WorldWideScience

Sample records for complete genomic structure

  1. Structured Matrix Completion with Applications to Genomic Data Integration.

    Science.gov (United States)

    Cai, Tianxi; Cai, T Tony; Zhang, Anru

    2016-01-01

    Matrix completion has attracted significant recent attention in many fields including statistics, applied mathematics and electrical engineering. Current literature on matrix completion focuses primarily on independent sampling models under which the individual observed entries are sampled independently. Motivated by applications in genomic data integration, we propose a new framework of structured matrix completion (SMC) to treat structured missingness by design. Specifically, our proposed method aims at efficient matrix recovery when a subset of the rows and columns of an approximately low-rank matrix are observed. We provide theoretical justification for the proposed SMC method and derive lower bound for the estimation errors, which together establish the optimal rate of recovery over certain classes of approximately low-rank matrices. Simulation studies show that the method performs well in finite sample under a variety of configurations. The method is applied to integrate several ovarian cancer genomic studies with different extent of genomic measurements, which enables us to construct more accurate prediction rules for ovarian cancer survival.

  2. The complete mitochondrial genome structure of the jaguar (Panthera onca).

    Science.gov (United States)

    Caragiulo, Anthony; Dougherty, Eric; Soto, Sofia; Rabinowitz, Salisa; Amato, George

    2016-01-01

    The jaguar (Panthera onca) is the largest felid in the Western hemisphere, and the only member of the Panthera genus in the New World. The jaguar inhabits most countries within Central and South America, and is considered near threatened by the International Union for the Conservation of Nature. This study represents the first sequence of the entire jaguar mitogenome, which was the only Panthera mitogenome that had not been sequenced. The jaguar mitogenome is 17,049 bases and possesses the same molecular structure as other felid mitogenomes. Bayesian inference (BI) and maximum likelihood (ML) were used to determine the phylogenetic placement of the jaguar within the Panthera genus. Both BI and ML analyses revealed the jaguar to be sister to the tiger/leopard/snow leopard clade.

  3. The First Complete Chloroplast Genome Sequences in Actinidiaceae: Genome Structure and Comparative Analysis.

    Science.gov (United States)

    Yao, Xiaohong; Tang, Ping; Li, Zuozhou; Li, Dawei; Liu, Yifei; Huang, Hongwen

    2015-01-01

    Actinidia chinensis is an important economic plant belonging to the basal lineage of the asterids. Availability of a complete Actinidia chloroplast genome sequence is crucial to understanding phylogenetic relationships among major lineages of angiosperms and facilitates kiwifruit genetic improvement. We report here the complete nucleotide sequences of the chloroplast genomes for Actinidia chinensis and A. chinensis var deliciosa obtained through de novo assembly of Illumina paired-end reads produced by total DNA sequencing. The total genome size ranges from 155,446 to 157,557 bp, with an inverted repeat (IR) of 24,013 to 24,391 bp, a large single copy region (LSC) of 87,984 to 88,337 bp and a small single copy region (SSC) of 20,332 to 20,336 bp. The genome encodes 113 different genes, including 79 unique protein-coding genes, 30 tRNA genes and 4 ribosomal RNA genes, with 16 duplicated in the inverted repeats, and a tRNA gene (trnfM-CAU) duplicated once in the LSC region. Comparisons of IR boundaries among four asterid species showed that IR/LSC borders were extended into the 5' portion of the psbA gene and IR contraction occurred in Actinidia. The clap gene has been lost from the chloroplast genome in Actinidia, and may have been transferred to the nucleus during chloroplast evolution. Twenty-seven polymorphic simple sequence repeat (SSR) loci were identified in the Actinidia chloroplast genome. Maximum parsimony analyses of a 72-gene, 16 taxa angiosperm dataset strongly support the placement of Actinidiaceae in Ericales within the basal asterids.

  4. The complete chloroplast genome sequence of Podocarpus lambertii: genome structure, evolutionary aspects, gene content and SSR detection.

    Directory of Open Access Journals (Sweden)

    Leila do Nascimento Vieira

    Full Text Available BACKGROUND: Podocarpus lambertii (Podocarpaceae is a native conifer from the Brazilian Atlantic Forest Biome, which is considered one of the 25 biodiversity hotspots in the world. The advancement of next-generation sequencing technologies has enabled the rapid acquisition of whole chloroplast (cp genome sequences at low cost. Several studies have proven the potential of cp genomes as tools to understand enigmatic and basal phylogenetic relationships at different taxonomic levels, as well as further probe the structural and functional evolution of plants. In this work, we present the complete cp genome sequence of P. lambertii. METHODOLOGY/PRINCIPAL FINDINGS: The P. lambertii cp genome is 133,734 bp in length, and similar to other sequenced cupressophytes, it lacks one of the large inverted repeat regions (IR. It contains 118 unique genes and one duplicated tRNA (trnN-GUU, which occurs as an inverted repeat sequence. The rps16 gene was not found, which was previously reported for the plastid genome of another Podocarpaceae (Nageia nagi and Araucariaceae (Agathis dammara. Structurally, P. lambertii shows 4 inversions of a large DNA fragment ∼20,000 bp compared to the Podocarpus totara cp genome. These unexpected characteristics may be attributed to geographical distance and different adaptive needs. The P. lambertii cp genome presents a total of 28 tandem repeats and 156 SSRs, with homo- and dipolymers being the most common and tri-, tetra-, penta-, and hexapolymers occurring with less frequency. CONCLUSION: The complete cp genome sequence of P. lambertii revealed significant structural changes, even in species from the same genus. These results reinforce the apparently loss of rps16 gene in Podocarpaceae cp genome. In addition, several SSRs in the P. lambertii cp genome are likely intraspecific polymorphism sites, which may allow highly sensitive phylogeographic and population structure studies, as well as phylogenetic studies of species of

  5. Complete Chloroplast Genomes of Papaver rhoeas and Papaver orientale: Molecular Structures, Comparative Analysis, and Phylogenetic Analysis

    Directory of Open Access Journals (Sweden)

    Jianguo Zhou

    2018-02-01

    Full Text Available Papaver rhoeas L. and P. orientale L., which belong to the family Papaveraceae, are used as ornamental and medicinal plants. The chloroplast genome has been used for molecular markers, evolutionary biology, and barcoding identification. In this study, the complete chloroplast genome sequences of P. rhoeas and P. orientale are reported. Results show that the complete chloroplast genomes of P. rhoeas and P. orientale have typical quadripartite structures, which are comprised of circular 152,905 and 152,799-bp-long molecules, respectively. A total of 130 genes were identified in each genome, including 85 protein-coding genes, 37 tRNA genes, and 8 rRNA genes. Sequence divergence analysis of four species from Papaveraceae indicated that the most divergent regions are found in the non-coding spacers with minimal differences among three Papaver species. These differences include the ycf1 gene and intergenic regions, such as rpoB-trnC, trnD-trnT, petA-psbJ, psbE-petL, and ccsA-ndhD. These regions are hypervariable regions, which can be used as specific DNA barcodes. This finding suggested that the chloroplast genome could be used as a powerful tool to resolve the phylogenetic positions and relationships of Papaveraceae. These results offer valuable information for future research in the identification of Papaver species and will benefit further investigations of these species.

  6. Complete Chloroplast Genome of the Wollemi Pine (Wollemia nobilis): Structure and Evolution.

    Science.gov (United States)

    Yap, Jia-Yee S; Rohner, Thore; Greenfield, Abigail; Van Der Merwe, Marlien; McPherson, Hannah; Glenn, Wendy; Kornfeld, Geoff; Marendy, Elessa; Pan, Annie Y H; Wilton, Alan; Wilkins, Marc R; Rossetto, Maurizio; Delaney, Sven K

    2015-01-01

    The Wollemi pine (Wollemia nobilis) is a rare Southern conifer with striking morphological similarity to fossil pines. A small population of W. nobilis was discovered in 1994 in a remote canyon system in the Wollemi National Park (near Sydney, Australia). This population contains fewer than 100 individuals and is critically endangered. Previous genetic studies of the Wollemi pine have investigated its evolutionary relationship with other pines in the family Araucariaceae, and have suggested that the Wollemi pine genome contains little or no variation. However, these studies were performed prior to the widespread use of genome sequencing, and their conclusions were based on a limited fraction of the Wollemi pine genome. In this study, we address this problem by determining the entire sequence of the W. nobilis chloroplast genome. A detailed analysis of the structure of the genome is presented, and the evolution of the genome is inferred by comparison with the chloroplast sequences of other members of the Araucariaceae and the related family Podocarpaceae. Pairwise alignments of whole genome sequences, and the presence of unique pseudogenes, gene duplications and insertions in W. nobilis and Araucariaceae, indicate that the W. nobilis chloroplast genome is most similar to that of its sister taxon Agathis. However, the W. nobilis genome contains an unusually high number of repetitive sequences, and these could be used in future studies to investigate and conserve any remnant genetic diversity in the Wollemi pine.

  7. Extensive structural variations between mitochondrial genomes of CMS and normal peppers (Capsicum annuum L.) revealed by complete nucleotide sequencing.

    Science.gov (United States)

    Jo, Yeong Deuk; Choi, Yoomi; Kim, Dong-Hwan; Kim, Byung-Dong; Kang, Byoung-Cheorl

    2014-07-04

    Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearrangements caused by recombination. However, the mitochondrial genome structure and the DNA rearrangements that may be related to CMS have not been characterized in Capsicum spp. We obtained the complete mitochondrial genome sequences of the pepper CMS line FS4401 (507,452 bp) and the fertile line Jeju (511,530 bp). Comparative analysis between mitochondrial genomes of peppers and tobacco that are included in Solanaceae revealed extensive DNA rearrangements and poor conservation in non-coding DNA. In comparison between pepper lines, FS4401 and Jeju mitochondrial DNAs contained the same complement of protein coding genes except for one additional copy of an atp6 gene (ψatp6-2) in FS4401. In terms of genome structure, we found eighteen syntenic blocks in the two mitochondrial genomes, which have been rearranged in each genome. By contrast, sequences between syntenic blocks, which were specific to each line, accounted for 30,380 and 17,847 bp in FS4401 and Jeju, respectively. The previously-reported CMS candidate genes, orf507 and ψatp6-2, were located on the edges of the largest sequence segments that were specific to FS4401. In this region, large number of small sequence segments which were absent or found on different locations in Jeju mitochondrial genome were combined together. The incorporation of repeats and overlapping of connected sequence segments by a few nucleotides implied that extensive rearrangements by homologous recombination might be involved in evolution of this region. Further analysis using mtDNA pairs from other plant species revealed common features of DNA regions around CMS-associated genes. Although large portion of sequence context was

  8. The complete chloroplast genome sequence of an endemic monotypic genus Hagenia (Rosaceae: structural comparative analysis, gene content and microsatellite detection

    Directory of Open Access Journals (Sweden)

    Andrew W. Gichira

    2017-01-01

    Full Text Available Hagenia is an endangered monotypic genus endemic to the topical mountains of Africa. The only species, Hagenia abyssinica (Bruce J.F. Gmel, is an important medicinal plant producing bioactive compounds that have been traditionally used by African communities as a remedy for gastrointestinal ailments in both humans and animals. Complete chloroplast genomes have been applied in resolving phylogenetic relationships within plant families. We employed high-throughput sequencing technologies to determine the complete chloroplast genome sequence of H. abyssinica. The genome is a circular molecule of 154,961 base pairs (bp, with a pair of Inverted Repeats (IR 25,971 bp each, separated by two single copies; a large (LSC, 84,320 bp and a small single copy (SSC, 18,696. H. abyssinica’s chloroplast genome has a 37.1% GC content and encodes 112 unique genes, 78 of which code for proteins, 30 are tRNA genes and four are rRNA genes. A comparative analysis with twenty other species, sequenced to-date from the family Rosaceae, revealed similarities in structural organization, gene content and arrangement. The observed size differences are attributed to the contraction/expansion of the inverted repeats. The translational initiation factor gene (infA which had been previously reported in other chloroplast genomes was conspicuously missing in H. abyssinica. A total of 172 microsatellites and 49 large repeat sequences were detected in the chloroplast genome. A Maximum Likelihood analyses of 71 protein-coding genes placed Hagenia in Rosoideae. The availability of a complete chloroplast genome, the first in the Sanguisorbeae tribe, is beneficial for further molecular studies on taxonomic and phylogenomic resolution within the Rosaceae family.

  9. The complete chloroplast genome sequence of an endemic monotypic genus Hagenia (Rosaceae): structural comparative analysis, gene content and microsatellite detection.

    Science.gov (United States)

    Gichira, Andrew W; Li, Zhizhong; Saina, Josphat K; Long, Zhicheng; Hu, Guangwan; Gituru, Robert W; Wang, Qingfeng; Chen, Jinming

    2017-01-01

    Hagenia is an endangered monotypic genus endemic to the topical mountains of Africa. The only species, Hagenia abyssinica (Bruce) J.F. Gmel, is an important medicinal plant producing bioactive compounds that have been traditionally used by African communities as a remedy for gastrointestinal ailments in both humans and animals. Complete chloroplast genomes have been applied in resolving phylogenetic relationships within plant families. We employed high-throughput sequencing technologies to determine the complete chloroplast genome sequence of H. abyssinica. The genome is a circular molecule of 154,961 base pairs (bp), with a pair of Inverted Repeats (IR) 25,971 bp each, separated by two single copies; a large (LSC, 84,320 bp) and a small single copy (SSC, 18,696). H. abyssinica 's chloroplast genome has a 37.1% GC content and encodes 112 unique genes, 78 of which code for proteins, 30 are tRNA genes and four are rRNA genes. A comparative analysis with twenty other species, sequenced to-date from the family Rosaceae, revealed similarities in structural organization, gene content and arrangement. The observed size differences are attributed to the contraction/expansion of the inverted repeats. The translational initiation factor gene ( infA ) which had been previously reported in other chloroplast genomes was conspicuously missing in H. abyssinica . A total of 172 microsatellites and 49 large repeat sequences were detected in the chloroplast genome. A Maximum Likelihood analyses of 71 protein-coding genes placed Hagenia in Rosoideae. The availability of a complete chloroplast genome, the first in the Sanguisorbeae tribe, is beneficial for further molecular studies on taxonomic and phylogenomic resolution within the Rosaceae family.

  10. The complete mitochondrial genome and its remarkable secondary structure for a stonefly Acroneuria hainana Wu (Insecta: Plecoptera, Perlidae).

    Science.gov (United States)

    Huang, Mingchao; Wang, Yuyu; Liu, Xingyue; Li, Weihai; Kang, Zehui; Wang, Kai; Li, Xuankun; Yang, Ding

    2015-02-15

    The Plecoptera (stoneflies) is a hemimetabolous order of insects, whose larvae are usually used as indicators for fresh water biomonitoring. Herein, we describe the complete mitochondrial (mt) genome of a stonefly species, namely Acroneuria hainana Wu belonging to the family Perlidae. This mt genome contains 13 PCGs, 22 tRNA-coding genes and 2 rRNA-coding genes that are conserved in most insect mt genomes, and it also has the identical gene order with the insect ancestral gene order. However, there are three special initiation codons of ND1, ND5 and COI in PCGs: TTG, GTG and CGA, coding for L, V and R, respectively. Additionally, the 899-bp control region, with 73.30% A+T content, has two long repeated sequences which are found at the 3'-end closing to the tRNA(Ile) gene. Both of them can be folded into a stem-loop structure, whose adjacent upstream and downstream sequences can be also folded into stem-loop structures. It is presumed that the four special structures in series could be associated with the D-loop replication. It might be able to adjust the replication speed of two replicate directions. Copyright © 2014 Elsevier B.V. All rights reserved.

  11. The complete genome structure and phylogenetic relationship of infectious hematopoietic necrosis virus

    Science.gov (United States)

    Morzunov , Sergey P.; Winton, James R.; Nichol, Stuart T.

    1995-01-01

    Infectious hematopoietic necrosis virus (IHNV), a member of the family Rhabdoviridae, causes a severe disease with high mortality in salmonid fish. The nucleotide sequence (11, 131 bases) of the entire genome was determined for the pathogenic WRAC strain of IHNV from southern Idaho. This allowed detailed analysis of all 6 genes, the deduced amino acid sequences of their encoded proteins, and important control motifs including leader, trailer and gene junction regions. Sequence analysis revealed that the 6 virus genes are located along the genome in the 3′ to 5′ order: nucleocapsid (N), polymerase-associated phosphoprotein (P or M1), matrix protein (M or M2), surface glycoprotein (G), a unique non-virion protein (NV) and virus polymerase (L). The IHNV genome RNA was found to have highly complementary termini (15 of 16 nucleotides). The gene junction regions display the highly conserved sequence UCURUC(U)7RCCGUG(N)4CACR (in the vRNA sense), which includes the typical rhabdovirus transcription termination/polyadenylation signal and a novel putative transcription initiation signal. Phylogenetic analysis of M, G and L protein sequences allowed insights into the evolutionary and taxonomic relationship of rhabdoviruses of fish relative to those of insects or mammals, and a broader sense of the relationship of non-segmented negative-strand RNA viruses. Based on these data, a new genus, piscivirus, is proposed which will initially contain IHNV, viral hemorrhagic septicemia virus and Hirame rhabdovirus.

  12. Complete plastid genomes from Ophioglossum californicum, Psilotum nudum, and Equisetum hyemale reveal an ancestral land plant genome structure and resolve the position of Equisetales among monilophytes

    Directory of Open Access Journals (Sweden)

    Grewe Felix

    2013-01-01

    Full Text Available Abstract Background Plastid genome structure and content is remarkably conserved in land plants. This widespread conservation has facilitated taxon-rich phylogenetic analyses that have resolved organismal relationships among many land plant groups. However, the relationships among major fern lineages, especially the placement of Equisetales, remain enigmatic. Results In order to understand the evolution of plastid genomes and to establish phylogenetic relationships among ferns, we sequenced the plastid genomes from three early diverging species: Equisetum hyemale (Equisetales, Ophioglossum californicum (Ophioglossales, and Psilotum nudum (Psilotales. A comparison of fern plastid genomes showed that some lineages have retained inverted repeat (IR boundaries originating from the common ancestor of land plants, while other lineages have experienced multiple IR changes including expansions and inversions. Genome content has remained stable throughout ferns, except for a few lineage-specific losses of genes and introns. Notably, the losses of the rps16 gene and the rps12i346 intron are shared among Psilotales, Ophioglossales, and Equisetales, while the gain of a mitochondrial atp1 intron is shared between Marattiales and Polypodiopsida. These genomic structural changes support the placement of Equisetales as sister to Ophioglossales + Psilotales and Marattiales as sister to Polypodiopsida. This result is augmented by some molecular phylogenetic analyses that recover the same relationships, whereas others suggest a relationship between Equisetales and Polypodiopsida. Conclusions Although molecular analyses were inconsistent with respect to the position of Marattiales and Equisetales, several genomic structural changes have for the first time provided a clear placement of these lineages within the ferns. These results further demonstrate the power of using rare genomic structural changes in cases where molecular data fail to provide strong phylogenetic

  13. One bacterial cell, one complete genome.

    Directory of Open Access Journals (Sweden)

    Tanja Woyke

    2010-04-01

    Full Text Available While the bulk of the finished microbial genomes sequenced to date are derived from cultured bacterial and archaeal representatives, the vast majority of microorganisms elude current culturing attempts, severely limiting the ability to recover complete or even partial genomes from these environmental species. Single cell genomics is a novel culture-independent approach, which enables access to the genetic material of an individual cell. No single cell genome has to our knowledge been closed and finished to date. Here we report the completed genome from an uncultured single cell of Candidatus Sulcia muelleri DMIN. Digital PCR on single symbiont cells isolated from the bacteriome of the green sharpshooter Draeculacephala minerva bacteriome allowed us to assess that this bacteria is polyploid with genome copies ranging from approximately 200-900 per cell, making it a most suitable target for single cell finishing efforts. For single cell shotgun sequencing, an individual Sulcia cell was isolated and whole genome amplified by multiple displacement amplification (MDA. Sanger-based finishing methods allowed us to close the genome. To verify the correctness of our single cell genome and exclude MDA-derived artifacts, we independently shotgun sequenced and assembled the Sulcia genome from pooled bacteriomes using a metagenomic approach, yielding a nearly identical genome. Four variations we detected appear to be genuine biological differences between the two samples. Comparison of the single cell genome with bacteriome metagenomic sequence data detected two single nucleotide polymorphisms (SNPs, indicating extremely low genetic diversity within a Sulcia population. This study demonstrates the power of single cell genomics to generate a complete, high quality, non-composite reference genome within an environmental sample, which can be used for population genetic analyzes.

  14. One Bacterial Cell, One Complete Genome

    Energy Technology Data Exchange (ETDEWEB)

    Woyke, Tanja; Tighe, Damon; Mavrommatis, Konstantinos; Clum, Alicia; Copeland, Alex; Schackwitz, Wendy; Lapidus, Alla; Wu, Dongying; McCutcheon, John P.; McDonald, Bradon R.; Moran, Nancy A.; Bristow, James; Cheng, Jan-Fang

    2010-04-26

    While the bulk of the finished microbial genomes sequenced to date are derived from cultured bacterial and archaeal representatives, the vast majority of microorganisms elude current culturing attempts, severely limiting the ability to recover complete or even partial genomes from these environmental species. Single cell genomics is a novel culture-independent approach, which enables access to the genetic material of an individual cell. No single cell genome has to our knowledge been closed and finished to date. Here we report the completed genome from an uncultured single cell of Candidatus Sulcia muelleri DMIN. Digital PCR on single symbiont cells isolated from the bacteriome of the green sharpshooter Draeculacephala minerva bacteriome allowed us to assess that this bacteria is polyploid with genome copies ranging from approximately 200?900 per cell, making it a most suitable target for single cell finishing efforts. For single cell shotgun sequencing, an individual Sulcia cell was isolated and whole genome amplified by multiple displacement amplification (MDA). Sanger-based finishing methods allowed us to close the genome. To verify the correctness of our single cell genome and exclude MDA-derived artifacts, we independently shotgun sequenced and assembled the Sulcia genome from pooled bacteriomes using a metagenomic approach, yielding a nearly identical genome. Four variations we detected appear to be genuine biological differences between the two samples. Comparison of the single cell genome with bacteriome metagenomic sequence data detected two single nucleotide polymorphisms (SNPs), indicating extremely low genetic diversity within a Sulcia population. This study demonstrates the power of single cell genomics to generate a complete, high quality, non-composite reference genome within an environmental sample, which can be used for population genetic analyzes.

  15. Complete mitochondrial genome of a wild Siberian tiger.

    Science.gov (United States)

    Sun, Yujiao; Lu, Taofeng; Sun, Zhaohui; Guan, Weijun; Liu, Zhensheng; Teng, Liwei; Wang, Shuo; Ma, Yuehui

    2015-01-01

    In this study, the complete mitochondrial genome of Siberian tiger (Panthera tigris altaica) was sequenced, using muscle tissue obtained from a male wild tiger. The total length of the mitochondrial genome is 16,996 bp. The genome structure of this tiger is in accordance with other Siberian tigers and it contains 12S rRNA gene, 16S rRNA gene, 22 tRNA genes, 13 protein-coding genes, and 1 control region.

  16. Fine-structured multi-scaling long-range correlations in completely sequenced genomes - features, origin and classification.

    NARCIS (Netherlands)

    T.A. Knoch (Tobias); M. Göcker; R. Lohner (Rudolf); A. Abuseiris (Anis); F.G. Grosveld (Frank)

    2009-01-01

    textabstractThe sequential organization of genomes, i.e. the relations between distant base pairs and regions within sequences, and its connection to the three-dimensional organization of genomes is still a largely unresolved problem. Long-range power-law correlations were found using correlation

  17. Complete Genome Sequences of 44 Arthrobacter Phages.

    Science.gov (United States)

    Klyczek, Karen K; Jacobs-Sera, Deborah; Adair, Tamarah L; Adams, Sandra D; Ball, Sarah L; Benjamin, Robert C; Bonilla, J Alfred; Breitenberger, Caroline A; Daniels, Charles J; Gaffney, Bobby L; Harrison, Melinda; Hughes, Lee E; King, Rodney A; Krukonis, Gregory P; Lopez, A Javier; Monsen-Collar, Kirsten; Pizzorno, Marie C; Rinehart, Claire A; Staples, Amanda K; Stowe, Emily L; Garlena, Rebecca A; Russell, Daniel A; Cresawn, Steven G; Pope, Welkin H; Hatfull, Graham F

    2018-02-01

    We report here the complete genome sequences of 44 phages infecting Arthrobacter sp. strain ATCC 21022. These phages have double-stranded DNA genomes with sizes ranging from 15,680 to 70,707 bp and G+C contents from 45.1% to 68.5%. All three tail types (belonging to the families Siphoviridae , Myoviridae , and Podoviridae ) are represented. Copyright © 2018 Klyczek et al.

  18. eGenomics: Cataloguing Our Complete Genome Collection III

    Directory of Open Access Journals (Sweden)

    Dawn Field

    2007-01-01

    Full Text Available This meeting report summarizes the proceedings of the “eGenomics: Cataloguing our Complete Genome Collection III” workshop held September 11–13, 2006, at the National Institute for Environmental eScience (NIEeS, Cambridge, United Kingdom. This 3rd workshop of the Genomic Standards Consortium was divided into two parts. The first half of the three-day workshop was dedicated to reviewing the genomic diversity of our current and future genome and metagenome collection, and exploring linkages to a series of existing projects through formal presentations. The second half was dedicated to strategic discussions. Outcomes of the workshop include a revised “Minimum Information about a Genome Sequence” (MIGS specification (v1.1, consensus on a variety of features to be added to the Genome Catalogue (GCat, agreement by several researchers to adopt MIGS for imminent genome publications, and an agreement by the EBI and NCBI to input their genome collections into GCat for the purpose of quantifying the amount of optional data already available (e.g., for geographic location coordinates and working towards a single, global list of all public genomes and metagenomes.

  19. Complete plastid genome sequence of Primula sinensis (Primulaceae: structure comparison, sequence variation and evidence for accD transfer to nucleus

    Directory of Open Access Journals (Sweden)

    Tong-Jian Liu

    2016-06-01

    Full Text Available Species-rich genus Primula L. is a typical plant group with which to understand genetic variance between species in different levels of relationships. Chloroplast genome sequences are used to be the information resource for quantifying this difference and reconstructing evolutionary history. In this study, we reported the complete chloroplast genome sequence of Primula sinensis and compared it with other related species. This genome of chloroplast showed a typical circular quadripartite structure with 150,859 bp in sequence length consisting of 37.2% GC base. Two inverted repeated regions (25,535 bp were separated by a large single-copy region (82,064 bp and a small single-copy region (17,725 bp. The genome consists of 112 genes, including 78 protein-coding genes, 30 tRNA genes and four rRNA genes. Among them, seven coding genes, seven tRNA genes and four rRNA genes have two copies due to their locations in the IR regions. The accD and infA genes lacking intact open reading frames (ORF were identified as pseudogenes. SSR and sequence variation analyses were also performed on the plastome of Primula sinensis, comparing with another available plastome of P. poissonii. The four most variable regions, rpl36–rps8, rps16–trnQ, trnH–psbA and ndhC–trnV, were identified. Phylogenetic relationship estimates using three sub-datasets extracted from a matrix of 57 protein-coding gene sequences showed the identical result that was consistent with previous studies. A transcript found from P. sinensis transcriptome showed a high similarity to plastid accD functional region and was identified as a putative plastid transit peptide at the N-terminal region. The result strongly suggested that plastid accD has been functionally transferred to the nucleus in P. sinensis.

  20. The complete chloroplast genome sequence of Abies nephrolepis (Pinaceae: Abietoideae

    Directory of Open Access Journals (Sweden)

    Dong-Keun Yi

    2016-06-01

    Full Text Available The plant chloroplast (cp genome has maintained a relatively conserved structure and gene content throughout evolution. Cp genome sequences have been used widely for resolving evolutionary and phylogenetic issues at various taxonomic levels of plants. Here, we report the complete cp genome of Abies nephrolepis. The A. nephrolepis cp genome is 121,336 base pairs (bp in length including a pair of short inverted repeat regions (IRa and IRb of 139 bp each separated by a small single copy (SSC region of 54,323 bp (SSC and a large single copy region of 66,735 bp (LSC. It contains 114 genes, 68 of which are protein coding genes, 35 tRNA and four rRNA genes, six open reading frames, and one pseudogene. Seventeen repeat units and 64 simple sequence repeats (SSR have been detected in A. nephrolepis cp genome. Large IR sequences locate in 42-kb inversion points (1186 bp. The A. nephrolepis cp genome is identical to Abies koreana’s which is closely related to taxa. Pairwise comparison between two cp genomes revealed 140 polymorphic sites in each. Complete cp genome sequence of A. nephrolepis has a significant potential to provide information on the evolutionary pattern of Abietoideae and valuable data for development of DNA markers for easy identification and classification.

  1. Complete Mitochondrial Genome of the Medicinal Mushroom Ganoderma lucidum

    Science.gov (United States)

    Chen, Haimei; Chen, Xiangdong; Lan, Jin; Liu, Chang

    2013-01-01

    Ganoderma lucidum is one of the well-known medicinal basidiomycetes worldwide. The mitochondrion, referred to as the second genome, is an organelle found in most eukaryotic cells and participates in critical cellular functions. Elucidating the structure and function of this genome is important to understand completely the genetic contents of G. lucidum. In this study, we assembled the mitochondrial genome of G. lucidum and analyzed the differential expressions of its encoded genes across three developmental stages. The mitochondrial genome is a typical circular DNA molecule of 60,630 bp with a GC content of 26.67%. Genome annotation identified genes that encode 15 conserved proteins, 27 tRNAs, small and large rRNAs, four homing endonucleases, and two hypothetical proteins. Except for genes encoding trnW and two hypothetical proteins, all genes were located on the positive strand. For the repeat structure analysis, eight forward, two inverted, and three tandem repeats were detected. A pair of fragments with a total length around 5.5 kb was found in both the nuclear and mitochondrial genomes, which suggests the possible transfer of DNA sequences between two genomes. RNA-Seq data for samples derived from three stages, namely, mycelia, primordia, and fruiting bodies, were mapped to the mitochondrial genome and qualified. The protein-coding genes were expressed higher in mycelia or primordial stages compared with those in the fruiting bodies. The rRNA abundances were significantly higher in all three stages. Two regions were transcribed but did not contain any identified protein or tRNA genes. Furthermore, three RNA-editing sites were detected. Genome synteny analysis showed that significant genome rearrangements occurred in the mitochondrial genomes. This study provides valuable information on the gene contents of the mitochondrial genome and their differential expressions at various developmental stages of G. lucidum. The results contribute to the understanding of the

  2. Complete mitochondrial genome of the Loligo opalescence.

    Science.gov (United States)

    Jiang, Lihua; Liu, Wei; Zhu, Aiyi; Zhang, Jianshe; Wu, Changwen

    2016-09-01

    In this study, we determined the complete mitochondrial genome of the Loligo opalescence. The genome was 17,370 bp in length and contained 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and 3 main non-coding regions. The composition and order of genes, were similar to most other invertebrates. The overall base composition of L. opalescence is A 38.62%, C 19.40%, T 32.37% and G 9.61%, with a highly A + T bias of 70.99%. All of the three control regions (CR) contain termination-associated sequences and conserved sequence blocks. This mitogenome sequence data would play an important role in the investigation of phylogenetic relationship, taxonomic resolution and phylogeography of the Loliginidae.

  3. Analysis of the complete DNA sequence of the temperate bacteriophage TP901-1: Evolution, structure, and genome organization of lactococcal bacteriophages

    DEFF Research Database (Denmark)

    Brøndsted, Lone; Østergaard, Solvej; Pedersen, Margit

    2001-01-01

    A complete analysis of the entire genome of the temperate lactococcal bacteriophage TP901-1 has been performed and the function of 21 of 56 TP901-1-encoded ORFs has been assigned. This knowledge has been used to propose 10 functional modules each responsible for specific functions during...

  4. The complete chloroplast genome sequence of Hibiscus syriacus.

    Science.gov (United States)

    Kwon, Hae-Yun; Kim, Joon-Hyeok; Kim, Sea-Hyun; Park, Ji-Min; Lee, Hyoshin

    2016-09-01

    The complete chloroplast genome sequence of Hibiscus syriacus L. is presented in this study. The genome is composed of 161 019 bp in length, with a typical circular structure containing a pair of inverted repeats of 25 745 bp of length separated by a large single-copy region and a small single-copy region of 89 698 bp and 19 831 bp of length, respectively. The overall GC content is 36.8%. One hundred and fourteen genes were annotated, including 81 protein-coding genes, 4 ribosomal RNA genes and 29 transfer RNA genes.

  5. Structure completion for facade layouts

    KAUST Repository

    Fan, Lubin

    2014-11-18

    (Figure Presented) We present a method to complete missing structures in facade layouts. Starting from an abstraction of the partially observed layout as a set of shapes, we can propose one or multiple possible completed layouts. Structure completion with large missing parts is an ill-posed problem. Therefore, we combine two sources of information to derive our solution: the observed shapes and a database of complete layouts. The problem is also very difficult, because shape positions and attributes have to be estimated jointly. Our proposed solution is to break the problem into two components: a statistical model to evaluate layouts and a planning algorithm to generate candidate layouts. This ensures that the completed result is consistent with the observation and the layouts in the database.

  6. Two complete chloroplast genome sequences of Cannabis sativa varieties.

    Science.gov (United States)

    Oh, Hyehyun; Seo, Boyoung; Lee, Seunghwan; Ahn, Dong-Ha; Jo, Euna; Park, Jin-Kyoung; Min, Gi-Sik

    2016-07-01

    In this study, we determined the complete chloroplast (cp) genomes from two varieties of Cannabis sativa. The genome sizes were 153,848 bp (the Korean non-drug variety, Cheungsam) and 153,854 bp (the African variety, Yoruba Nigeria). The genome structures were identical with 131 individual genes [86 protein-coding genes (PCGs), eight rRNA, and 37 tRNA genes]. Further, except for the presence of an intron in the rps3 genes of two C. sativa varieties, the cp genomes of C. sativa had conservative features similar to that of all known species in the order Rosales. To verify the position of C. sativa within the order Rosales, we conducted phylogenetic analysis by using concatenated sequences of all PCGs from 17 complete cp genomes. The resulting tree strongly supported monophyly of Rosales. Further, the family Cannabaceae, represented by C. sativa, showed close relationship with the family Moraceae. The phylogenetic relationship outlined in our study is well congruent with those previously shown for the order Rosales.

  7. Discovering Complete Quasispecies In Bacterial Genomes

    OpenAIRE

    Bertels, Frederic; Gokhale, Chaitanya; Traulsen, Arne

    2017-01-01

    Mobile genetic elements can be found in almost all genomes. Possibly the most common nonautonomous mobile genetic elements in bacteria are repetitive extragenic palindromic doublets forming hairpins (REPINs) that can occur hundreds of times within a genome. The sum of all REPINs in a genome can be viewed as an evolving population because REPINs replicate and mutate. In contrast to most other biological populations, we know the exact composition of the REPIN population and the sequence of each...

  8. Complete Genome Sequence of Staphylococcus epidermidis 1457.

    Science.gov (United States)

    Galac, Madeline R; Stam, Jason; Maybank, Rosslyn; Hinkle, Mary; Mack, Dietrich; Rohde, Holger; Roth, Amanda L; Fey, Paul D

    2017-06-01

    Staphylococcus epidermidis 1457 is a frequently utilized strain that is amenable to genetic manipulation and has been widely used for biofilm-related research. We report here the whole-genome sequence of this strain, which encodes 2,277 protein-coding genes and 81 RNAs within its 2.4-Mb genome and plasmid. Copyright © 2017 Galac et al.

  9. Structural Completeness in Fuzzy Logics

    Czech Academy of Sciences Publication Activity Database

    Cintula, Petr; Metcalfe, G.

    2009-01-01

    Roč. 50, č. 2 (2009), s. 153-183 ISSN 0029-4527 R&D Projects: GA MŠk(CZ) 1M0545 Institutional research plan: CEZ:AV0Z10300504 Keywords : structral logics * fuzzy logics * structural completeness * admissible rules * primitive variety * residuated lattices Subject RIV: BA - General Mathematics

  10. Complete sequence of the mitochondrial genome of ...

    Indian Academy of Sciences (India)

    products were purified using the DNA Gel Extraction Kit. (Tiangen, Shanghai, China). The purified products obtained ..... Base composition of O. rubicundus mitochondrial genome. .... the help of fish sampled and identified by morphology.

  11. The Complete Chloroplast Genome Sequences of Six Rehmannia Species

    Directory of Open Access Journals (Sweden)

    Shuyun Zeng

    2017-03-01

    Full Text Available Rehmannia is a non-parasitic genus in Orobanchaceae including six species mainly distributed in central and north China. Its phylogenetic position and infrageneric relationships remain uncertain due to potential hybridization and polyploidization. In this study, we sequenced and compared the complete chloroplast genomes of six Rehmannia species using Illumina sequencing technology to elucidate the interspecific variations. Rehmannia plastomes exhibited typical quadripartite and circular structures with good synteny of gene order. The complete genomes ranged from 153,622 bp to 154,055 bp in length, including 133 genes encoding 88 proteins, 37 tRNAs, and 8 rRNAs. Three genes (rpoA, rpoC2, accD have potentially experienced positive selection. Plastome size variation of Rehmannia was mainly ascribed to the expansion and contraction of the border regions between the inverted repeat (IR region and the single-copy (SC regions. Despite of the conserved structure in Rehmannia plastomes, sequence variations provide useful phylogenetic information. Phylogenetic trees of 23 Lamiales species reconstructed with the complete plastomes suggested that Rehmannia was monophyletic and sister to the clade of Lindenbergia and the parasitic taxa in Orobanchaceae. The interspecific relationships within Rehmannia were completely different with the previous studies. In future, population phylogenomic works based on plastomes are urgently needed to clarify the evolutionary history of Rehmannia.

  12. Complete genome sequence of Ikoma lyssavirus.

    Science.gov (United States)

    Marston, Denise A; Ellis, Richard J; Horton, Daniel L; Kuzmin, Ivan V; Wise, Emma L; McElhinney, Lorraine M; Banyard, Ashley C; Ngeleja, Chanasa; Keyyu, Julius; Cleaveland, Sarah; Lembo, Tiziana; Rupprecht, Charles E; Fooks, Anthony R

    2012-09-01

    Lyssaviruses (family Rhabdoviridae) constitute one of the most important groups of viral zoonoses globally. All lyssaviruses cause the disease rabies, an acute progressive encephalitis for which, once symptoms occur, there is no effective cure. Currently available vaccines are highly protective against the predominantly circulating lyssavirus species. Using next-generation sequencing technologies, we have obtained the whole-genome sequence for a novel lyssavirus, Ikoma lyssavirus (IKOV), isolated from an African civet in Tanzania displaying clinical signs of rabies. Genetically, this virus is the most divergent within the genus Lyssavirus. Characterization of the genome will help to improve our understanding of lyssavirus diversity and enable investigation into vaccine-induced immunity and protection.

  13. Complete Genome Sequence of Ikoma Lyssavirus

    OpenAIRE

    Marston, Denise A.; Ellis, Richard J.; Horton, Daniel L.; Kuzmin, Ivan V.; Wise, Emma L.; McElhinney, Lorraine M.; Banyard, Ashley C.; Ngeleja, Chanasa; Keyyu, Julius; Cleaveland, Sarah; Lembo, Tiziana; Rupprecht, Charles E.; Fooks, Anthony R.

    2012-01-01

    Lyssaviruses (family Rhabdoviridae) constitute one of the most important groups of viral zoonoses globally. All lyssaviruses cause the disease rabies, an acute progressive encephalitis for which, once symptoms occur, there is no effective cure. Currently available vaccines are highly protective against the predominantly circulating lyssavirus species. Using next-generation sequencing technologies, we have obtained the whole-genome sequence for a novel lyssavirus, Ikoma lyssavirus (IKOV), isol...

  14. The Complete Chloroplast Genome of Catha edulis: A Comparative Analysis of Genome Features with Related Species

    Directory of Open Access Journals (Sweden)

    Cuihua Gu

    2018-02-01

    Full Text Available Qat (Catha edulis, Celastraceae is a woody evergreen species with great economic and cultural importance. It is cultivated for its stimulant alkaloids cathine and cathinone in East Africa and southwest Arabia. However, genome information, especially DNA sequence resources, for C. edulis are limited, hindering studies regarding interspecific and intraspecific relationships. Herein, the complete chloroplast (cp genome of Catha edulis is reported. This genome is 157,960 bp in length with 37% GC content and is structurally arranged into two 26,577 bp inverted repeats and two single-copy areas. The size of the small single-copy and the large single-copy regions were 18,491 bp and 86,315 bp, respectively. The C. edulis cp genome consists of 129 coding genes including 37 transfer RNA (tRNA genes, 8 ribosomal RNA (rRNA genes, and 84 protein coding genes. For those genes, 112 are single copy genes and 17 genes are duplicated in two inverted regions with seven tRNAs, four rRNAs, and six protein coding genes. The phylogenetic relationships resolved from the cp genome of qat and 32 other species confirms the monophyly of Celastraceae. The cp genomes of C. edulis, Euonymus japonicus and seven Celastraceae species lack the rps16 intron, which indicates an intron loss took place among an ancestor of this family. The cp genome of C. edulis provides a highly valuable genetic resource for further phylogenomic research, barcoding and cp transformation in Celastraceae.

  15. The Complete Chloroplast Genome of Catha edulis: A Comparative Analysis of Genome Features with Related Species

    Science.gov (United States)

    Tembrock, Luke R.; Zheng, Shaoyu; Wu, Zhiqiang

    2018-01-01

    Qat (Catha edulis, Celastraceae) is a woody evergreen species with great economic and cultural importance. It is cultivated for its stimulant alkaloids cathine and cathinone in East Africa and southwest Arabia. However, genome information, especially DNA sequence resources, for C. edulis are limited, hindering studies regarding interspecific and intraspecific relationships. Herein, the complete chloroplast (cp) genome of Catha edulis is reported. This genome is 157,960 bp in length with 37% GC content and is structurally arranged into two 26,577 bp inverted repeats and two single-copy areas. The size of the small single-copy and the large single-copy regions were 18,491 bp and 86,315 bp, respectively. The C. edulis cp genome consists of 129 coding genes including 37 transfer RNA (tRNA) genes, 8 ribosomal RNA (rRNA) genes, and 84 protein coding genes. For those genes, 112 are single copy genes and 17 genes are duplicated in two inverted regions with seven tRNAs, four rRNAs, and six protein coding genes. The phylogenetic relationships resolved from the cp genome of qat and 32 other species confirms the monophyly of Celastraceae. The cp genomes of C. edulis, Euonymus japonicus and seven Celastraceae species lack the rps16 intron, which indicates an intron loss took place among an ancestor of this family. The cp genome of C. edulis provides a highly valuable genetic resource for further phylogenomic research, barcoding and cp transformation in Celastraceae. PMID:29425128

  16. Complete genome sequence of Gordonia bronchialis type strain (3410T)

    Energy Technology Data Exchange (ETDEWEB)

    Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Sikorski, Johannes [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Jando, Marlen [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Copeland, A [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Chen, Feng [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Ovchinnikova, Galina [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Chain, Patrick S. G. [Lawrence Livermore National Laboratory (LLNL); Saunders, Elizabeth H [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Detter, J C [U.S. Department of Energy, Joint Genome Institute; Brettin, Thomas S [ORNL; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute

    2010-01-01

    Gordonia bronchialis Tsukamura 1971 is the type species of the genus. G. bronchialis is a human-pathogenic organism that has been isolated from a large variety of human tissues. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the first completed genome sequence of the family Gordoniaceae. The 5,290,012 bp long genome with its 4,944 protein-coding and 55 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  17. Complete genome sequence of Acidimicrobium ferrooxidans type strain (ICPT)

    Energy Technology Data Exchange (ETDEWEB)

    Clum, Alicia; Nolan, Matt; Lang, Elke; Glavina Del Rio, Tijana; Tice, Hope; Copeland, Alex; Cheng, Jan-Fang; Lucas, Susan; Chen, Feng; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Ivanova, Natalia; Mavrommatis, Konstantinos; Mikhailova, Natalia; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Goker, Markus; Spring, Stefan; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jefferies, Cynthia C.; Chain, Patrick; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter; Lapidus, Alla

    2009-05-20

    Acidimicrobium ferrooxidans (Clark and Norris 1996) is the sole and type species of the genus, which until recently was the only genus within the actinobacterial family Acidimicrobiaceae and in the order Acidomicrobiales. Rapid oxidation of iron pyrite during autotrophic growth in the absence of an enhanced CO2 concentration is characteristic for A. ferrooxidans. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of the order Acidomicrobiales, and the 2,158,157 bp long single replicon genome with its 2038 protein coding and 54 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  18. Using nanopore sequencing to get complete genomes from complex samples

    DEFF Research Database (Denmark)

    Kirkegaard, Rasmus Hansen; Karst, Søren Michael; Nielsen, Per Halkjær

    The advantages of “next generation sequencing” has come at the cost of genome finishing. The dominant sequencing technology provides short reads of 150-300 bp, which has made genome assembly very difficult as the reads do not span important repeat regions. Genomes have thus been added...... to the databases as fragmented assemblies and not as finished contigs that resemble the chromosomes in which the DNA is organised within the cells. This is especially troublesome for genomes derived from complex metagenome sequencing. Databases with incomplete genomes can lead to false conclusions about...... the absence of genes and functional predictions of the organisms. Furthermore, it is common that repetitive elements and marker genes such as the 16S rRNA gene are missing completely from these genome bins. Using nanopore long reads, we demonstrate that it is possible to span these regions and make complete...

  19. Complete Genome Sequence of the Human Gut Symbiont Roseburia hominis

    DEFF Research Database (Denmark)

    Travis, Anthony J.; Kelly, Denise; Flint, Harry J

    2015-01-01

    We report here the complete genome sequence of the human gut symbiont Roseburia hominis A2-183(T) (= DSM 16839(T) = NCIMB 14029(T)), isolated from human feces. The genome is represented by a 3,592,125-bp chromosome with 3,405 coding sequences. A number of potential functions contributing to host...

  20. Complete genome sequence of pronghorn virus, a pestivirus

    Science.gov (United States)

    The complete genome sequence of Pronghorn virus, a member of the Pestivirus genus of the Flaviviridae, was determined. The virus, originally isolated from a pronghorn antelope, had a genome of 12,287 nucleotides with a single open reading frame of 11,694 bases encoding 3898 amino acids....

  1. Utilization of complete chloroplast genomes for phylogenetic studies

    NARCIS (Netherlands)

    Ramlee, Shairul Izan Binti

    2016-01-01

    Chloroplast DNA sequence polymorphisms are a primary source of data in many plant phylogenetic studies. The chloroplast genome is relatively conserved in its evolution making it an ideal molecule to retain phylogenetic signals. The chloroplast genome is also largely, but not completely, free from

  2. Complete mitochondrial genome of threatened mahseer Tor tor ...

    Indian Academy of Sciences (India)

    A.

    In the present study, complete mitochondrial genome of Tor tor has been sequenced .... Most of the genes were encoded on the heavy strand (H- strand), whereas only .... 4 bp in the DHU stem (figure 5 in electronic supplementary material).

  3. Complete Genome of Stachybotrys chartarum strain 51-11

    Data.gov (United States)

    U.S. Environmental Protection Agency — Complete genome sequence of the fungus Stachybotrys chartarum. Sequences can be used to identify genes, genetic pathways, gene clusters, genetic organization, etc....

  4. Complete genome sequences of six strains of the genus methylobacterium

    Energy Technology Data Exchange (ETDEWEB)

    Marx, Christopher J [Harvard University; Bringel, Francoise O. [University of Strasbourg; Christoserdova, Ludmila [University of Washington, Seattle; Moulin, Lionel [UMR, France; Farhan Ul Haque, Muhammad [CNRS, Strasbourg, France; Fleischman, Darrell E. [Wright State University, Dayton, OH; Gruffaz, Christelle [CNRS, Strasbourg, France; Jourand, Philippe [UMR, France; Knief, Claudia [ETH Zurich, Switzerland; Lee, Ming-Chun [Harvard University; Muller, Emilie E. L. [CNRS, Strasbourg, France; Nadalig, Thierry [CNRS, Strasbourg, France; Peyraud, Remi [ETH Zurich, Switzerland; Roselli, Sandro [CNRS, Strasbourg, France; Russ, Lina [ETH Zurich, Switzerland; Aguero, Fernan [Universidad Nacional de General San Martin; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Lajus, Aurelie [Genoscope/Centre National de la Recherche Scientifique-Unite Mixte de Recherche; Land, Miriam L [ORNL; Medigue, Claudine [Genoscope/Centre National de la Recherche Scientifique-Unite Mixte de Recherche; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Stolyar, Sergey [University of Washington; Vorholt, Julia A. [ETH Zurich, Switzerland; Vuilleumier, Stephane [University of Strasbourg

    2012-01-01

    The complete and assembled genome sequences were determined for six strains of the alphaproteobacterial genus Methylobacterium, chosen for their key adaptations to different plant-associated niches and environmental constraints.

  5. Complete Genome Sequences of Six Strains of the Genus Methylobacterium

    Energy Technology Data Exchange (ETDEWEB)

    Marx, Christopher J [Harvard University; Bringel, Francoise O. [University of Strasbourg; Christoserdova, Ludmila [University of Washington, Seattle; Moulin, Lionel [UMR, France; UI Hague, Muhammad Farhan [University of Strasbourg; Fleischman, Darrell E. [Wright State University, Dayton, OH; Gruffaz, Christelle [CNRS, Strasbourg, France; Jourand, Philippe [UMR, France; Knief, Claudia [ETH Zurich, Switzerland; Lee, Ming-Chun [Harvard University; Muller, Emilie E. L. [CNRS, Strasbourg, France; Nadalig, Thierry [CNRS, Strasbourg, France; Peyraud, Remi [ETH Zurich, Switzerland; Roselli, Sandro [CNRS, Strasbourg, France; Russ, Lina [ETH Zurich, Switzerland; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Ivanov, Pavel S. [University of Wyoming, Laramie; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Lajus, Aurelie [Genoscope/Centre National de la Recherche Scientifique-Unite Mixte de Recherche; Land, Miriam L [ORNL; Medigue, Claudine [Genoscope/Centre National de la Recherche Scientifique-Unite Mixte de Recherche; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Stolyar, Sergey [University of Washington; Vorholt, Julia A. [ETH Zurich, Switzerland; Vuilleumier, Stephane [University of Strasbourg

    2012-01-01

    The complete and assembled genome sequences were determined for six strains of the alphaproteobacterial genus Methylobacterium, chosen for their key adaptations to different plant-associated niches and environmental constraints.

  6. Structure completion for facade layouts

    KAUST Repository

    Fan, Lubin; Musialski, Przemyslaw; Liu, Ligang; Wonka, Peter

    2014-01-01

    completion with large missing parts is an ill-posed problem. Therefore, we combine two sources of information to derive our solution: the observed shapes and a database of complete layouts. The problem is also very difficult, because shape positions

  7. Getting complete genomes from complex samples using nanopore sequencing

    DEFF Research Database (Denmark)

    Kirkegaard, Rasmus Hansen; Karst, Søren Michael; Albertsen, Mads

    Short read sequencing and metagenomic binning workflows have made it possible to extract bacterial genome bins from environmental microbial samples containing hundreds to thousands of different species. However, these genome bins often do not represent complete genomes, as they are mostly...... fragmented, incomplete and often contaminated with foreign DNA and with no robust strategies to validate the quality. The value of these `draft genomes` have limited, lasting value to the scientific community, as gene synteny is broken and the uncertainty of what is missing. The genetic material most often...... missed is important multi-copy and/or conserved marker genes such as the 16S rRNA gene, as sequence micro-heterogeneity prevents assembly of these genes in the de novo assembly. We demonstrate that using nanopore long reads it is now possible to overcome these issues and make complete genomes from...

  8. The complete chloroplast genome of the Dendrobium strongylanthum (Orchidaceae: Epidendroideae).

    Science.gov (United States)

    Li, Jing; Chen, Chen; Wang, Zhe-Zhi

    2016-07-01

    Complete chloroplast genome sequence is very useful for studying the phylogenetic and evolution of species. In this study, the complete chloroplast genome of Dendrobium strongylanthum was constructed from whole-genome Illumina sequencing data. The chloroplast genome is 153 058 bp in length with 37.6% GC content and consists of two inverted repeats (IRs) of 26 316 bp. The IR regions are separated by large single-copy region (LSC, 85 836 bp) and small single-copy (SSC, 14 590 bp) region. A total of 130 chloroplast genes were successfully annotated, including 84 protein coding genes, 38 tRNA genes, and eight rRNA genes. Phylogenetic analyses showed that the chloroplast genome of Dendrobium strongylanthum is related to that of the Dendrobium officinal.

  9. Five Complete Chloroplast Genome Sequences from Diospyros: Genome Organization and Comparative Analysis.

    Science.gov (United States)

    Fu, Jianmin; Liu, Huimin; Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng

    2016-01-01

    Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros 'Jinzaoshi' were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. 'Jinzaoshi', support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales.

  10. Five Complete Chloroplast Genome Sequences from Diospyros: Genome Organization and Comparative Analysis.

    Directory of Open Access Journals (Sweden)

    Jianmin Fu

    Full Text Available Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros 'Jinzaoshi' were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp in the cp genome of D. 'Jinzaoshi', support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales.

  11. Complete chloroplast genome sequence of Elodea canadensis and comparative analyses with other monocot plastid genomes.

    Science.gov (United States)

    Huotari, Tea; Korpelainen, Helena

    2012-10-15

    Elodea canadensis is an aquatic angiosperm native to North America. It has attracted great attention due to its invasive nature when transported to new areas in its non-native range. We have determined the complete nucleotide sequence of the chloroplast (cp) genome of Elodea. Taxonomically Elodea is a basal monocot, and only few monocot cp genomes representing early lineages of monocots have been sequenced so far. The genome is a circular double-stranded DNA molecule 156,700 bp in length, and has a typical structure with large (LSC 86,194 bp) and small (SSC 17,810 bp) single-copy regions separated by a pair of inverted repeats (IRs 26,348 bp each). The Elodea cp genome contains 113 unique genes and 16 duplicated genes in the IR regions. A comparative analysis showed that the gene order and organization of the Elodea cp genome is almost identical to that of Amborella trichopoda, a basal angiosperm. The structure of IRs in Elodea is unique among monocot species with the whole cp genome sequenced. In Elodea and another monocot Lemna minor the borders between IRs and LSC are located upstream of rps 19 gene and downstream of trnH-GUG gene, while in most monocots, IR has extended to include both trnH and rps 19 genes. A phylogenetic analysis conducted using Bayesian method, based on the DNA sequences of 81 chloroplast genes from 17 monocot taxa provided support for the placement of Elodea together with Lemna as a basal monocot and the next diverging lineage of monocots after Acorales. In comparison with other monocots, the Elodea cp genome has gone through only few rearrangements or gene losses. IR of Elodea has a unique structure among the monocot species studied so far as its structure is similar to that of a basal angiosperm Amborella. This result together with phylogenetic analyses supports the placement of Elodea as a basal monocot to the next diverging lineage of monocots after Acorales. So far, only few cp genomes representing early lineages of monocots have been

  12. Complete mitochondrial genome of a Asian lion (Panthera leo goojratensis).

    Science.gov (United States)

    Li, Yu-Fei; Wang, Qiang; Zhao, Jian-ning

    2016-01-01

    The entire mitochondrial genome of this Asian lion (Panthera leo goojratensis) was 17,183 bp in length, gene composition and arrangement conformed to other lions, which contained the typical structure of 22 tRNAs, 2 rRNAs, 13 protein-coding genes and a non-coding region. The characteristic of the mitochondrial genome was analyzed in detail.

  13. Complete mitochondrial genome of Eruca sativa Mill. (Garden rocket.

    Directory of Open Access Journals (Sweden)

    Yankun Wang

    Full Text Available Eruca sativa (Cruciferae family is an ancient crop of great economic and agronomic importance. Here, the complete mitochondrial genome of Eruca sativa was sequenced and annotated. The circular molecule is 247,696 bp long, with a G+C content of 45.07%, containing 33 protein-coding genes, three rRNA genes, and 18 tRNA genes. The Eruca sativa mitochondrial genome may be divided into six master circles and four subgenomic molecules via three pairwise large repeats, resulting in a more dynamic structure of the Eruca sativa mtDNA compared with other cruciferous mitotypes. Comparison with the Brassica napus MtDNA revealed that most of the genes with known function are conserved between these two mitotypes except for the ccmFN2 and rrn18 genes, and 27 point mutations were scattered in the 14 protein-coding genes. Evolutionary relationships analysis suggested that Eruca sativa is more closely related to the Brassica species and to Raphanus sativus than to Arabidopsis thaliana.

  14. The Complete Chloroplast and Mitochondrial Genome Sequences of Boea hygrometrica: Insights into the Evolution of Plant Organellar Genomes

    Science.gov (United States)

    Wang, Xumin; Deng, Xin; Zhang, Xiaowei; Hu, Songnian; Yu, Jun

    2012-01-01

    The complete nucleotide sequences of the chloroplast (cp) and mitochondrial (mt) genomes of resurrection plant Boea hygrometrica (Bh, Gesneriaceae) have been determined with the lengths of 153,493 bp and 510,519 bp, respectively. The smaller chloroplast genome contains more genes (147) with a 72% coding sequence, and the larger mitochondrial genome have less genes (65) with a coding faction of 12%. Similar to other seed plants, the Bh cp genome has a typical quadripartite organization with a conserved gene in each region. The Bh mt genome has three recombinant sequence repeats of 222 bp, 843 bp, and 1474 bp in length, which divide the genome into a single master circle (MC) and four isomeric molecules. Compared to other angiosperms, one remarkable feature of the Bh mt genome is the frequent transfer of genetic material from the cp genome during recent Bh evolution. We also analyzed organellar genome evolution in general regarding genome features as well as compositional dynamics of sequence and gene structure/organization, providing clues for the understanding of the evolution of organellar genomes in plants. The cp-derived sequences including tRNAs found in angiosperm mt genomes support the conclusion that frequent gene transfer events may have begun early in the land plant lineage. PMID:22291979

  15. Complete genome sequence of Nakamurella multipartita type strain (Y-104).

    Science.gov (United States)

    Tice, Hope; Mayilraj, Shanmugam; Sims, David; Lapidus, Alla; Nolan, Matt; Lucas, Susan; Glavina Del Rio, Tijana; Copeland, Alex; Cheng, Jan-Fang; Meincke, Linda; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Ivanova, Natalia; Mavromatis, Konstantinos; Ovchinnikova, Galina; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jeffries, Cynthia D; Detter, John C; Brettin, Thomas; Rohde, Manfred; Göker, Markus; Bristow, Jim; Eisen, Jonathan A; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C; Klenk, Hans-Peter; Chen, Feng

    2010-03-30

    Nakamurella multipartita (Yoshimi et al. 1996) Tao et al. 2004 is the type species of the monospecific genus Nakamurella in the actinobacterial suborder Frankineae. The nonmotile, coccus-shaped strain was isolated from activated sludge acclimated with sugar-containing synthetic wastewater, and is capable of accumulating large amounts of polysaccharides in its cells. Here we describe the features of the organism, together with the complete genome sequence and annotation. This is the first complete genome sequence of a member of the family Nakamurellaceae. The 6,060,298 bp long single replicon genome with its 5415 protein-coding and 56 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  16. Getting complete genomes from complex samples using nanopore sequencing

    DEFF Research Database (Denmark)

    Kirkegaard, Rasmus Hansen; Karst, Søren Michael; Albertsen, Mads

    Background Short read DNA sequencing and metagenomic binning workflows have made it possible to extract bacterial genome bins from environmental microbial samples containing hundreds to thousands of different species. However, these genome bins often do not represent complete genomes......, as they are mostly fragmented, incomplete and often contaminated with foreign DNA. The value of these `draft genomes` have limited, lasting value to the scientific community, as gene synteny is broken and there is some uncertainty of what is missing1. The genetic material most often missed is important multi......-copy and/or conserved marker genes such as the 16S rRNA gene, as sequence micro-heterogeneity prevents assembly of these genes in the de novo assembly. However, long read sequencing technologies are emerging promising an end to fragmented genome assemblies2. Experimental design We extracted DNA from a full...

  17. Complete genome sequence of the myxobacterium Sorangium cellulosum

    DEFF Research Database (Denmark)

    Schneiker, S; Perlova, O; Kaiser, O

    2007-01-01

    The genus Sorangium synthesizes approximately half of the secondary metabolites isolated from myxobacteria, including the anti-cancer metabolite epothilone. We report the complete genome sequence of the model Sorangium strain S. cellulosum Soce56, which produces several natural products and has...... morphological and physiological properties typical of the genus. The circular genome, comprising 13,033,779 base pairs, is the largest bacterial genome sequenced to date. No global synteny with the genome of Myxococcus xanthus is apparent, revealing an unanticipated level of divergence between...... these myxobacteria. A large percentage of the genome is devoted to regulation, particularly post-translational phosphorylation, which probably supports the strain's complex, social lifestyle. This regulatory network includes the highest number of eukaryotic protein kinase-like kinases discovered in any organism...

  18. The complete chloroplast genomes of Cannabis sativa and Humulus lupulus.

    Science.gov (United States)

    Vergara, Daniela; White, Kristin H; Keepers, Kyle G; Kane, Nolan C

    2016-09-01

    Cannabis and Humulus are sister genera comprising the entirety of the Cannabaceae sensu stricto, including C. sativa L. (marijuana, hemp), and H. lupulus L. (hops) as two economically important crops. These two plants have been used by humans for many purposes including as a fiber, food, medicine, or inebriant in the case of C. sativa, and as a flavoring component in beer brewing in the case of H. lupulus. In this study, we report the complete chloroplast genomes for two distinct hemp varieties of C. sativa, Italian "Carmagnola" and Russian "Dagestani", and one Czech variety of H. lupulus "Saazer". Both C. sativa genomes are 153 871 bp in length, while the H. lupulus genome is 153 751 bp. The genomes from the two C. sativa varieties differ in 16 single nucleotide polymorphisms (SNPs), while the H. lupulus genome differs in 1722 SNPs from both C. sativa cultivars.

  19. Complete Genome Sequence of Bifidobacterium bifidum S17▿

    Science.gov (United States)

    Zhurina, Daria; Zomer, Aldert; Gleinser, Marita; Brancaccio, Vincenco Francesco; Auchter, Marc; Waidmann, Mark S.; Westermann, Christina; van Sinderen, Douwe; Riedel, Christian U.

    2011-01-01

    Here, we report on the first completely annotated genome sequence of a Bifidobacterium bifidum strain. B. bifidum S17, isolated from feces of a breast-fed infant, was shown to strongly adhere to intestinal epithelial cells and has potent anti-inflammatory activity in vitro and in vivo. The genome sequence will provide new insights into the biology of this potential probiotic organism and allow for the characterization of the molecular mechanisms underlying its beneficial properties. PMID:21037011

  20. The complete chloroplast genome sequence of Curcuma flaviflora (Curcuma).

    Science.gov (United States)

    Zhang, Yan; Deng, Jiabin; Li, Yangyi; Gao, Gang; Ding, Chunbang; Zhang, Li; Zhou, Yonghong; Yang, Ruiwu

    2016-09-01

    The complete chloroplast (cp) genome of Curcuma flaviflora, a medicinal plant in Southeast Asia, was sequenced. The genome size was 160 478 bp in length, with 36.3% GC content. A pair of inverted repeats (IRs) of 26 946 bp were separated by a large single copy (LSC) of 88 008 bp and a small single copy (SSC) of 18 578 bp, respectively. The cp genome contained 132 annotated genes, including 79 protein coding genes, 30 tRNA genes, and four rRNA genes. And 19 of these genes were duplicated in inverted repeat regions.

  1. Quantitative measure of randomness and order for complete genomes

    Science.gov (United States)

    Kong, Sing-Guan; Fan, Wen-Lang; Chen, Hong-Da; Wigger, Jan; Torda, Andrew E.; Lee, H. C.

    2009-06-01

    We propose an order index, ϕ , which gives a quantitative measure of randomness and order of complete genomic sequences. It maps genomes to a number from 0 (random and of infinite length) to 1 (fully ordered) and applies regardless of sequence length. The 786 complete genomic sequences in GenBank were found to have ϕ values in a very narrow range, ϕg=0.031-0.015+0.028 . We show this implies that genomes are halfway toward being completely random, or, at the “edge of chaos.” We further show that artificial “genomes” converted from literary classics have ϕ ’s that almost exactly coincide with ϕg , but sequences of low information content do not. We infer that ϕg represents a high information-capacity “fixed point” in sequence space, and that genomes are driven to it by the dynamics of a robust growth and evolution process. We show that a growth process characterized by random segmental duplication can robustly drive genomes to the fixed point.

  2. Complete mitochondrial genome of Cynopterus sphinx (Pteropodidae: Cynopterus).

    Science.gov (United States)

    Li, Linmiao; Li, Min; Wu, Zhengjun; Chen, Jinping

    2015-01-01

    We have characterized the complete mitochondrial genome of Cynopterus sphinx (Pteropodidae: Cynopterus) and described its organization in this study. The total length of C. sphinx complete mitochondrial genome was 16,895 bp with the base composition of 32.54% A, 14.05% G, 25.82% T and 27.59% C. The complete mitochondrial genome included 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes (12S rRNA and 16S rRNA) and 1 control region (D-loop). The control region was 1435 bp long with the sequence CATACG repeat 64 times. Three protein-coding genes (ND1, COI and ND4) were ended with incomplete stop codon TA or T.

  3. The Complete Sequence of a Human Parainfluenzavirus 4 Genome

    Science.gov (United States)

    Yea, Carmen; Cheung, Rose; Collins, Carol; Adachi, Dena; Nishikawa, John; Tellier, Raymond

    2009-01-01

    Although the human parainfluenza virus 4 (HPIV4) has been known for a long time, its genome, alone among the human paramyxoviruses, has not been completely sequenced to date. In this study we obtained the first complete genomic sequence of HPIV4 from a clinical isolate named SKPIV4 obtained at the Hospital for Sick Children in Toronto (Ontario, Canada). The coding regions for the N, P/V, M, F and HN proteins show very high identities (95% to 97%) with previously available partial sequences for HPIV4B. The sequence for the L protein and the non-coding regions represent new information. A surprising feature of the genome is its length, more than 17 kb, making it the longest genome within the genus Rubulavirus, although the length is well within the known range of 15 kb to 19 kb for the subfamily Paramyxovirinae. The availability of a complete genomic sequence will facilitate investigations on a respiratory virus that is still not completely characterized. PMID:21994536

  4. The Complete Sequence of a Human Parainfluenzavirus 4 Genome

    Directory of Open Access Journals (Sweden)

    Carmen Yea

    2009-06-01

    Full Text Available Although the human parainfluenza virus 4 (HPIV4 has been known for a long time, its genome, alone among the human paramyxoviruses, has not been completely sequenced to date. In this study we obtained the first complete genomic sequence of HPIV4 from a clinical isolate named SKPIV4 obtained at the Hospital for Sick Children in Toronto (Ontario, Canada. The coding regions for the N, P/V, M, F and HN proteins show very high identities (95% to 97% with previously available partial sequences for HPIV4B. The sequence for the L protein and the non-coding regions represent new information. A surprising feature of the genome is its length, more than 17 kb, making it the longest genome within the genus Rubulavirus, although the length is well within the known range of 15 kb to 19 kb for the subfamily Paramyxovirinae. The availability of a complete genomic sequence will facilitate investigations on a respiratory virus that is still not completely characterized.

  5. Complete Genome Sequence of Escherichia coli Strain WG5

    DEFF Research Database (Denmark)

    Imamovic, Lejla; Misiakou, Maria-Anna; van der Helm, Eric

    2018-01-01

    Escherichia coli strain WG5 is a widely used host for phage detection, including somatic coliphages employed as standard ISO method 10705-1 (2000). Here, we present the complete genome sequence of a commercial E. coli WG5 strain.......Escherichia coli strain WG5 is a widely used host for phage detection, including somatic coliphages employed as standard ISO method 10705-1 (2000). Here, we present the complete genome sequence of a commercial E. coli WG5 strain....

  6. Normalization of Complete Genome Characteristics: Application to Evolution from Primitive Organisms to Homo sapiens.

    Science.gov (United States)

    Sorimachi, Kenji; Okayasu, Teiji; Ohhira, Shuji

    2015-04-01

    Normalized nucleotide and amino acid contents of complete genome sequences can be visualized as radar charts. The shapes of these charts depict the characteristics of an organism's genome. The normalized values calculated from the genome sequence theoretically exclude experimental errors. Further, because normalization is independent of both target size and kind, this procedure is applicable not only to single genes but also to whole genomes, which consist of a huge number of different genes. In this review, we discuss the applications of the normalization of the nucleotide and predicted amino acid contents of complete genomes to the investigation of genome structure and to evolutionary research from primitive organisms to Homo sapiens. Some of the results could never have been obtained from the analysis of individual nucleotide or amino acid sequences but were revealed only after the normalization of nucleotide and amino acid contents was applied to genome research. The discovery that genome structure was homogeneous was obtained only after normalization methods were applied to the nucleotide or predicted amino acid contents of genome sequences. Normalization procedures are also applicable to evolutionary research. Thus, normalization of the contents of whole genomes is a useful procedure that can help to characterize organisms.

  7. Complete mitochondrial genome and phylogeny of Pleistocene mammoth Mammuthus primigenius.

    Directory of Open Access Journals (Sweden)

    Evgeny I Rogaev

    2006-03-01

    Full Text Available Phylogenetic relationships between the extinct woolly mammoth (Mammuthus primigenius, and the Asian (Elephas maximus and African savanna (Loxodonta africana elephants remain unresolved. Here, we report the sequence of the complete mitochondrial genome (16,842 base pairs of a woolly mammoth extracted from permafrost-preserved remains from the Pleistocene epoch--the oldest mitochondrial genome sequence determined to date. We demonstrate that well-preserved mitochondrial genome fragments, as long as approximately 1,600-1700 base pairs, can be retrieved from pre-Holocene remains of an extinct species. Phylogenetic reconstruction of the Elephantinae clade suggests that M. primigenius and E. maximus are sister species that diverged soon after their common ancestor split from the L. africana lineage. Low nucleotide diversity found between independently determined mitochondrial genomic sequences of woolly mammoths separated geographically and in time suggests that north-eastern Siberia was occupied by a relatively homogeneous population of M. primigenius throughout the late Pleistocene.

  8. Complete genome sequence of Truepera radiovictrix type strain (RQ-24).

    Science.gov (United States)

    Ivanova, Natalia; Rohde, Christine; Munk, Christine; Nolan, Matt; Lucas, Susan; Del Rio, Tijana Glavina; Tice, Hope; Deshpande, Shweta; Cheng, Jan-Fang; Tapia, Roxane; Han, Cliff; Goodwin, Lynne; Pitluck, Sam; Liolios, Konstantinos; Mavromatis, Konstantinos; Mikhailova, Natalia; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jeffries, Cynthia D; Brambilla, Evelyne; Rohde, Manfred; Göker, Markus; Tindall, Brian J; Woyke, Tanja; Bristow, James; Eisen, Jonathan A; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C; Klenk, Hans-Peter; Lapidus, Alla

    2011-02-22

    Truepera radiovictrix Albuquerque et al. 2005 is the type species of the genus Truepera within the phylum "Deinococcus/Thermus". T. radiovictrix is of special interest not only because of its isolated phylogenetic location in the order Deinococcales, but also because of its ability to grow under multiple extreme conditions in alkaline, moderately saline, and high temperature habitats. Of particular interest is the fact that, T. radiovictrix is also remarkably resistant to ionizing radiation, a feature it shares with members of the genus Deinococcus. This is the first completed genome sequence of a member of the family Trueperaceae and the fourth type strain genome sequence from a member of the order Deinococcales. The 3,260,398 bp long genome with its 2,994 protein-coding and 52 RNA genes consists of one circular chromosome and is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  9. Complete genome sequence of 'Thermobaculum terrenum' type strain (YNP1).

    Science.gov (United States)

    Kiss, Hajnalka; Cleland, David; Lapidus, Alla; Lucas, Susan; Del Rio, Tijana Glavina; Nolan, Matt; Tice, Hope; Han, Cliff; Goodwin, Lynne; Pitluck, Sam; Liolios, Konstantinos; Ivanova, Natalia; Mavromatis, Konstantinos; Ovchinnikova, Galina; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jeffries, Cynthia D; Lu, Megan; Brettin, Thomas; Detter, John C; Göker, Markus; Tindall, Brian J; Beck, Brian; McDermott, Timothy R; Woyke, Tanja; Bristow, James; Eisen, Jonathan A; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C; Klenk, Hans-Peter; Cheng, Jan-Fang

    2010-10-27

    'Thermobaculum terrenum' Botero et al. 2004 is the sole species within the proposed genus 'Thermobaculum'. Strain YNP1(T) is the only cultivated member of an acid tolerant, extremely thermophilic species belonging to a phylogenetically isolated environmental clone group within the phylum Chloroflexi. At present, the name 'Thermobaculum terrenum' is not yet validly published as it contravenes Rule 30 (3a) of the Bacteriological Code. The bacterium was isolated from a slightly acidic extreme thermal soil in Yellowstone National Park, Wyoming (USA). Depending on its final taxonomic allocation, this is likely to be the third completed genome sequence of a member of the class Thermomicrobia and the seventh type strain genome from the phylum Chloroflexi. The 3,101,581 bp long genome with its 2,872 protein-coding and 58 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  10. The complete chloroplast genome of Sinopodophyllum hexandrum (Berberidaceae).

    Science.gov (United States)

    Li, Huie; Guo, Qiqiang

    2016-07-01

    The complete chloroplast (cp) genome of the Sinopodophyllum hexandrum (Berberidaceae) was determined in this study. The circular genome is 157,940 bp in size, and comprises a pair of inverted repeat (IR) regions of 26,077 bp each, a large single-copy (LSC) region of 86,460 bp and a small single-copy (SSC) region of 19,326 bp. The GC content of the whole cp genome was 38.5%. A total of 133 genes were identified, including 88 protein-coding genes, 37 tRNA genes and eight rRNA genes. The whole cp genome consists of 114 unique genes, and 19 genes are duplicated in the IR regions. The phylogenetic analysis revealed that S. hexandrum is closely related to Nandina domestica within the family Berberidaceae.

  11. Complete nucleotide sequence and genome structure of a Japanese isolate of hibiscus latent Fort Pierce virus, a unique tobamovirus that contains an internal poly(A) region in its 3' end.

    Science.gov (United States)

    Yoshida, Tetsuya; Kitazawa, Yugo; Komatsu, Ken; Neriya, Yutaro; Ishikawa, Kazuya; Fujita, Naoko; Hashimoto, Masayoshi; Maejima, Kensaku; Yamaji, Yasuyuki; Namba, Shigetou

    2014-11-01

    In this study, we detected a Japanese isolate of hibiscus latent Fort Pierce virus (HLFPV-J), a member of the genus Tobamovirus, in a hibiscus plant in Japan and determined the complete sequence and organization of its genome. HLFPV-J has four open reading frames (ORFs), each of which shares more than 98 % nucleotide sequence identity with those of other HLFPV isolates. Moreover, HLFPV-J contains a unique internal poly(A) region of variable length, ranging from 44 to 78 nucleotides, in its 3'-untranslated region (UTR), as is the case with hibiscus latent Singapore virus (HLSV), another hibiscus-infecting tobamovirus. The length of the HLFPV-J genome was 6431 nucleotides, including the shortest internal poly(A) region. The sequence identities of ORFs 1, 2, 3 and 4 of HLFPV-J to other tobamoviruses were 46.6-68.7, 49.9-70.8, 31.0-70.8 and 39.4-70.1 %, respectively, at the nucleotide level and 39.8-75.0, 43.6-77.8, 19.2-70.4 and 31.2-74.2 %, respectively, at the amino acid level. The 5'- and 3'-UTRs of HLFPV-J showed 24.3-58.6 and 13.0-79.8 % identity, respectively, to other tobamoviruses. In particular, when compared to other tobamoviruses, each ORF and UTR of HLFPV-J showed the highest sequence identity to those of HLSV. Phylogenetic analysis showed that HLFPV-J, other HLFPV isolates and HLSV constitute a malvaceous-plant-infecting tobamovirus cluster. These results indicate that the genomic structure of HLFPV-J has unique features similar to those of HLSV. To our knowledge, this is the first report of the complete genome sequence of HLFPV.

  12. Complete genome sequence of Marivirga tractuosa type strain (H-43).

    Science.gov (United States)

    Pagani, Ioanna; Chertkov, Olga; Lapidus, Alla; Lucas, Susan; Del Rio, Tijana Glavina; Tice, Hope; Copeland, Alex; Cheng, Jan-Fang; Nolan, Matt; Saunders, Elizabeth; Pitluck, Sam; Held, Brittany; Goodwin, Lynne; Liolios, Konstantinos; Ovchinikova, Galina; Ivanova, Natalia; Mavromatis, Konstantinos; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Jeffries, Cynthia D; Detter, John C; Han, Cliff; Tapia, Roxanne; Ngatchou-Djao, Olivier D; Rohde, Manfred; Göker, Markus; Spring, Stefan; Sikorski, Johannes; Woyke, Tanja; Bristow, Jim; Eisen, Jonathan A; Markowitz, Victor; Hugenholtz, Philip; Klenk, Hans-Peter; Kyrpides, Nikos C

    2011-04-29

    Marivirga tractuosa (Lewin 1969) Nedashkovskaya et al. 2010 is the type species of the genus Marivirga, which belongs to the family Flammeovirgaceae. Members of this genus are of interest because of their gliding motility. The species is of interest because representative strains show resistance to several antibiotics, including gentamicin, kanamycin, neomycin, polymixin and streptomycin. This is the first complete genome sequence of a member of the family Flammeovirgaceae. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,511,574 bp long chromosome and the 4,916 bp plasmid with their 3,808 protein-coding and 49 RNA genes are a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  13. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses

    Science.gov (United States)

    Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying

    2016-01-01

    Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants. PMID:27014326

  14. The complete chloroplast genome sequences of five Epimedium species: lights into phylogenetic and taxonomic analyses

    Directory of Open Access Journals (Sweden)

    Yanjun eZhang

    2016-03-01

    Full Text Available Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR region and the single-copy (SC boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants.

  15. Complete mitochondrial genome of the fennec fox (Vulpes zerda).

    Science.gov (United States)

    Yang, Xiufeng; Zhao, Chao; Zhang, Honghai; Zhang, Jin; Chen, Lei; Sha, Weilai; Liu, Guangshuai

    2016-01-01

    In this study, the complete mitochondrial genome of the fennec fox (Vulpes zerda) was sequenced using blood samples obtained from a female individual in Shanghai wildlife Park. Sequence analysis showed that the content of T (26.7%) in total composition was no more than C (27.2%), which is different from most of Canide individuals sequenced previously.

  16. Complete Genome Sequences of Four Isolates of Plutella xylostella Granulovirus

    OpenAIRE

    Spence, Robert J.; Noune, Christopher; Hauxwell, Caroline

    2016-01-01

    Granuloviruses are widespread pathogens of Plutella xylostella L. (diamondback moth) and potential biopesticides for control of this global insect pest. We report the complete genomes of four Plutella xylostella granulovirus isolates from China, Malaysia, and Taiwan exhibiting pairs of noncoding, homologous repeat regions with significant sequence variation but equivalent length.

  17. Complete Genome Sequences of Four Isolates of Plutella xylostella Granulovirus.

    Science.gov (United States)

    Spence, Robert J; Noune, Christopher; Hauxwell, Caroline

    2016-06-30

    Granuloviruses are widespread pathogens of Plutella xylostella L. (diamondback moth) and potential biopesticides for control of this global insect pest. We report the complete genomes of four Plutella xylostella granulovirus isolates from China, Malaysia, and Taiwan exhibiting pairs of noncoding, homologous repeat regions with significant sequence variation but equivalent length. Copyright © 2016 Spence et al.

  18. Complete Genome Sequence of Plesiomonas shigelloides Type Strain NCTC10360

    Science.gov (United States)

    Fazal, Mohammed-Abbas; Burnett, Edward; Deheer-Graham, Ana; Oliver, Karen; Holroyd, Nancy; Russell, Julie E.

    2016-01-01

    Plesiomonas shigelloides is a Gram-negative rod within the Enterobacteriaceae family. It is a gastrointestinal pathogen of increasing notoriety, often associated with diarrheal disease. P. shigelloides is waterborne, and infection is often linked to the consumption of seafood. Here, we describe the first complete genome for P. shigelloides type strain NCTC10360. PMID:27660796

  19. Complete mitochondrial genome of threatened mahseer Tor tor ...

    Indian Academy of Sciences (India)

    In the present study, complete mitochondrial genome of Tor tor has been ... ative mitogenome analysis shows higher divergence value at ND1 gene than COI gene. Further .... of these genes was 11,408 bp, accounting for 68.8% of the.

  20. Supplementary data: A complete mitochondrial genome of wheat ...

    Indian Academy of Sciences (India)

    Supplementary data: A complete mitochondrial genome of wheat (Triticum aestivum cv. Chinese Yumai), and fast evolving mitochondrial genes in higher plants. Peng Cui, Huitao Liu, Qiang Lin, Feng Ding, Guoyin Zhuo, Songnian Hu, Dongcheng Liu, Wenlong Yang, Kehui Zhan,. Aimin Zhang and Jun Yu. J. Genet.

  1. Complete genome sequences of six measles virus strains

    NARCIS (Netherlands)

    Phan, M.V.T. (My V.T.); C.M.E. Schapendonk (Claudia); B.B. Oude Munnink (Bas B.); M.P.G. Koopmans D.V.M. (Marion); R.L. de Swart (Rik); Cotten, M. (Matthew)

    2018-01-01

    textabstractGenetic characterization of wild-type measles virus (MV) strains is a critical component of measles surveillance and molecular epidemiology. We have obtained complete genome sequences of six MV strains belonging to different genotypes, using random-primed next generation sequencing.

  2. The complete mitochondrial genome sequence of Diaphorina citri (Hemiptera: Psyllidae)

    Science.gov (United States)

    The first complete mitochondrial genome (mitogenome) sequence of Asian citrus psyllid, Diaphorina citri (Hemiptera: Psyllidae), from Guangzhou, China is presented. The circular mitogenome is 14,996 bp in length with an A+T content of 74.5%, and contains 13 protein-coding genes (PCGs), 22 tRNA genes ...

  3. Complete Genome Sequence of Enterotoxigenic Escherichia coli Siphophage Seurat.

    Science.gov (United States)

    Doan, Dung P; Lessor, Lauren E; Hernandez, Adriana C; Kuty Everett, Gabriel F

    2015-02-26

    Enterotoxigenic Escherichia coli (ETEC) is one of the leading causes of diarrhea in developing countries. Bacteriophage therapy has the potential to aid in the prevention and treatment of ETEC-related illness. To that end, we present here the complete genome of ETEC siphophage Seurat and describe its major features. Copyright © 2015 Doan et al.

  4. Complete genome sequence of a novel pestivirus from sheep.

    Science.gov (United States)

    Becher, Paul; Schmeiser, Stefanie; Oguzoglu, Tuba Cigdem; Postel, Alexander

    2012-10-01

    We report here the complete genome sequence of pestivirus strain Aydin/04-TR, which is the prototype of a group of similar viruses currently present in sheep and goats in Turkey. Sequence data from this virus showed that it clusters separately from the established and previously proposed tentative pestivirus species.

  5. Complete Genome Sequence of a Novel Pestivirus from Sheep

    OpenAIRE

    Becher, Paul; Schmeiser, Stefanie; Oguzoglu, Tuba Cigdem; Postel, Alexander

    2012-01-01

    We report here the complete genome sequence of pestivirus strain Aydin/04-TR, which is the prototype of a group of similar viruses currently present in sheep and goats in Turkey. Sequence data from this virus showed that it clusters separately from the established and previously proposed tentative pestivirus species.

  6. Using Partial Genomic Fosmid Libraries for Sequencing CompleteOrganellar Genomes

    Energy Technology Data Exchange (ETDEWEB)

    McNeal, Joel R.; Leebens-Mack, James H.; Arumuganathan, K.; Kuehl, Jennifer V.; Boore, Jeffrey L.; dePamphilis, Claude W.

    2005-08-26

    Organellar genome sequences provide numerous phylogenetic markers and yield insight into organellar function and molecular evolution. These genomes are much smaller in size than their nuclear counterparts; thus, their complete sequencing is much less expensive than total nuclear genome sequencing, making broader phylogenetic sampling feasible. However, for some organisms it is challenging to isolate plastid DNA for sequencing using standard methods. To overcome these difficulties, we constructed partial genomic libraries from total DNA preparations of two heterotrophic and two autotrophic angiosperm species using fosmid vectors. We then used macroarray screening to isolate clones containing large fragments of plastid DNA. A minimum tiling path of clones comprising the entire genome sequence of each plastid was selected, and these clones were shotgun-sequenced and assembled into complete genomes. Although this method worked well for both heterotrophic and autotrophic plants, nuclear genome size had a dramatic effect on the proportion of screened clones containing plastid DNA and, consequently, the overall number of clones that must be screened to ensure full plastid genome coverage. This technique makes it possible to determine complete plastid genome sequences for organisms that defy other available organellar genome sequencing methods, especially those for which limited amounts of tissue are available.

  7. The Complete Chloroplast Genome of Wild Rice (Oryza minuta) and Its Comparison to Related Species.

    Science.gov (United States)

    Asaf, Sajjad; Waqas, Muhammad; Khan, Abdul L; Khan, Muhammad A; Kang, Sang-Mo; Imran, Qari M; Shahzad, Raheem; Bilal, Saqib; Yun, Byung-Wook; Lee, In-Jung

    2017-01-01

    Oryza minuta , a tetraploid wild relative of cultivated rice (family Poaceae), possesses a BBCC genome and contains genes that confer resistance to bacterial blight (BB) and white-backed (WBPH) and brown (BPH) plant hoppers. Based on the importance of this wild species, this study aimed to understand the phylogenetic relationships of O. minuta with other Oryza species through an in-depth analysis of the composition and diversity of the chloroplast (cp) genome. The analysis revealed a cp genome size of 135,094 bp with a typical quadripartite structure and consisting of a pair of inverted repeats separated by small and large single copies, 139 representative genes, and 419 randomly distributed microsatellites. The genomic organization, gene order, GC content and codon usage are similar to those of typical angiosperm cp genomes. Approximately 30 forward, 28 tandem and 20 palindromic repeats were detected in the O . minuta cp genome. Comparison of the complete O. minuta cp genome with another eleven Oryza species showed a high degree of sequence similarity and relatively high divergence of intergenic spacers. Phylogenetic analyses were conducted based on the complete genome sequence, 65 shared genes and matK gene showed same topologies and O. minuta forms a single clade with parental O. punctata . Thus, the complete O . minuta cp genome provides interesting insights and valuable information that can be used to identify related species and reconstruct its phylogeny.

  8. Informational laws of genome structures

    Science.gov (United States)

    Bonnici, Vincenzo; Manca, Vincenzo

    2016-06-01

    In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined.

  9. Complete plastid genome of Astragalus mongholicus var. nakaianus (Fabaceae).

    Science.gov (United States)

    Choi, In-Su; Kim, Joo-Hwan; Choi, Byoung-Hee

    2016-07-01

    The first complete plastid genome (plastome) of the largest angiosperm genus, Astragalus, was sequenced for the Korean endangered endemic species A. mongholicus var. nakaianus. Its genome is relatively short (123,633 bp) because it lacks an Inverted Repeat (IR) region. It comprises 110 genes, including four unique rRNAs, 30 tRNAs, and 76 protein-coding genes. Similar to other closely related plastomes, rpl22 and rps16 are absent. The putative pseudogene with abnormal stop codons is atpE. This plastome has no additional inversions when compared with highly variable plastomes from IRLC tribes Fabeae and Trifolieae. Our phylogenetic analysis confirms the non-monophyly of Galegeae.

  10. Complete genome sequence of the European sheatfish virus.

    Science.gov (United States)

    Mavian, Carla; López-Bueno, Alberto; Fernández Somalo, María Pilar; Alcamí, Antonio; Alejo, Alí

    2012-06-01

    Viral diseases are an increasing threat to the thriving aquaculture industry worldwide. An emerging group of fish pathogens is formed by several ranaviruses, which have been isolated at different locations from freshwater and seawater fish species since 1985. We report the complete genome sequence of European sheatfish ranavirus (ESV), the first ranavirus isolated in Europe, which causes high mortality rates in infected sheatfish (Silurus glanis) and in other species. Analysis of the genome sequence shows that ESV belongs to the amphibian-like ranaviruses and is closely related to the epizootic hematopoietic necrosis virus (EHNV), a disease agent geographically confined to the Australian continent and notifiable to the World Organization for Animal Health.

  11. Complete mitochondrial genome of yellow meal worm (Tenebrio molitor).

    Science.gov (United States)

    Liu, Li-Na; Wang, Cheng-Ye

    2014-11-18

    The yellow meal worm (Tenebrio molitor L.) is an important resource insect typically used as animal feed additive. It is also widely used for biological research. The first complete mitochondrial genome of T. molitor was determined for the first time by long PCR and conserved primer walking approaches. The results showed that the entire mitogenome of T. molitor was 15 785 bp long, with 72.35% A+T content [deposited in GenBank with accession number KF418153]. The gene order and orientation were the same as the most common type suggested as ancestral for insects. Two protein-coding genes used atypical start codons (CTA in ND2 and AAT in COX1), and the remaining 11 protein-coding genes started with a typical insect initiation codon ATN. All tRNAs showed standard clover-leaf structure, except for tRNA(Ser) (AGN), which lacked a dihydrouridine (DHU) arm. The newly added T. molitor mitogenome could provide information for future studies on yellow meal worm.

  12. The complete mitochondrial genome of the rice moth, Corcyra cephalonica.

    Science.gov (United States)

    Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong

    2012-01-01

    The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)(3). The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)(9), (AT)(8) elements.

  13. The complete mitochondrial genome of Gossypium hirsutum and evolutionary analysis of higher plant mitochondrial genomes.

    Science.gov (United States)

    Liu, Guozheng; Cao, Dandan; Li, Shuangshuang; Su, Aiguo; Geng, Jianing; Grover, Corrinne E; Hu, Songnian; Hua, Jinping

    2013-01-01

    Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species.

  14. Structural genomics in endocrinology

    NARCIS (Netherlands)

    Smit, J. W.; Romijn, J. A.

    2001-01-01

    Traditionally, endocrine research evolved from the phenotypical characterisation of endocrine disorders to the identification of underlying molecular pathophysiology. This approach has been, and still is, extremely successful. The introduction of genomics and proteomics has resulted in a reversal of

  15. Complete chloroplast genome sequence of a tree fern Alsophila spinulosa: insights into evolutionary changes in fern chloroplast genomes.

    Science.gov (United States)

    Gao, Lei; Yi, Xuan; Yang, Yong-Xia; Su, Ying-Juan; Wang, Ting

    2009-06-11

    Ferns have generally been neglected in studies of chloroplast genomics. Before this study, only one polypod and two basal ferns had their complete chloroplast (cp) genome reported. Tree ferns represent an ancient fern lineage that first occurred in the Late Triassic. In recent phylogenetic analyses, tree ferns were shown to be the sister group of polypods, the most diverse group of living ferns. Availability of cp genome sequence from a tree fern will facilitate interpretation of the evolutionary changes of fern cp genomes. Here we have sequenced the complete cp genome of a scaly tree fern Alsophila spinulosa (Cyatheaceae). The Alsophila cp genome is 156,661 base pairs (bp) in size, and has a typical quadripartite structure with the large (LSC, 86,308 bp) and small single copy (SSC, 21,623 bp) regions separated by two copies of an inverted repeat (IRs, 24,365 bp each). This genome contains 117 different genes encoding 85 proteins, 4 rRNAs and 28 tRNAs. Pseudogenes of ycf66 and trnT-UGU are also detected in this genome. A unique trnR-UCG gene (derived from trnR-CCG) is found between rbcL and accD. The Alsophila cp genome shares some unusual characteristics with the previously sequenced cp genome of the polypod fern Adiantum capillus-veneris, including the absence of 5 tRNA genes that exist in most other cp genomes. The genome shows a high degree of synteny with that of Adiantum, but differs considerably from two basal ferns (Angiopteris evecta and Psilotum nudum). At one endpoint of an ancient inversion we detected a highly repeated 565-bp-region that is absent from the Adiantum cp genome. An additional minor inversion of the trnD-GUC, which is possibly shared by all ferns, was identified by comparison between the fern and other land plant cp genomes. By comparing four fern cp genome sequences it was confirmed that two major rearrangements distinguish higher leptosporangiate ferns from basal fern lineages. The Alsophila cp genome is very similar to that of the

  16. Complete chloroplast genome sequence of a tree fern Alsophila spinulosa: insights into evolutionary changes in fern chloroplast genomes

    Directory of Open Access Journals (Sweden)

    Yang Yong-Xia

    2009-06-01

    Full Text Available Abstract Background Ferns have generally been neglected in studies of chloroplast genomics. Before this study, only one polypod and two basal ferns had their complete chloroplast (cp genome reported. Tree ferns represent an ancient fern lineage that first occurred in the Late Triassic. In recent phylogenetic analyses, tree ferns were shown to be the sister group of polypods, the most diverse group of living ferns. Availability of cp genome sequence from a tree fern will facilitate interpretation of the evolutionary changes of fern cp genomes. Here we have sequenced the complete cp genome of a scaly tree fern Alsophila spinulosa (Cyatheaceae. Results The Alsophila cp genome is 156,661 base pairs (bp in size, and has a typical quadripartite structure with the large (LSC, 86,308 bp and small single copy (SSC, 21,623 bp regions separated by two copies of an inverted repeat (IRs, 24,365 bp each. This genome contains 117 different genes encoding 85 proteins, 4 rRNAs and 28 tRNAs. Pseudogenes of ycf66 and trnT-UGU are also detected in this genome. A unique trnR-UCG gene (derived from trnR-CCG is found between rbcL and accD. The Alsophila cp genome shares some unusual characteristics with the previously sequenced cp genome of the polypod fern Adiantum capillus-veneris, including the absence of 5 tRNA genes that exist in most other cp genomes. The genome shows a high degree of synteny with that of Adiantum, but differs considerably from two basal ferns (Angiopteris evecta and Psilotum nudum. At one endpoint of an ancient inversion we detected a highly repeated 565-bp-region that is absent from the Adiantum cp genome. An additional minor inversion of the trnD-GUC, which is possibly shared by all ferns, was identified by comparison between the fern and other land plant cp genomes. Conclusion By comparing four fern cp genome sequences it was confirmed that two major rearrangements distinguish higher leptosporangiate ferns from basal fern lineages. The

  17. Complete genome sequence of Oceanithermus profundus type strain (506T)

    Energy Technology Data Exchange (ETDEWEB)

    Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Zhang, Xiaojing [Los Alamos National Laboratory (LANL); Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Liolios, Konstantinos [U.S. Department of Energy, Joint Genome Institute; Pagani, Ioanna [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Hauser, Loren John [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Brambilla, Evelyne-Marie [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Ruhl, Alina [U.S. Department of Energy, Joint Genome Institute; Mwirichia, Romano [University of Munster, Germany; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Tindall, Brian [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Sikorski, Johannes [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Wirth, Reinhard [Universitat Regensburg, Regensburg, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Land, Miriam L [ORNL

    2011-01-01

    Oceanithermus profundus Miroshnichenko et al. 2003 is the type species of the genus Oceanithermus, which belongs to the family Thermaceae. The genus currently comprises two species whose members are thermophilic and are able to reduce sulfur compounds and nitrite. The organism is adapted to the salinity of sea water, is able to utilize a broad range of carbohydrates, some proteinaceous substrates, organic acids and alcohols. This is the first completed genome sequence of a member of the genus Oceanithermus and the fourth sequence from the family Thermaceae. The 2,439,291 bp long genome with its 2,391 protein-coding and 54 RNA genes consists of one chromosome and a 135,351 bp long plasmid, and is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  18. The complete chloroplast genome sequence of Dendrobium nobile.

    Science.gov (United States)

    Yan, Wenjin; Niu, Zhitao; Zhu, Shuying; Ye, Meirong; Ding, Xiaoyu

    2016-11-01

    The complete chloroplast (cp) genome sequence of Dendrobium nobile, an endangered and traditional Chinese medicine with important economic value, is presented in this article. The total genome size is 150,793 bp, containing a large single copy (LSC) region (84,939 bp) and a small single copy region (SSC) (13,310 bp) which were separated by two inverted repeat (IRs) regions (26,272 bp). The overall GC contents of the plastid genome were 38.8%. In total, 130 unique genes were annotated and they were consisted of 76 protein-coding genes, 30 tRNA genes and 4 rRNA genes. Fourteen genes contained one or two introns.

  19. Complete genome sequence of Actinosynnema mirum type strain (101T)

    Energy Technology Data Exchange (ETDEWEB)

    Land, Miriam; Lapidus, Alla; Mayilraj, Shanmugam; Chen, Feng; Copeland, Alex; Glavina Del Rio, Tijana; Nolan, Matt; Lucas, Susan; Tice, Hope; Cheng, Jan-Fang; Chertkov, Olga; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Rohde, Manfred; Goker, Markus; Pati, Amrita; Ivanova, Natalia; Mavrommatis, Konstantinos; Chen, Amy; Palaniappan, Krishna; Hauser, Loren; Chang, Yun-Juan; Jefferies, Cynthia; Brettin, Thomas; Detter, John C.; Han, Cliff; Chain, Patrick; Tindall, Brian; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter

    2009-05-20

    Actinosynnema mirum Hasegawa et al. 1978 is the type species of the genus, and is of phylogenetic interest because of its central phylogenetic location in the Actino-synnemataceae, a rapidly growing family within the actinobacterial suborder Pseudo-nocardineae. A. mirum is characterized by its motile spores borne on synnemata and as a producer of nocardicin antibiotics. It is capable of growing aerobically and under a moderate CO2 atmosphere. The strain is a Gram-positive, aerial and substrate mycelium producing bacterium, originally isolated from a grass blade collected from the Raritan River, New Jersey. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the first complete genome sequence of a member of the family Actinosynnemataceae, and only the second sequence from the actinobacterial suborder Pseudonocardineae. The 8,248,144 bp long single replicon genome with its 7100 protein-coding and 77 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  20. [Complete genome sequencing and sequence analysis of BCG Tice].

    Science.gov (United States)

    Wang, Zhiming; Pan, Yuanlong; Wu, Jun; Zhu, Baoli

    2012-10-04

    The objective of this study is to obtain the complete genome sequence of Bacillus Calmette-Guerin Tice (BCG Tice), in order to provide more information about the molecular biology of BCG Tice and design more reasonable vaccines to prevent tuberculosis. We assembled the data from high-throughput sequencing with SOAPdenovo software, with many contigs and scaffolds obtained. There are many sequence gaps and physical gaps remained as a result of regional low coverage and low quality. We designed primers at the end of contigs and performed PCR amplification in order to link these contigs and scaffolds. With various enzymes to perform PCR amplification, adjustment of PCR reaction conditions, and combined with clone construction to sequence, all the gaps were finished. We obtained the complete genome sequence of BCG Tice and submitted it to GenBank of National Center for Biotechnology Information (NCBI). The genome of BCG Tice is 4334064 base pairs in length, with GC content 65.65%. The problems and strategies during the finishing step of BCG Tice sequencing are illuminated here, with the hope of affording some experience to those who are involved in the finishing step of genome sequencing. The microarray data were verified by our results.

  1. The complete mitochondrial genome sequence of Eimeria magna (Apicomplexa: Coccidia).

    Science.gov (United States)

    Tian, Si-Qin; Cui, Ping; Fang, Su-Fang; Liu, Guo-Hua; Wang, Chun-Ren; Zhu, Xing-Quan

    2015-01-01

    In the present study, we determined the complete mitochondrial DNA (mtDNA) sequence of Eimeria magna from rabbits for the first time, and compared its gene contents and genome organizations with that of seven Eimeria spp. from domestic chickens. The size of the complete mt genome sequence of E. magna is 6249 bp, which consists of 3 protein-coding genes (cytb, cox1 and cox3), 12 gene fragments for the large subunit (LSU) rRNA, and 7 gene fragments for the small subunit (SSU) rRNA, without transfer RNA genes, in accordance with that of Eimeria spp. from chickens. The putative direction of translation for three genes (cytb, cox1 and cox3) was the same as those of Eimeria species from domestic chickens. The content of A + T is 65.16% for E. magna mt genome (29.73% A, 35.43% T, 17.09 G and 17.75% C). The E. magna mt genome sequence provides novel mtDNA markers for studying the molecular epidemiology and population genetics of Eimeria spp. and has implications for the molecular diagnosis and control of rabbit coccidiosis.

  2. Complete genome sequence of Desulfomicrobium baculatum type strain (XT)

    Energy Technology Data Exchange (ETDEWEB)

    Copeland, Alex; Spring, Stefan; Goker, Markus; Schneider, Susanne; Lapidus, Alla; Glavina Del Rio, Tijana; Tice, Hope; Cheng, Jan-Fang; Lucas, Susan; Chen, Feng; Nolan, Matt; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Ivanova, Natalia; Mavrommatis, Konstantinos; Ovchinnikova, Galina; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jefferies, Cynthia C; Meincke, Linda; Sims, David; Brettin, Thomas; Detter, John C; Han, Cliff; Chain, Patrick; Bristow, James; Eisen, Jonathan; Markowitz, Victor; Hugenholtz, Philip; Klenk, Hans-Peter; Kyrpides, Nikos C; Lucas, Susan

    2009-05-20

    Desulfomicrobium baculatum is the type species of the genus Desulfomicrobium, which is the type genus of the family Desulfomicrobiaceae. It is of phylogenetic interest because of the isolated location of the family Desulfomicrobiaceae within the order Desulfovibrionales. D. baculatum strain XT is a Gram-negative, motile, sulfate-reducing bacterium isolated from water-saturated manganese carbonate ore. It is strictly anaerobic and does not require NaCl for growth, although NaCl concentrations up to 6percent (w/v) are tolerated. The metabolism is respiratory or fermentative. In the presence of sulfate, pyruvate and lactate are incompletely oxidized to acetate and CO2. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the first completed genome sequence of a member of the deltaproteobacterial family Desulfomicrobiaceae, and this 3,942,657 bp long single replicon genome with its 3494 protein-coding and 72 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  3. The complete chloroplast genome sequence of Dodonaea viscosa: comparative and phylogenetic analyses.

    Science.gov (United States)

    Saina, Josphat K; Gichira, Andrew W; Li, Zhi-Zhong; Hu, Guang-Wan; Wang, Qing-Feng; Liao, Kuo

    2018-02-01

    The plant chloroplast (cp) genome is a highly conserved structure which is beneficial for evolution and systematic research. Currently, numerous complete cp genome sequences have been reported due to high throughput sequencing technology. However, there is no complete chloroplast genome of genus Dodonaea that has been reported before. To better understand the molecular basis of Dodonaea viscosa chloroplast, we used Illumina sequencing technology to sequence its complete genome. The whole length of the cp genome is 159,375 base pairs (bp), with a pair of inverted repeats (IRs) of 27,099 bp separated by a large single copy (LSC) 87,204 bp, and small single copy (SSC) 17,972 bp. The annotation analysis revealed a total of 115 unique genes of which 81 were protein coding, 30 tRNA, and four ribosomal RNA genes. Comparative genome analysis with other closely related Sapindaceae members showed conserved gene order in the inverted and single copy regions. Phylogenetic analysis clustered D. viscosa with other species of Sapindaceae with strong bootstrap support. Finally, a total of 249 SSRs were detected. Moreover, a comparison of the synonymous (Ks) and nonsynonymous (Ka) substitution rates in D. viscosa showed very low values. The availability of cp genome reported here provides a valuable genetic resource for comprehensive further studies in genetic variation, taxonomy and phylogenetic evolution of Sapindaceae family. In addition, SSR markers detected will be used in further phylogeographic and population structure studies of the species in this genus.

  4. Complete genome sequence of Marivirga tractuosa type strain (H-43).

    OpenAIRE

    Pagani, Ioanna; Chertkov, Olga; Lapidus, Alla; Lucas, Susan; Del Rio, Tijana Glavina; Tice, Hope; Copeland, Alex; Cheng, Jan-Fang; Nolan, Matt; Saunders, Elizabeth; Pitluck, Sam; Held, Brittany; Goodwin, Lynne; Liolios, Konstantinos; Ovchinikova, Galina

    2011-01-01

    Marivirga tractuosa (Lewin 1969) Nedashkovskaya et al. 2010 is the type species of the genus Marivirga, which belongs to the family Flammeovirgaceae. Members of this genus are of interest because of their gliding motility. The species is of interest because representative strains show resistance to several antibiotics, including gentamicin, kanamycin, neomycin, polymixin and streptomycin. This is the first complete genome sequence of a member of the family Flammeovirgaceae. Here we describe t...

  5. The complete mitochondrial genome sequence of the maned wolf (Chrysocyon brachyurus).

    Science.gov (United States)

    Zhao, Chao; Yang, Xiufeng; Zhang, Honghai; Zhang, Jin; Chen, Lei; Sha, Weilai; Liu, Guangshuai

    2016-01-01

    In this study, the complete mitochondrial genome of the maned wolf (Chrysocyon brachyurus), the unique species in Chrysocyon, was sequenced and reported for the first time using blood samples obtained from a female individual in Shanghai Zoo, China. Sequence analysis showed that the genome structure was in accordance with other Canidae species and it contained 12 S rRNA gene, 16 S rRNA gene, 22 tRNA genes, 13 protein-coding genes and 1 control region.

  6. Chlorobium Tepidum: Insights into the Structure, Physiology, and Metabolism of a Green Sulfur Bacterium Derived from the Complete Genome Sequence

    DEFF Research Database (Denmark)

    Frigaard, Niels-Ulrik; Chew, Aline Gomez Maqueo; Li, Hui

    2003-01-01

    Green sulfur bacteria are obligate, anaerobic photolithoautotrophs that synthesize unique bacteriochlorophylls (BChls) and a unique light-harvesting antenna structure, the chlorosome. One organism, Chlorobium tepidum, has emerged as a model for this group of bacteria primarily due to its relative...

  7. The complete chloroplast genome sequence of Dendrobium officinale.

    Science.gov (United States)

    Yang, Pei; Zhou, Hong; Qian, Jun; Xu, Haibin; Shao, Qingsong; Li, Yonghua; Yao, Hui

    2016-01-01

    The complete chloroplast sequence of Dendrobium officinale, an endangered and economically important traditional Chinese medicine, was reported and characterized. The genome size is 152,018 bp, with 37.5% GC content. A pair of inverted repeats (IRs) of 26,284 bp are separated by a large single-copy region (LSC, 84,944 bp) and a small single-copy region (SSC, 14,506 bp). The complete cp DNA contains 83 protein-coding genes, 39 tRNA genes and 8 rRNA genes. Fourteen genes contained one or two introns.

  8. Functional Coverage of the Human Genome by Existing Structures, Structural Genomics Targets, and Homology Models.

    Directory of Open Access Journals (Sweden)

    2005-08-01

    Full Text Available The bias in protein structure and function space resulting from experimental limitations and targeting of particular functional classes of proteins by structural biologists has long been recognized, but never continuously quantified. Using the Enzyme Commission and the Gene Ontology classifications as a reference frame, and integrating structure data from the Protein Data Bank (PDB, target sequences from the structural genomics projects, structure homology derived from the SUPERFAMILY database, and genome annotations from Ensembl and NCBI, we provide a quantified view, both at the domain and whole-protein levels, of the current and projected coverage of protein structure and function space relative to the human genome. Protein structures currently provide at least one domain that covers 37% of the functional classes identified in the genome; whole structure coverage exists for 25% of the genome. If all the structural genomics targets were solved (twice the current number of structures in the PDB, it is estimated that structures of one domain would cover 69% of the functional classes identified and complete structure coverage would be 44%. Homology models from existing experimental structures extend the 37% coverage to 56% of the genome as single domains and 25% to 31% for complete structures. Coverage from homology models is not evenly distributed by protein family, reflecting differing degrees of sequence and structure divergence within families. While these data provide coverage, conversely, they also systematically highlight functional classes of proteins for which structures should be determined. Current key functional families without structure representation are highlighted here; updated information on the "most wanted list" that should be solved is available on a weekly basis from http://function.rcsb.org:8080/pdb/function_distribution/index.html.

  9. Complete Genome Sequence of Rothia mucilaginosa DY-18: A Clinical Isolate with Dense Meshwork-Like Structures from a Persistent Apical Periodontitis Lesion

    Science.gov (United States)

    2010-09-25

    dermatitis associated with Rothia mucilaginosa bacteremia: a case report ,”American Journal of Dermatopathol- ogy, vol. 32, no. 2, pp. 175–179, 2010. [5] P...root- filled teeth with chronic apical periodontitis ,” International Endodontic Journal, vol. 34, no. 6, pp. 429–434, 2001. [12] L. C. de Paz...of Rothiamucilaginosa DY-18: A Clinical Isolate with DenseMeshwork-Like Structures from a Persistent Apical Periodontitis Lesion Kazuyoshi Yamane,1

  10. Complete genome sequence of Halanaerobium praevalens type strain (GSLT)

    Energy Technology Data Exchange (ETDEWEB)

    Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Sikorski, Johannes [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Chertkov, Olga [Los Alamos National Laboratory (LANL); Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Hammon, Nancy [U.S. Department of Energy, Joint Genome Institute; Deshpande, Shweta [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Huntemann, Marcel [U.S. Department of Energy, Joint Genome Institute; Liolios, Konstantinos [U.S. Department of Energy, Joint Genome Institute; Pagani, Ioanna [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Ovchinnikova, Galina [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Brambilla, Evelyne-Marie [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Kannan, K. Palani [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Tindall, Brian [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute

    2011-01-01

    Halanaerobium praevalens Zeikus et al. 1984 is the type species of the genus Halanaero- bium, which in turn is the type genus of the family Halanaerobiaceae. The species is of inter- est because it is able to reduce a variety of nitro-substituted aromatic compounds at a high rate, and because of its ability to degrade organic pollutants. The strain is also of interest be- cause it functions as a hydrolytic bacterium, fermenting complex organic matter and produc- ing intermediary metabolites for other trophic groups such as sulfate-reducing and methano- genic bacteria. It is further reported as being involved in carbon removal in the Great Salt Lake, its source of isolation. This is the first completed genome sequence of a representative of the genus Halanaerobium and the second genome sequence from a type strain of the fami- ly Halanaerobiaceae. The 2,309,262 bp long genome with its 2,110 protein-coding and 70 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  11. Genomic treasure troves: complete genome sequencing of herbarium and insect museum specimens.

    Science.gov (United States)

    Staats, Martijn; Erkens, Roy H J; van de Vossenberg, Bart; Wieringa, Jan J; Kraaijeveld, Ken; Stielow, Benjamin; Geml, József; Richardson, James E; Bakker, Freek T

    2013-01-01

    Unlocking the vast genomic diversity stored in natural history collections would create unprecedented opportunities for genome-scale evolutionary, phylogenetic, domestication and population genomic studies. Many researchers have been discouraged from using historical specimens in molecular studies because of both generally limited success of DNA extraction and the challenges associated with PCR-amplifying highly degraded DNA. In today's next-generation sequencing (NGS) world, opportunities and prospects for historical DNA have changed dramatically, as most NGS methods are actually designed for taking short fragmented DNA molecules as templates. Here we show that using a standard multiplex and paired-end Illumina sequencing approach, genome-scale sequence data can be generated reliably from dry-preserved plant, fungal and insect specimens collected up to 115 years ago, and with minimal destructive sampling. Using a reference-based assembly approach, we were able to produce the entire nuclear genome of a 43-year-old Arabidopsis thaliana (Brassicaceae) herbarium specimen with high and uniform sequence coverage. Nuclear genome sequences of three fungal specimens of 22-82 years of age (Agaricus bisporus, Laccaria bicolor, Pleurotus ostreatus) were generated with 81.4-97.9% exome coverage. Complete organellar genome sequences were assembled for all specimens. Using de novo assembly we retrieved between 16.2-71.0% of coding sequence regions, and hence remain somewhat cautious about prospects for de novo genome assembly from historical specimens. Non-target sequence contaminations were observed in 2 of our insect museum specimens. We anticipate that future museum genomics projects will perhaps not generate entire genome sequences in all cases (our specimens contained relatively small and low-complexity genomes), but at least generating vital comparative genomic data for testing (phylo)genetic, demographic and genetic hypotheses, that become increasingly more horizontal

  12. The complete chloroplast genomes of two Wisteria species, W. floribunda and W. sinensis (Fabaceae).

    Science.gov (United States)

    Kim, Na-Rae; Kim, Kyunghee; Lee, Sang-Choon; Lee, Jung-Hoon; Cho, Seong-Hyun; Yu, Yeisoo; Kim, Young-Dong; Yang, Tae-Jin

    2016-11-01

    Wisteria floribunda and Wisteria sinensis are ornamental woody vines in the Fabaceae. The complete chloroplast genome sequences of the two species were generated by de novo assembly using whole genome next generation sequences. The chloroplast genomes of W. floribunda and W. sinensis were 130 960 bp and 130 561 bp long, respectively, and showed inverted repeat (IR)-lacking structures as those reported in IRLC in the Fabaceae. The chloroplast genomes of both species contained same number of protein-coding sequences (77), tRNA genes (30), and rRNA genes (4). The phylogenetic analysis with the reported chloroplast genomes confirmed close taxonomical relationship of W. floribunda and W. sinensis.

  13. The complete mitochondrial genome of Ambastaia sidthimunki (Cypriniformes: Cobitidae).

    Science.gov (United States)

    Yu, Peng; Wei, Min; Yang, Qichao; Yang, Yingming; Wan, Quan

    2016-09-01

    Ambastaia sidthimunki is a beautiful small-sized fish and it was categorized as Endangered B2ab (iii,v) in the IUCN Red List. In this study, we reported the complete mitochondrial genome of the A. sidthimunki. The mitochondrial genome sequence was a circular molecule with 16,574 bp in length, and it contained 2 ribosomal RNA genes, 22 transfer RNA genes, 13 protein-coding genes, an L-strand replication origin (OL) and a control region (D-loop). The nucleotide acid composition of the entire mitogenome was 26.94% for C, 15.55% for G, 31.84% for A and 25.67% for T, with an AT content of 57.51%. This research contributes new molecular data for the conservation of this Endangered species.

  14. The complete chloroplast genome of Sinopodophyllum hexandrum Ying (Berberidaceae).

    Science.gov (United States)

    Meng, Lihua; Liu, Ruijuan; Chen, Jianbing; Ding, Chenxu

    2017-05-01

    The complete nucleotide sequence of the Sinopodophyllum hexandrum Ying chloroplast genome (cpDNA) was determined based on next-generation sequencing technologies in this study. The genome was 157 203 bp in length, containing a pair of inverted repeat (IRa and IRb) regions of 25 960 bp, which were separated by a large single-copy (LSC) region of 87 065 bp and a small single-copy (SSC) region of 18 218 bp, respectively. The cpDNA contained 148 genes, including 96 protein-coding genes, 8 ribosomal RNA genes, and 44 tRNA genes. In these genes, eight harbored a single intron, and two (ycf3 and clpP) contained a couple of introns. The cpDNA AT content of S. hexandrum cpDNA is 61.5%.

  15. Complete mitochondrial genome of sublittoral macroalga Rhodymenia pseudopalmata (Rhodymeniales, Rhodophyta).

    Science.gov (United States)

    Kim, Kyeong Mi; Yang, Eun Chan; Yi, Gangman; Yoon, Hwan Su

    2014-08-01

    We sequenced and characterized the first complete mitochondrial genome of the sublittoral red alga Rhodymenia pseudopalmata (Rhodymeniales, Rhodophyta). The mitogenome is 26,166 bp in length with 29.5% GC content. The circular mitogenome contains 47 genes, including 24 protein-coding, 2 rRNA and 21 tRNA genes including two copies of trnG, trnL, trnM and trnS. There are two cases of gene-overlapping, found between sdhD and nad4, and between secY and rps12. The R. pseudopalmata mitochondria genome differs from that of Gracilariopsis lemaneiformis by three missing genes (orf60, rpl20 and trnH).

  16. The complete chloroplast genome sequence of Euonymus japonicus (Celastraceae).

    Science.gov (United States)

    Choi, Kyoung Su; Park, SeonJoo

    2016-09-01

    The complete chloroplast (cp) genome sequence of the Euonymus japonicus, the first sequenced of the genus Euonymus, was reported in this study. The total length was 157 637 bp, containing a pair of 26 678 bp inverted repeat region (IR), which were separated by small single copy (SSC) region and large single copy (LSC) region of 18 340 bp and 85 941 bp, respectively. This genome contains 107 unique genes, including 74 coding genes, four rRNA genes, and 29 tRNA genes. Seventeen genes contain intron of E. japonicus, of which three genes (clpP, ycf3, and rps12) include two introns. The maximum likelihood (ML) phylogenetic analysis revealed that E. japonicus was closely related to Manihot and Populus.

  17. The complete mitochondrial genome of the Border Collie dog.

    Science.gov (United States)

    Wu, An-Quan; Zhang, Yong-Liang; Li, Li-Li; Chen, Long; Yang, Tong-Wen

    2016-01-01

    Border Collie dog is one of the famous breed of dog. In the present work we report the complete mitochondrial genome sequence of Border Collie dog for the first time. The total length of the mitogenome was 16,730 bp with the base composition of 31.6% for A, 28.7% for T, 25.5% for C, and 14.2% for G and an A-T (60.3%)-rich feature was detected. It harbored 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and one non-coding control region (D-loop region). The arrangement of all genes was identical to the typical mitochondrial genomes of dogs.

  18. Complete chloroplast genome sequence of a major economic species, Ziziphus jujuba (Rhamnaceae).

    Science.gov (United States)

    Ma, Qiuyue; Li, Shuxian; Bi, Changwei; Hao, Zhaodong; Sun, Congrui; Ye, Ning

    2017-02-01

    Ziziphus jujuba is an important woody plant with high economic and medicinal value. Here, we analyzed and characterized the complete chloroplast (cp) genome of Z. jujuba, the first member of the Rhamnaceae family for which the chloroplast genome sequence has been reported. We also built a web browser for navigating the cp genome of Z. jujuba ( http://bio.njfu.edu.cn/gb2/gbrowse/Ziziphus_jujuba_cp/ ). Sequence analysis showed that this cp genome is 161,466 bp long and has a typical quadripartite structure of large (LSC, 89,120 bp) and small (SSC, 19,348 bp) single-copy regions separated by a pair of inverted repeats (IRs, 26,499 bp). The sequence contained 112 unique genes, including 78 protein-coding genes, 30 transfer RNAs, and four ribosomal RNAs. The genome structure, gene order, GC content, and codon usage are similar to other typical angiosperm cp genomes. A total of 38 tandem repeats, two forward repeats, and three palindromic repeats were detected in the Z. jujuba cp genome. Simple sequence repeat (SSR) analysis revealed that most SSRs were AT-rich. The homopolymer regions in the cp genome of Z. jujuba were verified and manually corrected by Sanger sequencing. One-third of mononucleotide repeats were found to be erroneously sequenced by the 454 pyrosequencing, which resulted in sequences of 1-4 bases shorter than that by the Sanger sequencing. Analyzing the cp genome of Z. jujuba revealed that the IR contraction and expansion events resulted in ycf1 and rps19 pseudogenes. A phylogenetic analysis based on 64 protein-coding genes showed that Z. jujuba was closely related to members of the Elaeagnaceae family, which will be helpful for phylogenetic studies of other Rosales species. The complete cp genome sequence of Z. jujuba will facilitate population, phylogenetic, and cp genetic engineering studies of this economic plant.

  19. Structural Completeness in The War is Over

    DEFF Research Database (Denmark)

    Nielsen, Jakob Isak

    2010-01-01

    This article argues that Nina Mimica's The War is Over achieves structural completeness on the basis of a number of choices regarding its visual style: shot scale, shot length, editing style and camera movement.  ar Udgivelsesdato: Autumn...

  20. Recombination analysis based on the complete genome of bocavirus

    Directory of Open Access Journals (Sweden)

    Chen Shengxia

    2011-04-01

    Full Text Available Abstract Bocavirus include bovine parvovirus, minute virus of canine, porcine bocavirus, gorilla bocavirus, and Human bocaviruses 1-4 (HBoVs. Although recent reports showed that recombination happened in bocavirus, no systematical study investigated the recombination of bocavirus. The present study performed the phylogenetic and recombination analysis of bocavirus over the complete genomes available in GenBank. Results confirmed that recombination existed among bocavirus, including the likely inter-genotype recombination between HBoV1 and HBoV4, and intra-genotype recombination among HBoV2 variants. Moreover, it is the first report revealing the recombination that occurred between minute viruses of canine.

  1. The complete mitochondrial genome of Chrysopa pallens (Insecta, Neuroptera, Chrysopidae).

    Science.gov (United States)

    He, Kun; Chen, Zhe; Yu, Dan-Na; Zhang, Jia-Yong

    2012-10-01

    The complete mitochondrial genome of Chrysopa pallens (Neuroptera, Chrysopidae) was sequenced. It consists of 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA (rRNA) genes, and a control region (AT-rich region). The total length of C. pallens mitogenome is 16,723 bp with 79.5% AT content, and the length of control region is 1905 bp with 89.1% AT content. The non-coding regions of C. pallens include control region between 12S rRNA and trnI genes, and a 75-bp space region between trnI and trnQ genes.

  2. Complete Plastid Genome Sequencing of Four Tilia Species (Malvaceae: A Comparative Analysis and Phylogenetic Implications.

    Directory of Open Access Journals (Sweden)

    Jie Cai

    Full Text Available Tilia is an ecologically and economically important genus in the family Malvaceae. However, there is no complete plastid genome of Tilia sequenced to date, and the taxonomy of Tilia is difficult owing to frequent hybridization and polyploidization. A well-supported interspecific relationships of this genus is not available due to limited informative sites from the commonly used molecular markers. We report here the complete plastid genome sequences of four Tilia species determined by the Illumina technology. The Tilia plastid genome is 162,653 bp to 162,796 bp in length, encoding 113 unique genes and a total number of 130 genes. The gene order and organization of the Tilia plastid genome exhibits the general structure of angiosperms and is very similar to other published plastid genomes of Malvaceae. As other long-lived tree genera, the sequence divergence among the four Tilia plastid genomes is very low. And we analyzed the nucleotide substitution patterns and the evolution of insertions and deletions in the Tilia plastid genomes. Finally, we build a phylogeny of the four sampled Tilia species with high supports using plastid phylogenomics, suggesting that it is an efficient way to resolve the phylogenetic relationships of this genus.

  3. Complete sequencing of five araliaceae chloroplast genomes and the phylogenetic implications.

    Directory of Open Access Journals (Sweden)

    Rong Li

    Full Text Available BACKGROUND: The ginseng family (Araliaceae includes a number of economically important plant species. Previously phylogenetic studies circumscribed three major clades within the core ginseng plant family, yet the internal relationships of each major group have been poorly resolved perhaps due to rapid radiation of these lineages. Recent studies have shown that phyogenomics based on chloroplast genomes provides a viable way to resolve complex relationships. METHODOLOGY/PRINCIPAL FINDINGS: We report the complete nucleotide sequences of five Araliaceae chloroplast genomes using next-generation sequencing technology. The five chloroplast genomes are 156,333-156,459 bp in length including a pair of inverted repeats (25,551-26,108 bp separated by the large single-copy (86,028-86,566 bp and small single-copy (18,021-19,117 bp regions. Each chloroplast genome contains the same 114 unique genes consisting of 30 transfer RNA genes, four ribosomal RNA genes, and 80 protein coding genes. Gene size, content, and order, AT content, and IR/SC boundary structure are similar among all Araliaceae chloroplast genomes. A total of 140 repeats were identified in the five chloroplast genomes with palindromic repeat as the most common type. Phylogenomic analyses using parsimony, likelihood, and Bayesian inference based on the complete chloroplast genomes strongly supported the monophyly of the Asian Palmate group and the Aralia-Panax group. Furthermore, the relationships among the sampled taxa within the Asian Palmate group were well resolved. Twenty-six DNA markers with the percentage of variable sites higher than 5% were identified, which may be useful for phylogenetic studies of Araliaceae. CONCLUSION: The chloroplast genomes of Araliaceae are highly conserved in all aspects of genome features. The large-scale phylogenomic data based on the complete chloroplast DNA sequences is shown to be effective for the phylogenetic reconstruction of Araliaceae.

  4. First Complete Genome Sequence of Pepper vein yellows virus from Australia

    Science.gov (United States)

    Maina, Solomon; Edwards, Owain R.

    2016-01-01

    We present here the first complete genomic RNA sequence of the polerovirus Pepper vein yellows virus (PeVYV) obtained from a pepper plant in Australia. We compare it with complete PeVYV genomes from Japan and China. The Australian genome was more closely related to the Japanese than the Chinese genome. PMID:27231375

  5. Complete genome sequence of an attenuated Sparfloxacin-resistant Streptococcus agalactiae strain 138spar

    Science.gov (United States)

    The complete genome of a sparfloxacin-resistant Streptococcus agalactiae vaccine strain 138spar is 1,838,126 bp in size. The genome has 1892 coding sequences and 82 RNAs. The annotation of the genome is added by the NCBI Prokaryotic Genome Annotation Pipeline. The publishing of this genome will allo...

  6. The complete chloroplast genome sequence of the relict woody plant Metasequoia glyptostroboides Hu et Cheng

    Directory of Open Access Journals (Sweden)

    Jinhui eChen

    2015-06-01

    Full Text Available Metasequoia glyptostroboides Hu et Cheng is the only species in the genus Metasequoia Miki ex Hu et Cheng, which belongs to the Cupressaceae family. There were around ten species in the Metasequoia genus, which were widely spread across the Northern Hemisphere during the Cretaceous of the Mesozoic and in the Cenozoic. M. glyptostroboides is the only remaining representative of this genus. Here, we report the complete chloroplast (cp genome sequence and the cp genomic features of M. glyptostroboides. The M. glyptostroboides cp genome is 131,887 bp in length, with a total of 117 genes comprised of 82 protein-coding genes, 31 tRNA genes and four rRNA genes. In this genome, 11 forward repeats, nine palindromic repeats and 15 tandem repeats were detected. A total of 188 perfect microsatellites were detected through simple sequence repeat (SSR analysis and these were distributed unevenly within the cp genome. Comparison of the cp genome structure and gene order to those of several other land plants indicated that a copy of the inverted repeat (IR region, which was found to be IR region A (IRA, was lost in the M. glyptostroboides cp ge-nome. The five most divergent and five most conserved genes were determined and further phylogenetic analysis was performed among plant species, especially for relat-ed species in conifers. Finally, phylogenetic analysis demonstrated that M. glyptostro-boides is a sister species to Cryptomeria japonica (L. F. D. Don and to Taiwania cryptomerioides Hayata. The complete cp genome sequence information of M. glyp-tostroboides will be great helpful for further investigations of this endemic relict woody plant and for in-depth understanding of the evolutionary history of the conif-erous cp genomes, especially for the position of M. glyptostroboides in plant systemat-ics and evolution.

  7. The complete chloroplast genome sequence of the relict woody plant Metasequoia glyptostroboides Hu et Cheng.

    Science.gov (United States)

    Chen, Jinhui; Hao, Zhaodong; Xu, Haibin; Yang, Liming; Liu, Guangxin; Sheng, Yu; Zheng, Chen; Zheng, Weiwei; Cheng, Tielong; Shi, Jisen

    2015-01-01

    Metasequoia glyptostroboides Hu et Cheng is the only species in the genus Metasequoia Miki ex Hu et Cheng, which belongs to the Cupressaceae family. There were around 10 species in the Metasequoia genus, which were widely spread across the Northern Hemisphere during the Cretaceous of the Mesozoic and in the Cenozoic. M. glyptostroboides is the only remaining representative of this genus. Here, we report the complete chloroplast (cp) genome sequence and the cp genomic features of M. glyptostroboides. The M. glyptostroboides cp genome is 131,887 bp in length, with a total of 117 genes comprised of 82 protein-coding genes, 31 tRNA genes and four rRNA genes. In this genome, 11 forward repeats, nine palindromic repeats, and 15 tandem repeats were detected. A total of 188 perfect microsatellites were detected through simple sequence repeat (SSR) analysis and these were distributed unevenly within the cp genome. Comparison of the cp genome structure and gene order to those of several other land plants indicated that a copy of the inverted repeat (IR) region, which was found to be IR region A (IRA), was lost in the M. glyptostroboides cp genome. The five most divergent and five most conserved genes were determined and further phylogenetic analysis was performed among plant species, especially for related species in conifers. Finally, phylogenetic analysis demonstrated that M. glyptostroboides is a sister species to Cryptomeria japonica (L. F.) D. Don and to Taiwania cryptomerioides Hayata. The complete cp genome sequence information of M. glyptostroboides will be great helpful for further investigations of this endemic relict woody plant and for in-depth understanding of the evolutionary history of the coniferous cp genomes, especially for the position of M. glyptostroboides in plant systematics and evolution.

  8. The complete chloroplast genome of banana (Musa acuminata, Zingiberales): insight into plastid monocotyledon evolution.

    Science.gov (United States)

    Martin, Guillaume; Baurens, Franc-Christophe; Cardi, Céline; Aury, Jean-Marc; D'Hont, Angélique

    2013-01-01

    Banana (genus Musa) is a crop of major economic importance worldwide. It is a monocotyledonous member of the Zingiberales, a sister group of the widely studied Poales. Most cultivated bananas are natural Musa inter-(sub-)specific triploid hybrids. A Musa acuminata reference nuclear genome sequence was recently produced based on sequencing of genomic DNA enriched in nucleus. The Musa acuminata chloroplast genome was assembled with chloroplast reads extracted from whole-genome-shotgun sequence data. The Musa chloroplast genome is a circular molecule of 169,972 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC, 88,338 bp) and a Small Single Copy region (SSC, 10,768 bp) separated by Inverted Repeat regions (IRs, 35,433 bp). Two forms of the chloroplast genome relative to the orientation of SSC versus LSC were found. The Musa chloroplast genome shows an extreme IR expansion at the IR/SSC boundary relative to the most common structures found in angiosperms. This expansion consists of the integration of three additional complete genes (rps15, ndhH and ycf1) and part of the ndhA gene. No such expansion has been observed in monocots so far. Simple Sequence Repeats were identified in the Musa chloroplast genome and a new set of Musa chloroplastic markers was designed. The complete sequence of M. acuminata ssp malaccensis chloroplast we reported here is the first one for the Zingiberales order. As such it provides new insight in the evolution of the chloroplast of monocotyledons. In particular, it reinforces that IR/SSC expansion has occurred independently several times within monocotyledons. The discovery of new polymorphic markers within Musa chloroplast opens new perspectives to better understand the origin of cultivated triploid bananas.

  9. The complete chloroplast genome of banana (Musa acuminata, Zingiberales: insight into plastid monocotyledon evolution.

    Directory of Open Access Journals (Sweden)

    Guillaume Martin

    Full Text Available Banana (genus Musa is a crop of major economic importance worldwide. It is a monocotyledonous member of the Zingiberales, a sister group of the widely studied Poales. Most cultivated bananas are natural Musa inter-(sub-specific triploid hybrids. A Musa acuminata reference nuclear genome sequence was recently produced based on sequencing of genomic DNA enriched in nucleus.The Musa acuminata chloroplast genome was assembled with chloroplast reads extracted from whole-genome-shotgun sequence data. The Musa chloroplast genome is a circular molecule of 169,972 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC, 88,338 bp and a Small Single Copy region (SSC, 10,768 bp separated by Inverted Repeat regions (IRs, 35,433 bp. Two forms of the chloroplast genome relative to the orientation of SSC versus LSC were found. The Musa chloroplast genome shows an extreme IR expansion at the IR/SSC boundary relative to the most common structures found in angiosperms. This expansion consists of the integration of three additional complete genes (rps15, ndhH and ycf1 and part of the ndhA gene. No such expansion has been observed in monocots so far. Simple Sequence Repeats were identified in the Musa chloroplast genome and a new set of Musa chloroplastic markers was designed.The complete sequence of M. acuminata ssp malaccensis chloroplast we reported here is the first one for the Zingiberales order. As such it provides new insight in the evolution of the chloroplast of monocotyledons. In particular, it reinforces that IR/SSC expansion has occurred independently several times within monocotyledons. The discovery of new polymorphic markers within Musa chloroplast opens new perspectives to better understand the origin of cultivated triploid bananas.

  10. The complete genome sequence of the Atlantic salmon paramyxovirus (ASPV)

    International Nuclear Information System (INIS)

    Nylund, Stian; Karlsen, Marius; Nylund, Are

    2008-01-01

    The complete RNA genome of the Atlantic salmon paramyxovirus (ASPV), isolated from Atlantic salmon suffering from proliferative gill inflammation (PGI), has been determined. The genome is 16,965 nucleotides in length and consists of six nonoverlapping genes in the order 3'- N - P/C/V - M - F - HN - L -5', coding for the nucleocapsid, phospho-, matrix, fusion, hemagglutinin-neuraminidase and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and trinucleotide intergenic regions similar to those of other Paramyxoviridae. The ASPV P-gene expression strategy is like that of the respiro- and morbilliviruses, which express the phosphoprotein from the primary transcript, and edit a portion of the mRNA to encode the accessory proteins V and W. It also encodes the C-protein by ribosomal choice of translation initiation. Pairwise comparisons of amino acid identities, and phylogenetic analysis of deduced ASPV protein sequences with homologous sequences from other Paramyxoviridae, show that ASPV has an affinity for the genus Respirovirus, but may represent a new genus within the subfamily Paramyxovirinae

  11. Complete nucleotide sequences of avian metapneumovirus subtype B genome.

    Science.gov (United States)

    Sugiyama, Miki; Ito, Hiroshi; Hata, Yusuke; Ono, Eriko; Ito, Toshihiro

    2010-12-01

    Complete nucleotide sequences were determined for subtype B avian metapneumovirus (aMPV), the attenuated vaccine strain VCO3/50 and its parental pathogenic strain VCO3/60616. The genomes of both strains comprised 13,508 nucleotides (nt), with a 42-nt leader at the 3'-end and a 46-nt trailer at the 5'-end. The genome contains eight genes in the order 3'-N-P-M-F-M2-SH-G-L-5', which is the same order shown in the other metapneumoviruses. The genes are flanked on either side by conserved transcriptional start and stop signals and have intergenic sequences varying in length from 1 to 88 nt. Comparison of nt and predicted amino acid (aa) sequences of VCO3/60616 with those of other metapneumoviruses revealed higher homology with aMPV subtype A virus than with other metapneumoviruses. A total of 18 nt and 10 deduced aa differences were seen between the strains, and one or a combination of several differences could be associated with attenuation of VCO3/50.

  12. The complete chloroplast genome sequence of Dianthus superbus var. longicalycinus.

    Science.gov (United States)

    Gurusamy, Raman; Lee, Do-Hyung; Park, SeonJoo

    2016-05-01

    The complete chloroplast genome (cpDNA) sequence of Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicine was reported and characterized. The cpDNA of Dianthus superbus var. longicalycinus is 149,539 bp, with 36.3% GC content. A pair of inverted repeats (IRs) of 24,803 bp is separated by a large single-copy region (LSC, 82,805 bp) and a small single-copy region (SSC, 17,128 bp). It encodes 85 protein-coding genes, 36 tRNA genes and 8 rRNA genes. Of 129 individual genes, 13 genes encoded one intron and three genes have two introns.

  13. The complete plastid genomes of the two 'dinotoms' Durinskia baltica and Kryptoperidinium foliaceum.

    Directory of Open Access Journals (Sweden)

    Behzad Imanian

    2010-05-01

    Full Text Available In one small group of dinoflagellates, photosynthesis is carried out by a tertiary endosymbiont derived from a diatom, giving rise to a complex cell that we collectively refer to as a 'dinotom'. The endosymbiont is separated from its host by a single membrane and retains plastids, mitochondria, a large nucleus, and many other eukaryotic organelles and structures, a level of complexity suggesting an early stage of integration. Although the evolution of these endosymbionts has attracted considerable interest, the plastid genome has not been examined in detail, and indeed no tertiary plastid genome has yet been sequenced.Here we describe the complete plastid genomes of two closely related dinotoms, Durinskia baltica and Kryptoperidinium foliaceum. The D. baltica (116470 bp and K. foliaceum (140426 bp plastid genomes map as circular molecules featuring two large inverted repeats that separate distinct single copy regions. The organization and gene content of the D. baltica plastid closely resemble those of the pennate diatom Phaeodactylum tricornutum. The K. foliaceum plastid genome is much larger, has undergone more reorganization, and encodes a putative tyrosine recombinase (tyrC also found in the plastid genome of the heterokont Heterosigma akashiwo, and two putative serine recombinases (serC1 and serC2 homologous to recombinases encoded by plasmids pCf1 and pCf2 in another pennate diatom, Cylindrotheca fusiformis. The K. foliaceum plastid genome also contains an additional copy of serC1, two degenerate copies of another plasmid-encoded ORF, and two non-coding regions whose sequences closely resemble portions of the pCf1 and pCf2 plasmids.These results suggest that while the plastid genomes of two dinotoms share very similar gene content and genome organization with that of the free-living pennate diatom P. tricornutum, the K. folicaeum plastid genome has absorbed two exogenous plasmids. Whether this took place before or after the tertiary

  14. Complete mitochondrial genome sequence from an endangered Indian snake, Python molurus molurus (Serpentes, Pythonidae).

    Science.gov (United States)

    Dubey, Bhawna; Meganathan, P R; Haque, Ikramul

    2012-07-01

    This paper reports the complete mitochondrial genome sequence of an endangered Indian snake, Python molurus molurus (Indian Rock Python). A typical snake mitochondrial (mt) genome of 17258 bp length comprising of 37 genes including the 13 protein coding genes, 22 tRNA genes, and 2 ribosomal RNA genes along with duplicate control regions is described herein. The P. molurus molurus mt. genome is relatively similar to other snake mt. genomes with respect to gene arrangement, composition, tRNA structures and skews of AT/GC bases. The nucleotide composition of the genome shows that there are more A-C % than T-G% on the positive strand as revealed by positive AT and CG skews. Comparison of individual protein coding genes, with other snake genomes suggests that ATP8 and NADH3 genes have high divergence rates. Codon usage analysis reveals a preference of NNC codons over NNG codons in the mt. genome of P. molurus. Also, the synonymous and non-synonymous substitution rates (ka/ks) suggest that most of the protein coding genes are under purifying selection pressure. The phylogenetic analyses involving the concatenated 13 protein coding genes of P. molurus molurus conformed to the previously established snake phylogeny.

  15. Viral Genome DataBase: storing and analyzing genes and proteins from complete viral genomes.

    Science.gov (United States)

    Hiscock, D; Upton, C

    2000-05-01

    The Viral Genome DataBase (VGDB) contains detailed information of the genes and predicted protein sequences from 15 completely sequenced genomes of large (&100 kb) viruses (2847 genes). The data that is stored includes DNA sequence, protein sequence, GenBank and user-entered notes, molecular weight (MW), isoelectric point (pI), amino acid content, A + T%, nucleotide frequency, dinucleotide frequency and codon use. The VGDB is a mySQL database with a user-friendly JAVA GUI. Results of queries can be easily sorted by any of the individual parameters. The software and additional figures and information are available at http://athena.bioc.uvic.ca/genomes/index.html .

  16. The complete mitochondrial genome of the spinner shark Carcharhinus brevipinna.

    Science.gov (United States)

    Chen, Xiao; Xiang, Dan; Peng, Xin; Ai, Weiming; Chen, Hao

    2016-05-01

    The mitochondrial genome of the spinner shark (Carcharhinus brevipinna) was determined in this study. It was 16,706 bp in length with the typical genomic organization and gene order as most vertebrates. Whole nucleotide base composition was 31.3% A, 25.3% C, 13.2% G and 30.1% T. Among the protein-coding genes, there are three overlapping reading-frames on the same strand, while one of it on the opposite strand. Two start codons (ATG and GTG) and three stop codons (AGG, TAG and TAA/T) were used in 13 protein-coding genes. The 22 tRNA ranged from 67 (tRNA-Cys and tRNA-Ser2) to 75 bp (tRNA-Leu1) in length. Only the tRNA-Ser2 could not fold into the typical clover-leaf structure, which lost the dihydrouridine (DHU) arm and replaced by a simple loop. The control region was 1064 bp in length and showed a higher AT content (66.8%) than the average value of whole mitogenome (61.4%).

  17. Complete genome sequence and comparative genomics of the probiotic yeast Saccharomyces boulardii.

    Science.gov (United States)

    Khatri, Indu; Tomar, Rajul; Ganesan, K; Prasad, G S; Subramanian, Srikrishna

    2017-03-23

    The probiotic yeast, Saccharomyces boulardii (Sb) is known to be effective against many gastrointestinal disorders and antibiotic-associated diarrhea. To understand molecular basis of probiotic-properties ascribed to Sb we determined the complete genomes of two strains of Sb i.e. Biocodex and unique28 and the draft genomes for three other Sb strains that are marketed as probiotics in India. We compared these genomes with 145 strains of S. cerevisiae (Sc) to understand genome-level similarities and differences between these yeasts. A distinctive feature of Sb from other Sc is absence of Ty elements Ty1, Ty3, Ty4 and associated LTR. However, we could identify complete Ty2 and Ty5 elements in Sb. The genes for hexose transporters HXT11 and HXT9, and asparagine-utilization are absent in all Sb strains. We find differences in repeat periods and copy numbers of repeats in flocculin genes that are likely related to the differential adhesion of Sb as compared to Sc. Core-proteome based taxonomy places Sb strains along with wine strains of Sc. We find the introgression of five genes from Z. bailii into the chromosome IV of Sb and wine strains of Sc. Intriguingly, genes involved in conferring known probiotic properties to Sb are conserved in most Sc strains.

  18. Complete genome sequence of Arcanobacterium haemolyticum type strain (11018T)

    Energy Technology Data Exchange (ETDEWEB)

    Yasawong, Montri [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Teshima, Hazuki [Los Alamos National Laboratory (LANL); Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Liolios, Konstantinos [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Sikorski, Johannes [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Pukall, Rudiger [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany

    2010-01-01

    Vulcanisaeta distributa Itoh et al. 2002 belongs to the family Thermoproteaceae in the phylum Crenarchaeota. The genus Vulcanisaeta is characterized by a global distribution in hot and acidic springs. This is the first genome sequence from a member of the genus Vulcanisaeta and seventh genome sequence in the family Thermoproteaceae. The 2,374,137 bp long genome with its 2,544 protein-coding and 49 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  19. Functional Insights from Structural Genomics

    Energy Technology Data Exchange (ETDEWEB)

    Forouhar,F.; Kuzin, A.; Seetharaman, J.; Lee, I.; Zhou, W.; Abashidze, M.; Chen, Y.; Montelione, G.; Tong, L.; et al

    2007-01-01

    Structural genomics efforts have produced structural information, either directly or by modeling, for thousands of proteins over the past few years. While many of these proteins have known functions, a large percentage of them have not been characterized at the functional level. The structural information has provided valuable functional insights on some of these proteins, through careful structural analyses, serendipity, and structure-guided functional screening. Some of the success stories based on structures solved at the Northeast Structural Genomics Consortium (NESG) are reported here. These include a novel methyl salicylate esterase with important role in plant innate immunity, a novel RNA methyltransferase (H. influenzae yggJ (HI0303)), a novel spermidine/spermine N-acetyltransferase (B. subtilis PaiA), a novel methyltransferase or AdoMet binding protein (A. fulgidus AF{_}0241), an ATP:cob(I)alamin adenosyltransferase (B. subtilis YvqK), a novel carboxysome pore (E. coli EutN), a proline racemase homolog with a disrupted active site (B. melitensis BME11586), an FMN-dependent enzyme (S. pneumoniae SP{_}1951), and a 12-stranded {beta}-barrel with a novel fold (V. parahaemolyticus VPA1032).

  20. 2004 Structural, Function and Evolutionary Genomics

    Energy Technology Data Exchange (ETDEWEB)

    Douglas L. Brutlag Nancy Ryan Gray

    2005-03-23

    This Gordon conference will cover the areas of structural, functional and evolutionary genomics. It will take a systematic approach to genomics, examining the evolution of proteins, protein functional sites, protein-protein interactions, regulatory networks, and metabolic networks. Emphasis will be placed on what we can learn from comparative genomics and entire genomes and proteomes.

  1. Complete Chloroplast Genome Sequence of Coptis chinensis Franch. and Its Evolutionary History

    Science.gov (United States)

    He, Yang; Deng, Cao; Fan, Gang; Qin, Shishang

    2017-01-01

    The Coptis chinensis Franch. is an important medicinal plant from the Ranunculales. We used next generation sequencing technology to determine the complete chloroplast genome of C. chinensis. This genome is 155,484 bp long with 38.17% GC content. Two 26,758 bp long inverted repeats separated the genome into a typical quadripartite structure. The C. chinensis chloroplast genome consists of 128 gene loci, including eight rRNA gene loci, 28 tRNA gene loci, and 92 protein-coding gene loci. Most of the SSRs in C. chinensis are poly-A/T. The numbers of mononucleotide SSRs in C. chinensis and other Ranunculaceae species are fewer than those in Berberidaceae species, while the number of dinucleotide SSRs is greater than that in the Berberidaceae. C. chinensis diverged from other Ranunculaceae species an estimated 81 million years ago (Mya). The divergence between Ranunculaceae and Berberidaceae was ~111 Mya, while the Ranunculales and Magnoliaceae shared a common ancestor during the Jurassic, ~153 Mya. Position 104 of the C. chinensis ndhG protein was identified as a positively selected site, indicating possible selection for the photosystem-chlororespiration system in C. chinensis. In summary, the complete sequencing and annotation of the C. chinensis chloroplast genome will facilitate future studies on this important medicinal species. PMID:28698879

  2. Complete Chloroplast Genome Sequence of Coptis chinensis Franch. and Its Evolutionary History

    Directory of Open Access Journals (Sweden)

    Yang He

    2017-01-01

    Full Text Available The Coptis chinensis Franch. is an important medicinal plant from the Ranunculales. We used next generation sequencing technology to determine the complete chloroplast genome of C. chinensis. This genome is 155,484 bp long with 38.17% GC content. Two 26,758 bp long inverted repeats separated the genome into a typical quadripartite structure. The C. chinensis chloroplast genome consists of 128 gene loci, including eight rRNA gene loci, 28 tRNA gene loci, and 92 protein-coding gene loci. Most of the SSRs in C. chinensis are poly-A/T. The numbers of mononucleotide SSRs in C. chinensis and other Ranunculaceae species are fewer than those in Berberidaceae species, while the number of dinucleotide SSRs is greater than that in the Berberidaceae. C. chinensis diverged from other Ranunculaceae species an estimated 81 million years ago (Mya. The divergence between Ranunculaceae and Berberidaceae was ~111 Mya, while the Ranunculales and Magnoliaceae shared a common ancestor during the Jurassic, ~153 Mya. Position 104 of the C. chinensis ndhG protein was identified as a positively selected site, indicating possible selection for the photosystem-chlororespiration system in C. chinensis. In summary, the complete sequencing and annotation of the C. chinensis chloroplast genome will facilitate future studies on this important medicinal species.

  3. Complete mitochondrial genome sequence of the polychaete annelidPlatynereis dumerilii

    Energy Technology Data Exchange (ETDEWEB)

    Boore, Jeffrey L.

    2004-08-15

    Complete mitochondrial genome sequences are now available for 126 metazoans (see Boore 1999; Mitochondrial Genomics link at http://www.jgi.doe.gov), but the taxonomic representation is highly biased. For example, 80 are from a single phylum, Chordata, and show little variation for many molecular features. Arthropoda is represented by 16 taxa, Mollusca by eight, and Echinodermata by five, with only 17 others from the remaining {approx}30 metazoan phyla. With few exceptions (see Wolstenholme 1992 and Boore 1999) these are circular DNA molecules, about 16 kb in size, and encode the same set of 37 genes. A variety of non-standard names are sometimes used for animal mitochondrial genes; see Boore (1999) for gene nomenclature and a table of synonyms. Mitochondrial genome comparisons serve as a model of genome evolution. In this system, much smaller and simpler than that of the nucleus, are all of the same factors of genome evolution, where one may find tractable the changes in tRNA structure, base composition, genetic code, gene arrangement, etc. Further, patterns of mitochondrial gene rearrangements are an exceptionally reliable indicator of phylogenetic relationships (Smith et al.1993; Boore et al. 1995; Boore, Lavrov, and Brown 1998; Boore and Brown 1998, 2000; Dowton 1999; Stechmann and Schlegel 1999; Kurabayashi and Ueshima 2000). To these ends, we are sampling further the variation among major animal groups in features of their mitochondrial genomes.

  4. The complete plastid genome of Bunge's pine Pinus bungeana (Pinaceae).

    Science.gov (United States)

    Li, Zhong-Hu; Zhu, Juan; Yang, Yi-Xin; Yang, Jie; He, Jing-Wen; Zhao, Gui-Fang

    2016-07-01

    The complete nucleotide sequence of Bunge's pine Pinus bungeana Zucc. ex Endl. chloroplast genome (cp DNA) was determined in this study. The cpDNA was 117 861 bp in length, containing a pair of 475 bp inverted repeat regions (IRa and IRb), which were separated by large and small single copy regions (LSC and SSC) of 65 373 and 51 538 bp, respectively. The cpDNA contained 111 genes, including 71 protein-coding genes (71 PCG species), 4 ribosomal RNA genes (4 rRNA species) and 36 tRNA genes (32 tRNA species). In these genes, 13 harbored a single intron and 1 (ycf3) contained a couple of introns. The overall AT content of Bunge's pine cpDNA is 61.2%, while the corresponding values of the LSC, SSC and IR regions are 61.9%, 60.2% and 62.5%, respectively. A phylogenetic reconstruction based on the maximum parsimony analysis suggested that all the sampled Pinus species clustered a monophyletic clade and have a high bootstrap support, and the cpDNA of P. bungeana is closely related to that of congeneric P. gerardiana.

  5. Complete mitochondrial genome sequences from five Eimeria species (Apicomplexa; Coccidia; Eimeriidae) infecting domestic turkeys.

    Science.gov (United States)

    Ogedengbe, Mosun E; El-Sherry, Shiem; Whale, Julia; Barta, John R

    2014-07-17

    via host switching from another avian host. Phylogenetic analyses suggest E. necatrix and E. tenella are related distantly to other Eimeria of chickens. Mitochondrial genomes of Eimeria species sequenced to date are highly conserved with regard to gene content and structure. Nonetheless, complete mitochondrial genome sequences and, particularly the three CDS, possess sufficient sequence variability for differentiating Eimeria species of poultry. The mitochondrial genome sequences are highly suited for molecular diagnostics and phylogenetics of coccidia and, potentially, genetic markers for molecular epidemiology.

  6. The complete chloroplast genome sequence of Maddenia hypoleuca koehne (Prunoideae, Rosaceae).

    Science.gov (United States)

    Chen, Tao; Zhang, Jing; Liu, Yin; Wang, Hao; Wang, Juan; Chen, Qing; Tang, Hao-Ru; Wang, Xiao-Rong

    2016-11-01

    Maddenia hypoleuca Koehne belonging to family Rosaceae is a native species in China. The complete chloroplast (cp) genome was generated by de novo assembly using low coverage whole genome sequencing data and manual correction. The cp genome was 158 084 bp in length, with GC content of 36.63%. It exhibited a typical quadripartite structure: a pair of large inverted repeat regions (IRs, 26 246 bp each), a large single-copy region (LSC, 86 713 bp), and a small single-copy region (SSC, 18 879 bp). A total of 114 genes were predicted, which included 80 protein-coding genes, 30 tRNA genes, and four rRNA genes. Phylogenetic analysis indicated that M. hypoleuca is most closely related to Prunus padus within the Prunoideae subfamily, which conforms to the traditional classification.

  7. Complete Genome Sequence of Pediococcus pentosaceus Strain SL4

    DEFF Research Database (Denmark)

    Dantoft, Shruti Harnal; Bielak, Eliza Maria; Seo, Jae-Gu

    2013-01-01

    Pediococcus pentosaceus SL4 was isolated from a Korean fermented vegetable product, kimchi. We report here the whole-genome sequence (WGS) of P. pentosaceus SL4. The genome consists of a 1.79-Mb circular chromosome (G+C content of 37.3%) and seven distinct plasmids ranging in size from 4 kb to 50...

  8. Complete mitochondrial genome of a rhodolith, Sporolithon durum (Sporolithales, Rhodophyta).

    Science.gov (United States)

    Kim, Kyeong Mi; Yang, Eun Chan; Kim, Jeong Ha; Nelson, Wendy A; Yoon, Hwan Su

    2015-02-01

    We present the first mitochondrial genome of the nongeniculate coralline red alga, Sporolithon durum (Sporolithales). The genome consists of 45 genes, including 24 protein-coding, 2 rRNA and 19 tRNA genes in a circular molecule of 26,202 bp with overall 28.4% GC content.

  9. Complete genome sequence of Halorhodospira halophila SL1

    Energy Technology Data Exchange (ETDEWEB)

    Challacombe, Jean F [ORNL; Majid, Sophia [University of Chicago; Deole, Ratnakar [Oklahoma State University; Brettin, Thomas S. [Argonne National Laboratory (ANL); Bruce, David [Los Alamos National Laboratory (LANL); Delano, Susana [Los Alamos National Laboratory (LANL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Gleasner, Cheryl D. [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Misra, Monica [Los Alamos National Laboratory (LANL); Reitenga, Krista K. [Los Alamos National Laboratory (LANL); Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Copeland, A [U.S. Department of Energy, Joint Genome Institute; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Saunders, Elizabeth H [Los Alamos National Laboratory (LANL); Tapia, Roxanne [Los Alamos National Laboratory (LANL); Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Hoff, Wouter D. [Oklahoma State University

    2013-01-01

    Halorhodospira halophila is among the most halophilic organisms known. It is an obligately photosynthetic and anaerobic purple sulfur bacterium that exhibits autotrophic growth up to saturated NaCl concentrations. The type strain H. halophila SL1 was isolated from a hypersaline lake in Oregon. Here we report the determination of its entire genome in a single contig. This is the first genome of a phototrophic extreme halophile. The genome consists of 2,678,452 bp, encoding 2493 predicted genes as determined by automated genome annotation. Of the 2407 predicted proteins, 1905 were assigned to a putative function. Future detailed analysis of this genome promises to yield insights into the halophilic adaptations of this organism, its ability for photoautotrophic growth under extreme conditions, and its characteristic sulfur metabolism.

  10. MIPS: a database for protein sequences and complete genomes.

    Science.gov (United States)

    Mewes, H W; Hani, J; Pfeiffer, F; Frishman, D

    1998-01-01

    The MIPS group [Munich Information Center for Protein Sequences of the German National Center for Environment and Health (GSF)] at the Max-Planck-Institute for Biochemistry, Martinsried near Munich, Germany, is involved in a number of data collection activities, including a comprehensive database of the yeast genome, a database reflecting the progress in sequencing the Arabidopsis thaliana genome, the systematic analysis of other small genomes and the collection of protein sequence data within the framework of the PIR-International Protein Sequence Database (described elsewhere in this volume). Through its WWW server (http://www.mips.biochem.mpg.de ) MIPS provides access to a variety of generic databases, including a database of protein families as well as automatically generated data by the systematic application of sequence analysis algorithms. The yeast genome sequence and its related information was also compiled on CD-ROM to provide dynamic interactive access to the 16 chromosomes of the first eukaryotic genome unraveled. PMID:9399795

  11. The complete mitochondrial genomes for three Toxocara species of human and animal health significance

    Directory of Open Access Journals (Sweden)

    Wu Xiang-Yun

    2008-05-01

    Full Text Available Abstract Background Studying mitochondrial (mt genomics has important implications for various fundamental areas, including mt biochemistry, physiology and molecular biology. In addition, mt genome sequences have provided useful markers for investigating population genetic structures, systematics and phylogenetics of organisms. Toxocara canis, Toxocara cati and Toxocara malaysiensis cause significant health problems in animals and humans. Although they are of importance in human and animal health, no information on the mt genomes for any of Toxocara species is available. Results The sizes of the entire mt genome are 14,322 bp for T. canis, 14029 bp for T. cati and 14266 bp for T. malaysiensis, respectively. These circular genomes are amongst the largest reported to date for all secernentean nematodes. Their relatively large sizes relate mainly to an increased length in the AT-rich region. The mt genomes of the three Toxocara species all encode 12 proteins, two ribosomal RNAs and 22 transfer RNA genes, but lack the ATP synthetase subunit 8 gene, which is consistent with all other species of Nematode studied to date, with the exception of Trichinella spiralis. All genes are transcribed in the same direction and have a nucleotide composition high in A and T, but low in G and C. The contents of A+T of the complete genomes are 68.57% for T. canis, 69.95% for T. cati and 68.86% for T. malaysiensis, among which the A+T for T. canis is the lowest among all nematodes studied to date. The AT bias had a significant effect on both the codon usage pattern and amino acid composition of proteins. The mt genome structures for three Toxocara species, including genes and non-coding regions, are in the same order as for Ascaris suum and Anisakis simplex, but differ from Ancylostoma duodenale, Necator americanus and Caenorhabditis elegans only in the location of the AT-rich region, whereas there are substantial differences when compared with Onchocerca volvulus

  12. Complete Sequence and Analysis of the Mitochondrial Genome of Hemiselmis andersenii CCMP644 (Cryptophyceae

    Directory of Open Access Journals (Sweden)

    Bowman Sharen

    2008-05-01

    Full Text Available Abstract Background Cryptophytes are an enigmatic group of unicellular eukaryotes with plastids derived by secondary (i.e., eukaryote-eukaryote endosymbiosis. Cryptophytes are unusual in that they possess four genomes–a host cell-derived nuclear and mitochondrial genome and an endosymbiont-derived plastid and 'nucleomorph' genome. The evolutionary origins of the host and endosymbiont components of cryptophyte algae are at present poorly understood. Thus far, a single complete mitochondrial genome sequence has been determined for the cryptophyte Rhodomonas salina. Here, the second complete mitochondrial genome of the cryptophyte alga Hemiselmis andersenii CCMP644 is presented. Results The H. andersenii mtDNA is 60,553 bp in size and encodes 30 structural RNAs and 36 protein-coding genes, all located on the same strand. A prominent feature of the genome is the presence of a ~20 Kbp long intergenic region comprised of numerous tandem and dispersed repeat units of between 22–336 bp. Adjacent to these repeats are 27 copies of palindromic sequences predicted to form stable DNA stem-loop structures. One such stem-loop is located near a GC-rich and GC-poor region and may have a regulatory function in replication or transcription. The H. andersenii mtDNA shares a number of features in common with the genome of the cryptophyte Rhodomonas salina, including general architecture, gene content, and the presence of a large repeat region. However, the H. andersenii mtDNA is devoid of inverted repeats and introns, which are present in R. salina. Comparative analyses of the suite of tRNAs encoded in the two genomes reveal that the H. andersenii mtDNA has lost or converted its original trnK(uuu gene and possesses a trnS-derived 'trnK(uuu', which appears unable to produce a functional tRNA. Mitochondrial protein coding gene phylogenies strongly support a variety of previously established eukaryotic groups, but fail to resolve the relationships among higher

  13. Improvement of genome assembly completeness and identification of novel full-length protein-coding genes by RNA-seq in the giant panda genome.

    Science.gov (United States)

    Chen, Meili; Hu, Yibo; Liu, Jingxing; Wu, Qi; Zhang, Chenglin; Yu, Jun; Xiao, Jingfa; Wei, Fuwen; Wu, Jiayan

    2015-12-11

    High-quality and complete gene models are the basis of whole genome analyses. The giant panda (Ailuropoda melanoleuca) genome was the first genome sequenced on the basis of solely short reads, but the genome annotation had lacked the support of transcriptomic evidence. In this study, we applied RNA-seq to globally improve the genome assembly completeness and to detect novel expressed transcripts in 12 tissues from giant pandas, by using a transcriptome reconstruction strategy that combined reference-based and de novo methods. Several aspects of genome assembly completeness in the transcribed regions were effectively improved by the de novo assembled transcripts, including genome scaffolding, the detection of small-size assembly errors, the extension of scaffold/contig boundaries, and gap closure. Through expression and homology validation, we detected three groups of novel full-length protein-coding genes. A total of 12.62% of the novel protein-coding genes were validated by proteomic data. GO annotation analysis showed that some of the novel protein-coding genes were involved in pigmentation, anatomical structure formation and reproduction, which might be related to the development and evolution of the black-white pelage, pseudo-thumb and delayed embryonic implantation of giant pandas. The updated genome annotation will help further giant panda studies from both structural and functional perspectives.

  14. Complete sequence and comparative analysis of the chloroplast genome of coconut palm (Cocos nucifera).

    Science.gov (United States)

    Huang, Ya-Yi; Matzke, Antonius J M; Matzke, Marjori

    2013-01-01

    Coconut, a member of the palm family (Arecaceae), is one of the most economically important trees used by mankind. Despite its diverse morphology, coconut is recognized taxonomically as only a single species (Cocos nucifera L.). There are two major coconut varieties, tall and dwarf, the latter of which displays traits resulting from selection by humans. We report here the complete chloroplast (cp) genome of a dwarf coconut plant, and describe the gene content and organization, inverted repeat fluctuations, repeated sequence structure, and occurrence of RNA editing. Phylogenetic relationships of monocots were inferred based on 47 chloroplast protein-coding genes. Potential nodes for events of gene duplication and pseudogenization related to inverted repeat fluctuation were mapped onto the tree using parsimony criteria. We compare our findings with those from other palm species for which complete cp genome sequences are available.

  15. Complete genome sequencing and evolutionary analysis of Indian isolates of Dengue virus type 2

    Energy Technology Data Exchange (ETDEWEB)

    Dash, Paban Kumar, E-mail: pabandash@rediffmail.com; Sharma, Shashi; Soni, Manisha; Agarwal, Ankita; Parida, Manmohan; Rao, P.V.Lakshmana

    2013-07-05

    Highlights: •Complete genome of Indian DENV-2 was deciphered for the first time in this study. •The recent Indian DENV-2 revealed presence of many unique amino acid residues. •Genotype shift (American to Cosmopolitan) characterizes evolution of DENV-2 in India. •Circulation of a unique clade of DENV-2 in South Asia was identified. -- Abstract: Dengue is the most important arboviral infection of global public health significance. It is now endemic in most parts of the South East Asia including India. Though Dengue virus type 2 (DENV-2) is predominantly associated with major outbreaks in India, complete genome information of Indian DENV-2 is not available. In this study, the full-length genome of five DENV-2 isolates (four from 2001 to 2011 and one from 1960), from different parts of India was determined. The complete genome of the Indian DENV-2 was found to be 10,670 bases long with an open reading frame coding for 3391 amino acids. The recent Indian DENV-2 (2001–2011) revealed a nucleotide sequence identity of around 90% and 97% with an older Indian DENV-2 (1960) and closely related Sri Lankan and Chinese DENV-2 respectively. Presence of unique amino acid residues and non-conservative substitutions in critical amino acid residues of major structural and non-structural proteins was observed in recent Indian DENV-2. Selection pressure analysis revealed positive selection in few amino acid sites of the genes encoding for structural and non-structural proteins. The molecular phylogenetic analysis based on comparison of both complete coding region and envelope protein gene with globally diverse DENV-2 viruses classified the recent Indian isolates into a unique South Asian clade within Cosmopolitan genotype. A shift of genotype from American to Cosmopolitan in 1970s characterized the evolution of DENV-2 in India. Present study is the first report on complete genome characterization of emerging DENV-2 isolates from India and highlights the circulation of a

  16. Complete genome sequencing and evolutionary analysis of Indian isolates of Dengue virus type 2

    International Nuclear Information System (INIS)

    Dash, Paban Kumar; Sharma, Shashi; Soni, Manisha; Agarwal, Ankita; Parida, Manmohan; Rao, P.V.Lakshmana

    2013-01-01

    Highlights: •Complete genome of Indian DENV-2 was deciphered for the first time in this study. •The recent Indian DENV-2 revealed presence of many unique amino acid residues. •Genotype shift (American to Cosmopolitan) characterizes evolution of DENV-2 in India. •Circulation of a unique clade of DENV-2 in South Asia was identified. -- Abstract: Dengue is the most important arboviral infection of global public health significance. It is now endemic in most parts of the South East Asia including India. Though Dengue virus type 2 (DENV-2) is predominantly associated with major outbreaks in India, complete genome information of Indian DENV-2 is not available. In this study, the full-length genome of five DENV-2 isolates (four from 2001 to 2011 and one from 1960), from different parts of India was determined. The complete genome of the Indian DENV-2 was found to be 10,670 bases long with an open reading frame coding for 3391 amino acids. The recent Indian DENV-2 (2001–2011) revealed a nucleotide sequence identity of around 90% and 97% with an older Indian DENV-2 (1960) and closely related Sri Lankan and Chinese DENV-2 respectively. Presence of unique amino acid residues and non-conservative substitutions in critical amino acid residues of major structural and non-structural proteins was observed in recent Indian DENV-2. Selection pressure analysis revealed positive selection in few amino acid sites of the genes encoding for structural and non-structural proteins. The molecular phylogenetic analysis based on comparison of both complete coding region and envelope protein gene with globally diverse DENV-2 viruses classified the recent Indian isolates into a unique South Asian clade within Cosmopolitan genotype. A shift of genotype from American to Cosmopolitan in 1970s characterized the evolution of DENV-2 in India. Present study is the first report on complete genome characterization of emerging DENV-2 isolates from India and highlights the circulation of a

  17. Complete genome sequence of Serratia plymuthica strain AS12

    Energy Technology Data Exchange (ETDEWEB)

    Neupane, Saraswoti [Uppsala University, Uppsala, Sweden; Finlay, Roger D. [Uppsala University, Uppsala, Sweden; Alstrom, Sadhna [Uppsala University, Uppsala, Sweden; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Peters, Lin [U.S. Department of Energy, Joint Genome Institute; Ovchinnikova, Galina [U.S. Department of Energy, Joint Genome Institute; Chertkov, Olga [Los Alamos National Laboratory (LANL); Han, James [U.S. Department of Energy, Joint Genome Institute; Han, Cliff [Los Alamos National Laboratory (LANL); Tapia, Roxanne [Los Alamos National Laboratory (LANL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Pagani, Ioanna [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Hogberg, Nils [Uppsala University, Uppsala, Sweden

    2012-01-01

    A plant associated member of the family Enterobacteriaceae, Serratia plymuthica strain AS12 was isolated from rapeseed roots. It is of scientific interest due to its plant growth promoting and plant pathogen inhibiting ability. The genome of S. plymuthica AS12 comprises a 5,443,009 bp long circular chromosome, which consists of 4,952 protein-coding genes, 87 tRNA genes and 7 rRNA operons. This genome was sequenced within the 2010 DOE-JGI Community Sequencing Program (CSP2010) as part of the project entitled 'Genomics of four rapeseed plant growth promoting bacteria with antagonistic effect on plant pathogens'.

  18. Comparative analysis of complete chloroplast genome sequence and inversion variation in Lasthenia burkei (Madieae, Asteraceae).

    Science.gov (United States)

    Walker, Joseph F; Zanis, Michael J; Emery, Nancy C

    2014-04-01

    Complete chloroplast genome studies can help resolve relationships among large, complex plant lineages such as Asteraceae. We present the first whole plastome from the Madieae tribe and compare its sequence variation to other chloroplast genomes in Asteraceae. We used high throughput sequencing to obtain the Lasthenia burkei chloroplast genome. We compared sequence structure and rates of molecular evolution in the small single copy (SSC), large single copy (LSC), and inverted repeat (IR) regions to those for eight Asteraceae accessions and one Solanaceae accession. The chloroplast sequence of L. burkei is 150 746 bp and contains 81 unique protein coding genes and 4 coding ribosomal RNA sequences. We identified three major inversions in the L. burkei chloroplast, all of which have been found in other Asteraceae lineages, and a previously unreported inversion in Lactuca sativa. Regions flanking inversions contained tRNA sequences, but did not have particularly high G + C content. Substitution rates varied among the SSC, LSC, and IR regions, and rates of evolution within each region varied among species. Some observed differences in rates of molecular evolution may be explained by the relative proportion of coding to noncoding sequence within regions. Rates of molecular evolution vary substantially within and among chloroplast genomes, and major inversion events may be promoted by the presence of tRNAs. Collectively, these results provide insight into different mechanisms that may promote intramolecular recombination and the inversion of large genomic regions in the plastome.

  19. The complete mitochondrial genome of rabbit pinworm Passalurus ambiguus: genome characterization and phylogenetic analysis.

    Science.gov (United States)

    Liu, Guo-Hua; Li, Sheng; Zou, Feng-Cai; Wang, Chun-Ren; Zhu, Xing-Quan

    2016-01-01

    Passalurus ambiguus (Nematda: Oxyuridae) is a common pinworm which parasitizes in the caecum and colon of rabbits. Despite its significance as a pathogen, the epidemiology, genetics, systematics, and biology of this pinworm remain poorly understood. In the present study, we sequenced the complete mitochondrial (mt) genome of P. ambiguus. The circular mt genome is 14,023 bp in size and encodes of 36 genes, including 12 protein-coding, two ribosomal RNA, and 22 transfer RNA genes. The mt gene order of P. ambiguus is the same as that of Wellcomia siamensis, but distinct from that of Enterobius vermicularis. Phylogenetic analyses based on concatenated amino acid sequences of 12 protein-coding genes by Bayesian inference (BI) showed that P. ambiguus was more closely related to W. siamensis than to E. vermicularis. This mt genome provides novel genetic markers for studying the molecular epidemiology, population genetics, systematics of pinworm of animals and humans, and should have implications for the diagnosis, prevention, and control of passaluriasis in rabbits and other animals.

  20. Complete Genome Sequence of Mycobacterium phlei Type Strain RIVM601174

    KAUST Repository

    Abdallah, A. M.; Rashid, M.; Adroub, S. A.; Arnoux, M.; Ali, Shahjahan; van Soolingen, D.; Bitter, W.; Pain, Arnab

    2012-01-01

    Mycobacterium phlei is a rapidly growing nontuberculous Mycobacterium species that is typically nonpathogenic, with few reported cases of human disease. Here we report the whole genome sequence of M. phlei type strain RIVM601174.

  1. Complete Genome Sequence of Mycobacterium phlei Type Strain RIVM601174

    KAUST Repository

    Abdallah, A. M.

    2012-05-24

    Mycobacterium phlei is a rapidly growing nontuberculous Mycobacterium species that is typically nonpathogenic, with few reported cases of human disease. Here we report the whole genome sequence of M. phlei type strain RIVM601174.

  2. Complete Genome Sequence of Pseudomonas aeruginosa Phage AAT-1.

    Science.gov (United States)

    Andrade-Domínguez, Andrés; Kolter, Roberto

    2016-08-25

    Aspects of the interaction between phages and animals are of interest and importance for medical applications. Here, we report the genome sequence of the lytic Pseudomonas phage AAT-1, isolated from mammalian serum. AAT-1 is a double-stranded DNA phage, with a genome of 57,599 bp, containing 76 predicted open reading frames. Copyright © 2016 Andrade-Domínguez and Kolter.

  3. Complete mitochondrial genome of the spadenose shark (Scoliodon macrorhynchos).

    Science.gov (United States)

    Chen, Xiao; Peng, Xin; Zhang, Peng; Yang, Shenyun; Liu, Min

    2014-04-01

    We firstly presented the complete mitogenome of the spadenose shark Scoliodon macrorhynchos (Carcharhinidae, Carcharhiniformes). The mitogenome is 16,693 bp long and contains 13 protein-coding genes, two rRNAs, 22 tRNAs and one control region, a typical vertebrate arrangement. The codon usage bias was different between the H-strand and L-strand encoded protein genes. All tRNA genes have the typical cloverleaf secondary structure excepting tRNA-Ser2, in which the dihydrouridine (DHU) arm is replaced by a simple loop with 12 unpaired nucleotides. A termination associated sequence and three conserved sequence blocks (CSB I-III) were identified in the control region, which were considered associating with the replication and transcription of mitogenome.

  4. Complete mitochondrial genome of freshwater shark Wallago attu (Bloch & Schneider) from Indus River Sindh, Pakistan.

    Science.gov (United States)

    Laghari, Muhammad Younis; Lashari, Punhal; Xu, Peng; Zhao, Zixia; Jiang, Li; Narejo, Naeem Tariq; Xin, Baoping; Sun, Xiaowen; Zhang, Yan

    2016-01-01

    Complete mitochondrial genome of fresh water giant catfish, Wallago attu, was isolated by LA PCR (TakaRa LAtaq, Dalian, China); and sequenced by Sanger's method to obtain the complete mitochondrial genome. The complete mitogenome was 15,639 bp in length and contains 13 typical vertebrate protein-coding genes, 2 rRNA and 22 tRNA genes. The whole genome base composition was estimated to be 31.17% A, 28.15% C, 15.55% G and 25.12% T. The complete mitochondrial genome of catfish, W. attu, provides the fundamental tools for genetic breeding.

  5. Complete mitochondrial genome of the Freshwater Catfish Rita rita (Siluriformes, Bagridae).

    Science.gov (United States)

    Lashari, Punhal; Laghari, Muhammad Younis; Xu, Peng; Zhao, Zixia; Jiang, Li; Narejo, Naeem Tariq; Deng, Yulin; Sun, Xiaowen; Zhang, Yan

    2015-01-01

    The complete mitochondrial genome of Catfish, Rita rita, was isolated by LA PCR (TakaRa LAtaq, Dalian, China); and sequenced by Sanger's method to obtain the complete mitochondrial genome, which is listed Critically Endangered and Red Listed species. The complete mitogenome was 16,449 bp in length and contains 13 typical vertebrate protein-coding genes, 2 rRNA and 22 tRNA genes. The whole genome base composition was estimated to be 33.40% A, 27.43% C, 14.26% G and 24.89% T. The complete mitochondrial genome of catfish, Rita rita provides the basis for genetic breeding and conservation studies.

  6. Complete mitochondrial genome sequence of Urechis caupo, a representative of the phylum Echiura

    Directory of Open Access Journals (Sweden)

    Boore Jeffrey L

    2004-09-01

    Full Text Available Abstract Background Mitochondria contain small genomes that are physically separate from those of nuclei. Their comparison serves as a model system for understanding the processes of genome evolution. Although hundreds of these genome sequences have been reported, the taxonomic sampling is highly biased toward vertebrates and arthropods, with many whole phyla remaining unstudied. This is the first description of a complete mitochondrial genome sequence of a representative of the phylum Echiura, that of the fat innkeeper worm, Urechis caupo. Results This mtDNA is 15,113 nts in length and 62% A+T. It contains the 37 genes that are typical for animal mtDNAs in an arrangement somewhat similar to that of annelid worms. All genes are encoded by the same DNA strand which is rich in A and C relative to the opposite strand. Codons ending with the dinucleotide GG are more frequent than would be expected from apparent mutational biases. The largest non-coding region is only 282 nts long, is 71% A+T, and has potential for secondary structures. Conclusions Urechis caupo mtDNA shares many features with those of the few studied annelids, including the common usage of ATG start codons, unusual among animal mtDNAs, as well as gene arrangements, tRNA structures, and codon usage biases.

  7. Complete mitochondrial genome sequence of Urechis caupo, a representative of the phylum Echiura.

    Science.gov (United States)

    Boore, Jeffrey L

    2004-09-15

    Mitochondria contain small genomes that are physically separate from those of nuclei. Their comparison serves as a model system for understanding the processes of genome evolution. Although hundreds of these genome sequences have been reported, the taxonomic sampling is highly biased toward vertebrates and arthropods, with many whole phyla remaining unstudied. This is the first description of a complete mitochondrial genome sequence of a representative of the phylum Echiura, that of the fat innkeeper worm, Urechis caupo. This mtDNA is 15,113 nts in length and 62% A+T. It contains the 37 genes that are typical for animal mtDNAs in an arrangement somewhat similar to that of annelid worms. All genes are encoded by the same DNA strand which is rich in A and C relative to the opposite strand. Codons ending with the dinucleotide GG are more frequent than would be expected from apparent mutational biases. The largest non-coding region is only 282 nts long, is 71% A+T, and has potential for secondary structures. Urechis caupo mtDNA shares many features with those of the few studied annelids, including the common usage of ATG start codons, unusual among animal mtDNAs, as well as gene arrangements, tRNA structures, and codon usage biases.

  8. The complete chloroplast genome of Cinnamomum camphora and its comparison with related Lauraceae species

    Directory of Open Access Journals (Sweden)

    Caihui Chen

    2017-09-01

    Full Text Available Cinnamomum camphora, a member of the Lauraceae family, is a valuable aromatic and timber tree that is indigenous to the south of China and Japan. All parts of Cinnamomum camphora have secretory cells containing different volatile chemical compounds that are utilized as herbal medicines and essential oils. Here, we reported the complete sequencing of the chloroplast genome of Cinnamomum camphora using illumina technology. The chloroplast genome of Cinnamomum camphora is 152,570 bp in length and characterized by a relatively conserved quadripartite structure containing a large single copy region of 93,705 bp, a small single copy region of 19,093 bp and two inverted repeat (IR regions of 19,886 bp. Overall, the genome contained 123 coding regions, of which 15 were repeated in the IR regions. An analysis of chloroplast sequence divergence revealed that the small single copy region was highly variable among the different genera in the Lauraceae family. A total of 40 repeat structures and 83 simple sequence repeats were detected in both the coding and non-coding regions. A phylogenetic analysis indicated that Calycanthus is most closely related to Lauraceae, both being members of Laurales, which forms a sister group to Magnoliids. The complete sequence of the chloroplast of Cinnamomum camphora will aid in in-depth taxonomical studies of the Lauraceae family in the future. The genetic sequence information will also have valuable applications for chloroplast genetic engineering.

  9. Complete mitochondrial genome of Porzana fusca and Porzana pusilla and phylogenetic relationship of 16 Rallidae species.

    Science.gov (United States)

    Chen, Peng; Han, Yuqing; Zhu, Chaoying; Gao, Bin; Ruan, Luzhang

    2017-12-01

    The complete mitochondrial genome sequences of Porzana fusca and Porzana pusilla were determined. The two avian species share a high degree of homology in terms of mitochondrial genome organization and gene arrangement. Their corresponding mitochondrial genomes are 16,935 and 16,978 bp and consist of 37 genes and a control region. Their PCGs were both 11,365 bp long and have similar structure. Their tRNA gene sequences could be folded into canonical cloverleaf secondary structure, except for tRNA Ser (AGY) , which lost its "DHU" arm. Based on the concatenated nucleotide sequences of the complete mitochondrial DNA genes of 16 Rallidae species, reconstruction of phylogenetic trees and analysis of the molecular clock of P. fusca and P. pusilla indicated that these species from a sister group, which in turn are sister group to Rallina eurizonoides. The genus Gallirallus is a sister group to genus Lewinia, and these groups in turn are sister groups to genus Porphyrio. Moreover, molecular clock analyses suggested that the basal divergence of Rallidae could be traced back to 40.47 (41.46‒39.45) million years ago (Mya), and the divergence of Porzana occurred approximately 5.80 (15.16‒0.79) Mya.

  10. The complete mitochondrial genome of a stonefly species, Togoperla sp. (Plecoptera: Perlidae).

    Science.gov (United States)

    Wang, Kai; Wang, Yuyu; Yang, Ding

    2016-05-01

    The complete mitochondrial (mt) genome of a stonefly species, Togoperla sp. (Plecoptera: Perlidae), was sequenced. The 15,723 bp long genome has the standard metazoan complement of 37 genes and an A+T-rich region, which is the same as the insect ancestral genome arrangement.

  11. Complete Genome Sequences of Mycobacteriophages Clautastrophe, Kingsolomon, Krypton555, and Nicholas

    OpenAIRE

    Chung, Hui-Min; D’Elia, Tom; Ross, Joseph F.; Alvarado, Samuel M.; Brantley, Molly-Catherine; Bricker, Lydia P.; Butler, Courtney R.; Crist, Carson; Dane, Julia M.; Farran, Brett W.; Hobbs, Sierra; Lapak, Michelle; Lovell, Conner; Ludergnani, Nicholas; McMullen, Allison

    2017-01-01

    ABSTRACT We report here the complete genome sequences of four subcluster L3 mycobacteriophages newly isolated from soil samples, using Mycobacterium smegmatis mc2155 as the host. Comparative genomic analyses with four previously described subcluster L3 phages reveal strong nucleotide similarity and gene conservation, with several large insertions/deletions near their right genome ends.

  12. Complete Genome Sequences of Mycobacteriophages Clautastrophe, Kingsolomon, Krypton555, and Nicholas

    Science.gov (United States)

    Chung, Hui-Min; D’Elia, Tom; Ross, Joseph F.; Alvarado, Samuel M.; Brantley, Molly-Catherine; Bricker, Lydia P.; Butler, Courtney R.; Crist, Carson; Dane, Julia M.; Farran, Brett W.; Hobbs, Sierra; Lapak, Michelle; Lovell, Conner; McMullen, Allison; Mirza, Sohail A.; Thrift, Noah; Vaughan, Donald P.; Worley, Grace; Ejikemeuwa, Amara; Zaw, May; Albritton, Claude F.; Bertrand, Sarah C.; Chaudhry, Shanzay S.; Cheema, Vzair A.; Do, Camilla; Do, Michael L.; Duong, Huyen M.; El-Desoky, Dalia H.; Green, Kelsey M.; Lee, Rhea N.; Thornton, Lauren A.; Vu, James M.; Zahra, Mah Noor; Stoner, Ty H.; Garlena, Rebecca A.; Jacobs-Sera, Deborah; Russell, Daniel A.

    2017-01-01

    ABSTRACT We report here the complete genome sequences of four subcluster L3 mycobacteriophages newly isolated from soil samples, using Mycobacterium smegmatis mc2155 as the host. Comparative genomic analyses with four previously described subcluster L3 phages reveal strong nucleotide similarity and gene conservation, with several large insertions/deletions near their right genome ends. PMID:29122864

  13. First Complete Genome Sequence of Suakwa aphid-borne yellows virus from East Timor

    Science.gov (United States)

    Maina, Solomon; Edwards, Owain R.; de Almeida, Luis; Ximenes, Abel

    2016-01-01

    We present here the first complete genomic RNA sequence of the polerovirus Suakwa aphid-borne yellows virus (SABYV), from East Timor. The isolate sequenced came from a virus-infected pumpkin plant. The East Timorese genome had a nucleotide identity of 86.5% with the only other SABYV genome available, which is from Taiwan. PMID:27469955

  14. Complete Genome Sequence of Lactobacillus kefiranofaciens ZW3▿

    Science.gov (United States)

    Wang, Yanping; Wang, Jingrui; Ahmed, Zaheer; Bai, Xiaojia; Wang, Jinju

    2011-01-01

    Lactobacillus kefiranofaciens ZW3 was isolated in Tibet, China, from kefir grain, a traditional dairy product that is known to provide many health benefits to humans. Here, we present the genome features of L. kefiranofaciens ZW3 and the identification of a gene cluster related to the synthesis of exopolysaccharide, an important constituent of the Tibetan kefir. PMID:21705607

  15. Complete genome sequence of Lactobacillus kefiranofaciens ZW3.

    Science.gov (United States)

    Wang, Yanping; Wang, Jingrui; Ahmed, Zaheer; Bai, Xiaojia; Wang, Jinju

    2011-08-01

    Lactobacillus kefiranofaciens ZW3 was isolated in Tibet, China, from kefir grain, a traditional dairy product that is known to provide many health benefits to humans. Here, we present the genome features of L. kefiranofaciens ZW3 and the identification of a gene cluster related to the synthesis of exopolysaccharide, an important constituent of the Tibetan kefir.

  16. Complete Genome Sequence of Beijerinckia indica subsp. indica▿

    Science.gov (United States)

    Tamas, Ivica; Dedysh, Svetlana N.; Liesack, Werner; Stott, Matthew B.; Alam, Maqsudul; Murrell, J. Colin; Dunfield, Peter F.

    2010-01-01

    Beijerinckia indica subsp. indica is an aerobic, acidophilic, exopolysaccharide-producing, N2-fixing soil bacterium. It is a generalist chemoorganotroph that is phylogenetically closely related to facultative and obligate methanotrophs of the genera Methylocella and Methylocapsa. Here we report the full genome sequence of this bacterium. PMID:20601475

  17. Complete genome sequence of Beijerinckia indica subsp. indica.

    Science.gov (United States)

    Tamas, Ivica; Dedysh, Svetlana N; Liesack, Werner; Stott, Matthew B; Alam, Maqsudul; Murrell, J Colin; Dunfield, Peter F

    2010-09-01

    Beijerinckia indica subsp. indica is an aerobic, acidophilic, exopolysaccharide-producing, N(2)-fixing soil bacterium. It is a generalist chemoorganotroph that is phylogenetically closely related to facultative and obligate methanotrophs of the genera Methylocella and Methylocapsa. Here we report the full genome sequence of this bacterium.

  18. Complete genome sequence of Rhodospirillum rubrum type strain (S1).

    Science.gov (United States)

    Munk, A Christine; Copeland, Alex; Lucas, Susan; Lapidus, Alla; Del Rio, Tijana Glavina; Barry, Kerrie; Detter, John C; Hammon, Nancy; Israni, Sanjay; Pitluck, Sam; Brettin, Thomas; Bruce, David; Han, Cliff; Tapia, Roxanne; Gilna, Paul; Schmutz, Jeremy; Larimer, Frank; Land, Miriam; Kyrpides, Nikos C; Mavromatis, Konstantinos; Richardson, Paul; Rohde, Manfred; Göker, Markus; Klenk, Hans-Peter; Zhang, Yaoping; Roberts, Gary P; Reslewic, Susan; Schwartz, David C

    2011-07-01

    Rhodospirillum rubrum (Esmarch 1887) Molisch 1907 is the type species of the genus Rhodospirillum, which is the type genus of the family Rhodospirillaceae in the class Alphaproteobacteria. The species is of special interest because it is an anoxygenic phototroph that produces extracellular elemental sulfur (instead of oxygen) while harvesting light. It contains one of the most simple photosynthetic systems currently known, lacking light harvesting complex 2. Strain S1(T) can grow on carbon monoxide as sole energy source. With currently over 1,750 PubMed entries, R. rubrum is one of the most intensively studied microbial species, in particular for physiological and genetic studies. Next to R. centenum strain SW, the genome sequence of strain S1(T) is only the second genome of a member of the genus Rhodospirillum to be published, but the first type strain genome from the genus. The 4,352,825 bp long chromosome and 53,732 bp plasmid with a total of 3,850 protein-coding and 83 RNA genes were sequenced as part of the DOE Joint Genome Institute Program DOEM 2002.

  19. Complete Genome Sequence of Mycobacterium vaccae Type Strain ATCC 25954

    KAUST Repository

    Ho, Y. S.; Adroub, S. A.; Abadi, Maram; Al Alwan, B.; Alkhateeb, R.; Gao, G.; Ragab, A.; Ali, Shahjahan; van Soolingen, D.; Bitter, W.; Pain, Arnab; Abdallah, A. M.

    2012-01-01

    Mycobacterium vaccae is a rapidly growing, nontuberculous Mycobacterium species that is generally not considered a human pathogen and is of major pharmaceutical interest as an immunotherapeutic agent. We report here the annotated genome sequence of the M. vaccae type strain, ATCC 25954.

  20. Complete genome sequence of Nocardia brasiliensis HUJEG-1.

    Science.gov (United States)

    Vera-Cabrera, Lucio; Ortiz-Lopez, Rocio; Elizondo-Gonzalez, Ramiro; Perez-Maya, Antonio Ali; Ocampo-Candiani, Jorge

    2012-05-01

    In Mexico, actinomycetoma is mainly caused by Nocardia brasiliensis, which is a soil inhabitant actinobacterium. Here, we report for the first time the draft genome of a strain isolated from a human case that has largely been found in in vitro and experimental models of actinomycetoma, N. brasiliensis HUJEG-1.

  1. Complete Genome Sequence of Mycobacterium vaccae Type Strain ATCC 25954

    KAUST Repository

    Ho, Y. S.

    2012-10-26

    Mycobacterium vaccae is a rapidly growing, nontuberculous Mycobacterium species that is generally not considered a human pathogen and is of major pharmaceutical interest as an immunotherapeutic agent. We report here the annotated genome sequence of the M. vaccae type strain, ATCC 25954.

  2. A complete mitochondrial genome of wheat (Triticum aestivum cv ...

    Indian Academy of Sciences (India)

    role in the development and reproduction of the plant. They occupy a specific ... for biosynthetic pathways relative to their free-living cousins. (Gray et al. 1999; Itoh ... A mitochondrial genome BAC library was constructed fol- lowing a previously ...

  3. Complete Genome Sequence of Mycoplasma hominis Strain Sprott (ATCC 33131), Isolated from a Patient with Nongonococcal Urethritis.

    Science.gov (United States)

    Calcutt, Michael J; Foecking, Mark F

    2015-07-09

    Presented here is the complete and annotated genome sequence of Mycoplasma hominis Sprott (ATCC 33131). The chromosome comprises 695,214 bp, which is approximately 30 kb larger than the syntenic genome of M. hominis PG21(T). Tetracycline resistance of strain Sprott is most probably conferred by the tetM determinant, harbored on a mosaic transposon-like structure. Copyright © 2015 Calcutt and Foecking.

  4. Complete Genome Sequence of Bradyrhizobium sp. S23321: Insights into Symbiosis Evolution in Soil Oligotrophs

    Science.gov (United States)

    Okubo, Takashi; Tsukui, Takahiro; Maita, Hiroko; Okamoto, Shinobu; Oshima, Kenshiro; Fujisawa, Takatomo; Saito, Akihiro; Futamata, Hiroyuki; Hattori, Reiko; Shimomura, Yumi; Haruta, Shin; Morimoto, Sho; Wang, Yong; Sakai, Yoriko; Hattori, Masahira; Aizawa, Shin-ichi; Nagashima, Kenji V. P.; Masuda, Sachiko; Hattori, Tsutomu; Yamashita, Akifumi; Bao, Zhihua; Hayatsu, Masahito; Kajiya-Kanegae, Hiromi; Yoshinaga, Ikuo; Sakamoto, Kazunori; Toyota, Koki; Nakao, Mitsuteru; Kohara, Mitsuyo; Anda, Mizue; Niwa, Rieko; Jung-Hwan, Park; Sameshima-Saito, Reiko; Tokuda, Shin-ichi; Yamamoto, Sumiko; Yamamoto, Syuji; Yokoyama, Tadashi; Akutsu, Tomoko; Nakamura, Yasukazu; Nakahira-Yanaka, Yuka; Hoshino, Yuko Takada; Hirakawa, Hideki; Mitsui, Hisayuki; Terasawa, Kimihiro; Itakura, Manabu; Sato, Shusei; Ikeda-Ohtsubo, Wakako; Sakakura, Natsuko; Kaminuma, Eli; Minamisawa, Kiwamu

    2012-01-01

    Bradyrhizobium sp. S23321 is an oligotrophic bacterium isolated from paddy field soil. Although S23321 is phylogenetically close to Bradyrhizobium japonicum USDA110, a legume symbiont, it is unable to induce root nodules in siratro, a legume often used for testing Nod factor-dependent nodulation. The genome of S23321 is a single circular chromosome, 7,231,841 bp in length, with an average GC content of 64.3%. The genome contains 6,898 potential protein-encoding genes, one set of rRNA genes, and 45 tRNA genes. Comparison of the genome structure between S23321 and USDA110 showed strong colinearity; however, the symbiosis islands present in USDA110 were absent in S23321, whose genome lacked a chaperonin gene cluster (groELS3) for symbiosis regulation found in USDA110. A comparison of sequences around the tRNA-Val gene strongly suggested that S23321 contains an ancestral-type genome that precedes the acquisition of a symbiosis island by horizontal gene transfer. Although S23321 contains a nif (nitrogen fixation) gene cluster, the organization, homology, and phylogeny of the genes in this cluster were more similar to those of photosynthetic bradyrhizobia ORS278 and BTAi1 than to those on the symbiosis island of USDA110. In addition, we found genes encoding a complete photosynthetic system, many ABC transporters for amino acids and oligopeptides, two types (polar and lateral) of flagella, multiple respiratory chains, and a system for lignin monomer catabolism in the S23321 genome. These features suggest that S23321 is able to adapt to a wide range of environments, probably including low-nutrient conditions, with multiple survival strategies in soil and rhizosphere. PMID:22452844

  5. The complete chloroplast genome sequence of Aster spathulifolius (Asteraceae); genomic features and relationship with Asteraceae.

    Science.gov (United States)

    Choi, Kyoung Su; Park, SeonJoo

    2015-11-10

    Aster spathulifolius, a member of the Asteraceae family, is distributed along the coast of Japan and Korea. This plant is used for medicinal and ornamental purposes. The complete chloroplast (cp) genome of A. sphathulifolius consists of 149,473 bp that include a pair of inverted repeats of 24,751 bp separated by a large single copy region of 81,998 bp and a small single copy region of 17,973 bp. The chloroplast genome contains 78 coding genes, four rRNA genes and 29 tRNA genes. When compared to other cpDNA sequences of Asteraceae, A. spathulifolius showed the closest relationship with Jacobaea vulgaris, and its atpB gene was found to be a pseudogene, unlike J. vulgaris. Furthermore, evaluation of the gene compositions of J. vulgaris, Helianthus annuus, Guizotia abyssinica and A. spathulifolius revealed that 13.6-kb showed inversion from ndhF to rps15, unlike Lactuca of Asteraceae. Comparison of the synonymous (Ks) and nonsynonymous (Ka) substitution rates with J. vulgaris revealed that synonymous genes related to a small subunit of the ribosome showed the highest value (0.1558), while nonsynonymous rates of genes related to ATP synthase genes were highest (0.0118). These findings revealed that substitution has occurred at similar rates in most genes, and the substitution rates suggested that most genes is a purified selection. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Using ezRAD to reconstruct the complete mitochondrial genome of Porites fontanesii (Cnidaria: Scleractinia)

    KAUST Repository

    Terraneo, Tullia Isotta

    2018-02-09

    Corals in the genus Porites are among the major framework builders of reef structures worldwide, yet the genus has been challenging to study due to a lack of informative molecular markers. Here, we used ezRAD sequencing to reconstruct the complete mitochondrial genome of Porites fontanesii (GenBank accession number MG754069), a widespread coral species endemic to the Red Sea and Gulf of Aden. The gene arrangement of P. fontanesii did not differ from other Scleractinia and consisted of 18,658 bp, organized in 13 protein-coding genes, 2 rRNA genes, and 2 tRNA genes. This mitochondrial genome contributes essential data to work towards a better understanding of evolutionary relationships within Porites.

  7. Complete mitochondrial genome of the South Polar Skua Stercorarius maccormicki (Charadriiformes, Stercorariidae) in Antarctica.

    Science.gov (United States)

    Han, Yeong-Deok; Baek, Ye-Seul; Kim, Jeong-Hoon; Choi, Han-Gu; Kim, Sanghee

    2016-05-01

    The South Polar Skua, gull-like seabirds is the most fascinating Antarctic seabirds that lay two eggs at sites free of snow and ice and predominantly hunt pelagic fish and penguins. Blood samples of the South Polar Skua Stercorarius maccormicki was collected during the summer activity near King Sejong station in Antarctica. The complete mitochondrial DNA sequence of S. maccormicki was 16,669 bp, showing conserved genome structure and orientation found in other avian species. The control region of S. maccormicki was 93- and 80 bp shorter compared to those of Chroicocephalus saundersi and Synthliboramphus antiquus respectively. Interestingly, there is a (CAACAAACAA)6 repeat sequence in the control region. Our results of S. maccormicki mt genome including the repeat sequence, may provide useful genetic information for phylogenetic and phylogeographic histories of the southern skua complex.

  8. Using ezRAD to reconstruct the complete mitochondrial genome of Porites fontanesii (Cnidaria: Scleractinia)

    KAUST Repository

    Terraneo, Tullia Isotta; Arrigoni, Roberto; Benzoni, Francesca; Forsman, Zac H.; Berumen, Michael L.

    2018-01-01

    Corals in the genus Porites are among the major framework builders of reef structures worldwide, yet the genus has been challenging to study due to a lack of informative molecular markers. Here, we used ezRAD sequencing to reconstruct the complete mitochondrial genome of Porites fontanesii (GenBank accession number MG754069), a widespread coral species endemic to the Red Sea and Gulf of Aden. The gene arrangement of P. fontanesii did not differ from other Scleractinia and consisted of 18,658 bp, organized in 13 protein-coding genes, 2 rRNA genes, and 2 tRNA genes. This mitochondrial genome contributes essential data to work towards a better understanding of evolutionary relationships within Porites.

  9. Complete mitochondrial genome of the blue shark Prionace glauca (Elasmobranchii: Carcharhiniformes).

    Science.gov (United States)

    Chen, Xiao; Xiang, Dan; Ai, Weiming; Shi, Xiaofang

    2015-04-01

    In this study, we first presented the complete mitochondrial genome of the blue shark Prionace Glauca, a pelagic and oceanic species. It is 16,705 bp in length and contains 2 rRNA genes, 22 tRNA genes, 13 protein-coding genes and 1 putative control region. The overall base composition is 31.6% A, 24.4% C, 13.1% G and 30.9% T. Overlaps and short inter-genic spaces are located in the genome. The tRNA-Ser2 loses the dihydrouridine arm and cannot be folded into the typical clover-leaf secondary structure. Two start codons (GTG and ATG) with two stop codons (TAG and TAA) or with one incomplete stop codon (T) are found in the 13 protein-coding genes. The control region contains high A + T (69.9%) and low G (12.0%).

  10. Intraspecific phylogenetic analysis of Siberian woolly mammoths using complete mitochondrial genomes

    DEFF Research Database (Denmark)

    Gilbert, M Thomas P; Drautz, Daniela I; Lesk, Arthur M

    2008-01-01

    We report five new complete mitochondrial DNA (mtDNA) genomes of Siberian woolly mammoth (Mammuthus primigenius), sequenced with up to 73-fold coverage from DNA extracted from hair shaft material. Three of the sequences present the first complete mtDNA genomes of mammoth clade II. Analysis...... to indicate any important functional difference between genomes belonging to the two clades, suggesting that the loss of clade II more likely is due to genetic drift than a selective sweep....

  11. Complete Genome Sequence of Genotype VI Newcastle Disease Viruses Isolated from Pigeons in Pakistan

    OpenAIRE

    Wajid, Abdul; Rehmani, Shafqat Fatima; Sharma, Poonam; Goraichuk, Iryna V.; Dimitrov, Kiril M.; Afonso, Claudio L.

    2016-01-01

    Two complete genome sequences of Newcastle disease virus (NDV) are described here. Virulent isolates pigeon/Pakistan/Lahore/21A/2015 and pigeon/Pakistan/Lahore/25A/2015 were obtained from racing pigeons sampled in the Pakistani province of Punjab during 2015. Phylogenetic analysis of the fusion protein genes and complete genomes classified the isolates as members of NDV class II, genotype VI.

  12. Complete Genome Sequence of a Novel Aquareovirus That Infects the Endangered Fountain Darter, Etheostoma fonticola.

    Science.gov (United States)

    Iwanowicz, Luke R; Iwanowicz, Deborah D; Adams, Cynthia R; Lewis, Teresa D; Brandt, Thomas M; Cornman, Robert S; Sanders, Lakyn

    2016-12-22

    Here, we report the complete genome of a novel aquareovirus isolated from clinically normal fountain darters, Etheostoma fonticola, inhabiting the San Marcos River, Texas, USA. The complete genome consists of 23,958 bp consisting of 11 segments that range from 783 bp (S11) to 3,866 bp (S1). Copyright © 2016 Iwanowicz et al.

  13. Complete Genome Sequence of a Novel Aquareovirus That Infects the Endangered Fountain Darter, Etheostoma fonticola

    OpenAIRE

    Iwanowicz, Luke R.; Iwanowicz, Deborah D.; Adams, Cynthia R.; Lewis, Teresa D.; Brandt, Thomas M.; Cornman, Robert S.; Sanders, Lakyn

    2016-01-01

    Here, we report the complete genome of a novel aquareovirus isolated from clinically normal fountain darters, Etheostoma fonticola, inhabiting the San Marcos River, Texas, USA. The complete genome consists of 23,958 bp consisting of 11 segments that range from 783 bp (S11) to 3,866 bp (S1).

  14. Complete genome sequence of a novel aquareovirus that infects the endangered fountain darter, Etheostoma fonticola

    Science.gov (United States)

    Iwanowicz, Luke R.; Iwanowicz, Deborah; Adams, Cynthia; Lewis, Teresa D.; Brandt, Thomas M.; Cornman, Robert S.; Sanders, Lakyn R.

    2016-01-01

    Here, we report the complete genome of a novel aquareovirus isolated from clinically normal fountain darters, Etheostoma fonticola, inhabiting the San Marcos River, Texas, USA. The complete genome consists of 23,958 bp consisting of 11 segments that range from 783 bp (S11) to 3,866 bp (S1).

  15. The Complete Genome of a New Betabaculovirus from Clostera anastomosis.

    Directory of Open Access Journals (Sweden)

    Feifei Yin

    Full Text Available Clostera anastomosis (Lepidoptera: Notodontidae is a defoliating forest insect pest. Clostera anastomosis granulovirus-B (ClasGV-B belonging to the genus Betabaculovirus of family Baculoviridae has been used for biological control of the pest. Here we reported the full genome sequence of ClasGV-B and compared it to other previously sequenced baculoviruses. The circular double-stranded DNA genome is 107,439 bp in length, with a G+C content of 37.8% and contains 123 open reading frames (ORFs representing 93% of the genome. ClasGV-B contains 37 baculovirus core genes, 25 lepidopteran baculovirus specific genes, 19 betabaculovirus specific genes, 39 other genes with homologues to baculoviruses and 3 ORFs unique to ClasGV-B. Hrs appear to be absent from the ClasGV-B genome, however, two non-hr repeats were found. Phylogenetic tree based on 37 core genes from 73 baculovirus genomes placed ClasGV-B in the clade b of betabaculoviruses and was most closely related to Erinnyis ello GV (ErelGV. The gene arrangement of ClasGV-B also shared the strongest collinearity with ErelGV but differed from Clostera anachoreta GV (ClanGV, Clostera anastomosis GV-A (ClasGV-A, previously also called CaLGV and Epinotia aporema GV (EpapGV with a 20 kb inversion. ClasGV-B genome contains three copies of polyhedron envelope protein gene (pep and phylogenetic tree divides the PEPs of betabaculoviruses into three major clades: PEP-1, PEP-2 and PEP/P10. ClasGV-B also contains three homologues of P10 which all harbor an N-terminal coiled-coil domain and a C-terminal basic sequence. ClasGV-B encodes three fibroblast growth factor (FGF homologues which are conserved in all sequenced betabaculoviruses. Phylogenetic analysis placed these three FGFs into different groups and suggested that the FGFs were evolved at the early stage of the betabaculovirus expansion. ClasGV-B is different from previously reported ClasGV-A and ClanGV isolated from Notodontidae in sequence and gene

  16. Structural Genomics of Minimal Organisms: Pipeline and Results

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Sung-Hou; Shin, Dong-Hae; Kim, Rosalind; Adams, Paul; Chandonia, John-Marc

    2007-09-14

    The initial objective of the Berkeley Structural Genomics Center was to obtain a near complete three-dimensional (3D) structural information of all soluble proteins of two minimal organisms, closely related pathogens Mycoplasma genitalium and M. pneumoniae. The former has fewer than 500 genes and the latter has fewer than 700 genes. A semiautomated structural genomics pipeline was set up from target selection, cloning, expression, purification, and ultimately structural determination. At the time of this writing, structural information of more than 93percent of all soluble proteins of M. genitalium is avail able. This chapter summarizes the approaches taken by the authors' center.

  17. Complete mitochondrial genome of the tiger shark Galeocerdo cuvier (Carcharhiniformes: Carcharhinidae).

    Science.gov (United States)

    Chen, Xiao; Yu, Junqi; Zhang, Saile; Ding, Wenyong; Xiang, Dan

    2014-12-01

    The tiger shark Galeocerdo cuvier is the only member of the genus Galeocerdo. The complete mitochondrial genome of G. cuvier is presented for the first time in this study. The gene composition and arrangement in the mitogenome of G. cuvier is identical to most animal mitogenome. There are 22 bp short noncoding sequences and 44 bp overlaps in the mitogenome. The overall base composition is 31.8% A, 23.9% C, 13.0% G and 31.3% T. The dihydrouridine arm of tRNA-Ser2 was replaced by a simple loop and the other tRNAs could be folded into the typical cloverleaf structure.

  18. The complete genome sequence of Clostridium indolis DSM 755(T.).

    Science.gov (United States)

    Biddle, Amy S; Leschine, Susan; Huntemann, Marcel; Han, James; Chen, Amy; Kyrpides, Nikos; Markowitz, Victor; Palaniappan, Krishna; Ivanova, Natalia; Mikhailova, Natalia; Ovchinnikova, Galina; Schaumberg, Andrew; Pati, Amrita; Stamatis, Dimitrios; Reddy, Tatiparthi; Lobos, Elizabeth; Goodwin, Lynne; Nordberg, Henrik P; Cantor, Michael N; Hua, Susan X; Woyke, Tanja; Blanchard, Jeffrey L

    2014-06-15

    Clostridium indolis DSM 755(T) is a bacterium commonly found in soils and the feces of birds and mammals. Despite its prevalence, little is known about the ecology or physiology of this species. However, close relatives, C. saccharolyticum and C. hathewayi, have demonstrated interesting metabolic potentials related to plant degradation and human health. The genome of C. indolis DSM 755(T) reveals an abundance of genes in functional groups associated with the transport and utilization of carbohydrates, as well as citrate, lactate, and aromatics. Ecologically relevant gene clusters related to nitrogen fixation and a unique type of bacterial microcompartment, the CoAT BMC, are also detected. Our genome analysis suggests hypotheses to be tested in future culture based work to better understand the physiology of this poorly described species.

  19. Complete Genome Sequence of Mycobacterium xenopi Type Strain RIVM700367

    KAUST Repository

    Abdallah, A. M.; Rashid, M.; Adroub, S. A.; Elabdalaoui, H.; Ali, Shahjahan; van Soolingen, D.; Bitter, W.; Pain, Arnab

    2012-01-01

    Mycobacterium xenopi is a slow-growing, thermophilic, water-related Mycobacterium species. Like other nontuberculous mycobacteria, M. xenopi more commonly infects humans with altered immune function, such as chronic obstructive pulmonary disease patients. It is considered clinically relevant in a significant proportion of the patients from whom it is isolated. We report here the whole genome sequence of M. xenopi type strain RIVM700367.

  20. Complete Genome Sequence of Mycobacterium xenopi Type Strain RIVM700367

    KAUST Repository

    Abdallah, A. M.

    2012-05-24

    Mycobacterium xenopi is a slow-growing, thermophilic, water-related Mycobacterium species. Like other nontuberculous mycobacteria, M. xenopi more commonly infects humans with altered immune function, such as chronic obstructive pulmonary disease patients. It is considered clinically relevant in a significant proportion of the patients from whom it is isolated. We report here the whole genome sequence of M. xenopi type strain RIVM700367.

  1. A contig-based strategy for the genome-wide discovery of microRNAs without complete genome resources.

    Directory of Open Access Journals (Sweden)

    Jun-Zhi Wen

    Full Text Available MicroRNAs (miRNAs are important regulators of many cellular processes and exist in a wide range of eukaryotes. High-throughput sequencing is a mainstream method of miRNA identification through which it is possible to obtain the complete small RNA profile of an organism. Currently, most approaches to miRNA identification rely on a reference genome for the prediction of hairpin structures. However, many species of economic and phylogenetic importance are non-model organisms without complete genome sequences, and this limits miRNA discovery. Here, to overcome this limitation, we have developed a contig-based miRNA identification strategy. We applied this method to a triploid species of edible banana (GCTCV-119, Musa spp. AAA group and identified 180 pre-miRNAs and 314 mature miRNAs, which is three times more than those were predicted by the available dataset-based methods (represented by EST+GSS. Based on the recently published miRNA data set of Musa acuminate, the recall rate and precision of our strategy are estimated to be 70.6% and 92.2%, respectively, significantly better than those of EST+GSS-based strategy (10.2% and 50.0%, respectively. Our novel, efficient and cost-effective strategy facilitates the study of the functional and evolutionary role of miRNAs, as well as miRNA-based molecular breeding, in non-model species of economic or evolutionary interest.

  2. Insights into structural variations and genome rearrangements in prokaryotic genomes.

    Science.gov (United States)

    Periwal, Vinita; Scaria, Vinod

    2015-01-01

    Structural variations (SVs) are genomic rearrangements that affect fairly large fragments of DNA. Most of the SVs such as inversions, deletions and translocations have been largely studied in context of genetic diseases in eukaryotes. However, recent studies demonstrate that genome rearrangements can also have profound impact on prokaryotic genomes, leading to altered cell phenotype. In contrast to single-nucleotide variations, SVs provide a much deeper insight into organization of bacterial genomes at a much better resolution. SVs can confer change in gene copy number, creation of new genes, altered gene expression and many other functional consequences. High-throughput technologies have now made it possible to explore SVs at a much refined resolution in bacterial genomes. Through this review, we aim to highlight the importance of the less explored field of SVs in prokaryotic genomes and their impact. We also discuss its potential applicability in the emerging fields of synthetic biology and genome engineering where targeted SVs could serve to create sophisticated and accurate genome editing. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  3. Complete chloroplast genome of Trachelium caeruleum: extensiverearrangements are associated with repeats and tRNAs

    Energy Technology Data Exchange (ETDEWEB)

    Haberle, Rosemarie C.; Fourcade, Matthew L.; Boore, Jeffrey L.; Jansen, Robert K.

    2006-01-09

    Chloroplast genome structure, gene order and content arehighly conserved in land plants. We sequenced the complete chloroplastgenome sequence of Trachelium caeruleum (Campanulaceae) a member of anangiosperm family known for highly rearranged chloroplast genomes. Thetotal genome size is 162,321 bp with an IR of 27,273 bp, LSC of 100,113bp and SSC of 7,661 bp. The genome encodes 115 unique genes, with 19duplicated in the IR, a tRNA (trnI-CAU) duplicated once in the LSC and aprotein coding gene (psbJ) duplicated twice, for a total of 137 genes.Four genes (ycf15, rpl23, infA and accD) are truncated and likelynonfunctional; three others (clpP, ycf1 and ycf2) are so highly divergedthat they may now be pseudogenes. The most conspicuous feature of theTrachelium genome is the presence of eighteen internally unrearrangedblocks of genes that have been inverted or relocated within the genome,relative to the typical gene order of most angiosperm chloroplastgenomes. Recombination between repeats or tRNAs has been suggested as twomeans of chloroplast genome rearrangements. We compared the relativenumber of repeats in Trachelium to eight other angiosperm chloroplastgenomes, and evaluated the location of repeats and tRNAs in relation torearrangements. Trachelium has the highest number and largest repeats,which are concentrated near inversion endpoints or other rearrangements.tRNAs occur at many but not all inversion endpoints. There is likely nosingle mechanism responsible for the remarkable number of alterations inthis genome, but both repeats and tRNAs are clearly associated with theserearrangements. Land plant chloroplast genomes are highly conserved instructure, gene order and content. The chloroplast genomes of ferns, thegymnosperm Ginkgo, and most angiosperms are nearly collinear, reflectingthe gene order in lineages that diverged from lycopsids and the ancestralchloroplast gene order over 350 million years ago (Raubeson and Jansen,1992). Although earlier mapping studies

  4. Genomics technologies to study structural variations in the grapevine genome

    Directory of Open Access Journals (Sweden)

    Cardone Maria Francesca

    2016-01-01

    Full Text Available Grapevine is one of the most important crop plants in the world. Recently there was great expansion of genomics resources about grapevine genome, thus providing increasing efforts for molecular breeding. Current cultivars display a great level of inter-specific differentiation that needs to be investigated to reach a comprehensive understanding of the genetic basis of phenotypic differences, and to find responsible genes selected by cross breeding programs. While there have been significant advances in resolving the pattern and nature of single nucleotide polymorphisms (SNPs on plant genomes, few data are available on copy number variation (CNV. Furthermore association between structural variations and phenotypes has been described in only a few cases. We combined high throughput biotechnologies and bioinformatics tools, to reveal the first inter-varietal atlas of structural variation (SV for the grapevine genome. We sequenced and compared four table grape cultivars with the Pinot noir inbred line PN40024 genome as the reference. We detected roughly 8% of the grapevine genome affected by genomic variations. Taken into account phenotypic differences existing among the studied varieties we performed comparison of SVs among them and the reference and next we performed an in-depth analysis of gene content of polymorphic regions. This allowed us to identify genes showing differences in copy number as putative functional candidates for important traits in grapevine cultivation.

  5. Evolution and phylogeny of the mud shrimps (Crustacea: Decapoda) revealed from complete mitochondrial genomes.

    Science.gov (United States)

    Lin, Feng-Jiau; Liu, Yuan; Sha, Zhongli; Tsang, Ling Ming; Chu, Ka Hou; Chan, Tin-Yam; Liu, Ruiyu; Cui, Zhaoxia

    2012-11-16

    The evolutionary history and relationships of the mud shrimps (Crustacea: Decapoda: Gebiidea and Axiidea) are contentious, with previous attempts revealing mixed results. The mud shrimps were once classified in the infraorder Thalassinidea. Recent molecular phylogenetic analyses, however, suggest separation of the group into two individual infraorders, Gebiidea and Axiidea. Mitochondrial (mt) genome sequence and structure can be especially powerful in resolving higher systematic relationships that may offer new insights into the phylogeny of the mud shrimps and the other decapod infraorders, and test the hypothesis of dividing the mud shrimps into two infraorders. We present the complete mitochondrial genome sequences of five mud shrimps, Austinogebia edulis, Upogebia major, Thalassina kelanang (Gebiidea), Nihonotrypaea thermophilus and Neaxius glyptocercus (Axiidea). All five genomes encode a standard set of 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and a putative control region. Except for T. kelanang, mud shrimp mitochondrial genomes exhibited rearrangements and novel patterns compared to the pancrustacean ground pattern. Each of the two Gebiidea species (A. edulis and U. major) and two Axiidea species (N. glyptocercus and N. thermophiles) share unique gene order specific to their infraorders and analyses further suggest these two derived gene orders have evolved independently. Phylogenetic analyses based on the concatenated nucleotide and amino acid sequences of 13 protein-coding genes indicate the possible polyphyly of mud shrimps, supporting the division of the group into two infraorders. However, the infraordinal relationships among the Gebiidea and Axiidea, and other reptants are poorly resolved. The inclusion of mt genome from more taxa, in particular the reptant infraorders Polychelida and Glypheidea is required in further analysis. Phylogenetic analyses on the mt genome sequences and the distinct gene orders provide further

  6. Complete genome sequence of Shigella flexneri 5b and comparison with Shigella flexneri 2a

    Directory of Open Access Journals (Sweden)

    Xue Ying

    2006-07-01

    Full Text Available Abstract Background Shigella bacteria cause dysentery, which remains a significant threat to public health. Shigella flexneri is the most common species in both developing and developed countries. Five Shigella genomes have been sequenced, revealing dynamic and diverse features. To investigate the intra-species diversity of S. flexneri genomes further, we have sequenced the complete genome of S. flexneri 5b strain 8401 (abbreviated Sf8401 and compared it with S. flexneri 2a (Sf301. Results The Sf8401 chromosome is 4.5-Mb in size, a little smaller than that of Sf301, mainly because the former lacks the SHI-1 pathogenicity island (PAI. Compared with Sf301, there are 6 inversions and one translocation in Sf8401, which are probably mediated by insertion sequences (IS. There are clear differences in the known PAIs between these two genomes. The bacteriophage SfV segment remaining in SHI-O of Sf8401 is clearly larger than the remnants of bacteriophage SfII in Sf301. SHI-1 is absent from Sf8401 but a specific related protein is found next to the pheV locus. SHI-2 is involved in one intra-replichore inversion near the origin of replication, which may change the expression of iut/iuc genes. Moreover, genes related to the glycine-betaine biosynthesis pathway are present only in Sf8401 among the known Shigella genomes. Conclusion Our data show that the two S. flexneri genomes are very similar, which suggests a high level of structural and functional conservation between the two serotypes. The differences reflect different selection pressures during evolution. The ancestor of S. flexneri probably acquired SHI-1 and SHI-2 before SHI-O was integrated and the serotypes diverged. SHI-1 was subsequently deleted from the S. flexneri 5b genome by recombination, but stabilized in the S. flexneri 2a genome. These events may have contributed to the differences in pathogenicity and epidemicity between the two serotypes of S. flexneri.

  7. Genome sequencing and analysis of the first complete genome of Lactobacillus kunkeei strain MP2, an Apis mellifera gut isolate

    Directory of Open Access Journals (Sweden)

    Freddy Asenjo

    2016-04-01

    Full Text Available Background. The honey bee (Apis mellifera is the most important pollinator in agriculture worldwide. However, the number of honey bees has fallen significantly since 2006, becoming a huge ecological problem nowadays. The principal cause is CCD, or Colony Collapse Disorder, characterized by the seemingly spontaneous abandonment of hives by their workers. One of the characteristics of CCD in honey bees is the alteration of the bacterial communities in their gastrointestinal tract, mainly due to the decrease of Firmicutes populations, such as the Lactobacilli. At this time, the causes of these alterations remain unknown. We recently isolated a strain of Lactobacillus kunkeei (L. kunkeei strain MP2 from the gut of Chilean honey bees. L. kunkeei, is one of the most commonly isolated bacterium from the honey bee gut and is highly versatile in different ecological niches. In this study, we aimed to elucidate in detail, the L. kunkeei genetic background and perform a comparative genome analysis with other Lactobacillus species. Methods. L. kunkeei MP2 was originally isolated from the guts of Chilean A. mellifera individuals. Genome sequencing was done using Pacific Biosciences single-molecule real-time sequencing technology. De novo assembly was performed using Celera assembler. The genome was annotated using Prokka, and functional information was added using the EggNOG 3.1 database. In addition, genomic islands were predicted using IslandViewer, and pro-phage sequences using PHAST. Comparisons between L. kunkeei MP2 with other L. kunkeei, and Lactobacillus strains were done using Roary. Results. The complete genome of L. kunkeei MP2 comprises one circular chromosome of 1,614,522 nt. with a GC content of 36,9%. Pangenome analysis with 16 L. kunkeei strains, identified 113 unique genes, most of them related to phage insertions. A large and unique region of L. kunkeei MP2 genome contains several genes that encode for phage structural protein and

  8. Complete genome sequence of Sanguibacter keddieii type strain (ST-74T)

    Energy Technology Data Exchange (ETDEWEB)

    Ivanova, Natalia; Sikorski, Johannes; Sims, David; Brettin, Thomas; Detter, John C.; Han, Cliff; Lapidus, Alla; Copeland, Alex; Glavina Del Rio, Tijana; Nolan, Matt; Chen, Feng; Lucas, Susan; Tice, Hope; Cheng, Jan-Fang; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Pati, Amrita; Mavromatis, Konstantinos; Chen, Amy; Palaniappan, Krishna; D' haeseleer, Patrik; Chain, Patrick; Bristow, Jim; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Goker, Markus; Pukall, Rudiger; Klenk, Hans-Peter; Kyrpides, Nikos

    2009-05-20

    Sanguibacter keddieii is the type species of the genus Sanguibacter, the only described genus within the family of Sanguibacteraceae. Phylogenetically, this family is located in the neighbourhood of the genus Oerskovia and the family Cellulomonadaceae within the actinobacterial suborder Micrococcineae. The strain described in this report was isolated from blood of apparently healthy cows. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of the family Sanguibacteraceae, and the 4,253,413 bp long single replicon genome with its 3735 protein-coding and 70 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  9. Complete genome sequence of Calditerrivibrio nitroreducens type strain (Yu37-1T)

    Energy Technology Data Exchange (ETDEWEB)

    Pitluck, Sam [Joint Genome Institute, Walnut Creek, California; Sikorski, Johannes [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Zeytun, Ahmet [Los Alamos National Laboratory (LANL); Lapidus, Alla L. [Joint Genome Institute, Walnut Creek, California; Nolan, Matt [Joint Genome Institute, Walnut Creek, California; Lucas, Susan [Joint Genome Institute, Walnut Creek, California; Hammon, Nancy [Joint Genome Institute, Walnut Creek, California; Deshpande, Shweta [Joint Genome Institute, Walnut Creek, California; Cheng, Jan-Fang [Joint Genome Institute, Walnut Creek, California; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Liolios, Konstantinos [Joint Genome Institute, Walnut Creek, California; Pagani, Ioanna [Joint Genome Institute, Walnut Creek, California; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [Joint Genome Institute, Walnut Creek, California; Palaniappan, Krishna [Joint Genome Institute, Walnut Creek, California; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Detter, J. Chris [Joint Genome Institute, Walnut Creek, California; Brambilla, Evelyne-Marie [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Ngatchou, Olivier Duplex [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Spring, Stefan [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Woyke, Tanja [Joint Genome Institute, Walnut Creek, California; Bristow, James [Joint Genome Institute, Walnut Creek, California; Eisen, Jonathan [Joint Genome Institute, Walnut Creek, California; Markowitz, Victor [Joint Genome Institute, Walnut Creek, California; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [Joint Genome Institute, Walnut Creek, California; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Land, Miriam L [ORNL

    2011-01-01

    Calditerrivibrio nitroreducens Iino et al. 2008 is the type species of the genus Calditerrivibrio. The species is of interest because of its important role in the nitrate cycle as nitrate reducer and for its isolated phylogenetic position in the Tree of Life. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the third complete genome sequence of a member of the family Deferribacteraceae. The 2,216,552 bp long genome with its 2,128 protein-coding and 50 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  10. The Complete Chloroplast Genome Sequences of the Medicinal Plant Forsythia suspensa (Oleaceae

    Directory of Open Access Journals (Sweden)

    Wenbin Wang

    2017-10-01

    Full Text Available Forsythia suspensa is an important medicinal plant and traditionally applied for the treatment of inflammation, pyrexia, gonorrhea, diabetes, and so on. However, there is limited sequence and genomic information available for F. suspensa. Here, we produced the complete chloroplast genomes of F. suspensa using Illumina sequencing technology. F. suspensa is the first sequenced member within the genus Forsythia (Oleaceae. The gene order and organization of the chloroplast genome of F. suspensa are similar to other Oleaceae chloroplast genomes. The F. suspensa chloroplast genome is 156,404 bp in length, exhibits a conserved quadripartite structure with a large single-copy (LSC; 87,159 bp region, and a small single-copy (SSC; 17,811 bp region interspersed between inverted repeat (IRa/b; 25,717 bp regions. A total of 114 unique genes were annotated, including 80 protein-coding genes, 30 tRNA, and four rRNA. The low GC content (37.8% and codon usage bias for A- or T-ending codons may largely affect gene codon usage. Sequence analysis identified a total of 26 forward repeats, 23 palindrome repeats with lengths >30 bp (identity > 90%, and 54 simple sequence repeats (SSRs with an average rate of 0.35 SSRs/kb. We predicted 52 RNA editing sites in the chloroplast of F. suspensa, all for C-to-U transitions. IR expansion or contraction and the divergent regions were analyzed among several species including the reported F. suspensa in this study. Phylogenetic analysis based on whole-plastome revealed that F. suspensa, as a member of the Oleaceae family, diverged relatively early from Lamiales. This study will contribute to strengthening medicinal resource conservation, molecular phylogenetic, and genetic engineering research investigations of this species.

  11. The Complete Sequence of the Mitochondrial Genome of the Chamberednautilus (Mollusca: Cephalopoda)

    Energy Technology Data Exchange (ETDEWEB)

    Boore, Jeffrey L.

    2005-12-01

    Background: Mitochondria contain small genomes that arephysically separate from those of nuclei. Their comparison serves as amodel system for understanding the processes of genome evolution.Although complete mitochondrial genome sequences have been reported formore than 600 animals, the taxonomic sampling is highly biased towardvertebrates and arthropods, leaving much of the diversity yetuncharacterized. Results: The mitochondrial genome of a cephalopodmollusk, the Chambered Nautilus, is 16,258 nts in length and 59.5 percentA+T, both values that are typical of animal mitochondrial genomes. Itcontains the 37 genes that are typical for animal mtDNAs, with 15 on oneDNA strand and 22 on the other. The arrangement of these genes can bederived from that of the distantly related Katharina tunicata (Mollusca:Polyplacophora) by a switch in position of two large blocks of genes andtranspositions of four tRNA genes. There is strong skew in thedistribution of nucleotides between the two strands. There are an unusualnumber of non-coding regions and their function, if any, is not known;however, several of these demark abrupt shifts in nucleotide skew,suggesting that they may play roles in transcription and/or replication.One of the non-coding regions contains multiple repeats of a tRNA-likesequence. Some of the tRNA genes appear to overlap on the same strand,but this could be resolved if the polycistron were cleaved at thebeginning of the downstream gene, followed by polyadenylation of theproduct of the upstream gene to form a fully paired structure.Conclusions: Nautilus sp. mtDNA contains an expected gene content thathas experienced few rearrangements since the evolutionary split betweencephalopods and polyplacophorans. It contains an unusual number ofnon-coding regions, especially considering that these otherwise often aregenerated by the same processes that produce gene rearrangements. Thisappears to be yet another case where polyadenylation of mitochondrialtRNAs restores

  12. ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes.

    Science.gov (United States)

    Otto, Thomas Dan; Catanho, Marcos; Tristão, Cristian; Bezerra, Márcia; Fernandes, Renan Mathias; Elias, Guilherme Steinberger; Scaglia, Alexandre Capeletto; Bovermann, Bill; Berstis, Viktors; Lifschitz, Sergio; de Miranda, Antonio Basílio; Degrave, Wim

    2010-03-01

    Many analyses in modern biological research are based on comparisons between biological sequences, resulting in functional, evolutionary and structural inferences. When large numbers of sequences are compared, heuristics are often used resulting in a certain lack of accuracy. In order to improve and validate results of such comparisons, we have performed radical all-against-all comparisons of 4 million protein sequences belonging to the RefSeq database, using an implementation of the Smith-Waterman algorithm. This extremely intensive computational approach was made possible with the help of World Community Grid, through the Genome Comparison Project. The resulting database, ProteinWorldDB, which contains coordinates of pairwise protein alignments and their respective scores, is now made available. Users can download, compare and analyze the results, filtered by genomes, protein functions or clusters. ProteinWorldDB is integrated with annotations derived from Swiss-Prot, Pfam, KEGG, NCBI Taxonomy database and gene ontology. The database is a unique and valuable asset, representing a major effort to create a reliable and consistent dataset of cross-comparisons of the whole protein content encoded in hundreds of completely sequenced genomes using a rigorous dynamic programming approach. The database can be accessed through http://proteinworlddb.org

  13. The complete mitochondrial genome of the Korean skate: Hongeo koreana (Rajiformes, Rajidae).

    Science.gov (United States)

    Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Lee, Youn-Ho

    2014-12-01

    The complete mitochondrial genome of the Korean skate, Hongeo koreana, the sole member of its genus, is investigated for the first time. The genome consists of 16,906 bp in length including 2 rRNA, 22 tRNA and 13 protein coding genes with the same gene order and structure of the genome as those of other Rajidae species. The overall nucleotide composition of the L-strand is A = 29.8%, C = 27.9%, T = 27.9% and G = 14.3%, showing a high A + T bias. The anti-G bias (6.0%) is more significant in the third codon position. Twelve of the 13 protein-coding genes use ATG as their start codon while the COX1 gene starts with GTG. For stop codon, ND3 and ND4 genes show incomplete stop codon T. The mitogenome sequence of H. koreana will provide important information on the evolution and the phylogenetic relation of the genus Hongeo in relation to the other genera of the family Rajidae.

  14. Complete mitochondrial genome of the blacknose shark Carcharhinus acronotus (Elasmobranchii: Carcharhinidae).

    Science.gov (United States)

    Yang, Lei; Matthes-Rosana, Kerri A; Naylor, Gavin J P

    2016-01-01

    The complete mitochondrial genome of the blacknose shark Carcharhinus acronotus has been determined in this work. It has a length of 16,719 bp and consisted of 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 1 control region. The gene composition and genome organization was similar to other vertebrates. This study represents part of an ongoing effort to obtain mitochondrial genome sequences for chondrichthyan species in order to better estimate their phylogenetic relationships.

  15. The complete genome sequence and analysis of the human pathogen Campylobacter lari

    DEFF Research Database (Denmark)

    Miller, WG; Wang, G; Binnewies, Tim Terence

    2008-01-01

    Campylobacter lari is a member of the epsilon subdivision of the Proteobacteria and is part of the thermotolerant Campylobacter group, a clade that includes the human pathogen C. jejuni. Here we present the complete genome sequence of the human clinical isolate, C. lari RM2100. The genome of strain...... RM2100 is approximately 1.53 Mb and includes the 46 kb megaplasmid pCL2100. Also present within the strain RM2100 genome is a 36 kb putative prophage, termed CLIE1, which is similar to CJIE4, a putative prophage present within the C. jejuni RM1221 genome. Nearly all (90%) of the gene content...... in strain RM2100 is similar to genes present in the genomes of other characterized thermotolerant campylobacters. However, several genes involved in amino acid biosynthesis and energy metabolism, identified previously in other Campylobacter genomes, are absent from the C. lari RM2100 genome. Therefore, C...

  16. Transcriptional and phylogenetic analysis of five complete ambystomatid salamander mitochondrial genomes.

    Science.gov (United States)

    Samuels, Amy K; Weisrock, David W; Smith, Jeramiah J; France, Katherine J; Walker, John A; Putta, Srikrishna; Voss, S Randal

    2005-04-11

    We report on a study that extended mitochondrial transcript information from a recent EST project to obtain complete mitochondrial genome sequence for 5 tiger salamander complex species (Ambystoma mexicanum, A. t. tigrinum, A. andersoni, A. californiense, and A. dumerilii). We describe, for the first time, aspects of mitochondrial transcription in a representative amphibian, and then use complete mitochondrial sequence data to examine salamander phylogeny at both deep and shallow levels of evolutionary divergence. The available mitochondrial ESTs for A. mexicanum (N=2481) and A. t. tigrinum (N=1205) provided 92% and 87% coverage of the mitochondrial genome, respectively. Complete mitochondrial sequences for all species were rapidly obtained by using long distance PCR and DNA sequencing. A number of genome structural characteristics (base pair length, base composition, gene number, gene boundaries, codon usage) were highly similar among all species and to other distantly related salamanders. Overall, mitochondrial transcription in Ambystoma approximated the pattern observed in other vertebrates. We inferred from the mapping of ESTs onto mtDNA that transcription occurs from both heavy and light strand promoters and continues around the entire length of the mtDNA, followed by post-transcriptional processing. However, the observation of many short transcripts corresponding to rRNA genes indicates that transcription may often terminate prematurely to bias transcription of rRNA genes; indeed an rRNA transcription termination signal sequence was observed immediately following the 16S rRNA gene. Phylogenetic analyses of salamander family relationships consistently grouped Ambystomatidae in a clade containing Cryptobranchidae and Hynobiidae, to the exclusion of Salamandridae. This robust result suggests a novel alternative hypothesis because previous studies have consistently identified Ambystomatidae and Salamandridae as closely related taxa. Phylogenetic analyses of tiger

  17. Complete Genome Sequence of Bacillus velezensis L-1, Which Has Antagonistic Activity against Pear Diseases

    OpenAIRE

    Sun, Pingping; Cui, Jianchao; Jia, Xiaohui; Wang, Wenhui

    2017-01-01

    ABSTRACT Bacillus velezensis L-1 is an effective biocontrol agent against pear diseases. Here, we report the complete genome sequence of B. velezensis L-1 in which clusters related to the biosynthesis of secondary metabolites were predicted. This genome provides insights into the possible biocontrol mechanisms and furthers application of this specific bacterium.

  18. Complete Genome Sequence of Bacillus velezensis L-1, Which Has Antagonistic Activity against Pear Diseases.

    Science.gov (United States)

    Sun, Pingping; Cui, Jianchao; Jia, Xiaohui; Wang, Wenhui

    2017-11-30

    Bacillus velezensis L-1 is an effective biocontrol agent against pear diseases. Here, we report the complete genome sequence of B. velezensis L-1 in which clusters related to the biosynthesis of secondary metabolites were predicted. This genome provides insights into the possible biocontrol mechanisms and furthers application of this specific bacterium. Copyright © 2017 Sun et al.

  19. Characterization of the complete mitochondrial genome of the Rhinolophus sinicus sinicus (Chiroptera: Rhinolophidae) from Central China.

    Science.gov (United States)

    Xie, Lifen; Sun, Keping; Feng, Jiang

    2016-07-01

    We present a complete mitochondrial genome sequence of Rhinolophus sinicus sinicus from Central China and provide its annotation, as well as showed the phylogenetic relationship and mitogenomic variation with other published mitochondrial genomes of congeneric bat species. Our results revealed a relatively high mitogenomic variation between two R. s. sinucus from Central and East China, which is similar to interspecific divergence level.

  20. Complete Genome Sequence of the Novel Bacteriophage pSco-10 Infecting Staphylococcus cohnii.

    Science.gov (United States)

    Jun, Jin Woo; Giri, Sib Sankar; Kim, Hyoun Joong; Chi, Cheng; Yun, Saekil; Kim, Sang Guen; Kim, Sang Wha; Kang, Jeong Woo; Park, Se Chang

    2017-11-22

    Herein, we report the complete genome sequence of the Staphylococcus Myoviridae phage pSco-10 infecting Staphylococcus cohnii The phage pSco-10 was isolated from duck feces collected from four farms in South Korea. The current report provides valuable information for genomic study of phages. Copyright © 2017 Jun et al.

  1. Complete Genome Sequence of the Novel Bacteriophage pSco-10 Infecting Staphylococcus cohnii

    OpenAIRE

    Jun, Jin Woo; Giri, Sib Sankar; Kim, Hyoun Joong; Chi, Cheng; Yun, Saekil; Kim, Sang Guen; Kim, Sang Wha; Kang, Jeong Woo; Park, Se Chang

    2017-01-01

    ABSTRACT Herein, we report the complete genome sequence of the Staphylococcus Myoviridae phage pSco-10 infecting Staphylococcus cohnii. The phage pSco-10 was isolated from duck feces collected from four farms in South Korea. The current report provides valuable information for genomic study of phages.

  2. Complete genome sequence of a divergent strain of lettuce chlorosis virus from Periwinkle in China

    Science.gov (United States)

    A novel strain of Lettuce chlorosis virus (LCV) was identified from periwinkle in China (PW) with foliar interveinal chlorosis and plant dwarfing. Complete nucleotide (nt) sequences of genomic RNA1 and RNA2 of the virus are 8,602 nt and 8,456 nt, respectively. The genomic organization of LCV-PW rese...

  3. Complete Genome Sequence of the Anaerobic Halophilic Alkalithermophile Natranaerobius thermophilus JW/NM-WN-LFT

    Energy Technology Data Exchange (ETDEWEB)

    Mesbah, Noha [University of Georgia, Athens, GA; Dalin, Eileen [U.S. Department of Energy, Joint Genome Institute; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Chertkov, Olga [Los Alamos National Laboratory (LANL); Han, James [U.S. Department of Energy, Joint Genome Institute; Larimer, Frank W [ORNL; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Wiegel, Juergen [University of Georgia, Athens, GA

    2011-01-01

    The genome of the anaerobic halophilic alkalithermophile Natranaerobius thermophiles consists of one chromosome and two plasmids.The present study is the first to report the completely sequenced genome of polyextremophile and the harboring genes harboring genes associated with roles in regulation of intracellular osmotic pressure, pH homeostasis, and thermophilic stability.

  4. Complete Genome Sequence of the Fruiting Myxobacterium Melittangium boletus DSM 14713.

    Science.gov (United States)

    Treuner-Lange, Anke; Bruckskotten, Marc; Rupp, Oliver; Goesmann, Alexander; Søgaard-Andersen, Lotte

    2017-11-09

    The formation of spore-filled fruiting bodies in response to starvation represents a hallmark of many members of the order Myxococcales Here, we present the complete 9.9-Mb genome of the fruiting type strain Melittangium boletus DSM 14713, the first member of this genus to have its genome sequenced. Copyright © 2017 Treuner-Lange et al.

  5. Complete genome sequence of Paris mosaic necrosis virus, a distinct member of the genus Potyvirus

    Science.gov (United States)

    The complete genomic sequence of a novel potyvirus was determined from Paris polyphylla var. yunnanensis. Its genomic RNA consists of 9,660 nucleotides (nt) excluding the 3’-terminal poly (A) tail, containing a single open reading frame (ORF) encoding a large polyprotein. The virus shares 52.1-69.7%...

  6. Complete genome sequence of Bifidobacterium breve CECT 7263, a strain isolated from human milk.

    Science.gov (United States)

    Jiménez, Esther; Villar-Tajadura, M Antonia; Marín, María; Fontecha, Javier; Requena, Teresa; Arroyo, Rebeca; Fernández, Leónides; Rodríguez, Juan M

    2012-07-01

    Bifidobacterium breve is an actinobacterium frequently isolated from colonic microbiota of breastfeeding babies. Here, we report the complete and annotated genome sequence of a B. breve strain isolated from human milk, B. breve CECT 7263. The genome sequence will provide new insights into the biology of this potential probiotic organism and will allow the characterization of genes related to beneficial properties.

  7. Complete Genome Sequences of Getah Virus Strains Isolated from Horses in 2016 in Japan.

    Science.gov (United States)

    Nemoto, Manabu; Bannai, Hiroshi; Ochi, Akihiro; Niwa, Hidekazu; Murakami, Satoshi; Tsujimura, Koji; Yamanaka, Takashi; Kokado, Hiroshi; Kondo, Takashi

    2017-08-03

    Getah virus is mosquito-borne and causes disease in horses and pigs. We sequenced and analyzed the complete genomes of three strains isolated from horses in Ibaraki Prefecture, eastern Japan, in 2016. They were almost identical to the genomes of strains recently isolated from horses, pigs, and mosquitoes in Japan. Copyright © 2017 Nemoto et al.

  8. Complete genome sequence of thermophilic Bacillus smithii type strain DSM 4216T

    DEFF Research Database (Denmark)

    Bosma, Elleke Fenna; Koehorst, Jasper J.; van Hijum, Sacha A. F. T.

    2016-01-01

    determined the complete genomic sequence of the B. smithii type strain DSM 4216T, which consists of a 3,368,778 bp chromosome (GenBank accession number CP012024.1) and a 12,514 bp plasmid (GenBank accession number CP012025.1), together encoding 3880 genes. Genome annotation via RAST was complemented...

  9. Complete Genome Sequence of the Yogurt Isolate Lactobacillus delbrueckii subsp. bulgaricus ACA-DC 87.

    Science.gov (United States)

    Alexandraki, Voula; Kazou, Maria; Pot, Bruno; Tsakalidou, Effie; Papadimitriou, Konstantinos

    2017-08-24

    Lactobacillus delbrueckii subsp. bulgaricus is widely used in the production of yogurt and cheese. In this study, we present the complete genome sequence of L. delbrueckii subsp. bulgaricus ACA-DC 87 isolated from traditional Greek yogurt. Whole-genome analysis may reveal desirable technological traits of the strain for dairy fermentations. Copyright © 2017 Alexandraki et al.

  10. Complete genome sequence of a recent panzootic virulent Newcastle disease virus from Pakistan

    Science.gov (United States)

    Complete genome sequence of a new strain of Newcastle disease virus (NDV) (chicken/Pak/Lahore-611/2013) is reported. The strain was isolated from a vaccinated chicken flock in Pakistan in 2013 and has panzootic features. The genome is 15192 nucleotides in length and is classified as sub-genotype V...

  11. Complete genome sequences and comparative genome analysis of Lactobacillus plantarum strain 5-2 isolated from fermented soybean.

    Science.gov (United States)

    Liu, Chen-Jian; Wang, Rui; Gong, Fu-Ming; Liu, Xiao-Feng; Zheng, Hua-Jun; Luo, Yi-Yong; Li, Xiao-Ran

    2015-12-01

    Lactobacillus plantarum is an important probiotic and is mostly isolated from fermented foods. We sequenced the genome of L. plantarum strain 5-2, which was derived from fermented soybean isolated from Yunnan province, China. The strain was determined to contain 3114 genes. Fourteen complete insertion sequence (IS) elements were found in 5-2 chromosome. There were 24 DNA replication proteins and 76 DNA repair proteins in the 5-2 genome. Consistent with the classification of L. plantarum as a facultative heterofermentative lactobacillus, the 5-2 genome encodes key enzymes required for the EMP (Embden-Meyerhof-Parnas) and phosphoketolase (PK) pathways. Several components of the secretion machinery are found in the 5-2 genome, which was compared with L. plantarum ST-III, JDM1 and WCFS1. Most of the specific proteins in the four genomes appeared to be related to their prophage elements. Copyright © 2015 Elsevier Inc. All rights reserved.

  12. Complete genome sequence of Leptospira alstonii serovar room 22, strain GWTS#1

    Science.gov (United States)

    We report the complete genome sequence of Leptospira alstonii serovar room 22 strain GWTS#1. This is the first isolate of L. alstonii to be cultured from a mammal, in Western Europe, and represents a new serovar of pathogenic leptospires....

  13. From Sequence to Morphology - Long-Range Correlations in Complete Sequenced Genomes

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2004-01-01

    textabstractThe largely unresolved sequential organization, i.e. the relations within DNA sequences, and its connection to the three-dimensional organization of genomes was investigated by correlation analyses of completely sequenced chromosomes from Viroids, Archaea, Bacteria, Arabidopsis

  14. First Complete Genome Sequence of a Watermelon Mosaic Virus Isolated from Watermelon in the United States

    OpenAIRE

    Rajbanshi, Naveen; Ali, Akhtar

    2016-01-01

    Watermelon mosaic virus was first reported in 1965 from the Rio Grande Valley, TX. We report here the first complete genome sequence of a watermelon mosaic virus isolate from watermelon collected from the Rio Grande Valley of Texas.

  15. Complete genome sequence of Campylobacter jejuni strain 12567 a livestock-associated clade representative

    Science.gov (United States)

    We report the complete genome sequence of the Campylobacter jejuni strain 12567, a member of a C. jejuni livestock-associated clade that expresses glycoconjugates linked to improved gastrointestinal tract persistence....

  16. Complete genome sequence of Paenibacillus sp. strain JDR-2

    Science.gov (United States)

    Virginia Chow; Guang Nong; Franz J. St. John; John D. Rice; Ellen Dickstein; Olga Chertkov; David Bruce; Chris Detter; Thomas Brettin; James Han; Tanja Woyke; Sam Pitluck; Matt Nolan; Amrita Pati; Joel Martin; Alex Copeland; Miriam L. Land; Lynne Goodwin; Jeffrey B. Jones; Lonnie O. Ingram; Keelnathan T. Shanmugam; James F. Preston

    2012-01-01

    Paenibacillus sp. strain JDR-2, an aggressively xylanolytic bacterium isolated from sweetgum (Liquidambar styraciflua) wood, is able to efficiently depolymerize, assimilate and metabolize 4-O-methylglucuronoxylan, the predominant structural component of hardwood hemicelluloses. A basis for this capability was first supported by...

  17. Complete Genome Sequence of Genotype VI Newcastle Disease Viruses Isolated from Pigeons in Pakistan

    Science.gov (United States)

    Wajid, Abdul; Rehmani, Shafqat Fatima; Sharma, Poonam; Goraichuk, Iryna V.; Dimitrov, Kiril M.

    2016-01-01

    Two complete genome sequences of Newcastle disease virus (NDV) are described here. Virulent isolates pigeon/Pakistan/Lahore/21A/2015 and pigeon/Pakistan/Lahore/25A/2015 were obtained from racing pigeons sampled in the Pakistani province of Punjab during 2015. Phylogenetic analysis of the fusion protein genes and complete genomes classified the isolates as members of NDV class II, genotype VI. PMID:27540069

  18. The complete mitochondrial genome of Zebrias quagga (Pleuronectiformes: Soleidae).

    Science.gov (United States)

    Li, Dong-He; Shi, Wei; Miao, Xian-Guang; Kong, Xiao-Yu

    2016-01-01

    Zebrias quagga (Soleoidei, Soleidae) is a sort of small and medium-sized commercial flatfish, characterized by both eyes on the right side of the body and with a dark brown short tentacle on each eye. In this paper, the complete mitogenome sequence of Z. quagga was first determined, which is 17,045 bp in length and contains 13 protein-coding genes, two rRNA genes, 22 tRNA genes, as well as a control region (CR) and a L-strand replication origin (OL). Gene contents, locations, and orders are identical to those of typical teleostean mtDNA. The nucleotide composition of the whole mitogenome is 28.8%, 29.3%, 15.8%, and 26.1% for A, C, G, and T, respectively, with a slight bias of A+T content (54.9%). This result is expected to contribute to a better understanding the phylogenetic study of Soleidae and Pleuronectiformes.

  19. Analysis of the complete genome sequence of Nocardia seriolae UTF1, the causative agent of fish nocardiosis: The first reference genome sequence of the fish pathogenic Nocardia species.

    Science.gov (United States)

    Yasuike, Motoshige; Nishiki, Issei; Iwasaki, Yuki; Nakamura, Yoji; Fujiwara, Atushi; Shimahara, Yoshiko; Kamaishi, Takashi; Yoshida, Terutoyo; Nagai, Satoshi; Kobayashi, Takanori; Katoh, Masaya

    2017-01-01

    Nocardiosis caused by Nocardia seriolae is one of the major threats in the aquaculture of Seriola species (yellowtail; S. quinqueradiata, amberjack; S. dumerili and kingfish; S. lalandi) in Japan. Here, we report the complete nucleotide genome sequence of N. seriolae UTF1, isolated from a cultured yellowtail. The genome is a circular chromosome of 8,121,733 bp with a G+C content of 68.1% that encodes 7,697 predicted proteins. In the N. seriolae UTF1 predicted genes, we found orthologs of virulence factors of pathogenic mycobacteria and human clinical Nocardia isolates involved in host cell invasion, modulation of phagocyte function and survival inside the macrophages. The virulence factor candidates provide an essential basis for understanding their pathogenic mechanisms at the molecular level by the fish nocardiosis research community in future studies. We also found many potential antibiotic resistance genes on the N. seriolae UTF1 chromosome. Comparative analysis with the four existing complete genomes, N. farcinica IFM 10152, N. brasiliensis HUJEG-1 and N. cyriacigeorgica GUH-2 and N. nova SH22a, revealed that 2,745 orthologous genes were present in all five Nocardia genomes (core genes) and 1,982 genes were unique to N. seriolae UTF1. In particular, the N. seriolae UTF1 genome contains a greater number of mobile elements and genes of unknown function that comprise the differences in structure and gene content from the other Nocardia genomes. In addition, a lot of the N. seriolae UTF1-specific genes were assigned to the ABC transport system. Because of limited resources in ocean environments, these N. seriolae UTF1 specific ABC transporters might facilitate adaptation strategies essential for marine environment survival. Thus, the availability of the complete N. seriolae UTF1 genome sequence will provide a valuable resource for comparative genomic studies of N. seriolae isolates, as well as provide new insights into the ecological and functional diversity of

  20. Complete chloroplast genome of Gracilaria firma (Gracilariaceae, Rhodophyta), with discussion on the use of chloroplast phylogenomics in the subclass Rhodymeniophycidae.

    Science.gov (United States)

    Ng, Poh-Kheng; Lin, Showe-Mei; Lim, Phaik-Eem; Liu, Li-Chia; Chen, Chien-Ming; Pai, Tun-Wen

    2017-01-06

    The chloroplast genome of Gracilaria firma was sequenced in view of its role as an economically important marine crop with wide industrial applications. To date, there are only 15 chloroplast genomes published for the Florideophyceae. Apart from presenting the complete chloroplast genome of G. firma, this study also assessed the utility of genome-scale data to address the phylogenetic relationships within the subclass Rhodymeniophycidae. The synteny and genome structure of the chloroplast genomes across the taxa of Eurhodophytina was also examined. The chloroplast genome of Gracilaria firma maps as a circular molecule of 187,001 bp and contains 252 genes, which are distributed on both strands and consist of 35 RNA genes (3 rRNAs, 30 tRNAs, tmRNA and a ribonuclease P RNA component) and 217 protein-coding genes, including the unidentified open reading frames. The chloroplast genome of G. firma is by far the largest reported for Gracilariaceae, featuring a unique intergenic region of about 7000 bp with discontinuous vestiges of red algal plasmid DNA sequences interspersed between the nblA and cpeB genes. This chloroplast genome shows similar gene content and order to other Florideophycean taxa. Phylogenomic analyses based on the concatenated amino acid sequences of 146 protein-coding genes confirmed the monophyly of the classes Bangiophyceae and Florideophyceae with full nodal support. Relationships within the subclass Rhodymeniophycidae in Florideophyceae received moderate to strong nodal support, and the monotypic family of Gracilariales were resolved with maximum support. Chloroplast genomes hold substantial information that can be tapped for resolving the phylogenetic relationships of difficult regions in the Rhodymeniophycidae, which are perceived to have experienced rapid radiation and thus received low nodal support, as exemplified in this study. The present study shows that chloroplast genome of G. firma could serve as a key link to the full resolution of

  1. Analysis of genotype diversity and evolution of Dengue virus serotype 2 using complete genomes

    Directory of Open Access Journals (Sweden)

    Vaishali P. Waman

    2016-08-01

    Full Text Available Background Dengue is one of the most common arboviral diseases prevalent worldwide and is caused by Dengue viruses (genus Flavivirus, family Flaviviridae. There are four serotypes of Dengue Virus (DENV-1 to DENV-4, each of which is further subdivided into distinct genotypes. DENV-2 is frequently associated with severe dengue infections and epidemics. DENV-2 consists of six genotypes such as Asian/American, Asian I, Asian II, Cosmopolitan, American and sylvatic. Comparative genomic study was carried out to infer population structure of DENV-2 and to analyze the role of evolutionary and spatiotemporal factors in emergence of diversifying lineages. Methods Complete genome sequences of 990 strains of DENV-2 were analyzed using Bayesian-based population genetics and phylogenetic approaches to infer genetically distinct lineages. The role of spatiotemporal factors, genetic recombination and selection pressure in the evolution of DENV-2 is examined using the sequence-based bioinformatics approaches. Results DENV-2 genetic structure is complex and consists of fifteen subpopulations/lineages. The Asian/American genotype is observed to be diversified into seven lineages. The Asian I, Cosmopolitan and sylvatic genotypes were found to be subdivided into two lineages, each. The populations of American and Asian II genotypes were observed to be homogeneous. Significant evidence of episodic positive selection was observed in all the genes, except NS4A. Positive selection operational on a few codons in envelope gene confers antigenic and lineage diversity in the American strains of Asian/American genotype. Selection on codons of non-structural genes was observed to impact diversification of lineages in Asian I, cosmopolitan and sylvatic genotypes. Evidence of intra/inter-genotype recombination was obtained and the uncertainty in classification of recombinant strains was resolved using the population genetics approach. Discussion Complete genome-based analysis

  2. Complete Genomes of Classical Swine Fever Virus Cloned into Bacterial Artificial Chromosomes

    DEFF Research Database (Denmark)

    Rasmussen, Thomas Bruun; Reimann, I.; Uttenthal, Åse

    Complete genome amplification of viral RNA provides a new tool for the generation of modified pestiviruses. We have used our full-genome amplification strategy for generation of amplicons representing complete genomes of classical swine fever virus. The amplicons were cloned directly into a stabl...... single-copy bacterial artificial chromosome (BAC) generating full-length pestivirus DNAs from which infectious RNA transcripts could be also derived. Our strategy allows construction of stable infectious BAC DNAs from a single full-length PCR product....

  3. Complete genome sequence of Catenulispora acidiphila type strain (ID 139908T)

    Energy Technology Data Exchange (ETDEWEB)

    Copeland, Alex; Lapidus, Alla; Rio, Tijana GlavinaDel; Nolan, Matt; Lucas, Susan; Chen, Feng; Tice, Hope; Cheng, Jan-Fang; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Mikhailova, Natalia; Pati, Amrita; Ivanova, Natalia; Mavromatis, Konstantinos; Chen, Amy; Palaniappan, Krishna; Chain, Patrick; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jefferies, Cynthia C.; Chertkov, Olga; Brettin, Thomas; Detter, John C.; Han, Cliff; Ali, Zahid; Tindall, Brian J.; Goker, Markus; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter

    2009-05-20

    Catenulispora acidiphila Busti et al. 2006 is the type species of the genus Catenulispora, and is of interest because of the rather isolated phylogenetic location of the genomically little studied suborder Catenulisporineae within the order Actinomycetales. C. acidiphilia is known for its acidophilic, aerobic lifestyle, but can also grow scantly under anaerobic conditions. Under regular conditions C. acidiphilia grows in long filaments of relatively short aerial hyphae with marked septation. It is a free living, non motile, Gram-positive bacterium isolated from a forest soil sample taken from a wooded area in Gerenzano, Italy. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the first complete genome sequence of the actinobacterial family Catenulisporaceae, and the 10,467,782 bp long single replicon genome with its 9056 protein-coding and 69 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  4. Complete genome sequence of Haliangium ochraceum type strain (SMP-2T)

    Energy Technology Data Exchange (ETDEWEB)

    Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Daum, Chris [U.S. Department of Energy, Joint Genome Institute; Lang, Elke [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Abt, Birte [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Kopitz, marcus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Saunders, Elizabeth H [Los Alamos National Laboratory (LANL); Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Copeland, A [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Chen, Feng [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Brettin, Thomas S [ORNL; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany

    2010-01-01

    Haliangium ochraceum Fudou et al. 2002 is the type species of the genus Haliangium in the myxococcal family Haliangiaceae . Members of the genus Haliangium are the first halophilic myxobacterial taxa described. The cells of the species follow a multicellular lifestyle in highly organized biofilms, called swarms, they decompose bacterial and yeast cells as most myxobacteria do. The fruiting bodies contain particularly small coccoid myxospores. H. ochraceum encodes the first actin homologue identified in a bacterial genome. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of a member of the myxococcal suborder Nannocystineae, and the 9,446,314 bp long single replicon genome with its 6,898 protein-coding and 53 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  5. Complete genome sequence of Leptotrichia buccalis type strain (C-1013-bT)

    Energy Technology Data Exchange (ETDEWEB)

    Ivanova, Natalia; Gronow, Sabine; Lapidus, Alla; Copeland, Alex; Glavina Del Rio, Tijana; Nolan, Matt; Lucas, Susan; Chen, Feng; Tice, Hope; Cheng, Jan-Fang; Saunders, Liz; Bruce, David; Goodwin, Lynne; Brettin, Thomas; Detter, John C.; Han, Cliff; Pitluck, Sam; Mikhailova, Natalia; Pati, Amrita; Mavromatis, Konstantinos; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jefferies, Cynthia C.; Chain, Patrick; Rohde, Christine; Goker, Markus; Bristow, Jim; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter

    2009-05-20

    Leptotrichia buccalis (Robin 1853) Trevisan 1879 is the type species of the genus, and is of phylogenetic interest because of its isolated location in the sparsely populated and neither taxonomically nor genomically adequately accessed family 'Leptotrichiaceae' within the phylum 'Fusobacteria'. Species of Leptotrichia are large fusiform non-motile, non-sporulating rods, which often populate the human oral flora. L. buccalis is anaerobic to aerotolerant, and saccharolytic. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the first complete genome sequence of the order 'Fusobacteriales' and no more than the second sequence from the phylum 'Fusobacteria'. The 2,465,610 bp long single replicon genome with its 2306 protein-coding and 61 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  6. Complete genome sequence of Coraliomargarita akajimensis type strain (04OKA010-24T)

    Energy Technology Data Exchange (ETDEWEB)

    Mavromatis, Konstantinos; Abt, Birte; Brambilla, Evelyne; Lapidus, Alla; Copeland, Alex; Desphande, Shweta; Nolan, Matt; Lucas, Susan; Tice, Hope; Cheng, Jan-Fang; Han, Cliff; Detter, John C.; Woyke, Tanja; Goodwin, Lynne; Pitluck, Sam; Held, Brittany; Brettin, Thomas; Tapia, Roxanne; Ivanova, Natalia; Mikhailova, Natalia; Pati, Amrita; Liolios, Konstantinos; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jeffries, Cynthia D.; Rohde, Manfred; G& #246; ker, Markus; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Klenk, Hans-Peter; Kyrpides, Nikos C.

    2010-06-25

    Coraliomargarita akajimensis Yoon et al. 2007 the type species of the genus Coraliomargarita. C. akajimensis is an obligately aerobic, Gram-negative, non-spore-forming, non-motile, spherical bacterium which was isolated from seawater surrounding the hard coral Galaxea fascicularis. C. akajimensis organism is of special interest because of its phylogenetic position in a genomically purely studied area in the bacterial diversity. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of a member of the family Puniceicoccaceae. The 3,750,771 bp long genome with its 3,137 protein-coding and 55 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  7. Phylogenetic tree based on complete genomes using fractal and correlation analyses without sequence alignment

    Directory of Open Access Journals (Sweden)

    Zu-Guo Yu

    2006-06-01

    Full Text Available The complete genomes of living organisms have provided much information on their phylogenetic relationships. Similarly, the complete genomes of chloroplasts have helped resolve the evolution of this organelle in photosynthetic eukaryotes. In this review, we describe two algorithms to construct phylogenetic trees based on the theories of fractals and dynamic language using complete genomes. These algorithms were developed by our research group in the past few years. Our distance-based phylogenetic tree of 109 prokaryotes and eukaryotes agrees with the biologists' "tree of life" based on the 16S-like rRNA genes in a majority of basic branchings and most lower taxa. Our phylogenetic analysis also shows that the chloroplast genomes are separated into two major clades corresponding to chlorophytes s.l. and rhodophytes s.l. The interrelationships among the chloroplasts are largely in agreement with the current understanding on chloroplast evolution.

  8. Complete genome sequence of Parvibaculum lavamentivorans type strain (DS-1(T)).

    Science.gov (United States)

    Schleheck, David; Weiss, Michael; Pitluck, Sam; Bruce, David; Land, Miriam L; Han, Shunsheng; Saunders, Elizabeth; Tapia, Roxanne; Detter, Chris; Brettin, Thomas; Han, James; Woyke, Tanja; Goodwin, Lynne; Pennacchio, Len; Nolan, Matt; Cook, Alasdair M; Kjelleberg, Staffan; Thomas, Torsten

    2011-12-31

    Parvibaculum lavamentivorans DS-1(T) is the type species of the novel genus Parvibaculum in the novel family Rhodobiaceae (formerly Phyllobacteriaceae) of the order Rhizobiales of Alphaproteobacteria. Strain DS-1(T) is a non-pigmented, aerobic, heterotrophic bacterium and represents the first tier member of environmentally important bacterial communities that catalyze the complete degradation of synthetic laundry surfactants. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 3,914,745 bp long genome with its predicted 3,654 protein coding genes is the first completed genome sequence of the genus Parvibaculum, and the first genome sequence of a representative of the family Rhodobiaceae.

  9. Comparative Genetic Analyses of Human Rhinovirus C (HRV-C) Complete Genome from Malaysia

    Science.gov (United States)

    Khaw, Yam Sim; Chan, Yoke Fun; Jafar, Faizatul Lela; Othman, Norlijah; Chee, Hui Yee

    2016-01-01

    Human rhinovirus-C (HRV-C) has been implicated in more severe illnesses than HRV-A and HRV-B, however, the limited number of HRV-C complete genomes (complete 5′ and 3′ non-coding region and open reading frame sequences) has hindered the in-depth genetic study of this virus. This study aimed to sequence seven complete HRV-C genomes from Malaysia and compare their genetic characteristics with the 18 published HRV-Cs. Seven Malaysian HRV-C complete genomes were obtained with newly redesigned primers. The seven genomes were classified as HRV-C6, C12, C22, C23, C26, C42, and pat16 based on the VP4/VP2 and VP1 pairwise distance threshold classification. Five of the seven Malaysian isolates, namely, 3430-MY-10/C22, 8713-MY-10/C23, 8097-MY-11/C26, 1570-MY-10/C42, and 7383-MY-10/pat16 are the first newly sequenced complete HRV-C genomes. All seven Malaysian isolates genomes displayed nucleotide similarity of 63–81% among themselves and 63–96% with other HRV-Cs. Malaysian HRV-Cs had similar putative immunogenic sites, putative receptor utilization and potential antiviral sites as other HRV-Cs. The genomic features of Malaysian isolates were similar to those of other HRV-Cs. Negative selections were frequently detected in HRV-Cs complete coding sequences indicating that these sequences were under functional constraint. The present study showed that HRV-Cs from Malaysia have diverse genetic sequences but share conserved genomic features with other HRV-Cs. This genetic information could provide further aid in the understanding of HRV-C infection. PMID:27199901

  10. The complete mitochondrial genome of Gryllotalpa unispina Saussure, 1874 (Orthoptera: Gryllotalpoidea: Gryllotalpidae).

    Science.gov (United States)

    Zhang, Yulong; Shao, Dandan; Cai, Miao; Yin, Hong; Zhang, Daochuan

    2016-01-01

    The complete mitochondrial genome of Gryllotalpa unispina was 15,513 bp in length and contained 70.9% AT. All G. unispina protein-coding sequences except for the nad2 started with a typical ATN codon. The usual termination codons (TAA) and incomplete stop codons (T) were found from 13 protein-coding genes. All tRNA genes were folded into the typical cloverleaf secondary structure, except trnS(AGN) lacking the dihydrouridine arm. The sizes of the large and small ribosomal RNA genes were 1245 and 725 bp, respectively. The A + T-rich region was 917 bp in length with 76.8%. The orientation and gene order of the G. unispina mitogenome were identical to the G. orientalis and G. pluvialis, there was no phenomenon of "DK rearrangement" which has been widely reported in Caelifera.

  11. Complete mitochondrial genome of the whitetip reef shark Triaenodon obesus (Carcharhiniformes: Carcharhinidae).

    Science.gov (United States)

    Chen, Xiao; Sonchaeng, Pichai; Yuvanatemiya, Vasin; Nuangsaeng, Bunlung; Ai, Weiming

    2016-01-01

    The complete mitochondrial genome of the whitetip reef shark Triaenodon obesus is determined in this study. It is 16,700 bp in length, with the typical gene composition, arrangement and transcriptional orientation in vertebrates. The overall base composition is 31.4% A, 25.8% C, 13.2% G and 29.7% T. Two start codons and two stop codons are found in the protein-coding genes. The 22 tRNA genes ranged from 67 to 75 nucleotides. The tRNA-Ser2 lost the DHU arm and could not be folded to the typical cloverleaf secondary structure. The origin of L-strand replication (OL) sequence was identified between tRNA-Asn and tRNA-Cys genes. The high A+T content of control region is due to a lot of poly A and poly T.

  12. Complete mitochondrial genome of the scalloped hammerhead Sphyrna lewini (Carcharhiniformes: Sphyrnidae).

    Science.gov (United States)

    Chen, Xiao; Xiang, Dan; Xu, Yuziwei; Shi, Xiaofang

    2015-08-01

    The complete mitochondrial genome of the endangered scalloped hammerhead Sphyrna lewini was firstly determined in this study. It is 16,726 bp in length with the typical gene composition and orders in vertebrates. The overall base composition is 31.4% A, 26.3% C, 13.2% G and 29.1% T. Two start codon (ATG and GTG) and three stop codon (TAG, AGA and TAA/TA/T) patterns were found in protein-coding genes. Except for the tRNA-Ser2, the remaining 21 tRNAs can be folded into the typical cloverleaf structure. The control region possess the highest A + T content (66.1%) and lowest G content (12.6%) among all mitochondrial partitions.

  13. Complete mitochondrial genome of the Spadenose shark Scoliodon laticaudus (Carcharhiniformes: Carcharhinidae).

    Science.gov (United States)

    Periasamy, Rengaiyan; Chen, Xiao; Ingole, Baban; Liu, Wenai

    2016-09-01

    The complete mitochondrial genome of the Spadenose shark Scoliodon laticaudus has been determined for the first time in this study. It was 16,695 bp in length and consisted of 37 genes with typical gene order in vertebrate mitogenome. The nucleotide base content of S. laticaudus mitogenome was 31.5% A, 23.7% C, 13.2% G and 31.6% T. Two start codons (GTG and ATG) and three stop codons (AGA, TAG and TAA/T) were used in the protein-coding genes. The 22 tRNAs ranged from 67 bp (tRNA-Cys and tRNA-Ser2) to 75 bp (tRNA-Leu1) in length. The tRNA-Ser2 could not be folded into typical cloverleaf secondary structure by lacking the dihydrouridine (DHC) arm stem.

  14. Complete mitochondrial genome of the blacktip reef shark Carcharhinus melanopterus (Carcharhiniformes: Carcharhinidae).

    Science.gov (United States)

    Chen, Xiao; Shen, Xue-Juan; Arunrugstichai, Sirachai; Ai, Weiming; Xiang, Dan

    2016-01-01

    The complete mitochondrial genome of the blacktip reef shark Carcharhinus melanopterus is determined for the first time in this study. The gene composition and order in the mitogenome of C. melanopterus is identical to most vertebrates. The overall base composition is 31.3% A, 25.3% C, 13.3% G and 30.1% T. There are 29 bp overlaps and 21 bp short intergenic spaces in the mitogenome. Two start codons and three stop codons were found in protein-coding genes. The dihydrouridine arm of tRNA-Ser2 was replaced by a simple loop and the other tRNAs could be folded into the typical cloverleaf structure. The termination associated sequence (TAS) and the conserved sequence blocks (CSB1-3) are found in the control region.

  15. Complete Genome Sequence of an Avian Metapneumovirus Subtype A Strain Isolated from Chicken (Gallus gallus) in Brazil

    OpenAIRE

    Rizotto, La?s S.; Scagion, Guilherme P.; Cardoso, Tereza C.; Sim?o, Raphael M.; Caserta, Leonardo C.; Benassi, Julia C.; Keid, Lara B.; Oliveira, Tr?cia M. F. de S.; Soares, Rodrigo M.; Arns, Clarice W.; Van Borm, Steven; Ferreira, Helena L.

    2017-01-01

    ABSTRACT We report here the complete genome sequence of an avian metapneumovirus (aMPV) isolated from a tracheal tissue sample of a commercial layer flock. The complete genome sequence of aMPV-A/chicken/Brazil-SP/669/2003 was obtained using MiSeq (Illumina, Inc.) sequencing. Phylogenetic analysis of the complete genome classified the isolate as avian metapneumovirus subtype A.

  16. Complete genome analysis of three Acinetobacter baumannii clinical isolates in China for insight into the diversification of drug resistance elements.

    Science.gov (United States)

    Zhu, Lingxiang; Yan, Zhongqiang; Zhang, Zhaojun; Zhou, Qiming; Zhou, Jinchun; Wakeland, Edward K; Fang, Xiangdong; Xuan, Zhenyu; Shen, Dingxia; Li, Quan-Zhen

    2013-01-01

    The emergence and rapid spreading of multidrug-resistant Acinetobacter baumannii strains has become a major health threat worldwide. To better understand the genetic recombination related with the acquisition of drug-resistant elements during bacterial infection, we performed complete genome analysis on three newly isolated multidrug-resistant A. baumannii strains from Beijing using next-generation sequencing technology. Whole genome comparison revealed that all 3 strains share some common drug resistant elements including carbapenem-resistant bla OXA-23 and tetracycline (tet) resistance islands, but the genome structures are diversified among strains. Various genomic islands intersperse on the genome with transposons and insertions, reflecting the recombination flexibility during the acquisition of the resistant elements. The blood-isolated BJAB07104 and ascites-isolated BJAB0868 exhibit high similarity on their genome structure with most of the global clone II strains, suggesting these two strains belong to the dominant outbreak strains prevalent worldwide. A large resistance island (RI) of about 121-kb, carrying a cluster of resistance-related genes, was inserted into the ATPase gene on BJAB07104 and BJAB0868 genomes. A 78-kb insertion element carrying tra-locus and bla OXA-23 island, can be either inserted into one of the tniB gene in the 121-kb RI on the chromosome, or transformed to conjugative plasmid in the two BJAB strains. The third strains of this study, BJAB0715, which was isolated from spinal fluid, exhibit much more divergence compared with above two strains. It harbors multiple drug-resistance elements including a truncated AbaR-22-like RI on its genome. One of the unique features of this strain is that it carries both bla OXA-23 and bla OXA-58 genes on its genome. Besides, an Acinetobacter lwoffii adeABC efflux element was found inserted into the ATPase position in BJAB0715. Our comparative analysis on currently completed Acinetobacter baumannii

  17. Complete genome analysis of three Acinetobacter baumannii clinical isolates in China for insight into the diversification of drug resistance elements.

    Directory of Open Access Journals (Sweden)

    Lingxiang Zhu

    Full Text Available The emergence and rapid spreading of multidrug-resistant Acinetobacter baumannii strains has become a major health threat worldwide. To better understand the genetic recombination related with the acquisition of drug-resistant elements during bacterial infection, we performed complete genome analysis on three newly isolated multidrug-resistant A. baumannii strains from Beijing using next-generation sequencing technology.Whole genome comparison revealed that all 3 strains share some common drug resistant elements including carbapenem-resistant bla OXA-23 and tetracycline (tet resistance islands, but the genome structures are diversified among strains. Various genomic islands intersperse on the genome with transposons and insertions, reflecting the recombination flexibility during the acquisition of the resistant elements. The blood-isolated BJAB07104 and ascites-isolated BJAB0868 exhibit high similarity on their genome structure with most of the global clone II strains, suggesting these two strains belong to the dominant outbreak strains prevalent worldwide. A large resistance island (RI of about 121-kb, carrying a cluster of resistance-related genes, was inserted into the ATPase gene on BJAB07104 and BJAB0868 genomes. A 78-kb insertion element carrying tra-locus and bla OXA-23 island, can be either inserted into one of the tniB gene in the 121-kb RI on the chromosome, or transformed to conjugative plasmid in the two BJAB strains. The third strains of this study, BJAB0715, which was isolated from spinal fluid, exhibit much more divergence compared with above two strains. It harbors multiple drug-resistance elements including a truncated AbaR-22-like RI on its genome. One of the unique features of this strain is that it carries both bla OXA-23 and bla OXA-58 genes on its genome. Besides, an Acinetobacter lwoffii adeABC efflux element was found inserted into the ATPase position in BJAB0715.Our comparative analysis on currently completed

  18. Analysis of the Complete Mitochondrial Genome Sequence of the Diploid Cotton Gossypium raimondii by Comparative Genomics Approaches

    Directory of Open Access Journals (Sweden)

    Changwei Bi

    2016-01-01

    Full Text Available Cotton is one of the most important economic crops and the primary source of natural fiber and is an important protein source for animal feed. The complete nuclear and chloroplast (cp genome sequences of G. raimondii are already available but not mitochondria. Here, we assembled the complete mitochondrial (mt DNA sequence of G. raimondii into a circular genome of length of 676,078 bp and performed comparative analyses with other higher plants. The genome contains 39 protein-coding genes, 6 rRNA genes, and 25 tRNA genes. We also identified four larger repeats (63.9 kb, 10.6 kb, 9.1 kb, and 2.5 kb in this mt genome, which may be active in intramolecular recombination in the evolution of cotton. Strikingly, nearly all of the G. raimondii mt genome has been transferred to nucleus on Chr1, and the transfer event must be very recent. Phylogenetic analysis reveals that G. raimondii, as a member of Malvaceae, is much closer to another cotton (G. barbadense than other rosids, and the clade formed by two Gossypium species is sister to Brassicales. The G. raimondii mt genome may provide a crucial foundation for evolutionary analysis, molecular biology, and cytoplasmic male sterility in cotton and other higher plants.

  19. Characterization and Comparative Analysis of the Complete Chloroplast Genome of the Critically Endangered Species Streptocarpus teitensis (Gesneriaceae

    Directory of Open Access Journals (Sweden)

    Cornelius M. Kyalo

    2018-01-01

    Full Text Available Streptocarpus teitensis (Gesneriaceae is an endemic species listed as critically endangered in the International Union for Conservation of Nature (IUCN red list of threatened species. However, the sequence and genome information of this species remains to be limited. In this article, we present the complete chloroplast genome structure of Streptocarpus teitensis and its evolution inferred through comparative studies with other related species. S. teitensis displayed a chloroplast genome size of 153,207 bp, sheltering a pair of inverted repeats (IR of 25,402 bp each split by small and large single-copy (SSC and LSC regions of 18,300 and 84,103 bp, respectively. The chloroplast genome was observed to contain 116 unique genes, of which 80 are protein-coding, 32 are transfer RNAs, and four are ribosomal RNAs. In addition, a total of 196 SSR markers were detected in the chloroplast genome of Streptocarpus teitensis with mononucleotides (57.1% being the majority, followed by trinucleotides (33.2% and dinucleotides and tetranucleotides (both 4.1%, and pentanucleotides being the least (1.5%. Genome alignment indicated that this genome was comparable to other sequenced members of order Lamiales. The phylogenetic analysis suggested that Streptocarpus teitensis is closely related to Lysionotus pauciflorus and Dorcoceras hygrometricum.

  20. The complete chloroplast genome of Gracilariopsis lemaneiformis (Rhodophyta) gives new insight into the evolution of family Gracilariaceae.

    Science.gov (United States)

    Du, Qingwei; Bi, Guiqi; Mao, Yunxiang; Sui, Zhenghong

    2016-06-01

    The complete chloroplast genome of Gracilariopsis lemaneiformis was recovered from a Next Generation Sequencing data set. Without quadripartite structure, this chloroplast genome (183,013 bp, 27.40% GC content) contains 202 protein-coding genes, 34 tRNA genes, 3 rRNA genes, and 1 tmRNA gene. Synteny analysis showed plasmid incorporation regions in chloroplast genomes of three species of family Gracilariaceae and in Grateloupia taiwanensis of family Halymeniaceae. Combined with reported red algal plasmid sequences in nuclear and mitochondrial genomes, we postulated that red algal plasmids may have played an important role in ancient horizontal gene transfer among nuclear, chloroplast, and mitochondrial genomes. Substitution rate analysis showed that purifying selective forces maintaining stability of protein-coding genes of nine red algal chloroplast genomes over long periods must be strong and that the forces acting on gene groups and single genes of nine red algal chloroplast genomes were similar and consistent. The divergence of Gp. lemaneiformis occurred ~447.98 million years ago (Mya), close to the divergence time of genus Pyropia and Porphyra (443.62 Mya). © 2016 Phycological Society of America.

  1. The complete sequence of the mitochondrial genome of the African Penguin (Spheniscus demersus).

    Science.gov (United States)

    Labuschagne, Christiaan; Kotzé, Antoinette; Grobler, J Paul; Dalton, Desiré L

    2014-01-15

    The complete mitochondrial genome of the African Penguin (Spheniscus demersus) was sequenced. The molecule was sequenced via next generation sequencing and primer walking. The size of the genome is 17,346 bp in length. Comparison with the mitochondrial DNA of two other penguin genomes that have so far been reported was conducted namely; Little blue penguin (Eudyptula minor) and the Rockhopper penguin (Eudyptes chrysocome). This analysis made it possible to identify common penguin mitochondrial DNA characteristics. The S. demersus mtDNA genome is very similar, both in composition and length to both the E. chrysocome and E. minor genomes. The gene content of the African penguin mitochondrial genome is typical of vertebrates and all three penguin species have the standard gene order originally identified in the chicken. The control region for S. demersus is located between tRNA-Glu and tRNA-Phe and all three species of penguins contain two sets of similar repeats with varying copy numbers towards the 3' end of the control region, accounting for the size variance. This is the first report of the complete nucleotide sequence for the mitochondrial genome of the African penguin, S. demersus. These results can be subsequently used to provide information for penguin phylogenetic studies and insights into the evolution of genomes. © 2013 Elsevier B.V. All rights reserved.

  2. Complete genomes of Hairstreak butterflies, their speciation, and nucleo-mitochondrial incongruence.

    Science.gov (United States)

    Cong, Qian; Shen, Jinhui; Borek, Dominika; Robbins, Robert K; Otwinowski, Zbyszek; Grishin, Nick V

    2016-04-28

    Comparison of complete genomes of closely related species enables research on speciation and how phenotype is determined by genotype. Lepidoptera, an insect order of 150,000 species with diverse phenotypes, is well-suited for such comparative genomics studies if new genomes, which cover additional Lepidoptera families are acquired. We report a 729 Mbp genome assembly of the Calycopis cecrops, the first genome from the family Lycaenidae and the largest available Lepidoptera genome. As detritivore, Calycopis shows expansion in detoxification and digestion enzymes. We further obtained complete genomes of 8 Calycopis specimens: 3 C. cecrops and 5 C. isobeon, including a dry specimen stored in the museum for 30 years. The two species differ subtly in phenotype and cannot be differentiated by mitochondrial DNA. However, nuclear genomes revealed a deep split between them. Genes that can clearly separate the two species (speciation hotspots) mostly pertain to circadian clock, mating behavior, transcription regulation, development and cytoskeleton. The speciation hotspots and their function significantly overlap with those we previously found in Pterourus, suggesting common speciation mechanisms in these butterflies.

  3. Complete Genome Sequence of a Putative Densovirus of the Asian Citrus Psyllid, Diaphorina citri.

    Science.gov (United States)

    Nigg, Jared C; Nouri, Shahideh; Falk, Bryce W

    2016-07-28

    Here, we report the complete genome sequence of a putative densovirus of the Asian citrus psyllid, Diaphorina citri Diaphorina citri densovirus (DcDNV) was originally identified through metagenomics, and here, we obtained the complete nucleotide sequence using PCR-based approaches. Phylogenetic analysis places DcDNV between viruses of the Ambidensovirus and Iteradensovirus genera. Copyright © 2016 Nigg et al.

  4. Complete Genome Sequence of a Putative Densovirus of the Asian Citrus Psyllid, Diaphorina citri

    OpenAIRE

    Nigg, Jared C.; Nouri, Shahideh; Falk, Bryce W.

    2016-01-01

    Here, we report the complete genome sequence of a putative densovirus of the Asian citrus psyllid, Diaphorina citri. Diaphorina citri densovirus (DcDNV) was originally identified through metagenomics, and here, we obtained the complete nucleotide sequence using PCR-based approaches. Phylogenetic analysis places DcDNV between viruses of the Ambidensovirus and Iteradensovirus genera.

  5. The Complete Chloroplast Genome Sequence of Tree of Heaven (Ailanthus altissima (Mill. (Sapindales: Simaroubaceae, an Important Pantropical Tree

    Directory of Open Access Journals (Sweden)

    Josphat K. Saina

    2018-03-01

    Full Text Available Ailanthus altissima (Mill. Swingle (Simaroubaceae is a deciduous tree widely distributed throughout temperate regions in China, hence suitable for genetic diversity and evolutionary studies. Previous studies in A. altissima have mainly focused on its biological activities, genetic diversity and genetic structure. However, until now there is no published report regarding genome of this plant species or Simaroubaceae family. Therefore, in this paper, we first characterized A. altissima complete chloroplast genome sequence. The tree of heaven chloroplast genome was found to be a circular molecule 160,815 base pairs (bp in size and possess a quadripartite structure. The A. altissima chloroplast genome contains 113 unique genes of which 79 and 30 are protein coding and transfer RNA (tRNA genes respectively and also 4 ribosomal RNA genes (rRNA with overall GC content of 37.6%. Microsatellite marker detection identified A/T mononucleotides as majority SSRs in all the seven analyzed genomes. Repeat analyses of seven Sapindales revealed a total of 49 repeats in A. altissima, Rhus chinensis, Dodonaea viscosa, Leitneria floridana, while Azadirachta indica, Boswellia sacra, and Citrus aurantiifolia had a total of 48 repeats. The phylogenetic analysis using protein coding genes revealed that A. altissima is a sister to Leitneria floridana and also suggested that Simaroubaceae is a sister to Rutaceae family. The genome information reported here could be further applied for evolution and invasion, population genetics, and molecular studies in this plant species and family.

  6. Phylogenomic relationship of feijoa (Acca sellowiana (O.Berg) Burret) with other Myrtaceae based on complete chloroplast genome sequences.

    Science.gov (United States)

    Machado, Lilian de Oliveira; Vieira, Leila do Nascimento; Stefenon, Valdir Marcos; Oliveira Pedrosa, Fábio de; Souza, Emanuel Maltempi de; Guerra, Miguel Pedro; Nodari, Rubens Onofre

    2017-04-01

    Given their distribution, importance, and richness, Myrtaceae species comprise a model system for studying the evolution of tropical plant diversity. In addition, chloroplast (cp) genome sequencing is an efficient tool for phylogenetic relationship studies. Feijoa [Acca sellowiana (O. Berg) Burret; CN: pineapple-guava] is a Myrtaceae species that occurs naturally in southern Brazil and northern Uruguay. Feijoa is known for its exquisite perfume and flavorful fruits, pharmacological properties, ornamental value and increasing economic relevance. In the present work, we reported the complete cp genome of feijoa. The feijoa cp genome is a circular molecule of 159,370 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC 88,028 bp) and a Small Single Copy region (SSC 18,598 bp) separated by Inverted Repeat regions (IRs 26,372 bp). The genome structure, gene order, GC content and codon usage are similar to those of typical angiosperm cp genomes. When compared to other cp genome sequences of Myrtaceae, feijoa showed closest relationship with pitanga (Eugenia uniflora L.). Furthermore, a comparison of pitanga synonymous (Ks) and nonsynonymous (Ka) substitution rates revealed extremely low values. Maximum Likelihood and Bayesian Inference analyses produced phylogenomic trees identical in topology. These trees supported monophyly of three Myrtoideae clades.

  7. The complete mitochondrial genome of eastern lowland gorilla, Gorilla beringei graueri, and comparative mitochondrial genomics of Gorilla species.

    Science.gov (United States)

    Hu, Xiao-di; Gao, Li-zhi

    2016-01-01

    In this study, we determined the complete mitochondrial (mt) genome of eastern lowland gorilla, Gorilla beringei graueri for the first time. The total genome was 16,416 bp in length. It contained a total of 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and 1 control region (D-loop region). The base composition was A (30.88%), G (13.10%), C (30.89%) and T (25.13%), indicating that the percentage of A+T (56.01%) was higher than G+C (43.99%). Comparisons with the other publicly available Gorilla mitogenome showed the conservation of gene order and base compositions but a bunch of nucleotide diversity. This complete mitochondrial genome sequence will provide valuable genetic information for further studies on conservation genetics of eastern lowland gorilla.

  8. Complete DNA sequence of the mitochondrial genome of the treehopper Leptobelus gazella (Membracoidea: Hemiptera).

    Science.gov (United States)

    Zhao, Xing; Liang, Ai-Ping

    2016-09-01

    The first complete DNA sequence of the mitochondrial genome (mitogenome) of Leptobelus gazelle (Membracoidea: Hemiptera) is determined in this study. The circular molecule is 16,007 bp in its full length, which encodes a set of 37 genes, including 13 proteins, 2 ribosomal RNAs, 22 transfer RNAs, and contains an A + T-rich region (CR). The gene numbers, content, and organization of L. gazelle are similar to other typical metazoan mitogenomes. Twelve of the 13 PCGs are initiated with ATR methionine or ATT isoleucine codons, except the atp8 gene that uses the ATC isoleucine as start signal. Ten of the 13 PCGs have complete termination codons, either TAA (nine genes) or TAG (cytb). The remaining 3 PCGs (cox1, cox2 and nad5) have incomplete termination codons T (AA). All of the 22 tRNAs can be folded in the form of a typical clover-leaf structure. The complete mitogenome sequence data of L. gazelle is useful for the phylogenetic and biogeographic studies of the Membracoidea and Hemiptera.

  9. Complete mitochondrial genome of four pheretimoid earthworms (Clitellata: Oligochaeta) and their phylogenetic reconstruction.

    Science.gov (United States)

    Zhang, Liangliang; Jiang, Jibao; Dong, Yan; Qiu, Jiangping

    2015-12-15

    Among oligochaetes, the Pheretima complex within the Megascolecidae is a major earthworm group. Recently, however, the systematics of the Pheretima complex based on morphology are challenged by molecular studies. Since little comparative analysis of earthworm complete mitochondrial genomes has been reported yet, we sequenced mitogenomes of four pheretimoid earthworm species to explore their phylogenetic relationships. The general earthworm genomic features are also found in four earthworms: all genes transcribed from the same strand, the same initiation codon ATG for each PCGs, and conserved structures of RNA genes. Interestingly we find an extra potential tRNA-leucine (CUN) in Amynthas longisiphonus. The earthworm mitochondrial ATP8 exhibits the highest evolutionary rate, while the gene CO1 evolves slowest. Phylogenetic analysis based on protein-coding genes (PCGs) strongly supports the monophyly of the Clitellata, Hirudinea, Oligochaeta, Megascolecidae and Pheretima complex. Our analysis, however, reveals non-monophyly within the genara Amynthas and Metaphire. Thus the generic divisions based on morphology in the Pheretima complex should be reconsidered. Copyright © 2015 Elsevier B.V. All rights reserved.

  10. Complete genome sequence of a commensal bacterium, Hafnia alvei CBA7124, isolated from human feces.

    Science.gov (United States)

    Song, Hye Seon; Kim, Joon Yong; Kim, Yeon Bee; Jeong, Myeong Seon; Kang, Jisu; Rhee, Jin-Kyu; Kwon, Joseph; Kim, Ju Suk; Choi, Jong-Soon; Choi, Hak-Jong; Nam, Young-Do; Roh, Seong Woon

    2017-01-01

    Members of the genus Hafnia have been isolated from the feces of mammals, birds, reptiles, and fish, as well as from soil, water, sewage, and foods. Hafnia alvei is an opportunistic pathogen that has been implicated in intestinal and extraintestinal infections in humans. However, its pathogenicity is still unclear. In this study, we isolated H. alvei from human feces and performed sequencing as well as comparative genomic analysis to better understand its pathogenicity. The genome of H. alvei CBA7124 comprised a single circular chromosome with 4,585,298 bp and a GC content of 48.8%. The genome contained 25 rRNA genes (9 5S rRNA genes, 8 16S rRNA genes, and 8 23S rRNA genes), 88 tRNA genes, and 4043 protein-coding genes. Using comparative genomic analysis, the genome of this strain was found to have 72 strain-specific singletons. The genome also contained genes for antibiotic and antimicrobial resistance, as well as toxin-antitoxin systems. We revealed the complete genome sequence of the opportunistic gut pathogen, H. alvei CBA7124. We also performed comparative genomic analysis of the sequences in the genome of H. alvei CBA7124, and found that it contained strain-specific singletons, antibiotic resistance genes, and toxin-antitoxin systems. These results could improve our understanding of the pathogenicity and the mechanism behind the antibiotic resistance of H. alvei strains.

  11. Complete Genome Sequence of the Soybean Symbiont Bradyrhizobium japonicum Strain USDA6T

    Directory of Open Access Journals (Sweden)

    Nobukazu Uchiike

    2011-10-01

    Full Text Available The complete nucleotide sequence of the genome of the soybean symbiont Bradyrhizobium japonicum strain USDA6T was determined. The genome of USDA6T is a single circular chromosome of 9,207,384 bp. The genome size is similar to that of the genome of another soybean symbiont, B. japonicum USDA110 (9,105,828 bp. Comparison of the whole-genome sequences of USDA6T and USDA110 showed colinearity of major regions in the two genomes, although a large inversion exists between them. A significantly high level of sequence conservation was detected in three regions on each genome. The gene constitution and nucleotide sequence features in these three regions indicate that they may have been derived from a symbiosis island. An ancestral, large symbiosis island, approximately 860 kb in total size, appears to have been split into these three regions by unknown large-scale genome rearrangements. The two integration events responsible for this appear to have taken place independently, but through comparable mechanisms, in both genomes.

  12. The complete genome sequence and comparative genome analysis of the high pathogenicity Yersinia enterocolitica strain 8081.

    Directory of Open Access Journals (Sweden)

    Nicholas R Thomson

    2006-12-01

    Full Text Available The human enteropathogen, Yersinia enterocolitica, is a significant link in the range of Yersinia pathologies extending from mild gastroenteritis to bubonic plague. Comparison at the genomic level is a key step in our understanding of the genetic basis for this pathogenicity spectrum. Here we report the genome of Y. enterocolitica strain 8081 (serotype 0:8; biotype 1B and extensive microarray data relating to the genetic diversity of the Y. enterocolitica species. Our analysis reveals that the genome of Y. enterocolitica strain 8081 is a patchwork of horizontally acquired genetic loci, including a plasticity zone of 199 kb containing an extraordinarily high density of virulence genes. Microarray analysis has provided insights into species-specific Y. enterocolitica gene functions and the intraspecies differences between the high, low, and nonpathogenic Y. enterocolitica biotypes. Through comparative genome sequence analysis we provide new information on the evolution of the Yersinia. We identify numerous loci that represent ancestral clusters of genes potentially important in enteric survival and pathogenesis, which have been lost or are in the process of being lost, in the other sequenced Yersinia lineages. Our analysis also highlights large metabolic operons in Y. enterocolitica that are absent in the related enteropathogen, Yersinia pseudotuberculosis, indicating major differences in niche and nutrients used within the mammalian gut. These include clusters directing, the production of hydrogenases, tetrathionate respiration, cobalamin synthesis, and propanediol utilisation. Along with ancestral gene clusters, the genome of Y. enterocolitica has revealed species-specific and enteropathogen-specific loci. This has provided important insights into the pathology of this bacterium and, more broadly, into the evolution of the genus. Moreover, wider investigations looking at the patterns of gene loss and gain in the Yersinia have highlighted common

  13. The evolution of human influenza A viruses from 1999 to 2006: A complete genome study

    Directory of Open Access Journals (Sweden)

    Fomsgaard Anders

    2008-03-01

    Full Text Available Abstract Background Knowledge about the complete genome constellation of seasonal influenza A viruses from different countries is valuable for monitoring and understanding of the evolution and migration of strains. Few complete genome sequences of influenza A viruses from Europe are publicly available at the present time and there have been few longitudinal genome studies of human influenza A viruses. We have studied the evolution of circulating human H3N2, H1N1 and H1N2 influenza A viruses from 1999 to 2006, we analysed 234 Danish human influenza A viruses and characterised 24 complete genomes. Results H3N2 was the prevalent strain in Denmark during the study period, but H1N1 dominated the 2000–2001 season. H1N2 viruses were first observed in Denmark in 2002–2003. After years of little genetic change in the H1N1 viruses the 2005–2006 season presented H1N1 of greater variability than before. This indicates that H1N1 viruses are evolving and that H1N1 soon is likely to be the prevalent strain again. Generally, the influenza A haemagglutinin (HA of H3N2 viruses formed seasonal phylogenetic clusters. Different lineages co-circulating within the same season were also observed. The evolution has been stochastic, influenced by small "jumps" in genetic distance rather than constant drift, especially with the introduction of the Fujian-like viruses in 2002–2003. Also evolutionary stasis-periods were observed which might indicate well fit viruses. The evolution of H3N2 viruses have also been influenced by gene reassortments between lineages from different seasons. None of the influenza genes were influenced by strong positive selection pressure. The antigenic site B in H3N2 HA was the preferred site for genetic change during the study period probably because the site A has been masked by glycosylations. Substitutions at CTL-epitopes in the genes coding for the neuraminidase (NA, polymerase acidic protein (PA, matrix protein 1 (M1, non-structural

  14. The evolution of human influenza A viruses from 1999 to 2006: a complete genome study.

    Science.gov (United States)

    Bragstad, Karoline; Nielsen, Lars P; Fomsgaard, Anders

    2008-03-07

    Knowledge about the complete genome constellation of seasonal influenza A viruses from different countries is valuable for monitoring and understanding of the evolution and migration of strains. Few complete genome sequences of influenza A viruses from Europe are publicly available at the present time and there have been few longitudinal genome studies of human influenza A viruses. We have studied the evolution of circulating human H3N2, H1N1 and H1N2 influenza A viruses from 1999 to 2006, we analysed 234 Danish human influenza A viruses and characterised 24 complete genomes. H3N2 was the prevalent strain in Denmark during the study period, but H1N1 dominated the 2000-2001 season. H1N2 viruses were first observed in Denmark in 2002-2003. After years of little genetic change in the H1N1 viruses the 2005-2006 season presented H1N1 of greater variability than before. This indicates that H1N1 viruses are evolving and that H1N1 soon is likely to be the prevalent strain again. Generally, the influenza A haemagglutinin (HA) of H3N2 viruses formed seasonal phylogenetic clusters. Different lineages co-circulating within the same season were also observed. The evolution has been stochastic, influenced by small "jumps" in genetic distance rather than constant drift, especially with the introduction of the Fujian-like viruses in 2002-2003. Also evolutionary stasis-periods were observed which might indicate well fit viruses. The evolution of H3N2 viruses have also been influenced by gene reassortments between lineages from different seasons. None of the influenza genes were influenced by strong positive selection pressure. The antigenic site B in H3N2 HA was the preferred site for genetic change during the study period probably because the site A has been masked by glycosylations. Substitutions at CTL-epitopes in the genes coding for the neuraminidase (NA), polymerase acidic protein (PA), matrix protein 1 (M1), non-structural protein 1 (NS1) and especially the nucleoprotein (NP

  15. Insights from the complete chloroplast genome into the evolution of Sesamum indicum L.

    Directory of Open Access Journals (Sweden)

    Haiyang Zhang

    Full Text Available Sesame (Sesamum indicum L. is one of the oldest oilseed crops. In order to investigate the evolutionary characters according to the Sesame Genome Project, apart from sequencing its nuclear genome, we sequenced the complete chloroplast genome of S. indicum cv. Yuzhi 11 (white seeded using Illumina and 454 sequencing. Comparisons of chloroplast genomes between S. indicum and the 18 other higher plants were then analyzed. The chloroplast genome of cv. Yuzhi 11 contains 153,338 bp and a total of 114 unique genes (KC569603. The number of chloroplast genes in sesame is the same as that in Nicotiana tabacum, Vitis vinifera and Platanus occidentalis. The variation in the length of the large single-copy (LSC regions and inverted repeats (IR in sesame compared to 18 other higher plant species was the main contributor to size variation in the cp genome in these species. The 77 functional chloroplast genes, except for ycf1 and ycf2, were highly conserved. The deletion of the cp ycf1 gene sequence in cp genomes may be due either to its transfer to the nuclear genome, as has occurred in sesame, or direct deletion, as has occurred in Panax ginseng and Cucumis sativus. The sesame ycf2 gene is only 5,721 bp in length and has lost about 1,179 bp. Nucleotides 1-585 of ycf2 when queried in BLAST had hits in the sesame draft genome. Five repeats (R10, R12, R13, R14 and R17 were unique to the sesame chloroplast genome. We also found that IR contraction/expansion in the cp genome alters its rate of evolution. Chloroplast genes and repeats display the signature of convergent evolution in sesame and other species. These findings provide a foundation for further investigation of cp genome evolution in Sesamum and other higher plants.

  16. The complete mitochondrial genome of the deep-sea sponge Poecillastra laminaris (Astrophorida, Vulcanellidae).

    Science.gov (United States)

    Zeng, Cong; Thomas, Leighton J; Kelly, Michelle; Gardner, Jonathan P A

    2016-05-01

    The complete mitochondrial genome of a New Zealand specimen of the deep-sea sponge Poecillastra laminaris (Sollas, 1886) (Astrophorida, Vulcanellidae), from the Colville Ridge, New Zealand, was sequenced using the 454 Life Science pyrosequencing system. To identify homologous mitochondrial sequences, the 454 reads were mapped to the complete mitochondrial genome sequence of Geodia neptuni (GeneBank No. NC_006990). The P. laminaris genome is 18,413 bp in length and includes 14 protein-coding genes, 24 transfer RNA genes and 2 ribosomal RNA genes. Gene order resembled that of other demosponges. The base composition of the genome is A (29.1%), T (35.2%), C (14.0%) and G (21.7%). This is the second published mitogenome for a sponge of the order Astrophorida and will be useful in future phylogenetic analysis of deep-sea sponges.

  17. Complete genome sequence of Beutenbergia cavernae type strain (HKI 0122T)

    Energy Technology Data Exchange (ETDEWEB)

    Land, Miriam; Pukall, Rudiger; Abt, Birte; Goker, Markus; Rohde, Manfred; Glavina Del Rio, Tijana; Tice, Hope; Copeland, Alex; Cheng, Jan-Fang; Lucas, Susan; Chen, Feng; Nolan, Matt; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Ivanova, Natalia; Mavrommatis, Konstantinos; Ovchinnikova, Galina; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Hauser, Loren; Chang, Yun-Juan; Jefferies, Cynthia C.; Saunders, Elizabeth; Brettin, Thomas; Detter, John C.; Han, Cliff; Chain, Patrick; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter; Lapidus, Alla

    2009-05-20

    Beutenbergia cavernae (Groth et al. 1999) is the type species of the genus and is of phylogenetic interest because of its isolated location in the actinobacterial suborder Micrococcineae. B. cavernae HKI 0122T is a Gram-positive, non-motile, non-spore-forming bacterium isolated from a cave in Guangxi (China). B. cavernae grows best under aerobic conditions and shows a rod-coccus growth cycle. Its cell wall peptidoglycan contains the diagnostic L-lysine - L-glutamate interpeptide bridge. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first completed genome sequence from the poorly populated micrococcineal family Beutenbergiaceae, and this 4,669,183 bp long single replicon genome with its 4225 protein-coding and 53 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  18. Complete genome sequence of Cryptobacterium curtum type strain (12-3T)

    Energy Technology Data Exchange (ETDEWEB)

    Mavromatis, Konstantinos; Pukall, Rudiger; Rohde, Christine; Sims, David; Brettin, Thomas; Kuske, Cheryl; Detter, John C.; Han, Cliff; Lapidus, Alla; Copeland, Alex; Glavina Del Rio, Tijana; Nolan, Matt; Lucas, Susan; Tice, Hope; Cheng, Jan-Fang; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Ovchinnikova, Galina; Pati, Amrita; Ivanova, Natalia; Chen, Amy; Palaniappan, Krishna; Chain, Patrick; D' haeseleer, Patrik; Bristow, Jim; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Rohde, Manfred; Klenk, Hans-Peter; Kyrpides, Nikos C.

    2009-05-20

    Cryptobacterium curtum Nakazawa et al. 1999 is the type species of the genus, and is of phylogenetic interest because of its very distant and isolated position within the family Coriobacteriaceae. C. curtum is an asaccharolytic, opportunistic pathogen with a typical occurrence in the oral cavity, involved in dental and oral infections like periodontitis, inflammations and abscesses. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of the actinobacterial family Coriobacteriaceae, and this 1,617,804 bp long single replicon genome with its 1364 protein-coding and 58 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  19. Complete genome sequence of Pedobacter heparinus type strain (HIM 762-3T)

    Energy Technology Data Exchange (ETDEWEB)

    Han, Cliff; Spring, Stefan; Lapidus, Alla; Glavina Del Rio, Tijana; Tice, Hope; Copeland, Alex; Cheng, Jan-Fang; Lucas, Susan; Chen, Feng; Nolan, Matt; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Ivanova, Natalia; Mavrommatis, Konstantinos; Mikhailova, Natalia; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jefferies, Cynthia C.; Saunders, Elizabeth; Chertkov, Olga; Brettin, Thomas; Goker, Markus; Rohde, Manfred; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter; Detter, John C.

    2009-05-20

    Pedobacter heparinus (Payza and Korn 1956) Steyn et al. 1998 comb. nov. is the type species of the rapidly growing genus Pedobacter within the family Sphingobacteriaceae of the phylum 'Bacteroidetes'. P. heparinus is of interest, because it was the first isolated strain shown to grow with heparin as sole carbon and nitrogen source and because it produces several enzymes involved in the degradation of mucopolysaccharides. All available data about this species are based on a sole strain that was isolated from dry soil. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first report on a complete genome sequence of a member of the genus Pedobacter, and the 5,167,383 bp long single replicon genome with its 4287 protein-coding and 54 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  20. Complete genome sequence of Saccharomonospora viridis type strain (P101T)

    Energy Technology Data Exchange (ETDEWEB)

    Pati, Amrita; Sikorski, Johannes; Nolan, Matt; Lapidus, Alla; Copeland, Alex; Glavina Del Rio, Tijana; Lucas, Susan; Chen, Feng; Tice, Hope; Pitluck, Sam; Cheng, Jan-Fang; Chertkov, Olga; Brettin, Thomas; Han, Cliff; Detter, John C.; Kuske, Cheryl; Bruce, David; Goodwin, Lynne; Chain, Patrick; D' haeseleer, Patrik; Chen, Amy; Palaniappan, Krishna; Ivanova, Natalia; Mavromatis, Konstantinos; Mikhailova, Natalia; Rohde, Manfred; Tindall, Brian J.; Goker, Markus; Bristow, Jim; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides1, Nikos C.; Klenk, Hans-Peter

    2009-05-20

    Saccharomonospora viridis (Schuurmans et al. 1956) Nonomurea and Ohara 1971 is the type species of the genus Saccharomonospora which belongs to the family Pseudonocardiaceae. S. viridis is of interest because it is a Gram-negative organism classified amongst the usually Gram-positive actinomycetes. Members of the species are frequently found in hot compost and hay, and its spores can cause farmer?s lung disease, bagassosis, and humidifier fever. Strains of the species S. viridis have been found to metabolize the xenobiotic pentachlorophenol (PCP). The strain described in this study has been isolated from peat-bog in Ireland. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of the family Pseudonocardiaceae, and the 4,308,349 bp long single replicon genome with its 3906 protein-coding and 64 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  1. Complete genome sequence of Dyadobacter fermentans type strain (NS114T)

    Energy Technology Data Exchange (ETDEWEB)

    Lang, Elke; Lapidus, Alla; Chertkov, Olga; Brettin, Thomas; Detter, John C.; Han, Cliff; Copeland, Alex; Glavina Del Rio, Tijana; Nolan, Matt; Chen, Feng; Lucas, Susan; Tice, Hope; Cheng, Jan-Fang; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jeffries, Cynthia; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Ovchinnikova, Galina; Pati, Amrita; Ivanova, Natalia; Mavromatis, Konstantinos; Chen, Amy; Chain, Patrick; Bristow, Jim; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Goker, Markus; Rohde, Manfred; Kyrpides, Nikos C; Klenk, Hans-Peter

    2009-05-20

    Dyadobacter fermentans (Chelius MK and Triplett EW, 2000) is the type species of the genus Dyadobacter. It is of phylogenetic interest because of its location in the Cytophagaceae, a very diverse family within the order 'Sphingobacteriales'. D. fermentans has a mainly respiratory metabolism, stains Gram-negative, is non-motile and oxidase and catalase positive. It is characterized by the production of cell filaments in ageing cultures, a flexirubin-like pigment and its ability to ferment glucose, which is almost unique in the aerobically living members of this taxonomically difficult family. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of the 'sphingobacterial' genus Dyadobacter, and this 6,967,790 bp long single replicon genome with its 5804 protein-coding and 50 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  2. Complete genome sequence of Brachybacterium faecium type strain (Schefferle 6-10T)

    Energy Technology Data Exchange (ETDEWEB)

    Lapidus, Alla; Pukall, Rudiger; LaButti, Kurt; Copeland, Alex; Glavina Del Rio, Tijana; Nolan, Matt; Chen, Feng; Lucas, Susan; Tice, Hope; Cheng, Jan-Fang; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Rohde, Manfred; Goker, Markus; Pati, Amrita; Ivanova, Natalia; Mavrommatis, Konstantinos; Chen, Amy; Palaniappan, Krishna; D' haeseleer, Patrik; Chain, Patrick; Bristow, Jim; Eisen, Johnathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter

    2009-05-20

    Brachybacterium faecium Collins et al. 1988 is the type species of the genus, and is of phylogenetic interest because of its location in the Dermabacteraceae, a rather isolated family within the actinobacterial suborder Micrococcineae. B. faecium is known for its rod-coccus growth cycle and the ability to degrade uric acid. It grows aerobically or weakly anaerobically. The strain described in this report is a free-living, nonmotile, Gram-positive bacterium, originally isolated from poultry deep litter. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of a member of the actinobacterial family Dermabacteraceae, and the 3,614,992 bp long single replicon genome with its 3129 protein-coding and 69 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  3. Isolation and complete genome sequencing of Mimivirus bombay, a Giant Virus in sewage of Mumbai, India

    Directory of Open Access Journals (Sweden)

    Anirvan Chatterjee

    2016-09-01

    Full Text Available We report the isolation and complete genome sequencing of a new Mimiviridae family member, infecting Acanthamoeba castellanii, from sewage in Mumbai, India. The isolated virus has a particle size of about 435 nm and a 1,182,200-bp genome. A phylogeny based on the DNA polymerase sequence placed the isolate as a new member of the Mimiviridae family lineage A and was named as Mimivirus bombay. Extensive presence of Mimiviridae family members in different environmental niches, with remarkably similar genome size and genetic makeup, point towards an evolutionary advantage that needs to be further investigated. The complete genome sequence of Mimivirus bombay was deposited at GenBank/EMBL/DDBJ under the accession number KU761889.

  4. Complete mitochondrial genome of the Indian peafowl (Pavo cristatus), with phylogenetic analysis in phasianidae.

    Science.gov (United States)

    Zhou, Tai-Cheng; Sha, Tao; Irwin, David M; Zhang, Ya-Ping

    2015-01-01

    Pavo cristatus, known as the Indian peafowl, is endemic to India and Sri Lanka and has been domesticated for its ornamental and food value. However, its phylogenetic status is still debated. Here, to clarify the phylogenetic status of P. cristatus within Phasianidae, we analyzed its mitochondrial genome (mtDNA). The complete mitochondrial DNA (mtDNA) genome was determined using 34 pairs of primers. Our data show that the mtDNA genome of P. cristatus is 16,686 bp in length. Molecular phylogenetic analyses of P. cristatus was performed along with 22 complete mtDNA genomes belonging to other species in Phasianidae using Bayesian and maximum likelihood methods, where Aythya americana and Anas platyrhynchos were used as outgroups. Our results show that P. critatus has its closest genetic affinity with Pavo muticus and belongs to clade that contains Gallus, Bambusicola and Francolinus.

  5. Complete genome sequence of Desulfohalobium retbaense type strain (HR100T)

    Energy Technology Data Exchange (ETDEWEB)

    Spring, Stefan [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Copeland, A [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Chen, Feng [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Munk, Christine [U.S. Department of Energy, Joint Genome Institute; Kiss, Hajnalka [Los Alamos National Laboratory (LANL); Chain, Patrick S. G. [Lawrence Livermore National Laboratory (LLNL); Han, Cliff [Los Alamos National Laboratory (LANL); Brettin, Thomas S [ORNL; Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Schuler, Esther [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany

    2010-01-01

    Desulfohalobium retbaense (Ollivier et al. 1991) is the type species of the polyphyletic genus Desulfohalobium, which comprises, at the time of writing, two species and represents the family Desulfohalobiaceae within the Deltaproteobacteria. D. retbaense is a moderately halophilic sulfate-reducing bacterium, which can utilize H2 and a limited range of organic substrates, which are incompletely oxidized to acetate and CO2, for growth. The type strain HR100T was isolated from sediments of the hypersaline Retba Lake in Senegal. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the first completed genome sequence of a member of the family Desulfohalobiaceae. The 2,909,567 bp genome (one chromosome and a 45,263 bp plasmid) with its 2,552 protein-coding and 57 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  6. Complete genome sequence of Mahella australiensis type strain (50-1 BONT)

    Energy Technology Data Exchange (ETDEWEB)

    Sikorski, Johannes [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Teshima, Hazuki [Los Alamos National Laboratory (LANL); Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Hammon, Nancy [U.S. Department of Energy, Joint Genome Institute; Deshpande, Shweta [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Liolios, Konstantinos [U.S. Department of Energy, Joint Genome Institute; Pagani, Ioanna [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Huntemann, Marcel [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Ovchinnikova, Galina [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Ngatchou, Olivier Duplex [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Pukall, Rudiger [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Spring, Stefan [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Abt, Birte [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute

    2011-01-01

    Mahella australiensis Bonilla Salinas et al. 2004 is the type species of the genus Mahella, which belongs to the family Thermoanaerobacteraceae. The species is of interest because it differs from other known anaerobic spore-forming bacteria in its G+C content, and in certain phenotypic traits, such as carbon source utilization and relationship to temperature. Moreo- ver, it has been discussed that this species might be an indigenous member of petroleum and oil reservoirs. This is the first completed genome sequence of a member of the genus Mahella and the ninth completed type strain genome sequence from the family Thermoanaerobacte- raceae. The 3,135,972 bp long genome with its 2,974 protein-coding and 59 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  7. The complete chloroplast genome of a medicinal plant Epimedium koreanum Nakai (Berberidaceae).

    Science.gov (United States)

    Lee, Jung-Hoon; Kim, Kyunghee; Kim, Na-Rae; Lee, Sang-Choon; Yang, Tae-Jin; Kim, Young-Dong

    2016-11-01

    Epimedium koreanum is a perennial medicinal plant distributed in Eastern Asia. The complete chloroplast genome sequences of E. koreanum was obtained by de novo assembly using whole genome next-generation sequences. The chloroplast genome of E. koreanum was 157 218 bp in length and separated into four distinct regions such as large single copy region (89 600 bp), small single copy region (17 222 bp) and a pair of inverted repeat regions (25 198 bp). The genome contained a total of 112 genes including 78 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. Phylogenetic analysis with the reported chloroplast genomes revealed that E. koreanum is most closely related to Berberis bealei, a traditional medicinal plant in the Berberidaceae family.

  8. Minimally destructive sampling of type specimens of Pyropia (Bangiales, Rhodophyta) recovers complete plastid and mitochondrial genomes.

    Science.gov (United States)

    Hughey, Jeffery R; Gabrielson, Paul W; Rohmer, Laurence; Tortolani, Jacquie; Silva, Mayra; Miller, Kathy Ann; Young, Joel D; Martell, Craig; Ruediger, Erik

    2014-06-04

    Plant species, including algae and fungi, are based on type specimens to which the name of a taxon is permanently attached. Applying a scientific name to any specimen therefore requires demonstrating correspondence between the type and that specimen. Traditionally, identifications are based on morpho-anatomical characters, but recently systematists are using DNA sequence data. These studies are flawed if the DNA is isolated from misidentified modern specimens. We propose a genome-based solution. Using 4 × 4 mm(2) of material from type specimens, we assembled 14 plastid and 15 mitochondrial genomes attributed to the red algae Pyropia perforata, Py. fucicola, and Py. kanakaensis. The chloroplast genomes were fairly conserved, but the mitochondrial genomes differed significantly among populations in content and length. Complete genomes are attainable from 19(th) and early 20(th) century type specimens; this validates the effort and cost of their curation as well as supports the practice of the type method.

  9. Minimally destructive sampling of type specimens of Pyropia (Bangiales, Rhodophyta) recovers complete plastid and mitochondrial genomes

    Science.gov (United States)

    Hughey, Jeffery R.; Gabrielson, Paul W.; Rohmer, Laurence; Tortolani, Jacquie; Silva, Mayra; Miller, Kathy Ann; Young, Joel D.; Martell, Craig; Ruediger, Erik

    2014-01-01

    Plant species, including algae and fungi, are based on type specimens to which the name of a taxon is permanently attached. Applying a scientific name to any specimen therefore requires demonstrating correspondence between the type and that specimen. Traditionally, identifications are based on morpho-anatomical characters, but recently systematists are using DNA sequence data. These studies are flawed if the DNA is isolated from misidentified modern specimens. We propose a genome-based solution. Using 4 × 4 mm2 of material from type specimens, we assembled 14 plastid and 15 mitochondrial genomes attributed to the red algae Pyropia perforata, Py. fucicola, and Py. kanakaensis. The chloroplast genomes were fairly conserved, but the mitochondrial genomes differed significantly among populations in content and length. Complete genomes are attainable from 19th and early 20th century type specimens; this validates the effort and cost of their curation as well as supports the practice of the type method. PMID:24894641

  10. Characterization of the complete mitochondrial genome of Acanthoscelides obtectus (Coleoptera: Chrysomelidae: Bruchinae) with phylogenetic analysis.

    Science.gov (United States)

    Yao, Jie; Yang, Hong; Dai, Renhuai

    2017-10-01

    Acanthoscelides obtectus is a common species of the subfamily Bruchinae and a worldwide-distributed seed-feeding beetle. The complete mitochondrial genome of A. obtectus is 16,130 bp in length with an A + T content of 76.4%. It contains a positive AT skew and a negative GC skew. The mitogenome of A. obtectus contains 13 protein-coding genes (PCGs), 22 tRNA genes, two rRNA genes and a non-coding region (D-loop). All PCGs start with an ATN codon, and seven (ND3, ATP6, COIII, ND3, ND4L, ND6, and Cytb) of them terminate with TAA, while the remaining five (COI, COII, ND1, ND4, and ND5) terminate with a single T, ATP8 terminates with TGA. Except tRNA Ser , the secondary structures of 21 tRNAs that can be folded into a typical clover-leaf structure were identified. The secondary structures of lrRNA and srRNA were also predicted in this study. There are six domains with 48 helices in lrRNA and three domains with 32 helices in srRNA. The control region of A. obtectus is 1354 bp in size with the highest A + T content (83.5%) in a mitochondrial gene. Thirteen PCGs in 19 species have been used to infer their phylogenetic relationships. Our results show that A. obtectus belongs to the family Chrysomelidae (subfamily-Bruchinae). This is the first study on phylogenetic analyses involving the mitochondrial genes of A. obtectus and could provide basic data for future studies of mitochondrial genome diversities and the evolution of related insect lineages.

  11. A complete mitochondrial genome sequence from a mesolithic wild aurochs (Bos primigenius).

    LENUS (Irish Health Repository)

    Edwards, Ceiridwen J

    2010-01-01

    BACKGROUND: The derivation of domestic cattle from the extinct wild aurochs (Bos primigenius) has been well-documented by archaeological and genetic studies. Genetic studies point towards the Neolithic Near East as the centre of origin for Bos taurus, with some lines of evidence suggesting possible, albeit rare, genetic contributions from locally domesticated wild aurochsen across Eurasia. Inferences from these investigations have been based largely on the analysis of partial mitochondrial DNA sequences generated from modern animals, with limited sequence data from ancient aurochsen samples. Recent developments in DNA sequencing technologies, however, are affording new opportunities for the examination of genetic material retrieved from extinct species, providing new insight into their evolutionary history. Here we present DNA sequence analysis of the first complete mitochondrial genome (16,338 base pairs) from an archaeologically-verified and exceptionally-well preserved aurochs bone sample. METHODOLOGY: DNA extracts were generated from an aurochs humerus bone sample recovered from a cave site located in Derbyshire, England and radiocarbon-dated to 6,738+\\/-68 calibrated years before present. These extracts were prepared for both Sanger and next generation DNA sequencing technologies (Illumina Genome Analyzer). In total, 289.9 megabases (22.48%) of the post-filtered DNA sequences generated using the Illumina Genome Analyzer from this sample mapped with confidence to the bovine genome. A consensus B. primigenius mitochondrial genome sequence was constructed and was analysed alongside all available complete bovine mitochondrial genome sequences. CONCLUSIONS: For all nucleotide positions where both Sanger and Illumina Genome Analyzer sequencing methods gave high-confidence calls, no discrepancies were observed. Sequence analysis reveals evidence of heteroplasmy in this sample and places this mitochondrial genome sequence securely within a previously identified

  12. Complete Genome Sequence of the Endophytic Biocontrol Strain Bacillus velezensis CC09.

    Science.gov (United States)

    Cai, Xunchao; Kang, Xingxing; Xi, Huan; Liu, Changhong; Xue, Yarong

    2016-09-29

    Bacillus velezensis is a heterotypic synonym of B. methylotrophicus, B. amyloliquefaciens subsp. plantarum, and Bacillus oryzicola, and has been used to control plant fungal diseases. In order to fully understand the genetic basis of antimicrobial capacities, we did a complete genome sequencing of the endophytic B. velezensis strain CC09. Genes tightly associated with biocontrol ability, including nonribosomal peptide synthetases, polyketide synthetases, iron acquisition, colonization, and volatile organic compound synthesis were identified in the genome. Copyright © 2016 Cai et al.

  13. The complete mitochondrial DNA genome of a greater horseshoe bat subspecies, Rhinolophus ferrumequinum quelpartis (Chiroptera: Rhinolophidae).

    Science.gov (United States)

    Yoon, Kwang Bae; Kim, Ji Young; Kim, Hye Ri; Cho, Jae Youl; Park, Yung Chul

    2013-02-01

    There are two subspecies of Rhinolophus ferrumequinum currently recognized in South Korea. The Korean greater horseshoe bat subspecies, Rhinolophus ferrumequinum quelpartis, is distributed only in Jeju Island. The complete mitochondrial genome of the island subspecies was determined and revealed 99.7% similarity to the mainland subspecies Rhinolophus ferrumequinum korai. If d-loop region is excluded, similarity of the two genomes was 99.9%.

  14. The complete mitochondrial genome of a Chinese rufous horseshoe bat subspecies, Rhinolophus sinicus sinicus (Chiroptera: Rhinolophidae).

    Science.gov (United States)

    Sun, Haijian; Dong, Ji; Shi, Huizhen; Ren, Min; Hua, Panyu

    2016-09-01

    There are two subspecies of Rhinolophus sinicus currently recognized in China. In this study, using next generation sequencing approaches, the complete mitochondrial genome of one subspecies R. s. sinicus was obtained. The total length of the genome sequence is 16,898 bp. The arrangement and contents of R. s. sinicus mitochondrial genes exhibit high similarity with other bats of family Rhinolophida. Phylogenetic reconstructions support the sister relationship of the two subspecies and confirm the subspecies status of our specimen.

  15. Complete Genome Sequence of an Avian Paramyxovirus Representative of Putative New Serotype 13

    OpenAIRE

    Goraichuk, Iryna; Sharma, Poonam; Stegniy, Borys; Muzyka, Denys; Pantin-Jackwood, Mary J.; Gerilovych, Anton; Solodiankin, Olexii; Bolotin, Vitaliy; Miller, Patti J.; Dimitrov, Kiril M.; Afonso, Claudio L.

    2016-01-01

    Here, we report the complete genome sequence of a virus of a putative new serotype of avian paramyxovirus (APMV). The virus was isolated from a white-fronted goose in Ukraine in 2011 and designated white-fronted goose/Ukraine/Askania-Nova/48-15-02/2011. The genomic characterization of the isolate suggests that it represents the novel avian paramyxovirus group APMV 13.

  16. Complete Genome Sequence of an Avian Paramyxovirus Representative of Putative New Serotype 13

    Science.gov (United States)

    Goraichuk, Iryna; Sharma, Poonam; Stegniy, Borys; Muzyka, Denys; Pantin-Jackwood, Mary J.; Gerilovych, Anton; Solodiankin, Olexii; Bolotin, Vitaliy; Miller, Patti J.; Dimitrov, Kiril M.

    2016-01-01

    Here, we report the complete genome sequence of a virus of a putative new serotype of avian paramyxovirus (APMV). The virus was isolated from a white-fronted goose in Ukraine in 2011 and designated white-fronted goose/Ukraine/Askania-Nova/48-15-02/2011. The genomic characterization of the isolate suggests that it represents the novel avian paramyxovirus group APMV 13. PMID:27469958

  17. Complete Genome Sequence of the Gamma-Aminobutyric Acid-Producing Strain Streptococcus thermophilus APC151.

    Science.gov (United States)

    Linares, Daniel M; Arboleya, Silvia; Ross, R Paul; Stanton, Catherine

    2017-04-27

    Here is presented the whole-genome sequence of Streptococcus thermophilus APC151, isolated from a marine fish. This bacterium produces gamma-aminobutyric acid (GABA) in high yields and is biotechnologically suitable to produce naturally GABA-enriched biofunctional yogurt. Its complete genome comprises 2,097 genes and 1,839,134 nucleotides, with an average G+C content of 39.1%. Copyright © 2017 Linares et al.

  18. The complete mitochondrial genome of the great white shark, Carcharodon carcharias (Chondrichthyes, Lamnidae).

    Science.gov (United States)

    Chang, Chia-Hao; Shao, Kwang-Tsao; Lin, Yeong-Shin; Fang, Yi-Chiao; Ho, Hsuan-Ching

    2014-10-01

    The complete mitochondrial genome of the great white shark having 16,744 bp and including 13 protein-coding genes, 2 ribosomal RNA, 22 transfer RNA genes, 1 replication origin region and 1 control region. The mitochondrial gene arrangement of the great white shark is the same as the one observed in the most vertebrates. Base composition of the genome is A (30.6%), T (28.7%), C (26.9%) and G (13.9%).

  19. The complete mitochondrial genome of a stonefly species, Kamimuria chungnanshana Wu, 1948 (Plecoptera: Perlidae).

    Science.gov (United States)

    Wang, Kai; Ding, Shuangmei; Yang, Ding

    2016-09-01

    This study determined the complete mitochondrial (mt) genome of the stonefly, Kamimuria chungnanshana Wu, 1948. The mt genome is 15, 943 bp in size and contains 37 canonical genes which include 22 transfer RNA genes, 13 protein-coding genes, and two ribosomal RNA genes, the control region is 1062 bp in length. The phylogenetic tree shows that Kamimuria chungnanshana is sister group of Kamimuria wangi.

  20. Complete mitochondrial genome of the agarophyte red alga Gelidium vagum (Gelidiales).

    Science.gov (United States)

    Yang, Eun Chan; Kim, Kyeong Mi; Boo, Ga Hun; Lee, Jung-Hyun; Boo, Sung Min; Yoon, Hwan Su

    2014-08-01

    We describe the first complete mitochondrial genome of Gelidium vagum (Gelidiales) (24,901 bp, 30.4% GC content), an agar-producing red alga. The circular mitochondrial genome contains 43 genes, including 23 protein-coding, 18 tRNA and 2 rRNA genes. All the protein-coding genes have a typical ATG start codon. No introns were found. Two genes, secY and rps12, were overlapped by 41 bp.

  1. Complete genome of Martelella sp. AD-3, a moderately halophilic polycyclic aromatic hydrocarbons-degrading bacterium.

    Science.gov (United States)

    Cui, Changzheng; Li, Zhijie; Qian, Jiangchao; Shi, Jie; Huang, Ling; Tang, Hongzhi; Chen, Xin; Lin, Kuangfei; Xu, Ping; Liu, Yongdi

    2016-05-10

    Martelella sp. strain AD-3, a moderate halophilic bacterium, was isolated from a petroleum-contaminated soil with high salinity in China. Here, we report the complete genome of strain AD-3, which contains one circular chromosome and two circular plasmids. An array of genes related to metabolism of polycyclic aromatic hydrocarbons and halophilic mechanism in this bacterium was identified by the whole genome analysis. Copyright © 2016 Elsevier B.V. All rights reserved.

  2. Complete genome sequence of Bifidobacterium breve CECT 7263, a strain isolated from human milk

    OpenAIRE

    Jiménez, Esther; Villar-Tajadura, M. Antonia; Marín, María; Fontecha, F. Javier; Requena, Teresa; Arroyo, Rebeca; Fernández, Leónides; Rodríguez, Juan M.

    2012-01-01

    Bifidobacterium breve is an actinobacterium frequently isolated from colonic microbiota of breastfeeding babies. Here, we report the complete and annotated genome sequence of a B. breve strain isolated from human milk, B. breve CECT 7263. The genome sequence will provide new insights into the biology of this potential probiotic organism and will allow the characterization of genes related to beneficial properties. © 2012, American Society for Microbiology.

  3. The first complete chloroplast genome sequence of a lycophyte,Huperzia lucidula (Lycopodiaceae)

    Energy Technology Data Exchange (ETDEWEB)

    Wolf, Paul G.; Karol, Kenneth G.; Mandoli, Dina F.; Kuehl,Jennifer V.; Arumuganathan, K.; Ellis, Mark W.; Mishler, Brent D.; Kelch,Dean G.; Olmstead, Richard G.; Boore, Jeffrey L.

    2005-02-01

    We used a unique combination of techniques to sequence the first complete chloroplast genome of a lycophyte, Huperzia lucidula. This plant belongs to a significant clade hypothesized to represent the sister group to all other vascular plants. We used fluorescence-activated cell sorting (FACS) to isolate the organelles, rolling circle amplification (RCA) to amplify the genome, and shotgun sequencing to 8x depth coverage to obtain the complete chloroplast genome sequence. The genome is 154,373bp, containing inverted repeats of 15,314 bp each, a large single-copy region of 104,088 bp, and a small single-copy region of 19,671 bp. Gene order is more similar to those of mosses, liverworts, and hornworts than to gene order for other vascular plants. For example, the Huperziachloroplast genome possesses the bryophyte gene order for a previously characterized 30 kb inversion, thus supporting the hypothesis that lycophytes are sister to all other extant vascular plants. The lycophytechloroplast genome data also enable a better reconstruction of the basaltracheophyte genome, which is useful for inferring relationships among bryophyte lineages. Several unique characters are observed in Huperzia, such as movement of the gene ndhF from the small single copy region into the inverted repeat. We present several analyses of evolutionary relationships among land plants by using nucleotide data, amino acid sequences, and by comparing gene arrangements from chloroplast genomes. The results, while still tentative pending the large number of chloroplast genomes from other key lineages that are soon to be sequenced, are intriguing in themselves, and contribute to a growing comparative database of genomic and morphological data across the green plants.

  4. Complete Genome Sequence of Treponema paraluiscuniculi, Strain Cuniculi A: The Loss of Infectivity to Humans Is Associated with Genome Decay

    Science.gov (United States)

    Šmajs, David; Zobaníková, Marie; Strouhal, Michal; Čejková, Darina; Dugan-Rocha, Shannon; Pospíšilová, Petra; Norris, Steven J.; Albert, Tom; Qin, Xiang; Hallsworth-Pepin, Kym; Buhay, Christian; Muzny, Donna M.; Chen, Lei; Gibbs, Richard A.; Weinstock, George M.

    2011-01-01

    Treponema paraluiscuniculi is the causative agent of rabbit venereal spirochetosis. It is not infectious to humans, although its genome structure is very closely related to other pathogenic Treponema species including Treponema pallidum subspecies pallidum, the etiological agent of syphilis. In this study, the genome sequence of Treponema paraluiscuniculi, strain Cuniculi A, was determined by a combination of several high-throughput sequencing strategies. Whereas the overall size (1,133,390 bp), arrangement, and gene content of the Cuniculi A genome closely resembled those of the T. pallidum genome, the T. paraluiscuniculi genome contained a markedly higher number of pseudogenes and gene fragments (51). In addition to pseudogenes, 33 divergent genes were also found in the T. paraluiscuniculi genome. A set of 32 (out of 84) affected genes encoded proteins of known or predicted function in the Nichols genome. These proteins included virulence factors, gene regulators and components of DNA repair and recombination. The majority (52 or 61.9%) of the Cuniculi A pseudogenes and divergent genes were of unknown function. Our results indicate that T. paraluiscuniculi has evolved from a T. pallidum-like ancestor and adapted to a specialized host-associated niche (rabbits) during loss of infectivity to humans. The genes that are inactivated or altered in T. paraluiscuniculi are candidates for virulence factors important in the infectivity and pathogenesis of T. pallidum subspecies. PMID:21655244

  5. Complete genome sequence of Treponema paraluiscuniculi, strain Cuniculi A: the loss of infectivity to humans is associated with genome decay.

    Directory of Open Access Journals (Sweden)

    David Šmajs

    Full Text Available Treponema paraluiscuniculi is the causative agent of rabbit venereal spirochetosis. It is not infectious to humans, although its genome structure is very closely related to other pathogenic Treponema species including Treponema pallidum subspecies pallidum, the etiological agent of syphilis. In this study, the genome sequence of Treponema paraluiscuniculi, strain Cuniculi A, was determined by a combination of several high-throughput sequencing strategies. Whereas the overall size (1,133,390 bp, arrangement, and gene content of the Cuniculi A genome closely resembled those of the T. pallidum genome, the T. paraluiscuniculi genome contained a markedly higher number of pseudogenes and gene fragments (51. In addition to pseudogenes, 33 divergent genes were also found in the T. paraluiscuniculi genome. A set of 32 (out of 84 affected genes encoded proteins of known or predicted function in the Nichols genome. These proteins included virulence factors, gene regulators and components of DNA repair and recombination. The majority (52 or 61.9% of the Cuniculi A pseudogenes and divergent genes were of unknown function. Our results indicate that T. paraluiscuniculi has evolved from a T. pallidum-like ancestor and adapted to a specialized host-associated niche (rabbits during loss of infectivity to humans. The genes that are inactivated or altered in T. paraluiscuniculi are candidates for virulence factors important in the infectivity and pathogenesis of T. pallidum subspecies.

  6. Complete genome sequence of Spirochaeta smaragdinae type strain (SEBR 4228T)

    Energy Technology Data Exchange (ETDEWEB)

    Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Yasawong, Montri [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Chertkov, Olga [Los Alamos National Laboratory (LANL); Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Liolios, Konstantinos [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Bruce, David [U.S. Department of Energy, Joint Genome Institute; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Brambilla, Evelyne-Marie [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Spring, Stefan [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Sikorski, Johannes [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute

    2010-01-01

    Spirochaeta smaragdinae Magot et al. 1998 belongs to the family Spirochaetaceae. The species is Gram-negative, motile, obligately halophilic and strictly anaerobic bacterium, which is of interest because it is able to ferment numerous polysaccharides. S. smaragdinae is the only species of the family Spirochaetaceae known to reduce thiosulfate or element sulphur to sulfide. This is the first complete genome sequence in the family Spirochaetaceae. The 4,653,970 bp long genome with its 4,363 protein-coding and 57 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  7. Complete genome sequence of jacquemontia yellow vein virus, a novel begomovirus infecting Jacquemontia tamnifolia in Venezuela.

    Science.gov (United States)

    Fiallo-Olivé, Elvira; Chirinos, Dorys T; Geraud-Pouey, Francis; Navas-Castillo, Jesús

    2017-08-01

    Wild plants of the family Convolvulaceae are hosts for a few New World begomoviruses (genus Begomovirus, family Geminiviridae). In this work, we report the complete genome sequence of a new begomovirus infecting the wild convolvulaceous plant Jacquemontia tamnifolia in Venezuela. The cloned bipartite genome showed the organization of typical New World begomoviruses and was found to be phylogenetically related to those of begomoviruses from Venezuela and other Caribbean countries. Several recombination events have been shown to have occurred involving genome fragment exchange with related begomoviruses infecting crops such as tomato and cucurbits and wild plants, including Jacquemontia sp. We propose the name jacquemontia yellow vein virus (JacYVV) for this new begomovirus.

  8. The complete mitochondrial genome of the tiger tail seahorse, Hippocampus comes (Teleostei, Syngnathidae).

    Science.gov (United States)

    Chang, Chia-Hao; Lin, Han-Yang; Jang-Liaw, Nian-Hong; Shao, Kwang-Tsao; Lin, Yeong-Shin; Ho, Hsuan-Ching

    2013-06-01

    The complete mitochondrial genome of the tiger tail seahorse was sequenced using a polymerase chain reaction-based method. The total length of mitochondrial DNA is 16,525 bp and includes 13 protein-coding genes, 2 ribosomal RNA, 22 transfer RNA genes, and a control region. The mitochondrial gene arrangement of the tiger tail seahorse is also matching the one observed in the most vertebrate creatures. Base composition of the genome is A (32.8%), T (29.8%), C (23.0%), and G (14.4%) with an A+T-rich hallmark as that of other vertebrate mitochondrial genomes.

  9. The complete mitochondrial genome of the three-spot seahorse, Hippocampus trimaculatus (Teleostei, Syngnathidae).

    Science.gov (United States)

    Chang, Chia-Hao; Shao, Kwang-Tsao; Lin, Yeong-Shin; Liao, Yun-Chih

    2013-12-01

    The complete mitochondrial genome of the three-spot seahorse was sequenced using a polymerase chain reaction-based method. The total length of mitochondrial DNA is 16,535 bp and includes 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes, and a control region. The mitochondrial gene order of the three-spot seahorse also conforms to the distinctive vertebrate mitochondrial gene order. The base composition of the genome is A (32.7%), T (29.3%), C (23.4%), and G (14.6%) with an A + T-rich hallmark as that of other vertebrate mitochondrial genomes.

  10. The complete mitochondrial genome of Octopus bimaculatus Verrill, 1883 from the Gulf of California.

    Science.gov (United States)

    Domínguez-Contreras, José Francisco; Munguia-Vega, Adrian; Ceballos-Vázquez, Bertha Patricia; García-Rodriguez, Francisco Javier; Arellano-Martinez, Marcial

    2016-11-01

    The complete mitochondrial genome of Octopus bimaculatus is 16 085 bp in length and includes 13 protein-codes genes, 2 ribosomal RNA genes, 22 transfers RNA genes, and a control region. The composition of genome is A (40.9%), T (34.7%), C (16.9%), and G (7.5%). The control region of O. bimaculatus contains a VNTR locus not present in the genomes from other octopus species. A phylogenetic analysis shows a closer relationship between the mitogenomes from O. bimaculatus and O. vulgaris.

  11. The complete mitochondrial genome of the pirarucu (Arapaima gigas, Arapaimidae, Osteoglossiformes)

    OpenAIRE

    Hrbek,Tomas; Farias,Izeni Pires

    2008-01-01

    We sequenced the complete mitochondrial genome of the pirarucu, Arapaima gigas, the largest fish of the Amazon basin, and economically one of the most important species of the region. The total length of the Arapaima gigas mitochondrial genome is 16,433 bp. The mitochondrial genome contains 13 protein-coding genes, two rRNA genes and 22 tRNA genes. Twelve of the thirteen protein-coding genes are coded on the heavy strand, while nad6 is coded on the light strand. The Arapaima gene order and co...

  12. Complete genome sequence of the plant-associated Serratia plymuthica strain AS13

    Energy Technology Data Exchange (ETDEWEB)

    Neupane, Saraswoti [Uppsala University, Uppsala, Sweden; Finlay, Roger D. [Uppsala University, Uppsala, Sweden; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Alstrom, Sadhna [Uppsala University, Uppsala, Sweden; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Han, James [U.S. Department of Energy, Joint Genome Institute; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Peters, Lin [U.S. Department of Energy, Joint Genome Institute; Ovchinnikova, Galina [U.S. Department of Energy, Joint Genome Institute; Held, Brittany [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Detter, J C [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Hauser, Loren John [ORNL; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Pagani, Ioanna [U.S. Department of Energy, Joint Genome Institute; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Hogberg, Nils [Uppsala University, Uppsala, Sweden

    2012-01-01

    Serratia plymuthica AS13 is a plant-associated Gammaproteobacteria, isolated from rapeseed roots. It is of special interest because of its ability to inhibit fungal pathogens of rapeseed and to promote plant growth. The complete genome of S. plymuthica AS13 consists of a 5,442,549 bp circular chromosome. The chromosome contains 4,951 protein-coding genes, 87 tRNA genes and 7 rRNA operons. This genome was sequenced as part of the project enti- tled Genomics of four rapeseed plant growth promoting bacteria with antagonistic effect on plant pathogens within the 2010 DOE-JGI Community Sequencing Program (CSP2010).

  13. The complete mitochondrial genome of Setaria digitata (Nematoda: Filarioidea): Mitochondrial gene content, arrangement and composition compared with other nematodes.

    Science.gov (United States)

    Yatawara, Lalani; Wickramasinghe, Susiji; Rajapakse, R P V J; Agatsuma, Takeshi

    2010-09-01

    In the present study, we determined the complete mitochondrial (mt) genome sequence (13,839bp) of parasitic nematode Setaria digitata and its structure and organization compared with Onchocerca volvulus, Dirofilaria immitis and Brugia malayi. The mt genome of S. digitata is slightly larger than the mt genomes of other filarial nematodes. S. digitata mt genome contains 36 genes (12 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs) that are typically found in metazoans. This genome contains a high A+T (75.1%) content and low G+C content (24.9%). The mt gene order for S. digitata is the same as those for O. volvulus, D. immitis and B. malayi but it is distinctly different from other nematodes compared. The start codons inferred in the mt genome of S. digitata are TTT, ATT, TTG, ATG, GTT and ATA. Interestingly, the initiation codon TTT is unique to S. digitata mt genome and four protein-coding genes use this codon as a translation initiation codon. Five protein-coding genes use TAG as a stop codon whereas three genes use TAA and four genes use T as a termination codon. Out of 64 possible codons, only 57 are used for mitochondrial protein-coding genes of S. digitata. T-rich codons such as TTT (18.9%), GTT (7.9%), TTG (7.8%), TAT (7%), ATT (5.7%), TCT (4.8%) and TTA (4.1%) are used more frequently. This pattern of codon usage reflects the strong bias for T in the mt genome of S. digitata. In conclusion, the present investigation provides new molecular data for future studies of the comparative mitochondrial genomics and systematic of parasitic nematodes of socio-economic importance. 2010 Elsevier B.V. All rights reserved.

  14. Complete mitochondrial genome of the aluminum-tolerant fungus Rhodotorula taiwanensis RS1 and comparative analysis of Basidiomycota mitochondrial genomes.

    Science.gov (United States)

    Zhao, Xue Qiang; Aizawa, Tomoko; Schneider, Jessica; Wang, Chao; Shen, Ren Fang; Sunairi, Michio

    2013-04-01

    The complete mitochondrial genome of Rhodotorula taiwanensis RS1, an aluminum-tolerant Basidiomycota fungus, was determined and compared with the known mitochondrial genomes of 12 Basidiomycota species. The mitochondrial genome of R. taiwanensis RS1 is a circular DNA molecule of 40,392 bp and encodes the typical 15 mitochondrial proteins, 23 tRNAs, and small and large rRNAs as well as 10 intronic open reading frames. These genes are apparently transcribed in two directions and do not show syntenies in gene order with other investigated Basidiomycota species. The average G+C content (41%) of the mitochondrial genome of R. taiwanensis RS1 is the highest among the Basidiomycota species. Two introns were detected in the sequence of the atp9 gene of R. taiwanensis RS1, but not in that of other Basidiomycota species. Rhodotorula taiwanensis is the first species of the genus Rhodotorula whose full mitochondrial genome has been sequenced; and the data presented here supply valuable information for understanding the evolution of fungal mitochondrial genomes and researching the mechanism of aluminum tolerance in microorganisms. © 2013 The Authors. Published by Blackwell Publishing Ltd.

  15. Complete Chloroplast Genome Sequence of Aquilaria sinensis (Lour.) Gilg and Evolution Analysis within the Malvales Order.

    Science.gov (United States)

    Wang, Ying; Zhan, Di-Feng; Jia, Xian; Mei, Wen-Li; Dai, Hao-Fu; Chen, Xiong-Ting; Peng, Shi-Qing

    2016-01-01

    Aquilaria sinensis (Lour.) Gilg is an important medicinal woody plant producing agarwood, which is widely used in traditional Chinese medicine. High-throughput sequencing of chloroplast (cp) genomes enhanced the understanding about evolutionary relationships within plant families. In this study, we determined the complete cp genome sequences for A. sinensis. The size of the A. sinensis cp genome was 159,565 bp. This genome included a large single-copy region of 87,482 bp, a small single-copy region of 19,857 bp, and a pair of inverted repeats (IRa and IRb) of 26,113 bp each. The GC content of the genome was 37.11%. The A. sinensis cp genome encoded 113 functional genes, including 82 protein-coding genes, 27 tRNA genes, and 4 rRNA genes. Seven genes were duplicated in the protein-coding genes, whereas 11 genes were duplicated in the RNA genes. A total of 45 polymorphic simple-sequence repeat loci and 60 pairs of large repeats were identified. Most simple-sequence repeats were located in the noncoding sections of the large single-copy/small single-copy region and exhibited high A/T content. Moreover, 33 pairs of large repeat sequences were located in the protein-coding genes, whereas 27 pairs were located in the intergenic regions. Aquilaria sinensis cp genome bias ended with A/T on the basis of codon usage. The distribution of codon usage in A. sinensis cp genome was most similar to that in the Gonystylus bancanus cp genome. Comparative results of 82 protein-coding genes from 29 species of cp genomes demonstrated that A. sinensis was a sister species to G. bancanus within the Malvales order. Aquilaria sinensis cp genome presented the highest sequence similarity of >90% with the G. bancanus cp genome by using CGView Comparison Tool. This finding strongly supports the placement of A. sinensis as a sister to G. bancanus within the Malvales order. The complete A. sinensis cp genome information will be highly beneficial for further studies on this traditional medicinal

  16. [Sequencing and analysis of the complete genome of a rabies virus isolate from Sika deer].

    Science.gov (United States)

    Zhao, Yun-Jiao; Guo, Li; Huang, Ying; Zhang, Li-Shi; Qian, Ai-Dong

    2008-05-01

    One DRV strain was isolated from Sika Deer brain and sequenced. Nine overlapped gene fragments were amplified by RT-PCR through 3'-RACE and 5'-RACE method, and the complete DRV genome sequence was assembled. The length of the complete genome is 11863bp. The DRV genome organization was similar to other rabies viruses which were composed of five genes and the initiation sites and termination sites were highly conservative. There were mutated amino acids in important antigen sites of nucleoprotein and glycoprotein. The nucleotide and amino acid homologies of gene N, P, M, G, L in strains with completed genomie sequencing were compared. Compared with N gene sequence of other typical rabies viruses, a phylogenetic tree was established . These results indicated that DRV belonged to gene type 1. The highest homology compared with Chinese vaccine strain 3aG was 94%, and the lowest was 71% compared with WCBV. These findings provided theoretical reference for further research in rabies virus.

  17. Structural genomic variation in ischemic stroke

    Science.gov (United States)

    Matarin, Mar; Simon-Sanchez, Javier; Fung, Hon-Chung; Scholz, Sonja; Gibbs, J. Raphael; Hernandez, Dena G.; Crews, Cynthia; Britton, Angela; Wavrant De Vrieze, Fabienne; Brott, Thomas G.; Brown, Robert D.; Worrall, Bradford B.; Silliman, Scott; Case, L. Douglas; Hardy, John A.; Rich, Stephen S.; Meschia, James F.; Singleton, Andrew B.

    2008-01-01

    Technological advances in molecular genetics allow rapid and sensitive identification of genomic copy number variants (CNVs). This, in turn, has sparked interest in the function such variation may play in disease. While a role for copy number mutations as a cause of Mendelian disorders is well established, it is unclear whether CNVs may affect risk for common complex disorders. We sought to investigate whether CNVs may modulate risk for ischemic stroke (IS) and to provide a catalog of CNVs in patients with this disorder by analyzing copy number metrics produced as a part of our previous genome-wide single-nucleotide polymorphism (SNP)-based association study of ischemic stroke in a North American white population. We examined CNVs in 263 patients with ischemic stroke (IS). Each identified CNV was compared with changes identified in 275 neurologically normal controls. Our analysis identified 247 CNVs, corresponding to 187 insertions (76%; 135 heterozygous; 25 homozygous duplications or triplications; 2 heterosomic) and 60 deletions (24%; 40 heterozygous deletions;3 homozygous deletions; 14 heterosomic deletions). Most alterations (81%) were the same as, or overlapped with, previously reported CNVs. We report here the first genome-wide analysis of CNVs in IS patients. In summary, our study did not detect any common genomic structural variation unequivocally linked to IS, although we cannot exclude that smaller CNVs or CNVs in genomic regions poorly covered by this methodology may confer risk for IS. The application of genome-wide SNP arrays now facilitates the evaluation of structural changes through the entire genome as part of a genome-wide genetic association study. PMID:18288507

  18. Sequencing and analysis of the complete mitochondrial genome in Anopheles sinensis (Diptera: Culicidae).

    Science.gov (United States)

    Chen, Kai; Wang, Yan; Li, Xiang-Yu; Peng, Heng; Ma, Ya-Jun

    2017-10-02

    Anopheles sinensis (Diptera: Culicidae) is a primary vector of Plasmodium vivax and Brugia malayi in most regions of China. In addition, its phylogenetic relationship with the cryptic species of the Hyrcanus Group is complex and remains unresolved. Mitochondrial genome sequences are widely used as molecular markers for phylogenetic studies of mosquito species complexes, of which mitochondrial genome data of An. sinensis is not available. An. sinensis samples was collected from Shandong, China, and identified by molecular marker. Genomic DNA was extracted, followed by the Illumina sequencing. Two complete mitochondrial genomes were assembled and annotated using the mitochondrial genome of An. gambiae as reference. The mitochondrial genomes sequences of the 28 known Anopheles species were aligned and reconstructed phylogenetic tree by Maximum Likelihood (ML) method. The length of complete mitochondrial genomes of An. sinensis was 15,076 bp and 15,138 bp, consisting of 13 protein-coding genes, 22 transfer RNA (tRNA) genes, 2 ribosomal RNA (rRNA) genes, and an AT-rich control region. As in other insects, most mitochondrial genes are encoded on the J strand, except for ND5, ND4, ND4L, ND1, two rRNA and eight tRNA genes, which are encoded on the N strand. The bootstrap value was set as 1000 in ML analyses. The topologies restored phylogenetic affinity within subfamily Anophelinae. The ML tree showed four major clades, corresponding to the subgenera Cellia, Anopheles, Nyssorhynchus and Kerteszia of the genus Anopheles. The complete mitochondrial genomes of An. sinensis were obtained. The number, order and transcription direction of An. sinensis mitochondrial genes were the same as in other species of family Culicidae.

  19. The complete nucleotide sequence, genome organization, and origin of human adenovirus type 11

    International Nuclear Information System (INIS)

    Stone, Daniel; Furthmann, Anne; Sandig, Volker; Lieber, Andre

    2003-01-01

    The complete DNA sequence and transcription map of human adenovirus type 11 are reported here. This is the first published sequence for a subgenera B human adenovirus and demonstrates a genome organization highly similar to those of other human adenoviruses. All of the genes from the early, intermediate, and late regions are present in the expected locations of the genome for a human adenovirus. The genome size is 34,794 bp in length and has a GC content of 48.9%. Sequence alignment with genomes of groups A (Ad12), C (Ad5), D (Ad17), E (Simian adenovirus 25), and F (Ad40) revealed homologies of 64, 54, 68, 75, and 52%, respectively. Detailed genomic analysis demonstrated that Ads 11 and 35 are highly conserved in all areas except the hexon hypervariable regions and fiber. Similarly, comparison of Ad11 with subgroup E SAV25 revealed poor homology between fibers but high homology in proteins encoded by all other areas of the genome. We propose an evolutionary model in which functional viruses can be reconstituted following fiber substitution from one serotype to another. According to this model either the Ad11 genome is a derivative of Ad35, from which the fiber was substituted with Ad7, or the Ad35 genome is the product of a fiber substitution from Ad21 into the Ad11 genome. This model also provides a possible explanation for the origin of group E Ads, which are evolutionarily derived from a group C fiber substitution into a group B genome

  20. The complete genome sequence of the plant growth-promoting bacterium Pseudomonas sp. UW4.

    Directory of Open Access Journals (Sweden)

    Jin Duan

    Full Text Available The plant growth-promoting bacterium (PGPB Pseudomonas sp. UW4, previously isolated from the rhizosphere of common reeds growing on the campus of the University of Waterloo, promotes plant growth in the presence of different environmental stresses, such as flooding, high concentrations of salt, cold, heavy metals, drought and phytopathogens. In this work, the genome sequence of UW4 was obtained by pyrosequencing and the gaps between the contigs were closed by directed PCR. The P. sp. UW4 genome contains a single circular chromosome that is 6,183,388 bp with a 60.05% G+C content. The bacterial genome contains 5,423 predicted protein-coding sequences that occupy 87.2% of the genome. Nineteen genomic islands (GIs were predicted and thirty one complete putative insertion sequences were identified. Genes potentially involved in plant growth promotion such as indole-3-acetic acid (IAA biosynthesis, trehalose production, siderophore production, acetoin synthesis, and phosphate solubilization were determined. Moreover, genes that contribute to the environmental fitness of UW4 were also observed including genes responsible for heavy metal resistance such as nickel, copper, cadmium, zinc, molybdate, cobalt, arsenate, and chromate. Whole-genome comparison with other completely sequenced Pseudomonas strains and phylogeny of four concatenated "housekeeping" genes (16S rRNA, gyrB, rpoB and rpoD of 128 Pseudomonas strains revealed that UW4 belongs to the fluorescens group, jessenii subgroup.

  1. The Complete Genome Sequence of the Plant Growth-Promoting Bacterium Pseudomonas sp. UW4

    Science.gov (United States)

    Duan, Jin; Jiang, Wei; Cheng, Zhenyu; Heikkila, John J.; Glick, Bernard R.

    2013-01-01

    The plant growth-promoting bacterium (PGPB) Pseudomonas sp. UW4, previously isolated from the rhizosphere of common reeds growing on the campus of the University of Waterloo, promotes plant growth in the presence of different environmental stresses, such as flooding, high concentrations of salt, cold, heavy metals, drought and phytopathogens. In this work, the genome sequence of UW4 was obtained by pyrosequencing and the gaps between the contigs were closed by directed PCR. The P. sp. UW4 genome contains a single circular chromosome that is 6,183,388 bp with a 60.05% G+C content. The bacterial genome contains 5,423 predicted protein-coding sequences that occupy 87.2% of the genome. Nineteen genomic islands (GIs) were predicted and thirty one complete putative insertion sequences were identified. Genes potentially involved in plant growth promotion such as indole-3-acetic acid (IAA) biosynthesis, trehalose production, siderophore production, acetoin synthesis, and phosphate solubilization were determined. Moreover, genes that contribute to the environmental fitness of UW4 were also observed including genes responsible for heavy metal resistance such as nickel, copper, cadmium, zinc, molybdate, cobalt, arsenate, and chromate. Whole-genome comparison with other completely sequenced Pseudomonas strains and phylogeny of four concatenated “housekeeping” genes (16S rRNA, gyrB, rpoB and rpoD) of 128 Pseudomonas strains revealed that UW4 belongs to the fluorescens group, jessenii subgroup. PMID:23516524

  2. Complete genome sequence of Paenibacillus riograndensis SBR5(T), a Gram-positive diazotrophic rhizobacterium.

    Science.gov (United States)

    Brito, Luciana Fernandes; Bach, Evelise; Kalinowski, Jörn; Rückert, Christian; Wibberg, Daniel; Passaglia, Luciane M; Wendisch, Volker F

    2015-08-10

    Paenibacillus riograndensis is a Gram-positive rhizobacterium which exhibits plant growth promoting activities. It was isolated from the rhizosphere of wheat grown in the state of Rio Grande do Sul, Brazil. Here we announce the complete genome sequence of P. riograndensis strain SBR5(T). The genome of P. riograndensis SBR5(T) consists of a circular chromosome of 7,893,056bps. The genome was finished and fully annotated, containing 6705 protein coding genes, 87 tRNAs and 27 rRNAs. The knowledge of the complete genome helped to explain why P. riograndensis SBR5(T) can grow with the carbon sources arabinose and mannitol, but not myo-inositol, and to explain physiological features such as biotin auxotrophy and antibiotic resistances. The genome sequence will be valuable for functional genomics and ecological studies as well as for application of P. riograndensis SBR5(T) as plant growth-promoting rhizobacterium. Copyright © 2015 Elsevier B.V. All rights reserved.

  3. [Complete genome sequencing of polymalic acid-producing strain Aureobasidium pullulans CCTCC M2012223].

    Science.gov (United States)

    Wang, Yongkang; Song, Xiaodan; Li, Xiaorong; Yang, Sang-tian; Zou, Xiang

    2017-01-04

    To explore the genome sequence of Aureobasidium pullulans CCTCC M2012223, analyze the key genes related to the biosynthesis of important metabolites, and provide genetic background for metabolic engineering. Complete genome of A. pullulans CCTCC M2012223 was sequenced by Illumina HiSeq high throughput sequencing platform. Then, fragment assembly, gene prediction, functional annotation, and GO/COG cluster were analyzed in comparison with those of other five A. pullulans varieties. The complete genome sequence of A. pullulans CCTCC M2012223 was 30756831 bp with an average GC content of 47.49%, and 9452 genes were successfully predicted. Genome-wide analysis showed that A. pullulans CCTCC M2012223 had the biggest genome assembly size. Protein sequences involved in the pullulan and polymalic acid pathway were highly conservative in all of six A. pullulans varieties. Although both A. pullulans CCTCC M2012223 and A. pullulans var. melanogenum have a close affinity, some point mutation and inserts were occurred in protein sequences involved in melanin biosynthesis. Genome information of A. pullulans CCTCC M2012223 was annotated and genes involved in melanin, pullulan and polymalic acid pathway were compared, which would provide a theoretical basis for genetic modification of metabolic pathway in A. pullulans.

  4. The complete chloroplast genome sequences of Lychnis wilfordii and Silene capitata and comparative analyses with other Caryophyllaceae genomes.

    Science.gov (United States)

    Kang, Jong-Soo; Lee, Byoung Yoon; Kwak, Myounghai

    2017-01-01

    The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.

  5. Complete Genome Sequence of Zucchini Yellow Mosaic Virus Strain Kurdistan, Iran.

    Science.gov (United States)

    Maghamnia, Hamid Reza; Hajizadeh, Mohammad; Azizi, Abdolbaset

    2018-03-01

    The complete genome sequence of Zucchini yellow mosaic virus strain Kurdistan (ZYMV-Kurdistan) infecting squash from Iran was determined from 13 overlapping fragments. Excluding the poly (A) tail, ZYMV-Kurdistan genome consisted of 9593 nucleotides (nt), with 138 and 211 nt at the 5' and 3' non-translated regions, respectively. It contained two open-reading frames (ORFs), the large ORF encoding a polyprotein of 3080 amino acids (aa) and the small overlapping ORF encoding a P3N-PIPO protein of 74 aa. This isolate had six unique aa differences compared to other ZYMV isolates and shared 79.6-98.8% identities with other ZYMV genome sequences at the nt level and 90.1-99% identities at the aa level. A phylogenetic tree of ZYMV complete genomic sequences showed that Iranian and Central European isolates are closely related and form a phylogenetically homogenous group. All values in the ratio of substitution rates at non-synonymous and synonymous sites ( d N / d S ) were below 1, suggestive of strong negative selection forces during ZYMV protein history. This is the first report of complete genome sequence information of the most prevalent virus in the west of Iran. This study helps our understanding of the genetic diversity of ZYMV isolates infecting cucurbit plants in Iran, virus evolution and epidemiology and can assist in designing better diagnostic tools.

  6. Complete genome of a European hepatitis C virus subtype 1g isolate: phylogenetic and genetic analyses.

    Science.gov (United States)

    Bracho, Maria A; Saludes, Verónica; Martró, Elisa; Bargalló, Ana; González-Candelas, Fernando; Ausina, Vicent

    2008-06-05

    Hepatitis C virus isolates have been classified into six main genotypes and a variable number of subtypes within each genotype, mainly based on phylogenetic analysis. Analyses of the genetic relationship among genotypes and subtypes are more reliable when complete genome sequences (or at least the full coding region) are used; however, so far 31 of 80 confirmed or proposed subtypes have at least one complete genome available. Of these, 20 correspond to confirmed subtypes of epidemic interest. We present and analyse the first complete genome sequence of a HCV subtype 1g isolate. Phylogenetic and genetic distance analyses reveal that HCV-1g is the most divergent subtype among the HCV-1 confirmed subtypes. Potential genomic recombination events between genotypes or subtype 1 genomes were ruled out. We demonstrate phylogenetic congruence of previously deposited partial sequences of HCV-1g with respect to our sequence. In light of this, we propose changing the current status of its subtype-specific designation from provisional to confirmed.

  7. Complete genome of a European hepatitis C virus subtype 1g isolate: phylogenetic and genetic analyses

    Directory of Open Access Journals (Sweden)

    Bargalló Ana

    2008-06-01

    Full Text Available Abstract Background Hepatitis C virus isolates have been classified into six main genotypes and a variable number of subtypes within each genotype, mainly based on phylogenetic analysis. Analyses of the genetic relationship among genotypes and subtypes are more reliable when complete genome sequences (or at least the full coding region are used; however, so far 31 of 80 confirmed or proposed subtypes have at least one complete genome available. Of these, 20 correspond to confirmed subtypes of epidemic interest. Results We present and analyse the first complete genome sequence of a HCV subtype 1g isolate. Phylogenetic and genetic distance analyses reveal that HCV-1g is the most divergent subtype among the HCV-1 confirmed subtypes. Potential genomic recombination events between genotypes or subtype 1 genomes were ruled out. We demonstrate phylogenetic congruence of previously deposited partial sequences of HCV-1g with respect to our sequence. Conclusion In light of this, we propose changing the current status of its subtype-specific designation from provisional to confirmed.

  8. Complete genome sequence of Menghai rhabdovirus, a novel mosquito-borne rhabdovirus from China.

    Science.gov (United States)

    Sun, Qiang; Zhao, Qiumin; An, Xiaoping; Guo, Xiaofang; Zuo, Shuqing; Zhang, Xianglilan; Pei, Guangqian; Liu, Wenli; Cheng, Shi; Wang, Yunfei; Shu, Peng; Mi, Zhiqiang; Huang, Yong; Zhang, Zhiyi; Tong, Yigang; Zhou, Hongning; Zhang, Jiusong

    2017-04-01

    Menghai rhabdovirus (MRV) was isolated from Aedes albopictus in Menghai county of Yunnan Province, China, in August 2010. Whole-genome sequencing of MRV was performed using an Ion PGM™ Sequencer. We found that MRV is a single-stranded, negative-sense RNA virus. The complete genome of MRV has 10,744 nt, with short inverted repeat termini, encoding five typical rhabdovirus proteins (N, P, M, G, and L) and an additional small hypothetical protein. Nucleotide BLAST analysis using the BLASTn method showed that the genome sequence most similar to that of MRV is that of Arboretum virus (NC_025393.1), with a Max score of 322, query coverage of 14%, and 66% identity. Genomic and phylogenetic analyses both demonstrated that MRV should be considered a member of a novel species of the family Rhabdoviridae.

  9. The complete mitochondrial genome sequence of Eimeria innocua (Eimeriidae, Coccidia, Apicomplexa).

    Science.gov (United States)

    Hafeez, Mian Abdul; Vrba, Vladimir; Barta, John Robert

    2016-07-01

    The complete mitochondrial genome of Eimeria innocua KR strain (Eimeriidae, Coccidia, Apicomplexa) was sequenced. This coccidium infects turkeys (Meleagris gallopavo), Bobwhite quails (Colinus virginianus), and Grey partridges (Perdix perdix). Genome organization and gene contents were comparable with other Eimeria spp. infecting galliform birds. The circular-mapping mt genome of E. innocua is 6247 bp in length with three protein-coding genes (cox1, cox3, and cytb), 19 gene fragments encoding large subunit (LSU) rRNA and 14 gene fragments encoding small subunit (SSU) rRNA. Like other Apicomplexa, no tRNA was encoded. The mitochondrial genome of E. innocua confirms its close phylogenetic affinities to Eimeria dispersa.

  10. Complete genome sequence of Truepera radiovictrix type strain (RQ-24T)

    Energy Technology Data Exchange (ETDEWEB)

    Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Rohde, Christine [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Munk, Christine [Joint Genome Institute, Walnut Creek, California; Nolan, Matt [Joint Genome Institute, Walnut Creek, California; Lucas, Susan [Joint Genome Institute, Walnut Creek, California; Glavina Del Rio, Tijana [Joint Genome Institute, Walnut Creek, California; Tice, Hope [Joint Genome Institute, Walnut Creek, California; Deshpande, Shweta [Joint Genome Institute, Walnut Creek, California; Cheng, Jan-Fang [Joint Genome Institute, Walnut Creek, California; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [Joint Genome Institute, Walnut Creek, California; Liolios, Konstantinos [Joint Genome Institute, Walnut Creek, California; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [Joint Genome Institute, Walnut Creek, California; Palaniappan, Krishna [Joint Genome Institute, Walnut Creek, California; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Brambilla, Evelyne-Marie [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Tindall, Brian [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Woyke, Tanja [Joint Genome Institute, Walnut Creek, California; Bristow, James [Joint Genome Institute, Walnut Creek, California; Eisen, Jonathan [Joint Genome Institute, Walnut Creek, California; Markowitz, Victor [Joint Genome Institute, Walnut Creek, California; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [Joint Genome Institute, Walnut Creek, California; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Lapidus, Alla L. [Joint Genome Institute, Walnut Creek, California

    2011-01-01

    Truepera radiovictrix Albuquerque et al. 2005 is the type species of the genus Truepera within the phylum Deinococcus/Thermus. T. radiovictrix is of special interest not only because of its isolated phylogenetic location in the order Deinococcales, but also because of its ability to grow under multiple extreme conditions in alkaline, moderately saline, and high temperature habitats. Of particular interest is the fact that, T. radiovictrix is also remarkably resistant to ionizing radiation, a feature it shares with members of the genus Deinococcus. This is the first completed genome sequence of a member of the family Trueperaceae and the fourth type strain genome sequence from a member of the order Deinococcales. The 3,260,398 bp long genome with its 2,994 protein-coding and 52 RNA genes consists of one circular chromosome and is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  11. The complete genome sequence of Haloferax volcanii DS2, a model archaeon.

    Directory of Open Access Journals (Sweden)

    Amber L Hartman

    2010-03-01

    Full Text Available Haloferax volcanii is an easily culturable moderate halophile that grows on simple defined media, is readily transformable, and has a relatively stable genome. This, in combination with its biochemical and genetic tractability, has made Hfx. volcanii a key model organism, not only for the study of halophilicity, but also for archaeal biology in general.We report here the sequencing and analysis of the genome of Hfx. volcanii DS2, the type strain of this species. The genome contains a main 2.848 Mb chromosome, three smaller chromosomes pHV1, 3, 4 (85, 438, 636 kb, respectively and the pHV2 plasmid (6.4 kb.The completed genome sequence, presented here, provides an invaluable tool for further in vivo and in vitro studies of Hfx. volcanii.

  12. The complete chloroplast genome of an irreplaceable dietary and model crop, foxtail millet (Setaria italica).

    Science.gov (United States)

    Wang, Shuo; Gao, Li-Zhi

    2016-11-01

    The complete chloroplast genome sequence of foxtail millet (Setaria italica), an important food and fodder crop in the family Poaceae, is first reported in this study. The genome consists of 1 35 516 bp containing a pair of inverted repeats (IRs) of 21 804 bp separated by a large single-copy (LSC) region and a small single-copy (SSC) region of 79 896 bp and 12 012 bp, respectively. Coding sequences constitute 58.8% of the genome harboring 111 unique genes, 71 of which are protein-coding genes, 4 are rRNA genes, and 36 are tRNA genes. Phylogenetic analysis indicated foxtail millet clustered with Panicum virgatum and Echinochloa crus-galli belonging to the tribe Paniceae of the subfamily Panicoideae. This newly determined chloroplast genome will provide valuable information for the future breeding programs of valuable cereal crops in the family Poaceae.

  13. Complete genome sequence of the gliding, heparinolytic Pedobacter saltans type strain (113T)

    Science.gov (United States)

    Liolios, Konstantinos; Sikorski, Johannes; Lu, Meagan; Nolan, Matt; Lapidus, Alla; Lucas, Susan; Hammon, Nancy; Deshpande, Shweta; Cheng, Jan-Fang; Tapia, Roxanne; Han, Cliff; Goodwin, Lynne; Pitluck, Sam; Huntemann, Marcel; Ivanova, Natalia; Pagani, Ioanna; Mavromatis, Konstantinos; Ovchinikova, Galina; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Brambilla, Evelyne-Marie; Kotsyurbenko, Oleg; Rohde, Manfred; Tindall, Brian J.; Abt, Birte; Göker, Markus; Detter, John C.; Woyke, Tanja; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Klenk, Hans-Peter; Kyrpides, Nikos C.

    2011-01-01

    Pedobacter saltans Steyn et al. 1998 is one of currently 32 species in the genus Pedobacter within the family Sphingobacteriaceae. The species is of interest for its isolated location in the tree of life. Like other members of the genus P. saltans is heparinolytic. Cells of P. saltans show a peculiar gliding, dancing motility and can be distinguished from other Pedobacter strains by their ability to utilize glycerol and the inability to assimilate D-cellobiose. The genome presented here is only the second completed genome sequence of a type strain from a member of the family Sphingobacteriaceae to be published. The 4,635,236 bp long genome with its 3,854 protein-coding and 67 RNA genes consists of one chromosome, and is a part of the Genomic Encyclopedia of Bacteria and Archaea project. PMID:22180808

  14. Complete genome sequence of the gliding, heparinolytic Pedobacter saltans type strain (113).

    Science.gov (United States)

    Liolios, Konstantinos; Sikorski, Johannes; Lu, Meagan; Nolan, Matt; Lapidus, Alla; Lucas, Susan; Hammon, Nancy; Deshpande, Shweta; Cheng, Jan-Fang; Tapia, Roxanne; Han, Cliff; Goodwin, Lynne; Pitluck, Sam; Huntemann, Marcel; Ivanova, Natalia; Pagani, Ioanna; Mavromatis, Konstantinos; Ovchinikova, Galina; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Brambilla, Evelyne-Marie; Kotsyurbenko, Oleg; Rohde, Manfred; Tindall, Brian J; Abt, Birte; Göker, Markus; Detter, John C; Woyke, Tanja; Bristow, James; Eisen, Jonathan A; Markowitz, Victor; Hugenholtz, Philip; Klenk, Hans-Peter; Kyrpides, Nikos C

    2011-10-15

    Pedobacter saltans Steyn et al. 1998 is one of currently 32 species in the genus Pedobacter within the family Sphingobacteriaceae. The species is of interest for its isolated location in the tree of life. Like other members of the genus P. saltans is heparinolytic. Cells of P. saltans show a peculiar gliding, dancing motility and can be distinguished from other Pedobacter strains by their ability to utilize glycerol and the inability to assimilate D-cellobiose. The genome presented here is only the second completed genome sequence of a type strain from a member of the family Sphingobacteriaceae to be published. The 4,635,236 bp long genome with its 3,854 protein-coding and 67 RNA genes consists of one chromosome, and is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  15. Complete chloroplast genome sequence of a major allogamous forage species, perennial ryegrass (Lolium perenne L.).

    Science.gov (United States)

    Diekmann, Kerstin; Hodkinson, Trevor R; Wolfe, Kenneth H; van den Bekerom, Rob; Dix, Philip J; Barth, Susanne

    2009-06-01

    Lolium perenne L. (perennial ryegrass) is globally one of the most important forage and grassland crops. We sequenced the chloroplast (cp) genome of Lolium perenne cultivar Cashel. The L. perenne cp genome is 135 282 bp with a typical quadripartite structure. It contains genes for 76 unique proteins, 30 tRNAs and four rRNAs. As in other grasses, the genes accD, ycf1 and ycf2 are absent. The genome is of average size within its subfamily Pooideae and of medium size within the Poaceae. Genome size differences are mainly due to length variations in non-coding regions. However, considerable length differences of 1-27 codons in comparison of L. perenne to other Poaceae and 1-68 codons among all Poaceae were also detected. Within the cp genome of this outcrossing cultivar, 10 insertion/deletion polymorphisms and 40 single nucleotide polymorphisms were detected. Two of the polymorphisms involve tiny inversions within hairpin structures. By comparing the genome sequence with RT-PCR products of transcripts for 33 genes, 31 mRNA editing sites were identified, five of them unique to Lolium. The cp genome sequence of L. perenne is available under Accession number AM777385 at the European Molecular Biology Laboratory, National Center for Biotechnology Information and DNA DataBank of Japan.

  16. Complete genome sequence of Francisella tularensis subspecies holarctica FTNF002-00.

    Directory of Open Access Journals (Sweden)

    Ravi D Barabote

    Full Text Available Francisella tularensis subspecies holarctica FTNF002-00 strain was originally obtained from the first known clinical case of bacteremic F. tularensis pneumonia in Southern Europe isolated from an immunocompetent individual. The FTNF002-00 complete genome contains the RD(23 deletion and represents a type strain for a clonal population from the first epidemic tularemia outbreak in Spain between 1997-1998. Here, we present the complete sequence analysis of the FTNF002-00 genome. The complete genome sequence of FTNF002-00 revealed several large as well as small genomic differences with respect to two other published complete genome sequences of F. tularensis subsp. holarctica strains, LVS and OSU18. The FTNF002-00 genome shares >99.9% sequence similarity with LVS and OSU18, and is also approximately 5 MB smaller by comparison. The overall organization of the FTNF002-00 genome is remarkably identical to those of LVS and OSU18, except for a single 3.9 kb inversion in FTNF002-00. Twelve regions of difference ranging from 0.1-1.5 kb and forty-two small insertions and deletions were identified in a comparative analysis of FTNF002-00, LVS, and OSU18 genomes. Two small deletions appear to inactivate two genes in FTNF002-00 causing them to become pseudogenes; the intact genes encode a protein of unknown function and a drug:H(+ antiporter. In addition, we identified ninety-nine proteins in FTNF002-00 containing amino acid mutations compared to LVS and OSU18. Several non-conserved amino acid replacements were identified, one of which occurs in the virulence-associated intracellular growth locus subunit D protein. Many of these changes in FTNF002-00 are likely the consequence of direct selection that increases the fitness of this subsp. holarctica clone within its endemic population. Our complete genome sequence analyses lay the foundation for experimental testing of these possibilities.

  17. Complete mitochondrial genome of Taharana fasciana (Insecta, Hemiptera: Cicadellidae) and comparison with other Cicadellidae insects.

    Science.gov (United States)

    Wang, Jiajia; Li, Hu; Dai, Renhuai

    2017-12-01

    Here, we describe the first complete mitochondrial genome (mitogenome) sequence of the leafhopper Taharana fasciana (Coelidiinae). The mitogenome sequence contains 15,161 bp with an A + T content of 77.9%. It includes 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and one non-coding (A + T-rich) region; in addition, a repeat region is also present (GenBank accession no. KY886913). These genes/regions are in the same order as in the inferred insect ancestral mitogenome. All protein-coding genes have ATN as the start codon, and TAA or single T as the stop codons, except the gene ND3, which ends with TAG. Furthermore, we predicted the secondary structures of the rRNAs in T. fasciana. Six domains (domain III is absent in arthropods) and 41 helices were predicted for 16S rRNA, and 12S rRNA comprised three structural domains and 24 helices. Phylogenetic tree analysis confirmed that T. fasciana and other members of the Cicadellidae are clustered into a clade, and it identified the relationships among the subfamilies Deltocephalinae, Coelidiinae, Idiocerinae, Cicadellinae, and Typhlocybinae.

  18. Sequencing and characterization of the complete mitochondrial genome of Japanese Swellshark (Cephalloscyllium umbratile).

    Science.gov (United States)

    Zhu, Ke-Cheng; Liang, Yin-Yin; Wu, Na; Guo, Hua-Yang; Zhang, Nan; Jiang, Shi-Gui; Zhang, Dian-Chang

    2017-11-10

    To further comprehend the genome features of Cephalloscyllium umbratile (Carcharhiniformes), an endangered species, the complete mitochondrial DNA (mtDNA) was firstly sequenced and annotated. The full-length mtDNA of C. umbratile was 16,697 bp and contained ribosomal RNA (rRNA) genes, 13 protein-coding genes (PCGs), 23 transfer RNA (tRNA) genes, and a major non-coding control region. Each PCG was initiated by an authoritative ATN codon, except for COX1 initiated by a GTG codon. Seven of 13 PCGs had a typical TAA termination codon, while others terminated with a single T or TA. Moreover, the relative synonymous codon usage of the 13 PCGs was consistent with that of other published Carcharhiniformes. All tRNA genes had typical clover-leaf secondary structures, except for tRNA-Ser (GCT), which lacked the dihydrouridine 'DHU' arm. Furthermore, the analysis of the average Ka/Ks in the 13 PCGs of three Carcharhiniformes species indicated a strong purifying selection within this group. In addition, phylogenetic analysis revealed that C. umbratile was closely related to Glyphis glyphis and Glyphis garricki. Our data supply a useful resource for further studies on genetic diversity and population structure of C. umbratile.

  19. Complete genome sequence of a new enamovirus from Argentina infecting alfalfa plants showing dwarfism symptoms.

    Science.gov (United States)

    Bejerman, Nicolás; Giolitti, Fabián; Trucco, Verónica; de Breuil, Soledad; Dietzgen, Ralf G; Lenardon, Sergio

    2016-07-01

    Alfalfa dwarf disease, probably caused by synergistic interactions of mixed virus infections, is a major and emergent disease that threatens alfalfa production in Argentina. Deep sequencing of diseased alfalfa plant samples from the central region of Argentina resulted in the identification of a new virus genome resembling enamoviruses in sequence and genome structure. Phylogenetic analysis suggests that it is a new member of the genus Enamovirus, family Luteoviridae. The virus is tentatively named "alfalfa enamovirus 1" (AEV-1). The availability of the AEV-1 genome sequence will make it possible to assess the genetic variability of this virus and to construct an infectious clone to investigate its role in alfalfa dwarfism disease.

  20. The complete mitochondrial genome of the Feral Rock Pigeon (Columba livia breed feral).

    Science.gov (United States)

    Li, Chun-Hong; Liu, Fang; Wang, Li

    2014-10-01

    Abstract In the present work, we report the complete mitochondrial genome sequence of feral rock pigeon for the first time. The total length of the mitogenome was 17,239 bp with the base composition of 30.3% for A, 24.0% for T, 31.9% for C, and 13.8% for G and an A-T (54.3 %)-rich feature was detected. It harbored 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and 1 non-coding control region (D-loop region). The arrangement of all genes was identical to the typical mitochondrial genomes of pigeon. The complete mitochondrial genome sequence of feral rock pigeon would serve as an important data set of the germplasm resources for further study.

  1. The complete mitochondrial genome of the Fancy Pigeon, Columba livia (Columbiformes: Columbidae).

    Science.gov (United States)

    Zhang, Rui-Hua; Xu, Ming-Ju; Wang, Cun-Lian; Xu, Tong; Wei, Dong; Liu, Bao-Jian; Wang, Guo-Hua

    2015-02-01

    The fancy pigeons are domesticated varieties of the rock pigeon developed over many years of selective breeding. In the present work, we report the complete mitochondrial genome sequence of fancy pigeon for the first time. The total length of the mitogenome was 17,233 bp with the base composition of 30.1% for A, 24.0% for T, 31.9% for C, and 14.0% for G and an A-T (54.2 %)-rich feature was detected. It harbored 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and 1 non-coding control region (D-loop region). The arrangement of all genes was identical to the typical mitochondrial genomes of pigeon. The complete mitochondrial genome sequence of fancy pigeon would serve as an important data set of the germplasm resources for further study.

  2. The complete mitochondrial genome of the ice pigeon (Columba livia breed ice).

    Science.gov (United States)

    Zhang, Rui-Hua; He, Wen-Xiao

    2015-02-01

    The ice pigeon is a breed of fancy pigeon developed over many years of selective breeding. In the present work, we report the complete mitochondrial genome sequence of ice pigeon for the first time. The total length of the mitogenome was 17,236 bp with the base composition of 30.2% for A, 24.0% for T, 31.9% for C, and 13.9% for G and an A-T (54.2 %)-rich feature was detected. It harbored 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and 1 non-coding control region (D-loop region). The arrangement of all genes was identical to the typical mitochondrial genomes of pigeon. The complete mitochondrial genome sequence of ice pigeon would serve as an important data set of the germplasm resources for further study.

  3. Characterization of the complete mitochondrial genome of the king pigeon (Columba livia breed king).

    Science.gov (United States)

    Zhang, Rui-Hua; He, Wen-Xiao; Xu, Tong

    2015-06-01

    The king pigeon is a breed of pigeon developed over many years of selective breeding primarily as a utility breed. In the present work, we report the complete mitochondrial genome sequence of king pigeon for the first time. The total length of the mitogenome was 17,221 bp with the base composition of 30.14% for A, 24.05% for T, 31.82% for C, and 13.99% for G and an A-T (54.22 %)-rich feature was detected. It harbored 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and one non-coding control region (D-loop region). The arrangement of all genes was identical to the typical mitochondrial genomes of pigeon. The complete mitochondrial genome sequence of king pigeon would serve as an important data set of the germplasm resources for further study.

  4. The complete mitochondrial genome of the Jacobin pigeon (Columba livia breed Jacobin).

    Science.gov (United States)

    He, Wen-Xiao; Jia, Jin-Feng

    2015-06-01

    The Jacobin is a breed of fancy pigeon developed over many years of selective breeding that originated in Asia. In the present work, we report the complete mitochondrial genome sequence of Jacobin pigeon for the first time. The total length of the mitogenome was 17,245 bp with the base composition of 30.18% for A, 23.98% for T, 31.88% for C, and 13.96% for G and an A-T (54.17 %)-rich feature was detected. It harbored 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and 1 non-coding control region. The arrangement of all genes was identical to the typical mitochondrial genomes of pigeon. The complete mitochondrial genome sequence of Jacobin pigeon would serve as an important data set of the germplasm resources for further study.

  5. Complete genome sequence of Isosphaera pallida type strain (IS1BT)

    Energy Technology Data Exchange (ETDEWEB)

    Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Cleland, David M [ORNL; Saunders, Elizabeth H [Los Alamos National Laboratory (LANL); Lapidus, Alla L. [Joint Genome Institute, Walnut Creek, California; Nolan, Matt [Joint Genome Institute, Walnut Creek, California; Lucas, Susan [Joint Genome Institute, Walnut Creek, California; Hammon, Nancy [Joint Genome Institute, Walnut Creek, California; Deshpande, Shweta [Joint Genome Institute, Walnut Creek, California; Cheng, Jan-Fang [Joint Genome Institute, Walnut Creek, California; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [Joint Genome Institute, Walnut Creek, California; Liolios, Konstantinos [Joint Genome Institute, Walnut Creek, California; Pagani, Ioanna [Joint Genome Institute, Walnut Creek, California; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [Joint Genome Institute, Walnut Creek, California; Palaniappan, Krishna [Joint Genome Institute, Walnut Creek, California; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Detter, J. Chris [Joint Genome Institute, Walnut Creek, California; Beck, Brian [ATCC - American Type Culture Collection; Woyke, Tanja [Joint Genome Institute, Walnut Creek, California; Bristow, James [Joint Genome Institute, Walnut Creek, California; Eisen, Jonathan [Joint Genome Institute, Walnut Creek, California; Markowitz, Victor [Joint Genome Institute, Walnut Creek, California; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [Joint Genome Institute, Walnut Creek, California; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany

    2011-01-01

    Isosphaera pallida (ex Woronichin 1927) Giovannoni et al. 1995 is the type species of the genus Isosphaera. The species is of interest because it was the first heterotrophic bacterium known to be phototactic, and it occupies an isolated phylogenetic position within the Planctomycetaceae. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the first complete genome sequence of a member of the genus Isosphaera and the third of a member of the family Planctomycetaceae. The 5,472,964 bp long chromosome and the 56,340 bp long plasmid with a total of 3,763 protein-coding and 60 RNA genes are part of the Genomic Encyclopedia of Bacteria and Archaea project.

  6. The Complete Genome of Brucella Suis 019 Provides Insights on Cross-Species Infection

    Directory of Open Access Journals (Sweden)

    Yuanzhi Wang

    2016-01-01

    Full Text Available Brucella species are the most important zoonotic pathogens worldwide and cause considerable harm to humans and animals. In this study, we presented the complete genome of B. suis 019 isolated from sheep (ovine with epididymitis. B. suis 019 has a rough phenotype and can infect sheep, rhesus monkeys and possibly humans. The comparative genome analysis demonstrated that B. suis 019 is closest to the vaccine strain B. suis bv. 1 str. S2. Further analysis associated the rsh gene to the pathogenicity of B. suis 019, and the WbkA gene to the rough phenotype of B. suis 019. The 019 complete genome data was deposited in the GenBank database with ID PRJNA308608.

  7. Complete genome sequence of Marivirga tractuosa type strain (H-43T)

    Science.gov (United States)

    Pagani, Ioanna; Chertkov, Olga; Lapidus, Alla; Lucas, Susan; Del Rio, Tijana Glavina; Tice, Hope; Copeland, Alex; Cheng, Jan-Fang; Nolan, Matt; Saunders, Elizabeth; Pitluck, Sam; Held, Brittany; Goodwin, Lynne; Liolios, Konstantinos; Ovchinikova, Galina; Ivanova, Natalia; Mavromatis, Konstantinos; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Jeffries, Cynthia D.; Detter, John C.; Han, Cliff; Tapia, Roxanne; Ngatchou-Djao, Olivier D.; Rohde, Manfred; Göker, Markus; Spring, Stefan; Sikorski, Johannes; Woyke, Tanja; Bristow, Jim; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Klenk, Hans-Peter; Kyrpides, Nikos C.

    2011-01-01

    Marivirga tractuosa (Lewin 1969) Nedashkovskaya et al. 2010 is the type species of the genus Marivirga, which belongs to the family Flammeovirgaceae. Members of this genus are of interest because of their gliding motility. The species is of interest because representative strains show resistance to several antibiotics, including gentamicin, kanamycin, neomycin, polymixin and streptomycin. This is the first complete genome sequence of a member of the family Flammeovirgaceae. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,511,574 bp long chromosome and the 4,916 bp plasmid with their 3,808 protein-coding and 49 RNA genes are a part of the Genomic Encyclopedia of Bacteria and Archaea project. PMID:21677852

  8. Complete Genome Sequence of Biocontroller Bacillus velezensis Strain JTYP2, Isolated from Leaves of Echeveria laui.

    Science.gov (United States)

    Wang, Beibei; Liu, Hu; Ma, Hailin; Wang, Chengqiang; Liu, Kai; Li, Yuhuan; Hou, Qihui; Ge, Ruofei; Zhang, Tongrui; Liu, Fangchun; Ma, Jinjin; Wang, Yun; Wang, Haide; Xu, Baochao; Yao, Gan; Xu, Wenfeng; Fan, Lingchao; Ding, Yanqin; Du, Binghai

    2017-06-15

    Bacillus velezensis JTYP2 was isolated from the leaves of Echeveria laui in Qingzhou, China, and may control some of the fungal pathogens of the plant. Here, we present the complete genome sequence of B. velezensis JTYP2. Several gene clusters related to its biosynthesis of antimicrobial compounds were predicted. Copyright © 2017 Wang et al.

  9. Complete Genome Sequences of Two Escherichia coli O145:H28 Outbreak Strains of Food Origin

    OpenAIRE

    Cooper, Kerry K.; Mandrell, Robert E.; Louie, Jacqueline W.; Korlach, Jonas; Clark, Tyson A.; Parker, Craig T.; Huynh, Steven; Chain, Patrick S. G.; Ahmed, Sanaa; Carter, Michelle Qiu

    2014-01-01

    Escherichia coli O145:H28 strain RM12581 was isolated from bagged romaine lettuce during a 2010 U.S. lettuce-associated outbreak. E. coli O145:H28 strain RM12761 was isolated from ice cream during a 2007 ice cream-associated outbreak in Belgium. Here we report the complete genome sequences and annotation of both strains.

  10. Complete Genome Sequence of the Probiotic Strain Lactobacillus salivarius LPM01.

    Science.gov (United States)

    Chenoll, Empar; Codoñer, Francisco M; Martinez-Blanch, Juan F; Acevedo-Piérart, Marcelo; Ormeño, M Loreto; Ramón, Daniel; Genovés, Salvador

    2016-11-23

    Lactobacillus salivarius LPM01 (DSM 22150) is a probiotic strain able to improve health status in immunocompromised people. Here, we report its complete genome sequence deciphered by PacBio single-molecule real-time (SMRT) technology. Analysis of the sequence may provide insights into its functional activity and safety assessment. Copyright © 2016 Chenoll et al.

  11. Complete Genome Sequence of the Halophilic Methylotrophic Methanogen Archaeon Methanohalophilus portucalensis Strain FDF-1T

    KAUST Repository

    L’Haridon, Stéphane

    2018-01-17

    We report here the complete genome sequence (2.08 Mb) of Methanohalophilus portucalensis strain FDF-1T, a halophilic methylotrophic methanogen isolated from the sediment of a saltern in Figeria da Foz, Portugal. The average nucleotide identity and DNA-DNA hybridization analyses show that Methanohalophilus mahii, M. halophilus, and M. portucalensis are three different species within the Methanosarcinaceae family.

  12. Identification and Complete Genome Sequence Analysis of a Genotype XIV Newcastle Disease Virus from Nigeria

    OpenAIRE

    Shittu, Ismaila; Sharma, Poonam; Volkening, Jeremy D.; Solomon, Ponman; Sulaiman, Lanre K.; Joannis, Tony M.; Williams-Coplin, Dawn; Miller, Patti J.; Dimitrov, Kiril M.; Afonso, Claudio L.

    2016-01-01

    The first complete genome sequence of a strain of Newcastle disease virus (NDV) from genotype XIV is reported here. Strain duck/Nigeria/NG-695/KG.LOM.11-16/2009 was isolated from an apparently healthy domestic duck from a live bird market in Kogi State, Nigeria, in 2009. This strain is classified as a member of subgenotype XIVb of class II.

  13. Complete genome sequence of a tomato infecting tomato mottle mosaic virus in New York

    Science.gov (United States)

    Complete genome sequence of an emerging isolate of tomato mottle mosaic virus (ToMMV) infecting experimental nicotianan benthamiana plants in up-state New York was obtained using small RNA deep sequencing. ToMMV_NY-13 shared 99% sequence identity to ToMMV isolates from Mexico and Florida. Broader d...

  14. Complete genome sequence of the bioleaching bacterium Leptospirillum sp. group II strain CF-1.

    Science.gov (United States)

    Ferrer, Alonso; Bunk, Boyke; Spröer, Cathrin; Biedendieck, Rebekka; Valdés, Natalia; Jahn, Martina; Jahn, Dieter; Orellana, Omar; Levicán, Gloria

    2016-03-20

    We describe the complete genome sequence of Leptospirillum sp. group II strain CF-1, an acidophilic bioleaching bacterium isolated from an acid mine drainage (AMD). This work provides data to gain insights about adaptive response of Leptospirillum spp. to the extreme conditions of bioleaching environments. Copyright © 2016 Elsevier B.V. All rights reserved.

  15. The complete mitochondrial genome of the gray garden slug Deroceras reticulatum (Gastropoda: Pulmonata: Stylommatophora)

    Science.gov (United States)

    The complete circular mitochondrial genome of D. reticulatum is 14,048 bp in length, consisting of 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes, and 2 ribosomal RNA (rRNA) genes (GenBank accession number: KY765589). The overall base composition was 31.0 % A, 12.2 % C, 17.7 % G and 39...

  16. The complete mitochondrial genome and phylogenetic position of the Philippines spurdog, Squalus montalbani.

    Science.gov (United States)

    Kemper, Jenny M; Naylor, Gavin J P

    2016-11-01

    We present the complete mitochondrial genome sequence (16 555 bp) of the Philippines spurdog, Squalus montalbani, currently listed as Vulnerable due to population declines and fishing pressures. A phylogenetic analysis was carried out on S. montalbani and representative shark mitogenomes. Squalus montalbani was placed within the Squaliformes as a sister taxon to Squalus acanthias and Cirrhigaleus australis.

  17. Identification and complete genome analysis of novel picornavirus in bovine in Japan

    DEFF Research Database (Denmark)

    Nagai, Makoto; Omatsu, Tsutomu; Aoki, Hiroshi

    2015-01-01

    We identified novel viruses in feces from cattle with diarrhea collected in 2009 in Hokkaido Prefecture, Japan, by using a metagenomics approach and determined the (near) complete sequences of the virus. Sequence analyses revealed that they had a standard picornavirus genome organization, i.e. 5'...

  18. Complete genome sequence of currant latent virus (genus Cheravirus, family Secoviridae)

    Czech Academy of Sciences Publication Activity Database

    Petrzik, Karel; Koloniuk, Igor; Přibylová, Jaroslava; Špak, Josef

    2016-01-01

    Roč. 161, č. 2 (2016), s. 491-493 ISSN 0304-8608 Institutional support: RVO:60077344 Keywords : Stranded-RNA * complete genome sequence * Currant latent virus Subject RIV: EE - Microbiology, Virology Impact factor: 2.058, year: 2016

  19. Complete genome sequences of blueberry red ringspot virus (Caulimoviridae) isolates from the Czech Republic and Slovenia

    Czech Academy of Sciences Publication Activity Database

    Petrzik, Karel; Přibylová, Jaroslava; Mavrič-Pleško, I.; Špak, Josef

    2011-01-01

    Roč. 156, č. 10 (2011), s. 1901-1903 ISSN 0304-8608 Institutional research plan: CEZ:AV0Z50510513 Keywords : Complete genome * blueberry virus * highbush blueberry Subject RIV: EE - Microbiology, Virology Impact factor: 2.111, year: 2011

  20. Complete DNA sequence of the linear mitochondrial genome of the pathogenic yeast Candida parapsilosis

    DEFF Research Database (Denmark)

    Nosek, J.; Novotna, M.; Hlavatovicova, Z.

    2004-01-01

    The complete sequence of the mitochondrial DNA of the opportunistic yeast pathogen Candida parapsilosis was determined. The mitochondrial genome is represented by linear DNA molecules terminating with tandem repeats of a 738-bp unit. The number of repeats varies, thus generating a population...

  1. Complete Genome Sequence of Methylobacterium populi P-1M, Isolated from Pink-Pigmented Household Biofilm

    OpenAIRE

    Morohoshi, Tomohiro; Ikeda, Tsukasa

    2016-01-01

    Methylobacterium populi P-1M is isolated from the pink-pigmented household biofilm. Here, we present the complete genome sequence of P-1M, consisting of one chromosome of 5,705,640?bp and five plasmids of 64,864?bp, 59,879?bp, 42,569?bp, 41,417?bp, and 29,506?bp.

  2. Complete Whole-Genome Sequence of Salmonella enterica subsp. enterica Serovar Java NCTC5706.

    Science.gov (United States)

    Fazal, Mohammed-Abbas; Alexander, Sarah; Burnett, Edward; Deheer-Graham, Ana; Oliver, Karen; Holroyd, Nancy; Parkhill, Julian; Russell, Julie E

    2016-11-03

    Salmonellae are a significant cause of morbidity and mortality globally. Here, we report the first complete genome sequence for Salmonella enterica subsp. enterica serovar Java strain NCTC5706. This strain is of historical significance, having been isolated in the pre-antibiotic era and was deposited into the National Collection of Type Cultures in 1939. © Crown copyright 2016.

  3. Complete Genome Sequence of the Quality Control Strain Staphylococcus aureus subsp. aureus ATCC 25923.

    Science.gov (United States)

    Treangen, Todd J; Maybank, Rosslyn A; Enke, Sana; Friss, Mary Beth; Diviak, Lynn F; Karaolis, David K R; Koren, Sergey; Ondov, Brian; Phillippy, Adam M; Bergman, Nicholas H; Rosovitz, M J

    2014-11-06

    Staphylococcus aureus subsp. aureus ATCC 25923 is commonly used as a control strain for susceptibility testing to antibiotics and as a quality control strain for commercial products. We present the completed genome sequence for the strain, consisting of the chromosome and a 27.5-kb plasmid. Copyright © 2014 Treangen et al.

  4. Complete Genome Sequence of the Halophilic Methylotrophic Methanogen Archaeon Methanohalophilus portucalensis Strain FDF-1T

    KAUST Repository

    L’ Haridon, Sté phane; Corre, Erwan; Guan, Yue; Vinu, Manikandan; La Cono, Violetta; Yakimov, Michail; Stingl, Ulrich; Toffin, Laurent; Jebbar, Mohamed

    2018-01-01

    We report here the complete genome sequence (2.08 Mb) of Methanohalophilus portucalensis strain FDF-1T, a halophilic methylotrophic methanogen isolated from the sediment of a saltern in Figeria da Foz, Portugal. The average nucleotide identity and DNA-DNA hybridization analyses show that Methanohalophilus mahii, M. halophilus, and M. portucalensis are three different species within the Methanosarcinaceae family.

  5. Complete genome sequences of Escherichia coli strains 1303 and ECC-1470 isolated from bovine mastitis

    NARCIS (Netherlands)

    Leimbach, Andreas; Poehlein, Anja; Witten, Anika; Scheutz, Flemming; Schukken, Ynte|info:eu-repo/dai/nl/075051907; Daniel, Rolf; Dobrindt, Ulrich

    2016-01-01

    Escherichia coli is the leading causative agent of acute bovine mastitis. Here, we report the complete genome sequence of E. coli O70:H32 strain 1303, isolated from an acute case of bovine mastitis, and E. coli Ont:Hnt strain ECC-1470, isolated from a persistent infection.

  6. Complete Genome Sequence of Porcine Parvovirus N Strain Isolated from Guangxi, China

    OpenAIRE

    Su, Qian-Lian; Li, Bin; Zhao, Wu; Liang, Jia-Xing; He, Ying; Qin, Yi-Bin; Lu, Bing-Xia

    2015-01-01

    We report here the complete genomic sequence of the porcine parvovirus (PPV) N strain, isolated in 1989 from the viscera of a stillborn fetus farrowed by a gilt in Guangxi, southern China. Phylogenetic analyses suggest that the PPV-N strain is closely related to attenuated PPV NADL-2 strains. The PPV-N strain has good immunogenicity, genetic stability, and safety.

  7. On the representability of complete genomes by multiple competing finite-context (Markov models.

    Directory of Open Access Journals (Sweden)

    Armando J Pinho

    Full Text Available A finite-context (Markov model of order k yields the probability distribution of the next symbol in a sequence of symbols, given the recent past up to depth k. Markov modeling has long been applied to DNA sequences, for example to find gene-coding regions. With the first studies came the discovery that DNA sequences are non-stationary: distinct regions require distinct model orders. Since then, Markov and hidden Markov models have been extensively used to describe the gene structure of prokaryotes and eukaryotes. However, to our knowledge, a comprehensive study about the potential of Markov models to describe complete genomes is still lacking. We address this gap in this paper. Our approach relies on (i multiple competing Markov models of different orders (ii careful programming techniques that allow orders as large as sixteen (iii adequate inverted repeat handling (iv probability estimates suited to the wide range of context depths used. To measure how well a model fits the data at a particular position in the sequence we use the negative logarithm of the probability estimate at that position. The measure yields information profiles of the sequence, which are of independent interest. The average over the entire sequence, which amounts to the average number of bits per base needed to describe the sequence, is used as a global performance measure. Our main conclusion is that, from the probabilistic or information theoretic point of view and according to this performance measure, multiple competing Markov models explain entire genomes almost as well or even better than state-of-the-art DNA compression methods, such as XM, which rely on very different statistical models. This is surprising, because Markov models are local (short-range, contrasting with the statistical models underlying other methods, where the extensive data repetitions in DNA sequences is explored, and therefore have a non-local character.

  8. Using Genomics for Natural Product Structure Elucidation.

    Science.gov (United States)

    Tietz, Jonathan I; Mitchell, Douglas A

    2016-01-01

    Natural products (NPs) are the most historically bountiful source of chemical matter for drug development-especially for anti-infectives. With insights gleaned from genome mining, interest in natural product discovery has been reinvigorated. An essential stage in NP discovery is structural elucidation, which sheds light not only on the chemical composition of a molecule but also its novelty, properties, and derivatization potential. The history of structure elucidation is replete with techniquebased revolutions: combustion analysis, crystallography, UV, IR, MS, and NMR have each provided game-changing advances; the latest such advance is genomics. All natural products have a genetic basis, and the ability to obtain and interpret genomic information for structure elucidation is increasingly available at low cost to non-specialists. In this review, we describe the value of genomics as a structural elucidation technique, especially from the perspective of the natural product chemist approaching an unknown metabolite. Herein we first introduce the databases and programs of interest to the natural products chemist, with an emphasis on those currently most suited for general usability. We describe strategies for linking observed natural product-linked phenotypes to their corresponding gene clusters. We then discuss techniques for extracting structural information from genes, illustrated with numerous case examples. We also provide an analysis of the biases and limitations of the field with recommendations for future development. Our overview is not only aimed at biologically-oriented researchers already at ease with bioinformatic techniques, but also, in particular, at natural product, organic, and/or medicinal chemists not previously familiar with genomic techniques.

  9. The Complete Mitochondrial Genome of Corizus tetraspilus (Hemiptera: Rhopalidae) and Phylogenetic Analysis of Pentatomomorpha

    Science.gov (United States)

    Guo, Zhong-Long; Wang, Juan; Shen, Yu-Ying

    2015-01-01

    Insect mitochondrial genome (mitogenome) are the most extensively used genetic information for molecular evolution, phylogenetics and population genetics. Pentatomomorpha (>14,000 species) is the second largest infraorder of Heteroptera and of great economic importance. To better understand the diversity and phylogeny within Pentatomomorpha, we sequenced and annotated the complete mitogenome of Corizus tetraspilus (Hemiptera: Rhopalidae), an important pest of alfalfa in China. We analyzed the main features of the C. tetraspilus mitogenome, and provided a comparative analysis with four other Coreoidea species. Our results reveal that gene content, gene arrangement, nucleotide composition, codon usage, rRNA structures and sequences of mitochondrial transcription termination factor are conserved in Coreoidea. Comparative analysis shows that different protein-coding genes have been subject to different evolutionary rates correlated with the G+C content. All the transfer RNA genes found in Coreoidea have the typical clover leaf secondary structure, except for trnS1 (AGN) which lacks the dihydrouridine (DHU) arm and possesses a unusual anticodon stem (9 bp vs. the normal 5 bp). The control regions (CRs) among Coreoidea are highly variable in size, of which the CR of C. tetraspilus is the smallest (440 bp), making the C. tetraspilus mitogenome the smallest (14,989 bp) within all completely sequenced Coreoidea mitogenomes. No conserved motifs are found in the CRs of Coreoidea. In addition, the A+T content (60.68%) of the CR of C. tetraspilus is much lower than that of the entire mitogenome (74.88%), and is lowest among Coreoidea. Phylogenetic analyses based on mitogenomic data support the monophyly of each superfamily within Pentatomomorpha, and recognize a phylogenetic relationship of (Aradoidea + (Pentatomoidea + (Lygaeoidea + (Pyrrhocoroidea + Coreoidea)))). PMID:26042898

  10. Interrogating the druggable genome with structural informatics.

    Science.gov (United States)

    Hambly, Kevin; Danzer, Joseph; Muskal, Steven; Debe, Derek A

    2006-08-01

    Structural genomics projects are producing protein structure data at an unprecedented rate. In this paper, we present the Target Informatics Platform (TIP), a novel structural informatics approach for amplifying the rapidly expanding body of experimental protein structure information to enhance the discovery and optimization of small molecule protein modulators on a genomic scale. In TIP, existing experimental structure information is augmented using a homology modeling approach, and binding sites across multiple target families are compared using a clique detection algorithm. We report here a detailed analysis of the structural coverage for the set of druggable human targets, highlighting drug target families where the level of structural knowledge is currently quite high, as well as those areas where structural knowledge is sparse. Furthermore, we demonstrate the utility of TIP's intra- and inter-family binding site similarity analysis using a series of retrospective case studies. Our analysis underscores the utility of a structural informatics infrastructure for extracting drug discovery-relevant information from structural data, aiding researchers in the identification of lead discovery and optimization opportunities as well as potential "off-target" liabilities.

  11. Complete genome of Phenylobacterium zucineum – a novel facultative intracellular bacterium isolated from human erythroleukemia cell line K562

    Directory of Open Access Journals (Sweden)

    Sun Jie

    2008-08-01

    Full Text Available Abstract Background Phenylobacterium zucineum is a recently identified facultative intracellular species isolated from the human leukemia cell line K562. Unlike the known intracellular pathogens, P. zucineum maintains a stable association with its host cell without affecting the growth and morphology of the latter. Results Here, we report the whole genome sequence of the type strain HLK1T. The genome consists of a circular chromosome (3,996,255 bp and a circular plasmid (382,976 bp. It encodes 3,861 putative proteins, 42 tRNAs, and a 16S-23S-5S rRNA operon. Comparative genomic analysis revealed that it is phylogenetically closest to Caulobacter crescentus, a model species for cell cycle research. Notably, P. zucineum has a gene that is strikingly similar, both structurally and functionally, to the cell cycle master regulator CtrA of C. crescentus, and most of the genes directly regulated by CtrA in the latter have orthologs in the former. Conclusion This work presents the first complete bacterial genome in the genus Phenylobacterium. Comparative genomic analysis indicated that the CtrA regulon is well conserved between C. crescentus and P. zucineum.

  12. Complete genome sequence of Capnocytophaga ochracea type strain (VPI 2845T)

    Energy Technology Data Exchange (ETDEWEB)

    Mavromatis, Konstantinos; Gronow, Sabine; Saunders, Elizabeth; Land, Miriam; Lapidus, Alla; Copeland, Alex; Glavina Del Rio, Tijana; Nolan, Matt; Lucas, Susan; Chen, Feng; Tice1, Hope; Cheng, Jan-Fang; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Pati, Amrita; Ivanova, Natalia; Chen, Amy; Palaniappan, Krishna; Chain, Patrick; Hauser, Loren; Chang, Yun-Juan; Jefferies, Cynthia C.; Brettin, Thomas; Detter, John C.; Han, Cliff; Bristow, James; Goker, Markus; Rohde, Manfred; Eisen, Jonathan A.; Markowitz, Victor; Kyrpides, Nikos C.; Klenk, Hans-Peter; Hugenholtz, Philip

    2009-05-20

    Capnocytophaga ochracea (Prevot et al. 1956) Leadbetter et al. 1982 is the type species of the genus Capnocytophaga. It is of interest because of its location in the Flavobacteriaceae, a genomically yet uncharted family within the order Flavobacteriales. The species grows as fusiform to rod shaped cells which tend to form clumps and are able to move by gliding. C. ochracea is known as a capnophilic organism with the ability to grow under anaerobic as well as under aerobic conditions (oxygen concentration larger than 15percent), here only in the presence of 5percent CO2. Strain VPI 2845T, the type strain of the species, is portrayed in this report as a gliding, Gram-negative bacterium, originally isolated from a human oral cavity. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first completed genome sequence from the flavobacterial genus Capnocytophaga, and the 2,612,925 bp long single replicon genome with its 2193 protein-coding and 59 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  13. Complete genome sequence of Hydrogenobacter thermophilus type strain (TK-6T)

    Energy Technology Data Exchange (ETDEWEB)

    Zeytun, Ahmet [Los Alamos National Laboratory (LANL); Sikorski, Johannes [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Nolan, Matt [Joint Genome Institute, Walnut Creek, California; Lapidus, Alla L. [Joint Genome Institute, Walnut Creek, California; Lucas, Susan [Joint Genome Institute, Walnut Creek, California; Han, James [Joint Genome Institute; Tice, Hope [Joint Genome Institute, Walnut Creek, California; Cheng, Jan-Fang [Joint Genome Institute, Walnut Creek, California; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [Joint Genome Institute, Walnut Creek, California; Liolios, Konstantinos [Joint Genome Institute, Walnut Creek, California; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Ovchinnikova, Galina [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [Joint Genome Institute, Walnut Creek, California; Palaniappan, Krishna [Joint Genome Institute, Walnut Creek, California; Ngatchou, Olivier Duplex [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Han, Cliff [Los Alamos National Laboratory (LANL); Detter, J. Chris [Joint Genome Institute, Walnut Creek, California; Ubler, Susanne [Universitat Regensburg, Regensburg, Germany; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Tindall, Brian [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Wirth, Reinhard [Universitat Regensburg, Regensburg, Germany; Woyke, Tanja [Joint Genome Institute, Walnut Creek, California; Bristow, James [Joint Genome Institute, Walnut Creek, California; Eisen, Jonathan [Joint Genome Institute, Walnut Creek, California; Markowitz, Victor [Joint Genome Institute, Walnut Creek, California; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Kyrpides, Nikos C [Joint Genome Institute, Walnut Creek, California

    2011-01-01

    Hydrogenobacter thermophilus Kawasumi et al. 1984 is the type species of the genus Hydrogenobacter. H. thermophilus was the first obligate autotrophic organism reported among aerobic hydrogen-oxidizing bacteria. Strain TK-6T is of interest because of the unusually efficient hydrogen-oxidizing ability of this strain, which results in a faster generation time compared to other autotrophs. It is also able to grow anaerobically using nitrate as an electron acceptor when molecular hydrogen is used as the energy source, and able to aerobically fix CO2 via the reductive tricarboxylic acid cycle. This is the fifth completed genome sequence in the family Aquificaceae, and the second genome sequence determined from a strain derived from the original isolate. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 1,742,932 bp long genome with its 1,899 protein-coding and 49 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  14. Complete genome sequence of Capnocytophaga ochracea type strain (VPI 2845T)

    Energy Technology Data Exchange (ETDEWEB)

    Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Gronow, Sabine [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Saunders, Elizabeth H [Los Alamos National Laboratory (LANL); Land, Miriam L [ORNL; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Copeland, A [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Chen, Feng [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Chain, Patrick S. G. [Lawrence Livermore National Laboratory (LLNL); Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Brettin, Thomas S [ORNL; Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Han, Cliff [Los Alamos National Laboratory (LANL); Bristow, James [U.S. Department of Energy, Joint Genome Institute; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute

    2009-01-01

    Capnocytophaga ochracea (Pr vot et al. 1956) Leadbetter et al. 1982 is the type species of the genus Capnocytophaga. It is of interest because of its location in the Flavobacteriaceae, a genomically not yet charted family within the order Flavobacteriales. The species grows as fusiform to rod shaped cells which tend to form clumps and are able to move by gliding. C. ochracea is known as a capnophilic (CO2-requiring) organism with the ability to grow under anaerobic as well as aerobic conditions (oxygen concentration larger than 15%), here only in the presence of 5% CO2. Strain VPI 2845T, the type strain of the species, is portrayed in this report as a gliding, Gram-negative bacterium, originally isolated from a human oral cavity. Here we describe the features of this organism, together with the complete genome se-quence, and annotation. This is the first completed genome sequence from the flavobacterial genus Capnocytophaga, and the 2,612,925 bp long single replicon genome with its 2193 protein-coding and 59 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  15. Characterization of the complete mitochondrial genome of Khawia sinensis belongs among platyhelminths, cestodes.

    Science.gov (United States)

    Feng, Yan; Feng, Han-Li; Fang, Yi-Hui; Su, Ying-Bing

    2017-06-01

    Khawia sinensis is an important species in freshwater fish causing considerable economic losses to the breeding industry. This is the first mt genome of a caryophyllidean cestode characterised. The entire mt genome of K. sinensis is 13,759 bp in length. This mt genome contains 12 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes and two non-coding regions. The arrangement of the K. sinensis mt genome is the same as other tapeworms, however, the incomplete stop codon (A) is more frequent that other species. Phylogenetic analyses based on concatenated amino-acid sequences of the 12 protein-coding genes of 17 tapeworms including K. sinensis were conducted to assess the relationship of K. sinensis with other species, the result indicated K. sinensis was closely related with cestode species. This complete mt genome of K. sinensis will enrich the mitochondrial genome databases of tapeworms and provide important molecular markers for ecology, diagnostics, population variation and evolution of K. sinensis and other species. Copyright © 2017 Elsevier Inc. All rights reserved.

  16. Complete genome analysis of two new bacteriophages isolated from impetigo strains of Staphylococcus aureus.

    Science.gov (United States)

    Botka, Tibor; Růžičková, Vladislava; Konečná, Hana; Pantůček, Roman; Rychlík, Ivan; Zdráhal, Zbyněk; Petráš, Petr; Doškař, Jiří

    2015-08-01

    Exfoliative toxin A (ETA)-coding temperate bacteriophages are leading contributors to the toxic phenotype of impetigo strains of Staphylococcus aureus. Two distinct eta gene-positive bacteriophages isolated from S. aureus strains which recently caused massive outbreaks of pemphigus neonatorum in Czech maternity hospitals were characterized. The phages, designated ϕB166 and ϕB236, were able to transfer the eta gene into a prophageless S. aureus strain which afterwards converted into an ETA producer. Complete phage genome sequences were determined, and a comparative analysis of five designed genomic regions revealed major variances between them. They differed in the genome size, number of open reading frames, genome architecture, and virion protein patterns. Their high mutual sequence similarity was detected only in the terminal regions of the genome. When compared with the so far described eta phage genomes, noticeable differences were found. Thus, both phages represent two new lineages of as yet not characterized bacteriophages of the Siphoviridae family having impact on pathogenicity of impetigo strains of S. aureus.

  17. The complete chloroplast genome of Capsicum annuum var. glabriusculum using Illumina sequencing.

    Science.gov (United States)

    Raveendar, Sebastin; Na, Young-Wang; Lee, Jung-Ro; Shim, Donghwan; Ma, Kyung-Ho; Lee, Sok-Young; Chung, Jong-Wook

    2015-07-20

    Chloroplast (cp) genome sequences provide a valuable source for DNA barcoding. Molecular phylogenetic studies have concentrated on DNA sequencing of conserved gene loci. However, this approach is time consuming and more difficult to implement when gene organization differs among species. Here we report the complete re-sequencing of the cp genome of Capsicum pepper (Capsicum annuum var. glabriusculum) using the Illumina platform. The total length of the cp genome is 156,817 bp with a 37.7% overall GC content. A pair of inverted repeats (IRs) of 50,284 bp were separated by a small single copy (SSC; 18,948 bp) and a large single copy (LSC; 87,446 bp). The number of cp genes in C. annuum var. glabriusculum is the same as that in other Capsicum species. Variations in the lengths of LSC; SSC and IR regions were the main contributors to the size variation in the cp genome of this species. A total of 125 simple sequence repeat (SSR) and 48 insertions or deletions variants were found by sequence alignment of Capsicum cp genome. These findings provide a foundation for further investigation of cp genome evolution in Capsicum and other higher plants.

  18. Complete Sequence and Analysis of Coconut Palm (Cocos nucifera) Mitochondrial Genome.

    Science.gov (United States)

    Aljohi, Hasan Awad; Liu, Wanfei; Lin, Qiang; Zhao, Yuhui; Zeng, Jingyao; Alamer, Ali; Alanazi, Ibrahim O; Alawad, Abdullah O; Al-Sadi, Abdullah M; Hu, Songnian; Yu, Jun

    2016-01-01

    Coconut (Cocos nucifera L.), a member of the palm family (Arecaceae), is one of the most economically important crops in tropics, serving as an important source of food, drink, fuel, medicine, and construction material. Here we report an assembly of the coconut (C. nucifera, Oman local Tall cultivar) mitochondrial (mt) genome based on next-generation sequencing data. This genome, 678,653bp in length and 45.5% in GC content, encodes 72 proteins, 9 pseudogenes, 23 tRNAs, and 3 ribosomal RNAs. Within the assembly, we find that the chloroplast (cp) derived regions account for 5.07% of the total assembly length, including 13 proteins, 2 pseudogenes, and 11 tRNAs. The mt genome has a relatively large fraction of repeat content (17.26%), including both forward (tandem) and inverted (palindromic) repeats. Sequence variation analysis shows that the Ti/Tv ratio of the mt genome is lower as compared to that of the nuclear genome and neutral expectation. By combining public RNA-Seq data for coconut, we identify 734 RNA editing sites supported by at least two datasets. In summary, our data provides the second complete mt genome sequence in the family Arecaceae, essential for further investigations on mitochondrial biology of seed plants.

  19. Equid herpesvirus 8: Complete genome sequence and association with abortion in mares

    Science.gov (United States)

    Garvey, Marie; Suárez, Nicolás M.; Kerr, Karen; Hector, Ralph; Moloney-Quinn, Laura; Arkins, Sean; Davison, Andrew J.

    2018-01-01

    Equid herpesvirus 8 (EHV-8), formerly known as asinine herpesvirus 3, is an alphaherpesvirus that is closely related to equid herpesviruses 1 and 9 (EHV-1 and EHV-9). The pathogenesis of EHV-8 is relatively little studied and to date has only been associated with respiratory disease in donkeys in Australia and horses in China. A single EHV-8 genome sequence has been generated for strain Wh in China, but is apparently incomplete and contains frameshifts in two genes. In this study, the complete genome sequences of four EHV-8 strains isolated in Ireland between 2003 and 2015 were determined by Illumina sequencing. Two of these strains were isolated from cases of abortion in horses, and were misdiagnosed initially as EHV-1, and two were isolated from donkeys, one with neurological disease. The four genome sequences are very similar to each other, exhibiting greater than 98.4% nucleotide identity, and their phylogenetic clustering together demonstrated that genomic diversity is not dependent on the host. Comparative genomic analysis revealed 24 of the 76 predicted protein sequences are completely conserved among the Irish EHV-8 strains. Evolutionary comparisons indicate that EHV-8 is phylogenetically closer to EHV-9 than it is to EHV-1. In summary, the first complete genome sequences of EHV-8 isolates from two host species over a twelve year period are reported. The current study suggests that EHV-8 can cause abortion in horses. The potential threat of EHV-8 to the horse industry and the possibility that donkeys may act as reservoirs of infection warrant further investigation. PMID:29414990

  20. Complete Genome Sequence of an Avian Metapneumovirus Subtype A Strain Isolated from Chicken (Gallus gallus) in Brazil.

    Science.gov (United States)

    Rizotto, Laís S; Scagion, Guilherme P; Cardoso, Tereza C; Simão, Raphael M; Caserta, Leonardo C; Benassi, Julia C; Keid, Lara B; Oliveira, Trícia M F de S; Soares, Rodrigo M; Arns, Clarice W; Van Borm, Steven; Ferreira, Helena L

    2017-07-20

    We report here the complete genome sequence of an avian metapneumovirus (aMPV) isolated from a tracheal tissue sample of a commercial layer flock. The complete genome sequence of aMPV-A/chicken/Brazil-SP/669/2003 was obtained using MiSeq (Illumina, Inc.) sequencing. Phylogenetic analysis of the complete genome classified the isolate as avian metapneumovirus subtype A. Copyright © 2017 Rizotto et al.

  1. A complete mitochondrial genome sequence from a mesolithic wild aurochs (Bos primigenius.

    Directory of Open Access Journals (Sweden)

    Ceiridwen J Edwards

    Full Text Available BACKGROUND: The derivation of domestic cattle from the extinct wild aurochs (Bos primigenius has been well-documented by archaeological and genetic studies. Genetic studies point towards the Neolithic Near East as the centre of origin for Bos taurus, with some lines of evidence suggesting possible, albeit rare, genetic contributions from locally domesticated wild aurochsen across Eurasia. Inferences from these investigations have been based largely on the analysis of partial mitochondrial DNA sequences generated from modern animals, with limited sequence data from ancient aurochsen samples. Recent developments in DNA sequencing technologies, however, are affording new opportunities for the examination of genetic material retrieved from extinct species, providing new insight into their evolutionary history. Here we present DNA sequence analysis of the first complete mitochondrial genome (16,338 base pairs from an archaeologically-verified and exceptionally-well preserved aurochs bone sample. METHODOLOGY: DNA extracts were generated from an aurochs humerus bone sample recovered from a cave site located in Derbyshire, England and radiocarbon-dated to 6,738+/-68 calibrated years before present. These extracts were prepared for both Sanger and next generation DNA sequencing technologies (Illumina Genome Analyzer. In total, 289.9 megabases (22.48% of the post-filtered DNA sequences generated using the Illumina Genome Analyzer from this sample mapped with confidence to the bovine genome. A consensus B. primigenius mitochondrial genome sequence was constructed and was analysed alongside all available complete bovine mitochondrial genome sequences. CONCLUSIONS: For all nucleotide positions where both Sanger and Illumina Genome Analyzer sequencing methods gave high-confidence calls, no discrepancies were observed. Sequence analysis reveals evidence of heteroplasmy in this sample and places this mitochondrial genome sequence securely within a previously

  2. Complete genome sequence of Brachyspira intermedia reveals unique genomic features in Brachyspira species and phage-mediated horizontal gene transfer

    Science.gov (United States)

    2011-01-01

    Background Brachyspira spp. colonize the intestines of some mammalian and avian species and show different degrees of enteropathogenicity. Brachyspira intermedia can cause production losses in chickens and strain PWS/AT now becomes the fourth genome to be completed in the genus Brachyspira. Results 15 classes of unique and shared genes were analyzed in B. intermedia, B. murdochii, B. hyodysenteriae and B. pilosicoli. The largest number of unique genes was found in B. intermedia and B. murdochii. This indicates the presence of larger pan-genomes. In general, hypothetical protein annotations are overrepresented among the unique genes. A 3.2 kb plasmid was found in B. intermedia strain PWS/AT. The plasmid was also present in the B. murdochii strain but not in nine other Brachyspira isolates. Within the Brachyspira genomes, genes had been translocated and also frequently switched between leading and lagging strands, a process that can be followed by different AT-skews in the third positions of synonymous codons. We also found evidence that bacteriophages were being remodeled and genes incorporated into them. Conclusions The accessory gene pool shapes species-specific traits. It is also influenced by reductive genome evolution and horizontal gene transfer. Gene-transfer events can cross both species and genus boundaries and bacteriophages appear to play an important role in this process. A mechanism for horizontal gene transfer appears to be gene translocations leading to remodeling of bacteriophages in combination with broad tropism. PMID:21816042

  3. Completed sequence and corrected annotation of the genome of maize Iranian mosaic virus.

    Science.gov (United States)

    Ghorbani, Abozar; Izadpanah, Keramatollah; Dietzgen, Ralf G

    2018-03-01

    Maize Iranian mosaic virus (MIMV) is a negative-sense single-stranded RNA virus that is classified in the genus Nucleorhabdovirus, family Rhabdoviridae. The MIMV genome contains six open reading frames (ORFs) that encode in 3΄ to 5΄ order the nucleocapsid protein (N), phosphoprotein (P), putative movement protein (P3), matrix protein (M), glycoprotein (G) and RNA-dependent RNA polymerase (L). In this study, we determined the first complete genome sequence of MIMV using Illumina RNA-Seq and 3'/5' RACE. MIMV genome ('Fars' isolate) is 12,426 nucleotides in length. Unexpectedly, the predicted N gene ORF of this isolate and of four other Iranian isolates is 143 nucleotides shorter than that of the MIMV coding-complete reference isolate 'Shiraz 1' (Genbank NC_011542), possibly due to a minor error in the previous sequence. Genetic variability among the N, P, P3 and G ORFs of Iranian MIMV isolates was limited, but highest in the G gene ORF. Phylogenetic analysis of complete nucleorhabdovirus genomes demonstrated a close evolutionary relationship between MIMV, maize mosaic virus and taro vein chlorosis virus.

  4. Characterization of the complete mitochondrial genomes of Nematodirus oiratianus and Nematodirus spathiger of small ruminants.

    Science.gov (United States)

    Zhao, Guang-Hui; Jia, Yan-Qing; Cheng, Wen-Yu; Zhao, Wen; Bian, Qing-Qing; Liu, Guo-Hua

    2014-07-11

    Nematodirus spp. are among the most common nematodes of ruminants worldwide. N. oiratianus and N. spathiger are distributed worldwide as highly prevalent gastrointestinal nematodes, which cause emerging health problems and economic losses. Accurate identification of Nematodirus species is essential to develop effective control strategies for Nematodirus infection in ruminants. Mitochondrial DNA (mtDNA) could provide powerful genetic markers for identifying these closely related species and resolving phylogenetic relationships at different taxonomic levels. In the present study, the complete mitochondrial (mt) genomes of N. oiratianus and N. spathiger from small ruminants in China were obtained using Long-range PCR and sequencing. The complete mt genomes of N. oiratianus and N. spathiger were 13,765 bp and 13,519 bp in length, respectively. Both mt genomes were circular and consisted of 36 genes, including 12 genes encoding proteins, 2 genes encoding rRNA, and 22 genes encoding tRNA. Phylogenetic analyses based on the concatenated amino acid sequence data of all 12 protein-coding genes by Bayesian inference (BI), Maximum likelihood (ML) and Maximum parsimony (MP) showed that the two Nematodirus species (Molineidae) were closely related to Dictyocaulidae. The availability of the complete mtDNA sequences of N. oiratianus and N. spathiger not only provides new mtDNA sources for a better understanding of nematode mt genomics and phylogeny, but also provides novel and useful genetic markers for studying diagnosis, population genetics and molecular epidemiology of Nematodirus spp. in small ruminants.

  5. The complete mitochondrial genome sequence of Oceanic whitetip shark, Carcharhinus longimanus (Carcharhiniformes: Carcharhinidae).

    Science.gov (United States)

    Li, Weiwen; Dai, Xiaojie; Xu, Qianghua; Wu, Feng; Gao, Chunxia; Zhang, Yanbo

    2016-05-01

    The complete mitochondrial DNA sequence of Carcharhinus longimanus was determined and analyzed. The complete mtDNA genome sequence of C. longimanus was 16,706 bp in length. It contained 22 tRNA genes, 2 rRNA genes, 13 protein-coding genes and 2 non-conding regions: control region (D-loop) and origin of light-strand replication (OL). The complete mitogenome sequence information of C. longimanus can provide a useful data for further studies on molecular systematics, stock evaluation, taxonomic status and conservation genetics.

  6. The complete mitochondrial genome sequence of the Tibetan red fox (Vulpes vulpes montana).

    Science.gov (United States)

    Zhang, Jin; Zhang, Honghai; Zhao, Chao; Chen, Lei; Sha, Weilai; Liu, Guangshuai

    2015-01-01

    In this study, the complete mitochondrial genome of the Tibetan red fox (Vulpes Vulpes montana) was sequenced for the first time using blood samples obtained from a wild female red fox captured from Lhasa in Tibet, China. Qinghai--Tibet Plateau is the highest plateau in the world with an average elevation above 3500 m. Sequence analysis showed it contains 12S rRNA gene, 16S rRNA gene, 22 tRNA genes, 13 protein-coding genes and 1 control region (CR). The variable tandem repeats in CR is the main reason of the length variability of mitochondrial genome among canide animals.

  7. The complete mitochondrial genome of a spiraling whitefly, Aleurodicus dispersus Russell (Hemiptera: Aleyrodidae).

    Science.gov (United States)

    Ming-Xing, Lu; Zhi-Teng, Chen; Wei-Wei, Yu; Yu-Zhou, Du

    2017-03-01

    We report the complete mitochondrial genome (mitogenome) of a spiraling whitefly, Aleurodicus dispersus (Hemiptera: Aleyrodidae). The 16 170 bp long genome consists of 13 protein-coding genes, 20 transfer RNAs, 2 ribosomal RNAs, and a control region. The A. dispersus mitogenome also includes a cytb-like non-coding region and shows several variations relative to the typical insect mitogenome. A phylogenetic tree has been constructed using the 13 protein-coding genes of 12 related species from Hemiptera. Our results would contribute to further study of phylogeny in Aleyrodidae and Hemiptera.

  8. Complete genome sequence of Defluviimonas alba cai42T, a microbial exopolysaccharides producer.

    Science.gov (United States)

    Zhao, Jie-Yu; Geng, Shuang; Xu, Lian; Hu, Bing; Sun, Ji-Quan; Nie, Yong; Tang, Yue-Qin; Wu, Xiao-Lei

    2016-12-10

    Defluviimonas alba cai42 T , isolated from the oil-production water in Xinjiang Oilfield in China, has a strong ability to produce exopolysaccharides (EPS). We hereby present its complete genome sequence information which consists of a circular chromosome and three plasmids. The strain characteristically contains various genes encoding for enzymes involved in EPS biosynthesis, modification, and export. According to the genomic and physiochemical data, it is predicted that the strain has the potential to be utilized in industrial production of microbial EPS. Copyright © 2016 Elsevier B.V. All rights reserved.

  9. The complete mitochondrial genome of Porites harrisoni (Cnidaria: Scleractinia) obtained using next-generation sequencing

    KAUST Repository

    Terraneo, Tullia Isotta

    2018-02-24

    In this study, we sequenced the complete mitochondrial genome of Porites harrisoni using ezRAD and Illumina technology. Genome length consisted of 18,630 bp, with a base composition of 25.92% A, 13.28% T, 23.06% G, and 37.73% C. Consistent with other hard corals, P. harrisoni mitogenome was arranged in 13 protein-coding genes, 2 rRNA, and 2 tRNA genes. nad5 and cox1 contained embedded Group I Introns of 11,133 bp and 965 bp, respectively.

  10. Complete genome sequence of the aerobically denitrifying thermophilic bacterium Chelatococcus daeguensis TAD1

    Directory of Open Access Journals (Sweden)

    Yunlong Yang

    Full Text Available ABSTRACT Chelatococcus daeguensis TAD1 is a themophilic bacterium isolated from a biotrickling filter used to treat NOx in Ruiming Power Plant, located in Guangzhou, China, which shows an excellent aerobic denitrification activity at high temperature. The complete genome sequence of this strain was reported in the present study. Genes related to the aerobic denitrification were identified through whole genome analysis. This work will facilitate the mechanism of aerobic denitrification and provide evidence for its potential application in the nitrogen removal.

  11. The complete mitochondrial genome of Porites harrisoni (Cnidaria: Scleractinia) obtained using next-generation sequencing

    KAUST Repository

    Terraneo, Tullia Isotta; Arrigoni, Roberto; Benzoni, Francesca; Forsman, Zac H.; Berumen, Michael L.

    2018-01-01

    In this study, we sequenced the complete mitochondrial genome of Porites harrisoni using ezRAD and Illumina technology. Genome length consisted of 18,630 bp, with a base composition of 25.92% A, 13.28% T, 23.06% G, and 37.73% C. Consistent with other hard corals, P. harrisoni mitogenome was arranged in 13 protein-coding genes, 2 rRNA, and 2 tRNA genes. nad5 and cox1 contained embedded Group I Introns of 11,133 bp and 965 bp, respectively.

  12. Complete mitochondrial genome sequence of the Barbour's seahorse Hippocampus barbouri Jordan & Richardson, 1908 (Gasterosteiformes: Syngnathidae).

    Science.gov (United States)

    Wang, Bo; Zhang, Yanhong; Zhang, Huixian; Lin, Qiang

    2015-01-01

    The complete mitochondrial genome sequence of the Barbour's seahorse Hippocampus barbouri was first determined in this paper. The total length of H. barbouri mitogenome is 16,526 bp, which consists of 13 protein-coding genes, 22 tRNA and 2 rRNA genes and 1 control region. The features of the H. barbouri mitochondrial genome were similar to the typical vertebrates. The overall base composition of H. barbouri is 32.68% A, 29.75% T, 22.91% C and 14.66% G, with an AT content of 62.43%.

  13. Complete mitochondrial genome sequence of the lined seahorse Hippocampus erectus Perry, 1810 (Gasterosteiformes: Syngnathidae).

    Science.gov (United States)

    Zhang, Yanhong; Zhang, Huixian; Lin, Qiang; Huang, Liangmin

    2015-01-01

    The complete mitochondrial genome sequence of the lined seahorse Hippocampus erectus was first determined in this article. The total length of H. erectus mitogenome is 16,529 bp, which consists of 13 protein-coding genes, 22 tRNA and 2 rRNA genes and 1 control region. The features of the H. erectus mitochondrial genome were similar to the typical vertebrates. The overall base composition of H. erectus is 31.8% A, 28.6% T, 24.3% C and 15.3% G, with a slight A + T rich feature (60.4%).

  14. Complete plastid genome sequence of Daucus carota: implications for biotechnology and phylogeny of angiosperms.

    Science.gov (United States)

    Ruhlman, Tracey; Lee, Seung-Bum; Jansen, Robert K; Hostetler, Jessica B; Tallon, Luke J; Town, Christopher D; Daniell, Henry

    2006-08-31

    Carrot (Daucus carota) is a major food crop in the US and worldwide. Its capacity for storage and its lifecycle as a biennial make it an attractive species for the introduction of foreign genes, especially for oral delivery of vaccines and other therapeutic proteins. Until recently efforts to express recombinant proteins in carrot have had limited success in terms of protein accumulation in the edible tap roots. Plastid genetic engineering offers the potential to overcome this limitation, as demonstrated by the accumulation of BADH in chromoplasts of carrot taproots to confer exceedingly high levels of salt resistance. The complete plastid genome of carrot provides essential information required for genetic engineering. Additionally, the sequence data add to the rapidly growing database of plastid genomes for assessing phylogenetic relationships among angiosperms. The complete carrot plastid genome is 155,911 bp in length, with 115 unique genes and 21 duplicated genes within the IR. There are four ribosomal RNAs, 30 distinct tRNA genes and 18 intron-containing genes. Repeat analysis reveals 12 direct and 2 inverted repeats > or = 30 bp with a sequence identity > or = 90%. Phylogenetic analysis of nucleotide sequences for 61 protein-coding genes using both maximum parsimony (MP) and maximum likelihood (ML) were performed for 29 angiosperms. Phylogenies from both methods provide strong support for the monophyly of several major angiosperm clades, including monocots, eudicots, rosids, asterids, eurosids II, euasterids I, and euasterids II. The carrot plastid genome contains a number of dispersed direct and inverted repeats scattered throughout coding and non-coding regions. This is the first sequenced plastid genome of the family Apiaceae and only the second published genome sequence of the species-rich euasterid II clade. Both MP and ML trees provide very strong support (100% bootstrap) for the sister relationship of Daucus with Panax in the euasterid II clade. These

  15. Complete plastid genome sequence of Daucus carota: Implications for biotechnology and phylogeny of angiosperms

    Directory of Open Access Journals (Sweden)

    Ruhlman Tracey

    2006-08-01

    Full Text Available Abstract Background Carrot (Daucus carota is a major food crop in the US and worldwide. Its capacity for storage and its lifecycle as a biennial make it an attractive species for the introduction of foreign genes, especially for oral delivery of vaccines and other therapeutic proteins. Until recently efforts to express recombinant proteins in carrot have had limited success in terms of protein accumulation in the edible tap roots. Plastid genetic engineering offers the potential to overcome this limitation, as demonstrated by the accumulation of BADH in chromoplasts of carrot taproots to confer exceedingly high levels of salt resistance. The complete plastid genome of carrot provides essential information required for genetic engineering. Additionally, the sequence data add to the rapidly growing database of plastid genomes for assessing phylogenetic relationships among angiosperms. Results The complete carrot plastid genome is 155,911 bp in length, with 115 unique genes and 21 duplicated genes within the IR. There are four ribosomal RNAs, 30 distinct tRNA genes and 18 intron-containing genes. Repeat analysis reveals 12 direct and 2 inverted repeats ≥ 30 bp with a sequence identity ≥ 90%. Phylogenetic analysis of nucleotide sequences for 61 protein-coding genes using both maximum parsimony (MP and maximum likelihood (ML were performed for 29 angiosperms. Phylogenies from both methods provide strong support for the monophyly of several major angiosperm clades, including monocots, eudicots, rosids, asterids, eurosids II, euasterids I, and euasterids II. Conclusion The carrot plastid genome contains a number of dispersed direct and inverted repeats scattered throughout coding and non-coding regions. This is the first sequenced plastid genome of the family Apiaceae and only the second published genome sequence of the species-rich euasterid II clade. Both MP and ML trees provide very strong support (100% bootstrap for the sister relationship of

  16. Complete mitochondrial genome of the holotype specimen of Wildemania schizophylla (Bangiales: Rhodophyta).

    Science.gov (United States)

    Silva, Mayra Y; Hughey, Jeffery R

    2016-01-01

    Ion Proton data was used to assemble the complete mitochondrial genome from the holotype specimen of Wildemania schizophylla (29,156 bp). The mitogenome contains 50 genes, including 2 ribosomal RNA, 23 transfer RNA, 4 ribosomal proteins, 2 ymfs, 3 open reading frames (ORFs), and 19 genes involved in cellular respiration. Although gene synteny is conserved, the mitogenome of W. schizophylla is significantly smaller due to the lack of large intronic ORFs present in the cytochrome oxidase locus of other Bangiales. The results support the recognition of Wildemania as distinct from Porphyra, and demonstrate that small amounts of type material are suitable for genomic studies.

  17. Complete chloroplast genome of the multifunctional crop globe artichoke and comparison with other Asteraceae.

    Science.gov (United States)

    Curci, Pasquale L; De Paola, Domenico; Danzi, Donatella; Vendramin, Giovanni G; Sonnante, Gabriella

    2015-01-01

    With over 20,000 species, Asteraceae is the second largest plant family. High-throughput sequencing of nuclear and chloroplast genomes has allowed for a better understanding of the evolutionary relationships within large plant families. Here, the globe artichoke chloroplast (cp) genome was obtained by a combination of whole-genome and BAC clone high-throughput sequencing. The artichoke cp genome is 152,529 bp in length, consisting of two single-copy regions separated by a pair of inverted repeats (IRs) of 25,155 bp, representing the longest IRs found in the Asteraceae family so far. The large (LSC) and the small (SSC) single-copy regions span 83,578 bp and 18,641 bp, respectively. The artichoke cp sequence was compared to the other eight Asteraceae complete cp genomes available, revealing an IR expansion at the SSC/IR boundary. This expansion consists of 17 bp of the ndhF gene generating an overlap between the ndhF and ycf1 genes. A total of 127 cp simple sequence repeats (cpSSRs) were identified in the artichoke cp genome, potentially suitable for future population studies in the Cynara genus. Parsimony-informative regions were evaluated and allowed to place a Cynara species within the Asteraceae family tree. The eight most informative coding regions were also considered and tested for "specific barcode" purpose in the Asteraceae family. Our results highlight the usefulness of cp genome sequencing in exploring plant genome diversity and retrieving reliable molecular resources for phylogenetic and evolutionary studies, as well as for specific barcodes in plants.

  18. The Complete Chloroplast Genome of Ye-Xing-Ba (Scrophularia dentata; Scrophulariaceae), an Alpine Tibetan Herb.

    Science.gov (United States)

    Ni, Lianghong; Zhao, Zhili; Dorje, Gaawe; Ma, Mi

    2016-01-01

    Scrophularia dentata is an important Tibetan medicinal plant and traditionally used for the treatment of exanthema and fever in Traditional Tibetan Medicine (TTM). However, there is little sequence and genomic information available for S. dentata. In this paper, we report the complete chloroplast genome sequence of S. dentata and it is the first sequenced member of the Sect. Tomiophyllum within Scrophularia (Scrophulariaceae). The gene order and organization of the chloroplast genome of S. dentata are similar to other Lamiales chloroplast genomes. The plastome is 152,553 bp in length and includes a pair of inverted repeats (IRs) of 25,523 bp that separate a large single copy (LSC) region of 84,058 bp and a small single copy (SSC) region of 17,449 bp. It has 38.0% GC content and includes 114 unique genes, of which 80 are protein-coding, 30 are transfer RNA, and 4 are ribosomal RNA. Also, it contains 21 forward repeats, 19 palindrome repeats and 41 simple sequence repeats (SSRs). The repeats and SSRs within S. dentata were compared with those of S. takesimensis and present certain discrepancies. The chloroplast genome of S. dentata was compared with other five publicly available Lamiales genomes from different families. All the coding regions and non-coding regions (introns and intergenic spacers) within the six chloroplast genomes have been extracted and analysed. Furthermore, the genome divergent hotspot regions were identified. Our studies could provide basic data for the alpine medicinal species conservation and molecular phylogenetic researches of Scrophulariaceae and Lamiales.

  19. The Complete Chloroplast Genome of Ye-Xing-Ba (Scrophularia dentata; Scrophulariaceae, an Alpine Tibetan Herb.

    Directory of Open Access Journals (Sweden)

    Lianghong Ni

    Full Text Available Scrophularia dentata is an important Tibetan medicinal plant and traditionally used for the treatment of exanthema and fever in Traditional Tibetan Medicine (TTM. However, there is little sequence and genomic information available for S. dentata. In this paper, we report the complete chloroplast genome sequence of S. dentata and it is the first sequenced member of the Sect. Tomiophyllum within Scrophularia (Scrophulariaceae. The gene order and organization of the chloroplast genome of S. dentata are similar to other Lamiales chloroplast genomes. The plastome is 152,553 bp in length and includes a pair of inverted repeats (IRs of 25,523 bp that separate a large single copy (LSC region of 84,058 bp and a small single copy (SSC region of 17,449 bp. It has 38.0% GC content and includes 114 unique genes, of which 80 are protein-coding, 30 are transfer RNA, and 4 are ribosomal RNA. Also, it contains 21 forward repeats, 19 palindrome repeats and 41 simple sequence repeats (SSRs. The repeats and SSRs within S. dentata were compared with those of S. takesimensis and present certain discrepancies. The chloroplast genome of S. dentata was compared with other five publicly available Lamiales genomes from different families. All the coding regions and non-coding regions (introns and intergenic spacers within the six chloroplast genomes have been extracted and analysed. Furthermore, the genome divergent hotspot regions were identified. Our studies could provide basic data for the alpine medicinal species conservation and molecular phylogenetic researches of Scrophulariaceae and Lamiales.

  20. The complete mitochondrial genome of Pseudocellus pearsei (Chelicerata: Ricinulei and a comparison of mitochondrial gene rearrangements in Arachnida

    Directory of Open Access Journals (Sweden)

    Braband Anke

    2007-10-01

    Full Text Available Abstract Background Mitochondrial genomes are widely utilized for phylogenetic and population genetic analyses among animals. In addition to sequence data the mitochondrial gene order and RNA secondary structure data are used in phylogenetic analyses. Arachnid phylogeny is still highly debated and there is a lack of sufficient sequence data for many taxa. Ricinulei (hooded tickspiders are a morphologically distinct clade of arachnids with uncertain phylogenetic affinities. Results The first complete mitochondrial DNA genome of a member of the Ricinulei, Pseudocellus pearsei (Arachnida: Ricinulei was sequenced using a PCR-based approach. The mitochondrial genome is a typical circular duplex DNA molecule with a size of 15,099 bp, showing the complete set of genes usually present in bilaterian mitochondrial genomes. Five tRNA genes (trnW, trnY, trnN, trnL(CUN, trnV show different relative positions compared to other Chelicerata (e.g. Limulus polyphemus, Ixodes spp.. We propose that two events led to this derived gene order: (1 a tandem duplication followed by random deletion and (2 an independent translocation of trnN. Most of the inferred tRNA secondary structures show the common cloverleaf pattern except tRNA-Glu where the TψC-arm is missing. In phylogenetic analyses (maximum likelihood, maximum parsimony, Bayesian inference using concatenated amino acid and nucleotide sequences of protein-coding genes the basal relationships of arachnid orders remain unresolved. Conclusion Phylogenetic analyses (ML, MP, BI of arachnid mitochondrial genomes fail to resolve interordinal relationships of Arachnida and remain in a preliminary stage because there is still a lack of mitogenomic data from important taxa such as Opiliones and Pseudoscorpiones. Gene order varies considerably within Arachnida – only eight out of 23 species have retained the putative arthropod ground pattern. Some gene order changes are valuable characters in phylogenetic analysis of

  1. Dual Enrollment, Structural Reform, and the Completion Agenda

    Science.gov (United States)

    Karp, Melinda Mechur

    2015-01-01

    This chapter contextualizes and extends the previous chapters by addressing the intertwined issues of structural systems reform and college completion, as well as the role dual enrollment can play in ensuring equitable postsecondary outcomes for underrepresented students.

  2. Characterization of the complete mitochondrial genome of the giant silkworm moth, Eriogyna pyretorum (Lepidoptera: Saturniidae).

    Science.gov (United States)

    Jiang, Shao-Tong; Hong, Gui-Yun; Yu, Miao; Li, Na; Yang, Ying; Liu, Yan-Qun; Wei, Zhao-Jun

    2009-05-22

    The complete mitochondrial genome (mitogenome) of Eriogyna pyretorum (Lepidoptera: Saturniidae) was determined as being composed of 15,327 base pairs (bp), including 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The arrangement of the PCGs is the same as that found in the other sequenced lepidopteran. The AT skewness for the E. pyretorum mitogenome is slightly negative (-0.031), indicating the occurrence of more Ts than As. The nucleotide composition of the E. pyretorum mitogenome is also biased toward A + T nucleotides (80.82%). All PCGs are initiated by ATN codons, except for cytochrome c oxidase subunit 1 and 2 (cox1 and cox2). Two of the 13 PCGs harbor the incomplete termination codon by T. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA, with the exception of trnS1(AGN) and trnS2(UCN). Phylogenetic analysis among the available lepidopteran species supports the current morphology-based hypothesis that Bombycoidea, Geometroidea, Notodontidea, Papilionoidea and Pyraloidea are monophyletic. As has been previously suggested, Bombycidae (Bombyx mori and Bombyx mandarina), Sphingoidae (Manduca sexta) and Saturniidae (Antheraea pernyi, Antheraea yamamai, E. pyretorum and Caligula boisduvalii) formed a group.

  3. The complete mitochondrial genome of Sesarmops sinensis reveals gene rearrangements and phylogenetic relationships in Brachyura.

    Science.gov (United States)

    Tang, Bo-Ping; Xin, Zhao-Zhe; Liu, Yu; Zhang, Dai-Zhen; Wang, Zheng-Fei; Zhang, Hua-Bin; Chai, Xin-Yue; Zhou, Chun-Lin; Liu, Qiu-Ning

    2017-01-01

    Mitochondrial genome (mitogenome) is very important to understand molecular evolution and phylogenetics. Herein, in this study, the complete mitogenome of Sesarmops sinensis was reported. The mitogenome was 15,905 bp in size, and contained 13 protein-coding genes (PCGs), two ribosomal RNA (rRNA) genes, 22 transfer RNA (tRNA) genes, and a control region (CR). The AT skew and the GC skew are both negative in the mitogenomes of S. sinensis. The nucleotide composition of the S. sinensis mitogenome was also biased toward A + T nucleotides (75.7%). All tRNA genes displayed a typical mitochondrial tRNA cloverleaf structure, except for the trnS1 gene, which lacked a dihydroxyuridine arm. S. sinensis exhibits a novel rearrangement compared with the Pancrustacean ground pattern and other Brachyura species. Based on the 13 PCGs, the phylogenetic analysis showed that S. sinensis and Sesarma neglectum were clustered on one branch with high nodal support values, indicating that S. sinensis and S. neglectum have a sister group relationship. The group (S. sinensis + S. neglectum) was sister to (Parasesarmops tripectinis + Metopaulias depressus), suggesting that S. sinensis belongs to Grapsoidea, Sesarmidae. Phylogenetic trees based on amino acid sequences and nucleotide sequences of mitochondrial 13 PCGs using BI and ML respectively indicate that section Eubrachyura consists of four groups clearly. The resulting phylogeny supports the establishment of a separate subsection Potamoida. These four groups correspond to four subsections of Raninoida, Heterotremata, Potamoida, and Thoracotremata.

  4. Complete mitochondrial genome of the hardnose shark Carcharhinus macloti (Carcharhiniformes: Carcharhinidae).

    Science.gov (United States)

    Chen, Xiao; Liu, Min; Xiao, Jiamei; Yang, Weidi; Peng, Zaiqing

    2016-01-01

    The complete mitochondrial genome of Carcharhinus macloti was determined in this study. It is 16,701 bp in length and contains 37 genes with the typical gene order and transcriptional orientation in vertebrates. A total of 29 bp overlaps and 29 bp short intergenic spaces located in 22 gene junctions. The overall base composition is 31.6% A, 26.2% C, 13.0% G and 29.2% T. Two start codons (ATG and GTG) and three stop codons (AGG, TAG and TAA/T) were found in 13 protein-coding genes. The length of 22 tRNA genes ranged from 66 bp (tRNA-Ser2) to 75 bp (tRNA-Leu1). The tRNA-Ser2 (GCU) lacks the dihydrouridine arm by a simple loop and can not be folded into the typical cloverleaf structure. The control region is 1066 bp in length with high A+T content (68.2%).

  5. Complete mitochondrial genome of Concholepas concholepas inferred by 454 pyrosequencing and mtDNA expression in two mollusc populations.

    Science.gov (United States)

    Núñez-Acuña, Gustavo; Aguilar-Espinoza, Andrea; Gallardo-Escárate, Cristian

    2013-03-01

    Despite the great relevance of mitochondrial genome analysis in evolutionary studies, there is scarce information on how the transcripts associated with the mitogenome are expressed and their role in the genetic structuring of populations. This work reports the complete mitochondrial genome of the marine gastropod Concholepas concholepas, obtained by 454 pryosequencing, and an analysis of mitochondrial transcripts of two populations 1000 km apart along the Chilean coast. The mitochondrion of C. concholepas is 15,495 base pairs (bp) in size and contains the 37 subunits characteristic of metazoans, as well as a non-coding region of 330 bp. In silico analysis of mitochondrial gene variability showed significant differences among populations. In terms of levels of relative abundance of transcripts associated with mitochondrion in the two populations (assessed by qPCR), the genes associated with complexes III and IV of the mitochondrial genome had the highest levels of expression in the northern population while transcripts associated with the ATP synthase complex had the highest levels of expression in the southern population. Moreover, fifteen polymorphic SNPs were identified in silico between the mitogenomes of the two populations. Four of these markers implied different amino acid substitutions (non-synonymous SNPs). This work contributes novel information regarding the mitochondrial genome structure and mRNA expression levels of C. concholepas. Copyright © 2012 Elsevier Inc. All rights reserved.

  6. Defining the diverse spectrum of inversions, complex structural variation, and chromothripsis in the morbid human genome

    NARCIS (Netherlands)

    Collins, Ryan L; Brand, Harrison; Redin, Claire E.; Hanscom, Carrie; Antolik, Caroline; Stone, Matthew R; Glessner, Joseph T.; Mason, Tamara; Pregno, Giulia; Dorrani, Naghmeh; Mandrile, Giorgia; Giachino, Daniela; Perrin, Danielle; Walsh, Cole; Cipicchio, Michelle; Costello, Maura; Stortchevoi, Alexei; An, Joon Yong; Currall, Benjamin B; Seabra, Catarina M; Ragavendran, Ashok; Margolin, Lauren; Martinez-Agosto, Julian A.; Lucente, Diane; Levy, Brynn; Sanders, Jan-Stephan; Wapner, Ronald J.; Quintero-Rivera, Fabiola; Kloosterman, Wigard; Talkowski, Michael E.

    2017-01-01

    Background: Structural variation (SV) influences genome organization and contributes to human disease. However, the complete mutational spectrum of SV has not been routinely captured in disease association studies. Results: We sequenced 689 participants with autism spectrum disorder (ASD) and other

  7. The Complete Mitochondrial Genome Sequence of Bactericera cockerelli and Comparison with Three Other Psylloidea Species.

    Directory of Open Access Journals (Sweden)

    Fengnian Wu

    Full Text Available Potato psyllid (Bactericera cockerelli is an important pest of potato, tomato and pepper. Not only could a toxin secreted by nymphs results in serious phytotoxemia in some host plants, but also over the past few years B. cockerelli was shown to transmit "Candidatus Liberibacter solanacearum", the putative bacterial pathogen of potato zebra chip (ZC disease, to potato and tomato. ZC has caused devastating losses to potato production in the western U.S., Mexico, and elsewhere. New knowledge of the genetic diversity of the B. cockerelli is needed to develop improved strategies to manage pest populations. Mitochondrial genome (mitogenome sequencing provides important knowledge about insect evolution and diversity in and among populations. This report provides the first complete B. cockerelli mitogenome sequence as determined by next generation sequencing technology (Illumina MiSeq. The circular B. cockerelli mitogenome had a size of 15,220 bp with 13 protein-coding gene (PCGs, 2 ribosomal RNA genes (rRNAs, 22 transfer RNA genes (tRNAs, and a non-coding region of 975 bp. The overall gene order of the B. cockerelli mitogenome is identical to three other published Psylloidea mitogenomes: one species from the Triozidae, Paratrioza sinica; and two species from the Psyllidae, Cacopsylla coccinea and Pachypsylla venusta. This suggests all of these species share a common ancestral mitogenome. However, sequence analyses revealed differences between and among the insect families, in particular a unique region that can be folded into three stem-loop secondary structures present only within the B. cockerelli mitogenome. A phylogenetic tree based on the 13 PCGs matched an existing taxonomy scheme that was based on morphological characteristics. The available complete mitogenome sequence makes it accessible to all genes for future population diversity evaluation of B. cockerelli.

  8. Complete chloroplast genome sequence of MD-2 pineapple and its comparative analysis among nine other plants from the subclass Commelinidae.

    Science.gov (United States)

    Redwan, R M; Saidin, A; Kumar, S V

    2015-08-12

    Pineapple (Ananas comosus var. comosus) is known as the king of fruits for its crown and is the third most important tropical fruit after banana and citrus. The plant, which is indigenous to South America, is the most important species in the Bromeliaceae family and is largely traded for fresh fruit consumption. Here, we report the complete chloroplast sequence of the MD-2 pineapple that was sequenced using the PacBio sequencing technology. In this study, the high error rate of PacBio long sequence reads of A. comosus's total genomic DNA were improved by leveraging on the high accuracy but short Illumina reads for error-correction via the latest error correction module from Novocraft. Error corrected long PacBio reads were assembled by using a single tool to produce a contig representing the pineapple chloroplast genome. The genome of 159,636 bp in length is featured with the conserved quadripartite structure of chloroplast containing a large single copy region (LSC) with a size of 87,482 bp, a small single copy region (SSC) with a size of 18,622 bp and two inverted repeat regions (IRA and IRB) each with the size of 26,766 bp. Overall, the genome contained 117 unique coding regions and 30 were repeated in the IR region with its genes contents, structure and arrangement similar to its sister taxon, Typha latifolia. A total of 35 repeats structure were detected in both the coding and non-coding regions with a majority being tandem repeats. In addition, 205 SSRs were detected in the genome with six protein-coding genes contained more than two SSRs. Comparative chloroplast genomes from the subclass Commelinidae revealed a conservative protein coding gene albeit located in a highly divergence region. Analysis of selection pressure on protein-coding genes using Ka/Ks ratio showed significant positive selection exerted on the rps7 gene of the pineapple chloroplast with P less than 0.05. Phylogenetic analysis confirmed the recent taxonomical relation among the member of

  9. The complete chloroplast genome sequence of the medicinal plant Andrographis paniculata.

    Science.gov (United States)

    Ding, Ping; Shao, Yanhua; Li, Qian; Gao, Junli; Zhang, Runjing; Lai, Xiaoping; Wang, Deqin; Zhang, Huiye

    2016-07-01

    The complete chloroplast genome of Andrographis paniculata, an important medicinal plant with great economic value, has been studied in this article. The genome size is 150,249 bp in length, with 38.3% GC content. A pair of inverted repeats (IRs, 25,300 bp) are separated by a large single copy region (LSC, 82,459 bp) and a small single-copy region (SSC, 17,190 bp). The chloroplast genome contains 114 unique genes, 80 protein-coding genes, 30 tRNA genes and 4 rRNA genes. In these genes, 15 genes contained 1 intron and 3 genes comprised of 2 introns.

  10. The complete mitochondrial genome of the stonefly Dinocras cephalotes (Plecoptera, Perlidae).

    Science.gov (United States)

    Elbrecht, Vasco; Poettker, Lisa; John, Uwe; Leese, Florian

    2015-06-01

    The complete mitochondrial genome of the perlid stonefly Dinocras cephalotes (Curtis, 1827) was sequenced using a combined 454 and Sanger sequencing approach using the known sequence of Pteronarcys princeps Banks, 1907 (Pteronarcyidae), to identify homologous 454 reads. The genome is 15,666 bp in length and includes 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and a control region. Gene order resembles that of basal arthropods. The base composition of the genome is A (33.5%), T (29.0%), C (24.4%) and G (13.1%). This is the second published mitogenome for the order Plecoptera and will be useful in future phylogenetic analysis.

  11. Complete genome sequence of Tsukamurella paurometabola type strain (no. 33T)

    Energy Technology Data Exchange (ETDEWEB)

    Munk, Christine [U.S. Department of Energy, Joint Genome Institute; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Liolios, Konstantinos [U.S. Department of Energy, Joint Genome Institute; Huntemann, Marcel [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Brettin, Thomas S [ORNL; Yasawong, Montri [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Brambilla, Evelyne-Marie [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Sikorski, Johannes [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany

    2011-01-01

    Tsukamurella paurometabola corrig. (Steinhaus 1941) Collins et al. 1988 is the type species of the genus Tsukamurella, which is the type genus to the family Tsukamurellaceae. The spe- cies is not only of interest because of its isolated phylogenetic location, but also because it is a human opportunistic pathogen with some strains of the species reported to cause lung in- fection, lethal meningitis, and necrotizing tenosynovitis. This is the first completed genome sequence of a member of the genus Tsukamurella and the first genome sequence of a member of the family Tsukamurellaceae. The 4,479,724 bp long genome contains a 99,806 bp long plasmid and a total of 4,335 protein-coding and 56 RNA genes, and is a part of the Ge- nomic Encyclopedia of Bacteria and Archaea project.

  12. The complete mitochondrial genome of the medicinal fungus Ganoderma applanatum (Polyporales, Basidiomycota).

    Science.gov (United States)

    Wang, Xin-Cun; Shao, Junjie; Liu, Chang

    2016-07-01

    We have determined the complete nucleotide sequence of the mitochondrial genome of the medicinal fungus Ganoderma applanatum (Pers.) Pat. using the next-generation sequencing technology. The circular molecule is 119,803 bp long with a GC content of 26.66%. Gene prediction revealed genes encoding 15 conserved proteins, 25 tRNAs, the large and small ribosomal RNAs, all genes are located on the same strand except trnW-CCA. Compared with previously sequenced genomes of G. lucidum, G. meredithiae and G. sinense, the order of the protein and rRNA genes is highly conserved; however, the types of tRNA genes are slightly different. The mitochondrial genome of G. applanatum will contribute to the understanding of the phylogeny and evolution of Ganoderma and Ganodermataceae, the group containing many species with high medicinal values.

  13. Complete genome sequence of the industrial bacterium Bacillus licheniformis and comparisons with closely related Bacillus species

    Science.gov (United States)

    Rey, Michael W; Ramaiya, Preethi; Nelson, Beth A; Brody-Karpin, Shari D; Zaretsky, Elizabeth J; Tang, Maria; de Leon, Alfredo Lopez; Xiang, Henry; Gusti, Veronica; Clausen, Ib Groth; Olsen, Peter B; Rasmussen, Michael D; Andersen, Jens T; Jørgensen, Per L; Larsen, Thomas S; Sorokin, Alexei; Bolotin, Alexander; Lapidus, Alla; Galleron, Nathalie; Ehrlich, S Dusko; Berka, Randy M

    2004-01-01

    Background Bacillus licheniformis is a Gram-positive, spore-forming soil bacterium that is used in the biotechnology industry to manufacture enzymes, antibiotics, biochemicals and consumer products. This species is closely related to the well studied model organism Bacillus subtilis, and produces an assortment of extracellular enzymes that may contribute to nutrient cycling in nature. Results We determined the complete nucleotide sequence of the B. licheniformis ATCC 14580 genome which comprises a circular chromosome of 4,222,336 base-pairs (bp) containing 4,208 predicted protein-coding genes with an average size of 873 bp, seven rRNA operons, and 72 tRNA genes. The B. licheniformis chromosome contains large regions that are colinear with the genomes of B. subtilis and Bacillus halodurans, and approximately 80% of the predicted B. licheniformis coding sequences have B. subtilis orthologs. Conclusions Despite the unmistakable organizational similarities between the B. licheniformis and B. subtilis genomes, there are notable differences in the numbers and locations of prophages, transposable elements and a number of extracellular enzymes and secondary metabolic pathway operons that distinguish these species. Differences include a region of more than 80 kilobases (kb) that comprises a cluster of polyketide synthase genes and a second operon of 38 kb encoding plipastatin synthase enzymes that are absent in the B. licheniformis genome. The availability of a completed genome sequence for B. licheniformis should facilitate the design and construction of improved industrial strains and allow for comparative genomics and evolutionary studies within this group of Bacillaceae. PMID:15461803

  14. Gene Composer in a structural genomics environment

    International Nuclear Information System (INIS)

    Lorimer, Don; Raymond, Amy; Mixon, Mark; Burgin, Alex; Staker, Bart; Stewart, Lance

    2011-01-01

    For structural biology applications, protein-construct engineering is guided by comparative sequence analysis and structural information, which allow the researcher to better define domain boundaries for terminal deletions and nonconserved regions for surface mutants. A database software application called Gene Composer has been developed to facilitate construct design. The structural genomics effort at the Seattle Structural Genomics Center for Infectious Disease (SSGCID) requires the manipulation of large numbers of amino-acid sequences and the underlying DNA sequences which are to be cloned into expression vectors. To improve efficiency in high-throughput protein structure determination, a database software package, Gene Composer, has been developed which facilitates the information-rich design of protein constructs and their underlying gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bioinformatics steps used in modern structure-guided protein engineering and synthetic gene engineering. An example of the structure determination of H1N1 RNA-dependent RNA polymerase PB2 subunit is given

  15. The complete mitochondrial genomes of the Galápagos iguanas, Amblyrhynchus cristatus and Conolophus subcristatus.

    Science.gov (United States)

    MacLeod, Amy; Irisarri, Iker; Vences, Miguel; Steinfartz, Sebastian

    2016-09-01

    The Galápagos iguanas are among the oldest vertebrate lineages on the Galápagos archipelago, and the evolutionary history of this clade is of great interest to biologists. We describe here the complete mitochondrial genomes of the marine iguana, Amblyrhynchus cristatus (Genbank accession number: KT277937) and the land iguana Conolophus subcristatus (Genbank accession number: KT277936). The genomes contain 13 protein-coding genes, 22 transfer RNAs, and two ribosomal RNAs genes, as well as a control region (CR). Both species have an identical gene order, which matches that of Iguana iguana. The CR of both Galápagos iguanas features similar tandem repeats units, which are absent in I. iguana. We present a phylogeny of the Iguanidae based on complete mitochondrial genomes, which confirms the sister-group relationship of Galápagos iguanas. These new mitochondrial genomes constitute an important data source for future exploration of the phylogenetic relationships and evolutionary history of the Galápagos iguanas.

  16. Complete sequence and comparative analysis of the chloroplast genome of Plinia trunciflora

    Directory of Open Access Journals (Sweden)

    Maria Eguiluz

    2017-11-01

    Full Text Available Abstract Plinia trunciflora is a Brazilian native fruit tree from the Myrtaceae family, also known as jaboticaba. This species has great potential by its fruit production. Due to the high content of essential oils in their leaves and of anthocyanins in the fruits, there is also an increasing interest by the pharmaceutical industry. Nevertheless, there are few studies focusing on its molecular biology and genetic characterization. We herein report the complete chloroplast (cp genome of P. trunciflora using high-throughput sequencing and compare it to other previously sequenced Myrtaceae genomes. The cp genome of P. trunciflora is 159,512 bp in size, comprising inverted repeats of 26,414 bp and single-copy regions of 88,097 bp (LSC and 18,587 bp (SSC. The genome contains 111 single-copy genes (77 protein-coding, 30 tRNA and four rRNA genes. Phylogenetic analysis using 57 cp protein-coding genes demonstrated that P. trunciflora, Eugenia uniflora and Acca sellowiana form a cluster with closer relationship to Syzygium cumini than with Eucalyptus. The complete cp sequence reported here can be used in evolutionary and population genetics studies, contributing to resolve the complex taxonomy of this species and fill the gap in genetic characterization.

  17. Complete sequence and comparative analysis of the chloroplast genome of Plinia trunciflora

    Science.gov (United States)

    Eguiluz, Maria; Yuyama, Priscila Mary; Guzman, Frank; Rodrigues, Nureyev Ferreira; Margis, Rogerio

    2017-01-01

    Abstract Plinia trunciflora is a Brazilian native fruit tree from the Myrtaceae family, also known as jaboticaba. This species has great potential by its fruit production. Due to the high content of essential oils in their leaves and of anthocyanins in the fruits, there is also an increasing interest by the pharmaceutical industry. Nevertheless, there are few studies focusing on its molecular biology and genetic characterization. We herein report the complete chloroplast (cp) genome of P. trunciflora using high-throughput sequencing and compare it to other previously sequenced Myrtaceae genomes. The cp genome of P. trunciflora is 159,512 bp in size, comprising inverted repeats of 26,414 bp and single-copy regions of 88,097 bp (LSC) and 18,587 bp (SSC). The genome contains 111 single-copy genes (77 protein-coding, 30 tRNA and four rRNA genes). Phylogenetic analysis using 57 cp protein-coding genes demonstrated that P. trunciflora, Eugenia uniflora and Acca sellowiana form a cluster with closer relationship to Syzygium cumini than with Eucalyptus. The complete cp sequence reported here can be used in evolutionary and population genetics studies, contributing to resolve the complex taxonomy of this species and fill the gap in genetic characterization. PMID:29111566

  18. The complete mitochondrial genomes of five Eimeria species infecting domestic rabbits.

    Science.gov (United States)

    Liu, Guo-Hua; Tian, Si-Qin; Cui, Ping; Fang, Su-Fang; Wang, Chun-Ren; Zhu, Xing-Quan

    2015-12-01

    Rabbit coccidiosis caused by members of the genus Eimeria can cause enormous economic impact worldwide, but the genetics, epidemiology and biology of these parasites remain poorly understood. In the present study, we sequenced and annotated the complete mitochondrial (mt) genomes of five Eimeria species that commonly infect the domestic rabbits. The complete mt genomes of Eimeria intestinalis, Eimeria flavescens, Eimeria media, Eimeria vejdovskyi and Eimeria irresidua were 6261bp, 6258bp, 6168bp, 6254bp, 6259bp in length, respectively. All of the mt genomes consist of 3 genes for proteins (cytb, cox1, and cox3), 14 gene fragments for the large subunit (LSU) rRNA and 11 gene fragments for the small subunit (SSU) rRNA, but no transfer RNA (tRNA) genes. The gene order of the mt genomes is similar to that of Plasmodium, but distinct from Haemosporida and Theileria. Phylogenetic analyses based on full nucleotide sequences using Bayesian analysis revealed that the monophyly of the Eimeria of rabbits was strongly statistically supported with a Bayesian posterior probabilities. These data provide novel mtDNA markers for studying the population genetics and molecular epidemiology of the Eimeria species, and should have implications for the molecular diagnosis, prevention and control of coccidiosis in rabbits. Copyright © 2015 Elsevier Inc. All rights reserved.

  19. Complete sequence and comparative analysis of the chloroplast genome of Plinia trunciflora.

    Science.gov (United States)

    Eguiluz, Maria; Yuyama, Priscila Mary; Guzman, Frank; Rodrigues, Nureyev Ferreira; Margis, Rogerio

    2017-01-01

    Plinia trunciflora is a Brazilian native fruit tree from the Myrtaceae family, also known as jaboticaba. This species has great potential by its fruit production. Due to the high content of essential oils in their leaves and of anthocyanins in the fruits, there is also an increasing interest by the pharmaceutical industry. Nevertheless, there are few studies focusing on its molecular biology and genetic characterization. We herein report the complete chloroplast (cp) genome of P. trunciflora using high-throughput sequencing and compare it to other previously sequenced Myrtaceae genomes. The cp genome of P. trunciflora is 159,512 bp in size, comprising inverted repeats of 26,414 bp and single-copy regions of 88,097 bp (LSC) and 18,587 bp (SSC). The genome contains 111 single-copy genes (77 protein-coding, 30 tRNA and four rRNA genes). Phylogenetic analysis using 57 cp protein-coding genes demonstrated that P. trunciflora, Eugenia uniflora and Acca sellowiana form a cluster with closer relationship to Syzygium cumini than with Eucalyptus. The complete cp sequence reported here can be used in evolutionary and population genetics studies, contributing to resolve the complex taxonomy of this species and fill the gap in genetic characterization.

  20. Complete genome sequence of Denitrovibrio acetiphilus type strain (N2460T)

    Energy Technology Data Exchange (ETDEWEB)

    Kiss, Hajnalka; Lang, Elke; Lapidus, Alla; Copeland, Alex; Nolan, Matt; Glavina Del Rio, Tijana; Chen, Feng; Lucas, Susan; Tice, Hope; Cheng, Jan-Fang; Han, Cliff; Goodwin, Lynne; Pitluck, Sam; Liolios, Konstantinos; Pati, Amrita; Ivanova, Natalia; Mavromatis, Konstantinos; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jeffries, Cynthia D.; Detter, John C.; Brettin, Thomas; Spring, Stefan; Rohde, Manfred; Goker, Markus; Woyke, Tanja; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter

    2010-06-25

    Denitrovibrio acetiphilus Myhr and Torsvik 2000 is the type species of the genus Denitrovibrio in the bacterial family Deferribacteraceae. It is of phylogenetic interest because there are only six genera described in the family Deferribacteraceae. D. acetiphilus was isolated as a representative of a population reducing nitrate to ammonia in a laboratory column simulating the conditions in off-shore oil recovery fields. When nitrate was added to this column undesirable hydrogen sulfide production was stopped because the sulfate reducing populations were superseded by these nitrate reducing bacteria. Here we describe the features of this marine, mesophilic, obligately anaerobic organism respiring by nitrate reduction, together with the complete genome sequence, and annotation. This is the second complete genome sequence of the order Deferribacterales and the class Deferribacteres, which is the sole class in the phylum Deferribacteres. The 3,222,077 bp genome with its 3,034 protein-coding and 51 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  1. Complete genome sequence of Kytococcus sedentarius type strain (strain 541T)

    Energy Technology Data Exchange (ETDEWEB)

    Sims, David; Brettin, Thomas; Detter, John C.; Han, Cliff; Lapidus, Alla; Copeland, Alex; Glavina Del Rio, Tijana; Nolan, Matt; Chen, Feng; Lucas, Susan; Tice, Hope; Cheng, Jan-Fang; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Ovchinnikova, Galina; Pati, Amrita; Ivanova, Natalia; Mavrommatis, Konstantinos; Chen, Amy; Palaniappan, Krishna; D' haeseleer, Patrick; Chain, Patrick; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Schneider, Susanne; Goker, Markus; Pukall, Rudiger; Kyrpides, Nikos C.; Klenk, Hans-Peter

    2009-05-20

    Kytococcus sedentarius (ZoBell and Upham 1944) Stackebrandt et al. 1995 is the type strain of the species, and is of phylogenetic interest because of its location in the Dermacoccaceae, a poorly studied family within the actinobacterial suborder Micrococcineae. K. sedentarius is known for the production of oligoketide antibiotics as well as for its role as an opportunistic pathogen causing valve endocarditis, hemorrhagic pneumonia, and pitted keratolysis. It is strictly aerobic and can only grow when several amino acids are provided in the medium. The strain described in this report is a free-living, nonmotile, Gram-positive bacterium, originally isolated from a marine environment. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of a member of the family Dermacoccaceae and the 2,785,024 bp long single replicon genome with its 2639 protein-coding and 64 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  2. Complete mitochondrial genome of the monogonont rotifer, Brachionus koreanus (Rotifera, Brachionidae).

    Science.gov (United States)

    Hwang, Dae-Sik; Suga, Koushirou; Sakakura, Yoshitaka; Park, Heum Gi; Hagiwara, Atsushi; Rhee, Jae-Sung; Lee, Jae-Seong

    2014-02-01

    The complete mitochondrial genome was obtained from the assembled genome data sequenced by next generation sequencing (NGS) technology from the monogonont rotifer Brachionus koreanus. The mitochondrial genome of B. koreanus was composed of two circular chromosomes designated as mtDNA-I (10,421 bp) and mtDNA-II (11,923 bp). The gene contents of B. koreanus were identical with previously reported B. plicatilis mitochondrial genomes. However, gene orders of B. koreanus showed one rearrangement between the two species. Of 12 protein-coding genes (PCGs), 3 genes (ATP6, ND1, and ND3) had an incomplete stop codon. The A + T base composition of B. koreanus mitochondrial genome was high (68.81%). They also showed anti-G bias (12.03% and 10.97%) on the second and third position of PCGs as well as slight anti-C bias (15.96% and 14.31%) on the first and third position of PCGs.

  3. Complete genome sequence of Clostridium estertheticum DSM 8809, a microbe identified in spoiled vacuum packed beef

    Directory of Open Access Journals (Sweden)

    Zhongyi Yu

    2016-11-01

    Full Text Available Blown pack spoilage (BPS is a major issue for the beef industry. Aetiological agents of BPS involve members of a group of Clostridium species, including Clostridium estertheticum which has the ability to produce gas, mostly carbon dioxide, under anaerobic psychotrophic growth conditions. This spore-forming bacterium grows slowly under laboratory conditions, and it can take up to 3 months to produce a workable culture. These characteristics have limited the study of this commercially challenging bacterium. Consequently information on this bacterium is limited and no effective controls are currently available to confidently detect and manage this production risk. In this study the complete genome of Clostridium estertheticum DSM 8809 was determined by SMRT® sequencing. The genome consists of a circular chromosome of 4.7 Mbp along with a single plasmid carrying a potential tellurite resistance gene tehB and a Tn3-like resolvase-encoding gene tnpR. The genome sequence was searched for central metabolic pathways that would support its biochemical profile and several enzymes contributing to this phenotype were identified. Several putative antibiotic/biocide/metal resistance-encoding genes and virulence factors were also identified in the genome, a feature that requires further research. The availability of the genome sequence will provide a basic blueprint from which to develop valuable biomarkers that could support and improve the detection and control of this bacterium along the beef production chain.

  4. Assembly and comparative analysis of complete mitochondrial genome sequence of an economic plant Salix suchowensis

    Directory of Open Access Journals (Sweden)

    Ning Ye

    2017-03-01

    Full Text Available Willow is a widely used dioecious woody plant of Salicaceae family in China. Due to their high biomass yields, willows are promising sources for bioenergy crops. In this study, we assembled the complete mitochondrial (mt genome sequence of S. suchowensis with the length of 644,437 bp using Roche-454 GS FLX Titanium sequencing technologies. Base composition of the S. suchowensis mt genome is A (27.43%, T (27.59%, C (22.34%, and G (22.64%, which shows a prevalent GC content with that of other angiosperms. This long circular mt genome encodes 58 unique genes (32 protein-coding genes, 23 tRNA genes and 3 rRNA genes, and 9 of the 32 protein-coding genes contain 17 introns. Through the phylogenetic analysis of 35 species based on 23 protein-coding genes, it is supported that Salix as a sister to Populus. With the detailed phylogenetic information and the identification of phylogenetic position, some ribosomal protein genes and succinate dehydrogenase genes are found usually lost during evolution. As a native shrub willow species, this worthwhile research of S. suchowensis mt genome will provide more desirable information for better understanding the genomic breeding and missing pieces of sex determination evolution in the future.

  5. Structural genomic variations and Parkinson's disease.

    Science.gov (United States)

    Bandrés-Ciga, Sara; Ruz, Clara; Barrero, Francisco J; Escamilla-Sevilla, Francisco; Pelegrina, Javier; Vives, Francisco; Duran, Raquel

    2017-10-01

    Parkinson's disease (PD) is the second most common neurodegenerative disease, whose prevalence is projected to be between 8.7 and 9.3 million by 2030. Until about 20 years ago, PD was considered to be the textbook example of a "non-genetic" disorder. Nowadays, PD is generally considered a multifactorial disorder that arises from the combination and complex interaction of genes and environmental factors. To date, a total of 7 genes including SNCA, LRRK2, PARK2, DJ-1, PINK 1, VPS35 and ATP13A2 have been seen to cause unequivocally Mendelian PD. Also, variants with incomplete penetrance in the genes LRRK2 and GBA are considered to be strong risk factors for PD worldwide. Although genetic studies have provided valuable insights into the pathogenic mechanisms underlying PD, the role of structural variation in PD has been understudied in comparison with other genomic variations. Structural genomic variations might substantially account for such genetic substrates yet to be discovered. The present review aims to provide an overview of the structural genomic variants implicated in the pathogenesis of PD.

  6. Complete resequencing of 40 genomes reveals domestication events and genes in silkworm (Bombyx)

    DEFF Research Database (Denmark)

    Xia, Qingyou; Guo, Yiran; Zhang, Ze

    2009-01-01

    A single-base pair resolution silkworm genetic variation map was constructed from 40 domesticated and wild silkworms, each sequenced to approximately threefold coverage, representing 99.88% of the genome. We identified ~16 million single-nucleotide polymorphisms, many indels, and structural varia...

  7. Complete mitochondrial genome sequence of the hedgehog seahorse Hippocampus spinosissimus Weber, 1933 (Gasterosteiformes:Syngnathidae).

    Science.gov (United States)

    Liu, Shuaishuai; Zhang, Yanhong; Wang, Changming; Lin, Qiang

    2016-07-01

    The complete mitochondrial genome sequence of the hedgehog seahorse Hippocampus spinosissimus was first determined in this article. The total length of H. spinosissimus mitogenome is 16 527 bp and consists of 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and 1 control region. The gene order and composition of H. spinosissimus were similar to those of most other vertebrates. The overall base composition of H. spinosissimus is 32.1% A, 30.3% T, 14.9% G and 22.7% C, with a slight A + T-rich feature (62.4%). Phylogenetic analyses based on complete mitochondrial genome sequence showed that H. spinosissimus has a close genetic relationship to H. ingens and H. kuda.

  8. Complete Genome Sequence of the Endophytic Biocontrol Strain Bacillus velezensis CC09

    OpenAIRE

    Cai, Xunchao; Kang, Xingxing; Xi, Huan; Liu, Changhong; Xue, Yarong

    2016-01-01

    Bacillus velezensis is a heterotypic synonym of B. methylotrophicus, B. amyloliquefaciens subsp. plantarum, and Bacillus oryzicola, and has been used to control plant fungal diseases. In order to fully understand the genetic basis of antimicrobial capacities, we did a complete genome sequencing of the endophytic B.?velezensis strain CC09. Genes tightly associated with biocontrol ability, including nonribosomal peptide synthetases, polyketide synthetases, iron acquisition, colonization, and vo...

  9. Identification and Complete Genome Sequence Analysis of a Genotype XIV Newcastle Disease Virus from Nigeria

    Science.gov (United States)

    Shittu, Ismaila; Sharma, Poonam; Volkening, Jeremy D.; Solomon, Ponman; Sulaiman, Lanre K.; Joannis, Tony M.; Williams-Coplin, Dawn; Miller, Patti J.; Dimitrov, Kiril M.

    2016-01-01

    The first complete genome sequence of a strain of Newcastle disease virus (NDV) from genotype XIV is reported here. Strain duck/Nigeria/NG-695/KG.LOM.11-16/2009 was isolated from an apparently healthy domestic duck from a live bird market in Kogi State, Nigeria, in 2009. This strain is classified as a member of subgenotype XIVb of class II. PMID:26823576

  10. The complete mitochondrial genome of the Giant Manta ray, Manta birostris.

    Science.gov (United States)

    Hinojosa-Alvarez, Silvia; Díaz-Jaimes, Pindaro; Marcet-Houben, Marina; Gabaldón, Toni

    2015-01-01

    The complete mitochondrial genome of the giant manta ray (Manta birostris), consists of 18,075 bp with rich A + T and low G content. Gene organization and length is similar to other species of ray. It comprises of 13 protein-coding genes, 2 rRNAs genes, 23 tRNAs genes and 1 non-coding sequence, and the control region. We identified an AT tandem repeat region, similar to that reported in Mobula japanica.

  11. Complete genome sequence of Bacillus subtilis BSD-2, a microbial germicide isolated from cultivated cotton.

    Science.gov (United States)

    Liu, Hongwei; Yin, Shuli; An, Likang; Zhang, Genwei; Cheng, Huicai; Xi, Yanhua; Cui, Guanhui; Zhang, Feiyan; Zhang, Liping

    2016-07-20

    Bacillus subtilis BSD-2, isolated from cotton (Gossypium spp.), had strong antagonistic activity to Verticillium dahlia Kleb and Botrytis cinerea. We sequenced and annotated the BSD-2 complete genome to help us the better use of this strain, which has surfactin, bacilysin, bacillibactin, subtilosin A, Tas A and a potential class IV lanthipeptide biosynthetic pathways. Copyright © 2016 Elsevier B.V. All rights reserved.

  12. Complete Genome Sequence of a Newcastle Disease Virus Isolated from Wild Peacock (Pavo cristatus) in India.

    Science.gov (United States)

    Khulape, Sagar A; Gaikwad, Satish S; Chellappa, Madhan Mohan; Mishra, Bishnu Prasad; Dey, Sohini

    2014-06-05

    We report here the complete genome sequence of a Newcastle disease virus (NDV) isolated from a wild peacock. Phylogenetic analysis showed that it belongs to genotype II, class II of NDV strains. This study helps to understand the ecology of NDV strains circulating in a wild avian host of this geographical region during the outbreak of 2012 in northwest India. Copyright © 2014 Khulape et al.

  13. The complete mitochondrial genome of the endangered spotback skate, Atlantoraja castelnaui.

    Science.gov (United States)

    Duckett, Drew J L; Naylor, Gavin J P

    2016-05-01

    Chondrichthyes are a highly threatened class of organisms, largely due to overfishing and other human activities. The present study describes the complete mitochondrial genome (16,750 bp) of the endangered spotback skate, Atlantoraja castelnaui. The mitogenome is arranged in a typical vertebrate fashion, containing 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes and 1 control region.

  14. Complete Genome Sequence of Methylobacterium populi P-1M, Isolated from Pink-Pigmented Household Biofilm.

    Science.gov (United States)

    Morohoshi, Tomohiro; Ikeda, Tsukasa

    2016-06-16

    Methylobacterium populi P-1M is isolated from the pink-pigmented household biofilm. Here, we present the complete genome sequence of P-1M, consisting of one chromosome of 5,705,640 bp and five plasmids of 64,864 bp, 59,879 bp, 42,569 bp, 41,417 bp, and 29,506 bp. Copyright © 2016 Morohoshi and Ikeda.

  15. Prediction of transcriptional regulatory sites in the complete genome sequence of Escherichia coli K-12.

    Science.gov (United States)

    Thieffry, D; Salgado, H; Huerta, A M; Collado-Vides, J

    1998-06-01

    As one of the best-characterized free-living organisms, Escherichia coli and its recently completed genomic sequence offer a special opportunity to exploit systematically the variety of regulatory data available in the literature in order to make a comprehensive set of regulatory predictions in the whole genome. The complete genome sequence of E.coli was analyzed for the binding of transcriptional regulators upstream of coding sequences. The biological information contained in RegulonDB (Huerta, A.M. et al., Nucleic Acids Res.,26,55-60, 1998) for 56 different transcriptional proteins was the support to implement a stringent strategy combining string search and weight matrices. We estimate that our search included representatives of 15-25% of the total number of regulatory binding proteins in E.coli. This search was performed on the set of 4288 putative regulatory regions, each 450 bp long. Within the regions with predicted sites, 89% are regulated by one protein and 81% involve only one site. These numbers are reasonably consistent with the distribution of experimental regulatory sites. Regulatory sites are found in 603 regions corresponding to 16% of operon regions and 10% of intra-operonic regions. Additional evidence gives stronger support to some of these predictions, including the position of the site, biological consistency with the function of the downstream gene, as well as genetic evidence for the regulatory interaction. The predictions described here were incorporated into the map presented in the paper describing the complete E.coli genome (Blattner,F.R. et al., Science, 277, 1453-1461, 1997). The complete set of predictions in GenBank format is available at the url: http://www. cifn.unam.mx/Computational_Biology/E.coli-predictions ecoli-reg@cifn.unam.mx, collado@cifn.unam.mx

  16. Complete genome sequence of porcine parvovirus N strain isolated from guangxi, china.

    Science.gov (United States)

    Su, Qian-Lian; Li, Bin; Zhao, Wu; Liang, Jia-Xing; He, Ying; Qin, Yi-Bin; Lu, Bing-Xia

    2015-01-08

    We report here the complete genomic sequence of the porcine parvovirus (PPV) N strain, isolated in 1989 from the viscera of a stillborn fetus farrowed by a gilt in Guangxi, southern China. Phylogenetic analyses suggest that the PPV-N strain is closely related to attenuated PPV NADL-2 strains. The PPV-N strain has good immunogenicity, genetic stability, and safety. Copyright © 2015 Su et al.

  17. Parvovirus B19 1A complete genome from a fatal case in Brazil

    Directory of Open Access Journals (Sweden)

    Liliane Costa Conteville

    2015-09-01

    Full Text Available Parvovirus B19 (B19V infects individuals worldwide and is associated with an ample range of pathologies and clinical manifestations. B19V is classified into three distinct genotypes, all identified in Brazil. Here, we report a complete sequence of a B19V genotype 1A that was obtained by high-throughput metagenomic sequencing. This genome provides information that will contribute to the studies on B19V epidemiology and evolution.

  18. Complete genome sequence of Lactobacillus paracasei CAUH35, a new strain isolated from traditional fermented dairy product koumiss in China.

    Science.gov (United States)

    Wang, Guohong; Xiong, Yao; Xu, Qi; Yin, Jia; Hao, Yanling

    2015-11-20

    Lactobacillus paracasei CAUH35 was isolated from homemade koumiss, a traditional fermented dairy product with beneficial effects on human health. The genome consists of a circular 2,770,411 bp chromosome and four plasmids. Genome analysis revealed the presence of gene clusters involved in the production of exopolysaccharides and bacteriocin. The complete genome sequence of L. paracasei CAUH35 will provide genetic basis for further comparative and functional genomic analyses. Copyright © 2015. Published by Elsevier B.V.

  19. Characterization of the complete mitochondrial genome of Marshallagia marshalli and phylogenetic implications for the superfamily Trichostrongyloidea.

    Science.gov (United States)

    Sun, Miao-Miao; Han, Liang; Zhang, Fu-Kai; Zhou, Dong-Hui; Wang, Shu-Qing; Ma, Jun; Zhu, Xing-Quan; Liu, Guo-Hua

    2018-01-01

    Marshallagia marshalli (Nematoda: Trichostrongylidae) infection can lead to serious parasitic gastroenteritis in sheep, goat, and wild ruminant, causing significant socioeconomic losses worldwide. Up to now, the study concerning the molecular biology of M. marshalli is limited. Herein, we sequenced the complete mitochondrial (mt) genome of M. marshalli and examined its phylogenetic relationship with selected members of the superfamily Trichostrongyloidea using Bayesian inference (BI) based on concatenated mt amino acid sequence datasets. The complete mt genome sequence of M. marshalli is 13,891 bp, including 12 protein-coding genes, 22 transfer RNA genes, and 2 ribosomal RNA genes. All protein-coding genes are transcribed in the same direction. Phylogenetic analyses based on concatenated amino acid sequences of the 12 protein-coding genes supported the monophylies of the families Haemonchidae, Molineidae, and Dictyocaulidae with strong statistical support, but rejected the monophyly of the family Trichostrongylidae. The determination of the complete mt genome sequence of M. marshalli provides novel genetic markers for studying the systematics, population genetics, and molecular epidemiology of M. marshalli and its congeners.

  20. Characterization of the complete mitochondrial genomes of two whipworms Trichuris ovis and Trichuris discolor (Nematoda: Trichuridae).

    Science.gov (United States)

    Liu, Guo-Hua; Wang, Yan; Xu, Min-Jun; Zhou, Dong-Hui; Ye, Yong-Gang; Li, Jia-Yuan; Song, Hui-Qun; Lin, Rui-Qing; Zhu, Xing-Quan

    2012-12-01

    For many years, whipworms (Trichuris spp.) have been described with a relatively narrow range of both morphological and biometrical features. Moreover, there has been insufficient discrimination between congeners (or closely related species). In the present study, we determined the complete mitochondrial (mt) genomes of two whipworms Trichuris ovis and Trichuris discolor, compared them and then tested the hypothesis that T. ovis and T. discolor are distinct species by phylogenetic analyses using Bayesian inference, maximum likelihood and maximum parsimony) based on the deduced amino acid sequences of the mt protein-coding genes. The complete mt genomes of T. ovis and T. discolor were 13,946 bp and 13,904 bp in size, respectively. Both mt genomes are circular, and consist of 37 genes, including 13 genes coding for proteins, 2 genes for rRNA, and 22 genes for tRNA. The gene content and arrangement are identical to that of human and pig whipworms Trichuris trichiura and Trichuris suis. Taken together, these analyses showed genetic distinctiveness and strongly supported the recent proposal that T. ovis and T. discolor are distinct species using nuclear ribosomal DNA and a portion of the mtDNA sequence dataset. The availability of the complete mtDNA sequences of T. ovis and T. discolor provides novel genetic markers for studying the population genetics, diagnostics and molecular epidemiology of T. ovis and T. discolor. Copyright © 2012 Elsevier B.V. All rights reserved.

  1. Complete genome sequence of Pseudomonas antarctica PAMC 27494, a bacteriocin-producing psychrophile isolated from Antarctica.

    Science.gov (United States)

    Lee, Jaejin; Cho, Yong-Joon; Yang, Jae Young; Jung, You-Jung; Hong, Soon Gyu; Kim, Ok-Sun

    2017-10-10

    Antimicrobial-producing, cold-adapted microorganisms have great potential for biotechnological applications in food, pharmaceutical, and cosmetic industries. Pseudomonas antarctica PAMC 27494, a psychrophile exhibiting antimicrobial activity, was isolated from an Antarctic freshwater sample. Here we report the complete genome of P. antarctica PAMC 27494. The strain contains a gene cluster encoding microcin B which inhibits DNA regulations by targeting the DNA gyrase. PAMC 27494 may produce R-type pyocins and also contains a complete set of proteins for the biosynthesis of adenosylcobalamin and possibly induces plant growth by supplying pyrroloquinoline quionone molecules. Copyright © 2017 Elsevier B.V. All rights reserved.

  2. Analysis of the complete genome of the first Irkut virus isolate from China: comparison across the Lyssavirus genus.

    Science.gov (United States)

    Liu, Ye; Li, Nan; Zhang, Shoufeng; Zhang, Fei; Lian, Hai; Wang, Ying; Zhang, Jinxia; Hu, Rongliang

    2013-12-01

    The genome of Irkut virus, isolate IRKV-THChina12, the first non-rabies lyssavirus from China (of bat origin), has been completely sequenced. In general, coding and non-coding regions of this viral genome are similar to those of other lyssaviruses. However, alignment of the deduced amino acid sequences of the structural proteins of IRKV-THChina12 with those of other lyssavirus representatives revealed significant variability between viral species. The nucleoprotein and matrix protein were found to be the most conserved, followed by the large protein, glycoprotein and phosphoprotein. Differences in the antigenic sites in glycoprotein may result in only partial protection of the available rabies biologics against Irkut virus, which is of particular concern for pre- and post-exposure rabies prophylaxis. Copyright © 2013 Elsevier Inc. All rights reserved.

  3. Complete genome sequence of Bacillus velezensis G341, a strain with a broad inhibitory spectrum against plant pathogens.

    Science.gov (United States)

    Lee, Hyun-Hee; Park, Jungwook; Lim, Jae Yun; Kim, Hun; Choi, Gyung Ja; Kim, Jin-Cheol; Seo, Young-Su

    2015-10-10

    Bacillus velezensis G341 can suppress plant pathogens by producing antagonistic active compounds including bacillomycin D, fengycin, and (oxy) difficidin. The complete genome sequence of this bacterium was characterized by one circular chromosome of 4,009,746bp with 3953 open reading frames. The genome contained 36 pseudogenes, 30 rRNA operons, and 95 tRNAs. This complete genome sequence provides an additional resource for the development of antimicrobial compounds. Copyright © 2015 Elsevier B.V. All rights reserved.

  4. Complete Genome Sequence of Enteroinvasive Escherichia coli O96:H19 Associated with a Severe Foodborne Outbreak

    Science.gov (United States)

    Pettengill, Emily A.; Hoffmann, Maria; Roberts, Richard J.; Payne, Justin; Allard, Marc; Michelacci, Valeria; Minelli, Fabio; Morabito, Stefano

    2015-01-01

    We present here the complete genome sequence of a strain of enteroinvasive Escherichia coli O96:H19 from a severe foodborne outbreak in a canteen in Italy in 2014. The complete genome may provide important information about the acquired pathogenicity of this strain and the transition between commensal and pathogenic E. coli. PMID:26251502

  5. Complete Genome Sequence of Vibrio campbellii LMB 29 Isolated from Red Drum with Four Native Megaplasmids

    Directory of Open Access Journals (Sweden)

    Jinxin Liu

    2017-10-01

    Full Text Available Vibrio spp. are the most common pathogens for animals reared in aquaculture. Vibrio campbellii, which is often involved in shrimp, fish and mollusks diseases, is widely distributed in the marine environment worldwide, but our knowledge about its pathogenesis and antimicrobial resistance is very limited. The existence of this knowledge gap is at least partially because that V. campbellii was originally classified as Vibrio harveyi, and the detailed information of its comparative genome analysis to other Vibrio spp. is currently lacking. In this study, the complete genome of a V. campbellii predominant strain, LMB29, was determined by MiSeq in conjunction with PacBio SMRT sequencing. This genome consists of two circular DNA chromosomes and four megaplasmids. Comparative genome analysis indicates that LMB29 shares a 96.66% similarity (average nucleotide identity with the V. campbellii ATCC strain BAA-1116 based on a 75% AF (average fraction calculations, and its functional profile is very similar to V. campbellii E1 and V. campbellii CAIM115. Both type III secretion system (T3SS and type VI secretion system (T6SS, along with the tlh gene which encodes a thermolabile hemolysin, are present in LMB29 which may contribute to the bacterial pathogenesis. The virulence of this strain was experimental confirmed by performing a LDH assay on a fish cell infection model, and cell death was observed as early as within 3 h post infection. Thirty-seven antimicrobial resistance genes (>45% identity were predicted in LMB29 which includes a novel rifampicin ADP ribosyltransferase, arr-9, in plasmid pLMB157. The gene arr-9 was predicted on a genomic island with horizontal transferable potentials which may facilitate the rifampicin resistance dissemination. Future researches are needed to explore the pathogenesis of V. campbellii LMB29, but the availability of this genome sequence will certainly aid as a basis for further analysis.

  6. Macromolecular structure determination in the post-genome era

    CERN Document Server

    Kuhn, P

    2001-01-01

    Recent advances in genetics, molecular biology and crystallographic instrumentation and methodology have led to a revolution in the field of Structural Molecular Biology (SMB). These combined advances have paved the way to a more complete and detailed understanding of the biological macromolecules that make up an organism, both in terms of their individual functions and also the interactions between them. In this paper we describe a large-scale, genomic approach to the three-dimensional structure determination of macromolecules and their complexes, using high-throughput methodology to streamline all aspects of the process. This task requires the development of automated high-intensity synchrotron beam lines for X-ray diffraction data collection from single crystal samples. Furthermore, these beam lines must be operated within a sophisticated software and hardware environment, which is capable of delivering a completely automated structure determination pipeline. The SMB resource at SSRL is developing a system...

  7. Comparative Analysis of the Complete Chloroplast Genomes of Four Aconitum Medicinal Species

    Directory of Open Access Journals (Sweden)

    Jing Meng

    2018-04-01

    Full Text Available Aconitum (Ranunculaceae consists of approximately 400 species distributed in the temperate regions of the northern hemisphere. Many species are well-known herbs, mainly used for analgesia and anti-inflammatory purposes. This genus is well represented in China and has gained widespread attention for its toxicity and detoxification properties. In southwestern China, several Aconitum species, called ‘Dula’ in the Yi Nationality, were often used to control the poisonous effects of other Aconitum plants. In this study, the complete chloroplast (cp genomes of these species were determined for the first time through Illumina paired-end sequencing. Our results indicate that their cp genomes ranged from 151,214 bp (A. episcopale to 155,769 bp (A. delavayi in length. A total of 111–112 unique genes were identified, including 85 protein-coding genes, 36–37 tRNA genes and eight ribosomal RNA genes (rRNA. We also analyzed codon usage, IR expansion or contraction and simple sequence repeats in the cp genomes. Eight variable regions were identified and these may potentially be useful as specific DNA barcodes for species identification of Aconitum. Phylogenetic analysis revealed that all five studied species formed a new clade and were resolved with 100% bootstrap support. This study will provide genomic resources and potential plastid markers for DNA barcoding, further taxonomy and germplasm exploration of Aconitum.

  8. The complete genome sequence of hyperthermophile Dictyoglomus turgidum DSM 6724™ reveals a specialized carbohydrate fermentor

    Directory of Open Access Journals (Sweden)

    Phillip Brumm

    2016-12-01

    Full Text Available Here we report the complete genome sequence of the chemoorganotrophic, extremely thermophilic bacterium, Dictyoglomus turgidum, which is a Gram negative, strictly anaerobic bacterium. D. turgidum and D. thermophilum together form the Dictyoglomi phylum. The two Dictyoglomus genomes are highly syntenic, and both are distantly related to Caldicellulosiruptor spp. D. turgidum is able to grow on a wide variety of polysaccharide substrates due to significant genomic commitment to glycosyl hydrolases, sixteen of which were cloned and expressed in our study. The GH5, GH10 and GH42 enzymes characterized in this study suggest that D. turgidum can utilize most plant-based polysaccharides except crystalline cellulose. The DNA polymerase I enzyme was also expressed and characterized. The pure enzyme showed improved amplification of long PCR targets compared to Taq polymerase. The genome contains a full complement of DNA modifying enzymes, and an unusually high copy number (4 of a new, ancestral family of polB type nucleotidyltransferases designated as MNT (minimal nucleotidyltransferases. Considering its optimal growth at 72ºC, D. turgidum has an anomalously low G+C content of 39.9% that may account for the presence of reverse gyrase, usually associated with hyperthermophiles.

  9. Reanalysis and revision of the complete mitochondrial genome of Rachycentron canadum (Teleostei, Perciformes, Rachycentridae).

    Science.gov (United States)

    Musika, Jidapa; Khongchatee, Adison; Phinchongsakuldit, Jaros

    2014-08-01

    The complete mitochondrial genome of cobia, Rachycentron canadum, was reanalyzed and revised. The genome is 18,008 bp in length, containing 13 protein-coding genes, 2 ribosomal RNA (rRNA) genes, 22 transfer RNA (tRNA) genes, and a control region or displacement loop (D-loop). The gene arrangement is identical to that observed in most vertebrates. Base composition on the heavy strand is 30.14% A, 25.22% C, 15.80% G and 28.84% T. The D-loop region exhibits an A + T rich pattern, containing short tandem repeats of TATATACATGG, TATATGCACAA and TATATGCACGG. The mitochondrial genome studied differs from the previously published genome in two segments; the control region to 12S and ND5 to tRNA(Glu). The 12S sequence also differs from those published in the databases. Phylogeny analyses revealed that the differences could be due to errors in sequence assembly and/or sample misidentification of the previous studies.

  10. Comparative Analysis of the Complete Chloroplast Genomes of Four Aconitum Medicinal Species.

    Science.gov (United States)

    Meng, Jing; Li, Xuepei; Li, Hongtao; Yang, Junbo; Wang, Hong; He, Jun

    2018-04-26

    Aconitum (Ranunculaceae) consists of approximately 400 species distributed in the temperate regions of the northern hemisphere. Many species are well-known herbs, mainly used for analgesia and anti-inflammatory purposes. This genus is well represented in China and has gained widespread attention for its toxicity and detoxification properties. In southwestern China, several Aconitum species, called ‘Dula’ in the Yi Nationality, were often used to control the poisonous effects of other Aconitum plants. In this study, the complete chloroplast (cp) genomes of these species were determined for the first time through Illumina paired-end sequencing. Our results indicate that their cp genomes ranged from 151,214 bp ( A. episcopale ) to 155,769 bp ( A. delavayi ) in length. A total of 111⁻112 unique genes were identified, including 85 protein-coding genes, 36⁻37 tRNA genes and eight ribosomal RNA genes (rRNA). We also analyzed codon usage, IR expansion or contraction and simple sequence repeats in the cp genomes. Eight variable regions were identified and these may potentially be useful as specific DNA barcodes for species identification of Aconitum . Phylogenetic analysis revealed that all five studied species formed a new clade and were resolved with 100% bootstrap support. This study will provide genomic resources and potential plastid markers for DNA barcoding, further taxonomy and germplasm exploration of Aconitum .

  11. A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis

    Directory of Open Access Journals (Sweden)

    Stajich Jason E

    2006-11-01

    Full Text Available Abstract Background To date, most fungal phylogenies have been derived from single gene comparisons, or from concatenated alignments of a small number of genes. The increase in fungal genome sequencing presents an opportunity to reconstruct evolutionary events using entire genomes. As a tool for future comparative, phylogenomic and phylogenetic studies, we used both supertrees and concatenated alignments to infer relationships between 42 species of fungi for which complete genome sequences are available. Results A dataset of 345,829 genes was extracted from 42 publicly available fungal genomes. Supertree methods were employed to derive phylogenies from 4,805 single gene families. We found that the average consensus supertree method may suffer from long-branch attraction artifacts, while matrix representation with parsimony (MRP appears to be immune from these. A genome phylogeny was also reconstructed from a concatenated alignment of 153 universally distributed orthologs. Our MRP supertree and concatenated phylogeny are highly congruent. Within the Ascomycota, the sub-phyla Pezizomycotina and Saccharomycotina were resolved. Both phylogenies infer that the Leotiomycetes are the closest sister group to the Sordariomycetes. There is some ambiguity regarding the placement of Stagonospora nodurum, the sole member of the class Dothideomycetes present in the dataset. Within the Saccharomycotina, a monophyletic clade containing organisms that translate CTG as serine instead of leucine is evident. There is also strong support for two groups within the CTG clade, one containing the fully sexual species Candida lusitaniae, Candida guilliermondii and Debaryomyces hansenii, and the second group containing Candida albicans, Candida dubliniensis, Candida tropicalis, Candida parapsilosis and Lodderomyces elongisporus. The second major clade within the Saccharomycotina contains species whose genomes have undergone a whole genome duplication (WGD, and their close

  12. Complete genome sequence of a novel Plum pox virus strain W isolate determined by 454 pyrosequencing.

    Science.gov (United States)

    Sheveleva, Anna; Kudryavtseva, Anna; Speranskaya, Anna; Belenikin, Maxim; Melnikova, Natalia; Chirkov, Sergei

    2013-10-01

    The near-complete (99.7 %) genome sequence of a novel Russian Plum pox virus (PPV) isolate Pk, belonging to the strain Winona (W), has been determined by 454 pyrosequencing with the exception of the thirty-one 5'-terminal nucleotides. This region was amplified using 5'RACE kit and sequenced by the Sanger method. Genomic RNA released from immunocaptured PPV particles was employed for generation of cDNA library using TransPlex Whole transcriptome amplification kit (WTA2, Sigma-Aldrich). The entire Pk genome has identity level of 92.8-94.5 % when compared to the complete nucleotide sequences of other PPV-W isolates (W3174, LV-141pl, LV-145bt, and UKR 44189), confirming a high degree of variability within the PPV-W strain. The isolates Pk and LV-141pl are most closely related. The Pk has been found in a wild plum (Prunus domestica) in a new region of Russia indicating widespread dissemination of the PPV-W strain in the European part of the former USSR.

  13. [Sequencing and analysis of the complete mitochondrial genome of the King Cobra, Ophiophagus hannah (Serpents: Elapidae)].

    Science.gov (United States)

    Chen, Nian; Lai, Xiao-Ping

    2010-07-01

    We obtained the complete mitochondrial genome of King Cobra(GenBank accession number: EU_921899) by Ex Taq-PCR, TA-cloning and primer-walking methods. This genome is very similar to other vertebrate, which is 17 267 bp in length and encodes 38 genes (including 13 protein-coding, 2 ribosomal RNA and 23 transfer RNA genes) and two long non-coding regions. The duplication of tRNA-Ile gene forms a new mitochondrial gene rearrangement model. Eight tRNA genes and one protein genes were transcribed from L strand, and the other genes were transcribed genes from H strand. Genes on the H strand show a fairly similar content of Adenosine and Thymine respectively, whereas those on the L strand have higher proportion of A than T. Combined rDNA sequence data (12S+16S rRNA) were used to reconstruct the phylogeny of 21 snake species for which complete mitochondrial genome sequences were available in the public databases. This large data set and an appropriate range of outgroup taxa demonstrated that Elapidae is more closely related to colubridae than viperidae, which supports the traditional viewpoints.

  14. The complete chloroplast genome sequence of the CAM epiphyte Spanish moss (Tillandsia usneoides, Bromeliaceae and its comparative analysis.

    Directory of Open Access Journals (Sweden)

    Péter Poczai

    Full Text Available Spanish moss (Tillandsia usneoides is an epiphytic bromeliad widely distributed throughout tropical and warm temperate America. This plant is highly adapted to extreme environmental conditions. Striking features of this species include specialized trichomes (scales covering the surface of its shoots aiding the absorption of water and nutrients directly from the atmosphere and a specific photosynthesis using crassulacean acid metabolism (CAM. Here we report the plastid genome of Spanish moss and present the comparison of genome organization and sequence evolution within Poales. The plastome of Spanish moss has a quadripartite structure consisting of a large single copy (LSC, 87,439 bp, two inverted regions (IRa and IRb, 26,803 bp and short single copy (SSC, 18,612 bp region. The plastid genome had 37.2% GC content and 134 genes with 88 being unique protein-coding genes and 20 of these are duplicated in the IR, similar to other reported bromeliads. Our study shows that early diverging lineages of Poales do not have high substitution rates as compared to grasses, and plastid genomes of bromeliads show structural features considered to be ancestral in graminids. These include the loss of the introns in the clpP and rpoC1 genes and the complete loss or partial degradation of accD and ycf genes in the Graminid clade. Further structural rearrangements appeared in the graminids lacking in Spanish moss, which include a 28-kb inversion between the trnG-UCC-rps14 region and 6-kb in the trnG-UCC-psbD, followed by a third <1kb inversion in the trnT sequence.

  15. The complete chloroplast genome sequence of the CAM epiphyte Spanish moss (Tillandsia usneoides, Bromeliaceae) and its comparative analysis.

    Science.gov (United States)

    Poczai, Péter; Hyvönen, Jaakko

    2017-01-01

    Spanish moss (Tillandsia usneoides) is an epiphytic bromeliad widely distributed throughout tropical and warm temperate America. This plant is highly adapted to extreme environmental conditions. Striking features of this species include specialized trichomes (scales) covering the surface of its shoots aiding the absorption of water and nutrients directly from the atmosphere and a specific photosynthesis using crassulacean acid metabolism (CAM). Here we report the plastid genome of Spanish moss and present the comparison of genome organization and sequence evolution within Poales. The plastome of Spanish moss has a quadripartite structure consisting of a large single copy (LSC, 87,439 bp), two inverted regions (IRa and IRb, 26,803 bp) and short single copy (SSC, 18,612 bp) region. The plastid genome had 37.2% GC content and 134 genes with 88 being unique protein-coding genes and 20 of these are duplicated in the IR, similar to other reported bromeliads. Our study shows that early diverging lineages of Poales do not have high substitution rates as compared to grasses, and plastid genomes of bromeliads show structural features considered to be ancestral in graminids. These include the loss of the introns in the clpP and rpoC1 genes and the complete loss or partial degradation of accD and ycf genes in the Graminid clade. Further structural rearrangements appeared in the graminids lacking in Spanish moss, which include a 28-kb inversion between the trnG-UCC-rps14 region and 6-kb in the trnG-UCC-psbD, followed by a third <1kb inversion in the trnT sequence.

  16. [Complete genome sequencing and analyses of rabies viruses isolated from wild animals (Chinese Ferret-Badger) in Zhejiang province].

    Science.gov (United States)

    Lei, Yong-Liang; Wang, Xiao-Guang; Liu, Fu-Ming; Chen, Xiu-Ying; Ye, Bi-Feng; Mei, Jian-Hua; Lan, Jin-Quan; Tang, Qing

    2009-08-01

    Based on sequencing the full-length genomes of two Chinese Ferret-Badger, we analyzed the properties of rabies viruses genetic variation in molecular level to get information on prevalence and variation of rabies viruses in Zhejiang, and to enrich the genome database of rabies viruses street strains isolated from Chinese wildlife. Overlapped fragments were amplified by RT-PCR and full-length genomes were assembled to analyze the nucleotide and deduced protein similarities and phylogenetic analyses of the N genes from Chinese Ferret-Badger, sika deer, vole, dog. Vaccine strains were then determined. The two full-length genomes were completely sequenced to find out that they had the same genetic structure with 11 923 nts including 58 nts-Leader, 1353 nts-NP, 894 nts-PP, 609 nts-MP, 1575 nts-GP, 6386 nts-LP, and 2, 5, 5 nts- intergenic regions (IGRs), 423 nts-Pseudogene-like sequence (Psi), 70 nts-Trailer. The two full-length genomes were in accordance with the properties of Rhabdoviridae Lyssa virus by blast and multi-sequence alignment. The nucleotide and amino acid sequences among Chinese strains had the highest similarity, especially among animals of the same species. Of the two full-length genomes, the similarity in amino acid level was dramatically higher than that in nucleotide level, so that the nucleotide mutations happened in these two genomes were most probably as synonymous mutations. Compared to the referenced rabies viruses, the lengths of the five protein coding regions did not show any changes or recombination, but only with a few-point mutations. It was evident that the five proteins appeared to be stable. The variation sites and types of the two ferret badgers genomes were similar to the referenced vaccine or street strains. The two strains were genotype 1 according to the multi-sequence and phylogenetic analyses, which possessing the distinct geographyphic characteristics of China. All the evidence suggested a cue that these two ferret badgers

  17. [Sequencing and analysis of complete genome of rabies viruses isolated from Chinese Ferret-Badger and dog in Zhejiang province].

    Science.gov (United States)

    Lei, Yong-Liang; Wang, Xiao-Guang; Tao, Xiao-Yan; Li, Hao; Meng, Sheng-Li; Chen, Xiu-Ying; Liu, Fu-Ming; Ye, Bi-Feng; Tang, Qing

    2010-01-01

    Based on sequencing the full-length genomes of four Chinese Ferret-Badger and dog, we analyze the properties of rabies viruses genetic variation in molecular level, get the information about rabies viruses prevalence and variation in Zhejiang, and enrich the genome database of rabies viruses street strains isolated from China. Rabies viruses in suckling mice were isolated, overlapped fragments were amplified by RT-PCR and full-length genomes were assembled to analyze the nucleotide and deduced protein similarities and phylogenetic analyses from Chinese Ferret-Badger, dog, sika deer, vole, used vaccine strain were determined. The four full-length genomes were sequenced completely and had the same genetic structure with the length of 11, 923 nts or 11, 925 nts including 58 nts-Leader, 1353 nts-NP, 894 nts-PP, 609 nts-MP, 1575 nts-GP, 6386 nts-LP, and 2, 5, 5 nts- intergenic regions(IGRs), 423 nts-Pseudogene-like sequence (psi), 70 nts-Trailer. The four full-length genomes were in accordance with the properties of Rhabdoviridae Lyssa virus by BLAST and multi-sequence alignment. The nucleotide and amino acid sequences among Chinese strains had the highest similarity, especially among animals of the same species. Of the four full-length genomes, the similarity in amino acid level was dramatically higher than that in nucleotide level, so the nucleotide mutations happened in these four genomes were most synonymous mutations. Compared with the reference rabies viruses, the lengths of the five protein coding regions had no change, no recombination, only with a few point mutations. It was evident that the five proteins appeared to be stable. The variation sites and types of the four genomes were similar to the reference vaccine or street strains. And the four strains were genotype 1 according to the multi-sequence and phylogenetic analyses, which possessed the distinct district characteristics of China. Therefore, these four rabies viruses are likely to be street viruses

  18. Complete mitochondrial genome of the big-eared horseshoe bat Rhinolophus macrotis (Chiroptera, Rhinolophidae).

    Science.gov (United States)

    Zhang, Lin; Sun, Keping; Feng, Jiang

    2016-11-01

    We sequenced and characterized the complete mitochondrial genome of the big-eared horseshoe bat, Rhinolophus macrotis. Total length of the mitogenome is 16,848 bp, with a base composition of 31.2% A, 25.3% T, 28.8% C and 14.7% G. The mitogenome consists of 13 protein-coding genes, 2 rRNA (12S and 16S rRNA) genes, 22 tRNA genes and 1 control region. It has the same gene arrangement pattern as those of typical vertebrate mitochondrial genome. The results will contribute to our understanding of the taxonomic status and evolution in the genus Rhinolophus bats.

  19. Complete mitochondrial genome of the geophilous grasshopper Trilophidia annulata (Acrididae: Oedipodinae: Trilophidia).

    Science.gov (United States)

    Guan, De-Long; Xu, Sheng-Quan

    2016-09-01

    The complete mitogenome of the geophilous grasshopper Trilophidia annulata was reconstructed from whole-genome Illumina sequencing data. After annotation, the circular genome was obtained with 16,501 bp in length, and typically consisted of 37 genes, including 13 protein-coding genes (PCGs), 22 transfer RNAs (tRNAs), 2 ribosomal RNAs (rRNAs) and 1 D-loop region. All PCGs were initiated with ATN codons, except ND2 with the start codon GTG. Most of the PCGs used TAA as their stop codons, while the others used TAG as stop codons (COX1, COX3&ND1). The nucleotide composition was asymmetric (42.3% A, 15.0% C, 11.0% G, 31.8% T) with an overall GC content of 25.9%. These data would contribute to the design of novel molecular markers for population and evolutionary research of T. annulata.

  20. A study of the peopling of Greenland using next generation sequencing of complete mitochondrial genomes

    DEFF Research Database (Denmark)

    Lopopolo, Maria; Børsting, Claus; Pereira, Vania

    2016-01-01

    the migration patterns in the Greenlandic population from a female inheritance demographic perspective. Methods We investigated the maternal genetic variation in the Greenlandic population by sequencing the whole mtDNA genome in 127 Greenlandic individuals using the Illumina MiSeq® platform. Results All......Objectives The Greenlandic population history is characterized by a number of migrations of people of various ethnicities. In this work, the analysis of the complete mtDNA genome aimed to contribute to the ongoing debate on the origin of current Greenlanders and, at the same time, to address...... Greenlandic individuals belonged to the Inuit mtDNA lineages A2a, A2b1, and D4b1a2a1. No European haplogroup was found. Discussion The mtDNA lineages seem to support the hypothesis that the Inuit in Greenland are descendants from the Thule migration. The results also reinforce the importance of isolation...

  1. The complete mitochondrial genome of the redeye mullet Liza haematocheila (Teleostei, Mugilidae).

    Science.gov (United States)

    Chen, Jianhua; Li, Yinglei; Chen, Haigang; Yan, Binlun; Meng, Xueping

    2015-01-01

    The complete mitochondrial sequence of the redeye mullet Liza haematocheila has been determined. The circle genome is 16,822 bp in size, and consists of 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a control region. The gene order and composition of L. haematocheila was similar to that of most other teleosts. The base composition of H-strand is 26.42% (A), 26.38% (T), 16.72% (G) and 30.47% (C), with an AT content of 52.8%. All genes are encoded on the heavy strand with the exception of ND6 and eight tRNA genes. The mitochondrial genome of L. haematocheila presented will be in favor of resolving phylogenetic relationships within the family Scatophagidae and the Mugiliformes.

  2. The Complete Genome Sequence of the Fish Pathogen Tenacibaculum maritimum Provides Insights into Virulence Mechanisms

    Directory of Open Access Journals (Sweden)

    David Pérez-Pascual

    2017-08-01

    Full Text Available Tenacibaculum maritimum is a devastating bacterial pathogen of wild and farmed marine fish with a broad host range and a worldwide distribution. We report here the complete genome sequence of the T. maritimum type strain NCIMB 2154T. The genome consists of a 3,435,971-base pair circular chromosome with 2,866 predicted protein-coding genes. Genes encoding the biosynthesis of exopolysaccharides, the type IX secretion system, iron uptake systems, adhesins, hemolysins, proteases, and glycoside hydrolases were identified. They are likely involved in the virulence process including immune escape, invasion, colonization, destruction of host tissues, and nutrient scavenging. Among the predicted virulence factors, type IX secretion-mediated and cell-surface exposed proteins were identified including an atypical sialidase, a sphingomyelinase and a chondroitin AC lyase which activities were demonstrated in vitro.

  3. Complete genome sequence of Tolumonas auensis type strain (TA 4T)

    Energy Technology Data Exchange (ETDEWEB)

    Chertkov, Olga; Copeland, Alex; Lucas1, Susa; Lapidus, Alla; Berry, KerrieW.; Detter, JohnC.; Glavina Del Rio, Tijana; Hammon, Nancy; Dalin, Eileen; Tice, Hope; Pitluck, Sam; Richardson, Paul; Bruce, David; Goodwin, Lynne; Han, Cliff; Tapia, Roxanne; Saunders, Elizabeth; Schmutz, Jeremy; Brettin, Thomas; Larimer, Frank; Land, Miriam; Hauser, Loren; Spring, Stefan; Rohde, Manfred; Kyrpides, NikosC.; Ivanova, Natalia; G& #246; ker, Markus; Beller, HarryR.; Klenk, Hans-Peter; Woyke, Tanja

    2011-10-04

    Tolumonas auensis (Fischer-Romero et al. 1996) is currently the only validly named species of the genus Tolumonas in the family Aeromonadaceae. The strain is of interest because of its ability to produce toluene from phenylalanine and other phenyl precursors, as well as phenol from tyrosine. This is of interest because toluene is normally considered to be a tracer of anthropogenic pollution in lakes, but T. auensis represents a biogenic source of toluene. Other than Aeromonas hydrophila subsp. hydrophila, T. auensis strain TA 4T is the only other member in the family Aeromonadaceae with a completely sequenced type-strain genome. The 3,471,292-bp chromosome with a total of 3,288 protein-coding and 116 RNA genes was sequenced as part of the DOE Joint Genome Institute Program JBEI 2008.

  4. Complete genome sequence of Tolumonas auensis type strain (TA 4T)

    Energy Technology Data Exchange (ETDEWEB)

    Chertkov, Olga [Los Alamos National Laboratory (LANL); Copeland, A [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Berry, Alison M [California Institute of Technology, University of California, Davis; Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Hammon, Nancy [U.S. Department of Energy, Joint Genome Institute; Dalin, Eileen [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Richardson, P M [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Tapia, Roxanne [Los Alamos National Laboratory (LANL); Saunders, Elizabeth H [Los Alamos National Laboratory (LANL); Schmutz, Jeremy [Stanford University; Brettin, Thomas S [ORNL; Larimer, Frank W [ORNL; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Spring, Stefan [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Beller, Harry R. [Lawrence Berkeley National Laboratory (LBNL); Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute

    2011-01-01

    Tolumonas auensis Fischer-Romero et al. 1996 is currently the only validly named species of the genus Tolumonas in the family Aeromonadaceae. The strain is of interest because of its ability to produce toluene from phenylalanine and other phenyl precursors, as well as phenol from tyrosine. This is of interest because toluene is normally considered to be a tracer of anthropogenic pollution in lakes, but T. auensis represents a biogenic source of toluene. Oth- er than Aeromonas hydrophila subsp. hydrophila, T. auensis strain TA 4T is the only other member in the family Aeromonadaceae with a completely sequenced type-strain genome. The 3,471,292 bp chromosome with a total of 3,288 protein-coding and 116 RNA genes was sequenced as part of the DOE Joint Genome Institute Program JBEI 2008.

  5. Complete genome sequence of a new bipartite begomovirus infecting fluted pumpkin (Telfairia occidentalis) plants in Cameroon.

    Science.gov (United States)

    Leke, Walter N; Khatabi, Behnam; Fondong, Vincent N; Brown, Judith K

    2016-08-01

    The complete genome sequence was determined and characterized for a previously unreported bipartite begomovirus from fluted pumpkin (Telfairia occidentalis, family Cucurbitaceae) plants displaying mosaic symptoms in Cameroon. The DNA-A and DNA-B components were ~2.7 kb and ~2.6 kb in size, and the arrangement of viral coding regions on the genomic components was like those characteristic of other known bipartite begomoviruses originating in the Old World. While the DNA-A component was more closely related to that of chayote yellow mosaic virus (ChaYMV), at 78 %, the DNA-B component was more closely related to that of soybean chlorotic blotch virus (SbCBV), at 64 %. This newly discovered bipartite Old World virus is herein named telfairia mosaic virus (TelMV).

  6. Complete genome sequence of Olsenella uli type strain (VPI D76D-27CT)

    Energy Technology Data Exchange (ETDEWEB)

    Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Held, Brittany [Los Alamos National Laboratory (LANL); Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Yasawong, Montri [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Liolios, Konstantinos [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Ovchinnikova, Galina [U.S. Department of Energy, Joint Genome Institute; Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Chang, Yun-Juan [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Sikorski, Johannes [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Pukall, Rudiger [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany

    2010-01-01

    Olsenella uli (Olsen et al. 1991) Dewhirst et al. 2001 is the type species of the genus Olsenella, which belongs to the actinobacterial family Coriobacteriaceae. The species is of interest because it is frequently isolated from dental plaque in periodontitis patients and can cause primary endodontic infection. The species is a Gram-positive, non-motile and non-sporulating bacterium. The strain described in this study has been isolated from human gingival crevices in 1982. This is the first completed sequence of the genus Olsenella and the fifth sequence from the family Coriobacteriaceae. The 2,051,896 bp long genome with its 1,795 protein-coding and 55 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.

  7. Complete mitochondrial genome of Lutzomyia (Nyssomyia) umbratilis (Diptera: Psychodidae), the main vector of Leishmania guyanensis.

    Science.gov (United States)

    Kocher, Arthur; Gantier, Jean-Charles; Holota, Hélène; Jeziorski, Céline; Coissac, Eric; Bañuls, Anne-Laure; Girod, Romain; Gaborit, Pascal; Murienne, Jérôme

    2016-11-01

    The nearly complete mitochondrial genome of Lutzomyia umbratilis Ward & Fraiha, 1977 (Psychodidae: Phlebotominae), considered as the main vector of Leishmania guyanensis, is presented. The sequencing has been performed on an Illumina Hiseq 2500 platform, with a genome skimming strategy. The full nuclear ribosomal RNA segment was also assembled. The mitogenome of L. umbratilis was determined to be at least 15,717 bp-long and presents an architecture found in many mitogenomes of insect (13 protein-coding genes, 22 transfer RNAs, two ribosomal RNAs, and one non-coding region also referred as the control region). The control region contains a large repeated element of c. 370 bp and a poly-AT region of unknown length. This is the first mitogenome of Psychodidae to be described.

  8. Implications of structural genomics target selection strategies: Pfam5000, whole genome, and random approaches

    Energy Technology Data Exchange (ETDEWEB)

    Chandonia, John-Marc; Brenner, Steven E.

    2004-07-14

    The structural genomics project is an international effort to determine the three-dimensional shapes of all important biological macromolecules, with a primary focus on proteins. Target proteins should be selected according to a strategy which is medically and biologically relevant, of good value, and tractable. As an option to consider, we present the Pfam5000 strategy, which involves selecting the 5000 most important families from the Pfam database as sources for targets. We compare the Pfam5000 strategy to several other proposed strategies that would require similar numbers of targets. These include including complete solution of several small to moderately sized bacterial proteomes, partial coverage of the human proteome, and random selection of approximately 5000 targets from sequenced genomes. We measure the impact that successful implementation of these strategies would have upon structural interpretation of the proteins in Swiss-Prot, TrEMBL, and 131 complete proteomes (including 10 of eukaryotes) from the Proteome Analysis database at EBI. Solving the structures of proteins from the 5000 largest Pfam families would allow accurate fold assignment for approximately 68 percent of all prokaryotic proteins (covering 59 percent of residues) and 61 percent of eukaryotic proteins (40 percent of residues). More fine-grained coverage which would allow accurate modeling of these proteins would require an order of magnitude more targets. The Pfam5000 strategy may be modified in several ways, for example to focus on larger families, bacterial sequences, or eukaryotic sequences; as long as secondary consideration is given to large families within Pfam, coverage results vary only slightly. In contrast, focusing structural genomics on a single tractable genome would have only a limited impact in structural knowledge of other proteomes: a significant fraction (about 30-40 percent of the proteins, and 40-60 percent of the residues) of each proteome is classified in small

  9. Molecular Characterization of Five Potyviruses Infecting Korean Sweet Potatoes Based on Analyses of Complete Genome Sequences

    Directory of Open Access Journals (Sweden)

    Hae-Ryun Kwak

    2015-12-01

    Full Text Available Sweet potatoes (Ipomea batatas L. are grown extensively, in tropical and temperate regions, and are important food crops worldwide. In Korea, potyviruses, including Sweet potato feathery mottle virus (SPFMV, Sweet potato virus C (SPVC, Sweet potato virus G (SPVG, Sweet potato virus 2 (SPV2, and Sweet potato latent virus (SPLV, have been detected in sweet potato fields at a high (~95% incidence. In the present work, complete genome sequences of 18 isolates, representing the five potyviruses mentioned above, were compared with previously reported genome sequences. The complete genomes consisted of 10,081 to 10,830 nucleotides, excluding the poly-A tails. Their genomic organizations were typical of the Potyvirus genus, including one target open reading frame coding for a putative polyprotein. Based on phylogenetic analyses and sequence comparisons, the Korean SPFMV isolates belonged to the strains RC and O with >98% nucleotide sequence identity. Korean SPVC isolates had 99% identity to the Japanese isolate SPVC-Bungo and 70% identity to the SPFMV isolates. The Korean SPVG isolates showed 99% identity to the three previously reported SPVG isolates. Korean SPV2 isolates had 97% identity to the SPV2 GWB-2 isolate from the USA. Korean SPLV isolates had a relatively low (88% nucleotide sequence identity with the Taiwanese SPLV-TW isolates, and they were phylogenetically distantly related to SPFMV isolates. Recombination analysis revealed that possible recombination events occurred in the P1, HC-Pro and NIa-NIb regions of SPFMV and SPLV isolates and these regions were identified as hotspots for recombination in the sweet potato potyviruses.

  10. Complete genome sequencing and phylogenetic analysis of dengue type 1 virus isolated from Jeddah, Saudi Arabia.

    Science.gov (United States)

    Azhar, Esam I; Hashem, Anwar M; El-Kafrawy, Sherif A; Abol-Ela, Said; Abd-Alla, Adly M M; Sohrab, Sayed Sartaj; Farraj, Suha A; Othman, Norah A; Ben-Helaby, Huda G; Ashshi, Ahmed; Madani, Tariq A; Jamjoom, Ghazi

    2015-01-16

    Dengue viruses (DENVs) are mosquito-borne viruses which can cause disease ranging from mild fever to severe dengue infection. These viruses are endemic in several tropical and subtropical regions. Multiple outbreaks of DENV serotypes 1, 2 and 3 (DENV-1, DENV-2 and DENV-3) have been reported from the western region in Saudi Arabia since 1994. Strains from at least two genotypes of DENV-1 (Asia and America/Africa genotypes) have been circulating in western Saudi Arabia until 2006. However, all previous studies reported from Saudi Arabia were based on partial sequencing data of the envelope (E) gene without any reports of full genome sequences for any DENV serotypes circulating in Saudi Arabia. Here, we report the isolation and the first complete genome sequence of a DENV-1 strain (DENV-1-Jeddah-1-2011) isolated from a patient from Jeddah, Saudi Arabia in 2011. Whole genome sequence alignment and phylogenetic analysis showed high similarity between DENV-1-Jeddah-1-2011 strain and D1/H/IMTSSA/98/606 isolate (Asian genotype) reported from Djibouti in 1998. Further analysis of the full envelope gene revealed a close relationship between DENV-1-Jeddah-1-2011 strain and isolates reported between 2004-2006 from Jeddah as well as recent isolates from Somalia, suggesting the widespread of the Asian genotype in this region. These data suggest that strains belonging to the Asian genotype might have been introduced into Saudi Arabia long before 2004 most probably by African pilgrims and continued to circulate in western Saudi Arabia at least until 2011. Most importantly, these results indicate that pilgrims from dengue endemic regions can play an important role in the spread of new DENVs in Saudi Arabia and the rest of the world. Therefore, availability of complete genome sequences would serve as a reference for future epidemiological studies of DENV-1 viruses.

  11. The complete mitochondrial genome of Meloidogyne graminicola (Tylenchina: a unique gene arrangement and its phylogenetic implications.

    Directory of Open Access Journals (Sweden)

    Longhua Sun

    Full Text Available Meloidogyne graminicola is one of the most economically important plant parasitic-nematodes (PPNs. In the present study, we determined the complete mitochondrial (mt DNA genome sequence of this plant pathogen. Compared with other PPNs genera, this genome (19,589 bp is only slightly smaller than that of Pratylenchus vulnus (21,656 bp. The nucleotide composition of the whole mtDNA sequence of M. graminicola is significantly biased toward A and T, with T being the most favored nucleotide and C being the least favored. The A+T content of the entire genome is 83.51%. The mt genome of M. graminicola contains 36 genes (lacking atp8 that are transcribed in the same direction. The gene arrangement of the mt genome of M. graminicola is unique. A total of 21 out of 22 tRNAs possess a DHU loop only, while tRNASer(AGN lacks a DHU loop. The two large noncoding regions (2,031 bp and 5,063 bp are disrupted by tRNASer(UCN. Phylogenetic analysis based on concatenated amino acid sequences of 12 protein-coding genes support the monophylies of the three orders Rhabditida, Mermithida and Trichinellida, the suborder Rhabditina and the three infraorders Spiruromorpha, Oxyuridomorpha and Ascaridomorpha, but do not support the monophylies of the two suborders Spirurina and Tylenchina, and the three infraorders Rhabditomorpha, Panagrolaimomorpha and Tylenchomorpha. The four Tylenchomorpha species including M. graminicola, P. vulnus, H. glycines and R. similis from the superfamily Tylenchoidea are placed within a well-supported monophyletic clade, but far from the other two Tylenchomorpha species B. xylophilus and B. mucronatus of Aphelenchoidea. In the clade of Tylenchoidea, M. graminicola is sister to P. vulnus, and H. glycines is sister to R. similis, which suggests root-knot nematodes has a closer relationship to Pratylenchidae nematodes than to cyst nematodes.

  12. The complete multipartite genome sequence of Cupriavidus necator JMP134, a versatile pollutant degrader.

    Directory of Open Access Journals (Sweden)

    Athanasios Lykidis

    Full Text Available BACKGROUND: Cupriavidus necator JMP134 is a Gram-negative beta-proteobacterium able to grow on a variety of aromatic and chloroaromatic compounds as its sole carbon and energy source. METHODOLOGY/PRINCIPAL FINDINGS: Its genome consists of four replicons (two chromosomes and two plasmids containing a total of 6631 protein coding genes. Comparative analysis identified 1910 core genes common to the four genomes compared (C. necator JMP134, C. necator H16, C. metallidurans CH34, R. solanacearum GMI1000. Although secondary chromosomes found in the Cupriavidus, Ralstonia, and Burkholderia lineages are all derived from plasmids, analyses of the plasmid partition proteins located on those chromosomes indicate that different plasmids gave rise to the secondary chromosomes in each lineage. The C. necator JMP134 genome contains 300 genes putatively involved in the catabolism of aromatic compounds and encodes most of the central ring-cleavage pathways. This strain also shows additional metabolic capabilities towards alicyclic compounds and the potential for catabolism of almost all proteinogenic amino acids. This remarkable catabolic potential seems to be sustained by a high degree of genetic redundancy, most probably enabling this catabolically versatile bacterium with different levels of metabolic responses and alternative regulation necessary to cope with a challenging environment. From the comparison of Cupriavidus genomes, it is possible to state that a broad metabolic capability is a general trait for Cupriavidus genus, however certain specialization towards a nutritional niche (xenobiotics degradation, chemolithoautotrophy or symbiotic nitrogen fixation seems to be shaped mostly by the acquisition of "specialized" plasmids. CONCLUSIONS/SIGNIFICANCE: The availability of the complete genome sequence for C. necator JMP134 provides the groundwork for further elucidation of the mechanisms and regulation of chloroaromatic compound biodegradation.

  13. Complete genomes reveal signatures of demographic and genetic declines in the woolly mammoth

    Science.gov (United States)

    Palkopoulou, Eleftheria; Mallick, Swapan; Skoglund, Pontus; Enk, Jacob; Rohland, Nadin; Li, Heng; Omrak, Ayça; Vartanyan, Sergey; Poinar, Hendrik; Götherström, Anders; Reich, David; Dalén, Love

    2015-01-01

    Summary The processes leading up to species extinctions are typically characterized by prolonged declines in population size and geographic distribution, followed by a phase in which populations are very small and may be subject to intrinsic threats, including loss of genetic diversity and inbreeding [1]. However, whether such genetic factors have had an impact on species prior to their extinction is unclear [2, 3]; examining this would require a detailed reconstruction of a species’ demographic history as well as changes in genome-wide diversity leading up to its extinction. Here, we present high-quality complete genome sequences from two woolly mammoths (Mammuthus primigenius). The first mammoth was sequenced at 17.1-fold coverage, and dates to ~4,300 years before present, constituting one of the last surviving individuals on Wrangel Island. The second mammoth, sequenced at 11.2-fold coverage, was obtained from a ~44,800 year old specimen from the Late Pleistocene population in northeastern Siberia. The demographic trajectories inferred from the two genomes are qualitatively similar and reveal a population bottleneck during the Middle or Early Pleistocene, and a more recent severe decline in the ancestors of the Wrangel mammoth at the end of the last glaciation. A comparison of the two genomes shows that the Wrangel mammoth has a 20% reduction in heterozygosity as well as a 28-fold increase in the fraction of the genome that is comprised of runs of homozygosity. We conclude that the population on Wrangel Island, which was the last surviving woolly mammoth population, was subject to reduced genetic diversity shortly before it became extinct. PMID:25913407

  14. Systematic determination of the mosaic structure of bacterial genomes: species backbone versus strain-specific loops

    Directory of Open Access Journals (Sweden)

    Gendrault-Jacquemard A

    2005-07-01

    Full Text Available Abstract Background Public databases now contain multitude of complete bacterial genomes, including several genomes of the same species. The available data offers new opportunities to address questions about bacterial genome evolution, a task that requires reliable fine comparison data of closely related genomes. Recent analyses have shown, using pairwise whole genome alignments, that it is possible to segment bacterial genomes into a common conserved backbone and strain-specific sequences called loops. Results Here, we generalize this approach and propose a strategy that allows systematic and non-biased genome segmentation based on multiple genome alignments. Segmentation analyses, as applied to 13 different bacterial species, confirmed the feasibility of our approach to discern the 'mosaic' organization of bacterial genomes. Segmentation results are available through a Web interface permitting functional analysis, extraction and visualization of the backbone/loops structure of documented genomes. To illustrate the potential of this approach, we performed a precise analysis of the mosaic organization of three E. coli strains and functional characterization of the loops. Conclusion The segmentation results including the backbone/loops structure of 13 bacterial species genomes are new and available for use by the scientific community at the URL: http://genome.jouy.inra.fr/mosaic.

  15. The complete genome sequence of Trueperella pyogenes UFV1 reveals a processing system involved in the quorumsensing signal response

    DEFF Research Database (Denmark)

    Duarte, Vinicius da Silva; Treu, Laura; Campanaro, Stefano

    2017-01-01

    We present here the complete genome sequence of Trueperella pyogenes UFV1. The 2.3-Mbp genome contains an extremely interesting AI-2 transporter and processing system related to the quorum-sensing signal response. This specific feature is described in this species for the first time and might be ...... be responsible for a new pathogenic behavior.......We present here the complete genome sequence of Trueperella pyogenes UFV1. The 2.3-Mbp genome contains an extremely interesting AI-2 transporter and processing system related to the quorum-sensing signal response. This specific feature is described in this species for the first time and might...

  16. Complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera, and comparative analyses with other grass genomes

    Science.gov (United States)

    Saski, Christopher; Lee, Seung-Bum; Fjellheim, Siri; Guda, Chittibabu; Jansen, Robert K.; Luo, Hong; Tomkins, Jeffrey; Rognli, Odd Arne; Clarke, Jihong Liu

    2009-01-01

    Comparisons of complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera to six published grass chloroplast genomes reveal that gene content and order are similar but two microstructural changes have occurred. First, the expansion of the IR at the SSC/IRa boundary that duplicates a portion of the 5′ end of ndhH is restricted to the three genera of the subfamily Pooideae (Agrostis, Hordeum and Triticum). Second, a 6 bp deletion in ndhK is shared by Agrostis, Hordeum, Oryza and Triticum, and this event supports the sister relationship between the subfamilies Erhartoideae and Pooideae. Repeat analysis identified 19–37 direct and inverted repeats 30 bp or longer with a sequence identity of at least 90%. Seventeen of the 26 shared repeats are found in all the grass chloroplast genomes examined and are located in the same genes or intergenic spacer (IGS) regions. Examination of simple sequence repeats (SSRs) identified 16–21 potential polymorphic SSRs. Five IGS regions have 100% sequence identity among Zea mays, Saccharum officinarum and Sorghum bicolor, whereas no spacer regions were identical among Oryza sativa, Triticum aestivum, H. vulgare and A. stolonifera despite their close phylogenetic relationship. Alignment of EST sequences and DNA coding sequences identified six C–U conversions in both Sorghum bicolor and H. vulgare but only one in A. stolonifera. Phylogenetic trees based on DNA sequences of 61 protein-coding genes of 38 taxa using both maximum parsimony and likelihood methods provide moderate support for a sister relationship between the subfamilies Erhartoideae and Pooideae. PMID:17534593

  17. Analysis of complete logical structures in system reliability assessment

    International Nuclear Information System (INIS)

    Amendola, A.; Clarotti, C.A.; Contini, S.; Spizzichino, F.

    1980-01-01

    The application field of the fault-tree techniques has been explored in order to assess whether the AND-OR structures covered all possible actual binary systems. This resulted in the identification of various situations requiring the complete AND-OR-NOT structures for their analysis. We do not use the term non-coherent for such cases, since the monotonicity or not of a structure function is not a characteristic of a system, but of the particular top event being examined. The report presents different examples of complete fault-trees, which can be examined according to different degrees of approximation. In fact, the exact analysis for the determination of the smallest irredundant bases is very time consuming and actually necessary only in some particular cases (multi-state systems, incidental situations). Therefore, together with the exact procedure, the report shows two different methods of logical analysis that permit the reduction of complete fault-trees to AND-OR structures. Moreover, it discusses the problems concerning the evaluation of the probability distribution of the time to first top event occurrence, once the hypothesis of structure function monotonicity is removed

  18. Statistical properties of thermodynamically predicted RNA secondary structures in viral genomes

    Science.gov (United States)

    Spanò, M.; Lillo, F.; Miccichè, S.; Mantegna, R. N.

    2008-10-01

    By performing a comprehensive study on 1832 segments of 1212 complete genomes of viruses, we show that in viral genomes the hairpin structures of thermodynamically predicted RNA secondary structures are more abundant than expected under a simple random null hypothesis. The detected hairpin structures of RNA secondary structures are present both in coding and in noncoding regions for the four groups of viruses categorized as dsDNA, dsRNA, ssDNA and ssRNA. For all groups, hairpin structures of RNA secondary structures are detected more frequently than expected for a random null hypothesis in noncoding rather than in coding regions. However, potential RNA secondary structures are also present in coding regions of dsDNA group. In fact, we detect evolutionary conserved RNA secondary structures in conserved coding and noncoding regions of a large set of complete genomes of dsDNA herpesviruses.

  19. Isolation, identification, and complete genome sequence of a bovine adenovirus type 3 from cattle in China

    Directory of Open Access Journals (Sweden)

    Zhu Yuan-Mao

    2011-12-01

    Full Text Available Abstract Background Bovine adenovirus type 3 (BAV-3 belongs to the Mastadenovirus genus of the family Adenoviridae and is involved in respiratory and enteric infections of calves. The isolation of BAV-3 has not been reported prior to this study in China. In 2009, there were many cases in cattle showing similar clinical signs to BAV-3 infection and a virus strain, showing cytopathic effect in Madin-Darby bovine kidney cells, was isolated from a bovine nasal swab collected from feedlot cattle in Heilongjiang Province, China. The isolate was confirmed as a bovine adenovirus type 3 by PCR and immunofluorescence assay, and named as HLJ0955. So far only the complete genome sequence of prototype of BAV-3 WBR-1 strain has been reported. In order to further characterize the Chinese isolate HLJ0955, the complete genome sequence of HLJ0955 was determined. Results The size of the genome of the Chinese isolate HLJ0955 is 34,132 nucleotides in length with a G+C content of 53.6%. The coding sequences for gene regions of HLJ0955 isolate were similar to the prototype of BAV-3 WBR-1 strain, with 80.0-98.6% nucleotide and 87.5-98.8% amino acid identities. The genome of HLJ0955 strain contains 16 regions and four deletions in inverted terminal repeats, E1B region and E4 region, respectively. The complete genome and DNA binding protein gene based phylogenetic analysis with other adenoviruses were performed and the results showed that HLJ0955 isolate belonged to BAV-3 and clustered within the Mastadenovirus genus of the family Adenoviridae. Conclusions This is the first study to report the isolation and molecular characterization of BAV-3 from cattle in China. The phylogenetic analysis performed in this study supported the use of the DNA binding protein gene of adenovirus as an appropriate subgenomic target for the classification of different genuses of the family Adenoviridae on the molecular basis. Meanwhile, a large-scale pathogen and serological epidemiological

  20. The complete mitochondrial genome of the pink stem borer, Sesamia inferens, in comparison with four other Noctuid moths.

    Science.gov (United States)

    Chai, Huan-Na; Du, Yu-Zhou

    2012-01-01

    The complete 15,413-bp mitochondrial genome (mitogenome) of Sesamia inferens (Walker) (Lepidoptera: Noctuidae) was sequenced and compared with those of four other noctuid moths. All of the mitogenomes analyzed displayed similar characteristics with respect to gene content, genome organization, nucleotide comparison, and codon usages. Twelve-one protein-coding genes (PCGs) utilized the standard ATN, but the cox1 gene used CGA as the initiation codon; cox1, cox2, and nad4 genes had the truncated termination codon T in the S. inferens mitogenome. All of the tRNA genes had typical cloverleaf secondary structures except for trnS1(AGN), in which the dihydrouridine (DHU) arm did not form a stable stem-loop structure. Both the secondary structures of rrnL and rrnS genes inferred from the S. inferens mitogenome closely resembled those of other noctuid moths. In the A+T-rich region, the conserved motif "ATAGA" followed by a long T-stretch was observed in all noctuid moths, but other specific tandem-repeat elements were more variable. Additionally, the S. inferens mitogenome contained a potential stem-loop structure, a duplicated 17-bp repeat element, a decuplicated segment, and a microsatellite "(AT)(7)", without a poly-A element upstream of the trnM in the A+T-rich region. Finally, the phylogenetic relationships were reconstructed based on amino acid sequences of mitochondrial 13 PCGs, which support the traditional morphologically based view of relationships within the Noctuidae.

  1. The complete mitochondrial genome of Somanniathelphusa boyangensis and phylogenetic analysis of Genus Somanniathelphusa (Crustacea: Decapoda: Parathelphusidae.

    Directory of Open Access Journals (Sweden)

    Xin-Nan Jia

    Full Text Available In this study, the authors first obtained the mitochondrial genome of Somanniathelphusa boyangensis. The results showed that the mitochondrial genome is 17,032bp in length, included 13 protein-coding genes, 2 rRNAs genes, 22 tRNAs genes and 1 putative control region, and it has the characteristics of the metazoan mitochondrial genome A+T bias. All tRNA genes display the typical clover-leaf secondary structure except tRNASer(AGN, which has lost the dihydroxyuridine arm. The GenBank database contains the mitochondrial genomes of representatives of approximately 22 families of Brachyura, comprising 56 species, including 4 species of freshwater crab. The authors established the phylogenetic relationships using the maximum likelihood and Bayesian inference methods. The phylogenetic relationship indicated that the molecular taxonomy of S. boyangensis is consistent with current morphological classification, and Parathelphusidae and Potamidae are derived within the freshwater clade or as part of it. In addition, the authors used the COX1 sequence of Somanniathelphusa in GenBank and the COX1 sequence of S. boyangensis to estimated the divergence time of this genus. The result displayed that the divergence time of Somanniathelphusa qiongshanensis is consistent with the separation of Hainan Island from mainland China in the Beibu Gulf, and the divergence time for Somanniathelphusa taiwanensis and Somanniathelphusa amoyensis is consistent with the separation of Taiwan Province from Mainland China at Fujian Province. These data indicate that geologic events influenced speciation of the genus Somanniathelphusa.

  2. The nearly complete mitochondrial genome of a stonefly species, Styloperla sp. (Plecoptera: Styloperlidae).

    Science.gov (United States)

    Chen, Zhi-Teng; Wu, Hai-Yan; Du, Yu-Zhou

    2016-07-01

    We report the nearly complete mitochondrial genome of a stonefly species, Styloperla sp. (Plecoptera: Styloperlidae), which is a circular molecule of 15,416 bp in length and consists of 13 protein-coding genes, 2 ribosomal RNAs, 20 transfer RNAs and a partial control region (645 bp). Using the 13 protein-coding genes of 8 stoneflies and 3 other related species, we constructed a phylogenetic tree to verify the accuracy of the new determined mitogenome sequences. Our results provide basic data for further study of phylogeny in Plecoptera.

  3. Complete mitochondrial genome of the pacific seahorse Hippocampus ingens Girard, 1858 (Gasterosteiformes: Syngnathidae).

    Science.gov (United States)

    Zhang, Huixian; Zhang, Yanhong; Lin, Qiang

    2015-01-01

    The complete mitochondrial genome sequence of the pacific seahorse Hippocampus ingens was determined using long polymerase chain reactions. The total length of H. ingens mitogenome is 16,526 bp and consists of 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a control region. The gene order and composition of H. ingens were similar to those of most other vertebrates. The overall base composition of H. ingens is 32.6% A, 29.3% T, 23.5% G and 14.6% C, with a slight A+T rich feature (61.9%).

  4. Complete mitochondrial genome sequence of the longsnout seahorse Hippocampus reidi (Ginsburg, 1933; Gasterosteiformes: Syngnathidae).

    Science.gov (United States)

    Wang, Xin; Zhang, Yanhong; Zhang, Huixian; Meng, Tan; Lin, Qiang

    2016-01-01

    The complete mitochondrial genome sequence of the longsnout seahorse Hippocampus reidi was fisrt determined in this article. The total length of H. reidi mitogenome is 16,529 bp and consists of 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and 1 control region. The gene order and composition of H. reidi were similar to those of most other vertebrates. The overall base composition of H. reidi is 32.47% A, 29.41% T, 14.75% G and 23.37% C, with a slight A + T rich feature (61.88%).

  5. Complete mitochondrial genome sequence of the common bean anthracnose pathogen Colletotrichum lindemuthianum.

    Science.gov (United States)

    Gutiérrez, Pablo; Alzate, Juan; Yepes, Mauricio Salazar; Marín, Mauricio

    2016-01-01

    Colletotrichum lindemuthianum is the causal agent of anthracnose in common bean (Phaseolus vulgaris), one of the most limiting factors for this crop in South and Central America. In this work, the mitochondrial sequence of a Colombian isolate of C. lindemuthianum obtained from a common bean plant (var. Cargamanto) with anthracnose symptoms is presented. The mtDNA codes for 13 proteins of the respiratory chain, 1 ribosomal protein, 2 homing endonucleases, 2 ribosomal RNAs and 28 tRNAs. This is the first report of a complete mtDNA genome sequence from C. lindemuthianum.

  6. The First Complete Mitochondrial Genome Sequences for Stomatopod Crustaceans: Implications for Phylogeny

    Energy Technology Data Exchange (ETDEWEB)

    Swinstrom, Kirsten; Caldwell, Roy; Fourcade, H. Matthew; Boore, Jeffrey L.

    2005-09-07

    We report the first complete mitochondrial genome sequences of stomatopods and compare their features to each other and to those of other crustaceans. Phylogenetic analyses of the concatenated mitochondrial protein-coding sequences were used to explore relationships within the Stomatopoda, within the malacostracan crustaceans, and among crustaceans and insects. Although these analyses support the monophyly of both Malacostraca and, within it, Stomatopoda, it also confirms the view of a paraphyletic Crustacea, with Malacostraca being more closely related to insects than to the branchiopod crustaceans.

  7. The complete mitochondrial genome of Octopus conispadiceus (Sasaki, 1917) (Cephalopoda: Octopodidae).

    Science.gov (United States)

    Ma, Yuanyuan; Zheng, Xiaodong; Cheng, Rubin; Li, Qi

    2016-01-01

    In this paper, we determined the complete mitochondrial genome of Octopus conispadiceus (Cephalopoda: Octopodidae). The whole mitogenome of O. conispadiceus is 16,027 basepairs (bp) in length with a base composition of 41.4% A, 34.8% T, 16.1% C, 7.7% G and contains 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes, and a major non-coding region (MNR). The gene arrangements of O. conispadiceus showed remarkable similarity to that of O. vulgaris, Amphioctopus fangsiao, Cistopus chinensis and C. taiwanicus.

  8. Complete Genome Sequence of a Double-Stranded RNA Virus from Avocado

    OpenAIRE

    Villanueva, Francisco; Sabanadzovic, Sead; Valverde, Rodrigo A.; Navas-Castillo, Jesús

    2012-01-01

    A number of avocado (Persea americana) cultivars are known to contain high-molecular-weight double-stranded RNA (dsRNA) molecules for which a viral nature has been suggested, although sequence data are not available. Here we report the cloning and complete sequencing of a 13.5-kbp dsRNA virus isolated from avocado and show that it corresponds to the genome of a new species of the genus Endornavirus (family Endornaviridae), tentatively named Persea americana endornavirus (PaEV).

  9. Complete Genome Sequence of a Double-Stranded RNA Virus from Avocado

    Science.gov (United States)

    Villanueva, Francisco; Sabanadzovic, Sead; Valverde, Rodrigo A.

    2012-01-01

    A number of avocado (Persea americana) cultivars are known to contain high-molecular-weight double-stranded RNA (dsRNA) molecules for which a viral nature has been suggested, although sequence data are not available. Here we report the cloning and complete sequencing of a 13.5-kbp dsRNA virus isolated from avocado and show that it corresponds to the genome of a new species of the genus Endornavirus (family Endornaviridae), tentatively named Persea americana endornavirus (PaEV). PMID:22205720

  10. Sequencing and characterization of the complete mitochondrial genome of Japanese Swellshark (Cephalloscyllium umbratile)

    OpenAIRE

    Zhu, Ke-Cheng; Liang, Yin-Yin; Wu, Na; Guo, Hua-Yang; Zhang, Nan; Jiang, Shi-Gui; Zhang, Dian-Chang

    2017-01-01

    To further comprehend the genome features of Cephalloscyllium umbratile (Carcharhiniformes), an endangered species, the complete mitochondrial DNA (mtDNA) was firstly sequenced and annotated. The full-length mtDNA of C. umbratile was 16,697 bp and contained ribosomal RNA (rRNA) genes, 13 protein-coding genes (PCGs), 23 transfer RNA (tRNA) genes, and a major non-coding control region. Each PCG was initiated by an authoritative ATN codon, except for COX1 initiated by a GTG codon. Seven of 13 PC...

  11. The complete genomic sequence of a tentative new polerovirus identified in barley in South Korea.

    Science.gov (United States)

    Zhao, Fumei; Lim, Seungmo; Yoo, Ran Hee; Igori, Davaajargal; Kim, Sang-Min; Kwak, Do Yeon; Kim, Sun Lim; Lee, Bong Choon; Moon, Jae Sun

    2016-07-01

    The complete nucleotide sequence of a new barley polerovirus, tentatively named barley virus G (BVG), which was isolated in Gimje, South Korea, has been determined using an RNA sequencing technique combined with polymerase chain reaction methods. The viral genomic RNA of BVG is 5,620 nucleotides long and contains six typical open reading frames commonly observed in other poleroviruses. Sequence comparisons revealed that BVG is most closely related to maize yellow dwarf virus-RMV, with the highest amino acid identities being less than 90 % for all of the corresponding proteins. These results suggested that BVG is a member of a new species in the genus Polerovirus.

  12. A complete mitochondrial genome sequence of the wild two-humped camel (Camelus bactrianus ferus: an evolutionary history of camelidae

    Directory of Open Access Journals (Sweden)

    Meng He

    2007-07-01

    Full Text Available Abstract Background The family Camelidae that evolved in North America during the Eocene survived with two distinct tribes, Camelini and Lamini. To investigate the evolutionary relationship between them and to further understand the evolutionary history of this family, we determined the complete mitochondrial genome sequence of the wild two-humped camel (Camelus bactrianus ferus, the only wild survivor of the Old World camel. Results The mitochondrial genome sequence (16,680 bp from C. bactrianus ferus contains 13 protein-coding, two rRNA, and 22 tRNA genes as well as a typical control region; this basic structure is shared by all metazoan mitochondrial genomes. Its protein-coding region exhibits codon usage common to all mammals and possesses the three cryptic stop codons shared by all vertebrates. C. bactrianus ferus together with the rest of mammalian species do not share a triplet nucleotide insertion (GCC that encodes a proline residue found only in the nd1 gene of the New World camelid Lama pacos. This lineage-specific insertion in the L. pacos mtDNA occurred after the split between the Old and New World camelids suggests that it may have functional implication since a proline insertion in a protein backbone usually alters protein conformation significantly, and nd1 gene has not been seen as polymorphic as the rest of ND family genes among camelids. Our phylogenetic study based on complete mitochondrial genomes excluding the control region suggested that the divergence of the two tribes may occur in the early Miocene; it is much earlier than what was deduced from the fossil record (11 million years. An evolutionary history reconstructed for the family Camelidae based on cytb sequences suggested that the split of bactrian camel and dromedary may have occurred in North America before the tribe Camelini migrated from North America to Asia. Conclusion Molecular clock analysis of complete mitochondrial genomes from C. bactrianus ferus and L

  13. Complete genomic characterisation of two novel poxviruses (WKPV and EKPV) from western and eastern grey kangaroos.

    Science.gov (United States)

    Bennett, Mark; Tu, Shin-Lin; Upton, Chris; McArtor, Cassie; Gillett, Amber; Laird, Tanya; O'Dea, Mark

    2017-10-15

    Poxviruses have previously been detected in macropods with cutaneous papillomatous lesions, however to date, no comprehensive analysis of a poxvirus from kangaroos has been performed. Here we report the genome sequences of a western grey kangaroo poxvirus (WKPV) and an eastern grey kangaroo poxvirus (EKPV), named for the host species from which they were isolated, western grey (Macropus fuliginosus) and eastern grey (Macropus giganteus) kangaroos. Poxvirus DNA from WKPV and EKPV was isolated and entire coding genome regions determined through Roche GS Junior and Illumina Miseq sequencing, respectively. Viral genomes were assembled using MIRA and SPAdes, and annotations performed using tools available from the Viral Bioinformatics Resource Centre. Histopathology and transmission electron microscopy analysis was also performed on WKPV and its associated lesions. The WKPV and EKPV genomes show 96% identity (nucleotide) to each other and phylogenetic analysis places them on a distinct branch between the established Molluscipoxvirus and Avipoxvirus genera. WKPV and EKPV are 170 kbp and 167 kbp long, containing 165 and 162 putative genes, respectively. Together, their genomes encode up to 47 novel unique hypothetical proteins, and possess virulence proteins including a major histocompatibility complex class II inhibitor, a semaphorin-like protein, a serpin, a 3-β-hydroxysteroid dehydrogenase/δ 5→4 isomerase, and a CD200-like protein. These viruses also encode a large putative protein (WKPV-WA-039 and EKPV-SC-038) with a C-terminal domain that is structurally similar to the C-terminal domain of a cullin, suggestive of a role in the control of host ubiquitination. The relationship of these viruses to members of the Molluscipoxvirus and Avipoxvirus genera is discussed in terms of sequence similarity, gene content and nucleotide composition. A novel genus within subfamily Chordopoxvirinae is proposed to accommodate these two poxvirus species from kangaroos; we suggest

  14. Complete mitochondrial genome of the larch hawk moth, Sphinx morio (Lepidoptera: Sphingidae).

    Science.gov (United States)

    Kim, Min Jee; Choi, Sei-Woong; Kim, Iksoo

    2013-12-01

    The larch hawk moth, Sphinx morio, belongs to the lepidopteran family Sphingidae that has long been studied as a family of model insects in a diverse field. In this study, we describe the complete mitochondrial genome (mitogenome) sequences of the species in terms of general genomic features and characteristic short repetitive sequences found in the A + T-rich region. The 15,299-bp-long genome consisted of a typical set of genes (13 protein-coding genes, 2 rRNA genes, and 22 tRNA genes) and one major non-coding A + T-rich region, with the typical arrangement found in Lepidoptera. The 316-bp-long A + T-rich region located between srRNA and tRNA(Met) harbored the conserved sequence blocks that are typically found in lepidopteran insects. Additionally, the A + T-rich region of S. morio contained three characteristic repeat sequences that are rarely found in Lepidoptera: two identical 12-bp repeat, three identical 5-bp-long tandem repeat, and six nearly identical 5-6 bp long repeat sequences.

  15. A complete mitochondrial genome sequence of Asian black bear Sichuan subspecies (Ursus thibetanus mupinensis)

    Science.gov (United States)

    Hou, Wan-ru; Chen, Yu; Wu, Xia; Hu, Jin-chu; Peng, Zheng-song; Yang, Jung; Tang, Zong-xiang; Zhou, Cai-Quan; Li, Yu-ming; Yang, Shi-kui; Du, Yu-jie; Kong, Ling-lu; Ren, Zheng-long; Zhang, Huai-yu; Shuai, Su-rong

    2007-01-01

    We obtained the complete mitochondrial genome of U.thibetanus mupinensis by DNA sequencing based on the PCR fragments of 18 primers we designed. The results indicate that the mtDNA is 16 868 bp in size, encodes 13 protein genes, 22 tRNA genes, and 2 rRNA genes, with an overall H-strand base composition of 31.2% A, 25.4% C, 15.5% G and 27.9% T. The sequence of the control region (CR) located between tRNA-Pro and tRNA-Phe is 1422 bp in size, consists of 8.43% of the whole genome, GC content is 51.9% and has a 6bp tandem repeat and two 10bp tandem repeats identified by using the Tandem Repeats Finder. U. thibetanus mupinensis mitochondrial genome shares high similarity with those of three other Ursidae: U. americanus (91.46%), U. arctos (89.25%) and U. maritimus (87.66%). PMID:17205108

  16. Complete plastid genome sequence of goosegrass (Eleusine indica) and comparison with other Poaceae.

    Science.gov (United States)

    Zhang, Hui; Hall, Nathan; McElroy, J Scott; Lowe, Elijah K; Goertzen, Leslie R

    2017-02-05

    Eleusine indica, also known as goosegrass, is a serious weed in at least 42 countries. In this paper we report the complete plastid genome sequence of goosegrass obtained by de novo assembly of paired-end and mate-paired reads generated by Illumina sequencing of total genomic DNA. The goosegrass plastome is a circular molecule of 135,151bp in length, consisting of two single-copy regions separated by a pair of inverted repeats (IRs) of 20,919 bases. The large (LSC) and the small (SSC) single-copy regions span 80,667 bases and 12,646 bases, respectively. The plastome of goosegrass has 38.19% GC content and includes 108 unique genes, of which 76 are protein-coding, 28 are transfer RNA, and 4 are ribosomal RNA. The goosegrass plastome sequence was compared to eight other species of Poaceae. Although generally conserved with respect to Poaceae, this genomic resource will be useful for evolutionary studies within this weed species and the genus Eleusine. Copyright © 2016. Published by Elsevier B.V.

  17. The complete mitochondrial genome of the pirarucu (Arapaima gigas, Arapaimidae, Osteoglossiformes

    Directory of Open Access Journals (Sweden)

    Tomas Hrbek

    2008-01-01

    Full Text Available We sequenced the complete mitochondrial genome of the pirarucu, Arapaima gigas, the largest fish of the Amazon basin, and economically one of the most important species of the region. The total length of the Arapaima gigas mitochondrial genome is 16,433 bp. The mitochondrial genome contains 13 protein-coding genes, two rRNA genes and 22 tRNA genes. Twelve of the thirteen protein-coding genes are coded on the heavy strand, while nad6 is coded on the light strand. The Arapaima gene order and content is identical to the common vertebrate form, as is codon usage and base composition. Its control region is atypical in being short at 767 bp. The control region also contains a conserved ATGTA motif recently identified in the Asian arowana, three conserved sequence blocks (CSB-1, CBS-2 and CBS-3 and its 3' end contains long series of di- and mono-nucleotide microsatellite repeats. Other osteoglossiform species for which control region sequences have been published show similar control region characteristics.

  18. Gene characteristics of the complete mitochondrial genomes of Paratoxodera polyacantha and Toxodera hauseri (Mantodea: Toxoderidae).

    Science.gov (United States)

    Zhang, Le-Ping; Cai, Yin-Yin; Yu, Dan-Na; Storey, Kenneth B; Zhang, Jia-Yong

    2018-01-01

    The family Toxoderidae (Mantodea) contains an ecologically diverse group of praying mantis species that have in common greatly elongated bodies. In this study, we sequenced and compared the complete mitochondrial genomes of two Toxoderidae species, Paratoxodera polyacantha and Toxodera hauseri , and compared their mitochondrial genome characteristics with another member of the Toxoderidae, Stenotoxodera porioni (KY689118) . The lengths of the mitogenomes of T. hauseri and P. polyacantha were 15,616 bp and 15,999 bp, respectively, which is similar to that of S. porioni (15,846 bp). The size of each gene as well as the A+T-rich region and the A+T content of the whole genome were also very similar among the three species as were the protein-coding genes, the A+T content and the codon usages. The mitogenome of T. hauseri had the typical 22 tRNAs, whereas that of P. polyacantha had 26 tRNAs including an extra two copies of trnA - trnR . Intergenic regions of 67 bp and 76 bp were found in T. hauseri and P. polyacantha , respectively, between COX2 and trnK ; these can be explained as residues of a tandem duplication/random loss of trnK and trnD. This non-coding region may be synapomorphic for Toxoderidae. In BI and ML analyses, the monophyly of Toxoderidae was supported and P. polyacantha was the sister clade to T. hauseri and S. porioni .

  19. The complete chloroplast genome of traditional Chinese medical plants Paris polyphylla var. yunnanensis.

    Science.gov (United States)

    Song, Yun; Xu, Jin; Chen, NaiZhong; Li, MingFu

    2017-03-01

    Paris polyphylla var. yunnanensis is a perennial medical plant widely used in traditional Chinese medicine. Here, we report the complete chloroplast genome of P. polyphylla var. yunnanensis. The genome is 157 675 bp in length including a small single-copy region (SSC, 18 319 bp) and a large single-copy region (LSC, 84 108 bp) separated by a pair of inverted repeats (IRs, 27 624 bp). The genome contained 115 genes, including 81 protein-coding genes, 4 ribosomal RNA genes, and 30 tRNA genes. Among these genes, 13 harbored a single intron and 2 contained a couple of introns. The overall G + C content of the cpDNA is 37.4%, while the corresponding values of the LSC, SSC, and IR regions are 35.71%, 31.43%, and 41.87%, respectively. A Maximum-likelihood phylogenetic analysis suggested that genus Trillium, Paris, Fritillaria, and Lilium were strongly supported as monophyletic and the P. polyphylla var. yunnanensis is closely related to Trillium.

  20. Comparison of the complete mitochondrial genome of the stonefly Sweltsa longistyla (Plecoptera: Chloroperlidae) with mitogenomes of three other stoneflies.

    Science.gov (United States)

    Chen, Zhi-Teng; Du, Yu-Zhou

    2015-03-01

    The complete mitochondrial genome of the stonefly, Sweltsa longistyla Wu (Plecoptera: Chloroperlidae), was sequenced in this study. The mitogenome of S. longistyla is 16,151bp and contains 37 genes including 13 protein-coding genes (PCGs), 22 tRNA genes, two rRNA genes, and a large non-coding region. S. longistyla, Pteronarcys princeps Banks, Kamimuria wangi Du and Cryptoperla stilifera Sivec belong to the Plecoptera, and the gene order and orientation of their mitogenomes were similar. The overall AT content for the four stoneflies was below 72%, and the AT content of tRNA genes was above 69%. The four genomes were compact and contained only 65-127bp of non-coding intergenic DNAs. Overlapping nucleotides existed in all four genomes and ranged from 24 (P. princeps) to 178bp (K. wangi). There was a 7-bp motif ('ATGATAA') of overlapping DNA and an 8-bp motif (AAGCCTTA) conserved in three stonefly species (P. princeps, K. wangi and C. stilifera). The control regions of four stoneflies contained a stem-loop structure. Four conserved sequence blocks (CSBs) were present in the A+T-rich regions of all four stoneflies. Copyright © 2014 Elsevier B.V. All rights reserved.