WorldWideScience

Sample records for citrus genome sequence

  1. Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication

    OpenAIRE

    Wu, G. Albert; Prochnik, Simon; Jenkins, Jerry; Salse, Jerome; Hellsten, Uffe; Murat, Florent; Perrier, Xavier; Ruiz, Manuel; Scalabrin, Simone; Terol, Javier; Takita, Marco Aurelio; Labadie, Karine; Poulain, Julie; Couloux, Arnaud; Jabbari, Kamel

    2014-01-01

    Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes-a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-orange genomes- and show that cultivated types derive from two progenitor species. Although cultivated pummelos represent selections from one progenitor species, Citrus maxima, cultivated mandarins ...

  2. Characterizing the citrus variety Carrizo genome through 454 shotgun sequencing

    Science.gov (United States)

    Citrus production is of global importance both in economic impact and significance to nutrition. The number of natural citrus species appears extremely limited. The genome size is small (haploid approximately 367 Mb), arranged on 18 chromosomes. The citrus variety Carrizo, generated by a ‘Washingt...

  3. Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication

    Science.gov (United States)

    Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes—a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-o...

  4. Recovery and characterization of a Citrus clementina Hort. ex Tan. 'Clemenules' haploid plant selected to establish the reference whole Citrus genome sequence

    OpenAIRE

    Navarro Luis; Ollitrault Patrick; Pina José A; Hernández María; Juárez José; Aleza Pablo

    2009-01-01

    Abstract Background In recent years, the development of structural genomics has generated a growing interest in obtaining haploid plants. The use of homozygous lines presents a significant advantage for the accomplishment of sequencing projects. Commercial citrus species are characterized by high heterozygosity, making it difficult to assemble large genome sequences. Thus, the International Citrus Genomic Consortium (ICGC) decided to establish a reference whole citrus genome sequence from a h...

  5. Citrus Genomics

    OpenAIRE

    Talon, Manuel; Gmitter, Fred G.Jr.

    2008-01-01

    Citrus is one of the most widespread fruit crops globally, with great economic and health value. It is among the most difficult plants to improve through traditional breeding approaches. Currently, there is risk of devastation by diseases threatening to limit production and future availability to the human population. As technologies rapidly advance in genomic science, they are quickly adapted to address the biological challenges of the citrus plant system and the world's industries. The hist...

  6. Draft Genome Sequence of "Candidatus Liberibacter asiaticus" from a Citrus Tree in San Gabriel, California.

    Science.gov (United States)

    Wu, F; Kumagai, L; Liang, G; Deng, X; Zheng, Z; Keremane, M; Chen, J

    2015-01-01

    The draft genome sequence of "Candidatus Liberibacter asiaticus" strain SGCA5 from an orange citrus tree in San Gabriel, California, is reported here. SGCA5 has a genome size of 1,201,445 bp, a G+C content of 36.4%, 1,152 predicted open reading frames (ORFs), and 42 RNA genes. PMID:26701083

  7. Complete Genome Sequence of a Putative Densovirus of the Asian Citrus Psyllid, Diaphorina citri.

    Science.gov (United States)

    Nigg, Jared C; Nouri, Shahideh; Falk, Bryce W

    2016-01-01

    Here, we report the complete genome sequence of a putative densovirus of the Asian citrus psyllid, Diaphorina citri Diaphorina citri densovirus (DcDNV) was originally identified through metagenomics, and here, we obtained the complete nucleotide sequence using PCR-based approaches. Phylogenetic analysis places DcDNV between viruses of the Ambidensovirus and Iteradensovirus genera. PMID:27469948

  8. Genome Sequence of Streptomyces wadayamensis Strain A23, an Endophytic Actinobacterium from Citrus reticulata

    OpenAIRE

    de Oliveira, Luciana G; Tormet Gonzalez, Gabriela D.; Samborsky, Markyian; Marcon, Joelma; Araujo, Welington L.; de Azevedo, João Lucio

    2014-01-01

    The actinobacterium Streptomyces wadayamensis A23 is an endophyte of Citrus reticulata that produces the antimycin and mannopeptimycin antibiotics, among others. The strain has the capability to inhibit Xylella fastidiosa growth. The draft genome of S. wadayamensis A23 has ~7.0 Mb and 6,006 protein-coding sequences, with a 73.5% G+C content.

  9. Complete genome sequence analysis of two Citrus tatter leaf virus (CTLV) isolates from China

    Institute of Scientific and Technical Information of China (English)

    SONG Zhen; LI Zhong-an; LIU Ke-hong; ZHOU Chang-yong

    2015-01-01

    In order to understand molecular characterization of Citrus tatter leaf virus (CTLV) isolated from China, ful-length cDNAs of CTLV-MTH and CTLV-XHC from Citrus reticulata and Citrus sinensis were cloned and sequenced based on whole-genome ampliifcation by RT-PCR. The complete nucleotide sequences of CTLV-MTH and CTLV-XHC were determined to be 6 497 nucleotides in length and shared 79.9–91.0%and 78.8–98.0%nucleotide sequence identity, respectively, with other Apple stem grooving virus (ASGV) or CTLV strains available in GenBank. Unexpectedly, CTLV-MTH showed the highest nucleo-tide sequence identity (91%) with an apple isolate of ASGV, fol owed by 86.5%with ASGV-HH and 85.7%with ASGV-CHN. Furthermore, CTLV-MTH and three ASGV strains were grouped to a separate cluster in the phylogenetic tree, suggesting it has a closer relationship to ASGV than to CTLV. Therefore, it can be concluded roughly that CTLV may be not a distinct strains of ASGV. We proposed that Citrus tatter leaf virus should be renamed Apple stem grooving virus.

  10. Genome Sequence of Streptomyces wadayamensis Strain A23, an Endophytic Actinobacterium from Citrus reticulata.

    Science.gov (United States)

    de Oliveira, Luciana G; Tormet Gonzalez, Gabriela D; Samborsky, Markyian; Marcon, Joelma; Araujo, Welington L; de Azevedo, João Lucio

    2014-01-01

    The actinobacterium Streptomyces wadayamensis A23 is an endophyte of Citrus reticulata that produces the antimycin and mannopeptimycin antibiotics, among others. The strain has the capability to inhibit Xylella fastidiosa growth. The draft genome of S. wadayamensis A23 has ~7.0 Mb and 6,006 protein-coding sequences, with a 73.5% G+C content. PMID:24994795

  11. Complete chloroplast genome sequence of Omani lime (Citrus aurantiifolia and comparative analysis within the rosids.

    Directory of Open Access Journals (Sweden)

    Huei-Jiun Su

    Full Text Available The genus Citrus contains many economically important fruits that are grown worldwide for their high nutritional and medicinal value. Due to frequent hybridizations among species and cultivars, the exact number of natural species and the taxonomic relationships within this genus are unclear. To compare the differences between the Citrus chloroplast genomes and to develop useful genetic markers, we used a reference-assisted approach to assemble the complete chloroplast genome of Omani lime (C. aurantiifolia. The complete C. aurantiifolia chloroplast genome is 159,893 bp in length; the organization and gene content are similar to most of the rosids lineages characterized to date. Through comparison with the sweet orange (C. sinensis chloroplast genome, we identified three intergenic regions and 94 simple sequence repeats (SSRs that are potentially informative markers with resolution for interspecific relationships. These markers can be utilized to better understand the origin of cultivated Citrus. A comparison among 72 species belonging to 10 families of representative rosids lineages also provides new insights into their chloroplast genome evolution.

  12. Complete genome sequence of citrus huanglongbing bacterium, 'Candidatus Liberibacter asiaticus' obtained through metagenomics.

    Science.gov (United States)

    Duan, Yongping; Zhou, Lijuan; Hall, David G; Li, Wenbin; Doddapaneni, Harshavardhan; Lin, Hong; Liu, Li; Vahling, Cheryl M; Gabriel, Dean W; Williams, Kelly P; Dickerman, Allan; Sun, Yijun; Gottwald, Tim

    2009-08-01

    Citrus huanglongbing is the most destructive disease of citrus worldwide. It is spread by citrus psyllids and is associated with a low-titer, phloem-limited infection by any of three uncultured species of alpha-Proteobacteria, 'Candidatus Liberibacter asiaticus', 'Ca. L. americanus', and 'Ca. L. africanus'. A complete circular 'Ca. L. asiaticus' genome has been obtained by metagenomics, using the DNA extracted from a single 'Ca. L. asiaticus'-infected psyllid. The 1.23-Mb genome has an average 36.5% GC content. Annotation revealed a high percentage of genes involved in both cell motility (4.5%) and active transport in general (8.0%), which may contribute to its virulence. 'Ca. L. asiaticus' appears to have a limited ability for aerobic respiration and is likely auxotrophic for at least five amino acids. Consistent with its intracellular nature, 'Ca. L. asiaticus' lacks type III and type IV secretion systems as well as typical free-living or plant-colonizing extracellular degradative enzymes. 'Ca. L. asiaticus' appears to have all type I secretion system genes needed for both multidrug efflux and toxin effector secretion. Multi-protein phylogenetic analysis confirmed 'Ca. L. asiaticus' as an early-branching and highly divergent member of the family Rhizobiaceae. This is the first genome sequence of an uncultured alpha-proteobacteria that is both an intracellular plant pathogen and insect symbiont. PMID:19589076

  13. Novel insights into the genomic basis of citrus canker based on the genome sequences of two strains of Xanthomonas fuscans subsp. aurantifolii

    OpenAIRE

    Nociti Letícia A; Rodrigues Neto Julio; Leite Rui P; Nishiyama Milton Y; Laia Marcelo L; Kitajima Elliot W.; Jones Jeffrey B; Gimenez Daniele F; Furlan Luiz R; Ferro Maria I; Ferraz André L; Facincani Agda P; de Souza Robson F; de Oliveira Julio C; de Moraes Fabrício E

    2010-01-01

    Abstract Background Citrus canker is a disease that has severe economic impact on the citrus industry worldwide. There are three types of canker, called A, B, and C. The three types have different phenotypes and affect different citrus species. The causative agent for type A is Xanthomonas citri subsp. citri, whose genome sequence was made available in 2002. Xanthomonas fuscans subsp. aurantifolii strain B causes canker B and Xanthomonas fuscans subsp. aurantifolii strain C causes canker C. R...

  14. The Genome Sequence of an Isolate of Indian Citrus Ringspot Virus Infecting the Sweet Orange in India

    OpenAIRE

    K, Prabha; Baranwal, V.K.

    2012-01-01

    Whole-genome sequencing of an isolate of Mandarivirus infecting the sweet orange [Citrus sinensis (L) Blanco] in the western part of India (Pune) was done. The single-stranded positive-sense RNA genome of Indian citrus ringspot virus (ICRSV) Pune has 7,560 nucleotides (nt), excluding a poly(A) tail, comprised of 27.98% (2,115 nt) A, 32.12% (2,428 nt) C, 19.68% (1,488 nt) G, and 20.22% (1,529 nt) T residues. The genome, organized into six open reading frames (ORFs), shares 97.7% sequence ident...

  15. Whole-Genome Sequence of "Candidatus Liberibacter asiaticus" from a Huanglongbing-Affected Citrus Tree in Central Florida.

    Science.gov (United States)

    Zheng, Z; Sun, X; Deng, X; Chen, J

    2015-01-01

    Here, we report the draft genome sequence of "Candidatus Liberibacter asiaticus" strain FL17, isolated from a huanglongbing (HLB)-affected citrus tree in central Florida. The FL17 genome comprised 1,227,253 bp, with a G+C content of 36.5%, 1,175 predicted open reading frames, and 53 RNA genes. PMID:25792067

  16. Whole-Genome Sequence of “Candidatus Liberibacter asiaticus” from a Huanglongbing-Affected Citrus Tree in Central Florida

    OpenAIRE

    Z. Zheng; X. Sun; Deng, X.; J. Chen

    2015-01-01

    Here, we report the draft genome sequence of “Candidatus Liberibacter asiaticus” strain FL17, isolated from a huanglongbing (HLB)-affected citrus tree in central Florida. The FL17 genome comprised 1,227,253 bp, with a G+C content of 36.5%, 1,175 predicted open reading frames, and 53 RNA genes.

  17. Draft Genome Sequence of “Candidatus Liberibacter asiaticus” from a Citrus Tree in San Gabriel, California

    OpenAIRE

    Wu, F.; Kumagai, L.; Liang, G; Deng, X.; Z. Zheng; Keremane, M.; J. Chen

    2015-01-01

    The draft genome sequence of “Candidatus Liberibacter asiaticus” strain SGCA5 from an orange citrus tree in San Gabriel, California, is reported here. SGCA5 has a genome size of 1,201,445 bp, a G+C content of 36.4%, 1,152 predicted open reading frames (ORFs), and 42 RNA genes.

  18. Whole-Genome Sequence of “Candidatus Liberibacter asiaticus” from a Huanglongbing-Affected Citrus Tree in Central Florida

    Science.gov (United States)

    Zheng, Z.; Sun, X.

    2015-01-01

    Here, we report the draft genome sequence of “Candidatus Liberibacter asiaticus” strain FL17, isolated from a huanglongbing (HLB)-affected citrus tree in central Florida. The FL17 genome comprised 1,227,253 bp, with a G+C content of 36.5%, 1,175 predicted open reading frames, and 53 RNA genes. PMID:25792067

  19. Recovery and characterization of a Citrus clementina Hort. ex Tan. 'Clemenules' haploid plant selected to establish the reference whole Citrus genome sequence

    Directory of Open Access Journals (Sweden)

    Navarro Luis

    2009-08-01

    Full Text Available Abstract Background In recent years, the development of structural genomics has generated a growing interest in obtaining haploid plants. The use of homozygous lines presents a significant advantage for the accomplishment of sequencing projects. Commercial citrus species are characterized by high heterozygosity, making it difficult to assemble large genome sequences. Thus, the International Citrus Genomic Consortium (ICGC decided to establish a reference whole citrus genome sequence from a homozygous plant. Due to the existence of important molecular resources and previous success in obtaining haploid clementine plants, haploid clementine was selected as the target for the implementation of the reference whole genome citrus sequence. Results To obtain haploid clementine lines we used the technique of in situ gynogenesis induced by irradiated pollen. Flow cytometry, chromosome counts and SSR marker (Simple Sequence Repeats analysis facilitated the identification of six different haploid lines (2n = x = 9, one aneuploid line (2n = 2x+4 = 22 and one doubled haploid plant (2n = 2x = 18 of 'Clemenules' clementine. One of the haploids, obtained directly from an original haploid embryo, grew vigorously and produced flowers after four years. This is the first haploid plant of clementine that has bloomed and we have, for the first time, characterized the histology of haploid and diploid flowers of clementine. Additionally a double haploid plant was obtained spontaneously from this haploid line. Conclusion The first haploid plant of 'Clemenules' clementine produced directly by germination of a haploid embryo, which grew vigorously and produced flowers, has been obtained in this work. This haploid line has been selected and it is being used by the ICGC to establish the reference sequence of the nuclear genome of citrus.

  20. Genome sequence of the necrotrophic fungus Penicillium digitatum, the main postharvest pathogen of citrus

    OpenAIRE

    Marcet-Houben Marina; Ballester Ana-Rosa; de la Fuente Beatriz; Harries Eleonora; Marcos Jose F; González-Candelas Luis; Gabaldón Toni

    2012-01-01

    Abstract Background Penicillium digitatum is a fungal necrotroph causing a common citrus postharvest disease known as green mold. In order to gain insight into the genetic bases of its virulence mechanisms and its high degree of host-specificity, the genomes of two P. digitatum strains that differ in their antifungal resistance traits have been sequenced and compared with those of 28 other Pezizomycotina. Results The two sequenced genomes are highly similar, but important differences between ...

  1. The complete genome sequence of 'Candidatus Liberibacter americanus', associated with Citrus huanglongbing.

    Science.gov (United States)

    Wulff, Nelson A; Zhang, Shujian; Setubal, João C; Almeida, Nalvo F; Martins, Elaine C; Harakava, Ricardo; Kumar, Dibyendu; Rangel, Luiz Thiberio; Foissac, Xavier; Bové, Joseph M; Gabriel, Dean W

    2014-02-01

    Liberibacter spp. form a Rhizobiaceae clade of phloem-limited pathogens of limited host range. Two obligately parasitic species have been sequenced: 'Candidatus Liberibacter asiaticus', which causes citrus huanglongbing (HLB) worldwide, and 'Ca. L. solanacearum', which causes potato "zebra chip" disease. A third (proposed) species, Liberibacter crescens, was isolated from mountain papaya, grown in axenic culture, and sequenced. In an effort to identify common host determinants, the complete genomic DNA sequence of a second HLB species, 'Ca. L. americanus' strain 'São Paulo' was determined. The circular genome of 1,195,201 bp had an average 31.12% GC content and 983 predicted protein encoding genes, 800 (81.4%) of which had a predicted function. There were 658 genes common to all sequenced Liberibacter spp. and only 8 genes common to 'Ca. L. americanus' and 'Ca. L. asiaticus' but not found in 'Ca. L. solanacearum'. Surprisingly, most of the lipopolysaccharide biosynthetic genes were missing from the 'Ca. L. americanus' genome, as well as OmpA and a key regulator of flagellin, all indicating a 'Ca. L. americanus' strategy of avoiding production of major pathogen-associated molecular patterns present in 'Ca. L. asiaticus' and 'Ca. L. solanacearum'. As with 'Ca. L. asiaticus', one of two 'Ca. L. americanus' prophages replicated as an excision plasmid and carried potential lysogenic conversion genes that appeared fragmentary or degenerated in 'Ca. L. solanacearum'. PMID:24200077

  2. The complete genome sequence of Citrus vein enation virus from China

    Institute of Scientific and Technical Information of China (English)

    HUANG Ai-jun; SONG Zhen; CAO Meng-ji; CHEN Hong-ming; LI Zhong-an; ZHOU Chang-yong

    2015-01-01

    The complete nucleotide sequence of an isolate of Citrus vein enation virus (CVEV-XZG) from China has been determined for the ifrst time. The genome consisted of 5 983 nucleotides, coding for ifve open reading frames (ORFs), had a similar genomic organization features with Pea enation mosaic virus (PEMV). Nucleotide and deduced amino acid sequence identity of the ifve ORFs compared to isolate CVEV VE-1 range from 97.1 to 99.0% and 97.4 to 100.0%, these values compared to isolate PEMV-1 range from 45.2 to 51.6%and 31.1 to 45.2%. Phylogenetic analysis based on the complete genome sequence showed that the isolate CVEV-XZG had close relationship with Pea enation mosaic virus. The results supports CVEV may be a new member of genus Enamovirus. The ful sequence of CVEV-XZG presented here may serve as a basis for future study of CVEV in China.

  3. Complete Chloroplast Genome Sequence of Omani Lime (Citrus aurantiifolia) and Comparative Analysis within the Rosids

    OpenAIRE

    Huei-Jiun Su; Hogenhout, Saskia A.; Al-Sadi, Abdullah M.; Chih-Horng Kuo

    2014-01-01

    The genus Citrus contains many economically important fruits that are grown worldwide for their high nutritional and medicinal value. Due to frequent hybridizations among species and cultivars, the exact number of natural species and the taxonomic relationships within this genus are unclear. To compare the differences between the Citrus chloroplast genomes and to develop useful genetic markers, we used a reference-assisted approach to assemble the complete chloroplast genome of Omani lime (C....

  4. Genome sequence of the necrotrophic fungus Penicillium digitatum, the main postharvest pathogen of citrus

    Directory of Open Access Journals (Sweden)

    Marcet-Houben Marina

    2012-11-01

    Full Text Available Abstract Background Penicillium digitatum is a fungal necrotroph causing a common citrus postharvest disease known as green mold. In order to gain insight into the genetic bases of its virulence mechanisms and its high degree of host-specificity, the genomes of two P. digitatum strains that differ in their antifungal resistance traits have been sequenced and compared with those of 28 other Pezizomycotina. Results The two sequenced genomes are highly similar, but important differences between them include the presence of a unique gene cluster in the resistant strain, and mutations previously shown to confer fungicide resistance. The two strains, which were isolated in Spain, and another isolated in China have identical mitochondrial genome sequences suggesting a recent worldwide expansion of the species. Comparison with the closely-related but non-phytopathogenic P. chrysogenum reveals a much smaller gene content in P. digitatum, consistent with a more specialized lifestyle. We show that large regions of the P. chrysogenum genome, including entire supercontigs, are absent from P. digitatum, and that this is the result of large gene family expansions rather than acquisition through horizontal gene transfer. Our analysis of the P. digitatum genome is indicative of heterothallic sexual reproduction and reveals the molecular basis for the inability of this species to assimilate nitrate or produce the metabolites patulin and penicillin. Finally, we identify the predicted secretome, which provides a first approximation to the protein repertoire used during invasive growth. Conclusions The complete genome of P. digitatum, the first of a phytopathogenic Penicillium species, is a valuable tool for understanding the virulence mechanisms and host-specificity of this economically important pest.

  5. Novel insights into the genomic basis of citrus canker based on the genome sequences of two strains of Xanthomonas fuscans subsp. aurantifolii

    Directory of Open Access Journals (Sweden)

    Nociti Letícia A

    2010-04-01

    Full Text Available Abstract Background Citrus canker is a disease that has severe economic impact on the citrus industry worldwide. There are three types of canker, called A, B, and C. The three types have different phenotypes and affect different citrus species. The causative agent for type A is Xanthomonas citri subsp. citri, whose genome sequence was made available in 2002. Xanthomonas fuscans subsp. aurantifolii strain B causes canker B and Xanthomonas fuscans subsp. aurantifolii strain C causes canker C. Results We have sequenced the genomes of strains B and C to draft status. We have compared their genomic content to X. citri subsp. citri and to other Xanthomonas genomes, with special emphasis on type III secreted effector repertoires. In addition to pthA, already known to be present in all three citrus canker strains, two additional effector genes, xopE3 and xopAI, are also present in all three strains and are both located on the same putative genomic island. These two effector genes, along with one other effector-like gene in the same region, are thus good candidates for being pathogenicity factors on citrus. Numerous gene content differences also exist between the three cankers strains, which can be correlated with their different virulence and host range. Particular attention was placed on the analysis of genes involved in biofilm formation and quorum sensing, type IV secretion, flagellum synthesis and motility, lipopolysacharide synthesis, and on the gene xacPNP, which codes for a natriuretic protein. Conclusion We have uncovered numerous commonalities and differences in gene content between the genomes of the pathogenic agents causing citrus canker A, B, and C and other Xanthomonas genomes. Molecular genetics can now be employed to determine the role of these genes in plant-microbe interactions. The gained knowledge will be instrumental for improving citrus canker control.

  6. Draft Genome Sequence of Methylobacterium mesophilicum Strain SR1.6/6, Isolated from Citrus sinensis

    OpenAIRE

    Marinho Almeida, Diogo; Dini-Andreote, Francisco; Camargo Neves, Aline Aparecida; Jucá Ramos, Rommel Thiago; Andreote, Fernando Dini; Carneiro, Adriana Ribeiro; Oliveira de Souza Lima, André; Caracciolo Gomes de Sá, Pablo Henrique; Ribeiro Barbosa, Maria Silvanira; Araújo, Welington Luiz; Silva, Artur

    2013-01-01

    Methylobacterium mesophilicum strain SR1.6/6 is an endophytic bacterium isolated from a surface-sterilized Citrus sinensis branch. Ecological and biotechnological aspects of this bacterium, such as the genes involved in its association with the host plant and the primary oxidation of methanol, were annotated in the draft genome.

  7. Nuclear Species-Diagnostic SNP Markers Mined from 454 Amplicon Sequencing Reveal Admixture Genomic Structure of Modern Citrus Varieties

    Science.gov (United States)

    Curk, Franck; Ancillo, Gema; Ollitrault, Frédérique; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Garcia-Lor, Andres; Navarro, Luis; Ollitrault, Patrick

    2015-01-01

    Most cultivated Citrus species originated from interspecific hybridisation between four ancestral taxa (C. reticulata, C. maxima, C. medica, and C. micrantha) with limited further interspecific recombination due to vegetative propagation. This evolution resulted in admixture genomes with frequent interspecific heterozygosity. Moreover, a major part of the phenotypic diversity of edible citrus results from the initial differentiation between these taxa. Deciphering the phylogenomic structure of citrus germplasm is therefore essential for an efficient utilization of citrus biodiversity in breeding schemes. The objective of this work was to develop a set of species-diagnostic single nucleotide polymorphism (SNP) markers for the four Citrus ancestral taxa covering the nine chromosomes, and to use these markers to infer the phylogenomic structure of secondary species and modern cultivars. Species-diagnostic SNPs were mined from 454 amplicon sequencing of 57 gene fragments from 26 genotypes of the four basic taxa. Of the 1,053 SNPs mined from 28,507 kb sequence, 273 were found to be highly diagnostic for a single basic taxon. Species-diagnostic SNP markers (105) were used to analyse the admixture structure of varieties and rootstocks. This revealed C. maxima introgressions in most of the old and in all recent selections of mandarins, and suggested that C. reticulata × C. maxima reticulation and introgression processes were important in edible mandarin domestication. The large range of phylogenomic constitutions between C. reticulata and C. maxima revealed in mandarins, tangelos, tangors, sweet oranges, sour oranges, grapefruits, and orangelos is favourable for genetic association studies based on phylogenomic structures of the germplasm. Inferred admixture structures were in agreement with previous hypotheses regarding the origin of several secondary species and also revealed the probable origin of several acid citrus varieties. The developed species-diagnostic SNP

  8. Nuclear species-diagnostic SNP markers mined from 454 amplicon sequencing reveal admixture genomic structure of modern citrus varieties.

    Science.gov (United States)

    Curk, Franck; Ancillo, Gema; Ollitrault, Frédérique; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Garcia-Lor, Andres; Navarro, Luis; Ollitrault, Patrick

    2015-01-01

    Most cultivated Citrus species originated from interspecific hybridisation between four ancestral taxa (C. reticulata, C. maxima, C. medica, and C. micrantha) with limited further interspecific recombination due to vegetative propagation. This evolution resulted in admixture genomes with frequent interspecific heterozygosity. Moreover, a major part of the phenotypic diversity of edible citrus results from the initial differentiation between these taxa. Deciphering the phylogenomic structure of citrus germplasm is therefore essential for an efficient utilization of citrus biodiversity in breeding schemes. The objective of this work was to develop a set of species-diagnostic single nucleotide polymorphism (SNP) markers for the four Citrus ancestral taxa covering the nine chromosomes, and to use these markers to infer the phylogenomic structure of secondary species and modern cultivars. Species-diagnostic SNPs were mined from 454 amplicon sequencing of 57 gene fragments from 26 genotypes of the four basic taxa. Of the 1,053 SNPs mined from 28,507 kb sequence, 273 were found to be highly diagnostic for a single basic taxon. Species-diagnostic SNP markers (105) were used to analyse the admixture structure of varieties and rootstocks. This revealed C. maxima introgressions in most of the old and in all recent selections of mandarins, and suggested that C. reticulata × C. maxima reticulation and introgression processes were important in edible mandarin domestication. The large range of phylogenomic constitutions between C. reticulata and C. maxima revealed in mandarins, tangelos, tangors, sweet oranges, sour oranges, grapefruits, and orangelos is favourable for genetic association studies based on phylogenomic structures of the germplasm. Inferred admixture structures were in agreement with previous hypotheses regarding the origin of several secondary species and also revealed the probable origin of several acid citrus varieties. The developed species-diagnostic SNP

  9. The complete chloroplast genome sequence of Citrus sinensis (L. Osbeck var 'Ridge Pineapple': organization and phylogenetic relationships to other angiosperms

    Directory of Open Access Journals (Sweden)

    Jansen Robert K

    2006-09-01

    Full Text Available Abstract Background The production of Citrus, the largest fruit crop of international economic value, has recently been imperiled due to the introduction of the bacterial disease Citrus canker. No significant improvements have been made to combat this disease by plant breeding and nuclear transgenic approaches. Chloroplast genetic engineering has a number of advantages over nuclear transformation; it not only increases transgene expression but also facilitates transgene containment, which is one of the major impediments for development of transgenic trees. We have sequenced the Citrus chloroplast genome to facilitate genetic improvement of this crop and to assess phylogenetic relationships among major lineages of angiosperms. Results The complete chloroplast genome sequence of Citrus sinensis is 160,129 bp in length, and contains 133 genes (89 protein-coding, 4 rRNAs and 30 distinct tRNAs. Genome organization is very similar to the inferred ancestral angiosperm chloroplast genome. However, in Citrus the infA gene is absent. The inverted repeat region has expanded to duplicate rps19 and the first 84 amino acids of rpl22. The rpl22 gene in the IRb region has a nonsense mutation resulting in 9 stop codons. This was confirmed by PCR amplification and sequencing using primers that flank the IR/LSC boundaries. Repeat analysis identified 29 direct and inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Comparison of protein-coding sequences with expressed sequence tags revealed six putative RNA edits, five of which resulted in non-synonymous modifications in petL, psbH, ycf2 and ndhA. Phylogenetic analyses using maximum parsimony (MP and maximum likelihood (ML methods of a dataset composed of 61 protein-coding genes for 30 taxa provide strong support for the monophyly of several major clades of angiosperms, including monocots, eudicots, rosids and asterids. The MP and ML trees are incongruent in three areas: the position of Amborella and

  10. Development of genomic resources for Citrus clementina: Characterization of three deep-coverage BAC libraries and analysis of 46,000 BAC end sequences

    Directory of Open Access Journals (Sweden)

    Talon Manuel

    2008-09-01

    Full Text Available Abstract Background Citrus species constitute one of the major tree fruit crops of the subtropical regions with great economic importance. However, their peculiar reproductive characteristics, low genetic diversity and the long-term nature of tree breeding mostly impair citrus variety improvement. In woody plants, genomic science holds promise of improvements and in the Citrus genera the development of genomic tools may be crucial for further crop improvements. In this work we report the characterization of three BAC libraries from Clementine (Citrus clementina, one of the most relevant citrus fresh fruit market cultivars, and the analyses of 46.000 BAC end sequences. Clementine is a diploid plant with an estimated haploid genome size of 367 Mb and 2n = 18 chromosomes, which makes feasible the use of genomics tools to boost genetic improvement. Results Three genomic BAC libraries of Citrus clementina were constructed through EcoRI, MboI and HindIII digestions and 56,000 clones, representing an estimated genomic coverage of 19.5 haploid genome-equivalents, were picked. BAC end sequencing (BES of 28,000 clones produced 28.1 Mb of genomic sequence that allowed the identification of the repetitive fraction (12.5% of the genome and estimation of gene content (31,000 genes of this species. BES analyses identified 3,800 SSRs and 6,617 putative SNPs. Comparative genomic studies showed that citrus gene homology and microsyntheny with Populus trichocarpa was rather higher than with Arabidopsis thaliana, a species phylogenetically closer to citrus. Conclusion In this work, we report the characterization of three BAC libraries from C. clementina, and a new set of genomic resources that may be useful for isolation of genes underlying economically important traits, physical mapping and eventually crop improvement in Citrus species. In addition, BAC end sequencing has provided a first insight on the basic structure and organization of the citrus genome and has

  11. Genome-wide comparative analysis reveals possible common ancestors of nucleotide-binding sites domain containing genes in hybrid Citrus sinensis genome and original Citrus clementina genome

    Science.gov (United States)

    We identified and re-annotated candidate disease resistance (R) genes with nucleotide-binding sites (NBS) domain from a Citrus clementina genome and two complete Citrus sinensis genome sequences (one from the USA and one from China). We found similar numbers of NBS genes from three citrus genomes, r...

  12. Draft Genome Sequence of “Candidatus Liberibacter americanus” Bacterium Associated with Citrus Huanglongbing in Brazil

    OpenAIRE

    Lin, Hong; Coletta-Filho, Helvecio D; Han, Cliff S.; Lou, Binghai; Civerolo, Edwin L; Machado, Marcos A; Gupta, Goutam

    2013-01-01

    We report here the draft genome sequence of “Candidatus Liberibacter americanus” strain PW_SP. The 1,176,071-bp genome, with 31.6% G+C content, comprises 948 open reading frames, 38 tRNAs, and three complete rRNAs.

  13. Complete Genome Sequence of "Candidatus Liberibacter africanus," a Bacterium Associated with Citrus Huanglongbing.

    Science.gov (United States)

    Lin, Hong; Pietersen, Gerhard; Han, Cliff; Read, David Alan; Lou, Binghai; Gupta, Goutam; Civerolo, Edwin L

    2015-01-01

    We report here the complete genome sequence of "Candidatus Liberibacter africanus" strain PTSAPSY. The 1,192,232-bp genome with 34.5% G+C content comprises 1,017 open reading frames, 44 tRNAs, and three complete rRNAs in a circular chromosome. PMID:26184931

  14. Complete Genome sequence of citrus huanglongbing bacterium, ‘Candidatus Liberibacter asiaticus’ obtained through metagenomics

    Science.gov (United States)

    Citrus huanglongbing (HLB) is the most destructive disease of citrus worldwide. It is spread by the citrus psyllids (Diaphorina citri and Trioza erytreae), and is associated with low-titer, phloem-limited infections by any of the three uncultured species of a-Proteobacteria: 'Candidatus Liberibact...

  15. Citrus breeding, genetics and genomics in Japan.

    Science.gov (United States)

    Omura, Mitsuo; Shimada, Takehiko

    2016-01-01

    Citrus is one of the most cultivated fruits in the world, and satsuma mandarin (Citrus unshiu Marc.) is a major cultivated citrus in Japan. Many excellent cultivars derived from satsuma mandarin have been released through the improvement of mandarins using a conventional breeding method. The citrus breeding program is a lengthy process owing to the long juvenility, and it is predicted that marker-assisted selection (MAS) will overcome the obstacle and improve the efficiency of conventional breeding methods. To promote citrus molecular breeding in Japan, a genetic mapping was initiated in 1987, and the experimental tools and resources necessary for citrus functional genomics have been developed in relation to the physiological analysis of satsuma mandarin. In this paper, we review the progress of citrus breeding and genome researches in Japan and report the studies on genetic mapping, expression sequence tag cataloguing, and molecular characterization of breeding characteristics, mainly in terms of the metabolism of bio-functional substances as well as factors relating to, for example, fruit quality, disease resistance, polyembryony, and flowering. PMID:27069387

  16. Complete Genome Sequence of Diaphorina citri-associated C virus, a Novel Putative RNA Virus of the Asian Citrus Psyllid, Diaphorina citri

    Science.gov (United States)

    Nouri, Shahideh; Salem, Nidà

    2016-01-01

    We present here the complete nucleotide sequence and genome organization of a novel putative RNA virus identified in field populations of the Asian citrus psyllid, Diaphorina citri, through sequencing of the transcriptome followed by reverse transcription-PCR (RT-PCR). We tentatively named this virus Diaphorina citri-associated C virus (DcACV). DcACV is an unclassified positive-sense RNA virus. PMID:27445370

  17. Complete Genome Sequence of Diaphorina citri-associated C virus, a Novel Putative RNA Virus of the Asian Citrus Psyllid, Diaphorina citri.

    Science.gov (United States)

    Nouri, Shahideh; Salem, Nidà; Falk, Bryce W

    2016-01-01

    We present here the complete nucleotide sequence and genome organization of a novel putative RNA virus identified in field populations of the Asian citrus psyllid, Diaphorina citri, through sequencing of the transcriptome followed by reverse transcription-PCR (RT-PCR). We tentatively named this virus Diaphorina citri-associated C virus (DcACV). DcACV is an unclassified positive-sense RNA virus. PMID:27445370

  18. Development of genomic resources for Citrus clementina: Characterization of three deep-coverage BAC libraries and analysis of 46,000 BAC end sequences

    OpenAIRE

    Talon Manuel; Ollitrault Patrick; Naranjo M Angel; Terol Javier

    2008-01-01

    Abstract Background Citrus species constitute one of the major tree fruit crops of the subtropical regions with great economic importance. However, their peculiar reproductive characteristics, low genetic diversity and the long-term nature of tree breeding mostly impair citrus variety improvement. In woody plants, genomic science holds promise of improvements and in the Citrus genera the development of genomic tools may be crucial for further crop improvements. In this work we report the char...

  19. Sequence analysis of three citrus viroids infecting a single Tunisian citrus tree (Citrus, reticulata, Clementine

    Directory of Open Access Journals (Sweden)

    Amine Elleuch

    2006-01-01

    Full Text Available We report the nucleotide sequences of three citrus viroids belonging to three different genera: Citrus exocortis viroid (CEVd, Hop stunt viroid (HSVd and Citrus viroid-III (CVd-III isolated from a single natural infected Citrus reticulata var. Clementine tree growing in a tree nursery in Manouba (near Tunis Capital. We describe the sequence variability of these viroids from their natural host without using an alternative passage by an indicator host or an artificial inoculation. This work confirms that naturally occurring viroid infections contain a mixture of sequence variants. These are the first sequences of citrus viroids from Africa.

  20. The Asian Citrus Psyllid Genome (Diaphorina citri, Hemiptera)

    OpenAIRE

    Hunter, Wayne B.; Reese, Justin; International Psyllid Genome Consortium, The

    2014-01-01

    The Asian citrus psyllid, Diaphorina citri (Hemiptera), is an important pest of citrus because it vectors bacteria responsible for huanglongbing, which is one of the most serious diseases of citrus worldwide.  The first genome draft of D. citri (DIACI_1.0) was completed in 2011 (ARS, Ft. Pierce, FL), however, gaps in the assembly prompted additional sequencing using the long run PacBio system at the Los Alamos National Lab, NM.  The revised draft genome (DIACI_1.1) was assembled using the new...

  1. Complete Genome Sequence of “Candidatus Liberibacter africanus,” a Bacterium Associated with Citrus Huanglongbing

    OpenAIRE

    Lin, Hong; Pietersen, Gerhard; Han, Cliff; Read, David Alan; Lou, Binghai; Gupta, Goutam; Civerolo, Edwin L

    2015-01-01

    We report here the complete genome sequence of “Candidatus Liberibacter africanus” strain PTSAPSY. The 1,192,232-bp genome with 34.5% G+C content comprises 1,017 open reading frames, 44 tRNAs, and three complete rRNAs in a circular chromosome.

  2. Complete Genome Sequence of “Candidatus Liberibacter africanus,” a Bacterium Associated with Citrus Huanglongbing

    Science.gov (United States)

    Pietersen, Gerhard; Han, Cliff; Read, David Alan; Lou, Binghai; Gupta, Goutam; Civerolo, Edwin L.

    2015-01-01

    We report here the complete genome sequence of “Candidatus Liberibacter africanus” strain PTSAPSY. The 1,192,232-bp genome with 34.5% G+C content comprises 1,017 open reading frames, 44 tRNAs, and three complete rRNAs in a circular chromosome. PMID:26184931

  3. Citrus sinensis Annotation Project (CAP): A Comprehensive Database for Sweet Orange Genome

    OpenAIRE

    Wang, Jia; Chen, DiJun; Lei, Yang; Chang, Ji-Wei; Hao, Bao-Hai; Xing, Feng; Li, Sen; Xu, Qiang; Deng, Xiu-Xin; Chen, Ling-ling

    2014-01-01

    Citrus is one of the most important and widely grown fruit crop with global production ranking firstly among all the fruit crops in the world. Sweet orange accounts for more than half of the Citrus production both in fresh fruit and processed juice. We have sequenced the draft genome of a double-haploid sweet orange (C. sinensis cv. Valencia), and constructed the Citrus sinensis annotation project (CAP) to store and visualize the sequenced genomic and transcriptome data. CAP provides GBrowse-...

  4. Genomics meets induced mutations in citrus: identification of deleted genes through comparative genomic hybridization

    International Nuclear Information System (INIS)

    We report on the use of genomic approaches to identify pivotal genes in induced citrus mutants. Citrus is the most economically important fruit crop in the world while Spain is the first fresh citrus producer. The survival of the Citrus industry is critically dependent on genetically superior cultivars but improvements in fruit quality traits through traditional techniques are extremely difficult due to the unusual combination of biological characteristics of citrus. Genomic science, however, holds promise of improvements in breeding. In this work, we reported the successful identification of genes included in hemizygous deletions induced by fast neutron irradiation on Citrus clementina. Microarray-based CGH was used to identify underrepresented genes in a citrus mutant that shows color break delay. Subsequent confirmation of gene doses through quantitative PCR and comparison of best hits of putative deleted citrus genes against annotated genomes from other eudicots, specially poplar, enabled the prediction that these genes were clustered into a 700 kb fragment. The availability of Citrus BAC end sequences helped to draw a partial physical map of the deletion. Furthermore, gene content and order in the deleted segment was established by PCR location of gene hits on the physical map. Finally, a lower chlorophyll a/b ratio was found in green tissues from the mutant, an observation that can be related to the hemizygous deletion of a ClpC-like gene, coding a putative subunit of a multifunctional protease complex located into the chloroplast. Analysis of gene content and order inside this Citrus deletion led to the conclusion that microsynteny and local gene colinearity with Populus trichocarpa were higher than with the phylogenetically closer Arabidopsis thaliana genome. In conclusion, a combined strategy including genomics tools and induced citrus mutations has been proved to be a successful approach to identify genes with major roles in citrus fruit development

  5. Genomics Meets Induced Mutations in Citrus: Identification of Deleted Genes Through Comparative Genomic Hybridization

    International Nuclear Information System (INIS)

    We report on the use of genomic approaches to identify pivotal genes in induced citrus mutants. Citrus is the most economically important fruit crop in the world and Spain is the first fresh citrus producer. The survival of the citrus industry is critically dependent on genetically superior cultivars but improvements in fruit quality traits through traditional techniques are extremely difficult due to the unusual combination of biological characteristics of citrus. Genomic science, however, holds promise of improvements in breeding. In this work, we reported the successful identification of genes included in hemizygous deletions induced by fast neutron irradiation on Citrus clementina. Microarray-based CGH was used to identify underrepresented genes in a citrus mutant that shows color break delay. Subsequent confirmation of gene doses through quantitative PCR and comparison of best hits of putative deleted citrus genes against annotated genomes from other eudicots, specially poplar, enabled the prediction that these genes were clustered into a 700 kb fragment. The availability of Citrus BAC end sequences helped to draw a partial physical map of the deletion. Furthermore, gene content and order in the deleted segment was established by PCR location of gene hits on the physical map. Finally, a lower chlorophyll a/b ratio was found in green tissues from the mutant, an observation that can be related to the hemizygous deletion of a ClpC-like gene, coding a putative subunit of a multifunctional protease complex located into the chloroplast. Analysis of gene content and order inside this Citrus deletion led to the conclusion that microsynteny and local gene colinearity with Populus trichocarpa were higher than with the phylogenetically closer Arabidopsis thaliana genome. In conclusion, a combined strategy including genomics tools and induced citrus mutations has been proved to be a successful approach to identify genes with major roles in citrus fruit development

  6. Genome Sequencing

    DEFF Research Database (Denmark)

    Sato, Shusei; Andersen, Stig Uggerhøj

    2014-01-01

    The current Lotus japonicus reference genome sequence is based on a hybrid assembly of Sanger TAC/BAC, Sanger shotgun and Illumina shotgun sequencing data generated from the Miyakojima-MG20 accession. It covers nearly all expressed L. japonicus genes and has been annotated mainly based on transcr......The current Lotus japonicus reference genome sequence is based on a hybrid assembly of Sanger TAC/BAC, Sanger shotgun and Illumina shotgun sequencing data generated from the Miyakojima-MG20 accession. It covers nearly all expressed L. japonicus genes and has been annotated mainly based...

  7. Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus

    OpenAIRE

    Chen, Chunxian; Gmitter Jr, Fred G

    2013-01-01

    Background Single nucleotide polymorphisms (SNPs), the most abundant variations in a genome, have been widely used in various studies. Detection and characterization of citrus haplotype-based expressed sequence tag (EST) SNPs will greatly facilitate further utilization of these gene-based resources. Results In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for...

  8. Transcriptome analysis of root response to citrus blight based on the newly assembled Swingle citrumelo draft genome

    OpenAIRE

    Zhang, Yunzeng; Barthe, Gary; Grosser, Jude W.; Wang, Nian

    2016-01-01

    Background Citrus blight is a citrus tree overall decline disease and causes serious losses in the citrus industry worldwide. Although it was described more than one hundred years ago, its causal agent remains unknown and its pathophysiology is not well determined, which hampers our understanding of the disease and design of suitable disease management. Results In this study, we sequenced and assembled the draft genome for Swingle citrumelo, one important citrus rootstock. The draft genome is...

  9. Complete Genome Sequence of Xanthomonas citri subsp. citri Strain Aw12879, a Restricted-Host-Range Citrus Canker-Causing Bacterium

    OpenAIRE

    Jalan, Neha; Kumar, Dibyendu; Yu, Fahong; Jones, Jeffrey B; Graham, James H; Wang, Nian

    2013-01-01

    Xanthomonas citri subsp. citri causes citrus canker. The Asiatic strain has a broad host range, whereas the Wellington variant has a restricted host range. Here, we present the complete genome of X. citri subsp. citri strain AW12879. This study lays the foundation to further characterize the mechanisms for virulence and host range of X. citri.

  10. Genome Assembly of Citrus Leprosis Virus Nuclear Type Reveals a Close Association with Orchid Fleck Virus

    OpenAIRE

    Roy, Avijit; Stone, Andrew; Otero-Colina, Gabriel; Wei, Gang; Choudhary, Nandlal; Achor, Diann; Shao, Jonathan; Levy, Laurene; Nakhla, Mark K.; Hollingsworth, Charla R.; Hartung, John S.; Schneider, William L.; Brlansky, Ronald H.

    2013-01-01

    The complete genome of citrus leprosis virus nuclear type (CiLV-N) was identified by small RNA sequencing utilizing leprosis-affected citrus samples collected from the state of Querétaro, Mexico. The nucleotide identity and phylogenetic analysis indicate that CiLV-N is very closely related to orchid fleck virus, which typically infects Cymbidium species.

  11. Clues into the metagenome of huanglongbing infected citrus by analysis of ancillary sequences from Ion Torrent whole genome Candidatus Liberibacter asiaticus sequencing

    Science.gov (United States)

    Huanglongbing (HLB) is a globally devastating disease of citrus. Presently, three etiological agents are associated with HLB and include; Candidatus Liberibacter asiaticus (CLas), Candidatus Liberibacter americanus; and Candidatus Liberibacter africanus. Attempts to determine alternate (non-Liberi...

  12. Clues into the metagenome of Huanglongbing infected Citrus by analysis of ancillary sequences from Ion Torrent whole genome Candidatus Liberibacter asiaticus sequencing

    OpenAIRE

    Morgan, J K; Shatters, R. G.; Stover, E; Duan, Y. P.; Moore, G. A.; Powell, C A; Jarra-Cavieres, A.; Clark, S

    2014-01-01

    Huanglongbing (HLB) is a globally devastating disease of citrus.  Presently, three etiological agents are associated with HLB and include; Candidatus Liberibacter asiaticus (CLas), Candidatus Liberibacter americanus; and Candidatus Liberibacter africanus.  Attempts to determine alternate (non-Liberibacter) associated etiological agents of HLB have been performed, namely by metagenomic analyses with HLB phenotypic citrus of phloem tissue isolated from bark [1] and whole leaf midribs [2].  Thes...

  13. Whole Genome Sequencing

    Science.gov (United States)

    ... you want to learn. Search form Search Whole Genome Sequencing You are here Home Testing & Services Testing ... the full story, click here . What is whole genome sequencing? Whole genome sequencing is the mapping out ...

  14. Breeding, genetic and genomic of citrus for disease resistance

    Directory of Open Access Journals (Sweden)

    Marcos A. Machado

    2011-10-01

    Full Text Available Although the citriculture is one of the most important economic activities in Brazil, it is based on a small number of varieties. This fact has contributed for the vulnerability of the culture regarding the phytosanitary problems. A higher number of varieties/genotypes with potential for commercial growing, either for the industry or fresh market, has been one of the main objectives of citrus breeding programs. The genetic breeding of citrus has improved, in the last decades, due to the possibility of an association between biotechnological tools and classical methods of breeding. The use of molecular markers for early selection of zygotic seedlings from controlled crosses resulted in the possibility of selection of a high number of new combination and, as a consequence, the establishment of a great number of hybrids in field experiments. The faster new tools are incorporated in the program, the faster is possibility to reach new genotypes that can be tested as a new variety. Good traits should be kept or incorporate, whereas bad traits have to be excluded or minimized in the new genotype. Scion and rootstock can not be considered separately, and graft compatibility, fruit quality and productivity are essential traits to be evaluated in the last stages of the program. The mapping of QTLs has favored breeding programs of several perennial species and in citrus it was possible to map several characteristics with qualitative and quantitative inheritance. The existence of linkage maps and QTLs already mapped, the development of EST and BAC library and the sequencing of the Citrus complete genome altogether make very demanding and urgent the exploration of such data to launch a wider genetic study of citrus. The rising of information on genome of several organisms has opened new approaches looking for integration between breeding, genetic and genome. Genome assisted selection (GAS involves more than gene or complete genome sequencing and is becoming

  15. Genome Wide Characterization of Short Tandem Repeat Markers in Sweet Orange (Citrus sinensis)

    OpenAIRE

    Biswas, Manosh Kumar; Xu, Qiang; Mayer, Christoph; Deng, Xiuxin

    2014-01-01

    Sweet orange (Citrus sinensis) is one of the major cultivated and most-consumed citrus species. With the goal of enhancing the genomic resources in citrus, we surveyed, developed and characterized microsatellite markers in the ≈347 Mb sequence assembly of the sweet orange genome. A total of 50,846 SSRs were identified with a frequency of 146.4 SSRs/Mbp. Dinucleotide repeats are the most frequent repeat class and the highest density of SSRs was found in chromosome 4. SSRs are non-randomly dist...

  16. Genome wide characterization of short tandem repeat markers in sweet orange (Citrus sinensis.

    Directory of Open Access Journals (Sweden)

    Manosh Kumar Biswas

    Full Text Available Sweet orange (Citrus sinensis is one of the major cultivated and most-consumed citrus species. With the goal of enhancing the genomic resources in citrus, we surveyed, developed and characterized microsatellite markers in the ≈347 Mb sequence assembly of the sweet orange genome. A total of 50,846 SSRs were identified with a frequency of 146.4 SSRs/Mbp. Dinucleotide repeats are the most frequent repeat class and the highest density of SSRs was found in chromosome 4. SSRs are non-randomly distributed in the genome and most of the SSRs (62.02% are located in the intergenic regions. We found that AT-rich SSRs are more frequent than GC-rich SSRs. A total number of 21,248 SSR primers were successfully developed, which represents 89 SSR markers per Mb of the genome. A subset of 950 developed SSR primer pairs were synthesized and tested by wet lab experiments on a set of 16 citrus accessions. In total we identified 534 (56.21% polymorphic SSR markers that will be useful in citrus improvement. The number of amplified alleles ranges from 2 to 12 with an average of 4 alleles per marker and an average PIC value of 0.75. The newly developed sweet orange primer sequences, their in silico PCR products, exact position in the genome assembly and putative function are made publicly available. We present the largest number of SSR markers ever developed for a citrus species. Almost two thirds of the markers are transferable to 16 citrus relatives and may be used for constructing a high density linkage map. In addition, they are valuable for marker-assisted selection studies, population structure analyses and comparative genomic studies of C. sinensis with other citrus related species. Altogether, these markers provide a significant contribution to the citrus research community.

  17. Genome-wide comparative analysis reveals similar types of NBS genes in hybrid Citrus sinensis genome and original Citrus clementine genome and provides new insights into non-TIR NBS genes.

    Directory of Open Access Journals (Sweden)

    Yunsheng Wang

    Full Text Available In this study, we identified and compared nucleotide-binding site (NBS domain-containing genes from three Citrus genomes (C. clementina, C. sinensis from USA and C. sinensis from China. Phylogenetic analysis of all Citrus NBS genes across these three genomes revealed that there are three approximately evenly numbered groups: one group contains the Toll-Interleukin receptor (TIR domain and two different Non-TIR groups in which most of proteins contain the Coiled Coil (CC domain. Motif analysis confirmed that the two groups of CC-containing NBS genes are from different evolutionary origins. We partitioned NBS genes into clades using NBS domain sequence distances and found most clades include NBS genes from all three Citrus genomes. This suggests that three Citrus genomes have similar numbers and types of NBS genes. We also mapped the re-sequenced reads of three pomelo and three mandarin genomes onto the C. sinensis genome. We found that most NBS genes of the hybrid C. sinensis genome have corresponding homologous genes in both pomelo and mandarin genomes. The homologous NBS genes in pomelo and mandarin suggest that the parental species of C. sinensis may contain similar types of NBS genes. This explains why the hybrid C. sinensis and original C. clementina have similar types of NBS genes in this study. Furthermore, we found that sequence variation amongst Citrus NBS genes were shaped by multiple independent and shared accelerated mutation accumulation events among different groups of NBS genes and in different Citrus genomes. Our comparative analyses yield valuable insight into the structure, organization and evolution of NBS genes in Citrus genomes. Furthermore, our comprehensive analysis showed that the non-TIR NBS genes can be divided into two groups that come from different evolutionary origins. This provides new insights into non-TIR genes, which have not received much attention.

  18. Citrus plastid-related gene profiling based on expressed sequence tag analyses

    Directory of Open Access Journals (Sweden)

    Tercilio Calsa Jr.

    2007-01-01

    Full Text Available Plastid-related sequences, derived from putative nuclear or plastome genes, were searched in a large collection of expressed sequence tags (ESTs and genomic sequences from the Citrus Biotechnology initiative in Brazil. The identified putative Citrus chloroplast gene sequences were compared to those from Arabidopsis, Eucalyptus and Pinus. Differential expression profiling for plastid-directed nuclear-encoded proteins and photosynthesis-related gene expression variation between Citrus sinensis and Citrus reticulata, when inoculated or not with Xylella fastidiosa, were also analyzed. Presumed Citrus plastome regions were more similar to Eucalyptus. Some putative genes appeared to be preferentially expressed in vegetative tissues (leaves and bark or in reproductive organs (flowers and fruits. Genes preferentially expressed in fruit and flower may be associated with hypothetical physiological functions. Expression pattern clustering analysis suggested that photosynthesis- and carbon fixation-related genes appeared to be up- or down-regulated in a resistant or susceptible Citrus species after Xylella inoculation in comparison to non-infected controls, generating novel information which may be helpful to develop novel genetic manipulation strategies to control Citrus variegated chlorosis (CVC.

  19. Complete Genome Sequence of the Largest Known Flavi-Like Virus, Diaphorina citri flavi-like virus, a Novel Virus of the Asian Citrus Psyllid, Diaphorina citri.

    Science.gov (United States)

    Matsumura, Emilyn E; Nerva, Luca; Nigg, Jared C; Falk, Bryce W; Nouri, Shahideh

    2016-01-01

    A novel flavi-like virus tentatively named Diaphorina citri flavi-like virus (DcFLV) was identified in field populations of Diaphorina citri through small RNA and transcriptome sequencing followed by reverse transcription (RT)-PCR. We report here the complete nucleotide sequence and genome organization of DcFLV, the largest flavi-like virus identified to date. PMID:27609921

  20. A genome-wide 20 K citrus microarray for gene expression analysis

    Directory of Open Access Journals (Sweden)

    Gadea Jose

    2008-07-01

    Full Text Available Abstract Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-wide cDNA microarray that include 21,081 putative unigenes of citrus. As a functional companion to the microarray, a web-browsable database 1 was created and populated with information about the unigenes represented in the microarray, including cDNA libraries, isolated clones, raw and processed nucleotide and protein sequences, and results of all the structural and functional annotation of the unigenes, like general description, BLAST hits, putative Arabidopsis orthologs, microsatellites, putative SNPs, GO classification and PFAM domains. We have performed a Gene Ontology comparison with the full set of Arabidopsis proteins to estimate the genome coverage of the microarray. We have also performed microarray hybridizations to check its usability. Conclusion This new cDNA microarray replaces the first 7K microarray generated two years ago and allows gene expression analysis at a more global scale. We have followed a rational design to minimize cross-hybridization while maintaining its utility for different citrus species. Furthermore, we also provide access to a website with full structural and functional annotation of the unigenes represented in the microarray, along with the ability to use this site to directly perform gene expression analysis using standard tools at different publicly available servers. Furthermore, we show how this microarray offers a good representation of the citrus genome and present the usefulness of this genomic tool for global

  1. Citrus breeding, genetics and genomics in Japan

    OpenAIRE

    Omura, Mitsuo; Shimada, Takehiko

    2016-01-01

    Citrus is one of the most cultivated fruits in the world, and satsuma mandarin (Citrus unshiu Marc.) is a major cultivated citrus in Japan. Many excellent cultivars derived from satsuma mandarin have been released through the improvement of mandarins using a conventional breeding method. The citrus breeding program is a lengthy process owing to the long juvenility, and it is predicted that marker-assisted selection (MAS) will overcome the obstacle and improve the efficiency of conventional br...

  2. Next generation haplotyping to decipher nuclear genomic interspecific admixture in Citrus species: analysis of chromosome 2

    OpenAIRE

    Curk, Franck; Ancillo, Gema; Garcia-Lor, Andres; Luro, François; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Navarro, Luis; Ollitrault, Patrick

    2014-01-01

    Background The most economically important Citrus species originated by natural interspecific hybridization between four ancestral taxa (Citrus reticulata, Citrus maxima, Citrus medica, and Citrus micrantha) and from limited subsequent interspecific recombination as a result of apomixis and vegetative propagation. Such reticulate evolution coupled with vegetative propagation results in mosaic genomes with large chromosome fragments from the basic taxa in frequent interspecific heterozygosity....

  3. Development and characterization of genomic and expressed SSRs in citrus by genome-wide analysis.

    Directory of Open Access Journals (Sweden)

    Sheng-Rui Liu

    Full Text Available Microsatellites or simple sequence repeats (SSRs are one of the most popular sources of genetic markers and play a significant role in plant genetics and breeding. In this study, we identified citrus SSRs in the genome of Clementine mandarin and analyzed their frequency and distribution in different genomic regions. A total of 80,708 SSRs were detected in the genome with an overall density of 268 SSRs/Mb. While di-nucleotide repeats were the most frequent microsatellites in genomic DNA sequence, tetra-nucleotides, which had more repeat units than any other SSR types, had the highest cumulative sequence length. We identified 6,834 transcripts as containing 8,989 SSRs in 33,929 Clementine mandarin transcripts, among which, tri-nucleotide motifs (36.0% were the most common, followed by di-nucleotide (26.9% and hexa-nucleotide motifs (15.1%. The motif AG (16.7% was most abundant among these SSRs, while motifs AAG (6.6%, AAT (5.0%, and TAG (2.2% were most common among tri-nucleotides. Functional categorization of transcripts containing SSRs revealed that 5,879 (86.0% of such transcripts had homology with known proteins, GO and KEGG annotation revealed that transcripts containing SSRs were those implicated in diverse biological processes in plants, including binding, development, transcription, and protein degradation. When 27 genomic and 78 randomly selected SSRs were tested on Clementine mandarin, 95 SSRs revealed polymorphism. These 95 SSRs were further deployed on 18 genotypes of the three generas of Rutaceae for the genetic diversity assessment, genomic SSRs generally show low transferability in comparison to SSRs developed from expressed sequences. These transcript-markers identified in our study may provide a valuable genetic and genomic tool for further genetic research and varietal development in citrus, such as diversity study, QTL mapping, molecular breeding, comparative mapping and other genetic analyses.

  4. Comparative genomic characterization of citrus-associated Xylella fastidiosa strains

    Directory of Open Access Journals (Sweden)

    Nunes Luiz R

    2007-12-01

    Full Text Available Abstract Background The xylem-inhabiting bacterium Xylella fastidiosa (Xf is the causal agent of Pierce's disease (PD in vineyards and citrus variegated chlorosis (CVC in orange trees. Both of these economically-devastating diseases are caused by distinct strains of this complex group of microorganisms, which has motivated researchers to conduct extensive genomic sequencing projects with Xf strains. This sequence information, along with other molecular tools, have been used to estimate the evolutionary history of the group and provide clues to understand the capacity of Xf to infect different hosts, causing a variety of symptoms. Nonetheless, although significant amounts of information have been generated from Xf strains, a large proportion of these efforts has concentrated on the study of North American strains, limiting our understanding about the genomic composition of South American strains – which is particularly important for CVC-associated strains. Results This paper describes the first genome-wide comparison among South American Xf strains, involving 6 distinct citrus-associated bacteria. Comparative analyses performed through a microarray-based approach allowed identification and characterization of large mobile genetic elements that seem to be exclusive to South American strains. Moreover, a large-scale sequencing effort, based on Suppressive Subtraction Hybridization (SSH, identified 290 new ORFs, distributed in 135 Groups of Orthologous Elements, throughout the genomes of these bacteria. Conclusion Results from microarray-based comparisons provide further evidence concerning activity of horizontally transferred elements, reinforcing their importance as major mediators in the evolution of Xf. Moreover, the microarray-based genomic profiles showed similarity between Xf strains 9a5c and Fb7, which is unexpected, given the geographical and chronological differences associated with the isolation of these microorganisms. The newly

  5. Sequence diversity on four ORFs of citrus tristeza virus correlates with pathogenicity

    OpenAIRE

    Martínez-Soriano Juan; Rivera-Bustamante Rafael; Ochoa-Sánchez Juan; Herrera-Isidrón Lisset

    2009-01-01

    Abstract The molecular characterization of isolates of citrus tristeza virus (CTV) from eight locations in Mexico was undertaken by analyzing five regions located at the opposite ends of the virus genome. Two regions have been previously used to study CTV variability (coat protein and p23), while the other three correspond to other genomic segments (p349-B, p349-C and p13). Our comparative nucleotide analyses included CTV sequences from different geographical origins already deposited in the ...

  6. Comparative Genomic Analysis of Xanthomonas axonopodis pv. citrumelo F1, Which Causes Citrus Bacterial Spot Disease, and Related Strains Provides Insights into Virulence and Host Specificity ▿ #

    OpenAIRE

    Jalan, Neha; Aritua, Valente; Kumar, Dibyendu; Yu, Fahong; Jones, Jeffrey B; Graham, James H; Setubal, João C; Wang, Nian

    2011-01-01

    Xanthomonas axonopodis pv. citrumelo is a citrus pathogen causing citrus bacterial spot disease that is geographically restricted within the state of Florida. Illumina, 454 sequencing, and optical mapping were used to obtain a complete genome sequence of X. axonopodis pv. citrumelo strain F1, 4.9 Mb in size. The strain lacks plasmids, in contrast to other citrus Xanthomonas pathogens. Phylogenetic analysis revealed that this pathogen is very close to the tomato bacterial spot pathogen X. camp...

  7. Genomic variability of Citrus tristeza virus (CTV) isolates introduced into Morocco

    OpenAIRE

    Lbida, B.; Fonseca, Filomena; C. Santos; Zemzami, M.; Bennani, A; Nolasco, Gustavo

    2004-01-01

    Genomic variability of the coat protein gene of Citrus tristeza virus isolates obtained from old Meyer lemon introductions in Morocco and more recent budwood introductions from Spain were studied. The coat protein gene of the virus was amplified directly from infected tissue by immunocapture RT-PCR and analysed by single stranded conformation polymorphism (SSCP) and sequencing. Each isolate consisted of several related genomic variants, typical of a quasi-species. Although SSCP analysis has o...

  8. The complete chloroplast genome sequence of Citrus sinensis (L.) Osbeck var 'Ridge Pineapple': organization and phylogenetic relationships to other angiosperms

    OpenAIRE

    Jansen Robert K; Lee Seung-Bum; Singh Nameirakpam D; Bausher Michael G; Daniell Henry

    2006-01-01

    Abstract Background The production of Citrus, the largest fruit crop of international economic value, has recently been imperiled due to the introduction of the bacterial disease Citrus canker. No significant improvements have been made to combat this disease by plant breeding and nuclear transgenic approaches. Chloroplast genetic engineering has a number of advantages over nuclear transformation; it not only increases transgene expression but also facilitates transgene containment, which is ...

  9. Complex history of admixture during citrus domestication revealed by genome analysis

    Energy Technology Data Exchange (ETDEWEB)

    Wu, G. Albert; Prochnik, Simon; Jenkins, Jerry; Salse, Jerome; Hellsten, Uffe; Murat, Florent; Perrier, Xavier; Ruiz, Manuel; Scalabrin, Simone; Terol, Javier; Takita, Marco Aur& #233; lio,; Labadie, Karine; Poulain, Julie; Couloux, Arnaud; Jabbari, Kamel; Cattonaro, Federica; Fabbro, Cristian Del; Pinosio, Sara; Zuccolo, Andrea; Chapman, Jarrod; Grimwood, Jane; Tadeo, Francisco; Estornell, Leandro H.; Mu?oz-Sanz, Juan V.; Ibanez, Victoria; Herrero-Ortega, Amparo; Aleza, Pablo; P& #233; rez, Juli& #225; n P& #233; rez,; Ramon, Daniel; Brunel, Dominique; Luro, Francois; Chen, Chunxian; Farmerie, William G.; Desany, Brian; Kodira, Chinnappa; Mohiuddin, Mohammed; Harkins, Tim; Fredrikson, Karin; Burns, Paul; Lomsadze, Alexandre; Borodovsky, Mark; Reforgiato, Giuseppe; Freitas-Astua, Juliana; Quetier, Francis; Navarro, Luis; Roose, Mikeal; Wincker, Patrick; Schmutz, Jeremy; Morgante, Michele; Machado, Marcos Antonio; Talon, Manuel; Jaillon, Olivier; Ollitrault, Patrick; Gmitter, Frederick; Rokhsar, Daniel

    2014-06-30

    Although Citrus is the most globally significant tree fruit, its domestication history is poorly understood. Cultivated citrus types are believed to comprise selections from and/or hybrids of several wild progenitor species, but the identities of these progenitors, and their contribution to modern cultivars, remain controversial. Here we report the genomes of a collection of mandarins, pummelos, and oranges, including a high quality reference sequence from a haploid Clementine mandarin. By comparative genome analysis we show that these cultivated types can be derived from two progenitor species. Cultivated pummelos represent selections from a single progenitor species C. maxima. Unexpectedly, however, we find that cultivated mandarins are introgressions of C. maxima into a distinct second population that we identify with the ancestral wild mandarin species C. reticulata. Sweet and sour oranges are found to be interspecific hybrids. Sweet orange, the most widely cultivated citrus, arose as the offspring of previously admixed individuals. In contrast, sour (or Seville) orange is an F1 hybrid of pure C. maxima and C. reticulata parents, implying that wild mandarins were part of the early breeding germplasm. Surprisingly, we also find that a wild Chinese mandarin from Mangshan, China shows substantial sequence divergence from C. reticulata and appears to represent a distinct taxon. Understanding the relationships and phylogeny of cultivated citrus through genome analysis will clarify taxonomic relationships and enable previously inconceivable opportunities for sequence-directed genetic improvement. Citrus are widely consumed worldwide as juice or fresh fruit, providing important sources of vitamin C and other health-promoting compounds. Global production in 2012 exceeded 86 million metric tons, with an estimated value of US$9 billion (http://www.fas.usda.gov/psdonline/circulars/citrus.pdf). The very narrow genetic diversity of cultivated citrus makes it highly

  10. Analysis of SSR in Citrus Sequences from EMBL Database

    Institute of Scientific and Technical Information of China (English)

    MENG Hai-jun; CAO Qing-qin; HU Zhi-yong; LIU Gao-ping; CHENG Yun-jiang; DENG Xiu-xin

    2005-01-01

    Abundance of simple sequence repeat (SSR) in Citrus sequences from EMBL database was investigated by using computer program MISA (MIcroSAtellite), which aimed to provide useful information for the development of SSR markers.Among 32 896 sequences of Citrus, 4987 SSRs were found in 4167 sequences and the average distance between SSRs was approximately 3.5 kb. Mononucleotide repeats (50.6%) were the most abundant repeats. And di-, tri-, tetra-, penta- and hexa-nucleotide repeats were 22.8, 25.2, 1, 0.08, and 0.36%, respectively. The most abundant motif was A/T followed in descending order by AG/CT, AC/GT, AT/TA. AAT/ATT, AAG/CTT, AGC/CGT, ACG/CTG and C/G. They comprised about90% of all microsatellites. Ten primer pairs were designed, and three of them produced clear visible bands among Citrus and its related genera.

  11. Citrus sinensis annotation project (CAP: a comprehensive database for sweet orange genome.

    Directory of Open Access Journals (Sweden)

    Jia Wang

    Full Text Available Citrus is one of the most important and widely grown fruit crop with global production ranking firstly among all the fruit crops in the world. Sweet orange accounts for more than half of the Citrus production both in fresh fruit and processed juice. We have sequenced the draft genome of a double-haploid sweet orange (C. sinensis cv. Valencia, and constructed the Citrus sinensis annotation project (CAP to store and visualize the sequenced genomic and transcriptome data. CAP provides GBrowse-based organization of sweet orange genomic data, which integrates ab initio gene prediction, EST, RNA-seq and RNA-paired end tag (RNA-PET evidence-based gene annotation. Furthermore, we provide a user-friendly web interface to show the predicted protein-protein interactions (PPIs and metabolic pathways in sweet orange. CAP provides comprehensive information beneficial to the researchers of sweet orange and other woody plants, which is freely available at http://citrus.hzau.edu.cn/.

  12. Deep sequencing of viral small-RNAs of citrus tristeza virus (CTV) reveals genomic differences between two Italian isolates of CTV

    Science.gov (United States)

    A recent Citrus tristeza virus (CTV) epidemic of quick decline (QD) killed many sweet orange trees grafted on sour orange rootstock in Sicily but left some asymptomatic trees in the same field. Recent reports indicated cross-protection involves exclusion of a severe CTV strain by a mild strain of th...

  13. Yeast genome sequencing:

    DEFF Research Database (Denmark)

    Piskur, Jure; Langkjær, Rikke Breinhold

    2004-01-01

    For decades, unicellular yeasts have been general models to help understand the eukaryotic cell and also our own biology. Recently, over a dozen yeast genomes have been sequenced, providing the basis to resolve several complex biological questions. Analysis of the novel sequence data has shown...... of closely related species helps in gene annotation and to answer how many genes there really are within the genomes. Analysis of non-coding regions among closely related species has provided an example of how to determine novel gene regulatory sequences, which were previously difficult to analyse because...... they are short and degenerate and occupy different positions. Comparative genomics helps to understand the origin of yeasts and points out crucial molecular events in yeast evolutionary history, such as whole-genome duplication and horizontal gene transfer(s). In addition, the accumulating sequence data provide...

  14. Unique Features of a Japanese ‘Candidatus Liberibacter asiaticus’ Strain Revealed by Whole Genome Sequencing

    OpenAIRE

    Hiroshi Katoh; Shin-Ichi Miyata; Hiromitsu Inoue; Toru Iwanami

    2014-01-01

    Citrus greening (huanglongbing) is the most destructive disease of citrus worldwide. It is spread by citrus psyllids and is associated with phloem-limited bacteria of three species of α-Proteobacteria, namely, 'Candidatus Liberibacter asiaticus', 'Ca. L. americanus', and 'Ca. L. africanus'. Recent findings suggested that some Japanese strains lack the bacteriophage-type DNA polymerase region (DNA pol), in contrast to the Floridian psy62 strain. The whole genome sequence of the pol-negative 'C...

  15. Genome-wide identification of sweet orange (Citrus sinensis) histone modification gene families and their expression analysis during the fruit development and fruit-blue mold infection process

    OpenAIRE

    Xu, Jidi; Xu, Haidan; Liu, Yuanlong; Wang, Xia; Xu, Qiang; Deng, Xiuxin

    2015-01-01

    In eukaryotes, histone acetylation and methylation have been known to be involved in regulating diverse developmental processes and plant defense. These histone modification events are controlled by a series of histone modification gene families. To date, there is no study regarding genome-wide characterization of histone modification related genes in citrus species. Based on the two recent sequenced sweet orange genome databases, a total of 136 CsHMs (Citrus sinensis histone modification gen...

  16. Genome Similarity Implies that Citrus-Parasitic Burrowing Nematodes do not Represent a Unique Species

    OpenAIRE

    Kaplan, D. T.; Opperman, C. H.

    1997-01-01

    Burrowing nematodes from Central America, Dominican Republic, Florida, Guadeloupe, Hawaii, and Puerto Rico were characterized for their ability to parasitize citrus, but citrus parasites were found only in Florida. Sequence tag sites originally amplified from a citrus-parasitic burrowing nematode were polymorphic among 37 burrowing nematode isolates and were not correlated with citrus parasitism, nematode isolate collection site, or amplification of a 2.4-kb sequence tag site (DK#1). Results ...

  17. Classifying Genomic Sequences by Sequence Feature Analysis

    Institute of Scientific and Technical Information of China (English)

    Zhi-Hua Liu; Dian Jiao; Xiao Sun

    2005-01-01

    Traditional sequence analysis depends on sequence alignment. In this study, we analyzed various functional regions of the human genome based on sequence features, including word frequency, dinucleotide relative abundance, and base-base correlation. We analyzed the human chromosome 22 and classified the upstream,exon, intron, downstream, and intergenic regions by principal component analysis and discriminant analysis of these features. The results show that we could classify the functional regions of genome based on sequence feature and discriminant analysis.

  18. A genome-wide 20 K citrus microarray for gene expression analysis

    OpenAIRE

    Gadea Jose; Forment Javier; Santiago Julia; Marques M Carmen; Juarez Jose; Mauri Nuria; Martinez-Godoy M Angeles

    2008-01-01

    Abstract Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-...

  19. A genome-wide 20 K citrus microarray for gene expression analysis

    OpenAIRE

    Martinez-Godoy, M Angeles; Mauri, Nuria; Juarez, Jose; Marques, M Carmen; Santiago, Julia; Forment, Javier; Gadea, Jose

    2008-01-01

    Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-wide cDNA...

  20. SNP mining in C. clementina BAC end sequences; transferability in the Citrus genus (Rutaceae), phylogenetic inferences and perspectives for genetic mapping

    OpenAIRE

    Ollitrault Patrick; Terol Javier; Garcia-Lor Andres; Bérard Aurélie; Chauveau Aurélie; Froelicher Yann; Belzile Caroline; Morillon Raphaël; Navarro Luis; Brunel Dominique; Talon Manuel

    2012-01-01

    Abstract Background With the increasing availability of EST databases and whole genome sequences, SNPs have become the most abundant and powerful polymorphic markers. However, SNP chip data generally suffers from ascertainment biases caused by the SNP discovery and selection process in which a small number of individuals are used as discovery panels. The ongoing International Citrus Genome Consortium sequencing project of the highly heterozygous Clementine and sweet orange genomes will soon r...

  1. Genome Sequence Databases (Overview): Sequencing and Assembly

    Energy Technology Data Exchange (ETDEWEB)

    Lapidus, Alla L.

    2009-01-01

    From the date its role in heredity was discovered, DNA has been generating interest among scientists from different fields of knowledge: physicists have studied the three dimensional structure of the DNA molecule, biologists tried to decode the secrets of life hidden within these long molecules, and technologists invent and improve methods of DNA analysis. The analysis of the nucleotide sequence of DNA occupies a special place among the methods developed. Thanks to the variety of sequencing technologies available, the process of decoding the sequence of genomic DNA (or whole genome sequencing) has become robust and inexpensive. Meanwhile the assembly of whole genome sequences remains a challenging task. In addition to the need to assemble millions of DNA fragments of different length (from 35 bp (Solexa) to 800 bp (Sanger)), great interest in analysis of microbial communities (metagenomes) of different complexities raises new problems and pushes some new requirements for sequence assembly tools to the forefront. The genome assembly process can be divided into two steps: draft assembly and assembly improvement (finishing). Despite the fact that automatically performed assembly (or draft assembly) is capable of covering up to 98% of the genome, in most cases, it still contains incorrectly assembled reads. The error rate of the consensus sequence produced at this stage is about 1/2000 bp. A finished genome represents the genome assembly of much higher accuracy (with no gaps or incorrectly assembled areas) and quality ({approx}1 error/10,000 bp), validated through a number of computer and laboratory experiments.

  2. Taxonomy and phylogeny of the genus citrus based on the nuclear ribosomal dna its region sequence

    International Nuclear Information System (INIS)

    The genus Citrus (Aurantioideae, Rutaceae) is the sole source of the citrus fruits of commerce showing high economic values. In this study, the taxonomy and phylogeny of Citrus species is evaluated using sequence analysis of the ITS region of nrDNA. This study is based on 26 plants materials belonging to 22 Citrus species having wild, domesticated, and cultivated species. Through DNA alignment of the ITS sequence, ITS1 and ITS2 regions showed relatively high variations of sequence length and nucleotide among these Citrus species. According to previous six-tribe discrimination theory by Swingle and Reece, the grouping in our ITS phylogenetic tree reconstructed by ITS sequences was not related to tribe discrimination but species discrimination. However, the molecular analysis could provide more information on citrus taxonomy. Combined with ITS sequences of other subgenera in then true citrus fruit tree group, the ITS phylogenetic tree indicated subgenera Citrus was monophyletic and nearer to Fortunella, Poncirus, and Clymenia compared to Microcitrus and Eremocitrus. Abundant sequence variations of the ITS region shown in this study would help species identification and tribe differentiation of the genus Citrus. (author)

  3. Characterization of hemizygous deletions in Citrus using array-Comparative Genomic Hybridization and microsynteny comparisons with the poplar genome

    OpenAIRE

    Usach Antonio; Geraud Marion; Ruiz-Rivero Omar; Iglesias Domingo J; Naranjo Miguel A; Ríos Gabino; Talón Manuel

    2008-01-01

    Abstract Background Many fruit-tree species, including relevant Citrus spp varieties exhibit a reproductive biology that impairs breeding and strongly constrains genetic improvements. In citrus, juvenility increases the generation time while sexual sterility, inbreeding depression and self-incompatibility prevent the production of homozygous cultivars. Genomic technology may provide citrus researchers with a new set of tools to address these various restrictions. In this work, we report a val...

  4. The complete chloroplast genome sequence of Zanthoxylum piperitum.

    Science.gov (United States)

    Lee, Jonghoon; Lee, Hyeon Ju; Kim, Kyunghee; Lee, Sang-Choon; Sung, Sang Hyun; Yang, Tae-Jin

    2016-09-01

    The complete chloroplast genome sequence of Zanthoxylum piperitum, a plant species with useful aromatic oils in family Rutaceae, was generated in this study by de novo assembly with whole-genome sequence data. The chloroplast genome was 158 154 bp in length with a typical quadripartite structure containing a pair of inverted repeats of 27 644 bp, separated by large single copy and small single copy of 85 340 bp and 17 526 bp, respectively. The chloroplast genome harbored 112 genes consisting of 78 protein-coding genes 30 tRNA genes and 4 rRNA genes. Phylogenetic analysis of the complete chloroplast genome sequences with those of known relatives revealed that Z. piperitum is most closely related to the Citrus species. PMID:26260183

  5. Genome-Wide Comparative Analysis Reveals Similar Types of NBS Genes in Hybrid Citrus sinensis Genome and Original Citrus clementine Genome and Provides New Insights into Non-TIR NBS Genes

    Science.gov (United States)

    In this study, we identified and compared nucleotide-binding site (NBS) domain-containing genes from three Citrus genomes (C. clementina, C. sinensis from USA and C. sinensis from China). Phylogenetic analysis of all Citrus NBS genes across these three genomes revealed that there are three approxima...

  6. Sequence diversity on four ORFs of citrus tristeza virus correlates with pathogenicity

    Directory of Open Access Journals (Sweden)

    Martínez-Soriano Juan

    2009-07-01

    Full Text Available Abstract The molecular characterization of isolates of citrus tristeza virus (CTV from eight locations in Mexico was undertaken by analyzing five regions located at the opposite ends of the virus genome. Two regions have been previously used to study CTV variability (coat protein and p23, while the other three correspond to other genomic segments (p349-B, p349-C and p13. Our comparative nucleotide analyses included CTV sequences from different geographical origins already deposited in the GenBank databases. The largest nucleotide differences were located in two fragments located at the 5' end of the genome (p349-B and p349-C. Phylogenetic analyses on those five regions showed that the degree of nucleotide divergence among strains tended to correlate with their pathogenicity. Two main groups were defined: mild, with almost no noticeable effects on the indicator plants and severe, with drastic symptoms. Mild isolates clustered together in every analyzed ORF sharing a genetic distance below 0.022, in contrast with the severe isolates, which showed a more disperse distribution and a genetic distance of 0.276. Analyses of the p349-B and p349-C regions evidenced two lineages within the severe group: severe common subgroup (most of severe isolates and severe divergent subgroup (T36-like isolates. This study represents the first attempt to analyze the genetic variability of CTV in Mexico by constructing phylogenetic trees based on new genomic regions that use group-specific nucleotide and amino acid sequences. These results may be useful to implement specific assays for strain discrimination. Moreover, it would be an excellent reference for the CTV situation in México to face the recent arrival of brown citrus aphid.

  7. Whole-Genome Sequence of “Candidatus Liberibacter asiaticus” from Guangdong, China

    OpenAIRE

    Z. Zheng; Deng, X.; J. Chen

    2014-01-01

    The draft genome sequence of “Candidatus Liberibacter asiaticus” strain A4, isolated from a mandarin citrus in Guangdong, People’s Republic of China, is reported. The A4 strain has a genome size of 1,208,625 bp, G+C content of 36.4%, 1,107 predicted open reading frames, and 53 RNA genes.

  8. Draft Genome Sequence of “Candidatus Liberibacter asiaticus” from Diaphorina citri in Guangdong, China

    OpenAIRE

    Wu, F.; Z. Zheng; Deng, X.; Cen, Y.; Liang, G; J. Chen

    2015-01-01

    The draft genome sequence of “Candidatus Liberibacter asiaticus” strain YCPsy from an Asian citrus psyllid (Diaphorina citri) in Guangdong, China, is reported here. The YCPsy strain has a genome size of 1,233,647 bp, 36.5% G+C content, 1,171 open reading frames (ORFs), and 53 RNAs.

  9. Draft Genome Sequence of "Candidatus Liberibacter asiaticus" from Diaphorina citri in Guangdong, China.

    Science.gov (United States)

    Wu, F; Zheng, Z; Deng, X; Cen, Y; Liang, G; Chen, J

    2015-01-01

    The draft genome sequence of "Candidatus Liberibacter asiaticus" strain YCPsy from an Asian citrus psyllid (Diaphorina citri) in Guangdong, China, is reported here. The YCPsy strain has a genome size of 1,233,647 bp, 36.5% G+C content, 1,171 open reading frames (ORFs), and 53 RNAs. PMID:26543132

  10. A genome-wide 20 K citrus microarray for gene expression analysis

    OpenAIRE

    Martínez-Godoy, M. Ángeles; Mauri, Nuria; Juárez, José; Marqués, M.Carmen; Santiago, Julia; Forment, Javier; Gadea Vacas, José

    2008-01-01

    Background: Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genomewide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results: We have designed and constructed a publicly available ...

  11. Defective RNAs of Citrus tristeza virus analogous to Crinivirus genomic RNAs

    International Nuclear Information System (INIS)

    The family Closteroviridae includes the genera Closterovirus and Ampelovirus with monopartite genomes and the genus Crinivirus with bipartite genomes. Plants infected with the Closterovirus, Citrus tristeza virus (CTV), often contain one or more populations of defective RNAs (dRNAs). Although most dRNAs are comparatively small (2-5 kb) consisting of the genomic RNA termini with large internal deletions, we recently characterized large dRNAs of ∼12 kb that retained the open reading frames (ORFs) 1a plus 1b. These were self-replicating RNAs and appeared to be analogous to the genomic RNA 1 of the bipartite criniviruses. The present report describes the finding of an additional group of large dRNAs (LdRNAs) that retained all or most of the 10 3' ORFs and appeared to be analogous to genomic RNA 2 of criniviruses. Isolates associated with LdRNAs were found associated with double-recombinant dRNAs (DR-dRNAs) of various sizes (1.7 to 5.1 kb) that comprised the two termini and a noncontiguous internal sequence from ORF2. The genetic and epidemiological implications of the architectural identities of LdRNAs and DR dRNAs and their apparent analogy with the genomic RNA 2 of criniviruses are discussed

  12. Nucleotide heterogeneity at the genomic 5’- and 3’-termini of California (CA) isolates of Citrus tristeza virus (CTV)

    Science.gov (United States)

    Nucleotide (nt) sequences in the genomic ends of sense (+)-RNA viruses serve essential biological functions and are important considerations in the construction of infectious clones. Two isolates of Citrus tristeza virus (CTV) from California (CA) having a T30- and a T36-genotype were inoculated in ...

  13. Unique Features of a Japanese ‘Candidatus Liberibacter asiaticus’ Strain Revealed by Whole Genome Sequencing

    OpenAIRE

    Katoh, Hiroshi; Miyata, Shin-ichi; Inoue, Hiromitsu; Iwanami, Toru

    2014-01-01

    Citrus greening (huanglongbing) is the most destructive disease of citrus worldwide. It is spread by citrus psyllids and is associated with phloem-limited bacteria of three species of α-Proteobacteria, namely, ‘Candidatus Liberibacter asiaticus’, ‘Ca. L. americanus’, and ‘Ca. L. africanus’. Recent findings suggested that some Japanese strains lack the bacteriophage-type DNA polymerase region (DNA pol), in contrast to the Floridian psy62 strain. The whole genome sequence of the pol-negative ‘C...

  14. A Phylogenetic Analysis of 34 Chloroplast Genomes Elucidates the Relationships between Wild and Domestic Species within the Genus Citrus

    OpenAIRE

    Carbonell-Caballero, Jose; Alonso, Roberto; Ibañez, Victoria; Terol, Javier; Talon, Manuel; Dopazo, Joaquin

    2015-01-01

    Citrus genus includes some of the most important cultivated fruit trees worldwide. Despite being extensively studied because of its commercial relevance, the origin of cultivated citrus species and the history of its domestication still remain an open question. Here, we present a phylogenetic analysis of the chloroplast genomes of 34 citrus genotypes which constitutes the most comprehensive and detailed study to date on the evolution and variability of the genus Citrus. A statistical model wa...

  15. A reference genetic map of C. clementina hort. ex Tan.; citrus evolution inferences from comparative mapping

    OpenAIRE

    Ollitrault Patrick; Terol Javier; Chen Chunxian; Federici Claire T; Lotfy Samia; Hippolyte Isabelle; Ollitrault Frédérique; Bérard Aurélie; Chauveau Aurélie; Cuenca Jose; Costantino Gilles; Kacar Yildiz; Mu Lisa; Garcia-Lor Andres; Froelicher Yann

    2012-01-01

    Abstract Background Most modern citrus cultivars have an interspecific origin. As a foundational step towards deciphering the interspecific genome structures, a reference whole genome sequence was produced by the International Citrus Genome Consortium from a haploid derived from Clementine mandarin. The availability of a saturated genetic map of Clementine was identified as an essential prerequisite to assist the whole genome sequence assembl...

  16. Characterization of an ATP/ADP translocase in the citrus huanglongbing bacterium, Candidatus Liberibacter

    Science.gov (United States)

    Citrus huanglongbing (HLB), a disease currently threatening the citrus industry worldwide, has been associated with three different species of Alphaproteobacteria known as Candidatus Liberibacter. A complete genome sequence was recently obtained via metagenomics for Ca. L. asiaticus (Las), the prom...

  17. Genome-Wide Comparative Analysis Reveals Similar Types of NBS Genes in Hybrid Citrus sinensis Genome and Original Citrus clementine Genome and Provides New Insights into Non-TIR NBS Genes

    OpenAIRE

    Wang, Yunsheng; Zhou, Lijuan; Li, Dazhi; Dai, Liangying; Lawton-Rauh, Amy; Srimani, Pradip K.; Duan, Yongping; Luo, Feng

    2015-01-01

    In this study, we identified and compared nucleotide-binding site (NBS) domain-containing genes from three Citrus genomes (C. clementina, C. sinensis from USA and C. sinensis from China). Phylogenetic analysis of all Citrus NBS genes across these three genomes revealed that there are three approximately evenly numbered groups: one group contains the Toll-Interleukin receptor (TIR) domain and two different Non-TIR groups in which most of proteins contain the Coiled Coil (CC) domain. Motif anal...

  18. Characterization of hemizygous deletions in Citrus using array-Comparative Genomic Hybridization and microsynteny comparisons with the poplar genome

    Directory of Open Access Journals (Sweden)

    Usach Antonio

    2008-08-01

    Full Text Available Abstract Background Many fruit-tree species, including relevant Citrus spp varieties exhibit a reproductive biology that impairs breeding and strongly constrains genetic improvements. In citrus, juvenility increases the generation time while sexual sterility, inbreeding depression and self-incompatibility prevent the production of homozygous cultivars. Genomic technology may provide citrus researchers with a new set of tools to address these various restrictions. In this work, we report a valuable genomics-based protocol for the structural analysis of deletion mutations on an heterozygous background. Results Two independent fast neutron mutants of self-incompatible clementine (Citrus clementina Hort. Ex Tan. cv. Clemenules were the subject of the study. Both mutants, named 39B3 and 39E7, were expected to carry DNA deletions in hemizygous dosage. Array-based Comparative Genomic Hybridization (array-CGH using a Citrus cDNA microarray allowed the identification of underrepresented genes in these two mutants. Subsequent comparison of citrus deleted genes with annotated plant genomes, especially poplar, made possible to predict the presence of a large deletion in 39B3 of about 700 kb and at least two deletions of approximately 100 and 500 kb in 39E7. The deletion in 39B3 was further characterized by PCR on available Citrus BACs, which helped us to build a partial physical map of the deletion. Among the deleted genes, ClpC-like gene coding for a putative subunit of a multifunctional chloroplastic protease involved in the regulation of chlorophyll b synthesis was directly related to the mutated phenotype since the mutant showed a reduced chlorophyll a/b ratio in green tissues. Conclusion In this work, we report the use of array-CGH for the successful identification of genes included in a hemizygous deletion induced by fast neutron irradiation on Citrus clementina. The study of gene content and order into the 39B3 deletion also led to the unexpected

  19. Determination of genomic diversity within mutant lemon (Citrus limon L.) and mandarin (Citrus reticulata) using molecular markers

    International Nuclear Information System (INIS)

    Mutation breeding is considered to be successful in order to obtain new cultivars and expanding the genetic source of new varieties of lemon and mandarin in Citrus. In this study, different doses of gamma radiation (60Co) have been used to generate novel mutants of the common variety Yerli Yuvarlak lemon and Yerli mandarin. The differences within the mutants were examined using simple sequence repeats (SSR) and sequence related amplified polymorphism (SRAP) markers. A total of 26 SSR primers and 28 SRAP primer combinations were screened for linkage to the different traits. Six SSR primers were showed polymorphism and closely linked to shoot tip color in lemon. Whereas, all of SRAP primer combinations were not produce any scorable polymorphic bands in both Yerli Yuvarlak lemon and Yerli mandarin. In this study, SSR and SRAP molecular markers were used together for first time to determination the genetic differences in citrus mutation breeding in the world. (author)

  20. The ARF, AUX/IAA and GH3 gene families in citrus: genome-wide identification and expression analysis during fruitlet drop from abscission zone A.

    Science.gov (United States)

    Xie, Rangjin; Pang, Shaoping; Ma, Yanyan; Deng, Lie; He, Shaolan; Yi, Shilai; Lv, Qiang; Zheng, Yongqiang

    2015-12-01

    Completion of the whole genome sequencing of citrus enabled us to perform genome-wide identification and functional analysis of the gene families involved in agronomic traits and morphological diversity of citrus. In this study, 22 CitARF, 11 CitGH3 and 26 CitAUX/IAA genes were identified in citrus, respectively. Phylogenetic analysis revealed that all the genes of each gene family could be subdivided into three groups and showed strong evolutionary conservation. The GH3 and AUX/IAA gene families shrank and ARF gene family was highly conserved in the citrus genome after speciation from Arabidopsis thaliana. Tissue-specific expression profiles revealed that 54 genes were expressed in at least one tissue while just 5 genes including CitARF07, CitARF20, CitGH3.04, CitAUX/IAA25 and CitAUX/IAA26 with very low expression level in all tissues tested, suggesting that the CitARF, CitGH3 and CitAUX/IAA gene families played important roles in the development of citrus organs. In addition, our data found that the expression of 2 CitARF, 4 CitGH3 and 4 AUX/IAA genes was affected by IAA treatment, and 7 genes including, CitGH3.04, CitGH3.07, CitAUX/IAA03, CitAUX/IAA04, CitAUX/IAA18, CitAUX/IAA19 and CitAUX/IAA23 were related to fruitlet abscission. This study provides a foundation for future studies on elucidating the precise role of citrus ARF, GH3 and AUX/IAA genes in early steps of auxin signal transduction and open up a new opportunity to uncover the molecular mechanism underlying citrus fruitlet abscission. PMID:25982744

  1. Sequencing and annotation of the Wolbachia endosymbiont of Diaphorina citri by the CG-HLB Genome Resources group reveals candidate sources of interaction with the insect host

    OpenAIRE

    Saha, Surya; Hunter, Wayne; Lindeberg, Magdalen

    2014-01-01

    The Citrus Greening – Huanglongbing (CG-HLB) Genome Resources group serves as a bioinformatics resource for diverse projects related to the biology of CG-HLB.  A major recent project concerns the generation and annotation of a draft genome sequence for the Wolbachia endosymbiont (wDi) of the Asian citrus psyllid, of particular interest given the potential for control of psyllid behavior through manipulation of its bacterial endosymbionts.   The Wolbachia draft genome was assembled and contigs...

  2. Breeding, genetic and genomic of citrus for disease resistance

    OpenAIRE

    Machado, Marcos A; Mariângela Cristofani-Yaly; Marinês Bastianel

    2011-01-01

    Although the citriculture is one of the most important economic activities in Brazil, it is based on a small number of varieties. This fact has contributed for the vulnerability of the culture regarding the phytosanitary problems. A higher number of varieties/genotypes with potential for commercial growing, either for the industry or fresh market, has been one of the main objectives of citrus breeding programs. The genetic breeding of citrus has improved, in the last decades, due to the possi...

  3. Comparative Analysis of SSR and SRAP Sequence Divergence in Citrus Germplasm

    Directory of Open Access Journals (Sweden)

    Mohamed Hamdy Amar

    2012-01-01

    Full Text Available In the present study, we sought to determine whether one simple criterion, sequence divergence, can reasonably guide in phylogenetic across a broad scale in Citrus germplasm. Comparative investigation on the performance of the SSR and SRAP markers was conducted in phylogenetic analysis across sequence analysis of the PCR product in the genus Citrus and its relatives. The maximum composite likelihood model was used for pairwise distance calculation. To determine whether there is a difference depending on the method of choice. Somehow, phylogenetic trees were constructed using two algorithms Neighbor Joining (NJ and Maximum Parsimony (MP via MEGA 4 software. In contrast some differences in the positioning of some genotypes were observed in the phylogenetic trees created using the two models and the dendrogram from MP across SRAP sequence was the most congruent with Swingle and Reeces treatment of the subfamily Aurantioideae. The results of the present study suggest that evaluation of SRAP variation at the sequence level can be effective than SSR variation in exploring the evolutionary relationships among Citrus species. These results were the new information for future study on Citrus breeding programs such as germplasm characterization, screening of zygotic and nuclear seedlings and developing sequence divergence in Citrus and its relatives.

  4. Sequence Analysis of a 282-Kilobase Region Surrounding the Citrus Tristeza Virus Resistance Gene (Ctv) Locus in Poncirus trifoliata L. Raf.1

    Science.gov (United States)

    Yang, Zhong-Nan; Ye, Xin-Rong; Molina, Joe; Roose, Mikeal L.; Mirkov, T. Erik

    2003-01-01

    Citrus tristeza virus (CTV) is the major virus pathogen causing significant economic damage to citrus worldwide, and a single dominant gene, Ctv, provides broad spectrum resistance to CTV in Poncirus trifoliata L. Raf. Ctv was physically mapped to a 282-kb region using a P. trifoliata bacterial artificial chromosome library. This region was completely sequenced to about 8× coverage using a shotgun sequencing strategy and primer walking for gap closure. Sequence analysis predicts 22 putative genes, two mutator-like transposons and eight retrotransposons. This sequence analysis also revealed some interesting features of this region of the P. trifoliata genome: a disease resistance gene cluster with seven members and eight retrotransposons clustered in a 125-kb gene-poor region. Comparative sequence analysis suggests that six genes in the Ctv region have significant sequence similarity with their orthologs in bacterial artificial chromosome clones F7H2 and F21T11 from Arabidopsis chromosome I. However, the analysis of gene colinearity between P. trifoliata and Arabidopsis indicates that Arabidopsis genome sequence information may be of limited use for positional gene cloning in P. trifoliata and citrus. Analysis of candidate genes for Ctv is also discussed. PMID:12586873

  5. Sequence analysis of a 282-kilobase region surrounding the citrus Tristeza virus resistance gene (Ctv) locus in Poncirus trifoliata L. Raf.

    Science.gov (United States)

    Yang, Zhong-Nan; Ye, Xin-Rong; Molina, Joe; Roose, Mikeal L; Mirkov, T Erik

    2003-02-01

    Citrus tristeza virus (CTV) is the major virus pathogen causing significant economic damage to citrus worldwide, and a single dominant gene, Ctv, provides broad spectrum resistance to CTV in Poncirus trifoliata L. Raf. Ctv was physically mapped to a 282-kb region using a P. trifoliata bacterial artificial chromosome library. This region was completely sequenced to about 8x coverage using a shotgun sequencing strategy and primer walking for gap closure. Sequence analysis predicts 22 putative genes, two mutator-like transposons and eight retrotransposons. This sequence analysis also revealed some interesting features of this region of the P. trifoliata genome: a disease resistance gene cluster with seven members and eight retrotransposons clustered in a 125-kb gene-poor region. Comparative sequence analysis suggests that six genes in the Ctv region have significant sequence similarity with their orthologs in bacterial artificial chromosome clones F7H2 and F21T11 from Arabidopsis chromosome I. However, the analysis of gene colinearity between P. trifoliata and Arabidopsis indicates that Arabidopsis genome sequence information may be of limited use for positional gene cloning in P. trifoliata and citrus. Analysis of candidate genes for Ctv is also discussed. PMID:12586873

  6. The diploid genome sequence of Candida albicans

    OpenAIRE

    Jones, Ted; Federspiel, Nancy A.; Chibana, Hiroji; Dungan, Jan; Kalman, Sue; Magee, B. B.; Newport, George; Thorstenson, Yvonne R.; Agabian, Nina; Magee, P T; Davis, Ronald W.; Scherer, Stewart

    2004-01-01

    We present the diploid genome sequence of the fungal pathogen Candida albicans. Because C. albicans has no known haploid or homozygous form, sequencing was performed as a whole-genome shotgun of the heterozygous diploid genome in strain SC5314, a clinical isolate that is the parent of strains widely used for molecular analysis. We developed computational methods to assemble a diploid genome sequence in good agreement with available physical mapping data. We provide a whole-genome description ...

  7. Development in Rice Genome Research Based on Accurate Genome Sequence

    OpenAIRE

    2008-01-01

    Rice is one of the most important crops in the world. Although genetic improvement is a key technology for the acceleration of rice breeding, a lack of genome information had restricted efforts in molecular-based breeding until the completion of the high-quality rice genome sequence, which opened new opportunities for research in various areas of genomics. The syntenic relationship of the rice genome to other cereal genomes makes the rice genome invaluable for understanding how cereal genomes...

  8. Genome-wide identification, isolation and expression analysis of auxin response factor(ARF gene family in sweet orange (Citrus sinensis

    Directory of Open Access Journals (Sweden)

    si-bei eli

    2015-03-01

    Full Text Available Auxin response factors (ARFs are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologues of auxin response genes. A total of 19 non-redundant ARF genes (CiARF were identified and validated from the sweet orange genome. A comprehensive overview of the CiARF gene family was undertaken, including the gene structures, phylogeny, chromosome locations, conserved motifs, and cis-elements in promoter sequences. Furthermore, expression profiling using real-time PCR revealed many CiARF genes, albeit with different patterns depending on types of tissues and/or developmental stages. Comprehensive expression analysis of these genes was also performed under two hormone treatments using real-time PCR. Indole-3-acetic acid and N-1-napthylphthalamic acid treatment experiments revealed differential up-regulation and down-regulation, respectively, of the 19 citrus ARF genes in the callus of sweet orange. Our comprehensive analysis of ARF genes further elucidates the roles of CiARF family members in citrus growth and development.

  9. Value of a newly sequenced bacterial genome

    DEFF Research Database (Denmark)

    Barbosa, Eudes; Aburjaile, Flavia F; Ramos, Rommel Tj;

    2014-01-01

    Next-generation sequencing (NGS) technologies have made high-throughput sequencing available to medium- and small-size laboratories, culminating in a tidal wave of genomic information. The quantity of sequenced bacterial genomes has not only brought excitement to the field of genomics but also...... heightened expectations that NGS would boost antibacterial discovery and vaccine development. Although many possible drug and vaccine targets have been discovered, the success rate of genome-based analysis has remained below expectations. Furthermore, NGS has had consequences for genome quality, resulting...... in an exponential increase in draft (partial data) genome deposits in public databases. If no further interests are expressed for a particular bacterial genome, it is more likely that the sequencing of its genome will be limited to a draft stage, and the painstaking tasks of completing the sequencing of its genome...

  10. Sequencing intractable DNA to close microbial genomes.

    Directory of Open Access Journals (Sweden)

    Richard A Hurt

    Full Text Available Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps and the Desulfovibrio africanus genome (1 intractable gap. The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.

  11. Fungal genome sequencing: basic biology to biotechnology.

    Science.gov (United States)

    Sharma, Krishna Kant

    2016-08-01

    The genome sequences provide a first glimpse into the genomic basis of the biological diversity of filamentous fungi and yeast. The genome sequence of the budding yeast, Saccharomyces cerevisiae, with a small genome size, unicellular growth, and rich history of genetic and molecular analyses was a milestone of early genomics in the 1990s. The subsequent completion of fission yeast, Schizosaccharomyces pombe and genetic model, Neurospora crassa initiated a revolution in the genomics of the fungal kingdom. In due course of time, a substantial number of fungal genomes have been sequenced and publicly released, representing the widest sampling of genomes from any eukaryotic kingdom. An ambitious genome-sequencing program provides a wealth of data on metabolic diversity within the fungal kingdom, thereby enhancing research into medical science, agriculture science, ecology, bioremediation, bioenergy, and the biotechnology industry. Fungal genomics have higher potential to positively affect human health, environmental health, and the planet's stored energy. With a significant increase in sequenced fungal genomes, the known diversity of genes encoding organic acids, antibiotics, enzymes, and their pathways has increased exponentially. Currently, over a hundred fungal genome sequences are publicly available; however, no inclusive review has been published. This review is an initiative to address the significance of the fungal genome-sequencing program and provides the road map for basic and applied research. PMID:25721271

  12. Deep-sequencing revealed Citrus bark cracking viroid (CBCVd) as a highly aggressive pathogen on hop

    Czech Academy of Sciences Publication Activity Database

    Jakše, J.; Radišek, S.; Pokorn, T.; Matoušek, Jaroslav; Javornik, B.

    2015-01-01

    Roč. 64, č. 4 (2015), s. 831-842. ISSN 0032-0862 R&D Projects: GA MŠk(CZ) LH14255 Institutional support: RVO:60077344 Keywords : Bioinformatic * Citrus bark cracking viroid * Hop * Next-generation sequencing Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 2.121, year: 2014

  13. SNP mining in C. clementina BAC end sequences; transferability in the Citrus genus (Rutaceae, phylogenetic inferences and perspectives for genetic mapping

    Directory of Open Access Journals (Sweden)

    Ollitrault Patrick

    2012-01-01

    Full Text Available Abstract Background With the increasing availability of EST databases and whole genome sequences, SNPs have become the most abundant and powerful polymorphic markers. However, SNP chip data generally suffers from ascertainment biases caused by the SNP discovery and selection process in which a small number of individuals are used as discovery panels. The ongoing International Citrus Genome Consortium sequencing project of the highly heterozygous Clementine and sweet orange genomes will soon result in the release of several hundred thousand SNPs. The primary goals of this study were: (i to estimate the transferability within the genus Citrus of SNPs discovered from Clementine BACend sequencing (BES, (ii to estimate bias associated with the very narrow discovery panel, and (iii to evaluate the usefulness of the Clementine-derived SNP markers for diversity analysis and comparative mapping studies between the different cultivated Citrus species. Results Fifty-four accessions covering the main Citrus species and 52 interspecific hybrids between pummelo and Clementine were genotyped on a GoldenGate array platform using 1,457 SNPs mined from Clementine BES and 37 SNPs identified between and within C. maxima, C. medica, C. reticulata and C. micrantha. Consistent results were obtained from 622 SNP loci. Of these markers, 116 displayed incomplete transferability primarily in C. medica, C. maxima and wild Citrus species. The two primary biases associated with the SNP mining in Clementine were an overestimation of the C. reticulata diversity and an underestimation of the interspecific differentiation. However, the genetic stratification of the gene pool was high, with very frequent significant linkage disequilibrium. Furthermore, the shared intraspecific polymorphism and accession heterozygosity were generally enough to perform interspecific comparative genetic mapping. Conclusions A set of 622 SNP markers providing consistent results was selected. Of the

  14. Draft Genome Sequence of Lactobacillus rhamnosus 2166.

    OpenAIRE

    Karlyshev, Andrey V.; Melnikov, Vyacheslav G.; Kosarev, Igor V.; Abramov, Vyacheslav M.

    2014-01-01

    In this report, we present a draft sequence of the genome of Lactobacillus rhamnosus strain 2166, a potential novel probiotic. Genome annotation and read mapping onto a reference genome of L. rhamnosus strain GG allowed for the identification of the differences and similarities in the genomic contents and gene arrangements of these strains.

  15. Value of a newly sequenced bacterial genome.

    Science.gov (United States)

    Barbosa, Eudes Gv; Aburjaile, Flavia F; Ramos, Rommel Tj; Carneiro, Adriana R; Le Loir, Yves; Baumbach, Jan; Miyoshi, Anderson; Silva, Artur; Azevedo, Vasco

    2014-05-26

    Next-generation sequencing (NGS) technologies have made high-throughput sequencing available to medium- and small-size laboratories, culminating in a tidal wave of genomic information. The quantity of sequenced bacterial genomes has not only brought excitement to the field of genomics but also heightened expectations that NGS would boost antibacterial discovery and vaccine development. Although many possible drug and vaccine targets have been discovered, the success rate of genome-based analysis has remained below expectations. Furthermore, NGS has had consequences for genome quality, resulting in an exponential increase in draft (partial data) genome deposits in public databases. If no further interests are expressed for a particular bacterial genome, it is more likely that the sequencing of its genome will be limited to a draft stage, and the painstaking tasks of completing the sequencing of its genome and annotation will not be undertaken. It is important to know what is lost when we settle for a draft genome and to determine the "scientific value" of a newly sequenced genome. This review addresses the expected impact of newly sequenced genomes on antibacterial discovery and vaccinology. Also, it discusses the factors that could be leading to the increase in the number of draft deposits and the consequent loss of relevant biological information. PMID:24921006

  16. Value of a newly sequenced bacterial genome

    Institute of Scientific and Technical Information of China (English)

    Eudes; GV; Barbosa; Flavia; F; Aburjaile; Rommel; TJ; Ramos; Adriana; R; Carneiro; Yves; Le; Loir; Jan; Baumbach; Anderson; Miyoshi; Artur; Silva; Vasco; Azevedo

    2014-01-01

    Next-generation sequencing(NGS) technologies have made high-throughput sequencing available to medium- and small-size laboratories, culminating in a tidal wave of genomic information. The quantity of sequenced bacterial genomes has not only brought excitement to the field of genomics but also heightened expectations that NGS would boost antibacterial discovery and vaccine development. Although many possible drug and vaccine targets have been discovered, the success rate of genome-based analysis has remained below expectations. Furthermore, NGS has had consequences for genome quality, resulting in an exponential increase in draft(partial data) genome deposits in public databases. If no further interests are expressed for a particular bacterial genome, it is more likely that the sequencing of its genome will be limited to a draft stage, and the painstaking tasks of completing the sequencing of its genome and annotation will not be undertaken. It is important to know what is lost when we settle for a draft genome and to determine the "scientific value" of a newly sequenced genome. This review addresses the expected impact of newly sequenced genomes on antibacterial discovery and vaccinology. Also, it discusses the factors that could be leading to the increase in the number of draft deposits and the consequent loss of relevant biological information.

  17. Accurate and comprehensive sequencing of personal genomes

    OpenAIRE

    Ajay, Subramanian S.; Parker, Stephen C.J.; Ozel Abaan, Hatice; Fuentes Fajardo, Karin V.; Margulies, Elliott H.

    2011-01-01

    As whole-genome sequencing becomes commoditized and we begin to sequence and analyze personal genomes for clinical and diagnostic purposes, it is necessary to understand what constitutes a complete sequencing experiment for determining genotypes and detecting single-nucleotide variants. Here, we show that the current recommendation of ∼30× coverage is not adequate to produce genotype calls across a large fraction of the genome with acceptably low error rates. Our results are based on analyses...

  18. Automated correction of genome sequence errors

    OpenAIRE

    Gajer, Pawel; Schatz, Michael; Salzberg, Steven L

    2004-01-01

    By using information from an assembly of a genome, a new program called AutoEditor significantly improves base calling accuracy over that achieved by previous algorithms. This in turn improves the overall accuracy of genome sequences and facilitates the use of these sequences for polymorphism discovery. We describe the algorithm and its application in a large set of recent genome sequencing projects. The number of erroneous base calls in these projects was reduced by 80%. In an analysis of ov...

  19. Sequence Maneuverer: tool for sequence extraction from genomes

    OpenAIRE

    Yasmin, Tayyaba; Rehman, Inayat Ur; Ansari, Adnan Ahmad; liaqat, Khurrum; Khan, Muhammad Irfan

    2012-01-01

    The availability of genomic sequences of many organisms has opened new challenges in many aspects particularly in terms of genome analysis. Sequence extraction is a vital step and many tools have been developed to solve this issue. These tools are available publically but have limitations with reference to the sequence extraction, length of the sequence to be extracted, organism specificity and lack of user friendly interface. We have developed a java based software package having three modul...

  20. A Phylogenetic Analysis of 34 Chloroplast Genomes Elucidates the Relationships between Wild and Domestic Species within the Genus Citrus.

    Science.gov (United States)

    Carbonell-Caballero, Jose; Alonso, Roberto; Ibañez, Victoria; Terol, Javier; Talon, Manuel; Dopazo, Joaquin

    2015-08-01

    Citrus genus includes some of the most important cultivated fruit trees worldwide. Despite being extensively studied because of its commercial relevance, the origin of cultivated citrus species and the history of its domestication still remain an open question. Here, we present a phylogenetic analysis of the chloroplast genomes of 34 citrus genotypes which constitutes the most comprehensive and detailed study to date on the evolution and variability of the genus Citrus. A statistical model was used to estimate divergence times between the major citrus groups. Additionally, a complete map of the variability across the genome of different citrus species was produced, including single nucleotide variants, heteroplasmic positions, indels (insertions and deletions), and large structural variants. The distribution of all these variants provided further independent support to the phylogeny obtained. An unexpected finding was the high level of heteroplasmy found in several of the analyzed genomes. The use of the complete chloroplast DNA not only paves the way for a better understanding of the phylogenetic relationships within the Citrus genus but also provides original insights into other elusive evolutionary processes, such as chloroplast inheritance, heteroplasmy, and gene selection. PMID:25873589

  1. Towards a reference pecan genome sequence

    Science.gov (United States)

    The cost of generating DNA sequence data has declined dramatically over the previous 15 years as a result of the Human Genome Project and the potential applications of genome sequencing for human medicine. This cost reduction has generated renewed interest among crop breeding scientists in applying...

  2. De novo Transcriptome Analysis of Chinese Citrus Fly, Bactrocera minax (Diptera: Tephritidae), by High-Throughput Illumina Sequencing

    Science.gov (United States)

    Wang, Jia; Xiong, Ke-Cai; Liu, Ying-Hong

    2016-01-01

    The Chinese citrus fly, Bactrocera minax (Enderlein), is one of the most devastating pests of citrus in the temperate areas of Asia. So far, studies involving molecular biology and physiology of B. minax are still scarce, partly because of the lack of genomic information and inability to rear this insect in laboratory. In this study, de novo assembly of a transcriptome was performed using Illumina sequencing technology. A total of 20,928,907 clean reads were obtained and assembled into 33,324 unigenes, with an average length of 908.44 bp. Unigenes were annotated by alignment against NCBI non-redundant protein (Nr), Swiss-Prot, Clusters of Orthologous Groups (COG), Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database. Genes potentially involved in stress tolerance, including 20 heat shock protein (Hsps) genes, 26 glutathione S-transferases (GSTs) genes, and 2 ferritin subunit genes, were identified. These genes may play roles in stress tolerance in B. minax diapause stage. It has previously been found that 20E application on B. minax pupae could avert diapause, but the underlying mechanisms remain unknown. Thus, genes encoding enzymes in 20E biosynthesis pathway, including Neverland, Spook, Phantom, Disembodied, Shadow, Shade, and Cyp18a1, and genes encoding 20E receptor proteins, ecdysone receptor (EcR) and ultraspiracle (USP), were identified. The expression patterns of 20E-related genes among developmental stages and between 20E-treated and untreated pupae demonstrated their roles in diapause program. In addition, 1,909 simple sequence repeats (SSRs) were detected, which will contribute to molecular marker development. The findings in this study greatly improve our genetic understanding of B. minax, and lay the foundation for future studies on this species. PMID:27331903

  3. Genome-wide identification and expression analysis of the polyamine oxidase gene family in sweet orange (Citrus sinensis).

    Science.gov (United States)

    Wang, Wei; Liu, Ji-Hong

    2015-01-25

    Polyamine oxidases (PAOs) are FAD-dependent enzymes associated with polyamine catabolism. In plants, increasing evidences support that PAO genes play essential roles in abiotic and biotic stresses response. In this study, six putative PAO genes (CsPAO1-CsPAO6) were unraveled in sweet orange (Citrus sinensis) using the released citrus genome sequences. A total of 203 putative cis-regulatory elements involved in hormone and stress response were predicted in 1.5-kb promoter regions at the upstream of CsPAOs. The CsPAOs can be divided into four major groups, with similar organizations with their counterparts of Arabidopsis thaliana. Transcripts of CsPAOs were detected in leaf, stem, cotyledon, and root, with the highest levels detected in the roots. The CsPAOs displayed various responses to exogenous treatments with polyamines and ABA and were differentially altered by abiotic stresses, including cold, salt, and mannitol. Overexpression of CsPAO3 in tobacco demonstrated that spermidine and spermine were decreased in the transgenic line, while putrescine was significantly enhanced, implying a potential role of this gene in polyamine back conversion. These data provide valuable knowledge for understanding the roles of the PAO genes in the future. PMID:25445392

  4. Sequence Maneuverer: tool for sequence extraction from genomes

    Science.gov (United States)

    Yasmin, Tayyaba; Rehman, Inayat Ur; Ansari, Adnan Ahmad; liaqat, Khurrum; khan, Muhammad Irfan

    2012-01-01

    The availability of genomic sequences of many organisms has opened new challenges in many aspects particularly in terms of genome analysis. Sequence extraction is a vital step and many tools have been developed to solve this issue. These tools are available publically but have limitations with reference to the sequence extraction, length of the sequence to be extracted, organism specificity and lack of user friendly interface. We have developed a java based software package having three modules which can be used independently or sequentially. The tool efficiently extracts sequences from large datasets with few simple steps. It can efficiently extract multiple sequences of any desired length from a genome of any organism. The results are crosschecked by published data. Availability URL 1: http://ww3.comsats.edu.pk/bio/ResearchProjects.aspx URL 2: http://ww3.comsats.edu.pk/bio/SequenceManeuverer.aspx PMID:23275734

  5. The complete mitochondrial genome of the citrus red mite Panonychus citri (Acari: Tetranychidae: high genome rearrangement and extremely truncated tRNAs

    Directory of Open Access Journals (Sweden)

    Dou Wei

    2010-10-01

    Full Text Available Abstract Background The family Tetranychidae (Chelicerata: Acari includes ~1200 species, many of which are of agronomic importance. To date, mitochondrial genomes of only two Tetranychidae species have been sequenced, and it has been found that these two mitochondrial genomes are characterized by many unusual features in genome organization and structure such as gene order and nucleotide frequency. The scarcity of available sequence data has greatly impeded evolutionary studies in Acari (mites and ticks. Information on Tetranychidae mitochondrial genomes is quite important for phylogenetic evaluation and population genetics, as well as the molecular evolution of functional genes such as acaricide-resistance genes. In this study, we sequenced the complete mitochondrial genome of Panonychus citri (Family Tetranychidae, a worldwide citrus pest, and provide a comparison to other Acari. Results The mitochondrial genome of P. citri is a typical circular molecule of 13,077 bp, and contains the complete set of 37 genes that are usually found in metazoans. This is the smallest mitochondrial genome within all sequenced Acari and other Chelicerata, primarily due to the significant size reduction of protein coding genes (PCGs, a large rRNA gene, and the A + T-rich region. The mitochondrial gene order for P. citri is the same as those for P. ulmi and Tetranychus urticae, but distinctly different from other Acari by a series of gene translocations and/or inversions. The majority of the P. citri mitochondrial genome has a high A + T content (85.28%, which is also reflected by AT-rich codons being used more frequently, but exhibits a positive GC-skew (0.03. The Acari mitochondrial nad1 exhibits a faster amino acid substitution rate than other genes, and the variation of nucleotide substitution patterns of PCGs is significantly correlated with the G + C content. Most tRNA genes of P. citri are extremely truncated and atypical (44-65, 54.1 ± 4.1 bp, lacking

  6. Genomic sequencing of Pleistocene cave bears

    Energy Technology Data Exchange (ETDEWEB)

    Noonan, James P.; Hofreiter, Michael; Smith, Doug; Priest, JamesR.; Rohland, Nadin; Rabeder, Gernot; Krause, Johannes; Detter, J. Chris; Paabo, Svante; Rubin, Edward M.

    2005-04-01

    Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome, the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.

  7. Plantagora: modeling whole genome sequencing and assembly of plant genomes.

    Directory of Open Access Journals (Sweden)

    Roger Barthelson

    Full Text Available BACKGROUND: Genomics studies are being revolutionized by the next generation sequencing technologies, which have made whole genome sequencing much more accessible to the average researcher. Whole genome sequencing with the new technologies is a developing art that, despite the large volumes of data that can be produced, may still fail to provide a clear and thorough map of a genome. The Plantagora project was conceived to address specifically the gap between having the technical tools for genome sequencing and knowing precisely the best way to use them. METHODOLOGY/PRINCIPAL FINDINGS: For Plantagora, a platform was created for generating simulated reads from several different plant genomes of different sizes. The resulting read files mimicked either 454 or Illumina reads, with varying paired end spacing. Thousands of datasets of reads were created, most derived from our primary model genome, rice chromosome one. All reads were assembled with different software assemblers, including Newbler, Abyss, and SOAPdenovo, and the resulting assemblies were evaluated by an extensive battery of metrics chosen for these studies. The metrics included both statistics of the assembly sequences and fidelity-related measures derived by alignment of the assemblies to the original genome source for the reads. The results were presented in a website, which includes a data graphing tool, all created to help the user compare rapidly the feasibility and effectiveness of different sequencing and assembly strategies prior to testing an approach in the lab. Some of our own conclusions regarding the different strategies were also recorded on the website. CONCLUSIONS/SIGNIFICANCE: Plantagora provides a substantial body of information for comparing different approaches to sequencing a plant genome, and some conclusions regarding some of the specific approaches. Plantagora also provides a platform of metrics and tools for studying the process of sequencing and assembly

  8. Deep sequencing discovery of novel and conserved microRNAs in trifoliate orange (Citrus trifoliata

    Directory of Open Access Journals (Sweden)

    Yu Huaping

    2010-07-01

    Full Text Available Abstract Background MicroRNAs (miRNAs play a critical role in post-transcriptional gene regulation and have been shown to control many genes involved in various biological and metabolic processes. There have been extensive studies to discover miRNAs and analyze their functions in model plant species, such as Arabidopsis and rice. Deep sequencing technologies have facilitated identification of species-specific or lowly expressed as well as conserved or highly expressed miRNAs in plants. Results In this research, we used Solexa sequencing to discover new microRNAs in trifoliate orange (Citrus trifoliata which is an important rootstock of citrus. A total of 13,106,753 reads representing 4,876,395 distinct sequences were obtained from a short RNA library generated from small RNA extracted from C. trifoliata flower and fruit tissues. Based on sequence similarity and hairpin structure prediction, we found that 156,639 reads representing 63 sequences from 42 highly conserved miRNA families, have perfect matches to known miRNAs. We also identified 10 novel miRNA candidates whose precursors were all potentially generated from citrus ESTs. In addition, five miRNA* sequences were also sequenced. These sequences had not been earlier described in other plant species and accumulation of the 10 novel miRNAs were confirmed by qRT-PCR analysis. Potential target genes were predicted for most conserved and novel miRNAs. Moreover, four target genes including one encoding IRX12 copper ion binding/oxidoreductase and three genes encoding NB-LRR disease resistance protein have been experimentally verified by detection of the miRNA-mediated mRNA cleavage in C. trifoliata. Conclusion Deep sequencing of short RNAs from C. trifoliata flowers and fruits identified 10 new potential miRNAs and 42 highly conserved miRNA families, indicating that specific miRNAs exist in C. trifoliata. These results show that regulatory miRNAs exist in agronomically important trifoliate orange

  9. Comparison of 61 Sequenced Escherichia coli Genomes

    DEFF Research Database (Denmark)

    Lukjancenko, Oksana; Wassenaar, T. M.; Ussery, David

    2010-01-01

    Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics...... the pan-genome and about 80% of a typical genome; some of these variable genes tend to be co-localized on genomic islands. The diversity within the species E. coli, and the overlap in gene content between this and related species, suggests a continuum rather than sharp species borders in this group of...

  10. Polymorphism of the 5' terminal region of Citrus tristeza virus (CTV) RNA: incidence of three sequence types in isolates of different origin and pathogenicity.

    Science.gov (United States)

    Ayllón, M A; López, C; Navas-Castillo, J; Garnsey, S M; Guerri, J; Flores, R; Moreno, P

    2001-01-01

    Sequences of the 5' terminal region of the genomic RNA from eight isolates of Citrus tristeza virus (CTV) were previously classified into three types (I, II and III), with intragroup sequence identity higher than 88% and intergroup sequence identity as low as 44%. Sequencing of an additional 58 cDNA clones from 15 virus isolates showed that all sequences could be unequivocally assigned to one of the three types previously established. The relative frequency of each sequence type was assessed in 57 CTV isolates of different geographic origin and pathogenic characteristics by RT-PCR with sets of type-specific primers using CTV dsRNA as template. None of the isolates yielded amplification of the type I or II sequences alone, but in 19 of them type III sequences were the only amplification product detected. Within isolates containing more than one sequence type, eight had type II and III sequences, 11 had type I and III sequences, and 19 had sequences of the three types. Isolates containing only type III sequences caused only mild to moderate symptoms in Mexican lime, an indicator species for most CTV isolates, whereas isolates causing stem pitting in sweet orange an/or grapefruit, generally contained sequences type II. None of the sequence types could be traced to a precise geographic area, as all types were detected in isolates from at least nine of the 12 countries from which samples were taken. PMID:11266215

  11. Microbial species delineation using whole genome sequences

    Energy Technology Data Exchange (ETDEWEB)

    Kyrpides, Nikos; Mukherjee, Supratim; Ivanova, Natalia; Mavrommatics, Kostas; Pati, Amrita; Konstantinidis, Konstantinos

    2014-10-20

    Species assignments in prokaryotes use a manual, poly-phasic approach utilizing both phenotypic traits and sequence information of phylogenetic marker genes. With thousands of genomes being sequenced every year, an automated, uniform and scalable approach exploiting the rich genomic information in whole genome sequences is desired, at least for the initial assignment of species to an organism. We have evaluated pairwise genome-wide Average Nucleotide Identity (gANI) values and alignment fractions (AFs) for nearly 13,000 genomes using our fast implementation of the computation, identifying robust and widely applicable hard cut-offs for species assignments based on AF and gANI. Using these cutoffs, we generated stable species-level clusters of organisms, which enabled the identification of several species mis-assignments and facilitated the assignment of species for organisms without species definitions.

  12. Applications of Genome-based Science in Shaping Citrus Industries of the World (JGI Seventh Annual User Meeting, 2012: Genomics of Energy and Environment)

    Energy Technology Data Exchange (ETDEWEB)

    Gmitter Jr, Fred [University of Florida

    2012-03-21

    Fred Gmitter from the University of Florida on "Applications of Genome-based Science in Shaping the Future of the World's Citrus Industries" at the 7th Annual Genomics of Energy & Environment Meeting on March 21, 2012 in Walnut Creek, California.

  13. Unique features of a Japanese 'Candidatus Liberibacter asiaticus' strain revealed by whole genome sequencing.

    Directory of Open Access Journals (Sweden)

    Hiroshi Katoh

    Full Text Available Citrus greening (huanglongbing is the most destructive disease of citrus worldwide. It is spread by citrus psyllids and is associated with phloem-limited bacteria of three species of α-Proteobacteria, namely, 'Candidatus Liberibacter asiaticus', 'Ca. L. americanus', and 'Ca. L. africanus'. Recent findings suggested that some Japanese strains lack the bacteriophage-type DNA polymerase region (DNA pol, in contrast to the Floridian psy62 strain. The whole genome sequence of the pol-negative 'Ca. L. asiaticus' Japanese isolate Ishi-1 was determined by metagenomic analysis of DNA extracted from 'Ca. L. asiaticus'-infected psyllids and leaf midribs. The 1.19-Mb genome has an average 36.32% GC content. Annotation revealed 13 operons encoding rRNA and 44 tRNA genes, but no typical bacterial pathogenesis-related genes were located within the genome, similar to the Floridian psy62 and Chinese gxpsy. In contrast to other 'Ca. L. asiaticus' strains, the genome of the Japanese Ishi-1 strain lacks a prophage-related region.

  14. The characterization of twenty sequenced human genomes.

    Directory of Open Access Journals (Sweden)

    Kimberly Pelak

    2010-09-01

    Full Text Available We present the analysis of twenty human genomes to evaluate the prospects for identifying rare functional variants that contribute to a phenotype of interest. We sequenced at high coverage ten "case" genomes from individuals with severe hemophilia A and ten "control" genomes. We summarize the number of genetic variants emerging from a study of this magnitude, and provide a proof of concept for the identification of rare and highly-penetrant functional variants by confirming that the cause of hemophilia A is easily recognizable in this data set. We also show that the number of novel single nucleotide variants (SNVs discovered per genome seems to stabilize at about 144,000 new variants per genome, after the first 15 individuals have been sequenced. Finally, we find that, on average, each genome carries 165 homozygous protein-truncating or stop loss variants in genes representing a diverse set of pathways.

  15. Genome sequence and analysis of Lactobacillus helveticus

    Directory of Open Access Journals (Sweden)

    PaolaCremonesi

    2013-01-01

    Full Text Available The microbiological characterization of lactobacilli is historically well developed, but the genomic analysis is recent. Because of the widespread use of L. helveticus in cheese technology, information concerning the heterogeneity in this species is accumulating rapidly. Recently, the genome of five L. helveticus strains was sequenced to completion and compared with other genomically characterized lactobacilli. The genomic analysis of the first sequenced strain, L. helveticus DPC 4571, isolated from cheese and selected for its characteristics of rapid lysis and high proteolytic activity, has revealed a plethora of genes with industrial potential including those responsible for key metabolic functions such as proteolysis, lipolysis, and cell lysis. These genes and their derived enzymes can facilitate the production of cheese and cheese derivatives with potential for use as ingredients in consumer foods. In addition, L. helveticus has the potential to produce peptides with a biological function, such as angiotensin converting enzyme (ACE inhibitory activity, in fermented dairy products, demonstrating the therapeutic value of this species. A most intriguing feature of the genome of L. helveticus is the remarkable similarity in gene content with many intestinal lactobacilli. Comparative genomics has allowed the identification of key gene sets that facilitate a variety of lifestyles including adaptation to food matrices or the gastrointestinal tract. As genome sequence and functional genomic information continues to explode, key features of the genomes of L. helveticus strains continue to be discovered, answering many questions but also raising many new ones.

  16. Genome-wide identification, isolation and expression analysis of auxin response factor (ARF) gene family in sweet orange (Citrus sinensis)

    OpenAIRE

    Li, Si-Bei; OuYang, Wei-Zhi; Hou, Xiao-Jin; Xie, Liang-Liang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2015-01-01

    Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologs of ARFs. A total of 19 nonredundant ARF genes (CiARF) were found and validated from the sweet orange. A comprehensive overview of the Ci...

  17. Genome-Wide Analysis of Citrus R2R3MYB Genes and Their Spatiotemporal Expression under Stresses and Hormone Treatments

    OpenAIRE

    Xie, Rangjin; Li, Yongjie; He, Shaolan; Zheng, Yongqiang; Yi, Shilai; Lv, Qiang; Deng, Lie

    2014-01-01

    The R2R3MYB proteins represent one of the largest families of transcription factors, which play important roles in plant growth and development. Although genome-wide analysis of this family has been conducted in many species, little is known about R2R3MYB genes in citrus, In this study, 101 R2R3MYB genes has been identified in the citrus (Citrus sinesis and Citrus clementina) genomes, which are almost equal to the number of rice. Phylogenetic analysis revealed that they could be subdivided in...

  18. Sequencing and comparing whole mitochondrial genomes ofanimals

    Energy Technology Data Exchange (ETDEWEB)

    Boore, Jeffrey L.; Macey, J. Robert; Medina, Monica

    2005-04-22

    Comparing complete animal mitochondrial genome sequences is becoming increasingly common for phylogenetic reconstruction and as a model for genome evolution. Not only are they much more informative than shorter sequences of individual genes for inferring evolutionary relatedness, but these data also provide sets of genome-level characters, such as the relative arrangements of genes, that can be especially powerful. We describe here the protocols commonly used for physically isolating mtDNA, for amplifying these by PCR or RCA, for cloning,sequencing, assembly, validation, and gene annotation, and for comparing both sequences and gene arrangements. On several topics, we offer general observations based on our experiences to date with determining and comparing complete mtDNA sequences.

  19. Multilocus Sequence Typing of Total-Genome-Sequenced Bacteria

    DEFF Research Database (Denmark)

    Larsen, Mette Voldby; Cosentino, Salvatore; Rasmussen, Simon;

    2012-01-01

    Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the "gold standard" of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS)...

  20. Genomic prediction using QTL derived from whole genome sequence data

    DEFF Research Database (Denmark)

    Brøndum, Rasmus Froberg; Su, Guosheng; Janss, Luc;

    This study investigated the gain in accuracy of genomic prediction when a small number of significant variants from single marker analysis based on whole genome sequence data were added to the regular 54k SNP data. Analyses were performed for Nordic Holstein and Danish Jersey animals, using either...... a genomic BLUP or a Bayesian variable selection model. When using the genomic BLUP model, results showed increases in accuracy of up to two percentage points for production traits in both Holstein and Jersey animals by including the extra variants in the analysis, and an extra 1.5 percentage points...

  1. Simple sequence repeats in mycobacterial genomes

    Indian Academy of Sciences (India)

    Vattipally B Sreenu; Pankaj Kumar; Javaregowda Nagaraju; Hampapathalu A Nagarajaram

    2007-01-01

    Simple sequence repeats (SSRs) or microsatellites are the repetitive nucleotide sequences of motifs of length 1–6 bp. They are scattered throughout the genomes of all the known organisms ranging from viruses to eukaryotes. Microsatellites undergo mutations in the form of insertions and deletions (INDELS) of their repeat units with some bias towards insertions that lead to microsatellite tract expansion. Although prokaryotic genomes derive some plasticity due to microsatellite mutations they have in-built mechanisms to arrest undue expansions of microsatellites and one such mechanism is constituted by post-replicative DNA repair enzymes MutL, MutH and MutS. The mycobacterial genomes lack these enzymes and as a null hypothesis one could expect these genomes to harbour many long tracts. It is therefore interesting to analyse the mycobacterial genomes for distribution and abundance of microsatellites tracts and to look for potentially polymorphic microsatellites. Available mycobacterial genomes, Mycobacterium avium, M. leprae, M. bovis and the two strains of M. tuberculosis (CDC1551 and H37Rv) were analysed for frequencies and abundance of SSRs. Our analysis revealed that the SSRs are distributed throughout the mycobacterial genomes at an average of 220–230 SSR tracts per kb. All the mycobacterial genomes contain few regions that are conspicuously denser or poorer in microsatellites compared to their expected genome averages. The genomes distinctly show scarcity of long microsatellites despite the absence of a post-replicative DNA repair system. Such severe scarcity of long microsatellites could arise as a result of strong selection pressures operating against long and unstable sequences although influence of GC-content and role of point mutations in arresting microsatellite expansions can not be ruled out. Nonetheless, the long tracts occasionally found in coding as well as non-coding regions may account for limited genome plasticity in these genomes.

  2. Genomic Prediction from Whole Genome Sequence in Livestock: The 1000 Bull Genomes Project

    DEFF Research Database (Denmark)

    Hayes, Benjamin J; MacLeod, Iona M; Daetwyler, Hans D;

    Advantages of using whole genome sequence data to predict genomic estimated breeding values (GEBV) include better persistence of accuracy of GEBV across generations and more accurate GEBV across breeds. The 1000 Bull Genomes Project provides a database of whole genome sequenced key ancestor bulls......, for imputing sequence variant genotypes into reference sets for genomic prediction. Run 3.0 included 429 sequences, with 31.8 million variants detected. BayesRC, a new method for genomic prediction, addresses some challenges associated with using the sequence data, and takes advantage of biological...... information. In a dairy data set, predictions using BayesRC and imputed sequence data from 1000 Bull Genomes were 2% more accurate than with 800k data. We could demonstrate the method identified causal mutations in some cases. Further improvements will come from more accurate imputation of sequence variant...

  3. Genome Sequence of Pseudomonas chlororaphis Strain 189

    Science.gov (United States)

    Town, Jennifer; Audy, Patrice; Boyetchko, Susan M.

    2016-01-01

    Pseudomonas chlororaphis strain 189 is a potent inhibitor of the growth of the potato pathogen Phytophthora infestans. We determined the complete, finished sequence of the 6.8-Mbp genome of this strain, consisting of a single contiguous molecule. Strain 189 is closely related to previously sequenced strains of P. chlororaphis. PMID:27340063

  4. Genome-wide identification of sweet orange (Citrus sinensis histone modification gene families and their expression analysis during the fruit development and fruit-blue mold infection process

    Directory of Open Access Journals (Sweden)

    Jidi eXu

    2015-08-01

    Full Text Available In eukaryotes, histone acetylation and methylation have been known to be involved in regulating diverse developmental processes and plant defense. These histone modification events are controlled by a series of histone modification gene families. To date, there is no study regarding genome-wide characterization of histone modification related genes in citrus species. Based on the two recent sequenced sweet orange genome databases, a total of 136 CsHMs (Citrus sinensis histone modification genes, including 47 CsHMTs (histone methyltransferase genes, 23 CsHDMs (histone demethylase genes, 50 CsHATs (histone acetyltransferase genes, and 16 CsHDACs (histone deacetylase genes were identified. These genes were categorized to 11 gene families. A comprehensive analysis of these 11 gene families was performed with chromosome locations, phylogenetic comparison, gene structures and conserved domain compositions of proteins. In order to gain an insight into the potential roles of these genes in citrus fruit development, 42 CsHMs with high mRNA abundance in fruit tissues were selected to further analyze their expression profiles at six stages of fruit development. Interestingly, a numbers of genes were expressed highly in flesh of ripening fruit and some of them showed the increasing expression levels along with the fruit development. Furthermore, we analyzed the expression patterns of all 136 CsHMs response to the infection of blue mold (Penicillium digitatum, which is the most devastating pathogen in citrus postharvest process. The results indicated that 20 of them showed the strong alterations of their expression levels during the fruit-pathogen infection. In conclusion, this study presents a comprehensive analysis of the histone modification gene families in sweet orange and further elucidates their behaviors during the fruit development and the blue mold infection responses.

  5. Genome-wide identification of sweet orange (Citrus sinensis) histone modification gene families and their expression analysis during the fruit development and fruit-blue mold infection process.

    Science.gov (United States)

    Xu, Jidi; Xu, Haidan; Liu, Yuanlong; Wang, Xia; Xu, Qiang; Deng, Xiuxin

    2015-01-01

    In eukaryotes, histone acetylation and methylation have been known to be involved in regulating diverse developmental processes and plant defense. These histone modification events are controlled by a series of histone modification gene families. To date, there is no study regarding genome-wide characterization of histone modification related genes in citrus species. Based on the two recent sequenced sweet orange genome databases, a total of 136 CsHMs (Citrus sinensis histone modification genes), including 47 CsHMTs (histone methyltransferase genes), 23 CsHDMs (histone demethylase genes), 50 CsHATs (histone acetyltransferase genes), and 16 CsHDACs (histone deacetylase genes) were identified. These genes were categorized to 11 gene families. A comprehensive analysis of these 11 gene families was performed with chromosome locations, phylogenetic comparison, gene structures, and conserved domain compositions of proteins. In order to gain an insight into the potential roles of these genes in citrus fruit development, 42 CsHMs with high mRNA abundance in fruit tissues were selected to further analyze their expression profiles at six stages of fruit development. Interestingly, a numbers of genes were expressed highly in flesh of ripening fruit and some of them showed the increasing expression levels along with the fruit development. Furthermore, we analyzed the expression patterns of all 136 CsHMs response to the infection of blue mold (Penicillium digitatum), which is the most devastating pathogen in citrus post-harvest process. The results indicated that 20 of them showed the strong alterations of their expression levels during the fruit-pathogen infection. In conclusion, this study presents a comprehensive analysis of the histone modification gene families in sweet orange and further elucidates their behaviors during the fruit development and the blue mold infection responses. PMID:26300904

  6. Next-generation sequencing: applications beyond genomes

    OpenAIRE

    Marguerat, Samuel; Wilhelm, Brian T.; Bähler, Jürg

    2008-01-01

    The development of DNA sequencing more than 30 years ago has profoundly impacted biological research. In the last couple of years, remarkable technological innovations have emerged that allow the direct and cost-effective sequencing of complex samples at unprecedented scale and speed. These next-generation technologies make it feasible to sequence not only static genomes, but also entire transcriptomes expressed under different conditions. These and other powerful applications of next-generat...

  7. Genome Sequence of the Palaeopolyploid soybean

    Energy Technology Data Exchange (ETDEWEB)

    Schmutz, Jeremy; Cannon, Steven B.; Schlueter, Jessica; Ma, Jianxin; Mitros, Therese; Nelson, William; Hyten, David L.; Song, Qijian; Thelen, Jay J.; Cheng, Jianlin; Xu, Dong; Hellsten, Uffe; May, Gregory D.; Yu, Yeisoo; Sakura, Tetsuya; Umezawa, Taishi; Bhattacharyya, Madan K.; Sandhu, Devinder; Valliyodan, Babu; Lindquist, Erika; Peto, Myron; Grant, David; Shu, Shengqiang; Goodstein, David; Barry, Kerrie; Futrell-Griggs, Montona; Abernathy, Brian; Du, Jianchang; Tian, Zhixi; Zhu, Liucun; Gill, Navdeep; Joshi, Trupti; Libault, Marc; Sethuraman, Anand; Zhang, Xue-Cheng; Shinozaki, Kazuo; Nguyen, Henry T.; Wing, Rod A.; Cregan, Perry; Specht, James; Grimwood, Jane; Rokhsar, Dan; Stacey, Gary; Shoemaker, Randy C.; Jackson, Scott A.

    2009-08-03

    Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70percent more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78percent of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75percent of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.

  8. Rhipicephalus (Boophilus) microplus strain Deutsch, whole genome shotgun sequencing project first submission of genome sequence

    Science.gov (United States)

    The size and repetitive nature of the Rhipicephalus microplus genome makes obtaining a full genome sequence difficult. Cot filtration/selection techniques were used to reduce the repetitive fraction of the tick genome and enrich for the fraction of DNA with gene-containing regions. The Cot-selected ...

  9. Identification, characterization and expression analysis of lineage-specific genes within sweet orange (Citrus sinensis)

    OpenAIRE

    Xu, Yuantao; Wu, Guizhi; Hao, Baohai; Chen, Lingling; Deng, Xiuxin; Xu, Qiang

    2015-01-01

    Background With the availability of rapidly increasing number of genome and transcriptome sequences, lineage-specific genes (LSGs) can be identified and characterized. Like other conserved functional genes, LSGs play important roles in biological evolution and functions. Results Two set of citrus LSGs, 296 citrus-specific genes (CSGs) and 1039 orphan genes specific to sweet orange, were identified by comparative analysis between the sweet orange genome sequences and 41 genomes and 273 transcr...

  10. A 5'-proximal region of the Citrus tristeza virus genome encoding two leader proteases is involved in virus superinfection exclusion.

    Science.gov (United States)

    Atallah, Osama O; Kang, Sung-Hwan; El-Mohtar, Choaa A; Shilts, Turksen; Bergua, María; Folimonova, Svetlana Y

    2016-02-01

    Superinfection exclusion (SIE), a phenomenon in which a primary virus infection prevents a secondary infection with the same or closely related virus, has been observed with various viruses. Earlier we demonstrated that SIE by Citrus tristeza virus (CTV) requires viral p33 protein. In this work we show that p33 alone is not sufficient for virus exclusion. To define the additional viral components that are involved in this phenomenon, we engineered a hybrid virus in which a 5'-proximal region in the genome of the T36 isolate containing coding sequences for the two leader proteases L1 and L2 has been substituted with a corresponding region from the genome of a heterologous T68-1 isolate. Sequential inoculation of plants pre-infected with the CTV L1L2T68 hybrid with T36 CTV resulted in superinfection with the challenge virus, which indicated that the substitution of the L1-L2 coding region affected SIE ability of the virus. PMID:26748332

  11. Viral genome sequencing by random priming methods

    Directory of Open Access Journals (Sweden)

    Zhang Xinsheng

    2008-01-01

    Full Text Available Abstract Background Most emerging health threats are of zoonotic origin. For the overwhelming majority, their causative agents are RNA viruses which include but are not limited to HIV, Influenza, SARS, Ebola, Dengue, and Hantavirus. Of increasing importance therefore is a better understanding of global viral diversity to enable better surveillance and prediction of pandemic threats; this will require rapid and flexible methods for complete viral genome sequencing. Results We have adapted the SISPA methodology 123 to genome sequencing of RNA and DNA viruses. We have demonstrated the utility of the method on various types and sources of viruses, obtaining near complete genome sequence of viruses ranging in size from 3,000–15,000 kb with a median depth of coverage of 14.33. We used this technique to generate full viral genome sequence in the presence of host contaminants, using viral preparations from cell culture supernatant, allantoic fluid and fecal matter. Conclusion The method described is of great utility in generating whole genome assemblies for viruses with little or no available sequence information, viruses from greatly divergent families, previously uncharacterized viruses, or to more fully describe mixed viral infections.

  12. Complete Genome Sequence of “Candidatus Tremblaya princeps” Strain PCVAL, an Intriguing Translational Machine below the Living-Cell Status

    OpenAIRE

    López-Madrigal, Sergio; Latorre, Amparo; Porcar, Manuel; Moya, Andrés; Gil Benso, Rosario

    2011-01-01

    The sequence of the genome of “Candidatus Tremblaya princeps” strain PCVAL, the primary endosymbiont of the citrus mealybug Planococcus citri, has been determined. “Ca. Tremblaya princeps” presents an unusual nested endosymbiosis and harbors a gammaproteobacterial symbiont within its cytoplasm in all analyzed mealybugs. The genome sequence reveals that “Ca. Tremblaya princeps” cannot be considered an independent organism but that the consortium with its gammaproteobacterial symbiotic associat...

  13. Sequencing and Analysis of a Genomic Fragment Provide an Insight into the Dunaliella viridis Genomic Sequence

    Institute of Scientific and Technical Information of China (English)

    Xiao-Ming SUN; Yuan-Ping TANG; Xiang-Zong MENG; Wen-Wen ZHANG; Shan LI; Zhi-Rui DENG; Zheng-Kai XU; Ren-Tao SONG

    2006-01-01

    Dunaliella is a genus of wall-less unicellular eukaryotic green alga. Its exceptional resistances to salt and various other stresses have made it an ideal model for stress tolerance study. However, very little is known about its genome and genomic sequences. In this study, we sequenced and analyzed a 29,268 bp genomic fragment from Dunaliella viridis. The fragment showed low sequence homology to the GenBank database. At the nucleotide level, only a segment with significant sequence homology to 18S rRNA was found. The fragment contained six putative genes, but only one gene showed significant homology at the protein level to GenBank database. The average GC content of this sequence was 51.1%, which was much lower than that of close related green algae Chlamydomonas (65.7%). Significant segmental duplications were found within this fragment. The duplicated sequences accounted for about 35.7% of the entire region. Large amounts of simple sequence repeats (microsatellites) were found, with strong bias towards (AC)n type (76%). Analysis of other Dunaliella genomic sequences in the GenBank database (total 25,749 bp) was in agreement with these findings. These sequence features made it difficult to sequence Dunaliella genomic sequences. Further investigation should be made to reveal the biological significance of these unique sequence features.

  14. Sequencing and analysis of a genomic fragment provide an insight into the Dunaliella viridis genomic sequence.

    Science.gov (United States)

    Sun, Xiao-Ming; Tang, Yuan-Ping; Meng, Xiang-Zong; Zhang, Wen-Wen; Li, Shan; Deng, Zhi-Rui; Xu, Zheng-Kai; Song, Ren-Tao

    2006-11-01

    Dunaliella is a genus of wall-less unicellular eukaryotic green alga. Its exceptional resistances to salt and various other stresses have made it an ideal model for stress tolerance study. However, very little is known about its genome and genomic sequences. In this study, we sequenced and analyzed a 29,268 bp genomic fragment from Dunaliella viridis. The fragment showed low sequence homology to the GenBank database. At the nucleotide level, only a segment with significant sequence homology to 18S rRNA was found. The fragment contained six putative genes, but only one gene showed significant homology at the protein level to GenBank database. The average GC content of this sequence was 51.1%, which was much lower than that of close related green algae Chlamydomonas (65.7%). Significant segmental duplications were found within this fragment. The duplicated sequences accounted for about 35.7% of the entire region. Large amounts of simple sequence repeats (microsatellites) were found, with strong bias towards (AC)(n) type (76%). Analysis of other Dunaliella genomic sequences in the GenBank database (total 25,749 bp) was in agreement with these findings. These sequence features made it difficult to sequence Dunaliella genomic sequences. Further investigation should be made to reveal the biological significance of these unique sequence features. PMID:17091199

  15. Complete Mitochondrial Genome of the Citrus Spiny Whitefly Aleurocanthus spiniferus (Quaintance) (Hemiptera: Aleyrodidae): Implications for the Phylogeny of Whiteflies.

    Science.gov (United States)

    Chen, Zhi-Teng; Mu, Li-Xia; Wang, Ji-Rui; Du, Yu-Zhou

    2016-01-01

    In this study, we sequenced the complete mitochondrial genome (15,220 bp) of the citrus spiny whitefly, Aleurocanthus spiniferus (Quaintance), a well-known pest from the superfamily Aleyrodidae. The A. spiniferus mitogenome contains 36 genes, including 13 protein-coding genes (PCGs), 21 transfer RNAs (tRNA), two ribosomal RNAs (rRNA) and a large non-coding region (control region, CR). Like most whiteflies, the A. spiniferus mitogenome had a large degree of rearrangement due to translocation of the nad3-trnG-cox3 gene cluster. The 13 PCGs initiated with ATN and generally terminated with TAA, although some used TAG or T as stop codons; atp6 showed the highest evolutionary rate, whereas cox2 appeared to have the lowest rate. The A. spiniferus mitogenome had 21 tRNAs with a typical cloverleaf secondary structure composed of four arms. Modeling of the two rRNA genes indicated that their secondary structure was similar to that of other insects. The CR of A. spiniferus was 920 bp and mapped between the nad3-trnG-cox3 and trnI-trnM gene clusters. One potential stem-loop structure and five tandem repeats were identified in the CR. Phylogenetic relationships of 11 species from the Aleyrodidae were analyzed based on the deduced amino acid sequences of the 13 PCGs and evolutionary characteristics were explored. Species with more genetic rearrangements were generally more evolved within the Aleyrodidae. PMID:27551782

  16. Signaling pathways in a Citrus EST database

    Directory of Open Access Journals (Sweden)

    Angela Mehta

    2007-01-01

    Full Text Available Citrus spp. are economically important crops, which in Brazil are grown mainly in the State of São Paulo. Citrus cultures are attacked by several pathogens, causing severe yield losses. In order to better understand this culture, the Millenium Project (IAC Cordeirópolis was launched in order to sequence Citrus ESTs (expressed sequence tags from different tissues, including leaf, bark, fruit, root and flower. Plants were submitted to biotic and abiotic stresses and investigated under different development stages (adult vs. juvenile. Several cDNA libraries were constructed and the sequences obtained formed the Citrus ESTs database with almost 200,000 sequences. Searches were performed in the Citrus database to investigate the presence of different signaling pathway components. Several of the genes involved in the signaling of sugar, calcium, cytokinin, plant hormones, inositol phosphate, MAPKinase and COP9 were found in the citrus genome and are discussed in this paper. The results obtained may indicate that similar mechanisms described in other plants, such as Arabidopsis, occur in citrus. Further experimental studies must be conducted in order to understand the different signaling pathways present.

  17. Sorghum genome sequencing by methylation filtration.

    Directory of Open Access Journals (Sweden)

    Joseph A Bedell

    2005-01-01

    Full Text Available Sorghum bicolor is a close relative of maize and is a staple crop in Africa and much of the developing world because of its superior tolerance of arid growth conditions. We have generated sequence from the hypomethylated portion of the sorghum genome by applying methylation filtration (MF technology. The evidence suggests that 96% of the genes have been sequence tagged, with an average coverage of 65% across their length. Remarkably, this level of gene discovery was accomplished after generating a raw coverage of less than 300 megabases of the 735-megabase genome. MF preferentially captures exons and introns, promoters, microRNAs, and simple sequence repeats, and minimizes interspersed repeats, thus providing a robust view of the functional parts of the genome. The sorghum MF sequence set is beneficial to research on sorghum and is also a powerful resource for comparative genomics among the grasses and across the entire plant kingdom. Thousands of hypothetical gene predictions in rice and Arabidopsis are supported by the sorghum dataset, and genomic similarities highlight evolutionarily conserved regions that will lead to a better understanding of rice and Arabidopsis.

  18. Sorghum genome sequencing by methylation filtration.

    Science.gov (United States)

    Bedell, Joseph A; Budiman, Muhammad A; Nunberg, Andrew; Citek, Robert W; Robbins, Dan; Jones, Joshua; Flick, Elizabeth; Rholfing, Theresa; Fries, Jason; Bradford, Kourtney; McMenamy, Jennifer; Smith, Michael; Holeman, Heather; Roe, Bruce A; Wiley, Graham; Korf, Ian F; Rabinowicz, Pablo D; Lakey, Nathan; McCombie, W Richard; Jeddeloh, Jeffrey A; Martienssen, Robert A

    2005-01-01

    Sorghum bicolor is a close relative of maize and is a staple crop in Africa and much of the developing world because of its superior tolerance of arid growth conditions. We have generated sequence from the hypomethylated portion of the sorghum genome by applying methylation filtration (MF) technology. The evidence suggests that 96% of the genes have been sequence tagged, with an average coverage of 65% across their length. Remarkably, this level of gene discovery was accomplished after generating a raw coverage of less than 300 megabases of the 735-megabase genome. MF preferentially captures exons and introns, promoters, microRNAs, and simple sequence repeats, and minimizes interspersed repeats, thus providing a robust view of the functional parts of the genome. The sorghum MF sequence set is beneficial to research on sorghum and is also a powerful resource for comparative genomics among the grasses and across the entire plant kingdom. Thousands of hypothetical gene predictions in rice and Arabidopsis are supported by the sorghum dataset, and genomic similarities highlight evolutionarily conserved regions that will lead to a better understanding of rice and Arabidopsis. PMID:15660154

  19. Population genetic inference from genomic sequence variation

    OpenAIRE

    Pool, John E.; Hellmann, Ines; Jeffrey D. Jensen; Nielsen, Rasmus

    2010-01-01

    Population genetics has evolved from a theory-driven field with little empirical data into a data-driven discipline in which genome-scale data sets test the limits of available models and computational analysis methods. In humans and a few model organisms, analyses of whole-genome sequence polymorphism data are currently under way. And in light of the falling costs of next-generation sequencing technologies, such studies will soon become common in many other organisms as well. Here, we assess...

  20. An International Plan to Sequence the Onion Genome

    Science.gov (United States)

    The cost of DNA sequencing continues to decline and, in the near future, it will become reasonable to undertake sequencing of the enormous nuclear genome of onion. We undertook sequencing of expressed and genomic regions of the onion genome to learn about the structure of the onion genome, as well a...

  1. The diploid genome sequence of an Asian individual

    DEFF Research Database (Denmark)

    Wang, Jun; Wang, Wei; Li, Ruiqiang;

    2008-01-01

    Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we...

  2. Multilocus sequence typing of total-genome-sequenced bacteria.

    Science.gov (United States)

    Larsen, Mette V; Cosentino, Salvatore; Rasmussen, Simon; Friis, Carsten; Hasman, Henrik; Marvig, Rasmus Lykke; Jelsbak, Lars; Sicheritz-Pontén, Thomas; Ussery, David W; Aarestrup, Frank M; Lund, Ole

    2012-04-01

    Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the "gold standard" of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST. PMID:22238442

  3. Mauve: Multiple Alignment of Conserved Genomic Sequence With Rearrangements

    OpenAIRE

    Darling, Aaron C.E.; Mau, Bob; Blattner, Frederick R.; Perna, Nicole T.

    2004-01-01

    As genomes evolve, they undergo large-scale evolutionary processes that present a challenge to sequence comparison not posed by short sequences. Recombination causes frequent genome rearrangements, horizontal transfer introduces new sequences into bacterial chromosomes, and deletions remove segments of the genome. Consequently, each genome is a mosaic of unique lineage-specific segments, regions shared with a subset of other genomes and segments conserved among all the genomes under considera...

  4. Hidden ribozymes in eukaryotic genome sequence

    OpenAIRE

    Sean P Ryder

    2010-01-01

    The small self-cleaving ribozymes fold into complex tertiary structures to promote autocatalytic cleavage or ligation at a precise position within their sequence. Until recently, relatively few examples had been identified. Two papers now reveal that self-cleaving ribozymes are prevalent in eukaryotic genomes and, in some cases, might play a role in regulating gene expression.

  5. Whole genome sequences of four Brucella strains.

    Science.gov (United States)

    Ding, Jiabo; Pan, Yuanlong; Jiang, Hai; Cheng, Junsheng; Liu, Taotao; Qin, Nan; Yang, Yi; Cui, Buyun; Chen, Chen; Liu, Cuihua; Mao, Kairong; Zhu, Baoli

    2011-07-01

    Brucella melitensis and Brucella suis are intracellular pathogens of livestock and humans. Here we report four genome sequences, those of the virulent strain B. melitensis M28-12 and vaccine strains B. melitensis M5 and M111 and B. suis S2, which show different virulences and pathogenicities, which will help to design a more effective brucellosis vaccine. PMID:21602346

  6. Genome Sequence of Lactobacillus amylovorus GRL1112

    OpenAIRE

    Kant, R.; Paulin, L.; Alatalo, E.; DE VOS W.M.; Palva, A.

    2010-01-01

    Lactobacillus amylovorus is a common member of the normal gastrointestinal tract (GIT) microbiota in pigs. Here, we report the genome sequence of L. amylovorus GRL1112, a porcine feces isolate displaying strong adherence to the pig intestinal epithelial cells. The strain is of interest, as it is a potential probiotic bacterium.

  7. Genome sequence of Lactobacillus amylovorus GRL1112.

    Science.gov (United States)

    Kant, Ravi; Paulin, Lars; Alatalo, Edward; de Vos, Willem M; Palva, Airi

    2011-02-01

    Lactobacillus amylovorus is a common member of the normal gastrointestinal tract (GIT) microbiota in pigs. Here, we report the genome sequence of L. amylovorus GRL1112, a porcine feces isolate displaying strong adherence to the pig intestinal epithelial cells. The strain is of interest, as it is a potential probiotic bacterium. PMID:21131492

  8. Mapping whole genome shotgun sequence and variant calling in mammalian species without their reference genomes

    Science.gov (United States)

    Genomics research in mammals has produced reference genome sequences that are essential for identifying variation associated with disease. High quality reference genome sequences are now available for humans, model species, and economically important agricultural animals. Comparisons between these s...

  9. The Theory and Practice of Genome Sequence Assembly.

    Science.gov (United States)

    Simpson, Jared T; Pop, Mihai

    2015-01-01

    The current genomic revolution was made possible by joint advances in genome sequencing technologies and computational approaches for analyzing sequence data. The close interaction between biologists and computational scientists is perhaps most apparent in the development of approaches for sequencing entire genomes, a feat that would not be possible without sophisticated computational tools called genome assemblers (short for genome sequence assemblers). Here, we survey the key developments in algorithms for assembling genome sequences since the development of the first DNA sequencing methods more than 35 years ago. PMID:25939056

  10. Analysis of expressed sequence tags from Citrus sinensis L. Osbeck infected with Xylella fastidiosa

    Directory of Open Access Journals (Sweden)

    Alessandra A. de Souza

    2007-01-01

    Full Text Available In order to understand the genetic responses resulting from physiological changes that occur in plants displaying citrus variegated chlorosis (CVC symptoms, we adopted a strategy of comparing two EST libraries from sweet orange [Citrus sinensis (L. Osbeck]. One of them was prepared with plants showing typical CVC symptoms caused by Xylella fastidiosa and the other with non-inoculated plants. We obtained 15,944 ESTs by sequencing the two cDNA libraries. Using an in silico hybridization strategy, 37 genes were found to have significant variation at the transcriptional level. Within this subset, 21 were up-regulated and 16 were down-regulated in plants with CVC. The main functional categories of the down-regulated transcripts in plants with CVC were associated with metabolism, protein modification, energy and transport facilitation. The majority of the up-regulated transcripts were associated with metabolism and defense response. Some transcripts associated with adaptation to stress conditions were up-regulated in plants with CVC and could explain why plants remain alive even under severe water and nutritional stress. Others of the up-regulated transcripts are related to defense response suggesting that sweet orange plants activate their defense machinery. The genes associated with stress response might be expressed as part of a secondary response related to physiological alterations caused by the infection.

  11. Genome-Wide Characterization and Expression Analysis of Major Intrinsic Proteins during Abiotic and Biotic Stresses in Sweet Orange (Citrus sinensis L. Osb..

    Directory of Open Access Journals (Sweden)

    Cristina de Paula Santos Martins

    Full Text Available The family of aquaporins (AQPs, or major intrinsic proteins (MIPs, includes integral membrane proteins that function as transmembrane channels for water and other small molecules of physiological significance. MIPs are classified into five subfamilies in higher plants, including plasma membrane (PIPs, tonoplast (TIPs, NOD26-like (NIPs, small basic (SIPs and unclassified X (XIPs intrinsic proteins. This study reports a genome-wide survey of MIP encoding genes in sweet orange (Citrus sinensis L. Osb., the most widely cultivated Citrus spp. A total of 34 different genes encoding C. sinensis MIPs (CsMIPs were identified and assigned into five subfamilies (CsPIPs, CsTIPs, CsNIPs, CsSIPs and CsXIPs based on sequence analysis and also on their phylogenetic relationships with clearly classified MIPs of Arabidopsis thaliana. Analysis of key amino acid residues allowed the assessment of the substrate specificity of each CsMIP. Gene structure analysis revealed that the CsMIPs possess an exon-intron organization that is highly conserved within each subfamily. CsMIP loci were precisely mapped on every sweet orange chromosome, indicating a wide distribution of the gene family in the sweet orange genome. Investigation of their expression patterns in different tissues and upon drought and salt stress treatments, as well as with 'Candidatus Liberibacter asiaticus' infection, revealed a tissue-specific and coordinated regulation of the different CsMIP isoforms, consistent with the organization of the stress-responsive cis-acting regulatory elements observed in their promoter regions. A special role in regulating the flow of water and nutrients is proposed for CsTIPs and CsXIPs during drought stress, and for most CsMIPs during salt stress and the development of HLB disease. These results provide a valuable reference for further exploration of the CsMIPs functions and applications to the genetic improvement of both abiotic and biotic stress tolerance in citrus.

  12. Genome-Wide Characterization and Expression Analysis of Major Intrinsic Proteins during Abiotic and Biotic Stresses in Sweet Orange (Citrus sinensis L. Osb.).

    Science.gov (United States)

    Martins, Cristina de Paula Santos; Pedrosa, Andresa Muniz; Du, Dongliang; Gonçalves, Luana Pereira; Yu, Qibin; Gmitter, Frederick G; Costa, Marcio Gilberto Cardoso

    2015-01-01

    The family of aquaporins (AQPs), or major intrinsic proteins (MIPs), includes integral membrane proteins that function as transmembrane channels for water and other small molecules of physiological significance. MIPs are classified into five subfamilies in higher plants, including plasma membrane (PIPs), tonoplast (TIPs), NOD26-like (NIPs), small basic (SIPs) and unclassified X (XIPs) intrinsic proteins. This study reports a genome-wide survey of MIP encoding genes in sweet orange (Citrus sinensis L. Osb.), the most widely cultivated Citrus spp. A total of 34 different genes encoding C. sinensis MIPs (CsMIPs) were identified and assigned into five subfamilies (CsPIPs, CsTIPs, CsNIPs, CsSIPs and CsXIPs) based on sequence analysis and also on their phylogenetic relationships with clearly classified MIPs of Arabidopsis thaliana. Analysis of key amino acid residues allowed the assessment of the substrate specificity of each CsMIP. Gene structure analysis revealed that the CsMIPs possess an exon-intron organization that is highly conserved within each subfamily. CsMIP loci were precisely mapped on every sweet orange chromosome, indicating a wide distribution of the gene family in the sweet orange genome. Investigation of their expression patterns in different tissues and upon drought and salt stress treatments, as well as with 'Candidatus Liberibacter asiaticus' infection, revealed a tissue-specific and coordinated regulation of the different CsMIP isoforms, consistent with the organization of the stress-responsive cis-acting regulatory elements observed in their promoter regions. A special role in regulating the flow of water and nutrients is proposed for CsTIPs and CsXIPs during drought stress, and for most CsMIPs during salt stress and the development of HLB disease. These results provide a valuable reference for further exploration of the CsMIPs functions and applications to the genetic improvement of both abiotic and biotic stress tolerance in citrus. PMID:26397813

  13. Construction of citrus gene coexpression networks from microarray data using random matrix theory

    OpenAIRE

    Dongliang Du; Nidhi Rawat; Zhanao Deng; Gmitter Jr., Fred G.

    2015-01-01

    After the sequencing of citrus genomes, gene function annotation is becoming a new challenge. Gene coexpression analysis can be employed for function annotation using publicly available microarray data sets. In this study, 230 sweet orange (Citrus sinensis) microarrays were used to construct seven coexpression networks, including one condition-independent and six condition-dependent (Citrus canker, Huanglongbing, leaves, flavedo, albedo, and flesh) networks. In total, these networks contain 3...

  14. Whole Genome Sequencing: Cracking the Genetic Code for Foodborne Illness

    Science.gov (United States)

    ... For Consumers Home For Consumers Consumer Updates Whole Genome Sequencing: Cracking the Genetic Code for Foodborne Illness ... Bacteria that cause disease have millions of different genomes, or sequences of genetic code, each as unique ...

  15. Sequence analysis and editing for bisulphite genomic sequencing projects

    OpenAIRE

    Carr, IM; Valleley, EMA; Cordery, SF; Markham, AF; Bonthron, DT

    2007-01-01

    Bisulphite genomic sequencing is a widely used technique for detailed analysis of the methylation status of a region of DNA. It relies upon the selective deamination of unmethylated cytosine to uracil after treatment with sodium bisulphite, usually followed by PCR amplification of the chosen target region. Since this two-step procedure replaces all unmethylated cytosine bases with thymine, PCR products derived from unmethylated templates contain only three types of nucleotide, in unequal prop...

  16. Sequence motif discovery with computational genome-wide analysis

    OpenAIRE

    Akashi, Hirofumi; Aoki, Fumio; Toyota, Minoru; Maruyama, Reo; Sasaki, Yasushi; Mita, Hiroaki; Tokura, Hajime; Imai, Kohzoh; Tatsumi, Haruyuki

    2006-01-01

    As a result of the human genome project and advancements in DNA sequencing technology, we can utilize a huge amount of nucleotide sequence data and can search DNA sequence motifs in whole human genome. However, searching motifs with the naked eye is an enormous task and searching throughout the whole genome is absolutely impossible. Therefore, we have developed a computational genome-wide analyzing system for detecting DNA sequence motifs with biological significance. We used a multi-parallel...

  17. What Will We Do with a Cotton Genome Sequence?

    Institute of Scientific and Technical Information of China (English)

    BRUBAKER Curt

    2008-01-01

    @@ With the publication of "Toward Sequencing Cotton (Gossypium) Genomes" [Chen et al.PlantPhysiology,2007,145:1303-1310-] a clear consensus emerged from the cotton genomics community not only that cotton genome sequences were a critical resource for research and commercial innovationin cotton genomics,but that there was a logical means of achieving this goal.

  18. The predictive capacity of personal genome sequencing.

    Science.gov (United States)

    Roberts, Nicholas J; Vogelstein, Joshua T; Parmigiani, Giovanni; Kinzler, Kenneth W; Vogelstein, Bert; Velculescu, Victor E

    2012-05-01

    New DNA sequencing methods will soon make it possible to identify all germline variants in any individual at a reasonable cost. However, the ability of whole-genome sequencing to predict predisposition to common diseases in the general population is unknown. To estimate this predictive capacity, we use the concept of a "genometype." A specific genometype represents the genomes in the population conferring a specific level of genetic risk for a specified disease. Using this concept, we estimated the maximum capacity of whole-genome sequencing to identify individuals at clinically significant risk for 24 different diseases. Our estimates were derived from the analysis of large numbers of monozygotic twin pairs; twins of a pair share the same genometype and therefore identical genetic risk factors. Our analyses indicate that (i) for 23 of the 24 diseases, most of the individuals will receive negative test results; (ii) these negative test results will, in general, not be very informative, because the risk of developing 19 of the 24 diseases in those who test negative will still be, at minimum, 50 to 80% of that in the general population; and (iii) on the positive side, in the best-case scenario, more than 90% of tested individuals might be alerted to a clinically significant predisposition to at least one disease. These results have important implications for the valuation of genetic testing by industry, health insurance companies, public policy-makers, and consumers. PMID:22472521

  19. Enhanced Dynamic Algorithm of Genome Sequence Alignments

    Directory of Open Access Journals (Sweden)

    Arabi E. keshk

    2014-05-01

    Full Text Available The merging of biology and computer science has created a new field called computational biology that explore the capacities of computers to gain knowledge from biological data, bioinformatics. Computational biology is rooted in life sciences as well as computers, information sciences, and technologies. The main problem in computational biology is sequence alignment that is a way of arranging the sequences of DNA, RNA or protein to identify the region of similarity and relationship between sequences. This paper introduces an enhancement of dynamic algorithm of genome sequence alignment, which called EDAGSA. It is filling the three main diagonals without filling the entire matrix by the unused data. It gets the optimal solution with decreasing the execution time and therefore the performance is increased. To illustrate the effectiveness of optimizing the performance of the proposed algorithm, it is compared with the traditional methods such as Needleman-Wunsch, Smith-Waterman and longest common subsequence algorithms. Also, database is implemented for using the algorithm in multi-sequence alignments for searching the optimal sequence that matches the given sequence.

  20. Swine Genome Sequencing Consortium (SGSC: A Strategic Roadmap for Sequencing The Pig Genome

    Directory of Open Access Journals (Sweden)

    Kellye Eversole

    2006-04-01

    Full Text Available The Swine Genome Sequencing Consortium (SGSC was formed in September 2003 by academic, government and industry representatives to provide international coordination for sequencing the pig genome. The SGSC’s mission is to advance biomedical research for animal production and health by the development of DNAbased tools and products resulting from the sequencing of the swine genome. During the past 2 years, the SGSC has met bi-annually to develop a strategic roadmap for creating the required scientific resources, to integrate existing physical maps, and to create a sequencing strategy that captured international participation and a broad funding base. During the past year, SGSC members have integrated their respective physical mapping data with the goal of creating a minimal tiling path (MTP that will be used as the sequencing template. During the recent Plant and Animal Genome meeting (January 16, 2005 San Diego, CA, presentations demonstrated that a human–pig comparative map has been completed, BAC fingerprint contigs (FPC for each of the autosomes and X chromosome have been constructed and that BAC end-sequencing has permitted, through BLAST analysis and RH-mapping, anchoring of the contigs. Thus, significant progress has been made towards the creation of a MTP. In addition, whole-genome (WG shotgun libraries have been constructed and are currently being sequenced in various laboratories around the globe. Thus, a hybrid sequencing approach in which 3x coverage of BACs comprising the MTP and 3x of the WG-shotgun libraries will be used to develop a draft 6x coverage of the pig genome.

  1. Underlying Data for Sequencing the Mitochondrial Genome with the Massively Parallel Sequencing Platform Ion Torrent™ PGM™

    OpenAIRE

    Seo, Seung Bum; Zeng, Xiangpei; King, Jonathan L.; Larue, Bobby L; Assidi, Mourad; Al-Qahtani, Mohamed H; Sajantila, Antti; Budowle, Bruce

    2015-01-01

    Abstract Background Massively parallel sequencing (MPS) technologies have the capacity to sequence targeted regions or whole genomes of multiple nucleic acid samples with high coverage by sequencing millions of DNA fragments simultaneously. Compared with Sanger sequencing, MPS also can reduce labor and cost on a per nucleotide basis and indeed on a per sample basis. In this study, whole genomes of human mitochondria (mtGenome) were sequenced on the Personal Genome Machine (PGMTM) (L...

  2. Underlying Data for Sequencing the Mitochondrial Genome with the Massively Parallel Sequencing Platform Ion Torrent™ PGM™

    OpenAIRE

    Seo, Seung Bum; Zeng, Xiangpei; King, Jonathan L.; Larue, Bobby L; Assidi, Mourad; Al-Qahtani, Mohamed H; Sajantila, Antti; Budowle, Bruce

    2015-01-01

    Background Massively parallel sequencing (MPS) technologies have the capacity to sequence targeted regions or whole genomes of multiple nucleic acid samples with high coverage by sequencing millions of DNA fragments simultaneously. Compared with Sanger sequencing, MPS also can reduce labor and cost on a per nucleotide basis and indeed on a per sample basis. In this study, whole genomes of human mitochondria (mtGenome) were sequenced on the Personal Genome Machine (PGMTM) (Life Technologies, S...

  3. Genome-wide classification and evolutionary and expression analyses of citrus MYB transcription factor families in sweet orange.

    Directory of Open Access Journals (Sweden)

    Xiao-Jin Hou

    Full Text Available MYB family genes are widely distributed in plants and comprise one of the largest transcription factors involved in various developmental processes and defense responses of plants. To date, few MYB genes and little expression profiling have been reported for citrus. Here, we describe and classify 177 members of the sweet orange MYB gene (CsMYB family in terms of their genomic gene structures and similarity to their putative Arabidopsis orthologs. According to these analyses, these CsMYBs were categorized into four groups (4R-MYB, 3R-MYB, 2R-MYB and 1R-MYB. Gene structure analysis revealed that 1R-MYB genes possess relatively more introns as compared with 2R-MYB genes. Investigation of their chromosomal localizations revealed that these CsMYBs are distributed across nine chromosomes. Sweet orange includes a relatively small number of MYB genes compared with the 198 members in Arabidopsis, presumably due to a paralog reduction related to repetitive sequence insertion into promoter and non-coding transcribed region of the genes. Comparative studies of CsMYBs and Arabidopsis showed that CsMYBs had fewer gene duplication events. Expression analysis revealed that the MYB gene family has a wide expression profile in sweet orange development and plays important roles in development and stress responses. In addition, 337 new putative microsatellites with flanking sequences sufficient for primer design were also identified from the 177 CsMYBs. These results provide a useful reference for the selection of candidate MYB genes for cloning and further functional analysis forcitrus.

  4. Identifying driver mutations in sequenced cancer genomes

    DEFF Research Database (Denmark)

    Raphael, Benjamin J; Dobson, Jason R; Oesper, Layla;

    2014-01-01

    protein sequence or structure. Finally, we review techniques to identify recurrent combinations of somatic mutations, including approaches that examine mutations in known pathways or protein-interaction networks, as well as de novo approaches that identify combinations of mutations according to......-throughput DNA sequencing data, particularly for tumor samples that comprise heterogeneous populations of cells. Next, we review computational approaches that aim to predict driver mutations according to their frequency of occurrence in a cohort of samples, or according to their predicted functional impact on......, and random mutations. Here, we review computational approaches to identify somatic mutations in cancer genome sequences and to distinguish the driver mutations that are responsible for cancer from random, passenger mutations. First, we describe approaches to detect somatic mutations from high...

  5. Why Assembling Plant Genome Sequences Is So Challenging

    Directory of Open Access Journals (Sweden)

    Pedro Seoane

    2012-09-01

    Full Text Available In spite of the biological and economic importance of plants, relatively few plant species have been sequenced. Only the genome sequence of plants with relatively small genomes, most of them angiosperms, in particular eudicots, has been determined. The arrival of next-generation sequencing technologies has allowed the rapid and efficient development of new genomic resources for non-model or orphan plant species. But the sequencing pace of plants is far from that of animals and microorganisms. This review focuses on the typical challenges of plant genomes that can explain why plant genomics is less developed than animal genomics. Explanations about the impact of some confounding factors emerging from the nature of plant genomes are given. As a result of these challenges and confounding factors, the correct assembly and annotation of plant genomes is hindered, genome drafts are produced, and advances in plant genomics are delayed.

  6. Genome-wide analysis of citrus R2R3MYB genes and their spatiotemporal expression under stresses and hormone treatments.

    Directory of Open Access Journals (Sweden)

    Rangjin Xie

    Full Text Available The R2R3MYB proteins represent one of the largest families of transcription factors, which play important roles in plant growth and development. Although genome-wide analysis of this family has been conducted in many species, little is known about R2R3MYB genes in citrus, In this study, 101 R2R3MYB genes has been identified in the citrus (Citrus sinesis and Citrus clementina genomes, which are almost equal to the number of rice. Phylogenetic analysis revealed that they could be subdivided into 21 subgroups. The evolutionary relationships and the intro-exon organizations were also analyzed, revealing strong gene conservation but also the expansions of particular functional genes during the plant evolution. Tissue-specific expression profiles showed that 95 citrus R2R3MYB genes were expressed in at least one tissue and the other 6 genes showed very low expression in all tissues tested, suggesting that citrus R2R3MYB genes play important roles in the development of all citrus organs. The transcript abundance level analysis during abiotic conditions (NaCl, abscisic acid, jasmonic acid, drought and low temperature identified a group of R2R3MYB genes that responded to one or multiple treatments, which showed a promising for improving citrus adaptation to stresses. Our results provided an essential foundation for the future selection of the citrus R2R3MYB genes for cloning and functional dissection with an aim of uncovering their roles in citrus growth and development.

  7. Simple sequence repeats in bryophyte mitochondrial genomes.

    Science.gov (United States)

    Zhao, Chao-Xian; Zhu, Rui-Liang; Liu, Yang

    2016-01-01

    Simple sequence repeats (SSRs) are thought to be common in plant mitochondrial (mt) genomes, but have yet to be fully described for bryophytes. We screened the mt genomes of two liverworts (Marchantia polymorpha and Pleurozia purpurea), two mosses (Physcomitrella patens and Anomodon rugelii) and two hornworts (Phaeoceros laevis and Nothoceros aenigmaticus), and detected 475 SSRs. Some SSRs are found conserved during the evolution, among which except one exists in both liverworts and mosses, all others are shared only by the two liverworts, mosses or hornworts. SSRs are known as DNA tracts having high mutation rates; however, according to our observations, they still can evolve slowly. The conservativeness of these SSRs suggests that they are under strong selection and could play critical roles in maintaining the gene functions. PMID:24491104

  8. Initial sequencing and comparative analysis of the mouse genome

    Energy Technology Data Exchange (ETDEWEB)

    Waterston, Robert H.; Lindblad-Toh, Kerstin; Birney, Ewan; Rogers, Jane; Abril, Josep F.; Agarwal, Pankaj; Agarwala, Richa; Ainscough, Rachel; Alexandersson, Marina; An, Peter; Antonarakis, Stylianos E.; Attwood, John; Baertsch, Robert; Bailey, Jonathon; Barlow, Karen; Beck, Stephan; Berry, Eric; Birren, Bruce; Bloom, Toby; Bork, Peer; Botcherby, Marc; Bray, Nicolas; Brent, Michael R.; Brown, Daniel G.; Brown, Stephen D.; Bult, Carol; Burton, John; Butler, Jonathan; Campbell, Robert D.; Carninci, Piero; Cawley, Simon; Chiaromonte, Francesca; Chinwalla, Asif T.; Church, Deanna M.; Clamp, Michele; Clee, Christopher; Collins, Francis S.; Cook, Lisa L.; Copley, Richard R.; Coulson, Alan; Couronne, Olivier; Cuff, James; Curwen, Val; Cutts, Tim; Daly, Mark; David, Robert; Davies, Joy; Delehaunty, Kimberly D.; Deri, Justin; Dermitzakis, Emmanouil T.; Dewey, Colin; Dickens, Nicholas J.; Diekhans, Mark; Dodge, Sheila; Dubchak, Inna; Dunn, Diane M.; Eddy, Sean R.; Elnitski, Laura; Emes, Richard D.; Eswara, Pallavi; Eyras, Eduardo; Felsenfeld, Adam; Fewell, Ginger A.; Flicek, Paul; Foley, Karen; Frankel, Wayne N.; Fulton, Lucinda A.; Fulton, Robert S.; Furey, Terrence S.; Gage, Diane; Gibbs, Richard A.; Glusman, Gustavo; Gnerre, Sante; Goldman, Nick; Goodstadt, Leo; Grafham, Darren; Graves, Tina A.; Green, Eric D.; Gregory, Simon; Guigo, Roderic; Guyer, Mark; Hardison, Ross C.; Haussler, David; Hayashizaki, Yoshihide; Hillier, LaDeana W.; Hinrichs, Angela; Hlavina, Wratko; Holzer, Timothy; Hsu, Fan; Hua, Axin; Hubbard, Tim; Hunt, Adrienne; Jackson, Ian; Jaffe, David B.; Johnson, L. Steven; Jones, Matthew; Jones, Thomas A.; Joy, Ann; Kamal, Michael; Karlsson, Elinor K.; Karolchik, Donna; Kasprzyk, Arkadiusz; Kawai, Jun; Keibler, Evan; Kells, Cristyn; Kent, W. James; Kirby, Andrew; Kolbe, Diana L.; Korf, Ian; Kucherlapati, Raju S.; Kulbokas III, Edward J.; Kulp, David; Landers, Tom; Leger, J.P.; Leonard, Steven; Letunic, Ivica; Levine, Rosie; et al.

    2002-12-15

    The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.

  9. Genome-wide identification, isolation and expression analysis of auxin response factor(ARF) gene family in sweet orange (Citrus sinensis)

    OpenAIRE

    si-bei eli; wei-zhi eouyang; xiao-jin ehou; liang-liang eXie; chun-gen ehu; Jin-Zhi eZhang

    2015-01-01

    Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologues of auxin response genes. A total of 19 non-redundant ARF genes (CiARF) were identified and validated from the sweet orange genome. A c...

  10. Genome sequence of Haemophilus parasuis strain 29755

    OpenAIRE

    Mullins, Michael A.; Register, Karen B.; Bayles, Darrell O; Dyer, David W.; Joanna S Kuehn; Phillips, Gregory J.

    2011-01-01

    Haemophilus parasuis is a member of the family Pasteurellaceae and is the etiologic agent of Glässer’s disease in pigs, a systemic syndrome associated with only a subset of isolates. The genetic basis for virulence and systemic spread of particular H. parasuis isolates is currently unknown. Strain 29755 is an invasive isolate that has long been used in the study of Glässer’s disease. Accordingly, the genome sequence of strain 29755 is of considerable importance to investigators endeavoring to...

  11. Transferability of the EST-SSRs developed on Nules clementine (Citrus clementina Hort ex Tan) to other Citrus species and their effectiveness for genetic mapping

    OpenAIRE

    Ollitrault Patrick; Talon Manuel; Wincker Patrick; Allario Thierry; Argout Xavier; Terol Javier; Costantino Gilles; Luro François L; Morillon Raphael

    2008-01-01

    Abstract Background During the last decade, numerous microsatellite markers were developed for genotyping and to identify closely related plant genotypes. In citrus, previously developed microsatellite markers were arisen from genomic libraries and more often located in non coding DNA sequences. To optimize the use of these EST-SSRs as genetic markers in genome mapping programs and citrus systematic analysis, we have investigated their polymorphism related to the type (di or trinucleotide) or...

  12. Building the sequence map of the human pan-genome

    DEFF Research Database (Denmark)

    Li, Ruiqiang; Li, Yingrui; Zheng, Hancheng;

    2009-01-01

    Here we integrate the de novo assembly of an Asian and an African genome with the NCBI reference human genome, as a step toward constructing the human pan-genome. We identified approximately 5 Mb of novel sequences not present in the reference genome in each of these assemblies. Most novel...... sequences are individual or population specific, as revealed by their comparison to all available human DNA sequence and by PCR validation using the human genome diversity cell line panel. We found novel sequences present in patterns consistent with known human migration paths. Cross-species conservation...

  13. Insights from 20 years of bacterial genome sequencing

    DEFF Research Database (Denmark)

    Land, Miriam; Hauser, Loren; Jun, Se-Ran;

    2015-01-01

    genomics has produced. To date, there are genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. However, the distribution is quite skewed towards a few phyla that contain model organisms. But the breadth is continuing to improve, with projects dedicated to filling in......Since the first two complete bacterial genome sequences were published in 1995, the science of bacteria has dramatically changed. Using third-generation DNA sequencing, it is possible to completely sequence a bacterial genome in a few hours and identify some types of methylation sites along the...... genome as well. Sequencing of bacterial genome sequences is now a standard procedure, and the information from tens of thousands of bacterial genomes has had a major impact on our views of the bacterial world. In this review, we explore a series of questions to highlight some insights that comparative...

  14. Draft Genome Sequence of Alternaria alternata ATCC 34957.

    Science.gov (United States)

    Nguyen, Hai D T; Lewis, Christopher T; Lévesque, C André; Gräfenhan, Tom

    2016-01-01

    We report the draft genome sequence of Alternaria alternata ATCC 34957. This strain was previously reported to produce alternariol and alternariol monomethyl ether on weathered grain sorghum. The genome was sequenced with PacBio technology and assembled into 27 scaffolds with a total genome size of 33.5 Mb. PMID:26769939

  15. Draft Genome Sequence of Fungus Clonostachys rosea Strain YKD0085.

    Science.gov (United States)

    Liu, Shuai; Chang, Yaowen; Hu, Xujia; Gong, Xuanyun; Di, Yingtong; Dong, Jinyan; Hao, Xiaojiang

    2016-01-01

    Here, we report the draft genome sequence of Clonostachys rosea (strain YKD0085). The functional annotation of C. rosea provides important information related to its ability to produce secondary metabolites. The genome sequence presented here builds the basis for further genome mining. PMID:27340057

  16. Complete Genome Sequence of Staphylococcus aureus Siphovirus Phage JS01

    OpenAIRE

    Jia, Hongying; Bai, Qinqin; Yang, Yongchun; Yao, Huochun

    2013-01-01

    Staphylococcus aureus is the most prevalent and economically significant pathogen causing bovine mastitis. We isolated and characterized one staphylophage from the milk of mastitis-affected cattle and sequenced its genome. Transmission electron microscopy (TEM) observation shows that it belongs to the family Siphovirus. We announce here its complete genome sequence and report major findings from the genomic analysis.

  17. First Draft Genome Sequence of Staphylococcus condimenti F-2T

    Science.gov (United States)

    Zheng, Beiwen; Hu, Xinjun; Jiang, Xiawei; Li, Ang; Yao, Jian

    2016-01-01

    This report describes the draft genome sequence of S. condimenti strain F-2T (DSM 11674), a potential starter culture. The genome assembly comprised 2,616,174 bp with 34.6% GC content. To the best of our knowledge, this is the first documentation that reports the whole-genome sequence of S. condimenti. PMID:27257207

  18. Draft Genome Sequence of Streptomyces hygroscopicus subsp. hygroscopicus NBRC 16556.

    Science.gov (United States)

    Komaki, Hisayuki; Ichikawa, Natsuko; Oguchi, Akio; Hamada, Moriyuki; Tamura, Tomohiko; Suzuki, Ken-Ichiro; Fujita, Nobuyuki

    2016-01-01

    Here, we report the draft genome sequence of strain NBRC 16556, deposited as Streptomyces hygroscopicus subsp. hygroscopicus into the NBRC culture collection. An average nucleotide identity analysis confirmed that the taxonomic identification is correct. The genome sequence will serve as a valuable reference for genome mining to search new secondary metabolites. PMID:27198007

  19. Whole-Genome Shotgun Sequencing of a Colonizing Multilocus Sequence Type 17 Streptococcus agalactiae Strain

    Science.gov (United States)

    Singh, Pallavi; Springman, A. Cody; Davies, H. Dele

    2012-01-01

    This report highlights the whole-genome shotgun draft sequence for a Streptococcus agalactiae strain representing multilocus sequence type (ST) 17, isolated from a colonized woman at 8 weeks postpartum. This sequence represents an important addition to the published genomes and will promote comparative genomic studies of S. agalactiae recovered from diverse sources. PMID:23045509

  20. Whole-Genome Shotgun Sequencing of a Colonizing Multilocus Sequence Type 17 Streptococcus agalactiae Strain

    OpenAIRE

    Singh, Pallavi; Springman, A. Cody; Davies, H Dele; Manning, Shannon D.

    2012-01-01

    This report highlights the whole-genome shotgun draft sequence for a Streptococcus agalactiae strain representing multilocus sequence type (ST) 17, isolated from a colonized woman at 8 weeks postpartum. This sequence represents an important addition to the published genomes and will promote comparative genomic studies of S. agalactiae recovered from diverse sources.

  1. Genome Sequence of Stachybotrys chartarum Strain 51-11

    OpenAIRE

    Betancourt, Doris A; Dean, Timothy R.; Kim, Jean; Levy, Josh

    2015-01-01

    The Stachybotrys chartarum strain 51-11 genome was sequenced by shotgun sequencing utilizing Illumina HiSeq 2000 and PacBio technologies. Since S. chartarum has been implicated as having health impacts within water-damaged buildings, any information extracted from the genomic sequence data relating to toxins or the metabolism of the fungus might be useful.

  2. First Complete Genome Sequence of Cherry virus A.

    Science.gov (United States)

    Koinuma, Hiroaki; Nijo, Takamichi; Iwabuchi, Nozomu; Yoshida, Tetsuya; Keima, Takuya; Okano, Yukari; Maejima, Kensaku; Yamaji, Yasuyuki; Namba, Shigetou

    2016-01-01

    The 5'-terminal genomic sequence of Cherry virus A (CVA) has long been unknown. We determined the first complete genome sequence of an apricot isolate of CVA (7,434 nucleotides [nt]). The 5'-untranslated region was 107 nt in length, which was 53 nt longer than those of known CVA sequences. PMID:27284130

  3. Complete Genome Sequence of Rift Valley Fever Virus Strain Lunyo

    OpenAIRE

    Lumley, Sarah; Horton, Daniel L.; Marston, Denise A.; Johnson, Nicholas; Ellis, Richard J.; Fooks, Anthony R.; Hewson, Roger

    2016-01-01

    Using next-generation sequencing technologies, the first complete genome sequence of Rift Valley fever virus strain Lunyo is reported here. Originally reported as an attenuated antigenic variant strain from Uganda, genomic sequence analysis shows that Lunyo clusters together with other Ugandan isolates.

  4. Draft genome sequence of the sexually transmitted pathogen Trichomonas vaginalis

    DEFF Research Database (Denmark)

    Carlton, Jane M.; Hirt, Robert P.; Silva, Joana C.;

    2007-01-01

    We describe the genome sequence of the protist Trichomonas vaginalis, a sexually transmitted human pathogen. Repeats and transposable elements comprise about two-thirds of the approximately 160-megabase genome, reflecting a recent massive expansion of genetic material. This expansion...

  5. Draft Genome Sequence of Brevibacterium massiliense Strain 541308T

    OpenAIRE

    Roux, Véronique; Robert, Catherine; Gimenez, Grégory; Raoult, Didier

    2012-01-01

    A draft genome sequence of Brevibacterium massiliense, an aerobic bacterium isolated from a human ankle discharge, is described here. CRISPR-associated proteins were found to be encoded in the genome, and analysis of transport proteins was performed.

  6. First complete genome sequence of infectious laryngotracheitis virus

    Directory of Open Access Journals (Sweden)

    Ficorilli Nino P

    2011-04-01

    Full Text Available Abstract Background Infectious laryngotracheitis virus (ILTV is an alphaherpesvirus that causes acute respiratory disease in chickens worldwide. To date, only one complete genomic sequence of ILTV has been reported. This sequence was generated by concatenating partial sequences from six different ILTV strains. Thus, the full genomic sequence of a single (individual strain of ILTV has not been determined previously. This study aimed to use high throughput sequencing technology to determine the complete genomic sequence of a live attenuated vaccine strain of ILTV. Results The complete genomic sequence of the Serva vaccine strain of ILTV was determined, annotated and compared to the concatenated ILTV reference sequence. The genome size of the Serva strain was 152,628 bp, with a G + C content of 48%. A total of 80 predicted open reading frames were identified. The Serva strain had 96.5% DNA sequence identity with the concatenated ILTV sequence. Notably, the concatenated ILTV sequence was found to lack four large regions of sequence, including 528 bp and 594 bp of sequence in the UL29 and UL36 genes, respectively, and two copies of a 1,563 bp sequence in the repeat regions. Considerable differences in the size of the predicted translation products of 4 other genes (UL54, UL30, UL37 and UL38 were also identified. More than 530 single-nucleotide polymorphisms (SNPs were identified. Most SNPs were located within three genomic regions, corresponding to sequence from the SA-2 ILTV vaccine strain in the concatenated ILTV sequence. Conclusions This is the first complete genomic sequence of an individual ILTV strain. This sequence will facilitate future comparative genomic studies of ILTV by providing an appropriate reference sequence for the sequence analysis of other ILTV strains.

  7. First complete genome sequence of infectious laryngotracheitis virus

    OpenAIRE

    Ficorilli Nino P; Browning Glenn F; Petermann Ivonne; Noormohammadi Amir H; Markham John F; Markham Philip F; Lee Sang-Won; Hartley Carol A; Devlin Joanne M

    2011-01-01

    Abstract Background Infectious laryngotracheitis virus (ILTV) is an alphaherpesvirus that causes acute respiratory disease in chickens worldwide. To date, only one complete genomic sequence of ILTV has been reported. This sequence was generated by concatenating partial sequences from six different ILTV strains. Thus, the full genomic sequence of a single (individual) strain of ILTV has not been determined previously. This study aimed to use high throughput sequencing technology to determine t...

  8. Coevolution between simple sequence repeats (SSRs and virus genome size

    Directory of Open Access Journals (Sweden)

    Zhao Xiangyan

    2012-08-01

    Full Text Available Abstract Background Relationship between the level of repetitiveness in genomic sequence and genome size has been investigated by making use of complete prokaryotic and eukaryotic genomes, but relevant studies have been rarely made in virus genomes. Results In this study, a total of 257 viruses were examined, which cover 90% of genera. The results showed that simple sequence repeats (SSRs is strongly, positively and significantly correlated with genome size. Certain repeat class is distributed in a certain range of genome sequence length. Mono-, di- and tri- repeats are widely distributed in all virus genomes, tetra- SSRs as a common component consist in genomes which more than 100 kb in size; in the range of genome  Conclusions We conducted this research standing on the height of the whole virus. We concluded that genome size is an important factor in affecting the occurrence of SSRs; hosts are also responsible for the variances of SSRs content to a certain degree.

  9. Whole Genome Sequencing: Innovation Dream or Privacy Nightmare?

    OpenAIRE

    De Cristofaro, Emiliano

    2012-01-01

    Over the past several years, DNA sequencing has emerged as one of the driving forces in life-sciences, paving the way for affordable and accurate whole genome sequencing. As genomes represent the entirety of an organism's hereditary information, the availability of complete human genomes prompts a wide range of revolutionary applications. The hope for improving modern healthcare and better understanding the human genome propels many interesting and challenging research frontiers. Unfortunatel...

  10. Draft Genome Sequences of Klebsiella variicola Plant Isolates

    OpenAIRE

    Martínez-Romero, Esperanza; Silva-Sanchez, Jesús; Barrios, Humberto; Rodríguez-Medina, Nadia; Martínez-Barnetche, Jesús; Téllez-Sosa, Juan; Gómez-Barreto, Rosa Elena; Garza-Ramos, Ulises

    2015-01-01

    Three endophytic Klebsiella variicola isolates—T29A, 3, and 6A2, obtained from sugar cane stem, maize shoots, and banana leaves, respectively—were used for whole-genome sequencing. Here, we report the draft genome sequences of circular chromosomes and plasmids. The genomes contain plant colonization and cellulases genes. This study will help toward understanding the genomic basis of K. variicola interaction with plant hosts.

  11. Integration of new alternative reference strain genome sequences into the Saccharomyces genome database.

    Science.gov (United States)

    Song, Giltae; Balakrishnan, Rama; Binkley, Gail; Costanzo, Maria C; Dalusag, Kyla; Demeter, Janos; Engel, Stacia; Hellerstedt, Sage T; Karra, Kalpana; Hitz, Benjamin C; Nash, Robert S; Paskov, Kelley; Sheppard, Travis; Skrzypek, Marek; Weng, Shuai; Wong, Edith; Michael Cherry, J

    2016-01-01

    The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) is the authoritative community resource for the Saccharomyces cerevisiae reference genome sequence and its annotation. To provide a wider scope of genetic and phenotypic variation in yeast, the genome sequences and their corresponding annotations from 11 alternative S. cerevisiae reference strains have been integrated into SGD. Genomic and protein sequence information for genes from these strains are now available on the Sequence and Protein tab of the corresponding Locus Summary pages. We illustrate how these genome sequences can be utilized to aid our understanding of strain-specific functional and phenotypic differences.Database URL: www.yeastgenome.org. PMID:27252399

  12. Next-generation sequencing and large genome assemblies

    OpenAIRE

    Henson, Joseph; Tischler, German; Ning, Zemin

    2012-01-01

    The next-generation sequencing (NGS) revolution has drastically reduced time and cost requirements for sequencing of large genomes, and also qualitatively changed the problem of assembly. This article reviews the state of the art in de novo genome assembly, paying particular attention to mammalian-sized genomes. The strengths and weaknesses of the main sequencing platforms are highlighted, leading to a discussion of assembly and the new challenges associated with NGS data. Current approaches ...

  13. Sugarcane genome sequencing by methylation filtration provides tools for genomic research in the genus Saccharum

    OpenAIRE

    Grativol, Clícia; Regulski, Michael; Bertalan, Marcelo; McCombie, W Richard; da Silva, Felipe Rodrigues; Neto, Adhemar Zerlotini; Vicentini, Renato; Farinelli, Laurent; Hemerly, Adriana Silva; Martienssen, Robert A; Ferreira, Paulo Cavalcanti Gomes

    2014-01-01

    Many economically important crops have large and complex genomes, which hampers sequencing of their genome by standard methods such as WGS. Large tracts of methylated repeats occur at plant genomes interspersed by hypomethylated gene-rich regions. Gene enrichment strategies based on methylation profile offer an alternative to sequencing repetitive genomes. Here, we have applied methyl filtration (MF) with McrBC digestion to enrich for euchromatic regions of sugarcane genome. To verify the eff...

  14. Insights from twenty years of bacterial genome sequencing

    Energy Technology Data Exchange (ETDEWEB)

    Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Jun, Se Ran [ORNL; Nookaew, Intawat [ORNL; Leuze, Michael Rex [ORNL; Ahn, Tae-Hyuk [ORNL; Karpinets, Tatiana V [ORNL; Lund, Ole [Technical University of Denmark; Kora, Guruprasad H [ORNL; Wassenaar, Trudy [Molecular Microbiology & Genomics Consultants, Zotzenheim, Germany; Poudel, Suresh [ORNL; Ussery, David W [ORNL

    2015-01-01

    Since the first two complete bacterial genome sequences were published in 1995, the science of bacteria has dramatically changed. Using third-generation DNA sequencing, it is possible to completely sequence a bacterial genome in a few hours and identify some types of methylation sites along the genome as well. Sequencing of bacterial genome sequences is now a standard procedure, and the information from tens of thousands of bacterial genomes has had a major impact on our views of the bacterial world. In this review, we explore a series of questions to highlight some insights that comparative genomics has produced. To date, there are genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. However, the distribution is quite skewed towards a few phyla that contain model organisms. But the breadth is continuing to improve, with projects dedicated to filling in less characterized taxonomic groups. The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system provides bacteria with immunity against viruses, which outnumber bacteria by tenfold. How fast can we go? Second-generation sequencing has produced a large number of draft genomes (close to 90 % of bacterial genomes in GenBank are currently not complete); third-generation sequencing can potentially produce a finished genome in a few hours, and at the same time provide methlylation sites along the entire chromosome. The diversity of bacterial communities is extensive as is evident from the genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. Genome sequencing can help in classifying an organism, and in the case where multiple genomes of the same species are available, it is possible to calculate the pan- and core genomes; comparison of more than 2000 Escherichia coli genomes finds an E. coli core genome of about 3100 gene families and a total of about 89,000 different gene families. Why do we care about bacterial genome

  15. Genome Project Standards in a New Era of Sequencing

    Energy Technology Data Exchange (ETDEWEB)

    GSC Consortia; HMP Jumpstart Consortia; Chain, P. S. G.; Grafham, D. V.; Fulton, R. S.; FitzGerald, M. G.; Hostetler, J.; Muzny, D.; Detter, J. C.; Ali, J.; Birren, B.; Bruce, D. C.; Buhay, C.; Cole, J. R.; Ding, Y.; Dugan, S.; Field, D.; Garrity, G. M.; Gibbs, R.; Graves, T.; Han, C. S.; Harrison, S. H.; Highlander, S.; Hugenholtz, P.; Khouri, H. M.; Kodira, C. D.; Kolker, E.; Kyrpides, N. C.; Lang, D.; Lapidus, A.; Malfatti, S. A.; Markowitz, V.; Metha, T.; Nelson, K. E.; Parkhill, J.; Pitluck, S.; Qin, X.; Read, T. D.; Schmutz, J.; Sozhamannan, S.; Strausberg, R.; Sutton, G.; Thomson, N. R.; Tiedje, J. M.; Weinstock, G.; Wollam, A.

    2009-06-01

    For over a decade, genome 43 sequences have adhered to only two standards that are relied on for purposes of sequence analysis by interested third parties (1, 2). However, ongoing developments in revolutionary sequencing technologies have resulted in a redefinition of traditional whole genome sequencing that requires a careful reevaluation of such standards. With commercially available 454 pyrosequencing (followed by Illumina, SOLiD, and now Helicos), there has been an explosion of genomes sequenced under the moniker 'draft', however these can be very poor quality genomes (due to inherent errors in the sequencing technologies, and the inability of assembly programs to fully address these errors). Further, one can only infer that such draft genomes may be of poor quality by navigating through the databases to find the number and type of reads deposited in sequence trace repositories (and not all genomes have this available), or to identify the number of contigs or genome fragments deposited to the database. The difficulty in assessing the quality of such deposited genomes has created some havoc for genome analysis pipelines and contributed to many wasted hours of (mis)interpretation. These same novel sequencing technologies have also brought an exponential leap in raw sequencing capability, and at greatly reduced prices that have further skewed the time- and cost-ratios of draft data generation versus the painstaking process of improving and finishing a genome. The resulting effect is an ever-widening gap between drafted and finished genomes that only promises to continue (Figure 1), hence there is an urgent need to distinguish good and poor datasets. The sequencing institutes in the authorship, along with the NIH's Human Microbiome Project Jumpstart Consortium (3), strongly believe that a new set of standards is required for genome sequences. The following represents a set of six community-defined categories of genome sequence standards that better

  16. Genome Wide Characterization of Simple Sequence Repeats in Cucumber

    Science.gov (United States)

    The whole genome sequence of the cucumber cultivar Gy14 was recently sequenced at 15× coverage with the Roche 454 Titanium technology. The microsatellite DNA sequences (simple sequence repeats, SSRs) in the assembled scaffolds were computationally explored and characterized. A total of 112,073 SSRs ...

  17. Why size really matters when sequencing plant genomes

    Czech Academy of Sciences Publication Activity Database

    Kelly, L.J.; Leitch, A.R.; Fay, M. F.; Renny-Byfield, S.; Pellicer, J.; Macas, Jiří; Leitch, I.J.

    2012-01-01

    Roč. 5, č. 4 (2012), s. 415-425. ISSN 1755-0874 Institutional research plan: CEZ:AV0Z50510513 Institutional support: RVO:60077344 Keywords : C-value * genome assembly * genome size evolution * genome sequencing Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 0.924, year: 2012

  18. Genome Sequence of Mushroom Soft-Rot Pathogen Janthinobacterium agaricidamnosum.

    Science.gov (United States)

    Graupner, Katharina; Lackner, Gerald; Hertweck, Christian

    2015-01-01

    Janthinobacterium agaricidamnosum causes soft-rot disease of the cultured button mushroom Agaricus bisporus and is thus responsible for agricultural losses. Here, we present the genome sequence of J. agaricidamnosum DSM 9628. The 5.9-Mb genome harbors several secondary metabolite biosynthesis gene clusters, which renders this neglected bacterium a promising source for genome mining approaches. PMID:25883287

  19. Genome Sequence of Mushroom Soft-Rot Pathogen Janthinobacterium agaricidamnosum

    OpenAIRE

    Graupner, Katharina; Lackner, Gerald; Hertweck, Christian

    2015-01-01

    Janthinobacterium agaricidamnosum causes soft-rot disease of the cultured button mushroom Agaricus bisporus and is thus responsible for agricultural losses. Here, we present the genome sequence of J. agaricidamnosum DSM 9628. The 5.9-Mb genome harbors several secondary metabolite biosynthesis gene clusters, which renders this neglected bacterium a promising source for genome mining approaches.

  20. Nucleotide sequence and genome organization of carnation mottle virus RNA.

    OpenAIRE

    Guilley, H; Carrington, J C; Balàzs, E; Jonard, G; Richards, K; Morris, T J

    1985-01-01

    The complete nucleotide sequence of carnation mottle genomic RNA (4003 nucleotides) is presented. The sequence was determined for cloned cDNA copies of viral RNA containing over 99% of the sequence and was completed by direct sequence analysis of RNA and cDNA transcripts. The sequence contains two long open reading frames which together can account for observed translation products. One translation product would arise by suppression of an amber termination codon and the sequence raises the po...

  1. Sequence resources at the Candida Genome Database

    OpenAIRE

    Arnaud, Martha B.; Costanzo, Maria C.; Skrzypek, Marek S.; Shah, Prachi; Binkley, Gail; Lane, Christopher; Miyasato, Stuart R.; SHERLOCK, Gavin

    2006-01-01

    The Candida Genome Database (CGD, ) contains a curated collection of genomic information and community resources for researchers who are interested in the molecular biology of the opportunistic pathogen Candida albicans. With the recent release of a new assembly of the C.albicans genome, Assembly 20, C.albicans genomics has entered a new era. Although the C.albicans genome assembly continues to undergo refinement, multiple assemblies and gene nomenclatures will remain in widespread use by the...

  2. Complete Genome Sequence of the Human Gut Symbiont Roseburia hominis

    DEFF Research Database (Denmark)

    Travis, Anthony J.; Kelly, Denise; Flint, Harry J; Aminov, Rustam

    2015-01-01

    We report here the complete genome sequence of the human gut symbiont Roseburia hominis A2-183(T) (= DSM 16839(T) = NCIMB 14029(T)), isolated from human feces. The genome is represented by a 3,592,125-bp chromosome with 3,405 coding sequences. A number of potential functions contributing to host...

  3. Draft Genome Sequence of the Fish Pathogen Piscirickettsia salmonis

    OpenAIRE

    Eppinger, Mark; McNair, Katelyn; Zogaj, Xhavit; Dinsdale, Elizabeth A.; Edwards, Robert A.; Klose, Karl E.

    2013-01-01

    Piscirickettsia salmonis is a Gram-negative intracellular fish pathogen that has a significant impact on the salmon industry. Here, we report the genome sequence of P. salmonis strain LF-89. This is the first draft genome sequence of P. salmonis, and it reveals interesting attributes, including flagellar genes, despite this bacterium being considered nonmotile.

  4. Draft Genome Sequence of the Fish Pathogen Piscirickettsia salmonis.

    Science.gov (United States)

    Eppinger, Mark; McNair, Katelyn; Zogaj, Xhavit; Dinsdale, Elizabeth A; Edwards, Robert A; Klose, Karl E

    2013-01-01

    Piscirickettsia salmonis is a Gram-negative intracellular fish pathogen that has a significant impact on the salmon industry. Here, we report the genome sequence of P. salmonis strain LF-89. This is the first draft genome sequence of P. salmonis, and it reveals interesting attributes, including flagellar genes, despite this bacterium being considered nonmotile. PMID:24201203

  5. Biased distribution of DNA uptake sequences towards genome maintenance genes

    DEFF Research Database (Denmark)

    Davidsen, T.; Rodland, E.A.; Lagesen, K.;

    2004-01-01

    Repeated sequence signatures are characteristic features of all genomic DNA. We have made a rigorous search for repeat genomic sequences in the human pathogens Neisseria meningitidis, Neisseria gonorrhoeae and Haemophilus influenzae and found that by far the most frequent 9-10mers residing within...

  6. Complete genome sequence of ‘Candidatus Liberibacter africanus’

    Science.gov (United States)

    The complete genome sequence of ‘Candidatus Liberibacter africanus’ (Laf), strain ptsapsy, was obtained by an Illumina HiSeq 2000. The Laf genome comprises 1,192,232 nucleotides, 34.5% GC content, 1,141 predicted coding sequences, 44 tRNAs, 3 complete copies of ribosomal RNA genes (16S, 23S and 5S) ...

  7. Draft Genome Sequence of the Wolbachia Endosymbiont of Drosophila suzukii

    OpenAIRE

    Siozios, Stefanos; Cestaro, Alessandro; Kaur, Rupinder; Pertot, Ilaria; Rota-Stabelli, Omar; Anfora, Gianfranco

    2013-01-01

    Wolbachia is one of the most successful and abundant symbiotic bacteria in nature, infecting more than 40% of the terrestrial arthropod species. Here we report the draft genome sequence of a novel Wolbachia strain named “wSuzi” that was retrieved from the genome sequencing of its host, the invasive pest Drosophila suzukii.

  8. Draft Genome Sequence of the Wolbachia Endosymbiont of Drosophila suzukii.

    Science.gov (United States)

    Siozios, Stefanos; Cestaro, Alessandro; Kaur, Rupinder; Pertot, Ilaria; Rota-Stabelli, Omar; Anfora, Gianfranco

    2013-01-01

    Wolbachia is one of the most successful and abundant symbiotic bacteria in nature, infecting more than 40% of the terrestrial arthropod species. Here we report the draft genome sequence of a novel Wolbachia strain named "wSuzi" that was retrieved from the genome sequencing of its host, the invasive pest Drosophila suzukii. PMID:23472225

  9. Draft Genome Sequence of Klebsiella pneumoniae Isolate PR04

    OpenAIRE

    Zulkifli, M. H.; L. K. Teh; L. S. Lee; Z. A. Zakaria; Salleh, M. Z.

    2013-01-01

    Klebsiella pneumoniae PR04 was isolated from a patient hospitalized in Malaysia. The draft genome sequence of K. pneumoniae PR04 shows differences compared to the reference sequences of K. pneumoniae strains MGH 78578 and NTUH-K2044 in terms of their genomic structures.

  10. The carrot genome sequence brings colors out of the dark.

    Science.gov (United States)

    Garcia-Mas, Jordi; Rodriguez-Concepcion, Manuel

    2016-05-27

    The genome sequence of carrot (Daucus carota L.) is the first completed for an Apiaceae species, furthering knowledge of the evolution of the important euasterid II clade. Analyzing the whole-genome sequence allowed for the identification of a gene that may regulate the accumulation of carotenoids in the root. PMID:27230684

  11. Complete Genome Sequences of Five Paenibacillus larvae Bacteriophages.

    Science.gov (United States)

    Sheflo, Michael A; Gardner, Adam V; Merrill, Bryan D; Fisher, Joshua N B; Lunt, Bryce L; Breakwell, Donald P; Grose, Julianne H; Burnett, Sandra H

    2013-01-01

    Paenibacillus larvae is a pathogen of honeybees that causes American foulbrood (AFB). We isolated bacteriophages from soil containing bee debris collected near beehives in Utah. We announce five high-quality complete genome sequences, which represent the first completed genome sequences submitted to GenBank for any P. larvae bacteriophage. PMID:24233582

  12. Genome sequence of Kocuria palustris strain W4

    DEFF Research Database (Denmark)

    Herschend, Jakob; Raghupathi, Prem Krishnan; Røder, Henriette Lyng;

    2016-01-01

    We report the 3.09 Mb draft genome sequence ofKocuria palustrisW4, isolated from a slaughterhouse in Denmark.......We report the 3.09 Mb draft genome sequence ofKocuria palustrisW4, isolated from a slaughterhouse in Denmark....

  13. Nearly Complete Genome Sequence of Lactobacillus plantarum Strain NIZO2877

    NARCIS (Netherlands)

    Martino, M.E.; Bayjanov, J.R.; Joncour, P.; Hughes, S.; Gillet, B.; Kleerebezem, M; Siezen, R.; Hijum, S.A.F.T. van; Leulier, F.

    2015-01-01

    Lactobacillus plantarum is a versatile bacterial species that is isolated mostly from foods. Here, we present the first genome sequence of L. plantarum strain NIZO2877 isolated from a hot dog in Vietnam. Its two contigs represent a nearly complete genome sequence.

  14. On the current status of Phakopsora pachyrhizi genome sequencing

    Directory of Open Access Journals (Sweden)

    Marco eLoehrer

    2014-08-01

    Full Text Available Recent advances in the field of sequencing technologies and bioinformatics allow a more rapid access to genomes of non-model organisms at sinking costs. Accordingly, draft genomes of several economically important cereal rust fungi have been released in the last three years. Aside from the very recent flax rust and poplar rust draft assemblies there are no genomic data available for other dicot-infecting rust fungi. In this article we outline rust fungus sequencing efforts and comment on the current status of Phakopsora pachyrhizi (Asian soybean rust genome sequencing.

  15. Unexpected cross-species contamination in genome sequencing projects

    Directory of Open Access Journals (Sweden)

    Samier Merchant

    2014-11-01

    Full Text Available The raw data from a genome sequencing project sometimes contains DNA from contaminating organisms, which may be introduced during sample collection or sequence preparation. In some instances, these contaminants remain in the sequence even after assembly and deposition of the genome into public databases. As a result, searches of these databases may yield erroneous and confusing results. We used efficient microbiome analysis software to scan the draft assembly of domestic cow, Bos taurus, and identify 173 small contigs that appeared to derive from microbial contaminants. In the course of verifying these findings, we discovered that one genome, Neisseria gonorrhoeae TCDC-NG08107, although putatively a complete genome, contained multiple sequences that actually derived from the cow and sheep genomes. Our findings illustrate the need to carefully validate findings of anomalous DNA that rely on comparisons to either draft or finished genomes.

  16. Minimum taxonomic criteria for bacterial genome sequence depositions and announcements.

    Science.gov (United States)

    Bull, Matthew J; Marchesi, Julian R; Vandamme, Peter; Plummer, Sue; Mahenthiralingam, Eshwar

    2012-04-01

    Multiple bioinformatic methods are available to analyse the information encoded within the complete genome sequence of a bacterium and accurately assign its species status or nearest phylogenetic neighbour. However, it is clear that even now in what is the third decade of bacterial genomics, taxonomically incorrect genome sequence depositions are still being made. We outline a simple scheme of bioinformatic analysis and a set of minimum criteria that should be applied to all bacterial genomic data to ensure that they are accurately assigned to the species or genus level prior to database deposition. To illustrate the utility of the bioinformatic workflow, we analysed the recently deposited genome sequence of Lactobacillus acidophilus 30SC and demonstrated that this DNA was in fact derived from a strain of Lactobacillus amylovorus. Using these methods researchers can ensure that the taxonomic accuracy of genome sequence depositions is maintained within the ever increasing nucleic acid datasets. PMID:22366464

  17. Genome sequencing and annotation of Serratia sp. strain TEL.

    Science.gov (United States)

    Lephoto, Tiisetso E; Gray, Vincent M

    2015-12-01

    We present the annotation of the draft genome sequence of Serratia sp. strain TEL (GenBank accession number KP711410). This organism was isolated from entomopathogenic nematode Oscheius sp. strain TEL (GenBank accession number KM492926) collected from grassland soil and has a genome size of 5,000,541 bp and 542 subsystems. The genome sequence can be accessed at DDBJ/EMBL/GenBank under the accession number LDEG00000000. PMID:26697332

  18. Comparative Copy Number Variation From Whole Genome Sequencing

    OpenAIRE

    Janevski, A.; Varadan, V.; Kamalakaran, S.; Banerjee, N.; Dimitrova, D

    2011-01-01

    Whole genome sequencing enables a high resolution view of the humangenome and enables unique insights into copy number variations in anunprecedented scale. Numerous tools and studies have already been introduced that provide confirmatory and new genomic variability datain individuals and across populations. We investigate two such methods, CNV-seq and FREEC and compare their outputs when applied to five whole genome sequences representing four populations. We focus onthe ability of these tool...

  19. Whole-genome sequence-based analysis of thyroid function

    DEFF Research Database (Denmark)

    Taylor, Peter N.; Porcu, Eleonora; Chew, Shelby;

    2015-01-01

    Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N = 2,287). Using additional whole-genome...... association with FT4 in NRG1. Our results demonstrate that increased coverage in whole-genome sequence association studies identifies novel variants associated with thyroid function....

  20. Marsupial Genome Sequences: Providing Insight into Evolution and Disease

    OpenAIRE

    Deakin, Janine E.

    2012-01-01

    Marsupials (metatherians), with their position in vertebrate phylogeny and their unique biological features, have been studied for many years by a dedicated group of researchers, but it has only been since the sequencing of the first marsupial genome that their value has been more widely recognised. We now have genome sequences for three distantly related marsupial species (the grey short-tailed opossum, the tammar wallaby, and Tasmanian devil), with the promise of many more genomes to be seq...

  1. Whole-genome sequencing in bacteriology: state of the art

    OpenAIRE

    Dark, Michael

    2013-01-01

    Michael J DarkDepartment of Infectious Diseases and Pathology and Emerging Pathogens Institute, University of Florida, Gainesville, FL, USAAbstract: Over the last ten years, genome sequencing capabilities have expanded exponentially. There have been tremendous advances in sequencing technology, DNA sample preparation, genome assembly, and data analysis. This has led to advances in a number of facets of bacterial genomics, including metagenomics, clinical medicine, bacterial archaeology, and b...

  2. Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities

    OpenAIRE

    Chen, Kevin; Pachter, Lior

    2005-01-01

    The application of whole-genome shotgun sequencing to microbial communities represents a major development in metagenomics, the study of uncultured microbes via the tools of modern genomic analysis. In the past year, whole-genome shotgun sequencing projects of prokaryotic communities from an acid mine biofilm, the Sargasso Sea, Minnesota farm soil, three deep-sea whale falls, and deep-sea sediments have been reported, adding to previously published work on viral communities from marine and fe...

  3. Perspectives of Integrative Cancer Genomics in Next Generation Sequencing Era

    OpenAIRE

    Kwon, So Mee; Cho, Hyunwoo; Choi, Ji Hye; Jee, Byul A; Jo, Yuna; Woo, Hyun Goo

    2012-01-01

    The explosive development of genomics technologies including microarrays and next generation sequencing (NGS) has provided comprehensive maps of cancer genomes, including the expression of mRNAs and microRNAs, DNA copy numbers, sequence variations, and epigenetic changes. These genome-wide profiles of the genetic aberrations could reveal the candidates for diagnostic and/or prognostic biomarkers as well as mechanistic insights into tumor development and progression. Recent efforts to establis...

  4. Whole-Genome Sequences of Thirteen Isolates of Borrelia burgdorferi

    Energy Technology Data Exchange (ETDEWEB)

    Schutzer S. E.; Dunn J.; Fraser-Liggett, C. M.; Casjens, S. R.; Qiu, W.-G.; Mongodin, E. F.; Luft, B. J.

    2011-02-01

    Borrelia burgdorferi is a causative agent of Lyme disease in North America and Eurasia. The first complete genome sequence of B. burgdorferi strain 31, available for more than a decade, has assisted research on the pathogenesis of Lyme disease. Because a single genome sequence is not sufficient to understand the relationship between genotypic and geographic variation and disease phenotype, we determined the whole-genome sequences of 13 additional B. burgdorferi isolates that span the range of natural variation. These sequences should allow improved understanding of pathogenesis and provide a foundation for novel detection, diagnosis, and prevention strategies.

  5. Generation of physical map contig-specific sequences useful for whole genome sequence scaffolding.

    Directory of Open Access Journals (Sweden)

    Yanliang Jiang

    Full Text Available Along with the rapid advances of the nextgen sequencing technologies, more and more species are added to the list of organisms whose whole genomes are sequenced. However, the assembled draft genome of many organisms consists of numerous small contigs, due to the short length of the reads generated by nextgen sequencing platforms. In order to improve the assembly and bring the genome contigs together, more genome resources are needed. In this study, we developed a strategy to generate a valuable genome resource, physical map contig-specific sequences, which are randomly distributed genome sequences in each physical contig. Two-dimensional tagging method was used to create specific tags for 1,824 physical contigs, in which the cost was dramatically reduced. A total of 94,111,841 100-bp reads and 315,277 assembled contigs are identified containing physical map contig-specific tags. The physical map contig-specific sequences along with the currently available BAC end sequences were then used to anchor the catfish draft genome contigs. A total of 156,457 genome contigs (~79% of whole genome sequencing assembly were anchored and grouped into 1,824 pools, in which 16,680 unique genes were annotated. The physical map contig-specific sequences are valuable resources to link physical map, genetic linkage map and draft whole genome sequences, consequently have the capability to improve the whole genome sequences assembly and scaffolding, and improve the genome-wide comparative analysis as well. The strategy developed in this study could also be adopted in other species whose whole genome assembly is still facing a challenge.

  6. Complete genome sequence and comparative genomic analysis of an emerging human pathogen, serotype V Streptococcus agalactiae

    OpenAIRE

    Tettelin, Hervé; Masignani, Vega; Cieslewicz, Michael J.; Eisen, Jonathan A.; Peterson, Scott; Wessels, Michael R.; Paulsen, Ian T.; Nelson, Karen E.; Margarit, Immaculada; Read, Timothy D.; Madoff, Lawrence C.; Wolf, Alex M.; Beanan, Maureen J; Brinkac, Lauren M.; Sean C Daugherty

    2002-01-01

    The 2,160,267 bp genome sequence of Streptococcus agalactiae, the leading cause of bacterial sepsis, pneumonia, and meningitis in neonates in the U.S. and Europe, is predicted to encode 2,175 genes. Genome comparisons among S. agalactiae, Streptococcus pneumoniae, Streptococcus pyogenes, and the other completely sequenced genomes identified genes specific to the streptococci and to S. agalactiae. These in silico analyses, combined with comparative genome hybridization experiments between the ...

  7. Progress in Understanding and Sequencing the Genome of Brassica rapa

    OpenAIRE

    Hong, Chang Pyo; Kwon, Soo-Jin; Kim, Jung Sun; Yang, Tae-Jin; Park, Beom-Seok; Lim, Yong Pyo

    2008-01-01

    Brassica rapa, which is closely related to Arabidopsis thaliana, is an important crop and a model plant for studying genome evolution via polyploidization. We report the current understanding of the genome structure of B. rapa and efforts for the whole-genome sequencing of the species. The tribe Brassicaceae, which comprises ca. 240 species, descended from a common hexaploid ancestor with a basic genome similar to that of Arabidopsis. Chromosome rearrangements, including fusions and/or fissio...

  8. Complete Genome Sequence of Probiotic Strain Lactobacillus acidophilus La-14.

    Science.gov (United States)

    Stahl, Buffy; Barrangou, Rodolphe

    2013-01-01

    We present the 1,991,830-bp complete genome sequence of Lactobacillus acidophilus strain La-14 (SD-5212). Comparative genomic analysis revealed 99.98% similarity overall to the L. acidophilus NCFM genome. Globally, 111 single nucleotide polymorphisms (SNPs) (95 SNPs, 16 indels) were observed throughout the genome. Also, a 416-bp deletion in the LA14_1146 sugar ABC transporter was identified. PMID:23788546

  9. Complete Genome Sequence of Probiotic Strain Lactobacillus acidophilus La-14

    OpenAIRE

    Stahl, Buffy; Barrangou, Rodolphe

    2013-01-01

    We present the 1,991,830-bp complete genome sequence of Lactobacillus acidophilus strain La-14 (SD-5212). Comparative genomic analysis revealed 99.98% similarity overall to the L. acidophilus NCFM genome. Globally, 111 single nucleotide polymorphisms (SNPs) (95 SNPs, 16 indels) were observed throughout the genome. Also, a 416-bp deletion in the LA14_1146 sugar ABC transporter was identified.

  10. Analysis of the Thermotoga maritima genome combining a variety of sequence similarity and genome context tools

    OpenAIRE

    Kyrpides, Nikos C; Ouzounis, Christos A; Iliopoulos, Ioannis; Vonstein, Veronika; Overbeek, Ross

    2000-01-01

    The proliferation of genome sequence data has led to the development of a number of tools and strategies that facilitate computational analysis. These methods include the identification of motif patterns, membership of the query sequences in family databases, metabolic pathway involvement and gene proximity. We re-examined the completely sequenced genome of Thermotoga maritima by employing the combined use of the above methods. By analyzing all 1877 proteins encoded in this genome, we identif...

  11. Phylogenetic Analysis of Citrus tristeza virus Isolates of Wild Type Citrus in China

    Institute of Scientific and Technical Information of China (English)

    YI Long; ZHOU Chang-yong

    2014-01-01

    The genetic variation and phylogenetic relationships of Citrus tristeza virus (CTV) isolates collected from Chinese wild type citrus were analyzed by comparing the sequences of nine genomic regions (p23, p20, p13, p18, p25, p27, POL, HEL and k17) with the CTV isolates of cultivated citrus from different countries. The results showed that the divergence pattern of genomic RNA of the CTV isolates from wild type citrus was similar to that of other isolates from cultivated citrus, the 3´ proximal region was relatively conserved, and the 5´ proximal region had greater variability. The nine genomic regions of CTV isolates analyzed were found to have been under purifying selection in the evolution process. Phylogenetic analysis showed that the eleven Chinese wild CTV isolates were located at different clades and did not relfect their geographical origins, suggesting genetic diversity among the Chinese wild CTV populations. These results will aid in the understanding of molecular evolution of the Chinese CTV populations.

  12. Using Partial Genomic Fosmid Libraries for Sequencing CompleteOrganellar Genomes

    Energy Technology Data Exchange (ETDEWEB)

    McNeal, Joel R.; Leebens-Mack, James H.; Arumuganathan, K.; Kuehl, Jennifer V.; Boore, Jeffrey L.; dePamphilis, Claude W.

    2005-08-26

    Organellar genome sequences provide numerous phylogenetic markers and yield insight into organellar function and molecular evolution. These genomes are much smaller in size than their nuclear counterparts; thus, their complete sequencing is much less expensive than total nuclear genome sequencing, making broader phylogenetic sampling feasible. However, for some organisms it is challenging to isolate plastid DNA for sequencing using standard methods. To overcome these difficulties, we constructed partial genomic libraries from total DNA preparations of two heterotrophic and two autotrophic angiosperm species using fosmid vectors. We then used macroarray screening to isolate clones containing large fragments of plastid DNA. A minimum tiling path of clones comprising the entire genome sequence of each plastid was selected, and these clones were shotgun-sequenced and assembled into complete genomes. Although this method worked well for both heterotrophic and autotrophic plants, nuclear genome size had a dramatic effect on the proportion of screened clones containing plastid DNA and, consequently, the overall number of clones that must be screened to ensure full plastid genome coverage. This technique makes it possible to determine complete plastid genome sequences for organisms that defy other available organellar genome sequencing methods, especially those for which limited amounts of tissue are available.

  13. Real-time, portable genome sequencing for Ebola surveillance.

    Science.gov (United States)

    Quick, Joshua; Loman, Nicholas J; Duraffour, Sophie; Simpson, Jared T; Severi, Ettore; Cowley, Lauren; Bore, Joseph Akoi; Koundouno, Raymond; Dudas, Gytis; Mikhail, Amy; Ouédraogo, Nobila; Afrough, Babak; Bah, Amadou; Baum, Jonathan H J; Becker-Ziaja, Beate; Boettcher, Jan Peter; Cabeza-Cabrerizo, Mar; Camino-Sánchez, Álvaro; Carter, Lisa L; Doerrbecker, Juliane; Enkirch, Theresa; García-Dorival, Isabel; Hetzelt, Nicole; Hinzmann, Julia; Holm, Tobias; Kafetzopoulou, Liana Eleni; Koropogui, Michel; Kosgey, Abigael; Kuisma, Eeva; Logue, Christopher H; Mazzarelli, Antonio; Meisel, Sarah; Mertens, Marc; Michel, Janine; Ngabo, Didier; Nitzsche, Katja; Pallasch, Elisa; Patrono, Livia Victoria; Portmann, Jasmine; Repits, Johanna Gabriella; Rickett, Natasha Y; Sachse, Andreas; Singethan, Katrin; Vitoriano, Inês; Yemanaberhan, Rahel L; Zekeng, Elsa G; Racine, Trina; Bello, Alexander; Sall, Amadou Alpha; Faye, Ousmane; Faye, Oumar; Magassouba, N'Faly; Williams, Cecelia V; Amburgey, Victoria; Winona, Linda; Davis, Emily; Gerlach, Jon; Washington, Frank; Monteil, Vanessa; Jourdain, Marine; Bererd, Marion; Camara, Alimou; Somlare, Hermann; Camara, Abdoulaye; Gerard, Marianne; Bado, Guillaume; Baillet, Bernard; Delaune, Déborah; Nebie, Koumpingnin Yacouba; Diarra, Abdoulaye; Savane, Yacouba; Pallawo, Raymond Bernard; Gutierrez, Giovanna Jaramillo; Milhano, Natacha; Roger, Isabelle; Williams, Christopher J; Yattara, Facinet; Lewandowski, Kuiama; Taylor, James; Rachwal, Phillip; Turner, Daniel J; Pollakis, Georgios; Hiscox, Julian A; Matthews, David A; O'Shea, Matthew K; Johnston, Andrew McD; Wilson, Duncan; Hutley, Emma; Smit, Erasmus; Di Caro, Antonino; Wölfel, Roman; Stoecker, Kilian; Fleischmann, Erna; Gabriel, Martin; Weller, Simon A; Koivogui, Lamine; Diallo, Boubacar; Keïta, Sakoba; Rambaut, Andrew; Formenty, Pierre; Günther, Stephan; Carroll, Miles W

    2016-02-11

    The Ebola virus disease epidemic in West Africa is the largest on record, responsible for over 28,599 cases and more than 11,299 deaths. Genome sequencing in viral outbreaks is desirable to characterize the infectious agent and determine its evolutionary rate. Genome sequencing also allows the identification of signatures of host adaptation, identification and monitoring of diagnostic targets, and characterization of responses to vaccines and treatments. The Ebola virus (EBOV) genome substitution rate in the Makona strain has been estimated at between 0.87 × 10(-3) and 1.42 × 10(-3) mutations per site per year. This is equivalent to 16-27 mutations in each genome, meaning that sequences diverge rapidly enough to identify distinct sub-lineages during a prolonged epidemic. Genome sequencing provides a high-resolution view of pathogen evolution and is increasingly sought after for outbreak surveillance. Sequence data may be used to guide control measures, but only if the results are generated quickly enough to inform interventions. Genomic surveillance during the epidemic has been sporadic owing to a lack of local sequencing capacity coupled with practical difficulties transporting samples to remote sequencing facilities. To address this problem, here we devise a genomic surveillance system that utilizes a novel nanopore DNA sequencing instrument. In April 2015 this system was transported in standard airline luggage to Guinea and used for real-time genomic surveillance of the ongoing epidemic. We present sequence data and analysis of 142 EBOV samples collected during the period March to October 2015. We were able to generate results less than 24 h after receiving an Ebola-positive sample, with the sequencing process taking as little as 15-60 min. We show that real-time genomic surveillance is possible in resource-limited settings and can be established rapidly to monitor outbreaks. PMID:26840485

  14. Determining and comparing protein function in Bacterial genome sequences

    DEFF Research Database (Denmark)

    Vesth, Tammi Camilla

    predictions were made in about 60% of the cases. This project has highlighted the difficulties and challenges in functional annotation and computational analysis of sequence data. It has provided possible solutions for creating reproducible pipelines for comparative genomics as well as constructed a number of......In November 2013, there was around 21.000 different prokaryotic genomes sequenced and publicly available, and the number is growing daily with another 20.000 or more genomes expected to be sequenced and deposited by the end of 2014. An important part of the analysis of this data is the functional...... known functions. This thesis describes the development of new tools for comparative functional annotation and a system for comparative genomics in general. As novel sequenced genomes are becoming more readily available, there is a need for standard analysis tools. The system CMG-biotools is presented...

  15. Marsupial genome sequences: providing insight into evolution and disease.

    Science.gov (United States)

    Deakin, Janine E

    2012-01-01

    Marsupials (metatherians), with their position in vertebrate phylogeny and their unique biological features, have been studied for many years by a dedicated group of researchers, but it has only been since the sequencing of the first marsupial genome that their value has been more widely recognised. We now have genome sequences for three distantly related marsupial species (the grey short-tailed opossum, the tammar wallaby, and Tasmanian devil), with the promise of many more genomes to be sequenced in the near future, making this a particularly exciting time in marsupial genomics. The emergence of a transmissible cancer, which is obliterating the Tasmanian devil population, has increased the importance of obtaining and analysing marsupial genome sequence for understanding such diseases as well as for conservation efforts. In addition, these genome sequences have facilitated studies aimed at answering questions regarding gene and genome evolution and provided insight into the evolution of epigenetic mechanisms. Here I highlight the major advances in our understanding of evolution and disease, facilitated by marsupial genome projects, and speculate on the future contributions to be made by such sequences. PMID:24278712

  16. The complete mitochondrial genome of the citrus red mite Panonychus citri (Acari: Tetranychidae): high genome rearrangement and extremely truncated tRNAs

    OpenAIRE

    Dou Wei; Wang Bao-Jun; Wei Dan-Dan; Yuan Ming-Long; Wang Jin-Jun

    2010-01-01

    Abstract Background The family Tetranychidae (Chelicerata: Acari) includes ~1200 species, many of which are of agronomic importance. To date, mitochondrial genomes of only two Tetranychidae species have been sequenced, and it has been found that these two mitochondrial genomes are characterized by many unusual features in genome organization and structure such as gene order and nucleotide frequency. The scarcity of available sequence data has greatly impeded evolutionary studies in Acari (mit...

  17. Draft genome sequence of Enterococcus faecium strain LMG 8148.

    Science.gov (United States)

    Michiels, Joran E; Van den Bergh, Bram; Fauvart, Maarten; Michiels, Jan

    2016-01-01

    Enterococcus faecium, traditionally considered a harmless gut commensal, is emerging as an important nosocomial pathogen showing increasing rates of multidrug resistance. We report the draft genome sequence of E. faecium strain LMG 8148, isolated in 1968 from a human in Gothenburg, Sweden. The draft genome has a total length of 2,697,490 bp, a GC-content of 38.3 %, and 2,402 predicted protein-coding sequences. The isolation of this strain predates the emergence of E. faecium as a nosocomial pathogen. Consequently, its genome can be useful in comparative genomic studies investigating the evolution of E. faecium as a pathogen. PMID:27610213

  18. Complete genome sequence of Acidimicrobium ferrooxidans type strain (ICPT)

    Energy Technology Data Exchange (ETDEWEB)

    Clum, Alicia; Nolan, Matt; Lang, Elke; Glavina Del Rio, Tijana; Tice, Hope; Copeland, Alex; Cheng, Jan-Fang; Lucas, Susan; Chen, Feng; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Ivanova, Natalia; Mavrommatis, Konstantinos; Mikhailova, Natalia; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Goker, Markus; Spring, Stefan; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jefferies, Cynthia C.; Chain, Patrick; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter; Lapidus, Alla

    2009-05-20

    Acidimicrobium ferrooxidans (Clark and Norris 1996) is the sole and type species of the genus, which until recently was the only genus within the actinobacterial family Acidimicrobiaceae and in the order Acidomicrobiales. Rapid oxidation of iron pyrite during autotrophic growth in the absence of an enhanced CO2 concentration is characteristic for A. ferrooxidans. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of the order Acidomicrobiales, and the 2,158,157 bp long single replicon genome with its 2038 protein coding and 54 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  19. Puzzling sequences: studying microbial genomes from 'Ötzi'

    International Nuclear Information System (INIS)

    Ancient remains, and mummies in particular, are of central value for archaeological research. The Tyrolean iceman “Ötzi” was conserved in a glacier of the Ötztal Alps about 5000 years ago. Aside from morphological and phenotypical classification, the determination of DNA sequences and the subsequent genome analyses have been first applied to mitochondrial DNA and then been extended to genomic DNA. Typically also ancient microbial DNA is sequenced. These sequences allow the identification of pathogens as well as studying the evolution of microorganisms. The talk will explain the metagenomic aspects of the “Ötzi” genome project and discuss the first results. (author)

  20. Genomic treasure troves: complete genome sequencing of herbarium and insect museum specimens.

    Directory of Open Access Journals (Sweden)

    Martijn Staats

    Full Text Available Unlocking the vast genomic diversity stored in natural history collections would create unprecedented opportunities for genome-scale evolutionary, phylogenetic, domestication and population genomic studies. Many researchers have been discouraged from using historical specimens in molecular studies because of both generally limited success of DNA extraction and the challenges associated with PCR-amplifying highly degraded DNA. In today's next-generation sequencing (NGS world, opportunities and prospects for historical DNA have changed dramatically, as most NGS methods are actually designed for taking short fragmented DNA molecules as templates. Here we show that using a standard multiplex and paired-end Illumina sequencing approach, genome-scale sequence data can be generated reliably from dry-preserved plant, fungal and insect specimens collected up to 115 years ago, and with minimal destructive sampling. Using a reference-based assembly approach, we were able to produce the entire nuclear genome of a 43-year-old Arabidopsis thaliana (Brassicaceae herbarium specimen with high and uniform sequence coverage. Nuclear genome sequences of three fungal specimens of 22-82 years of age (Agaricus bisporus, Laccaria bicolor, Pleurotus ostreatus were generated with 81.4-97.9% exome coverage. Complete organellar genome sequences were assembled for all specimens. Using de novo assembly we retrieved between 16.2-71.0% of coding sequence regions, and hence remain somewhat cautious about prospects for de novo genome assembly from historical specimens. Non-target sequence contaminations were observed in 2 of our insect museum specimens. We anticipate that future museum genomics projects will perhaps not generate entire genome sequences in all cases (our specimens contained relatively small and low-complexity genomes, but at least generating vital comparative genomic data for testing (phylogenetic, demographic and genetic hypotheses, that become increasingly more

  1. BAC-pool 454-sequencing: A rapid and efficient approach to sequence complex tetraploid cotton genomes

    Science.gov (United States)

    New and emerging next generation sequencing technologies have been promising in reducing sequencing costs, but not significantly for complex polyploid plant genomes such as cotton. Large and highly repetitive genome of G. hirsutum (~2.5GB) is less amenable and cost-intensive with traditional BAC-by...

  2. Sequencing of chloroplast genome using whole cellular DNA and Solexa sequencing technology

    Directory of Open Access Journals (Sweden)

    Jian eWu

    2012-11-01

    Full Text Available Sequencing of the chloroplast genome using traditional sequencing methods has been difficult because of its size (>120 kb and the complicated procedures required to prepare templates. To explore the feasibility of sequencing the chloroplast genome using DNA extracted from whole cells and Solexa sequencing technology, we sequenced whole cellular DNA isolated from leaves of three Brassica rapa accessions with one lane per accession. In total, 246 Mb, 362Mb, 361 Mb sequence data were generated for the three accessions Chiifu-401-42, Z16 and FT, respectively. Microreads were assembled by reference-guided assembly using the cpDNA sequences of B. rapa, Arabidopsis thaliana, and Nicotiana tabacum. We achieved coverage of more than 99.96% of the cp genome in the three tested accessions using the B. rapa sequence as the reference. When A. thaliana or N. tabacum sequences were used as references, 99.7–99.8% or 95.5–99.7% of the B. rapa chloroplast genome was covered, respectively. These results demonstrated that sequencing of whole cellular DNA isolated from young leaves using the Illumina Genome Analyzer is an efficient method for high-throughput sequencing of chloroplast genome.

  3. Genome sequencing and analysis of the model grass Brachypodium distachyon.

    Science.gov (United States)

    2010-02-11

    Three subfamilies of grasses, the Ehrhartoideae, Panicoideae and Pooideae, provide the bulk of human nutrition and are poised to become major sources of renewable energy. Here we describe the genome sequence of the wild grass Brachypodium distachyon (Brachypodium), which is, to our knowledge, the first member of the Pooideae subfamily to be sequenced. Comparison of the Brachypodium, rice and sorghum genomes shows a precise history of genome evolution across a broad diversity of the grasses, and establishes a template for analysis of the large genomes of economically important pooid grasses such as wheat. The high-quality genome sequence, coupled with ease of cultivation and transformation, small size and rapid life cycle, will help Brachypodium reach its potential as an important model system for developing new energy and food crops. PMID:20148030

  4. The Arabidopsis lyrata genome sequence and the basis of rapid genome size change

    Energy Technology Data Exchange (ETDEWEB)

    Hu, Tina T.; Pattyn, Pedro; Bakker, Erica G.; Cao, Jun; Cheng, Jan-Fang; Clark, Richard M.; Fahlgren, Noah; Fawcett, Jeffrey A.; Grimwood, Jane; Gundlach, Heidrun; Haberer, Georg; Hollister, Jesse D.; Ossowski, Stephan; Ottilar, Robert P.; Salamov, Asaf A.; Schneeberger, Korbinian; Spannagl, Manuel; Wang, Xi; Yang, Liang; Nasrallah, Mikhail E.; Bergelson, Joy; Carrington, James C.; Gaut, Brandon S.; Schmutz, Jeremy; Mayer, Klaus F. X.; Van de Peer, Yves; Grigoriev, Igor V.; Nordborg, Magnus; Weigel, Detlef; Guo, Ya-Long

    2011-04-29

    In our manuscript, we present a high-quality genome sequence of the Arabidopsis thaliana relative, Arabidopsis lyrata, produced by dideoxy sequencing. We have performed the usual types of genome analysis (gene annotation, dN/dS studies etc. etc.), but this is relegated to the Supporting Information. Instead, we focus on what was a major motivation for sequencing this genome, namely to understand how A. thaliana lost half its genome in a few million years and lived to tell the tale. The rather surprising conclusion is that there is not a single genomic feature that accounts for the reduced genome, but that every aspect centromeres, intergenic regions, transposable elements, gene family number is affected through hundreds of thousands of cuts. This strongly suggests that overall genome size in itself is what has been under selection, a suggestion that is strongly supported by our demonstration (using population genetics data from A. thaliana) that new deletions seem to be driven to fixation.

  5. Perspectives of integrative cancer genomics in next generation sequencing era.

    Science.gov (United States)

    Kwon, So Mee; Cho, Hyunwoo; Choi, Ji Hye; Jee, Byul A; Jo, Yuna; Woo, Hyun Goo

    2012-06-01

    The explosive development of genomics technologies including microarrays and next generation sequencing (NGS) has provided comprehensive maps of cancer genomes, including the expression of mRNAs and microRNAs, DNA copy numbers, sequence variations, and epigenetic changes. These genome-wide profiles of the genetic aberrations could reveal the candidates for diagnostic and/or prognostic biomarkers as well as mechanistic insights into tumor development and progression. Recent efforts to establish the huge cancer genome compendium and integrative omics analyses, so-called "integromics", have extended our understanding on the cancer genome, showing its daunting complexity and heterogeneity. However, the challenges of the structured integration, sharing, and interpretation of the big omics data still remain to be resolved. Here, we review several issues raised in cancer omics data analysis, including NGS, focusing particularly on the study design and analysis strategies. This might be helpful to understand the current trends and strategies of the rapidly evolving cancer genomics research. PMID:23105932

  6. Analysis of Simple Sequence Repeats in Genomes of Rhizobia

    Institute of Scientific and Technical Information of China (English)

    GAO Ya-mei; HAN Yi-qiang; TANG Hui; SUN Dong-mei; WANG Yan-jie; WANG Wei-dong

    2008-01-01

    Simple sequence repeats (SSRs) or microsatellites, as genetic markers, are ubiquitous in genomes of various organisms. The analysis of SSR in rhizobia genome provides useful information for a variety of applications in population genetics of rhizobia. We analyzed the occurrences, relative abundance, and relative density of SSRs, the most common in Bradyrhizobium japonicum, Mesorhizobium loti, and Sinorhizobium meliloti genomes se-quenced in the microorganisms tandem repeats database, and SSRs in the three species genomes were compared with each other. The result showed that there were 1 410, 859, and 638 SSRs in B. japonicum, M. loti, and 5. meliloti genomes, respectively. In the genomes of B. japonicum, M. loti, and 5. meliloti, tetranucleotide, pentanucleotide, and hexanucleotide repeats were more abundant and indicated higher mutation rates in these species. The least abundance was mononucleotide repeat. The SSRs type and distribution were similar among these species.

  7. Draft Genome Sequences of Gammaproteobacterial Methanotrophs Isolated from Marine Ecosystems.

    Science.gov (United States)

    Flynn, James D; Hirayama, Hisako; Sakai, Yasuyoshi; Dunfield, Peter F; Klotz, Martin G; Knief, Claudia; Op den Camp, Huub J M; Jetten, Mike S M; Khmelenina, Valentina N; Trotsenko, Yuri A; Murrell, J Colin; Semrau, Jeremy D; Svenning, Mette M; Stein, Lisa Y; Kyrpides, Nikos; Shapiro, Nicole; Woyke, Tanja; Bringel, Françoise; Vuilleumier, Stéphane; DiSpirito, Alan A; Kalyuzhnaya, Marina G

    2016-01-01

    The genome sequences of Methylobacter marinus A45, Methylobacter sp. strain BBA5.1, and Methylomarinum vadi IT-4 were obtained. These aerobic methanotrophs are typical members of coastal and hydrothermal vent marine ecosystems. PMID:26798114

  8. Draft Genome Sequences of Gammaproteobacterial Methanotrophs Isolated from Marine Ecosystems

    OpenAIRE

    Flynn, James D.; Hirayama, Hisako; Sakai, Yasuyoshi; Dunfield, Peter F.; Klotz, Martin G.; Knief, Claudia; Op Den Camp, Huub J M; Jetten, Mike S. M.; Khmelenina, Valentina N; Trotsenko, Yuri A.; Murrell, J. Colin; Semrau, Jeremy D.; Svenning, Mette M.; Stein, Lisa Y.; Kyrpides, Nikos

    2016-01-01

    The genome sequences of Methylobacter marinus A45, Methylobacter sp. strain BBA5.1, and Methylomarinum vadi IT-4 were obtained. These aerobic methanotrophs are typical members of coastal and hydrothermal vent marine ecosystems.

  9. Draft Genome Sequences of Gammaproteobacterial Methanotrophs Isolated from Marine Ecosystems

    Science.gov (United States)

    Flynn, James D.; Hirayama, Hisako; Sakai, Yasuyoshi; Dunfield, Peter F.; Knief, Claudia; Op den Camp, Huub J. M.; Jetten, Mike S. M.; Khmelenina, Valentina N.; Trotsenko, Yuri A.; Murrell, J. Colin; Semrau, Jeremy D.; Svenning, Mette M.; Stein, Lisa Y.; Kyrpides, Nikos; Shapiro, Nicole; Woyke, Tanja; Bringel, Françoise; Vuilleumier, Stéphane; DiSpirito, Alan A.

    2016-01-01

    The genome sequences of Methylobacter marinus A45, Methylobacter sp. strain BBA5.1, and Methylomarinum vadi IT-4 were obtained. These aerobic methanotrophs are typical members of coastal and hydrothermal vent marine ecosystems. PMID:26798114

  10. Draft Genome Sequence of Paecilomyces hepiali, Isolated from Cordyceps sinensis.

    Science.gov (United States)

    Yu, Yi; Wang, Wenting; Wang, Linping; Pang, Fang; Guo, Lanping; Song, Lai; Liu, Guiming; Feng, Chengqiang

    2016-01-01

    Paecilomyces hepiali is an endoparasitic fungus that commonly exists in the natural Cordyceps sinensis Here, we report the draft genome sequence of P. hepiali, which will facilitate the exploitation of medicinal compounds produced by the fungus. PMID:27389266

  11. Draft Genome Sequence of Paecilomyces hepiali, Isolated from Cordyceps sinensis

    Science.gov (United States)

    Yu, Yi; Wang, Wenting; Wang, Linping; Pang, Fang; Guo, Lanping; Song, Lai

    2016-01-01

    Paecilomyces hepiali is an endoparasitic fungus that commonly exists in the natural Cordyceps sinensis. Here, we report the draft genome sequence of P. hepiali, which will facilitate the exploitation of medicinal compounds produced by the fungus. PMID:27389266

  12. First Draft Genome Sequence of a Mycobacterium gordonae Clinical Isolate

    Science.gov (United States)

    Smirnova, T.; Blagodatskikh, K.; Varlamov, D.; Sochivko, D.; Larionova, E.; Andreevskaya, S.; Andrievskaya, I.; Chernousova, L.

    2016-01-01

    Here, we report the first draft genome sequence of the clinically relevant species Mycobacterium gordonae. The clinical isolate Mycobacterium gordonae 14-8773 was obtained from the sputum of a patient with mycobacteriosis. PMID:27365356

  13. Genome Sequence of Bacillus thuringiensis subsp. kurstaki Strain HD-1

    OpenAIRE

    Day, Michael; Ibrahim, Mohamed; Dyer, David; Bulla, Lee

    2014-01-01

    We report here the complete genome sequence of Bacillus thuringiensis subsp. kurstaki strain HD-1, which serves as the primary U.S. reference standard for all commercial insecticidal formulations of B. thuringiensis manufactured around the world.

  14. Bacterial epidemiology and biology - lessons from genome sequencing.

    OpenAIRE

    Parkhill, J.; Wren, BW

    2011-01-01

    : ABSTRACT: Next-generation sequencing has ushered in a new era of microbial genomics, enabling the detailed historical and geographical tracing of bacteria. This is helping to shape our understanding of bacterial evolution.

  15. Seeing chordate evolution through the Ciona genome sequence

    OpenAIRE

    Cañestro, Cristian; Bassham, Susan; Postlethwait, John H.

    2003-01-01

    A draft sequence of the compact genome of the sea squirt Ciona intestinalis, a non-vertebrate chordate that diverged very early from other chordates, including vertebrates, illuminates how chordates originated and how vertebrate developmental innovations evolved.

  16. Complete Genome Sequence of Rahnella aquatilis CIP 78.65

    Energy Technology Data Exchange (ETDEWEB)

    Martinez, Robert J [University of Alabama, Tuscaloosa; Bruce, David [Los Alamos National Laboratory (LANL); Detter, J C [U.S. Department of Energy, Joint Genome Institute; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Han, James [U.S. Department of Energy, Joint Genome Institute; Han, Cliff [Los Alamos National Laboratory (LANL); Held, Brittany [Los Alamos National Laboratory (LANL); Land, Miriam L [ORNL; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Pennacchio, Len [U.S. Department of Energy, Joint Genome Institute; Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Sobeckya, Patricia A. [University of Alabama, Tuscaloosa

    2012-01-01

    Rahnella aquatilis CIP 78.65 is a gammaproteobacterium isolated from a drinking water source in Lille, France. Here we report the complete genome sequence of Rahnella aquatilis CIP 78.65, the type strain of R. aquatilis.

  17. Cancer Genome Sequencing and Its Implications for Personalized Cancer Vaccines

    International Nuclear Information System (INIS)

    New DNA sequencing platforms have revolutionized human genome sequencing. The dramatic advances in genome sequencing technologies predict that the $1,000 genome will become a reality within the next few years. Applied to cancer, the availability of cancer genome sequences permits real-time decision-making with the potential to affect diagnosis, prognosis, and treatment, and has opened the door towards personalized medicine. A promising strategy is the identification of mutated tumor antigens, and the design of personalized cancer vaccines. Supporting this notion are preliminary analyses of the epitope landscape in breast cancer suggesting that individual tumors express significant numbers of novel antigens to the immune system that can be specifically targeted through cancer vaccines

  18. Brucella abortus S19 genome sequenced, points toward virulence genes

    OpenAIRE

    Whyte, Barry James

    2008-01-01

    Researchers at the Virginia Bioinformatics Institute at Virginia Tech; the National Animal Disease Center in Ames, Iowa; and collaborators at 454 Life Sciences, Branford, Conn., have sequenced the genome of Brucella abortus strain S19.

  19. Complete Genome Sequence of Mycobacterium phlei Type Strain RIVM601174

    KAUST Repository

    Abdallah, A. M.

    2012-05-24

    Mycobacterium phlei is a rapidly growing nontuberculous Mycobacterium species that is typically nonpathogenic, with few reported cases of human disease. Here we report the whole genome sequence of M. phlei type strain RIVM601174.

  20. Complete Genome Sequences of Six Strains of the Genus Methylobacterium

    Energy Technology Data Exchange (ETDEWEB)

    Marx, Christopher J [Harvard University; Bringel, Francoise O. [University of Strasbourg; Christoserdova, Ludmila [University of Washington, Seattle; Moulin, Lionel [UMR, France; UI Hague, Muhammad Farhan [University of Strasbourg; Fleischman, Darrell E. [Wright State University, Dayton, OH; Gruffaz, Christelle [CNRS, Strasbourg, France; Jourand, Philippe [UMR, France; Knief, Claudia [ETH Zurich, Switzerland; Lee, Ming-Chun [Harvard University; Muller, Emilie E. L. [CNRS, Strasbourg, France; Nadalig, Thierry [CNRS, Strasbourg, France; Peyraud, Remi [ETH Zurich, Switzerland; Roselli, Sandro [CNRS, Strasbourg, France; Russ, Lina [ETH Zurich, Switzerland; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Ivanov, Pavel S. [University of Wyoming, Laramie; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Lajus, Aurelie [Genoscope/Centre National de la Recherche Scientifique-Unite Mixte de Recherche; Land, Miriam L [ORNL; Medigue, Claudine [Genoscope/Centre National de la Recherche Scientifique-Unite Mixte de Recherche; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Stolyar, Sergey [University of Washington; Vorholt, Julia A. [ETH Zurich, Switzerland; Vuilleumier, Stephane [University of Strasbourg

    2012-01-01

    The complete and assembled genome sequences were determined for six strains of the alphaproteobacterial genus Methylobacterium, chosen for their key adaptations to different plant-associated niches and environmental constraints.

  1. Complete genome sequences of six strains of the genus methylobacterium

    Energy Technology Data Exchange (ETDEWEB)

    Marx, Christopher J [Harvard University; Bringel, Francoise O. [University of Strasbourg; Christoserdova, Ludmila [University of Washington, Seattle; Moulin, Lionel [UMR, France; Farhan Ul Haque, Muhammad [CNRS, Strasbourg, France; Fleischman, Darrell E. [Wright State University, Dayton, OH; Gruffaz, Christelle [CNRS, Strasbourg, France; Jourand, Philippe [UMR, France; Knief, Claudia [ETH Zurich, Switzerland; Lee, Ming-Chun [Harvard University; Muller, Emilie E. L. [CNRS, Strasbourg, France; Nadalig, Thierry [CNRS, Strasbourg, France; Peyraud, Remi [ETH Zurich, Switzerland; Roselli, Sandro [CNRS, Strasbourg, France; Russ, Lina [ETH Zurich, Switzerland; Aguero, Fernan [Universidad Nacional de General San Martin; Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Lajus, Aurelie [Genoscope/Centre National de la Recherche Scientifique-Unite Mixte de Recherche; Land, Miriam L [ORNL; Medigue, Claudine [Genoscope/Centre National de la Recherche Scientifique-Unite Mixte de Recherche; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Stolyar, Sergey [University of Washington; Vorholt, Julia A. [ETH Zurich, Switzerland; Vuilleumier, Stephane [University of Strasbourg

    2012-01-01

    The complete and assembled genome sequences were determined for six strains of the alphaproteobacterial genus Methylobacterium, chosen for their key adaptations to different plant-associated niches and environmental constraints.

  2. Sequencing of Wheat Chromosome 6B: Toward Functional Genomics

    Czech Academy of Sciences Publication Activity Database

    Tanaka, T.; Kobayashi, F.; Joshi, G.P.; Onuki, R.; Šimková, Hana; Nasuda, S.; Doležel, Jaroslav; Ogihara, Y.; Itoh, T.; Handa, H.

    Verlag: Springer, 2015 - (Handa, H.), s. 111-116 ISBN 978-4-431-55674-9 Institutional support: RVO:61389030 Keywords : Chromosome 6B * Genome sequencing * Marker construction Subject RIV: EB - Genetics ; Molecular Biology

  3. Sequence analysis of the complete mitochondrial genome of Youxian sheldrake.

    Science.gov (United States)

    He, Shao-Ping; Liu, Li-Li; Yu, Qi-Fang; Li, Si; He, Jian-Hua

    2016-01-01

    Youxian sheldrake is excellent native breeds in Hunan province in China. The complete mitochondrial (mt) genome sequence plays an important role in the accurate determination of phylogenetic relationships among metazoans. This is the first study to determine the complete mitochondrial genome sequence of Youxian sheldrake using PCR-based amplification and Sanger sequencing. The characteristic of the entire mitochondrial genome was analyzed in detail, the total length of the mitogenome is 16,605 bp, with the base composition of 29.21% A, 22.18% T, 32.84% C, 15.77% G in the Youxian sheldrake. It contained 2 ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and a major non-coding control region (D-loop region). The complete mitochondrial genome sequence of Youxian sheldrake provided an important data for further study of the phylogenetics of poultry, and available data for the genetics and breeding. PMID:25090395

  4. Complete genome sequence of Treponema pallidum strain DAL-1

    Science.gov (United States)

    Zobaníková, Marie; Mikolka, Pavol; Čejková, Darina; Pospíšilová, Petra; Chen, Lei; Strouhal, Michal; Qin, Xiang; Weinstock, George M.; Šmajs, David

    2012-01-01

    Treponema pallidum strain DAL-1 is a human uncultivable pathogen causing the sexually transmitted disease syphilis. Strain DAL-1 was isolated from the amniotic fluid of a pregnant woman in the secondary stage of syphilis. Here we describe the 1,139,971 bp long genome of T. pallidum strain DAL-1 which was sequenced using two independent sequencing methods (454 pyrosequencing and Illumina). In rabbits, strain DAL-1 replicated better than the T. pallidum strain Nichols. The comparison of the complete DAL-1 genome sequence with the Nichols sequence revealed a list of genetic differences that are potentially responsible for the increased rabbit virulence of the DAL-1 strain. PMID:23449808

  5. Complete genome sequencing and comparative genomic analysis of functionally diverse Lysinibacillus sphaericus III(3)7.

    Science.gov (United States)

    Rey, Andrés; Silva-Quintero, Laura; Dussán, Jenny

    2016-09-01

    Lysinibacillus sphaericus III(3)7 is a native Colombian strain, the first one isolated from soil samples. This strain has shown high levels of pathogenic activity against Culex quinquefaciatus larvae in laboratory assays compared to other members of the same species. Using Pacific Biosciences sequencing technology we sequenced, annotated (de novo) and described the genome of strain III(3)7, achieving a complete genome sequence status. We then performed a comparative analysis between the newly sequenced genome and the ones previously reported for Colombian isolates L. sphaericus OT4b.31, CBAM5 and OT4b.25, with the inclusion of L. sphaericus C3-41 that has been used as a reference genome for most of previous genome sequencing projects. We concluded that L. sphaericus III(3)7 is highly similar with strain OT4b.25 and shares high levels of synteny with isolates CBAM5 and C3-41. PMID:27419068

  6. Analysis of the bread wheat genome using whole-genome shotgun sequencing

    OpenAIRE

    Brenchley R.; Brenchley, Rachel; Spannagl M.; Spannagl, Manuel; Pfeifer M; Pfeifer, Matthias; Barker, Gary L. A.; Barker G.L.A.; D'Amore R.; D'Amore, Rosalinda; Allen A.M.; Allen, Alexandra M.; McKenzie, Neil; McKenzie N.; Kramer, Melissa

    2012-01-01

    Summary Bread wheat (Triticum aestivum) is a globally important crop, accounting for 20% of the calories consumed by mankind. We sequenced its large and challenging 17 Gb hexaploid genome using 454 pyrosequencing and compared this with the sequences of diploid ancestral and progenitor genomes. Between 94,000-96,000 genes were identified, and two-thirds were assigned to the A, B and D genomes. High-resolution synteny maps identified many small disruptions to conserved gene order. We show the h...

  7. Intra-species sequence comparisons for annotating genomes

    Energy Technology Data Exchange (ETDEWEB)

    Boffelli, Dario; Weer, Claire V.; Weng, Li; Lewis, Keith D.; Shoukry, Malak I.; Pachter, Lior; Keys, David N.; Rubin, Edward M.

    2004-07-15

    Analysis of sequence variation among members of a single species offers a potential approach to identify functional DNA elements responsible for biological features unique to that species. Due to its high rate of allelic polymorphism and ease of genetic manipulability, we chose the sea squirt, Ciona intestinalis, to explore intra-species sequence comparisons for genome annotation. A large number of C. intestinalis specimens were collected from four continents and a set of genomic intervals amplified, resequenced and analyzed to determine the mutation rates at each nucleotide in the sequence. We found that regions with low mutation rates efficiently demarcated functionally constrained sequences: these include a set of noncoding elements, which we showed in C intestinalis transgenic assays to act as tissue-specific enhancers, as well as the location of coding sequences. This illustrates that comparisons of multiple members of a species can be used for genome annotation, suggesting a path for the annotation of the sequenced genomes of organisms occupying uncharacterized phylogenetic branches of the animal kingdom and raises the possibility that the resequencing of a large number of Homo sapiens individuals might be used to annotate the human genome and identify sequences defining traits unique to our species. The sequence data from this study has been submitted to GenBank under accession nos. AY667278-AY667407.

  8. Whole genome sequencing in clinical and public health microbiology

    OpenAIRE

    Kwong, J. C.; McCallum, N; Sintchenko, V.; Howden, B. P.

    2015-01-01

    SummaryGenomics and whole genome sequencing (WGS) have the capacity to greatly enhance knowledge and understanding of infectious diseases and clinical microbiology. The growth and availability of bench-top WGS analysers has facilitated the feasibility of genomics in clinical and public health microbiology. Given current resource and infrastructure limitations, WGS is most applicable to use in public health laboratories, reference laboratories, and hospital infection control-affiliated laborat...

  9. Genome sequence and comparative analysis of Avibacterium paragallinarum

    OpenAIRE

    Requena, David; Chumbe, Ana; Torres, Michael; Alzamora, Ofelia; Ramirez, Manuel; Valdivia-Olarte, Hugo; Gutierrez, Andres Hazaet; Izquierdo-Lara, Ray; Saravia, Luis Enrique; Zavaleta, Milagros; Tataje-Lavanda, Luis; Best, Ivan; Fernández-Sánchez, Manolo; Icochea, Eliana; Zimic, Mirko

    2013-01-01

    Background: Avibacterium paragallinarum, the causative agent of infectious coryza, is a highly contagious respiratory acute disease of poultry, which affects commercial chickens, laying hens and broilers worldwide. Methodology: In this study, we performed the whole genome sequencing, assembly and annotation of a Peruvian isolate of A. paragallinarum. Genome was sequenced in a 454 GS FLX Titanium system. De novo assembly was performed and annotation was completed with GS De Novo Assembler 2.6 ...

  10. Mapping Challenging Mutations by Whole-Genome Sequencing

    OpenAIRE

    Smith, Harold E.; Fabritius, Amy S.; Aimee Jaramillo-Lambert; Andy Golden

    2016-01-01

    Whole-genome sequencing provides a rapid and powerful method for identifying mutations on a global scale, and has spurred a renewed enthusiasm for classical genetic screens in model organisms. The most commonly characterized category of mutation consists of monogenic, recessive traits, due to their genetic tractability. Therefore, most of the mapping methods for mutation identification by whole-genome sequencing are directed toward alleles that fulfill those criteria (i.e., single-gene, homoz...

  11. Whole genome and transcriptome sequencing of a B3 thymoma.

    Directory of Open Access Journals (Sweden)

    Iacopo Petrini

    Full Text Available Molecular pathology of thymomas is poorly understood. Genomic aberrations are frequently identified in tumors but no extensive sequencing has been reported in thymomas. Here we present the first comprehensive view of a B3 thymoma at whole genome and transcriptome levels. A 55-year-old Caucasian female underwent complete resection of a stage IVA B3 thymoma. RNA and DNA were extracted from a snap frozen tumor sample with a fraction of cancer cells over 80%. We performed array comparative genomic hybridization using Agilent platform, transcriptome sequencing using HiSeq 2000 (Illumina and whole genome sequencing using Complete Genomics Inc platform. Whole genome sequencing determined, in tumor and normal, the sequence of both alleles in more than 95% of the reference genome (NCBI Build 37. Copy number (CN aberrations were comparable with those previously described for B3 thymomas, with CN gain of chromosome 1q, 5, 7 and X and CN loss of 3p, 6, 11q42.2-qter and q13. One translocation t(11;X was identified by whole genome sequencing and confirmed by PCR and Sanger sequencing. Ten single nucleotide variations (SNVs and 2 insertion/deletions (INDELs were identified; these mutations resulted in non-synonymous amino acid changes or affected splicing sites. The lack of common cancer-associated mutations in this patient suggests that thymomas may evolve through mechanisms distinctive from other tumor types, and supports the rationale for additional high-throughput sequencing screens to better understand the somatic genetic architecture of thymoma.

  12. Genome Sequence of the Biocontrol Strain Pseudomonas fluorescens F113

    OpenAIRE

    Redondo-Nieto, M.; M. Barret; Morrisey, J; Germaine, K.; Martínez-Granero, F.; Barahona, E.; Navazo, A.; Sánchez-Contreras, M.; Moynihan, J.; Giddens, S.; Coppoolse, E.; Muriel, C.; Stiekema, W.; Rainey, P; Dowling, D

    2012-01-01

    Pseudomonas fluorescens F113 is a plant growth-promoting rhizobacterium (PGPR) that has biocontrol activity against fungal plant pathogens and is a model for rhizosphere colonization. Here, we present its complete genome sequence, which shows that besides a core genome very similar to those of other strains sequenced within this species, F113 possesses a wide array of genes encoding specialized functions for thriving in the rhizosphere and interacting with eukaryotic organisms.

  13. Draft genome sequence of Therminicola potens strain JR

    Energy Technology Data Exchange (ETDEWEB)

    Byrne-Bailey, K.G.; Wrighton, K.C.; Melnyk, R.A.; Agbo, P.; Hazen, T.C.; Coates, J.D.

    2010-07-01

    'Thermincola potens' strain JR is one of the first Gram-positive dissimilatory metal-reducing bacteria (DMRB) for which there is a complete genome sequence. Consistent with the physiology of this organism, preliminary annotation revealed an abundance of multiheme c-type cytochromes that are putatively associated with the periplasm and cell surface in a Gram-positive bacterium. Here we report the complete genome sequence of strain JR.

  14. Genome Sequence of Pantoea agglomerans Strain IG1

    OpenAIRE

    Matsuzawa, Tomohiko; Mori, Kazuki; Kadowaki, Takeshi; Shimada, Misato; Tashiro, Kosuke; Kuhara, Satoru; Inagawa, Hiroyuki; Soma, Gen-Ichiro; Takegawa, Kaoru

    2012-01-01

    Pantoea agglomerans is a Gram-negative bacterium that grows symbiotically with various plants. Here we report the 4.8-Mb genome sequence of P. agglomerans strain IG1. The lipopolysaccharides derived from P. agglomerans IG1 have been shown to be effective in the prevention of various diseases, such as bacterial or viral infection, lifestyle-related diseases. This genome sequence represents a substantial step toward the elucidation of pathways for production of lipopolysaccharides.

  15. Complete Genome Sequence of Pseudomonas aeruginosa Phage AAT-1.

    Science.gov (United States)

    Andrade-Domínguez, Andrés; Kolter, Roberto

    2016-01-01

    Aspects of the interaction between phages and animals are of interest and importance for medical applications. Here, we report the genome sequence of the lytic Pseudomonas phage AAT-1, isolated from mammalian serum. AAT-1 is a double-stranded DNA phage, with a genome of 57,599 bp, containing 76 predicted open reading frames. PMID:27563032

  16. Draft Genome Sequence of Avibacterium paragallinarum Strain 221

    OpenAIRE

    Xu, Fuzhou; Miao, Deyuan; Du, Yu; CHEN, XIAOLING; Zhang, Peijun; Sun, Huiling

    2013-01-01

    Avibacterium paragallinarum is the causative agent of infectious coryza. Here we report the draft genome sequence of reference strain 221 of A. paragallinarum serovar A. The genome is composed of 135 contigs for 2,685,568 bp with a 41% G+C content.

  17. Draft Genome Sequence of Amycolatopsis decaplanina Strain DSM 44594T

    OpenAIRE

    Kaur, Navjot; Kumar, Shailesh; Bala, Monu; Raghava, Gajendra Pal Singh; Mayilraj, Shanmugam

    2013-01-01

    We report the 8.5-Mb genome sequence of Amycolatopsis decaplanina strain DSM 44594T, isolated from a soil sample from India. The draft genome of strain DSM 44594T consists of 8,533,276 bp with a 68.6% G+C content, 7,899 protein-coding genes, and 57 RNAs.

  18. Complete genome sequence of Aeromonas hydrophila AL06-06

    Science.gov (United States)

    Aeromonas hydrophila occurs in freshwater environments and infects fish and mammals. In this work, we report the complete genome sequence of Aeromonas hydrophila AL06-06, which was isolated from diseased goldfish and is being used for comparative genomic studies with A. hydrophila strains causing ba...

  19. A snapshot of the emerging tomato genome sequence

    NARCIS (Netherlands)

    Mueller, L.A.; Klein Lankhorst, R.M.; Tanksley, S.D.; Peters, R.M.; Staveren, van M.J.; Datema, E.; Fiers, M.W.E.J.; Ham, van R.C.H.J.; Szinay, D.; Jong, de J.H.S.G.M.

    2009-01-01

    The genome of tomato (Solanum lycopersicum L.) is being sequenced by an international consortium of 10 countries (Korea, China, the United Kingdom, India, the Netherlands, France, Japan, Spain, Italy, and the United States) as part of the larger “International Solanaceae Genome Project (SOL): System

  20. Draft Genome Sequence of Rhodococcus sp. Strain 311R

    Science.gov (United States)

    Ehsani, Elham; Jauregui, Ruy; Geffers, Robert; Jareck, Michael; Boon, Nico; Pieper, Dietmar H.

    2015-01-01

    Here, we report the draft genome sequence of Rhodococcus sp. strain 311R, which was isolated from a site contaminated with alkanes and aromatic compounds. Strain 311R shares 90% of the genome of Rhodococcus erythropolis SK121, which is the closest related bacteria. PMID:25999565

  1. Whole-Genome Sequences of Three Symbiotic Endozoicomonas Bacteria

    KAUST Repository

    Neave, Matthew J.

    2014-08-14

    Members of the genus Endozoicomonas associate with a wide range of marine organisms. Here, we report on the whole-genome sequencing, assembly, and annotation of three Endozoicomonas type strains. These data will assist in exploring interactions between Endozoicomonas organisms and their hosts, and it will aid in the assembly of genomes from uncultivated Endozoicomonas spp.

  2. Complete Genome Sequence of Pediococcus pentosaceus Strain SL4

    DEFF Research Database (Denmark)

    Dantoft, Shruti Harnal; Bielak, Eliza Maria; Seo, Jae-Gu;

    2013-01-01

    Pediococcus pentosaceus SL4 was isolated from a Korean fermented vegetable product, kimchi. We report here the whole-genome sequence (WGS) of P. pentosaceus SL4. The genome consists of a 1.79-Mb circular chromosome (G+C content of 37.3%) and seven distinct plasmids ranging in size from 4 kb to 50...

  3. Genome Sequence of Chinese Porcine Parvovirus Strain PPV2010

    OpenAIRE

    Cui, Jin; wang, xin; Ren, Yudong; Cui, Shangjin; Li, Guangxing; Ren, Xiaofeng

    2012-01-01

    Porcine parvovirus (PPV) isolate PPV2010 has recently emerged in China. Herein, we analyze the complete genome sequence of PPV2010. Our results indicate that the genome of PPV2010 bears mixed characteristics of virulent PPV and vaccine strains. Importantly, PPV2010 has the potential to be a naturally attenuated candidate vaccine strain.

  4. The tomato genome sequence provides insight into fleshy fruit evolution

    Science.gov (United States)

    The genome of the inbred tomato cultivar ‘Heinz 1706’ was sequenced and assembled using a combination of Sanger and “next generation” technologies. The predicted genome size is ~900 Mb, consistent with prior estimates, of which 760 Mb were assembled in 91 scaffolds aligned to the 12 tomato chromosom...

  5. Genome sequence of the cultivated cotton Gossypium arboreum

    Science.gov (United States)

    Cotton is one of the most economically important natural fiber crops in the world, and the complex tetraploid nature of its genome (AADD, 2n = 52) makes genetic, genomic and functional analyses extremely challenging. Here we sequenced and assembled 98.3% of the 1.7-gigabase G. arboreum (AA, 2n = 26...

  6. Complete Genome Sequence of Bacillus thuringiensis Bacteriophage Smudge

    Science.gov (United States)

    Cornell, Jessica L.; Breslin, Eileen; Schuhmacher, Zachary; Himelright, Madison; Berluti, Cassandra; Boyd, Charles; Carson, Rachel; Del Gallo, Elle; Giessler, Caris; Gilliam, Benjamin; Heatherly, Catherine; Nevin, Julius; Nguyen, Bryan; Nguyen, Justin; Parada, Jocelyn; Sutterfield, Blake; Tukruni, Muruj

    2016-01-01

    Smudge, a bacteriophage enriched from soil using Bacillus thuringiensis DSM-350 as the host, had its complete genome sequenced. Smudge is a myovirus with a genome consisting of 292 genes and was identified as belonging to the C1 cluster of Bacillus phages. PMID:27540049

  7. Complete Genome Sequence of Mycobacterium bovis Strain BCG-1 (Russia).

    Science.gov (United States)

    Sotnikova, Evgeniya A; Shitikov, Egor A; Malakhova, Maja V; Kostryukova, Elena S; Ilina, Elena N; Atrasheuskaya, Alena V; Ignatyev, Georgy M; Vinokurova, Nataliya V; Gorbachyov, Vyacheslav Y

    2016-01-01

    Mycobacterium bovisBCG (Bacille Calmette-Guérin) is a vaccine strain used for protection against tuberculosis. Here, we announce the complete genome sequence ofM. bovisstrain BCG-1 (Russia). Extensive use of this strain necessitates the study of its genome stability by comparative analysis. PMID:27034492

  8. Complete Chloroplast Genome Sequence of Phagomixotrophic Green Alga Cymbomonas tetramitiformis

    Science.gov (United States)

    Paasch, Amber E.; Graham, Linda E.; Kim, Eunsoo

    2016-01-01

    We report here the complete chloroplast genome sequence of Cymbomonas tetramitiformis strain PLY262, which is a prasinophycean green alga that retains a phagomixotrophic mode of nutrition. The genome is 84,524 bp in length, with a G+C content of 37%, and contains 3 rRNAs, 26 tRNAs, and 76 protein-coding genes. PMID:27313295

  9. Complete genome sequence of Campylobacter gracilis ATCC 33236T

    Science.gov (United States)

    The human oral pathogen Campylobacter gracilis has been isolated from periodontal and endodontal infections, and also from non-oral head, neck or lung infections. This study describes the whole-genome sequence of the human periodontal isolate ATCC 33236T (=FDC 1084), which is the first closed genome...

  10. Draft genome sequence of the sexually transmitted pathogen Trichomonas vaginalis

    NARCIS (Netherlands)

    Carlton, Jane M.; Hirt, Robert P.; Silva, Joana C.; Delcher, Arthur L.; Schatz, Michael; Zhao, Qi; Wortman, Jennifer R.; Bidwell, Shelby L.; Alsmark, U. Cecilia M.; Besteiro, Sebastien; Sicheritz-Ponten, Thomas; Noel, Christophe J.; Dacks, Joel B.; Foster, Peter G.; Simillion, Cedric; Van de Peer, Yves; Miranda-Saavedra, Diego; Barton, Geoffrey J.; Westrop, Gareth D.; Mueller, Sylke; Dessi, Daniele; Fiori, Pier Luigi; Ren, Qinghu; Paulsen, Ian; Zhang, Hanbang; Bastida-Corcuera, Felix D.; Simoes-Barbosa, Augusto; Brown, Mark T.; Hayes, Richard D.; Mukherjee, Mandira; Okumura, Cheryl Y.; Schneider, Rachel; Smith, Alias J.; Vanacova, Stepanka; Villalvazo, Maria; Haas, Brian J.; Pertea, Mihaela; Feldblyum, Tamara V.; Utterback, Terry R.; Shu, Chung-Li; Osoegawa, Kazutoyo; de Jong, Pieter J.; Hrdy, Ivan; Horvathova, Lenka; Zubacova, Zuzana; Dolezal, Pavel; Malik, Shehre-Banoo; Logsdon, John M.; Henze, Katrin; Gupta, Arti; Wang, Ching C.; Dunne, Rebecca L.; Upcroft, Jacqueline A.; Upcroft, Peter; White, Owen; Salzberg, Steven L.; Tang, Petrus; Chiu, Cheng-Hsun; Lee, Ying-Shiung; Embley, T. Martin; Coombs, Graham H.; Mottram, Jeremy C.; Tachezy, Jan; Fraser-Liggett, Claire M.; Johnson, Patricia J.

    2007-01-01

    We describe the genome sequence of the protist Trichomonas vaginalis, a sexually transmitted human pathogen. Repeats and transposable elements comprise about two-thirds of the similar to 160-megabase genome, reflecting a recent massive expansion of genetic material. This expansion, in conjunction wi

  11. Complete Genome Sequence of Bacillus thuringiensis Strain 407 Cry-

    OpenAIRE

    Poehlein, Anja; Liesegang, Heiko

    2013-01-01

    Bacillus thuringiensis is an insect pathogen that has been used widely as a biopesticide. Here, we report the genome sequence of strain 407 Cry-, which is used to study the genetic determinants of pathogenicity. The genome consists of a 5.5-Mb chromosome and nine plasmids, including a novel 502-kb megaplasmid.

  12. Complete Genome Sequence of Cyanobacterial Siphovirus KBS2A.

    Science.gov (United States)

    Ponsero, Alise J; Chen, Feng; Lennon, Jay T; Wilhelm, Steven W

    2013-01-01

    We present the genome of a cyanosiphovirus (KBS2A) that infects a marine Synechococcus sp. (strain WH7803). Unique to this genome, relative to other sequenced cyanosiphoviruses, is the absence of elements associated with integration into the host chromosome, suggesting this virus may not be able to establish a lysogenic relationship. PMID:23969045

  13. Complete Genome Sequence of Cyanobacterial Siphovirus KBS2A

    OpenAIRE

    Ponsero, Alise J.; Chen, Feng; Lennon, Jay T.; Wilhelm, Steven W.

    2013-01-01

    We present the genome of a cyanosiphovirus (KBS2A) that infects a marine Synechococcus sp. (strain WH7803). Unique to this genome, relative to other sequenced cyanosiphoviruses, is the absence of elements associated with integration into the host chromosome, suggesting this virus may not be able to establish a lysogenic relationship.

  14. Complete Genome Sequence of Bacillus thuringiensis Bacteriophage Smudge.

    Science.gov (United States)

    Cornell, Jessica L; Breslin, Eileen; Schuhmacher, Zachary; Himelright, Madison; Berluti, Cassandra; Boyd, Charles; Carson, Rachel; Del Gallo, Elle; Giessler, Caris; Gilliam, Benjamin; Heatherly, Catherine; Nevin, Julius; Nguyen, Bryan; Nguyen, Justin; Parada, Jocelyn; Sutterfield, Blake; Tukruni, Muruj; Temple, Louise

    2016-01-01

    Smudge, a bacteriophage enriched from soil using Bacillus thuringiensis DSM-350 as the host, had its complete genome sequenced. Smudge is a myovirus with a genome consisting of 292 genes and was identified as belonging to the C1 cluster of Bacillus phages. PMID:27540049

  15. Complete Genome Sequence of Cyanobacterium Leptolyngbya sp. NIES-3755

    Science.gov (United States)

    Fujisawa, Takatomo; Ohtsubo, Yoshiyuki; Katayama, Mitsunori; Misawa, Naomi; Wakazuki, Sachiko; Shimura, Yohei; Nakamura, Yasukazu; Kawachi, Masanobu; Yoshikawa, Hirofumi; Eki, Toshihiko

    2016-01-01

    Cyanobacterial genus Leptolyngbya comprises genetically diverse species, but the availability of their complete genome information is limited. Here, we isolated Leptolyngbya sp. strain NIES-3755 from soil at the Toyohashi University of Technology, Japan. We determined the complete genome sequence of the NIES-3755 strain, which is composed of one chromosome and three plasmids. PMID:26988037

  16. The complete chloroplast genome sequence of Abies nephrolepis (Pinaceae: Abietoideae

    Directory of Open Access Journals (Sweden)

    Dong-Keun Yi

    2016-06-01

    Full Text Available The plant chloroplast (cp genome has maintained a relatively conserved structure and gene content throughout evolution. Cp genome sequences have been used widely for resolving evolutionary and phylogenetic issues at various taxonomic levels of plants. Here, we report the complete cp genome of Abies nephrolepis. The A. nephrolepis cp genome is 121,336 base pairs (bp in length including a pair of short inverted repeat regions (IRa and IRb of 139 bp each separated by a small single copy (SSC region of 54,323 bp (SSC and a large single copy region of 66,735 bp (LSC. It contains 114 genes, 68 of which are protein coding genes, 35 tRNA and four rRNA genes, six open reading frames, and one pseudogene. Seventeen repeat units and 64 simple sequence repeats (SSR have been detected in A. nephrolepis cp genome. Large IR sequences locate in 42-kb inversion points (1186 bp. The A. nephrolepis cp genome is identical to Abies koreana’s which is closely related to taxa. Pairwise comparison between two cp genomes revealed 140 polymorphic sites in each. Complete cp genome sequence of A. nephrolepis has a significant potential to provide information on the evolutionary pattern of Abietoideae and valuable data for development of DNA markers for easy identification and classification.

  17. Finished Genome Sequence of Collimonas arenae Cal35

    NARCIS (Netherlands)

    Wu, Je-Jia; de Jager, Victor; Deng, Wen-ling; Leveau, Johan

    2015-01-01

    We announce the finished genome sequence of soil forest isolate Collimonas arenae Cal35, which comprises a 5.6-Mbp chromosome and 41-kb plasmid. The Cal35 genome is the second one published for the bacterial genus Collimonas and represents the first opportunity for high-resolution comparison of geno

  18. Analysis Method of Citrus Genome Microarray%浅谈柑橘基因组芯片分析方法

    Institute of Scientific and Technical Information of China (English)

    杨雪莲; 贝学军; 朱友娟

    2012-01-01

    cDNA microarray and oligonucleotide microarray are currently used for analysing citrus gene expression profile.The data analysis of genome microarray include data preprocessing,screening differential expression genes,and further analysing the differential expression genes.Through data analysis and integration of biological information,this paper studies the plant physiological changes.%指出了cDNA芯片和寡核苷酸芯片是目前用于柑橘基因表达谱分析的方法,基因组芯片数据分析主要包括数据预处理,筛选差异基因,差异基因再进一步分析。通过数据分析及整合样点的生物学信息,研究了植物生理变化。

  19. Dissection of the octoploid strawberry genome by deep sequencing of the genomes of Fragaria species.

    Science.gov (United States)

    Hirakawa, Hideki; Shirasawa, Kenta; Kosugi, Shunichi; Tashiro, Kosuke; Nakayama, Shinobu; Yamada, Manabu; Kohara, Mistuyo; Watanabe, Akiko; Kishida, Yoshie; Fujishiro, Tsunakazu; Tsuruoka, Hisano; Minami, Chiharu; Sasamoto, Shigemi; Kato, Midori; Nanri, Keiko; Komaki, Akiko; Yanagi, Tomohiro; Guoxin, Qin; Maeda, Fumi; Ishikawa, Masami; Kuhara, Satoru; Sato, Shusei; Tabata, Satoshi; Isobe, Sachiko N

    2014-01-01

    Cultivated strawberry (Fragaria x ananassa) is octoploid and shows allogamous behaviour. The present study aims at dissecting this octoploid genome through comparison with its wild relatives, F. iinumae, F. nipponica, F. nubicola, and F. orientalis by de novo whole-genome sequencing on an Illumina and Roche 454 platforms. The total length of the assembled Illumina genome sequences obtained was 698 Mb for F. x ananassa, and ∼200 Mb each for the four wild species. Subsequently, a virtual reference genome termed FANhybrid_r1.2 was constructed by integrating the sequences of the four homoeologous subgenomes of F. x ananassa, from which heterozygous regions in the Roche 454 and Illumina genome sequences were eliminated. The total length of FANhybrid_r1.2 thus created was 173.2 Mb with the N50 length of 5137 bp. The Illumina-assembled genome sequences of F. x ananassa and the four wild species were then mapped onto the reference genome, along with the previously published F. vesca genome sequence to establish the subgenomic structure of F. x ananassa. The strategy adopted in this study has turned out to be successful in dissecting the genome of octoploid F. x ananassa and appears promising when applied to the analysis of other polyploid plant species. PMID:24282021

  20. Comparative genomics beyond sequence-based alignments

    DEFF Research Database (Denmark)

    Þórarinsson, Elfar; Yao, Zizhen; Wiklund, Eric D.;

    2008-01-01

    Recent computational scans for non-coding RNAs (ncRNAs) in multiple organisms have relied on existing multiple sequence alignments. However, as sequence similarity drops, a key signal of RNA structure--frequent compensating base changes--is increasingly likely to cause sequence-based alignment me...

  1. First fungal genome sequence from Africa: A preliminary analysis

    Directory of Open Access Journals (Sweden)

    Rene Sutherland

    2012-01-01

    Full Text Available Some of the most significant breakthroughs in the biological sciences this century will emerge from the development of next generation sequencing technologies. The ease of availability of DNA sequence made possible through these new technologies has given researchers opportunities to study organisms in a manner that was not possible with Sanger sequencing. Scientists will, therefore, need to embrace genomics, as well as develop and nurture the human capacity to sequence genomes and utilise the ’tsunami‘ of data that emerge from genome sequencing. In response to these challenges, we sequenced the genome of Fusarium circinatum, a fungal pathogen of pine that causes pitch canker, a disease of great concern to the South African forestry industry. The sequencing work was conducted in South Africa, making F. circinatum the first eukaryotic organism for which the complete genome has been sequenced locally. Here we report on the process that was followed to sequence, assemble and perform a preliminary characterisation of the genome. Furthermore, details of the computer annotation and manual curation of this genome are presented. The F. circinatum genome was found to be nearly 44 million bases in size, which is similar to that of four other Fusarium genomes that have been sequenced elsewhere. The genome contains just over 15 000 open reading frames, which is less than that of the related species, Fusarium oxysporum, but more than that for Fusarium verticillioides. Amongst the various putative gene clusters identified in F. circinatum, those encoding the secondary metabolites fumosin and fusarin appeared to harbour evidence of gene translocation. It is anticipated that similar comparisons of other loci will provide insights into the genetic basis for pathogenicity of the pitch canker pathogen. Perhaps more importantly, this project has engaged a relatively large group of scientists

  2. Enabling technologies of genomic-scale sequence enrichment for targeted high-throughput sequencing

    OpenAIRE

    Summerer, Daniel

    2009-01-01

    Next-generation sequencing has still not reached its full potential due to the technical inability of effectively targeting desired genomic regions of interest. Once available, methods adressing this bottleneck will dramatically reduce cost and enable the efficient analysis of complex samples. Recently, a number of possible approaches for genomic-scale sequence enrichment have been reported using different strategies. All methods basically rely on sequence-specific nucleic acid hybridization,...

  3. Identification of photoperception and light signal transduction pathways in citrus

    Directory of Open Access Journals (Sweden)

    Vera Quecini

    2007-01-01

    Full Text Available Studies employing model species have elucidated several aspects of photoperception and light signal transduction that control plant development. However, the information available for economically important crops is scarce. Citrus genome databases of expressed sequence tags (EST were investigated in order to identify genes coding for functionally characterized proteins responsible for light-regulated developmental control in model plants. Approximately 176,200 EST sequences from 53 libraries were queried and all bona fide and putative photoreceptor gene families were found in citrus species. We have identified 53 orthologs for several families of transcriptional regulators and cytoplasmic proteins mediating photoreceptor-induced responses although some important Arabidopsis phytochrome- and cryptochrome-signaling components are absent from citrus sequence databases. The main gene families responsible for phototropin-mediated signal transduction were present in citrus transcriptome, including general regulatory factors (14-3-3 proteins, scaffolding elements and auxin-responsive transcription factors and transporters. A working model of light perception, signal transduction and response-eliciting in citrus is proposed based on the identified key components. These results demonstrate the power of comparative genomics between model systems and economically important crop species to elucidate several aspects of plant physiology and metabolism.

  4. Ancient Human Genome Sequence of an Extinct Palaeo-Eskimo

    DEFF Research Database (Denmark)

    Rasmussen, Morten; Li, Yingrui; Lindgreen, Stinus;

    2010-01-01

    We report here the genome sequence of an ancient human. Obtained from approximately 4,000-year-old permafrost-preserved hair, the genome represents a male individual from the first known culture to settle in Greenland. Sequenced to an average depth of 20x, we recover 79% of the diploid genome, an...... possible phenotypic characteristics of the individual that belonged to a culture whose location has yielded only trace human remains. We compare the high-confidence SNPs to those of contemporary populations to find the populations most closely related to the individual. This provides evidence for a...

  5. Complete mitochondrial genome sequence of Aoluguya reindeer (Rangifer tarandus).

    Science.gov (United States)

    Ju, Yan; Liu, Huamiao; Rong, Min; Yang, Yifeng; Wei, Haijun; Shao, Yuanchen; Chen, Xiumin; Xing, Xiumei

    2016-05-01

    The complete mitochondria genome of the reindeer, Rangifer tarandus, was determined by accurate polymerase chain reaction. The entire genome is 16,357 bp in length and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a D-loop region, all of which are arranged in a typical vertebrate manner. The overall base composition of the reindeer's mitochondrial genome is 33.7% of A, 23.1% of C, 30.1% of T and 13.2%of G. A termination associated sequence and several conserved central sequence block domains were discovered within the control region. PMID:25469816

  6. Monitoring genomic sequences during SELEX using high-throughput sequencing: neutral SELEX.

    Directory of Open Access Journals (Sweden)

    Bob Zimmermann

    Full Text Available BACKGROUND: SELEX is a well established in vitro selection tool to analyze the structure of ligand-binding nucleic acid sequences called aptamers. Genomic SELEX transforms SELEX into a tool to discover novel, genomically encoded RNA or DNA sequences binding a ligand of interest, called genomic aptamers. Concerns have been raised regarding requirements imposed on RNA sequences undergoing SELEX selection. METHODOLOGY/PRINCIPAL FINDINGS: To evaluate SELEX and assess the extent of these effects, we designed and performed a Neutral SELEX experiment omitting the selection step, such that the sequences are under the sole selective pressure of SELEX's amplification steps. Using high-throughput sequencing, we obtained thousands of full-length sequences from the initial genomic library and the pools after each of the 10 rounds of Neutral SELEX. We compared these to sequences obtained from a Genomic SELEX experiment deriving from the same initial library, but screening for RNAs binding with high affinity to the E. coli regulator protein Hfq. With each round of Neutral SELEX, sequences became less stable and changed in nucleotide content, but no sequences were enriched. In contrast, we detected substantial enrichment in the Hfq-selected set with enriched sequences having structural stability similar to the neutral sequences but with significantly different nucleotide selection. CONCLUSIONS/SIGNIFICANCE: Our data indicate that positive selection in SELEX acts independently of the neutral selective requirements imposed on the sequences. We conclude that Genomic SELEX, when combined with high-throughput sequencing of positively and neutrally selected pools, as well as the gnomic library, is a powerful method to identify genomic aptamers.

  7. Specialized microbial databases for inductive exploration of microbial genome sequences

    Directory of Open Access Journals (Sweden)

    Cabau Cédric

    2005-02-01

    Full Text Available Abstract Background The enormous amount of genome sequence data asks for user-oriented databases to manage sequences and annotations. Queries must include search tools permitting function identification through exploration of related objects. Methods The GenoList package for collecting and mining microbial genome databases has been rewritten using MySQL as the database management system. Functions that were not available in MySQL, such as nested subquery, have been implemented. Results Inductive reasoning in the study of genomes starts from "islands of knowledge", centered around genes with some known background. With this concept of "neighborhood" in mind, a modified version of the GenoList structure has been used for organizing sequence data from prokaryotic genomes of particular interest in China. GenoChore http://bioinfo.hku.hk/genochore.html, a set of 17 specialized end-user-oriented microbial databases (including one instance of Microsporidia, Encephalitozoon cuniculi, a member of Eukarya has been made publicly available. These databases allow the user to browse genome sequence and annotation data using standard queries. In addition they provide a weekly update of searches against the world-wide protein sequences data libraries, allowing one to monitor annotation updates on genes of interest. Finally, they allow users to search for patterns in DNA or protein sequences, taking into account a clustering of genes into formal operons, as well as providing extra facilities to query sequences using predefined sequence patterns. Conclusion This growing set of specialized microbial databases organize data created by the first Chinese bacterial genome programs (ThermaList, Thermoanaerobacter tencongensis, LeptoList, with two different genomes of Leptospira interrogans and SepiList, Staphylococcus epidermidis associated to related organisms for comparison.

  8. Genome sequence analysis of the model grass Brachypodium distachyon: insights into grass genome evolution

    Energy Technology Data Exchange (ETDEWEB)

    Schulman, Al

    2009-08-09

    Three subfamilies of grasses, the Erhardtoideae (rice), the Panicoideae (maize, sorghum, sugar cane and millet), and the Pooideae (wheat, barley and cool season forage grasses) provide the basis of human nutrition and are poised to become major sources of renewable energy. Here we describe the complete genome sequence of the wild grass Brachypodium distachyon (Brachypodium), the first member of the Pooideae subfamily to be completely sequenced. Comparison of the Brachypodium, rice and sorghum genomes reveals a precise sequence- based history of genome evolution across a broad diversity of the grass family and identifies nested insertions of whole chromosomes into centromeric regions as a predominant mechanism driving chromosome evolution in the grasses. The relatively compact genome of Brachypodium is maintained by a balance of retroelement replication and loss. The complete genome sequence of Brachypodium, coupled to its exceptional promise as a model system for grass research, will support the development of new energy and food crops

  9. Sequencing and comparative analyses of the genomes of zoysiagrasses.

    Science.gov (United States)

    Tanaka, Hidenori; Hirakawa, Hideki; Kosugi, Shunichi; Nakayama, Shinobu; Ono, Akiko; Watanabe, Akiko; Hashiguchi, Masatsugu; Gondo, Takahiro; Ishigaki, Genki; Muguerza, Melody; Shimizu, Katsuya; Sawamura, Noriko; Inoue, Takayasu; Shigeki, Yuichi; Ohno, Naoki; Tabata, Satoshi; Akashi, Ryo; Sato, Shusei

    2016-04-01

    Zoysiais a warm-season turfgrass, which comprises 11 allotetraploid species (2n= 4x= 40), each possessing different morphological and physiological traits. To characterize the genetic systems ofZoysiaplants and to analyse their structural and functional differences in individual species and accessions, we sequenced the genomes ofZoysiaspecies using HiSeq and MiSeq platforms. As a reference sequence ofZoysiaspecies, we generated a high-quality draft sequence of the genome ofZ. japonicaaccession 'Nagirizaki' (334 Mb) in which 59,271 protein-coding genes were predicted. In parallel, draft genome sequences ofZ. matrella'Wakaba' andZ. pacifica'Zanpa' were also generated for comparative analyses. To investigate the genetic diversity among theZoysiaspecies, genome sequence reads of three additional accessions,Z. japonica'Kyoto',Z. japonica'Miyagi' andZ. matrella'Chiba Fair Green', were accumulated, and aligned against the reference genome of 'Nagirizaki' along with those from 'Wakaba' and 'Zanpa'. As a result, we detected 7,424,163 single-nucleotide polymorphisms and 852,488 short indels among these species. The information obtained in this study will be valuable for basic studies on zoysiagrass evolution and genetics as well as for the breeding of zoysiagrasses, and is made available in the 'Zoysia Genome Database' athttp://zoysia.kazusa.or.jp. PMID:26975196

  10. Population of Endogenous Pararetrovirus Genomes in Carrizo Citrange

    OpenAIRE

    Roy, Avijit; Shao, Jonathan; Schneider, William L.; John S. Hartung; Brlansky, Ronald H.

    2014-01-01

    The complete genome sequences of three related endogenous pararetroviruses (EPRVs) were obtained by 454 sequencing of nucleic acid extracts from Carrizo citrange, used as a citrus rootstock. Numerous homologous sequences have been found in the sweet orange genome. The new EPRVs are most closely related to petunia vein-clearing virus.

  11. Human genome and genetic sequencing research and informed consent

    International Nuclear Information System (INIS)

    On March 29, 2001, the Ethical Guidelines for Human Genome and Genetic Sequencing Research were established. They have intended to serve as ethical guidelines for all human genome and genetic sequencing research practice, for the purpose of upholding respect for human dignity and rights and enforcing use of proper methods in the pursuit of human genome and genetic sequencing research, with the understanding and cooperation of the public. The RadGenomics Project has prepared a research protocol and informed consent document that follow these ethical guidelines. We have endeavored to protect the privacy of individual information, and have established a procedure for examination of research practices by an ethics committee. Here we report our procedure in order to offer this concept to the patients. (authors)

  12. Open access to sequence: Browsing the Pichia pastoris genome

    Directory of Open Access Journals (Sweden)

    Graf Alexandra

    2009-10-01

    Full Text Available Abstract The first genome sequences of the important yeast protein production host Pichia pastoris have been released into the public domain this spring. In order to provide the scientific community easy and versatile access to the sequence, two web-sites have been installed as a resource for genomic sequence, gene and protein information for P. pastoris: A GBrowse based genome browser was set up at http://www.pichiagenome.org and a genome portal with gene annotation and browsing functionality at http://bioinformatics.psb.ugent.be/webtools/bogas. Both websites are offering information on gene annotation and function, regulation and structure. In addition, a WiKi based platform allows all users to create additional information on genes, proteins, physiology and other items of P. pastoris research, so that the Pichia community can benefit from exchange of knowledge, data and materials.

  13. A Probabilistic Genome-Wide Gene Reading Frame Sequence Model

    DEFF Research Database (Denmark)

    Have, Christian Theil; Mørk, Søren

    We introduce a new type of probabilistic sequence model, that model the sequential composition of reading frames of genes in a genome. Our approach extends gene finders with a model of the sequential composition of genes at the genome-level -- effectively producing a sequential genome annotation as...... output. The model can be used to obtain the most probable genome annotation based on a combination of i: a gene finder score of each gene candidate and ii: the sequence of the reading frames of gene candidates through a genome. The model --- as well as a higher order variant --- is developed and tested...... using the probabilistic logic programming language and machine learning system PRISM - a fast and efficient model prototyping environment, using bacterial gene finding performance as a benchmark of signal strength. The model is used to prune a set of gene predictions from an underlying gene finder and...

  14. Complete Genome Sequence of the Alfalfa latent virus.

    Science.gov (United States)

    Nemchinov, Lev G; Shao, Jonathan; Postnikova, Olga A

    2015-01-01

    The first complete genome sequence of the Alfalfa latent carlavirus (ALV) was obtained by primer walking and Illumina RNA sequencing. The virus differs substantially from the Czech ALV isolate and the Pea streak virus isolate from Wisconsin. The absence of a clear nucleic acid-binding protein indicates ALV divergence from other carlaviruses. PMID:25883281

  15. Draft Genome Sequence of Biocontrol Agent Bacillus cereus UW85.

    Science.gov (United States)

    Lozano, Gabriel L; Holt, Jonathan; Ravel, Jacques; Rasko, David A; Thomas, Michael G; Handelsman, Jo

    2016-01-01

    Bacillus cereus UW85 was isolated from a root of a field-grown alfalfa plant from Arlington, WI, and identified for its ability to suppress damping off, a disease caused by Phytophthora megasperma f. sp. medicaginis on alfalfa. Here, we report the draft genome sequence of B. cereus UW85, obtained by a combination of Sanger and Illumina sequencing. PMID:27587823

  16. Genome sequence of Stachybotrys chartarum Strain 51-11

    Science.gov (United States)

    Stachybotrys chartarum strain 51-11 genome was sequenced by shotgun sequencing utilizing Illumina Hiseq 2000 and PacBio long read technology. Since Stachybotrys chartarum has been implicated in health impacts within water-damaged buildings, any information extracted from the geno...

  17. Complete Genome Sequence of Vibrio alginolyticus ZJ-T.

    Science.gov (United States)

    Deng, Yiqin; Chen, Chang; Zhao, Zhe; Huang, Xiaochun; Yang, Yiying; Ding, Xiongqi

    2016-01-01

    Vibrio alginolyticus is a ubiquitous Gram-negative bacterium which is normally distributed in the coastal and estuarine environments. It has been suggested to be an opportunistic pathogen to both marine animals and humans, Here, the completed genome sequence of V. alginolyticus ZJ-T was determined by Illumina high-throughput sequencing. PMID:27587824

  18. Sequencing and analysis of an Irish human genome.

    LENUS (Irish Health Repository)

    Tong, Pin

    2010-01-01

    Recent studies generating complete human sequences from Asian, African and European subgroups have revealed population-specific variation and disease susceptibility loci. Here, choosing a DNA sample from a population of interest due to its relative geographical isolation and genetic impact on further populations, we extend the above studies through the generation of 11-fold coverage of the first Irish human genome sequence.

  19. Draft Genome Sequence of Type Strain Streptococcus gordonii ATCC 10558

    DEFF Research Database (Denmark)

    Rasmussen, Louise Hesselbjerg; Dargis, Rimtas; Christensen, Jens Jørgen Elmer;

    2016-01-01

    Streptococcus gordonii ATCC 10558T was isolated from a patient with infective endocarditis in 1946 and announced as a type strain in 1989. Here, we report the 2,154,510-bp draft genome sequence of S. gordonii ATCC 10558T. This sequence will contribute to knowledge about the pathogenesis of...

  20. Genomic insight into the common carp (Cyprinus carpio genome by sequencing analysis of BAC-end sequences

    Directory of Open Access Journals (Sweden)

    Wang Jintu

    2011-04-01

    Full Text Available Abstract Background Common carp is one of the most important aquaculture teleost fish in the world. Common carp and other closely related Cyprinidae species provide over 30% aquaculture production in the world. However, common carp genomic resources are still relatively underdeveloped. BAC end sequences (BES are important resources for genome research on BAC-anchored genetic marker development, linkage map and physical map integration, and whole genome sequence assembling and scaffolding. Result To develop such valuable resources in common carp (Cyprinus carpio, a total of 40,224 BAC clones were sequenced on both ends, generating 65,720 clean BES with an average read length of 647 bp after sequence processing, representing 42,522,168 bp or 2.5% of common carp genome. The first survey of common carp genome was conducted with various bioinformatics tools. The common carp genome contains over 17.3% of repetitive elements with GC content of 36.8% and 518 transposon ORFs. To identify and develop BAC-anchored microsatellite markers, a total of 13,581 microsatellites were detected from 10,355 BES. The coding region of 7,127 genes were recognized from 9,443 BES on 7,453 BACs, with 1,990 BACs have genes on both ends. To evaluate the similarity to the genome of closely related zebrafish, BES of common carp were aligned against zebrafish genome. A total of 39,335 BES of common carp have conserved homologs on zebrafish genome which demonstrated the high similarity between zebrafish and common carp genomes, indicating the feasibility of comparative mapping between zebrafish and common carp once we have physical map of common carp. Conclusion BAC end sequences are great resources for the first genome wide survey of common carp. The repetitive DNA was estimated to be approximate 28% of common carp genome, indicating the higher complexity of the genome. Comparative analysis had mapped around 40,000 BES to zebrafish genome and established over 3

  1. Genomic Sequencing of Single Microbial Cells from Environmental Samples

    Energy Technology Data Exchange (ETDEWEB)

    Ishoey, Thomas; Woyke, Tanja; Stepanauskas, Ramunas; Novotny, Mark; Lasken, Roger S.

    2008-02-01

    Recently developed techniques allow genomic DNA sequencing from single microbial cells [Lasken RS: Single-cell genomic sequencing using multiple displacement amplification, Curr Opin Microbiol 2007, 10:510-516]. Here, we focus on research strategies for putting these methods into practice in the laboratory setting. An immediate consequence of single-cell sequencing is that it provides an alternative to culturing organisms as a prerequisite for genomic sequencing. The microgram amounts of DNA required as template are amplified from a single bacterium by a method called multiple displacement amplification (MDA) avoiding the need to grow cells. The ability to sequence DNA from individual cells will likely have an immense impact on microbiology considering the vast numbers of novel organisms, which have been inaccessible unless culture-independent methods could be used. However, special approaches have been necessary to work with amplified DNA. MDA may not recover the entire genome from the single copy present in most bacteria. Also, some sequence rearrangements can occur during the DNA amplification reaction. Over the past two years many research groups have begun to use MDA, and some practical approaches to single-cell sequencing have been developed. We review the consensus that is emerging on optimum methods, reliability of amplified template, and the proper interpretation of 'composite' genomes which result from the necessity of combining data from several single-cell MDA reactions in order to complete the assembly. Preferred laboratory methods are considered on the basis of experience at several large sequencing centers where >70% of genomes are now often recovered from single cells. Methods are reviewed for preparation of bacterial fractions from environmental samples, single-cell isolation, DNA amplification by MDA, and DNA sequencing.

  2. Draft Genome Sequence of Neisseria gonorrhoeae Sequence Type 1407, a Multidrug-Resistant Clinical Isolate.

    Science.gov (United States)

    Anselmo, A; Ciammaruconi, A; Carannante, A; Neri, A; Fazio, C; Fortunato, A; Palozzi, A M; Vacca, P; Fillo, S; Lista, F; Stefanelli, P

    2015-01-01

    Gonorrhea may become untreatable due to the spread of resistant or multidrug-resistant strains. Cefixime-resistant gonococci belonging to sequence type 1407 have been described worldwide. We report the genome sequence of Neisseria gonorrhoeae strain G2891, a multidrug-resistant isolate of sequence type 1407, collected in Italy in 2013. PMID:26272575

  3. Comparison of two Next Generation sequencing platforms for full genome sequencing of Classical Swine Fever Virus

    DEFF Research Database (Denmark)

    Fahnøe, Ulrik; Pedersen, Anders Gorm; Höper, Dirk;

    2013-01-01

    Next Generation Sequencing (NGS) is becoming more adopted into viral research and will be the preferred technology in the years to come. We have recently sequenced several strains of Classical Swine Fever Virus (CSFV) by NGS on both Genome Sequencer FLX (GS FLX) and Iontorrent PGM platforms. In...

  4. Identification and in silico analysis of the Citrus HSP70 molecular chaperone gene family

    OpenAIRE

    Fietto, Luciano G.; Maximiller D.L. Costa; Cosme D Cruz; De Souza, Alessandra A.; Machado, Marcos A; Fontes, Elizabeth P. B.

    2007-01-01

    The completion of the genome sequencing of the Arabidopsis thaliana model system provided a powerful molecular tool for comparative analysis of gene families present in the genome of economically relevant plant species. In this investigation, we used the sequences of the Arabidopsis Hsp70 gene family to identify and annotate the Citrus Hsp70 genes represented in the CitEST database. Based on sequence comparison analysis, we identified 18 clusters that were further divided into 5 subgroups enc...

  5. Chemical rationale for selection of isolates for genome sequencing

    DEFF Research Database (Denmark)

    Rank, Christian; Larsen, Thomas Ostenfeld; Frisvad, Jens Christian

    The advances in gene sequencing will in the near future enable researchers to affordably acquire the full genomes of handpicked isolates. We here present a method to evaluate the chemical potential of an entire species and select representatives for genome sequencing. The selection criteria for new...... strains to be sequenced can be manifold, but for studying the functional phenotype, using a metabolome based approach offers a cheap and rapid assessment of critical strains to cover the chemical diversity. We have applied this methodology on the complex A. flavus/A. oryzae group. Though these two species...

  6. Complete genome sequence of Treponema pallidum, the syphilis spirochete.

    Science.gov (United States)

    Fraser, C M; Norris, S J; Weinstock, G M; White, O; Sutton, G G; Dodson, R; Gwinn, M; Hickey, E K; Clayton, R; Ketchum, K A; Sodergren, E; Hardham, J M; McLeod, M P; Salzberg, S; Peterson, J; Khalak, H; Richardson, D; Howell, J K; Chidambaram, M; Utterback, T; McDonald, L; Artiach, P; Bowman, C; Cotton, M D; Fujii, C; Garland, S; Hatch, B; Horst, K; Roberts, K; Sandusky, M; Weidman, J; Smith, H O; Venter, J C

    1998-07-17

    The complete genome sequence of Treponema pallidum was determined and shown to be 1,138,006 base pairs containing 1041 predicted coding sequences (open reading frames). Systems for DNA replication, transcription, translation, and repair are intact, but catabolic and biosynthetic activities are minimized. The number of identifiable transporters is small, and no phosphoenolpyruvate:phosphotransferase carbohydrate transporters were found. Potential virulence factors include a family of 12 potential membrane proteins and several putative hemolysins. Comparison of the T. pallidum genome sequence with that of another pathogenic spirochete, Borrelia burgdorferi, the agent of Lyme disease, identified unique and common genes and substantiates the considerable diversity observed among pathogenic spirochetes. PMID:9665876

  7. Pig genome sequence - analysis and publication strategy

    DEFF Research Database (Denmark)

    Archibald, Alan L.; Bolund, Lars; Churcher, Carol;

    2010-01-01

    preferentially selected for sequencing. In accordance with the Bermuda and Fort Lauderdale agreements and the more recent Toronto Statement the data have been released into public sequence repositories (Genbank/EMBL, NCBI/Ensembl trace repositories) in a timely manner and in advance of publication. CONCLUSIONS...

  8. DNA sequencing leads to genomics progress in China

    Institute of Scientific and Technical Information of China (English)

    WU JiaYan; XIAO JingFa; ZHANG RuoSi; YU Jun

    2011-01-01

    1 Science in the large-scale sequencing era Ten years ago,the first draft sequence assembly of the human genome was completed [1],bringing biomedical research one-step closer toward the goal of revolutionizing diagnosis,prevention,and treatment of human diseases.Recently,journalists from the journal Nature surveyed more than 1000 life scientists regarding this laudable aim [2],obtaining substantially negative responses [3].However,almost all of those surveyed had been influenced,in one way or another,by the availability of the human genome sequence,and they also agreed with the notion that the "sequence is the start." The complexity of genome biology and almost every aspect of human biology is far greater than previously thought [4].

  9. Genomic multiple sequence alignments: refinement using a genetic algorithm

    Directory of Open Access Journals (Sweden)

    Lefkowitz Elliot J

    2005-08-01

    Full Text Available Abstract Background Genomic sequence data cannot be fully appreciated in isolation. Comparative genomics – the practice of comparing genomic sequences from different species – plays an increasingly important role in understanding the genotypic differences between species that result in phenotypic differences as well as in revealing patterns of evolutionary relationships. One of the major challenges in comparative genomics is producing a high-quality alignment between two or more related genomic sequences. In recent years, a number of tools have been developed for aligning large genomic sequences. Most utilize heuristic strategies to identify a series of strong sequence similarities, which are then used as anchors to align the regions between the anchor points. The resulting alignment is globally correct, but in many cases is suboptimal locally. We describe a new program, GenAlignRefine, which improves the overall quality of global multiple alignments by using a genetic algorithm to improve local regions of alignment. Regions of low quality are identified, realigned using the program T-Coffee, and then refined using a genetic algorithm. Because a better COFFEE (Consistency based Objective Function For alignmEnt Evaluation score generally reflects greater alignment quality, the algorithm searches for an alignment that yields a better COFFEE score. To improve the intrinsic slowness of the genetic algorithm, GenAlignRefine was implemented as a parallel, cluster-based program. Results We tested the GenAlignRefine algorithm by running it on a Linux cluster to refine sequences from a simulation, as well as refine a multiple alignment of 15 Orthopoxvirus genomic sequences approximately 260,000 nucleotides in length that initially had been aligned by Multi-LAGAN. It took approximately 150 minutes for a 40-processor Linux cluster to optimize some 200 fuzzy (poorly aligned regions of the orthopoxvirus alignment. Overall sequence identity increased only

  10. Construction of citrus gene coexpression networks from microarray data using random matrix theory.

    Science.gov (United States)

    Du, Dongliang; Rawat, Nidhi; Deng, Zhanao; Gmitter, Fred G

    2015-01-01

    After the sequencing of citrus genomes, gene function annotation is becoming a new challenge. Gene coexpression analysis can be employed for function annotation using publicly available microarray data sets. In this study, 230 sweet orange (Citrus sinensis) microarrays were used to construct seven coexpression networks, including one condition-independent and six condition-dependent (Citrus canker, Huanglongbing, leaves, flavedo, albedo, and flesh) networks. In total, these networks contain 37 633 edges among 6256 nodes (genes), which accounts for 52.11% measurable genes of the citrus microarray. Then, these networks were partitioned into functional modules using the Markov Cluster Algorithm. Significantly enriched Gene Ontology biological process terms and KEGG pathway terms were detected for 343 and 60 modules, respectively. Finally, independent verification of these networks was performed using another expression data of 371 genes. This study provides new targets for further functional analyses in citrus. PMID:26504573

  11. Molecular epidemiology of dengue viruses from complete genome sequences

    OpenAIRE

    Ong, Swee Hoe

    2010-01-01

    The availability of the complete genetic blueprint of the dengue virus is essential in molecular epidemiological studies to uncover the role of the virus in dengue pathogenesis. During the course of this project, over two hundred complete genomes of the dengue virus were generated from clinical samples collected in three dengue-endemic Southeast Asian countries. In addition, a bioinformatics platform integrating a sequence database, sequence retrieval tools, sequence annotation data and a var...

  12. Information-theoretic View of Sequence Organization in a Genome

    OpenAIRE

    Luo, Liaofu; Gao, Yang; Lu, Jun

    2010-01-01

    Sequence organizations are viewed from two points: one is from informational redundancy or informational correlation (IC) and another is from k-mer frequency statistics. Two problems are investigated. The first is how the ICs exceed the fluctuation bound and the order emerges from fluctuation in a genome when the sequence length attains some critical value. We demonstrated that the transition from fluctuation to order takes place at about sequence length 200-300 thousands bases for human and ...

  13. Physical map-assisted whole-genome shotgun sequence assemblies

    OpenAIRE

    Warren, René L.; Varabei, Dmitry; Platt, Darren; Huang, Xiaoqiu; Messina, David; Yang, Shiaw-Pyng; Kronstad, James W.; Krzywinski, Martin; Warren, Wesley C; Wallis, John W.; Hillier, LaDeana W.; Chinwalla, Asif T.; Schein, Jacqueline E.; Siddiqui, Asim S.; Marra, Marco A.

    2006-01-01

    We describe a targeted approach to improve the contiguity of whole-genome shotgun sequence (WGS) assemblies at run-time, using information from Bacterial Artificial Chromosome (BAC)-based physical maps. Clone sizes and overlaps derived from clone fingerprints are used for the calculation of length constraints between any two BAC neighbors sharing 40% of their size. These constraints are used to promote the linkage and guide the arrangement of sequence contigs within a sequence scaffold at the...

  14. A label-free differential quantitative mass spectrometry method for the characterization and identification of protein changes during citrus fruit development

    Directory of Open Access Journals (Sweden)

    Lin Dawei

    2010-12-01

    Full Text Available Abstract Background Citrus is one of the most important and widely grown commodity fruit crops. In this study a label-free LC-MS/MS based shot-gun proteomics approach was taken to explore three main stages of citrus fruit development. These approaches were used to identify and evaluate changes occurring in juice sac cells in various metabolic pathways affecting citrus fruit development and quality. Results Protein changes in citrus juice sac cells were identified and quantified using label-free shotgun methodologies. Two alternative methods, differential mass-spectrometry (dMS and spectral counting (SC were used to analyze protein changes occurring during earlier and late stages of fruit development. Both methods were compared in order to develop a proteomics workflow that could be used in a non-model plant lacking a sequenced genome. In order to resolve the bioinformatics limitations of EST databases from species that lack a full sequenced genome, we established iCitrus. iCitrus is a comprehensive sequence database created by merging three major sources of sequences (HarvEST:citrus, NCBI/citrus/unigenes, NCBI/citrus/proteins and improving the annotation of existing unigenes. iCitrus provided a useful bioinformatics tool for the high-throughput identification of citrus proteins. We have identified approximately 1500 citrus proteins expressed in fruit juice sac cells and quantified the changes of their expression during fruit development. Our results showed that both dMS and SC provided significant information on protein changes, with dMS providing a higher accuracy. Conclusion Our data supports the notion of the complementary use of dMS and SC for label-free comparative proteomics, broadening the identification spectrum and strengthening the identification of trends in protein expression changes during the particular processes being compared.

  15. Draft Genome Sequences of Two Virulent Serotypes of Avian Pasteurella multocida

    OpenAIRE

    Abrahante, Juan E.; Johnson, Timothy J.; Hunter, Samuel S.; Maheswaran, Samuel K.; Hauglund, Melissa J.; Bayles, Darrell O.; Tatum, Fred M.; Briggs, Robert E.

    2013-01-01

    Here we report the draft genome sequences of two virulent avian strains of Pasteurella multocida. Comparative analyses of these genomes were done with the published genome sequence of avirulent P. multocida strain Pm70.

  16. Mitochondrial genome sequencing helps show the evolutionary mechanism of mitochondrial genome formation in Brassica

    Directory of Open Access Journals (Sweden)

    Yan Jiyong

    2011-10-01

    Full Text Available Abstract Background Angiosperm mitochondrial genomes are more complex than those of other organisms. Analyses of the mitochondrial genome sequences of at least 11 angiosperm species have showed several common properties; these cannot easily explain, however, how the diverse mitotypes evolved within each genus or species. We analyzed the evolutionary relationships of Brassica mitotypes by sequencing. Results We sequenced the mitotypes of cam (Brassica rapa, ole (B. oleracea, jun (B. juncea, and car (B. carinata and analyzed them together with two previously sequenced mitotypes of B. napus (pol and nap. The sizes of whole single circular genomes of cam, jun, ole, and car are 219,747 bp, 219,766 bp, 360,271 bp, and 232,241 bp, respectively. The mitochondrial genome of ole is largest as a resulting of the duplication of a 141.8 kb segment. The jun mitotype is the result of an inherited cam mitotype, and pol is also derived from the cam mitotype with evolutionary modifications. Genes with known functions are conserved in all mitotypes, but clear variation in open reading frames (ORFs with unknown functions among the six mitotypes was observed. Sequence relationship analysis showed that there has been genome compaction and inheritance in the course of Brassica mitotype evolution. Conclusions We have sequenced four Brassica mitotypes, compared six Brassica mitotypes and suggested a mechanism for mitochondrial genome formation in Brassica, including evolutionary events such as inheritance, duplication, rearrangement, genome compaction, and mutation.

  17. The diploid genome sequence of an individual human.

    Directory of Open Access Journals (Sweden)

    Samuel Levy

    2007-09-01

    Full Text Available Presented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel included 3,213,401 single nucleotide polymorphisms (SNPs, 53,823 block substitutions (2-206 bp, 292,102 heterozygous insertion/deletion events (indels(1-571 bp, 559,473 homozygous indels (1-82,711 bp, 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.

  18. Genome-wide sequence variations among Mycobacterium avium subspecies paratuberculosis.

    Directory of Open Access Journals (Sweden)

    AdelMTalaat

    2011-12-01

    Full Text Available Mycobacterium avium subspecies paratuberculosis (M. ap, the causative agent of Johne’s disease (JD, infects many farmed ruminants, wildlife animals and humans. To better understand the molecular pathogenesis of these infections, we analyzed the whole genome sequences of several M. ap and M. avium subspecies avium (M. avium strains isolated from various hosts and environments. Using Next-generation sequencing technology, all 6 M. ap isolates showed a high percentage of homology (98% to the reference genome sequence of M. ap K-10 isolated from cattle. However, 2 M. avium isolates (DT 78 and Env 77 showed significant sequence diversity from the reference strain M. avium 104. The genomes of M. avium isolates DT 78 and Env 77 exhibited only 87% and 40% homology, respectively, to the M. avium 104 reference genome. Within the M. ap isolates, genomic rearrangements (insertions/deletions, Indels were not detected, and only unique single nucleotide polymorphisms (SNPs were observed among the 6 M. ap strains. While most of the SNPs (~100 in M. ap genomes were non-synonymous, a total of ~ 6000 SNPs were detected among M. avium genomes, most of them were synonymous suggesting a differential selective pressure between M. ap and M. avium isolates. In addition, SNPs-based phylo-genomic analysis showed that isolates from goat and Oryx are closely related to the cattle (K-10 strain while the human isolate (M. ap 4B is closely related to the environmental strains, indicating environmental source to human infections. Overall, SNPs were the most common variations among M. ap isolates while SNPs in addition to Indels were prevalent among M. avium isolates. Genomic variations will be useful in designing host-specific markers for the analysis of mycobacterial evolution and for developing novel diagnostics directed against Johne’s disease in animals.

  19. Comparison of methods for genomic localization of gene trap sequences

    Directory of Open Access Journals (Sweden)

    Ferrin Thomas E

    2006-09-01

    Full Text Available Abstract Background Gene knockouts in a model organism such as mouse provide a valuable resource for the study of basic biology and human disease. Determining which gene has been inactivated by an untargeted gene trapping event poses a challenging annotation problem because gene trap sequence tags, which represent sequence near the vector insertion site of a trapped gene, are typically short and often contain unresolved residues. To understand better the localization of these sequences on the mouse genome, we compared stand-alone versions of the alignment programs BLAT, SSAHA, and MegaBLAST. A set of 3,369 sequence tags was aligned to build 34 of the mouse genome using default parameters for each algorithm. Known genome coordinates for the cognate set of full-length genes (1,659 sequences were used to evaluate localization results. Results In general, all three programs performed well in terms of localizing sequences to a general region of the genome, with only relatively subtle errors identified for a small proportion of the sequence tags. However, large differences in performance were noted with regard to correctly identifying exon boundaries. BLAT correctly identified the vast majority of exon boundaries, while SSAHA and MegaBLAST missed the majority of exon boundaries. SSAHA consistently reported the fewest false positives and is the fastest algorithm. MegaBLAST was comparable to BLAT in speed, but was the most susceptible to localizing sequence tags incorrectly to pseudogenes. Conclusion The differences in performance for sequence tags and full-length reference sequences were surprisingly small. Characteristic variations in localization results for each program were noted that affect the localization of sequence at exon boundaries, in particular.

  20. Evolution Analysis of Simple Sequence Repeats in Plant Genome.

    Directory of Open Access Journals (Sweden)

    Zhen Qin

    Full Text Available Simple sequence repeats (SSRs are widespread units on genome sequences, and play many important roles in plants. In order to reveal the evolution of plant genomes, we investigated the evolutionary regularities of SSRs during the evolution of plant species and the plant kingdom by analysis of twelve sequenced plant genome sequences. First, in the twelve studied plant genomes, the main SSRs were those which contain repeats of 1-3 nucleotides combination. Second, in mononucleotide SSRs, the A/T percentage gradually increased along with the evolution of plants (except for P. patens. With the increase of SSRs repeat number the percentage of A/T in C. reinhardtii had no significant change, while the percentage of A/T in terrestrial plants species gradually declined. Third, in dinucleotide SSRs, the percentage of AT/TA increased along with the evolution of plant kingdom and the repeat number increased in terrestrial plants species. This trend was more obvious in dicotyledon than monocotyledon. The percentage of CG/GC showed the opposite pattern to the AT/TA. Forth, in trinucleotide SSRs, the percentages of combinations including two or three A/T were in a rising trend along with the evolution of plant kingdom; meanwhile with the increase of SSRs repeat number in plants species, different species chose different combinations as dominant SSRs. SSRs in C. reinhardtii, P. patens, Z. mays and A. thaliana showed their specific patterns related to evolutionary position or specific changes of genome sequences. The results showed that, SSRs not only had the general pattern in the evolution of plant kingdom, but also were associated with the evolution of the specific genome sequence. The study of the evolutionary regularities of SSRs provided new insights for the analysis of the plant genome evolution.

  1. Enhanced Dynamic Algorithm of Genome Sequence Alignments

    OpenAIRE

    Arabi E. keshk

    2014-01-01

    The merging of biology and computer science has created a new field called computational biology that explore the capacities of computers to gain knowledge from biological data, bioinformatics. Computational biology is rooted in life sciences as well as computers, information sciences, and technologies. The main problem in computational biology is sequence alignment that is a way of arranging the sequences of DNA, RNA or protein to identify the region of similarity and relationship between se...

  2. Isolation, classification and transcription profiles of the AP2/ERF transcription factor superfamily in citrus.

    Science.gov (United States)

    Xie, Xiu-lan; Shen, Shu-ling; Yin, Xue-ren; Xu, Qian; Sun, Chong-de; Grierson, Donald; Ferguson, Ian; Chen, Kun-song

    2014-07-01

    The AP2/ERF gene family encodes plant-specific transcription factors. In model plants, AP2/ERF genes have been shown to be expressed in response to developmental and environmental stimuli, and many function downstream of the ethylene, biotic, and abiotic stress signaling pathways. In citrus, ethylene is effective in regulation citrus fruit quality, such as degreening and aroma. However, information about the citrus AP2/ERF family is limited, and would enhance our understanding of fruit responses to environmental stress, fruit development and quality. CitAP2/ERF genes were isolated using the citrus genome database, and their expression patterns analyzed by real-time PCR using various orange organs and samples from a fruit developmental series. 126 sequences with homologies to AP2/ERF proteins were identified from the citrus genome, and, on the basis of their structure and sequence, assigned to the ERF family (102), AP2 family (18), RAV family (4) and Soloist (2). MEME motif analysis predicted the defining AP2/ERF domain and EAR repressor domains. Analysis of transcript accumulation in Citrus sinensis cv. 'Newhall' indicated that CitAP2/ERF genes show organ-specific and temporal expression, and provided a framework for understanding the transcriptional regulatory roles of AP2/ERF gene family members in citrus. Hierarchical cluster analysis and t tests identified regulators that potentially function during orange fruit growth and development. PMID:24566692

  3. Toward Complete Bacterial Genome Sequencing Through the Combined Use of Multiple Next-Generation Sequencing Platforms.

    Science.gov (United States)

    Jeong, Haeyoung; Lee, Dae-Hee; Ryu, Choong-Min; Park, Seung-Hwan

    2016-01-01

    PacBio's long-read sequencing technologies can be successfully used for a complete bacterial genome assembly using recently developed non-hybrid assemblers in the absence of secondgeneration, high-quality short reads. However, standardized procedures that take into account multiple pre-existing second-generation sequencing platforms are scarce. In addition to Illumina HiSeq and Ion Torrent PGM-based genome sequencing results derived from previous studies, we generated further sequencing data, including from the PacBio RS II platform, and applied various bioinformatics tools to obtain complete genome assemblies for five bacterial strains. Our approach revealed that the hierarchical genome assembly process (HGAP) non-hybrid assembler resulted in nearly complete assemblies at a moderate coverage of ~75x, but that different versions produced non-compatible results requiring post processing. The other two platforms further improved the PacBio assembly through scaffolding and a final error correction. PMID:26464377

  4. Genome-Wide Characterization and Expression Analysis of Major Intrinsic Proteins during Abiotic and Biotic Stresses in Sweet Orange (Citrus sinensis L. Osb.)

    OpenAIRE

    de Paula Santos Martins, Cristina; Pedrosa, Andresa Muniz; Du, Dongliang; Gonçalves, Luana Pereira; Yu, Qibin; Gmitter, Frederick G.; Costa, Marcio Gilberto Cardoso

    2015-01-01

    The family of aquaporins (AQPs), or major intrinsic proteins (MIPs), includes integral membrane proteins that function as transmembrane channels for water and other small molecules of physiological significance. MIPs are classified into five subfamilies in higher plants, including plasma membrane (PIPs), tonoplast (TIPs), NOD26-like (NIPs), small basic (SIPs) and unclassified X (XIPs) intrinsic proteins. This study reports a genome-wide survey of MIP encoding genes in sweet orange (Citrus sin...

  5. The complete genome sequence of 'Candidatus Liberibacter solanacearum', the bacterium associated with potato zebra chip disease.

    Directory of Open Access Journals (Sweden)

    Hong Lin

    Full Text Available Zebra Chip (ZC is an emerging plant disease that causes aboveground decline of potato shoots and generally results in unusable tubers. This disease has led to multi-million dollar losses for growers in the central and western United States over the past decade and impacts the livelihood of potato farmers in Mexico and New Zealand. ZC is associated with 'Candidatus Liberibacter solanacearum', a fastidious alpha-proteobacterium that is transmitted by a phloem-feeding psyllid vector, Bactericera cockerelli Sulc. Research on this disease has been hampered by a lack of robust culture methods and paucity of genome sequence information for 'Ca. L. solanacearum'. Here we present the sequence of the 1.26 Mbp metagenome of 'Ca. L. solanacearum', based on DNA isolated from potato psyllids. The coding inventory of the 'Ca. L. solanacearum' genome was analyzed and compared to related Rhizobiaceae to better understand 'Ca. L. solanacearum' physiology and identify potential targets to develop improved treatment strategies. This analysis revealed a number of unique transporters and pathways, all potentially contributing to ZC pathogenesis. Some of these factors may have been acquired through horizontal gene transfer. Taxonomically, 'Ca. L. solanacearum' is related to 'Ca. L. asiaticus', a suspected causative agent of citrus huanglongbing, yet many genome rearrangements and several gene gains/losses are evident when comparing these two Liberibacter. species. Relative to 'Ca. L. asiaticus', 'Ca. L. solanacearum' probably has reduced capacity for nucleic acid modification, increased amino acid and vitamin biosynthesis functionalities, and gained a high-affinity iron transport system characteristic of several pathogenic microbes.

  6. Plasmodium knowlesi genome sequences from clinical isolates reveal extensive genomic dimorphism.

    Directory of Open Access Journals (Sweden)

    Miguel M Pinheiro

    Full Text Available Plasmodium knowlesi is a newly described zoonosis that causes malaria in the human population that can be severe and fatal. The study of P. knowlesi parasites from human clinical isolates is relatively new and, in order to obtain maximum information from patient sample collections, we explored the possibility of generating P. knowlesi genome sequences from archived clinical isolates. Our patient sample collection consisted of frozen whole blood samples that contained excessive human DNA contamination and, in that form, were not suitable for parasite genome sequencing. We developed a method to reduce the amount of human DNA in the thawed blood samples in preparation for high throughput parasite genome sequencing using Illumina HiSeq and MiSeq sequencing platforms. Seven of fifteen samples processed had sufficiently pure P. knowlesi DNA for whole genome sequencing. The reads were mapped to the P. knowlesi H strain reference genome and an average mapping of 90% was obtained. Genes with low coverage were removed leaving 4623 genes for subsequent analyses. Previously we identified a DNA sequence dimorphism on a small fragment of the P. knowlesi normocyte binding protein xa gene on chromosome 14. We used the genome data to assemble full-length Pknbpxa sequences and discovered that the dimorphism extended along the gene. An in-house algorithm was developed to detect SNP sites co-associating with the dimorphism. More than half of the P. knowlesi genome was dimorphic, involving genes on all chromosomes and suggesting that two distinct types of P. knowlesi infect the human population in Sarawak, Malaysian Borneo. We use P. knowlesi clinical samples to demonstrate that Plasmodium DNA from archived patient samples can produce high quality genome data. We show that analyses, of even small numbers of difficult clinical malaria isolates, can generate comprehensive genomic information that will improve our understanding of malaria parasite diversity and

  7. Complete genome sequence of Arcobacter nitrofigilis type strain (CIT)

    Energy Technology Data Exchange (ETDEWEB)

    Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Gronow, Sabine [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Copeland, A [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Han, Cliff [Los Alamos National Laboratory (LANL); Chertkov, Olga [Los Alamos National Laboratory (LANL); Bruce, David [Los Alamos National Laboratory (LANL); Tapia, Roxanne [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Liolios, Konstantinos [U.S. Department of Energy, Joint Genome Institute; Ivanova, N [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute

    2010-01-01

    Arcobacter nitrofigilis (McClung et al. 1983) Vandamme et al. 1991 is the type species of the genus Arcobacter in the epsilonproteobacterial family Campylobacteraceae. The species was first described in 1983 as Campylobacter nitrofigilis [1] after its detection as a free-living, nitrogen-fixing Campylobacter species associated with Spartina alterniflora Loisel. roots [2]. It is of phylogenetic interest because of its lifestyle as a symbiotic organism in a marine environment in contrast to many other Arcobacter species which are associated with warm-blooded animals and tend to be pathogenic. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first complete genome sequence of a type stain of the genus Arcobacter. The 3,192,235 bp genome with its 3,154 protein-coding and 70 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  8. The complete plastid genome sequence of Bomarea edulis (Alstroemeriaceae: Liliales).

    Science.gov (United States)

    Kim, Jung Sung; Kim, Hyoung Tae; Yoon, Chang Young; Kim, Joo-Hwan

    2016-05-01

    Bomarea, a member of the family Alstroemeriaceae, is distributed from Chile to Mexico and includes approximately 120 species. Recent molecular phylogenetic studies have clarified the monophyly of the family within the order Liliales and the sister relationship with the family Colchicaceae. At this time, five plastid genomes of Liliales have been analyzed at the familial level. To examine plastid genome variation at the generic level, we sequenced the plastid genome of Bomarea edulis, which is the most widely distributed species in the genus, and compared it with Alstroemeria aurea. The plastid genome sequence of B. edulis was 154,925 bp in length with a similar structure as A. aurea, excluding the IR-LSC junction. Ycf68 and infA were pseudogenes caused by frameshift mutations, and the ycf15 gene was deleted, similar to A. aurea. PMID:25319309

  9. The complete mitochondrial genome sequence of Malus hupehensis var. pinyiensis.

    Science.gov (United States)

    Duan, Naibin; Sun, Honghe; Wang, Nan; Fei, Zhangjun; Chen, Xuesen

    2016-07-01

    The complete mitochondrial genome sequence of Malus hupehensis var. pinyiensis, a widely used apple rootstock, was determined using the Illumina high-throughput sequencing approach. The genome is 422,555 bp in length and has a GC content of 45.21%. It is separated by a pair of inverted repeats of 32,504 bp, to form a large single copy region of 213,055 bp and a small single copy region of 144,492 bp. The genome contains 38 protein-coding genes, four pseudogenes, 25 tRNA genes, and three rRNA genes. The genome is 25,608 bp longer than that of M. domestica, and several structural variations between these two mitogenomes were detected. PMID:26539696

  10. Draft genome sequence of the rubber tree Hevea brasiliensis

    Directory of Open Access Journals (Sweden)

    Rahman Ahmad Yamin Abdul

    2013-02-01

    Full Text Available Abstract Background Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR. NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876. Results Here, we report the draft genome sequence of H. brasiliensis. The assembly spans ~1.1 Gb of the estimated 2.15 Gb haploid genome. Overall, ~78% of the genome was identified as repetitive DNA. Gene prediction shows 68,955 gene models, of which 12.7% are unique to Hevea. Most of the key genes associated with rubber biosynthesis, rubberwood formation, disease resistance, and allergenicity have been identified. Conclusions The knowledge gained from this genome sequence will aid in the future development of high-yielding clones to keep up with the ever increasing need for natural rubber.

  11. An automated annotation tool for genomic DNA sequences using GeneScan and BLAST

    Indian Academy of Sciences (India)

    Andrew M. Lynn; Chakresh Kumar Jain; K. Kosalai; Pranjan Barman; Nupur Thakur; Harish Batra; Alok Bhattacharya

    2001-04-01

    Genomic sequence data are often available well before the annotated sequence is published. We present a method for analysis of genomic DNA to identify coding sequences using the GeneScan algorithm and characterize these resultant sequences by BLAST. The routines are used to develop a system for automated annotation of genome DNA sequences.

  12. Accuracy of genomic prediction using imputed whole-genome sequence data in white layers.

    Science.gov (United States)

    Heidaritabar, M; Calus, M P L; Megens, H-J; Vereijken, A; Groenen, M A M; Bastiaansen, J W M

    2016-06-01

    There is an increasing interest in using whole-genome sequence data in genomic selection breeding programmes. Prediction of breeding values is expected to be more accurate when whole-genome sequence is used, because the causal mutations are assumed to be in the data. We performed genomic prediction for the number of eggs in white layers using imputed whole-genome resequence data including ~4.6 million SNPs. The prediction accuracies based on sequence data were compared with the accuracies from the 60 K SNP panel. Predictions were based on genomic best linear unbiased prediction (GBLUP) as well as a Bayesian variable selection model (BayesC). Moreover, the prediction accuracy from using different types of variants (synonymous, non-synonymous and non-coding SNPs) was evaluated. Genomic prediction using the 60 K SNP panel resulted in a prediction accuracy of 0.74 when GBLUP was applied. With sequence data, there was a small increase (~1%) in prediction accuracy over the 60 K genotypes. With both 60 K SNP panel and sequence data, GBLUP slightly outperformed BayesC in predicting the breeding values. Selection of SNPs more likely to affect the phenotype (i.e. non-synonymous SNPs) did not improve the accuracy of genomic prediction. The fact that sequence data were based on imputation from a small number of sequenced animals may have limited the potential to improve the prediction accuracy. A small reference population (n = 1004) and possible exclusion of many causal SNPs during quality control can be other possible reasons for limited benefit of sequence data. We expect, however, that the limited improvement is because the 60 K SNP panel was already sufficiently dense to accurately determine the relationships between animals in our data. PMID:26776363

  13. Genome and exome sequencing in the clinic: unbiased genomic approaches with a high diagnostic yield

    NARCIS (Netherlands)

    Nelen, M.; Veltman, J.A.

    2012-01-01

    For the reasons discussed here, we think whole-genome- or exome-based approaches are currently most suited for diagnostic implementation in genetically heterogeneous diseases, initially to complement and later to replace Sanger sequencing, qPCR and genomic microarrays. Patients do need to be counsel

  14. Complete genome sequence and comparative genomic analysis of an emerging human pathogen, serotype V Streptococcus agalactiae

    NARCIS (Netherlands)

    Tettelin, H; Masignani, [No Value; Cieslewicz, MJ; Eisen, JA; Peterson, S; Paulsen, IT; Nelson, KE; Margarit, [No Value; Read, TD; Madoff, LC; Beanan, MJ; Brinkac, LM; Daugherty, SC; DeBoy, RT; Durkin, AS; Kolonay, JF; Madupu, R; Lewis, MR; Radune, D; Fedorova, NB; Scanlan, D; Khouri, H; Mulligan, S; Carty, HA; Cline, RT; Van Aken, SE; Gill, J; Scarselli, M; Mora, M; Iacobini, ET; Brettoni, C; Galli, G; Mariani, M; Vegni, F; Maione, D; Rinaudo, D; Rappuoli, R; Telford, JL; Kasper, DL; Grandi, G; Fraser, CM

    2002-01-01

    The 2,160,267 bp genome sequence of Streptococcus agalactiae, the leading cause of bacterial sepsis, pneumonia, and meningitis in neonates in the U.S. and Europe, is predicted to encode 2,175 genes. Genome comparisons among S. agalactiae, Streptococcus pneumoniae, Streptococcus pyogenes, and the oth

  15. Candida albicans genome sequence: a platform for genomics in the absence of genetics

    OpenAIRE

    Odds, Frank C.; Brown, Alistair JP; Gow, Neil AR

    2004-01-01

    Publication of the complete diploid genome sequence of the yeast Candida albicans will accelerate research into the pathogenesis of Candida infections. Comparative genomic analysis highlights genes that may contribute to C. albicans survival and its fitness as a human commensal and pathogen.

  16. Complete Genome Sequence of Streptomyces ambofaciens DSM 40697, a Paradigm for Genome Plasticity Studies

    Science.gov (United States)

    Thibessard, Annabelle

    2016-01-01

    The sequence of Streptomyces ambofaciens DSM 40697 was completely determined. The genome consists of an 8.1-Mbp linear chromosome with terminal inverted repeats of 210 kb. Genomic islands were identified, one of which corresponds to a new putative integrative and conjugative element (ICE) called pSAM3. PMID:27257195

  17. Complete genome sequence of Croceibacter atlanticus HTCC2559T.

    Science.gov (United States)

    Oh, Hyun-Myung; Kang, Ilnam; Ferriera, Steve; Giovannoni, Stephen J; Cho, Jang-Cheon

    2010-09-01

    Here we announce the complete genome sequence of Croceibacter atlanticus HTCC2559(T), which was isolated by high-throughput dilution-to-extinction culturing from the Bermuda Atlantic Time Series station in the Western Sargasso Sea. Strain HTCC2559(T) contained genes for carotenoid biosynthesis, flavonoid biosynthesis, and several macromolecule-degrading enzymes. The genome confirmed physiological observations of cultivated Croceibacter atlanticus strain HTCC2559(T), which identified it as an obligate chemoheterotroph. PMID:20639333

  18. The genome sequence of the filamentous fungus Neurospora crassa

    OpenAIRE

    Read, Nick D; et al.

    2003-01-01

    Neurospora crassa is a central organism in the history of twentieth-century genetics, biochemistry and molecular biology. Here, we report a high-quality draft sequence of the N. crassa genome. The approximately 40-megabase genome encodes about 10,000 protein-coding genes—more than twice as many as in the fission yeast Schizosaccharomyces pombe and only about 25% fewer than in the fruitfly Drosophila melanogaster. Analysis of the gene set yields insights into unexpected aspects of Neu...

  19. Complete Genome Sequence of a Novel Porcine Parvovirus in China

    OpenAIRE

    Dai, Xiao-Fang; Wang, Qiu-Ju; Jiang, Shi-Jin; Xie, Zhi-Jing

    2012-01-01

    The porcine parvovirus JT strain (PPV-JT) was isolated from a piglet showing nonsuppurative myocarditis in Shandong, China, in 2010. The complete genomic sequence of PPV-JT, 4,941 bp long, was determined from clones made from replicative form (RF) DNA. The genomic analysis demonstrated that the PPV-JT might be involved in a recombination event, which will help us understand the molecular characteristics and evolutionary of PPV in China.

  20. Complete Genome Sequence of the Endophytic Fungus Diaporthe (Phomopsis) ampelina.

    Science.gov (United States)

    Savitha, J; Bhargavi, S D; Praveen, V K

    2016-01-01

    Diaporthe ampelina was isolated as an endophytic fungus from the root of Commiphora wightii, a medicinal plant collected from Dhanvantri Vana, Bangalore University, Bangalore, India. The whole genome is 59 Mb, contains a total of 905 scaffolds, and has a G+C content of 51.74%. The genome sequence of D. ampelina shows a complete absence of lovastatin (an anticholesterol drug) gene cluster. PMID:27257198

  1. Whole Genome and Transcriptome Sequencing of a B3 Thymoma

    OpenAIRE

    Iacopo Petrini; Arun Rajan; Trung Pham; Donna Voeller; Sean Davis; James Gao; Yisong Wang; Giuseppe Giaccone

    2013-01-01

    Molecular pathology of thymomas is poorly understood. Genomic aberrations are frequently identified in tumors but no extensive sequencing has been reported in thymomas. Here we present the first comprehensive view of a B3 thymoma at whole genome and transcriptome levels. A 55-year-old Caucasian female underwent complete resection of a stage IVA B3 thymoma. RNA and DNA were extracted from a snap frozen tumor sample with a fraction of cancer cells over 80%. We performed array comparative genomi...

  2. Microsatellite evolution inferred from human– chimpanzee genomic sequence alignments

    OpenAIRE

    Webster, Matthew T.; Smith, Nick G.C.; Ellegren, Hans

    2002-01-01

    Most studies of microsatellite evolution utilize long, highly mutable loci, which are unrepresentative of the majority of simple repeats in the human genome. Here we use an unbiased sample of 2,467 microsatellite loci derived from alignments of 5.1 Mb of genomic sequence from human and chimpanzee to investigate the mutation process of tandemly repetitive DNA. The results indicate that the process of microsatellite evolution is highly heterogeneous, exhibiting differences between loci of diffe...

  3. Complete genome sequence of the European sheatfish virus

    OpenAIRE

    Mavian, Carla; López-Bueno, Alberto; Somalo, María Pilar Fernández; Alcamí, Antonio; Alejo, Alí

    2012-01-01

    Viral diseases are an increasing threat to the thriving aquaculture industry worldwide. An emerging group of fish pathogens is formed by several ranaviruses, which have been isolated at different locations from freshwater and seawater fish species since 1985.We report the complete genome sequence of European sheatfish ranavirus (ESV), the first ranavirus isolated in Europe, which causes high mortality rates in infected sheatfish (Silurus glanis) and in other species. Analysis of the genome se...

  4. Complete genome sequence of the European sheatfish virus

    OpenAIRE

    Mavian, Carla; López-Bueno, Alberto; Alcamí, Antonio; Alejo, Alí; Fernández Somalo, María Pilar

    2012-01-01

    Viral diseases are an increasing threat to the thriving aquaculture industry worldwide. An emerging group of fish pathogens is formed by several ranaviruses, which have been isolated at different locations from freshwater and seawater fish species since 1985.Wereport the complete genome sequence of European sheatfish ranavirus (ESV), the first ranavirus isolated in Europe, which causes high mortality rates in infected sheatfish (Silurus glanis) and in other species. Analysis of the genome seq...

  5. Arrangement of repetitive sequences in the genome of herpesvirus Sylvilagus.

    OpenAIRE

    Medveczky, M M; Geck, P; Clarke, C; Byrnes, J; Sullivan, J L; Medveczky, P G

    1989-01-01

    Herpesvirus sylvilagus is a lymphotropic (type gamma) herpesvirus of cottontail rabbits (Sylvilagus floridanus). Analysis of virion DNA of herpesvirus sylvilagus has revealed that the genome consists of one stretch of about 120 kilobase pairs of internal, unique DNA flanked by a variable number of 553-base-pair tandem repeats. The G + C content of the repetitive DNA is extremely high (83%), as determined by sequencing. The organization of the herpesvirus sylvilagus genome is, therefore, simil...

  6. Complete Genome Sequence of the Endophytic Fungus Diaporthe (Phomopsis) ampelina

    Science.gov (United States)

    Bhargavi, S. D.; Praveen, V. K.

    2016-01-01

    Diaporthe ampelina was isolated as an endophytic fungus from the root of Commiphora wightii, a medicinal plant collected from Dhanvantri Vana, Bangalore University, Bangalore, India. The whole genome is 59 Mb, contains a total of 905 scaffolds, and has a G+C content of 51.74%. The genome sequence of D. ampelina shows a complete absence of lovastatin (an anticholesterol drug) gene cluster. PMID:27257198

  7. Genome sequencing, annotation of Citrobacter freundii strain GTC 09479

    Directory of Open Access Journals (Sweden)

    Kazuyuki Kimura

    2014-12-01

    Full Text Available We report the 4.9-Mb genome sequence of Citrobacter freundii strain GTC 09479, isolated from urine sample collected during the year 1983 at Gifu University Graduate School of Medicine, Japan. This draft genome consist of 4,899,578 bp with 51.62% G + C, 4,574 predicted CDSs, 72 tRNAs and 10 rRNAs.

  8. Identification of photoperception and light signal transduction pathways in citrus

    OpenAIRE

    Vera Quecini

    2007-01-01

    Studies employing model species have elucidated several aspects of photoperception and light signal transduction that control plant development. However, the information available for economically important crops is scarce. Citrus genome databases of expressed sequence tags (EST) were investigated in order to identify genes coding for functionally characterized proteins responsible for light-regulated developmental control in model plants. Approximately 176,200 EST sequences from 53 libraries...

  9. cis-acting elements at opposite ends of the Citrus tristeza virus genome differ in initiation and termination of subgenomic RNAs

    International Nuclear Information System (INIS)

    Citrus tristeza virus (CTV), a member of the Closteroviridae with a plus-stranded genomic RNA of approximately 20 kb, produces 10 3'-coterminal subgenomic (sg) RNAs that serve as messenger (m)RNAs for its internal genes. In addition, a population of 5'-terminal sgRNAs of approximately 700 nts are highly abundant in infected cells. Previous analysis demonstrated that the controller elements (CE) are responsible for the 3'-terminal mRNAs and the small 5'-terminal sgRNAs differ in the number of additional sgRNAs produced. A feature of both types of CE is production of 5'- and 3'-terminal positive-stranded sgRNAs, but the 3' CEs additionally produce a negative-stranded complement of the 3'-terminal mRNAs. Here, we found that the termination (for 5'-terminal sgRNAs) and initiation (for 3'-terminal sgRNAs) sites of the 5' vs. the 3' CEs occur at opposite ends of the respective minimal active CEs. The initiation site for the 3' CE of the major coat protein gene, and probably those of the p20 and p23 genes, was outside (3' in terms of the genomic RNA) the minimal unit, whereas the termination sites were located within the minimal CE, 30-50 nts upstream of the initiation site (referring to the positive-strand sequence). In contrast, the initiation site for the 5' CE was in the 5' region of the minimal unit, with the termination sites 20-35 nts downstream (referring to the positive-strand sequence). Furthermore, the CEs differ in initiation nucleotide and response to mutagenesis of that nucleotide. The 3' CE initiates sgRNA synthesis from a uridylate, whereas the 5' CE initiates from a cytidylate. We previously found that the 3' CEs were unusually tolerant to mutagenesis of the initiation sites, with initiation proceeding from alternative sites. Mutagenesis of the initiation site of the 5' CE prevented synthesis of either the 5'- or 3'-terminal sgRNAs. Thus, the cis-acting elements at opposite ends of the genome are remarkably different, perhaps having arisen from different

  10. Standardized metadata for human pathogen/vector genomic sequences.

    Directory of Open Access Journals (Sweden)

    Vivien G Dugan

    Full Text Available High throughput sequencing has accelerated the determination of genome sequences for thousands of human infectious disease pathogens and dozens of their vectors. The scale and scope of these data are enabling genotype-phenotype association studies to identify genetic determinants of pathogen virulence and drug/insecticide resistance, and phylogenetic studies to track the origin and spread of disease outbreaks. To maximize the utility of genomic sequences for these purposes, it is essential that metadata about the pathogen/vector isolate characteristics be collected and made available in organized, clear, and consistent formats. Here we report the development of the GSCID/BRC Project and Sample Application Standard, developed by representatives of the Genome Sequencing Centers for Infectious Diseases (GSCIDs, the Bioinformatics Resource Centers (BRCs for Infectious Diseases, and the U.S. National Institute of Allergy and Infectious Diseases (NIAID, part of the National Institutes of Health (NIH, informed by interactions with numerous collaborating scientists. It includes mapping to terms from other data standards initiatives, including the Genomic Standards Consortium's minimal information (MIxS and NCBI's BioSample/BioProjects checklists and the Ontology for Biomedical Investigations (OBI. The standard includes data fields about characteristics of the organism or environmental source of the specimen, spatial-temporal information about the specimen isolation event, phenotypic characteristics of the pathogen/vector isolated, and project leadership and support. By modeling metadata fields into an ontology-based semantic framework and reusing existing ontologies and minimum information checklists, the application standard can be extended to support additional project-specific data fields and integrated with other data represented with comparable standards. The use of this metadata standard by all ongoing and future GSCID sequencing projects will

  11. Sequence analysis reveals mosaic genome of Aichi virus

    Directory of Open Access Journals (Sweden)

    Han Xiaohong

    2011-08-01

    Full Text Available Abstract Aichi virus is a positive-sense and single-stranded RNA virus, which demonstrated to be related to diarrhea of Children. In the present study, phylogenetic and recombination analysis based on the Aichi virus complete genomes available in GenBank reveal a mosaic genome sequence [GenBank: FJ890523], of which the nt 261-852 region (the nt position was based on the aligned sequence file shows close relationship with AB010145/Japan with 97.9% sequence identity, while the other genomic regions show close relationship with AY747174/German with 90.1% sequence identity. Our results will provide valuable hints for future research on Aichi virus diversity. Aichi virus is a member of the Kobuvirus genus of the Picornaviridae family 12 and belongs to a positive-sense and single-stranded RNA virus. Its presence in fecal specimens of children suffering from diarrhea has been demonstrated in several Asian countries 3456, in Brazil and German 7, in France 8 and in Tunisia 9. Some reports showed the high level of seroprevalence in adults 710, suggesting the widespread exposure to Aichi virus during childhood. The genome of Aichi virus contains 8,280 nucleotides and a poly(A tail. The single large open reading frame (nt 713-8014 according to the strain AB010145 encodes a polyprotein of 2,432 amino acids that is cleaved into the typical picornavirus structural proteins VP0, VP3, VP1, and nonstructural proteins 2A, 2B, 2C, 3A, 3B, 3C and 3D 211. Based on the phylogenetic analysis of 519-bp sequences at the 3C-3D (3CD junction, Aichi viruses can be divided into two genotypes A and B with approximately 90% sequence homology 12. Although only six complete genomes of Aichi virus were deposited in GenBank at present, mosaic genomes can be found in strains from different countries.

  12. Transcription of densovirus endogenous sequences in the Myzus persicae genome.

    Science.gov (United States)

    Clavijo, Gabriel; van Munster, Manuella; Monsion, Baptiste; Bochet, Nicole; Brault, Véronique

    2016-04-01

    Integration of non-retroviral sequences in the genome of different organisms has been observed and, in some cases, a relationship of these integrations with immunity has been established. The genome of the green peach aphid, Myzus persicae (clone G006), was screened for densovirus-like sequence (DLS) integrations. A total of 21 DLSs localized on 10 scaffolds were retrieved that mostly shared sequence identity with two aphid-infecting viruses, Myzus persicae densovirus (MpDNV) and Dysaphis plantaginea densovirus (DplDNV). In some cases, uninterrupted potential ORFs corresponding to non-structural viral proteins or capsid proteins were found within DLSs identified in the aphid genome. In particular, one scaffold harboured a complete virus-like genome, while another scaffold contained two virus-like genomes in reverse orientation. Remarkably, transcription of some of these ORFs was observed in M. persicae, suggesting a biological effect of these viral integrations. In contrast to most of the other densoviruses identified so far that induce acute host infection, it has been reported previously that MpDNV has only a minor effect on M. persicae fitness, while DplDNV can even have a beneficial effect on its aphid host. This suggests that DLS integration in the M. persicae genome may be responsible for the latency of MpDNV infection in the aphid host. PMID:26758080

  13. Low-pass sequencing for microbial comparative genomics

    Directory of Open Access Journals (Sweden)

    Kennedy Sean

    2004-01-01

    Full Text Available Abstract Background We studied four extremely halophilic archaea by low-pass shotgun sequencing: (1 the metabolically versatile Haloarcula marismortui; (2 the non-pigmented Natrialba asiatica; (3 the psychrophile Halorubrum lacusprofundi and (4 the Dead Sea isolate Halobaculum gomorrense. Approximately one thousand single pass genomic sequences per genome were obtained. The data were analyzed by comparative genomic analyses using the completed Halobacterium sp. NRC-1 genome as a reference. Low-pass shotgun sequencing is a simple, inexpensive, and rapid approach that can readily be performed on any cultured microbe. Results As expected, the four archaeal halophiles analyzed exhibit both bacterial and eukaryotic characteristics as well as uniquely archaeal traits. All five halophiles exhibit greater than sixty percent GC content and low isoelectric points (pI for their predicted proteins. Multiple insertion sequence (IS elements, often involved in genome rearrangements, were identified in H. lacusprofundi and H. marismortui. The core biological functions that govern cellular and genetic mechanisms of H. sp. NRC-1 appear to be conserved in these four other halophiles. Multiple TATA box binding protein (TBP and transcription factor IIB (TFB homologs were identified from most of the four shotgunned halophiles. The reconstructed molecular tree of all five halophiles shows a large divergence between these species, but with the closest relationship being between H. sp. NRC-1 and H. lacusprofundi. Conclusion Despite the diverse habitats of these species, all five halophiles share (1 high GC content and (2 low protein isoelectric points, which are characteristics associated with environmental exposure to UV radiation and hypersalinity, respectively. Identification of multiple IS elements in the genome of H. lacusprofundi and H. marismortui suggest that genome structure and dynamic genome reorganization might be similar to that previously observed in the

  14. The complete mitochondrial genome sequence of the budgerigar, Melopsittacus undulatus.

    Science.gov (United States)

    Guan, Xiaojing; Xu, Jun; Smith, Edward J

    2016-01-01

    Here, we describe the budgie's mitochondrial genome sequence, a resource that can facilitate this parrot's use as a model organism as well as for determining its phylogenetic relatedness to other parrots/Psittaciformes. The estimated total length of the sequence was 18,193 bp. In addition to the to the 13 protein and tRNA and rRNA coding regions, the sequence also includes a duplicated hypervariable region, a feature unique to only a few birds. The two hypervariable regions shared a sequence identity of about 86%. PMID:24660934

  15. Pittosporum cryptic virus 1: genome sequence completion using next-generation sequencing.

    Science.gov (United States)

    Elbeaino, Toufic; Kubaa, Raied Abou; Tuzlali, Hasan Tuna; Digiaro, Michele

    2016-07-01

    Next-generation sequencing (NGS) was applied to dsRNAs extracted from an Italian pittosporum plant infected with pittosporum cryptic virus 1 (PiCV1). NGS allowed assembly of the full genome sequence of PiCV1, comprising dsRNA1 (1.9 kbp) and dsRNA2 (1.5 kbp), which encode the RNA-dependent RNA polymerase and capsid protein genes, respectively. Phylogenetic and sequence analyses confirmed that PiCV1 is a new member of the genus Deltapartitivirus, family Partiviridae. From the same plant, NSG also permitted assembly of the complete genome sequence of eggplant mottled dwarf virus (EMDV), which shared 86 % to 98 % nucleotide sequence identity with complete and partial sequences (ca 6750 nt) of other known EMDV isolates with sequences available in the GenBank database. PMID:27087112

  16. Sequence Classification: 892357 [TMBETA-GENOME[Archive

    Lifescience Database Archive (English)

    Full Text Available Non-TMB Non-TMH Non-TMB Non-TMB Non-TMB Non-TMB >gi|6322971|ref|NP_013043.1| Fe(II)-dependent su ... dioxygenase, involved in sulfonate catabolism for use ... as a sulfur source, contains sequence that closely ...

  17. Genome Sequencing and Annotation of Mycobacterium tuberculosis PR08 strain

    Directory of Open Access Journals (Sweden)

    Mohammad Maaruf Jaafar

    2016-03-01

    Full Text Available Mycobacterium tuberculosis is an acid fast bacterial species in the family Mycobacteriaceae and is the causative agent of most cases of tuberculosis. Here, we report the genomic features of Mycobacterium tuberculosis isolated from the cerebrospinal fluid (CSF of a patient diagnosed with both pulmonary and extrapulmonary tuberculosis (TB. The isolated strain was identified as Mycobacterium tuberculosis PR08 (MTB PR08. Genomic DNA of the MTB PR08 strain was extracted and subjected to whole genome sequencing using MiSeq (Illumina, CA,USA. The draft genome size of MTB PR08 strain is 4,292,364 bp with a G + C content of 65.2%. This strain was annotated to have 4723 genes and 48 RNAs. This whole genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession number CP010895.

  18. Sequencing and analysis of the giant panda genome

    Institute of Scientific and Technical Information of China (English)

    YANG HuanMing

    2010-01-01

    @@ The giant panda (Ailuropoda melanoleuca) is loved all over the world and is considered a symbol of China, as illustrated by its being one of the mascots for the Beijing 2008 Olympic Games.It is also one of the world's most endangered animals and a flagship species for conservation.Using next-generation sequencing technology (Illumina Genome Analyzer) and our in-house assembly software, we have generated the first map of the giant panda genome sequence.This map will provide an unparalleled amount of information to aid in understanding the genetic and biological nature of this unique species and will contribute significantly to disease control and conservation efforts for this endangered species.In March 2008, the giant panda genome sequencing and analysis project was started at the Beijing Genomics Institute (BGI) in Shenzhen with collaborators from the Kunming Institute of Zoology and the Chengdu Research Base of Giant Panda Breeding.On 21 Jan.2010, this collaboration resulted in the publication, as a cover story in the journal Nature, of the sequencing and analysis of the giant panda genome.

  19. Genome sequence of the pea aphid Acyrthosiphon pisum

    DEFF Research Database (Denmark)

    Richards, S.; Gibbs, R. A.; Gerardo, N. M.;

    2010-01-01

    Aphids are important agricultural pests and also biological models for studies of insect-plant interactions, symbiosis, virus vectoring, and the developmental causes of extreme phenotypic plasticity. Here we present the 464 Mb draft genome assembly of the pea aphid Acyrthosiphon pisum. This first...... published whole genome sequence of a basal hemimetabolous insect provides an outgroup to the multiple published genomes of holometabolous insects. Pea aphids are host-plant specialists, they can reproduce both sexually and asexually, and they have coevolved with an obligate bacterial symbiont. Here we...

  20. The complete chloroplast genome sequence of Curcuma flaviflora (Curcuma).

    Science.gov (United States)

    Zhang, Yan; Deng, Jiabin; Li, Yangyi; Gao, Gang; Ding, Chunbang; Zhang, Li; Zhou, Yonghong; Yang, Ruiwu

    2016-09-01

    The complete chloroplast (cp) genome of Curcuma flaviflora, a medicinal plant in Southeast Asia, was sequenced. The genome size was 160 478 bp in length, with 36.3% GC content. A pair of inverted repeats (IRs) of 26 946 bp were separated by a large single copy (LSC) of 88 008 bp and a small single copy (SSC) of 18 578 bp, respectively. The cp genome contained 132 annotated genes, including 79 protein coding genes, 30 tRNA genes, and four rRNA genes. And 19 of these genes were duplicated in inverted repeat regions. PMID:26367332

  1. The complete mitochondrial genome sequence of Emperor Penguins (Aptenodytes forsteri).

    Science.gov (United States)

    Xu, Qiwu; Xia, Yan; Dang, Xiao; Chen, Xiaoli

    2016-09-01

    The emperor penguin (Aptenodytes forsteri) is the largest living species of penguin. Herein, we first reported the complete mitochondrial genome of emperor penguin. The mitochondrial genome is a circular molecule of 17 301 bp in length, consisting of 13 protein-coding genes, 22 tRNA genes, two rRNA, and one control region. To verify the accuracy and the utility of new determined mitogenome sequences, we constructed the species phylogenetic tree of emperor penguin together with 10 other closely species. This is the second complete mitochondrial genome of penguin, and this is going to be an important data to study mitochondrial evolution of birds. PMID:26403091

  2. Draft genome sequence of the rubber tree Hevea brasiliensis

    OpenAIRE

    Rahman Ahmad Yamin Abdul; Usharraj Abhilash O; Misra Biswapriya B; Thottathil Gincy P; Jayasekaran Kandakumar; Feng Yun; Hou Shaobin; Ong Su Yean; Ng Fui Ling; Lee Ling Sze; Tan Hock Siew; Sakaff Muhd Khairul Luqman Muhd; Teh Beng Soon; Khoo Bee; Badai Siti Suriawati

    2013-01-01

    Abstract Background Hevea brasiliensis, a member of the Euphorbiaceae family, is the major commercial source of natural rubber (NR). NR is a latex polymer with high elasticity, flexibility, and resilience that has played a critical role in the world economy since 1876. Results Here, we report the draft genome sequence of H. brasiliensis. The assembly spans ~1.1 Gb of the estimated 2.15 Gb haploid genome. Overall, ~78% of the genome was identified as repetitive DNA. Gene prediction shows 68,95...

  3. Establishing a framework for comparative analysis of genome sequences

    Energy Technology Data Exchange (ETDEWEB)

    Bansal, A.K.

    1995-06-01

    This paper describes a framework and a high-level language toolkit for comparative analysis of genome sequence alignment The framework integrates the information derived from multiple sequence alignment and phylogenetic tree (hypothetical tree of evolution) to derive new properties about sequences. Multiple sequence alignments are treated as an abstract data type. Abstract operations have been described to manipulate a multiple sequence alignment and to derive mutation related information from a phylogenetic tree by superimposing parsimonious analysis. The framework has been applied on protein alignments to derive constrained columns (in a multiple sequence alignment) that exhibit evolutionary pressure to preserve a common property in a column despite mutation. A Prolog toolkit based on the framework has been implemented and demonstrated on alignments containing 3000 sequences and 3904 columns.

  4. Complete Plastid Genome Sequence of the Brown Alga Undaria pinnatifida.

    Directory of Open Access Journals (Sweden)

    Lei Zhang

    Full Text Available In this study, we fully sequenced the circular plastid genome of a brown alga, Undaria pinnatifida. The genome is 130,383 base pairs (bp in size; it contains a large single-copy (LSC, 76,598 bp and a small single-copy region (SSC, 42,977 bp, separated by two inverted repeats (IRa and IRb: 5,404 bp. The genome contains 139 protein-coding, 28 tRNA, and 6 rRNA genes; none of these genes contains introns. Organization and gene contents of the U. pinnatifida plastid genome were similar to those of Saccharina japonica. There is a co-linear relationship between the plastid genome of U. pinnatifida and that of three previously sequenced large brown algal species. Phylogenetic analyses of 43 taxa based on 23 plastid protein-coding genes grouped all plastids into a red or green lineage. In the large brown algae branch, U. pinnatifida and S. japonica formed a sister clade with much closer relationship to Ectocarpus siliculosus than to Fucus vesiculosus. For the first time, the start codon ATT was identified in the plastid genome of large brown algae, in the atpA gene of U. pinnatifida. In addition, we found a gene-length change induced by a 3-bp repetitive DNA in ycf35 and ilvB genes of the U. pinnatifida plastid genome.

  5. Sequence modelling and an extensible data model for genomic database

    Energy Technology Data Exchange (ETDEWEB)

    Li, Peter Wei-Der [California Univ., San Francisco, CA (United States)]|[Lawrence Berkeley Lab., CA (United States)

    1992-01-01

    The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS`s do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data model that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the ``Extensible Object Model``, to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.

  6. Sequence modelling and an extensible data model for genomic database

    Energy Technology Data Exchange (ETDEWEB)

    Li, Peter Wei-Der (California Univ., San Francisco, CA (United States) Lawrence Berkeley Lab., CA (United States))

    1992-01-01

    The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS's do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data model that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the Extensible Object Model'', to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.

  7. Building a model: developing genomic resources for common milkweed (Asclepias syriaca with low coverage genome sequencing

    Directory of Open Access Journals (Sweden)

    Weitemier Kevin

    2011-05-01

    Full Text Available Abstract Background Milkweeds (Asclepias L. have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L. could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. Results A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp and 5S rDNA (120 bp sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp, with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae unigenes (median coverage of 0.29× and 66% of single copy orthologs (COSII in asterids (median coverage of 0.14×. From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites and phylogenetics (low-copy nuclear genes studies were developed. Conclusions The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species

  8. Mitochondrial DNA sequences in the nuclear genome of a locust.

    Science.gov (United States)

    Gellissen, G; Bradfield, J Y; White, B N; Wyatt, G R

    The endosymbiotic theory of the origin of mitochondria is widely accepted, and implies that loss of genes from the mitochondria to the nucleus of eukaryotic cells has occurred over evolutionary time. However, evidence at the DNA sequence level for gene transfer between these organelles has so far been limited to a single example, the demonstration that a mitochondrial ATPase subunit gene of Neurospora crassa has an homologous partner in the nuclear genome. From a gene library of the insect, Locusta migratoria, we have now isolated two clones, representing separate fragments of nuclear DNA, which contain sequences homologous to the mitochondrial genes for ribosomal RNA, as well as regions of homology with highly repeated nuclear sequences. The results suggest the transfer of sequences between mitochondrial and nuclear genomes, followed by evolutionary divergence. PMID:6298629

  9. Complete genome sequence of Allochromatium vinosum DSM 180T

    Energy Technology Data Exchange (ETDEWEB)

    Weissgerber, Thomas [Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany; Zigann, Renate [Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany; Bruce, David [Los Alamos National Laboratory (LANL); Chang, Yun-Juan [ORNL; Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Han, Cliff [Los Alamos National Laboratory (LANL); Hauser, Loren John [ORNL; Jeffries, Cynthia [Oak Ridge National Laboratory (ORNL); Land, Miriam L [ORNL; Munk, Christine [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Dahl, Christiane [Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany

    2011-01-01

    Allochromatium vinosum formerly Chromatium vinosum is a mesophilic purple sulfur bacte- rium belonging to the family Chromatiaceae in the bacterial class Gammaproteobacteria. The genus Allochromatium contains currently five species. All members were isolated from fresh- water, brackish water or marine habitats and are predominately obligate phototrophs. Here we describe the features of the organism, together with the complete genome sequence and annotation. This is the first completed genome sequence of a member of the Chromatiaceae within the purple sulfur bacteria thriving in globally occurring habitats. The 3,669,074 bp ge- nome with its 3,302 protein-coding and 64 RNA genes was sequenced within the Joint Ge- nome Institute Community Sequencing Program.

  10. Complete genome sequence of Thauera aminoaromatica strain MZ1T

    Energy Technology Data Exchange (ETDEWEB)

    Sanseverino, John [ORNL; Chauhan, Archana [University of Tennessee, Knoxville (UTK); Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Copeland, A [U.S. Department of Energy, Joint Genome Institute; Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Glavina Del Rio, Tijana [U.S. Department of Energy, Joint Genome Institute; Dalin, Eileen [U.S. Department of Energy, Joint Genome Institute; Tice, Hope [U.S. Department of Energy, Joint Genome Institute; Bruce, David [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Sims, David [Los Alamos National Laboratory (LANL); Brettin, Thomas S [ORNL; Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Han, Cliff [Los Alamos National Laboratory (LANL); Chang, Yun-Juan [ORNL; Larimer, Frank W [ORNL; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Moser, Scott [University of Tennessee, Knoxville (UTK); Jegier, Patricia [University of Tennessee, Knoxville (UTK); Close, Dan [University of Tennessee, Knoxville (UTK); Wang, Ying [University of Tennessee, Knoxville (UTK); Layton, Alice [University of Tennessee, Knoxville (UTK); Allen, Michael S. [University of Tennessee, Knoxville (UTK); Sayler, Gary [University of Tennessee, Knoxville (UTK)

    2012-01-01

    Thauera aminoaromatica strain MZ1T, an isolate belonging to genus Thauera, of the family Rhodocyclaceae and the class the Betaproteobacteria, has been characterized for its ability to produce abundant exopolysaccharide and degrade various aromatic compounds with nitrate as an electron acceptor. These properties, if fully understood at the genome-sequence level, can aid in environmental processing of organic matter in anaerobic cycles by short-circuiting a central anaerobic metabolite, acetate, from microbiological conversion to methane, a criti-cal greenhouse gas. Strain MZ1T is the first strain from the genus Thauera with a completely sequenced genome. The 4,496,212 bp chromosome and 78,374 bp plasmid contain 4,071 protein-coding and 71 RNA genes, and were sequenced as part of the DOE Community Se-quencing Program CSP{_}776774.

  11. Complete chloroplast genome sequence of Fritillaria unibracteata var. wabuensis based on SMRT Sequencing Technology.

    Science.gov (United States)

    Li, Ying; Li, Qiushi; Li, Xiwen; Song, Jingyuan; Sun, Chao

    2016-09-01

    Fritillaria unibracteata var. wabuensis is an important medicinal plant used for the treatment of cough symptoms related to the respiratory system. The chloroplast genome of F. unibracteata var. wabuensis (GenBank accession no. KF769142) was assembled using the PacBio RS platform (Pacific Biosciences, Beverly, MA) as a circle sequence with 151 009 bp. The assembled genome contains 133 genes, including 88 protein-coding, 37 tRNA, and eight rRNA genes. This genome sequence will provide important resource for further studies on the evolution of Fritillaria genus and molecular identification of Fritillaria herbs and their adulterants. This work suggests that PacBio RS is a powerful tool to sequence and assemble chloroplast genomes. PMID:26370383

  12. 不同来源的柑橘衰退病毒分离物基因组5'端A、F变异区序列克隆及分析%Cloning and analyzing of 5' A and F variation sequences of Citrus tristeza virus genomic RNA of isolates from different regions

    Institute of Scientific and Technical Information of China (English)

    丁芳; 洪霓; 钟云; 易干军; 王国平

    2008-01-01

    柑橘衰退病毒(Citrus tristeza virus,CTV)组群自然条件下存在株系分化现象.本研究利用RT-PCR技术扩增、克隆了来自我国不同地区的21个柑橘衰退病毒分离物的5'端A、F变异区.通过分析发现,不同来源的各分离物在5'端A、F区存在较大的变异.21个分离物A区序列相似性最低为85.8%,最高可达99.8%,平均为95.9%;与GenBank中9个代表性株系的平均相似性为84.2%.F区序列相似性较A区高,为98.0%;相似性最低为94.3%,最高达99.1%.结果显示不同来源的CTV分离物5'端序列A、F区变异较大.

  13. Complete Genome Sequence of Streptococcus agalactiae CNCTC 10/84, a Hypervirulent Sequence Type 26 Strain

    OpenAIRE

    Hooven, Thomas A.; Randis, Tara M.; Sean C Daugherty; Narechania, Apurva; Planet, Paul J.; Tettelin, Hervé; Ratner, Adam J.

    2014-01-01

    Streptococcus agalactiae (group B Streptococcus [GBS]) is a human pathogen with a propensity to cause neonatal infections. We report the complete genome sequence of GBS strain CNCTC 10/84, a hypervirulent clinical isolate frequently used to study GBS pathogenesis. Comparative analysis of this sequence may shed light on novel pathogenic mechanisms.

  14. Genome sequencing highlights the dynamic early history of dogs.

    OpenAIRE

    Freedman, Adam H.; Ilan Gronau; Schweizer, Rena M.; Diego Ortega-Del Vecchyo; Eunjung Han; Silva, Pedro M.; Marco Galaverni; Zhenxin Fan; Peter Marx; Belen Lorente-Galdos; Holly Beale; Oscar Ramirez; Farhad Hormozdiari; Can Alkan; Carles Vilà

    2014-01-01

    To identify genetic changes underlying dog domestication and reconstruct their early evolutionary history, we generated high-quality genome sequences from three gray wolves, one from each of the three putative centers of dog domestication, two basal dog lineages (Basenji and Dingo) and a golden jackal as an outgroup. Analysis of these sequences supports a demographic model in which dogs and wolves diverged through a dynamic process involving population bottlenecks in both lineages and post-di...

  15. Genome Sequencing Highlights the Dynamic Early History of Dogs

    OpenAIRE

    Freedman, A.H.; Gronau, I.; Schweizer, R.M.; Han, E; Silva, P.M.; Galaverni, M.; Fan, Z; Marx, P; Lorente-Galdos, B.; Beale, H.; Ramirez, O.; Hormozdiari, Fereydoun; Alkan, Can; Vilà, Carles; Geffen, E

    2014-01-01

    To identify genetic changes underlying dog domestication and reconstruct their early evolutionary history, we generated high-quality genome sequences from three gray wolves, one from each of the three putative centers of dog domestication, two basal dog lineages (Basenji and Dingo) and a golden jackal as an outgroup. Analysis of these sequences supports a demographic model in which dogs and wolves diverged through a dynamic process involving population bottlenecks in both lineages and post-di...

  16. Sequencing Crop Genomes: A Gateway to Improve Tropical Agriculture

    OpenAIRE

    Thottathil, Gincy Paily; Jayasekaran, Kandakumar; Othman, Ahmad Sofiman

    2016-01-01

    Agricultural development in the tropics lags behind development in the temperate latitudes due to the lack of advanced technology, and various biotic and abiotic factors. To cope with the increasing demand for food and other plant-based products, improved crop varieties have to be developed. To breed improved varieties, a better understanding of crop genetics is necessary. With the advent of next-generation DNA sequencing technologies, many important crop genomes have been sequenced. Primary ...

  17. The impact of next-generation sequencing on genomics

    OpenAIRE

    Zhang, Jun; Chiodini, Rod; Badr, Ahmed; Zhang, Genfa

    2011-01-01

    This article reviews basic concepts, general applications, and the potential impact of next-generation sequencing (NGS) technologies on genomics, with particular reference to currently available and possible future platforms and bioinformatics. NGS technologies have demonstrated the capacity to sequence DNA at unprecedented speed, thereby enabling previously unimaginable scientific achievements and novel biological applications. But, the massive data produced by NGS also presents a significan...

  18. Analysis of Chimpanzee History Based on Genome Sequence Alignments

    OpenAIRE

    Caswell, Jennifer L.; Richter, Daniel J.; Neubauer, Julie; Schirmer, Christine; Gnerre, Sante; Mallick, Swapan; Reich, David Emil

    2008-01-01

    Population geneticists often study small numbers of carefully chosen loci, but it has become possible to obtain orders of magnitude for more data from overlaps of genome sequences. Here, we generate tens of millions of base pairs of multiple sequence alignments from combinations of three western chimpanzees, three central chimpanzees, an eastern chimpanzee, a bonobo, a human, an orangutan, and a macaque. Analysis provides a more precise understanding of demographic history than was previously...

  19. Complete mitochondrial genome sequence of Romanogobio tenuicorpus (Amur whitefin gudgeon).

    Science.gov (United States)

    Dong, Fang; Tong, Guang-Xiang; Kuang, You-Yi; Sun, Xiao-Wen

    2015-01-01

    Amur whitefin gudgeon (Romanogobio tenuicorpus) belongs to the family Cyprinidae, it is freshwater aquaculture species in China. In the report, we determined the complete mitochondrial genome sequence of Romanogobio tenuicorpus, which is 16,600 bp long circular molecule with 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes and a control region, the conserved sequence blocks, CSB1, CSB2 and CSB3 were also detected. PMID:24409923

  20. Molecular evolution of herpesviruses: genomic and protein sequence comparisons.

    OpenAIRE

    Karlin, S; Mocarski, E S; Schachtel, G A

    1994-01-01

    Phylogenetic reconstruction of herpesvirus evolution is generally founded on amino acid sequence comparisons of specific proteins. These are relevant to the evolution of the specific gene (or set of genes), but the resulting phylogeny may vary depending on the particular sequence chosen for analysis (or comparison). In the first part of this report, we compare 13 herpesvirus genomes by using a new multidimensional methodology based on distance measures and partial orderings of dinucleotide re...

  1. Mitochondrial genome sequences and comparative genomics ofPhytophthora ramorum and P. sojae

    Energy Technology Data Exchange (ETDEWEB)

    Martin, Frank N.; Douda, Bensasson; Tyler, Brett M.; Boore,Jeffrey L.

    2007-01-01

    The complete sequences of the mitochondrial genomes of theoomycetes of Phytophthora ramorum and P. sojae were determined during thecourse of their complete nuclear genome sequencing (Tyler, et al. 2006).Both are circular, with sizes of 39,314 bp for P. ramorum and 42,975 bpfor P. sojae. Each contains a total of 37 identifiable protein-encodinggenes, 25 or 26 tRNAs (P. sojae and P. ramorum, respectively)specifying19 amino acids, and a variable number of ORFs (7 for P. ramorum and 12for P. sojae) which are potentially additional functional genes.Non-coding regions comprise approximately 11.5 percent and 18.4 percentof the genomes of P. ramorum and P. sojae, respectively. Relative to P.sojae, there is an inverted repeat of 1,150 bp in P. ramorum thatincludes an unassigned unique ORF, a tRNA gene, and adjacent non-codingsequences, but otherwise the gene order in both species is identical.Comparisons of these genomes with published sequences of the P. infestansmitochondrial genome reveals a number of similarities, but the gene orderin P. infestans differs in two adjacent locations due to inversions.Sequence alignments of the three genomes indicated sequence conservationranging from 75 to 85 percent and that specific regions were morevariable than others.

  2. Targeted or whole genome sequencing of formalin fixed tissue samples: potential applications in cancer genomics.

    Science.gov (United States)

    Munchel, Sarah; Hoang, Yen; Zhao, Yue; Cottrell, Joseph; Klotzle, Brandy; Godwin, Andrew K; Koestler, Devin; Beyerlein, Peter; Fan, Jian-Bing; Bibikova, Marina; Chien, Jeremy

    2015-09-22

    Current genomic studies are limited by the poor availability of fresh-frozen tissue samples. Although formalin-fixed diagnostic samples are in abundance, they are seldom used in current genomic studies because of the concern of formalin-fixation artifacts. Better characterization of these artifacts will allow the use of archived clinical specimens in translational and clinical research studies. To provide a systematic analysis of formalin-fixation artifacts on Illumina sequencing, we generated 26 DNA sequencing data sets from 13 pairs of matched formalin-fixed paraffin-embedded (FFPE) and fresh-frozen (FF) tissue samples. The results indicate high rate of concordant calls between matched FF/FFPE pairs at reference and variant positions in three commonly used sequencing approaches (whole genome, whole exome, and targeted exon sequencing). Global mismatch rates and C · G > T · A substitutions were comparable between matched FF/FFPE samples, and discordant rates were low (<0.26%) in all samples. Finally, low-pass whole genome sequencing produces similar pattern of copy number alterations between FF/FFPE pairs. The results from our studies suggest the potential use of diagnostic FFPE samples for cancer genomic studies to characterize and catalog variations in cancer genomes. PMID:26305677

  3. Identification of cancer-driver genes in focal genomic alterations from whole genome sequencing data.

    Science.gov (United States)

    Jang, Ho; Hur, Youngmi; Lee, Hyunju

    2016-01-01

    DNA copy number alterations (CNAs) are the main genomic events that occur during the initiation and development of cancer. Distinguishing driver aberrant regions from passenger regions, which might contain candidate target genes for cancer therapies, is an important issue. Several methods for identifying cancer-driver genes from multiple cancer patients have been developed for single nucleotide polymorphism (SNP) arrays. However, for NGS data, methods for the SNP array cannot be directly applied because of different characteristics of NGS such as higher resolutions of data without predefined probes and incorrectly mapped reads to reference genomes. In this study, we developed a wavelet-based method for identification of focal genomic alterations for sequencing data (WIFA-Seq). We applied WIFA-Seq to whole genome sequencing data from glioblastoma multiforme, ovarian serous cystadenocarcinoma and lung adenocarcinoma, and identified focal genomic alterations, which contain candidate cancer-related genes as well as previously known cancer-driver genes. PMID:27156852

  4. Identification of cancer-driver genes in focal genomic alterations from whole genome sequencing data

    Science.gov (United States)

    Jang, Ho; Hur, Youngmi; Lee, Hyunju

    2016-01-01

    DNA copy number alterations (CNAs) are the main genomic events that occur during the initiation and development of cancer. Distinguishing driver aberrant regions from passenger regions, which might contain candidate target genes for cancer therapies, is an important issue. Several methods for identifying cancer-driver genes from multiple cancer patients have been developed for single nucleotide polymorphism (SNP) arrays. However, for NGS data, methods for the SNP array cannot be directly applied because of different characteristics of NGS such as higher resolutions of data without predefined probes and incorrectly mapped reads to reference genomes. In this study, we developed a wavelet-based method for identification of focal genomic alterations for sequencing data (WIFA-Seq). We applied WIFA-Seq to whole genome sequencing data from glioblastoma multiforme, ovarian serous cystadenocarcinoma and lung adenocarcinoma, and identified focal genomic alterations, which contain candidate cancer-related genes as well as previously known cancer-driver genes. PMID:27156852

  5. Citrus tristeza virus-based RNA-interference (RNAi) vector and its potential in combating citrus Huanglongbing (HLB)

    OpenAIRE

    Hajeri, Shubash; El-Mohtar, Choaa; Dawson, William O.; Gowda, Siddarame

    2014-01-01

    Citrus tristeza virus (CTV), a plus-sense ssRNA virus, is member of the genus Closterovirus, family Closteroviridae. RNA viruses are inducers as-well-as targets of gene silencing defense mechanism of host plants and this has been exploited as a tool in functional genomics. CTV was developed into virus-induced gene silencing (VIGS) or RNA-interference (RNAi) vector, which interferes with expression of endogenous genes in citrus or GFP-transgene in Nicotiana benthamiana (16c) in a sequence spec...

  6. Sequence Determination from Overlapping Fragments: A Simple Model of Whole-Genome Shotgun Sequencing

    Science.gov (United States)

    Derrida, Bernard; Fink, Thomas M.

    2002-02-01

    Assembling fragments randomly sampled from along a sequence is the basis of whole-genome shotgun sequencing, a technique used to map the DNA of the human and other genomes. We calculate the probability that a random sequence can be recovered from a collection of overlapping fragments. We provide an exact solution for an infinite alphabet and in the case of constant overlaps. For the general problem we apply two assembly strategies and give the probability that the assembly puzzle can be solved in the limit of infinitely many fragments.

  7. A Pan-HIV Strategy for Complete Genome Sequencing.

    Science.gov (United States)

    Berg, Michael G; Yamaguchi, Julie; Alessandri-Gradt, Elodie; Tell, Robert W; Plantier, Jean-Christophe; Brennan, Catherine A

    2016-04-01

    Molecular surveillance is essential to monitor HIV diversity and track emerging strains. We have developed a universal library preparation method (HIV-SMART [i.e.,switchingmechanismat 5' end ofRNAtranscript]) for next-generation sequencing that harnesses the specificity of HIV-directed priming to enable full genome characterization of all HIV-1 groups (M, N, O, and P) and HIV-2. Broad application of the HIV-SMART approach was demonstrated using a panel of diverse cell-cultured virus isolates. HIV-1 non-subtype B-infected clinical specimens from Cameroon were then used to optimize the protocol to sequence directly from plasma. When multiplexing 8 or more libraries per MiSeq run, full genome coverage at a median ∼2,000× depth was routinely obtained for either sample type. The method reproducibly generated the same consensus sequence, consistently identified viral sequence heterogeneity present in specimens, and at viral loads of ≤4.5 log copies/ml yielded sufficient coverage to permit strain classification. HIV-SMART provides an unparalleled opportunity to identify diverse HIV strains in patient specimens and to determine phylogenetic classification based on the entire viral genome. Easily adapted to sequence any RNA virus, this technology illustrates the utility of next-generation sequencing (NGS) for viral characterization and surveillance. PMID:26699702

  8. A shot in the genome: how accurately do shotgun 454 sequences represent a genome?

    Directory of Open Access Journals (Sweden)

    Meglécz Emese

    2012-05-01

    Full Text Available Abstract Background Next generation sequencing (NGS provides a valuable method to quickly obtain sequence information from non-model organisms at a genomic scale. In principle, if sequencing is not targeted for a genomic region or sequence type (e.g. coding region, microsatellites NGS reads can be used as a genome snapshot and provide information on the different types of sequences in the genome. However, no study has ascertained if a typical 454 dataset of low coverage (1/4-1/8 of a PicoTiter plate leading to generally less than 0.1x of coverage represents all parts of genomes equally. Findings Partial genome shotgun sequencing of total DNA (without enrichment on a 454 NGS platform was used to obtain reads of Apis mellifera (454 reads hereafter. These 454 reads were compared to the assembled chromosomes of this species in three different aspects: (i dimer and trimer compositions, (ii the distribution of mapped 454 sequences along the chromosomes and (iii the numbers of different classes of microsatellites. Highly significant chi-square tests for all three types of analyses indicated that the 454 data is not a perfect random sample of the genome. Only the number of 454 reads mapped to each of the 16 chromosomes and the number of microsatellites pooled by motif (repeat unit length was not significantly different from the expected values. However, a very strong correlation (correlation coefficients greater than 0.97 was observed between most of the 454 variables (the number of different dimers and trimers, the number of 454 reads mapped to each chromosome fragments of one Mb, the number of 454 reads mapped to each chromosome, the number of microsatellites of each class and their corresponding genomic variables. Conclusions The results of chi square tests suggest that 454 shotgun reads cannot be regarded as a perfect representation of the genome especially if the comparison is done on a finer scale (e.g. chromosome fragments instead of whole

  9. Sugarcane genome sequencing by methylation filtration provides tools for genomic research in the genus Saccharum.

    Science.gov (United States)

    Grativol, Clícia; Regulski, Michael; Bertalan, Marcelo; McCombie, W Richard; da Silva, Felipe Rodrigues; Zerlotini Neto, Adhemar; Vicentini, Renato; Farinelli, Laurent; Hemerly, Adriana Silva; Martienssen, Robert A; Ferreira, Paulo Cavalcanti Gomes

    2014-07-01

    Many economically important crops have large and complex genomes that hamper their sequencing by standard methods such as whole genome shotgun (WGS). Large tracts of methylated repeats occur in plant genomes that are interspersed by hypomethylated gene-rich regions. Gene-enrichment strategies based on methylation profiles offer an alternative to sequencing repetitive genomes. Here, we have applied methyl filtration with McrBC endonuclease digestion to enrich for euchromatic regions in the sugarcane genome. To verify the efficiency of methylation filtration and the assembly quality of sequences submitted to gene-enrichment strategy, we have compared assemblies using methyl-filtered (MF) and unfiltered (UF) libraries. The use of methy filtration allowed a better assembly by filtering out 35% of the sugarcane genome and by producing 1.5× more scaffolds and 1.7× more assembled Mb in length compared with unfiltered dataset. The coverage of sorghum coding sequences (CDS) by MF scaffolds was at least 36% higher than by the use of UF scaffolds. Using MF technology, we increased by 134× the coverage of gene regions of the monoploid sugarcane genome. The MF reads assembled into scaffolds that covered all genes of the sugarcane bacterial artificial chromosomes (BACs), 97.2% of sugarcane expressed sequence tags (ESTs), 92.7% of sugarcane RNA-seq reads and 98.4% of sorghum protein sequences. Analysis of MF scaffolds from encoded enzymes of the sucrose/starch pathway discovered 291 single-nucleotide polymorphisms (SNPs) in the wild sugarcane species, S. spontaneum and S. officinarum. A large number of microRNA genes was also identified in the MF scaffolds. The information achieved by the MF dataset provides a valuable tool for genomic research in the genus Saccharum and for improvement of sugarcane as a biofuel crop. PMID:24773339

  10. The complete plastid genome sequence of Abies koreana (Pinaceae: Abietoideae).

    Science.gov (United States)

    Yi, Dong-Keun; Yang, Jong Cheol; So, Soonku; Joo, Minjung; Kim, Dong-Kap; Shin, Chang Ho; Lee, You-Mi; Choi, Kyung

    2016-07-01

    The nucleotide sequence of the chloroplast genome from Abies koreana is the first to have complete genome sequence from genus Abies of family Pinaceae. The circular double-stranded DNA, which consists of 121,373 base pairs (bp), contains a pair of very short inverted repeat regions (IRa and IRb) of 264 bp each, which are separated by a small and large single-copy regions (SSC and LSC) of 54,197 and 66,648 bp, respectively. The genome contents of 114 genes (68 peptide-encoding genes, 35 tRNA genes, four rRNA genes, six open reading frames and one pseudogene) are similar to the chloroplast DNA of other species of Abietoideae. Loss of ndh genes was also identified in the genome of A. koreana like other genomes in the family Pinaceae. Thirteen genes contain one (11 genes) or two (rps12 and ycf3 genes) introns. In phylogenetic analysis, the tree confirms that Abies, Keteleeria and Cedrus are strongly supported as monophyletic. Other inverted repeat sequences located in 42-kb inversion points (1186 bp) include trnS-psaM-ycf12- ψtrnG genes. PMID:25812052

  11. [Genome sequencing and personalized medicine: perspectives and limitations].

    Science.gov (United States)

    Le Gall, Jean-Yves; Debré, Patrice

    2014-01-01

    DNA sequencing technologies have advanced at an exponential rate in recent years: the first human genome was sequenced in 2001 after many years of effort by dozens of international laboratories at a cost of tens of millions of dollars, while in 2013 a genome can be sequenced within 24 hours for a few hundred dollars (exome sequencing takes only a few hours). More and more hospital laboratories are acquiring new high-throughput sequencing devices ("next-generation sequencers", NGS), allowing them to analyze tens or hundreds of genes, or even the entire exome. This is having a major impact on medical concepts and practices, especially with respect to genetics and oncology. This ability to search for mutations simultaneously in a large number of genes is finding applications in the diagnosis of Mendelian diseases (including at birth), routine screening for heterozygotes, and pre-conception diagnosis. NGS is now sufficiently sensitive to analyze circulating fetal DNA in maternal blood (cell-free fetal DNA, cffDNA), enabling applications such as non invasive diagnosis of fetal sex (and X-linked diseases), fetal rhesus among rhesus-negative women, trisomy and, in the near future, Mendelian mutations. Data on multifactorial diseases are still preliminary, but it should soon be possible to identify "strong" factors of genetic predisposition that have so far been beyond the scope of genome-wide association studies (GWAS). In the field of constitutional oncogenetics, NGS can also be used for simultaneous analysis of genes involved in " hereditary " cancers (21 breast cancer genes, 6 colon cancer genes, etc.). More generally, NGS can identify all genomic abnormalities (deletions, translocations, mutations) in a given malignant tissue (hemopathy or solid tumor), and has the potential to distinguish between important mutations (those that drive tumor progression) from " bystander " or accessory mutations, and also to identify "druggable" mutations amenable to targeted therapies

  12. An RNA-Seq-based reference transcriptome for Citrus.

    Science.gov (United States)

    Terol, Javier; Tadeo, Francisco; Ventimilla, Daniel; Talon, Manuel

    2016-03-01

    Previous RNA-Seq studies in citrus have been focused on physiological processes relevant to fruit quality and productivity of the major species, especially sweet orange. Less attention has been paid to vegetative or reproductive tissues, while most Citrus species have never been analysed. In this work, we characterized the transcriptome of vegetative and reproductive tissues from 12 Citrus species from all main phylogenetic groups. Our aims were to acquire a complete view of the citrus transcriptome landscape, to improve previous functional annotations and to obtain genetic markers associated with genes of agronomic interest. 28 samples were used for RNA-Seq analysis, obtained from 12 Citrus species: C. medica, C. aurantifolia, C. limon, C. bergamia, C. clementina, C. deliciosa, C. reshni, C. maxima, C. paradisi, C. aurantium, C. sinensis and Poncirus trifoliata. Four different organs were analysed: root, phloem, leaf and flower. A total of 3421 million Illumina reads were produced and mapped against the reference C. clementina genome sequence. Transcript discovery pipeline revealed 3326 new genes, the number of genes with alternative splicing was increased to 19,739, and a total of 73,797 transcripts were identified. Differential expression studies between the four tissues showed that gene expression is overall related to the physiological function of the specific organs above any other variable. Variants discovery analysis revealed the presence of indels and SNPs in genes associated with fruit quality and productivity. Pivotal pathways in citrus such as those of flavonoids, flavonols, ethylene and auxin were also analysed in detail. PMID:26261026

  13. Structural characterization of the thermally-tolerant pectin methylesterase purified from Citrus sinensis fruit and its gene sequence

    Science.gov (United States)

    Despite the longstanding importance for the thermally-tolerant pectin methylesterase (TT-PME) activity in citrus juice processing and product quality, unequivocal identification of the protein and its corresponding gene has remained elusive. We purified TT-PME from sweet orange [Citrus sinensis (L.)...

  14. Draft Genome Sequence of Rice Isolate Pseudomonas chlororaphis EA105

    OpenAIRE

    McCully, Lucy M.; Bitzer, Adam S.; Spence, Carla A.; Bais, Harsh P.; Silby, Mark W.

    2014-01-01

    Pseudomonas chlororaphis EA105, a strain isolated from rice rhizosphere, has shown antagonistic activities against a rice fungal pathogen, and could be important in defense against rice blast. We report the draft genome sequence of EA105, which is an estimated size of 6.6 Mb.

  15. Complete Genome Sequence of the Haloalkaliphilic, Hydrogen Producing Halanaerobium hydrogenoformans

    Energy Technology Data Exchange (ETDEWEB)

    Brown, Steven D [ORNL; Begemann, Matthew B [University of Wisconsin, Madison; Mormile, Dr. Melanie R. [Missouri University of Science and Technology; Wall, Judy D. [University of Missouri; Han, Cliff [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Samual [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Elias, Dwayne A [ORNL

    2011-01-01

    Halanaerobium hydrogenoformans is an alkaliphilic bacterium capable of biohydrogen production at pH 11 and 7% (w/v) salt. We present the 2.6 Mb genome sequence to provide insights into its physiology and potential for bioenergy applications.

  16. Genome Sequence of the Yeast Cyberlindnera fabianii (Hansenula fabianii)

    OpenAIRE

    Freel, Kelle C.; Sarilar, Veronique; Neuvéglise, Cécile; Devillers, Hugo; Friedrich, Anne; Schacherer, Joseph

    2014-01-01

    The yeast Cyberlindnera fabianii is used in wastewater treatment, fermentation of alcoholic beverages, and has caused blood infections. To assist in the accurate identification of this species, and to determine the genetic basis for properties involved in fermentation and water treatment, we sequenced and annotated the genome of C. fabianii (YJS4271).

  17. Complete Genome Sequence of Haemophilus parasuis SH0165▿

    OpenAIRE

    Yue, Min; Yang, Fan; Yang, Jian; Bei, Weicheng; Cai, Xuwang; Chen, Lihong; Dong, Jie; Zhou, Rui; Jin, Meilin; Jin, Qi; Chen, Huanchun

    2008-01-01

    Haemophilus parasuis is the causative agent of Glässer's disease, which produces big losses in swine populations worldwide. H. parasuis SH0165, belonging to the dominant serovar 5 in China, is a clinically isolated strain with high-level virulence. Here, we report the first completed genome sequence of this species.

  18. Complete Genome Sequence of a Novel Human Betapapillomavirus, HPV-159

    OpenAIRE

    Kocjan, Boštjan J.; Hošnjak, Lea; Seme, Katja; Poljak, Mario

    2013-01-01

    A novel human papillomavirus (HPV), now officially recognized as HPV-159, isolated from an anal swab, was fully cloned, sequenced, and genetically characterized. HPV-159 has a genomic organization that is typical of cutaneotrophic HPV types, and it belongs to the genus Betapapillomavirus.

  19. Complete Genome Sequence of Beijerinckia indica subsp. indica▿

    OpenAIRE

    Tamas, Ivica; Dedysh, Svetlana N.; Liesack, Werner; Stott, Matthew B.; Alam, Maqsudul; Murrell, J. Colin; Dunfield, Peter F.

    2010-01-01

    Beijerinckia indica subsp. indica is an aerobic, acidophilic, exopolysaccharide-producing, N2-fixing soil bacterium. It is a generalist chemoorganotroph that is phylogenetically closely related to facultative and obligate methanotrophs of the genera Methylocella and Methylocapsa. Here we report the full genome sequence of this bacterium.

  20. The mitochondrial genome sequence of the Tasmanian tiger (Thylacinus cynocephalus)

    DEFF Research Database (Denmark)

    Miller, Webb; Drautz, Daniela I; Janecka, Jan E; Lesk, Arthur M; Ratan, Aakrosh; Tomsho, Lynn P; Packard, Mike; Zhang, Yeting; McClellan, Lindsay R; Qi, Ji; Zhao, Fangqing; Gilbert, M Thomas P; Dalén, Love; Arsuaga, Juan Luis; Ericson, Per G P; Huson, Daniel H; Helgen, Kristofer M; Murphy, William J; Götherström, Anders; Schuster, Stephan C

    2009-01-01

    We report the first two complete mitochondrial genome sequences of the thylacine (Thylacinus cynocephalus), or so-called Tasmanian tiger, extinct since 1936. The thylacine's phylogenetic position within australidelphian marsupials has long been debated, and here we provide strong support for the...

  1. Complete Genome Sequence of Robiginitalea biformata HTCC2501▿

    OpenAIRE

    Oh, Hyun-Myung; Giovannoni, Stephen J.; Lee, Kiyoung; Ferriera, Steve; Johnson, Justin; Cho, Jang-Cheon

    2009-01-01

    Robiginitalea biformata HTCC2501, isolated from the Sargasso Sea by dilution-to-extinction culturing, has been known as an aerobic chemoheterotroph with carotenoid pigments and dimorphic growth phases. Here, we announce the complete sequence of the R. biformata HTCC2501 genome, which contains genes for carotenoid biosynthesis and several macromolecule-degrading enzymes.

  2. Genome sequence of the human pathogen Vibrio cholerae Amazonia.

    NARCIS (Netherlands)

    Thompson, C.C.; Marin, M.A.; Dias, G.M.; Dutilh, B.E.; Edwards, R.A.; Iida, T.; Thompson, F.L.; Vicente, A.C.

    2011-01-01

    Vibrio cholerae O1 Amazonia is a pathogen that was isolated from cholera-like diarrhea cases in at least two countries, Brazil and Ghana. Based on multilocus sequence analysis, this lineage belongs to a distinct profile compared to strains from El Tor and classical biotypes. The genomic analysis rev

  3. Genome Sequence of the Tick-Borne Pathogen Rickettsia raoultii.

    Science.gov (United States)

    El Karkouri, Khalid; Mediannikov, Oleg; Robert, Catherine; Raoult, Didier; Fournier, Pierre-Edouards

    2016-01-01

    ITALIC! Rickettsia raoultiiis a tick-associated spotted fever group (SFG) organism, causing scalp eschar and neck lymphadenopathy after tick bite (SENLAT) in humans. We report here the genome sequence of ITALIC! R. raoultiistrain Khabarovsk(T)(CSUR R3(T), ATCC VR-1596(T)), which was isolated from a ITALIC! Dermacentor silvarumtick collected in Russia. PMID:27103706

  4. Genome Sequence of the Tick-Borne Pathogen Rickettsia raoultii

    OpenAIRE

    El Karkouri, Khalid; Mediannikov, Oleg; Robert, Catherine; Raoult, Didier; Fournier, Pierre-Edouards

    2016-01-01

    Rickettsia raoultii is a tick-associated spotted fever group (SFG) organism, causing scalp eschar and neck lymphadenopathy after tick bite (SENLAT) in humans. We report here the genome sequence of R. raoultii strain KhabarovskT (CSUR R3T, ATCC VR-1596T), which was isolated from a Dermacentor silvarum tick collected in Russia.

  5. Genome Sequence of the Paleopolyploid Soybean (Glycine max (L.) Merr.)

    Science.gov (United States)

    We report the genome sequence for soybean (Glycine max var. Williams 82), one of the most important crop plants worldwide because of its ability to produce both protein and oil. Soybean is a recently domesticated legume that plays a vital role in crop rotation as it fixes atmospheric nitrogen via s...

  6. Draft Genome Sequence of Bacillus subtilis strain KATMIRA1933

    OpenAIRE

    Karlyshev, Andrey V.; Melnikov, Vyacheslav G.; Chikindas, Michael L.

    2014-01-01

    In this report, we present a draft sequence of Bacillus subtilis KATMIRA1933. Previous studies demonstrated probiotic properties of this strain partially attributed to production of an antibacterial compound, subtilosin. Comparative analysis of this strain’s genome with that of a commercial probiotic strain, B. subtilis Natto, is presented.

  7. Complete Genome Sequences of Four Isolates of Plutella xylostella Granulovirus

    Science.gov (United States)

    2016-01-01

    Granuloviruses are widespread pathogens of Plutella xylostella L. (diamondback moth) and potential biopesticides for control of this global insect pest. We report the complete genomes of four Plutella xylostella granulovirus isolates from China, Malaysia, and Taiwan exhibiting pairs of noncoding, homologous repeat regions with significant sequence variation but equivalent length. PMID:27365355

  8. Draft Genome Sequence of Streptococcus agalactiae PR06

    OpenAIRE

    MZ, Irma Syakina; L. K. Teh; Salleh, M. Z.

    2013-01-01

    Streptococcus agalactiae (group B streptococcus [GBS]) is a Gram-positive bacterium that was first recognized as a causative agent of bovine mastitis. S. agalactiae has subsequently emerged as a significant cause of human diseases. Here, we report the draft genome sequence of S. agalactiae PR06, which was isolated from a septicemic patient in a local hospital in Malaysia.

  9. Draft Genome Sequence of Pectobacterium wasabiae Strain CFIA1002.

    Science.gov (United States)

    Yuan, Kat Xiaoli; Adam, Zaky; Tambong, James; Lévesque, C André; Chen, Wen; Lewis, Christopher T; De Boer, Solke H; Li, Xiang Sean

    2014-01-01

    Pectobacterium wasabiae, originally causing soft rot disease in horseradish in Japan, was recently found to cause blackleg-like symptoms on potato in the United States, Canada, and Europe. A draft genome sequence of a Canadian potato isolate of P. wasabiae CFIA1002 will enhance the characterization of its pathogenicity and host specificity features. PMID:24831134

  10. Draft Genome Sequence of Pectobacterium wasabiae Strain CFIA1002

    OpenAIRE

    Yuan, Kat (Xiaoli); Adam, Zaky; Tambong, James; Lévesque, C. André; Chen, Wen; Lewis, Christopher T.; De Boer, Solke H.; LI, XIANG

    2014-01-01

    Pectobacterium wasabiae, originally causing soft rot disease in horseradish in Japan, was recently found to cause blackleg-like symptoms on potato in the United States, Canada, and Europe. A draft genome sequence of a Canadian potato isolate of P. wasabiae CFIA1002 will enhance the characterization of its pathogenicity and host specificity features.

  11. Draft Genome Sequences of the Turfgrass Pathogen Sclerotinia homoeocarpa.

    Science.gov (United States)

    Green, Robert; Sang, Hyunkyu; Chang, Taehyun; Allan-Perkins, Elisha; Petit, Elsa; Jung, Geunhwa

    2016-01-01

    Sclerotinia homoeocarpa (F. T. Bennett) is one of the most economically important pathogens on high-amenity cool-season turfgrasses, where it causes dollar spot. To understand the genetic mechanisms of fungicide resistance, which has become highly prevalent, the whole genomes of two isolates with varied resistance levels to fungicides were sequenced. PMID:26868400

  12. Complete Genome Sequence of Bacillus thuringiensis Bacteriophage BMBtp2

    OpenAIRE

    Dong, Zhaoxia; Peng, Donghai; Wang, Yueying; Zhu, Lei; Ruan, Lifang; Sun, Ming

    2013-01-01

    Bacillus thuringiensis is an insect pathogen which has been widely used for biocontrol. During B. thuringiensis fermentation, lysogenic bacteriophages cause severe losses of yield. Here, we announce the complete genome sequence of a bacteriophage, BMBtp2, which is induced from B. thuringiensis strain YBT-1765, which may be helpful to clarify the mechanism involved in bacteriophage contamination.

  13. Draft Genome Sequence of Halomonas smyrnensis AAD6T

    OpenAIRE

    Sogutcu, Elif; Emrence, Zeliha; Arikan, Muzzaffer; Cakiris, Aris; Abaci, Neslihan; Öner, Ebru Toksoy; Üstek, Duran; Arga, Kazim Yalcin

    2012-01-01

    Halomonas smyrnensis AAD6T is a Gram-negative, aerobic, exopolysaccharide-producing, and moderately halophilic bacterium that produces levan, a fructose homopolymer with many potential uses in various industries. We report the draft genome sequence of H. smyrnensis AAD6T, which will accelerate research on the rational design and optimization of microbial levan production.

  14. Genome Sequence of Ureaplasma diversum Strain ATCC 49782.

    Science.gov (United States)

    Marques, Lucas M; Guimarães, Ana M S; Martins, Hellen B; Rezende, Izadora S; Barbosa, Maysa S; Campos, Guilherme B; do Nascimento, Naíla C; Dos Santos, Andrea P; Amorim, Aline T; Santos, Verena M; Messick, Joanne B; Timenetsky, Jorge

    2015-01-01

    Here, we report the complete genome sequence of Ureaplasma diversum strain ATCC 49782. This species is of bovine origin, having an association with reproductive disorders in cattle, including placentitis, fetal alveolitis, abortion, and birth of weak calves. It has a small circular chromosome of 975,425 bp. PMID:25883297

  15. Genome Sequence of Klebsiella pneumoniae Urinary Tract Isolate Top52

    OpenAIRE

    Johnson, Jeremiah G.; Spurbeck, Rachel R.; Sandhu, Sukhinder K.; Matson, Jyl S.

    2014-01-01

    Klebsiella pneumoniae is a significant cause of nosocomial infections, including ventilator-associated pneumonias and catheter-associated urinary tract infections. K. pneumoniae strain TOP52 #1721 (Top52) was isolated from a woman presenting with acute cystitis and subsequently characterized using various murine models of infection. Here we present the genome sequence of K. pneumoniae Top52.

  16. Complete Genome Sequence of Mycobacterium vaccae Type Strain ATCC 25954

    KAUST Repository

    Ho, Y. S.

    2012-10-26

    Mycobacterium vaccae is a rapidly growing, nontuberculous Mycobacterium species that is generally not considered a human pathogen and is of major pharmaceutical interest as an immunotherapeutic agent. We report here the annotated genome sequence of the M. vaccae type strain, ATCC 25954.

  17. Complete Genome Sequence of Biocontrol Strain Pseudomonas fluorescens LBUM 223

    OpenAIRE

    Roquigny, Roxane; Arseneault, Tanya; Gadkar, Vijay J.; Novinscak, Amy; Joly, David L.; Filion, Martin

    2015-01-01

    Pseudomonas fluorescens LBUM 223 is a plant growth-promoting rhizobacterium (PGPR) with biocontrol activity against various plant pathogens. It produces the antimicrobial metabolite phenazine-1-carboxylic acid, which is involved in the biocontrol of Streptomyces scabies, the causal agent of common scab of potato. Here, we report the complete genome sequence of P. fluorescens LBUM 223.

  18. Whole Genome Sequences of Four Brucella Strains ▿

    OpenAIRE

    Ding, Jiabo; Pan, Yuanlong; Jiang, Hai; Cheng, Junsheng; Liu, Taotao; Qin, Nan; Yi YANG; Cui, Buyun; Chen, Chen; Liu, Cuihua; Mao, Kairong; Zhu, Baoli

    2011-01-01

    Brucella melitensis and Brucella suis are intracellular pathogens of livestock and humans. Here we report four genome sequences, those of the virulent strain B. melitensis M28-12 and vaccine strains B. melitensis M5 and M111 and B. suis S2, which show different virulences and pathogenicities, which will help to design a more effective brucellosis vaccine.

  19. Genome Sequence of Klebsiella pneumoniae Respiratory Isolate IA565

    OpenAIRE

    Johnson, Jeremiah G.; Spurbeck, Rachel R.; Sandhu, Sukhinder K.; Matson, Jyl S

    2014-01-01

    Klebsiella pneumoniae is a clinically significant opportunistic bacterial pathogen as well as a normal member of the human microbiota. K. pneumoniae strain IA565 was isolated from a tracheal aspirate at the University of Iowa Hospitals and Clinics. Here, we present the genome sequence of K. pneumoniae IA565.

  20. Complete Genome Sequence of Anaplasma marginale subsp. centrale

    Science.gov (United States)

    Anaplasma marginale subsp. centrale is a naturally attenuated subtype that has been used as a vaccine for a century. We sequenced the genome of this organism and compared it to those of virulent senso stricto A. marginale strains. The comparison markedly narrows the number of outer membrane protein ...

  1. Complete Genome Sequences of Four Isolates of Plutella xylostella Granulovirus.

    Science.gov (United States)

    Spence, Robert J; Noune, Christopher; Hauxwell, Caroline

    2016-01-01

    Granuloviruses are widespread pathogens of Plutella xylostella L. (diamondback moth) and potential biopesticides for control of this global insect pest. We report the complete genomes of four Plutella xylostella granulovirus isolates from China, Malaysia, and Taiwan exhibiting pairs of noncoding, homologous repeat regions with significant sequence variation but equivalent length. PMID:27365355

  2. Genome Sequences of Six Paenibacillus larvae Siphoviridae Phages.

    Science.gov (United States)

    Carson, Susan; Bruff, Emily; DeFoor, William; Dums, Jacob; Groth, Adam; Hatfield, Taylor; Iyer, Aruna; Joshi, Kalyani; McAdams, Sarah; Miles, Devon; Miller, Delanie; Oufkir, Abdoullah; Raynor, Brinkley; Riley, Sara; Roland, Shelby; Rozier, Horace; Talley, Sarah; Miller, Eric S

    2015-01-01

    Six sequenced and annotated genomes of Paenibacillus larvae phages isolated from the combs of American foulbrood-diseased beehives are 37 to 45 kbp and have approximately 42% G+C content and 60 to 74 protein-coding genes. Phage Lily is most divergent from Diva, Rani, Redbud, Shelly, and Sitara. PMID:26089405

  3. Draft Genome Sequences of Four Plant Probiotic Bacillus Strains.

    Science.gov (United States)

    Jeong, Haeyoung; Park, Seung-Hwan; Choi, Soo-Keun

    2016-01-01

    Here, we report the whole-genome sequences of four Bacillus strains that exhibit plant probiotic activities. Three of them are the type strains of Bacillus endophyticus, "Bacillus gaemokensis," and Bacillus trypoxylicola, and the other, Bacillus sp. strain KCTC 13219, should be reclassified into a species belonging to the genus Lysinibacillus. PMID:27174273

  4. Large-Scale Sequencing: The Future of Genomic Sciences Colloquium

    Energy Technology Data Exchange (ETDEWEB)

    Margaret Riley; Merry Buckley

    2009-01-01

    Genetic sequencing and the various molecular techniques it has enabled have revolutionized the field of microbiology. Examining and comparing the genetic sequences borne by microbes - including bacteria, archaea, viruses, and microbial eukaryotes - provides researchers insights into the processes microbes carry out, their pathogenic traits, and new ways to use microorganisms in medicine and manufacturing. Until recently, sequencing entire microbial genomes has been laborious and expensive, and the decision to sequence the genome of an organism was made on a case-by-case basis by individual researchers and funding agencies. Now, thanks to new technologies, the cost and effort of sequencing is within reach for even the smallest facilities, and the ability to sequence the genomes of a significant fraction of microbial life may be possible. The availability of numerous microbial genomes will enable unprecedented insights into microbial evolution, function, and physiology. However, the current ad hoc approach to gathering sequence data has resulted in an unbalanced and highly biased sampling of microbial diversity. A well-coordinated, large-scale effort to target the breadth and depth of microbial diversity would result in the greatest impact. The American Academy of Microbiology convened a colloquium to discuss the scientific benefits of engaging in a large-scale, taxonomically-based sequencing project. A group of individuals with expertise in microbiology, genomics, informatics, ecology, and evolution deliberated on the issues inherent in such an effort and generated a set of specific recommendations for how best to proceed. The vast majority of microbes are presently uncultured and, thus, pose significant challenges to such a taxonomically-based approach to sampling genome diversity. However, we have yet to even scratch the surface of the genomic diversity among cultured microbes. A coordinated sequencing effort of cultured organisms is an appropriate place to begin

  5. Complete Genome Sequence of Actinobaculum schaalii Strain CCUG 27420

    DEFF Research Database (Denmark)

    Kristiansen, Rikke; Dueholm, Morten S; Bank, Steffen;

    2014-01-01

    Complete genome sequencing of the emerging uropathogen Actinobaculum schaalii indicates that an important mechanism of its virulence is attachment pili, which allow the organism to adhere to the surface of animal cells, greatly enhancing the ability of this organism to colonize the urinary tract....

  6. Characterization of reniform nematode genome through shotgun sequencing

    Science.gov (United States)

    The reniform nematode (RN), a major agricultural pest particularly on cotton in the United States(U.S.), is among the major plant parasitic nematodes for which limited genomic information exists. In this study, over 380 Mb of sequence data were generated from four pooled adult female RN and assembl...

  7. Bioinformatics for whole-genome shotgun sequencing of microbial communities.

    Directory of Open Access Journals (Sweden)

    Kevin Chen

    2005-07-01

    Full Text Available The application of whole-genome shotgun sequencing to microbial communities represents a major development in metagenomics, the study of uncultured microbes via the tools of modern genomic analysis. In the past year, whole-genome shotgun sequencing projects of prokaryotic communities from an acid mine biofilm, the Sargasso Sea, Minnesota farm soil, three deep-sea whale falls, and deep-sea sediments have been reported, adding to previously published work on viral communities from marine and fecal samples. The interpretation of this new kind of data poses a wide variety of exciting and difficult bioinformatics problems. The aim of this review is to introduce the bioinformatics community to this emerging field by surveying existing techniques and promising new approaches for several of the most interesting of these computational problems.

  8. The complete plastid genome sequence of Picea jezoensis (Pinaceae: Piceoideae).

    Science.gov (United States)

    Yang, Jong Cheol; Joo, Minjung; So, Soonku; Yi, Dong-Keun; Shin, Chang Ho; Lee, You-Mi; Choi, Kyung

    2016-09-01

    The nucleotide sequence of the complete chloroplast genome of P. jezoensis was completed. The total genome size was 124 146 bp, containing a pair of very short inverted repeats (IRa and IRb) of 422 bp, which were separated by large single copy (LSC) and small single copy (SSC) with 66 956 bp and 56 346 bp, respectively. The overall GC contents of the plastid genome were determined as 38.8%. One hundred fifteen genes including 68 peptide-encoding genes, 35 tRNA genes, four rRNA genes, six open-reading frames, and two pseudogenes were annotated. In these genes, 15 genes contained only one or two introns. Phylogenetic analyses using maximum likelihood (ML) methods were performed from fully sequenced Gymnosperms and other species of dataset composed of 69 protein-coding genes. PMID:26332576

  9. Complete genome sequence of the fish pathogen Flavobacterium psychrophilum.

    Science.gov (United States)

    Duchaud, Eric; Boussaha, Mekki; Loux, Valentin; Bernardet, Jean-François; Michel, Christian; Kerouault, Brigitte; Mondot, Stanislas; Nicolas, Pierre; Bossy, Robert; Caron, Christophe; Bessières, Philippe; Gibrat, Jean-François; Claverol, Stéphane; Dumetz, Fabien; Le Hénaff, Michel; Benmansour, Abdenour

    2007-07-01

    We report here the complete genome sequence of the virulent strain JIP02/86 (ATCC 49511) of Flavobacterium psychrophilum, a widely distributed pathogen of wild and cultured salmonid fish. The genome consists of a 2,861,988-base pair (bp) circular chromosome with 2,432 predicted protein-coding genes. Among these predicted proteins, stress response mediators, gliding motility proteins, adhesins and many putative secreted proteases are probably involved in colonization, invasion and destruction of the host tissues. The genome sequence provides the basis for explaining the relationships of the pathogen to the host and opens new perspectives for the development of more efficient disease control strategies. It also allows for a better understanding of the physiology and evolution of a significant representative of the family Flavobacteriaceae, whose members are associated with an interesting diversity of lifestyles and habitats. PMID:17592475

  10. Genetically based location from triploid populations and gene ontology of a 3.3-mb genome region linked to alternaria brown spot resistance in citrus reveal clusters of resistance genes

    OpenAIRE

    José Cuenca; Pablo Aleza; Antonio Vicent; Dominique Brunel; Patrick Ollitrault; Luis Navarro

    2013-01-01

    Genetic analysis of phenotypical traits and marker-trait association in polyploid species is generally considered as a challenge. In the present work, different approaches were combined taking advantage of the particular genetic structures of 2n gametes resulting from second division restitution (SDR) to map a genome region linked to Alternaria brown spot (ABS) resistance in triploid citrus progeny. ABS in citrus is a serious disease caused by the tangerine pathotype of the fungus Alternaria ...

  11. 基于柑橘及其近缘属植物DNA条形码的叶绿体编码序列筛选%Screening Potential DNA Barcode Regions of Chloroplast Coding Genome for Citrus and Its Related Genera

    Institute of Scientific and Technical Information of China (English)

    于杰; 闫化学; 鲁振华; 周志钦

    2011-01-01

    [Objective] Four coding regions of chloroplast genome of Citrus and its close relatives were analyzed in an attempt to find suitable DNA barcoding markers for species identification and lay a foundation for further study of non-coding region.[ Method ] Four chloroplast DNA regions (matK, rpoB, rpoC1 and rbcL ) of 59 Citrus accessions were sequenced, the intergeneric,interspecific, intraspecific genetic distances were calculated, and the phylogenetic tree of all the accessions tested was built based on the distance data obtained. [Result] The intergeneric and interspecific sequence variations of matK were the highest among four coding regions tested, and had significant difference from other regions studied. On the contrary, no obvious variations were found in the rpoB and rpoC1 regions. The sequence variation of rbcL was medium among the fragments sequenced. [Conclusion] The matK sequence could be used as potential candidate fragment for future DNA barcoding study of Citrus and its closely related genera.%[目的]通过对柑橘及其近缘属植物叶绿体4种编码序列的测定分析,获得能进行DNA条形编码的特征序列,为进一步研究叶绿体非编码区序列奠定基础.[方法]对柑橘及其近缘属植物59份样品进行matK、rpoB、rpoC1、rbcL测序,序列比对与人工校正,计算属间,种同、种内的遗传距离,比较序列间的差异,建立系统发育树.[结果]4种序列中,matK序列在属间、种间差异最大,与其它序列相比具有显著性差异,rbcL序列次之,而rpoB、rpoC1序列两者间没有显著性差异.[结论]matK序列是柑橘及其近缘属植物DNA条形码的未来研究中一个重要的候选片段.

  12. Construction and analysis of full-lengh and normalized cDNA libraries from citrus

    OpenAIRE

    Marqués, M.Carmen; Pérez-Amador, Miguel A.

    2012-01-01

    We have developed an integrated method to generate a normalized cDNA collection enriched in full-length and rare transcripts from citrus, using different species and multiple tissues and developmental stages. Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. In this regard, the availability of full-length cDNA clones facilitates functional an...

  13. Bacterial diversity analysis of Huanglongbing pathogen-infected citrus, using PhyloChip and 16S rRNA gene clone library sequencing

    Energy Technology Data Exchange (ETDEWEB)

    Shankar Sagaram, U.; DeAngelis, K.M.; Trivedi, P.; Andersen, G.L.; Lu, S.-E.; Wang, N.

    2009-03-01

    The bacterial diversity associated with citrus leaf midribs was characterized 1 from citrus groves that contained the Huanglongbing (HLB) pathogen, which has yet to be cultivated in vitro. We employed a combination of high-density phylogenetic 16S rDNA microarray and 16S rDNA clone library sequencing to determine the microbial community composition of symptomatic and asymptomatic citrus midribs. Our results revealed that citrus leaf midribs can support a diversity of microbes. PhyloChip analysis indicated that 47 orders of bacteria from 15 phyla were present in the citrus leaf midribs while 20 orders from phyla were observed with the cloning and sequencing method. PhyloChip arrays indicated that nine taxa were significantly more abundant in symptomatic midribs compared to asymptomatic midribs. Candidatus Liberibacter asiaticus (Las) was detected at a very low level in asymptomatic plants, but was over 200 times more abundant in symptomatic plants. The PhyloChip analysis was further verified by sequencing 16S rDNA clone libraries, which indicated the dominance of Las in symptomatic leaves. These data implicate Las as the pathogen responsible for HLB disease. Citrus is the most important commercial fruit crop in Florida. In recent years, citrus Huanglongbing (HLB), also called citrus greening, has severely affected Florida's citrus production and hence has drawn an enormous amount of attention. HLB is one of the most devastating diseases of citrus (6,13), characterized by blotchy mottling with green islands on leaves, as well as stunting, fruit decline, and small, lopsided fruits with poor coloration. The disease tends to be associated with a phloem-limited fastidious {alpha}-proteobacterium given a provisional Candidatus status (Candidatus Liberobacter spp. later changed to Candidatus Liberibacter spp.) in nomenclature (18,25,34). Previous studies indicate that HLB infection causes disorder in the phloem and severely impairs the translocation of assimilates in

  14. Draft genome sequence of the Algerian bee Apis mellifera intermissa.

    Science.gov (United States)

    Haddad, Nizar Jamal; Loucif-Ayad, Wahida; Adjlane, Noureddine; Saini, Deepti; Manchiganti, Rushiraj; Krishnamurthy, Venkatesh; AlShagoor, Banan; Batainh, Ahmed Mahmud; Mugasimangalam, Raja

    2015-06-01

    Apis mellifera intermissa is the native honeybee subspecies of Algeria. A. m. intermissa occurs in Tunisia, Algeria and Morocco, between the Atlas and the Mediterranean and Atlantic coasts. This bee is very important due to its high ability to adapt to great variations in climatic conditions and due to its preferable cleaning behavior. Here we report the draft genome sequence of this honey bee, its Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession JSUV00000000. The 240-Mb genome is being annotated and analyzed. Comparison with the genome of other Apis mellifera sub-species promises to yield insights into the evolution of adaptations to high temperature and resistance to Varroa parasite infestation. PMID:26484171

  15. Draft genome sequence of the Algerian bee Apis mellifera intermissa

    Directory of Open Access Journals (Sweden)

    Nizar Jamal Haddad

    2015-06-01

    Full Text Available Apis mellifera intermissa is the native honeybee subspecies of Algeria. A. m. intermissa occurs in Tunisia, Algeria and Morocco, between the Atlas and the Mediterranean and Atlantic coasts. This bee is very important due to its high ability to adapt to great variations in climatic conditions and due to its preferable cleaning behavior. Here we report the draft genome sequence of this honey bee, its Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession JSUV00000000. The 240-Mb genome is being annotated and analyzed. Comparison with the genome of other Apis mellifera sub-species promises to yield insights into the evolution of adaptations to high temperature and resistance to Varroa parasite infestation.

  16. Contrasting DNA sequence organisation patterns in sauropsidian genomes.

    Science.gov (United States)

    Epplen, J T; Diedrich, U; Wagenmann, M; Schmidtke, J; Engel, W

    1979-11-01

    The genomic DNA organisation patterns of four sauropsidian species, namely Python reticularis, Caiman crocodilus, Terrapene carolina triungius and Columba livia domestica were investigated by reassociation of short and long DNA fragments, by hyperchromicity measurements of reannealed fragments and by length estimations of S1-nuclease resistant repetitive duplexes. While the genomic DNA of the three reptilian species shows a short period interspersion pattern, the genome of the avian species is organised in a long period interspersion pattern apparently typical for birds. These findings are discussed in view of the close phylogenetic relationships of birds and reptiles, and also with regard to a possible relationship between the extent of sequence interspersion and genome size. PMID:533670

  17. Quantifying Next Generation Sequencing Sample Pre-Processing Bias in HIV-1 Complete Genome Sequencing

    Directory of Open Access Journals (Sweden)

    Bram Vrancken

    2016-01-01

    Full Text Available Genetic analyses play a central role in infectious disease research. Massively parallelized “mechanical cloning” and sequencing technologies were quickly adopted by HIV researchers in order to broaden the understanding of the clinical importance of minor drug-resistant variants. These efforts have, however, remained largely limited to small genomic regions. The growing need to monitor multiple genome regions for drug resistance testing, as well as the obvious benefit for studying evolutionary and epidemic processes makes complete genome sequencing an important goal in viral research. In addition, a major drawback for NGS applications to RNA viruses is the need for large quantities of input DNA. Here, we use a generic overlapping amplicon-based near full-genome amplification protocol to compare low-input enzymatic fragmentation (Nextera™ with conventional mechanical shearing for Roche 454 sequencing. We find that the fragmentation method has only a modest impact on the characterization of the population composition and that for reliable results, the variation introduced at all steps of the procedure—from nucleic acid extraction to sequencing—should be taken into account, a finding that is also relevant for NGS technologies that are now more commonly used. Furthermore, by applying our protocol to deep sequence a number of pre-therapy plasma and PBMC samples, we illustrate the potential benefits of a near complete genome sequencing approach in routine genotyping.

  18. Whole genome sequencing reveals genomic heterogeneity and antibiotic purification in Mycobacterium tuberculosis isolates

    KAUST Repository

    Black, PA

    2015-10-24

    Background Whole genome sequencing has revolutionised the interrogation of mycobacterial genomes. Recent studies have reported conflicting findings on the genomic stability of Mycobacterium tuberculosis during the evolution of drug resistance. In an age where whole genome sequencing is increasingly relied upon for defining the structure of bacterial genomes, it is important to investigate the reliability of next generation sequencing to identify clonal variants present in a minor percentage of the population. This study aimed to define a reliable cut-off for identification of low frequency sequence variants and to subsequently investigate genetic heterogeneity and the evolution of drug resistance in M. tuberculosis. Methods Genomic DNA was isolated from single colonies from 14 rifampicin mono-resistant M. tuberculosis isolates, as well as the primary cultures and follow up MDR cultures from two of these patients. The whole genomes of the M. tuberculosis isolates were sequenced using either the Illumina MiSeq or Illumina HiSeq platforms. Sequences were analysed with an in-house pipeline. Results Using next-generation sequencing in combination with Sanger sequencing and statistical analysis we defined a read frequency cut-off of 30 % to identify low frequency M. tuberculosis variants with high confidence. Using this cut-off we demonstrated a high rate of genetic diversity between single colonies isolated from one population, showing that by using the current sequencing technology, single colonies are not a true reflection of the genetic diversity within a whole population and vice versa. We further showed that numerous heterogeneous variants emerge and then disappear during the evolution of isoniazid resistance within individual patients. Our findings allowed us to formulate a model for the selective bottleneck which occurs during the course of infection, acting as a genomic purification event. Conclusions Our study demonstrated true levels of genetic diversity

  19. MetaSim: a sequencing simulator for genomics and metagenomics.

    Directory of Open Access Journals (Sweden)

    Daniel C Richter

    Full Text Available BACKGROUND: The new research field of metagenomics is providing exciting insights into various, previously unclassified ecological systems. Next-generation sequencing technologies are producing a rapid increase of environmental data in public databases. There is great need for specialized software solutions and statistical methods for dealing with complex metagenome data sets. METHODOLOGY/PRINCIPAL FINDINGS: To facilitate the development and improvement of metagenomic tools and the planning of metagenomic projects, we introduce a sequencing simulator called MetaSim. Our software can be used to generate collections of synthetic reads that reflect the diverse taxonomical composition of typical metagenome data sets. Based on a database of given genomes, the program allows the user to design a metagenome by specifying the number of genomes present at different levels of the NCBI taxonomy, and then to collect reads from the metagenome using a simulation of a number of different sequencing technologies. A population sampler optionally produces evolved sequences based on source genomes and a given evolutionary tree. CONCLUSIONS/SIGNIFICANCE: MetaSim allows the user to simulate individual read datasets that can be used as standardized test scenarios for planning sequencing projects or for benchmarking metagenomic software.

  20. Sequencing Crop Genomes: A Gateway to Improve Tropical Agriculture.

    Science.gov (United States)

    Thottathil, Gincy Paily; Jayasekaran, Kandakumar; Othman, Ahmad Sofiman

    2016-02-01

    Agricultural development in the tropics lags behind development in the temperate latitudes due to the lack of advanced technology, and various biotic and abiotic factors. To cope with the increasing demand for food and other plant-based products, improved crop varieties have to be developed. To breed improved varieties, a better understanding of crop genetics is necessary. With the advent of next-generation DNA sequencing technologies, many important crop genomes have been sequenced. Primary importance has been given to food crops, including cereals, tuber crops, vegetables, and fruits. The DNA sequence information is extremely valuable for identifying key genes controlling important agronomic traits and for identifying genetic variability among the cultivars. However, massive DNA re-sequencing and gene expression studies have to be performed to substantially improve our understanding of crop genetics. Application of the knowledge obtained from the genomes, transcriptomes, expression studies, and epigenetic studies would enable the development of improved varieties and may lead to a second green revolution. The applications of next generation DNA sequencing technologies in crop improvement, its limitations, future prospects, and the features of important crop genome projects are reviewed herein. PMID:27019684

  1. Secure distributed genome analysis for GWAS and sequence comparison computation

    Science.gov (United States)

    2015-01-01

    Background The rapid increase in the availability and volume of genomic data makes significant advances in biomedical research possible, but sharing of genomic data poses challenges due to the highly sensitive nature of such data. To address the challenges, a competition for secure distributed processing of genomic data was organized by the iDASH research center. Methods In this work we propose techniques for securing computation with real-life genomic data for minor allele frequency and chi-squared statistics computation, as well as distance computation between two genomic sequences, as specified by the iDASH competition tasks. We put forward novel optimizations, including a generalization of a version of mergesort, which might be of independent interest. Results We provide implementation results of our techniques based on secret sharing that demonstrate practicality of the suggested protocols and also report on performance improvements due to our optimization techniques. Conclusions This work describes our techniques, findings, and experimental results developed and obtained as part of iDASH 2015 research competition to secure real-life genomic computations and shows feasibility of securely computing with genomic data in practice. PMID:26733307

  2. Multiplexed DNA sequence capture of mitochondrial genomes using PCR products.

    Directory of Open Access Journals (Sweden)

    Tomislav Maricic

    Full Text Available BACKGROUND: To utilize the power of high-throughput sequencers, target enrichment methods have been developed. The majority of these require reagents and equipment that are only available from commercial vendors and are not suitable for the targets that are a few kilobases in length. METHODOLOGY/PRINCIPAL FINDINGS: We describe a novel and economical method in which custom made long-range PCR products are used to capture complete human mitochondrial genomes from complex DNA mixtures. We use the method to capture 46 complete mitochondrial genomes in parallel and we sequence them on a single lane of an Illumina GA(II instrument. CONCLUSIONS/SIGNIFICANCE: This method is economical and simple and particularly suitable for targets that can be amplified by PCR and do not contain highly repetitive sequences such as mtDNA. It has applications in population genetics and forensics, as well as studies of ancient DNA.

  3. SVA: software for annotating and visualizing sequenced human genomes

    Science.gov (United States)

    Ge, Dongliang; Ruzzo, Elizabeth K.; Shianna, Kevin V.; He, Min; Pelak, Kimberly; Heinzen, Erin L.; Need, Anna C.; Cirulli, Elizabeth T.; Maia, Jessica M.; Dickson, Samuel P.; Zhu, Mingfu; Singh, Abanish; Allen, Andrew S.; Goldstein, David B.

    2011-01-01

    Summary: Here we present Sequence Variant Analyzer (SVA), a software tool that assigns a predicted biological function to variants identified in next-generation sequencing studies and provides a browser to visualize the variants in their genomic contexts. SVA also provides for flexible interaction with software implementing variant association tests allowing users to consider both the bioinformatic annotation of identified variants and the strength of their associations with studied traits. We illustrate the annotation features of SVA using two simple examples of sequenced genomes that harbor Mendelian mutations. Availability and implementation: Freely available on the web at http://www.svaproject.org. Contact: d.ge@duke.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21624899

  4. Isolation and Characterization of Copia-like Retrotransposons from 12 Sweet Orange (Citrus sinensis Osbeck) Cultivars

    Institute of Scientific and Technical Information of China (English)

    Neng-Guo TAO; Juan XU; Yun-Jiang CHENG; Liu HONG; Wen-Wu GUO; Hua-Lin YI; Xiu-Xin DENG

    2005-01-01

    As the largest transposable element in the plant genome, retrotransposons are thought to be involved in citrus genetic instability and genome evolution, especially in sweet orange, which is prone to bud mutation. In the present study, the presence of copia-like retrotransposons, their heterogeneity, genomic distribution, and transcriptional activities in Citrus were investigated in 12 sweet orange (Citrus sinensis Osbeck) cultivars using a PCR assay designed to detect copia-like reverse transcriptase (RT)sequences. Twelve amplification products from each cultivar were cloned and sequenced. The cloned sequences showed great heterogeneity, except "Dream" navel and "Hamlin", both of which shared the same sequence. Frame shifting, termination, deletion, and substitution accounted for the heterogeneity of RT sequences. Southern blot hybridization using the RT1 clone from the "Cara Cara" navel as a probe showed that multiple copies were integrated throughout the sweet orange genomes, which made the retrotransposon possible an effective molecular marker to detect citrus evolution events and to reveal its relationship with bud mutation. No transcriptional activities of the retrotransposon were detected by RT-PCR and Northern analysis in the fruits and leaves of either "Cara Cara" or "Seike" navels.

  5. Downsizing genomic medicine: approaching the ethical complexity of whole-genome sequencing by starting small.

    Science.gov (United States)

    Sharp, Richard R

    2011-03-01

    As we look to a time when whole-genome sequencing is integrated into patient care, it is possible to anticipate a number of ethical challenges that will need to be addressed. The most intractable of these concern informed consent and the responsible management of very large amounts of genetic information. Given the range of possible findings, it remains unclear to what extent it will be possible to obtain meaningful patient consent to genomic testing. Equally unclear is how clinicians will disseminate the enormous volume of genetic information produced by whole-genome sequencing. Toward developing practical strategies for managing these ethical challenges, we propose a research agenda that approaches multiplexed forms of clinical genetic testing as natural laboratories in which to develop best practices for managing the ethical complexities of genomic medicine. PMID:21311340

  6. Complete genome sequence of Actinosynnema mirum type strain (101T)

    Energy Technology Data Exchange (ETDEWEB)

    Land, Miriam; Lapidus, Alla; Mayilraj, Shanmugam; Chen, Feng; Copeland, Alex; Glavina Del Rio, Tijana; Nolan, Matt; Lucas, Susan; Tice, Hope; Cheng, Jan-Fang; Chertkov, Olga; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Rohde, Manfred; Goker, Markus; Pati, Amrita; Ivanova, Natalia; Mavrommatis, Konstantinos; Chen, Amy; Palaniappan, Krishna; Hauser, Loren; Chang, Yun-Juan; Jefferies, Cynthia; Brettin, Thomas; Detter, John C.; Han, Cliff; Chain, Patrick; Tindall, Brian; Bristow, James; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter

    2009-05-20

    Actinosynnema mirum Hasegawa et al. 1978 is the type species of the genus, and is of phylogenetic interest because of its central phylogenetic location in the Actino-synnemataceae, a rapidly growing family within the actinobacterial suborder Pseudo-nocardineae. A. mirum is characterized by its motile spores borne on synnemata and as a producer of nocardicin antibiotics. It is capable of growing aerobically and under a moderate CO2 atmosphere. The strain is a Gram-positive, aerial and substrate mycelium producing bacterium, originally isolated from a grass blade collected from the Raritan River, New Jersey. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the first complete genome sequence of a member of the family Actinosynnemataceae, and only the second sequence from the actinobacterial suborder Pseudonocardineae. The 8,248,144 bp long single replicon genome with its 7100 protein-coding and 77 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.

  7. Insights into hominid evolution from the gorilla genome sequence.

    Science.gov (United States)

    Scally, Aylwyn; Dutheil, Julien Y; Hillier, LaDeana W; Jordan, Gregory E; Goodhead, Ian; Herrero, Javier; Hobolth, Asger; Lappalainen, Tuuli; Mailund, Thomas; Marques-Bonet, Tomas; McCarthy, Shane; Montgomery, Stephen H; Schwalie, Petra C; Tang, Y Amy; Ward, Michelle C; Xue, Yali; Yngvadottir, Bryndis; Alkan, Can; Andersen, Lars N; Ayub, Qasim; Ball, Edward V; Beal, Kathryn; Bradley, Brenda J; Chen, Yuan; Clee, Chris M; Fitzgerald, Stephen; Graves, Tina A; Gu, Yong; Heath, Paul; Heger, Andreas; Karakoc, Emre; Kolb-Kokocinski, Anja; Laird, Gavin K; Lunter, Gerton; Meader, Stephen; Mort, Matthew; Mullikin, James C; Munch, Kasper; O'Connor, Timothy D; Phillips, Andrew D; Prado-Martinez, Javier; Rogers, Anthony S; Sajjadian, Saba; Schmidt, Dominic; Shaw, Katy; Simpson, Jared T; Stenson, Peter D; Turner, Daniel J; Vigilant, Linda; Vilella, Albert J; Whitener, Weldon; Zhu, Baoli; Cooper, David N; de Jong, Pieter; Dermitzakis, Emmanouil T; Eichler, Evan E; Flicek, Paul; Goldman, Nick; Mundy, Nicholas I; Ning, Zemin; Odom, Duncan T; Ponting, Chris P; Quail, Michael A; Ryder, Oliver A; Searle, Stephen M; Warren, Wesley C; Wilson, Richard K; Schierup, Mikkel H; Rogers, Jane; Tyler-Smith, Chris; Durbin, Richard

    2012-03-01

    Gorillas are humans' closest living relatives after chimpanzees, and are of comparable importance for the study of human origins and evolution. Here we present the assembly and analysis of a genome sequence for the western lowland gorilla, and compare the whole genomes of all extant great ape genera. We propose a synthesis of genetic and fossil evidence consistent with placing the human-chimpanzee and human-chimpanzee-gorilla speciation events at approximately 6 and 10 million years ago. In 30% of the genome, gorilla is closer to human or chimpanzee than the latter are to each other; this is rarer around coding genes, indicating pervasive selection throughout great ape evolution, and has functional consequences in gene expression. A comparison of protein coding genes reveals approximately 500 genes showing accelerated evolution on each of the gorilla, human and chimpanzee lineages, and evidence for parallel acceleration, particularly of genes involved in hearing. We also compare the western and eastern gorilla species, estimating an average sequence divergence time 1.75 million years ago, but with evidence for more recent genetic exchange and a population bottleneck in the eastern species. The use of the genome sequence in these and future analyses will promote a deeper understanding of great ape biology and evolution. PMID:22398555

  8. The impact of next-generation sequencing on genomics

    Institute of Scientific and Technical Information of China (English)

    Jun Zhang; Rod Chiodini; Ahmed Badr; Genfa Zhang

    2011-01-01

    This article reviews basic concepts,general applications,and the potential impact of next-generation sequencing(NGS)technologies on genomics,with particular reference to currently available and possible future platforms and bioinformatics.NGS technologies have demonstrated the capacity to sequence DNA at unprecedented speed,thereby enabling previously unimaginable scientific achievements and novel biological applications.But,the massive data produced by NGS also presents a significant challenge for data storage,analyses,and management solutions.Advanced bioinformatic tools are essential for the successful application of NGS technology.As evidenced throughout this review,NGS technologies will have a striking impact on genomic research and the entire biological field.With its ability to tackle the unsolved challenges unconquered by previous genomic technologies,NGS is likely to unravel the complexity of the human genome in terms of genetic variations,some of which may be confined to susceptible loci for some common human conditions.The impact of NGS technologies on genomics will be far reaching and likely change the field for years to come.

  9. First High-Quality Draft Genome Sequence of Pasteurella multocida Sequence Type 128 Isolated from Infected Bone

    OpenAIRE

    Kavousi, Niloofar; Eng, Wilhelm Wei Han; Lee, Yin Peng; Tan, Lian Huat; Thuraisingham, Ravindran; Catherine M Yule; Gan, Han Ming

    2016-01-01

    We report here the first high-quality draft genome sequence of Pasteurella multocida sequence type 128, which was isolated from the infected finger bone of an adult female who was bitten by a domestic dog. The draft genome will be a valuable addition to the scarce genomic resources available for P. multocida.

  10. First High-Quality Draft Genome Sequence of Pasteurella multocida Sequence Type 128 Isolated from Infected Bone.

    Science.gov (United States)

    Kavousi, Niloofar; Eng, Wilhelm Wei Han; Lee, Yin Peng; Tan, Lian Huat; Thuraisingham, Ravindran; Yule, Catherine M; Gan, Han Ming

    2016-01-01

    We report here the first high-quality draft genome sequence of Pasteurella multocida sequence type 128, which was isolated from the infected finger bone of an adult female who was bitten by a domestic dog. The draft genome will be a valuable addition to the scarce genomic resources available for P. multocida. PMID:26941132

  11. In silico analysis of phytohormone metabolism and communication pathways in citrus transcriptome

    Directory of Open Access Journals (Sweden)

    Vera Quecini

    2007-01-01

    Full Text Available Plant hormones play a crucial role in integrating endogenous and exogenous signals and in determining developmental responses to form the plant body throughout its life cycle. In citrus species, several economically important processes are controlled by phytohormones, including seed germination, secondary growth, fruit abscission and ripening. Integrative genomics is a powerful tool for linking newly researched organisms, such as tropical woody species, to functional studies already carried out on established model organisms. Based on gene orthology analyses and expression patterns, we searched the Citrus Genome Sequencing Consortium (CitEST database for Expressed Sequence Tags (EST consensus sequences sharing similarity to known components of hormone metabolism and signaling pathways in model species. More than 600 homologs of functionally characterized hormone metabolism and signal transduction members from model species were identified in citrus, allowing us to propose a framework for phytohormone signaling mechanisms in citrus. A number of components from hormone-related metabolic pathways were absent in citrus, suggesting the presence of distinct metabolic pathways. Our results demonstrated the power of comparative genomics between model systems and economically important crop species to elucidate several aspects of plant physiology and metabolism.

  12. ReRep: Computational detection of repetitive sequences in genome survey sequences (GSS

    Directory of Open Access Journals (Sweden)

    Alves-Ferreira Marcelo

    2008-09-01

    Full Text Available Abstract Background Genome survey sequences (GSS offer a preliminary global view of a genome since, unlike ESTs, they cover coding as well as non-coding DNA and include repetitive regions of the genome. A more precise estimation of the nature, quantity and variability of repetitive sequences very early in a genome sequencing project is of considerable importance, as such data strongly influence the estimation of genome coverage, library quality and progress in scaffold construction. Also, the elimination of repetitive sequences from the initial assembly process is important to avoid errors and unnecessary complexity. Repetitive sequences are also of interest in a variety of other studies, for instance as molecular markers. Results We designed and implemented a straightforward pipeline called ReRep, which combines bioinformatics tools for identifying repetitive structures in a GSS dataset. In a case study, we first applied the pipeline to a set of 970 GSSs, sequenced in our laboratory from the human pathogen Leishmania braziliensis, the causative agent of leishmaniosis, an important public health problem in Brazil. We also verified the applicability of ReRep to new sequencing technologies using a set of 454-reads of an Escheria coli. The behaviour of several parameters in the algorithm is evaluated and suggestions are made for tuning of the analysis. Conclusion The ReRep approach for identification of repetitive elements in GSS datasets proved to be straightforward and efficient. Several potential repetitive sequences were found in a L. braziliensis GSS dataset generated in our laboratory, and further validated by the analysis of a more complete genomic dataset from the EMBL and Sanger Centre databases. ReRep also identified most of the E. coli K12 repeats prior to assembly in an example dataset obtained by automated sequencing using 454 technology. The parameters controlling the algorithm behaved consistently and may be tuned to the properties

  13. Whole-genome sequencing and analysis of the Malaysian cynomolgus macaque (Macaca fascicularis) genome

    OpenAIRE

    Higashino, Atsunori; Sakate, Ryuichi; Kameoka, Yosuke; Takahashi, Ichiro; Hirata, Makoto; Tanuma, Reiko; Masui, Tohru; Yasutomi, Yasuhiro; Osada, Naoki

    2012-01-01

    Background The genetic background of the cynomolgus macaque (Macaca fascicularis) is made complex by the high genetic diversity, population structure, and gene introgression from the closely related rhesus macaque (Macaca mulatta). Herein we report the whole-genome sequence of a Malaysian cynomolgus macaque male with more than 40-fold coverage, which was determined using a resequencing method based on the Indian rhesus macaque genome. Results We identified approximately 9.7 million single nuc...

  14. Genomic organization and sequence analysis of the vomeronasal receptor V2R genes in mouse genome

    Institute of Scientific and Technical Information of China (English)

    YANG Hui; Zhang YaPing

    2007-01-01

    Two multigene superfamilies, named V1R and V2R, encoding seven-transmembrane-domain G-protein coupled receptors (GPCRs) have been identified as pheromone receptors in mammals. Three V2R gene families have been described in mouse and rat. Here we screened the updated mouse genome sequence database and finally retrieved 63 putative functional V2R genes including three newly identified genes which formed a new additional family. We described the genomic organization of these genes and also characterized the conservation of mouse V2R protein sequences. These genomic and sequence information we described are useful as part of the evidence to speculate the functional domain of V2Rs and should give aid to the functionality study in the future.

  15. Draft genome sequence of Acidithiobacillus ferrooxidans YQH-1

    Directory of Open Access Journals (Sweden)

    Lei Yan

    2015-12-01

    Full Text Available Acidithiobacillus ferrooxidans YQH-1 is a moderate acidophilic bacterium isolated from a river in a volcano of Northeast China. Here, we describe the draft genome of strain YQH-1, which was assembled into 123 contigs containing 3,111,222 bp with a G + C content of 58.63%. A large number of genes related to carbon dioxide fixation, dinitrogen fixation, pH tolerance, heavy metal detoxification, and oxidative stress defense were detected. The genome sequence can be accessed at DDBJ/EMBL/GenBank under the accession no. LJBT00000000.

  16. Arrangement of repetitive sequences in the genome of herpesvirus Sylvilagus.

    Science.gov (United States)

    Medveczky, M M; Geck, P; Clarke, C; Byrnes, J; Sullivan, J L; Medveczky, P G

    1989-02-01

    Herpesvirus sylvilagus is a lymphotropic (type gamma) herpesvirus of cottontail rabbits (Sylvilagus floridanus). Analysis of virion DNA of herpesvirus sylvilagus has revealed that the genome consists of one stretch of about 120 kilobase pairs of internal, unique DNA flanked by a variable number of 553-base-pair tandem repeats. The G + C content of the repetitive DNA is extremely high (83%), as determined by sequencing. The organization of the herpesvirus sylvilagus genome is, therefore, similar to that of the primate lymphotropic viruses herpesvirus saimiri and herpesvirus ateles. PMID:2911114

  17. Complete genome sequence of Croceibacter bacteriophage P2559S.

    Science.gov (United States)

    Kang, Ilnam; Kang, Dongmin; Cho, Jang-Cheon

    2012-08-01

    Croceibacter atlanticus HTCC2559(T), a marine bacterium isolated from the Sargasso Sea, is a phylogenetically unique member of the family Flavobacteriaceae. Strain HTCC2559(T) possesses genes related to interaction with primary producers, which makes studies on bacteriophages infecting the strain interesting. Here we report the genome sequence of bacteriophage P2559S, which was isolated off the coast of the Republic of Korea and lytically infects HTCC2559(T). Many genes predicted in the P2559S genome had their homologs in Bacteroides phages. PMID:22843867

  18. The complete chloroplast genome sequence of Clematis terniflora DC. (Ranunculaceae).

    Science.gov (United States)

    Li, Mengzhu; Yang, Bingxian; Chen, Qinyi; Zhu, Wei; Ma, Ji; Tian, Jingkui

    2016-07-01

    Clematis terniflora DC. is an important medicinal plant used in the treatment of inflammatory symptoms related to respiratory and urinary systems. In this study, we found that the complete cp genome of C. terniflora DC. is 159,528 bp. The phylogenetic analysis of 32 taxa showed a strong sister relationship with Ranunculus macranthus, which also strongly supports the position of Ranunculales. The complete cp genome sequence of Clematis terniflora DC. reported here has the potential to advance population and phylogenetic studies of this medicinal plant. PMID:25865739

  19. The complete chloroplast genome sequence of Hibiscus syriacus.

    Science.gov (United States)

    Kwon, Hae-Yun; Kim, Joon-Hyeok; Kim, Sea-Hyun; Park, Ji-Min; Lee, Hyoshin

    2016-09-01

    The complete chloroplast genome sequence of Hibiscus syriacus L. is presented in this study. The genome is composed of 161 019 bp in length, with a typical circular structure containing a pair of inverted repeats of 25 745 bp of length separated by a large single-copy region and a small single-copy region of 89 698 bp and 19 831 bp of length, respectively. The overall GC content is 36.8%. One hundred and fourteen genes were annotated, including 81 protein-coding genes, 4 ribosomal RNA genes and 29 transfer RNA genes. PMID:26357910

  20. Phytophthora Genome Sequences Uncover Evolutionary Origins and Mechanisms of Pathogenesis

    Energy Technology Data Exchange (ETDEWEB)

    Tyler, Brett M.; Tripathy, Sucheta; Zhang, Xuemin; Dehal, Paramvir; Jiang, Rays H. Y.; Aerts, Andrea; Arredondo, Felipe D.; Baxter, Laura; Bensasson, Douda; Beynon, JIm L.; Chapman, Jarrod; Damasceno, Cynthia M. B.; Dorrance, Anne E.; Dou, Daolong; Dickerman, Allan W.; Dubchak, Inna L.; Garbelotto, Matteo; Gijzen, Mark; Gordon, Stuart G.; Govers, Francine; Grunwald, NIklaus J.; Huang, Wayne; Ivors, Kelly L.; Jones, Richard W.; Kamoun, Sophien; Krampis, Konstantinos; Lamour, Kurt H.; Lee, Mi-Kyung; McDonald, W. Hayes; Medina, Monica; Meijer, Harold J. G.; Nordberg, Erik K.; Maclean, Donald J.; Ospina-Giraldo, Manuel D.; Morris, Paul F.; Phuntumart, Vipaporn; Putnam, Nicholas J.; Rash, Sam; Rose, Jocelyn K. C.; Sakihama, Yasuko; Salamov, Asaf A.; Savidor, Alon; Scheuring, Chantel F.; Smith, Brian M.; Sobral, Bruno W. S.; Terry, Astrid; Torto-Alalibo, Trudy A.; Win, Joe; Xu, Zhanyou; Zhang, Hongbin; Grigoriev, Igor V.; Rokhsar, Daniel S.; Boore, Jeffrey L.

    2006-04-17

    Draft genome sequences have been determined for the soybean pathogen Phytophthora sojae and the sudden oak death pathogen Phytophthora ramorum. Oömycetes such as these Phytophthora species share the kingdom Stramenopila with photosynthetic algae such as diatoms, and the presence of many Phytophthora genes of probable phototroph origin supports a photosynthetic ancestry for the stramenopiles. Comparison of the two species' genomes reveals a rapid expansion and diversification of many protein families associated with plant infection such as hydrolases, ABC transporters, protein toxins, proteinase inhibitors, and, in particular, a superfamily of 700 proteins with similarity to known oömycete avirulence genes.