WorldWideScience

Sample records for pyrosequencing based transcriptome

  1. 454 pyrosequencing based transcriptome analysis of Zygaena filipendulae with focus on genes involved in biosynthesis of cyanogenic glucosides.

    Science.gov (United States)

    Zagrobelny, Mika; Scheibye-Alsing, Karsten; Jensen, Niels Bjerg; Møller, Birger Lindberg; Gorodkin, Jan; Bak, Søren

    2009-12-02

    An essential driving component in the co-evolution of plants and insects is the ability to produce and handle bioactive compounds. Plants produce bioactive natural products for defense, but some insects detoxify and/or sequester the compounds, opening up for new niches with fewer competitors. To study the molecular mechanism behind the co-adaption in plant-insect interactions, we have investigated the interactions between Lotus corniculatus and Zygaena filipendulae. They both contain cyanogenic glucosides which liberate toxic hydrogen cyanide upon breakdown. Moths belonging to the Zygaena family are the only insects known, able to carry out both de novo biosynthesis and sequestration of the same cyanogenic glucosides as those from their feed plants. The biosynthetic pathway for cyanogenic glucoside biosynthesis in Z. filipendulae proceeds using the same intermediates as in the well known pathway from plants, but none of the enzymes responsible have been identified. A genomics strategy founded on 454 pyrosequencing of the Z. filipendulae transcriptome was undertaken to identify some of these enzymes in Z. filipendulae. Comparisons of the Z. filipendulae transcriptome with the sequenced genomes of Bombyx mori, Drosophila melanogaster, Tribolium castaneum, Apis mellifera and Anopheles gambiae indicate a high coverage of the Z. filipendulae transcriptome. 11% of the Z. filipendulae transcriptome sequences were assigned to Gene Ontology categories. Candidate genes for enzymes functioning in the biosynthesis of cyanogenic glucosides (cytochrome P450 and family 1 glycosyltransferases) were identified based on sequence length, number of copies and presence/absence of close homologs in D. melanogaster, B. mori and the cyanogenic butterfly Heliconius. Examination of biased codon usage, GC content and selection on gene candidates support the notion of cyanogenesis as an "old" trait within Ditrysia, as well as its origins being convergent between plants and insects

  2. 454 pyrosequencing based transcriptome analysis of Zygaena filipendulae with focus on genes involved in biosynthesis of cyanogenic glucosides

    Directory of Open Access Journals (Sweden)

    Jensen Niels

    2009-12-01

    Full Text Available Abstract Background An essential driving component in the co-evolution of plants and insects is the ability to produce and handle bioactive compounds. Plants produce bioactive natural products for defense, but some insects detoxify and/or sequester the compounds, opening up for new niches with fewer competitors. To study the molecular mechanism behind the co-adaption in plant-insect interactions, we have investigated the interactions between Lotus corniculatus and Zygaena filipendulae. They both contain cyanogenic glucosides which liberate toxic hydrogen cyanide upon breakdown. Moths belonging to the Zygaena family are the only insects known, able to carry out both de novo biosynthesis and sequestration of the same cyanogenic glucosides as those from their feed plants. The biosynthetic pathway for cyanogenic glucoside biosynthesis in Z. filipendulae proceeds using the same intermediates as in the well known pathway from plants, but none of the enzymes responsible have been identified. A genomics strategy founded on 454 pyrosequencing of the Z. filipendulae transcriptome was undertaken to identify some of these enzymes in Z. filipendulae. Results Comparisons of the Z. filipendulae transcriptome with the sequenced genomes of Bombyx mori, Drosophila melanogaster, Tribolium castaneum, Apis mellifera and Anopheles gambiae indicate a high coverage of the Z. filipendulae transcriptome. 11% of the Z. filipendulae transcriptome sequences were assigned to Gene Ontology categories. Candidate genes for enzymes functioning in the biosynthesis of cyanogenic glucosides (cytochrome P450 and family 1 glycosyltransferases were identified based on sequence length, number of copies and presence/absence of close homologs in D. melanogaster, B. mori and the cyanogenic butterfly Heliconius. Examination of biased codon usage, GC content and selection on gene candidates support the notion of cyanogenesis as an "old" trait within Ditrysia, as well as its origins being

  3. Pyrosequencing the Manduca sexta larval midgut transcriptome: messages for digestion, detoxification and defence.

    Science.gov (United States)

    Pauchet, Y; Wilkinson, P; Vogel, H; Nelson, D R; Reynolds, S E; Heckel, D G; ffrench-Constant, R H

    2010-02-01

    The tobacco hornworm Manduca sexta is an important model for insect physiology but genomic and transcriptomic data are currently lacking. Following a recent pyrosequencing study generating immune related expressed sequence tags (ESTs), here we use this new technology to define the M. sexta larval midgut transcriptome. We generated over 387,000 midgut ESTs, using a combination of Sanger and 454 sequencing, and classified predicted proteins into those involved in digestion, detoxification and immunity. In many cases the depth of 454 pyrosequencing coverage allowed us to define the entire cDNA sequence of a particular gene. Many new M. sexta genes are described including up to 36 new cytochrome P450s, some of which have been implicated in the metabolism of host plant-derived nicotine. New lepidopteran gene families such as the beta-fructofuranosidases, previously thought to be restricted to Bombyx mori, are also described. An unexpectedly high number of ESTs were involved in immunity, for example 39 contigs encoding serpins, and the increasingly appreciated role of the midgut in insect immunity is discussed. Similar studies of other tissues will allow for a tissue by tissue description of the M. sexta transcriptome and will form an essential complimentary step on the road to genome sequencing and annotation.

  4. Pyrosequencing the Bemisia tabaci transcriptome reveals a highly diverse bacterial community and a robust system for insecticide resistance.

    Directory of Open Access Journals (Sweden)

    Wen Xie

    Full Text Available BACKGROUND: Bemisia tabaci (Gennadius is a phloem-feeding insect poised to become one of the major insect pests in open field and greenhouse production systems throughout the world. The high level of resistance to insecticides is a main factor that hinders continued use of insecticides for suppression of B. tabaci. Despite its prevalence, little is known about B. tabaci at the genome level. To fill this gap, an invasive B. tabaci B biotype was subjected to pyrosequencing-based transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes. METHODOLOGY AND PRINCIPAL FINDINGS: Using Roche 454 pyrosequencing, 857,205 reads containing approximately 340 megabases were obtained from the B. tabaci transcriptome. De novo assembly generated 178,669 unigenes including 30,980 from insects, 17,881 from bacteria, and 129,808 from the nohit. A total of 50,835 (28.45% unigenes showed similarity to the non-redundant database in GenBank with a cut-off E-value of 10-5. Among them, 40,611 unigenes were assigned to one or more GO terms and 6,917 unigenes were assigned to 288 known pathways. De novo metatranscriptome analysis revealed highly diverse bacterial symbionts in B. tabaci, and demonstrated the host-symbiont cooperation in amino acid production. In-depth transcriptome analysis indentified putative molecular markers, and genes potentially involved in insecticide resistance and nutrient digestion. The utility of this transcriptome was validated by a thiamethoxam resistance study, in which annotated cytochrome P450 genes were significantly overexpressed in the resistant B. tabaci in comparison to its susceptible counterparts. CONCLUSIONS: This transcriptome/metatranscriptome analysis sheds light on the molecular understanding of symbiosis and insecticide resistance in an agriculturally important phloem-feeding insect pest, and lays the foundation for future functional genomics research of the

  5. Profiling the venom gland transcriptomes of Costa Rican snakes by 454 pyrosequencing

    Directory of Open Access Journals (Sweden)

    Sanz Libia

    2011-05-01

    Full Text Available Abstract Background A long term research goal of venomics, of applied importance for improving current antivenom therapy, but also for drug discovery, is to understand the pharmacological potential of venoms. Individually or combined, proteomic and transcriptomic studies have demonstrated their feasibility to explore in depth the molecular diversity of venoms. In the absence of genome sequence, transcriptomes represent also valuable searchable databases for proteomic projects. Results The venom gland transcriptomes of 8 Costa Rican taxa from 5 genera (Crotalus, Bothrops, Atropoides, Cerrophidion, and Bothriechis of pitvipers were investigated using high-throughput 454 pyrosequencing. 100,394 out of 330,010 masked reads produced significant hits in the available databases. 5.165,220 nucleotides (8.27% were masked by RepeatMasker, the vast majority of which corresponding to class I (retroelements and class II (DNA transposons mobile elements. BLAST hits included 79,991 matches to entries of the taxonomic suborder Serpentes, of which 62,433 displayed similarity to documented venom proteins. Strong discrepancies between the transcriptome-computed and the proteome-gathered toxin compositions were obvious at first sight. Although the reasons underlaying this discrepancy are elusive, since no clear trend within or between species is apparent, the data indicate that individual mRNA species may be translationally controlled in a species-dependent manner. The minimum number of genes from each toxin family transcribed into the venom gland transcriptome of each species was calculated from multiple alignments of reads matched to a full-length reference sequence of each toxin family. Reads encoding ORF regions of Kazal-type inhibitor-like proteins were uniquely found in Bothriechis schlegelii and B. lateralis transcriptomes, suggesting a genus-specific recruitment event during the early-Middle Miocene. A transcriptome-based cladogram supports the large

  6. Characterization of the Zoarces viviparus liver transcriptome using massively parallel pyrosequencing

    Directory of Open Access Journals (Sweden)

    Asker Noomi

    2009-07-01

    Full Text Available Abstract Background The teleost Zoarces viviparus (eelpout lives along the coasts of Northern Europe and has long been an established model organism for marine ecology and environmental monitoring. The scarce information about this species genome has however restrained the use of efficient molecular-level assays, such as gene expression microarrays. Results In the present study we present the first comprehensive characterization of the Zoarces viviparus liver transcriptome. From 400,000 reads generated by massively parallel pyrosequencing, more than 50,000 pieces of putative transcripts were assembled, annotated and functionally classified. The data was estimated to cover roughly 40% of the total transcriptome and homologues for about half of the genes of Gasterosteus aculeatus (stickleback were identified. The sequence data was consequently used to design an oligonucleotide microarray for large-scale gene expression analysis. Conclusion Our results show that one run using a Genome Sequencer FLX from 454 Life Science/Roche generates enough genomic information for adequate de novo assembly of a large number of genes in a higher vertebrate. The generated sequence data, including the validated microarray probes, are publicly available to promote genome-wide research in Zoarces viviparus.

  7. Identification and analysis of common bean (Phaseolus vulgaris L. transcriptomes by massively parallel pyrosequencing

    Directory of Open Access Journals (Sweden)

    Thimmapuram Jyothi

    2011-10-01

    Full Text Available Abstract Background Common bean (Phaseolus vulgaris is the most important food legume in the world. Although this crop is very important to both the developed and developing world as a means of dietary protein supply, resources available in common bean are limited. Global transcriptome analysis is important to better understand gene expression, genetic variation, and gene structure annotation in addition to other important features. However, the number and description of common bean sequences are very limited, which greatly inhibits genome and transcriptome research. Here we used 454 pyrosequencing to obtain a substantial transcriptome dataset for common bean. Results We obtained 1,692,972 reads with an average read length of 207 nucleotides (nt. These reads were assembled into 59,295 unigenes including 39,572 contigs and 19,723 singletons, in addition to 35,328 singletons less than 100 bp. Comparing the unigenes to common bean ESTs deposited in GenBank, we found that 53.40% or 31,664 of these unigenes had no matches to this dataset and can be considered as new common bean transcripts. Functional annotation of the unigenes carried out by Gene Ontology assignments from hits to Arabidopsis and soybean indicated coverage of a broad range of GO categories. The common bean unigenes were also compared to the bean bacterial artificial chromosome (BAC end sequences, and a total of 21% of the unigenes (12,724 including 9,199 contigs and 3,256 singletons match to the 8,823 BAC-end sequences. In addition, a large number of simple sequence repeats (SSRs and transcription factors were also identified in this study. Conclusions This work provides the first large scale identification of the common bean transcriptome derived by 454 pyrosequencing. This research has resulted in a 150% increase in the number of Phaseolus vulgaris ESTs. The dataset obtained through this analysis will provide a platform for functional genomics in common bean and related legumes and

  8. Pyrosequencing the Midgut Transcriptome of the Banana Weevil Cosmopolites sordidus (Germar (Coleoptera: Curculionidae Reveals Multiple Protease-Like Transcripts.

    Directory of Open Access Journals (Sweden)

    Arnubio Valencia

    Full Text Available The banana weevil Cosmopolites sordidus is an important and serious insect pest in most banana and plantain-growing areas of the world. In spite of the economic importance of this insect pest very little genomic and transcriptomic information exists for this species. In the present study, we characterized the midgut transcriptome of C. sordidus using massive 454-pyrosequencing. We generated over 590,000 sequencing reads that assembled into 30,840 contigs with more than 400 bp, representing a significant expansion of existing sequences available for this insect pest. Among them, 16,427 contigs contained one or more GO terms. In addition, 15,263 contigs were assigned an EC number. In-depth transcriptome analysis identified genes potentially involved in insecticide resistance, peritrophic membrane biosynthesis, immunity-related function and defense against pathogens, and Bacillus thuringiensis toxins binding proteins as well as multiple enzymes involved with protein digestion. This transcriptome will provide a valuable resource for understanding larval physiology and for identifying novel target sites and management approaches for this important insect pest.

  9. Pyrosequencing the Midgut Transcriptome of the Banana Weevil Cosmopolites sordidus (Germar) (Coleoptera: Curculionidae) Reveals Multiple Protease-Like Transcripts.

    Science.gov (United States)

    Valencia, Arnubio; Wang, Haichuan; Soto, Alberto; Aristizabal, Manuel; Arboleda, Jorge W; Eyun, Seong-Il; Noriega, Daniel D; Siegfried, Blair

    2016-01-01

    The banana weevil Cosmopolites sordidus is an important and serious insect pest in most banana and plantain-growing areas of the world. In spite of the economic importance of this insect pest very little genomic and transcriptomic information exists for this species. In the present study, we characterized the midgut transcriptome of C. sordidus using massive 454-pyrosequencing. We generated over 590,000 sequencing reads that assembled into 30,840 contigs with more than 400 bp, representing a significant expansion of existing sequences available for this insect pest. Among them, 16,427 contigs contained one or more GO terms. In addition, 15,263 contigs were assigned an EC number. In-depth transcriptome analysis identified genes potentially involved in insecticide resistance, peritrophic membrane biosynthesis, immunity-related function and defense against pathogens, and Bacillus thuringiensis toxins binding proteins as well as multiple enzymes involved with protein digestion. This transcriptome will provide a valuable resource for understanding larval physiology and for identifying novel target sites and management approaches for this important insect pest.

  10. Pyrosequencing the transcriptome of the greenhouse whitefly, Trialeurodes vaporariorum reveals multiple transcripts encoding insecticide targets and detoxifying enzymes

    Directory of Open Access Journals (Sweden)

    Gorman Kevin

    2011-01-01

    Full Text Available Abstract Background The whitefly Trialeurodes vaporariorum is an economically important crop pest in temperate regions that has developed resistance to most classes of insecticides. However, the molecular mechanisms underlying resistance have not been characterised and, to date, progress has been hampered by a lack of nucleotide sequence data for this species. Here, we use pyrosequencing on the Roche 454-FLX platform to produce a substantial and annotated EST dataset. This 'unigene set' will form a critical reference point for quantitation of over-expressed messages via digital transcriptomics. Results Pyrosequencing produced around a million sequencing reads that assembled into 54,748 contigs, with an average length of 965 bp, representing a dramatic expansion of existing cDNA sequences available for T. vaporariorum (only 43 entries in GenBank at the time of this publication. BLAST searching of non-redundant databases returned 20,333 significant matches and those gene families potentially encoding gene products involved in insecticide resistance were manually curated and annotated. These include, enzymes potentially involved in the detoxification of xenobiotics and those encoding the targets of the major chemical classes of insecticides. A total of 57 P450s, 17 GSTs and 27 CCEs were identified along with 30 contigs encoding the target proteins of six different insecticide classes. Conclusion Here, we have developed new transcriptomic resources for T. vaporariorum. These include a substantial and annotated EST dataset that will serve the community studying this important crop pest and will elucidate further the molecular mechanisms underlying insecticide resistance.

  11. Comparative transcriptomic analysis of two closely related ground beetle species with marked genital divergence using pyrosequencing.

    Science.gov (United States)

    Fujimaki, Kotaro; Fujisawa, Tomochika; Yazawa, Shigenobu; Nishimura, Osamu; Sota, Teiji

    2014-09-01

    Ground beetles of the subgenus Ohomopterus (genus Carabus) show marked divergence in species-specific male and female genital morphologies, which contributes to reproductive isolation among species. Characterizing the genetic basis of species-specific genital morphology is essential for understanding their diversification, but genomic information on Ohomopterus is not yet available. We analyzed mRNA extracted from abdominal sections of the last instar larvae and pupae of two sister species, Carabus (Ohomopterus) iwawakianus and C. (O.) uenoi, which show marked differences in genital morphology, to compare transcriptomic profiles using Roche 454 pyrosequencing. We obtained 1,608,572 high-quality reads and assembled them into 176,278 unique sequences, of which 66,049 sequences were combined into 12,662 clusters. Differential expression analyses for sexed pupae suggested that four and five clusters were differentially expressed between species for males and females, respectively. We also identified orthologous sequences of genes involved in genital development in Drosophila, which potentially affect genital development and species-specific genital morphology in Ohomopterus. This study provides the first large transcriptomic data set for a morphologically diversified beetle group, which can facilitate future studies on the genetic basis of species-specific genitalia.

  12. Transcriptome sequencing and characterization for the sea cucumber Apostichopus japonicus (Selenka, 1867.

    Directory of Open Access Journals (Sweden)

    Huixia Du

    Full Text Available BACKGROUND: Sea cucumbers are a special group of marine invertebrates. They occupy a taxonomic position that is believed to be important for understanding the origin and evolution of deuterostomes. Some of them such as Apostichopus japonicus represent commercially important aquaculture species in Asian countries. Many efforts have been devoted to increasing the number of expressed sequence tags (ESTs for A. japonicus, but a comprehensive characterization of its transcriptome remains lacking. Here, we performed the large-scale transcriptome profiling and characterization by pyrosequencing diverse cDNA libraries from A. japonicus. RESULTS: In total, 1,061,078 reads were obtained by 454 sequencing of eight cDNA libraries representing different developmental stages and adult tissues in A. japonicus. These reads were assembled into 29,666 isotigs, which were further clustered into 21,071 isogroups. Nearly 40% of the isogroups showed significant matches to known proteins based on sequence similarity. Gene ontology (GO and KEGG pathway analyses recovered diverse biological functions and processes. Candidate genes that were potentially involved in aestivation were identified. Transcriptome comparison with the sea urchin Strongylocentrotus purpuratus revealed similar patterns of GO term representation. In addition, 4,882 putative orthologous genes were identified, of which 202 were not present in the non-echinoderm organisms. More than 700 simple sequence repeats (SSRs and 54,000 single nucleotide polymorphisms (SNPs were detected in the A. japonicus transcriptome. CONCLUSION: Pyrosequencing was proven to be efficient in rapidly identifying a large set of genes for the sea cucumber A. japonicus. Through the large-scale transcriptome sequencing as well as public EST data integration, we performed a comprehensive characterization of the A. japonicus transcriptome and identified candidate aestivation-related genes. A large number of potential genetic

  13. Comparing de novo assemblers for 454 transcriptome data.

    Science.gov (United States)

    Kumar, Sujai; Blaxter, Mark L

    2010-10-16

    Roche 454 pyrosequencing has become a method of choice for generating transcriptome data from non-model organisms. Once the tens to hundreds of thousands of short (250-450 base) reads have been produced, it is important to correctly assemble these to estimate the sequence of all the transcripts. Most transcriptome assembly projects use only one program for assembling 454 pyrosequencing reads, but there is no evidence that the programs used to date are optimal. We have carried out a systematic comparison of five assemblers (CAP3, MIRA, Newbler, SeqMan and CLC) to establish best practices for transcriptome assemblies, using a new dataset from the parasitic nematode Litomosoides sigmodontis. Although no single assembler performed best on all our criteria, Newbler 2.5 gave longer contigs, better alignments to some reference sequences, and was fast and easy to use. SeqMan assemblies performed best on the criterion of recapitulating known transcripts, and had more novel sequence than the other assemblers, but generated an excess of small, redundant contigs. The remaining assemblers all performed almost as well, with the exception of Newbler 2.3 (the version currently used by most assembly projects), which generated assemblies that had significantly lower total length. As different assemblers use different underlying algorithms to generate contigs, we also explored merging of assemblies and found that the merged datasets not only aligned better to reference sequences than individual assemblies, but were also more consistent in the number and size of contigs. Transcriptome assemblies are smaller than genome assemblies and thus should be more computationally tractable, but are often harder because individual contigs can have highly variable read coverage. Comparing single assemblers, Newbler 2.5 performed best on our trial data set, but other assemblers were closely comparable. Combining differently optimal assemblies from different programs however gave a more credible

  14. Comparing de novo assemblers for 454 transcriptome data

    Directory of Open Access Journals (Sweden)

    Blaxter Mark L

    2010-10-01

    Full Text Available Abstract Background Roche 454 pyrosequencing has become a method of choice for generating transcriptome data from non-model organisms. Once the tens to hundreds of thousands of short (250-450 base reads have been produced, it is important to correctly assemble these to estimate the sequence of all the transcripts. Most transcriptome assembly projects use only one program for assembling 454 pyrosequencing reads, but there is no evidence that the programs used to date are optimal. We have carried out a systematic comparison of five assemblers (CAP3, MIRA, Newbler, SeqMan and CLC to establish best practices for transcriptome assemblies, using a new dataset from the parasitic nematode Litomosoides sigmodontis. Results Although no single assembler performed best on all our criteria, Newbler 2.5 gave longer contigs, better alignments to some reference sequences, and was fast and easy to use. SeqMan assemblies performed best on the criterion of recapitulating known transcripts, and had more novel sequence than the other assemblers, but generated an excess of small, redundant contigs. The remaining assemblers all performed almost as well, with the exception of Newbler 2.3 (the version currently used by most assembly projects, which generated assemblies that had significantly lower total length. As different assemblers use different underlying algorithms to generate contigs, we also explored merging of assemblies and found that the merged datasets not only aligned better to reference sequences than individual assemblies, but were also more consistent in the number and size of contigs. Conclusions Transcriptome assemblies are smaller than genome assemblies and thus should be more computationally tractable, but are often harder because individual contigs can have highly variable read coverage. Comparing single assemblers, Newbler 2.5 performed best on our trial data set, but other assemblers were closely comparable. Combining differently optimal assemblies

  15. Transcriptomic analysis of grain amaranth (Amaranthus hypochondriacus using 454 pyrosequencing: comparison with A. tuberculatus, expression profiling in stems and in response to biotic and abiotic stress

    Directory of Open Access Journals (Sweden)

    Vargas-Ortiz Erandi

    2011-07-01

    Full Text Available Abstract Background Amaranthus hypochondriacus, a grain amaranth, is a C4 plant noted by its ability to tolerate stressful conditions and produce highly nutritious seeds. These possess an optimal amino acid balance and constitute a rich source of health-promoting peptides. Although several recent studies, mostly involving subtractive hybridization strategies, have contributed to increase the relatively low number of grain amaranth expressed sequence tags (ESTs, transcriptomic information of this species remains limited, particularly regarding tissue-specific and biotic stress-related genes. Thus, a large scale transcriptome analysis was performed to generate stem- and (abiotic stress-responsive gene expression profiles in grain amaranth. Results A total of 2,700,168 raw reads were obtained from six 454 pyrosequencing runs, which were assembled into 21,207 high quality sequences (20,408 isotigs + 799 contigs. The average sequence length was 1,064 bp and 930 bp for isotigs and contigs, respectively. Only 5,113 singletons were recovered after quality control. Contigs/isotigs were further incorporated into 15,667 isogroups. All unique sequences were queried against the nr, TAIR, UniRef100, UniRef50 and Amaranthaceae EST databases for annotation. Functional GO annotation was performed with all contigs/isotigs that produced significant hits with the TAIR database. Only 8,260 sequences were found to be homologous when the transcriptomes of A. tuberculatus and A. hypochondriacus were compared, most of which were associated with basic house-keeping processes. Digital expression analysis identified 1,971 differentially expressed genes in response to at least one of four stress treatments tested. These included several multiple-stress-inducible genes that could represent potential candidates for use in the engineering of stress-resistant plants. The transcriptomic data generated from pigmented stems shared similarity with findings reported in developing

  16. Transcriptome sequencing and comparative transcriptome analysis of the scleroglucan producer Sclerotium rolfsii

    Directory of Open Access Journals (Sweden)

    Stahl Ulf

    2010-05-01

    Full Text Available Abstract Background The plant pathogenic basidiomycete Sclerotium rolfsii produces the industrially exploited exopolysaccharide scleroglucan, a polymer that consists of (1 → 3-β-linked glucose with a (1 → 6-β-glycosyl branch on every third unit. Although the physicochemical properties of scleroglucan are well understood, almost nothing is known about the genetics of scleroglucan biosynthesis. Similarly, the biosynthetic pathway of oxalate, the main by-product during scleroglucan production, has not been elucidated yet. In order to provide a basis for genetic and metabolic engineering approaches, we studied scleroglucan and oxalate biosynthesis in S. rolfsii using different transcriptomic approaches. Results Two S. rolfsii transcriptomes obtained from scleroglucan-producing and scleroglucan-nonproducing conditions were pooled and sequenced using the 454 pyrosequencing technique yielding ~350,000 reads. These could be assembled into 21,937 contigs and 171,833 singletons, for which 6,951 had significant matches in public protein data bases. Sequence data were used to obtain first insights into the genomics of scleroglucan and oxalate production and to predict putative proteins involved in the synthesis of both metabolites. Using comparative transcriptomics, namely Agilent microarray hybridization and suppression subtractive hybridization, we identified ~800 unigenes which are differently expressed under scleroglucan-producing and non-producing conditions. From these, candidate genes were identified which could represent potential leads for targeted modification of the S. rolfsii metabolism for increased scleroglucan yields. Conclusions The results presented in this paper provide for the first time genomic and transcriptomic data about S. rolfsii and demonstrate the power and usefulness of combined transcriptome sequencing and comparative microarray analysis. The data obtained allowed us to predict the biosynthetic pathways of scleroglucan and

  17. Antarctic krill 454 pyrosequencing reveals chaperone and stress transcriptome.

    Directory of Open Access Journals (Sweden)

    Melody S Clark

    Full Text Available BACKGROUND: The Antarctic krill Euphausia superba is a keystone species in the Antarctic food chain. Not only is it a significant grazer of phytoplankton, but it is also a major food item for charismatic megafauna such as whales and seals and an important Southern Ocean fisheries crop. Ecological data suggest that this species is being affected by climate change and this will have considerable consequences for the balance of the Southern Ocean ecosystem. Hence, understanding how this organism functions is a priority area and will provide fundamental data for life history studies, energy budget calculations and food web models. METHODOLOGY/PRINCIPAL FINDINGS: The assembly of the 454 transcriptome of E. superba resulted in 22,177 contigs with an average size of 492bp (ranging between 137 and 8515bp. In depth analysis of the data revealed an extensive catalogue of the cellular chaperone systems and the major antioxidant proteins. Full length sequences were characterised for the chaperones HSP70, HSP90 and the super-oxide dismutase antioxidants, with the discovery of potentially novel duplications of these genes. The sequence data contained 41,470 microsatellites and 17,776 Single Nucleotide Polymorphisms (SNPs/INDELS, providing a resource for population and also gene function studies. CONCLUSIONS: This paper details the first 454 generated data for a pelagic Antarctic species or any pelagic crustacean globally. The classical "stress proteins", such as HSP70, HSP90, ferritin and GST were all highly expressed. These genes were shown to be over expressed in the transcriptomes of Antarctic notothenioid fish and hypothesized as adaptations to living in the cold, with the associated problems of decreased protein folding efficiency and increased vulnerability to damage by reactive oxygen species. Hence, these data will provide a major resource for future physiological work on krill, but in particular a suite of "stress" genes for studies understanding

  18. Sequencing and analysis of the Mediterranean amphioxus (Branchiostoma lanceolatum transcriptome.

    Directory of Open Access Journals (Sweden)

    Silvan Oulion

    Full Text Available BACKGROUND: The basally divergent phylogenetic position of amphioxus (Cephalochordata, as well as its conserved morphology, development and genetics, make it the best proxy for the chordate ancestor. Particularly, studies using the amphioxus model help our understanding of vertebrate evolution and development. Thus, interest for the amphioxus model led to the characterization of both the transcriptome and complete genome sequence of the American species, Branchiostoma floridae. However, recent technical improvements allowing induction of spawning in the laboratory during the breeding season on a daily basis with the Mediterranean species Branchiostoma lanceolatum have encouraged European Evo-Devo researchers to adopt this species as a model even though no genomic or transcriptomic data have been available. To fill this need we used the pyrosequencing method to characterize the B. lanceolatum transcriptome and then compared our results with the published transcriptome of B. floridae. RESULTS: Starting with total RNA from nine different developmental stages of B. lanceolatum, a normalized cDNA library was constructed and sequenced on Roche GS FLX (Titanium mode. Around 1.4 million of reads were produced and assembled into 70,530 contigs (average length of 490 bp. Overall 37% of the assembled sequences were annotated by BlastX and their Gene Ontology terms were determined. These results were then compared to genomic and transcriptomic data of B. floridae to assess similarities and specificities of each species. CONCLUSION: We obtained a high-quality amphioxus (B. lanceolatum reference transcriptome using a high throughput sequencing approach. We found that 83% of the predicted genes in the B. floridae complete genome sequence are also found in the B. lanceolatum transcriptome, while only 41% were found in the B. floridae transcriptome obtained with traditional Sanger based sequencing. Therefore, given the high degree of sequence conservation

  19. A comparison of parallel pyrosequencing and sanger clone-based sequencing and its impact on the characterization of the genetic diversity of HIV-1.

    Directory of Open Access Journals (Sweden)

    Binhua Liang

    Full Text Available BACKGROUND: Pyrosequencing technology has the potential to rapidly sequence HIV-1 viral quasispecies without requiring the traditional approach of cloning. In this study, we investigated the utility of ultra-deep pyrosequencing to characterize genetic diversity of the HIV-1 gag quasispecies and assessed the possible contribution of pyrosequencing technology in studying HIV-1 biology and evolution. METHODOLOGY/PRINCIPAL FINDINGS: HIV-1 gag gene was amplified from 96 patients using nested PCR. The PCR products were cloned and sequenced using capillary based Sanger fluorescent dideoxy termination sequencing. The same PCR products were also directly sequenced using the 454 pyrosequencing technology. The two sequencing methods were evaluated for their ability to characterize quasispecies variation, and to reveal sites under host immune pressure for their putative functional significance. A total of 14,034 variations were identified by 454 pyrosequencing versus 3,632 variations by Sanger clone-based (SCB sequencing. 11,050 of these variations were detected only by pyrosequencing. These undetected variations were located in the HIV-1 Gag region which is known to contain putative cytotoxic T lymphocyte (CTL and neutralizing antibody epitopes, and sites related to virus assembly and packaging. Analysis of the positively selected sites derived by the two sequencing methods identified several differences. All of them were located within the CTL epitope regions. CONCLUSIONS/SIGNIFICANCE: Ultra-deep pyrosequencing has proven to be a powerful tool for characterization of HIV-1 genetic diversity with enhanced sensitivity, efficiency, and accuracy. It also improved reliability of downstream evolutionary and functional analysis of HIV-1 quasispecies.

  20. Evaluation of culture-based techniques and 454 pyrosequencing for the analysis of fungal diversity in potting media and organic fertilizers.

    Science.gov (United States)

    Al-Sadi, A M; Al-Mazroui, S S; Phillips, A J L

    2015-08-01

    Potting media and organic fertilizers (OFs) are commonly used in agricultural systems. However, there is a lack of studies on the efficiency of culture-based techniques in assessing the level of fungal diversity in these products. A study was conducted to investigate the efficiency of seven culture-based techniques and pyrosequencing for characterizing fungal diversity in potting media and OFs. Fungal diversity was evaluated using serial dilution, direct plating and baiting with carrot slices, potato slices, radish seeds, cucumber seeds and cucumber cotyledons. Identity of all the isolates was confirmed on the basis of the internal transcribed spacer region of the ribosomal RNA (ITS rRNA) sequence data. The direct plating technique was found to be superior over other culture-based techniques in the number of fungal species detected. It was also found to be simple and the least time consuming technique. Comparing the efficiency of direct plating with 454 pyrosequencing revealed that pyrosequencing detected 12 and 15 times more fungal species from potting media and OFs respectively. Analysis revealed that there were differences between potting media and OFs in the dominant phyla, classes, orders, families, genera and species detected. Zygomycota (52%) and Chytridiomycota (60%) were the predominant phyla in potting media and OFs respectively. The superiority of pyrosequencing over cultural methods could be related to the ability to detect obligate fungi, slow growing fungi and fungi that exist at low population densities. The evaluated methods in this study, especially direct plating and pyrosequencing, may be used as tools to help detect and reduce movement of unwanted fungi between countries and regions. © 2015 The Society for Applied Microbiology.

  1. 454 Pyrosequencing of Olive (Olea europaea L.) Transcriptome in Response to Salinity.

    Science.gov (United States)

    Bazakos, Christos; Manioudaki, Maria E; Sarropoulou, Elena; Spano, Thodhoraq; Kalaitzis, Panagiotis

    2015-01-01

    Olive (Olea europaea L.) is one of the most important crops in the Mediterranean region. The expansion of cultivation in areas irrigated with low quality and saline water has negative effects on growth and productivity however the investigation of the molecular basis of salt tolerance in olive trees has been only recently initiated. To this end, we investigated the molecular response of cultivar Kalamon to salinity stress using next-generation sequencing technology to explore the transcriptome profile of olive leaves and roots and identify differentially expressed genes that are related to salt tolerance response. Out of 291,958 obtained trimmed reads, 28,270 unique transcripts were identified of which 35% are annotated, a percentage that is comparable to similar reports on non-model plants. Among the 1,624 clusters in roots that comprise more than one read, 24 were differentially expressed comprising 9 down- and 15 up-regulated genes. Respectively, inleaves, among the 2,642 clusters, 70 were identified as differentially expressed, with 14 down- and 56 up-regulated genes. Using next-generation sequencing technology we were able to identify salt-response-related transcripts. Furthermore we provide an annotated transcriptome of olive as well as expression data, which are both significant tools for further molecular studies in olive.

  2. Transcriptome Analysis of Barbarea vulgaris Infested with Diamondback Moth (Plutella xylostella) Larvae

    Science.gov (United States)

    Shen, Di; Wang, Haiping; Wu, Qingjun; Lu, Peng; Qiu, Yang; Song, Jiangping; Zhang, Youjun; Li, Xixiang

    2013-01-01

    Background The diamondback moth (DBM, Plutella xylostella) is a crucifer-specific pest that causes significant crop losses worldwide. Barbarea vulgaris (Brassicaceae) can resist DBM and other herbivorous insects by producing feeding-deterrent triterpenoid saponins. Plant breeders have long aimed to transfer this insect resistance to other crops. However, a lack of knowledge on the biosynthetic pathways and regulatory networks of these insecticidal saponins has hindered their practical application. A pyrosequencing-based transcriptome analysis of B. vulgaris during DBM larval feeding was performed to identify genes and gene networks responsible for saponin biosynthesis and its regulation at the genome level. Principal Findings Approximately 1.22, 1.19, 1.16, 1.23, 1.16, 1.20, and 2.39 giga base pairs of clean nucleotides were generated from B. vulgaris transcriptomes sampled 1, 4, 8, 12, 24, and 48 h after onset of P. xylostella feeding and from non-inoculated controls, respectively. De novo assembly using all data of the seven transcriptomes generated 39,531 unigenes. A total of 37,780 (95.57%) unigenes were annotated, 14,399 of which were assigned to one or more gene ontology terms and 19,620 of which were assigned to 126 known pathways. Expression profiles revealed 2,016–4,685 up-regulated and 557–5188 down-regulated transcripts. Secondary metabolic pathways, such as those of terpenoids, glucosinolates, and phenylpropanoids, and its related regulators were elevated. Candidate genes for the triterpene saponin pathway were found in the transcriptome. Orthological analysis of the transcriptome with four other crucifer transcriptomes identified 592 B. vulgaris-specific gene families with a P-value cutoff of 1e−5. Conclusion This study presents the first comprehensive transcriptome analysis of B. vulgaris subjected to a series of DBM feedings. The biosynthetic and regulatory pathways of triterpenoid saponins and other DBM deterrent metabolites in this plant were

  3. Characteristics of 454 pyrosequencing data--enabling realistic simulation with flowsim.

    Science.gov (United States)

    Balzer, Susanne; Malde, Ketil; Lanzén, Anders; Sharma, Animesh; Jonassen, Inge

    2010-09-15

    The commercial launch of 454 pyrosequencing in 2005 was a milestone in genome sequencing in terms of performance and cost. Throughout the three available releases, average read lengths have increased to approximately 500 base pairs and are thus approaching read lengths obtained from traditional Sanger sequencing. Study design of sequencing projects would benefit from being able to simulate experiments. We explore 454 raw data to investigate its characteristics and derive empirical distributions for the flow values generated by pyrosequencing. Based on our findings, we implement Flowsim, a simulator that generates realistic pyrosequencing data files of arbitrary size from a given set of input DNA sequences. We finally use our simulator to examine the impact of sequence lengths on the results of concrete whole-genome assemblies, and we suggest its use in planning of sequencing projects, benchmarking of assembly methods and other fields. Flowsim is freely available under the General Public License from http://blog.malde.org/index.php/flowsim/.

  4. Pyrosequencing of Haliotis diversicolor transcriptomes: insights into early developmental molluscan gene expression.

    Directory of Open Access Journals (Sweden)

    Zi-Xia Huang

    Full Text Available BACKGROUND: The abalone Haliotis diversicolor is a good model for study of the settlement and metamorphosis, which are widespread marine ecological phenomena. However, information on the global gene backgrounds and gene expression profiles for the early development of abalones is lacking. METHODOLOGY/PRINCIPAL FINDINGS: In this study, eight non-normalized and multiplex barcode-labeled transcriptomes were sequenced using a 454 GS system to cover the early developmental stages of the abalone H. diversicolor. The assembly generated 35,415 unigenes, of which 7,566 were assigned GO terms. A global gene expression profile containing 636 scaffolds/contigs was constructed and was proven reliable using qPCR evaluation. It indicated that there may be existing dramatic phase transitions. Bioprocesses were proposed, including the 'lock system' in mature eggs, the collagen shells of the trochophore larvae and the development of chambered extracellular matrix (ECM structures within the earliest postlarvae. CONCLUSION: This study globally details the first 454 sequencing data for larval stages of H. diversicolor. A basic analysis of the larval transcriptomes and cluster of the gene expression profile indicates that each stage possesses a batch of specific genes that are indispensable during embryonic development, especially during the two-cell, trochophore and early postlarval stages. These data will provide a fundamental resource for future physiological works on abalones, revealing the mechanisms of settlement and metamorphosis at the molecular level.

  5. Development of an ELA-DRA gene typing method based on pyrosequencing technology.

    Science.gov (United States)

    Díaz, S; Echeverría, M G; It, V; Posik, D M; Rogberg-Muñoz, A; Pena, N L; Peral-García, P; Vega-Pla, J L; Giovambattista, G

    2008-11-01

    The polymorphism of equine lymphocyte antigen (ELA) class II DRA gene had been detected by polymerase chain reaction-single-strand conformational polymorphism (PCR-SSCP) and reference strand-mediated conformation analysis. These methodologies allowed to identify 11 ELA-DRA exon 2 sequences, three of which are widely distributed among domestic horse breeds. Herein, we describe the development of a pyrosequencing-based method applicable to ELA-DRA typing, by screening samples from eight different horse breeds previously typed by PCR-SSCP. This sequence-based method would be useful in high-throughput genotyping of major histocompatibility complex genes in horses and other animal species, making this system interesting as a rapid screening method for animal genotyping of immune-related genes.

  6. Transcriptome dynamics-based operon prediction in prokaryotes.

    Science.gov (United States)

    Fortino, Vittorio; Smolander, Olli-Pekka; Auvinen, Petri; Tagliaferri, Roberto; Greco, Dario

    2014-05-16

    Inferring operon maps is crucial to understanding the regulatory networks of prokaryotic genomes. Recently, RNA-seq based transcriptome studies revealed that in many bacterial species the operon structure vary with the change of environmental conditions. Therefore, new computational solutions that use both static and dynamic data are necessary to create condition specific operon predictions. In this work, we propose a novel classification method that integrates RNA-seq based transcriptome profiles with genomic sequence features to accurately identify the operons that are expressed under a measured condition. The classifiers are trained on a small set of confirmed operons and then used to classify the remaining gene pairs of the organism studied. Finally, by linking consecutive gene pairs classified as operons, our computational approach produces condition-dependent operon maps. We evaluated our approach on various RNA-seq expression profiles of the bacteria Haemophilus somni, Porphyromonas gingivalis, Escherichia coli and Salmonella enterica. Our results demonstrate that, using features depending on both transcriptome dynamics and genome sequence characteristics, we can identify operon pairs with high accuracy. Moreover, the combination of DNA sequence and expression data results in more accurate predictions than each one alone. We present a computational strategy for the comprehensive analysis of condition-dependent operon maps in prokaryotes. Our method can be used to generate condition specific operon maps of many bacterial organisms for which high-resolution transcriptome data is available.

  7. A first insight into Pycnoporus sanguineus BAFC 2126 transcriptome.

    Directory of Open Access Journals (Sweden)

    Cristian O Rohr

    Full Text Available Fungi of the genus Pycnoporus are white-rot basidiomycetes widely studied because of their ability to synthesize high added-value compounds and enzymes of industrial interest. Here we report the sequencing, assembly and analysis of the transcriptome of Pycnoporus sanguineus BAFC 2126 grown at stationary phase, in media supplemented with copper sulfate. Using the 454 pyrosequencing platform we obtained a total of 226,336 reads (88,779,843 bases that were filtered and de novo assembled to generate a reference transcriptome of 7,303 transcripts. Putative functions were assigned for 4,732 transcripts by searching similarities of six-frame translated sequences against a customized protein database and by the presence of conserved protein domains. Through the analysis of translated sequences we identified transcripts encoding 178 putative carbohydrate active enzymes, including representatives of 15 families with roles in lignocellulose degradation. Furthermore, we found many transcripts encoding enzymes related to lignin hydrolysis and modification, including laccases and peroxidases, as well as GMC oxidoreductases, copper radical oxidases and other enzymes involved in the generation of extracellular hydrogen peroxide and iron homeostasis. Finally, we identified the transcripts encoding all of the enzymes involved in terpenoid backbone biosynthesis pathway, various terpene synthases related to the biosynthesis of sesquiterpenoids and triterpenoids precursors, and also cytochrome P450 monooxygenases, glutathione S-transferases and epoxide hydrolases with potential functions in the biodegradation of xenobiotics and the enantioselective biosynthesis of biologically active drugs. To our knowledge this is the first report of a transcriptome of genus Pycnoporus and a resource for future molecular studies in P. sanguineus.

  8. Gene discovery using massively parallel pyrosequencing to develop ESTs for the flesh fly Sarcophaga crassipalpis

    Directory of Open Access Journals (Sweden)

    Hahn Daniel A

    2009-05-01

    Full Text Available Abstract Background Flesh flies in the genus Sarcophaga are important models for investigating endocrinology, diapause, cold hardiness, reproduction, and immunity. Despite the prominence of Sarcophaga flesh flies as models for insect physiology and biochemistry, and in forensic studies, little genomic or transcriptomic data are available for members of this genus. We used massively parallel pyrosequencing on the Roche 454-FLX platform to produce a substantial EST dataset for the flesh fly Sarcophaga crassipalpis. To maximize sequence diversity, we pooled RNA extracted from whole bodies of all life stages and normalized the cDNA pool after reverse transcription. Results We obtained 207,110 ESTs with an average read length of 241 bp. These reads assembled into 20,995 contigs and 31,056 singletons. Using BLAST searches of the NR and NT databases we were able to identify 11,757 unique gene elements (ES. crassipalpis unigenes among GO Biological Process functional groups with that of the Drosophila melanogaster transcriptome suggests that our ESTs are broadly representative of the flesh fly transcriptome. Insertion and deletion errors in 454 sequencing present a serious hurdle to comparative transcriptome analysis. Aided by a new approach to correcting for these errors, we performed a comparative analysis of genetic divergence across GO categories among S. crassipalpis, D. melanogaster, and Anopheles gambiae. The results suggest that non-synonymous substitutions occur at similar rates across categories, although genes related to response to stimuli may evolve slightly faster. In addition, we identified over 500 potential microsatellite loci and more than 12,000 SNPs among our ESTs. Conclusion Our data provides the first large-scale EST-project for flesh flies, a much-needed resource for exploring this model species. In addition, we identified a large number of potential microsatellite and SNP markers that could be used in population and systematic

  9. Metabarcoding Analysis of Phytophthora Diversity Using Genus-Specific Primers and 454 Pyrosequencing.

    Science.gov (United States)

    Prigigallo, Maria I; Abdelfattah, Ahmed; Cacciola, Santa O; Faedda, Roberto; Sanzani, Simona M; Cooke, David E L; Schena, L

    2016-03-01

    A metabarcoding method based on genus-specific primers and 454 pyrosequencing was utilized to investigate the genetic diversity of Phytophthora spp. in soil and root samples of potted plants, from eight nurseries. Pyrosequencing enabled the detection of 25 Phytophthora phylotypes distributed in seven different clades and provided a much higher resolution than a corresponding cloning/Sanger sequencing approach. Eleven of these phylotypes, including P. cactorum, P. citricola s.str., P. palmivora, P. palmivora-like, P. megasperma or P. gonapodyides, P. ramorum, and five putative new Phytophthora species phylogenetically related to clades 1, 2, 4, 6, and 7 were detected only with the 454 pyrosequencing approach. We also found an additional 18 novel records of a phylotype in a particular nursery that were not detected with cloning/Sanger sequencing. Several aspects confirmed the reliability of the method: (i) many identical sequence types were identified independently in different nurseries, (ii) most sequence types identified with 454 pyrosequencing were identical to those from the cloning/Sanger sequencing approach and/or perfectly matched GenBank deposited sequences, and (iii) the divergence noted between sequence types of putative new Phytophthora species and all other detected sequences was sufficient to rule out sequencing errors. The proposed method represents a powerful tool to study Phytophthora diversity providing that particular attention is paid to the analysis of 454 pyrosequencing raw read sequences and to the identification of sequence types.

  10. Rapid transcriptome and proteome profiling of a non-model marine invertebrate, Bugula neritina

    KAUST Repository

    Wang, Hao

    2010-06-10

    Non-model organisms represent the majority of life forms in our planet. However, the lack of genetic information hinders us to understand the unique biological phenomena in non-model organisms at the molecular level. In this study, we applied a tandem transcriptome and proteome profiling on a non-model marine fouling organism, Bugula neritina. Using a 454 pyrosequencing platform with the updated titanium reagents, we generated a total of 48M bp transcriptome data consisting of 131 450 high-quality reads. Of these, 122 650 reads (93%) were assembled to produce 6392 contigs with an average length of 538 bases and the remaining 8800 reads were singletons. Of the total 15 192 unigenes, 13 863 ORFs were predicated, of which 6917 were functionally annotated based on gene ontology and eukaryotic orthologous groups. Subsequent proteome analysis identified and quantified 882 proteins from B. neritina. These results would provide fundamental and important information for the subsequent studies of molecular mechanism in larval biology, development, antifouling research. Furthermore, we demonstrated, for the first time, the combined use of two high-throughput technologies as a powerful approach for accelerating the studies of non-model but otherwise important species. © 2010 Wiley-VCH Verlag GmbH & Co. KGaA.

  11. Transcriptomic underpinning of toxicant-mediated physiological function alterations in three terrestrial invertebrate taxa: A review

    Energy Technology Data Exchange (ETDEWEB)

    Brulle, Franck [Univ Lille Nord de France, F59000 Lille (France); LGCgE-Lille 1, Ecologie Numerique et Ecotoxicologie, F-59650 Villeneuve d' Ascq (France); Morgan, A. John [Cardiff School of Biosciences, Cardiff University, P.O. Box 915, Cardiff, CF10 3US Wales (United Kingdom); Cocquerelle, Claude [Univ Lille Nord de France, F59000 Lille (France); LGCgE-Lille 1, Ecologie Numerique et Ecotoxicologie, F-59650 Villeneuve d' Ascq (France); Vandenbulcke, Franck, E-mail: franck.vandenbulcke@univ-lille1.f [Univ Lille Nord de France, F59000 Lille (France); LGCgE-Lille 1, Ecologie Numerique et Ecotoxicologie, F-59650 Villeneuve d' Ascq (France)

    2010-09-15

    Diverse anthropogenic activities often lead to the accumulation of inorganic and organic residues in topsoils. Biota living in close contact with contaminated soils may experience stress at different levels of biological organisation throughout the continuum from the molecular-genetic to ecological and community levels. To date, the relationship between changes at the molecular (mRNA expression) and biochemical/physiological levels evoked by exposures to chemical compounds has been partially established in a limited number of terrestrial invertebrate species. Recently, the advent of a family of transcriptomic tools (e.g. Real-time PCR, Subtractive Suppressive Hybridization, Expressed Sequence Tag sequencing, pyro-sequencing technologies, Microarray chips), together with supporting informatic and statistical procedures, have permitted the robust analyses of global gene expression changes within an ecotoxicological context. This review focuses on how transcriptomics is enlightening our understanding of the molecular-genetic responses of three contrasting terrestrial macroinvertebrate taxa (nematodes, earthworms, and springtails) to inorganics, organics, and agrochemicals. - Environmental toxicology and transcriptomics in soil macroinvertebrates.

  12. Transcriptomic underpinning of toxicant-mediated physiological function alterations in three terrestrial invertebrate taxa: A review

    International Nuclear Information System (INIS)

    Brulle, Franck; Morgan, A. John; Cocquerelle, Claude; Vandenbulcke, Franck

    2010-01-01

    Diverse anthropogenic activities often lead to the accumulation of inorganic and organic residues in topsoils. Biota living in close contact with contaminated soils may experience stress at different levels of biological organisation throughout the continuum from the molecular-genetic to ecological and community levels. To date, the relationship between changes at the molecular (mRNA expression) and biochemical/physiological levels evoked by exposures to chemical compounds has been partially established in a limited number of terrestrial invertebrate species. Recently, the advent of a family of transcriptomic tools (e.g. Real-time PCR, Subtractive Suppressive Hybridization, Expressed Sequence Tag sequencing, pyro-sequencing technologies, Microarray chips), together with supporting informatic and statistical procedures, have permitted the robust analyses of global gene expression changes within an ecotoxicological context. This review focuses on how transcriptomics is enlightening our understanding of the molecular-genetic responses of three contrasting terrestrial macroinvertebrate taxa (nematodes, earthworms, and springtails) to inorganics, organics, and agrochemicals. - Environmental toxicology and transcriptomics in soil macroinvertebrates.

  13. Pyrosequencing for classification of human FcγRIIIA allotypes: a comparison with PCR-based techniques.

    Science.gov (United States)

    Matlawska-Wasowska, Ksenia; Gale, James M; Nickl, Christian K; Khalili, Parisa; Shirley, Brian; Wilson, Bridget S; Vasef, Mohammad A; Winter, Stuart S

    2014-12-01

    Surface-specific antigens expressed by hematopoietic cells are attractive targets for antibody-mediated immunotherapy. Monoclonal antibodies (mAbs) involve various mechanisms to eliminate target cells, including antibody-dependent cellular cytotoxicity (ADCC)- and phagocytosis (ADCP)-mediated killing through natural killer (NK) and macrophage effector cells bearing FcγRIIIA (CD16). The clinical efficacy of ADCC is particularly impacted by a single nucleotide polymorphism (SNP) found in the gene encoding FcγRIIIA (FCGR3A), which generates a variable distribution of the 158 V/V, F/V or F/F CD16 allotypes (F = phenylalanine, V = valine) in the normal human population. Currently, most patients are not screened for CD16 allotypes, creating the potential to include in their treatment a mAb-based therapy that may have limited benefit. Therefore, it is important to identify CD16 allotypes when considering mAb therapies that require ADCC/ADCP. The objective of this study was to develop a reliable PCR-based assay for classification of human FcγRIIIA allotypes. We studied 42 normal human subjects for the incidence of FcγRIIIA-158 polymorphisms using comparative molecular approaches. The results of our study showed 100% accuracy in genotyping by pyrosequencing. In contrast, nested PCR-based allele-specific restriction assay and quantitative PCR techniques proved to be relatively less sensitive and less specific in distinguishing variant genotypes. Since the efficacy of the mAb-based targeted immunotherapy may be highly dependent upon the CD16 polymorphism in a given individual, we recommend pyrosequencing for CD16 allotype testing.

  14. Strategien zur HLA-Typisierung mit PyrosequencingTM

    OpenAIRE

    Entz, Patricia

    2006-01-01

    Der Haupthistokompatibilitätskomplex ist durch seine biologische Funktion eine für die Diagnostik und Forschung äußerst wichtige Region im humanen Genom. Die Untersuchung von HLA-Genorten stellt ein wichtiges Instrument in der molekulargenetischen Praxis dar. Die Pyrosequencing-Technik ist gut geeignet, um kurze DNA-Abschnitte mit weitgehend bekannter Sequenz schnell und effizient zu untersuchen. Ziel dieser Arbeit war die Entwicklung von Pyrosequencing-basierten Methoden zur HLA-Typisierung....

  15. Developmental gene discovery in a hemimetabolous insect: de novo assembly and annotation of a transcriptome for the cricket Gryllus bimaculatus.

    Directory of Open Access Journals (Sweden)

    Victor Zeng

    Full Text Available Most genomic resources available for insects represent the Holometabola, which are insects that undergo complete metamorphosis like beetles and flies. In contrast, the Hemimetabola (direct developing insects, representing the basal branches of the insect tree, have very few genomic resources. We have therefore created a large and publicly available transcriptome for the hemimetabolous insect Gryllus bimaculatus (cricket, a well-developed laboratory model organism whose potential for functional genetic experiments is currently limited by the absence of genomic resources. cDNA was prepared using mRNA obtained from adult ovaries containing all stages of oogenesis, and from embryo samples on each day of embryogenesis. Using 454 Titanium pyrosequencing, we sequenced over four million raw reads, and assembled them into 21,512 isotigs (predicted transcripts and 120,805 singletons with an average coverage per base pair of 51.3. We annotated the transcriptome manually for over 400 conserved genes involved in embryonic patterning, gametogenesis, and signaling pathways. BLAST comparison of the transcriptome against the NCBI non-redundant protein database (nr identified significant similarity to nr sequences for 55.5% of transcriptome sequences, and suggested that the transcriptome may contain 19,874 unique transcripts. For predicted transcripts without significant similarity to known sequences, we assessed their similarity to other orthopteran sequences, and determined that these transcripts contain recognizable protein domains, largely of unknown function. We created a searchable, web-based database to allow public access to all raw, assembled and annotated data. This database is to our knowledge the largest de novo assembled and annotated transcriptome resource available for any hemimetabolous insect. We therefore anticipate that these data will contribute significantly to more effective and higher-throughput deployment of molecular analysis tools in

  16. Transcriptome sequencing and de novo analysis of the copepod Calanus sinicus using 454 GS FLX.

    Directory of Open Access Journals (Sweden)

    Juan Ning

    Full Text Available BACKGROUND: Despite their species abundance and primary economic importance, genomic information about copepods is still limited. In particular, genomic resources are lacking for the copepod Calanus sinicus, which is a dominant species in the coastal waters of East Asia. In this study, we performed de novo transcriptome sequencing to produce a large number of expressed sequence tags for the copepod C. sinicus. RESULTS: Copepodid larvae and adults were used as the basic material for transcriptome sequencing. Using 454 pyrosequencing, a total of 1,470,799 reads were obtained, which were assembled into 56,809 high quality expressed sequence tags. Based on their sequence similarity to known proteins, about 14,000 different genes were identified, including members of all major conserved signaling pathways. Transcripts that were putatively involved with growth, lipid metabolism, molting, and diapause were also identified among these genes. Differentially expressed genes related to several processes were found in C. sinicus copepodid larvae and adults. We detected 284,154 single nucleotide polymorphisms (SNPs that provide a resource for gene function studies. CONCLUSION: Our data provide the most comprehensive transcriptome resource available for C. sinicus. This resource allowed us to identify genes associated with primary physiological processes and SNPs in coding regions, which facilitated the quantitative analysis of differential gene expression. These data should provide foundation for future genetic and genomic studies of this and related species.

  17. Toward an understanding of the molecular mechanisms of barnacle larval settlement: A comparative transcriptomic approach

    KAUST Repository

    Chen, Zhang-Fan

    2011-07-29

    Background: The barnacle Balanus amphitrite is a globally distributed biofouler and a model species in intertidal ecology and larval settlement studies. However, a lack of genomic information has hindered the comprehensive elucidation of the molecular mechanisms coordinating its larval settlement. The pyrosequencing-based transcriptomic approach is thought to be useful to identify key molecular changes during larval settlement. Methodology and Principal Findings: Using 454 pyrosequencing, we collected totally 630,845 reads including 215,308 from the larval stages and 415,537 from the adults; 23,451 contigs were generated while 77,785 remained as singletons. We annotated 31,720 of the 92,322 predicted open reading frames, which matched hits in the NCBI NR database, and identified 7,954 putative genes that were differentially expressed between the larval and adult stages. Of these, several genes were further characterized with quantitative real-time PCR and in situ hybridization, revealing some key findings: 1) vitellogenin was uniquely expressed in late nauplius stage, suggesting it may be an energy source for the subsequent non-feeding cyprid stage; 2) the locations of mannose receptors suggested they may be involved in the sensory system of cyprids; 3) 20 kDa-cement protein homologues were expressed in the cyprid cement gland and probably function during attachment; and 4) receptor tyrosine kinases were expressed higher in cyprid stage and may be involved in signal perception during larval settlement. Conclusions: Our results provide not only the basis of several new hypotheses about gene functions during larval settlement, but also the availability of this large transcriptome dataset in B. amphitrite for further exploration of larval settlement and developmental pathways in this important marine species. © 2011 Chen et al.

  18. Characterization of Olkiluoto bacterial and archaeal communities by 454 pyrosequencing

    Energy Technology Data Exchange (ETDEWEB)

    Bomberg, M.; Nyyssoenen, M.; Itaevaara, M. [VTT Technical Research Centre of Finland, Espoo (Finland)

    2012-06-15

    Recent advancement in sequencing technologies, 'Next Generation Sequencing', such as FLX 454 pyrosequencing has made it possible to obtain large amounts of sequence data where previously only few sequences could be obtained. This technique is especially useful for the study of community composition of uncultured microbial populations in environmental samples. In this project, the FLX 454 pyrosequencing technique was used to obtain up to 20 000 16S rRNA sequences or 10 000 mRNA sequences from each sample for identification of the microbial species composition as well as for comparison of the microbial communities between different samples. This project focused on the characterization of active microbial communities in the groundwater at the final disposal site of high radioactive wastes in Olkiluoto by FLX 454 pyrosequencing of the bacterial and archaeal ribosomal RNA as well as of the mRNA transcripts of the dsrB gene and mcrA gene of sulphate reducing bacteria and methanogenic archaea, respectively. Specific emphasis was put on studying the relationship of active and latent sulphate reducers and methanogens by qPCR due to their important roles in deep geobiochemical processes connected to copper corrosion. Seven packered boreholes were sampled anaerobically in Olkiluoto during 2009-2010. Groundwater was pumped from specific depths and the microbial cells werecollected by filtration on a membrane. Active microbial communities were studied based on RNA extracted from the membranes and translated to copy DNA, followed by sequencing by 454 Tag pyrosequencing. A total of 27 different bacterial and 17 archaeal taxonomic groups were detected.

  19. Characterization of Olkiluoto bacterial and archaeal communities by 454 pyrosequencing

    Energy Technology Data Exchange (ETDEWEB)

    Bomberg, M; Nyyssoenen, M; Itaevaara, M [VTT Technical Research Centre of Finland, Espoo (Finland)

    2012-06-15

    Recent advancement in sequencing technologies, 'Next Generation Sequencing', such as FLX 454 pyrosequencing has made it possible to obtain large amounts of sequence data where previously only few sequences could be obtained. This technique is especially useful for the study of community composition of uncultured microbial populations in environmental samples. In this project, the FLX 454 pyrosequencing technique was used to obtain up to 20 000 16S rRNA sequences or 10 000 mRNA sequences from each sample for identification of the microbial species composition as well as for comparison of the microbial communities between different samples. This project focused on the characterization of active microbial communities in the groundwater at the final disposal site of high radioactive wastes in Olkiluoto by FLX 454 pyrosequencing of the bacterial and archaeal ribosomal RNA as well as of the mRNA transcripts of the dsrB gene and mcrA gene of sulphate reducing bacteria and methanogenic archaea, respectively. Specific emphasis was put on studying the relationship of active and latent sulphate reducers and methanogens by qPCR due to their important roles in deep geobiochemical processes connected to copper corrosion. Seven packered boreholes were sampled anaerobically in Olkiluoto during 2009-2010. Groundwater was pumped from specific depths and the microbial cells werecollected by filtration on a membrane. Active microbial communities were studied based on RNA extracted from the membranes and translated to copy DNA, followed by sequencing by 454 Tag pyrosequencing. A total of 27 different bacterial and 17 archaeal taxonomic groups were detected.

  20. Characterization of Olkiluoto bacterial and archaeal communities by 454 pyrosequencing

    International Nuclear Information System (INIS)

    Bomberg, M.; Nyyssoenen, M.; Itaevaara, M.

    2012-06-01

    Recent advancement in sequencing technologies, 'Next Generation Sequencing', such as FLX 454 pyrosequencing has made it possible to obtain large amounts of sequence data where previously only few sequences could be obtained. This technique is especially useful for the study of community composition of uncultured microbial populations in environmental samples. In this project, the FLX 454 pyrosequencing technique was used to obtain up to 20 000 16S rRNA sequences or 10 000 mRNA sequences from each sample for identification of the microbial species composition as well as for comparison of the microbial communities between different samples. This project focused on the characterization of active microbial communities in the groundwater at the final disposal site of high radioactive wastes in Olkiluoto by FLX 454 pyrosequencing of the bacterial and archaeal ribosomal RNA as well as of the mRNA transcripts of the dsrB gene and mcrA gene of sulphate reducing bacteria and methanogenic archaea, respectively. Specific emphasis was put on studying the relationship of active and latent sulphate reducers and methanogens by qPCR due to their important roles in deep geobiochemical processes connected to copper corrosion. Seven packered boreholes were sampled anaerobically in Olkiluoto during 2009-2010. Groundwater was pumped from specific depths and the microbial cells werecollected by filtration on a membrane. Active microbial communities were studied based on RNA extracted from the membranes and translated to copy DNA, followed by sequencing by 454 Tag pyrosequencing. A total of 27 different bacterial and 17 archaeal taxonomic groups were detected

  1. Pyrosequencing Based Microbial Community Analysis of Stabilized Mine Soils

    Science.gov (United States)

    Park, J. E.; Lee, B. T.; Son, A.

    2015-12-01

    Heavy metals leached from exhausted mines have been causing severe environmental problems in nearby soils and groundwater. Environmental mitigation was performed based on the heavy metal stabilization using Calcite and steel slag in Korea. Since the soil stabilization only temporarily immobilizes the contaminants to soil matrix, the potential risk of re-leaching heavy metal still exists. Therefore the follow-up management of stabilized soils and the corresponding evaluation methods are required to avoid the consequent contamination from the stabilized soils. In this study, microbial community analysis using pyrosequencing was performed for assessing the potential leaching of the stabilized soils. As a result of rarefaction curve and Chao1 and Shannon indices, the stabilized soil has shown lower richness and diversity as compared to non-contaminated negative control. At the phyla level, as the degree of contamination increases, most of phyla decreased with only exception of increased proteobacteria. Among proteobacteria, gamma-proteobacteria increased against the heavy metal contamination. At the species level, Methylobacter tundripaludum of gamma-proteobacteria showed the highest relative portion of microbial community, indicating that methanotrophs may play an important role in either solubilization or immobilization of heavy metals in stabilized soils.

  2. Bacterial flora-typing with targeted, chip-based Pyrosequencing

    Directory of Open Access Journals (Sweden)

    El-Sayed Yasser Y

    2007-11-01

    Full Text Available Abstract Background The metagenomic analysis of microbial communities holds the potential to improve our understanding of the role of microbes in clinical conditions. Recent, dramatic improvements in DNA sequencing throughput and cost will enable such analyses on individuals. However, such advances in throughput generally come at the cost of shorter read-lengths, limiting the discriminatory power of each read. In particular, classifying the microbial content of samples by sequencing the Results We describe a method for identifying the phylogenetic content of bacterial samples using high-throughput Pyrosequencing targeted at the 16S rRNA gene. Our analysis is adapted to the shorter read-lengths of such technology and uses a database of 16S rDNA to determine the most specific phylogenetic classification for reads, resulting in a weighted phylogenetic tree characterizing the content of the sample. We present results for six samples obtained from the human vagina during pregnancy that corroborates previous studies using conventional techniques. Next, we analyze the power of our method to classify reads at each level of the phylogeny using simulation experiments. We assess the impacts of read-length and database completeness on our method, and predict how we do as technology improves and more bacteria are sequenced. Finally, we study the utility of targeting specific 16S variable regions and show that such an approach considerably improves results for certain types of microbial samples. Using simulation, our method can be used to determine the most informative variable region. Conclusion This study provides positive validation of the effectiveness of targeting 16S metagenomes using short-read sequencing technology. Our methodology allows us to infer the most specific assignment of the sequence reads within the phylogeny, and to identify the most discriminative variable region to target. The analysis of high-throughput Pyrosequencing on human flora

  3. Analysis of genetically modified organisms by pyrosequencing on a portable photodiode-based bioluminescence sequencer.

    Science.gov (United States)

    Song, Qinxin; Wei, Guijiang; Zhou, Guohua

    2014-07-01

    A portable bioluminescence analyser for detecting the DNA sequence of genetically modified organisms (GMOs) was developed by using a photodiode (PD) array. Pyrosequencing on eight genes (zSSIIb, Bt11 and Bt176 gene of genetically modified maize; Lectin, 35S-CTP4, CP4EPSPS, CaMV35S promoter and NOS terminator of the genetically modified Roundup ready soya) was successfully detected with this instrument. The corresponding limit of detection (LOD) was 0.01% with 35 PCR cycles. The maize and soya available from three different provenances in China were detected. The results indicate that pyrosequencing using the small size of the detector is a simple, inexpensive, and reliable way in a farm/field test of GMO analysis. Copyright © 2014 Elsevier Ltd. All rights reserved.

  4. Evaluation of de novo assembly technique in the South African abalone Haliotis midae transcriptome: A comparison from Illumina and 454 systems

    Directory of Open Access Journals (Sweden)

    Barbara Picone

    2016-12-01

    Full Text Available Next generation sequencing platforms have recently been used to rapidly characterize transcriptome sequences from a number of non-model organisms. The present study compares two of the most frequently used platforms, the Roche 454-pyrosequencing and the Illumina sequencing-by-synthesis (SBS, on the same RNA sample obtained from an intertidal gastropod mollusc species, Haliotis midae. All the sequencing reads were deposited in the Short Read Archive (SRA database are retrievable under the accession number [SRR071314 (Illumina Genome Analyzer II] and [SRR1737738, SRR1737737, SRR1737735, SRR1737734 (454 GS FLX] in the SRA database of NCBI. Three transcriptomes, composed of either pure 454 or Illumina reads or a mixture of read types (Hybrid, were assembled using CLC Genomics Workbench software. Illumina assemblies performed the best de novo transcriptome characterization in terms of contig length, whereas the 454 assemblies tended to improve the complete assembly of gene transcripts. Both the Hybrid and Illumina assemblies produced longer contigs covering more of the transcriptome than 454 assemblies. However, the addition of 454 significantly increased the number of genes annotated.

  5. De novo assembly and transcriptome analysis of five major tissues of Jatropha curcas L. using GS FLX titanium platform of 454 pyrosequencing

    Directory of Open Access Journals (Sweden)

    Parani Madasamy

    2011-04-01

    Full Text Available Abstract Background Jatropha curcas L. is an important non-edible oilseed crop with promising future in biodiesel production. However, factors like oil yield, oil composition, toxic compounds in oil cake, pests and diseases limit its commercial potential. Well established genetic engineering methods using cloned genes could be used to address these limitations. Earlier, 10,983 unigenes from Sanger sequencing of ESTs, and 3,484 unique assembled transcripts from 454 pyrosequencing of uncloned cDNAs were reported. In order to expedite the process of gene discovery, we have undertaken 454 pyrosequencing of normalized cDNAs prepared from roots, mature leaves, flowers, developing seeds, and embryos of J. curcas. Results From 383,918 raw reads, we obtained 381,957 quality-filtered and trimmed reads that are suitable for the assembly of transcript sequences. De novo contig assembly of these reads generated 17,457 assembled transcripts (contigs and 54,002 singletons. Average length of the assembled transcripts was 916 bp. About 30% of the transcripts were longer than 1000 bases, and the size of the longest transcript was 7,173 bases. BLASTX analysis revealed that 2,589 of these transcripts are full-length. The assembled transcripts were validated by RT-PCR analysis of 28 transcripts. The results showed that the transcripts were correctly assembled and represent actively expressed genes. KEGG pathway mapping showed that 2,320 transcripts are related to major biochemical pathways including the oil biosynthesis pathway. Overall, the current study reports 14,327 new assembled transcripts which included 2589 full-length transcripts and 27 transcripts that are directly involved in oil biosynthesis. Conclusion The large number of transcripts reported in the current study together with existing ESTs and transcript sequences will serve as an invaluable genetic resource for crop improvement in jatropha. Sequence information of those genes that are involved in oil

  6. De novo assembly and transcriptome analysis of five major tissues of Jatropha curcas L. using GS FLX titanium platform of 454 pyrosequencing.

    Science.gov (United States)

    Natarajan, Purushothaman; Parani, Madasamy

    2011-04-15

    Jatropha curcas L. is an important non-edible oilseed crop with promising future in biodiesel production. However, factors like oil yield, oil composition, toxic compounds in oil cake, pests and diseases limit its commercial potential. Well established genetic engineering methods using cloned genes could be used to address these limitations. Earlier, 10,983 unigenes from Sanger sequencing of ESTs, and 3,484 unique assembled transcripts from 454 pyrosequencing of uncloned cDNAs were reported. In order to expedite the process of gene discovery, we have undertaken 454 pyrosequencing of normalized cDNAs prepared from roots, mature leaves, flowers, developing seeds, and embryos of J. curcas. From 383,918 raw reads, we obtained 381,957 quality-filtered and trimmed reads that are suitable for the assembly of transcript sequences. De novo contig assembly of these reads generated 17,457 assembled transcripts (contigs) and 54,002 singletons. Average length of the assembled transcripts was 916 bp. About 30% of the transcripts were longer than 1000 bases, and the size of the longest transcript was 7,173 bases. BLASTX analysis revealed that 2,589 of these transcripts are full-length. The assembled transcripts were validated by RT-PCR analysis of 28 transcripts. The results showed that the transcripts were correctly assembled and represent actively expressed genes. KEGG pathway mapping showed that 2,320 transcripts are related to major biochemical pathways including the oil biosynthesis pathway. Overall, the current study reports 14,327 new assembled transcripts which included 2589 full-length transcripts and 27 transcripts that are directly involved in oil biosynthesis. The large number of transcripts reported in the current study together with existing ESTs and transcript sequences will serve as an invaluable genetic resource for crop improvement in jatropha. Sequence information of those genes that are involved in oil biosynthesis could be used for metabolic engineering of

  7. Microbial analysis in primary and persistent endodontic infections by using pyrosequencing.

    Science.gov (United States)

    Hong, Bo-Young; Lee, Tae-Kwon; Lim, Sang-Min; Chang, Seok Woo; Park, Joonhong; Han, Seung Hyun; Zhu, Qiang; Safavi, Kamran E; Fouad, Ashraf F; Kum, Kee Yeon

    2013-09-01

    The aim of this study was to investigate the bacterial community profile of intracanal microbiota in primary and persistent endodontic infections associated with asymptomatic chronic apical periodontitis by using GS-FLX Titanium pyrosequencing. The null hypothesis was that there is no difference in diversity of overall bacterial community profiles between primary and persistent infections. Pyrosequencing analysis from 10 untreated and 8 root-filled samples was conducted. Analysis from 18 samples yielded total of 124,767 16S rRNA gene sequences (with a mean of 6932 reads per sample) that were taxonomically assigned into 803 operational taxonomic units (3% distinction), 148 genera, and 10 phyla including unclassified. Bacteroidetes was the most abundant phylum in both primary and persistent infections. There were no significant differences in bacterial diversity between the 2 infection groups (P > .05). The bacterial community profile that was based on dendrogram showed that bacterial population in both infections was not significantly different in their structure and composition (P > .05). The present pyrosequencing study demonstrates that persistent infections have as diverse bacterial community as primary infections. Copyright © 2013 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.

  8. DOGMA: domain-based transcriptome and proteome quality assessment.

    Science.gov (United States)

    Dohmen, Elias; Kremer, Lukas P M; Bornberg-Bauer, Erich; Kemena, Carsten

    2016-09-01

    Genome studies have become cheaper and easier than ever before, due to the decreased costs of high-throughput sequencing and the free availability of analysis software. However, the quality of genome or transcriptome assemblies can vary a lot. Therefore, quality assessment of assemblies and annotations are crucial aspects of genome analysis pipelines. We developed DOGMA, a program for fast and easy quality assessment of transcriptome and proteome data based on conserved protein domains. DOGMA measures the completeness of a given transcriptome or proteome and provides information about domain content for further analysis. DOGMA provides a very fast way to do quality assessment within seconds. DOGMA is implemented in Python and published under GNU GPL v.3 license. The source code is available on https://ebbgit.uni-muenster.de/domainWorld/DOGMA/ CONTACTS: e.dohmen@wwu.de or c.kemena@wwu.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  9. Next-generation transcriptome assembly

    Energy Technology Data Exchange (ETDEWEB)

    Martin, Jeffrey A.; Wang, Zhong

    2011-09-01

    Transcriptomics studies often rely on partial reference transcriptomes that fail to capture the full catalog of transcripts and their variations. Recent advances in sequencing technologies and assembly algorithms have facilitated the reconstruction of the entire transcriptome by deep RNA sequencing (RNA-seq), even without a reference genome. However, transcriptome assembly from billions of RNA-seq reads, which are often very short, poses a significant informatics challenge. This Review summarizes the recent developments in transcriptome assembly approaches - reference-based, de novo and combined strategies-along with some perspectives on transcriptome assembly in the near future.

  10. Transcriptome Analysis of the Small Brown Planthopper, Laodelphax striatellus Carrying Rice stripe virus

    Directory of Open Access Journals (Sweden)

    Joo Hyun Lee

    2013-09-01

    Full Text Available Rice stripe virus (RSV, the type member of the genus Tenuivirus, transmits by the feeding behavior of small brown planthopper (SBPH, Laodelphax striatellus. To investigate the interactions between the virus and vector insect, total RNA was extracted from RSV-viruliferous SBPH (RVLS and non-viruliferous SBPH (NVLS adults to construct expressed sequence tag databases for comparative transcriptome analysis. Over 30 million bases were sequenced by 454 pyrosequencing to construct 1,538 and 953 of isotigs from the mRNA of RVLS and NVLS, respectively. The gene ontology (GO analysis demonstrated that both libraries have similar GO structures, however, the gene expression pattern analysis revealed that 17.8% and 16.8% of isotigs were up- and down-regulated significantly in the RVLS, respectively. These RSV-dependently regulated genes possibly have important roles in the physiology of SBPH, transmission of RSV, and RSV and SBPH interaction.

  11. Allele-Specific DNA Methylation Detection by Pyrosequencing®

    DEFF Research Database (Denmark)

    Kristensen, Lasse Sommer; Johansen, Jens Vilstrup; Grønbæk, Kirsten

    2015-01-01

    DNA methylation is an epigenetic modification that plays important roles in healthy as well as diseased cells, by influencing the transcription of genes. In spite the fact that human somatic cells are diploid, most of the currently available methods for the study of DNA methylation do not provide......-effective protocol for allele-specific DNA methylation detection based on Pyrosequencing(®) of methylation-specific PCR (MSP) products including a single nucleotide polymorphism (SNP) within the amplicon....

  12. Sugarcane giant borer transcriptome analysis and identification of genes related to digestion.

    Science.gov (United States)

    Fonseca, Fernando Campos de Assis; Firmino, Alexandre Augusto Pereira; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; de Souza Júnior, José Dijair Antonino; de Sousa Júnior, José Dijair Antonino; Silva-Junior, Orzenil Bonfim; Togawa, Roberto Coiti; Pappas, Georgios Joannis; de Góis, Luiz Avelar Brandão; da Silva, Maria Cristina Mattar; Grossi-de-Sá, Maria Fátima

    2015-01-01

    Sugarcane is a widely cultivated plant that serves primarily as a source of sugar and ethanol. Its annual yield can be significantly reduced by the action of several insect pests including the sugarcane giant borer (Telchin licus licus), a lepidopteran that presents a long life cycle and which efforts to control it using pesticides have been inefficient. Although its economical relevance, only a few DNA sequences are available for this species in the GenBank. Pyrosequencing technology was used to investigate the transcriptome of several developmental stages of the insect. To maximize transcript diversity, a pool of total RNA was extracted from whole body insects and used to construct a normalized cDNA database. Sequencing produced over 650,000 reads, which were de novo assembled to generate a reference library of 23,824 contigs. After quality score and annotation, 43% of the contigs had at least one BLAST hit against the NCBI non-redundant database, and 40% showed similarities with the lepidopteran Bombyx mori. In a further analysis, we conducted a comparison with Manduca sexta midgut sequences to identify transcripts of genes involved in digestion. Of these transcripts, many presented an expansion or depletion in gene number, compared to B. mori genome. From the sugarcane giant borer (SGB) transcriptome, a number of aminopeptidase N (APN) cDNAs were characterized based on homology to those reported as Cry toxin receptors. This is the first report that provides a large-scale EST database for the species. Transcriptome analysis will certainly be useful to identify novel developmental genes, to better understand the insect's biology and to guide the development of new strategies for insect-pest control.

  13. Sugarcane giant borer transcriptome analysis and identification of genes related to digestion.

    Directory of Open Access Journals (Sweden)

    Fernando Campos de Assis Fonseca

    Full Text Available Sugarcane is a widely cultivated plant that serves primarily as a source of sugar and ethanol. Its annual yield can be significantly reduced by the action of several insect pests including the sugarcane giant borer (Telchin licus licus, a lepidopteran that presents a long life cycle and which efforts to control it using pesticides have been inefficient. Although its economical relevance, only a few DNA sequences are available for this species in the GenBank. Pyrosequencing technology was used to investigate the transcriptome of several developmental stages of the insect. To maximize transcript diversity, a pool of total RNA was extracted from whole body insects and used to construct a normalized cDNA database. Sequencing produced over 650,000 reads, which were de novo assembled to generate a reference library of 23,824 contigs. After quality score and annotation, 43% of the contigs had at least one BLAST hit against the NCBI non-redundant database, and 40% showed similarities with the lepidopteran Bombyx mori. In a further analysis, we conducted a comparison with Manduca sexta midgut sequences to identify transcripts of genes involved in digestion. Of these transcripts, many presented an expansion or depletion in gene number, compared to B. mori genome. From the sugarcane giant borer (SGB transcriptome, a number of aminopeptidase N (APN cDNAs were characterized based on homology to those reported as Cry toxin receptors. This is the first report that provides a large-scale EST database for the species. Transcriptome analysis will certainly be useful to identify novel developmental genes, to better understand the insect's biology and to guide the development of new strategies for insect-pest control.

  14. Construction of an EST-SSR-based interspecific transcriptome ...

    Indian Academy of Sciences (India)

    Construction of an EST-SSR-based interspecific transcriptome linkage map of fibre development in cotton. CHUANXIANG LIU, DAOJUN YUAN and ZHONGXU LIN. ∗. National Key Laboratory of Crop Genetic Improvement and National Centre of Plant Gene Research (Wuhan),. Huazhong Agricultural University, Wuhan ...

  15. Wrinkles in the rare biosphere: Pyrosequencing errors can lead to artificial inflation of diversity estimates

    Energy Technology Data Exchange (ETDEWEB)

    Kunin, Victor; Engelbrektson, Anna; Ochman, Howard; Hugenholtz, Philip

    2009-08-01

    Massively parallel pyrosequencing of the small subunit (16S) ribosomal RNA gene has revealed that the extent of rare microbial populations in several environments, the 'rare biosphere', is orders of magnitude higher than previously thought. One important caveat with this method is that sequencing error could artificially inflate diversity estimates. Although the per-base error of 16S rDNA amplicon pyrosequencing has been shown to be as good as or lower than Sanger sequencing, no direct assessments of pyrosequencing errors on diversity estimates have been reported. Using only Escherichia coli MG1655 as a reference template, we find that 16S rDNA diversity is grossly overestimated unless relatively stringent read quality filtering and low clustering thresholds are applied. In particular, the common practice of removing reads with unresolved bases and anomalous read lengths is insufficient to ensure accurate estimates of microbial diversity. Furthermore, common and reproducible homopolymer length errors can result in relatively abundant spurious phylotypes further confounding data interpretation. We suggest that stringent quality-based trimming of 16S pyrotags and clustering thresholds no greater than 97% identity should be used to avoid overestimates of the rare biosphere.

  16. Transcriptome sequencing and annotation of the microalgae Dunaliella tertiolecta: Pathway description and gene discovery for production of next-generation biofuels

    Directory of Open Access Journals (Sweden)

    Bibby Kyle

    2011-03-01

    Full Text Available Abstract Background Biodiesel or ethanol derived from lipids or starch produced by microalgae may overcome many of the sustainability challenges previously ascribed to petroleum-based fuels and first generation plant-based biofuels. The paucity of microalgae genome sequences, however, limits gene-based biofuel feedstock optimization studies. Here we describe the sequencing and de novo transcriptome assembly for the non-model microalgae species, Dunaliella tertiolecta, and identify pathways and genes of importance related to biofuel production. Results Next generation DNA pyrosequencing technology applied to D. tertiolecta transcripts produced 1,363,336 high quality reads with an average length of 400 bases. Following quality and size trimming, ~ 45% of the high quality reads were assembled into 33,307 isotigs with a 31-fold coverage and 376,482 singletons. Assembled sequences and singletons were subjected to BLAST similarity searches and annotated with Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG orthology (KO identifiers. These analyses identified the majority of lipid and starch biosynthesis and catabolism pathways in D. tertiolecta. Conclusions The construction of metabolic pathways involved in the biosynthesis and catabolism of fatty acids, triacylglycrols, and starch in D. tertiolecta as well as the assembled transcriptome provide a foundation for the molecular genetics and functional genomics required to direct metabolic engineering efforts that seek to enhance the quantity and character of microalgae-based biofuel feedstock.

  17. [Sensitivity and specificity of nested PCR pyrosequencing in hepatitis B virus drug resistance gene testing].

    Science.gov (United States)

    Sun, Shumei; Zhou, Hao; Zhou, Bin; Hu, Ziyou; Hou, Jinlin; Sun, Jian

    2012-05-01

    To evaluate the sensitivity and specificity of nested PCR combined with pyrosequencing in the detection of HBV drug-resistance gene. RtM204I (ATT) mutant and rtM204 (ATG) nonmutant plasmids mixed at different ratios were detected for mutations using nested-PCR combined with pyrosequencing, and the results were compared with those by conventional PCR pyrosequencing to analyze the linearity and consistency of the two methods. Clinical specimens with different viral loads were examined for drug-resistant mutations using nested PCR pyrosequencing and nested PCR combined with dideoxy sequencing (Sanger) for comparison of the detection sensitivity and specificity. The fitting curves demonstrated good linearity of both conventional PCR pyrosequencing and nested PCR pyrosequencing (R(2)>0.99, PNested PCR showed a better consistency with the predicted value than conventional PCR, and was superior to conventional PCR for detection of samples containing 90% mutant plasmid. In the detection of clinical specimens, Sanger sequencing had a significantly lower sensitivity than nested PCR pyrosequencing (92% vs 100%, Pnested PCR and Sanger sequencing method, nested PCR pyrosequencing has a higher sensitivity especially in clinical specimens with low viral copies, which can be important for early detection of HBV mutant strains and hence more effective clinical management.

  18. A physiologically-oriented transcriptomic analysis of the midgut of Tenebrio molitor.

    Science.gov (United States)

    Moreira, Nathalia R; Cardoso, Christiane; Dias, Renata O; Ferreira, Clelia; Terra, Walter R

    2017-05-01

    Physiological data showed that T. molitor midgut is buffered at pH 5.6 at the two anterior thirds and at 7.9 at the posterior third. Furthermore, water is absorbed and secreted at the anterior and posterior midgut, respectively, driving a midgut counter flux of fluid. To look for the molecular mechanisms underlying these phenomena and nutrient absorption as well, a transcriptomic approach was used. For this, 11 types of transporters were chosen from the midgut transcriptome obtained by pyrosequencing (Roche 454). After annotation with the aid of databanks and manual curation, the sequences were validated by RT-PCR. The expression level of each gene at anterior, middle and posterior midgut and carcass (larva less midgut) was evaluated by RNA-seq taking into account reference sequences based on 454 contigs and reads obtained by Illumina sequencing. The data showed that sugar and amino acid uniporters and symporters are expressed along the whole midgut. In the anterior midgut are found transporters for NH 3 and NH 4 + that with a chloride channel may be responsible for acidifying the lumen. At the posterior midgut, bicarbonate-Cl - antiporter with bicarbonate supplied by carbonic anhydrase may alkalinize the lumen. Water absorption caused mainly by an anterior Na + -K + -2Cl - symporter and water secretion caused by a posterior K + -Cl - may drive the midgut counter flux. Transporters that complement the action of those described were also found. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. Transcriptome analysis in cotton boll weevil (Anthonomus grandis and RNA interference in insect pests.

    Directory of Open Access Journals (Sweden)

    Alexandre Augusto Pereira Firmino

    Full Text Available Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families' data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects.

  20. Transcriptome analysis in cotton boll weevil (Anthonomus grandis) and RNA interference in insect pests.

    Science.gov (United States)

    Firmino, Alexandre Augusto Pereira; Fonseca, Fernando Campos de Assis; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; Antonino de Souza, José Dijair; Togawa, Roberto Coiti; Silva-Junior, Orzenil Bonfim; Pappas, Georgios Joannis; da Silva, Maria Cristina Mattar; Engler, Gilbert; Grossi-de-Sa, Maria Fatima

    2013-01-01

    Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi) as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families' data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects.

  1. Characterization of gonadal transcriptomes from the turbot (Scophthalmus maximus).

    Science.gov (United States)

    Hu, Yulong; Huang, Meng; Wang, Weiji; Guan, Jiantao; Kong, Jie

    2016-01-01

    The mechanisms underlying sexual reproduction and sex ratio determination remains unclear in turbot, a flatfish of great commercial value. And there is limited information in the turbot database regarding genes related to the reproductive system. Here, we conducted high-throughput transcriptome profiling of turbot gonad tissues to better understand their reproductive functions and to supply essential gene sequence information for marker-assisted selection programs in the turbot industry. In this study, two gonad libraries representing sex differences in Scophthalmus maximus yielded 453 818 high-quality reads that were assembled into 24 611 contigs and 33 713 singletons by using 454 pyrosequencing, 13 936 contigs and singletons (CS) of which were annotated using BLASTx. GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analyses revealed that various biological functions and processes were associated with many of the annotated CS. Expression analyses showed that 510 genes were differentially expressed in males versus females; 80% of these genes were annotated. In addition, 6484 and 6036 single nucleotide polymorphisms (SNPs) were identified in male and female libraries, respectively. This transcriptome resource will serve as the foundation for cDNA or SNP microarray construction, gene expression characterization, and sex-specific linkage mapping in turbot.

  2. Position-specific automated processing of V3 env ultra-deep pyrosequencing data for predicting HIV-1 tropism.

    Science.gov (United States)

    Jeanne, Nicolas; Saliou, Adrien; Carcenac, Romain; Lefebvre, Caroline; Dubois, Martine; Cazabat, Michelle; Nicot, Florence; Loiseau, Claire; Raymond, Stéphanie; Izopet, Jacques; Delobel, Pierre

    2015-11-20

    HIV-1 coreceptor usage must be accurately determined before starting CCR5 antagonist-based treatment as the presence of undetected minor CXCR4-using variants can cause subsequent virological failure. Ultra-deep pyrosequencing of HIV-1 V3 env allows to detect low levels of CXCR4-using variants that current genotypic approaches miss. However, the computation of the mass of sequence data and the need to identify true minor variants while excluding artifactual sequences generated during amplification and ultra-deep pyrosequencing is rate-limiting. Arbitrary fixed cut-offs below which minor variants are discarded are currently used but the errors generated during ultra-deep pyrosequencing are sequence-dependant rather than random. We have developed an automated processing of HIV-1 V3 env ultra-deep pyrosequencing data that uses biological filters to discard artifactual or non-functional V3 sequences followed by statistical filters to determine position-specific sensitivity thresholds, rather than arbitrary fixed cut-offs. It allows to retain authentic sequences with point mutations at V3 positions of interest and discard artifactual ones with accurate sensitivity thresholds.

  3. A transcriptome resource for Antarctic krill (Euphausia superba Dana) exposed to short-term stress

    KAUST Repository

    Martins, Maria Joã o F; Lago-Leston, Asuncion; Anjos, Antonio; Duarte, Carlos M.; Agusti, Susana; Serrã o, Ester A.; Pearson, Gareth A.

    2015-01-01

    Euphausia superba is a keystone species in Antarctic food webs. However, the continued decrease in stock density raises concerns over the resilience and adaptive potential of krill to withstand the current rate of environmental change. We undertook a transcriptome-scale approach (454 pyrosequencing) as a baseline for future studies addressing the physiological response of krill to short-term food shortage and natural UV-B stress. The final assembly resulted in a total of 26,415 contigs, 39.8% of which were putatively annotated. Exploratory analyses indicate an overall reduction in protein synthesis under food shortage while UV stress resulted in the activation of photo-protective mechanisms. © 2015.

  4. A transcriptome resource for Antarctic krill (Euphausia superba Dana) exposed to short-term stress

    KAUST Repository

    Martins, Maria João F

    2015-10-01

    Euphausia superba is a keystone species in Antarctic food webs. However, the continued decrease in stock density raises concerns over the resilience and adaptive potential of krill to withstand the current rate of environmental change. We undertook a transcriptome-scale approach (454 pyrosequencing) as a baseline for future studies addressing the physiological response of krill to short-term food shortage and natural UV-B stress. The final assembly resulted in a total of 26,415 contigs, 39.8% of which were putatively annotated. Exploratory analyses indicate an overall reduction in protein synthesis under food shortage while UV stress resulted in the activation of photo-protective mechanisms. © 2015.

  5. Removing Noise From Pyrosequenced Amplicons

    Directory of Open Access Journals (Sweden)

    Davenport Russell J

    2011-01-01

    Full Text Available Abstract Background In many environmental genomics applications a homologous region of DNA from a diverse sample is first amplified by PCR and then sequenced. The next generation sequencing technology, 454 pyrosequencing, has allowed much larger read numbers from PCR amplicons than ever before. This has revolutionised the study of microbial diversity as it is now possible to sequence a substantial fraction of the 16S rRNA genes in a community. However, there is a growing realisation that because of the large read numbers and the lack of consensus sequences it is vital to distinguish noise from true sequence diversity in this data. Otherwise this leads to inflated estimates of the number of types or operational taxonomic units (OTUs present. Three sources of error are important: sequencing error, PCR single base substitutions and PCR chimeras. We present AmpliconNoise, a development of the PyroNoise algorithm that is capable of separately removing 454 sequencing errors and PCR single base errors. We also introduce a novel chimera removal program, Perseus, that exploits the sequence abundances associated with pyrosequencing data. We use data sets where samples of known diversity have been amplified and sequenced to quantify the effect of each of the sources of error on OTU inflation and to validate these algorithms. Results AmpliconNoise outperforms alternative algorithms substantially reducing per base error rates for both the GS FLX and latest Titanium protocol. All three sources of error lead to inflation of diversity estimates. In particular, chimera formation has a hitherto unrealised importance which varies according to amplification protocol. We show that AmpliconNoise allows accurate estimates of OTU number. Just as importantly AmpliconNoise generates the right OTUs even at low sequence differences. We demonstrate that Perseus has very high sensitivity, able to find 99% of chimeras, which is critical when these are present at high

  6. Accurate CpG and non-CpG cytosine methylation analysis by high-throughput locus-specific pyrosequencing in plants.

    Science.gov (United States)

    How-Kit, Alexandre; Daunay, Antoine; Mazaleyrat, Nicolas; Busato, Florence; Daviaud, Christian; Teyssier, Emeline; Deleuze, Jean-François; Gallusci, Philippe; Tost, Jörg

    2015-07-01

    Pyrosequencing permits accurate quantification of DNA methylation of specific regions where the proportions of the C/T polymorphism induced by sodium bisulfite treatment of DNA reflects the DNA methylation level. The commercially available high-throughput locus-specific pyrosequencing instruments allow for the simultaneous analysis of 96 samples, but restrict the DNA methylation analysis to CpG dinucleotide sites, which can be limiting in many biological systems. In contrast to mammals where DNA methylation occurs nearly exclusively on CpG dinucleotides, plants genomes harbor DNA methylation also in other sequence contexts including CHG and CHH motives, which cannot be evaluated by these pyrosequencing instruments due to software limitations. Here, we present a complete pipeline for accurate CpG and non-CpG cytosine methylation analysis at single base-resolution using high-throughput locus-specific pyrosequencing. The devised approach includes the design and validation of PCR amplification on bisulfite-treated DNA and pyrosequencing assays as well as the quantification of the methylation level at every cytosine from the raw peak intensities of the Pyrograms by two newly developed Visual Basic Applications. Our method presents accurate and reproducible results as exemplified by the cytosine methylation analysis of the promoter regions of two Tomato genes (NOR and CNR) encoding transcription regulators of fruit ripening during different stages of fruit development. Our results confirmed a significant and temporally coordinated loss of DNA methylation on specific cytosines during the early stages of fruit development in both promoters as previously shown by WGBS. The manuscript describes thus the first high-throughput locus-specific DNA methylation analysis in plants using pyrosequencing.

  7. Pyrosequencing analysis of the gyrB gene to differentiate bacteria responsible for diarrheal diseases.

    Science.gov (United States)

    Hou, X-L; Cao, Q-Y; Jia, H-Y; Chen, Z

    2008-07-01

    Pathogens causing acute diarrhea include a large variety of species from Enterobacteriaceae and Vibrionaceae. A method based on pyrosequencing was used here to differentiate bacteria commonly associated with diarrhea in China; the method is targeted to a partial amplicon of the gyrB gene, which encodes the B subunit of DNA gyrase. Twenty-eight specific polymorphic positions were identified from sequence alignment of a large sequence dataset and targeted using 17 sequencing primers. Of 95 isolates tested, belonging to 13 species within 7 genera, most could be identified to the species level; O157 type could be differentiated from other E. coli types; Salmonella enterica subsp. enterica could be identified at the serotype level; the genus Shigella, except for S. boydii and S. dysenteriae, could also be identified. All these isolates were also subjected to conventional sequencing of a relatively long ( approximately1.2 kb) region of gyrB DNA; these results confirmed those with pyrosequencing. Twenty-two fecal samples were surveyed, the results of which were concordant with culture-based bacterial identification, and the pathogen detection limit with simulated stool specimens was 10(4) CFU/ml. DNA from different pathogens was also mixed to simulate a case of multibacterial infection, and the generated signals correlated well with the mix ratio. In summary, the gyrB-based pyrosequencing approach proved to have significant reliability and discriminatory power for enteropathogenic bacterial identification and provided a fast and effective method for clinical diagnosis.

  8. Rapid detection and identification of Bacillus anthracis in food using pyrosequencing technology.

    Science.gov (United States)

    Amoako, Kingsley K; Janzen, Timothy W; Shields, Michael J; Hahn, Kristen R; Thomas, Matthew C; Goji, Noriko

    2013-08-01

    The development of advanced methodologies for the detection of Bacillus anthracis has been evolving rapidly since the release of the anthrax spores in the mail in 2001. Recent advances in detection and identification techniques could prove to be an essential component in the defense against biological attacks. Sequence based such as pyrosequencing, which has the capability to determine short DNA stretches in real-time using biotinylated PCR amplicons, has potential biodefense applications. Using markers from the virulence plasmids (pXO1 and pXO2) and chromosomal regions, we have demonstrated the power of this technology in the rapid, specific and sensitive detection of B. anthracis spores in food matrices including milk, juice, bottled water, and processed meat. The combined use of immunomagnetic separation and pyrosequencing showed positive detection when liquid foods (bottled water, milk, juice), and processed meat were experimentally inoculated with 6CFU/mL and 6CFU/g, respectively, without an enrichment step. Pyrosequencing is completed in about 60min (following PCR amplification) and yields accurate and reliable results with an added layer of confidence. The entire assay (from sample preparation to sequencing information) can be completed in about 7.5h. A typical run on food samples yielded 67-80bp reads with 94-100% identity to the expected sequence. This sequence based approach is a novel application for the detection of anthrax spores in food with potential application in foodborne bioterrorism response and biodefense involving the use of anthrax spores. Crown Copyright © 2013. Published by Elsevier B.V. All rights reserved.

  9. Transcriptome survey of Patagonian southern beech Nothofagus nervosa (= N. Alpina: assembly, annotation and molecular marker discovery

    Directory of Open Access Journals (Sweden)

    Torales Susana L

    2012-07-01

    Full Text Available Abstract Background Nothofagus nervosa is one of the most emblematic native tree species of Patagonian temperate forests. Here, the shotgun RNA-sequencing (RNA-Seq of the transcriptome of N. nervosa, including de novo assembly, functional annotation, and in silico discovery of potential molecular markers to support population and associations genetic studies, are described. Results Pyrosequencing of a young leaf cDNA library generated a total of 111,814 high quality reads, with an average length of 447 bp. De novo assembly using Newbler resulted into 3,005 tentative isotigs (including alternative transcripts. The non-assembled sequences (singletons were clustered with CD-HIT-454 to identify natural and artificial duplicates from pyrosequencing reads, leading to 21,881 unique singletons. 15,497 out of 24,886 non-redundant sequences or unigenes, were successfully annotated against a plant protein database. A substantial number of simple sequence repeat markers (SSRs were discovered in the assembled and annotated sequences. More than 40% of the SSR sequences were inside ORF sequences. To confirm the validity of these predicted markers, a subset of 73 SSRs selected through functional annotation evidences were successfully amplified from six seedlings DNA samples, being 14 polymorphic. Conclusions This paper is the first report that shows a highly precise representation of the mRNAs diversity present in young leaves of a native South American tree, N. nervosa, as well as its in silico deduced putative functionality. The reported Nothofagus transcriptome sequences represent a unique resource for genetic studies and provide a tool to discover genes of interest and genetic markers that will greatly aid questions involving evolution, ecology, and conservation using genetic and genomic approaches in the genus.

  10. Transcriptome sequencing of the Antarctic vascular plant Deschampsia antarctica Desv. under abiotic stress.

    Science.gov (United States)

    Lee, Jungeun; Noh, Eun Kyeung; Choi, Hyung-Seok; Shin, Seung Chul; Park, Hyun; Lee, Hyoungseok

    2013-03-01

    Antarctic hairgrass (Deschampsia antarctica Desv.) is the only natural grass species in the maritime Antarctic. It has been studied as an extremophile that has successfully adapted to marginal land with the harshest environment for terrestrial plants. However, limited genetic research has focused on this species due to the lack of genomic resources. Here, we present the first de novo assembly of its transcriptome by massive parallel sequencing and its expression profile using D. antarctica grown under various stress conditions. Total sequence reads generated by pyrosequencing were assembled into 60,765 unigenes (28,177 contigs and 32,588 singletons). A total of 29,173 unique protein-coding genes were identified based on sequence similarities to known proteins. The combined results from all three stress conditions indicated differential expression of 3,110 genes. Quantitative reverse transcription polymerase chain reaction showed that several well-known stress-responsive genes encoding late embryogenesis abundant protein, dehydrin 1, and ice recrystallization inhibition protein were induced dramatically and that genes encoding U-box-domain-containing protein, electron transfer flavoprotein-ubiquinone, and F-box-containing protein were induced by abiotic stressors in a manner conserved with other plant species. We identified more than 2,000 simple sequence repeats that can be developed as functional molecular markers. This dataset is the most comprehensive transcriptome resource currently available for D. antarctica and is therefore expected to be an important foundation for future genetic studies of grasses and extremophiles.

  11. Comparative analysis of transcriptomes in aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing

    Directory of Open Access Journals (Sweden)

    Taketo Okada

    2016-12-01

    Full Text Available Ephedra plants are taxonomically classified as gymnosperms, and are medicinally important as the botanical origin of crude drugs and as bioresources that contain pharmacologically active chemicals. Here we show a comparative analysis of the transcriptomes of aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing by RNA-Seq. De novo assembly of short cDNA sequence reads generated 23,358, 13,373, and 28,579 contigs longer than 200 bases from aerial stems, roots, or both aerial stems and roots, respectively. The presumed functions encoded by these contig sequences were annotated by BLAST (blastx. Subsequently, these contigs were classified based on gene ontology slims, Enzyme Commission numbers, and the InterPro database. Furthermore, comparative gene expression analysis was performed between aerial stems and roots. These transcriptome analyses revealed differences and similarities between the transcriptomes of aerial stems and roots in E. sinica. Deep transcriptome sequencing of Ephedra should open the door to molecular biological studies based on the entire transcriptome, tissue- or organ-specific transcriptomes, or targeted genes of interest.

  12. Transcriptome

    Science.gov (United States)

    ... Also: Talking Glossary of Genetic Terms Definitions for genetic terms used on this page En Español: Transcriptoma Transcriptome What is a transcriptome? What can a transcriptome tell us? How can transcriptome data be used to explore gene function? What is ...

  13. Complete genome sequence of a novel Plum pox virus strain W isolate determined by 454 pyrosequencing.

    Science.gov (United States)

    Sheveleva, Anna; Kudryavtseva, Anna; Speranskaya, Anna; Belenikin, Maxim; Melnikova, Natalia; Chirkov, Sergei

    2013-10-01

    The near-complete (99.7 %) genome sequence of a novel Russian Plum pox virus (PPV) isolate Pk, belonging to the strain Winona (W), has been determined by 454 pyrosequencing with the exception of the thirty-one 5'-terminal nucleotides. This region was amplified using 5'RACE kit and sequenced by the Sanger method. Genomic RNA released from immunocaptured PPV particles was employed for generation of cDNA library using TransPlex Whole transcriptome amplification kit (WTA2, Sigma-Aldrich). The entire Pk genome has identity level of 92.8-94.5 % when compared to the complete nucleotide sequences of other PPV-W isolates (W3174, LV-141pl, LV-145bt, and UKR 44189), confirming a high degree of variability within the PPV-W strain. The isolates Pk and LV-141pl are most closely related. The Pk has been found in a wild plum (Prunus domestica) in a new region of Russia indicating widespread dissemination of the PPV-W strain in the European part of the former USSR.

  14. Transcriptome analysis of the honey bee fungal pathogen, Ascosphaera apis: implications for host pathogenesis

    Directory of Open Access Journals (Sweden)

    Cornman R

    2012-06-01

    Full Text Available Abstract Background We present a comprehensive transcriptome analysis of the fungus Ascosphaera apis, an economically important pathogen of the Western honey bee (Apis mellifera that causes chalkbrood disease. Our goals were to further annotate the A. apis reference genome and to identify genes that are candidates for being differentially expressed during host infection versus axenic culture. Results We compared A. apis transcriptome sequence from mycelia grown on liquid or solid media with that dissected from host-infected tissue. 454 pyrosequencing provided 252 Mb of filtered sequence reads from both culture types that were assembled into 10,087 contigs. Transcript contigs, protein sequences from multiple fungal species, and ab initio gene predictions were included as evidence sources in the Maker gene prediction pipeline, resulting in 6,992 consensus gene models. A phylogeny based on 12 of these protein-coding loci further supported the taxonomic placement of Ascosphaera as sister to the core Onygenales. Several common protein domains were less abundant in A. apis compared with related ascomycete genomes, particularly cytochrome p450 and protein kinase domains. A novel gene family was identified that has expanded in some ascomycete lineages, but not others. We manually annotated genes with homologs in other fungal genomes that have known relevance to fungal virulence and life history. Functional categories of interest included genes involved in mating-type specification, intracellular signal transduction, and stress response. Computational and manual annotations have been made publicly available on the Bee Pests and Pathogens website. Conclusions This comprehensive transcriptome analysis substantially enhances our understanding of the A. apis genome and its expression during infection of honey bee larvae. It also provides resources for future molecular studies of chalkbrood disease and ultimately improved disease management.

  15. Transcriptome analysis of the honey bee fungal pathogen, Ascosphaera apis: implications for host pathogenesis

    Science.gov (United States)

    2012-01-01

    Background We present a comprehensive transcriptome analysis of the fungus Ascosphaera apis, an economically important pathogen of the Western honey bee (Apis mellifera) that causes chalkbrood disease. Our goals were to further annotate the A. apis reference genome and to identify genes that are candidates for being differentially expressed during host infection versus axenic culture. Results We compared A. apis transcriptome sequence from mycelia grown on liquid or solid media with that dissected from host-infected tissue. 454 pyrosequencing provided 252 Mb of filtered sequence reads from both culture types that were assembled into 10,087 contigs. Transcript contigs, protein sequences from multiple fungal species, and ab initio gene predictions were included as evidence sources in the Maker gene prediction pipeline, resulting in 6,992 consensus gene models. A phylogeny based on 12 of these protein-coding loci further supported the taxonomic placement of Ascosphaera as sister to the core Onygenales. Several common protein domains were less abundant in A. apis compared with related ascomycete genomes, particularly cytochrome p450 and protein kinase domains. A novel gene family was identified that has expanded in some ascomycete lineages, but not others. We manually annotated genes with homologs in other fungal genomes that have known relevance to fungal virulence and life history. Functional categories of interest included genes involved in mating-type specification, intracellular signal transduction, and stress response. Computational and manual annotations have been made publicly available on the Bee Pests and Pathogens website. Conclusions This comprehensive transcriptome analysis substantially enhances our understanding of the A. apis genome and its expression during infection of honey bee larvae. It also provides resources for future molecular studies of chalkbrood disease and ultimately improved disease management. PMID:22747707

  16. An Integrated Transcriptome-Wide Analysis of Cave and Surface Dwelling Astyanax mexicanus

    Science.gov (United States)

    Gross, Joshua B.; Furterer, Allison; Carlson, Brian M.; Stahl, Bethany A.

    2013-01-01

    Numerous organisms around the globe have successfully adapted to subterranean environments. A powerful system in which to study cave adaptation is the freshwater characin fish, Astyanax mexicanus. Prior studies in this system have established a genetic basis for the evolution of numerous regressive traits, most notably vision and pigmentation reduction. However, identification of the precise genetic alterations that underlie these morphological changes has been delayed by limited genetic and genomic resources. To address this, we performed a transcriptome analysis of cave and surface dwelling Astyanax morphs using Roche/454 pyrosequencing technology. Through this approach, we obtained 576,197 Pachón cavefish-specific reads and 438,978 surface fish-specific reads. Using this dataset, we assembled transcriptomes of cave and surface fish separately, as well as an integrated transcriptome that combined 1,499,568 reads from both morphotypes. The integrated assembly was the most successful approach, yielding 22,596 high quality contiguous sequences comprising a total transcriptome length of 21,363,556 bp. Sequence identities were obtained through exhaustive blast searches, revealing an adult transcriptome represented by highly diverse Gene Ontology (GO) terms. Our dataset facilitated rapid identification of sequence polymorphisms between morphotypes. These data, along with positional information collected from the Danio rerio genome, revealed several syntenic regions between Astyanax and Danio. We demonstrated the utility of this positional information through a QTL analysis of albinism in a surface x Pachón cave F2 pedigree, using 65 polymorphic markers identified from our integrated assembly. We also adapted our dataset for an RNA-seq study, revealing many genes responsible for visual system maintenance in surface fish, whose expression was not detected in adult Pachón cavefish. Conversely, several metabolism-related genes expressed in cavefish were not detected in

  17. Pyrosequencing for Rapid Detection of Mycobacterium tuberculosis Resistance to Rifampin, Isoniazid, and Fluoroquinolones ▿

    Science.gov (United States)

    Bravo, Lulette Tricia C.; Tuohy, Marion J.; Ang, Concepcion; Destura, Raul V.; Mendoza, Myrna; Procop, Gary W.; Gordon, Steven M.; Hall, Geraldine S.; Shrestha, Nabin K.

    2009-01-01

    After isoniazid and rifampin (rifampicin), the next pivotal drug class in Mycobacterium tuberculosis treatment is the fluoroquinolone class. Mutations in resistance-determining regions (RDR) of the rpoB, katG, and gyrA genes occur with frequencies of 97%, 50%, and 85% among M. tuberculosis isolates resistant to rifampin, isoniazid, and fluoroquinolones, respectively. Sequences are highly conserved, and certain mutations correlate well with phenotypic resistance. We developed a pyrosequencing assay to determine M. tuberculosis genotypic resistance to rifampin, isoniazid, and fluoroquinolones. We characterized 102 M. tuberculosis clinical isolates from the Philippines for susceptibility to rifampin, isoniazid, and ofloxacin by using the conventional submerged-disk proportion method and validated our pyrosequencing assay using these isolates. DNA was extracted and amplified by using PCR primers directed toward the RDR of the rpoB, katG, and gyrA genes, and pyrosequencing was performed on the extracts. The M. tuberculosis H37Rv strain (ATCC 25618) was used as the reference strain. The sensitivities and specificities of pyrosequencing were 96.7% and 97.3%, 63.8% and 100%, and 70.0% and 100% for the detection of resistance to rifampin, isoniazid, and ofloxacin, respectively. Pyrosequencing is thus a rapid and accurate method for detecting M. tuberculosis resistance to these three drugs. PMID:19846642

  18. Rapid strategy for screening by pyrosequencing of influenza virus reassortants--candidates for live attenuated vaccines.

    Science.gov (United States)

    Shcherbik, Svetlana V; Pearce, Nicholas C; Levine, Marnie L; Klimov, Alexander I; Villanueva, Julie M; Bousse, Tatiana L

    2014-01-01

    Live attenuated influenza vaccine viruses (LAIVs) can be generated by classical reassortment of gene segments between a cold adapted, temperature sensitive and attenuated Master Donor Virus (MDV) and a seasonal wild-type (wt) virus. The vaccine candidates contain hemagglutinin (HA) and neuraminidase (NA) genes derived from the circulating wt viruses and the remaining six genes derived from the MDV strains. Rapid, efficient selection of the viruses with 6∶2 genome compositions from the large number of genetically different viruses generated during reassortment is essential for the biannual production schedule of vaccine viruses. This manuscript describes a new approach for the genotypic analysis of LAIV reassortant virus clones based on pyrosequencing. LAIV candidate viruses were created by classical reassortment of seasonal influenza A (H3N2) (A/Victoria/361/2011, A/Ohio/02/2012, A/Texas/50/2012) or influenza A (H7N9) (A/Anhui/1/2013) wt viruses with the MDV A/Leningrad/134/17/57(H2N2). Using strain-specific pyrosequencing assays, mixed gene variations were detected in the allantoic progenies during the cloning procedure. The pyrosequencing analysis also allowed for estimation of the relative abundance of segment variants in mixed populations. This semi-quantitative approach was used for selecting specific clones for the subsequent cloning procedures. The present study demonstrates that pyrosequencing analysis is a useful technique for rapid and reliable genotyping of reassortants and intermediate clones during the preparation of LAIV candidates, and can expedite the selection of vaccine virus candidates.

  19. Rapid strategy for screening by pyrosequencing of influenza virus reassortants--candidates for live attenuated vaccines.

    Directory of Open Access Journals (Sweden)

    Svetlana V Shcherbik

    Full Text Available BACKGROUND: Live attenuated influenza vaccine viruses (LAIVs can be generated by classical reassortment of gene segments between a cold adapted, temperature sensitive and attenuated Master Donor Virus (MDV and a seasonal wild-type (wt virus. The vaccine candidates contain hemagglutinin (HA and neuraminidase (NA genes derived from the circulating wt viruses and the remaining six genes derived from the MDV strains. Rapid, efficient selection of the viruses with 6∶2 genome compositions from the large number of genetically different viruses generated during reassortment is essential for the biannual production schedule of vaccine viruses. METHODOLOGY/PRINCIPAL FINDINGS: This manuscript describes a new approach for the genotypic analysis of LAIV reassortant virus clones based on pyrosequencing. LAIV candidate viruses were created by classical reassortment of seasonal influenza A (H3N2 (A/Victoria/361/2011, A/Ohio/02/2012, A/Texas/50/2012 or influenza A (H7N9 (A/Anhui/1/2013 wt viruses with the MDV A/Leningrad/134/17/57(H2N2. Using strain-specific pyrosequencing assays, mixed gene variations were detected in the allantoic progenies during the cloning procedure. The pyrosequencing analysis also allowed for estimation of the relative abundance of segment variants in mixed populations. This semi-quantitative approach was used for selecting specific clones for the subsequent cloning procedures. CONCLUSIONS/SIGNIFICANCE: The present study demonstrates that pyrosequencing analysis is a useful technique for rapid and reliable genotyping of reassortants and intermediate clones during the preparation of LAIV candidates, and can expedite the selection of vaccine virus candidates.

  20. Tracking fungal community responses to maize plants by DNA- and RNA-based pyrosequencing.

    Directory of Open Access Journals (Sweden)

    Eiko E Kuramae

    Full Text Available We assessed soil fungal diversity and community structure at two sampling times (t1 = 47 days and t2 = 104 days of plant age in pots associated with four maize cultivars, including two genetically modified (GM cultivars by high-throughput pyrosequencing of the 18S rRNA gene using DNA and RNA templates. We detected no significant differences in soil fungal diversity and community structure associated with different plant cultivars. However, DNA-based analyses yielded lower fungal OTU richness as compared to RNA-based analyses. Clear differences in fungal community structure were also observed in relation to sampling time and the nucleic acid pool targeted (DNA versus RNA. The most abundant soil fungi, as recovered by DNA-based methods, did not necessary represent the most "active" fungi (as recovered via RNA. Interestingly, RNA-derived community compositions at t1 were highly similar to DNA-derived communities at t2, based on presence/absence measures of OTUs. We recovered large proportions of fungal sequences belonging to arbuscular mycorrhizal fungi and Basidiomycota, especially at the RNA level, suggesting that these important and potentially beneficial fungi are not affected by the plant cultivars nor by GM traits (Bt toxin production. Our results suggest that even though DNA- and RNA-derived soil fungal communities can be very different at a given time, RNA composition may have a predictive power of fungal community development through time.

  1. Identification of strong promoters based on the transcriptome of Bacillus licheniformis.

    Science.gov (United States)

    Liu, Xin; Yang, Haiyan; Zheng, Junwei; Ye, Yanrui; Pan, Li

    2017-06-01

    To expand the repertoire of strong promoters for high level expression of proteins based on the transcriptome of Bacillus licheniformis. The transcriptome of B. licheniformis ATCC14580 grown to the early stationary phase was analyzed and the top 10 highly expressed genes/operons out of the 3959 genes and 1249 operons identified were chosen for study promoter activity. Using beta-galactosidase gene as a reporter, the candidate promoter pBL9 exhibited the strongest activity which was comparable to that of the widely used strong promoter p43. Furthermore, the pro-transglutaminase from Streptomyces mobaraensis (pro-MTG) was expressed under the control of promoter pBL9 and the activity of pro-MTG reached 82 U/ml after 36 h, which is 23% higher than that of promoter p43 (66.8 U/ml). In our analyses of the transcriptome of B. licheniformis, we have identified a strong promoter pBL9, which could be adapted for high level expression of proteins in the host Bacillus subtilis.

  2. Community analysis of chronic wound bacteria using 16S rRNA gene-based pyrosequencing: impact of diabetes and antibiotics on chronic wound microbiota.

    Directory of Open Access Journals (Sweden)

    Lance B Price

    Full Text Available BACKGROUND: Bacterial colonization is hypothesized to play a pathogenic role in the non-healing state of chronic wounds. We characterized wound bacteria from a cohort of chronic wound patients using a 16S rRNA gene-based pyrosequencing approach and assessed the impact of diabetes and antibiotics on chronic wound microbiota. METHODOLOGY/PRINCIPAL FINDINGS: We prospectively enrolled 24 patients at a referral wound center in Baltimore, MD; sampled patients' wounds by curette; cultured samples under aerobic and anaerobic conditions; and pyrosequenced the 16S rRNA V3 hypervariable region. The 16S rRNA gene-based analyses revealed an average of 10 different bacterial families in wounds--approximately 4 times more than estimated by culture-based analyses. Fastidious anaerobic bacteria belonging to the Clostridiales family XI were among the most prevalent bacteria identified exclusively by 16S rRNA gene-based analyses. Community-scale analyses showed that wound microbiota from antibiotic treated patients were significantly different from untreated patients (p = 0.007 and were characterized by increased Pseudomonadaceae abundance. These analyses also revealed that antibiotic use was associated with decreased Streptococcaceae among diabetics and that Streptococcaceae was more abundant among diabetics as compared to non-diabetics. CONCLUSIONS/SIGNIFICANCE: The 16S rRNA gene-based analyses revealed complex bacterial communities including anaerobic bacteria that may play causative roles in the non-healing state of some chronic wounds. Our data suggest that antimicrobial therapy alters community structure--reducing some bacteria while selecting for others.

  3. New approach for the study of mite reproduction: The first transcriptome analysis of a mite, Phytoseiulus persimilis (Acari: Phytoseiidae).

    Science.gov (United States)

    Cabrera, Ana R; Donohue, Kevin V; Khalil, Sayed M S; Scholl, Elizabeth; Opperman, Charles; Sonenshine, Daniel E; Roe, R Michael

    2011-01-01

    Many species of mites and ticks are of agricultural and medical importance. Much can be learned from the study of transcriptomes of acarines which can generate DNA-sequence information of potential target genes for the control of acarine pests. High throughput transcriptome sequencing can also yield sequences of genes critical during physiological processes poorly understood in acarines, i.e., the regulation of female reproduction in mites. The predatory mite, Phytoseiulus persimilis, was selected to conduct a transcriptome analysis using 454 pyrosequencing. The objective of this project was to obtain DNA-sequence information of expressed genes from P. persimilis with special interest in sequences corresponding to vitellogenin (Vg) and the vitellogenin receptor (VgR). These genes are critical to the understanding of vitellogenesis, and they will facilitate the study of the regulation of mite female reproduction. A total of 12,556 contiguous sequences (contigs) were assembled with an average size of 935bp. From these sequences, the putative translated peptides of 11 contigs were similar in amino acid sequences to other arthropod Vgs, while 6 were similar to VgRs. We selected some of these sequences to conduct stage-specific expression studies to further determine their function. 2010 Elsevier Ltd. All rights reserved.

  4. Massive sequencing of Ulmus minor's transcriptome provides new molecular tools for a genus under the constant threat of Dutch elm disease

    Directory of Open Access Journals (Sweden)

    Pedro ePerdiguero

    2015-07-01

    Full Text Available Elms, especially Ulmus minor and Ulmus americana, are carrying out a hard battle against Dutch elm disease (DED. This vascular wilt disease, caused by Ophiostoma ulmi and O. novo-ulmi, appeared in the twentieth century and killed millions of elms across North America and Europe. Elm breeding and conservation programmes have identified a reduced number of DED tolerant genotypes. In this study, three U. minor genotypes with contrasted levels of tolerance to DED were exposed to several biotic and abiotic stresses in order to (i obtain a de novo assembled transcriptome of U. minor using 454 pyrosequencing, (ii perform a functional annotation of the assembled transcriptome, (iii identify genes potentially involved in the molecular response to environmental stress, and (iv develop gene-based markers to support breeding programmes. A total of 58,429 putative unigenes were identified after assembly and filtering of the transcriptome. 32,152 of these unigenes showed homology with proteins identified in the genome from the most common plant model species. Well-known family proteins and transcription factors involved in abiotic, biotic or both stresses were identified after functional annotation. A total of 30,693 polymorphisms were identified in 7,125 isotigs, a large number of them corresponding to SNPs (27,359. In a subset randomly selected for validation, 87 % of the SNPs were confirmed. The material generated may be valuable for future Ulmus gene expression, population genomics and association genetics studies, especially taking into account the scarce molecular information available for this genus and the great impact that DED has on elm populations.

  5. Efficient alignment of pyrosequencing reads for re-sequencing applications

    Directory of Open Access Journals (Sweden)

    Russo Luis MS

    2011-05-01

    Full Text Available Abstract Background Over the past few years, new massively parallel DNA sequencing technologies have emerged. These platforms generate massive amounts of data per run, greatly reducing the cost of DNA sequencing. However, these techniques also raise important computational difficulties mostly due to the huge volume of data produced, but also because of some of their specific characteristics such as read length and sequencing errors. Among the most critical problems is that of efficiently and accurately mapping reads to a reference genome in the context of re-sequencing projects. Results We present an efficient method for the local alignment of pyrosequencing reads produced by the GS FLX (454 system against a reference sequence. Our approach explores the characteristics of the data in these re-sequencing applications and uses state of the art indexing techniques combined with a flexible seed-based approach, leading to a fast and accurate algorithm which needs very little user parameterization. An evaluation performed using real and simulated data shows that our proposed method outperforms a number of mainstream tools on the quantity and quality of successful alignments, as well as on the execution time. Conclusions The proposed methodology was implemented in a software tool called TAPyR--Tool for the Alignment of Pyrosequencing Reads--which is publicly available from http://www.tapyr.net.

  6. Comparative transcriptome analysis of the Asteraceae halophyte Karelinia caspica under salt stress.

    Science.gov (United States)

    Zhang, Xia; Liao, Maoseng; Chang, Dan; Zhang, Fuchun

    2014-12-17

    Much attention has been given to the potential of halophytes as sources of tolerance traits for introduction into cereals. However, a great deal remains unknown about the diverse mechanisms employed by halophytes to cope with salinity. To characterize salt tolerance mechanisms underlying Karelinia caspica, an Asteraceae halophyte, we performed Large-scale transcriptomic analysis using a high-throughput Illumina sequencing platform. Comparative gene expression analysis was performed to correlate the effects of salt stress and ABA regulation at the molecular level. Total sequence reads generated by pyrosequencing were assembled into 287,185 non-redundant transcripts with an average length of 652 bp. Using the BLAST function in the Swiss-Prot, NCBI nr, GO, KEGG, and KOG databases, a total of 216,416 coding sequences associated with known proteins were annotated. Among these, 35,533 unigenes were classified into 69 gene ontology categories, and 18,378 unigenes were classified into 202 known pathways. Based on the fold changes observed when comparing the salt stress and control samples, 60,127 unigenes were differentially expressed, with 38,122 and 22,005 up- and down-regulated, respectively. Several of the differentially expressed genes are known to be involved in the signaling pathway of the plant hormone ABA, including ABA metabolism, transport, and sensing as well as the ABA signaling cascade. Transcriptome profiling of K. caspica contribute to a comprehensive understanding of K. caspica at the molecular level. Moreover, the global survey of differentially expressed genes in this species under salt stress and analyses of the effects of salt stress and ABA regulation will contribute to the identification and characterization of genes and molecular mechanisms underlying salt stress responses in Asteraceae plants.

  7. Investigation of the koala (Phascolarctos cinereus) hindgut microbiome via 16S pyrosequencing.

    Science.gov (United States)

    Barker, Christopher J; Gillett, Amber; Polkinghorne, Adam; Timms, Peter

    2013-12-27

    As a dietary source, the foliage of Eucalyptus spp. is low in available protein and carbohydrate while containing polyphenolic compounds that interfere with enzymatic digestion. To overcome this, the koala (Phascolarctos cinereus) has evolved a range of anatomical and physiological adaptations to assist with digestion and absorption of nutrients from this food source. Microbial fermentation of partially digested eucalyptus leaves is thought to be critical in this process, however, little is known about the composition and diversity of microorganisms that are associated with digestive health in this native species. In this study, we performed 16S rRNA gene pyrosequencing of caecum, colon and faecal pellet samples from two wild, free ranging, Queensland koalas. Our results reveal a highly complex and diverse ecosystem with considerable intra-individual variation. Although samples were dominated by sequences from the Bacteroidetes and Firmicutes phyla there was considerable variation at the genus level. This study is the first non-culture based microbiota analysis, using 454-amplicon pyrosequencing, and provides preliminary data to expand our understanding of the koala hindgut. Copyright © 2013 Elsevier B.V. All rights reserved.

  8. Pyrosequencing-Based Assays for Rapid Detection of HER2 and HER3 Mutations in Clinical Samples Uncover an E332E Mutation Affecting HER3 in Retroperitoneal Leiomyosarcoma

    Directory of Open Access Journals (Sweden)

    Paula González-Alonso

    2015-08-01

    Full Text Available Mutations in Human Epidermal Growth Factor Receptors (HER are associated with poor prognosis of several types of solid tumors. Although HER-mutation detection methods are currently available, such as Next-Generation Sequencing (NGS, alternative pyrosequencing allow the rapid characterization of specific mutations. We developed specific PCR-based pyrosequencing assays for identification of most prevalent HER2 and HER3 mutations, including S310F/Y, R678Q, L755M/P/S/W, V777A/L/M, 774-776 insertion, and V842I mutations in HER2, as well as M91I, V104M/L, D297N/V/Y, and E332E/K mutations in HER3. We tested 85 Formalin Fixed and Paraffin Embbeded (FFPE samples and we detected three HER2-V842I mutations in colorectal carcinoma (CRC, ovarian carcinoma, and pancreatic carcinoma patients, respectively, and a HER2-L755M mutation in a CRC specimen. We also determined the presence of a HER3-E332K mutation in an urothelial carcinoma sample, and two HER3-D297Y mutations, in both gastric adenocarcinoma and CRC specimens. The D297Y mutation was previously detected in breast and gastric tumors, but not in CRC. Moreover, we found a not-previously-described HER3-E332E synonymous mutation in a retroperitoneal leiomyosarcoma patient. The pyrosequencing assays presented here allow the detection and characterization of specific HER2 and HER3 mutations. These pyrosequencing assays might be implemented in routine diagnosis for molecular characterization of HER2/HER3 receptors as an alternative to complex NGS approaches.

  9. Transcriptome resources for the perennial sunflower Helianthus maximiliani obtained from ecologically divergent populations.

    Science.gov (United States)

    Kawakami, Takeshi; Darby, Brian J; Ungerer, Mark C

    2014-07-01

    Next-generation sequencing (NGS) technologies provide a rapid means to generate genomic resources for species exhibiting interesting ecological and evolutionary variation but for which such resources are scant or nonexistent. In the current report, we utilize 454 pyrosequencing to obtain transcriptome information for multiple individuals and tissue types from geographically disparate and ecologically differentiated populations of the perennial sunflower species Helianthus maximiliani. A total of 850 275 raw reads were obtained averaging 355 bp in length. Reads were assembled, postprocessing, into 16 681 unique contigs with an N50 of 898 bp and a total length of 13.6 Mb. A majority (67%) of these contigs were annotated based on comparison with the Arabidopsis thaliana genome (TAIR10). Contigs were identified that exhibit high similarity to genes associated with natural variation in flowering time and freezing tolerance in other plant species and will facilitate future studies aimed at elucidating the molecular basis of clinal life history variation and adaptive differentiation in H. maximiliani. Large numbers of gene-associated simple sequence repeats (SSRs) and single-nucleotide polymorphisms (SNPs) also were identified that can be deployed in mapping and population genomic analyses. © 2014 John Wiley & Sons Ltd.

  10. Investigating bacterial populations in styrene-degrading biofilters by 16S rDNA tag pyrosequencing.

    Science.gov (United States)

    Portune, Kevin J; Pérez, M Carmen; Álvarez-Hornos, F Javier; Gabaldón, Carmen

    2015-01-01

    Microbial biofilms are essential components in the elimination of pollutants within biofilters, yet still little is known regarding the complex relationships between microbial community structure and biodegradation function within these engineered ecosystems. To further explore this relationship, 16S rDNA tag pyrosequencing was applied to samples taken at four time points from a styrene-degrading biofilter undergoing variable operating conditions. Changes in microbial structure were observed between different stages of biofilter operation, and the level of styrene concentration was revealed to be a critical factor affecting these changes. Bacterial genera Azoarcus and Pseudomonas were among the dominant classified genera in the biofilter. Canonical correspondence analysis (CCA) and correlation analysis revealed that the genera Brevundimonas, Hydrogenophaga, and Achromobacter may play important roles in styrene degradation under increasing styrene concentrations. No significant correlations (P > 0.05) could be detected between biofilter operational/functional parameters and biodiversity measurements, although biological heterogeneity within biofilms and/or technical variability within pyrosequencing may have considerably affected these results. Percentages of selected bacterial taxonomic groups detected by fluorescence in situ hybridization (FISH) were compared to results from pyrosequencing in order to assess the effectiveness and limitations of each method for identifying each microbial taxon. Comparison of results revealed discrepancies between the two methods in the detected percentages of numerous taxonomic groups. Biases and technical limitations of both FISH and pyrosequencing, such as the binding of FISH probes to non-target microbial groups and lack of classification of sequences for defined taxonomic groups from pyrosequencing, may partially explain some differences between the two methods.

  11. Rapid detection of the CYP2A6*12 hybrid allele by Pyrosequencing® technology

    Directory of Open Access Journals (Sweden)

    Gallagher Margaret L

    2009-08-01

    Full Text Available Abstract Background Identification of CYP2A6 alleles associated with reduced enzyme activity is important in the study of inter-individual differences in drug metabolism. CYP2A6*12 is a hybrid allele that results from unequal crossover between CYP2A6 and CYP2A7 genes. The 5' regulatory region and exons 1–2 are derived from CYP2A7, and exons 3–9 are derived from CYP2A6. Conventional methods for detection of CYP2A6*12 consist of two-step PCR protocols that are laborious and unsuitable for high-throughput genotyping. We developed a rapid and accurate method to detect the CYP2A6*12 allele by Pyrosequencing technology. Methods A single set of PCR primers was designed to specifically amplify both the CYP2A6*1 wild-type allele and the CYP2A6*12 hybrid allele. An internal Pyrosequencing primer was used to generate allele-specific sequence information, which detected homozygous wild-type, heterozygous hybrid, and homozygous hybrid alleles. We first validated the assay on 104 DNA samples that were also genotyped by conventional two-step PCR and by cycle sequencing. CYP2A6*12 allele frequencies were then determined using the Pyrosequencing assay on 181 multi-ethnic DNA samples from subjects of African American, European Caucasian, Pacific Rim, and Hispanic descent. Finally, we streamlined the Pyrosequencing assay by integrating liquid handling robotics into the workflow. Results Pyrosequencing results demonstrated 100% concordance with conventional two-step PCR and cycle sequencing methods. Allele frequency data showed slightly higher prevalence of the CYP2A6*12 allele in European Caucasians and Hispanics. Conclusion This Pyrosequencing assay proved to be a simple, rapid, and accurate alternative to conventional methods, which can be easily adapted to the needs of higher-throughput studies.

  12. Transcriptomic analysis of Siberian ginseng (Eleutherococcus senticosus) to discover genes involved in saponin biosynthesis.

    Science.gov (United States)

    Hwang, Hwan-Su; Lee, Hyoshin; Choi, Yong Eui

    2015-03-14

    Eleutherococcus senticosus, Siberian ginseng, is a highly valued woody medicinal plant belonging to the family Araliaceae. E. senticosus produces a rich variety of saponins such as oleanane-type, noroleanane-type, 29-hydroxyoleanan-type, and lupane-type saponins. Genomic or transcriptomic approaches have not been used to investigate the saponin biosynthetic pathway in this plant. In this study, de novo sequencing was performed to select candidate genes involved in the saponin biosynthetic pathway. A half-plate 454 pyrosequencing run produced 627,923 high-quality reads with an average sequence length of 422 bases. De novo assembly generated 72,811 unique sequences, including 15,217 contigs and 57,594 singletons. Approximately 48,300 (66.3%) unique sequences were annotated using BLAST similarity searches. All of the mevalonate pathway genes for saponin biosynthesis starting from acetyl-CoA were isolated. Moreover, 206 reads of cytochrome P450 (CYP) and 145 reads of uridine diphosphate glycosyltransferase (UGT) sequences were isolated. Based on methyl jasmonate (MeJA) treatment and real-time PCR (qPCR) analysis, 3 CYPs and 3 UGTs were finally selected as candidate genes involved in the saponin biosynthetic pathway. The identified sequences associated with saponin biosynthesis will facilitate the study of the functional genomics of saponin biosynthesis and genetic engineering of E. senticosus.

  13. Swine transcriptome characterization by combined Iso-Seq and RNA-seq for annotating the emerging long read-based reference genome

    Science.gov (United States)

    PacBio long-read sequencing technology is increasingly popular in genome sequence assembly and transcriptome cataloguing. Recently, a new-generation pig reference genome was assembled based on long reads from this technology. To finely annotate this genome assembly, transcriptomes of nine tissues fr...

  14. Pyrosequencing and genetic diversity of microeukaryotes

    DEFF Research Database (Denmark)

    Harder, Christoffer Bugge

    carefully selected waterworks (Article IV), where the bacterial metabolic diversity and its important for water purification was described. Building on this, the most important part of the thesis consists of two pyrosequencing analyses of protozoa with newly developed 18S primers. One specifically targets...... Cercozoa, a particularly abundant phylum of protozoa (Article III), on heath land that had been subjected to prolonged artificially induced drought in a Danish free-air climate-manipulation experiment (CLIMAITE). Article III showed that the testate cercozoan forms responded negatively to prolonged drought...

  15. Pyrosequencing of environmental soil samples reveals biodiversity of the Phytophthora resident community in chestnut forests.

    Science.gov (United States)

    Vannini, Andrea; Bruni, Natalia; Tomassini, Alessia; Franceschini, Selma; Vettraino, Anna Maria

    2013-09-01

    Pyrosequencing analysis was performed on soils from Italian chestnut groves to evaluate the diversity of the resident Phytophthora community. Sequences analysed with a custom database discriminated 15 pathogenic Phytophthoras including species common to chestnut soils, while a total of nine species were detected with baiting. The two sites studied differed in Phytophthora diversity and the presence of specific taxa responded to specific ecological traits of the sites. Furthermore, some species not previously recorded were represented by a discrete number of reads; among these species, Phytophthora ramorum was detected at both sites. Pyrosequencing was demonstrated to be a very sensitive technique to describe the Phytophthora community in soil and was able to detect species not easy to be isolated from soil with standard baiting techniques. In particular, pyrosequencing is an highly efficient tool for investigating the colonization of new environments by alien species, and for ecological and adaptive studies coupled with biological detection methods. This study represents the first application of pyrosequencing for describing Phytophthoras in environmental soil samples. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  16. CBrowse: a SAM/BAM-based contig browser for transcriptome assembly visualization and analysis.

    Science.gov (United States)

    Li, Pei; Ji, Guoli; Dong, Min; Schmidt, Emily; Lenox, Douglas; Chen, Liangliang; Liu, Qi; Liu, Lin; Zhang, Jie; Liang, Chun

    2012-09-15

    To address the impending need for exploring rapidly increased transcriptomics data generated for non-model organisms, we developed CBrowse, an AJAX-based web browser for visualizing and analyzing transcriptome assemblies and contigs. Designed in a standard three-tier architecture with a data pre-processing pipeline, CBrowse is essentially a Rich Internet Application that offers many seamlessly integrated web interfaces and allows users to navigate, sort, filter, search and visualize data smoothly. The pre-processing pipeline takes the contig sequence file in FASTA format and its relevant SAM/BAM file as the input; detects putative polymorphisms, simple sequence repeats and sequencing errors in contigs and generates image, JSON and database-compatible CSV text files that are directly utilized by different web interfaces. CBowse is a generic visualization and analysis tool that facilitates close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors in transcriptome sequencing projects. CBrowse is distributed under the GNU General Public License, available at http://bioinfolab.muohio.edu/CBrowse/ liangc@muohio.edu or liangc.mu@gmail.com; glji@xmu.edu.cn Supplementary data are available at Bioinformatics online.

  17. PageRank-based identification of signaling crosstalk from transcriptomics data: the case of Arabidopsis thaliana.

    Science.gov (United States)

    Omranian, Nooshin; Mueller-Roeber, Bernd; Nikoloski, Zoran

    2012-04-01

    The levels of cellular organization, from gene transcription to translation to protein-protein interaction and metabolism, operate via tightly regulated mutual interactions, facilitating organismal adaptability and various stress responses. Characterizing the mutual interactions between genes, transcription factors, and proteins involved in signaling, termed crosstalk, is therefore crucial for understanding and controlling cells' functionality. We aim at using high-throughput transcriptomics data to discover previously unknown links between signaling networks. We propose and analyze a novel method for crosstalk identification which relies on transcriptomics data and overcomes the lack of complete information for signaling pathways in Arabidopsis thaliana. Our method first employs a network-based transformation of the results from the statistical analysis of differential gene expression in given groups of experiments under different signal-inducing conditions. The stationary distribution of a random walk (similar to the PageRank algorithm) on the constructed network is then used to determine the putative transcripts interrelating different signaling pathways. With the help of the proposed method, we analyze a transcriptomics data set including experiments from four different stresses/signals: nitrate, sulfur, iron, and hormones. We identified promising gene candidates, downstream of the transcription factors (TFs), associated to signaling crosstalk, which were validated through literature mining. In addition, we conduct a comparative analysis with the only other available method in this field which used a biclustering-based approach. Surprisingly, the biclustering-based approach fails to robustly identify any candidate genes involved in the crosstalk of the analyzed signals. We demonstrate that our proposed method is more robust in identifying gene candidates involved downstream of the signaling crosstalk for species for which large transcriptomics data sets

  18. Construction of coffee transcriptome networks based on gene annotation semantics

    Directory of Open Access Journals (Sweden)

    Castillo Luis F.

    2012-12-01

    Full Text Available Gene annotation is a process that encompasses multiple approaches on the analysis of nucleic acids or protein sequences in order to assign structural and functional characteristics to gene models. When thousands of gene models are being described in an organism genome, construction and visualization of gene networks impose novel challenges in the understanding of complex expression patterns and the generation of new knowledge in genomics research. In order to take advantage of accumulated text data after conventional gene sequence analysis, this work applied semantics in combination with visualization tools to build transcriptome networks from a set of coffee gene annotations. A set of selected coffee transcriptome sequences, chosen by the quality of the sequence comparison reported by Basic Local Alignment Search Tool (BLAST and Interproscan, were filtered out by coverage, identity, length of the query, and e-values. Meanwhile, term descriptors for molecular biology and biochemistry were obtained along the Wordnet dictionary in order to construct a Resource Description Framework (RDF using Ruby scripts and Methontology to find associations between concepts. Relationships between sequence annotations and semantic concepts were graphically represented through a total of 6845 oriented vectors, which were reduced to 745 non-redundant associations. A large gene network connecting transcripts by way of relational concepts was created where detailed connections remain to be validated for biological significance based on current biochemical and genetics frameworks. Besides reusing text information in the generation of gene connections and for data mining purposes, this tool development opens the possibility to visualize complex and abundant transcriptome data, and triggers the formulation of new hypotheses in metabolic pathways analysis.

  19. Rapid Molecular Identification of Human Taeniid Cestodes by Pyrosequencing Approach

    Science.gov (United States)

    Thanchomnang, Tongjit; Tantrawatpan, Chairat; Intapan, Pewpan M.; Sanpool, Oranuch; Janwan, Penchom; Lulitanond, Viraphong; Tourtip, Somjintana; Yamasaki, Hiroshi; Maleewong, Wanchai

    2014-01-01

    Taenia saginata, T. solium, and T. asiatica are causative agents of taeniasis in humans. The difficulty of morphological identification of human taeniids can lead to misdiagnosis or confusion. To overcome this problem, several molecular methods have been developed, but use of these tends to be time-consuming. Here, a rapid and high-throughput pyrosequencing approach was developed for the identification of three human taeniids originating from various countries. Primers targeting the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene of the three Taenia species were designed. Variations in a 26-nucleotide target region were used for identification. The reproducibility and accuracy of the pyrosequencing technology was confirmed by Sanger sequencing. This technique will be a valuable tool to distinguish between sympatric human taeniids that occur in Thailand, Asia and Pacific countries. This method could potentially be used for the molecular identification of the taeniid species that might be associated with suspicious cysts and lesions, or cyst residues in humans or livestock at the slaughterhouse. PMID:24945530

  20. Transcriptome Analysis in Sheepgrass (Leymus chinensis). A Dominant Perennial Grass of the Eurasian Steppe

    Energy Technology Data Exchange (ETDEWEB)

    Chen, Shuangyan [Chinese Academy of Sciences (CAS), Institute of Botany (IB), Beijing; Huang, Xin [Chinese Academy of Sciences (CAS), Institute of Botany (IB), Beijing; Yang, Xiaohan [Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States); Liu, Gongshe [Chinese Academy of Sciences (CAS), Institute of Botany (IB), Beijing

    2013-07-04

    BACKGROUND: Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. RESULTS: The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. CONCLUSIONS: This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.

  1. Transcriptome analysis in sheepgrass (Leymus chinensis): a dominant perennial grass of the Eurasian Steppe.

    Science.gov (United States)

    Chen, Shuangyan; Huang, Xin; Yan, Xueqing; Liang, Ye; Wang, Yuezhu; Li, Xiaofeng; Peng, Xianjun; Ma, Xingyong; Zhang, Lexin; Cai, Yueyue; Ma, Tian; Cheng, Liqin; Qi, Dongmei; Zheng, Huajun; Yang, Xiaohan; Li, Xiaoxia; Liu, Gongshe

    2013-01-01

    Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.

  2. DIMM-SC: a Dirichlet mixture model for clustering droplet-based single cell transcriptomic data.

    Science.gov (United States)

    Sun, Zhe; Wang, Ting; Deng, Ke; Wang, Xiao-Feng; Lafyatis, Robert; Ding, Ying; Hu, Ming; Chen, Wei

    2018-01-01

    Single cell transcriptome sequencing (scRNA-Seq) has become a revolutionary tool to study cellular and molecular processes at single cell resolution. Among existing technologies, the recently developed droplet-based platform enables efficient parallel processing of thousands of single cells with direct counting of transcript copies using Unique Molecular Identifier (UMI). Despite the technology advances, statistical methods and computational tools are still lacking for analyzing droplet-based scRNA-Seq data. Particularly, model-based approaches for clustering large-scale single cell transcriptomic data are still under-explored. We developed DIMM-SC, a Dirichlet Mixture Model for clustering droplet-based Single Cell transcriptomic data. This approach explicitly models UMI count data from scRNA-Seq experiments and characterizes variations across different cell clusters via a Dirichlet mixture prior. We performed comprehensive simulations to evaluate DIMM-SC and compared it with existing clustering methods such as K-means, CellTree and Seurat. In addition, we analyzed public scRNA-Seq datasets with known cluster labels and in-house scRNA-Seq datasets from a study of systemic sclerosis with prior biological knowledge to benchmark and validate DIMM-SC. Both simulation studies and real data applications demonstrated that overall, DIMM-SC achieves substantially improved clustering accuracy and much lower clustering variability compared to other existing clustering methods. More importantly, as a model-based approach, DIMM-SC is able to quantify the clustering uncertainty for each single cell, facilitating rigorous statistical inference and biological interpretations, which are typically unavailable from existing clustering methods. DIMM-SC has been implemented in a user-friendly R package with a detailed tutorial available on www.pitt.edu/∼wec47/singlecell.html. wei.chen@chp.edu or hum@ccf.org. Supplementary data are available at Bioinformatics online. © The Author

  3. Pyrosequencing based assessment of bacterial diversity in Turkish Rhipicephalus annulatus and Dermacentor marginatus ticks (Acari: Ixodidae).

    Science.gov (United States)

    Tekin, Saban; Dowd, Scot E; Davinic, Marko; Bursali, Ahmet; Keskin, Adem

    2017-03-01

    Ticks continue to be a threat to human and animal health in Turkey, as they are considered important vectors of human and animal diseases. The objectives of this investigation are to characterize the microbial communities of two tick species, Rhipicephalus annulatus and Dermacenter marginatus, analyze patterns of co-occurrence among microbial taxa, identify and compare pathogens contributing human diseases, and determine whether avirulent symbionts could exclude human pathogens from tick communities. Furthermore, this study explores a microbiome of the R. annulatus and D. marginatus via the bacterial 16S tag-encoded FLX-titanium amplicon pyrosequencing (bTEFAP) technique to describe their bacterial diversity. Pyrosequencing was performed on adult males and females isolated from humans from two high-risk Turkish provinces, Sivas and Amasya, during tick outbreaks in 2009. A total of 36,253 sequences were utilized for analyses of the 8 tick samples. Several pathogenic genera such as Francisella, Coxiella, Rickettsia, and Shigella were detected in the ticks tested. The most distinguishable difference between the two species of ticks was the lack of known human pathogen Rickettsia in R. annulatus and in samples 9 and 10 of D. marginatus. These samples had higher relative abundance of Flavobacterium sp., Curvibacter sp., Acidovorax sp., and Bacteroidaceae genera mostly representing symbionts which form a large component of normal tick microbiota. The outcome of this study is consistent with the predictions of the community ecological theory that diversity-rich bacteriomes are more resistant to bacterial invasion (and consequent pathogen dissemination) than diversity-deprived ones.

  4. Detection of polyoma virus in brain tissue of patients with progressive multifocal leukoencephalopathy by real-time PCR and pyrosequencing.

    Science.gov (United States)

    Beck, Rose C; Kohn, Debra J; Tuohy, Marion J; Prayson, Richard A; Yen-Lieberman, Belinda; Procop, Gary W

    2004-03-01

    We evaluated 2 methods, a LightCycler PCR assay and pyrosequencing for the detection of the JC polyoma virus (JCV) in fixed brain tissue of 10 patients with and 3 control patients without progressive multifocal leukoencephalopathy (PML). Nucleic acid extraction was performed after deparaffinization and proteinase K digestion. The LightCycler assay differentiates the BK virus (BKV), JCV, and SV40 using melt curve analysis. Conventional PCR was used with the same primers to generate products for pyrosequencing. Two sequencing primers were used that differentiate the polyoma viruses. Seven of 11 biopsies (1 patient had 2 biopsies) with PML were positive for JCV by real-time PCR and/or PCR/pyrosequencing. Three of 4 remaining biopsies were positive by real-time PCR but had melting points between JCV and SV40. The 4 specimens that were negative or atypical by LightCycler PCR were positive by traditional PCR, but 1 had an amplicon of lower molecular weight by gel electrophoresis. These were shown to represent JCV by at least 1 of the 2 pyrosequencing primers. The biopsies from patients without PML were PCR negative. Both the LightCycler and pyrosequencing assays are useful for confirming JCV in brain biopsies from patients with PML, but variant JCVs may require supplementary methods to confirm JCV infection.

  5. Evaluating de Bruijn graph assemblers on 454 transcriptomic data.

    Directory of Open Access Journals (Sweden)

    Xianwen Ren

    Full Text Available Next generation sequencing (NGS technologies have greatly changed the landscape of transcriptomic studies of non-model organisms. Since there is no reference genome available, de novo assembly methods play key roles in the analysis of these data sets. Because of the huge amount of data generated by NGS technologies for each run, many assemblers, e.g., ABySS, Velvet and Trinity, are developed based on a de Bruijn graph due to its time- and space-efficiency. However, most of these assemblers were developed initially for the Illumina/Solexa platform. The performance of these assemblers on 454 transcriptomic data is unknown. In this study, we evaluated and compared the relative performance of these de Bruijn graph based assemblers on both simulated and real 454 transcriptomic data. The results suggest that Trinity, the Illumina/Solexa-specialized transcriptomic assembler, performs the best among the multiple de Bruijn graph assemblers, comparable to or even outperforming the standard 454 assembler Newbler which is based on the overlap-layout-consensus algorithm. Our evaluation is expected to provide helpful guidance for researchers to choose assemblers when analyzing 454 transcriptomic data.

  6. Rapid detection and subtyping of human influenza A viruses and reassortants by pyrosequencing.

    Directory of Open Access Journals (Sweden)

    Yi-Mo Deng

    Full Text Available BACKGROUND: Given the continuing co-circulation of the 2009 H1N1 pandemic influenza A viruses with seasonal H3N2 viruses, rapid and reliable detection of newly emerging influenza reassortant viruses is important to enhance our influenza surveillance. METHODOLOGY/PRINCIPAL FINDINGS: A novel pyrosequencing assay was developed for the rapid identification and subtyping of potential human influenza A virus reassortants based on all eight gene segments of the virus. Except for HA and NA genes, one universal set of primers was used to amplify and subtype each of the six internal genes. With this method, all eight gene segments of 57 laboratory isolates and 17 original specimens of seasonal H1N1, H3N2 and 2009 H1N1 pandemic viruses were correctly matched with their corresponding subtypes. In addition, this method was shown to be capable of detecting reassortant viruses by correctly identifying the source of all 8 gene segments from three vaccine production reassortant viruses and three H1N2 viruses. CONCLUSIONS/SIGNIFICANCE: In summary, this pyrosequencing assay is a sensitive and specific procedure for screening large numbers of viruses for reassortment events amongst the commonly circulating human influenza A viruses, which is more rapid and cheaper than using conventional sequencing approaches.

  7. Rapid detection and subtyping of human influenza A viruses and reassortants by pyrosequencing.

    Science.gov (United States)

    Deng, Yi-Mo; Caldwell, Natalie; Barr, Ian G

    2011-01-01

    Given the continuing co-circulation of the 2009 H1N1 pandemic influenza A viruses with seasonal H3N2 viruses, rapid and reliable detection of newly emerging influenza reassortant viruses is important to enhance our influenza surveillance. A novel pyrosequencing assay was developed for the rapid identification and subtyping of potential human influenza A virus reassortants based on all eight gene segments of the virus. Except for HA and NA genes, one universal set of primers was used to amplify and subtype each of the six internal genes. With this method, all eight gene segments of 57 laboratory isolates and 17 original specimens of seasonal H1N1, H3N2 and 2009 H1N1 pandemic viruses were correctly matched with their corresponding subtypes. In addition, this method was shown to be capable of detecting reassortant viruses by correctly identifying the source of all 8 gene segments from three vaccine production reassortant viruses and three H1N2 viruses. In summary, this pyrosequencing assay is a sensitive and specific procedure for screening large numbers of viruses for reassortment events amongst the commonly circulating human influenza A viruses, which is more rapid and cheaper than using conventional sequencing approaches.

  8. PIVOT: platform for interactive analysis and visualization of transcriptomics data.

    Science.gov (United States)

    Zhu, Qin; Fisher, Stephen A; Dueck, Hannah; Middleton, Sarah; Khaladkar, Mugdha; Kim, Junhyong

    2018-01-05

    Many R packages have been developed for transcriptome analysis but their use often requires familiarity with R and integrating results of different packages requires scripts to wrangle the datatypes. Furthermore, exploratory data analyses often generate multiple derived datasets such as data subsets or data transformations, which can be difficult to track. Here we present PIVOT, an R-based platform that wraps open source transcriptome analysis packages with a uniform user interface and graphical data management that allows non-programmers to interactively explore transcriptomics data. PIVOT supports more than 40 popular open source packages for transcriptome analysis and provides an extensive set of tools for statistical data manipulations. A graph-based visual interface is used to represent the links between derived datasets, allowing easy tracking of data versions. PIVOT further supports automatic report generation, publication-quality plots, and program/data state saving, such that all analysis can be saved, shared and reproduced. PIVOT will allow researchers with broad background to easily access sophisticated transcriptome analysis tools and interactively explore transcriptome datasets.

  9. Gene set-based module discovery in the breast cancer transcriptome

    Directory of Open Access Journals (Sweden)

    Zhang Michael Q

    2009-02-01

    Full Text Available Abstract Background Although microarray-based studies have revealed global view of gene expression in cancer cells, we still have little knowledge about regulatory mechanisms underlying the transcriptome. Several computational methods applied to yeast data have recently succeeded in identifying expression modules, which is defined as co-expressed gene sets under common regulatory mechanisms. However, such module discovery methods are not applied cancer transcriptome data. Results In order to decode oncogenic regulatory programs in cancer cells, we developed a novel module discovery method termed EEM by extending a previously reported module discovery method, and applied it to breast cancer expression data. Starting from seed gene sets prepared based on cis-regulatory elements, ChIP-chip data, and gene locus information, EEM identified 10 principal expression modules in breast cancer based on their expression coherence. Moreover, EEM depicted their activity profiles, which predict regulatory programs in each subtypes of breast tumors. For example, our analysis revealed that the expression module regulated by the Polycomb repressive complex 2 (PRC2 is downregulated in triple negative breast cancers, suggesting similarity of transcriptional programs between stem cells and aggressive breast cancer cells. We also found that the activity of the PRC2 expression module is negatively correlated to the expression of EZH2, a component of PRC2 which belongs to the E2F expression module. E2F-driven EZH2 overexpression may be responsible for the repression of the PRC2 expression modules in triple negative tumors. Furthermore, our network analysis predicts regulatory circuits in breast cancer cells. Conclusion These results demonstrate that the gene set-based module discovery approach is a powerful tool to decode regulatory programs in cancer cells.

  10. Combining flow cytometry and 16S rRNA gene pyrosequencing: A promising approach for drinking water monitoring and characterization

    KAUST Repository

    Prest, Emmanuelle I E C

    2014-10-01

    The combination of flow cytometry (FCM) and 16S rRNA gene pyrosequencing data was investigated for the purpose of monitoring and characterizing microbial changes in drinking water distribution systems. High frequency sampling (5min intervals for 1h) was performed at the outlet of a treatment plant and at one location in the full-scale distribution network. In total, 52 bulk water samples were analysed with FCM, pyrosequencing and conventional methods (adenosine-triphosphate, ATP; heterotrophic plate count, HPC). FCM and pyrosequencing results individually showed that changes in the microbial community occurred in the water distribution system, which was not detected with conventional monitoring. FCM data showed an increase in the total bacterial cell concentrations (from 345±15×103 to 425±35×103cellsmL-1) and in the percentage of intact bacterial cells (from 39±3.5% to 53±4.4%) during water distribution. This shift was also observed in the FCM fluorescence fingerprints, which are characteristic of each water sample. A similar shift was detected in the microbial community composition as characterized with pyrosequencing, showing that FCM and genetic fingerprints are congruent. FCM and pyrosequencing data were subsequently combined for the calculation of cell concentration changes for each bacterial phylum. The results revealed an increase in cell concentrations of specific bacterial phyla (e.g., Proteobacteria), along with a decrease in other phyla (e.g., Actinobacteria), which could not be concluded from the two methods individually. The combination of FCM and pyrosequencing methods is a promising approach for future drinking water quality monitoring and for advanced studies on drinking water distribution pipeline ecology. © 2014 Elsevier Ltd.

  11. Pyrosequencing Analysis of Cryogenically Ground Samples from Primary and Secondary/Persistent Endodontic Infections.

    Science.gov (United States)

    Keskin, Cangül; Demiryürek, Ebru Özsezer; Onuk, Ertan Emek

    2017-08-01

    This study aimed to characterize the microbial communities of primary and secondary/persistent endodontic infections using high-throughput pyrosequencing from the pulverized samples. The roots of 20 extracted human teeth with primary endodontic infection and 20 teeth with secondary/persistent endodontic infection were collected. The outer surfaces of the roots were disinfected, and whole roots were cryopulverized. 16S amplicon pyrosequencing data from the DNA extracted from the pulverized root powders were obtained, and microorganism abundance and diversity were calculated. Data were analyzed using statistical and bioinformatic methods. Pyrosequencing analysis resulted a total of 2,606,128 sequences from 40 samples. A total of 15 phyla, 160 genera, and 368 species were detected. No significant difference between primary and secondary/persistent endodontic infections was found regarding the diversity and richness of operational taxonomic units at the phyla, genera, and species levels (P > .005). The present study revealed that the microbial diversity of secondary/persistent endodontic infections did not differ than those of primary endodontic infections. A new archaeal species, Candidatus Nitrosoarchaeum limnia, was detected in root canals of 1 patient with primary endodontic infection for the first time. Copyright © 2017 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.

  12. Multiplex pyrosequencing assay using AdvISER-MH-PYRO algorithm: a case for rapid and cost-effective genotyping analysis of prostate cancer risk-associated SNPs.

    Science.gov (United States)

    Ambroise, Jérôme; Butoescu, Valentina; Robert, Annie; Tombal, Bertrand; Gala, Jean-Luc

    2015-06-25

    Single Nucleotide Polymorphisms (SNPs) identified in Genome Wide Association Studies (GWAS) have generally moderate association with related complex diseases. Accordingly, Multilocus Genetic Risk Scores (MGRSs) have been computed in previous studies in order to assess the cumulative association of multiple SNPs. When several SNPs have to be genotyped for each patient, using successive uniplex pyrosequencing reactions increases analytical reagent expenses and Turnaround Time (TAT). While a set of several pyrosequencing primers could theoretically be used to analyze multiplex amplicons, this would generate overlapping primer-specific pyro-signals that are visually uninterpretable. In the current study, two multiplex assays were developed consisting of a quadruplex (n=4) and a quintuplex (n=5) polymerase chain reaction (PCR) each followed by multiplex pyrosequencing analysis. The aim was to reliably but rapidly genotype a set of prostate cancer-related SNPs (n=9). The nucleotide dispensation order was selected using SENATOR software. Multiplex pyro-signals were analyzed using the new AdvISER-MH-PYRO software based on a sparse representation of the signal. Using uniplex assays as gold standard, the concordance between multiplex and uniplex assays was assessed on DNA extracted from patient blood samples (n = 10). All genotypes (n=90) generated with the quadruplex and the quintuplex pyroquencing assays were perfectly (100 %) concordant with uniplex pyrosequencing. Using multiplex genotyping approach for analyzing a set of 90 patients allowed reducing TAT by approximately 75 % (i.e., from 2025 to 470 min) while reducing reagent consumption and cost by approximately 70 % (i.e., from ~229 US$ /patient to ~64 US$ /patient). This combination of quadruplex and quintuplex pyrosequencing and PCR assays enabled to reduce the amount of DNA required for multi-SNP analysis, and to lower the global TAT and costs of SNP genotyping while providing results as reliable as uniplex

  13. TRAM (Transcriptome Mapper: database-driven creation and analysis of transcriptome maps from multiple sources

    Directory of Open Access Journals (Sweden)

    Danieli Gian

    2011-02-01

    Full Text Available Abstract Background Several tools have been developed to perform global gene expression profile data analysis, to search for specific chromosomal regions whose features meet defined criteria as well as to study neighbouring gene expression. However, most of these tools are tailored for a specific use in a particular context (e.g. they are species-specific, or limited to a particular data format and they typically accept only gene lists as input. Results TRAM (Transcriptome Mapper is a new general tool that allows the simple generation and analysis of quantitative transcriptome maps, starting from any source listing gene expression values for a given gene set (e.g. expression microarrays, implemented as a relational database. It includes a parser able to assign univocal and updated gene symbols to gene identifiers from different data sources. Moreover, TRAM is able to perform intra-sample and inter-sample data normalization, including an original variant of quantile normalization (scaled quantile, useful to normalize data from platforms with highly different numbers of investigated genes. When in 'Map' mode, the software generates a quantitative representation of the transcriptome of a sample (or of a pool of samples and identifies if segments of defined lengths are over/under-expressed compared to the desired threshold. When in 'Cluster' mode, the software searches for a set of over/under-expressed consecutive genes. Statistical significance for all results is calculated with respect to genes localized on the same chromosome or to all genome genes. Transcriptome maps, showing differential expression between two sample groups, relative to two different biological conditions, may be easily generated. We present the results of a biological model test, based on a meta-analysis comparison between a sample pool of human CD34+ hematopoietic progenitor cells and a sample pool of megakaryocytic cells. Biologically relevant chromosomal segments and gene

  14. Transcriptome analysis reveals the time of the fourth round of genome duplication in common carp (Cyprinus carpio)

    Science.gov (United States)

    2012-01-01

    Background Common carp (Cyprinus carpio) is thought to have undergone one extra round of genome duplication compared to zebrafish. Transcriptome analysis has been used to study the existence and timing of genome duplication in species for which genome sequences are incomplete. Large-scale transcriptome data for the common carp genome should help reveal the timing of the additional duplication event. Results We have sequenced the transcriptome of common carp using 454 pyrosequencing. After assembling the 454 contigs and the published common carp sequences together, we obtained 49,669 contigs and identified genes using homology searches and an ab initio method. We identified 4,651 orthologous pairs between common carp and zebrafish and found 129,984 paralogous pairs within the common carp. An estimation of the synonymous substitution rate in the orthologous pairs indicated that common carp and zebrafish diverged 120 million years ago (MYA). We identified one round of genome duplication in common carp and estimated that it had occurred 5.6 to 11.3 MYA. In zebrafish, no genome duplication event after speciation was observed, suggesting that, compared to zebrafish, common carp had undergone an additional genome duplication event. We annotated the common carp contigs with Gene Ontology terms and KEGG pathways. Compared with zebrafish gene annotations, we found that a set of biological processes and pathways were enriched in common carp. Conclusions The assembled contigs helped us to estimate the time of the fourth-round of genome duplication in common carp. The resource that we have built as part of this study will help advance functional genomics and genome annotation studies in the future. PMID:22424280

  15. Rapid molecular identification of human taeniid cestodes by pyrosequencing approach.

    Directory of Open Access Journals (Sweden)

    Tongjit Thanchomnang

    Full Text Available Taenia saginata, T. solium, and T. asiatica are causative agents of taeniasis in humans. The difficulty of morphological identification of human taeniids can lead to misdiagnosis or confusion. To overcome this problem, several molecular methods have been developed, but use of these tends to be time-consuming. Here, a rapid and high-throughput pyrosequencing approach was developed for the identification of three human taeniids originating from various countries. Primers targeting the mitochondrial cytochrome c oxidase subunit 1 (cox1 gene of the three Taenia species were designed. Variations in a 26-nucleotide target region were used for identification. The reproducibility and accuracy of the pyrosequencing technology was confirmed by Sanger sequencing. This technique will be a valuable tool to distinguish between sympatric human taeniids that occur in Thailand, Asia and Pacific countries. This method could potentially be used for the molecular identification of the taeniid species that might be associated with suspicious cysts and lesions, or cyst residues in humans or livestock at the slaughterhouse.

  16. Comparative transcriptomic analysis of roots of contrasting Gossypium herbaceum genotypes revealing adaptation to drought

    Directory of Open Access Journals (Sweden)

    Ranjan Alok

    2012-11-01

    Full Text Available Abstract Background Root length and its architecture govern the adaptability of plants to various stress conditions, including drought stress. Genetic variations in root growth, length, and architecture are genotypes dependent. In this study, we compared the drought-induced transcriptome of four genotypes of Gossypium herbaceum that differed in their drought tolerance adaptability. Three different methodologies, namely, microarray, pyrosequencing, and qRT–PCR, were used for transcriptome analysis and validation. Results The variations in root length and growth were found among four genotypes of G.herbaceum when exposed to mannitol-induced osmotic stress. Under osmotic stress, the drought tolerant genotypes Vagad and GujCot-21 showed a longer root length than did by drought sensitive RAHS-14 and RAHS-IPS-187. Further, the gene expression patterns in the root tissue of all genotypes were analyzed. We obtained a total of 794 differentially expressed genes by microarray and 104928 high-quality reads representing 53195 unigenes from the root transcriptome. The Vagad and GujCot-21 respond to water stress by inducing various genes and pathways such as response to stresses, response to water deprivation, and flavonoid pathways. Some key regulatory genes involved in abiotic stress such as AP2 EREBP, MYB, WRKY, ERF, ERD9, and LEA were highly expressed in Vagad and GujCot-21. The genes RHD3, NAP1, LBD, and transcription factor WRKY75, known for root development under various stress conditions, were expressed specifically in Vagad and GujCot-21. The genes related to peroxidases, transporters, cell wall-modifying enzymes, and compatible solutes (amino acids, amino sugars, betaine, sugars, or sugar alcohols were also highly expressed in Vagad and Gujcot-21. Conclusion Our analysis highlights changes in the expression pattern of genes and depicts a small but highly specific set of drought responsive genes induced in response to drought stress. Some of these

  17. De novo Transcriptome Assembly of Common Wild Rice (Oryza rufipogon Griff.) and Discovery of Drought-Response Genes in Root Tissue Based on Transcriptomic Data.

    Science.gov (United States)

    Tian, Xin-Jie; Long, Yan; Wang, Jiao; Zhang, Jing-Wen; Wang, Yan-Yan; Li, Wei-Min; Peng, Yu-Fa; Yuan, Qian-Hua; Pei, Xin-Wu

    2015-01-01

    The perennial O. rufipogon (common wild rice), which is considered to be the ancestor of Asian cultivated rice species, contains many useful genetic resources, including drought resistance genes. However, few studies have identified the drought resistance and tissue-specific genes in common wild rice. In this study, transcriptome sequencing libraries were constructed, including drought-treated roots (DR) and control leaves (CL) and roots (CR). Using Illumina sequencing technology, we generated 16.75 million bases of high-quality sequence data for common wild rice and conducted de novo assembly and annotation of genes without prior genome information. These reads were assembled into 119,332 unigenes with an average length of 715 bp. A total of 88,813 distinct sequences (74.42% of unigenes) significantly matched known genes in the NCBI NT database. Differentially expressed gene (DEG) analysis showed that 3617 genes were up-regulated and 4171 genes were down-regulated in the CR library compared with the CL library. Among the DEGs, 535 genes were expressed in roots but not in shoots. A similar comparison between the DR and CR libraries showed that 1393 genes were up-regulated and 315 genes were down-regulated in the DR library compared with the CR library. Finally, 37 genes that were specifically expressed in roots were screened after comparing the DEGs identified in the above-described analyses. This study provides a transcriptome sequence resource for common wild rice plants and establishes a digital gene expression profile of wild rice plants under drought conditions using the assembled transcriptome data as a reference. Several tissue-specific and drought-stress-related candidate genes were identified, representing a fully characterized transcriptome and providing a valuable resource for genetic and genomic studies in plants.

  18. Comparison Study of MS-HRM and Pyrosequencing Techniques for Quantification of APC and CDKN2A Gene Methylation

    Science.gov (United States)

    Migheli, Francesca; Stoccoro, Andrea; Coppedè, Fabio; Wan Omar, Wan Adnan; Failli, Alessandra; Consolini, Rita; Seccia, Massimo; Spisni, Roberto; Miccoli, Paolo; Mathers, John C.; Migliore, Lucia

    2013-01-01

    There is increasing interest in the development of cost-effective techniques for the quantification of DNA methylation biomarkers. We analyzed 90 samples of surgically resected colorectal cancer tissues for APC and CDKN2A promoter methylation using methylation sensitive-high resolution melting (MS-HRM) and pyrosequencing. MS-HRM is a less expensive technique compared with pyrosequencing but is usually more limited because it gives a range of methylation estimates rather than a single value. Here, we developed a method for deriving single estimates, rather than a range, of methylation using MS-HRM and compared the values obtained in this way with those obtained using the gold standard quantitative method of pyrosequencing. We derived an interpolation curve using standards of known methylated/unmethylated ratio (0%, 12.5%, 25%, 50%, 75%, and 100% of methylation) to obtain the best estimate of the extent of methylation for each of our samples. We observed similar profiles of methylation and a high correlation coefficient between the two techniques. Overall, our new approach allows MS-HRM to be used as a quantitative assay which provides results which are comparable with those obtained by pyrosequencing. PMID:23326336

  19. Combining flow cytometry and 16S rRNA gene pyrosequencing: a promising approach for drinking water monitoring and characterization.

    Science.gov (United States)

    Prest, E I; El-Chakhtoura, J; Hammes, F; Saikaly, P E; van Loosdrecht, M C M; Vrouwenvelder, J S

    2014-10-15

    The combination of flow cytometry (FCM) and 16S rRNA gene pyrosequencing data was investigated for the purpose of monitoring and characterizing microbial changes in drinking water distribution systems. High frequency sampling (5 min intervals for 1 h) was performed at the outlet of a treatment plant and at one location in the full-scale distribution network. In total, 52 bulk water samples were analysed with FCM, pyrosequencing and conventional methods (adenosine-triphosphate, ATP; heterotrophic plate count, HPC). FCM and pyrosequencing results individually showed that changes in the microbial community occurred in the water distribution system, which was not detected with conventional monitoring. FCM data showed an increase in the total bacterial cell concentrations (from 345 ± 15 × 10(3) to 425 ± 35 × 10(3) cells mL(-1)) and in the percentage of intact bacterial cells (from 39 ± 3.5% to 53 ± 4.4%) during water distribution. This shift was also observed in the FCM fluorescence fingerprints, which are characteristic of each water sample. A similar shift was detected in the microbial community composition as characterized with pyrosequencing, showing that FCM and genetic fingerprints are congruent. FCM and pyrosequencing data were subsequently combined for the calculation of cell concentration changes for each bacterial phylum. The results revealed an increase in cell concentrations of specific bacterial phyla (e.g., Proteobacteria), along with a decrease in other phyla (e.g., Actinobacteria), which could not be concluded from the two methods individually. The combination of FCM and pyrosequencing methods is a promising approach for future drinking water quality monitoring and for advanced studies on drinking water distribution pipeline ecology. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. Using expected sequence features to improve basecalling accuracy of amplicon pyrosequencing data

    DEFF Research Database (Denmark)

    Rask, Thomas Salhøj; Petersen, Bent; Chen, Donald S.

    2016-01-01

    . The new basecalling method described here, named Multipass, implements a probabilistic framework for working with the raw flowgrams obtained by pyrosequencing. For each sequence variant Multipass calculates the likelihood and nucleotide sequence of several most likely sequences given the flowgram data....... This probabilistic approach enables integration of basecalling into a larger model where other parameters can be incorporated, such as the likelihood for observing a full-length open reading frame at the targeted region. We apply the method to 454 amplicon pyrosequencing data obtained from a malaria virulence gene...... family, where Multipass generates 20 % more error-free sequences than current state of the art methods, and provides sequence characteristics that allow generation of a set of high confidence error-free sequences. This novel method can be used to increase accuracy of existing and future amplicon...

  1. A sweetpotato gene index established by de novo assembly of pyrosequencing and Sanger sequences and mining for gene-based microsatellite markers

    Directory of Open Access Journals (Sweden)

    Solis Julio

    2010-10-01

    Full Text Available Abstract Background Sweetpotato (Ipomoea batatas (L. Lam., a hexaploid outcrossing crop, is an important staple and food security crop in developing countries in Africa and Asia. The availability of genomic resources for sweetpotato is in striking contrast to its importance for human nutrition. Previously existing sequence data were restricted to around 22,000 expressed sequence tag (EST sequences and ~ 1,500 GenBank sequences. We have used 454 pyrosequencing to augment the available gene sequence information to enhance functional genomics and marker design for this plant species. Results Two quarter 454 pyrosequencing runs used two normalized cDNA collections from stems and leaves from drought-stressed sweetpotato clone Tanzania and yielded 524,209 reads, which were assembled together with 22,094 publically available expressed sequence tags into 31,685 sets of overlapping DNA segments and 34,733 unassembled sequences. Blastx comparisons with the UniRef100 database allowed annotation of 23,957 contigs and 15,342 singletons resulting in 24,657 putatively unique genes. Further, 27,119 sequences had no match to protein sequences of UniRef100database. On the basis of this gene index, we have identified 1,661 gene-based microsatellite sequences, of which 223 were selected for testing and 195 were successfully amplified in a test panel of 6 hexaploid (I. batatas and 2 diploid (I. trifida accessions. Conclusions The sweetpotato gene index is a useful source for functionally annotated sweetpotato gene sequences that contains three times more gene sequence information for sweetpotato than previous EST assemblies. A searchable version of the gene index, including a blastn function, is available at http://www.cipotato.org/sweetpotato_gene_index.

  2. A sensitive issue: Pyrosequencing as a valuable forensic SNP typing platform

    DEFF Research Database (Denmark)

    Harrison, C.; Musgrave-Brown, E.; Bender, K.

    2006-01-01

    Analysing minute amounts of DNA is a routine challenge in forensics in part due to the poor sensitivity of an instrument and its inability to detect results from forensic samples. In this study, the sensitivity of the Pyrosequencing method is investigated using varying concentrations of DNA and f...

  3. Characterization of bacterial populations in Danish raw milk cheeses made with different starter cultures by denaturating gradient gel electrophoresis and pyrosequencing

    DEFF Research Database (Denmark)

    Masoud, Wafa Mahmoud Hasan; Takamiya, Monica K Wik; Vogensen, Finn Kvist

    2011-01-01

    ripening. Other bacteria like Corynebacterium, Halomonas, Pediococcus, Micrococcus and Staphylococcus, which were encountered in some cheese samples at low percentages compared with the total bacterial populations, were only detected by pyrosequencing. 16S rRNA gene pyrosequencing is an efficient method...

  4. Intra-tumor heterogeneity in breast cancer has limited impact on transcriptomic-based molecular profiling.

    Science.gov (United States)

    Karthik, Govindasamy-Muralidharan; Rantalainen, Mattias; Stålhammar, Gustav; Lövrot, John; Ullah, Ikram; Alkodsi, Amjad; Ma, Ran; Wedlund, Lena; Lindberg, Johan; Frisell, Jan; Bergh, Jonas; Hartman, Johan

    2017-11-29

    Transcriptomic profiling of breast tumors provides opportunity for subtyping and molecular-based patient stratification. In diagnostic applications the specimen profiled should be representative of the expression profile of the whole tumor and ideally capture properties of the most aggressive part of the tumor. However, breast cancers commonly exhibit intra-tumor heterogeneity at molecular, genomic and in phenotypic level, which can arise during tumor evolution. Currently it is not established to what extent a random sampling approach may influence molecular breast cancer diagnostics. In this study we applied RNA-sequencing to quantify gene expression in 43 pieces (2-5 pieces per tumor) from 12 breast tumors (Cohort 1). We determined molecular subtype and transcriptomic grade for all tumor pieces and analysed to what extent pieces originating from the same tumors are concordant or discordant with each other. Additionally, we validated our finding in an independent cohort consisting of 19 pieces (2-6 pieces per tumor) from 6 breast tumors (Cohort 2) profiled using microarray technique. Exome sequencing was also performed on this cohort, to investigate the extent of intra-tumor genomic heterogeneity versus the intra-tumor molecular subtype classifications. Molecular subtyping was consistent in 11 out of 12 tumors and transcriptomic grade assignments were consistent in 11 out of 12 tumors as well. Molecular subtype predictions revealed consistent subtypes in four out of six patients in this cohort 2. Interestingly, we observed extensive intra-tumor genomic heterogeneity in these tumor pieces but not in their molecular subtype classifications. Our results suggest that macroscopic intra-tumoral transcriptomic heterogeneity is limited and unlikely to have an impact on molecular diagnostics for most patients.

  5. Global meta-analysis of transcriptomics studies.

    Directory of Open Access Journals (Sweden)

    José Caldas

    Full Text Available Transcriptomics meta-analysis aims at re-using existing data to derive novel biological hypotheses, and is motivated by the public availability of a large number of independent studies. Current methods are based on breaking down studies into multiple comparisons between phenotypes (e.g. disease vs. healthy, based on the studies' experimental designs, followed by computing the overlap between the resulting differential expression signatures. While useful, in this methodology each study yields multiple independent phenotype comparisons, and connections are established not between studies, but rather between subsets of the studies corresponding to phenotype comparisons. We propose a rank-based statistical meta-analysis framework that establishes global connections between transcriptomics studies without breaking down studies into sets of phenotype comparisons. By using a rank product method, our framework extracts global features from each study, corresponding to genes that are consistently among the most expressed or differentially expressed genes in that study. Those features are then statistically modelled via a term-frequency inverse-document frequency (TF-IDF model, which is then used for connecting studies. Our framework is fast and parameter-free; when applied to large collections of Homo sapiens and Streptococcus pneumoniae transcriptomics studies, it performs better than similarity-based approaches in retrieving related studies, using a Medical Subject Headings gold standard. Finally, we highlight via case studies how the framework can be used to derive novel biological hypotheses regarding related studies and the genes that drive those connections. Our proposed statistical framework shows that it is possible to perform a meta-analysis of transcriptomics studies with arbitrary experimental designs by deriving global expression features rather than decomposing studies into multiple phenotype comparisons.

  6. The Human Pancreas Proteome Defined by Transcriptomics and Antibody-Based Profiling

    Science.gov (United States)

    Fagerberg, Linn; Hallström, Björn M.; Schwenk, Jochen M.; Uhlén, Mathias; Korsgren, Olle; Lindskog, Cecilia

    2014-01-01

    The pancreas is composed of both exocrine glands and intermingled endocrine cells to execute its diverse functions, including enzyme production for digestion of nutrients and hormone secretion for regulation of blood glucose levels. To define the molecular constituents with elevated expression in the human pancreas, we employed a genome-wide RNA sequencing analysis of the human transcriptome to identify genes with elevated expression in the human pancreas. This quantitative transcriptomics data was combined with immunohistochemistry-based protein profiling to allow mapping of the corresponding proteins to different compartments and specific cell types within the pancreas down to the single cell level. Analysis of whole pancreas identified 146 genes with elevated expression levels, of which 47 revealed a particular higher expression as compared to the other analyzed tissue types, thus termed pancreas enriched. Extended analysis of in vitro isolated endocrine islets identified an additional set of 42 genes with elevated expression in these specialized cells. Although only 0.7% of all genes showed an elevated expression level in the pancreas, this fraction of transcripts, in most cases encoding secreted proteins, constituted 68% of the total mRNA in pancreas. This demonstrates the extreme specialization of the pancreas for production of secreted proteins. Among the elevated expression profiles, several previously not described proteins were identified, both in endocrine cells (CFC1, FAM159B, RBPJL and RGS9) and exocrine glandular cells (AQP12A, DPEP1, GATM and ERP27). In summary, we provide a global analysis of the pancreas transcriptome and proteome with a comprehensive list of genes and proteins with elevated expression in pancreas. This list represents an important starting point for further studies of the molecular repertoire of pancreatic cells and their relation to disease states or treatment effects. PMID:25546435

  7. Transcriptomic effects-based monitoring for endocrine active chemicals: Assessing relative contribution of treated wastewater to downstream pollution

    Science.gov (United States)

    Martinovic-Weigelt, Dalma; Mehinto, Alvine C.; Ankley, Gerald T.; Denslow, Nancy D.; Barber, Larry B.; Lee, Kathy E.; King, Ryan J.; Schoenfuss, Heiko L.; Schroeder, Anthony L.; Villeneuve, Daniel L.

    2014-01-01

    The present study investigated whether a combination of targeted analytical chemistry information with unsupervised, data-rich biological methodology (i.e., transcriptomics) could be utilized to evaluate relative contributions of wastewater treatment plant (WWTP) effluents to biological effects. The effects of WWTP effluents on fish exposed to ambient, receiving waters were studied at three locations with distinct WWTP and watershed characteristics. At each location, 4 d exposures of male fathead minnows to the WWTP effluent and upstream and downstream ambient waters were conducted. Transcriptomic analyses were performed on livers using 15 000 feature microarrays, followed by a canonical pathway and gene set enrichment analyses. Enrichment of gene sets indicative of teleost brain–pituitary–gonadal–hepatic (BPGH) axis function indicated that WWTPs serve as an important source of endocrine active chemicals (EACs) that affect the BPGH axis (e.g., cholesterol and steroid metabolism were altered). The results indicated that transcriptomics may even pinpoint pertinent adverse outcomes (i.e., liver vacuolization) and groups of chemicals that preselected chemical analytes may miss. Transcriptomic Effects-Based monitoring was capable of distinguishing sites, and it reflected chemical pollution gradients, thus holding promise for assessment of relative contributions of point sources to pollution and the efficacy of pollution remediation.

  8. Illumina–based de novo transcriptome sequencing and analysis of ...

    Indian Academy of Sciences (India)

    Administrator

    2017-10-25

    Oct 25, 2017 ... (Shanghai, China) following manufacturer's protocols (Illumina, San .... suggests that pathways involved in musk production are expressed at a ..... Strickler S. R., Aureliano B. and Mueller L. A. 2012 Designing a transcriptome.

  9. A molecular gram stain using broad range PCR and pyrosequencing technology: a potentially useful tool for diagnosing orthopaedic infections.

    Science.gov (United States)

    Kobayashi, Naomi; Bauer, Thomas W; Togawa, Daisuke; Lieberman, Isador H; Sakai, Hiroshige; Fujishiro, Takaaki; Tuohy, Marion J; Procop, Gary W

    2005-06-01

    The bacteria associated with orthopaedic infections are usually common gram-positive and gram-negative bacteria. This fundamental grouping of bacteria is a necessary first step in the selection of appropriate antibiotics. Since polymerase chain reaction (PCR) is more rapid and may be more sensitive than culture, we developed a postamplification pyrosequencing method to subcategorize bacteria based on a few nucleotide polymorphisms in the 16S rRNA gene. We validated this method using well-characterized strains of bacteria and applied it to specimens from spinal surgery cases with suspected infections. Lysates of 114 bacteria including 75 species were created following standard cultivation to obtain DNA. The DNA was amplified by a broad-range real-time PCR. The amplicons were evaluated by pyrosequencing and were classified as gram-positive, gram-negative, or acid-fast bacilli based on the first three to five nucleotides sequenced. In addition, clinical cases of suspected infection were obtained from spinal surgery. The results of the "molecular Gram stain" were compared with the results of traditional Gram stain and culture. The lysates of 107 (93.9%) of the bacteria extracts tested were appropriately categorized as gram-positive and gram-negative or as acid-fast bacilli on the basis of this assay. The sensitivity and specificity of this assay were 100% and 97.4% for gram-positive and 88.3% and 100% for gram-negative isolates. All of the five clinical samples were appropriately categorized as containing gram-positive or gram-negative bacteria with this assay. This study demonstrates that high sensitivity and specificity of a molecular gram stain may be achieved using broad-range real-time PCR and pyrosequencing.

  10. Rumen bacterial community evaluated by 454 pyrosequencing and terminal restriction fragment length polymorphism analyses in dairy sheep fed marine algae.

    Science.gov (United States)

    Castro-Carrera, T; Toral, P G; Frutos, P; McEwan, N R; Hervás, G; Abecia, L; Pinloche, E; Girdwood, S E; Belenguer, A

    2014-03-01

    Developing novel strategies to increase the content of bioactive unsaturated fatty acids (FA) in ruminant-derived products requires a deeper understanding of rumen biohydrogenation and bacteria involved in this process. Although high-throughput pyrosequencing may allow for a great coverage of bacterial diversity, it has hardly been used to investigate the microbiology of ruminal FA metabolism. In this experiment, 454 pyrosequencing and a molecular fingerprinting technique (terminal restriction fragment length polymorphism; T-RFLP) were used concurrently to assess the effect of diet supplementation with marine algae (MA) on the rumen bacterial community of dairy sheep. Eleven lactating ewes were divided in 2 lots and offered a total mixed ration based on alfalfa hay and concentrate (40:60), supplemented with 0 (control) or 8 (MA) g of MA/kg of dry matter. After 54 d on treatments, animals were slaughtered and samples of rumen content and fluid were collected separately for microbial analysis. Pyrosequencing yielded a greater coverage of bacterial diversity than T-RFLP and allowed the identification of low abundant populations. Conversely, both molecular approaches pointed to similar conclusions and showed that relevant changes due to MA addition were observed within the major ruminal phyla, namely Bacteroidetes, Firmicutes, and Proteobacteria. Decreases in the abundance of unclassified Bacteroidales, Porphyromonadaceae, and Ruminococcaceae and increases in as-yet uncultured species of the family Succinivibrionaceae, might be related to a potential role of these groups in different pathways of rumen FA metabolism. Diet supplementation with MA, however, had no effect on the relative abundance of Butyrivibrio and Pseudobutyrivibrio genera. In addition, results from both 454 pyrosequencing and T-RFLP indicate that the effect of MA was rather consistent in rumen content or fluid samples, despite inherent differences between these fractions in their bacterial composition

  11. Sequencing and characterization of the guppy (Poecilia reticulata transcriptome

    Directory of Open Access Journals (Sweden)

    Rodd F Helen

    2011-04-01

    Full Text Available Abstract Background Next-generation sequencing is providing researchers with a relatively fast and affordable option for developing genomic resources for organisms that are not among the traditional genetic models. Here we present a de novo assembly of the guppy (Poecilia reticulata transcriptome using 454 sequence reads, and we evaluate potential uses of this transcriptome, including detection of sex-specific transcripts and deployment as a reference for gene expression analysis in guppies and a related species. Guppies have been model organisms in ecology, evolutionary biology, and animal behaviour for over 100 years. An annotated transcriptome and other genomic tools will facilitate understanding the genetic and molecular bases of adaptation and variation in a vertebrate species with a uniquely well known natural history. Results We generated approximately 336 Mbp of mRNA sequence data from male brain, male body, female brain, and female body. The resulting 1,162,670 reads assembled into 54,921 contigs, creating a reference transcriptome for the guppy with an average read depth of 28×. We annotated nearly 40% of this reference transcriptome by searching protein and gene ontology databases. Using this annotated transcriptome database, we identified candidate genes of interest to the guppy research community, putative single nucleotide polymorphisms (SNPs, and male-specific expressed genes. We also showed that our reference transcriptome can be used for RNA-sequencing-based analysis of differential gene expression. We identified transcripts that, in juveniles, are regulated differently in the presence and absence of an important predator, Rivulus hartii, including two genes implicated in stress response. For each sample in the RNA-seq study, >50% of high-quality reads mapped to unique sequences in the reference database with high confidence. In addition, we evaluated the use of the guppy reference transcriptome for gene expression analyses in

  12. Proteome Profiling Outperforms Transcriptome Profiling for Coexpression Based Gene Function Prediction

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Jing; Ma, Zihao; Carr, Steven A.; Mertins, Philipp; Zhang, Hui; Zhang, Zhen; Chan, Daniel W.; Ellis, Matthew J. C.; Townsend, R. Reid; Smith, Richard D.; McDermott, Jason E.; Chen, Xian; Paulovich, Amanda G.; Boja, Emily S.; Mesri, Mehdi; Kinsinger, Christopher R.; Rodriguez, Henry; Rodland, Karin D.; Liebler, Daniel C.; Zhang, Bing

    2016-11-11

    Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies

  13. Bio-crude transcriptomics: Gene discovery and metabolic network reconstruction for the biosynthesis of the terpenome of the hydrocarbon oil-producing green alga, Botryococcus braunii race B (Showa*

    Directory of Open Access Journals (Sweden)

    Molnár István

    2012-10-01

    Full Text Available Abstract Background Microalgae hold promise for yielding a biofuel feedstock that is sustainable, carbon-neutral, distributed, and only minimally disruptive for the production of food and feed by traditional agriculture. Amongst oleaginous eukaryotic algae, the B race of Botryococcus braunii is unique in that it produces large amounts of liquid hydrocarbons of terpenoid origin. These are comparable to fossil crude oil, and are sequestered outside the cells in a communal extracellular polymeric matrix material. Biosynthetic engineering of terpenoid bio-crude production requires identification of genes and reconstruction of metabolic pathways responsible for production of both hydrocarbons and other metabolites of the alga that compete for photosynthetic carbon and energy. Results A de novo assembly of 1,334,609 next-generation pyrosequencing reads form the Showa strain of the B race of B. braunii yielded a transcriptomic database of 46,422 contigs with an average length of 756 bp. Contigs were annotated with pathway, ontology, and protein domain identifiers. Manual curation allowed the reconstruction of pathways that produce terpenoid liquid hydrocarbons from primary metabolites, and pathways that divert photosynthetic carbon into tetraterpenoid carotenoids, diterpenoids, and the prenyl chains of meroterpenoid quinones and chlorophyll. Inventories of machine-assembled contigs are also presented for reconstructed pathways for the biosynthesis of competing storage compounds including triacylglycerol and starch. Regeneration of S-adenosylmethionine, and the extracellular localization of the hydrocarbon oils by active transport and possibly autophagy are also investigated. Conclusions The construction of an annotated transcriptomic database, publicly available in a web-based data depository and annotation tool, provides a foundation for metabolic pathway and network reconstruction, and facilitates further omics studies in the absence of a genome

  14. Clinical Neuropathology practice news 1-2014: Pyrosequencing meets clinical and analytical performance criteria for routine testing of MGMT promoter methylation status in glioblastoma

    Science.gov (United States)

    Preusser, Matthias; Berghoff, Anna S.; Manzl, Claudia; Filipits, Martin; Weinhäusel, Andreas; Pulverer, Walter; Dieckmann, Karin; Widhalm, Georg; Wöhrer, Adelheid; Knosp, Engelbert; Marosi, Christine; Hainfellner, Johannes A.

    2014-01-01

    Testing of the MGMT promoter methylation status in glioblastoma is relevant for clinical decision making and research applications. Two recent and independent phase III therapy trials confirmed a prognostic and predictive value of the MGMT promoter methylation status in elderly glioblastoma patients. Several methods for MGMT promoter methylation testing have been proposed, but seem to be of limited test reliability. Therefore, and also due to feasibility reasons, translation of MGMT methylation testing into routine use has been protracted so far. Pyrosequencing after prior DNA bisulfite modification has emerged as a reliable, accurate, fast and easy-to-use method for MGMT promoter methylation testing in tumor tissues (including formalin-fixed and paraffin-embedded samples). We performed an intra- and inter-laboratory ring trial which demonstrates a high analytical performance of this technique. Thus, pyrosequencing-based assessment of MGMT promoter methylation status in glioblastoma meets the criteria of high analytical test performance and can be recommended for clinical application, provided that strict quality control is performed. Our article summarizes clinical indications, practical instructions and open issues for MGMT promoter methylation testing in glioblastoma using pyrosequencing. PMID:24359605

  15. Transcriptome analysis in Concholepas concholepas (Gastropoda, Muricidae): mining and characterization of new genomic and molecular markers.

    Science.gov (United States)

    Cárdenas, Leyla; Sánchez, Roland; Gomez, Daniela; Fuenzalida, Gonzalo; Gallardo-Escárate, Cristián; Tanguy, Arnaud

    2011-09-01

    The marine gastropod Concholepas concholepas, locally known as the "loco", is the main target species of the benthonic Chilean fisheries. Genetic and genomic tools are necessary to study the genome of this species in order to understand the molecular basis of its development, growth, and other key traits to improve the management strategies and to identify local adaptation to prevent loss of biodiversity. Here, we use pyrosequencing technologies to generate the first transcriptomic database from adult specimens of the loco. After trimming, a total of 140,756 Expressed Sequence Tag sequences were achieved. Clustering and assembly analysis identified 19,219 contigs and 105,435 singleton sequences. BlastN analysis showed a significant identity with Expressed Sequence Tags of different gastropod species available in public databases. Similarly, BlastX results showed that only 895 out of the total 124,654 had significant hits and may represent novel genes for marine gastropods. From this database, simple sequence repeat motifs were also identified and a total of 38 primer pairs were designed and tested to assess their potential as informative markers and to investigate their cross-species amplification in different related gastropod species. This dataset represents the first publicly available 454 data for a marine gastropod endemic to the southeastern Pacific coast, providing a valuable transcriptomic resource for future efforts of gene discovery and development of functional markers in other marine gastropods. Copyright © 2011 Elsevier B.V. All rights reserved.

  16. Illumina-based de novo transcriptome sequencing and analysis

    Indian Academy of Sciences (India)

    In the present study, we used Illumina HiSeq technology to perform de novo assembly of heart and musk gland transcriptomes from the Chinese forest musk deer. A total of 239,383 transcripts and 176,450 unigenes were obtained, of which 37,329 unigenes were matched to known sequences in the NCBI nonredundant ...

  17. Use of Non-Normalized, Non-Amplified cDNA for 454-Based RNA Sequencing of Fleshy Melon Fruit

    Directory of Open Access Journals (Sweden)

    Vitaly Portnoy

    2011-03-01

    Full Text Available The melon ( L. fruit is an important crop and model system for the genomic study of both fleshy fruit development and the Cucurbitaceae family. To obtain an accurate representation of the melon fruit transcriptome based on expressed sequence tag (EST abundance in 454-pyrosequencing data, we prepared double-stranded complementary DNA (cDNA of melon without the usual amplification and normalization steps. A purification step was also included to eliminate small fragments. Complementary DNAs were obtained from 14 individual fruit libraries derived from two genotypes, separated into flesh and peel tissues, and sampled throughout fruit development. Pyrosequencing was performed using Genome Sequencer FLX (GS FLX technology, resulting in 1,215,359 reads, with mean length of >200 nucleotides. The global digital expression data was validated by comparative reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR of 40 selected genes and expression patterns were similar for the two methods. The results indicate that high-quality, nonbiased cDNA for next-generation sequencing can be prepared from mature, fleshy fruit, which are notorious for difficulties in ribonucleic acid (RNA preparation.

  18. Pyrosequencing data reveals tissue-specific expression of lineage-specific transcripts in chickpea

    OpenAIRE

    Garg, Rohini; Jain, Mukesh

    2011-01-01

    Chickpea is a very important crop legume plant, which provides a protein-rich supplement to cereal-based diets and has the ability to fix atmospheric nitrogen. Despite its economic importance, the functional genomic resources for chickpea are very limited. Recently, we reported the complete transcriptome of chickpea using next generation sequencing technologies. We analyzed the tissue-specific expression of chickpea transcripts based on RNA-seq data. In addition, we identified two sets of lin...

  19. Potential human pathogenic bacteria in a mixed urban watershed as revealed by pyrosequencing.

    Directory of Open Access Journals (Sweden)

    A Mark Ibekwe

    Full Text Available Current microbial source tracking (MST methods for water depend on testing for fecal indicator bacterial counts or specific marker gene sequences to identify fecal contamination where potential human pathogenic bacteria could be present. In this study, we applied 454 high-throughput pyrosequencing to identify bacterial pathogen DNA sequences, including those not traditionally monitored by MST and correlated their abundances to specific sources of contamination such as urban runoff and agricultural runoff from concentrated animal feeding operations (CAFOs, recreation park area, waste-water treatment plants, and natural sites with little or no human activities. Samples for pyrosequencing were surface water, and sediment collected from 19 sites. A total of 12,959 16S rRNA gene sequences with average length of ≤400 bp were obtained, and were assigned to corresponding taxonomic ranks using ribosomal database project (RDP, Classifier and Greengenes databases. The percent of total potential pathogens were highest in urban runoff water (7.94%, agricultural runoff sediment (6.52%, and Prado Park sediment (6.00%, respectively. Although the numbers of DNA sequence tags from pyrosequencing were very high for the natural site, corresponding percent potential pathogens were very low (3.78-4.08%. Most of the potential pathogenic bacterial sequences identified were from three major phyla, namely, Proteobacteria, Bacteroidetes, and Firmicutes. The use of deep sequencing may provide improved and faster methods for the identification of pathogen sources in most watersheds so that better risk assessment methods may be developed to enhance public health.

  20. A comprehensive reference transcriptome resource for the common house spider Parasteatoda tepidariorum.

    Directory of Open Access Journals (Sweden)

    Nico Posnien

    Full Text Available Parasteatoda tepidariorum is an increasingly popular model for the study of spider development and the evolution of development more broadly. However, fully understanding the regulation and evolution of P. tepidariorum development in comparison to other animals requires a genomic perspective. Although research on P. tepidariorum has provided major new insights, gene analysis to date has been limited to candidate gene approaches. Furthermore, the few available EST collections are based on embryonic transcripts, which have not been systematically annotated and are unlikely to contain transcripts specific to post-embryonic stages of development. We therefore generated cDNA from pooled embryos representing all described embryonic stages, as well as post-embryonic stages including nymphs, larvae and adults, and using Illumina HiSeq technology obtained a total of 625,076,514 100-bp paired end reads. We combined these data with 24,360 ESTs available in GenBank, and 1,040,006 reads newly generated from 454 pyrosequencing of a mixed-stage embryo cDNA library. The combined sequence data were assembled using a custom de novo assembly strategy designed to optimize assembly product length, number of predicted transcripts, and proportion of raw reads incorporated into the assembly. The de novo assembly generated 446,427 contigs with an N50 of 1,875 bp. These sequences obtained 62,799 unique BLAST hits against the NCBI non-redundant protein data base, including putative orthologs to 8,917 Drosophila melanogaster genes based on best reciprocal BLAST hit identity compared with the D. melanogaster proteome. Finally, we explored the utility of the transcriptome for RNA-Seq studies, and showed that this resource can be used as a mapping scaffold to detect differential gene expression in different cDNA libraries. This resource will therefore provide a platform for future genomic, gene expression and functional approaches using P. tepidariorum.

  1. Transcriptome profiling of Elettaria cardamomum (L. Maton (small cardamom

    Directory of Open Access Journals (Sweden)

    F. Nadiya

    2017-03-01

    Full Text Available Elettaria cardamomum (L. Maton, known as ‘queen of spices, is a perennial herbaceous monocot of the family Zingiberaceae, native to southern India. Cardamom is an economically valuable spice crop and used widely in culinary and medicinal purposes. In the present study, using Ion Proton RNA sequencing technology, we performed transcriptome sequencing and de novo transcriptome assembly of a wild and five cultivar genotypes of cardamom. RNA-seq generated a total of 22,811,983 (92 base and 24,889,197 (75 base raw reads accounting for approximately 8.21GB and 7.65GB of sequence data for wild and cultivar genotypes of cardamom respectively. The raw data were submitted to SRA database of NCBI under the accession numbers SRX1141272 (wild and SRX1141276 (cultivars. The raw reads were quality filtered and assembled using MIRA assembler resulted with 112,208 and 264,161contigs having N50 value 616 and 664 for wild and cultivar cardamom respectively. The assembled unigenes were functionally annotated using several databases including PlantCyc for pathway annotation. This work represents the first report on cardamom transcriptome sequencing. In order to generate a comprehensive reference transcriptome, we further assembled the raw reads of wild and cultivar genotypes which might enrich the plant transcriptome database and trigger advanced research in cardamom genomics.

  2. Use of pyrosequencing and DNA barcodes to monitor variations in Firmicutes and Bacteroidetes communities in the gut microbiota of obese humans

    Directory of Open Access Journals (Sweden)

    Raoult Didier

    2008-12-01

    Full Text Available Abstract Background Recent studies of 16S rRNA genes in the mammalian gut microbiota distinguished a higher Firmicutes/Bacteroidetes ratio in obese individuals compared to lean individuals. This ratio was estimated using a clonal Sanger sequencing approach which is time-consuming and requires laborious data analysis. In contrast, new high-throughput pyrosequencing technology offers an inexpensive alternative to clonal Sanger sequencing and would significantly advance our understanding of obesity via the development of a clinical diagnostic method. Here we present a cost-effective method that combines 16S rRNA pyrosequencing and DNA barcodes of the Firmicutes and Bacteroidetes 16S rRNA genes to determine the Firmicutes/Bacteroidetes ratio in the gut microbiota of obese humans. Results The main result was the identification of DNA barcodes targeting the Firmicutes and Bacteroidetes phyla. These barcodes were validated using previously published 16S rRNA gut microbiota clone libraries. In addition, an accurate F/B ratio was found when the DNA barcodes were applied to short pyrosequencing reads of published gut metagenomes. Finally, the barcodes were utilized to define the F/B ratio of 16S rRNA pyrosequencing data generated from brain abscess pus and cystic fibrosis sputum. Conclusion Using DNA barcodes of Bacteroidetes and Firmicutes 16S rRNA genes combined with pyrosequencing is a cost-effective method for monitoring relevant changes in the relative abundance of Firmicutes and Bacteroidetes bacterial communities in microbial ecosystems.

  3. Comparison of a teratogenic transcriptome-based predictive test based on human embryonic versus inducible pluripotent stem cells.

    Science.gov (United States)

    Shinde, Vaibhav; Perumal Srinivasan, Sureshkumar; Henry, Margit; Rotshteyn, Tamara; Hescheler, Jürgen; Rahnenführer, Jörg; Grinberg, Marianna; Meisig, Johannes; Blüthgen, Nils; Waldmann, Tanja; Leist, Marcel; Hengstler, Jan Georg; Sachinidis, Agapios

    2016-12-30

    Human embryonic stem cells (hESCs) partially recapitulate early embryonic three germ layer development, allowing testing of potential teratogenic hazards. Because use of hESCs is ethically debated, we investigated the potential for human induced pluripotent stem cells (hiPSCs) to replace hESCs in such tests. Three cell lines, comprising hiPSCs (foreskin and IMR90) and hESCs (H9) were differentiated for 14 days. Their transcriptome profiles were obtained on day 0 and day 14 and analyzed by comprehensive bioinformatics tools. The transcriptomes on day 14 showed that more than 70% of the "developmental genes" (regulated genes with > 2-fold change on day 14 compared to day 0) exhibited variability among cell lines. The developmental genes belonging to all three cell lines captured biological processes and KEGG pathways related to all three germ layer embryonic development. In addition, transcriptome profiles were obtained after 14 days of exposure to teratogenic valproic acid (VPA) during differentiation. Although the differentially regulated genes between treated and untreated samples showed more than 90% variability among cell lines, VPA clearly antagonized the expression of developmental genes in all cell lines: suppressing upregulated developmental genes, while inducing downregulated ones. To quantify VPA-disturbed development based on developmental genes, we estimated the "developmental potency" (D p ) and "developmental index" (D i ). Despite differences in genes deregulated by VPA, uniform D i values were obtained for all three cell lines. Given that the D i values for VPA were similar for hESCs and hiPSCs, D i can be used for robust hazard identification, irrespective of whether hESCs or hiPSCs are used in the test systems.

  4. De novo assembly and characterization of tissue specific transcriptomes in the emerald notothen, Trematomus bernacchii.

    Science.gov (United States)

    Huth, Troy J; Place, Sean P

    2013-11-20

    The notothenioids comprise a diverse group of fishes that rapidly radiated after isolation by the Antarctic Circumpolar Current approximately 14-25 million years ago. Given that evolutionary adaptation has led to finely tuned traits with narrow physiological limits in these organisms, this system provides a unique opportunity to examine physiological trade-offs and limits of adaptive responses to environmental perturbation. As such, notothenioids have a rich history with respect to studies attempting to understand the vulnerability of polar ecosystems to the negative impacts associated with global climate change. Unfortunately, despite being a model system for understanding physiological adaptations to extreme environments, we still lack fundamental molecular tools for much of the Nototheniidae family. Specimens of the emerald notothen, Trematomus bernacchii, were acclimated for 28 days in flow-through seawater tanks maintained near ambient seawater temperatures (-1.5°C) or at +4°C. Following acclimation, tissue specific cDNA libraries for liver, gill and brain were created by pooling RNA from n = 5 individuals per temperature treatment. The tissue specific libraries were bar-coded and used for 454 pyrosequencing, which yielded over 700 thousand sequencing reads. A de novo assembly and annotation of these reads produced a functional transcriptome library of T. bernacchii containing 30,107 unigenes, 13,003 of which possessed significant homology to a known protein product. Digital gene expression analysis of these extremely cold adapted fish reinforced the loss of an inducible heat shock response and allowed the preliminary exploration into other elements of the cellular stress response. Preliminary exploration of the transcriptome of T. bernacchii under elevated temperatures enabled a semi-quantitative comparison to prior studies aimed at characterizing the thermal response of this endemic fish whose size, abundance and distribution has established it as a

  5. Transcriptome sequencing of different narrow-leafed lupin tissue types provides a comprehensive uni-gene assembly and extensive gene-based molecular markers

    Science.gov (United States)

    Kamphuis, Lars G; Hane, James K; Nelson, Matthew N; Gao, Lingling; Atkins, Craig A; Singh, Karam B

    2015-01-01

    Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is an important grain legume crop that is valuable for sustainable farming and is becoming recognized as a human health food. NLL breeding is directed at improving grain production, disease resistance, drought tolerance and health benefits. However, genetic and genomic studies have been hindered by a lack of extensive genomic resources for the species. Here, the generation, de novo assembly and annotation of transcriptome datasets derived from five different NLL tissue types of the reference accession cv. Tanjil are described. The Tanjil transcriptome was compared to transcriptomes of an early domesticated cv. Unicrop, a wild accession P27255, as well as accession 83A:476, together being the founding parents of two recombinant inbred line (RIL) populations. In silico predictions for transcriptome-derived gene-based length and SNP polymorphic markers were conducted and corroborated using a survey assembly sequence for NLL cv. Tanjil. This yielded extensive indel and SNP polymorphic markers for the two RIL populations. A total of 335 transcriptome-derived markers and 66 BAC-end sequence-derived markers were evaluated, and 275 polymorphic markers were selected to genotype the reference NLL 83A:476 × P27255 RIL population. This significantly improved the completeness, marker density and quality of the reference NLL genetic map. PMID:25060816

  6. Pyrosequencing analysis of the microbial diversity of airag, khoormog and tarag, traditional fermented dairy products of mongolia.

    Science.gov (United States)

    Oki, Kaihei; Dugersuren, Jamyan; Demberel, Shirchin; Watanabe, Koichi

    2014-01-01

    Here, we used pyrosequencing to obtain a detailed analysis of the microbial diversities of traditional fermented dairy products of Mongolia. From 22 Airag (fermented mare's milk), 5 Khoormog (fermented camel's milk) and 26 Tarag (fermented milk of cows, goats and yaks) samples collected in the Mongolian provinces of Arhangai, Bulgan, Dundgobi, Tov, Uburhangai and Umnugobi, we obtained a total of 81 operational taxonomic units, which were assigned to 15 families, 21 genera and 41 species in 3 phyla. The genus Lactobacillus is a core bacterial component of Mongolian fermented milks, and Lactobacillus helveticus, Lactobacillus kefiranofaciens and Lactobacillus delbrueckii were the predominant species of lactic acid bacteria (LAB) in the Airag, Khoormog and Tarag samples, respectively. By using this pyrosequencing approach, we successfully detected most LAB species that have been isolated as well as seven LAB species that have not been found in our previous culture-based study. A subsequent analysis of the principal components of the samples revealed that L. delbrueckii, L. helveticus, L. kefiranofaciens and Streptococcus thermophilus were the main factors influencing the microbial diversity of these Mongolian traditional fermented dairy products and that this diversity correlated with the animal species from which the milk was sourced.

  7. Transcriptome mining of immune-related genes in the muricid snail Concholepas concholepas.

    Science.gov (United States)

    Détrée, Camille; López-Landavery, Edgar; Gallardo-Escárate, Cristian; Lafarga-De la Cruz, Fabiola

    2017-12-01

    The population of the Chilean endemic marine gastropod Concholepas concholepas locally called "loco" has dramatically decreased in the past 50 years as a result of intense activity of local fisheries and high environmental variability observed along the Chilean coast, including episodes of hypoxia, changes in sea surface temperature, ocean acidification and diseases. In this study, we set out to explore the molecular basis of C. concholepas to cope with biotic stressors such as exposure to the pathogenic bacterium Vibrio anguillarum. Here, 454pyrosequencing was conducted and 61 transcripts related to the immune response in this muricid species were identified. Among these, the expression of six genes (CcNFκβ, CcIκβ, CcLITAF, CcTLR, CcCas8 and CcCath) involved in the regulation of inflammatory, apoptotic and immune processes upon stimuli, were evaluated during the first 33 h post challenge (hpc). The results showed that CcTLR, CcCas8 and CcCath have an initial response at 4 hpc, evidencing an up-regulation from 4 to 24 hpc. Notably, the response of CcNFKB occurred 2 h later with a statistically significant up-regulation at 6 hpc and 10 hpc. Furthermore, the challenge with V. anguillarum induced a statistically significant down-regulation of CcIKB between 2 and 10 hpc as well as a down-regulation of CcLITAF between 2 and 4 hpc followed in both cases by an up-regulation between 24 and 33 hpc. This work describes the first transcriptomic effort to characterize the immune response of C. concholepas and constitutes a valuable transcriptomic resource for future efforts to develop sustainable aquaculture and conservations tools for this endemic marine snail species. Copyright © 2017 Elsevier Ltd. All rights reserved.

  8. Shotgun pyrosequencing metagenomic analyses of dusts from swine confinement and grain facilities.

    Science.gov (United States)

    Boissy, Robert J; Romberger, Debra J; Roughead, William A; Weissenburger-Moser, Lisa; Poole, Jill A; LeVan, Tricia D

    2014-01-01

    Inhalation of agricultural dusts causes inflammatory reactions and symptoms such as headache, fever, and malaise, which can progress to chronic airway inflammation and associated diseases, e.g. asthma, chronic bronchitis, chronic obstructive pulmonary disease, and hypersensitivity pneumonitis. Although in many agricultural environments feed particles are the major constituent of these dusts, the inflammatory responses that they provoke are likely attributable to particle-associated bacteria, archaebacteria, fungi, and viruses. In this study, we performed shotgun pyrosequencing metagenomic analyses of DNA from dusts from swine confinement facilities or grain elevators, with comparisons to dusts from pet-free households. DNA sequence alignment showed that 19% or 62% of shotgun pyrosequencing metagenomic DNA sequence reads from swine facility or household dusts, respectively, were of swine or human origin, respectively. In contrast only 2% of such reads from grain elevator dust were of mammalian origin. These metagenomic shotgun reads of mammalian origin were excluded from our analyses of agricultural dust microbiota. The ten most prevalent bacterial taxa identified in swine facility compared to grain elevator or household dust were comprised of 75%, 16%, and 42% gram-positive organisms, respectively. Four of the top five swine facility dust genera were assignable (Clostridium, Lactobacillus, Ruminococcus, and Eubacterium, ranging from 4% to 19% relative abundance). The relative abundances of these four genera were lower in dust from grain elevators or pet-free households. These analyses also highlighted the predominance in swine facility dust of Firmicutes (70%) at the phylum level, Clostridia (44%) at the Class level, and Clostridiales at the Order level (41%). In summary, shotgun pyrosequencing metagenomic analyses of agricultural dusts show that they differ qualitatively and quantitatively at the level of microbial taxa present, and that the bioinformatic analyses

  9. Shotgun pyrosequencing metagenomic analyses of dusts from swine confinement and grain facilities.

    Directory of Open Access Journals (Sweden)

    Robert J Boissy

    Full Text Available Inhalation of agricultural dusts causes inflammatory reactions and symptoms such as headache, fever, and malaise, which can progress to chronic airway inflammation and associated diseases, e.g. asthma, chronic bronchitis, chronic obstructive pulmonary disease, and hypersensitivity pneumonitis. Although in many agricultural environments feed particles are the major constituent of these dusts, the inflammatory responses that they provoke are likely attributable to particle-associated bacteria, archaebacteria, fungi, and viruses. In this study, we performed shotgun pyrosequencing metagenomic analyses of DNA from dusts from swine confinement facilities or grain elevators, with comparisons to dusts from pet-free households. DNA sequence alignment showed that 19% or 62% of shotgun pyrosequencing metagenomic DNA sequence reads from swine facility or household dusts, respectively, were of swine or human origin, respectively. In contrast only 2% of such reads from grain elevator dust were of mammalian origin. These metagenomic shotgun reads of mammalian origin were excluded from our analyses of agricultural dust microbiota. The ten most prevalent bacterial taxa identified in swine facility compared to grain elevator or household dust were comprised of 75%, 16%, and 42% gram-positive organisms, respectively. Four of the top five swine facility dust genera were assignable (Clostridium, Lactobacillus, Ruminococcus, and Eubacterium, ranging from 4% to 19% relative abundance. The relative abundances of these four genera were lower in dust from grain elevators or pet-free households. These analyses also highlighted the predominance in swine facility dust of Firmicutes (70% at the phylum level, Clostridia (44% at the Class level, and Clostridiales at the Order level (41%. In summary, shotgun pyrosequencing metagenomic analyses of agricultural dusts show that they differ qualitatively and quantitatively at the level of microbial taxa present, and that the

  10. Comparative analysis of pyrosequencing and a phylogenetic microarray for exploring microbial community structures in the human distal intestine.

    Directory of Open Access Journals (Sweden)

    Marcus J Claesson

    Full Text Available BACKGROUND: Variations in the composition of the human intestinal microbiota are linked to diverse health conditions. High-throughput molecular technologies have recently elucidated microbial community structure at much higher resolution than was previously possible. Here we compare two such methods, pyrosequencing and a phylogenetic array, and evaluate classifications based on two variable 16S rRNA gene regions. METHODS AND FINDINGS: Over 1.75 million amplicon sequences were generated from the V4 and V6 regions of 16S rRNA genes in bacterial DNA extracted from four fecal samples of elderly individuals. The phylotype richness, for individual samples, was 1,400-1,800 for V4 reads and 12,500 for V6 reads, and 5,200 unique phylotypes when combining V4 reads from all samples. The RDP-classifier was more efficient for the V4 than for the far less conserved and shorter V6 region, but differences in community structure also affected efficiency. Even when analyzing only 20% of the reads, the majority of the microbial diversity was captured in two samples tested. DNA from the four samples was hybridized against the Human Intestinal Tract (HIT Chip, a phylogenetic microarray for community profiling. Comparison of clustering of genus counts from pyrosequencing and HITChip data revealed highly similar profiles. Furthermore, correlations of sequence abundance and hybridization signal intensities were very high for lower-order ranks, but lower at family-level, which was probably due to ambiguous taxonomic groupings. CONCLUSIONS: The RDP-classifier consistently assigned most V4 sequences from human intestinal samples down to genus-level with good accuracy and speed. This is the deepest sequencing of single gastrointestinal samples reported to date, but microbial richness levels have still not leveled out. A majority of these diversities can also be captured with five times lower sampling-depth. HITChip hybridizations and resulting community profiles correlate

  11. Transcriptomic survey of the midgut of Anthonomus grandis (Coleoptera: Curculionidae).

    Science.gov (United States)

    Salvador, Ricardo; Príncipi, Darío; Berretta, Marcelo; Fernández, Paula; Paniego, Norma; Sciocco-Cap, Alicia; Hopp, Esteban

    2014-01-01

    Anthonomus grandis Boheman is a key pest in cotton crops in the New World. Its larval stage develops within the flower bud using it as food and as protection against its predators. This behavior limits the effectiveness of its control using conventional insecticide applications and biocontrol techniques. In spite of its importance, little is known about its genome sequence and, more important, its specific expression in key organs like the midgut. Total mRNA isolated from larval midguts was used for pyrosequencing. Sequence reads were assembled and annotated to generate a unigene data set. In total, 400,000 reads from A. grandis midgut with an average length of 237 bp were assembled and combined into 20,915 contigs. The assembled reads fell into 6,621 genes models. BlastX search using the NCBI-NR database showed that 3,006 unigenes had significant matches to known sequences. Gene Ontology (GO) mapping analysis evidenced that A. grandis is able to transcripts coding for proteins involved in catalytic processing of macromolecules that allows its adaptation to very different feeding source scenarios. Furthermore, transcripts encoding for proteins involved in detoxification mechanisms such as p450 genes, glutathione-S-transferase, and carboxylesterases are also expressed. This is the first report of a transcriptomic study in A. grandis and the largest set of sequence data reported for this species. These data are valuable resources to expand the knowledge of this insect group and could be used in the design of new control strategies based in molecular information. © The Author 2014. Published by Oxford University Press on behalf of the Entomological Society of America.

  12. The diversity and structure of marine protists in the coastal waters of China revealed by morphological observation and 454 pyrosequencing

    Science.gov (United States)

    Liu, Yun; Song, Shuqun; Chen, Tiantian; Li, Caiwen

    2017-04-01

    Pyrosequencing of the 18S rRNA gene has been widely adopted to study the eukaryotic diversity in various types of environments, and has an advantage over traditional morphology methods in exploring unknown microbial communities. To comprehensively assess the diversity and community composition of marine protists in the coastal waters of China, we applied both morphological observations and high-throughput sequencing of the V2 and V3 regions of 18S rDNA simultaneously to analyze samples collected from the surface layer of the Yellow and East China Seas. Dinoflagellates, diatoms and ciliates were the three dominant protistan groups as revealed by the two methods. Diatoms were the first dominant protistan group in the microscopic observations, with Skeletonema mainly distributed in the nearshore eutrophic waters and Chaetoceros in higher temperature and higher pH waters. The mixotrophic dinoflagellates, Gymnodinium and Gyrodinium, were more competitive in the oligotrophic waters. The pyrosequencing method revealed an extensive diversity of dinoflagellates. Chaetoceros was the only dominant diatom group in the pyrosequencing dataset. Gyrodinium represented the most abundant reads and dominated the offshore oligotrophic protistan community as they were in the microscopic observations. The dominance of parasitic dinoflagellates in the pyrosequencing dataset, which were overlooked in the morphological observations, indicates more attention should be paid to explore the potential role of this group. Both methods provide coherent clustering of samples. Nutrient levels, salinity and pH were the main factors influencing the distribution of protists. This study demonstrates that different primer pairs used in the pyrosequencing will indicate different protistan community structures. A suitable marker may reveal more comprehensive composition of protists and provide valuable information on environmental drivers.

  13. The maternal and early embryonic transcriptome of the milkweed bug Oncopeltus fasciatus.

    Science.gov (United States)

    Ewen-Campen, Ben; Shaner, Nathan; Panfilio, Kristen A; Suzuki, Yuichiro; Roth, Siegfried; Extavour, Cassandra G

    2011-01-25

    Most evolutionary developmental biology ("evo-devo") studies of emerging model organisms focus on small numbers of candidate genes cloned individually using degenerate PCR. However, newly available sequencing technologies such as 454 pyrosequencing have recently begun to allow for massive gene discovery in animals without sequenced genomes. Within insects, although large volumes of sequence data are available for holometabolous insects, developmental studies of basally branching hemimetabolous insects typically suffer from low rates of gene discovery. We used 454 pyrosequencing to sequence over 500 million bases of cDNA from the ovaries and embryos of the milkweed bug Oncopeltus fasciatus, which lacks a sequenced genome. This indirectly developing insect occupies an important phylogenetic position, branching basal to Diptera (including fruit flies) and Hymenoptera (including honeybees), and is an experimentally tractable model for short-germ development. 2,087,410 reads from both normalized and non-normalized cDNA assembled into 21,097 sequences (isotigs) and 112,531 singletons. The assembled sequences fell into 16,617 unique gene models, and included predictions of splicing isoforms, which we examined experimentally. Discovery of new genes plateaued after assembly of ~1.5 million reads, suggesting that we have sequenced nearly all transcripts present in the cDNA sampled. Many transcripts have been assembled at close to full length, and there is a net gain of sequence data for over half of the pre-existing O. fasciatus accessions for developmental genes in GenBank. We identified 10,775 unique genes, including members of all major conserved metazoan signaling pathways and genes involved in several major categories of early developmental processes. We also specifically address the effects of cDNA normalization on gene discovery in de novo transcriptome analyses. Our sequencing, assembly and annotation framework provide a simple and effective way to achieve high

  14. The maternal and early embryonic transcriptome of the milkweed bug Oncopeltus fasciatus

    Directory of Open Access Journals (Sweden)

    Roth Siegfried

    2011-01-01

    Full Text Available Abstract Background Most evolutionary developmental biology ("evo-devo" studies of emerging model organisms focus on small numbers of candidate genes cloned individually using degenerate PCR. However, newly available sequencing technologies such as 454 pyrosequencing have recently begun to allow for massive gene discovery in animals without sequenced genomes. Within insects, although large volumes of sequence data are available for holometabolous insects, developmental studies of basally branching hemimetabolous insects typically suffer from low rates of gene discovery. Results We used 454 pyrosequencing to sequence over 500 million bases of cDNA from the ovaries and embryos of the milkweed bug Oncopeltus fasciatus, which lacks a sequenced genome. This indirectly developing insect occupies an important phylogenetic position, branching basal to Diptera (including fruit flies and Hymenoptera (including honeybees, and is an experimentally tractable model for short-germ development. 2,087,410 reads from both normalized and non-normalized cDNA assembled into 21,097 sequences (isotigs and 112,531 singletons. The assembled sequences fell into 16,617 unique gene models, and included predictions of splicing isoforms, which we examined experimentally. Discovery of new genes plateaued after assembly of ~1.5 million reads, suggesting that we have sequenced nearly all transcripts present in the cDNA sampled. Many transcripts have been assembled at close to full length, and there is a net gain of sequence data for over half of the pre-existing O. fasciatus accessions for developmental genes in GenBank. We identified 10,775 unique genes, including members of all major conserved metazoan signaling pathways and genes involved in several major categories of early developmental processes. We also specifically address the effects of cDNA normalization on gene discovery in de novo transcriptome analyses. Conclusions Our sequencing, assembly and annotation framework

  15. Reptilian Transcriptomes v2.0: An Extensive Resource for Sauropsida Genomics and Transcriptomics.

    Science.gov (United States)

    Tzika, Athanasia C; Ullate-Agote, Asier; Grbic, Djordje; Milinkovitch, Michel C

    2015-07-01

    Despite the availability of deep-sequencing techniques, genomic and transcriptomic data remain unevenly distributed across phylogenetic groups. For example, reptiles are poorly represented in sequence databases, hindering functional evolutionary and developmental studies in these lineages substantially more diverse than mammals. In addition, different studies use different assembly and annotation protocols, inhibiting meaningful comparisons. Here, we present the "Reptilian Transcriptomes Database 2.0," which provides extensive annotation of transcriptomes and genomes from species covering the major reptilian lineages. To this end, we sequenced normalized complementary DNA libraries of multiple adult tissues and various embryonic stages of the leopard gecko and the corn snake and gathered published reptilian sequence data sets from representatives of the four extant orders of reptiles: Squamata (snakes and lizards), the tuatara, crocodiles, and turtles. The LANE runner 2.0 software was implemented to annotate all assemblies within a single integrated pipeline. We show that this approach increases the annotation completeness of the assembled transcriptomes/genomes. We then built large concatenated protein alignments of single-copy genes and inferred phylogenetic trees that support the positions of turtles and the tuatara as sister groups of Archosauria and Squamata, respectively. The Reptilian Transcriptomes Database 2.0 resource will be updated to include selected new data sets as they become available, thus making it a reference for differential expression studies, comparative genomics and transcriptomics, linkage mapping, molecular ecology, and phylogenomic analyses involving reptiles. The database is available at www.reptilian-transcriptomes.org and can be enquired using a wwwblast server installed at the University of Geneva. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  16. Pyrosequencing of prey DNA in reptile faeces: analysis of earthworm consumption by slow worms.

    Science.gov (United States)

    Brown, David S; Jarman, Simon N; Symondson, William O C

    2012-03-01

    Little quantitative ecological information exists on the diets of most invertebrate feeding reptiles, particularly nocturnal or elusive species that are difficult to observe. In the UK and elsewhere, reptiles are legally required to be relocated before land development can proceed, but without knowledge of their dietary requirements, the suitability of receptor sites cannot be known. Here, we tested the ability of non-invasive DNA-based molecular diagnostics (454 pyrosequencing) to analyse reptile diets, with the specific aims of determining which earthworm species are exploited by slow worms (the legless lizard Anguis fragilis) and whether they feed on the deeper-living earthworm species that only come to the surface at night. Slow worm faecal samples from four different habitats were analysed using earthworm-specific PCR primers. We found that 86% of slow worms (N=80) had eaten earthworms. In lowland heath and marshy/acid grassland, Lumbricus rubellus, a surface-dwelling epigeic species, dominated slow worm diet. In two other habitats, riverside pasture and calciferous coarse grassland, diet was dominated by deeper-living anecic and endogeic species. We conclude that all species of earthworm are exploited by these reptiles and lack of specialization allows slow worms to thrive in a wide variety of habitats. Pyrosequencing of prey DNA in faeces showed promise as a practical, rapid and relatively inexpensive means of obtaining detailed and valuable ecological information on the diets of reptiles. © 2011 Blackwell Publishing Ltd.

  17. Pyrosequencing reveals highly diverse and species-specific microbial communities in sponges from the Red Sea

    KAUST Repository

    Lee, Onon; Wang, Yong; Yang, Jiangke; Lafi, Feras Fawzi; Al-Suwailem, Abdulaziz M.; Qian, Peiyuan

    2010-01-01

    Marine sponges are associated with a remarkable array of microorganisms. Using a tag pyrosequencing technology, this study was the first to investigate in depth the microbial communities associated with three Red Sea sponges, Hyrtios erectus

  18. Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Kuo, Alan; Grigoriev, Igor

    2009-04-17

    Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.

  19. Web services for transcriptomics

    NARCIS (Netherlands)

    Neerincx, P.

    2009-01-01

    Transcriptomics is part of a family of disciplines focussing on high throughput molecular biology experiments. In the case of transcriptomics, scientists study the expression of genes resulting in transcripts. These transcripts can either perform a biological function themselves or function as

  20. Pathogenic bacteria in sewage treatment plants as revealed by 454 pyrosequencing.

    Science.gov (United States)

    Ye, Lin; Zhang, Tong

    2011-09-01

    This study applied 454 high-throughput pyrosequencing to analyze potentially pathogenic bacteria in activated sludge from 14 municipal wastewater treatment plants (WWTPs) across four countries (China, U.S., Canada, and Singapore), plus the influent and effluent of one of the 14 WWTPs. A total of 370,870 16S rRNA gene sequences with average length of 207 bps were obtained and all of them were assigned to corresponding taxonomic ranks by using RDP classifier and MEGAN. It was found that the most abundant potentially pathogenic bacteria in the WWTPs were affiliated with the genera of Aeromonas and Clostridium. Aeromonas veronii, Aeromonas hydrophila, and Clostridium perfringens were species most similar to the potentially pathogenic bacteria found in this study. Some sequences highly similar (>99%) to Corynebacterium diphtheriae were found in the influent and activated sludge samples from a saline WWTP. Overall, the percentage of the sequences closely related (>99%) to known pathogenic bacteria sequences was about 0.16% of the total sequences. Additionally, a platform-independent Java application (BAND) was developed for graphical visualization of the data of microbial abundance generated by high-throughput pyrosequencing. The approach demonstrated in this study could examine most of the potentially pathogenic bacteria simultaneously instead of one-by-one detection by other methods.

  1. Transcriptomes analysis of Aeromonas molluscorum Av27 cells exposed to tributyltin (TBT): Unravelling the effects from the molecular level to the organism

    Science.gov (United States)

    Cruz, Andreia; Rodrigues, Raquel; Pinheiro, Miguel; Mendo, Sónia

    2015-01-01

    Aeromonas molluscorum Av27 cells were exposed to 0, 5 and 50 μM of TBT and the respective transcriptomes were obtained by pyrosequencing. Gene Ontology revealed that exposure to 5 μM TBT results in a higher number of repressed genes in contrast with 50 μM of TBT, where the number of over-expressed genes is greater. At both TBT concentrations, higher variations in gene expression were found in the functional categories associated with enzymatic activities, transport/binding and oxidation-reduction. A number of proteins are affected by TBT, such as the acriflavin resistance protein, several transcription-related proteins, several Hsps, ABC transporters, CorA and ZntB and other outer membrane efflux proteins, all of these involved in cellular metabolic processes, important to maintain overall cell viability. Using the STRING tool, several proteins with unknown function were related with others involved in degradation processes, such as the pyoverdine chromophore biosynthetic protein, that has been described as playing a role in the Sn–C cleavage of organotins. This approach has allowed a better understanding of the molecular effects of exposure of bacterial cells to TBT. Furthermore it contributes to the knowledge of the functional genomic aspects of bacteria exposed to this pollutant. Furthermore, the transcriptomic data gathered, and now publically available, constitute a valuable resource for comparative genome analysis. PMID:26171931

  2. Pyrosequencing as a tool for the detection of Phytophthora species: error rate and risk of false Molecular Operational Taxonomic Units.

    Science.gov (United States)

    Vettraino, A M; Bonants, P; Tomassini, A; Bruni, N; Vannini, A

    2012-11-01

    To evaluate the accuracy of pyrosequencing for the description of Phytophthora communities in terms of taxa identification and risk of assignment for false Molecular Operational Taxonomic Units (MOTUs). Pyrosequencing of Internal Transcribed Spacer 1 (ITS1) amplicons was used to describe the structure of a DNA mixture comprising eight Phytophthora spp. and Pythium vexans. Pyrosequencing resulted in 16 965 reads, detecting all species in the template DNA mixture. Reducing the ITS1 sequence identity threshold resulted in a decrease in numbers of unmatched reads but a concomitant increase in the numbers of false MOTUs. The total error rate was 0·63% and comprised mainly mismatches (0·25%) Pyrosequencing of ITS1 region is an efficient and accurate technique for the detection and identification of Phytophthora spp. in environmental samples. However, the risk of allocating false MOTUs, even when demonstrated to be low, may require additional validation with alternative detection methods. Phytophthora spp. are considered among the most destructive groups of invasive plant pathogens, affecting thousands of cultivated and wild plants worldwide. Simultaneous early detection of Phytophthora complexes in environmental samples offers an unique opportunity for the interception of known and unknown species along pathways of introduction, along with the identification of these organisms in invaded environments. © 2012 The Authors Letters in Applied Microbiology © 2012 The Society for Applied Microbiology.

  3. The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE

    Directory of Open Access Journals (Sweden)

    Steinhauer Diana

    2011-02-01

    Full Text Available Abstract Background The combination of high-throughput transcript profiling and next-generation sequencing technologies is a prerequisite for genome-wide comprehensive transcriptome analysis. Our recent innovation of deepSuperSAGE is based on an advanced SuperSAGE protocol and its combination with massively parallel pyrosequencing on Roche's 454 sequencing platform. As a demonstration of the power of this combination, we have chosen the salt stress transcriptomes of roots and nodules of the third most important legume crop chickpea (Cicer arietinum L.. While our report is more technology-oriented, it nevertheless addresses a major world-wide problem for crops generally: high salinity. Together with low temperatures and water stress, high salinity is responsible for crop losses of millions of tons of various legume (and other crops. Continuously deteriorating environmental conditions will combine with salinity stress to further compromise crop yields. As a good example for such stress-exposed crop plants, we started to characterize salt stress responses of chickpeas on the transcriptome level. Results We used deepSuperSAGE to detect early global transcriptome changes in salt-stressed chickpea. The salt stress responses of 86,919 transcripts representing 17,918 unique 26 bp deepSuperSAGE tags (UniTags from roots of the salt-tolerant variety INRAT-93 two hours after treatment with 25 mM NaCl were characterized. Additionally, the expression of 57,281 transcripts representing 13,115 UniTags was monitored in nodules of the same plants. From a total of 144,200 analyzed 26 bp tags in roots and nodules together, 21,401 unique transcripts were identified. Of these, only 363 and 106 specific transcripts, respectively, were commonly up- or down-regulated (>3.0-fold under salt stress in both organs, witnessing a differential organ-specific response to stress. Profiting from recent pioneer works on massive cDNA sequencing in chickpea, more than 9,400 Uni

  4. The ambrosia symbiosis is specific in some species and promiscuous in others: evidence from community pyrosequencing

    Czech Academy of Sciences Publication Activity Database

    Kostovčík, Martin; Bateman, C.C.; Kolařík, Miroslav; Stelinski, L.L.; Jordal, B.H.; Hulcr, J.

    2015-01-01

    Roč. 9, č. 1 (2015), s. 126-138 ISSN 1751-7362 Institutional support: RVO:61388971 Keywords : ambrosia symbiosis * pyrosequencing Subject RIV: EE - Microbiology, Virology Impact factor: 9.328, year: 2015

  5. Analysis of the scallop microbiota by means of 16S rRNA gene pyrosequencing

    Directory of Open Access Journals (Sweden)

    Alex Mira

    2014-06-01

    Pyrosequencing of the samples resulted in a total of 18520 sequences (3000 per sample, approximately with an average length of 325 bp (base pairs. The taxonomic assignment of sequences allowed the identification to the genus level, being observed a large bacterial diversity with over 110 genera. The most prevalent genera in the samples were Hydrotalea, Acinetobacter, Delftia, Sediminibacter and Pseudomonas, among others. Differences in the microbial communities were observed among the samples, and the PCoA analysis allowed their separation by means on their gender and if they proceed from sampling before or after the spawning. Nevertheless, the rarefaction curves obtained for each sample failed to reach a saturation phase, indicating that more sequencing effort would be necessary.

  6. EcoBrowser: a web-based tool for visualizing transcriptome data of Escherichia coli

    Directory of Open Access Journals (Sweden)

    Jia Peng

    2011-10-01

    Full Text Available Abstract Background Escherichia coli has been extensively studied as a prokaryotic model organism whose whole genome was determined in 1997. However, it is difficult to identify all the gene products involved in diverse functions by using whole genome sequencesalone. The high-resolution transcriptome mapping using tiling arrays has proved effective to improve the annotation of transcript units and discover new transcripts of ncRNAs. While abundant tiling array data have been generated, the lack of appropriate visualization tools to accommodate and integrate multiple sources of data has emerged. Findings EcoBrowser is a web-based tool for visualizing genome annotations and transcriptome data of E. coli. Important tiling array data of E. coli from different experimental platforms are collected and processed for query. An AJAX based genome browser is embedded for visualization. Thus, genome annotations can be compared with transcript profiling and genome occupancy profiling from independent experiments, which will be helpful in discovering new transcripts including novel mRNAs and ncRNAs, generating a detailed description of the transcription unit architecture, further providing clues for investigation of prokaryotic transcriptional regulation that has proved to be far more complex than previously thought. Conclusions With the help of EcoBrowser, users can get a systemic view both from the vertical and parallel sides, as well as inspirations for the design of new experiments which will expand our understanding of the regulation mechanism.

  7. Evaluation of Automated Ribosomal Intergenic Spacer Analysis for Bacterial Fingerprinting of Rumen Microbiome Compared to Pyrosequencing Technology

    Directory of Open Access Journals (Sweden)

    Elie Jami

    2014-01-01

    Full Text Available The mammalian gut houses a complex microbial community which is believed to play a significant role in host physiology. In recent years, several microbial community analysis methods have been implemented to study the whole gut microbial environment, in contrast to classical microbiological methods focusing on bacteria which can be cultivated. One of these is automated ribosomal intergenic spacer analysis (ARISA, an inexpensive and popular way of analyzing bacterial diversity and community fingerprinting in ecological samples. ARISA uses the natural variability in length of the DNA fragment found between the 16S and 23S genes in different bacterial lineages to infer diversity. This method is now being supplanted by affordable next-generation sequencing technologies that can also simultaneously annotate operational taxonomic units for taxonomic identification. We compared ARISA and pyrosequencing of samples from the rumen microbiome of cows, previously sampled at different stages of development and varying in microbial complexity using several ecological parameters. We revealed close agreement between ARISA and pyrosequencing outputs, especially in their ability to discriminate samples from different ecological niches. In contrast, the ARISA method seemed to underestimate sample richness. The good performance of the relatively inexpensive ARISA makes it relevant for straightforward use in bacterial fingerprinting analysis as well as for quick cross-validation of pyrosequencing data.

  8. Identification and genotyping of molluscum contagiosum virus from genital swab samples by real-time PCR and Pyrosequencing.

    Science.gov (United States)

    Trama, Jason P; Adelson, Martin E; Mordechai, Eli

    2007-12-01

    Laboratory diagnosis of molluscum contagiosum virus (MCV) is important as lesions can be confused with those caused by Cryptococcus neoformans, herpes simplex virus, human papillomavirus, and varicella-zoster virus. To develop a rapid method for identifying patients infected with MCV via swab sampling. Two dual-labeled probe real-time PCR assays, one homologous to the p43K gene and one to the MC080R gene, were designed. The p43K PCR was designed to be used in conjunction with Pyrosequencing for confirmation of PCR products and discrimination between MCV1 and MCV2. Both PCR assays were optimized with respect to reaction components, thermocycling parameters, and primer and probe concentrations. The specificities of both PCR assays were confirmed by non-amplification of 38 known human pathogens. Sensitivity assays demonstrated detection of as few as 10 copies per reaction. Testing 703 swabs, concordance between the two real-time PCR assays was 99.9%. Under the developed conditions, Pyrosequencing of the p43K PCR product was capable of providing enough nucleotide sequence to definitively differentiate MCV1 and MCV2. These real-time PCR assays can be used for the rapid, sensitive, and specific detection of MCV and, when combined with Pyrosequencing, can further discriminate between MCV1 and MCV2.

  9. Transcriptome-based characterization of interactions between Saccharomyces cerevisiae and Lactobacillus delbrueckii subsp. bulgaricus in lactose-grown chemostat cocultures.

    Science.gov (United States)

    Mendes, Filipa; Sieuwerts, Sander; de Hulster, Erik; Almering, Marinka J H; Luttik, Marijke A H; Pronk, Jack T; Smid, Eddy J; Bron, Peter A; Daran-Lapujade, Pascale

    2013-10-01

    Mixed populations of Saccharomyces cerevisiae yeasts and lactic acid bacteria occur in many dairy, food, and beverage fermentations, but knowledge about their interactions is incomplete. In the present study, interactions between Saccharomyces cerevisiae and Lactobacillus delbrueckii subsp. bulgaricus, two microorganisms that co-occur in kefir fermentations, were studied during anaerobic growth on lactose. By combining physiological and transcriptome analysis of the two strains in the cocultures, five mechanisms of interaction were identified. (i) Lb. delbrueckii subsp. bulgaricus hydrolyzes lactose, which cannot be metabolized by S. cerevisiae, to galactose and glucose. Subsequently, galactose, which cannot be metabolized by Lb. delbrueckii subsp. bulgaricus, is excreted and provides a carbon source for yeast. (ii) In pure cultures, Lb. delbrueckii subsp. bulgaricus grows only in the presence of increased CO2 concentrations. In anaerobic mixed cultures, the yeast provides this CO2 via alcoholic fermentation. (iii) Analysis of amino acid consumption from the defined medium indicated that S. cerevisiae supplied alanine to the bacterium. (iv) A mild but significant low-iron response in the yeast transcriptome, identified by DNA microarray analysis, was consistent with the chelation of iron by the lactate produced by Lb. delbrueckii subsp. bulgaricus. (v) Transcriptome analysis of Lb. delbrueckii subsp. bulgaricus in mixed cultures showed an overrepresentation of transcripts involved in lipid metabolism, suggesting either a competition of the two microorganisms for fatty acids or a response to the ethanol produced by S. cerevisiae. This study demonstrates that chemostat-based transcriptome analysis is a powerful tool to investigate microbial interactions in mixed populations.

  10. Integration of deep transcriptome and proteome analyses reveals the components of alkaloid metabolism in opium poppy cell cultures.

    Science.gov (United States)

    Desgagné-Penix, Isabel; Khan, Morgan F; Schriemer, David C; Cram, Dustin; Nowak, Jacek; Facchini, Peter J

    2010-11-18

    Papaver somniferum (opium poppy) is the source for several pharmaceutical benzylisoquinoline alkaloids including morphine, the codeine and sanguinarine. In response to treatment with a fungal elicitor, the biosynthesis and accumulation of sanguinarine is induced along with other plant defense responses in opium poppy cell cultures. The transcriptional induction of alkaloid metabolism in cultured cells provides an opportunity to identify components of this process via the integration of deep transcriptome and proteome databases generated using next-generation technologies. A cDNA library was prepared for opium poppy cell cultures treated with a fungal elicitor for 10 h. Using 454 GS-FLX Titanium pyrosequencing, 427,369 expressed sequence tags (ESTs) with an average length of 462 bp were generated. Assembly of these sequences yielded 93,723 unigenes, of which 23,753 were assigned Gene Ontology annotations. Transcripts encoding all known sanguinarine biosynthetic enzymes were identified in the EST database, 5 of which were represented among the 50 most abundant transcripts. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) of total protein extracts from cell cultures treated with a fungal elicitor for 50 h facilitated the identification of 1,004 proteins. Proteins were fractionated by one-dimensional SDS-PAGE and digested with trypsin prior to LC-MS/MS analysis. Query of an opium poppy-specific EST database substantially enhanced peptide identification. Eight out of 10 known sanguinarine biosynthetic enzymes and many relevant primary metabolic enzymes were represented in the peptide database. The integration of deep transcriptome and proteome analyses provides an effective platform to catalogue the components of secondary metabolism, and to identify genes encoding uncharacterized enzymes. The establishment of corresponding transcript and protein databases generated by next-generation technologies in a system with a well-defined metabolite profile facilitates

  11. Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

    Directory of Open Access Journals (Sweden)

    Marais Gabriel AB

    2011-07-01

    Full Text Available Abstract Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO terms, and thousands of single-nucleotide polymorphisms (SNPs were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49% that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to

  12. Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

    Science.gov (United States)

    2011-01-01

    Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a

  13. MGMT promoter methylation determined by HRM in comparison to MSP and pyrosequencing for predicting high-grade glioma response.

    Science.gov (United States)

    Switzeny, Olivier J; Christmann, Markus; Renovanz, Mirjam; Giese, Alf; Sommer, Clemens; Kaina, Bernd

    2016-01-01

    The DNA repair protein O(6)-methylguanine-DNA methyltransferase (MGMT) causes resistance of cancer cells to alkylating agents and, therefore, is a well-established predictive marker for high-grade gliomas that are routinely treated with alkylating drugs. Since MGMT is highly epigenetically regulated, the MGMT promoter methylation status is taken as an indicator of MGMT silencing, predicting the outcome of glioma therapy. MGMT promoter methylation is usually determined by methylation specific PCR (MSP), which is a labor intensive and error-prone method often used semi-quantitatively. Searching for alternatives, we used closed-tube high resolution melt (HRM) analysis, which is a quantitative method, and compared it with MSP and pyrosequencing regarding its predictive value. We analyzed glioblastoma cell lines with known MGMT activity and formalin-fixed samples from IDH1 wild-type high-grade glioma patients (WHO grade III/IV) treated with radiation and temozolomide by HRM, MSP, and pyrosequencing. The data were compared as to progression-free survival (PFS) and overall survival (OS) of patients exhibiting the methylated and unmethylated MGMT status. A promoter methylation cut-off level relevant for PFS and OS was determined. In a multivariate Cox regression model, methylation of MGMT promoter of high-grade gliomas analyzed by HRM, but not MSP, was found to be an independent predictive marker for OS. Univariate Kaplan-Meier analyses revealed for PFS and OS a significant and better discrimination between methylated and unmethylated tumors when quantitative HRM was used instead of MSP. Compared to MSP and pyrosequencing, the HRM method is simple, cost effective, highly accurate and fast. HRM is at least equivalent to pyrosequencing in quantifying the methylation level. It is superior in predicting PFS and OS of high-grade glioma patients compared to MSP and, therefore, can be recommended being used routinely for determination of the MGMT status of gliomas.

  14. Diversity and structure of soil bacterial communities in the Fildes Region (maritime Antarctica as revealed by 454 pyrosequencing

    Directory of Open Access Journals (Sweden)

    Neng Fei eWang

    2015-10-01

    Full Text Available This study assessed the diversity and composition of bacterial communities in four different soils (human-, penguin-, seal-colony impacted soils and pristine soil in the Fildes Region (King George Island, Antarctica using 454 pyrosequencing with bacterial-specific primers targeting the 16S rRNA gene. Proteobacteria, Actinobacteria, Acidobacteria, and Verrucomicrobia were abundant phyla in almost all the soil samples. The four types of soils were significantly different in geochemical properties and bacterial community structure. Thermotogae, Cyanobacteria, Fibrobacteres, Deinococcus-Thermus, and Chlorobi obviously varied in their abundance among the 4 soil types. Considering all the samples together, members of the genera Gaiella, Chloracidobacterium, Nitrospira, Polaromonas, Gemmatimonas, Sphingomonas and Chthoniobacter were found to predominate, whereas members of the genera Chamaesiphon, Herbaspirillum, Hirschia, Nevskia, Nitrosococcus, Rhodococcus, Rhodomicrobium, and Xanthomonas varied obviously in their abundance among the four soil types. Distance-based redundancy analysis revealed that pH (p < 0.01, phosphate phosphorus (p < 0.01, organic carbon (p < 0.05, and organic nitrogen (p < 0.05 were the most significant factors that correlated with the community distribution of soil bacteria. To our knowledge, this is the first study to explore the soil bacterial communities in human-, penguin-, and seal- colony impacted soils from ice-free areas in maritime Antarctica using high-throughput pyrosequencing.

  15. Transcriptome of interstitial cells of Cajal reveals unique and selective gene signatures.

    Directory of Open Access Journals (Sweden)

    Moon Young Lee

    Full Text Available Transcriptome-scale data can reveal essential clues into understanding the underlying molecular mechanisms behind specific cellular functions and biological processes. Transcriptomics is a continually growing field of research utilized in biomarker discovery. The transcriptomic profile of interstitial cells of Cajal (ICC, which serve as slow-wave electrical pacemakers for gastrointestinal (GI smooth muscle, has yet to be uncovered. Using copGFP-labeled ICC mice and flow cytometry, we isolated ICC populations from the murine small intestine and colon and obtained their transcriptomes. In analyzing the transcriptome, we identified a unique set of ICC-restricted markers including transcription factors, epigenetic enzymes/regulators, growth factors, receptors, protein kinases/phosphatases, and ion channels/transporters. This analysis provides new and unique insights into the cellular and biological functions of ICC in GI physiology. Additionally, we constructed an interactive ICC genome browser (http://med.unr.edu/physio/transcriptome based on the UCSC genome database. To our knowledge, this is the first online resource that provides a comprehensive library of all known genetic transcripts expressed in primary ICC. Our genome browser offers a new perspective into the alternative expression of genes in ICC and provides a valuable reference for future functional studies.

  16. Pyrosequencing, a method approved to detect the two major EGFR mutations for anti EGFR therapy in NSCLC

    Directory of Open Access Journals (Sweden)

    Richard Marie-Jeanne

    2011-05-01

    Full Text Available Abstract Background Epidermal Growth Factor Receptor (EGFR mutations, especially in-frame deletions in exon 19 (ΔLRE and a point mutation in exon 21 (L858R predict gefitinib sensitivity in patients with non-small cell lung cancer. Several methods are currently described for their detection but the gold standard for tissue samples remains direct DNA sequencing, which requires samples containing at least 50% of tumor cells. Methods We designed a pyrosequencing assay based on nested PCR for the characterization of theses mutations on formalin-fixed and paraffin-embedded tumor tissue. Results This method is highly specific and permits precise characterization of all the exon 19 deletions. Its sensitivity is higher than that of "BigDye terminator" sequencing and enabled detection of 3 additional mutations in the 58 NSCLC tested. The concordance between the two methods was very good (97.4%. In the prospective analysis of 213 samples, 7 (3.3% samples were not analyzed and EGFR mutations were detected in 18 (8.7% patients. However, we observed a deficit of mutation detection when the samples were very poor in tumor cells. Conclusions pyrosequencing is then a highly accurate method for detecting ΔLRE and L858R EGFR mutations in patients with NSCLC when the samples contain at least 20% of tumor cells.

  17. Characterizing the transcriptome and molecular markers information ...

    Indian Academy of Sciences (India)

    2008). Analyses of the genetic structure .... The annotations and classifications for the transcriptome ... Based on the Pfam classification, the predic- ..... J. Lipid. Res. 33, 251–262. Eckert C. G., Samis K. E. and Lougheed S. C. 2008 Genetic vari-.

  18. Detection by real-time PCR and pyrosequencing of the cry1Ab and cry1Ac genes introduced in genetically modified (GM) constructs.

    Science.gov (United States)

    Debode, Frederic; Janssen, Eric; Bragard, Claude; Berben, Gilbert

    2017-08-01

    The presence of genetically modified organisms (GMOs) in food and feed is mainly detected by the use of targets focusing on promoters and terminators. As some genes are frequently used in genetically modified (GM) construction, they also constitute excellent screening elements and their use is increasing. In this paper we propose a new target for the detection of cry1Ab and cry1Ac genes by real-time polymerase chain reaction (PCR) and pyrosequencing. The specificity, sensitivity and robustness of the real-time PCR method were tested following the recommendations of international guidelines and the method met the expected performance criteria. This paper also shows how the robustness testing was assessed. This new cry1Ab/Ac method can provide a positive signal with a larger number of GM events than do the other existing methods using double dye-probes. The method permits the analysis of results with less ambiguity than the SYBRGreen method recommended by the European Reference Laboratory (EURL) GM Food and Feed (GMFF). A pyrosequencing method was also developed to gain additional information thanks to the sequence of the amplicon. This method of sequencing-by-synthesis can determine the sequence between the primers used for PCR. Pyrosequencing showed that the sequences internal to the primers present differences following the GM events considered and three different sequences were observed. The sensitivity of the pyrosequencing was tested on reference flours with a low percentage GM content and different copy numbers. Improvements in the pyrosequencing protocol provided correct sequences with 50 copies of the target. Below this copy number, the quality of the sequence was more random.

  19. Impact of Transcriptomics on Our Understanding of Pulmonary Fibrosis

    Science.gov (United States)

    Vukmirovic, Milica; Kaminski, Naftali

    2018-01-01

    Idiopathic pulmonary fibrosis (IPF) is a lethal fibrotic lung disease characterized by aberrant remodeling of the lung parenchyma with extensive changes to the phenotypes of all lung resident cells. The introduction of transcriptomics, genome scale profiling of thousands of RNA transcripts, caused a significant inversion in IPF research. Instead of generating hypotheses based on animal models of disease, or biological plausibility, with limited validation in humans, investigators were able to generate hypotheses based on unbiased molecular analysis of human samples and then use animal models of disease to test their hypotheses. In this review, we describe the insights made from transcriptomic analysis of human IPF samples. We describe how transcriptomic studies led to identification of novel genes and pathways involved in the human IPF lung such as: matrix metalloproteinases, WNT pathway, epithelial genes, role of microRNAs among others, as well as conceptual insights such as the involvement of developmental pathways and deep shifts in epithelial and fibroblast phenotypes. The impact of lung and transcriptomic studies on disease classification, endotype discovery, and reproducible biomarkers is also described in detail. Despite these impressive achievements, the impact of transcriptomic studies has been limited because they analyzed bulk tissue and did not address the cellular and spatial heterogeneity of the IPF lung. We discuss new emerging technologies and applications, such as single-cell RNAseq and microenvironment analysis that may address cellular and spatial heterogeneity. We end by making the point that most current tissue collections and resources are not amenable to analysis using the novel technologies. To take advantage of the new opportunities, we need new efforts of sample collections, this time focused on access to all the microenvironments and cells in the IPF lung. PMID:29670881

  20. Transcriptomes analysis of Aeromonas molluscorum Av27 cells exposed to tributyltin (TBT): Unravelling the effects from the molecular level to the organism.

    Science.gov (United States)

    Cruz, Andreia; Rodrigues, Raquel; Pinheiro, Miguel; Mendo, Sónia

    2015-08-01

    Aeromonas molluscorum Av27 cells were exposed to 0, 5 and 50 μM of TBT and the respective transcriptomes were obtained by pyrosequencing. Gene Ontology revealed that exposure to 5 μM TBT results in a higher number of repressed genes in contrast with 50 μM of TBT, where the number of over-expressed genes is greater. At both TBT concentrations, higher variations in gene expression were found in the functional categories associated with enzymatic activities, transport/binding and oxidation-reduction. A number of proteins are affected by TBT, such as the acriflavin resistance protein, several transcription-related proteins, several Hsps, ABC transporters, CorA and ZntB and other outer membrane efflux proteins, all of these involved in cellular metabolic processes, important to maintain overall cell viability. Using the STRING tool, several proteins with unknown function were related with others involved in degradation processes, such as the pyoverdine chromophore biosynthetic protein, that has been described as playing a role in the Sn-C cleavage of organotins. This approach has allowed a better understanding of the molecular effects of exposure of bacterial cells to TBT. Furthermore it contributes to the knowledge of the functional genomic aspects of bacteria exposed to this pollutant. Furthermore, the transcriptomic data gathered, and now publically available, constitute a valuable resource for comparative genome analysis. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  1. Development and characterization of microsatellite markers from the humivorous termite Cavitermes tuberosus (Isoptera: Termitinae) using pyrosequencing technology

    Czech Academy of Sciences Publication Activity Database

    Fournier, D.; Hanus, Robert; Roisin, Y.

    2015-01-01

    Roč. 7, č. 2 (2015), s. 521-524 ISSN 1877-7252 Institutional support: RVO:61388963 Keywords : Cavitermes tuberosus * termite * microsatellite * pyrosequencing * population genetics Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 0.446, year: 2015

  2. Microplanktonic community structure in a coastal system relative to a Phaeocystis bloom inferred from morphological and tag pyrosequencing methods.

    Directory of Open Access Journals (Sweden)

    Sébastien Monchy

    Full Text Available BACKGROUND: Massive phytoplankton blooms, like the recurrent Phaeocystis proliferation observed every year in the Eastern English Channel (EEC, have a significant influence on the overall planktonic community structure and their food web dynamics. As well as being an important area for local fisheries, the EEC is an ideal ecosystem for work on microbial diversity. This is because, although its environmental context is relatively complex, it is reasonably well understood due to several years of monitoring and morphological observations of its planktonic organisms. The objective of our study was to better understand the under-explored microbial eukaryotic diversity relative to the Phaeocystis bloom. METHODOLOGY AND PRINCIPAL FINDINGS: The community structure of microplankton (diatoms, haptophytes, ciliates and dinoflagellates was studied through morphological observations and tag pyrosequencing. During the annual Phaeocystis spring bloom, the phytoplankton biomass increased by 34-fold, while the microzooplankton biomass showed a 4-fold increase, representing on average about 4.6% of the biomass of their phytoplankton prey. Tag pyrosequencing unveiled an extensive diversity of Gymnodiniaceae, with G. spirale and G. fusiformis representing the most abundant reads. An extended diversity of Phaeocystales, with partial 18S rDNA genes sequence identity as low as 85% was found, with taxa corresponding to P. globosa, but also to unknown Phaeocystaceae. CONCLUSIONS: Morphological analyses and pyrosequencing were generally in accordance with capturing frequency shifts of abundant taxa. Tag pyrosequencing allowed highlighting the maintenance of microplankton diversity during the Phaeocystis bloom and the increase of the taxa presenting low number of reads (minor taxa along with the dominant ones in response to biotic and/or abiotic changing conditions. Although molecular approaches have enhanced our perception on diversity, it has come to light that the

  3. 16S rRNA gene pyrosequencing reveals bacterial dysbiosis in the duodenum of dogs with idiopathic inflammatory bowel disease.

    Science.gov (United States)

    Suchodolski, Jan S; Dowd, Scot E; Wilke, Vicky; Steiner, Jörg M; Jergens, Albert E

    2012-01-01

    Canine idiopathic inflammatory bowel disease (IBD) is believed to be caused by a complex interaction of genetic, immunologic, and microbial factors. While mucosa-associated bacteria have been implicated in the pathogenesis of canine IBD, detailed studies investigating the enteric microbiota using deep sequencing techniques are lacking. The objective of this study was to evaluate mucosa-adherent microbiota in the duodenum of dogs with spontaneous idiopathic IBD using 16 S rRNA gene pyrosequencing. Biopsy samples of small intestinal mucosa were collected endoscopically from healthy dogs (n = 6) and dogs with moderate IBD (n = 7) or severe IBD (n = 7) as assessed by a clinical disease activity index. Total RNA was extracted from biopsy specimens and 454-pyrosequencing of the 16 S rRNA gene was performed on aliquots of cDNA from each dog. Intestinal inflammation was associated with significant differences in the composition of the intestinal microbiota when compared to healthy dogs. PCoA plots based on the unweighted UniFrac distance metric indicated clustering of samples between healthy dogs and dogs with IBD (ANOSIM, pmicrobial groups, which bear resemblance to dysbiosis reported in humans with chronic intestinal inflammation. These bacterial groups may serve as useful targets for monitoring intestinal inflammation.

  4. Comparative genomic and transcriptomic analysis revealed genetic characteristics related to solvent formation and xylose utilization in Clostridium acetobutylicum EA 2018

    Directory of Open Access Journals (Sweden)

    Wang Shengyue

    2011-02-01

    Full Text Available Abstract Background Clostridium acetobutylicum, a gram-positive and spore-forming anaerobe, is a major strain for the fermentative production of acetone, butanol and ethanol. But a previously isolated hyper-butanol producing strain C. acetobutylicum EA 2018 does not produce spores and has greater capability of solvent production, especially for butanol, than the type strain C. acetobutylicum ATCC 824. Results Complete genome of C. acetobutylicum EA 2018 was sequenced using Roche 454 pyrosequencing. Genomic comparison with ATCC 824 identified many variations which may contribute to the hyper-butanol producing characteristics in the EA 2018 strain, including a total of 46 deletion sites and 26 insertion sites. In addition, transcriptomic profiling of gene expression in EA 2018 relative to that of ATCC824 revealed expression-level changes of several key genes related to solvent formation. For example, spo0A and adhEII have higher expression level, and most of the acid formation related genes have lower expression level in EA 2018. Interestingly, the results also showed that the variation in CEA_G2622 (CAC2613 in ATCC 824, a putative transcriptional regulator involved in xylose utilization, might accelerate utilization of substrate xylose. Conclusions Comparative analysis of C. acetobutylicum hyper-butanol producing strain EA 2018 and type strain ATCC 824 at both genomic and transcriptomic levels, for the first time, provides molecular-level understanding of non-sporulation, higher solvent production and enhanced xylose utilization in the mutant EA 2018. The information could be valuable for further genetic modification of C. acetobutylicum for more effective butanol production.

  5. Transcriptomic network analysis of micronuclei-related genes: a case study

    DEFF Research Database (Denmark)

    van Leeuwen, D. M.; Pedersen, Marie; Knudsen, Lisbeth E.

    2011-01-01

    checkpoint and aneuploidy. The MN-related gene network was tested against a transcriptomics case study associated with MN measurements. In this case study, transcriptomic data from children and adults differentially exposed to ambient air pollution in the Czech Republic were analysed and visualised......Mechanistically relevant information on responses of humans to xenobiotic exposure in relation to chemically induced biological effects, such as micronuclei (MN) formation can be obtained through large-scale transcriptomics studies. Network analysis may enhance the analysis and visualisation...... of such data. Therefore, this study aimed to develop a 'MN formation' network based on a priori knowledge, by using the pathway tool MetaCore. The gene network contained 27 genes and three gene complexes that are related to processes involved in MN formation, e.g. spindle assembly checkpoint, cell cycle...

  6. Transcriptome signature of the adult mouse choroid plexus

    Directory of Open Access Journals (Sweden)

    Marques Fernanda

    2011-01-01

    Full Text Available Abstract Background Although the gene expression profile of several tissues in humans and in rodent animal models has been explored, analysis of the complete choroid plexus (CP transcriptome is still lacking. A better characterization of the CP transcriptome can provide key insights into its functions as one of the barriers that separate the brain from the periphery and in the production of cerebrospinal fluid. Methods This work extends further what is known about the mouse CP transcriptome through a microarray analysis of CP tissue from normal mice under physiological conditions. Results We found that the genes most highly expressed are those implicated in energy metabolism (oxidative phosphorylation, glycolysis/gluconeogenesis and in ribosomal function, which is in agreement with the secretory nature of the CP. On the other hand, genes encoding for immune mediators are among those with lower expression in basal conditions. In addition, we found genes known to be relevant during brain development, and not previously identified to be expressed in the CP, including those encoding for various axonal guidance and angiogenesis molecules and for growth factors. Some of these are known to influence the neural stem cell niche in the subventricular zone, highlighting the involvement of the CP as a likely modulator of neurogenesis. Interestingly, our observations confirm that the CP transcriptome is unique, displaying low homology with that of other tissues. Of note, we describe here that the closest similarity is with the transcriptome of the endothelial cells of the blood-brain barrier. Conclusions Based on the data presented here, it will now be possible to further explore the function of particular proteins of the CP secretome in health and in disease.

  7. RNAseq-based transcriptome comparison of Saccharomyces cerevisiae strains isolated from diverse fermentative environments.

    Science.gov (United States)

    Ibáñez, Clara; Pérez-Torrado, Roberto; Morard, Miguel; Toft, Christina; Barrio, Eladio; Querol, Amparo

    2017-09-18

    Transcriptome analyses play a central role in unraveling the complexity of gene expression regulation in Saccharomyces cerevisiae. This species, one of the most important microorganisms for humans given its industrial applications, shows an astonishing degree of genetic and phenotypic variability among different strains adapted to specific environments. In order to gain novel insights into the Saccharomyces cerevisiae biology of strains adapted to different fermentative environments, we analyzed the whole transcriptome of three strains isolated from wine, flor wine or mezcal fermentations. An RNA-seq transcriptome comparison of the different yeasts in the samples obtained during synthetic must fermentation highlighted the differences observed in the genes that encode mannoproteins, and in those involved in aroma, sugar transport, glycerol and alcohol metabolism, which are important under alcoholic fermentation conditions. These differences were also observed in the physiology of the strains after mannoprotein and aroma determinations. This study offers an essential foundation for understanding how gene expression variations contribute to the fermentation differences of the strains adapted to unequal fermentative environments. Such knowledge is crucial to make improvements in fermentation processes and to define targets for the genetic improvement or selection of wine yeasts. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. Comparison of transcriptomic landscapes of bovine embryos using RNA-Seq

    Directory of Open Access Journals (Sweden)

    Khatib Hasan

    2010-12-01

    Full Text Available Abstract Background Advances in sequencing technologies have opened a new era of high throughput investigations. Although RNA-seq has been demonstrated in many organisms, no study has provided a comprehensive investigation of the bovine transcriptome using RNA-seq. Results In this study, we provide a deep survey of the bovine embryonic transcriptomes, the first application of RNA-seq in cattle. Embryos cultured in vitro were used as models to study early embryonic development in cattle. RNA amplified from limited amounts of starting total RNA were sequenced and mapped to the reference genome to obtain digital gene expression at single base resolution. In particular, gene expression estimates from more than 1.6 million unannotated bases in 1785 novel transcribed units were obtained. We compared the transcriptomes of embryos showing distinct developmental statuses and found genes that showed differential overall expression as well as alternative splicing. Conclusion Our study demonstrates the power of RNA-seq and provides further understanding of bovine preimplantation embryonic development at a fine scale.

  9. Analysis of oral microbiota in children with dental caries by PCR-DGGE and barcoded pyrosequencing.

    Science.gov (United States)

    Ling, Zongxin; Kong, Jianming; Jia, Peng; Wei, Chaochun; Wang, Yuezhu; Pan, Zhiwen; Huang, Wujing; Li, Lanjuan; Chen, Hui; Xiang, Charlie

    2010-10-01

    Oral microbiota plays a vital role in maintaining the homeostasis of oral cavity. Dental caries are among the most common oral diseases in children and pathogenic bacteria contribute to the development of the disease. However, the overall structure of bacterial communities in the oral cavity from children with dental caries has not been explored deeply heretofore. We used high-throughput barcoded pyrosequencing and PCR-denaturing gradient gel electrophoresis (DGGE) to examine bacterial diversity of oral microbiota in saliva and supragingival plaques from 60 children aged 3 to 6 years old with and without dental caries from China. The multiplex barcoded pyrosequencing was performed in a single run, with multiple samples tagged uniquely by multiplex identifiers. As PCR-DGGE analysis is a conventional molecular ecological approach, this analysis was also performed on the same samples and the results of both approaches were compared. A total of 186,787 high-quality sequences were obtained for evaluating bacterial diversity and 41,905 unique sequences represented all phylotypes. We found that the oral microbiota in children was far more diverse than previous studies reported, and more than 200 genera belonging to ten phyla were found in the oral cavity. The phylotypes in saliva and supragingival plaques were significantly different and could be divided into two distinct clusters (p oral microbiome analyzed by PCR-DGGE and barcoded pyrosequencing was employed to cross validate the data sets. The genera of Streptococcus, Veillonella, Actinomyces, Granulicatella, Leptotrichia, and Thiomonas in plaques were significantly associated with dental caries (p oral microbiota allowed for a better understanding of oral microecosystem, and these pathogenic populations in plaque provide new insights into the etiology of dental caries and suggest new targets for interventions of the disease.

  10. Unexpected associated microalgal diversity in the lichen Ramalina farinacea is uncovered by pyrosequencing analyses.

    Directory of Open Access Journals (Sweden)

    Patricia Moya

    Full Text Available The current literature reveals that the intrathalline coexistence of multiple microalgal taxa in lichens is more common than previously thought, and additional complexity is supported by the coexistence of bacteria and basidiomycete yeasts in lichen thalli. This replaces the old paradigm that lichen symbiosis occurs between a fungus and a single photobiont. The lichen Ramalina farinacea has proven to be a suitable model to study the multiplicity of microalgae in lichen thalli due to the constant coexistence of Trebouxia sp. TR9 and T. jamesii in long-distance populations. To date, studies involving phycobiont diversity within entire thalli are based on Sanger sequencing, but this method seems to underestimate the diversity. Here, we aim to analyze both the microalgal diversity and its community structure in a single thallus of the lichen R. farinacea by applying a 454 pyrosequencing approach coupled with a careful ad hoc-performed protocol for lichen sample processing prior to DNA extraction. To ascertain the reliability of the pyrosequencing results and the applied bioinformatics pipeline results, the thalli were divided into three sections (apical, middle and basal zones, and a mock community sample was used. The developed methodology allowed 40448 filtered algal reads to be obtained from a single lichen thallus, which encompassed 31 OTUs representative of different microalgae genera. In addition to corroborating the coexistence of the two Trebouxia sp. TR9 and T. jamesii taxa in the same thallus, this study showed a much higher microalgal diversity associated with the lichen. Along the thallus ramifications, we also detected variations in phycobiont distribution that might correlate with different microenvironmental conditions. These results highlight R. farinacea as a suitable material for studying microalgal diversity and further strengthen the concept of lichens as multispecies microecosystems. Future analyses will be relevant to

  11. Pyrosequencing-based comparative genome analysis of the nosocomial pathogen Enterococcus faecium and identification of a large transferable pathogenicity island

    Directory of Open Access Journals (Sweden)

    Bonten Marc JM

    2010-04-01

    Full Text Available Abstract Background The Gram-positive bacterium Enterococcus faecium is an important cause of nosocomial infections in immunocompromized patients. Results We present a pyrosequencing-based comparative genome analysis of seven E. faecium strains that were isolated from various sources. In the genomes of clinical isolates several antibiotic resistance genes were identified, including the vanA transposon that confers resistance to vancomycin in two strains. A functional comparison between E. faecium and the related opportunistic pathogen E. faecalis based on differences in the presence of protein families, revealed divergence in plant carbohydrate metabolic pathways and oxidative stress defense mechanisms. The E. faecium pan-genome was estimated to be essentially unlimited in size, indicating that E. faecium can efficiently acquire and incorporate exogenous DNA in its gene pool. One of the most prominent sources of genomic diversity consists of bacteriophages that have integrated in the genome. The CRISPR-Cas system, which contributes to immunity against bacteriophage infection in prokaryotes, is not present in the sequenced strains. Three sequenced isolates carry the esp gene, which is involved in urinary tract infections and biofilm formation. The esp gene is located on a large pathogenicity island (PAI, which is between 64 and 104 kb in size. Conjugation experiments showed that the entire esp PAI can be transferred horizontally and inserts in a site-specific manner. Conclusions Genes involved in environmental persistence, colonization and virulence can easily be aquired by E. faecium. This will make the development of successful treatment strategies targeted against this organism a challenge for years to come.

  12. Transcriptomic signatures of ash (Fraxinus spp. phloem.

    Directory of Open Access Journals (Sweden)

    Xiaodong Bai

    2011-01-01

    Full Text Available Ash (Fraxinus spp. is a dominant tree species throughout urban and forested landscapes of North America (NA. The rapid invasion of NA by emerald ash borer (Agrilus planipennis, a wood-boring beetle endemic to Eastern Asia, has resulted in the death of millions of ash trees and threatens billions more. Larvae feed primarily on phloem tissue, which girdles and kills the tree. While NA ash species including black (F. nigra, green (F. pennsylvannica and white (F. americana are highly susceptible, the Asian species Manchurian ash (F. mandshurica is resistant to A. planipennis perhaps due to their co-evolutionary history. Little is known about the molecular genetics of ash. Hence, we undertook a functional genomics approach to identify the repertoire of genes expressed in ash phloem.Using 454 pyrosequencing we obtained 58,673 high quality ash sequences from pooled phloem samples of green, white, black, blue and Manchurian ash. Intriguingly, 45% of the deduced proteins were not significantly similar to any sequences in the GenBank non-redundant database. KEGG analysis of the ash sequences revealed a high occurrence of defense related genes. Expression analysis of early regulators potentially involved in plant defense (i.e. transcription factors, calcium dependent protein kinases and a lipoxygenase 3 revealed higher mRNA levels in resistant ash compared to susceptible ash species. Lastly, we predicted a total of 1,272 single nucleotide polymorphisms and 980 microsatellite loci, among which seven microsatellite loci showed polymorphism between different ash species.The current transcriptomic data provide an invaluable resource for understanding the genetic make-up of ash phloem, the target tissue of A. planipennis. These data along with future functional studies could lead to the identification/characterization of defense genes involved in resistance of ash to A. planipennis, and in future ash breeding programs for marker development.

  13. Pyrosequencing-based analysis of the microbiome associated with the horn fly, Haematobia irritans.

    Directory of Open Access Journals (Sweden)

    Azhahianambi Palavesam

    Full Text Available The horn fly, Haematobia irritans, is one of the most economically important pests of cattle. Insecticides have been a major element of horn fly management programs. Growing concerns with insecticide resistance, insecticide residues on farm products, and non-availability of new generation insecticides, are serious issues for the livestock industry. Alternative horn fly control methods offer the promise to decrease the use of insecticides and reduce the amount of insecticide residues on livestock products and give an impetus to the organic livestock farming segment. The horn fly, an obligatory blood feeder, requires the help of microflora to supply additional nutrients and metabolize the blood meal. Recent advancements in DNA sequencing methodologies enable researchers to examine the microflora diversity independent of culture methods. We used the bacterial 16S tag-encoded FLX-titanium amplicon pyrosequencing (bTEFAP method to carry out the classification analysis of bacterial flora in adult female and male horn flies and horn fly eggs. The bTEFAP method identified 16S rDNA sequences in our samples which allowed the identification of various prokaryotic taxa associated with the life stage examined. This is the first comprehensive report of bacterial flora associated with the horn fly using a culture-independent method. Several rumen, environmental, symbiotic and pathogenic bacteria associated with the horn fly were identified and quantified. This is the first report of the presence of Wolbachia in horn flies of USA origin and is the first report of the presence of Rikenella in an obligatory blood feeding insect.

  14. Deep sequencing analysis of the transcriptomes of peanut aerial and subterranean young pods identifies candidate genes related to early embryo abortion.

    Science.gov (United States)

    Chen, Xiaoping; Zhu, Wei; Azam, Sarwar; Li, Heying; Zhu, Fanghe; Li, Haifen; Hong, Yanbin; Liu, Haiyan; Zhang, Erhua; Wu, Hong; Yu, Shanlin; Zhou, Guiyuan; Li, Shaoxiong; Zhong, Ni; Wen, Shijie; Li, Xingyu; Knapp, Steve J; Ozias-Akins, Peggy; Varshney, Rajeev K; Liang, Xuanqiang

    2013-01-01

    The failure of peg penetration into the soil leads to seed abortion in peanut. Knowledge of genes involved in these processes is comparatively deficient. Here, we used RNA-seq to gain insights into transcriptomes of aerial and subterranean pods. More than 2 million transcript reads with an average length of 396 bp were generated from one aerial (AP) and two subterranean (SP1 and SP2) pod libraries using pyrosequencing technology. After assembly, sets of 49 632, 49 952 and 50 494 from a total of 74 974 transcript assembly contigs (TACs) were identified in AP, SP1 and SP2, respectively. A clear linear relationship in the gene expression level was observed between these data sets. In brief, 2194 differentially expressed TACs with a 99.0% true-positive rate were identified, among which 859 and 1068 TACs were up-regulated in aerial and subterranean pods, respectively. Functional analysis showed that putative function based on similarity with proteins catalogued in UniProt and gene ontology term classification could be determined for 59 342 (79.2%) and 42 955 (57.3%) TACs, respectively. A total of 2968 TACs were mapped to 174 KEGG pathways, of which 168 were shared by aerial and subterranean transcriptomes. TACs involved in photosynthesis were significantly up-regulated and enriched in the aerial pod. In addition, two senescence-associated genes were identified as significantly up-regulated in the aerial pod, which potentially contribute to embryo abortion in aerial pods, and in turn, to cessation of swelling. The data set generated in this study provides evidence for some functional genes as robust candidates underlying aerial and subterranean pod development and contributes to an elucidation of the evolutionary implications resulting from fruit development under light and dark conditions. © 2012 The Authors Plant Biotechnology Journal © 2012 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.

  15. The developmental transcriptome of Drosophila melanogaster

    Energy Technology Data Exchange (ETDEWEB)

    University of Connecticut; Graveley, Brenton R.; Brooks, Angela N.; Carlson, Joseph W.; Duff, Michael O.; Landolin, Jane M.; Yang, Li; Artieri, Carlo G.; van Baren, Marijke J.; Boley, Nathan; Booth, Benjamin W.; Brown, James B.; Cherbas, Lucy; Davis, Carrie A.; Dobin, Alex; Li, Renhua; Lin, Wei; Malone, John H.; Mattiuzzo, Nicolas R.; Miller, David; Sturgill, David; Tuch, Brian B.; Zaleski, Chris; Zhang, Dayu; Blanchette, Marco; Dudoit, Sandrine; Eads, Brian; Green, Richard E.; Hammonds, Ann; Jiang, Lichun; Kapranov, Phil; Langton, Laura; Perrimon, Norbert; Sandler, Jeremy E.; Wan, Kenneth H.; Willingham, Aarron; Zhang, Yu; Zou, Yi; Andrews, Justen; Bicke, Peter J.; Brenner, Steven E.; Brent, Michael R.; Cherbas, Peter; Gingeras, Thomas R.; Hoskins, Roger A.; Kaufman, Thomas C.; Oliver, Brian; Celniker, Susan E.

    2010-12-02

    Drosophila melanogaster is one of the most well studied genetic model organisms; nonetheless, its genome still contains unannotated coding and non-coding genes, transcripts, exons and RNA editing sites. Full discovery and annotation are pre-requisites for understanding how the regulation of transcription, splicing and RNA editing directs the development of this complex organism. Here we used RNA-Seq, tiling microarrays and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events, and inferred protein isoforms that previously eluded discovery using established experimental, prediction and conservation-based approaches. These data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development. Drosophila melanogaster is an important non-mammalian model system that has had a critical role in basic biological discoveries, such as identifying chromosomes as the carriers of genetic information and uncovering the role of genes in development. Because it shares a substantial genic content with humans, Drosophila is increasingly used as a translational model for human development, homeostasis and disease. High-quality maps are needed for all functional genomic elements. Previous studies demonstrated that a rich collection of genes is deployed during the life cycle of the fly. Although expression profiling using microarrays has revealed the expression of, 13,000 annotated genes, it is difficult to map splice junctions and individual base modifications generated by RNA editing using such approaches. Single-base resolution is essential to define precisely the elements that comprise the Drosophila transcriptome. Estimates of the number of transcript isoforms are less accurate than estimates of the number of genes

  16. Digital gene expression analysis based on integrated de novo transcriptome assembly of sweet potato [Ipomoea batatas (L. Lam].

    Directory of Open Access Journals (Sweden)

    Xiang Tao

    Full Text Available BACKGROUND: Sweet potato (Ipomoea batatas L. [Lam.] ranks among the top six most important food crops in the world. It is widely grown throughout the world with high and stable yield, strong adaptability, rich nutrient content, and multiple uses. However, little is known about the molecular biology of this important non-model organism due to lack of genomic resources. Hence, studies based on high-throughput sequencing technologies are needed to get a comprehensive and integrated genomic resource and better understanding of gene expression patterns in different tissues and at various developmental stages. METHODOLOGY/PRINCIPAL FINDINGS: Illumina paired-end (PE RNA-Sequencing was performed, and generated 48.7 million of 75 bp PE reads. These reads were de novo assembled into 128,052 transcripts (≥ 100 bp, which correspond to 41.1 million base pairs, by using a combined assembly strategy. Transcripts were annotated by Blast2GO and 51,763 transcripts got BLASTX hits, in which 39,677 transcripts have GO terms and 14,117 have ECs that are associated with 147 KEGG pathways. Furthermore, transcriptome differences of seven tissues were analyzed by using Illumina digital gene expression (DGE tag profiling and numerous differentially and specifically expressed transcripts were identified. Moreover, the expression characteristics of genes involved in viral genomes, starch metabolism and potential stress tolerance and insect resistance were also identified. CONCLUSIONS/SIGNIFICANCE: The combined de novo transcriptome assembly strategy can be applied to other organisms whose reference genomes are not available. The data provided here represent the most comprehensive and integrated genomic resources for cloning and identifying genes of interest in sweet potato. Characterization of sweet potato transcriptome provides an effective tool for better understanding the molecular mechanisms of cellular processes including development of leaves and storage roots

  17. De novo assembly of the perennial ryegrass transcriptome using an RNA-Seq strategy.

    Directory of Open Access Journals (Sweden)

    Jacqueline D Farrell

    Full Text Available Perennial ryegrass is a highly heterozygous outbreeding grass species used for turf and forage production. Heterozygosity can affect de-Bruijn graph assembly making de novo transcriptome assembly of species such as perennial ryegrass challenging. Creating a reference transcriptome from a homozygous perennial ryegrass genotype can circumvent the challenge of heterozygosity. The goals of this study were to perform RNA-sequencing on multiple tissues from a highly inbred genotype to develop a reference transcriptome. This was complemented with RNA-sequencing of a highly heterozygous genotype for SNP calling.De novo transcriptome assembly of the inbred genotype created 185,833 transcripts with an average length of 830 base pairs. Within the inbred reference transcriptome 78,560 predicted open reading frames were found of which 24,434 were predicted as complete. Functional annotation found 50,890 transcripts with a BLASTp hit from the Swiss-Prot non-redundant database, 58,941 transcripts with a Pfam protein domain and 1,151 transcripts encoding putative secreted peptides. To evaluate the reference transcriptome we targeted the high-affinity K+ transporter gene family and found multiple orthologs. Using the longest unique open reading frames as the reference sequence, 64,242 single nucleotide polymorphisms were found. One thousand sixty one open reading frames from the inbred genotype contained heterozygous sites, confirming the high degree of homozygosity.Our study has developed an annotated, comprehensive transcriptome reference for perennial ryegrass that can aid in determining genetic variation, expression analysis, genome annotation, and gene mapping.

  18. Fish-T1K (Transcriptomes of 1,000 Fishes) Project: large-scale transcriptome data for fish evolution studies.

    Science.gov (United States)

    Sun, Ying; Huang, Yu; Li, Xiaofeng; Baldwin, Carole C; Zhou, Zhuocheng; Yan, Zhixiang; Crandall, Keith A; Zhang, Yong; Zhao, Xiaomeng; Wang, Min; Wong, Alex; Fang, Chao; Zhang, Xinhui; Huang, Hai; Lopez, Jose V; Kilfoyle, Kirk; Zhang, Yong; Ortí, Guillermo; Venkatesh, Byrappa; Shi, Qiong

    2016-01-01

    Ray-finned fishes (Actinopterygii) represent more than 50 % of extant vertebrates and are of great evolutionary, ecologic and economic significance, but they are relatively underrepresented in 'omics studies. Increased availability of transcriptome data for these species will allow researchers to better understand changes in gene expression, and to carry out functional analyses. An international project known as the "Transcriptomes of 1,000 Fishes" (Fish-T1K) project has been established to generate RNA-seq transcriptome sequences for 1,000 diverse species of ray-finned fishes. The first phase of this project has produced transcriptomes from more than 180 ray-finned fishes, representing 142 species and covering 51 orders and 109 families. Here we provide an overview of the goals of this project and the work done so far.

  19. Developmental Transcriptome for a Facultatively Eusocial Bee, Megalopta genalis

    OpenAIRE

    Jones, Beryl M.; Wcislo, William T.; Robinson, Gene E.

    2015-01-01

    Transcriptomes provide excellent foundational resources for mechanistic and evolutionary analyses of complex traits. We present a developmental transcriptome for the facultatively eusocial bee Megalopta genalis, which represents a potential transition point in the evolution of eusociality. A de novo transcriptome assembly of Megalopta genalis was generated using paired-end Illumina sequencing and the Trinity assembler. Males and females of all life stages were aligned to this transcriptome fo...

  20. Integration of deep transcriptome and proteome analyses reveals the components of alkaloid metabolism in opium poppy cell cultures

    Directory of Open Access Journals (Sweden)

    Schriemer David C

    2010-11-01

    Full Text Available Abstract Background Papaver somniferum (opium poppy is the source for several pharmaceutical benzylisoquinoline alkaloids including morphine, the codeine and sanguinarine. In response to treatment with a fungal elicitor, the biosynthesis and accumulation of sanguinarine is induced along with other plant defense responses in opium poppy cell cultures. The transcriptional induction of alkaloid metabolism in cultured cells provides an opportunity to identify components of this process via the integration of deep transcriptome and proteome databases generated using next-generation technologies. Results A cDNA library was prepared for opium poppy cell cultures treated with a fungal elicitor for 10 h. Using 454 GS-FLX Titanium pyrosequencing, 427,369 expressed sequence tags (ESTs with an average length of 462 bp were generated. Assembly of these sequences yielded 93,723 unigenes, of which 23,753 were assigned Gene Ontology annotations. Transcripts encoding all known sanguinarine biosynthetic enzymes were identified in the EST database, 5 of which were represented among the 50 most abundant transcripts. Liquid chromatography-tandem mass spectrometry (LC-MS/MS of total protein extracts from cell cultures treated with a fungal elicitor for 50 h facilitated the identification of 1,004 proteins. Proteins were fractionated by one-dimensional SDS-PAGE and digested with trypsin prior to LC-MS/MS analysis. Query of an opium poppy-specific EST database substantially enhanced peptide identification. Eight out of 10 known sanguinarine biosynthetic enzymes and many relevant primary metabolic enzymes were represented in the peptide database. Conclusions The integration of deep transcriptome and proteome analyses provides an effective platform to catalogue the components of secondary metabolism, and to identify genes encoding uncharacterized enzymes. The establishment of corresponding transcript and protein databases generated by next-generation technologies in a

  1. Spatial and Species Variations in Bacterial Communities Associated with Corals from the Red Sea as Revealed by Pyrosequencing

    KAUST Repository

    Lee, O. O.; Yang, J.; Bougouffa, S.; Wang, Y.; Batang, Zenon B.; Tian, R.; Al-Suwailem, A.; Qian, P.-Y.

    2012-01-01

    -pyrosequencing technique to investigate the bacterial communities associated with three stony Scleractinea and two soft Octocorallia corals from three locations in the Red Sea. Our results revealed highly diverse bacterial communities in the Red Sea corals, with more than

  2. Pyrosequencing Analysis of Subgingival Microbiota in Distinct Periodontal Conditions.

    Science.gov (United States)

    Park, O-J; Yi, H; Jeon, J H; Kang, S-S; Koo, K-T; Kum, K-Y; Chun, J; Yun, C-H; Han, S H

    2015-07-01

    Subgingival microorganisms are potentially associated with periodontal diseases. However, changes in the subgingival microbiota during the progress of periodontal diseases are poorly understood. In this study, we analyzed bacterial communities in the subgingival paper point samples from 32 Korean individuals with no sign of disease, gingivitis, or periodontitis using 454 FLX Titanium pyrosequencing. A total of 256,113 reads representing 26 phyla, 433 genera, and 1,016 species were detected. Bacteroidetes, Fusobacteria, Synergistetes, and Spirochaetes were the abundant phyla in periodontitis subjects, whereas Firmicutes and Proteobacteria were identified as the dominant phyla in the gingivitis and healthy subjects, respectively. Although high levels of Porphyromonas, Fusobacterium, Fretibacterium, Rothia, Filifactor, and Treponema genera were observed in the periodontitis subjects, Streptococcus, Capnocytophaga, Leptotrichia, and Haemophilus genera were found at high frequency in the gingivitis subjects. Species including Porphyromonas gingivalis, Fusobacterium nucleatum, and Fretibacterium fastidiosum were significantly increased in periodontitis subjects. On the other hand, Streptococcus pseudopneumoniae, Haemophilus parainfluenzae, and Leptotrichia hongkongensis were preferentially observed in the gingivitis subjects. Intriguingly, the halophile Halomonas hamiltonii was revealed as a predominant species in the healthy subjects. Based on Fast UniFrac analysis, distinctive bacterial clusters were classified for the healthy, gingivitis, and periodontitis state. The current findings might be useful for understanding the pathogenesis, diagnosis, and treatment of periodontal diseases. © International & American Associations for Dental Research 2015.

  3. 454 pyrosequencing to describe microbial eukaryotic community composition, diversity and relative abundance: a test for marine haptophytes.

    Directory of Open Access Journals (Sweden)

    Elianne Egge

    Full Text Available Next generation sequencing of ribosomal DNA is increasingly used to assess the diversity and structure of microbial communities. Here we test the ability of 454 pyrosequencing to detect the number of species present, and assess the relative abundance in terms of cell numbers and biomass of protists in the phylum Haptophyta. We used a mock community consisting of equal number of cells of 11 haptophyte species and compared targeting DNA and RNA/cDNA, and two different V4 SSU rDNA haptophyte-biased primer pairs. Further, we tested four different bioinformatic filtering methods to reduce errors in the resulting sequence dataset. With sequencing depth of 11000-20000 reads and targeting cDNA with Haptophyta specific primers Hap454 we detected all 11 species. A rarefaction analysis of expected number of species recovered as a function of sampling depth suggested that minimum 1400 reads were required here to recover all species in the mock community. Relative read abundance did not correlate to relative cell numbers. Although the species represented with the largest biomass was also proportionally most abundant among the reads, there was generally a weak correlation between proportional read abundance and proportional biomass of the different species, both with DNA and cDNA as template. The 454 sequencing generated considerable spurious diversity, and more with cDNA than DNA as template. With initial filtering based only on match with barcode and primer we observed 100-fold more operational taxonomic units (OTUs at 99% similarity than the number of species present in the mock community. Filtering based on quality scores, or denoising with PyroNoise resulted in ten times more OTU99% than the number of species. Denoising with AmpliconNoise reduced the number of OTU99% to match the number of species present in the mock community. Based on our analyses, we propose a strategy to more accurately depict haptophyte diversity using 454 pyrosequencing.

  4. The Escherichia coli transcriptome linked to growth fitness

    Directory of Open Access Journals (Sweden)

    Bei-Wen Ying

    2016-03-01

    Full Text Available A series of Escherichia coli strains with varied genomic sequences were subjected to high-density microarray analyses to elucidate the fitness-correlated transcriptomes. Fitness, which is commonly evaluated by the growth rate during the exponential phase, is not only determined by the genome but is also linked to growth conditions, e.g., temperature. We previously reported genetic and environmental contributions to E. coli transcriptomes and evolutionary transcriptome changes in thermal adaptation. Here, we describe experimental details on how to prepare microarray samples that truly represent the growth fitness of the E. coli cells. A step-by-step record of sample preparation procedures that correspond to growing cells and transcriptome data sets that are deposited at the GEO database (GSE33212, GSE52770, GSE61739 are also provided for reference. Keywords: Transcriptome, Growth fitness, Escherichia coli, Microarray

  5. Integrative investigation of metabolic and transcriptomic data

    Directory of Open Access Journals (Sweden)

    Önsan Z İlsen

    2006-04-01

    Full Text Available Abstract Background New analysis methods are being developed to integrate data from transcriptome, proteome, interactome, metabolome, and other investigative approaches. At the same time, existing methods are being modified to serve the objectives of systems biology and permit the interpretation of the huge datasets currently being generated by high-throughput methods. Results Transcriptomic and metabolic data from chemostat fermentors were collected with the aim of investigating the relationship between these two data sets. The variation in transcriptome data in response to three physiological or genetic perturbations (medium composition, growth rate, and specific gene deletions was investigated using linear modelling, and open reading-frames (ORFs whose expression changed significantly in response to these perturbations were identified. Assuming that the metabolic profile is a function of the transcriptome profile, expression levels of the different ORFs were used to model the metabolic variables via Partial Least Squares (Projection to Latent Structures – PLS using PLS toolbox in Matlab. Conclusion The experimental design allowed the analyses to discriminate between the effects which the growth medium, dilution rate, and the deletion of specific genes had on the transcriptome and metabolite profiles. Metabolite data were modelled as a function of the transcriptome to determine their congruence. The genes that are involved in central carbon metabolism of yeast cells were found to be the ORFs with the most significant contribution to the model.

  6. High-throughput sequencing and analysis of the gill tissue transcriptome from the deep-sea hydrothermal vent mussel Bathymodiolus azoricus

    Directory of Open Access Journals (Sweden)

    Gomes Paula

    2010-10-01

    Full Text Available Abstract Background Bathymodiolus azoricus is a deep-sea hydrothermal vent mussel found in association with large faunal communities living in chemosynthetic environments at the bottom of the sea floor near the Azores Islands. Investigation of the exceptional physiological reactions that vent mussels have adopted in their habitat, including responses to environmental microbes, remains a difficult challenge for deep-sea biologists. In an attempt to reveal genes potentially involved in the deep-sea mussel innate immunity we carried out a high-throughput sequence analysis of freshly collected B. azoricus transcriptome using gills tissues as the primary source of immune transcripts given its strategic role in filtering the surrounding waterborne potentially infectious microorganisms. Additionally, a substantial EST data set was produced and from which a comprehensive collection of genes coding for putative proteins was organized in a dedicated database, "DeepSeaVent" the first deep-sea vent animal transcriptome database based on the 454 pyrosequencing technology. Results A normalized cDNA library from gills tissue was sequenced in a full 454 GS-FLX run, producing 778,996 sequencing reads. Assembly of the high quality reads resulted in 75,407 contigs of which 3,071 were singletons. A total of 39,425 transcripts were conceptually translated into amino-sequences of which 22,023 matched known proteins in the NCBI non-redundant protein database, 15,839 revealed conserved protein domains through InterPro functional classification and 9,584 were assigned with Gene Ontology terms. Queries conducted within the database enabled the identification of genes putatively involved in immune and inflammatory reactions which had not been previously evidenced in the vent mussel. Their physical counterpart was confirmed by semi-quantitative quantitative Reverse-Transcription-Polymerase Chain Reactions (RT-PCR and their RNA transcription level by quantitative PCR (q

  7. Next generation sequencing based transcriptome analysis of septic-injury responsive genes in the beetle Tribolium castaneum.

    Directory of Open Access Journals (Sweden)

    Boran Altincicek

    Full Text Available Beetles (Coleoptera are the most diverse animal group on earth and interact with numerous symbiotic or pathogenic microbes in their environments. The red flour beetle Tribolium castaneum is a genetically tractable model beetle species and its whole genome sequence has recently been determined. To advance our understanding of the molecular basis of beetle immunity here we analyzed the whole transcriptome of T. castaneum by high-throughput next generation sequencing technology. Here, we demonstrate that the Illumina/Solexa sequencing approach of cDNA samples from T. castaneum including over 9.7 million reads with 72 base pairs (bp length (approximately 700 million bp sequence information with about 30× transcriptome coverage confirms the expression of most predicted genes and enabled subsequent qualitative and quantitative transcriptome analysis. This approach recapitulates our recent quantitative real-time PCR studies of immune-challenged and naïve T. castaneum beetles, validating our approach. Furthermore, this sequencing analysis resulted in the identification of 73 differentially expressed genes upon immune-challenge with statistical significance by comparing expression data to calculated values derived by fitting to generalized linear models. We identified up regulation of diverse immune-related genes (e.g. Toll receptor, serine proteinases, DOPA decarboxylase and thaumatin and of numerous genes encoding proteins with yet unknown functions. Of note, septic-injury resulted also in the elevated expression of genes encoding heat-shock proteins or cytochrome P450s supporting the view that there is crosstalk between immune and stress responses in T. castaneum. The present study provides a first comprehensive overview of septic-injury responsive genes in T. castaneum beetles. Identified genes advance our understanding of T. castaneum specific gene expression alteration upon immune-challenge in particular and may help to understand beetle immunity

  8. Global transcriptome analysis of developing chickpea (Cicer arietinum L.) seeds.

    Science.gov (United States)

    Pradhan, Seema; Bandhiwal, Nitesh; Shah, Niraj; Kant, Chandra; Gaur, Rashmi; Bhatia, Sabhyata

    2014-01-01

    Understanding developmental processes, especially in non-model crop plants, is extremely important in order to unravel unique mechanisms regulating development. Chickpea (C. arietinum L.) seeds are especially valued for their high carbohydrate and protein content. Therefore, in order to elucidate the mechanisms underlying seed development in chickpea, deep sequencing of transcriptomes from four developmental stages was undertaken. In this study, next generation sequencing platform was utilized to sequence the transcriptome of four distinct stages of seed development in chickpea. About 1.3 million reads were generated which were assembled into 51,099 unigenes by merging the de novo and reference assemblies. Functional annotation of the unigenes was carried out using the Uniprot, COG and KEGG databases. RPKM based digital expression analysis revealed specific gene activities at different stages of development which was validated using Real time PCR analysis. More than 90% of the unigenes were found to be expressed in at least one of the four seed tissues. DEGseq was used to determine differentially expressing genes which revealed that only 6.75% of the unigenes were differentially expressed at various stages. Homology based comparison revealed 17.5% of the unigenes to be putatively seed specific. Transcription factors were predicted based on HMM profiles built using TF sequences from five legume plants and analyzed for their differential expression during progression of seed development. Expression analysis of genes involved in biosynthesis of important secondary metabolites suggested that chickpea seeds can serve as a good source of antioxidants. Since transcriptomes are a valuable source of molecular markers like simple sequence repeats (SSRs), about 12,000 SSRs were mined in chickpea seed transcriptome and few of them were validated. In conclusion, this study will serve as a valuable resource for improved chickpea breeding.

  9. Global transcriptome analysis of developing chickpea (Cicer arietinum L. seeds

    Directory of Open Access Journals (Sweden)

    Seema ePradhan

    2014-12-01

    Full Text Available Understanding developmental processes, especially in non-model crop plants, is extremely important in order to unravel unique mechanisms regulating development. Chickpea (C. arietinum L. seeds are especially valued for their high carbohydrate and protein content. Therefore, in order to elucidate the mechanisms underlying seed development in chickpea, deep sequencing of transcriptomes from four developmental stages was undertaken. In this study, next generation sequencing platform was utilised to sequence the transcriptome of four distinct stages of seed development in chickpea. About 1.3 million reads were generated which were assembled into 51,099 unigenes by merging the de novo and reference assemblies. Functional annotation of the unigenes was carried out using the Uniprot, COG and KEGG databases. RPKM based digital expression analysis revealed specific gene activities at different stages of development which was validated using Real time PCR analysis. More than 90% of the unigenes were found to be expressed in at least one of the four seed tissues. DEGseq was used to determine differentially expressing genes which revealed that only 6.75% of the unigenes were differentially expressed at various stages. Homology based comparison revealed 17.5% of the unigenes to be putatively seed specific. Transcription factors were predicted based on HMM profiles built using TF sequences from five legume plants and analysed for their differential expression during progression of seed development. Expression analysis of genes involved in biosynthesis of important secondary metabolites suggested that chickpea seeds can serve as a good source of antioxidants. Since transcriptomes are a valuable source of molecular markers like simple sequence repeats (SSRs, about 12,000 SSRs were mined in chickpea seed transcriptome and few of them were validated. In conclusion, this study will serve as a valuable resource for improved chickpea breeding.

  10. Pyrosequencing Reveals Fungal Communities in the Rhizosphere of Xinjiang Jujube

    Directory of Open Access Journals (Sweden)

    Peng Liu

    2015-01-01

    Full Text Available Fungi are important soil components as both decomposers and plant symbionts and play a major role in ecological and biogeochemical processes. However, little is known about the richness and structure of fungal communities. DNA sequencing technologies allow for the direct estimation of microbial community diversity, avoiding culture-based biases. We therefore used 454 pyrosequencing to investigate the fungal communities in the rhizosphere of Xinjiang jujube. We obtained no less than 40,488 internal transcribed spacer (ITS rDNA reads, the number of each sample was 6943, 6647, 6584, 6550, 6860, and 6904, and we used bioinformatics and multivariate statistics to analyze the results. The index of diversity showed greater richness in the rhizosphere fungal community of a 3-year-old jujube than in that of an 8-year-old jujube. Most operational taxonomic units belonged to Ascomycota, and taxonomic analyses identified Hypocreales as the dominant fungal order. Our results demonstrated that the fungal orders are present in different proportions in different sampling areas. Redundancy analysis (RDA revealed a significant correlation between soil properties and the abundance of fungal phyla. Our results indicated lower fungal diversity in the rhizosphere of Xinjiang jujube than that reported in other studies, and we hope our findings provide a reference for future research.

  11. Transcriptome analysis of yellow horn (Xanthoceras sorbifolia Bunge: a potential oil-rich seed tree for biodiesel in China.

    Directory of Open Access Journals (Sweden)

    Yulin Liu

    Full Text Available BACKGROUND: Yellow horn (Xanthoceras sorbifolia Bunge is an oil-rich seed shrub that grows well in cold, barren environments and has great potential for biodiesel production in China. However, the limited genetic data means that little information about the key genes involved in oil biosynthesis is available, which limits further improvement of this species. In this study, we describe sequencing and de novo transcriptome assembly to produce the first comprehensive and integrated genomic resource for yellow horn and identify the pathways and key genes related to oil accumulation. In addition, potential molecular markers were identified and compiled. METHODOLOGY/PRINCIPAL FINDINGS: Total RNA was isolated from 30 plants from two regions, including buds, leaves, flowers and seeds. Equal quantities of RNA from these tissues were pooled to construct a cDNA library for 454 pyrosequencing. A total of 1,147,624 high-quality reads with total and average lengths of 530.6 Mb and 462 bp, respectively, were generated. These reads were assembled into 51,867 unigenes, corresponding to a total of 36.1 Mb with a mean length, N50 and median of 696, 928 and 570 bp, respectively. Of the unigenes, 17,541 (33.82% were unmatched in any public protein databases. We identified 281 unigenes that may be involved in de novo fatty acid (FA and triacylglycerol (TAG biosynthesis and metabolism. Furthermore, 6,707 SSRs, 16,925 SNPs and 6,201 InDels with high-confidence were also identified in this study. CONCLUSIONS: This transcriptome represents a new functional genomics resource and a foundation for further studies on the metabolic engineering of yellow horn to increase oil content and modify oil composition. The potential molecular markers identified in this study provide a basis for polymorphism analysis of Xanthoceras, and even Sapindaceae; they will also accelerate the process of breeding new varieties with better agronomic characteristics.

  12. Pyrosequencing reveals the effect of mobilizing agents and lignocellulosic substrate amendment on microbial community composition in a real industrial PAH-polluted soil

    Energy Technology Data Exchange (ETDEWEB)

    Lladó, S., E-mail: llado@biomed.cas.cz [Department of Microbiology, University of Barcelona, Diagonal 645, E-08028 Barcelona (Spain); Institute of Microbiology, Academy of Sciences of the Czech Republic, Vídenská 1083, 142 20 Prague 4 (Czech Republic); Covino, S., E-mail: covino@biomed.cas.cz [Institute of Microbiology, Academy of Sciences of the Czech Republic, Vídenská 1083, 142 20 Prague 4 (Czech Republic); Solanas, A.M., E-mail: asolanas@ub.edu [Department of Microbiology, University of Barcelona, Diagonal 645, E-08028 Barcelona (Spain); Petruccioli, M., E-mail: petrucci@unitus.it [Department for Innovation in Biological, Agro-Food and Forest Systems [DIBAF], University of Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo (Italy); D’annibale, A., E-mail: dannib@unitus.it [Department for Innovation in Biological, Agro-Food and Forest Systems [DIBAF], University of Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo (Italy); Viñas, M., E-mail: marc.vinas@irta.cat [GIRO Joint Research Unit IRTA-UPC, Institute of Research and Technology Food and Agriculture [IRTA], Torre Marimon, E-08140 Caldes de Montbui (Spain)

    2015-02-11

    Highlights: • Soil microbial community assessment through classical (MPN) and molecular tools (DGGE and pyrosequencing) is provided. • A failure of exogenous white rot fungi to colonize the polluted soil is shown by DGGE and pyrosequencing. • Surfactant Brij 30 hampers 4-ring PAHs degradation due to toxicity over Actinobacteria and Bacteroidetes populations. • A high prevalence of Fusarium and Scedosporium populations is revealed during soil bioremediation. • Cupriavidus, Mycobacterium and Chithinophagaceae are potential HMW–PAH degraders in the soil. - Abstract: Bacterial and fungal biodiversity throughout different biostimulation and bioaugmentation treatments applied to an industrial creosote-polluted soil were analyzed by means of polyphasic approach in order to gain insight into the microbial community structure and dynamics. Pyrosequencing data obtained from initial creosote polluted soil (after a biopiling step) revealed that Alpha and Gammaproteobacteria were the most abundant bacterial groups, whereas Fusarium and Scedosporium were the main fungal genera in the contaminated soil. At the end of 60-days laboratory scale bioremediation assays, pyrosequencing and DGGE data showed that (i) major bacterial community shifts were caused by the type of mobilizing agent added to the soil and, to a lesser extent, by the addition of lignocellulosic substrate; and (ii) the presence of the non-ionic surfactant (Brij 30) hampered the proliferation of Actinobacteria (Mycobacteriaceae) and Bacteroidetes (Chitinophagaceae) and, in the absence of lignocellulosic substrate, also impeded polycyclic aromatic hydrocarbons (PAHs) degradation. The results show the importance of implementing bioremediation experiments combined with microbiome assessment to gain insight on the effect of crucial parameters (e.g. use of additives) over the potential functions of complex microbial communities harbored in polluted soils, essential for bioremediation success.

  13. Pyrosequencing reveals the effect of mobilizing agents and lignocellulosic substrate amendment on microbial community composition in a real industrial PAH-polluted soil

    International Nuclear Information System (INIS)

    Lladó, S.; Covino, S.; Solanas, A.M.; Petruccioli, M.; D’annibale, A.; Viñas, M.

    2015-01-01

    Highlights: • Soil microbial community assessment through classical (MPN) and molecular tools (DGGE and pyrosequencing) is provided. • A failure of exogenous white rot fungi to colonize the polluted soil is shown by DGGE and pyrosequencing. • Surfactant Brij 30 hampers 4-ring PAHs degradation due to toxicity over Actinobacteria and Bacteroidetes populations. • A high prevalence of Fusarium and Scedosporium populations is revealed during soil bioremediation. • Cupriavidus, Mycobacterium and Chithinophagaceae are potential HMW–PAH degraders in the soil. - Abstract: Bacterial and fungal biodiversity throughout different biostimulation and bioaugmentation treatments applied to an industrial creosote-polluted soil were analyzed by means of polyphasic approach in order to gain insight into the microbial community structure and dynamics. Pyrosequencing data obtained from initial creosote polluted soil (after a biopiling step) revealed that Alpha and Gammaproteobacteria were the most abundant bacterial groups, whereas Fusarium and Scedosporium were the main fungal genera in the contaminated soil. At the end of 60-days laboratory scale bioremediation assays, pyrosequencing and DGGE data showed that (i) major bacterial community shifts were caused by the type of mobilizing agent added to the soil and, to a lesser extent, by the addition of lignocellulosic substrate; and (ii) the presence of the non-ionic surfactant (Brij 30) hampered the proliferation of Actinobacteria (Mycobacteriaceae) and Bacteroidetes (Chitinophagaceae) and, in the absence of lignocellulosic substrate, also impeded polycyclic aromatic hydrocarbons (PAHs) degradation. The results show the importance of implementing bioremediation experiments combined with microbiome assessment to gain insight on the effect of crucial parameters (e.g. use of additives) over the potential functions of complex microbial communities harbored in polluted soils, essential for bioremediation success

  14. De novo-based transcriptome profiling of male-sterile and fertile watermelon lines.

    Science.gov (United States)

    Rhee, Sun-Ju; Kwon, Taehyung; Seo, Minseok; Jang, Yoon Jeong; Sim, Tae Yong; Cho, Seoae; Han, Sang-Wook; Lee, Gung Pyo

    2017-01-01

    The whole-genome sequence of watermelon (Citrullus lanatus (Thunb.) Matsum. & Nakai), a valuable horticultural crop worldwide, was released in 2013. Here, we compared a de novo-based approach (DBA) to a reference-based approach (RBA) using RNA-seq data, to aid in efforts to improve the annotation of the watermelon reference genome and to obtain biological insight into male-sterility in watermelon. We applied these techniques to available data from two watermelon lines: the male-sterile line DAH3615-MS and the male-fertile line DAH3615. Using DBA, we newly annotated 855 watermelon transcripts, and found gene functional clusters predicted to be related to stimulus responses, nucleic acid binding, transmembrane transport, homeostasis, and Golgi/vesicles. Among the DBA-annotated transcripts, 138 de novo-exclusive differentially-expressed genes (DEDEGs) related to male sterility were detected. Out of 33 randomly selected newly annotated transcripts and DEDEGs, 32 were validated by RT-qPCR. This study demonstrates the usefulness and reliability of the de novo transcriptome assembly in watermelon, and provides new insights for researchers exploring transcriptional blueprints with regard to the male sterility.

  15. Detailed transcriptome description of the neglected cestode Taenia multiceps.

    Science.gov (United States)

    Wu, Xuhang; Fu, Yan; Yang, Deying; Zhang, Runhui; Zheng, Wanpeng; Nie, Huaming; Xie, Yue; Yan, Ning; Hao, Guiying; Gu, Xiaobin; Wang, Shuxian; Peng, Xuerong; Yang, Guangyou

    2012-01-01

    The larval stage of Taenia multiceps, a global cestode, encysts in the central nervous system (CNS) of sheep and other livestock. This frequently leads to their death and huge socioeconomic losses, especially in developing countries. This parasite can also cause zoonotic infections in humans, but has been largely neglected due to a lack of diagnostic techniques and studies. Recent developments in next-generation sequencing provide an opportunity to explore the transcriptome of T. multiceps. We obtained a total of 31,282 unigenes (mean length 920 bp) using Illumina paired-end sequencing technology and a new Trinity de novo assembler without a referenced genome. Individual transcription molecules were determined by sequence-based annotations and/or domain-based annotations against public databases (Nr, UniprotKB/Swiss-Prot, COG, KEGG, UniProtKB/TrEMBL, InterPro and Pfam). We identified 26,110 (83.47%) unigenes and inferred 20,896 (66.8%) coding sequences (CDS). Further comparative transcripts analysis with other cestodes (Taenia pisiformis, Taenia solium, Echincoccus granulosus and Echincoccus multilocularis) and intestinal parasites (Trichinella spiralis, Ancylostoma caninum and Ascaris suum) showed that 5,100 common genes were shared among three Taenia tapeworms, 261 conserved genes were detected among five Taeniidae cestodes, and 109 common genes were found in four zoonotic intestinal parasites. Some of the common genes were genes required for parasite survival, involved in parasite-host interactions. In addition, we amplified two full-length CDS of unigenes from the common genes using RT-PCR. This study provides an extensive transcriptome of the adult stage of T. multiceps, and demonstrates that comparative transcriptomic investigations deserve to be further studied. This transcriptome dataset forms a substantial public information platform to achieve a fundamental understanding of the biology of T. multiceps, and helps in the identification of drug targets and

  16. Detailed transcriptome description of the neglected cestode Taenia multiceps.

    Directory of Open Access Journals (Sweden)

    Xuhang Wu

    Full Text Available BACKGROUND: The larval stage of Taenia multiceps, a global cestode, encysts in the central nervous system (CNS of sheep and other livestock. This frequently leads to their death and huge socioeconomic losses, especially in developing countries. This parasite can also cause zoonotic infections in humans, but has been largely neglected due to a lack of diagnostic techniques and studies. Recent developments in next-generation sequencing provide an opportunity to explore the transcriptome of T. multiceps. METHODOLOGY/PRINCIPAL FINDINGS: We obtained a total of 31,282 unigenes (mean length 920 bp using Illumina paired-end sequencing technology and a new Trinity de novo assembler without a referenced genome. Individual transcription molecules were determined by sequence-based annotations and/or domain-based annotations against public databases (Nr, UniprotKB/Swiss-Prot, COG, KEGG, UniProtKB/TrEMBL, InterPro and Pfam. We identified 26,110 (83.47% unigenes and inferred 20,896 (66.8% coding sequences (CDS. Further comparative transcripts analysis with other cestodes (Taenia pisiformis, Taenia solium, Echincoccus granulosus and Echincoccus multilocularis and intestinal parasites (Trichinella spiralis, Ancylostoma caninum and Ascaris suum showed that 5,100 common genes were shared among three Taenia tapeworms, 261 conserved genes were detected among five Taeniidae cestodes, and 109 common genes were found in four zoonotic intestinal parasites. Some of the common genes were genes required for parasite survival, involved in parasite-host interactions. In addition, we amplified two full-length CDS of unigenes from the common genes using RT-PCR. CONCLUSIONS/SIGNIFICANCE: This study provides an extensive transcriptome of the adult stage of T. multiceps, and demonstrates that comparative transcriptomic investigations deserve to be further studied. This transcriptome dataset forms a substantial public information platform to achieve a fundamental understanding of

  17. De novo Genome Assembly and Single Nucleotide Variations for Soybean Mosaic Virus Using Soybean Seed Transcriptome Data

    Directory of Open Access Journals (Sweden)

    Yeonhwa Jo

    2017-10-01

    Full Text Available Soybean is the most important legume crop in the world. Several diseases in soybean lead to serious yield losses in major soybean-producing countries. Moreover, soybean can be infected by diverse viruses. Recently, we carried out a large-scale screening to identify viruses infecting soybean using available soybean transcriptome data. Of the screened transcriptomes, a soybean transcriptome for soybean seed development analysis contains several virus-associated sequences. In this study, we identified five viruses, including soybean mosaic virus (SMV, infecting soybean by de novo transcriptome assembly followed by blast search. We assembled a nearly complete consensus genome sequence of SMV China using transcriptome data. Based on phylogenetic analysis, the consensus genome sequence of SMV China was closely related to SMV isolates from South Korea. We examined single nucleotide variations (SNVs for SMVs in the soybean seed transcriptome revealing 780 SNVs, which were evenly distributed on the SMV genome. Four SNVs, C-U, U-C, A-G, and G-A, were frequently identified. This result demonstrated the quasispecies variation of the SMV genome. Taken together, this study carried out bioinformatics analyses to identify viruses using soybean transcriptome data. In addition, we demonstrated the application of soybean transcriptome data for virus genome assembly and SNV analysis.

  18. ST Spot Detector: a web-based application for automatic spot and tissue detection for Spatial Transcriptomics image data sets.

    Science.gov (United States)

    Wong, Kim; Fernández Navarro, José; Bergenstråhle, Ludvig; Ståhl, Patrik L; Lundeberg, Joakim

    2018-01-17

    Spatial transcriptomics (ST) is a method which combines high resolution tissue imaging with high throughput transcriptome sequencing data. This data must be aligned with the images for correct visualisation, a process that involves several manual steps. Here we present ST Spot Detector, a web tool that automates and facilitates this alignment through a user friendly interface. Open source under the MIT license, available from https://github.com/SpatialTranscriptomicsResearch/st_spot_detector. jose.fernandez.navarro@scilifelab.se. Supplementary data are available at Bioinformatics online. © The Author(s) 2018. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  19. De Novo Adult Transcriptomes of Two European Brittle Stars: Spotlight on Opsin-Based Photoreception.

    Directory of Open Access Journals (Sweden)

    Jérôme Delroisse

    Full Text Available Next generation sequencing (NGS technology allows to obtain a deeper and more complete view of transcriptomes. For non-model or emerging model marine organisms, NGS technologies offer a great opportunity for rapid access to genetic information. In this study, paired-end Illumina HiSeqTM technology has been employed to analyse transcriptomes from the arm tissues of two European brittle star species, Amphiura filiformis and Ophiopsila aranea. About 48 million Illumina reads were generated and 136,387 total unigenes were predicted from A. filiformis arm tissues. For O. aranea arm tissues, about 47 million reads were generated and 123,324 total unigenes were obtained. Twenty-four percent of the total unigenes from A. filiformis show significant matches with sequences present in reference online databases, whereas, for O. aranea, this percentage amounts to 23%. In both species, around 50% of the predicted annotated unigenes were significantly similar to transcripts from the purple sea urchin, the closest species to date that has undergone complete genome sequencing and annotation. GO, COG and KEGG analyses were performed on predicted brittle star unigenes. We focused our analyses on the phototransduction actors involved in light perception. Firstly, two new echinoderm opsins were identified in O. aranea: one rhabdomeric opsin (homologous to vertebrate melanopsin and one RGR opsin. The RGR-opsin is supposed to be involved in retinal regeneration while the r-opsin is suspected to play a role in visual-like behaviour. Secondly, potential phototransduction actors were identified in both transcriptomes using the fly (rhabdomeric and mammal (ciliary classical phototransduction pathways as references. Finally, the sensitivity of O.aranea to monochromatic light was investigated to complement data available for A. filiformis. The presence of microlens-like structures at the surface of dorsal arm plate of O. aranea could potentially explain phototactic

  20. Transcriptome complexity in a genome-reduced bacterium

    DEFF Research Database (Denmark)

    Güell, Marc; van Noort, Vera; Yus, Eva

    2009-01-01

    To study basic principles of transcriptome organization in bacteria, we analyzed one of the smallest self-replicating organisms, Mycoplasma pneumoniae. We combined strand-specific tiling arrays, complemented by transcriptome sequencing, with more than 252 spotted arrays. We detected 117 previousl...

  1. Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants.

    Science.gov (United States)

    Li, Xinguo; Wu, Harry X; Southerton, Simon G

    2010-06-21

    Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution.

  2. Utility of RNA Sequencing for Analysis of Maize Reproductive Transcriptomes

    Directory of Open Access Journals (Sweden)

    Rebecca M. Davidson

    2011-11-01

    Full Text Available Transcriptome sequencing is a powerful method for studying global expression patterns in large, complex genomes. Evaluation of sequence-based expression profiles during reproductive development would provide functional annotation to genes underlying agronomic traits. We generated transcriptome profiles for 12 diverse maize ( L. reproductive tissues representing male, female, developing seed, and leaf tissues using high throughput transcriptome sequencing. Overall, ∼80% of annotated genes were expressed. Comparative analysis between sequence and hybridization-based methods demonstrated the utility of ribonucleic acid sequencing (RNA-seq for expression determination and differentiation of paralagous genes (∼85% of maize genes. Analysis of 4975 gene families across reproductive tissues revealed expression divergence is proportional to family size. In all pairwise comparisons between tissues, 7 (pre- vs. postemergence cobs to 48% (pollen vs. ovule of genes were differentially expressed. Genes with expression restricted to a single tissue within this study were identified with the highest numbers observed in leaves, endosperm, and pollen. Coexpression network analysis identified 17 gene modules with complex and shared expression patterns containing many previously described maize genes. The data and analyses in this study provide valuable tools through improved gene annotation, gene family characterization, and a core set of candidate genes to further characterize maize reproductive development and improve grain yield potential.

  3. 454-Pyrosequencing Analysis of Bacterial Communities from Autotrophic Nitrogen Removal Bioreactors Utilizing Universal Primers: Effect of Annealing Temperature.

    Science.gov (United States)

    Gonzalez-Martinez, Alejandro; Rodriguez-Sanchez, Alejandro; Rodelas, Belén; Abbas, Ben A; Martinez-Toledo, Maria Victoria; van Loosdrecht, Mark C M; Osorio, F; Gonzalez-Lopez, Jesus

    2015-01-01

    Identification of anaerobic ammonium oxidizing (anammox) bacteria by molecular tools aimed at the evaluation of bacterial diversity in autotrophic nitrogen removal systems is limited by the difficulty to design universal primers for the Bacteria domain able to amplify the anammox 16S rRNA genes. A metagenomic analysis (pyrosequencing) of total bacterial diversity including anammox population in five autotrophic nitrogen removal technologies, two bench-scale models (MBR and Low Temperature CANON) and three full-scale bioreactors (anammox, CANON, and DEMON), was successfully carried out by optimization of primer selection and PCR conditions (annealing temperature). The universal primer 530F was identified as the best candidate for total bacteria and anammox bacteria diversity coverage. Salt-adjusted optimum annealing temperature of primer 530F was calculated (47°C) and hence a range of annealing temperatures of 44-49°C was tested. Pyrosequencing data showed that annealing temperature of 45°C yielded the best results in terms of species richness and diversity for all bioreactors analyzed.

  4. Bacterial communities potentially involved in iron-cycling in Baltic Sea and North Sea sediments revealed by pyrosequencing

    DEFF Research Database (Denmark)

    Reyes, Carlen; Dellwig, Olaf; Dähnke, K.

    2016-01-01

    To gain insight into the bacterial communities involved in iron-(Fe) cycling under marine conditions, we analysed sediments with Fe-contents (0.5-1.5 wt %) from the suboxic zone at a marine site in the Skagerrak (SK) and a brackish site in the Bothnian Bay (BB) using 16S rRNA gene pyrosequencing....

  5. Using transcriptomic profiles in the diatom Phaeodactylum tricornutum to identify and prioritize stressors

    International Nuclear Information System (INIS)

    Osborn, Hannah L.; Hook, Sharon E.

    2013-01-01

    Highlights: •Exposure to stressors with different modes of action generated unique gene expression profiles in the diatom Phaeodactylum tricornutum. •The gene expression profile generated by a multiple stressor exposure reflected exposure to individual components of the mixture. •Quantitative PCR assays were generated that could be used to identify exposure to individual stressors. -- Abstract: The transcriptomic profile of the marine diatom, Phaeodactylum tricornutum, exposed to several ecologically relevant stressors, was used to develop toxicity identification evaluation (TIE)-like gene expression assays. Algal growth inhibition was measured by flow cytometry to determine exposure concentrations that elicited a sublethal toxic response. P. tricornutum was exposed to concentrations of copper (2 μg L −1 ), cadmium (5 μg L −1 ), silver (20 μg L −1 ), simazine (75 μg L −1 ), the water accommodated fraction (WAF) of weathered crude oil (5 mg L −1 ), 50 μg L −1 ammonia, a decreased salinity treatment (15‰), and a mixture exposure of ammonia, decreased salinity and cadmium (10 μg L −1 ). Analysis of the gene expression via microarray indicated that unique transcriptomic signals were generated for each of the individual treatments. Transcriptomic profiles of ammonia and the mixture treatment overlapped substantially. Photosynthesis related transcripts were altered in the simazine (herbicide) treatment. A transcript involved in degrading hydrocarbons, dioxygenase, had increased abundance after crude oil exposure. Overall, transcriptomic responses in the different treatments were associated with stress responses, membrane transport, transcription and translation and could be linked to contaminant mode of action. The transcriptomic profiles were used to design real-time (quantitative) polymerase chain reaction (qPCR) assays that would link changes in transcript abundance to a particular stressor in a TIE-based approach. At least one transcript

  6. Using transcriptomic profiles in the diatom Phaeodactylum tricornutum to identify and prioritize stressors

    Energy Technology Data Exchange (ETDEWEB)

    Osborn, Hannah L., E-mail: Hannah.Osborn@csiro.au; Hook, Sharon E., E-mail: Sharon.Hook@csiro.au

    2013-08-15

    Highlights: •Exposure to stressors with different modes of action generated unique gene expression profiles in the diatom Phaeodactylum tricornutum. •The gene expression profile generated by a multiple stressor exposure reflected exposure to individual components of the mixture. •Quantitative PCR assays were generated that could be used to identify exposure to individual stressors. -- Abstract: The transcriptomic profile of the marine diatom, Phaeodactylum tricornutum, exposed to several ecologically relevant stressors, was used to develop toxicity identification evaluation (TIE)-like gene expression assays. Algal growth inhibition was measured by flow cytometry to determine exposure concentrations that elicited a sublethal toxic response. P. tricornutum was exposed to concentrations of copper (2 μg L{sup −1}), cadmium (5 μg L{sup −1}), silver (20 μg L{sup −1}), simazine (75 μg L{sup −1}), the water accommodated fraction (WAF) of weathered crude oil (5 mg L{sup −1}), 50 μg L{sup −1} ammonia, a decreased salinity treatment (15‰), and a mixture exposure of ammonia, decreased salinity and cadmium (10 μg L{sup −1}). Analysis of the gene expression via microarray indicated that unique transcriptomic signals were generated for each of the individual treatments. Transcriptomic profiles of ammonia and the mixture treatment overlapped substantially. Photosynthesis related transcripts were altered in the simazine (herbicide) treatment. A transcript involved in degrading hydrocarbons, dioxygenase, had increased abundance after crude oil exposure. Overall, transcriptomic responses in the different treatments were associated with stress responses, membrane transport, transcription and translation and could be linked to contaminant mode of action. The transcriptomic profiles were used to design real-time (quantitative) polymerase chain reaction (qPCR) assays that would link changes in transcript abundance to a particular stressor in a TIE-based

  7. 454 pyrosequencing analyses of bacterial and archaeal richness in 21 full-scale biogas digesters.

    Science.gov (United States)

    Sundberg, Carina; Al-Soud, Waleed A; Larsson, Madeleine; Alm, Erik; Yekta, Sepehr S; Svensson, Bo H; Sørensen, Søren J; Karlsson, Anna

    2013-09-01

    The microbial community of 21 full-scale biogas reactors was examined using 454 pyrosequencing of 16S rRNA gene sequences. These reactors included seven (six mesophilic and one thermophilic) digesting sewage sludge (SS) and 14 (ten mesophilic and four thermophilic) codigesting (CD) various combinations of wastes from slaughterhouses, restaurants, households, etc. The pyrosequencing generated more than 160,000 sequences representing 11 phyla, 23 classes, and 95 genera of Bacteria and Archaea. The bacterial community was always both more abundant and more diverse than the archaeal community. At the phylum level, the foremost populations in the SS reactors included Actinobacteria, Proteobacteria, Chloroflexi, Spirochetes, and Euryarchaeota, while Firmicutes was the most prevalent in the CD reactors. The main bacterial class in all reactors was Clostridia. Acetoclastic methanogens were detected in the SS, but not in the CD reactors. Their absence suggests that methane formation from acetate takes place mainly via syntrophic acetate oxidation in the CD reactors. A principal component analysis of the communities at genus level revealed three clusters: SS reactors, mesophilic CD reactors (including one thermophilic CD and one SS), and thermophilic CD reactors. Thus, the microbial composition was mainly governed by the substrate differences and the process temperature. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  8. Pyrosequencing assessment of rhizosphere fungal communities from a soybean field.

    Science.gov (United States)

    Sugiyama, Akifumi; Ueda, Yoshikatsu; Takase, Hisabumi; Yazaki, Kazufumi

    2014-10-01

    Soil fungal communities play essential roles in soil ecosystems, affecting plant growth and health. Rhizosphere bacterial communities have been shown to undergo dynamic changes during plant growth. This study utilized 454 pyrosequencing to analyze rhizosphere fungal communities during soybean growth. Members of the Ascomycota and Basiodiomycota dominated in all soils. There were no statistically significant changes at the phylum level among growth stages or between bulk and rhizosphere soils. In contrast, the relative abundance of small numbers of operational taxonomic units, 4 during growth and 28 between bulk and rhizosphere soils, differed significantly. Clustering analysis revealed that rhizosphere fungal communities were different from bulk fungal communities during growth stages of soybeans. Taken together, these results suggest that in contrast to rhizosphere bacterial communities, most constituents of rhizosphere fungal communities remained stable during soybean growth.

  9. Histological chorioamnionitis shapes the neonatal transcriptomic immune response.

    Science.gov (United States)

    Weitkamp, Jörn-Hendrik; Guthrie, Scott O; Wong, Hector R; Moldawer, Lyle L; Baker, Henry V; Wynn, James L

    2016-07-01

    Histologic chorioamnionitis (HCA) is commonly associated with preterm birth and deleterious post-natal outcomes including sepsis and necrotizing enterocolitis. Transcriptomic analysis has been used to uncover gene signatures that permit diagnosis and prognostication, show new therapeutic targets, and reveal mechanisms that underlie differential outcomes with other complex disease states in neonates such as sepsis. To define the transcriptomic and inflammatory protein response in peripheral blood among infants with exposure to histologic chorioamnionitis. Prospective, observational study. Uninfected preterm neonates retrospectively categorized based on placental pathology with no HCA exposure (n=18) or HCA exposure (n=15). We measured the transcriptomic and inflammatory mediator response in prospectively collected whole blood. We found 488 significant (p<0.001), differentially expressed genes in whole blood samples among uninfected neonates with HCA exposure that collectively represented activated innate and adaptive immune cellular pathways and revealed a potential regulatory role for the pleotropic microRNA molecule miR-155. Differentially secreted plasma cytokines in patients with HCA exposure compared to patients without HCA included MCP-1, MPO, and MMP-9 (p<0.05). Exposure to HCA distinctively activates the neonatal immune system in utero with potentially long-term health consequences. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  10. Determining the optimal number of independent components for reproducible transcriptomic data analysis.

    Science.gov (United States)

    Kairov, Ulykbek; Cantini, Laura; Greco, Alessandro; Molkenov, Askhat; Czerwinska, Urszula; Barillot, Emmanuel; Zinovyev, Andrei

    2017-09-11

    Independent Component Analysis (ICA) is a method that models gene expression data as an action of a set of statistically independent hidden factors. The output of ICA depends on a fundamental parameter: the number of components (factors) to compute. The optimal choice of this parameter, related to determining the effective data dimension, remains an open question in the application of blind source separation techniques to transcriptomic data. Here we address the question of optimizing the number of statistically independent components in the analysis of transcriptomic data for reproducibility of the components in multiple runs of ICA (within the same or within varying effective dimensions) and in multiple independent datasets. To this end, we introduce ranking of independent components based on their stability in multiple ICA computation runs and define a distinguished number of components (Most Stable Transcriptome Dimension, MSTD) corresponding to the point of the qualitative change of the stability profile. Based on a large body of data, we demonstrate that a sufficient number of dimensions is required for biological interpretability of the ICA decomposition and that the most stable components with ranks below MSTD have more chances to be reproduced in independent studies compared to the less stable ones. At the same time, we show that a transcriptomics dataset can be reduced to a relatively high number of dimensions without losing the interpretability of ICA, even though higher dimensions give rise to components driven by small gene sets. We suggest a protocol of ICA application to transcriptomics data with a possibility of prioritizing components with respect to their reproducibility that strengthens the biological interpretation. Computing too few components (much less than MSTD) is not optimal for interpretability of the results. The components ranked within MSTD range have more chances to be reproduced in independent studies.

  11. Transcriptomics-based identification of developmental toxicants through their interference with cardiomyocyte differentiation of embryonic stem cells

    International Nuclear Information System (INIS)

    Dartel, Dorien A.M. van; Pennings, Jeroen L.A.; Schooten, Frederik J. van; Piersma, Aldert H.

    2010-01-01

    The embryonic stem cell test (EST) predicts developmental toxicity based on the inhibition of cardiomyocyte differentiation of embryonic stem cells (ESC). The subjective endpoint, the long culture duration together with the undefined applicability domain and related predictivity need further improvement to facilitate implementation of the EST into regulatory strategies. These aspects may be improved by studying gene expression changes in the ESC differentiation cultures and their modulation by compound exposure using transcriptomics. Here, we tested the developmental toxicants monobutyl phthalate and 6-aminonicotinamide. ESC were allowed to differentiated, and cardiomyocyte differentiation was assessed after 10 days of culture. RNA of solvent controls was collected after 0, 24, 48, 72 and 96 h of exposure, and RNA of developmental-toxicant-exposed cultures was collected after 24 and 96 h. Samples were hybridized to DNA microarrays, and 1355 genes were found differentially expressed among the unexposed experimental groups. These regulated genes were involved in differentiation-related processes, and Principal Component Analysis (PCA) based on these genes showed that the unexposed experimental groups appeared in chronological order in the PCA, which can therefore be regarded as a continuous representation of the differentiation track. The developmental-toxicant-exposed cultures appeared to deviate significantly from this differentiation track, which confirms the compound-modulating effects on the differentiation process. The incorporation of transcriptomics in the EST is expected to provide a more informative and improved endpoint in the EST as compared with morphology, allowing early detection of differentiation modulation. Furthermore, this approach may improve the definition of the applicability domain and predictivity of the EST.

  12. Assessment of bacterial diversity in the cattle tick Rhipicephalus (Boophilus microplus through tag-encoded pyrosequencing

    Directory of Open Access Journals (Sweden)

    Bendele Kylie G

    2011-01-01

    Full Text Available Abstract Background Ticks are regarded as the most relevant vectors of disease-causing pathogens in domestic and wild animals. The cattle tick, Rhipicephalus (Boophilus microplus, hinders livestock production in tropical and subtropical parts of the world where it is endemic. Tick microbiomes remain largely unexplored. The objective of this study was to explore the R. microplus microbiome by applying the bacterial 16S tag-encoded FLX-titanium amplicon pyrosequencing (bTEFAP technique to characterize its bacterial diversity. Pyrosequencing was performed on adult males and females, eggs, and gut and ovary tissues from adult females derived from samples of R. microplus collected during outbreaks in southern Texas. Results Raw data from bTEFAP were screened and trimmed based upon quality scores and binned into individual sample collections. Bacteria identified to the species level include Staphylococcus aureus, Staphylococcus chromogenes, Streptococcus dysgalactiae, Staphylococcus sciuri, Serratia marcescens, Corynebacterium glutamicum, and Finegoldia magna. One hundred twenty-one bacterial genera were detected in all the life stages and tissues sampled. The total number of genera identified by tick sample comprised: 53 in adult males, 61 in adult females, 11 in gut tissue, 7 in ovarian tissue, and 54 in the eggs. Notable genera detected in the cattle tick include Wolbachia, Coxiella, and Borrelia. The molecular approach applied in this study allowed us to assess the relative abundance of the microbiota associated with R. microplus. Conclusions This report represents the first survey of the bacteriome in the cattle tick using non-culture based molecular approaches. Comparisons of our results with previous bacterial surveys provide an indication of geographic variation in the assemblages of bacteria associated with R. microplus. Additional reports on the identification of new bacterial species maintained in nature by R. microplus that may be

  13. Assessment of bacterial diversity in the cattle tick Rhipicephalus (Boophilus) microplus through tag-encoded pyrosequencing

    Science.gov (United States)

    2011-01-01

    Background Ticks are regarded as the most relevant vectors of disease-causing pathogens in domestic and wild animals. The cattle tick, Rhipicephalus (Boophilus) microplus, hinders livestock production in tropical and subtropical parts of the world where it is endemic. Tick microbiomes remain largely unexplored. The objective of this study was to explore the R. microplus microbiome by applying the bacterial 16S tag-encoded FLX-titanium amplicon pyrosequencing (bTEFAP) technique to characterize its bacterial diversity. Pyrosequencing was performed on adult males and females, eggs, and gut and ovary tissues from adult females derived from samples of R. microplus collected during outbreaks in southern Texas. Results Raw data from bTEFAP were screened and trimmed based upon quality scores and binned into individual sample collections. Bacteria identified to the species level include Staphylococcus aureus, Staphylococcus chromogenes, Streptococcus dysgalactiae, Staphylococcus sciuri, Serratia marcescens, Corynebacterium glutamicum, and Finegoldia magna. One hundred twenty-one bacterial genera were detected in all the life stages and tissues sampled. The total number of genera identified by tick sample comprised: 53 in adult males, 61 in adult females, 11 in gut tissue, 7 in ovarian tissue, and 54 in the eggs. Notable genera detected in the cattle tick include Wolbachia, Coxiella, and Borrelia. The molecular approach applied in this study allowed us to assess the relative abundance of the microbiota associated with R. microplus. Conclusions This report represents the first survey of the bacteriome in the cattle tick using non-culture based molecular approaches. Comparisons of our results with previous bacterial surveys provide an indication of geographic variation in the assemblages of bacteria associated with R. microplus. Additional reports on the identification of new bacterial species maintained in nature by R. microplus that may be pathogenic to its vertebrate hosts

  14. Evaluation of persistence of resistant variants with ultra-deep pyrosequencing in chronic hepatitis C patients treated with telaprevir.

    Directory of Open Access Journals (Sweden)

    Xiomara V Thomas

    Full Text Available BACKGROUND & AIMS: Telaprevir, a hepatitis C virus NS3/4A protease inhibitor has significantly improved sustained viral response rates when given in combination with pegylated interferon alfa-2a and ribavirin, compared with current standard of care in hepatitis C virus genotype 1 infected patients. In patients with a failed sustained response, the emergence of drug-resistant variants during treatment has been reported. It is unclear to what extent these variants persist in untreated patients. The aim of this study was to assess using ultra-deep pyrosequencing, whether after 4 years follow-up, the frequency of resistant variants is increased compared to pre-treatment frequencies following 14 days of telaprevir treatment. METHODS: Fifteen patients from 2 previous telaprevir phase 1 clinical studies (VX04-950-101 and VX05-950-103 were included. These patients all received telaprevir monotherapy for 14 days, and 2 patients subsequently received standard of care. Variants at previously well-characterized NS3 protease positions V36, T54, R155 and A156 were assessed at baseline and after a follow-up of 4±1.2 years by ultra-deep pyrosequencing. The prevalence of resistant variants at follow-up was compared to baseline. RESULTS: Resistance associated mutations were detectable at low frequency at baseline. In general, prevalence of resistance mutations at follow-up was not increased compared to baseline. Only one patient had a small, but statistically significant, increase in the number of V36M and T54S variants 4 years after telaprevir-dosing. CONCLUSION: In patients treated for 14 days with telaprevir monotherapy, ultra-deep pyrosequencing indicates that long-term persistence of resistant variants is rare.

  15. Bacterial communities in the gut and reproductive organs of Bactrocera minax (Diptera: Tephritidae) based on 454 pyrosequencing.

    Science.gov (United States)

    Wang, Ailin; Yao, Zhichao; Zheng, Weiwei; Zhang, Hongyu

    2014-01-01

    The citrus fruit fly Bactrocera minax is associated with diverse bacterial communities. We used a 454 pyrosequencing technology to study in depth the microbial communities associated with gut and reproductive organs of Bactrocera minax. Our dataset consisted of 100,749 reads with an average length of 400 bp. The saturated rarefaction curves and species richness indices indicate that the sampling was comprehensive. We found highly diverse bacterial communities, with individual sample containing approximately 361 microbial operational taxonomic units (OTUs). A total of 17 bacterial phyla were obtained from the flies. A phylogenetic analysis of 16S rDNA revealed that Proteobacteria was dominant in all samples (75%-95%). Actinobacteria and Firmicutes were also commonly found in the total clones. Klebsiella, Citrobacter, Enterobacter, and Serratia were the major genera. However, bacterial diversity (Chao1, Shannon and Simpson indices) and community structure (PCA analysis) varied across samples. Female ovary has the most diverse bacteria, followed by male testis, and the bacteria diversity of reproductive organs is richer than that of the gut. The observed variation can be caused by sex and tissue, possibly to meet the host's physiological demands.

  16. Bacterial communities in the gut and reproductive organs of Bactrocera minax (Diptera: Tephritidae based on 454 pyrosequencing.

    Directory of Open Access Journals (Sweden)

    Ailin Wang

    Full Text Available The citrus fruit fly Bactrocera minax is associated with diverse bacterial communities. We used a 454 pyrosequencing technology to study in depth the microbial communities associated with gut and reproductive organs of Bactrocera minax. Our dataset consisted of 100,749 reads with an average length of 400 bp. The saturated rarefaction curves and species richness indices indicate that the sampling was comprehensive. We found highly diverse bacterial communities, with individual sample containing approximately 361 microbial operational taxonomic units (OTUs. A total of 17 bacterial phyla were obtained from the flies. A phylogenetic analysis of 16S rDNA revealed that Proteobacteria was dominant in all samples (75%-95%. Actinobacteria and Firmicutes were also commonly found in the total clones. Klebsiella, Citrobacter, Enterobacter, and Serratia were the major genera. However, bacterial diversity (Chao1, Shannon and Simpson indices and community structure (PCA analysis varied across samples. Female ovary has the most diverse bacteria, followed by male testis, and the bacteria diversity of reproductive organs is richer than that of the gut. The observed variation can be caused by sex and tissue, possibly to meet the host's physiological demands.

  17. Transcriptome characterization and sequencing-based identification of salt-responsive genes in Millettia pinnata, a semi-mangrove plant.

    Science.gov (United States)

    Huang, Jianzi; Lu, Xiang; Yan, Hao; Chen, Shouyi; Zhang, Wanke; Huang, Rongfeng; Zheng, Yizhi

    2012-04-01

    Semi-mangroves form a group of transitional species between glycophytes and halophytes, and hold unique potential for learning molecular mechanisms underlying plant salt tolerance. Millettia pinnata is a semi-mangrove plant that can survive a wide range of saline conditions in the absence of specialized morphological and physiological traits. By employing the Illumina sequencing platform, we generated ~192 million short reads from four cDNA libraries of M. pinnata and processed them into 108,598 unisequences with a high depth of coverage. The mean length and total length of these unisequences were 606 bp and 65.8 Mb, respectively. A total of 54,596 (50.3%) unisequences were assigned Nr annotations. Functional classification revealed the involvement of unisequences in various biological processes related to metabolism and environmental adaptation. We identified 23,815 candidate salt-responsive genes with significantly differential expression under seawater and freshwater treatments. Based on the reverse transcription-polymerase chain reaction (RT-PCR) and real-time PCR analyses, we verified the changes in expression levels for a number of candidate genes. The functional enrichment analyses for the candidate genes showed tissue-specific patterns of transcriptome remodelling upon salt stress in the roots and the leaves. The transcriptome of M. pinnata will provide valuable gene resources for future application in crop improvement. In addition, this study sets a good example for large-scale identification of salt-responsive genes in non-model organisms using the sequencing-based approach.

  18. Simultaneous pyrosequencing of the 16S rRNA, IncP-1 trfA, and merA genes

    DEFF Research Database (Denmark)

    Holmsgaard, Peter Nikolai; Sørensen, Søren Johannes; Hansen, Lars H.

    2013-01-01

    The use of amplicon pyrosequencing makes it possible to produce thousands of sequences of the same gene at relatively low costs. Here we show that it is possible to simultaneously sequence the 16S rRNA gene, IncP-1 trfA gene and mercury reductase gene (merA) as a way for screening the diversity...

  19. Plant transcriptomics and responses to environmental stress: an ...

    Indian Academy of Sciences (India)

    3Centre for Environmental Research, Near East University, 33010, Lefkosha, Turkish Republic of the Northern Cyprus. 4Department of ...... Transcriptomic analysis of sense and antisense strands of .... 2008 Stem cell transcriptome profiling via.

  20. Sputum microbiota in tuberculosis as revealed by 16S rRNA pyrosequencing.

    Directory of Open Access Journals (Sweden)

    Man Kit Cheung

    Full Text Available BACKGROUND: Tuberculosis (TB remains a global threat in the 21st century. Traditional studies of the disease are focused on the single pathogen Mycobacterium tuberculosis. Recent studies have revealed associations of some diseases with an imbalance in the microbial community. Characterization of the TB microbiota could allow a better understanding of the disease. METHODOLOGY/PRINCIPAL FINDINGS: Here, the sputum microbiota in TB infection was examined by using 16S rRNA pyrosequencing. A total of 829,873 high-quality sequencing reads were generated from 22 TB and 14 control sputum samples. Firmicutes, Proteobacteria, Bacteroidetes, Actinobacteria, and Fusobacteria were the five major bacterial phyla recovered, which together composed over 98% of the microbial community. Proteobacteria and Bacteroidetes were more represented in the TB samples and Firmicutes was more predominant in the controls. Sixteen major bacterial genera were recovered. Streptococcus, Neisseria and Prevotella were the most predominant genera, which were dominated by several operational taxonomic units grouped at a 97% similarity level. Actinomyces, Fusobacterium, Leptotrichia, Prevotella, Streptococcus, and Veillonella were found in all TB samples, possibly representing the core genera in TB sputum microbiota. The less represented genera Mogibacterium, Moryella and Oribacterium were enriched statistically in the TB samples, while a genus belonging to the unclassified Lactobacillales was enriched in the controls. The diversity of microbiota was similar in the TB and control samples. CONCLUSIONS/SIGNIFICANCE: The composition and diversity of sputum microbiota in TB infection was characterized for the first time by using high-throughput pyrosequencing. It lays the framework for examination of potential roles played by the diverse microbiota in TB pathogenesis and progression, and could ultimately facilitate advances in TB treatment.

  1. Pyrosequencing analysis of oral microbiota in children with severe early childhood dental caries.

    Science.gov (United States)

    Jiang, Wen; Zhang, Jie; Chen, Hui

    2013-11-01

    Severe early childhood caries are a prevalent public health problem among preschool children throughout the world. However, little is known about the microbiota found in association with severe early childhood caries. Our study aimed to explore the bacterial microbiota of dental plaques to study the etiology of severe early childhood caries through pyrosequencing analysis based on 16S rRNA gene V1-V3 hypervariable regions. Forty participants were enrolled in the study, and we obtained twenty samples of supragingival plaque from caries-free subjects and twenty samples from subjects with severe early childhood caries. A total of 175,918 reads met the quality control standards, and the bacteria found belonged to fourteen phyla and sixty-three genera. Our results show the overall structure and microbial composition of oral bacterial communities, and they suggest that these bacteria may present a core microbiome in the dental plaque microbiota. Three genera, Streptococcus, Granulicatella, and Actinomyces, were increased significantly in children with severe dental cavities. These data may facilitate improvements in the prevention and treatment of severe early childhood caries.

  2. Pyrosequencing survey of intestinal microbiota diversity in cultured sea bass (Dicentrarchus labrax) fed functional diets.

    OpenAIRE

    Carda Diéguez, Miguel; Mira, Alex; Fouz Rodríguez, Belén

    2014-01-01

    The routine use of chemotherapy to control bacterial diseases in aquatic populations has resulted in the development and spread of antibiotic resistance. The inclusion of immunostimulants in fish diets (functional diets) is one of the main strategies to solve this threat. This study aimed to analyse the intestinal microbiota of cultured European sea bass (Dicentrarchus labrax) fed two functional diets applying pyrosequencing of PCR-amplified 16S rRNA gene. Quality-filtered reads were assigned...

  3. Strategic and Operational Plan for Integrating Transcriptomics ...

    Science.gov (United States)

    Plans for incorporating high throughput transcriptomics into the current high throughput screening activities at NCCT; the details are in the attached slide presentation presentation on plans for incorporating high throughput transcriptomics into the current high throughput screening activities at NCCT, given at the OECD meeting on June 23, 2016

  4. A glance at quality score: implication for de novo transcriptome reconstruction of Illumina reads

    Directory of Open Access Journals (Sweden)

    Stanley Kimbung Mbandi

    2014-02-01

    Full Text Available Downstream analyses of short-reads from next-generation sequencing platforms are often preceded by a pre-processing step that removes uncalled and wrongly called bases. Standard approaches rely on their associated base quality scores to retain the read or a portion of it when the score is above a predefined threshold. It is difficult to differentiate sequencing error from biological variation without a reference using quality scores. The effects of quality score based trimming have not been systematically studied in de novo transcriptome assembly. Using RNA-Seq data produced from Illumina, we teased out the effects of quality score base filtering or trimming on de novo transcriptome reconstruction. We showed that assemblies produced from reads subjected to different quality score thresholds contain truncated and missing transfrags when compared to those from untrimmed reads. Our data supports the fact that de novo assembling of untrimmed data is challenging for de Bruijn graph assemblers. However, our results indicates that comparing the assemblies from untrimmed and trimmed read subsets can suggest appropriate filtering parameters and enable selection of the optimum de novo transcriptome assembly in non-model organisms.

  5. Evaluation of the bacterial diversity of pressure ulcers using bTEFAP pyrosequencing.

    Science.gov (United States)

    Smith, Drake M; Snow, David E; Rees, Eric; Zischkau, Ann M; Hanson, J Delton; Wolcott, Randall D; Sun, Yan; White, Jennifer; Kumar, Shashi; Dowd, Scot E

    2010-09-21

    Decubitus ulcers, also known as bedsores or pressure ulcers, affect millions of hospitalized patients each year. The microflora of chronic wounds such as ulcers most commonly exist in the biofilm phenotype and have been known to significantly impair normal healing trajectories. Bacterial tag-encoded FLX amplicon pyrosequencing (bTEFAP), a universal bacterial identification method, was used to identify bacterial populations in 49 decubitus ulcers. Diversity estimators were utilized and wound community compositions analyzed in relation to metadata such as Age, race, gender, and comorbidities. Decubitus ulcers are shown to be polymicrobial in nature with no single bacterium exclusively colonizing the wounds. The microbial community among such ulcers is highly variable. While there are between 3 and 10 primary populations in each wound there can be hundreds of different species present many of which are in trace amounts. There is no clearly significant differences in the microbial ecology of decubitus ulcer in relation to metadata except when considering diabetes. The microbial populations and composition in the decubitus ulcers of diabetics may be significantly different from the communities in non-diabetics. Based upon the continued elucidation of chronic wound bioburdens as polymicrobial infections, it is recommended that, in addition to traditional biofilm-based wound care strategies, an antimicrobial/antibiofilm treatment program can be tailored to each patient's respective wound microflora.

  6. Evaluation of the bacterial diversity of Pressure ulcers using bTEFAP pyrosequencing

    Directory of Open Access Journals (Sweden)

    Wolcott Randall D

    2010-09-01

    Full Text Available Abstract Background Decubitus ulcers, also known as bedsores or pressure ulcers, affect millions of hospitalized patients each year. The microflora of chronic wounds such as ulcers most commonly exist in the biofilm phenotype and have been known to significantly impair normal healing trajectories. Methods Bacterial tag-encoded FLX amplicon pyrosequencing (bTEFAP, a universal bacterial identification method, was used to identify bacterial populations in 49 decubitus ulcers. Diversity estimators were utilized and wound community compositions analyzed in relation to metadata such as Age, race, gender, and comorbidities. Results Decubitus ulcers are shown to be polymicrobial in nature with no single bacterium exclusively colonizing the wounds. The microbial community among such ulcers is highly variable. While there are between 3 and 10 primary populations in each wound there can be hundreds of different species present many of which are in trace amounts. There is no clearly significant differences in the microbial ecology of decubitus ulcer in relation to metadata except when considering diabetes. The microbial populations and composition in the decubitus ulcers of diabetics may be significantly different from the communities in non-diabetics. Conclusions Based upon the continued elucidation of chronic wound bioburdens as polymicrobial infections, it is recommended that, in addition to traditional biofilm-based wound care strategies, an antimicrobial/antibiofilm treatment program can be tailored to each patient's respective wound microflora.

  7. The Human Blood Metabolome-Transcriptome Interface.

    Directory of Open Access Journals (Sweden)

    Jörg Bartel

    2015-06-01

    Full Text Available Biological systems consist of multiple organizational levels all densely interacting with each other to ensure function and flexibility of the system. Simultaneous analysis of cross-sectional multi-omics data from large population studies is a powerful tool to comprehensively characterize the underlying molecular mechanisms on a physiological scale. In this study, we systematically analyzed the relationship between fasting serum metabolomics and whole blood transcriptomics data from 712 individuals of the German KORA F4 cohort. Correlation-based analysis identified 1,109 significant associations between 522 transcripts and 114 metabolites summarized in an integrated network, the 'human blood metabolome-transcriptome interface' (BMTI. Bidirectional causality analysis using Mendelian randomization did not yield any statistically significant causal associations between transcripts and metabolites. A knowledge-based interpretation and integration with a genome-scale human metabolic reconstruction revealed systematic signatures of signaling, transport and metabolic processes, i.e. metabolic reactions mainly belonging to lipid, energy and amino acid metabolism. Moreover, the construction of a network based on functional categories illustrated the cross-talk between the biological layers at a pathway level. Using a transcription factor binding site enrichment analysis, this pathway cross-talk was further confirmed at a regulatory level. Finally, we demonstrated how the constructed networks can be used to gain novel insights into molecular mechanisms associated to intermediate clinical traits. Overall, our results demonstrate the utility of a multi-omics integrative approach to understand the molecular mechanisms underlying both normal physiology and disease.

  8. The Human Blood Metabolome-Transcriptome Interface

    Science.gov (United States)

    Schramm, Katharina; Adamski, Jerzy; Gieger, Christian; Herder, Christian; Carstensen, Maren; Peters, Annette; Rathmann, Wolfgang; Roden, Michael; Strauch, Konstantin; Suhre, Karsten; Kastenmüller, Gabi; Prokisch, Holger; Theis, Fabian J.

    2015-01-01

    Biological systems consist of multiple organizational levels all densely interacting with each other to ensure function and flexibility of the system. Simultaneous analysis of cross-sectional multi-omics data from large population studies is a powerful tool to comprehensively characterize the underlying molecular mechanisms on a physiological scale. In this study, we systematically analyzed the relationship between fasting serum metabolomics and whole blood transcriptomics data from 712 individuals of the German KORA F4 cohort. Correlation-based analysis identified 1,109 significant associations between 522 transcripts and 114 metabolites summarized in an integrated network, the ‘human blood metabolome-transcriptome interface’ (BMTI). Bidirectional causality analysis using Mendelian randomization did not yield any statistically significant causal associations between transcripts and metabolites. A knowledge-based interpretation and integration with a genome-scale human metabolic reconstruction revealed systematic signatures of signaling, transport and metabolic processes, i.e. metabolic reactions mainly belonging to lipid, energy and amino acid metabolism. Moreover, the construction of a network based on functional categories illustrated the cross-talk between the biological layers at a pathway level. Using a transcription factor binding site enrichment analysis, this pathway cross-talk was further confirmed at a regulatory level. Finally, we demonstrated how the constructed networks can be used to gain novel insights into molecular mechanisms associated to intermediate clinical traits. Overall, our results demonstrate the utility of a multi-omics integrative approach to understand the molecular mechanisms underlying both normal physiology and disease. PMID:26086077

  9. Transcriptome sequences resolve deep relationships of the grape family.

    Science.gov (United States)

    Wen, Jun; Xiong, Zhiqiang; Nie, Ze-Long; Mao, Likai; Zhu, Yabing; Kan, Xian-Zhao; Ickert-Bond, Stefanie M; Gerrath, Jean; Zimmer, Elizabeth A; Fang, Xiao-Dong

    2013-01-01

    Previous phylogenetic studies of the grape family (Vitaceae) yielded poorly resolved deep relationships, thus impeding our understanding of the evolution of the family. Next-generation sequencing now offers access to protein coding sequences very easily, quickly and cost-effectively. To improve upon earlier work, we extracted 417 orthologous single-copy nuclear genes from the transcriptomes of 15 species of the Vitaceae, covering its phylogenetic diversity. The resulting transcriptome phylogeny provides robust support for the deep relationships, showing the phylogenetic utility of transcriptome data for plants over a time scale at least since the mid-Cretaceous. The pros and cons of transcriptome data for phylogenetic inference in plants are also evaluated.

  10. Perspectives on the use of transcriptomics to advance biofuels

    Directory of Open Access Journals (Sweden)

    Siseon Lee

    2015-11-01

    Full Text Available As a field within the energy research sector, bioenergy is continuously expanding. Although much has been achieved and the yields of both ethanol and butanol have been improved, many avenues of research to further increase these yields still remain. This review covers current research related with transcriptomics and the application of this high-throughput analytical tool to engineer both microbes and plants with the penultimate goal being better biofuel production and yields. The initial focus is given to the responses of fermentative microbes during the fermentative production of acids, such as butyric acid, and solvents, including ethanol and butanol. As plants offer the greatest natural renewable source of fermentable sugars within the form of lignocellulose, the second focus area is the transcriptional responses of microbes when exposed to plant hydrolysates and lignin-related compounds. This is of particular importance as the acid/base hydrolysis methods commonly employed to make the plant-based cellulose available for enzymatic hydrolysis to sugars also generates significant amounts of lignin-derivatives that are inhibitory to fermentative bacteria and microbes. The article then transitions to transcriptional analyses of lignin-degrading organisms, such as Phanerochaete chrysosporium, as an alternative to acid/base hydrolysis. The final portion of this article will discuss recent transcriptome analyses of plants and, in particular, the genes involved in lignin production. The rationale behind these studies is to eventually reduce the lignin content present within these plants and, consequently, the amount of inhibitors generated during the acid/base hydrolysis of the lignocelluloses. All four of these topics represent key areas where transcriptomic research is currently being conducted to identify microbial genes and their responses to products and inhibitors as well as those related with lignin degradation/formation.

  11. The utility of transcriptomics in fish conservation.

    Science.gov (United States)

    Connon, Richard E; Jeffries, Ken M; Komoroske, Lisa M; Todgham, Anne E; Fangue, Nann A

    2018-01-29

    There is growing recognition of the need to understand the mechanisms underlying organismal resilience (i.e. tolerance, acclimatization) to environmental change to support the conservation management of sensitive and economically important species. Here, we discuss how functional genomics can be used in conservation biology to provide a cellular-level understanding of organismal responses to environmental conditions. In particular, the integration of transcriptomics with physiological and ecological research is increasingly playing an important role in identifying functional physiological thresholds predictive of compensatory responses and detrimental outcomes, transforming the way we can study issues in conservation biology. Notably, with technological advances in RNA sequencing, transcriptome-wide approaches can now be applied to species where no prior genomic sequence information is available to develop species-specific tools and investigate sublethal impacts that can contribute to population declines over generations and undermine prospects for long-term conservation success. Here, we examine the use of transcriptomics as a means of determining organismal responses to environmental stressors and use key study examples of conservation concern in fishes to highlight the added value of transcriptome-wide data to the identification of functional response pathways. Finally, we discuss the gaps between the core science and policy frameworks and how thresholds identified through transcriptomic evaluations provide evidence that can be more readily used by resource managers. © 2018. Published by The Company of Biologists Ltd.

  12. Improving production of ?-lactam antibiotics by Penicillium chrysogenum : Metabolic engineering based on transcriptome analysis

    NARCIS (Netherlands)

    Veiga, T.

    2012-01-01

    In Chapters 2-5 of this thesis, the applicability of transcriptome analysis to guide metabolic engineering strategies in P. chrysogenum is explored by investigating four cellular processes that are of potential relevance for industrial production of ?-lactam antibiotics: - Regulation of secondary

  13. SSR marker development and intraspecific genetic divergence exploration of Chrysanthemum indicum based on transcriptome analysis.

    Science.gov (United States)

    Han, Zhengzhou; Ma, Xinye; Wei, Min; Zhao, Tong; Zhan, Ruoting; Chen, Weiwen

    2018-04-25

    Chrysanthemum indicum L., an important ancestral species of the flowering plant chrysanthemum, can be used as medicine and for functional food development. Due to the lack of hereditary information for this species and the difficulty of germplasm identification, we herein provide new genetic insight from the perspective of intraspecific transcriptome comparison and present single sequence repeat (SSR) molecular marker recognition technology. Through the study of a diploid germplasm (DIWNT) and a tetraploid germplasm (DIWT), the following outcome were obtained. (1) A significant difference in Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations for specific homologous genes was observed using the OrthoMCL method for the identification of homologous gene families between the two cytotypes. Ka/Ks analysis of common, single-copy homologous family members also revealed a greater difference among genes that experienced positive selection than among those experiencing positive selection. (2) Of more practical value, 2575 SSR markers were predicted and partly verified. We used TaxonGap as a visual tool to inspect genotype uniqueness and screen for high-performance molecular loci; we recommend four primers of 65 randomly selected primers with a combined identification success rate of 88.6% as priorities for further development of DNA fingerprinting of C. indicum germplasm. The SSR technology based on next-generation sequencing was proved to be successful in the identification of C. indicum germplasms. And the information on the intraspecfic genetic divergence generated by transcriptome comparison deepened the understanding of this complex species' nature.

  14. 3rd International Conference on Transcriptomics

    OpenAIRE

    John A Daniel

    2017-01-01

    Conference Series has been instrumental in conducting international Biochemistry meetings for seven years, and very excited to expand Europe, America and Asia Pacific continents. Previous meetings were held in major cities like Philadelphia, Orlando with success the meetings again scheduled in three continents. 3rd International Conference on Transcriptomics to be held during October 30 - November 01, 2017 at Bangkok, Thailand The Global Transcriptomics business sector to develop at a C...

  15. Characterization of mango (Mangifera indica L.) transcriptome and chloroplast genome.

    Science.gov (United States)

    Azim, M Kamran; Khan, Ishtaiq A; Zhang, Yong

    2014-05-01

    We characterized mango leaf transcriptome and chloroplast genome using next generation DNA sequencing. The RNA-seq output of mango transcriptome generated >12 million reads (total nucleotides sequenced >1 Gb). De novo transcriptome assembly generated 30,509 unigenes with lengths in the range of 300 to ≥3,000 nt and 67× depth of coverage. Blast searching against nonredundant nucleotide databases and several Viridiplantae genomic datasets annotated 24,593 mango unigenes (80% of total) and identified Citrus sinensis as closest neighbor of mango with 9,141 (37%) matched sequences. The annotation with gene ontology and Clusters of Orthologous Group terms categorized unigene sequences into 57 and 25 classes, respectively. More than 13,500 unigenes were assigned to 293 KEGG pathways. Besides major plant biology related pathways, KEGG based gene annotation pointed out active presence of an array of biochemical pathways involved in (a) biosynthesis of bioactive flavonoids, flavones and flavonols, (b) biosynthesis of terpenoids and lignins and (c) plant hormone signal transduction. The mango transcriptome sequences revealed 235 proteases belonging to five catalytic classes of proteolytic enzymes. The draft genome of mango chloroplast (cp) was obtained by a combination of Sanger and next generation sequencing. The draft mango cp genome size is 151,173 bp with a pair of inverted repeats of 27,093 bp separated by small and large single copy regions, respectively. Out of 139 genes in mango cp genome, 91 found to be protein coding. Sequence analysis revealed cp genome of C. sinensis as closest neighbor of mango. We found 51 short repeats in mango cp genome supposed to be associated with extensive rearrangements. This is the first report of transcriptome and chloroplast genome analysis of any Anacardiaceae family member.

  16. Comparative Transcriptomics to Identify Novel Genes and Pathways in Dinoflagellates

    Science.gov (United States)

    Ryan, D.

    2016-02-01

    The unarmored dinoflagellate Karenia brevis is among the most prominent harmful, bloom-forming phytoplankton species in the Gulf of Mexico. During blooms, the polyketides PbTx-1 and PbTx-2 (brevetoxins) are produced by K. brevis. Brevetoxins negatively impact human health and the Gulf shellfish harvest. However, the genes underlying brevetoxin synthesis are currently unknown. Because the K. brevis genome is extremely large ( 1 × 1011 base pairs long), and with a high proportion of repetitive, non-coding DNA, it has not been sequenced. In fact, large, repetitive genomes are common among the dinoflagellate group. High-throughput RNA sequencing technology enabled us to assemble Karenia transcriptomes de novo and investigate potential genes in the brevetoxin pathway through comparative transcriptomics. The brevetoxin profile varies among K. brevis clonal cultures. For example, well-documented Wilson-CCFWC268 typically produces 8-10 pg PbTx per cell, whereas SP1 produces differences in gene expression. Of the 85,000 transcripts in the K. brevis transcriptome, 4,600 transcripts, including novel unannotated orthologs and putative polyketide synthases (PKSs), were only expressed by brevetoxin-producing K. brevis and K. papilionacea, not K. mikimotoi. Examination of gene expression between the typical- and low-toxin Wilson clones identified about 3,500 genes with significantly different expression levels, including 2 putative PKSs. One of the 2 PKSs was only found in the brevetoxin-producing Karenia species. These transcriptomes could not have been characterized without high-throughput RNA sequencing.

  17. Transcriptome sequences resolve deep relationships of the grape family.

    Directory of Open Access Journals (Sweden)

    Jun Wen

    Full Text Available Previous phylogenetic studies of the grape family (Vitaceae yielded poorly resolved deep relationships, thus impeding our understanding of the evolution of the family. Next-generation sequencing now offers access to protein coding sequences very easily, quickly and cost-effectively. To improve upon earlier work, we extracted 417 orthologous single-copy nuclear genes from the transcriptomes of 15 species of the Vitaceae, covering its phylogenetic diversity. The resulting transcriptome phylogeny provides robust support for the deep relationships, showing the phylogenetic utility of transcriptome data for plants over a time scale at least since the mid-Cretaceous. The pros and cons of transcriptome data for phylogenetic inference in plants are also evaluated.

  18. The Human Transcriptome: An Unfinished Story

    Directory of Open Access Journals (Sweden)

    Mihaela Pertea

    2012-06-01

    Full Text Available Despite recent technological advances, the study of the human transcriptome is still in its early stages. Here we provide an overview of the complex human transcriptomic landscape, present the bioinformatics challenges posed by the vast quantities of transcriptomic data, and discuss some of the studies that have tried to determine how much of the human genome is transcribed. Recent evidence has suggested that more than 90% of the human genome is transcribed into RNA. However, this view has been strongly contested by groups of scientists who argued that many of the observed transcripts are simply the result of transcriptional noise. In this review, we conclude that the full extent of transcription remains an open question that will not be fully addressed until we decipher the complete range and biological diversity of the transcribed genomic sequences.

  19. Comparisons of the fungal and protistan communities among different marine sponge holobionts by pyrosequencing.

    Science.gov (United States)

    He, Liming; Liu, Fang; Karuppiah, Valliappan; Ren, Yi; Li, Zhiyong

    2014-05-01

    To date, the knowledge of eukaryotic communities associated with sponges remains limited compared with prokaryotic communities. In a manner similar to prokaryotes, it could be hypothesized that sponge holobionts have phylogenetically diverse eukaryotic symbionts, and the eukaryotic community structures in different sponge holobionts were probably different. In order to test this hypothesis, the communities of eukaryota associated with 11 species of South China Sea sponges were compared with the V4 region of 18S ribosomal ribonucleic acid gene using 454 pyrosequencing. Consequently, 135 and 721 unique operational taxonomic units (OTUs) of fungi and protists were obtained at 97 % sequence similarity, respectively. These sequences were assigned to 2 phyla of fungi (Ascomycota and Basidiomycota) and 9 phyla of protists including 5 algal phyla (Chlorophyta, Haptophyta, Streptophyta, Rhodophyta, and Stramenopiles) and 4 protozoal phyla (Alveolata, Cercozoa, Haplosporidia, and Radiolaria) including 47 orders (12 fungi, 35 protists). Entorrhizales of fungi and 18 orders of protists were detected in marine sponges for the first time. Particularly, Tilletiales of fungi and Chlorocystidales of protists were detected for the first time in marine habitats. Though Ascomycota, Alveolata, and Radiolaria were detected in all the 11 sponge species, sponge holobionts have different fungi and protistan communities according to OTU comparison and principal component analysis at the order level. This study provided the first insights into the fungal and protistan communities associated with different marine sponge holobionts using pyrosequencing, thus further extending the knowledge on sponge-associated eukaryotic diversity.

  20. Transcriptomic changes during maize roots development responsive to Cadmium (Cd) pollution using comparative RNAseq-based approach

    International Nuclear Information System (INIS)

    Peng, Hua; He, Xiujing; Gao, Jian; Ma, Haixia; Zhang, Zhiming; Shen, Yaou; Pan, Guangtang; Lin, Haijian

    2015-01-01

    The heavy metal cadmium (Cd), acts as a widespread environmental contaminant, which has shown to adversely affect human health, food safety and ecosystem safety in recent years. However, research on how plant respond to various kinds of heavy metal stress is scarcely reported, especially for understanding of complex molecular regulatory mechanisms and elucidating the gene networks of plant respond to Cd stress. Here, transcriptomic changes during Mo17 and B73 seedlings development responsive to Cd pollution were investigated and comparative RNAseq-based approach in both genotypes were performed. 115 differential expression genes (DEGs) with significant alteration in expression were found co-modulated in both genotypes during the maize seedling development; of those, most of DGEs were found comprised of stress and defense responses proteins, transporters, as well as transcription factors, such as thaumatin-like protein, ZmOPR2 and ZmOPR5. More interestingly, genotype-specific transcriptional factors changes induced by Cd stress were found contributed to the regulatory mechanism of Cd sensitivity in both different genotypes. Moreover, 12 co-expression modules associated with specific biological processes or pathways (M1 to M12) were identified by consensus co-expression network. These results will expand our understanding of complex molecular mechanism of response and defense to Cd exposure in maize seedling roots. - Highlights: • Transcriptomic changes responsive to Cd pollution using comparative RNAseq-based approach. • 115 differential expression genes (DEGs) were found co-modulated in both genotypes. • Most of DGEs belong to stress and defense responses proteins, transporters, transcription factors. • 12 co-expression modules associated with specific biological processes or pathways. • Genotype-specific transcriptional factors changes induced by Cd stress were found

  1. Transcriptomic changes during maize roots development responsive to Cadmium (Cd) pollution using comparative RNAseq-based approach

    Energy Technology Data Exchange (ETDEWEB)

    Peng, Hua [Key Laboratory of Biology and Genetic Improvement of Maize in Southwest Region, Ministry of Agriculture, Maize Research Institute, Sichuan Agricultural University, Wenjiang, Sichuan, 611130 (China); Sichuan Tourism College, Chengdu, 610000, Sichuan (China); He, Xiujing [Key Laboratory of Biology and Genetic Improvement of Maize in Southwest Region, Ministry of Agriculture, Maize Research Institute, Sichuan Agricultural University, Wenjiang, Sichuan, 611130 (China); Gao, Jian [Institute of Pathology and Southwest Cancer Center, Southwest Hospital, Third Military Medical University, Key Laboratory of Tumor Immunopathology, Ministry of Education of China, Chongqing (China); Ma, Haixia; Zhang, Zhiming; Shen, Yaou [Key Laboratory of Biology and Genetic Improvement of Maize in Southwest Region, Ministry of Agriculture, Maize Research Institute, Sichuan Agricultural University, Wenjiang, Sichuan, 611130 (China); Pan, Guangtang, E-mail: pangt@sicau.edu.cn [Key Laboratory of Biology and Genetic Improvement of Maize in Southwest Region, Ministry of Agriculture, Maize Research Institute, Sichuan Agricultural University, Wenjiang, Sichuan, 611130 (China); Lin, Haijian, E-mail: linhj521@gmail.com [Key Laboratory of Biology and Genetic Improvement of Maize in Southwest Region, Ministry of Agriculture, Maize Research Institute, Sichuan Agricultural University, Wenjiang, Sichuan, 611130 (China)

    2015-09-04

    The heavy metal cadmium (Cd), acts as a widespread environmental contaminant, which has shown to adversely affect human health, food safety and ecosystem safety in recent years. However, research on how plant respond to various kinds of heavy metal stress is scarcely reported, especially for understanding of complex molecular regulatory mechanisms and elucidating the gene networks of plant respond to Cd stress. Here, transcriptomic changes during Mo17 and B73 seedlings development responsive to Cd pollution were investigated and comparative RNAseq-based approach in both genotypes were performed. 115 differential expression genes (DEGs) with significant alteration in expression were found co-modulated in both genotypes during the maize seedling development; of those, most of DGEs were found comprised of stress and defense responses proteins, transporters, as well as transcription factors, such as thaumatin-like protein, ZmOPR2 and ZmOPR5. More interestingly, genotype-specific transcriptional factors changes induced by Cd stress were found contributed to the regulatory mechanism of Cd sensitivity in both different genotypes. Moreover, 12 co-expression modules associated with specific biological processes or pathways (M1 to M12) were identified by consensus co-expression network. These results will expand our understanding of complex molecular mechanism of response and defense to Cd exposure in maize seedling roots. - Highlights: • Transcriptomic changes responsive to Cd pollution using comparative RNAseq-based approach. • 115 differential expression genes (DEGs) were found co-modulated in both genotypes. • Most of DGEs belong to stress and defense responses proteins, transporters, transcription factors. • 12 co-expression modules associated with specific biological processes or pathways. • Genotype-specific transcriptional factors changes induced by Cd stress were found.

  2. 454-Pyrosequencing Analysis of Bacterial Communities from Autotrophic Nitrogen Removal Bioreactors Utilizing Universal Primers: Effect of Annealing Temperature

    Directory of Open Access Journals (Sweden)

    Alejandro Gonzalez-Martinez

    2015-01-01

    Full Text Available Identification of anaerobic ammonium oxidizing (anammox bacteria by molecular tools aimed at the evaluation of bacterial diversity in autotrophic nitrogen removal systems is limited by the difficulty to design universal primers for the Bacteria domain able to amplify the anammox 16S rRNA genes. A metagenomic analysis (pyrosequencing of total bacterial diversity including anammox population in five autotrophic nitrogen removal technologies, two bench-scale models (MBR and Low Temperature CANON and three full-scale bioreactors (anammox, CANON, and DEMON, was successfully carried out by optimization of primer selection and PCR conditions (annealing temperature. The universal primer 530F was identified as the best candidate for total bacteria and anammox bacteria diversity coverage. Salt-adjusted optimum annealing temperature of primer 530F was calculated (47°C and hence a range of annealing temperatures of 44–49°C was tested. Pyrosequencing data showed that annealing temperature of 45°C yielded the best results in terms of species richness and diversity for all bioreactors analyzed.

  3. Phylogenetic characterization of fecal microbial communities of dogs fed diets with or without supplemental dietary fiber using 454 pyrosequencing.

    Directory of Open Access Journals (Sweden)

    Ingmar S Middelbos

    Full Text Available BACKGROUND: Dogs suffer from many of the same maladies as humans that may be affected by the gut microbiome, but knowledge of the canine microbiome is incomplete. This work aimed to use 16S rDNA tag pyrosequencing to phylogenetically characterize hindgut microbiome in dogs and determine how consumption of dietary fiber affects community structure. PRINCIPAL FINDINGS: Six healthy adult dogs were used in a crossover design. A control diet without supplemental fiber and a beet pulp-supplemented (7.5% diet were fed. Fecal DNA was extracted and the V3 hypervariable region of the microbial 16S rDNA gene amplified using primers suitable for 454-pyrosequencing. Microbial diversity was assessed on random 2000-sequence subsamples of individual and pooled DNA samples by diet. Our dataset comprised 77,771 reads with an average length of 141 nt. Individual samples contained approximately 129 OTU, with Fusobacteria (23-40% of reads, Firmicutes (14-28% of reads and Bacteroidetes (31-34% of reads being co-dominant phyla. Feeding dietary fiber generally decreased Fusobacteria and increased Firmicutes, but these changes were not equally apparent in all dogs. UniFrac analysis revealed that structure of the gut microbiome was affected by diet and Firmicutes appeared to play a strong role in by-diet clustering. CONCLUSIONS: Our data suggest three co-dominant bacterial phyla in the canine hindgut. Furthermore, a relatively small amount of dietary fiber changed the structure of the gut microbiome detectably. Our data are among the first to characterize the healthy canine gut microbiome using pyrosequencing and provide a basis for studies focused on devising dietary interventions for microbiome-associated diseases.

  4. De Novo Assembly of the Pea (Pisum sativum L. Nodule Transcriptome

    Directory of Open Access Journals (Sweden)

    Vladimir A. Zhukov

    2015-01-01

    Full Text Available The large size and complexity of the garden pea (Pisum sativum L. genome hamper its sequencing and the discovery of pea gene resources. Although transcriptome sequencing provides extensive information about expressed genes, some tissue-specific transcripts can only be identified from particular organs under appropriate conditions. In this study, we performed RNA sequencing of polyadenylated transcripts from young pea nodules and root tips on an Illumina GAIIx system, followed by de novo transcriptome assembly using the Trinity program. We obtained more than 58,000 and 37,000 contigs from “Nodules” and “Root Tips” assemblies, respectively. The quality of the assemblies was assessed by comparison with pea expressed sequence tags and transcriptome sequencing project data available from NCBI website. The “Nodules” assembly was compared with the “Root Tips” assembly and with pea transcriptome sequencing data from projects indicating tissue specificity. As a result, approximately 13,000 nodule-specific contigs were found and annotated by alignment to known plant protein-coding sequences and by Gene Ontology searching. Of these, 581 sequences were found to possess full CDSs and could thus be considered as novel nodule-specific transcripts of pea. The information about pea nodule-specific gene sequences can be applied for gene-based markers creation, polymorphism studies, and real-time PCR.

  5. KONAGAbase: a genomic and transcriptomic database for the diamondback moth, Plutella xylostella.

    Science.gov (United States)

    Jouraku, Akiya; Yamamoto, Kimiko; Kuwazaki, Seigo; Urio, Masahiro; Suetsugu, Yoshitaka; Narukawa, Junko; Miyamoto, Kazuhisa; Kurita, Kanako; Kanamori, Hiroyuki; Katayose, Yuichi; Matsumoto, Takashi; Noda, Hiroaki

    2013-07-09

    The diamondback moth (DBM), Plutella xylostella, is one of the most harmful insect pests for crucifer crops worldwide. DBM has rapidly evolved high resistance to most conventional insecticides such as pyrethroids, organophosphates, fipronil, spinosad, Bacillus thuringiensis, and diamides. Therefore, it is important to develop genomic and transcriptomic DBM resources for analysis of genes related to insecticide resistance, both to clarify the mechanism of resistance of DBM and to facilitate the development of insecticides with a novel mode of action for more effective and environmentally less harmful insecticide rotation. To contribute to this goal, we developed KONAGAbase, a genomic and transcriptomic database for DBM (KONAGA is the Japanese word for DBM). KONAGAbase provides (1) transcriptomic sequences of 37,340 ESTs/mRNAs and 147,370 RNA-seq contigs which were clustered and assembled into 84,570 unigenes (30,695 contigs, 50,548 pseudo singletons, and 3,327 singletons); and (2) genomic sequences of 88,530 WGS contigs with 246,244 degenerate contigs and 106,455 singletons from which 6,310 de novo identified repeat sequences and 34,890 predicted gene-coding sequences were extracted. The unigenes and predicted gene-coding sequences were clustered and 32,800 representative sequences were extracted as a comprehensive putative gene set. These sequences were annotated with BLAST descriptions, Gene Ontology (GO) terms, and Pfam descriptions, respectively. KONAGAbase contains rich graphical user interface (GUI)-based web interfaces for easy and efficient searching, browsing, and downloading sequences and annotation data. Five useful search interfaces consisting of BLAST search, keyword search, BLAST result-based search, GO tree-based search, and genome browser are provided. KONAGAbase is publicly available from our website (http://dbm.dna.affrc.go.jp/px/) through standard web browsers. KONAGAbase provides DBM comprehensive transcriptomic and draft genomic sequences with

  6. Barcoding lichen-forming fungi using 454 pyrosequencing is challenged by artifactual and biological sequence variation.

    Science.gov (United States)

    Mark, Kristiina; Cornejo, Carolina; Keller, Christine; Flück, Daniela; Scheidegger, Christoph

    2016-09-01

    Although lichens (lichen-forming fungi) play an important role in the ecological integrity of many vulnerable landscapes, only a minority of lichen-forming fungi have been barcoded out of the currently accepted ∼18 000 species. Regular Sanger sequencing can be problematic when analyzing lichens since saprophytic, endophytic, and parasitic fungi live intimately admixed, resulting in low-quality sequencing reads. Here, high-throughput, long-read 454 pyrosequencing in a GS FLX+ System was tested to barcode the fungal partner of 100 epiphytic lichen species from Switzerland using fungal-specific primers when amplifying the full internal transcribed spacer region (ITS). The present study shows the potential of DNA barcoding using pyrosequencing, in that the expected lichen fungus was successfully sequenced for all samples except one. Alignment solutions such as BLAST were found to be largely adequate for the generated long reads. In addition, the NCBI nucleotide database-currently the most complete database for lichen-forming fungi-can be used as a reference database when identifying common species, since the majority of analyzed lichens were identified correctly to the species or at least to the genus level. However, several issues were encountered, including a high sequencing error rate, multiple ITS versions in a genome (incomplete concerted evolution), and in some samples the presence of mixed lichen-forming fungi (possible lichen chimeras).

  7. Systematic Identification and Assessment of Therapeutic Targets for Breast Cancer Based on Genome-Wide RNA Interference Transcriptomes

    Directory of Open Access Journals (Sweden)

    Yang Liu

    2017-02-01

    Full Text Available With accumulating public omics data, great efforts have been made to characterize the genetic heterogeneity of breast cancer. However, identifying novel targets and selecting the best from the sizeable lists of candidate targets is still a key challenge for targeted therapy, largely owing to the lack of economical, efficient and systematic discovery and assessment to prioritize potential therapeutic targets. Here, we describe an approach that combines the computational evaluation and objective, multifaceted assessment to systematically identify and prioritize targets for biological validation and therapeutic exploration. We first establish the reference gene expression profiles from breast cancer cell line MCF7 upon genome-wide RNA interference (RNAi of a total of 3689 genes, and the breast cancer query signatures using RNA-seq data generated from tissue samples of clinical breast cancer patients in the Cancer Genome Atlas (TCGA. Based on gene set enrichment analysis, we identified a set of 510 genes that when knocked down could significantly reverse the transcriptome of breast cancer state. We then perform multifaceted assessment to analyze the gene set to prioritize potential targets for gene therapy. We also propose drug repurposing opportunities and identify potentially druggable proteins that have been poorly explored with regard to the discovery of small-molecule modulators. Finally, we obtained a small list of candidate therapeutic targets for four major breast cancer subtypes, i.e., luminal A, luminal B, HER2+ and triple negative breast cancer. This RNAi transcriptome-based approach can be a helpful paradigm for relevant researches to identify and prioritize candidate targets for experimental validation.

  8. Pyrosequencing reveals diverse microbial community associated with the zoanthid Palythoa australiae from the South China Sea.

    Science.gov (United States)

    Sun, Wei; Zhang, Fengli; He, Liming; Li, Zhiyong

    2014-05-01

    Diverse sessile organisms inhabit the coral reef ecosystems, including corals, sponges, and sea anemones. In the past decades, scleractinian corals (Cnidaria, Anthozoa, Scleractinia) and their associated microorganisms have attracted much attention. Zoanthids (Cnidaria, Anthozoa, Zoanthidea) are commonly found in coral reefs. However, little is known about the community structure of zoanthid-associated microbiota. In this study, the microbial community associated with the zoanthid Palythoa australiae in the South China Sea was investigated by 454 pyrosequencing. As a result, 2,353 bacterial, 583 archaeal, and 36 eukaryotic microbial ribotypes were detected, respectively. A total of 22 bacterial phyla (16 formally described phyla and six candidate phyla) were recovered. Proteobacteria was the most abundant group, followed by Chloroflexi and Actinobacteria. High-abundance Rhizobiales and diverse Chloroflexi were observed in the bacterial community. The archaeal population was composed of Crenarchaeota and Euryarchaeota, with Marine Group I as the dominant lineage. In particular, Candidatus Nitrosopumilus dominated the archaeal community. Besides bacteria and archaea, the zoanthid harbored eukaryotic microorganisms including fungi and algae though their diversity was very low. This study provided the first insights into the microbial community associated with P. australiae by 454 pyrosequencing, consequently laid a basis for the understanding of the association of P. australiae-microbes symbioses.

  9. Transcriptome analysis of tube foot and large scale marker discovery in sea cucumber, Apostichopus japonicus.

    Science.gov (United States)

    Zhou, Xiaoxu; Wang, Hongdi; Cui, Jun; Qiu, Xuemei; Chang, Yaqing; Wang, Xiuli

    2016-12-01

    Tube foot as one of the ambulacral appendages types in Aspidochirote holothurioids, is known for their functions in locomotion, feeding, chemoreception, light sensitivity and respiration. In this study, we explored the characteristic of transcriptome in the tube foot of sea cucumber (Apostichopus japonicus). Our results showed that among 390 unigenes which specifically expressed in the tube foot, 190 of them were annotated. Based on the assembly transcriptome, we found 219,860 SNPs from 34,749 unigenes, 97,683, 53,624, 27,767 and 40,786 were located in CDSs, 5'-UTRs, 3'-UTRs and non-CDS separately. Furthermore, 12,114 SSRs were detected from 7394 unigenes. Target genes of four specifically expressed miRNAs (miR-29a, miR-29b, miR-278-3p and miR-2005) in tube foot were also predicted based on the transcriptome, which contain immune-related factors (MBL, VLRA, AjC3, MyD88, CFB), skin pigmentation (MITF), candidate regeneration factor (TRP) and holothurians autolysis-related factor (CL). These results develop a relatively large number of molecular markers and transcriptome resources, and will provide a foundation for further analyses on the function and molecular mechanisms underlying A. japonicas tube foot. Copyright © 2016 Elsevier Inc. All rights reserved.

  10. Barcoded pyrosequencing analysis of the microbial community in a simulator of the human gastrointestinal tract showed a colon region-specific microbiota modulation for two plant-derived polysaccharide blends.

    Science.gov (United States)

    Marzorati, Massimo; Maignien, Lois; Verhelst, An; Luta, Gabriela; Sinnott, Robert; Kerckhof, Frederiek Maarten; Boon, Nico; Van de Wiele, Tom; Possemiers, Sam

    2013-02-01

    The combination of a Simulator of the Human Intestinal Microbial Ecosystem with ad hoc molecular techniques (i.e. pyrosequencing, denaturing gradient gel electrophoresis and quantitative PCR) allowed an evaluation of the extent to which two plant polysaccharide supplements could modify a complex gut microbial community. The presence of Aloe vera gel powder and algae extract in product B as compared to the standard blend (product A) improved its fermentation along the entire simulated colon. The potential extended effect of product B in the simulated distal colon, as compared to product A, was confirmed by: (i) the separate clustering of the samples before and after the treatment in the phylogenetic-based dendrogram and OTU-based PCoA plot only for product B; (ii) a higher richness estimator (+33 vs. -36 % of product A); and (iii) a higher dynamic parameter (21 vs. 13 %). These data show that the combination of well designed in vitro simulators with barcoded pyrosequencing is a powerful tool for characterizing changes occurring in the gut microbiota following a treatment. However, for the quantification of low-abundance species-of interest because of their relationship to potential positive health effects (i.e. bifidobacteria or lactobacilli)-conventional molecular ecological approaches, such as PCR-DGGE and qPCR, still remain a very useful complementary tool.

  11. Chromosomal clustering of a human transcriptome reveals regulatory background

    Directory of Open Access Journals (Sweden)

    Purmann Antje

    2005-09-01

    Full Text Available Abstract Background There has been much evidence recently for a link between transcriptional regulation and chromosomal gene order, but the relationship between genomic organization, regulation and gene function in higher eukaryotes remains to be precisely defined. Results Here, we present evidence for organization of a large proportion of a human transcriptome into gene clusters throughout the genome, which are partly regulated by the same transcription factors, share biological functions and are characterized by non-housekeeping genes. This analysis was based on the cardiac transcriptome identified by our genome-wide array analysis of 55 human heart samples. We found 37% of these genes to be arranged mainly in adjacent pairs or triplets. A significant number of pairs of adjacent genes are putatively regulated by common transcription factors (p = 0.02. Furthermore, these gene pairs share a significant number of GO functional classification terms. We show that the human cardiac transcriptome is organized into many small clusters across the whole genome, rather than being concentrated in a few larger clusters. Conclusion Our findings suggest that genes expressed in concert are organized in a linear arrangement for coordinated regulation. Determining the relationship between gene arrangement, regulation and nuclear organization as well as gene function will have broad biological implications.

  12. Bacterial and diazotrophic diversities of endophytes in Dendrobium catenatum determined through barcoded pyrosequencing.

    Science.gov (United States)

    Li, Ou; Xiao, Rong; Sun, Lihua; Guan, Chenglin; Kong, Dedong; Hu, Xiufang

    2017-01-01

    As an epiphyte orchid, Dendrobium catenatum relies on microorganisms for requisite nutrients. Metagenome pyrosequencing based on 16S rRNA and nifH genes was used to characterize the bacterial and diazotrophic communities associated with D. catenatum collected from 5 districts in China. Based on Meta-16S rRNA sequencing, 22 bacterial phyla and 699 genera were identified, distributed as 125 genera from 8 phyla and 319 genera from 10 phyla shared by all the planting bases and all the tissues, respectively. The predominant Proteobacteria varied from 71.81% (GZ) to 96.08% (YN), and Delftia (10.39-38.42%), Burkholderia (2.71-15.98%), Escherichia/Shigella (4.90-25.12%), Pseudomonas (2.68-30.72%) and Sphingomonas (1.83-2.05%) dominated in four planting bases. Pseudomonas (17.94-22.06%), Escherichia/Shigella (6.59-11.59%), Delftia (9.65-22.14%) and Burkholderia (3.12-11.05%) dominated in all the tissues. According to Meta-nifH sequencing, 4 phyla and 45 genera were identified, while 17 genera and 24 genera from 4 phyla were shared by all the planting bases and all the tissues, respectively. Burkholderia and Bradyrhizobium were the most popular in the planting bases, followed by Methylovirgula and Mesorhizobium. Mesorhizobium was the most popular in different tissues, followed by Beijerinckia, Xanthobacter, and Burkholderia. Among the genera, 39 were completely overlapped with the results based on the 16S rRNA gene. In conclusion, abundant bacteria and diazotrophs were identified in common in different tissues of D. catenatum from five planting bases, which might play a great role in the supply of nutrients such as nitrogen. The exact abundance of phylum and genus on the different tissues from different planting bases need deeper sequencing with more samples.

  13. Bacterial and diazotrophic diversities of endophytes in Dendrobium catenatum determined through barcoded pyrosequencing.

    Directory of Open Access Journals (Sweden)

    Ou Li

    Full Text Available As an epiphyte orchid, Dendrobium catenatum relies on microorganisms for requisite nutrients. Metagenome pyrosequencing based on 16S rRNA and nifH genes was used to characterize the bacterial and diazotrophic communities associated with D. catenatum collected from 5 districts in China. Based on Meta-16S rRNA sequencing, 22 bacterial phyla and 699 genera were identified, distributed as 125 genera from 8 phyla and 319 genera from 10 phyla shared by all the planting bases and all the tissues, respectively. The predominant Proteobacteria varied from 71.81% (GZ to 96.08% (YN, and Delftia (10.39-38.42%, Burkholderia (2.71-15.98%, Escherichia/Shigella (4.90-25.12%, Pseudomonas (2.68-30.72% and Sphingomonas (1.83-2.05% dominated in four planting bases. Pseudomonas (17.94-22.06%, Escherichia/Shigella (6.59-11.59%, Delftia (9.65-22.14% and Burkholderia (3.12-11.05% dominated in all the tissues. According to Meta-nifH sequencing, 4 phyla and 45 genera were identified, while 17 genera and 24 genera from 4 phyla were shared by all the planting bases and all the tissues, respectively. Burkholderia and Bradyrhizobium were the most popular in the planting bases, followed by Methylovirgula and Mesorhizobium. Mesorhizobium was the most popular in different tissues, followed by Beijerinckia, Xanthobacter, and Burkholderia. Among the genera, 39 were completely overlapped with the results based on the 16S rRNA gene. In conclusion, abundant bacteria and diazotrophs were identified in common in different tissues of D. catenatum from five planting bases, which might play a great role in the supply of nutrients such as nitrogen. The exact abundance of phylum and genus on the different tissues from different planting bases need deeper sequencing with more samples.

  14. Bacterial and diazotrophic diversities of endophytes in Dendrobium catenatum determined through barcoded pyrosequencing

    Science.gov (United States)

    Li, Ou; Sun, Lihua; Guan, Chenglin; Kong, Dedong

    2017-01-01

    As an epiphyte orchid, Dendrobium catenatum relies on microorganisms for requisite nutrients. Metagenome pyrosequencing based on 16S rRNA and nifH genes was used to characterize the bacterial and diazotrophic communities associated with D. catenatum collected from 5 districts in China. Based on Meta-16S rRNA sequencing, 22 bacterial phyla and 699 genera were identified, distributed as 125 genera from 8 phyla and 319 genera from 10 phyla shared by all the planting bases and all the tissues, respectively. The predominant Proteobacteria varied from 71.81% (GZ) to 96.08% (YN), and Delftia (10.39–38.42%), Burkholderia (2.71–15.98%), Escherichia/Shigella (4.90–25.12%), Pseudomonas (2.68–30.72%) and Sphingomonas (1.83–2.05%) dominated in four planting bases. Pseudomonas (17.94–22.06%), Escherichia/Shigella (6.59–11.59%), Delftia (9.65–22.14%) and Burkholderia (3.12–11.05%) dominated in all the tissues. According to Meta-nifH sequencing, 4 phyla and 45 genera were identified, while 17 genera and 24 genera from 4 phyla were shared by all the planting bases and all the tissues, respectively. Burkholderia and Bradyrhizobium were the most popular in the planting bases, followed by Methylovirgula and Mesorhizobium. Mesorhizobium was the most popular in different tissues, followed by Beijerinckia, Xanthobacter, and Burkholderia. Among the genera, 39 were completely overlapped with the results based on the 16S rRNA gene. In conclusion, abundant bacteria and diazotrophs were identified in common in different tissues of D. catenatum from five planting bases, which might play a great role in the supply of nutrients such as nitrogen. The exact abundance of phylum and genus on the different tissues from different planting bases need deeper sequencing with more samples. PMID:28931073

  15. Transcriptomics-based analysis using RNA-Seq of the coconut (Cocos nucifera) leaf in response to yellow decline phytoplasma infection.

    Science.gov (United States)

    Nejat, Naghmeh; Cahill, David M; Vadamalai, Ganesan; Ziemann, Mark; Rookes, James; Naderali, Neda

    2015-10-01

    Invasive phytoplasmas wreak havoc on coconut palms worldwide, leading to high loss of income, food insecurity and extreme poverty of farmers in producing countries. Phytoplasmas as strictly biotrophic insect-transmitted bacterial pathogens instigate distinct changes in developmental processes and defence responses of the infected plants and manipulate plants to their own advantage; however, little is known about the cellular and molecular mechanisms underlying host-phytoplasma interactions. Further, phytoplasma-mediated transcriptional alterations in coconut palm genes have not yet been identified. This study evaluated the whole transcriptome profiles of naturally infected leaves of Cocos nucifera ecotype Malayan Red Dwarf in response to yellow decline phytoplasma from group 16SrXIV, using RNA-Seq technique. Transcriptomics-based analysis reported here identified genes involved in coconut innate immunity. The number of down-regulated genes in response to phytoplasma infection exceeded the number of genes up-regulated. Of the 39,873 differentially expressed unigenes, 21,860 unigenes were suppressed and 18,013 were induced following infection. Comparative analysis revealed that genes associated with defence signalling against biotic stimuli were significantly overexpressed in phytoplasma-infected leaves versus healthy coconut leaves. Genes involving cell rescue and defence, cellular transport, oxidative stress, hormone stimulus and metabolism, photosynthesis reduction, transcription and biosynthesis of secondary metabolites were differentially represented. Our transcriptome analysis unveiled a core set of genes associated with defence of coconut in response to phytoplasma attack, although several novel defence response candidate genes with unknown function have also been identified. This study constitutes valuable sequence resource for uncovering the resistance genes and/or susceptibility genes which can be used as genetic tools in disease resistance breeding.

  16. Deciphering the Developmental Dynamics of the Mouse Liver Transcriptome.

    Directory of Open Access Journals (Sweden)

    Sumedha S Gunewardena

    Full Text Available During development, liver undergoes a rapid transition from a hematopoietic organ to a major organ for drug metabolism and nutrient homeostasis. However, little is known on a transcriptome level of the genes and RNA-splicing variants that are differentially regulated with age, and which up-stream regulators orchestrate age-specific biological functions in liver. We used RNA-Seq to interrogate the developmental dynamics of the liver transcriptome in mice at 12 ages from late embryonic stage (2-days before birth to maturity (60-days after birth. Among 21,889 unique NCBI RefSeq-annotated genes, 9,641 were significantly expressed in at least one age, 7,289 were differently regulated with age, and 859 had multiple (> = 2 RNA splicing-variants. Factor analysis showed that the dynamics of hepatic genes fall into six distinct groups based on their temporal expression. The average expression of cytokines, ion channels, kinases, phosphatases, transcription regulators and translation regulators decreased with age, whereas the average expression of peptidases, enzymes and transmembrane receptors increased with age. The average expression of growth factors peak between Day-3 and Day-10, and decrease thereafter. We identified critical biological functions, upstream regulators, and putative transcription modules that seem to govern age-specific gene expression. We also observed differential ontogenic expression of known splicing variants of certain genes, and 1,455 novel splicing isoform candidates. In conclusion, the hepatic ontogeny of the transcriptome ontogeny has unveiled critical networks and up-stream regulators that orchestrate age-specific biological functions in liver, and suggest that age contributes to the complexity of the alternative splicing landscape of the hepatic transcriptome.

  17. Comparative glandular trichome transcriptome-based gene characterization reveals reasons for differential (-)-menthol biosynthesis in Mentha species.

    Science.gov (United States)

    Akhtar, Md Qussen; Qamar, Nida; Yadav, Pallavi; Kulkarni, Pallavi; Kumar, Ajay; Shasany, Ajit Kumar

    2017-06-01

    The genes involved in menthol biosynthesis are reported earlier in Mentha × piperita. But the information on these genes is not available in Mentha arvensis. To bridge the gap in knowledge on differential biosynthesis of monoterpenes leading to compositional variation in the essential oil of these species, a comparative transcriptome analysis of the glandular trichome (GT) was carried out. In addition to the mevalonic acid (MVA) and methylerythritol phosphate (MEP) pathway genes, about 210 and 196 different terpene synthases (TPSs) transcripts were identified from annotation in M. arvensis and M. × piperita, respectively, and correlated to several monoterpenes present in the essential oil. Six isoforms of (-)-menthol dehydrogenases (MD), the last enzyme of the menthol biosynthetic pathway, were identified, cloned and characterized from the transcriptome data (three from each species). Varied expression levels and differential enzyme kinetics of these isoforms indicated the nature and composition of the product, as these isoforms generate both (-)-menthol and (+)-neomenthol from (-)-menthone and converts (-)-menthol to (-)-menthone in the reverse reaction, and hence together determine the quantity of (-)-menthol in the essential oil in these two species. Several genes for high value minor monoterpenes could also be identified from the transcriptome data. © 2017 Scandinavian Plant Physiology Society.

  18. Cardiovascular risk protection from the Mediterranean diet and olive oil. A transcriptomic update in humans

    International Nuclear Information System (INIS)

    Carrion, S.; Torres, L.; Castañer, O.

    2016-01-01

    This review highlights the human studies that explore the benefits of the Mediterranean diet and olive oil, based on gene expression analysis. We summarized consistent human transcriptomic studies on cardiovascular risk, based on TMD and olive oil interventions, with real life doses and conditions. A literature review was carried out leading up to February 2016. The results show that the TMD, specially supplemented with virgin olive oil, produces beneficial changes in the transcriptomic response of relevant genes in cardiovascular risk such as CAT, GPX1 and SIRT2. p65 and MCP-1, IL1B, IL6, CXCL1, INF-γ, ARHGAP15 and IL7R, which are involved in inflammation; and ABCA1, SR-B1, PPARBP, PPARα, PPARγ, PPARδ, CD-36 and COX-1, which play an important role in cholesterol efflux. The available data illustrate a transcriptomic effect on atherosclerosis, inflammation and oxidative stress pathways as well as the mentioned genes. [es

  19. Cardiovascular risk protection from the Mediterranean diet and olive oil. A transcriptomic update in humans

    Directory of Open Access Journals (Sweden)

    S. Carrión

    2016-12-01

    Full Text Available This review highlights the human studies that explore the benefits of the Mediterranean diet and olive oil, based on gene expression analysis. We summarized consistent human transcriptomic studies on cardiovascular risk, based on TMD and olive oil interventions, with real life doses and conditions. A literature review was carried out leading up to February 2016. The results show that the TMD, specially supplemented with virgin olive oil, produces beneficial changes in the transcriptomic response of relevant genes in cardiovascular risk such as CAT, GPX1 and SIRT2. p65 and MCP-1, IL1B, IL6, CXCL1, INF-γ, ARHGAP15 and IL7R, which are involved in inflammation; and ABCA1, SR-B1, PPARBP, PPARα, PPARγ, PPARδ, CD-36 and COX-1, which play an important role in cholesterol efflux. The available data illustrate a transcriptomic effect on atherosclerosis, inflammation and oxidative stress pathways as well as the mentioned genes.

  20. Developmental Transcriptome for a Facultatively Eusocial Bee, Megalopta genalis.

    Science.gov (United States)

    Jones, Beryl M; Wcislo, William T; Robinson, Gene E

    2015-08-14

    Transcriptomes provide excellent foundational resources for mechanistic and evolutionary analyses of complex traits. We present a developmental transcriptome for the facultatively eusocial bee Megalopta genalis, which represents a potential transition point in the evolution of eusociality. A de novo transcriptome assembly of Megalopta genalis was generated using paired-end Illumina sequencing and the Trinity assembler. Males and females of all life stages were aligned to this transcriptome for analysis of gene expression profiles throughout development. Gene Ontology analysis indicates that stage-specific genes are involved in ion transport, cell-cell signaling, and metabolism. A number of distinct biological processes are upregulated in each life stage, and transitions between life stages involve shifts in dominant functional processes, including shifts from transcriptional regulation in embryos to metabolism in larvae, and increased lipid metabolism in adults. We expect that this transcriptome will provide a useful resource for future analyses to better understand the molecular basis of the evolution of eusociality and, more generally, phenotypic plasticity. Copyright © 2015 Jones et al.

  1. Characterization of a male reproductive transcriptome for Peromyscus eremicus (Cactus mouse

    Directory of Open Access Journals (Sweden)

    Lauren L. Kordonowy

    2016-10-01

    Full Text Available Rodents of the genus Peromyscus have become increasingly utilized models for investigations into adaptive biology. This genus is particularly powerful for research linking genetics with adaptive physiology or behaviors, and recent research has capitalized on the unique opportunities afforded by the ecological diversity of these rodents. Well characterized genomic and transcriptomic data is intrinsic to explorations of the genetic architecture responsible for ecological adaptations. Therefore, this study characterizes the transcriptome of three male reproductive tissues (testes, epididymis and vas deferens of Peromyscus eremicus (Cactus mouse, a desert specialist. The transcriptome assembly process was optimized in order to produce a high quality and substantially complete annotated transcriptome. This composite transcriptome was generated to characterize the expressed transcripts in the male reproductive tract of P. eremicus, which will serve as a crucial resource for future research investigating our hypothesis that the male Cactus mouse possesses an adaptive reproductive phenotype to mitigate water-loss from ejaculate. This study reports genes under positive selection in the male Cactus mouse reproductive transcriptome relative to transcriptomes from Peromyscus maniculatus (deer mouse and Mus musculus. Thus, this study expands upon existing genetic research in this species, and we provide a high quality transcriptome to enable further explorations of our proposed hypothesis for male Cactus mouse reproductive adaptations to minimize seminal fluid loss.

  2. De novo transcriptome assembly of ‘Angeleno’ and ‘Lamoon’ Japanese plum cultivars (Prunus salicina

    Directory of Open Access Journals (Sweden)

    Máximo González

    2016-09-01

    De novo transcriptome assembly was performed using CLC Genome Workbench software and a total of 54,584 unique contigs were generated, with an N50 of 1343 base pair (bp and a mean length of 829 bp. This work contributed with a specific Japanese plum skin transcriptome, providing two libraries of contrasting fruit skin color phenotype (yellow and red and increasing substantially the GB of raw data available until now for this specie.

  3. De novo transcriptome assembly of a sour cherry cultivar, Schattenmorelle

    Directory of Open Access Journals (Sweden)

    Yeonhwa Jo

    2015-12-01

    Full Text Available Sour cherry (Prunus cerasus in the genus Prunus in the family Rosaceae is one of the most popular stone fruit trees worldwide. Of known sour cherry cultivars, the Schattenmorelle is a famous old sour cherry with a high amount of fruit production. The Schattenmorelle was selected before 1650 and described in the 1800s. This cultivar was named after gardens of the Chateau de Moreille in which the cultivar was initially found. In order to identify new genes and to develop genetic markers for sour cherry, we performed a transcriptome analysis of a sour cherry. We selected the cultivar Schattenmorelle, which is among commercially important cultivars in Europe and North America. We obtained 2.05 GB raw data from the Schattenmorelle (NCBI accession number: SRX1187170. De novo transcriptome assembly using Trinity identified 61,053 transcripts in which N50 was 611 bp. Next, we identified 25,585 protein coding sequences using TransDecoder. The identified proteins were blasted against NCBI's non-redundant database for annotation. Based on blast search, we taxonomically classified the obtained sequences. As a result, we provide the transcriptome of sour cherry cultivar Schattenmorelle using next generation sequencing.

  4. Pyrosequencing the Canine Faecal Microbiota: Breadth and Depth of Biodiversity

    Science.gov (United States)

    Hand, Daniel; Wallis, Corrin; Colyer, Alison; Penn, Charles W.

    2013-01-01

    Mammalian intestinal microbiota remain poorly understood despite decades of interest and investigation by culture-based and other long-established methodologies. Using high-throughput sequencing technology we now report a detailed analysis of canine faecal microbiota. The study group of animals comprised eleven healthy adult miniature Schnauzer dogs of mixed sex and age, some closely related and all housed in kennel and pen accommodation on the same premises with similar feeding and exercise regimes. DNA was extracted from faecal specimens and subjected to PCR amplification of 16S rDNA, followed by sequencing of the 5′ region that included variable regions V1 and V2. Barcoded amplicons were sequenced by Roche-454 FLX high-throughput pyrosequencing. Sequences were assigned to taxa using the Ribosomal Database Project Bayesian classifier and revealed dominance of Fusobacterium and Bacteroidetes phyla. Differences between animals in the proportions of different taxa, among 10,000 reads per animal, were clear and not supportive of the concept of a “core microbiota”. Despite this variability in prominent genera, littermates were shown to have a more similar faecal microbial composition than unrelated dogs. Diversity of the microbiota was also assessed by assignment of sequence reads into operational taxonomic units (OTUs) at the level of 97% sequence identity. The OTU data were then subjected to rarefaction analysis and determination of Chao1 richness estimates. The data indicated that faecal microbiota comprised possibly as many as 500 to 1500 OTUs. PMID:23382835

  5. Pyrosequencing the canine faecal microbiota: breadth and depth of biodiversity.

    Directory of Open Access Journals (Sweden)

    Daniel Hand

    Full Text Available Mammalian intestinal microbiota remain poorly understood despite decades of interest and investigation by culture-based and other long-established methodologies. Using high-throughput sequencing technology we now report a detailed analysis of canine faecal microbiota. The study group of animals comprised eleven healthy adult miniature Schnauzer dogs of mixed sex and age, some closely related and all housed in kennel and pen accommodation on the same premises with similar feeding and exercise regimes. DNA was extracted from faecal specimens and subjected to PCR amplification of 16S rDNA, followed by sequencing of the 5' region that included variable regions V1 and V2. Barcoded amplicons were sequenced by Roche-454 FLX high-throughput pyrosequencing. Sequences were assigned to taxa using the Ribosomal Database Project Bayesian classifier and revealed dominance of Fusobacterium and Bacteroidetes phyla. Differences between animals in the proportions of different taxa, among 10,000 reads per animal, were clear and not supportive of the concept of a "core microbiota". Despite this variability in prominent genera, littermates were shown to have a more similar faecal microbial composition than unrelated dogs. Diversity of the microbiota was also assessed by assignment of sequence reads into operational taxonomic units (OTUs at the level of 97% sequence identity. The OTU data were then subjected to rarefaction analysis and determination of Chao1 richness estimates. The data indicated that faecal microbiota comprised possibly as many as 500 to 1500 OTUs.

  6. A Long-Read Transcriptome Assembly of Cotton (Gossypium hirsutum L. and Intraspecific Single Nucleotide Polymorphism Discovery

    Directory of Open Access Journals (Sweden)

    Hamid Ashrafi

    2015-07-01

    Full Text Available Upland cotton ( L. has a narrow germplasm base, which constrains marker development and hampers intraspecific breeding. A pressing need exists for high-throughput single nucleotide polymorphism (SNP markers that can be readily applied to germplasm in breeding and breeding-related research programs. Despite progress made in developing new sequencing technologies during the past decade, the cost of sequencing remains substantial when one is dealing with numerous samples and large genomes. Several strategies have been proposed to lower the cost of sequencing for multiple genotypes of large-genome species like cotton, such as transcriptome sequencing and reduced-representation DNA sequencing. This paper reports the development of a transcriptome assembly of the inbred line Texas Marker-1 (TM-1, a genetic standard for cotton, its usefulness as a reference for RNA sequencing (RNA-seq-based SNP identification, and the availability of transcriptome sequences of four other cotton cultivars. An assembly of TM-1 was made using Roche 454 transcriptome reads combined with an assembly of all available public expressed sequence tag (EST sequences of TM-1. The TM-1 assembly consists of 72,450 contigs with a total of 70 million bp. Functional predictions of the transcripts were estimated by alignment to selected protein databases. Transcriptome sequences of the five lines, including TM-1, were obtained using an Illumina Genome Analyzer-II, and the short reads were mapped to the TM-1 assembly to discover SNPs among the five lines. We identified >14,000 unfiltered allelic SNPs, of which ∼3,700 SNPs were retained for assay development after applying several rigorous filters. This paper reports availability of the reference transcriptome assembly and shows its utility in developing intraspecific SNP markers in upland cotton.

  7. TCW: transcriptome computational workbench.

    Directory of Open Access Journals (Sweden)

    Carol Soderlund

    Full Text Available BACKGROUND: The analysis of transcriptome data involves many steps and various programs, along with organization of large amounts of data and results. Without a methodical approach for storage, analysis and query, the resulting ad hoc analysis can lead to human error, loss of data and results, inefficient use of time, and lack of verifiability, repeatability, and extensibility. METHODOLOGY: The Transcriptome Computational Workbench (TCW provides Java graphical interfaces for methodical analysis for both single and comparative transcriptome data without the use of a reference genome (e.g. for non-model organisms. The singleTCW interface steps the user through importing transcript sequences (e.g. Illumina or assembling long sequences (e.g. Sanger, 454, transcripts, annotating the sequences, and performing differential expression analysis using published statistical programs in R. The data, metadata, and results are stored in a MySQL database. The multiTCW interface builds a comparison database by importing sequence and annotation from one or more single TCW databases, executes the ESTscan program to translate the sequences into proteins, and then incorporates one or more clusterings, where the clustering options are to execute the orthoMCL program, compute transitive closure, or import clusters. Both singleTCW and multiTCW allow extensive query and display of the results, where singleTCW displays the alignment of annotation hits to transcript sequences, and multiTCW displays multiple transcript alignments with MUSCLE or pairwise alignments. The query programs can be executed on the desktop for fastest analysis, or from the web for sharing the results. CONCLUSION: It is now affordable to buy a multi-processor machine, and easy to install Java and MySQL. By simply downloading the TCW, the user can interactively analyze, query and view their data. The TCW allows in-depth data mining of the results, which can lead to a better understanding of the

  8. TCW: transcriptome computational workbench.

    Science.gov (United States)

    Soderlund, Carol; Nelson, William; Willer, Mark; Gang, David R

    2013-01-01

    The analysis of transcriptome data involves many steps and various programs, along with organization of large amounts of data and results. Without a methodical approach for storage, analysis and query, the resulting ad hoc analysis can lead to human error, loss of data and results, inefficient use of time, and lack of verifiability, repeatability, and extensibility. The Transcriptome Computational Workbench (TCW) provides Java graphical interfaces for methodical analysis for both single and comparative transcriptome data without the use of a reference genome (e.g. for non-model organisms). The singleTCW interface steps the user through importing transcript sequences (e.g. Illumina) or assembling long sequences (e.g. Sanger, 454, transcripts), annotating the sequences, and performing differential expression analysis using published statistical programs in R. The data, metadata, and results are stored in a MySQL database. The multiTCW interface builds a comparison database by importing sequence and annotation from one or more single TCW databases, executes the ESTscan program to translate the sequences into proteins, and then incorporates one or more clusterings, where the clustering options are to execute the orthoMCL program, compute transitive closure, or import clusters. Both singleTCW and multiTCW allow extensive query and display of the results, where singleTCW displays the alignment of annotation hits to transcript sequences, and multiTCW displays multiple transcript alignments with MUSCLE or pairwise alignments. The query programs can be executed on the desktop for fastest analysis, or from the web for sharing the results. It is now affordable to buy a multi-processor machine, and easy to install Java and MySQL. By simply downloading the TCW, the user can interactively analyze, query and view their data. The TCW allows in-depth data mining of the results, which can lead to a better understanding of the transcriptome. TCW is freely available from www.agcol.arizona.edu/software/tcw.

  9. Characterization of microsatellite loci from two-spotted octopus Octopus bimaculatus Verrill 1883 from pyrosequencing reads

    Science.gov (United States)

    Domínguez-Contreras, J. F.; Munguía-Vega, A.; Ceballos-Vázquez, B. P.; Arellano-Martínez, M.; Culver, Melanie

    2014-01-01

    We characterized 22 novel microsatellite loci in the two-spotted octopus Octopus bimaculatus using 454 pyrosequencing reads. All loci were polymorphic and will be used in studies of marine connectivity aimed at increasing sustainability of the resource. The mean number alleles per locus was 13.09 (range 7–19) and observed heterozygosities ranged from 0.50 to 1.00. Four loci pairs were linked and three deviated from Hardy–Weinberg equilibrium. Eighteen and 12 loci were polymorphic in Octopus bimaculoides and Octopus hubbsorum, respectively.

  10. A pipeline for the de novo assembly of the Themira biloba (Sepsidae: Diptera) transcriptome using a multiple k-mer length approach.

    Science.gov (United States)

    Melicher, Dacotah; Torson, Alex S; Dworkin, Ian; Bowsher, Julia H

    2014-03-12

    The Sepsidae family of flies is a model for investigating how sexual selection shapes courtship and sexual dimorphism in a comparative framework. However, like many non-model systems, there are few molecular resources available. Large-scale sequencing and assembly have not been performed in any sepsid, and the lack of a closely related genome makes investigation of gene expression challenging. Our goal was to develop an automated pipeline for de novo transcriptome assembly, and to use that pipeline to assemble and analyze the transcriptome of the sepsid Themira biloba. Our bioinformatics pipeline uses cloud computing services to assemble and analyze the transcriptome with off-site data management, processing, and backup. It uses a multiple k-mer length approach combined with a second meta-assembly to extend transcripts and recover more bases of transcript sequences than standard single k-mer assembly. We used 454 sequencing to generate 1.48 million reads from cDNA generated from embryo, larva, and pupae of T. biloba and assembled a transcriptome consisting of 24,495 contigs. Annotation identified 16,705 transcripts, including those involved in embryogenesis and limb patterning. We assembled transcriptomes from an additional three non-model organisms to demonstrate that our pipeline assembled a higher-quality transcriptome than single k-mer approaches across multiple species. The pipeline we have developed for assembly and analysis increases contig length, recovers unique transcripts, and assembles more base pairs than other methods through the use of a meta-assembly. The T. biloba transcriptome is a critical resource for performing large-scale RNA-Seq investigations of gene expression patterns, and is the first transcriptome sequenced in this Dipteran family.

  11. De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata

    Directory of Open Access Journals (Sweden)

    Neeraja Cherukupalli

    2016-08-01

    Full Text Available Andrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequencing of RNA from A. paniculata leaf using Illumina HiSeqTM 2000 platform followed by the de novo transcriptome assembly. A total of 189.22 million high quality paired reads were generated and 1,70,724 transcripts were predicted in the primary assembly. Secondary assembly generated a transcriptome size of ~88 Mb with 83,800 clustered transcripts. Based on the similarity searches against plant nonredundant protein database, gene ontology and eukaryotic orthologous groups, 49,363 transcripts were annotated constituting upto 58.91% of the identified unigenes. Annotation of transcripts − using kyoto encyclopedia of genes and genomes database − revealed 5,606 transcripts plausibly involved in 140 pathways including biosynthesis of terpenoids and other secondary metabolites. Transcription factor analysis showed 6,767 unique transcripts belonging to 97 different transcription factor families. A total number of 124 CYP450 transcripts belonging to seven divergent clans have been identified. Transcriptome revealed 146 different transcripts coding for enzymes involved in the biosynthesis of terpenoids of which 35 contained terpene synthase motifs. This study also revealed 32,341 simple sequence repeats (SSRs in 23,168 transcripts. Assembled sequences of transcriptome of A.paniculata generated in this study are made available, for the first time, in the TSA database, which provides useful information for functional and comparative genomic analyses besides identification of key enzymes involved in the various pathways of secondary metabolism.

  12. De novo Assembly of Leaf Transcriptome in the Medicinal Plant Andrographis paniculata

    Science.gov (United States)

    Cherukupalli, Neeraja; Divate, Mayur; Mittapelli, Suresh R.; Khareedu, Venkateswara R.; Vudem, Dashavantha R.

    2016-01-01

    Andrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequencing of RNA from A. paniculata leaf using Illumina HiSeq™ 2000 platform followed by the de novo transcriptome assembly. A total of 189.22 million high quality paired reads were generated and 1,70,724 transcripts were predicted in the primary assembly. Secondary assembly generated a transcriptome size of ~88 Mb with 83,800 clustered transcripts. Based on the similarity searches against plant non-redundant protein database, gene ontology, and eukaryotic orthologous groups, 49,363 transcripts were annotated constituting upto 58.91% of the identified unigenes. Annotation of transcripts—using kyoto encyclopedia of genes and genomes database—revealed 5606 transcripts plausibly involved in 140 pathways including biosynthesis of terpenoids and other secondary metabolites. Transcription factor analysis showed 6767 unique transcripts belonging to 97 different transcription factor families. A total number of 124 CYP450 transcripts belonging to seven divergent clans have been identified. Transcriptome revealed 146 different transcripts coding for enzymes involved in the biosynthesis of terpenoids of which 35 contained terpene synthase motifs. This study also revealed 32,341 simple sequence repeats (SSRs) in 23,168 transcripts. Assembled sequences of transcriptome of A. paniculata generated in this study are made available, for the first time, in the TSA database, which provides useful information for functional and comparative genomic analysis besides identification of key enzymes involved in the various pathways of secondary metabolism. PMID:27582746

  13. High Throughput Transcriptomics @ USEPA (Toxicology ...

    Science.gov (United States)

    The ideal chemical testing approach will provide complete coverage of all relevant toxicological responses. It should be sensitive and specific It should identify the mechanism/mode-of-action (with dose-dependence). It should identify responses relevant to the species of interest. Responses should ideally be translated into tissue-, organ-, and organism-level effects. It must be economical and scalable. Using a High Throughput Transcriptomics platform within US EPA provides broader coverage of biological activity space and toxicological MOAs and helps fill the toxicological data gap. Slide presentation at the 2016 ToxForum on using High Throughput Transcriptomics at US EPA for broader coverage biological activity space and toxicological MOAs.

  14. Bacterial tag encoded FLX titanium amplicon pyrosequencing (bTEFAP based assessment of prokaryotic diversity in metagenome of Lonar soda lake, India

    Directory of Open Access Journals (Sweden)

    Pravin Dudhagara

    2015-06-01

    Full Text Available Bacterial diversity and archaeal diversity in metagenome of the Lonar soda lake sediment were assessed by bacterial tag-encoded FLX amplicon pyrosequencing (bTEFAP. Metagenome comprised 5093 sequences with 2,531,282 bp and 53 ± 2% G + C content. Metagenome sequence data are available at NCBI under the Bioproject database with accession no. PRJNA218849. Metagenome sequence represented the presence of 83.1% bacterial and 10.5% archaeal origin. A total of 14 different bacteria demonstrating 57 species were recorded with dominating species like Coxiella burnetii (17%, Fibrobacter intestinalis (12% and Candidatus Cloacamonas acidaminovorans (11%. Occurrence of two archaeal phyla representing 24 species, among them Methanosaeta harundinacea (35%, Methanoculleus chikugoensis (12% and Methanolinea tarda (11% were dominating species. Significant presence of 11% sequences as an unclassified indicated the possibilities for unknown novel prokaryotes from the metagenome.

  15. Microbial Diversity of Bovine Mastitic Milk as Described by Pyrosequencing of Metagenomic 16s rDNA

    OpenAIRE

    Oikonomou, Georgios; Machado, Vinicius Silva; Santisteban, Carlos; Schukken, Ynte Hein; Bicalho, Rodrigo Carvalho

    2012-01-01

    Dairy cow mastitis is an important disease in the dairy industry. Different microbial species have been identified as causative agents in mastitis, and are traditionally diagnosed by bacterial culture. The objective of this study was to use metagenomic pyrosequencing of bacterial 16S rRNA genes to investigate bacterial DNA diversity in milk samples of mastitic and healthy dairy cows and compare the results with those obtained by classical bacterial culture. One hundred and thirty-six milk sam...

  16. Time-based comparative transcriptomics in engineered xylose-utilizing Saccharomyces cerevisiae identifies temperature-responsive genes during ethanol production.

    Science.gov (United States)

    Ismail, Ku Syahidah Ku; Sakamoto, Takatoshi; Hasunuma, Tomohisa; Kondo, Akihiko

    2013-09-01

    Agricultural residues comprising lignocellulosic materials are excellent sources of pentose sugar, which can be converted to ethanol as fuel. Ethanol production via consolidated bioprocessing requires a suitable microorganism to withstand the harsh fermentation environment of high temperature, high ethanol concentration, and exposure to inhibitors. We genetically enhanced an industrial Saccharomyces cerevisiae strain, sun049, enabling it to uptake xylose as the sole carbon source at high fermentation temperature. This strain was able to produce 13.9 g/l ethanol from 50 g/l xylose at 38 °C. To better understand the xylose consumption ability during long-term, high-temperature conditions, we compared by transcriptomics two fermentation conditions: high temperature (38 °C) and control temperature (30 °C) during the first 12 h of fermentation. This is the first long-term, time-based transcriptomics approach, and it allowed us to discover the role of heat-responsive genes when xylose is the sole carbon source. The results suggest that genes related to amino acid, cell wall, and ribosomal protein synthesis are down-regulated under heat stress. To allow cell stability and continuous xylose uptake in order to produce ethanol, hexose transporter HXT5, heat shock proteins, ubiquitin proteins, and proteolysis were all induced at high temperature. We also speculate that the strong relationship between high temperature and increased xylitol accumulation represents the cell's mechanism to protect itself from heat degradation.

  17. Screening and Validation of Highly-Efficient Insecticidal Conotoxins from a Transcriptome-Based Dataset of Chinese Tubular Cone Snail

    Directory of Open Access Journals (Sweden)

    Bingmiao Gao

    2017-07-01

    Full Text Available Most previous studies have focused on analgesic and anti-cancer activities for the conotoxins identified from piscivorous and molluscivorous cone snails, but little attention has been devoted to insecticidal activity of conotoxins from the dominant vermivorous species. As a representative vermivorous cone snail, the Chinese tubular cone snail (Conus betulinus is the dominant Conus species inhabiting the South China Sea. We sequenced related venom transcriptomes from C. betulinus using both the next-generation sequencing and traditional Sanger sequencing technologies, and a comprehensive library of 215 conotoxin transcripts was constructed. In our current study, six conotoxins with potential insecticidal activity were screened out from our conotoxin library by homologous search with a reported positive control (alpha-conotoxin ImI from C. imperialis as the query. Subsequently, these conotoxins were synthesized by chemical solid-phase and oxidative folding for further insecticidal activity validation, such as MTT assay, insect bioassay and homology modeling. The final results proved insecticidal activities of our achieved six conotoxins from the transcriptome-based dataset. Interestingly, two of them presented a lot of high insecticidal activity, which supports their usefulness for a trial as insecticides in field investigations. In summary, our present work provides a good example for high throughput development of biological insecticides on basis of the accumulated genomic resources.

  18. Bioinformatic prediction of G protein-coupled receptor encoding sequences from the transcriptome of the foreleg, including the Haller's organ, of the cattle tick, Rhipicephalus australis.

    Directory of Open Access Journals (Sweden)

    Sergio Munoz

    Full Text Available The cattle tick of Australia, Rhipicephalus australis, is a vector for microbial parasites that cause serious bovine diseases. The Haller's organ, located in the tick's forelegs, is crucial for host detection and mating. To facilitate the development of new technologies for better control of this agricultural pest, we aimed to sequence and annotate the transcriptome of the R. australis forelegs and associated tissues, including the Haller's organ. As G protein-coupled receptors (GPCRs are an important family of eukaryotic proteins studied as pharmaceutical targets in humans, we prioritized the identification and classification of the GPCRs expressed in the foreleg tissues. The two forelegs from adult R. australis were excised, RNA extracted, and pyrosequenced with 454 technology. Reads were assembled into unigenes and annotated by sequence similarity. Python scripts were written to find open reading frames (ORFs from each unigene. These ORFs were analyzed by different GPCR prediction approaches based on sequence alignments, support vector machines, hidden Markov models, and principal component analysis. GPCRs consistently predicted by multiple methods were further studied by phylogenetic analysis and 3D homology modeling. From 4,782 assembled unigenes, 40,907 possible ORFs were predicted. Using Blastp, Pfam, GPCRpred, TMHMM, and PCA-GPCR, a basic set of 46 GPCR candidates were compiled and a phylogenetic tree was constructed. With further screening of tertiary structures predicted by RaptorX, 6 likely GPCRs emerged and the strongest candidate was classified by PCA-GPCR to be a GABAB receptor.

  19. Transcriptome analysis of the Asian honey bee Apis cerana cerana.

    Directory of Open Access Journals (Sweden)

    Zi Long Wang

    Full Text Available BACKGROUND: The Eastern hive honey bee, Apis cerana cerana is a native and widely bred honey bee species in China. Molecular biology research about this honey bee species is scarce, and genomic information for A. c. cerana is not currently available. Transcriptome and expression profiling data for this species are therefore important resources needed to better understand the biological mechanisms of A. c. cerana. In this study, we obtained the transcriptome information of A. c. cerana by RNA-sequencing and compared gene expression differences between queens and workers of A. c. cerana by digital gene expression (DGE analysis. RESULTS: Using high-throughput Illumina RNA sequencing we obtained 51,581,510 clean reads corresponding to 4.64 Gb total nucleotides from a single run. These reads were assembled into 46,999 unigenes with a mean length of 676 bp. Based on a sequence similarity search against the five public databases (NR, Swissport, GO, COG, KEGG with a cut-off E-value of 10(-5 using BLASTX, a total of 24,630 unigenes were annotated with gene descriptions, gene ontology terms, or metabolic pathways. Using these transcriptome data as references we analyzed the gene expression differences between the queens and workers of A. c. cerana using a tag-based digital gene expression method. We obtained 5.96 and 5.66 million clean tags from the queen and worker samples, respectively. A total of 414 genes were differentially expressed between them, with 189 up-regulated and 225 down-regulated in queens. CONCLUSIONS: Our transcriptome data provide a comprehensive sequence resource for future A. c. cerana study, establishing an important public information platform for functional genomic studies in A. c. cerana. Furthermore, the DGE data provide comprehensive gene expression information for the queens and workers, which will facilitate our understanding of the molecular mechanisms of the different physiological aspects of the two castes.

  20. 16S rRNA gene pyrosequencing reveals bacterial dysbiosis in the duodenum of dogs with idiopathic inflammatory bowel disease.

    Directory of Open Access Journals (Sweden)

    Jan S Suchodolski

    Full Text Available BACKGROUND: Canine idiopathic inflammatory bowel disease (IBD is believed to be caused by a complex interaction of genetic, immunologic, and microbial factors. While mucosa-associated bacteria have been implicated in the pathogenesis of canine IBD, detailed studies investigating the enteric microbiota using deep sequencing techniques are lacking. The objective of this study was to evaluate mucosa-adherent microbiota in the duodenum of dogs with spontaneous idiopathic IBD using 16 S rRNA gene pyrosequencing. METHODOLOGY/PRINCIPAL FINDINGS: Biopsy samples of small intestinal mucosa were collected endoscopically from healthy dogs (n = 6 and dogs with moderate IBD (n = 7 or severe IBD (n = 7 as assessed by a clinical disease activity index. Total RNA was extracted from biopsy specimens and 454-pyrosequencing of the 16 S rRNA gene was performed on aliquots of cDNA from each dog. Intestinal inflammation was associated with significant differences in the composition of the intestinal microbiota when compared to healthy dogs. PCoA plots based on the unweighted UniFrac distance metric indicated clustering of samples between healthy dogs and dogs with IBD (ANOSIM, p<0.001. Proportions of Fusobacteria (p = 0.010, Bacteroidaceae (p = 0.015, Prevotellaceae (p = 0.022, and Clostridiales (p = 0.019 were significantly more abundant in healthy dogs. In contrast, specific bacterial genera within Proteobacteria, including Diaphorobacter (p = 0.044 and Acinetobacter (p = 0.040, were either more abundant or more frequently identified in IBD dogs. CONCLUSIONS/SIGNIFICANCE: In conclusion, dogs with spontaneous IBD exhibit alterations in microbial groups, which bear resemblance to dysbiosis reported in humans with chronic intestinal inflammation. These bacterial groups may serve as useful targets for monitoring intestinal inflammation.

  1. Genome-Wide Constitutively Expressed Gene Analysis and New Reference Gene Selection Based on Transcriptome Data: A Case Study from Poplar/Canker Disease Interaction

    Directory of Open Access Journals (Sweden)

    Jiaping Zhao

    2017-10-01

    Full Text Available A number of transcriptome datasets for differential expression (DE genes have been widely used for understanding organismal biology, but these datasets also contain untapped information that can be used to develop more precise analytical tools. With the use of transcriptome data generated from poplar/canker disease interaction system, we describe a methodology to identify candidate reference genes from high-throughput sequencing data. This methodology will improve the accuracy of RT-qPCR and will lead to better standards for the normalization of expression data. Expression stability analysis from xylem and phloem of Populus bejingensis inoculated with the fungal canker pathogen Botryosphaeria dothidea revealed that 729 poplar transcripts (1.11% were stably expressed, at a threshold level of coefficient of variance (CV of FPKM < 20% and maximum fold change (MFC of FPKM < 2.0. Expression stability and bioinformatics analysis suggested that commonly used house-keeping (HK genes were not the most appropriate internal controls: 70 of the 72 commonly used HK genes were not stably expressed, 45 of the 72 produced multiple isoform transcripts, and some of their reported primers produced unspecific amplicons in PCR amplification. RT-qPCR analysis to compare and evaluate the expression stability of 10 commonly used poplar HK genes and 20 of the 729 newly-identified stably expressed transcripts showed that some of the newly-identified genes (such as SSU_S8e, LSU_L5e, and 20S_PSU had higher stability ranking than most of commonly used HK genes. Based on these results, we recommend a pipeline for deriving reference genes from transcriptome data. An appropriate candidate gene should have a unique transcript, constitutive expression, CV value of expression < 20% (or possibly 30% and MFC value of expression <2, and an expression level of 50–1,000 units. Lastly, when four of the newly identified HK genes were used in the normalization of expression data for 20

  2. Modular organization of the white spruce (Picea glauca) transcriptome reveals functional organization and evolutionary signatures.

    Science.gov (United States)

    Raherison, Elie S M; Giguère, Isabelle; Caron, Sébastien; Lamara, Mebarek; MacKay, John J

    2015-07-01

    Transcript profiling has shown the molecular bases of several biological processes in plants but few studies have developed an understanding of overall transcriptome variation. We investigated transcriptome structure in white spruce (Picea glauca), aiming to delineate its modular organization and associated functional and evolutionary attributes. Microarray analyses were used to: identify and functionally characterize groups of co-expressed genes; investigate expressional and functional diversity of vascular tissue preferential genes which were conserved among Picea species, and identify expression networks underlying wood formation. We classified 22 857 genes as variable (79%; 22 coexpression groups) or invariant (21%) by profiling across several vegetative tissues. Modular organization and complex transcriptome restructuring among vascular tissue preferential genes was revealed by their assignment to coexpression groups with partially overlapping profiles and partially distinct functions. Integrated analyses of tissue-based and temporally variable profiles identified secondary xylem gene networks, showed their remodelling over a growing season and identified PgNAC-7 (no apical meristerm (NAM), Arabidopsis transcription activation factor (ATAF) and cup-shaped cotyledon (CUC) transcription factor 007 in Picea glauca) as a major hub gene specific to earlywood formation. Reference profiling identified comprehensive, statistically robust coexpressed groups, revealing that modular organization underpins the evolutionary conservation of the transcriptome structure. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  3. The testes transcriptome derived from the New World Screwworm, Cochliomyia hominivorax TSA

    Science.gov (United States)

    In a collaboration with National Center for Genome Resources researchers, we sequenced and assembled the testes transcriptome derived from the Pacora, Panama, production plant strain of the New World Screwworm, Cochliomyia hominivorax. This transcriptome contains 4,149 unigenes and the Transcriptome...

  4. Deep sequencing-based analysis of the Cymbidium ensifolium floral transcriptome.

    Directory of Open Access Journals (Sweden)

    Xiaobai Li

    Full Text Available Cymbidium ensifolium is a Chinese Cymbidium with an elegant shape, beautiful appearance, and a fragrant aroma. C. ensifolium has a long history of cultivation in China and it has excellent commercial value as a potted plant and cut flower. The development of C. ensifolium genomic resources has been delayed because of its large genome size. Taking advantage of technical and cost improvement of RNA-Seq, we extracted total mRNA from flower buds and mature flowers and obtained a total of 9.52 Gb of filtered nucleotides comprising 98,819,349 filtered reads. The filtered reads were assembled into 101,423 isotigs, representing 51,696 genes. Of the 101,423 isotigs, 41,873 were putative homologs of annotated sequences in the public databases, of which 158 were associated with floral development and 119 were associated with flowering. The isotigs were categorized according to their putative functions. In total, 10,212 of the isotigs were assigned into 25 eukaryotic orthologous groups (KOGs, 41,690 into 58 gene ontology (GO terms, and 9,830 into 126 Arabidopsis Kyoto Encyclopedia of Genes and Genomes (KEGG pathways, and 9,539 isotigs into 123 rice pathways. Comparison of the isotigs with those of the two related orchid species P. equestris and C. sinense showed that 17,906 isotigs are unique to C. ensifolium. In addition, a total of 7,936 SSRs and 16,676 putative SNPs were identified. To our knowledge, this transcriptome database is the first major genomic resource for C. ensifolium and the most comprehensive transcriptomic resource for genus Cymbidium. These sequences provide valuable information for understanding the molecular mechanisms of floral development and flowering. Sequences predicted to be unique to C. ensifolium would provide more insights into C. ensifolium gene diversity. The numerous SNPs and SSRs identified in the present study will contribute to marker development for C. ensifolium.

  5. Analysis of insecticide resistance-related genes of the Carmine spider mite Tetranychus cinnabarinus based on a de novo assembled transcriptome.

    Science.gov (United States)

    Xu, Zhifeng; Zhu, Wenyi; Liu, Yanchao; Liu, Xing; Chen, Qiushuang; Peng, Miao; Wang, Xiangzun; Shen, Guangmao; He, Lin

    2014-01-01

    The carmine spider mite (CSM), Tetranychus cinnabarinus, is an important pest mite in agriculture, because it can develop insecticide resistance easily. To gain valuable gene information and molecular basis for the future insecticide resistance study of CSM, the first transcriptome analysis of CSM was conducted. A total of 45,016 contigs and 25,519 unigenes were generated from the de novo transcriptome assembly, and 15,167 unigenes were annotated via BLAST querying against current databases, including nr, SwissProt, the Clusters of Orthologous Groups (COGs), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO). Aligning the transcript to Tetranychus urticae genome, the 19255 (75.45%) of the transcripts had significant (e-value insecticide resistance in arthropod were generated from CSM transcriptome, including 53 P450-, 22 GSTs-, 23 CarEs-, 1 AChE-, 7 GluCls-, 9 nAChRs-, 8 GABA receptor-, 1 sodium channel-, 6 ATPase- and 12 Cyt b genes. We developed significant molecular resources for T. cinnabarinus putatively involved in insecticide resistance. The transcriptome assembly analysis will significantly facilitate our study on the mechanism of adapting environmental stress (including insecticide) in CSM at the molecular level, and will be very important for developing new control strategies against this pest mite.

  6. SNP discovery in the transcriptome of white Pacific shrimp Litopenaeus vannamei by next generation sequencing.

    Directory of Open Access Journals (Sweden)

    Yang Yu

    Full Text Available The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies.

  7. De novo transcriptome assembly of two contrasting pumpkin cultivars

    Directory of Open Access Journals (Sweden)

    Aliki Xanthopoulou

    2016-03-01

    Full Text Available Cucurbita pepo (squash, pumpkin, gourd, a worldwide-cultivated vegetable of American origin, is extremely variable in fruit characteristics. However, the information associated with genes and genetic markers for pumpkin is very limited. In order to identify new genes and to develop genetic markers, we performed a transcriptome analysis (RNA-Seq of two contrasting pumpkin cultivars. Leaves and female flowers of cultivars, ‘Big Moose’ with large round fruits and ‘Munchkin’ with small round fruits, were harvested for total RNA extraction. We obtained a total of 6 GB (Big Moose; http://www.ncbi.nlm.nih.gov/Traces/sra/?run=SRR3056882 and 5 GB (Munchkin; http://www.ncbi.nlm.nih.gov/Traces/sra/?run=SRR3056883 sequence data (NCBI SRA database SRX1502732 and SRX1502735, respectively, which correspond to 18,055,786 and 14,824,292 150-base reads. After quality assessment, the clean sequences where 17,995,932 and 14,774,486 respectively. The numbers of total transcripts for ‘Big Moose’ and ‘Munchkin’ were 84,727 and 68,051, respectively. TransDecoder identified possible coding regions in assembled transcripts. This study provides transcriptome data for two contrasting pumpkin cultivars, which might be useful for genetic marker development and comparative transcriptome analyses. Keywords: RNA-Seq, Pumpkin, Contrasting cultivars, Cucurbita pepo

  8. Pyrosequencing analysis yields comprehensive assessment of microbial communities in pilot-scale two-stage membrane biofilm reactors.

    Science.gov (United States)

    Ontiveros-Valencia, Aura; Tang, Youneng; Zhao, He-Ping; Friese, David; Overstreet, Ryan; Smith, Jennifer; Evans, Patrick; Rittmann, Bruce E; Krajmalnik-Brown, Rosa

    2014-07-01

    We studied the microbial community structure of pilot two-stage membrane biofilm reactors (MBfRs) designed to reduce nitrate (NO3(-)) and perchlorate (ClO4(-)) in contaminated groundwater. The groundwater also contained oxygen (O2) and sulfate (SO4(2-)), which became important electron sinks that affected the NO3(-) and ClO4(-) removal rates. Using pyrosequencing, we elucidated how important phylotypes of each "primary" microbial group, i.e., denitrifying bacteria (DB), perchlorate-reducing bacteria (PRB), and sulfate-reducing bacteria (SRB), responded to changes in electron-acceptor loading. UniFrac, principal coordinate analysis (PCoA), and diversity analyses documented that the microbial community of biofilms sampled when the MBfRs had a high acceptor loading were phylogenetically distant from and less diverse than the microbial community of biofilm samples with lower acceptor loadings. Diminished acceptor loading led to SO4(2-) reduction in the lag MBfR, which allowed Desulfovibrionales (an SRB) and Thiothrichales (sulfur-oxidizers) to thrive through S cycling. As a result of this cooperative relationship, they competed effectively with DB/PRB phylotypes such as Xanthomonadales and Rhodobacterales. Thus, pyrosequencing illustrated that while DB, PRB, and SRB responded predictably to changes in acceptor loading, a decrease in total acceptor loading led to important shifts within the "primary" groups, the onset of other members (e.g., Thiothrichales), and overall greater diversity.

  9. Denaturing gradient gel electrophoresis and barcoded pyrosequencing reveal unprecedented archaeal diversity in mangrove sediment and rhizosphere samples.

    Science.gov (United States)

    Pires, Ana C C; Cleary, Daniel F R; Almeida, Adelaide; Cunha, Angela; Dealtry, Simone; Mendonça-Hagler, Leda C S; Smalla, Kornelia; Gomes, Newton C M

    2012-08-01

    Mangroves are complex ecosystems that regulate nutrient and sediment fluxes to the open sea. The importance of bacteria and fungi in regulating nutrient cycles has led to an interest in their diversity and composition in mangroves. However, very few studies have assessed Archaea in mangroves, and virtually nothing is known about whether mangrove rhizospheres affect archaeal diversity and composition. Here, we studied the diversity and composition of Archaea in mangrove bulk sediment and the rhizospheres of two mangrove trees, Rhizophora mangle and Laguncularia racemosa, using denaturing gradient gel electrophoresis (DGGE) and pyrosequencing of archaeal 16S rRNA genes with a nested-amplification approach. DGGE profiles revealed significant structural differences between bulk sediment and rhizosphere samples, suggesting that roots of both mangrove species influence the sediment archaeal community. Nearly all of the detected sequences obtained with pyrosequencing were identified as Archaea, but most were unclassified at the level of phylum or below. Archaeal richness was, furthermore, the highest in the L. racemosa rhizosphere, intermediate in bulk sediment, and the lowest in the R. mangle rhizosphere. This study shows that rhizosphere microhabitats of R. mangle and L. racemosa, common plants in subtropical mangroves located in Rio de Janeiro, Brazil, hosted distinct archaeal assemblages.

  10. Cyanobacterial composition and spatial distribution based on pyrosequencing data in the Gurbantunggut Desert, Northwestern China.

    Science.gov (United States)

    Zhang, Bingchang; Li, Renhui; Xiao, Peng; Su, Yangui; Zhang, Yuanming

    2016-03-01

    Cyanobacteria are the primary colonizers and form a dominant component of soil photosynthetic communities in biological soil crusts. They are crucial in improving soil environments, namely accumulating soil carbon and nitrogen. Many classical studies have examined cyanobacterial diversity in desert crusts, but relatively few comprehensive molecular surveys have been conducted. We used 454 pyrosequencing of 16S rRNA to investigate cyanobacterial composition and distribution on regional scales in the Gurbantunggut Desert. The relationship between cyanobacterial distribution and environmental factors was also explored. A total of 24,973 cyanobacteria partial 16S rRNA gene sequences were obtained, and 507OTUs were selected, as most OTUs had very few reads. Among these, 347 OTU sequences were of cyanobacteria origin, belonging to Oscillatoriales, Nostocales, Chroococcales, and uncultured cyanobacterium clone, respectively. Microcoleus vaginatus, Chroococcidiopsis spp. and M. steenstrupii were the dominant species in most areas of the Gurbantunggut Desert. Compared with other desert, the Gurbantunggut Desert differed in the prominence of Chroococcidiopsis spp. and lack of Pseudanabaenales. Species composition and abundance of cyanobacteria also showed distinct variations. Soil texture, precipitation, and nutrients and salt levels affected cyanobacterial distribution. Increased precipitation was helpful in improving cyanobacterial diversity. A higher content of coarse sand promoted the colonization and growth of Oscillatoriales and some phylotypes of Chroococcales. The fine-textured soil with higher nutrients and salts supported more varied populations of cyanobacteria, namely some heterocystous cyanobacteria. The results suggested that the Gurbantunggut Desert was rich in cyanobacteria and that precipitation was a primary regulating factor for cyanobacterial composition on a regional scale. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  11. Pyrosequencing as a tool for the identification of common isolates of Mycobacterium sp.

    Science.gov (United States)

    Tuohy, Marion J; Hall, Gerri S; Sholtis, Mary; Procop, Gary W

    2005-04-01

    Pyrosequencing technology, sequencing by addition, was evaluated for categorization of mycobacterial isolates. One hundred and eighty-nine isolates, including 18 ATCC and Trudeau Mycobacterial Culture Collection (TMC) strains, were studied. There were 38 Mycobacterium tuberculosis complex, 27 M. kansasii, 27 MAI complex, 21 M. marinum, 14 M. gordonae, 20 M. chelonae-abscessus group, 10 M. fortuitum, 5 M. xenopi, 3 M. celatum, 2 M. terrae complex, 20 M. mucogenicum, and 2 M. scrofulaceum. Nucleic acid extracts were prepared from solid media or MGIT broth. Traditional PCR was performed with one of the primers biotinylated; the assay targeted a portion of the 16S rRNA gene that contains a hypervariable region, which has been previously shown to be useful for the identification of mycobacteria. The PSQ Sample Preparation Kit was used, and the biotinylated PCR product was processed to a single-stranded DNA template. The sequencing primer was hybridized to the DNA template in a PSQ96 plate. Incorporation of the complementary nucleotides resulted in light generation peaks, forming a pyrogram, which was evaluated by the instrument software. Thirty basepairs were used for isolate categorization. Manual interpretation of the sequences was performed if the quality of the 30-bp sequence was in doubt or if more than 4 bp homopolymers were recognized. Sequences with more than 5 bp of bad quality were deemed unacceptable. When blasted against GenBank, 179 of 189 sequences (94.7%) assigned isolates to the correct molecular genus or group. Ten M. gordonae isolates had more than 5 bp of bad quality sequence and were not accepted. Pyrosequencing of this hypervariable region afforded rapid and acceptable characterization of common, routinely isolated clinical Mycobacterium sp. Algorithms are recommended for further differentiation with an additional sequencing primer or additional biochemicals.

  12. Transcriptome and proteomic analysis of mango (Mangifera indica Linn) fruits.

    Science.gov (United States)

    Wu, Hong-xia; Jia, Hui-min; Ma, Xiao-wei; Wang, Song-biao; Yao, Quan-sheng; Xu, Wen-tian; Zhou, Yi-gang; Gao, Zhong-shan; Zhan, Ru-lin

    2014-06-13

    Here we used Illumina RNA-seq technology for transcriptome sequencing of a mixed fruit sample from 'Zill' mango (Mangifera indica Linn) fruit pericarp and pulp during the development and ripening stages. RNA-seq generated 68,419,722 sequence reads that were assembled into 54,207 transcripts with a mean length of 858bp, including 26,413 clusters and 27,794 singletons. A total of 42,515(78.43%) transcripts were annotated using public protein databases, with a cut-off E-value above 10(-5), of which 35,198 and 14,619 transcripts were assigned to gene ontology terms and clusters of orthologous groups respectively. Functional annotation against the Kyoto Encyclopedia of Genes and Genomes database identified 23,741(43.79%) transcripts which were mapped to 128 pathways. These pathways revealed many previously unknown transcripts. We also applied mass spectrometry-based transcriptome data to characterize the proteome of ripe fruit. LC-MS/MS analysis of the mango fruit proteome was using tandem mass spectrometry (MS/MS) in an LTQ Orbitrap Velos (Thermo) coupled online to the HPLC. This approach enabled the identification of 7536 peptides that matched 2754 proteins. Our study provides a comprehensive sequence for a systemic view of transcriptome during mango fruit development and the most comprehensive fruit proteome to date, which are useful for further genomics research and proteomic studies. Our study provides a comprehensive sequence for a systemic view of both the transcriptome and proteome of mango fruit, and a valuable reference for further research on gene expression and protein identification. This article is part of a Special Issue entitled: Proteomics of non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.

  13. Transcriptome analysis of the response of Burmese python to digestion.

    Science.gov (United States)

    Duan, Jinjie; Sanggaard, Kristian Wejse; Schauser, Leif; Lauridsen, Sanne Enok; Enghild, Jan J; Schierup, Mikkel Heide; Wang, Tobias

    2017-08-01

    Exceptional and extreme feeding behaviour makes the Burmese python (Python bivittatus) an interesting model to study physiological remodelling and metabolic adaptation in response to refeeding after prolonged starvation. In this study, we used transcriptome sequencing of 5 visceral organs during fasting as well as 24 hours and 48 hours after ingestion of a large meal to unravel the postprandial changes in Burmese pythons. We first used the pooled data to perform a de novo assembly of the transcriptome and supplemented this with a proteomic survey of enzymes in the plasma and gastric fluid. We constructed a high-quality transcriptome with 34 423 transcripts, of which 19 713 (57%) were annotated. Among highly expressed genes (fragments per kilo base per million sequenced reads > 100 in 1 tissue), we found that the transition from fasting to digestion was associated with differential expression of 43 genes in the heart, 206 genes in the liver, 114 genes in the stomach, 89 genes in the pancreas, and 158 genes in the intestine. We interrogated the function of these genes to test previous hypotheses on the response to feeding. We also used the transcriptome to identify 314 secreted proteins in the gastric fluid of the python. Digestion was associated with an upregulation of genes related to metabolic processes, and translational changes therefore appear to support the postprandial rise in metabolism. We identify stomach-related proteins from a digesting individual and demonstrate that the sensitivity of modern liquid chromatography/tandem mass spectrometry equipment allows the identification of gastric juice proteins that are present during digestion. © The Authors 2017. Published by Oxford University Press.

  14. Comprehensive transcriptome analysis provides new insights into nutritional strategies and phylogenetic relationships of chrysophytes

    Directory of Open Access Journals (Sweden)

    Daniela Beisser

    2017-01-01

    Full Text Available Background Chrysophytes are protist model species in ecology and ecophysiology and important grazers of bacteria-sized microorganisms and primary producers. However, they have not yet been investigated in detail at the molecular level, and no genomic and only little transcriptomic information is available. Chrysophytes exhibit different trophic modes: while phototrophic chrysophytes perform only photosynthesis, mixotrophs can gain carbon from bacterial food as well as from photosynthesis, and heterotrophs solely feed on bacteria-sized microorganisms. Recent phylogenies and megasystematics demonstrate an immense complexity of eukaryotic diversity with numerous transitions between phototrophic and heterotrophic organisms. The question we aim to answer is how the diverse nutritional strategies, accompanied or brought about by a reduction of the plasmid and size reduction in heterotrophic strains, affect physiology and molecular processes. Results We sequenced the mRNA of 18 chrysophyte strains on the Illumina HiSeq platform and analysed the transcriptomes to determine relations between the trophic mode (mixotrophic vs. heterotrophic and gene expression. We observed an enrichment of genes for photosynthesis, porphyrin and chlorophyll metabolism for phototrophic and mixotrophic strains that can perform photosynthesis. Genes involved in nutrient absorption, environmental information processing and various transporters (e.g., monosaccharide, peptide, lipid transporters were present or highly expressed only in heterotrophic strains that have to sense, digest and absorb bacterial food. We furthermore present a transcriptome-based alignment-free phylogeny construction approach using transcripts assembled from short reads to determine the evolutionary relationships between the strains and the possible influence of nutritional strategies on the reconstructed phylogeny. We discuss the resulting phylogenies in comparison to those from established approaches

  15. Comparative Transcriptomes and EVO-DEVO Studies Depending on Next Generation Sequencing

    Directory of Open Access Journals (Sweden)

    Tiancheng Liu

    2015-01-01

    Full Text Available High throughput technology has prompted the progressive omics studies, including genomics and transcriptomics. We have reviewed the improvement of comparative omic studies, which are attributed to the high throughput measurement of next generation sequencing technology. Comparative genomics have been successfully applied to evolution analysis while comparative transcriptomics are adopted in comparison of expression profile from two subjects by differential expression or differential coexpression, which enables their application in evolutionary developmental biology (EVO-DEVO studies. EVO-DEVO studies focus on the evolutionary pressure affecting the morphogenesis of development and previous works have been conducted to illustrate the most conserved stages during embryonic development. Old measurements of these studies are based on the morphological similarity from macro view and new technology enables the micro detection of similarity in molecular mechanism. Evolutionary model of embryo development, which includes the “funnel-like” model and the “hourglass” model, has been evaluated by combination of these new comparative transcriptomic methods with prior comparative genomic information. Although the technology has promoted the EVO-DEVO studies into a new era, technological and material limitation still exist and further investigations require more subtle study design and procedure.

  16. Transcriptome Analysis of an Insecticide Resistant Housefly Strain: Insights about SNPs and Regulatory Elements in Cytochrome P450 Genes.

    Science.gov (United States)

    Mahmood, Khalid; Højland, Dorte H; Asp, Torben; Kristensen, Michael

    2016-01-01

    Insecticide resistance in the housefly, Musca domestica, has been investigated for more than 60 years. It will enter a new era after the recent publication of the housefly genome and the development of multiple next generation sequencing technologies. The genetic background of the xenobiotic response can now be investigated in greater detail. Here, we investigate the 454-pyrosequencing transcriptome of the spinosad-resistant 791spin strain in relation to the housefly genome with focus on P450 genes. The de novo assembly of clean reads gave 35,834 contigs consisting of 21,780 sequences of the spinosad resistant strain. The 3,648 sequences were annotated with an enzyme code EC number and were mapped to 124 KEGG pathways with metabolic processes as most highly represented pathway. One hundred and twenty contigs were annotated as P450s covering 44 different P450 genes of housefly. Eight differentially expressed P450s genes were identified and investigated for SNPs, CpG islands and common regulatory motifs in promoter and coding regions. Functional annotation clustering of metabolic related genes and motif analysis of P450s revealed their association with epigenetic, transcription and gene expression related functions. The sequence variation analysis resulted in 12 SNPs and eight of them found in cyp6d1. There is variation in location, size and frequency of CpG islands and specific motifs were also identified in these P450s. Moreover, identified motifs were associated to GO terms and transcription factors using bioinformatic tools. Transcriptome data of a spinosad resistant strain provide together with genome data fundamental support for future research to understand evolution of resistance in houseflies. Here, we report for the first time the SNPs, CpG islands and common regulatory motifs in differentially expressed P450s. Taken together our findings will serve as a stepping stone to advance understanding of the mechanism and role of P450s in xenobiotic detoxification.

  17. Principle considerations for the use of transcriptomics in doping research.

    Science.gov (United States)

    Neuberger, Elmo W I; Moser, Dirk A; Simon, Perikles

    2011-10-01

    Over the course of the past decade, technical progress has enabled scientists to investigate genome-wide RNA expression using microarray platforms. This transcriptomic approach represents a promising tool for the discovery of basic gene expression patterns and for identification of cellular signalling pathways under various conditions. Since doping substances have been shown to influence mRNA expression, it has been suggested that these changes can be detected by screening the blood transcriptome. In this review, we critically discuss the potential but also the pitfalls of this application as a tool in doping research. Transcriptomic approaches were considered to potentially provide researchers with a unique gene expression signature or with a specific biomarker for various physiological and pathophysiological conditions. Since transcriptomic approaches are considerably prone to biological and technical confounding factors that act on study subjects or samples, very strict guidelines for the use of transcriptomics in human study subjects have been developed. Typical field conditions associated with doping controls limit the feasibility of following these strict guidelines as there are too many variables counteracting a standardized procedure. After almost a decade of research using transcriptomic tools, it still remains a matter of future technological progress to identify the ultimate biomarker using technologies and/or methodologies that are sufficiently robust against typical biological and technical bias and that are valid in a court of law. Copyright © 2011 John Wiley & Sons, Ltd.

  18. Characterisation Of The Porcine Lung Transcriptome Using High-Throughput Pyrosequencing

    DEFF Research Database (Denmark)

    Panitz, Frank; Nielsen, Rasmus Ory; Andersen, Pernille K

    abundance. Our objective was to investigate animals previously not affected by lung disease and those that had been affected. To this end lung tissue samples were collected, separately pooled and tagged before sequencing using the Roche/454 FLX platform. We sequenced about one million reads that were...... clustered and mapped to the current pig genome reference sequence. Identified genes or clusters were annotated for functional classes and mined for singe nucleotide polymorphisms. In addition, we compared gene expression between sample groups in order to investigate possible changes in the lung...

  19. Comparative analyses of two Geraniaceae transcriptomes using next-generation sequencing.

    Science.gov (United States)

    Zhang, Jin; Ruhlman, Tracey A; Mower, Jeffrey P; Jansen, Robert K

    2013-12-29

    Organelle genomes of Geraniaceae exhibit several unusual evolutionary phenomena compared to other angiosperm families including accelerated nucleotide substitution rates, widespread gene loss, reduced RNA editing, and extensive genomic rearrangements. Since most organelle-encoded proteins function in multi-subunit complexes that also contain nuclear-encoded proteins, it is likely that the atypical organellar phenomena affect the evolution of nuclear genes encoding organellar proteins. To begin to unravel the complex co-evolutionary interplay between organellar and nuclear genomes in this family, we sequenced nuclear transcriptomes of two species, Geranium maderense and Pelargonium x hortorum. Normalized cDNA libraries of G. maderense and P. x hortorum were used for transcriptome sequencing. Five assemblers (MIRA, Newbler, SOAPdenovo, SOAPdenovo-trans [SOAPtrans], Trinity) and two next-generation technologies (454 and Illumina) were compared to determine the optimal transcriptome sequencing approach. Trinity provided the highest quality assembly of Illumina data with the deepest transcriptome coverage. An analysis to determine the amount of sequencing needed for de novo assembly revealed diminishing returns of coverage and quality with data sets larger than sixty million Illumina paired end reads for both species. The G. maderense and P. x hortorum transcriptomes contained fewer transcripts encoding the PLS subclass of PPR proteins relative to other angiosperms, consistent with reduced mitochondrial RNA editing activity in Geraniaceae. In addition, transcripts for all six plastid targeted sigma factors were identified in both transcriptomes, suggesting that one of the highly divergent rpoA-like ORFs in the P. x hortorum plastid genome is functional. The findings support the use of the Illumina platform and assemblers optimized for transcriptome assembly, such as Trinity or SOAPtrans, to generate high-quality de novo transcriptomes with broad coverage. In addition

  20. De Novo Assembly and Characterization of the Transcriptome of Grasshopper Shirakiacris shirakii

    Directory of Open Access Journals (Sweden)

    Zhongying Qiu

    2016-07-01

    Full Text Available Background: The grasshopper Shirakiacris shirakii is an important agricultural pest and feeds mainly on gramineous plants, thereby causing economic damage to a wide range of crops. However, genomic information on this species is extremely limited thus far, and transcriptome data relevant to insecticide resistance and pest control are also not available. Methods: The transcriptome of S. shirakii was sequenced using the Illumina HiSeq platform, and we de novo assembled the transcriptome. Results: Its sequencing produced a total of 105,408,878 clean reads, and the de novo assembly revealed 74,657 unigenes with an average length of 680 bp and N50 of 1057 bp. A total of 28,173 unigenes were annotated for the NCBI non-redundant protein sequences (Nr, NCBI non-redundant nucleotide sequences (Nt, a manually-annotated and reviewed protein sequence database (Swiss-Prot, Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG databases. Based on the Nr annotation results, we manually identified 79 unigenes encoding cytochrome P450 monooxygenases (P450s, 36 unigenes encoding carboxylesterases (CarEs and 36 unigenes encoding glutathione S-transferases (GSTs in S. shirakii. Core RNAi components relevant to miroRNA, siRNA and piRNA pathways, including Pasha, Loquacious, Argonaute-1, Argonaute-2, Argonaute-3, Zucchini, Aubergine, enhanced RNAi-1 and Piwi, were expressed in S. shirakii. We also identified five unigenes that were homologous to the Sid-1 gene. In addition, the analysis of differential gene expressions revealed that a total of 19,764 unigenes were up-regulated and 4185 unigenes were down-regulated in larvae. In total, we predicted 7504 simple sequence repeats (SSRs from 74,657 unigenes. Conclusions: The comprehensive de novo transcriptomic data of S. shirakii will offer a series of valuable molecular resources for better studying insecticide resistance, RNAi and molecular marker discovery in the transcriptome.

  1. Blood transcriptomics and metabolomics for personalized medicine.

    Science.gov (United States)

    Li, Shuzhao; Todor, Andrei; Luo, Ruiyan

    2016-01-01

    Molecular analysis of blood samples is pivotal to clinical diagnosis and has been intensively investigated since the rise of systems biology. Recent developments have opened new opportunities to utilize transcriptomics and metabolomics for personalized and precision medicine. Efforts from human immunology have infused into this area exquisite characterizations of subpopulations of blood cells. It is now possible to infer from blood transcriptomics, with fine accuracy, the contribution of immune activation and of cell subpopulations. In parallel, high-resolution mass spectrometry has brought revolutionary analytical capability, detecting > 10,000 metabolites, together with environmental exposure, dietary intake, microbial activity, and pharmaceutical drugs. Thus, the re-examination of blood chemicals by metabolomics is in order. Transcriptomics and metabolomics can be integrated to provide a more comprehensive understanding of the human biological states. We will review these new data and methods and discuss how they can contribute to personalized medicine.

  2. The Drosophila surface glia transcriptome: evolutionary conserved blood-brain barrier processes.

    Directory of Open Access Journals (Sweden)

    Michael K DeSalvo

    2014-11-01

    Full Text Available AbstractCentral nervous system (CNS function is dependent on the stringent regulation of metabolites, drugs, cells, and pathogens exposed to the CNS space. Cellular blood-brain barrier (BBB structures are highly specific checkpoints governing entry and exit of all small molecules to and from the brain interstitial space, but the precise mechanisms that regulate the BBB are not well understood. In addition, the BBB has long been a challenging obstacle to the pharmacologic treatment of CNS diseases; thus model systems that can parse the functions of the BBB are highly desirable. In this study, we sought to define the transcriptome of the adult Drosophila melanogaster BBB by isolating the BBB surface glia with FACS and profiling their gene expression with microarrays. By comparing the transcriptome of these surface glia to that of all brain glia, brain neurons, and whole brains, we present a catalog of transcripts that are selectively enriched at the Drosophila BBB. We found that the fly surface glia show high expression of many ABC and SLC transporters, cell adhesion molecules, metabolic enzymes, signaling molecules, and components of xenobiotic metabolism pathways. Using gene sequence-based alignments, we compare the Drosophila and Murine BBB transcriptomes and discover many shared chemoprotective and small molecule control pathways, thus affirming the relevance of invertebrate models for studying evolutionary conserved BBB properties. The Drosophila BBB transcriptome is valuable to vertebrate and insect biologists alike as a resource for studying proteins underlying diffusion barrier development and maintenance, glial biology, and regulation of drug transport at tissue barriers.

  3. The Drosophila surface glia transcriptome: evolutionary conserved blood-brain barrier processes.

    Science.gov (United States)

    DeSalvo, Michael K; Hindle, Samantha J; Rusan, Zeid M; Orng, Souvinh; Eddison, Mark; Halliwill, Kyle; Bainton, Roland J

    2014-01-01

    Central nervous system (CNS) function is dependent on the stringent regulation of metabolites, drugs, cells, and pathogens exposed to the CNS space. Cellular blood-brain barrier (BBB) structures are highly specific checkpoints governing entry and exit of all small molecules to and from the brain interstitial space, but the precise mechanisms that regulate the BBB are not well understood. In addition, the BBB has long been a challenging obstacle to the pharmacologic treatment of CNS diseases; thus model systems that can parse the functions of the BBB are highly desirable. In this study, we sought to define the transcriptome of the adult Drosophila melanogaster BBB by isolating the BBB surface glia with fluorescence activated cell sorting (FACS) and profiling their gene expression with microarrays. By comparing the transcriptome of these surface glia to that of all brain glia, brain neurons, and whole brains, we present a catalog of transcripts that are selectively enriched at the Drosophila BBB. We found that the fly surface glia show high expression of many ATP-binding cassette (ABC) and solute carrier (SLC) transporters, cell adhesion molecules, metabolic enzymes, signaling molecules, and components of xenobiotic metabolism pathways. Using gene sequence-based alignments, we compare the Drosophila and Murine BBB transcriptomes and discover many shared chemoprotective and small molecule control pathways, thus affirming the relevance of invertebrate models for studying evolutionary conserved BBB properties. The Drosophila BBB transcriptome is valuable to vertebrate and insect biologists alike as a resource for studying proteins underlying diffusion barrier development and maintenance, glial biology, and regulation of drug transport at tissue barriers.

  4. Analysis of soil fungal communities by amplicon pyrosequencing: current approaches to data analysis and the introduction of the pipeline SEED

    Czech Academy of Sciences Publication Activity Database

    Větrovský, Tomáš; Baldrian, Petr

    2013-01-01

    Roč. 49, č. 8 (2013), s. 1027-1037 ISSN 0178-2762 R&D Projects: GA MŠk LD12050; GA MŠk LD12048; GA ČR GAP504/12/0709 Institutional support: RVO:61388971 Keywords : Fungal community * Internal transcribed spacer * Pyrosequencing pipeline Subject RIV: EE - Microbiology, Virology Impact factor: 3.396, year: 2013

  5. Introduction of the hybcell-based compact sequencing technology and comparison to state-of-the-art methodologies for KRAS mutation detection.

    Science.gov (United States)

    Zopf, Agnes; Raim, Roman; Danzer, Martin; Niklas, Norbert; Spilka, Rita; Pröll, Johannes; Gabriel, Christian; Nechansky, Andreas; Roucka, Markus

    2015-03-01

    The detection of KRAS mutations in codons 12 and 13 is critical for anti-EGFR therapy strategies; however, only those methodologies with high sensitivity, specificity, and accuracy as well as the best cost and turnaround balance are suitable for routine daily testing. Here we compared the performance of compact sequencing using the novel hybcell technology with 454 next-generation sequencing (454-NGS), Sanger sequencing, and pyrosequencing, using an evaluation panel of 35 specimens. A total of 32 mutations and 10 wild-type cases were reported using 454-NGS as the reference method. Specificity ranged from 100% for Sanger sequencing to 80% for pyrosequencing. Sanger sequencing and hybcell-based compact sequencing achieved a sensitivity of 96%, whereas pyrosequencing had a sensitivity of 88%. Accuracy was 97% for Sanger sequencing, 85% for pyrosequencing, and 94% for hybcell-based compact sequencing. Quantitative results were obtained for 454-NGS and hybcell-based compact sequencing data, resulting in a significant correlation (r = 0.914). Whereas pyrosequencing and Sanger sequencing were not able to detect multiple mutated cell clones within one tumor specimen, 454-NGS and the hybcell-based compact sequencing detected multiple mutations in two specimens. Our comparison shows that the hybcell-based compact sequencing is a valuable alternative to state-of-the-art methodologies used for detection of clinically relevant point mutations.

  6. Analysis of RET promoter CpG island methylation using methylation-specific PCR (MSP), pyrosequencing, and methylation-sensitive high-resolution melting (MS-HRM): impact on stage II colon cancer patient outcome.

    Science.gov (United States)

    Draht, Muriel X G; Smits, Kim M; Jooste, Valérie; Tournier, Benjamin; Vervoort, Martijn; Ramaekers, Chantal; Chapusot, Caroline; Weijenberg, Matty P; van Engeland, Manon; Melotte, Veerle

    2016-01-01

    Already since the 1990s, promoter CpG island methylation markers have been considered promising diagnostic, prognostic, and predictive cancer biomarkers. However, so far, only a limited number of DNA methylation markers have been introduced into clinical practice. One reason why the vast majority of methylation markers do not translate into clinical applications is lack of independent validation of methylation markers, often caused by differences in methylation analysis techniques. We recently described RET promoter CpG island methylation as a potential prognostic marker in stage II colorectal cancer (CRC) patients of two independent series. In the current study, we analyzed the RET promoter CpG island methylation of 241 stage II colon cancer patients by direct methylation-specific PCR (MSP), nested-MSP, pyrosequencing, and methylation-sensitive high-resolution melting (MS-HRM). All primers were designed as close as possible to the same genomic region. In order to investigate the effect of different DNA methylation assays on patient outcome, we assessed the clinical sensitivity and specificity as well as the association of RET methylation with overall survival for three and five years of follow-up. Using direct-MSP and nested-MSP, 12.0 % (25/209) and 29.6 % (71/240) of the patients showed RET promoter CpG island methylation. Methylation frequencies detected by pyrosequencing were related to the threshold for positivity that defined RET methylation. Methylation frequencies obtained by pyrosequencing (threshold for positivity at 20 %) and MS-HRM were 13.3 % (32/240) and 13.8 % (33/239), respectively. The pyrosequencing threshold for positivity of 20 % showed the best correlation with MS-HRM and direct-MSP results. Nested-MSP detected RET promoter CpG island methylation in deceased patients with a higher sensitivity (33.1 %) compared to direct-MSP (10.7 %), pyrosequencing (14.4 %), and MS-HRM (15.4 %). While RET methylation frequencies detected by nested

  7. Analysis of a human brain transcriptome map

    Directory of Open Access Journals (Sweden)

    Greene Jonathan R

    2002-04-01

    Full Text Available Abstract Background Genome wide transcriptome maps can provide tools to identify candidate genes that are over-expressed or silenced in certain disease tissue and increase our understanding of the structure and organization of the genome. Expressed Sequence Tags (ESTs from the public dbEST and proprietary Incyte LifeSeq databases were used to derive a transcript map in conjunction with the working draft assembly of the human genome sequence. Results Examination of ESTs derived from brain tissues (excluding brain tumor tissues suggests that these genes are distributed on chromosomes in a non-random fashion. Some regions on the genome are dense with brain-enriched genes while some regions lack brain-enriched genes, suggesting a significant correlation between distribution of genes along the chromosome and tissue type. ESTs from brain tumor tissues have also been mapped to the human genome working draft. We reveal that some regions enriched in brain genes show a significant decrease in gene expression in brain tumors, and, conversely that some regions lacking in brain genes show an increased level of gene expression in brain tumors. Conclusions This report demonstrates a novel approach for tissue specific transcriptome mapping using EST-based quantitative assessment.

  8. Cutoffs and k-mers: implications from a transcriptome study in allopolyploid plants

    Directory of Open Access Journals (Sweden)

    Gruenheit Nicole

    2012-03-01

    Full Text Available Abstract Background Transcriptome analysis is increasingly being used to study the evolutionary origins and ecology of non-model plants. One issue for both transcriptome assembly and differential gene expression analyses is the common occurrence in plants of hybridisation and whole genome duplication (WGD and hybridization resulting in allopolyploidy. The divergence of duplicated genes following WGD creates near identical homeologues that can be problematic for de novo assembly and also reference based assembly protocols that use short reads (35 - 100 bp. Results Here we report a successful strategy for the assembly of two transcriptomes made using 75 bp Illumina reads from Pachycladon fastigiatum and Pachycladon cheesemanii. Both are allopolyploid plant species (2n = 20 that originated in the New Zealand Alps about 0.8 million years ago. In a systematic analysis of 19 different coverage cutoffs and 20 different k-mer sizes we showed that i none of the genes could be assembled across all of the parameter space ii assembly of each gene required an optimal set of parameter values and iii these parameter values could be explained in part by different gene expression levels and different degrees of similarity between genes. Conclusions To obtain optimal transcriptome assemblies for allopolyploid plants, k-mer size and k-mer coverage need to be considered simultaneously across a broad parameter space. This is important for assembling a maximum number of full length ESTs and for avoiding chimeric assemblies of homeologous and paralogous gene copies.

  9. Pyrosequencing Analysis of the Microbial Diversity of Airag, Khoormog and Tarag, Traditional Fermented Dairy Products of Mongolia

    OpenAIRE

    OKI, Kaihei; DUGERSUREN, Jamyan; DEMBEREL, Shirchin; WATANABE, Koichi

    2014-01-01

    Here, we used pyrosequencing to obtain a detailed analysis of the microbial diversities of traditional fermented dairy products of Mongolia. From 22 Airag (fermented mare’s milk), 5 Khoormog (fermented camel’s milk) and 26 Tarag (fermented milk of cows, goats and yaks) samples collected in the Mongolian provinces of Arhangai, Bulgan, Dundgobi, Tov, Uburhangai and Umnugobi, we obtained a total of 81 operational taxonomic units, which were assigned to 15 families, 21 genera and 41 species in 3 ...

  10. De novo assembling and primary analysis of genome and transcriptome of gray whale Eschrichtius robustus.

    Science.gov (United States)

    Moskalev, Alexey А; Kudryavtseva, Anna V; Graphodatsky, Alexander S; Beklemisheva, Violetta R; Serdyukova, Natalya A; Krutovsky, Konstantin V; Sharov, Vadim V; Kulakovskiy, Ivan V; Lando, Andrey S; Kasianov, Artem S; Kuzmin, Dmitry A; Putintseva, Yuliya A; Feranchuk, Sergey I; Shaposhnikov, Mikhail V; Fraifeld, Vadim E; Toren, Dmitri; Snezhkina, Anastasia V; Sitnik, Vasily V

    2017-12-28

    Gray whale, Eschrichtius robustus (E. robustus), is a single member of the family Eschrichtiidae, which is considered to be the most primitive in the class Cetacea. Gray whale is often described as a "living fossil". It is adapted to extreme marine conditions and has a high life expectancy (77 years). The assembly of a gray whale genome and transcriptome will allow to carry out further studies of whale evolution, longevity, and resistance to extreme environment. In this work, we report the first de novo assembly and primary analysis of the E. robustus genome and transcriptome based on kidney and liver samples. The presented draft genome assembly is complete by 55% in terms of a total genome length, but only by 24% in terms of the BUSCO complete gene groups, although 10,895 genes were identified. Transcriptome annotation and comparison with other whale species revealed robust expression of DNA repair and hypoxia-response genes, which is expected for whales. This preliminary study of the gray whale genome and transcriptome provides new data to better understand the whale evolution and the mechanisms of their adaptation to the hypoxic conditions.

  11. Pyrosequencing Analysis of Norovirus Genogroup II Distribution in Sewage and Oysters: First Detection of GII.17 Kawasaki 2014 in Oysters.

    Science.gov (United States)

    Pu, Jian; Kazama, Shinobu; Miura, Takayuki; Azraini, Nabila Dhyan; Konta, Yoshimitsu; Ito, Hiroaki; Ueki, You; Cahyaningrum, Ermaya Eka; Omura, Tatsuo; Watanabe, Toru

    2016-12-01

    Norovirus GII.3, GII.4, and GII.17 were detected using pyrosequencing in sewage and oysters in January and February 2015, in Japan. The strains in sewage and oyster samples were genetically identical or similar, predominant strains belonging to GII.17 Kawasaki 2014 lineage. This is the first report of GII.17 Kawasaki 2014 in oysters.

  12. Analysis of the transcriptome of Isodon rubescens and key enzymes involved in terpenoid biosynthesis

    Directory of Open Access Journals (Sweden)

    Xiuhong Su

    2016-05-01

    Full Text Available Isodon rubescens is an important medicinal plant in China that has been shown to reduce tumour growth due to the presence of the compound oridonin. In an effort to facilitate molecular research on oridonin biosynthesis, we reported the use of next generation massively parallel sequencing technologies and de novo transcriptome assembly to gain a comprehensive overview of I. rubescens transcriptome. In our study, a total of 50,934,276 clean reads, 101,640 transcripts and 44,626 unigenes were generated through de novo transcriptome assembly. A number of unigenes – 23,987, 10,263, 7359, 18,245, 17,683, 19,485, 9361 – were annotated in the National Center for Biotechnology Information (NCBI non-redundant protein (Nr, NCBI nucleotide sequences (Nt, Kyoto Encyclopedia of Genes and Genomes (KEGG Orthology (KO, Swiss-Prot, protein family (Pfam, gene ontology (GO, eukaryotic ortholog groups (KOG databases, respectively. Furthermore, the annotated unigenes were functionally classified according to the GO, KOG and KEGG. Based on these results, candidate genes encoding enzymes involved in terpenoids backbone biosynthesis were detected. Our data provided the most comprehensive sequence resource available for the study on I. rubescens, as well as demonstrated the effective use of Illumina sequencing and de novo transcriptome assembly on a species lacking genomic information.

  13. The Embryonic Transcriptome of the Red-Eared Slider Turtle (Trachemys scripta.

    Directory of Open Access Journals (Sweden)

    Nicholas J Kaplinsky

    Full Text Available The bony shell of the turtle is an evolutionary novelty not found in any other group of animals, however, research into its formation has suggested that it has evolved through modification of conserved developmental mechanisms. Although these mechanisms have been extensively characterized in model organisms, the tools for characterizing them in non-model organisms such as turtles have been limited by a lack of genomic resources. We have used a next generation sequencing approach to generate and assemble a transcriptome from stage 14 and 17 Trachemys scripta embryos, stages during which important events in shell development are known to take place. The transcriptome consists of 231,876 sequences with an N50 of 1,166 bp. GO terms and EC codes were assigned to the 61,643 unique predicted proteins identified in the transcriptome sequences. All major GO categories and metabolic pathways are represented in the transcriptome. Transcriptome sequences were used to amplify several cDNA fragments designed for use as RNA in situ probes. One of these, BMP5, was hybridized to a T. scripta embryo and exhibits both conserved and novel expression patterns. The transcriptome sequences should be of broad use for understanding the evolution and development of the turtle shell and for annotating any future T. scripta genome sequences.

  14. The Embryonic Transcriptome of the Red-Eared Slider Turtle (Trachemys scripta).

    Science.gov (United States)

    Kaplinsky, Nicholas J; Gilbert, Scott F; Cebra-Thomas, Judith; Lilleväli, Kersti; Saare, Merly; Chang, Eric Y; Edelman, Hannah E; Frick, Melissa A; Guan, Yin; Hammond, Rebecca M; Hampilos, Nicholas H; Opoku, David S B; Sariahmed, Karim; Sherman, Eric A; Watson, Ray

    2013-01-01

    The bony shell of the turtle is an evolutionary novelty not found in any other group of animals, however, research into its formation has suggested that it has evolved through modification of conserved developmental mechanisms. Although these mechanisms have been extensively characterized in model organisms, the tools for characterizing them in non-model organisms such as turtles have been limited by a lack of genomic resources. We have used a next generation sequencing approach to generate and assemble a transcriptome from stage 14 and 17 Trachemys scripta embryos, stages during which important events in shell development are known to take place. The transcriptome consists of 231,876 sequences with an N50 of 1,166 bp. GO terms and EC codes were assigned to the 61,643 unique predicted proteins identified in the transcriptome sequences. All major GO categories and metabolic pathways are represented in the transcriptome. Transcriptome sequences were used to amplify several cDNA fragments designed for use as RNA in situ probes. One of these, BMP5, was hybridized to a T. scripta embryo and exhibits both conserved and novel expression patterns. The transcriptome sequences should be of broad use for understanding the evolution and development of the turtle shell and for annotating any future T. scripta genome sequences.

  15. A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae

    DEFF Research Database (Denmark)

    Nookaew, Intawat; Papini, Marta; Pornputtapong, Natapol

    2012-01-01

    RNA-seq, has recently become an attractive method of choice in the studies of transcriptomes, promising several advantages compared with microarrays. In this study, we sought to assess the contribution of the different analytical steps involved in the analysis of RNA-seq data generated with the I......RNA-seq, has recently become an attractive method of choice in the studies of transcriptomes, promising several advantages compared with microarrays. In this study, we sought to assess the contribution of the different analytical steps involved in the analysis of RNA-seq data generated...... gene expression identification derived from the different statistical methods, as well as their integrated analysis results based on gene ontology annotation are in good agreement. Overall, our study provides a useful and comprehensive comparison between the two platforms (RNA-seq and microrrays...

  16. 454-Pyrosequencing survey of microbiota in adult Spotted Wing Drosophila (SWD) corroborates a core microbiome and additional symbiotic and entomopathogenic bacterial associates

    Science.gov (United States)

    Complete surveys of insect endosymbionts including species of economic importance have until recently been hampered by a lack of high-throughput genetic assays. We used 454-pyrosequencing of the 16S rRNA gene amplicon of adult spotted wing Drosophila (SWD) Drosophila suzukii (Matsumura) from souther...

  17. Desiccation tolerance in bryophytes: The dehydration and rehydration transcriptomes in the desiccation-tolerant bryophyte Bryum argenteum.

    Science.gov (United States)

    Gao, Bei; Li, Xiaoshuang; Zhang, Daoyuan; Liang, Yuqing; Yang, Honglan; Chen, Moxian; Zhang, Yuanming; Zhang, Jianhua; Wood, Andrew J

    2017-08-08

    The desiccation tolerant bryophyte Bryum argenteum is an important component of desert biological soil crusts (BSCs) and is emerging as a model system for studying vegetative desiccation tolerance. Here we present and analyze the hydration-dehydration-rehydration transcriptomes in B. argenteum to establish a desiccation-tolerance transcriptomic atlas. B. argenteum gametophores representing five different hydration stages (hydrated (H0), dehydrated for 2 h (D2), 24 h (D24), then rehydrated for 2 h (R2) and 48 h (R48)), were sampled for transcriptome analyses. Illumina high throughput RNA-Seq technology was employed and generated more than 488.46 million reads. An in-house de novo transcriptome assembly optimization pipeline based on Trinity assembler was developed to obtain a reference Hydration-Dehydration-Rehydration (H-D-R) transcriptome comprising of 76,206 transcripts, with an N50 of 2,016 bp and average length of 1,222 bp. Comprehensive transcription factor (TF) annotation discovered 978 TFs in 62 families, among which 404 TFs within 40 families were differentially expressed upon dehydration-rehydration. Pfam term enrichment analysis revealed 172 protein families/domains were significantly associated with the H-D-R cycle and confirmed early rehydration (i.e. the R2 stage) as exhibiting the maximum stress-induced changes in gene expression.

  18. Use of pyrosequencing and denaturing gradient gel electrophoresis to examine the effects of probiotics and essential oil blends on digestive microflora in broilers under mixed Eimeria infection.

    Science.gov (United States)

    Hume, Michael E; Barbosa, Nei A; Dowd, Scot E; Sakomura, Nilva K; Nalian, Armen G; Martynova-Van Kley, Alexandra; Oviedo-Rondón, Edgar O

    2011-11-01

    A protective digestive microflora helps prevent and reduce broiler infection and colonization by enteropathogens. In the current experiment, broilers fed diets supplemented with probiotics and essential oil (EO) blends were infected with a standard mixed Eimeria spp. to determine effects of performance enhancers on ileal and cecal microbial communities (MCs). Eight treatment groups included four controls (uninfected-unmedicated [UU], unmedicated-infected, the antibiotic BMD plus the ionophore Coban as positive control, and the ionophore as negative control), and four treatments (probiotics BC-30 and Calsporin; and EO, Crina Poultry Plus, and Crina PoultryAF). Day-old broilers were raised to 14 days in floor pens on used litter and then were moved to Petersime batteries and inoculated at 15 days with mixed Eimeria spp. Ileal and cecal samples were collected at 14 days and 7 days postinfection. Digesta DNA was subjected to pyrosequencing for sequencing of individual cecal bacteria and denaturing gradient gel electrophoresis (DGGE) for determination of changes in ileal and cecal MC according to percentage similarity coefficient (%SC). Pyrosequencing is very sensitive detecting shifts in individual bacterial sequences, whereas DGGE is able to detect gross shifts in entire MC. These combined techniques offer versatility toward identifying feed additive and mild Eimeria infection modulation of broiler MC. Pyrosequencing detected 147 bacterial species sequences. Additionally, pyrosequencing revealed the presence of relatively low levels of the potential human enteropathogens Campylobacter sp. and four Shigella spp. as well as the potential poultry pathogen Clostridiun perfringens. Pre- and postinfection changes in ileal (56%SC) and cecal (78.5%SC) DGGE profiles resulted from the coccidia infection and with increased broiler age. Probiotics and EO changed MC from those seen in UU ilea and ceca. Results potentially reflect the performance enhancement above expectations in

  19. Nodeomics: pathogen detection in vertebrate lymph nodes using meta-transcriptomics.

    Directory of Open Access Journals (Sweden)

    Nicola E Wittekindt

    Full Text Available The ongoing emergence of human infections originating from wildlife highlights the need for better knowledge of the microbial community in wildlife species where traditional diagnostic approaches are limited. Here we evaluate the microbial biota in healthy mule deer (Odocoileus hemionus by analyses of lymph node meta-transcriptomes. cDNA libraries from five individuals and two pools of samples were prepared from retropharyngeal lymph node RNA enriched for polyadenylated RNA and sequenced using Roche-454 Life Sciences technology. Protein-coding and 16S ribosomal RNA (rRNA sequences were taxonomically profiled using protein and rRNA specific databases. Representatives of all bacterial phyla were detected in the seven libraries based on protein-coding transcripts indicating that viable microbiota were present in lymph nodes. Residents of skin and rumen, and those ubiquitous in mule deer habitat dominated classifiable bacterial species. Based on detection of both rRNA and protein-coding transcripts, we identified two new proteobacterial species; a Helicobacter closely related to Helicobacter cetorum in the Helicobacter pylori/Helicobacter acinonychis complex and an Acinetobacter related to Acinetobacter schindleri. Among viruses, a novel gamma retrovirus and other members of the Poxviridae and Retroviridae were identified. We additionally evaluated bacterial diversity by amplicon sequencing the hypervariable V6 region of 16S rRNA and demonstrate that overall taxonomic diversity is higher with the meta-transcriptomic approach. These data provide the most complete picture to date of the microbial diversity within a wildlife host. Our research advances the use of meta-transcriptomics to study microbiota in wildlife tissues, which will facilitate detection of novel organisms with pathogenic potential to human and animals.

  20. Cross-species transferability of SSR loci developed from transciptome sequencing in lodgepole pine.

    Science.gov (United States)

    Lesser, Mark R; Parchman, Thomas L; Buerkle, C Alex

    2012-05-01

    With the advent of next generation sequencing technologies, transcriptome level sequence collections are arising as prominent resources for the discovery of gene-based molecular markers. In a previous study more than 15,000 simple sequence repeats (SSRs) in expressed sequence tag (EST) sequences resulting from 454 pyrosequencing of Pinus contorta cDNA were identified. From these we developed PCR primers for approximately 4000 candidate SSRs. Here, we tested 184 of these SSRs for successful amplification across P. contorta and eight other pine species and examined patterns of polymorphism and allelic variability for a subset of these SSRs. Cross-species transferability was high, with high percentages of loci producing PCR products in all species tested. In addition, 50% of the loci we screened across panels of individuals from three of these species were polymorphic and allelically diverse. We examined levels of diversity in a subset of these SSRs by collecting genotypic data across several populations of Pinus ponderosa in northern Wyoming. Our results indicate the utility of mining pyrosequenced EST collections for gene-based SSRs and provide a source of molecular markers that should bolster evolutionary genetic investigations across the genus Pinus. © 2011 Blackwell Publishing Ltd.

  1. Chapter 4 genomics, transcriptomics, and epigenomics in traumatic brain injury research.

    Science.gov (United States)

    Puccio, Ava M; Alexander, Sheila

    2015-01-01

    The long-term effects and significant impact of the full spectrum of traumatic brain injury (TBI) has received increased attention in recent years. Despite increased research efforts, there has been little movement toward improving outcomes for the survivors of TBI. TBI is a heterogeneous condition with a complex biological response, and significant variability in human recovery contributes to the difficulty in identifying therapeutics that improve outcomes. Personalized medicine, identifying the best course of treatment for a given individual based on individual characteristics, has great potential to improve recovery for TBI survivors. The advances in medical genetics and genomics over the past 20 years have increased our understanding of many biological processes. A substantial amount of research has focused on the genomic, transcriptomic, and epigenomic profiles in many health and disease states, including recovery from TBI. The focus of this review chapter is to describe the current state of the science in genomic, transcriptomic, and epigenomic research in the TBI population. There have been some advancements toward understanding the genomic, transcriptomic, and epigenomic processes in humans, but much of this work remains at the preclinical stage. This current evidence does improve our understanding of TBI recovery, but also serves as an excellent platform upon which to build further study toward improved outcomes for this population.

  2. Improving transcriptome construction in non-model organisms: integrating manual and automated gene definition in Emiliania huxleyi.

    Science.gov (United States)

    Feldmesser, Ester; Rosenwasser, Shilo; Vardi, Assaf; Ben-Dor, Shifra

    2014-02-22

    The advent of Next Generation Sequencing technologies and corresponding bioinformatics tools allows the definition of transcriptomes in non-model organisms. Non-model organisms are of great ecological and biotechnological significance, and consequently the understanding of their unique metabolic pathways is essential. Several methods that integrate de novo assembly with genome-based assembly have been proposed. Yet, there are many open challenges in defining genes, particularly where genomes are not available or incomplete. Despite the large numbers of transcriptome assemblies that have been performed, quality control of the transcript building process, particularly on the protein level, is rarely performed if ever. To test and improve the quality of the automated transcriptome reconstruction, we used manually defined and curated genes, several of them experimentally validated. Several approaches to transcript construction were utilized, based on the available data: a draft genome, high quality RNAseq reads, and ESTs. In order to maximize the contribution of the various data, we integrated methods including de novo and genome based assembly, as well as EST clustering. After each step a set of manually curated genes was used for quality assessment of the transcripts. The interplay between the automated pipeline and the quality control indicated which additional processes were required to improve the transcriptome reconstruction. We discovered that E. huxleyi has a very high percentage of non-canonical splice junctions, and relatively high rates of intron retention, which caused unique issues with the currently available tools. While individual tools missed genes and artificially joined overlapping transcripts, combining the results of several tools improved the completeness and quality considerably. The final collection, created from the integration of several quality control and improvement rounds, was compared to the manually defined set both on the DNA and

  3. Genome and Transcriptome Sequencing of the Ostreid herpesvirus 1 From Tomales Bay, California

    Science.gov (United States)

    Burge, C. A.; Langevin, S.; Closek, C. J.; Roberts, S. B.; Friedman, C. S.

    2016-02-01

    Mass mortalities of larval and seed bivalve molluscs attributed to the Ostreid herpesvirus 1 (OsHV-1) occur globally. OsHV-1 was fully sequenced and characterized as a member of the Family Malacoherpesviridae. Multiple strains of OsHV-1 exist and may vary in virulence, i.e. OsHV-1 µvar. For most global variants of OsHV-1, sequence data is limited to PCR-based sequencing of segments, including two recent genomes. In the United States, OsHV-1 is limited to detection in adjacent embayments in California, Tomales and Drakes bays. Limited DNA sequence data of OsHV-1 infecting oysters in Tomales Bay indicates the virus detected in Tomales Bay is similar but not identical to any one global variant of OsHV-1. In order to better understand both strain variation and virulence of OsHV-1 infecting oysters in Tomales Bay, we used genomic and transcriptomic sequencing. Meta-genomic sequencing (Illumina MiSeq) was conducted from infected oysters (n=4 per year) collected in 2003, 2007, and 2014, where full OsHV-1 genome sequences and low overall microbial diversity were achieved from highly infected oysters. Increased microbial diversity was detected in three of four samples sequenced from 2003, where qPCR based genome copy numbers of OsHV-1 were lower. Expression analysis (SOLiD RNA sequencing) of OsHV-1 genes expressed in oyster larvae at 24 hours post exposure revealed a nearly complete transcriptome, with several highly expressed genes, which are similar to recent transcriptomic analyses of other OsHV-1 variants. Taken together, our results indicate that genome and transcriptome sequencing may be powerful tools in understanding both strain variation and virulence of non-culturable marine viruses.

  4. Developmental transcriptome of Aplysia californica'

    KAUST Repository

    Heyland, Andreas; Vue, Zer; Voolstra, Christian R.; Medina, Mó nica; Moroz, Leonid L.

    2010-01-01

    developmental transcriptome with similar studies in the zebra fish Danio rerio, the fruit fly Drosophila melanogaster, the nematode Caenorhabditis elegans, and other studies on molluscs suggests an overall highly divergent pattern of gene regulatory mechanisms

  5. Bacterial community composition of South China Sea sediments through pyrosequencing-based analysis of 16S rRNA genes.

    Science.gov (United States)

    Zhu, Daochen; Tanabe, Shoko-Hosoi; Yang, Chong; Zhang, Weimin; Sun, Jianzhong

    2013-01-01

    Subseafloor sediments accumulate large amounts of organic and inorganic materials that contain a highly diverse microbial ecosystem. The aim of this study was to survey the bacterial community of subseafloor sediments from the South China Sea. Pyrosequencing of over 265,000 amplicons of the V3 hypervariable region of the 16S ribosomal RNA gene was performed on 16 sediment samples collected from multiple locations in the northern region of the South China Sea from depths ranging from 35 to 4000 m. A total of 9,726 operational taxonomic units (OTUs; between 695 and 2819 unique OTUs per sample) at 97% sequence similarity level were generated. In total, 40 bacterial phyla including 22 formally described phyla and 18 candidate phyla, with Proteobacteria, Firmicutes, Planctomycetes, Actinobacteria and Chloroflexi being most diverse, were identified. The most abundant phylotype, accounting for 42.6% of all sequences, belonged to Gammaproteobacteria, which possessed absolute predominance in the samples analyzed. Among the 18 candidate phyla, 12 were found for the first time in the South China Sea. This study provided a novel insight into the composition of bacterial communities of the South China Sea subseafloor. Furthermore, abundances and community similarity analysis showed that the compositions of the bacterial communities are very similar at phylum level at different depths from 35-4000 m.

  6. Bacterial community composition of South China Sea sediments through pyrosequencing-based analysis of 16S rRNA genes.

    Directory of Open Access Journals (Sweden)

    Daochen Zhu

    Full Text Available BACKGROUND: Subseafloor sediments accumulate large amounts of organic and inorganic materials that contain a highly diverse microbial ecosystem. The aim of this study was to survey the bacterial community of subseafloor sediments from the South China Sea. METHODOLOGY/PRINCIPAL FINDINGS: Pyrosequencing of over 265,000 amplicons of the V3 hypervariable region of the 16S ribosomal RNA gene was performed on 16 sediment samples collected from multiple locations in the northern region of the South China Sea from depths ranging from 35 to 4000 m. A total of 9,726 operational taxonomic units (OTUs; between 695 and 2819 unique OTUs per sample at 97% sequence similarity level were generated. In total, 40 bacterial phyla including 22 formally described phyla and 18 candidate phyla, with Proteobacteria, Firmicutes, Planctomycetes, Actinobacteria and Chloroflexi being most diverse, were identified. The most abundant phylotype, accounting for 42.6% of all sequences, belonged to Gammaproteobacteria, which possessed absolute predominance in the samples analyzed. Among the 18 candidate phyla, 12 were found for the first time in the South China Sea. CONCLUSIONS: This study provided a novel insight into the composition of bacterial communities of the South China Sea subseafloor. Furthermore, abundances and community similarity analysis showed that the compositions of the bacterial communities are very similar at phylum level at different depths from 35-4000 m.

  7. Hepatic transcriptomic profiles from barramundi, Lates calcarifer, as a means of assessing organism health and identifying stressors in rivers in northern Queensland.

    Science.gov (United States)

    Hook, Sharon E; Kroon, Frederieke J; Greenfield, Paul A; Warne, Michael St J; Smith, Rachael A; Turner, Ryan D

    2017-08-01

    Resource managers need to differentiate between sites with and without contaminants and those where contaminants cause impacts. Potentially, transcriptomes could be used to evaluate sites where contaminant-induced effects may occur, to identify causative stressors of effects and potential adverse outcomes. To test this hypothesis, the hepatic transcriptomes in Barramundi, a perciforme teleost fish, (Lates calcarifer) from two reference sites, two agriculturally impacted sites sampled during the dry season, and an impacted site sampled during the wet season were compared. The hepatic transcriptome was profiled using RNA-Seq. Multivariate analysis showed that transcriptomes were clustered based on site and by inference water quality, but not sampling time. The largest differences in transcriptomic profile were between reference sites and a site sampled during high run-off, showing that impacted sites can be identified via RNA-Seq. Transcripts with altered abundance were linked to xenobiotic metabolism, peroxisome proliferation and stress responses, indicating putative stressors with the potential for adverse outcomes in barramundi. Copyright © 2017. Published by Elsevier Ltd.

  8. Characterization of the microbial communities along the gastrointestinal tract of sheep by 454 pyrosequencing analysis

    Directory of Open Access Journals (Sweden)

    Jin Wang

    2017-01-01

    Full Text Available Objective The gastrointestinal tract of sheep contain complex microbial communities that influence numerous aspects of the sheep’s health and development. The objective of this study was to analyze the composition and diversity of the microbiota in the gastrointestinal tract sections (rumen, reticulum, omasum, abomasum, duodenum, jejunum, ileum, cecum, colon, and rectum of sheep. Methods This analysis was performed by 454 pyrosequencing using the V3-V6 region of the 16S rRNA genes. Samples were collected from five healthy, small tailed Han sheep aged 10 months, obtained at market. The bacterial composition of sheep gastrointestinal microbiota was investigated at the phylum, class, order, family, genus, and species levels. Results The dominant bacterial phyla in the entire gastrointestinal sections were Firmicutes, Bacteroidetes, and Proteobacteria. In the stomach, the three most dominant genera in the sheep were Prevotella, unclassified Lachnospiraceae, and Butyrivibrio. In the small intestine, the three most dominant genera in the sheep were Escherichia, unclassified Lachnospiraceae, and Ruminococcus. In the large intestine, the three most dominant genera in the sheep were Ruminococcus, unclassified Ruminococcaceae, and Prevotella. R. flavefaciens, B. fibrisolvens, and S. ruminantium were three most dominant species in the sheep gastrointestinal tract. Principal Coordinates Analysis showed that the microbial communities from each gastrointestinal section could be separated into three groups according to similarity of community composition: stomach (rumen, reticulum, omasum, and abomasum, small intestine (duodenum, jejunum, and ileum, and large intestine (cecum, colon, and rectum. Conclusion This is the first study to characterize the entire gastrointestinal microbiota in sheep by use of 16S rRNA gene amplicon pyrosequencing, expanding our knowledge of the gastrointestinal bacterial community of sheep.

  9. Transcriptome-Based Modeling Reveals that Oxidative Stress Induces Modulation of the AtfA-Dependent Signaling Networks in Aspergillus nidulans

    Directory of Open Access Journals (Sweden)

    Erzsébet Orosz

    2017-01-01

    Full Text Available To better understand the molecular functions of the master stress-response regulator AtfA in Aspergillus nidulans, transcriptomic analyses of the atfA null mutant and the appropriate control strains exposed to menadione sodium bisulfite- (MSB-, t-butylhydroperoxide- and diamide-induced oxidative stresses were performed. Several elements of oxidative stress response were differentially expressed. Many of them, including the downregulation of the mitotic cell cycle, as the MSB stress-specific upregulation of FeS cluster assembly and the MSB stress-specific downregulation of nitrate reduction, tricarboxylic acid cycle, and ER to Golgi vesicle-mediated transport, showed AtfA dependence. To elucidate the potential global regulatory role of AtfA governing expression of a high number of genes with very versatile biological functions, we devised a model based on the comprehensive transcriptomic data. Our model suggests that an important function of AtfA is to modulate the transduction of stress signals. Although it may regulate directly only a limited number of genes, these include elements of the signaling network, for example, members of the two-component signal transduction systems. AtfA acts in a stress-specific manner, which may increase further the number and diversity of AtfA-dependent genes. Our model sheds light on the versatility of the physiological functions of AtfA and its orthologs in fungi.

  10. A consensus approach to vertebrate de novo transcriptome assembly from RNA-seq data: Assembly of the duck (Anas platyrhynchos transcriptome.

    Directory of Open Access Journals (Sweden)

    Joanna eMoreton

    2014-06-01

    Full Text Available For vertebrate organisms where a reference genome is not available, de novo transcriptome assembly enables a cost effective insight into the identification of tissue specific or differentially expressed genes and variation of the coding part of the genome. However, since there are a number of different tools and parameters that can be used to reconstruct transcripts, it is difficult to determine an optimal method. Here we suggest a pipeline based on (1 assessing the performance of three different assembly tools (2 using both single and multiple k-mer approaches (3 examining the influence of the number of reads used in the assembly (4 merging assemblies from different tools. We use an example dataset from the vertebrate Anas platyrhynchos domestica (Pekin duck. We find that taking a subset of data enables a robust assembly to be produced by multiple methods without the need for very high memory capacity. The use of reads mapped back to transcripts (RMBT and CEGMA (Core Eukaryotic Genes Mapping Approach provides useful metrics to determine the completeness of assembly obtained. For this dataset the use of multiple k-mers in the assembly generated a more complete assembly as measured by greater number of RMBT and CEGMA score. Merged single k-mer assemblies are generally smaller but consist of longer transcripts, suggesting an assembly consisting of fewer fragmented transcripts. We suggest that the use of a subset of reads during assembly allows the relatively rapid investigation of assembly characteristics and can guide the user to the most appropriate transcriptome for particular downstream use. Transcriptomes generated by the compared assembly methods and the final merged assembly are freely available for download at http://dx.doi.org/10.6084/m9.figshare.1032613.

  11. Transcriptomic alterations during ageing reflect the shift from cancer to degenerative diseases in the elderly.

    Science.gov (United States)

    Aramillo Irizar, Peer; Schäuble, Sascha; Esser, Daniela; Groth, Marco; Frahm, Christiane; Priebe, Steffen; Baumgart, Mario; Hartmann, Nils; Marthandan, Shiva; Menzel, Uwe; Müller, Julia; Schmidt, Silvio; Ast, Volker; Caliebe, Amke; König, Rainer; Krawczak, Michael; Ristow, Michael; Schuster, Stefan; Cellerino, Alessandro; Diekmann, Stephan; Englert, Christoph; Hemmerich, Peter; Sühnel, Jürgen; Guthke, Reinhard; Witte, Otto W; Platzer, Matthias; Ruppin, Eytan; Kaleta, Christoph

    2018-01-30

    Disease epidemiology during ageing shows a transition from cancer to degenerative chronic disorders as dominant contributors to mortality in the old. Nevertheless, it has remained unclear to what extent molecular signatures of ageing reflect this phenomenon. Here we report on the identification of a conserved transcriptomic signature of ageing based on gene expression data from four vertebrate species across four tissues. We find that ageing-associated transcriptomic changes follow trajectories similar to the transcriptional alterations observed in degenerative ageing diseases but are in opposite direction to the transcriptomic alterations observed in cancer. We confirm the existence of a similar antagonism on the genomic level, where a majority of shared risk alleles which increase the risk of cancer decrease the risk of chronic degenerative disorders and vice versa. These results reveal a fundamental trade-off between cancer and degenerative ageing diseases that sheds light on the pronounced shift in their epidemiology during ageing.

  12. Genome Annotation and Transcriptomics of Oil-Producing Algae

    Science.gov (United States)

    2015-03-16

    AFRL-OSR-VA-TR-2015-0103 GENOME ANNOTATION AND TRANSCRIPTOMICS OF OIL-PRODUCING ALGAE Sabeeha Merchant UNIVERSITY OF CALIFORNIA LOS ANGELES Final...2010 To 12-31-2014 4. TITLE AND SUBTITLE GENOME ANNOTATION AND TRANSCRIPTOMICS OF OIL-PRODUCING ALGAE 5a. CONTRACT NUMBER FA9550-10-1-0095 5b...NOTES 14. ABSTRACT Most algae accumulate triacylglycerols (TAGs) when they are starved for essential nutrients like N, S, P (or Si in the case of some

  13. Analysis of insecticide resistance-related genes of the Carmine spider mite Tetranychus cinnabarinus based on a de novo assembled transcriptome.

    Directory of Open Access Journals (Sweden)

    Zhifeng Xu

    Full Text Available The carmine spider mite (CSM, Tetranychus cinnabarinus, is an important pest mite in agriculture, because it can develop insecticide resistance easily. To gain valuable gene information and molecular basis for the future insecticide resistance study of CSM, the first transcriptome analysis of CSM was conducted. A total of 45,016 contigs and 25,519 unigenes were generated from the de novo transcriptome assembly, and 15,167 unigenes were annotated via BLAST querying against current databases, including nr, SwissProt, the Clusters of Orthologous Groups (COGs, Kyoto Encyclopedia of Genes and Genomes (KEGG and Gene Ontology (GO. Aligning the transcript to Tetranychus urticae genome, the 19255 (75.45% of the transcripts had significant (e-value <10-5 matches to T. urticae DNA genome, 19111 sequences matched to T. urticae proteome with an average protein length coverage of 42.55%. Core Eukaryotic Genes Mapping Approach (CEGMA analysis identified 435 core eukaryotic genes (CEGs in the CSM dataset corresponding to 95% coverage. Ten gene categories that relate to insecticide resistance in arthropod were generated from CSM transcriptome, including 53 P450-, 22 GSTs-, 23 CarEs-, 1 AChE-, 7 GluCls-, 9 nAChRs-, 8 GABA receptor-, 1 sodium channel-, 6 ATPase- and 12 Cyt b genes. We developed significant molecular resources for T. cinnabarinus putatively involved in insecticide resistance. The transcriptome assembly analysis will significantly facilitate our study on the mechanism of adapting environmental stress (including insecticide in CSM at the molecular level, and will be very important for developing new control strategies against this pest mite.

  14. De novo transcriptome assembly of shrimp Palaemon serratus

    Directory of Open Access Journals (Sweden)

    Alejandra Perina

    2017-03-01

    Full Text Available The shrimp Palaemon serratus is a coastal decapod crustacean with a high commercial value. It is harvested for human consumption. In this study, we used Illumina sequencing technology (HiSeq 2000 to sequence, assemble and annotate the transcriptome of P. serratus. RNA was isolated from muscle of adults individuals and, from a pool of larvae. A total number of 4 cDNA libraries were constructed, using the TruSeq RNA Sample Preparation Kit v2. The raw data in this study was deposited in NCBI SRA database with study accession number of SRP090769. The obtained data were subjected to de novo transcriptome assembly using Trinity software, and coding regions were predicted by TransDecoder. We used Blastp and Sma3s to annotate the identified proteins. The transcriptome data could provide some insight into the understanding of genes involved in the larval development and metamorphosis.

  15. Visual analysis of transcriptome data in the context of anatomical structures and biological networks

    Directory of Open Access Journals (Sweden)

    Astrid eJunker

    2012-11-01

    Full Text Available The complexity and temporal as well as spatial resolution of transcriptome datasets is constantly increasing due to extensive technological developments. Here we present methods for advanced visualization and intuitive exploration of transcriptomics data as necessary prerequisites in order to facilitate the gain of biological knowledge. Color-coding of structural images based on the expression level enables a fast visual data analysis in the background of the examined biological system. The network-based exploration of these visualizations allows for comparative analysis of genes with specific transcript patterns and supports the extraction of functional relationships even from large datasets. In order to illustrate the presented methods, the tool HIVE was applied for visualization and exploration of database-retrieved expression data for master regulators of Arabidopsis thaliana flower and seed development in the context of corresponding tissue-specific regulatory networks.

  16. Single cell transcriptome profiling of developing chick retinal cells.

    Science.gov (United States)

    Laboissonniere, Lauren A; Martin, Gregory M; Goetz, Jillian J; Bi, Ran; Pope, Brock; Weinand, Kallie; Ellson, Laura; Fru, Diane; Lee, Miranda; Wester, Andrea K; Liu, Peng; Trimarchi, Jeffrey M

    2017-08-15

    The vertebrate retina is a specialized photosensitive tissue comprised of six neuronal and one glial cell types, each of which develops in prescribed proportions at overlapping timepoints from a common progenitor pool. While each of these cells has a specific function contributing to proper vision in the mature animal, their differential representation in the retina as well as the presence of distinctive cellular subtypes makes identifying the transcriptomic signatures that lead to each retinal cell's fate determination and development challenging. We have analyzed transcriptomes from individual cells isolated from the chick retina throughout retinogenesis. While we focused our efforts on the retinal ganglion cells, our transcriptomes of developing chick cells also contained representation from multiple retinal cell types, including photoreceptors and interneurons at different stages of development. Most interesting was the identification of transcriptomes from individual mixed lineage progenitor cells in the chick as these cells offer a window into the cell fate decision-making process. Taken together, these data sets will enable us to uncover the most critical genes acting in the steps of cell fate determination and early differentiation of various retinal cell types. © 2017 Wiley Periodicals, Inc.

  17. Transcriptome Profiling in Human Diseases: New Advances and Perspectives.

    Science.gov (United States)

    Casamassimi, Amelia; Federico, Antonio; Rienzo, Monica; Esposito, Sabrina; Ciccodicola, Alfredo

    2017-07-29

    In the last decades, transcriptome profiling has been one of the most utilized approaches to investigate human diseases at the molecular level. Through expression studies, many molecular biomarkers and therapeutic targets have been found for several human pathologies. This number is continuously increasing thanks to total RNA sequencing. Indeed, this new technology has completely revolutionized transcriptome analysis allowing the quantification of gene expression levels and allele-specific expression in a single experiment, as well as to identify novel genes, splice isoforms, fusion transcripts, and to investigate the world of non-coding RNA at an unprecedented level. RNA sequencing has also been employed in important projects, like ENCODE (Encyclopedia of the regulatory elements) and TCGA (The Cancer Genome Atlas), to provide a snapshot of the transcriptome of dozens of cell lines and thousands of primary tumor specimens. Moreover, these studies have also paved the way to the development of data integration approaches in order to facilitate management and analysis of data and to identify novel disease markers and molecular targets to use in the clinics. In this scenario, several ongoing clinical trials utilize transcriptome profiling through RNA sequencing strategies as an important instrument in the diagnosis of numerous human pathologies.

  18. A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing.

    Science.gov (United States)

    Chen, Shi-Yi; Deng, Feilong; Jia, Xianbo; Li, Cao; Lai, Song-Jia

    2017-08-09

    It is widely acknowledged that transcriptional diversity largely contributes to biological regulation in eukaryotes. Since the advent of second-generation sequencing technologies, a large number of RNA sequencing studies have considerably improved our understanding of transcriptome complexity. However, it still remains a huge challenge for obtaining full-length transcripts because of difficulties in the short read-based assembly. In the present study we employ PacBio single-molecule long-read sequencing technology for whole-transcriptome profiling in rabbit (Oryctolagus cuniculus). We totally obtain 36,186 high-confidence transcripts from 14,474 genic loci, among which more than 23% of genic loci and 66% of isoforms have not been annotated yet within the current reference genome. Furthermore, about 17% of transcripts are computationally revealed to be non-coding RNAs. Up to 24,797 alternative splicing (AS) and 11,184 alternative polyadenylation (APA) events are detected within this de novo constructed transcriptome, respectively. The results provide a comprehensive set of reference transcripts and hence contribute to the improved annotation of rabbit genome.

  19. Analysis of the Citrullus colocynthis transcriptome during water deficit stress.

    Directory of Open Access Journals (Sweden)

    Zhuoyu Wang

    Full Text Available Citrullus colocynthis is a very drought tolerant species, closely related to watermelon (C. lanatus var. lanatus, an economically important cucurbit crop. Drought is a threat to plant growth and development, and the discovery of drought inducible genes with various functions is of great importance. We used high throughput mRNA Illumina sequencing technology and bioinformatic strategies to analyze the C. colocynthis leaf transcriptome under drought treatment. Leaf samples at four different time points (0, 24, 36, or 48 hours of withholding water were used for RNA extraction and Illumina sequencing. qRT-PCR of several drought responsive genes was performed to confirm the accuracy of RNA sequencing. Leaf transcriptome analysis provided the first glimpse of the drought responsive transcriptome of this unique cucurbit species. A total of 5038 full-length cDNAs were detected, with 2545 genes showing significant changes during drought stress. Principle component analysis indicated that drought was the major contributing factor regulating transcriptome changes. Up regulation of many transcription factors, stress signaling factors, detoxification genes, and genes involved in phytohormone signaling and citrulline metabolism occurred under the water deficit conditions. The C. colocynthis transcriptome data highlight the activation of a large set of drought related genes in this species, thus providing a valuable resource for future functional analysis of candidate genes in defense of drought stress.

  20. Histological and Transcriptomic Analysis during Bulbil Formation in Lilium lancifolium

    Directory of Open Access Journals (Sweden)

    Panpan Yang

    2017-08-01

    Full Text Available Aerial bulbils are an important propagative organ, playing an important role in population expansion. However, the detailed gene regulatory patterns and molecular mechanism underlying bulbil formation remain unclear. Triploid Lilium lancifolium, which develops many aerial bulbils on the leaf axils of middle-upper stem, is a useful species for investigating bulbil formation. To investigate the mechanism of bulbil formation in triploid L. lancifolium, we performed histological and transcriptomic analyses using samples of leaf axils located in the upper and lower stem of triploid L. lancifolium during bulbil formation. Histological results indicated that the bulbils of triploid L. lancifolium are derived from axillary meristems that initiate de novo from cells on the adaxial side of the petiole base. Transcriptomic analysis generated ~650 million high-quality reads and 11,871 differentially expressed genes (DEGs. Functional analysis showed that the DEGs were significantly enriched in starch and sucrose metabolism and plant hormone signal transduction. Starch synthesis and accumulation likely promoted the initiation of upper bulbils in triploid L. lancifolium. Hormone-associated pathways exhibited distinct patterns of change in each sample. Auxin likely promoted the initiation of bulbils and then inhibited further bulbil formation. High biosynthesis and low degradation of cytokinin might have led to bulbil formation in the upper leaf axil. The present study achieved a global transcriptomic analysis focused on gene expression changes and pathways' enrichment during upper bulbil formation in triploid L. lancifolium, laying a solid foundation for future molecular studies on bulbil formation.

  1. A Transcriptome Map of Actinobacillus pleuropneumoniae at Single-Nucleotide Resolution Using Deep RNA-Seq.

    Directory of Open Access Journals (Sweden)

    Zhipeng Su

    Full Text Available Actinobacillus pleuropneumoniae is the pathogen of porcine contagious pleuropneumoniae, a highly contagious respiratory disease of swine. Although the genome of A. pleuropneumoniae was sequenced several years ago, limited information is available on the genome-wide transcriptional analysis to accurately annotate the gene structures and regulatory elements. High-throughput RNA sequencing (RNA-seq has been applied to study the transcriptional landscape of bacteria, which can efficiently and accurately identify gene expression regions and unknown transcriptional units, especially small non-coding RNAs (sRNAs, UTRs and regulatory regions. The aim of this study is to comprehensively analyze the transcriptome of A. pleuropneumoniae by RNA-seq in order to improve the existing genome annotation and promote our understanding of A. pleuropneumoniae gene structures and RNA-based regulation. In this study, we utilized RNA-seq to construct a single nucleotide resolution transcriptome map of A. pleuropneumoniae. More than 3.8 million high-quality reads (average length ~90 bp from a cDNA library were generated and aligned to the reference genome. We identified 32 open reading frames encoding novel proteins that were mis-annotated in the previous genome annotations. The start sites for 35 genes based on the current genome annotation were corrected. Furthermore, 51 sRNAs in the A. pleuropneumoniae genome were discovered, of which 40 sRNAs were never reported in previous studies. The transcriptome map also enabled visualization of 5'- and 3'-UTR regions, in which contained 11 sRNAs. In addition, 351 operons covering 1230 genes throughout the whole genome were identified. The RNA-Seq based transcriptome map validated annotated genes and corrected annotations of open reading frames in the genome, and led to the identification of many functional elements (e.g. regions encoding novel proteins, non-coding sRNAs and operon structures. The transcriptional units

  2. In silico approach towards H5N1 virus protein and transcriptomics ...

    African Journals Online (AJOL)

    H5N1 influenza A virus is a serious threat to human population. With a considerable mortality rate, strategies for coping with the infection are being developed. Our research group and some others investigated the potential therapeutic and preventive measures for tackling H5N1 infections. Protein based and transcriptomics ...

  3. Sequencing and de novo transcriptome assembly of the Chinese giant salamander (Andrias davidianus

    Directory of Open Access Journals (Sweden)

    Yong Huang

    2017-06-01

    Full Text Available Next-generation technologies for determination of genomics and transcriptomics composition have a wide range of applications. Andrias davidianus, has become an endangered amphibian species of salamander endemic in China. However, there is a lack of the molecular information. In this study, we obtained the RNA-Seq data from a pool of A. davidianus tissue including spleen, liver, muscle, kidney, skin, testis, gut and heart using Illumina HiSeq 2500 platform. A total of 15,398,997,600 bp were obtained, corresponding to 102,659,984 raw reads. A total of 102,659,984 reads were filtered after removing low-quality reads and trimming the adapter sequences. The Trinity program was used to de novo assemble 132,912 unigenes with an average length of 690 bp and N50 of 1263 bp. Unigenes were annotated through number of databases. These transcriptomic data of A. davidianus should open the door to molecular evolution studies based on the entire transcriptome or targeted genes of interest to sequence. The raw data in this study can be available in NCBI SRA database with accession number of SRP099564.

  4. Rnnotator: an automated de novo transcriptome assembly pipeline from stranded RNA-Seq reads

    Energy Technology Data Exchange (ETDEWEB)

    Martin, Jeffrey; Bruno, Vincent M.; Fang, Zhide; Meng, Xiandong; Blow, Matthew; Zhang, Tao; Sherlock, Gavin; Snyder, Michael; Wang, Zhong

    2010-11-19

    Background: Comprehensive annotation and quantification of transcriptomes are outstanding problems in functional genomics. While high throughput mRNA sequencing (RNA-Seq) has emerged as a powerful tool for addressing these problems, its success is dependent upon the availability and quality of reference genome sequences, thus limiting the organisms to which it can be applied. Results: Here, we describe Rnnotator, an automated software pipeline that generates transcript models by de novo assembly of RNA-Seq data without the need for a reference genome. We have applied the Rnnotator assembly pipeline to two yeast transcriptomes and compared the results to the reference gene catalogs of these organisms. The contigs produced by Rnnotator are highly accurate (95percent) and reconstruct full-length genes for the majority of the existing gene models (54.3percent). Furthermore, our analyses revealed many novel transcribed regions that are absent from well annotated genomes, suggesting Rnnotator serves as a complementary approach to analysis based on a reference genome for comprehensive transcriptomics. Conclusions: These results demonstrate that the Rnnotator pipeline is able to reconstruct full-length transcripts in the absence of a complete reference genome.

  5. Comparative transcriptome analyses indicate molecular homology of zebrafish swimbladder and mammalian lung.

    Directory of Open Access Journals (Sweden)

    Weiling Zheng

    Full Text Available The fish swimbladder is a unique organ in vertebrate evolution and it functions for regulating buoyancy in most teleost species. It has long been postulated as a homolog of the tetrapod lung, but the molecular evidence is scarce. In order to understand the molecular function of swimbladder as well as its relationship with lungs in tetrapods, transcriptomic analyses of zebrafish swimbladder were carried out by RNA-seq. Gene ontology classification showed that genes in cytoskeleton and endoplasmic reticulum were enriched in the swimbladder. Further analyses depicted gene sets and pathways closely related to cytoskeleton constitution and regulation, cell adhesion, and extracellular matrix. Several prominent transcription factor genes in the swimbladder including hoxc4a, hoxc6a, hoxc8a and foxf1 were identified and their expressions in developing swimbladder during embryogenesis were confirmed. By comparison of enriched transcripts in the swimbladder with those in human and mouse lungs, we established the resemblance of transcriptome of the zebrafish swimbladder and mammalian lungs. Based on the transcriptomic data of zebrafish swimbladder, the predominant functions of swimbladder are in its epithelial and muscular tissues. Our comparative analyses also provide molecular evidence of the relatedness of the fish swimbladder and mammalian lung.

  6. De novo assembly of the perennial ryegrass transcriptome using an RNA-seq strategy

    DEFF Research Database (Denmark)

    Farrell, Jacqueline Danielle; Byrne, Stephen; Paina, Cristiana

    2014-01-01

    a homozygous perennial ryegrass genotype can circumvent the challenge of heterozygosity. The goals of this study were to perform RNA-sequencing on multiple tissues from a highly inbred genotype to develop a reference transcriptome. This was complemented with RNA-sequencing of a highly heterozygous genotype...... for SNP calling. Result De novo transcriptome assembly of the inbred genotype created 185,833 transcripts with an average length of 830 base pairs. Within the inbred reference transcriptome 78,560 predicted open reading frames were found of which 24,434 were predicted as complete. Functional annotation...... multiple orthologs. Using the longest unique open reading frames as the reference sequence, 64,242 single nucleotide polymorphisms were found. One thousand sixty one open reading frames from the inbred genotype contained heterozygous sites, confirming the high degree of homozygosity. Conclusion Our study...

  7. Transcriptomic dissection of sexual differences in Bemisia tabaci, an invasive agricultural pest worldwide.

    Science.gov (United States)

    Xie, Wen; Guo, Litao; Jiao, Xiaoguo; Yang, Nina; Yang, Xin; Wu, Qingjun; Wang, Shaoli; Zhou, Xuguo; Zhang, Youjun

    2014-02-14

    Sex difference involving chromosomes and gene expression has been extensively documented. In this study, the gender difference in the sweetpotato whitefly Bemisia tabaci was investigated using Illumina-based transcriptomic analysis. Gender-based RNAseq data produced 27 Gb reads, and subsequent de novo assembly generated 93,948 transcripts with a N50 of 1,853 bp. A total of 1,351 differentially expressed genes were identified between male and female B. tabaci, and majority of them were female-biased. Pathway and GO enrichment experiments exhibited a gender-specific expression, including enriched translation in females, and enhanced structural constituent of cuticle in male whiteflies. In addition, a putative transformer2 gene (tra2) was cloned, and the structural feature and expression profile of tra2 were investigated. Sexually dimorphic transcriptome is an uncharted territory for the agricultural insect pests. Molecular understanding of sex determination in B. tabaci, an emerging invasive insect pest worldwide, will provide potential molecular target(s) for genetic pest control alternatives.

  8. Predicting gene regulatory networks of soybean nodulation from RNA-Seq transcriptome data.

    Science.gov (United States)

    Zhu, Mingzhu; Dahmen, Jeremy L; Stacey, Gary; Cheng, Jianlin

    2013-09-22

    High-throughput RNA sequencing (RNA-Seq) is a revolutionary technique to study the transcriptome of a cell under various conditions at a systems level. Despite the wide application of RNA-Seq techniques to generate experimental data in the last few years, few computational methods are available to analyze this huge amount of transcription data. The computational methods for constructing gene regulatory networks from RNA-Seq expression data of hundreds or even thousands of genes are particularly lacking and urgently needed. We developed an automated bioinformatics method to predict gene regulatory networks from the quantitative expression values of differentially expressed genes based on RNA-Seq transcriptome data of a cell in different stages and conditions, integrating transcriptional, genomic and gene function data. We applied the method to the RNA-Seq transcriptome data generated for soybean root hair cells in three different development stages of nodulation after rhizobium infection. The method predicted a soybean nodulation-related gene regulatory network consisting of 10 regulatory modules common for all three stages, and 24, 49 and 70 modules separately for the first, second and third stage, each containing both a group of co-expressed genes and several transcription factors collaboratively controlling their expression under different conditions. 8 of 10 common regulatory modules were validated by at least two kinds of validations, such as independent DNA binding motif analysis, gene function enrichment test, and previous experimental data in the literature. We developed a computational method to reliably reconstruct gene regulatory networks from RNA-Seq transcriptome data. The method can generate valuable hypotheses for interpreting biological data and designing biological experiments such as ChIP-Seq, RNA interference, and yeast two hybrid experiments.

  9. Transcriptomic analysis of flower development in wintersweet (Chimonanthus praecox).

    Science.gov (United States)

    Liu, Daofeng; Sui, Shunzhao; Ma, Jing; Li, Zhineng; Guo, Yulong; Luo, Dengpan; Yang, Jianfeng; Li, Mingyang

    2014-01-01

    Wintersweet (Chimonanthus praecox) is familiar as a garden plant and woody ornamental flower. On account of its unique flowering time and strong fragrance, it has a high ornamental and economic value. Despite a long history of human cultivation, our understanding of wintersweet genetics and molecular biology remains scant, reflecting a lack of basic genomic and transcriptomic data. In this study, we assembled three cDNA libraries, from three successive stages in flower development, designated as the flower bud with displayed petal, open flower and senescing flower stages. Using the Illumina RNA-Seq method, we obtained 21,412,928, 26,950,404, 24,912,954 qualified Illumina reads, respectively, for the three successive stages. The pooled reads from all three libraries were then assembled into 106,995 transcripts, 51,793 of which were annotated in the NCBI non-redundant protein database. Of these annotated sequences, 32,649 and 21,893 transcripts were assigned to gene ontology categories and clusters of orthologous groups, respectively. We could map 15,587 transcripts onto 312 pathways using the Kyoto Encyclopedia of Genes and Genomes pathway database. Based on these transcriptomic data, we obtained a large number of candidate genes that were differentially expressed at the open flower and senescing flower stages. An analysis of differentially expressed genes involved in plant hormone signal transduction pathways indicated that although flower opening and senescence may be independent of the ethylene signaling pathway in wintersweet, salicylic acid may be involved in the regulation of flower senescence. We also succeeded in isolating key genes of floral scent biosynthesis and proposed a biosynthetic pathway for monoterpenes and sesquiterpenes in wintersweet flowers, based on the annotated sequences. This comprehensive transcriptomic analysis presents fundamental information on the genes and pathways which are involved in flower development in wintersweet. And our data

  10. De novo transcriptome assembly of two different peach cultivars grown in Korea

    Directory of Open Access Journals (Sweden)

    Yeonhwa Jo

    2015-12-01

    Full Text Available Peach (Prunus persica is one of the most popular stone fruits worldwide. Next generation sequencing (NGS has facilitated genome and transcriptome analyses of several stone fruit trees. In this study, we conducted de novo transcriptome analyses of two peach cultivars grown in Korea. Leaves of two cultivars, referred to as Jangtaek and Mibaek, were harvested and used for library preparation. The two prepared libraries were paired-end sequenced by the HiSeq2000 system. We obtained 8.14 GB and 9.62 GB sequence data from Jangtaek and Mibaek (NCBI accession numbers: SRS1056585 and SRS1056587, respectively. The Trinity program was used to assemble two transcriptomes de novo, resulting in 110,477 (Jangtaek and 136,196 (Mibaek transcripts. TransDecoder identified possible coding regions in assembled transcripts. The identified proteins were subjected to BLASTP search against NCBI's non-redundant database for functional annotation. This study provides transcriptome data for two peach cultivars, which might be useful for genetic marker development and comparative transcriptome analyses.

  11. Transcriptomics resources of human tissues and organs

    DEFF Research Database (Denmark)

    Uhlén, Mathias; Hallström, Björn M.; Lindskog, Cecilia

    2016-01-01

    a framework for defining the molecular constituents of the human body as well as for generating comprehensive lists of proteins expressed across tissues or in a tissue-restricted manner. Here, we review publicly available human transcriptome resources and discuss body-wide data from independent genome......Quantifying the differential expression of genes in various human organs, tissues, and cell types is vital to understand human physiology and disease. Recently, several large-scale transcriptomics studies have analyzed the expression of protein-coding genes across tissues. These datasets provide...

  12. Diurnal Transcriptome and Gene Network Represented through Sparse Modeling in Brachypodium distachyon

    Directory of Open Access Journals (Sweden)

    Satoru Koda

    2017-11-01

    Full Text Available We report the comprehensive identification of periodic genes and their network inference, based on a gene co-expression analysis and an Auto-Regressive eXogenous (ARX model with a group smoothly clipped absolute deviation (SCAD method using a time-series transcriptome dataset in a model grass, Brachypodium distachyon. To reveal the diurnal changes in the transcriptome in B. distachyon, we performed RNA-seq analysis of its leaves sampled through a diurnal cycle of over 48 h at 4 h intervals using three biological replications, and identified 3,621 periodic genes through our wavelet analysis. The expression data are feasible to infer network sparsity based on ARX models. We found that genes involved in biological processes such as transcriptional regulation, protein degradation, and post-transcriptional modification and photosynthesis are significantly enriched in the periodic genes, suggesting that these processes might be regulated by circadian rhythm in B. distachyon. On the basis of the time-series expression patterns of the periodic genes, we constructed a chronological gene co-expression network and identified putative transcription factors encoding genes that might be involved in the time-specific regulatory transcriptional network. Moreover, we inferred a transcriptional network composed of the periodic genes in B. distachyon, aiming to identify genes associated with other genes through variable selection by grouping time points for each gene. Based on the ARX model with the group SCAD regularization using our time-series expression datasets of the periodic genes, we constructed gene networks and found that the networks represent typical scale-free structure. Our findings demonstrate that the diurnal changes in the transcriptome in B. distachyon leaves have a sparse network structure, demonstrating the spatiotemporal gene regulatory network over the cyclic phase transitions in B. distachyon diurnal growth.

  13. Characterization of Liaoning cashmere goat transcriptome: sequencing, de novo assembly, functional annotation and comparative analysis.

    Directory of Open Access Journals (Sweden)

    Hongliang Liu

    Full Text Available Liaoning cashmere goat is a famous goat breed for cashmere wool. In order to increase the transcriptome data and accelerate genetic improvement for this breed, we performed de novo transcriptome sequencing to generate the first expressed sequence tag dataset for the Liaoning cashmere goat, using next-generation sequencing technology.Transcriptome sequencing of Liaoning cashmere goat on a Roche 454 platform yielded 804,601 high-quality reads. Clustering and assembly of these reads produced a non-redundant set of 117,854 unigenes, comprising 13,194 isotigs and 104,660 singletons. Based on similarity searches with known proteins, 17,356 unigenes were assigned to 6,700 GO categories, and the terms were summarized into three main GO categories and 59 sub-categories. 3,548 and 46,778 unigenes had significant similarity to existing sequences in the KEGG and COG databases, respectively. Comparative analysis revealed that 42,254 unigenes were aligned to 17,532 different sequences in NCBI non-redundant nucleotide databases. 97,236 (82.51% unigenes were mapped to the 30 goat chromosomes. 35,551 (30.17% unigenes were matched to 11,438 reported goat protein-coding genes. The remaining non-matched unigenes were further compared with cattle and human reference genes, 67 putative new goat genes were discovered. Additionally, 2,781 potential simple sequence repeats were initially identified from all unigenes.The transcriptome of Liaoning cashmere goat was deep sequenced, de novo assembled, and annotated, providing abundant data to better understand the Liaoning cashmere goat transcriptome. The potential simple sequence repeats provide a material basis for future genetic linkage and quantitative trait loci analyses.

  14. Transcriptome architecture across tissues in the pig

    Directory of Open Access Journals (Sweden)

    Folch Josep M

    2008-04-01

    Full Text Available Abstract Background Artificial selection has resulted in animal breeds with extreme phenotypes. As an organism is made up of many different tissues and organs, each with its own genetic programme, it is pertinent to ask: How relevant is tissue in terms of total transcriptome variability? Which are the genes most distinctly expressed between tissues? Does breed or sex equally affect the transcriptome across tissues? Results In order to gain insight on these issues, we conducted microarray expression profiling of 16 different tissues from four animals of two extreme pig breeds, Large White and Iberian, two males and two females. Mixed model analysis and neighbor – joining trees showed that tissues with similar developmental origin clustered closer than those with different embryonic origins. Often a sound biological interpretation was possible for overrepresented gene ontology categories within differentially expressed genes between groups of tissues. For instance, an excess of nervous system or muscle development genes were found among tissues of ectoderm or mesoderm origins, respectively. Tissue accounted for ~11 times more variability than sex or breed. Nevertheless, we were able to confidently identify genes with differential expression across tissues between breeds (33 genes and between sexes (19 genes. The genes primarily affected by sex were overall different than those affected by breed or tissue. Interaction with tissue can be important for differentially expressed genes between breeds but not so much for genes whose expression differ between sexes. Conclusion Embryonic development leaves an enduring footprint on the transcriptome. The interaction in gene × tissue for differentially expressed genes between breeds suggests that animal breeding has targeted differentially each tissue's transcriptome.

  15. De novo RNA-Seq based transcriptome analysis of Papiliotrema laurentii strain RY1 under nitrogen starvation.

    Science.gov (United States)

    Sarkar, Soumyadev; Chakravorty, Somnath; Mukherjee, Avishek; Bhattacharya, Debanjana; Bhattacharya, Semantee; Gachhui, Ratan

    2018-03-01

    Nitrogen is a key nutrient for all cell forms. Most organisms respond to nitrogen scarcity by slowing down their growth rate. On the contrary, our previous studies have shown that Papiliotrema laurentii strain RY1 has a robust growth under nitrogen starvation. To understand the global regulation that leads to such an extraordinary response, we undertook a de novo approach for transcriptome analysis of the yeast. Close to 33 million sequence reads of high quality for nitrogen limited and enriched condition were generated using Illumina NextSeq500. Trinity analysis and clustered transcripts annotation of the reads produced 17,611 unigenes, out of which 14,157 could be annotated. Gene Ontology term analysis generated 44.92% cellular component terms, 39.81% molecular function terms and 15.24% biological process terms. The most over represented pathways in general were translation, carbohydrate metabolism, amino acid metabolism, general metabolism, folding, sorting, degradation followed by transport and catabolism, nucleotide metabolism, replication and repair, transcription and lipid metabolism. A total of 4256 Single Sequence Repeats were identified. Differential gene expression analysis detected 996 P-significant transcripts to reveal transmembrane transport, lipid homeostasis, fatty acid catabolism and translation as the enriched terms which could be essential for Papiliotrema laurentii strain RY1 to adapt during nitrogen deprivation. Transcriptome data was validated by quantitative real-time PCR analysis of twelve transcripts. To the best of our knowledge, this is the first report of Papiliotrema laurentii strain RY1 transcriptome which would play a pivotal role in understanding the biochemistry of the yeast under acute nitrogen stress and this study would be encouraging to initiate extensive investigations into this Papiliotrema system. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. Condition-dependent transcriptome reveals high-level regulatory architecture in Bacillus subtilis

    DEFF Research Database (Denmark)

    Nicolas, Pierre; Mäder, Ulrike; Dervyn, Etienne

    2012-01-01

    Bacteria adapt to environmental stimuli by adjusting their transcriptomes in a complex manner, the full potential of which has yet to be established for any individual bacterial species. Here, we report the transcriptomes of Bacillus subtilis exposed to a wide range of environmental and nutrition...

  17. Condition-Dependent Transcriptome Reveals High-Level Regulatory Architecture in Bacillus subtilis

    NARCIS (Netherlands)

    Nicolas, Pierre; Maeder, Ulrike; Dervyn, Etienne; Rochat, Tatiana; Leduc, Aurelie; Pigeonneau, Nathalie; Bidnenko, Elena; Marchadier, Elodie; Hoebeke, Mark; Aymerich, Stephane; Becher, Doerte; Bisicchia, Paola; Botella, Eric; Delumeau, Olivier; Doherty, Geoff; Denham, Emma L.; Fogg, Mark J.; Fromion, Vincent; Goelzer, Anne; Hansen, Annette; Haertig, Elisabeth; Harwood, Colin R.; Homuth, Georg; Jarmer, Hanne; Jules, Matthieu; Klipp, Edda; Le Chat, Ludovic; Lecointe, Francois; Lewis, Peter; Liebermeister, Wolfram; March, Anika; Mars, Ruben A. T.; Nannapaneni, Priyanka; Noone, David; Pohl, Susanne; Rinn, Bernd; Ruegheimer, Frank; Sappa, Praveen K.; Samson, Franck; Schaffer, Marc; Schwikowski, Benno; Steil, Leif; Stuelke, Joerg; Wiegert, Thomas; Devine, Kevin M.; Wilkinson, Anthony J.; van Dijl, Jan Maarten; Hecker, Michael; Voelker, Uwe; Bessieres, Philippe; Noirot, Philippe

    2012-01-01

    Bacteria adapt to environmental stimuli by adjusting their transcriptomes in a complex manner, the full potential of which has yet to be established for any individual bacterial species. Here, we report the transcriptomes of Bacillus subtilis exposed to a wide range of environmental and nutritional

  18. Deep RNA Sequencing of the Skeletal Muscle Transcriptome in Swimming Fish

    NARCIS (Netherlands)

    Palstra, A.P.; Beltran, S.; Burgerhout, E.; Brittijn, S.A.; Magnoni, L.J.; Henkel, C.V.; Jansen, A.; Thillart, G.E.E.J.M.; Spaink, H.P.; Planas, J.V.

    2013-01-01

    Deep RNA sequencing (RNA-seq) was performed to provide an in-depth view of the transcriptome of red and white skeletal muscle of exercised and non-exercised rainbow trout (Oncorhynchus mykiss) with the specific objective to identify expressed genes and quantify the transcriptomic effects of

  19. Rapid and accurate pyrosequencing of angiosperm plastid genomes

    Science.gov (United States)

    Moore, Michael J; Dhingra, Amit; Soltis, Pamela S; Shaw, Regina; Farmerie, William G; Folta, Kevin M; Soltis, Douglas E

    2006-01-01

    Background Plastid genome sequence information is vital to several disciplines in plant biology, including phylogenetics and molecular biology. The past five years have witnessed a dramatic increase in the number of completely sequenced plastid genomes, fuelled largely by advances in conventional Sanger sequencing technology. Here we report a further significant reduction in time and cost for plastid genome sequencing through the successful use of a newly available pyrosequencing platform, the Genome Sequencer 20 (GS 20) System (454 Life Sciences Corporation), to rapidly and accurately sequence the whole plastid genomes of the basal eudicot angiosperms Nandina domestica (Berberidaceae) and Platanus occidentalis (Platanaceae). Results More than 99.75% of each plastid genome was simultaneously obtained during two GS 20 sequence runs, to an average depth of coverage of 24.6× in Nandina and 17.3× in Platanus. The Nandina and Platanus plastid genomes shared essentially identical gene complements and possessed the typical angiosperm plastid structure and gene arrangement. To assess the accuracy of the GS 20 sequence, over 45 kilobases of sequence were generated for each genome using conventional sequencing. Overall error rates of 0.043% and 0.031% were observed in GS 20 sequence for Nandina and Platanus, respectively. More than 97% of all observed errors were associated with homopolymer runs, with ~60% of all errors associated with homopolymer runs of 5 or more nucleotides and ~50% of all errors associated with regions of extensive homopolymer runs. No substitution errors were present in either genome. Error rates were generally higher in the single-copy and noncoding regions of both plastid genomes relative to the inverted repeat and coding regions. Conclusion Highly accurate and essentially complete sequence information was obtained for the Nandina and Platanus plastid genomes using the GS 20 System. More importantly, the high accuracy observed in the GS 20 plastid

  20. Rapid and accurate pyrosequencing of angiosperm plastid genomes

    Directory of Open Access Journals (Sweden)

    Farmerie William G

    2006-08-01

    Full Text Available Abstract Background Plastid genome sequence information is vital to several disciplines in plant biology, including phylogenetics and molecular biology. The past five years have witnessed a dramatic increase in the number of completely sequenced plastid genomes, fuelled largely by advances in conventional Sanger sequencing technology. Here we report a further significant reduction in time and cost for plastid genome sequencing through the successful use of a newly available pyrosequencing platform, the Genome Sequencer 20 (GS 20 System (454 Life Sciences Corporation, to rapidly and accurately sequence the whole plastid genomes of the basal eudicot angiosperms Nandina domestica (Berberidaceae and Platanus occidentalis (Platanaceae. Results More than 99.75% of each plastid genome was simultaneously obtained during two GS 20 sequence runs, to an average depth of coverage of 24.6× in Nandina and 17.3× in Platanus. The Nandina and Platanus plastid genomes shared essentially identical gene complements and possessed the typical angiosperm plastid structure and gene arrangement. To assess the accuracy of the GS 20 sequence, over 45 kilobases of sequence were generated for each genome using conventional sequencing. Overall error rates of 0.043% and 0.031% were observed in GS 20 sequence for Nandina and Platanus, respectively. More than 97% of all observed errors were associated with homopolymer runs, with ~60% of all errors associated with homopolymer runs of 5 or more nucleotides and ~50% of all errors associated with regions of extensive homopolymer runs. No substitution errors were present in either genome. Error rates were generally higher in the single-copy and noncoding regions of both plastid genomes relative to the inverted repeat and coding regions. Conclusion Highly accurate and essentially complete sequence information was obtained for the Nandina and Platanus plastid genomes using the GS 20 System. More importantly, the high accuracy

  1. Single-cell analysis of targeted transcriptome predicts drug sensitivity of single cells within human myeloma tumors.

    Science.gov (United States)

    Mitra, A K; Mukherjee, U K; Harding, T; Jang, J S; Stessman, H; Li, Y; Abyzov, A; Jen, J; Kumar, S; Rajkumar, V; Van Ness, B

    2016-05-01

    Multiple myeloma (MM) is characterized by significant genetic diversity at subclonal levels that have a defining role in the heterogeneity of tumor progression, clinical aggressiveness and drug sensitivity. Although genome profiling studies have demonstrated heterogeneity in subclonal architecture that may ultimately lead to relapse, a gene expression-based prediction program that can identify, distinguish and quantify drug response in sub-populations within a bulk population of myeloma cells is lacking. In this study, we performed targeted transcriptome analysis on 528 pre-treatment single cells from 11 myeloma cell lines and 418 single cells from 8 drug-naïve MM patients, followed by intensive bioinformatics and statistical analysis for prediction of proteasome inhibitor sensitivity in individual cells. Using our previously reported drug response gene expression profile signature at the single-cell level, we developed an R Statistical analysis package available at https://github.com/bvnlabSCATTome, SCATTome (single-cell analysis of targeted transcriptome), that restructures the data obtained from Fluidigm single-cell quantitative real-time-PCR analysis run, filters missing data, performs scaling of filtered data, builds classification models and predicts drug response of individual cells based on targeted transcriptome using an assortment of machine learning methods. Application of SCATT should contribute to clinically relevant analysis of intratumor heterogeneity, and better inform drug choices based on subclonal cellular responses.

  2. Transposable elements in the Anopheles funestus transcriptome.

    Science.gov (United States)

    Fernández-Medina, Rita D; Carareto, Claudia M A; Struchiner, Cláudio J; Ribeiro, José M C

    2017-06-01

    Transposable elements (TEs) are present in most of the eukaryotic genomes and their impact on genome evolution is increasingly recognized. Although there is extensive information on the TEs present in several eukaryotic genomes, less is known about the expression of these elements at the transcriptome level. Here we present a detailed analysis regarding the expression of TEs in Anopheles funestus, the second most important vector of human malaria in Africa. Several transcriptionally active TE families belonging both to Class I and II were identified and characterized. Interestingly, we have identified a full-length putative active element (including the presence of full length TIRs in the genomic sequence) belonging to the hAT superfamily, which presents active members in other insect genomes. This work contributes to a comprehensive understanding of the landscape of transposable elements in A. funestus transcriptome. Our results reveal that TEs are abundant and diverse in the mosquito and that most of the TE families found in the genome are represented in the mosquito transcriptome, a fact that could indicate activity of these elements.The vast diversity of TEs expressed in A. funestus suggests that there is ongoing amplification of several families in this organism.

  3. Insight into the bacterial diversity of fermentation woad dye vats as revealed by PCR-DGGE and pyrosequencing.

    Science.gov (United States)

    Milanović, Vesna; Osimani, Andrea; Taccari, Manuela; Garofalo, Cristiana; Butta, Alessandro; Clementi, Francesca; Aquilanti, Lucia

    2017-07-01

    The bacterial diversity in fermenting dye vats with woad (Isatis tinctoria L.) prepared and maintained in a functional state for approximately 12 months was examined using a combination of culture-dependent and -independent PCR-DGGE analyses and next-generation sequencing of 16S rRNA amplicons. An extremely complex ecosystem including taxa potentially contributing to both indigo reduction and formation, as well as indigo degradation was found. PCR-DGGE analyses revealed the presence of Paenibacillus lactis, Sporosarcina koreensis, Bacillus licheniformis, and Bacillus thermoamylovorans, while Bacillus thermolactis, Bacillus pumilus and Bacillus megaterium were also identified but with sequence identities lower than 97%. Dominant operational taxonomic units (OTUs) identified by pyrosequencing included Clostridium ultunense, Tissierella spp., Alcaligenes faecalis, Erysipelothrix spp., Enterococcus spp., Virgibacillus spp. and Virgibacillus panthothenicus, while sub-dominant OTUs included clostridia, alkaliphiles, halophiles, bacilli, moderately thermophilic bacteria, lactic acid bacteria, Enterobacteriaceae, aerobes, and even photosynthetic bacteria. Based on the current knowledge of indigo-reducing bacteria, it is considered that indigo-reducing bacteria constituted only a small fraction in the unique microcosm detected in the natural indigo dye vats.

  4. Transcriptome Profiling in Human Diseases: New Advances and Perspectives

    Directory of Open Access Journals (Sweden)

    Amelia Casamassimi

    2017-07-01

    Full Text Available In the last decades, transcriptome profiling has been one of the most utilized approaches to investigate human diseases at the molecular level. Through expression studies, many molecular biomarkers and therapeutic targets have been found for several human pathologies. This number is continuously increasing thanks to total RNA sequencing. Indeed, this new technology has completely revolutionized transcriptome analysis allowing the quantification of gene expression levels and allele-specific expression in a single experiment, as well as to identify novel genes, splice isoforms, fusion transcripts, and to investigate the world of non-coding RNA at an unprecedented level. RNA sequencing has also been employed in important projects, like ENCODE (Encyclopedia of the regulatory elements and TCGA (The Cancer Genome Atlas, to provide a snapshot of the transcriptome of dozens of cell lines and thousands of primary tumor specimens. Moreover, these studies have also paved the way to the development of data integration approaches in order to facilitate management and analysis of data and to identify novel disease markers and molecular targets to use in the clinics. In this scenario, several ongoing clinical trials utilize transcriptome profiling through RNA sequencing strategies as an important instrument in the diagnosis of numerous human pathologies.

  5. De novo assembly and annotation of the Antarctic copepod (Tigriopus kingsejongensis) transcriptome.

    Science.gov (United States)

    Kim, Hui-Su; Lee, Bo-Young; Han, Jeonghoon; Lee, Young Hwan; Min, Gi-Sik; Kim, Sanghee; Lee, Jae-Seong

    2016-08-01

    The whole transcriptome of the Antarctic copepod (Tigriopus kingsejongensis) was sequenced using Illumina RNA-seq. De novo assembly was performed with 64,785,098 raw reads using Trinity, which assembled into 81,653 contigs. TransDecoder found 38,250 candidate coding contigs which showed homology to other species by BLAST analysis. Functional gene annotation was performed by Gene Ontology (GO), InterProScan, and KEGG pathway analyses. Finally, we identified a number of expressed gene catalog for T. kingsejongensis that is a useful model animal for gene information-based polar research to uncover molecular mechanisms of environmental adaptation on harsh environments. In particular, we observed highly developing lipid metabolism in T. kingsejongensis directly compared to those of the Far East Pacific coast copepod Tigriopus japonicus at the transcriptome level. Copyright © 2016 Elsevier B.V. All rights reserved.

  6. Comparison of bacterial culture and 16S rRNA community profiling by clonal analysis and and pyrosequencing for the characterisation of the caries-associated microbiome

    Directory of Open Access Journals (Sweden)

    Kathrin eSchulze-Schweifing

    2014-11-01

    Full Text Available Culture-independent analyses have greatly expanded knowledge regarding the composition of complex bacterial communities including those associated with oral diseases. A consistent finding from such studies, however, has been the under-reporting of members of the phylum Actinobacteria. In this study, five pairs of broad range primers targeting 16S rRNA genes were used in clonal analysis of 6 samples collected from tooth lesions involving dentine in subjects with active caries. Samples were also subjected to cultural analysis and pyrosequencing by means of the 454 platform. A diverse bacterial community of 229 species-level taxa was revealed by culture and clonal analysis, dominated by representatives of the genera Prevotella, Lactobacillus, Selenomonas and Streptococcus. The five most abundant species were: Lactobacillus gasseri, Prevotella denticola, Alloprevotella tannerae, S. mutans and Streptococcus sp. HOT 070, which together made up 31.6 % of the sequences. Two samples were dominated by lactobacilli, while the remaining samples had low numbers of lactobacilli but significantly higher numbers of Prevotella species. The different primer pairs produced broadly similar data but proportions of the phylum Bacteroidetes were significantly higher when primer 1387R was used. All of the primer sets underestimated the proportion of Actinobacteria compared to culture. Pyrosequencing analysis of the samples was performed to a depth of sequencing of 4293 sequences per sample which were identified to 264 species-level taxa, and resulted in significantly higher coverage estimates than the clonal analysis. Pyrosequencing, however, also underestimated the relative abundance of Actinobacteria compared to culture.

  7. Mango (Mangifera indica L.) germplasm diversity based on single nucleotide polymorphisms derived from the transcriptome.

    Science.gov (United States)

    Sherman, Amir; Rubinstein, Mor; Eshed, Ravit; Benita, Miri; Ish-Shalom, Mazal; Sharabi-Schwager, Michal; Rozen, Ada; Saada, David; Cohen, Yuval; Ophir, Ron

    2015-11-14

    Germplasm collections are an important source for plant breeding, especially in fruit trees which have a long duration of juvenile period. Thus, efforts have been made to study the diversity of fruit tree collections. Even though mango is an economically important crop, most of the studies on diversity in mango collections have been conducted with a small number of genetic markers. We describe a de novo transcriptome assembly from mango cultivar 'Keitt'. Variation discovery was performed using Illumina resequencing of 'Keitt' and 'Tommy Atkins' cultivars identified 332,016 single-nucleotide polymorphisms (SNPs) and 1903 simple-sequence repeats (SSRs). Most of the SSRs (70.1%) were of trinucleotide with the preponderance of motif (GGA/AAG)n and only 23.5% were di-nucleotide SSRs with the mostly of (AT/AT)n motif. Further investigation of the diversity in the Israeli mango collection was performed based on a subset of 293 SNPs. Those markers have divided the Israeli mango collection into two major groups: one group included mostly mango accessions from Southeast Asia (Malaysia, Thailand, Indonesia) and India and the other with mainly of Floridian and Israeli mango cultivars. The latter group was more polymorphic (FS=-0.1 on the average) and was more of an admixture than the former group. A slight population differentiation was detected (FST=0.03), suggesting that if the mango accessions of the western world apparently was originated from Southeast Asia, as has been previously suggested, the duration of cultivation was not long enough to develop a distinct genetic background. Whole-transcriptome reconstruction was used to significantly broaden the mango's genetic variation resources, i.e., SNPs and SSRs. The set of SNP markers described in this study is novel. A subset of SNPs was sampled to explore the Israeli mango collection and most of them were polymorphic in many mango accessions. Therefore, we believe that these SNPs will be valuable as they recapitulate and

  8. The comparison of pyrosequencing molecular Gram stain, culture, and conventional Gram stain for diagnosing orthopaedic infections.

    Science.gov (United States)

    Kobayashi, Naomi; Bauer, Thomas W; Tuohy, Marion J; Lieberman, Isador H; Krebs, Viktor; Togawa, Daisuke; Fujishiro, Takaaki; Procop, Gary W

    2006-08-01

    We have developed a combined real-time PCR and pyrosequencing assay that successfully differentiated the vast majority of gram-positive and gram-negative bacteria when bacterial isolates were tested. The purpose of this study was to evaluate this assay on clinical specimens obtained from orthopedic surgeries, and to prospectively compare the results of "molecular Gram stain" with culture and conventional direct Gram stain. Forty-five surgical specimens were obtained from patients who underwent orthopedic surgery procedures. The DNA was extracted and a set of broad-range PCR primers that targeted a part of the 16S rDNA gene was used for pan-bacterial PCR. The amplicons were submitted for pyrosequencing and the resulting molecular Gram stain characteristics were recorded. Culture and direct Gram staining were performed using standard methods for all cases. Surgical specimens were reviewed histologically for all cases that had a discrepancy between culture and molecular results. There was an 86.7% (39/45) agreement between the traditional and molecular methods. In 12/14 (85.7%) culture-proven cases of bacterial infection, molecular Gram stain characteristics were in agreement with the culture results, while the conventional Gram stain result was in agreement only for five cases (35.7%). In the 31 culture negative cases, 27 cases were also PCR negative, whereas 4 were PCR positive. Three of these were characterized as gram negative and one as gram positive by this molecular method. Molecular determination of the Gram stain characteristics of bacteria that cause orthopedic infections may be achieved, in most instances, by this method. Further studies are necessary to understand the clinical importance of PCR-positive/culture-negative results.

  9. Development of high-throughput SNP-based genotyping in Acacia auriculiformis x A. mangium hybrids using short-read transcriptome data

    Directory of Open Access Journals (Sweden)

    Wong Melissa ML

    2012-12-01

    Full Text Available Abstract Background Next Generation Sequencing has provided comprehensive, affordable and high-throughput DNA sequences for Single Nucleotide Polymorphism (SNP discovery in Acacia auriculiformis and Acacia mangium. Like other non-model species, SNP detection and genotyping in Acacia are challenging due to lack of genome sequences. The main objective of this study is to develop the first high-throughput SNP genotyping assay for linkage map construction of A. auriculiformis x A. mangium hybrids. Results We identified a total of 37,786 putative SNPs by aligning short read transcriptome data from four parents of two Acacia hybrid mapping populations using Bowtie against 7,839 de novo transcriptome contigs. Given a set of 10 validated SNPs from two lignin genes, our in silico SNP detection approach is highly accurate (100% compared to the traditional in vitro approach (44%. Further validation of 96 SNPs using Illumina GoldenGate Assay gave an overall assay success rate of 89.6% and conversion rate of 37.5%. We explored possible factors lowering assay success rate by predicting exon-intron boundaries and paralogous genes of Acacia contigs using Medicago truncatula genome as reference. This assessment revealed that presence of exon-intron boundary is the main cause (50% of assay failure. Subsequent SNPs filtering and improved assay design resulted in assay success and conversion rate of 92.4% and 57.4%, respectively based on 768 SNPs genotyping. Analysis of clustering patterns revealed that 27.6% of the assays were not reproducible and flanking sequence might play a role in determining cluster compression. In addition, we identified a total of 258 and 319 polymorphic SNPs in A. auriculiformis and A. mangium natural germplasms, respectively. Conclusion We have successfully discovered a large number of SNP markers in A. auriculiformis x A. mangium hybrids using next generation transcriptome sequencing. By using a reference genome from the most closely

  10. Prerequisites for Amplicon Pyrosequencing of Microbial Methanol Utilizers in the Environment

    Directory of Open Access Journals (Sweden)

    Steffen eKolb

    2013-09-01

    Full Text Available The commercial availability of next generation sequencing (NGS technologies facilitated the assessment of functional groups of microorganisms in the environment with high coverage, resolution, and reproducibility. Soil methylotrophs were among the first microorganisms in the environment that were assessed with molecular tools, and nowadays, as well with NGS technologies. Studies in the past years re-attracted notice to the pivotal role of methylotrophs in global conversions of methanol, which mainly originates from plants, and is involved in oxidative reactions and ozone formation in the atmosphere. Aerobic methanol utilizers belong to Bacteria, yeasts, Ascomycota, and molds. Numerous bacterial methylotrophs are facultatively aerobic, and also contribute to anaerobic methanol oxidation in the environment, whereas strict anaerobic methanol utilizers belong to methanogens and acetogens. The diversity of enzymes catalyzing the initial oxidation of methanol is considerable, and comprises at least five different enzyme types in aerobes, and one in strict anaerobes. Only the gene of the large subunit of PQQ-dependent methanol dehydrogenase (mxaF has been analyzed by environmental pyrosequencing. To enable a comprehensive assessment of methanol utilizers in the environment, new primers targeting genes of the PQQ MDH in Methylibium (mdh2, of the NAD-dependent MDH (mdh, of the methanol oxidoreductase of Actinobacteria (mdo, of the fungal FAD-dependent alcohol oxidase (mod1, mod2, and homologues, and of the gene of the large subunit of the methanol:corrinoid methyltransferases (mtaC in methanogens and acetogens need to be developed. Combined stable isotope probing of nucleic acids or proteins with amplicon-based NGS are straightforward approaches to reveal insights into functions of certain methylotrophic taxa in the global methanol cycle.

  11. Field efficacy of four anthelmintics and confirmation of drug-resistant nematodes by controlled efficacy test and pyrosequencing on a sheep and goat farm in Denmark

    DEFF Research Database (Denmark)

    Pena-Espinoza, Miguel Angel; Thamsborg, Stig M.; Demeler, Janina

    2014-01-01

    and Trichostrongylus colubriformis isolated from adult goats on the farm. Recovered specimens of H. contortus were subjected to pyrosequencing for detection of single nucleotide polymorphisms (SNPs) related to benzimidazole (BZ) resistance. During the FECRT, FECs in untreated lambs dropped significantly by 47%. No FEC...

  12. N-of-1-pathways MixEnrich: advancing precision medicine via single-subject analysis in discovering dynamic changes of transcriptomes.

    Science.gov (United States)

    Li, Qike; Schissler, A Grant; Gardeux, Vincent; Achour, Ikbel; Kenost, Colleen; Berghout, Joanne; Li, Haiquan; Zhang, Hao Helen; Lussier, Yves A

    2017-05-24

    Transcriptome analytic tools are commonly used across patient cohorts to develop drugs and predict clinical outcomes. However, as precision medicine pursues more accurate and individualized treatment decisions, these methods are not designed to address single-patient transcriptome analyses. We previously developed and validated the N-of-1-pathways framework using two methods, Wilcoxon and Mahalanobis Distance (MD), for personal transcriptome analysis derived from a pair of samples of a single patient. Although, both methods uncover concordantly dysregulated pathways, they are not designed to detect dysregulated pathways with up- and down-regulated genes (bidirectional dysregulation) that are ubiquitous in biological systems. We developed N-of-1-pathways MixEnrich, a mixture model followed by a gene set enrichment test, to uncover bidirectional and concordantly dysregulated pathways one patient at a time. We assess its accuracy in a comprehensive simulation study and in a RNA-Seq data analysis of head and neck squamous cell carcinomas (HNSCCs). In presence of bidirectionally dysregulated genes in the pathway or in presence of high background noise, MixEnrich substantially outperforms previous single-subject transcriptome analysis methods, both in the simulation study and the HNSCCs data analysis (ROC Curves; higher true positive rates; lower false positive rates). Bidirectional and concordant dysregulated pathways uncovered by MixEnrich in each patient largely overlapped with the quasi-gold standard compared to other single-subject and cohort-based transcriptome analyses. The greater performance of MixEnrich presents an advantage over previous methods to meet the promise of providing accurate personal transcriptome analysis to support precision medicine at point of care.

  13. Sequencing and De Novo Assembly of the Toxicodendron radicans (Poison Ivy) Transcriptome.

    Science.gov (United States)

    Weisberg, Alexandra J; Kim, Gunjune; Westwood, James H; Jelesko, John G

    2017-11-10

    Contact with poison ivy plants is widely dreaded because they produce a natural product called urushiol that is responsible for allergenic contact delayed-dermatitis symptoms lasting for weeks. For this reason, the catchphrase most associated with poison ivy is "leaves of three, let it be", which serves the purpose of both identification and an appeal for avoidance. Ironically, despite this notoriety, there is a dearth of specific knowledge about nearly all other aspects of poison ivy physiology and ecology. As a means of gaining a more molecular-oriented understanding of poison ivy physiology and ecology, Next Generation DNA sequencing technology was used to develop poison ivy root and leaf RNA-seq transcriptome resources. De novo assembled transcriptomes were analyzed to generate a core set of high quality expressed transcripts present in poison ivy tissue. The predicted protein sequences were evaluated for similarity to SwissProt homologs and InterProScan domains, as well as assigned both GO terms and KEGG annotations. Over 23,000 simple sequence repeats were identified in the transcriptome, and corresponding oligo nucleotide primer pairs were designed. A pan-transcriptome analysis of existing Anacardiaceae transcriptomes revealed conserved and unique transcripts among these species.

  14. Probing the evolution, ecology and physiology of marine protists using transcriptomics.

    Science.gov (United States)

    Caron, David A; Alexander, Harriet; Allen, Andrew E; Archibald, John M; Armbrust, E Virginia; Bachy, Charles; Bell, Callum J; Bharti, Arvind; Dyhrman, Sonya T; Guida, Stephanie M; Heidelberg, Karla B; Kaye, Jonathan Z; Metzner, Julia; Smith, Sarah R; Worden, Alexandra Z

    2017-01-01

    Protists, which are single-celled eukaryotes, critically influence the ecology and chemistry of marine ecosystems, but genome-based studies of these organisms have lagged behind those of other microorganisms. However, recent transcriptomic studies of cultured species, complemented by meta-omics analyses of natural communities, have increased the amount of genetic information available for poorly represented branches on the tree of eukaryotic life. This information is providing insights into the adaptations and interactions between protists and other microorganisms and macroorganisms, but many of the genes sequenced show no similarity to sequences currently available in public databases. A better understanding of these newly discovered genes will lead to a deeper appreciation of the functional diversity and metabolic processes in the ocean. In this Review, we summarize recent developments in our understanding of the ecology, physiology and evolution of protists, derived from transcriptomic studies of cultured strains and natural communities, and discuss how these novel large-scale genetic datasets will be used in the future.

  15. Bacillus anthracis genome organization in light of whole transcriptome sequencing

    Energy Technology Data Exchange (ETDEWEB)

    Martin, Jeffrey; Zhu, Wenhan; Passalacqua, Karla D.; Bergman, Nicholas; Borodovsky, Mark

    2010-03-22

    Emerging knowledge of whole prokaryotic transcriptomes could validate a number of theoretical concepts introduced in the early days of genomics. What are the rules connecting gene expression levels with sequence determinants such as quantitative scores of promoters and terminators? Are translation efficiency measures, e.g. codon adaptation index and RBS score related to gene expression? We used the whole transcriptome shotgun sequencing of a bacterial pathogen Bacillus anthracis to assess correlation of gene expression level with promoter, terminator and RBS scores, codon adaptation index, as well as with a new measure of gene translational efficiency, average translation speed. We compared computational predictions of operon topologies with the transcript borders inferred from RNA-Seq reads. Transcriptome mapping may also improve existing gene annotation. Upon assessment of accuracy of current annotation of protein-coding genes in the B. anthracis genome we have shown that the transcriptome data indicate existence of more than a hundred genes missing in the annotation though predicted by an ab initio gene finder. Interestingly, we observed that many pseudogenes possess not only a sequence with detectable coding potential but also promoters that maintain transcriptional activity.

  16. Transcriptome analysis and comparison reveal divergence between two invasive whitefly cryptic species

    Directory of Open Access Journals (Sweden)

    Xia Jun

    2011-09-01

    Full Text Available Abstract Background Invasive species are valuable model systems for examining the evolutionary processes and molecular mechanisms associated with their specific characteristics by comparison with closely related species. Over the past 20 years, two species of the whitefly Bemisia tabaci species complex, Middle East-Asia Minor 1 (MEAM1 and Mediterranean (MED, have both spread from their origin Middle East/Mediterranean to many countries despite their apparent differences in many life history parameters. Previously, we have sequenced the transcriptome of MED. In this study, we sequenced the transcriptome of MEAM1 and took a comparative genomic approach to investigate the transcriptome evolution and the genetic factors underlying the differences between MEAM1 and MED. Results Using Illumina sequencing technology, we generated 17 million sequencing reads for MEAM1. These reads were assembled into 57,741 unique sequences and 15,922 sequences were annotated with an E-value above 10-5. Compared with the MED transcriptome, we identified 3,585 pairs of high quality orthologous genes and inferred their sequence divergences. The average differences in coding, 5' untranslated and 3' untranslated region were 0.83%, 1.66% and 1.43%, respectively. The level of sequence divergence provides additional support to the proposition that MEAM1 and MED are two species. Based on the ratio of nonsynonymous and synonymous substitutions, we identified 24 sequences that have evolved in response to positive selection. Many of those genes are predicted to be involved in metabolism and insecticide resistance which might contribute to the divergence of the two whitefly species. Conclusions Our data present a comprehensive sequence comparison between the two invasive whitefly species. This study will provide a road map for future investigations on the molecular mechanisms underlying their biological differences.

  17. Characterizing Ancylostoma caninum transcriptome and exploring nematode parasitic adaptation

    Directory of Open Access Journals (Sweden)

    Hawdon John

    2010-05-01

    Full Text Available Abstract Background Hookworm infection is one of the most important neglected diseases in developing countries, with approximately 1 billion people infected worldwide. To better understand hookworm biology and nematode parasitism, the present study generated a near complete transcriptome of the canine hookworm Ancylostoma caninum to a very high coverage using high throughput technology, and compared it to those of the free-living nematode Caenorhabditis elegans and the parasite Brugia malayi. Results The generated transcripts from four developmental stages, infective L3, serum stimulated L3, adult male and adult female, covered 93% of the A. caninum transcriptome. The broad diversity among nematode transcriptomes was confirmed, and an impact of parasitic adaptation on transcriptome diversity was inferred. Intra-population analysis showed that A. caninum has higher coding sequence diversity than humans. Examining the developmental expression profiles of A. caninum revealed major transitions in gene expression from larval stages to adult. Adult males expressed the highest number of selectively expressed genes, but adult female expressed the highest number of selective parasitism-related genes. Genes related to parasitism adaptation and A. caninum specific genes exhibited more expression selectivity while those conserved in nematodes tend to be consistently expressed. Parasitism related genes were expressed more selectively in adult male and female worms. The comprehensive analysis of digital expression profiles along with transcriptome comparisons enabled identification of a set of parasitism genes encoding secretory proteins in animal parasitic nematode. Conclusions This study validated the usage of deep sequencing for gene expression profiling. Parasitic adaptation of the canine hookworm is related to its diversity and developmental dynamics. This comprehensive comparative genomic and expression study substantially improves our understanding of

  18. Microbiome and ecotypic adaption of Holcus lanatus (L.) to extremes of its soil pH range, investigated through transcriptome sequencing.

    Science.gov (United States)

    Young, Ellen; Carey, Manus; Meharg, Andrew A; Meharg, Caroline

    2018-03-20

    the most common genus in shoots, with Colletotrichum and Rhizophagus (AM fungi) most numerous in limestone soil roots. The latter coincided with upregulation of plant genes involved in AM symbiosis initiation and AM-based P acquisition in an environment where P availability is low. Meta-transcriptome analyses provided novel insights into H. lanatus transcriptome responses, associated eukaryotic microbiota functions and taxonomic community composition. Significant edaphic and plant ecotype effects were identified, demonstrating that meta-transcriptome-based functional analysis is a powerful tool for the study of natural plant-microbiome interactions.

  19. Structural covariance of brain region volumes is associated with both structural connectivity and transcriptomic similarity.

    Science.gov (United States)

    Yee, Yohan; Fernandes, Darren J; French, Leon; Ellegood, Jacob; Cahill, Lindsay S; Vousden, Dulcie A; Spencer Noakes, Leigh; Scholz, Jan; van Eede, Matthijs C; Nieman, Brian J; Sled, John G; Lerch, Jason P

    2018-05-18

    An organizational pattern seen in the brain, termed structural covariance, is the statistical association of pairs of brain regions in their anatomical properties. These associations, measured across a population as covariances or correlations usually in cortical thickness or volume, are thought to reflect genetic and environmental underpinnings. Here, we examine the biological basis of structural volume covariance in the mouse brain. We first examined large scale associations between brain region volumes using an atlas-based approach that parcellated the entire mouse brain into 318 regions over which correlations in volume were assessed, for volumes obtained from 153 mouse brain images via high-resolution MRI. We then used a seed-based approach and determined, for 108 different seed regions across the brain and using mouse gene expression and connectivity data from the Allen Institute for Brain Science, the variation in structural covariance data that could be explained by distance to seed, transcriptomic similarity to seed, and connectivity to seed. We found that overall, correlations in structure volumes hierarchically clustered into distinct anatomical systems, similar to findings from other studies and similar to other types of networks in the brain, including structural connectivity and transcriptomic similarity networks. Across seeds, this structural covariance was significantly explained by distance (17% of the variation, up to a maximum of 49% for structural covariance to the visceral area of the cortex), transcriptomic similarity (13% of the variation, up to maximum of 28% for structural covariance to the primary visual area) and connectivity (15% of the variation, up to a maximum of 36% for structural covariance to the intermediate reticular nucleus in the medulla) of covarying structures. Together, distance, connectivity, and transcriptomic similarity explained 37% of structural covariance, up to a maximum of 63% for structural covariance to the

  20. ROSMETER: a bioinformatic tool for the identification of transcriptomic imprints related to reactive oxygen species type and origin provides new insights into stress responses.

    Science.gov (United States)

    Rosenwasser, Shilo; Fluhr, Robert; Joshi, Janak Raj; Leviatan, Noam; Sela, Noa; Hetzroni, Amotz; Friedman, Haya

    2013-10-01

    The chemical identity of the reactive oxygen species (ROS) and its subcellular origin will leave a specific imprint on the transcriptome response. In order to facilitate the appreciation of ROS signaling, we developed a tool that is tuned to qualify this imprint. Transcriptome data from experiments in Arabidopsis (Arabidopsis thaliana) for which the ROS type and organelle origin are known were compiled into indices and made accessible by a Web-based interface called ROSMETER. The ROSMETER algorithm uses a vector-based algorithm to portray the ROS signature for a given transcriptome. The ROSMETER platform was applied to identify the ROS signatures profiles in transcriptomes of senescing plants and of those exposed to abiotic and biotic stresses. An unexpected highly significant ROS transcriptome signature of mitochondrial stress was detected during the early presymptomatic stages of leaf senescence, which was accompanied by the specific oxidation of mitochondria-targeted redox-sensitive green fluorescent protein probe. The ROSMETER analysis of diverse stresses revealed both commonalties and prominent differences between various abiotic stress conditions, such as salt, cold, ultraviolet light, drought, heat, and pathogens. Interestingly, early responses to the various abiotic stresses clustered together, independent of later responses, and exhibited negative correlations to several ROS indices. In general, the ROS transcriptome signature of abiotic stresses showed limited correlation to a few indices, while biotic stresses showed broad correlation with multiple indices. The ROSMETER platform can assist in formulating hypotheses to delineate the role of ROS in plant acclimation to environmental stress conditions and to elucidate the molecular mechanisms of the oxidative stress response in plants.

  1. A transcriptome-based assessment of the astrocytic dystrophin-associated complex in the developing human brain.

    Science.gov (United States)

    Simon, Matthew J; Murchison, Charles; Iliff, Jeffrey J

    2018-02-01

    Astrocytes play a critical role in regulating the interface between the cerebral vasculature and the central nervous system. Contributing to this is the astrocytic endfoot domain, a specialized structure that ensheathes the entirety of the vasculature and mediates signaling between endothelial cells, pericytes, and neurons. The astrocytic endfoot has been implicated as a critical element of the glymphatic pathway, and changes in protein expression profiles in this cellular domain are linked to Alzheimer's disease pathology. Despite this, basic physiological properties of this structure remain poorly understood including the developmental timing of its formation, and the protein components that localize there to mediate its functions. Here we use human transcriptome data from male and female subjects across several developmental stages and brain regions to characterize the gene expression profile of the dystrophin-associated complex (DAC), a known structural component of the astrocytic endfoot that supports perivascular localization of the astroglial water channel aquaporin-4. Transcriptomic profiling is also used to define genes exhibiting parallel expression profiles to DAC elements, generating a pool of candidate genes that encode gene products that may contribute to the physiological function of the perivascular astrocytic endfoot domain. We found that several genes encoding transporter proteins are transcriptionally associated with DAC genes. © 2017 Wiley Periodicals, Inc.

  2. Sequencing and de novo assembly of the transcriptome of the glassy-winged sharpshooter (Homalodisca vitripennis.

    Directory of Open Access Journals (Sweden)

    Raja Sekhar Nandety

    Full Text Available BACKGROUND: The glassy-winged sharpshooter Homalodisca vitripennis (Hemiptera: Cicadellidae, is a xylem-feeding leafhopper and important vector of the bacterium Xylella fastidiosa; the causal agent of Pierce's disease of grapevines. The functional complexity of the transcriptome of H. vitripennis has not been elucidated thus far. It is a necessary blueprint for an understanding of the development of H. vitripennis and for designing efficient biorational control strategies including those based on RNA interference. RESULTS: Here we elucidate and explore the transcriptome of adult H. vitripennis using high-throughput paired end deep sequencing and de novo assembly. A total of 32,803,656 paired-end reads were obtained with an average transcript length of 624 nucleotides. We assembled 32.9 Mb of the transcriptome of H. vitripennis that spanned across 47,265 loci and 52,708 transcripts. Comparison of our non-redundant database showed that 45% of the deduced proteins of H. vitripennis exhibit identity (e-value ≤1(-5 with known proteins. We assigned Gene Ontology (GO terms, Kyoto Encyclopedia of Genes and Genomes (KEGG annotations, and potential Pfam domains to each transcript isoform. In order to gain insight into the molecular basis of key regulatory genes of H. vitripennis, we characterized predicted proteins involved in the metabolism of juvenile hormone, and biogenesis of small RNAs (Dicer and Piwi sequences from the transcriptomic sequences. Analysis of transposable element sequences of H. vitripennis indicated that the genome is less expanded in comparison to many other insects with approximately 1% of the transcriptome carrying transposable elements. CONCLUSIONS: Our data significantly enhance the molecular resources available for future study and control of this economically important hemipteran. This transcriptional information not only provides a more nuanced understanding of the underlying biological and physiological mechanisms that

  3. Flower bud transcriptome analysis of Sapium sebiferum (Linn.) Roxb. and primary investigation of drought induced flowering: pathway construction and G-quadruplex prediction based on transcriptome.

    Science.gov (United States)

    Yang, Minglei; Wu, Ying; Jin, Shan; Hou, Jinyan; Mao, Yingji; Liu, Wenbo; Shen, Yangcheng; Wu, Lifang

    2015-01-01

    Sapium sebiferum (Linn.) Roxb. (Chinese Tallow Tree) is a perennial woody tree and its seeds are rich in oil which hold great potential for biodiesel production. Despite a traditional woody oil plant, our understanding on S. sebiferum genetics and molecular biology remains scant. In this study, the first comprehensive transcriptome of S. sebiferum flower has been generated by sequencing and de novo assembly. A total of 149,342 unigenes were generated from raw reads, of which 24,289 unigenes were successfully matched to public database. A total of 61 MADS box genes and putative pathways involved in S. sebiferum flower development have been identified. Abiotic stress response network was also constructed in this work, where 2,686 unigenes are involved in the pathway. As for lipid biosynthesis, 161 unigenes have been identified in fatty acid (FA) and triacylglycerol (TAG) biosynthesis. Besides, the G-Quadruplexes in RNA of S. sebiferum also have been predicted. An interesting finding is that the stress-induced flowering was observed in S. sebiferum for the first time. According to the results of semi-quantitative PCR, expression tendencies of flowering-related genes, GA1, AP2 and CRY2, accorded with stress-related genes, such as GRX50435 and PRXⅡ39562. This transcriptome provides functional genomic information for further research of S. sebiferum, especially for the genetic engineering to shorten the juvenile period and improve yield by regulating flower development. It also offers a useful database for the research of other Euphorbiaceae family plants.

  4. Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data.

    Science.gov (United States)

    Aliper, Alexander; Plis, Sergey; Artemov, Artem; Ulloa, Alvaro; Mamoshina, Polina; Zhavoronkov, Alex

    2016-07-05

    Deep learning is rapidly advancing many areas of science and technology with multiple success stories in image, text, voice and video recognition, robotics, and autonomous driving. In this paper we demonstrate how deep neural networks (DNN) trained on large transcriptional response data sets can classify various drugs to therapeutic categories solely based on their transcriptional profiles. We used the perturbation samples of 678 drugs across A549, MCF-7, and PC-3 cell lines from the LINCS Project and linked those to 12 therapeutic use categories derived from MeSH. To train the DNN, we utilized both gene level transcriptomic data and transcriptomic data processed using a pathway activation scoring algorithm, for a pooled data set of samples perturbed with different concentrations of the drug for 6 and 24 hours. In both pathway and gene level classification, DNN achieved high classification accuracy and convincingly outperformed the support vector machine (SVM) model on every multiclass classification problem, however, models based on pathway level data performed significantly better. For the first time we demonstrate a deep learning neural net trained on transcriptomic data to recognize pharmacological properties of multiple drugs across different biological systems and conditions. We also propose using deep neural net confusion matrices for drug repositioning. This work is a proof of principle for applying deep learning to drug discovery and development.

  5. Acclimation of Saccharomyces cerevisiae to low temperature: a chemostat-based transcriptome analysis.

    Science.gov (United States)

    Tai, Siew Leng; Daran-Lapujade, Pascale; Walsh, Michael C; Pronk, Jack T; Daran, Jean-Marc

    2007-12-01

    Effects of suboptimal temperatures on transcriptional regulation in yeast have been extensively studied in batch cultures. To eliminate indirect effects of specific growth rates that are inherent to batch-cultivation studies, genome-wide transcriptional responses to low temperatures were analyzed in steady-state chemostats, grown at a fixed specific growth rate (0.03 h(-1)). Although in vivo metabolic fluxes were essentially the same in cultures grown at 12 and at 30 degrees C, concentrations of the growth-limiting nutrients (glucose or ammonia) were higher at 12 degrees C. This difference was reflected by transcript levels of genes that encode transporters for the growth-limiting nutrients. Several transcriptional responses to low temperature occurred under both nutrient-limitation regimes. Increased transcription of ribosome-biogenesis genes emphasized the importance of adapting protein-synthesis capacity to low temperature. In contrast to observations in cold-shock and batch-culture studies, transcript levels of environmental stress response genes were reduced at 12 degrees C. Transcription of trehalose-biosynthesis genes and intracellular trehalose levels indicated that, in contrast to its role in cold-shock adaptation, trehalose is not involved in steady-state low-temperature adaptation. Comparison of the chemostat-based transcriptome data with literature data revealed large differences between transcriptional reprogramming during long-term low-temperature acclimation and the transcriptional responses to a rapid transition to low temperature.

  6. Major differences between human atopic dermatitis and murine models as determined by global transcriptomic profiling

    DEFF Research Database (Denmark)

    Ewald, David Adrian; Noda, Shinji; Oliva, Margeaux

    2017-01-01

    , and a comparison of these models with the human AD transcriptomic fingerprint is lacking. We sought to evaluate the transcriptomic profiles of six common murine models and determine how they relate to human AD skin. Transcriptomic profiling was performed using microarrays and qRT-PCR on biopsies from NC/Nga, flaky...

  7. Imaging-based surrogate markers of transcriptome subclasses and signatures in hepatocellular carcinoma. Preliminary results

    Energy Technology Data Exchange (ETDEWEB)

    Taouli, Bachir [Icahn School of Medicine at Mount Sinai, Department of Radiology, New York, NY (United States); Icahn School of Medicine at Mount Sinai, Translational and Molecular Imaging Institute, New York, NY (United States); Icahn School of Medicine at Mount Sinai, Liver Cancer Program, Tisch Cancer Institute, New York, NY (United States); Hoshida, Yujin; Chen, Xintong; Sun, Xiaochen; Kojima, Kensuke; Toffanin, Sara; Hirschfield, Hadassa [Icahn School of Medicine at Mount Sinai, Liver Cancer Program, Tisch Cancer Institute, New York, NY (United States); Icahn School of Medicine at Mount Sinai, Division of Liver Diseases, Department of Medicine, New York, NY (United States); Kakite, Suguru [Icahn School of Medicine at Mount Sinai, Translational and Molecular Imaging Institute, New York, NY (United States); Tottori University, Division of Radiology, Department of Pathophysiological and Therapeutic Science, Faculty of Medicine, Yonago City (Japan); Tan, Poh Seng [Icahn School of Medicine at Mount Sinai, Liver Cancer Program, Tisch Cancer Institute, New York, NY (United States); Icahn School of Medicine at Mount Sinai, Division of Liver Diseases, Department of Medicine, New York, NY (United States); National University Health System, Division of Gastroenterology and Hepatology, University Medicine Cluster, Singapore (Singapore); Kihira, Shingo [Icahn School of Medicine at Mount Sinai, Department of Radiology, New York, NY (United States); Fiel, M.I. [Icahn School of Medicine at Mount Sinai, Department of Pathology, New York, NY (United States); Wagner, Mathilde [Icahn School of Medicine at Mount Sinai, Translational and Molecular Imaging Institute, New York, NY (United States); Sorbonne Universites, UPMC, Department of Radiology, Hopital Pitie-Salpetriere, Paris (France); Llovet, Josep M. [Icahn School of Medicine at Mount Sinai, Liver Cancer Program, Tisch Cancer Institute, New York, NY (United States); Icahn School of Medicine at Mount Sinai, Division of Liver Diseases, Department of Medicine, New York, NY (United States); Universitat de Barcelona, HCC Translational Research Laboratory, Barcelona-Clinic Liver Cancer Group Institut d' Investigacions Biomediques August Pi i Sunyer (IDIBAPS), Hospital Clinic de Barcelona, Barcelona (Spain); Institucio Catalana de Recerca i Estudis Avancats, Barcelona (Spain)

    2017-11-15

    In this preliminary study, we examined whether imaging-based phenotypes are associated with reported predictive gene signatures in hepatocellular carcinoma (HCC). Thirty-eight patients (M/F 30/8, mean age 61 years) who underwent pre-operative CT or MR imaging before surgery as well as transcriptome profiling were included in this IRB-approved single-centre retrospective study. Eleven qualitative and four quantitative imaging traits (size, enhancement ratios, wash-out ratio, tumour-to-liver contrast ratios) were assessed by three observers and were correlated with 13 previously reported HCC gene signatures using logistic regression analysis. Thirty-nine HCC tumours (mean size 5.7 ± 3.2 cm) were assessed. Significant positive associations were observed between certain imaging traits and gene signatures of aggressive HCC phenotype (G3-Boyault, Proliferation-Chiang profiles, CK19-Villanueva, S1/S2-Hoshida) with odds ratios ranging from 4.44-12.73 (P <0.045). Infiltrative pattern at imaging was significantly associated with signatures of microvascular invasion and aggressive phenotype. Significant but weak associations were also observed between each enhancement ratio and tumour-to-liver contrast ratios and certain gene expression profiles. This preliminary study demonstrates a correlation between phenotypic imaging traits with gene signatures of aggressive HCC, which warrants further prospective validation to establish imaging-based surrogate markers of molecular phenotypes in HCC. (orig.)

  8. Imaging-based surrogate markers of transcriptome subclasses and signatures in hepatocellular carcinoma. Preliminary results

    International Nuclear Information System (INIS)

    Taouli, Bachir; Hoshida, Yujin; Chen, Xintong; Sun, Xiaochen; Kojima, Kensuke; Toffanin, Sara; Hirschfield, Hadassa; Kakite, Suguru; Tan, Poh Seng; Kihira, Shingo; Fiel, M.I.; Wagner, Mathilde; Llovet, Josep M.

    2017-01-01

    In this preliminary study, we examined whether imaging-based phenotypes are associated with reported predictive gene signatures in hepatocellular carcinoma (HCC). Thirty-eight patients (M/F 30/8, mean age 61 years) who underwent pre-operative CT or MR imaging before surgery as well as transcriptome profiling were included in this IRB-approved single-centre retrospective study. Eleven qualitative and four quantitative imaging traits (size, enhancement ratios, wash-out ratio, tumour-to-liver contrast ratios) were assessed by three observers and were correlated with 13 previously reported HCC gene signatures using logistic regression analysis. Thirty-nine HCC tumours (mean size 5.7 ± 3.2 cm) were assessed. Significant positive associations were observed between certain imaging traits and gene signatures of aggressive HCC phenotype (G3-Boyault, Proliferation-Chiang profiles, CK19-Villanueva, S1/S2-Hoshida) with odds ratios ranging from 4.44-12.73 (P <0.045). Infiltrative pattern at imaging was significantly associated with signatures of microvascular invasion and aggressive phenotype. Significant but weak associations were also observed between each enhancement ratio and tumour-to-liver contrast ratios and certain gene expression profiles. This preliminary study demonstrates a correlation between phenotypic imaging traits with gene signatures of aggressive HCC, which warrants further prospective validation to establish imaging-based surrogate markers of molecular phenotypes in HCC. (orig.)

  9. Insights into the development and evolution of exaggerated traits using de novo transcriptomes of two species of horned scarab beetles.

    Directory of Open Access Journals (Sweden)

    Ian A Warren

    Full Text Available Scarab beetles exhibit an astonishing variety of rigid exo-skeletal outgrowths, known as "horns". These traits are often sexually dimorphic and vary dramatically across species in size, shape, location, and allometry with body size. In many species, the horn exhibits disproportionate growth resulting in an exaggerated allometric relationship with body size, as compared to other traits, such as wings, that grow proportionately with body size. Depending on the species, the smallest males either do not produce a horn at all, or they produce a disproportionately small horn for their body size. While the diversity of horn shapes and their behavioural ecology have been reasonably well studied, we know far less about the proximate mechanisms that regulate horn growth. Thus, using 454 pyrosequencing, we generated transcriptome profiles, during horn growth and development, in two different scarab beetle species: the Asian rhinoceros beetle, Trypoxylus dichotomus, and the dung beetle, Onthophagus nigriventris. We obtained over half a million reads for each species that were assembled into over 6,000 and 16,000 contigs respectively. We combined these data with previously published studies to look for signatures of molecular evolution. We found a small subset of genes with horn-biased expression showing evidence for recent positive selection, as is expected with sexual selection on horn size. We also found evidence of relaxed selection present in genes that demonstrated biased expression between horned and horn-less morphs, consistent with the theory of developmental decoupling of phenotypically plastic traits.

  10. Insights into the development and evolution of exaggerated traits using de novo transcriptomes of two species of horned scarab beetles.

    Science.gov (United States)

    Warren, Ian A; Vera, J Cristobal; Johns, Annika; Zinna, Robert; Marden, James H; Emlen, Douglas J; Dworkin, Ian; Lavine, Laura C

    2014-01-01

    Scarab beetles exhibit an astonishing variety of rigid exo-skeletal outgrowths, known as "horns". These traits are often sexually dimorphic and vary dramatically across species in size, shape, location, and allometry with body size. In many species, the horn exhibits disproportionate growth resulting in an exaggerated allometric relationship with body size, as compared to other traits, such as wings, that grow proportionately with body size. Depending on the species, the smallest males either do not produce a horn at all, or they produce a disproportionately small horn for their body size. While the diversity of horn shapes and their behavioural ecology have been reasonably well studied, we know far less about the proximate mechanisms that regulate horn growth. Thus, using 454 pyrosequencing, we generated transcriptome profiles, during horn growth and development, in two different scarab beetle species: the Asian rhinoceros beetle, Trypoxylus dichotomus, and the dung beetle, Onthophagus nigriventris. We obtained over half a million reads for each species that were assembled into over 6,000 and 16,000 contigs respectively. We combined these data with previously published studies to look for signatures of molecular evolution. We found a small subset of genes with horn-biased expression showing evidence for recent positive selection, as is expected with sexual selection on horn size. We also found evidence of relaxed selection present in genes that demonstrated biased expression between horned and horn-less morphs, consistent with the theory of developmental decoupling of phenotypically plastic traits.

  11. Transcriptome alterations in zebrafish embryos after exposure to environmental estrogens and anti-androgens can reveal endocrine disruption.

    Science.gov (United States)

    Schiller, Viktoria; Wichmann, Arne; Kriehuber, Ralf; Schäfers, Christoph; Fischer, Rainer; Fenske, Martina

    2013-12-01

    Exposure to environmental chemicals known as endocrine disruptors (EDs) is in many cases associated with an unpredictable hazard for wildlife and human health. The identification of endocrine disruptive properties of chemicals certain to enter the aquatic environment relies on toxicity tests with fish, assessing adverse effects on reproduction and sexual development. The demand for quick, reliable ED assays favored the use of fish embryos as alternative test organisms. We investigated the application of a transcriptomics-based assay for estrogenic and anti-androgenic chemicals with zebrafish embryos. Two reference compounds, 17α-ethinylestradiol and flutamide, were tested to evaluate the effects on development and the transcriptome after 48h-exposures. Comparison of the transcriptome response with other estrogenic and anti-androgenic compounds (genistein, bisphenol A, methylparaben, linuron, prochloraz, propanil) showed commonalities and differences in regulated pathways, enabling us to classify the estrogenic and anti-androgenic potencies. This demonstrates that different mechanism of ED can be assessed already in fish embryos. Copyright © 2013 Elsevier Inc. All rights reserved.

  12. Transcriptome Dynamics during Maize Endosperm Development.

    Directory of Open Access Journals (Sweden)

    Jianzhou Qu

    Full Text Available The endosperm is a major organ of the seed that plays vital roles in determining seed weight and quality. However, genome-wide transcriptome patterns throughout maize endosperm development have not been comprehensively investigated to date. Accordingly, we performed a high-throughput RNA sequencing (RNA-seq analysis of the maize endosperm transcriptome at 5, 10, 15 and 20 days after pollination (DAP. We found that more than 11,000 protein-coding genes underwent alternative splicing (AS events during the four developmental stages studied. These genes were mainly involved in intracellular protein transport, signal transmission, cellular carbohydrate metabolism, cellular lipid metabolism, lipid biosynthesis, protein modification, histone modification, cellular amino acid metabolism, and DNA repair. Additionally, 7,633 genes, including 473 transcription factors (TFs, were differentially expressed among the four developmental stages. The differentially expressed TFs were from 50 families, including the bZIP, WRKY, GeBP and ARF families. Further analysis of the stage-specific TFs showed that binding, nucleus and ligand-dependent nuclear receptor activities might be important at 5 DAP, that immune responses, signalling, binding and lumen development are involved at 10 DAP, that protein metabolic processes and the cytoplasm might be important at 15 DAP, and that the responses to various stimuli are different at 20 DAP compared with the other developmental stages. This RNA-seq analysis provides novel, comprehensive insights into the transcriptome dynamics during early endosperm development in maize.

  13. The floral transcriptome of Eucalyptus grandis

    CSIR Research Space (South Africa)

    Vining, KJ

    2015-10-01

    Full Text Available As a step toward functional annotation of genes required for floral initiation and development within the Eucalyptus genome, we used short read sequencing to analyze transcriptomes of floral buds from early and late developmental stages...

  14. Researches on Transcriptome Sequencing in the Study of Traditional Chinese Medicine

    Science.gov (United States)

    Xin, Jie; Zhang, Rong-chao; Wang, Lei

    2017-01-01

    Due to its incomparable advantages, the application of transcriptome sequencing in the study of traditional Chinese medicine attracts more and more attention of researchers, which greatly promote the development of traditional Chinese medicine. In this paper, the applications of transcriptome sequencing in traditional Chinese medicine were summarized by reviewing recent related papers. PMID:28900463

  15. Taming Human Genetic Variability: Transcriptomic Meta-Analysis Guides the Experimental Design and Interpretation of iPSC-Based Disease Modeling

    Directory of Open Access Journals (Sweden)

    Pierre-Luc Germain

    2017-06-01

    Full Text Available Both the promises and pitfalls of the cell reprogramming research platform rest on human genetic variation, making the measurement of its impact one of the most urgent issues in the field. Harnessing large transcriptomics datasets of induced pluripotent stem cells (iPSC, we investigate the implications of this variability for iPSC-based disease modeling. In particular, we show that the widespread use of more than one clone per individual in combination with current analytical practices is detrimental to the robustness of the findings. We then proceed to identify methods to address this challenge and leverage multiple clones per individual. Finally, we evaluate the specificity and sensitivity of different sample sizes and experimental designs, presenting computational tools for power analysis. These findings and tools reframe the nature of replicates used in disease modeling and provide important resources for the design, analysis, and interpretation of iPSC-based studies.

  16. Tissue-Specific Transcriptomics of the Exotic Invasive Insect Pest Emerald Ash Borer (Agrilus planipennis)

    Science.gov (United States)

    Mittapalli, Omprakash; Bai, Xiaodong; Bonello, Pierluigi; Herms, Daniel A.

    2010-01-01

    Background The insect midgut and fat body represent major tissue interfaces that deal with several important physiological functions including digestion, detoxification and immune response. The emerald ash borer (Agrilus planipennis), is an exotic invasive insect pest that has killed millions of ash trees (Fraxinus spp.) primarily in the Midwestern United States and Ontario, Canada. However, despite its high impact status little knowledge exists for A. planipennis at the molecular level. Methodology and Principal Findings Newer-generation Roche-454 pyrosequencing was used to obtain 126,185 reads for the midgut and 240,848 reads for the fat body, which were assembled into 25,173 and 37,661 high quality expressed sequence tags (ESTs) for the midgut and the fat body of A. planipennis larvae, respectively. Among these ESTs, 36% of the midgut and 38% of the fat body sequences showed similarity to proteins in the GenBank nr database. A high number of the midgut sequences contained chitin-binding peritrophin (248)and trypsin (98) domains; while the fat body sequences showed high occurrence of cytochrome P450s (85) and protein kinase (123) domains. Further, the midgut transcriptome of A. planipennis revealed putative microbial transcripts encoding for cell-wall degrading enzymes such as polygalacturonases and endoglucanases. A significant number of SNPs (137 in midgut and 347 in fat body) and microsatellite loci (317 in midgut and 571 in fat body) were predicted in the A. planipennis transcripts. An initial assessment of cytochrome P450s belonging to various CYP clades revealed distinct expression patterns at the tissue level. Conclusions and Significance To our knowledge this study is one of the first to illuminate tissue-specific gene expression in an invasive insect of high ecological and economic consequence. These findings will lay the foundation for future gene expression and functional studies in A. planipennis. PMID:21060843

  17. ALTERATIONS IN THE DEVELOPING TESTIS TRANSCRIPTOME FOLLOWING EMBRYONIC VINCLOZOLIN EXPOSURE

    OpenAIRE

    Clement, Tracy M.; Savenkova, Marina I.; Settles, Matthew; Anway, Matthew D.; Skinner, Michael K.

    2010-01-01

    The current study investigates the direct effects of in utero vinclozolin exposure on the developing F1 generation rat testis transcriptome. Previous studies have demonstrated that exposure to vinclozolin during embryonic gonadal sex determination induces epigenetic modifications of the germ line and transgenerational adult onset disease states. Microarray analyses were performed to compare control and vinclozolin treated testis transcriptomes at embryonic day 13, 14 and 16. A total of 576 di...

  18. Transcriptome profiling of tobacco under water deficit conditions

    Directory of Open Access Journals (Sweden)

    Roel C. Rabara

    2015-09-01

    Full Text Available Drought is one of the limiting environmental factors that affect crop production. Understanding the molecular basis of how plants respond to this water deficit stress is key to developing drought tolerant crops. In this study we generated time course-based transcriptome profiles of tobacco plants under water deficit conditions using microarray technology. In this paper, we describe in detail the experimental procedures and analyses performed in our study. The data set we generated (available in the NCBI/GEO database under GSE67434 has been analysed to identify genes that are involved in the regulation of tobacco's responses to drought.

  19. The renal transcriptome in experimental hypertension

    NARCIS (Netherlands)

    Wesseling, S.

    2007-01-01

    The renal transcriptome in experimental hypertension The kidneys importantly determine blood pressure. Kidney dysfunction can result in hypertension, which in turn leads to renal damage. In primary hypertension the cause is unknown. The condition is polygenic, however, which genetic defects cause

  20. Brain transcriptome atlases : A computational perspective

    NARCIS (Netherlands)

    Mahfouz, A.M.E.T.A.; Huisman, S.M.H.; Lelieveldt, B.P.F.; Reinders, M.J.T.

    2017-01-01

    The immense complexity of the mammalian brain is largely reflected in the underlying molecular signatures of its billions of cells. Brain transcriptome atlases provide valuable insights into gene expression patterns across different brain areas throughout the course of development. Such atlases

  1. Comparison of the nodule vs. root transcriptome of the actinorhizal plant Datisca glomerata: actinorhizal nodules contain a specific class of defensins.

    Directory of Open Access Journals (Sweden)

    Irina V Demina

    Full Text Available Actinorhizal root nodule symbioses are very diverse, and the symbiosis of Datisca glomerata has previously been shown to have many unusual aspects. In order to gain molecular information on the infection mechanism, nodule development and nodule metabolism, we compared the transcriptomes of D. glomerata roots and nodules. Root and nodule libraries representing the 3'-ends of cDNAs were subjected to high-throughput parallel 454 sequencing. To identify the corresponding genes and to improve the assembly, Illumina sequencing of the nodule transcriptome was performed as well. The evaluation revealed 406 differentially regulated genes, 295 of which (72.7% could be assigned a function based on homology. Analysis of the nodule transcriptome showed that genes encoding components of the common symbiosis signaling pathway were present in nodules of D. glomerata, which in combination with the previously established function of SymRK in D. glomerata nodulation suggests that this pathway is also active in actinorhizal Cucurbitales. Furthermore, comparison of the D. glomerata nodule transcriptome with nodule transcriptomes from actinorhizal Fagales revealed a new subgroup of nodule-specific defensins that might play a role specific to actinorhizal symbioses. The D. glomerata members of this defensin subgroup contain an acidic C-terminal domain that was never found in plant defensins before.

  2. Amplicon-Based Pyrosequencing Reveals High Diversity of Protistan Parasites in Ships' Ballast Water: Implications for Biogeography and Infectious Diseases.

    Science.gov (United States)

    Pagenkopp Lohan, K M; Fleischer, R C; Carney, K J; Holzer, K K; Ruiz, G M

    2016-04-01

    Ships' ballast water (BW) commonly moves macroorganisms and microorganisms across the world's oceans and along coasts; however, the majority of these microbial transfers have gone undetected. We applied high-throughput sequencing methods to identify microbial eukaryotes, specifically emphasizing the protistan parasites, in ships' BW collected from vessels calling to the Chesapeake Bay (Virginia and Maryland, USA) from European and Eastern Canadian ports. We utilized tagged-amplicon 454 pyrosequencing with two general primer sets, amplifying either the V4 or V9 domain of the small subunit (SSU) of the ribosomal RNA (rRNA) gene complex, from total DNA extracted from water samples collected from the ballast tanks of bulk cargo vessels. We detected a diverse group of protistan taxa, with some known to contain important parasites in marine systems, including Apicomplexa (unidentified apicomplexans, unidentified gregarines, Cryptosporidium spp.), Dinophyta (Blastodinium spp., Euduboscquella sp., unidentified syndinids, Karlodinium spp., Syndinium spp.), Perkinsea (Parvilucifera sp.), Opisthokonta (Ichthyosporea sp., Pseudoperkinsidae, unidentified ichthyosporeans), and Stramenopiles (Labyrinthulomycetes). Further characterization of groups with parasitic taxa, consisting of phylogenetic analyses for four taxa (Cryptosporidium spp., Parvilucifera spp., Labyrinthulomycetes, and Ichthyosporea), revealed that sequences were obtained from both known and novel lineages. This study demonstrates that high-throughput sequencing is a viable and sensitive method for detecting parasitic protists when present and transported in the ballast water of ships. These data also underscore the potential importance of human-aided dispersal in the biogeography of these microbes and emerging diseases in the world's oceans.

  3. Analysis of a native whitefly transcriptome and its sequence divergence with two invasive whitefly species

    Directory of Open Access Journals (Sweden)

    Wang Xiao-Wei

    2012-10-01

    Full Text Available Abstract Background Genomic divergence between invasive and native species may provide insight into the molecular basis underlying specific characteristics that drive the invasion and displacement of closely related species. In this study, we sequenced the transcriptome of an indigenous species, Asia II 3, of the Bemisia tabaci complex and compared its genetic divergence with the transcriptomes of two invasive whiteflies species, Middle East Asia Minor 1 (MEAM1 and Mediterranean (MED, respectively. Results More than 16 million reads of 74 base pairs in length were obtained for the Asia II 3 species using the Illumina sequencing platform. These reads were assembled into 52,535 distinct sequences (mean size: 466 bp and 16,596 sequences were annotated with an E-value above 10-5. Protein family comparisons revealed obvious diversification among the transcriptomes of these species suggesting species-specific adaptations during whitefly evolution. On the contrary, substantial conservation of the whitefly transcriptomes was also evident, despite their differences. The overall divergence of coding sequences between the orthologous gene pairs of Asia II 3 and MEAM1 is 1.73%, which is comparable to the average divergence of Asia II 3 and MED transcriptomes (1.84% and much higher than that of MEAM1 and MED (0.83%. This is consistent with the previous phylogenetic analyses and crossing experiments suggesting these are distinct species. We also identified hundreds of highly diverged genes and compiled sequence identify data into gene functional groups and found the most divergent gene classes are Cytochrome P450, Glutathione metabolism and Oxidative phosphorylation. These results strongly suggest that the divergence of genes related to metabolism might be the driving force of the MEAM1 and Asia II 3 differentiation. We also analyzed single nucleotide polymorphisms within the orthologous gene pairs of indigenous and invasive whiteflies which are helpful for

  4. Transcriptomic events involved in melon mature-fruit abscission comprise the sequential induction of cell-wall degrading genes coupled to a stimulation of endo and exocytosis.

    Directory of Open Access Journals (Sweden)

    Jorge Corbacho

    Full Text Available Mature-fruit abscission (MFA in fleshy-fruit is a genetically controlled process with mechanisms that, contrary to immature-fruit abscission, has not been fully characterized. Here, we use pyrosequencing to characterize the transcriptomes of melon abscission zone (AZ at three stages during AZ-cell separation in order to understand MFA control at an early stage of AZ-activation.The results show that by early induction of MFA, the melon AZ exhibits major gene induction, while by late induction of MFA, melon AZ shows major gene repression. Although some genes displayed similar regulation in both early and late induction of abscission, such as EXT1-EXT4, EGase1, IAA2, ERF1, AP2D15, FLC, MADS2, ERAF17, SAP5 and SCL13 genes, the majority had different expression patterns. This implies that time-specific events occur during MFA, and emphasizes the value of characterizing multiple time-specific abscission transcriptomes. Analysis of gene-expression from these AZs reveal that a sequential induction of cell-wall-degrading genes is associated with the upregulation of genes involved in endo and exocytosis, and a shift in plant-hormone metabolism and signaling genes during MFA. This is accompanied by transcriptional activity of small-GTPases and synthaxins together with tubulins, dynamins, V-type ATPases and kinesin-like proteins potentially involved in MFA signaling. Early events are potentially controlled by down-regulation of MADS-box, AP2/ERF and Aux/IAA transcription-factors, and up-regulation of homeobox, zinc finger, bZIP, and WRKY transcription-factors, while late events may be controlled by up-regulation of MYB transcription-factors.Overall, the data provide a comprehensive view on MFA in fleshy-fruit, identifying candidate genes and pathways associated with early induction of MFA. Our comprehensive gene-expression profile will be very useful for elucidating gene regulatory networks of the MFA in fleshy-fruit.

  5. Optimization of biostimulant for bioremediation of contaminated coastal sediment by response surface methodology (RSM) and evaluation of microbial diversity by pyrosequencing.

    Science.gov (United States)

    Subha, Bakthavachallam; Song, Young Chae; Woo, Jung Hui

    2015-09-15

    The present study aims to optimize the slow release biostimulant ball (BSB) for bioremediation of contaminated coastal sediment using response surface methodology (RSM). Different bacterial communities were evaluated using a pyrosequencing-based approach in contaminated coastal sediments. The effects of BSB size (1-5cm), distance (1-10cm) and time (1-4months) on changes in chemical oxygen demand (COD) and volatile solid (VS) reduction were determined. Maximum reductions of COD and VS, 89.7% and 78.8%, respectively, were observed at a 3cm ball size, 5.5cm distance and 4months; these values are the optimum conditions for effective treatment of contaminated coastal sediment. Most of the variance in COD and VS (0.9291 and 0.9369, respectively) was explained in our chosen models. BSB is a promising method for COD and VS reduction and enhancement of SRB diversity. Copyright © 2015 Elsevier Ltd. All rights reserved.

  6. The Transcriptome of Leishmania major Developmental Stages in Their Natural Sand Fly Vector.

    Science.gov (United States)

    Inbar, Ehud; Hughitt, V Keith; Dillon, Laura A L; Ghosh, Kashinath; El-Sayed, Najib M; Sacks, David L

    2017-04-04

    The life cycle of the Leishmania parasite in the sand fly vector involves differentiation into several distinctive forms, each thought to represent an adaptation to specific microenvironments in the midgut of the fly. Based on transcriptome sequencing (RNA-Seq) results, we describe the first high-resolution analysis of the transcriptome dynamics of four distinct stages of Leishmania major as they develop in a natural vector, Phlebotomus duboscqi The early transformation from tissue amastigotes to procyclic promastigotes in the blood-fed midgut was accompanied by the greatest number of differentially expressed genes, including the downregulation of amastins, and upregulation of multiple cell surface proteins, sugar and amino acid transporters, and genes related to glucose metabolism and cell cycle progression. The global changes accompanying post-blood meal differentiation of procyclic promastigotes to the nectomonad and metacyclic stages were less extensive, though each displayed a unique signature. The transcriptome of nectomonads, which has not been studied previously, revealed changes consistent with cell cycle arrest and the upregulation of genes associated with starvation and stress, including autophagic pathways of protein recycling. Maturation to the infective, metacyclic stage was accompanied by changes suggesting preadaptation to the intracellular environment of the mammalian host, demonstrated by the amastigote-like profiles of surface proteins and metabolism-related genes. Finally, a direct comparison between sand fly-derived and culture-derived metacyclics revealed a reassuring similarity between the two forms, with the in vivo forms distinguished mainly by a stronger upregulation of transcripts associated with nutrient stress. IMPORTANCE The life cycle of Leishmania parasites in the sand fly vector includes their growth and development as morphologically distinct forms of extracellular promastigotes found within the different microenvironments of the

  7. Cell type-specific responses to salinity - the epidermal bladder cell transcriptome of Mesembryanthemum crystallinum.

    Science.gov (United States)

    Oh, Dong-Ha; Barkla, Bronwyn J; Vera-Estrella, Rosario; Pantoja, Omar; Lee, Sang-Yeol; Bohnert, Hans J; Dassanayake, Maheshi

    2015-08-01

    Mesembryanthemum crystallinum (ice plant) exhibits extreme tolerance to salt. Epidermal bladder cells (EBCs), developing on the surface of aerial tissues and specialized in sodium sequestration and other protective functions, are critical for the plant's stress adaptation. We present the first transcriptome analysis of EBCs isolated from intact plants, to investigate cell type-specific responses during plant salt adaptation. We developed a de novo assembled, nonredundant EBC reference transcriptome. Using RNAseq, we compared the expression patterns of the EBC-specific transcriptome between control and salt-treated plants. The EBC reference transcriptome consists of 37 341 transcript-contigs, of which 7% showed significantly different expression between salt-treated and control samples. We identified significant changes in ion transport, metabolism related to energy generation and osmolyte accumulation, stress signalling, and organelle functions, as well as a number of lineage-specific genes of unknown function, in response to salt treatment. The salinity-induced EBC transcriptome includes active transcript clusters, refuting the view of EBCs as passive storage compartments in the whole-plant stress response. EBC transcriptomes, differing from those of whole plants or leaf tissue, exemplify the importance of cell type-specific resolution in understanding stress adaptive mechanisms. No claim to original US government works. New Phytologist © 2015 New Phytologist Trust.

  8. Transcriptome-Based Identification of the Desiccation Response Genes in Marine Red Algae Pyropia tenera (Rhodophyta) and Enhancement of Abiotic Stress Tolerance by PtDRG2 in Chlamydomonas.

    Science.gov (United States)

    Im, Sungoh; Lee, Ha-Nul; Jung, Hyun Shin; Yang, Sunghwan; Park, Eun-Jeong; Hwang, Mi Sook; Jeong, Won-Joong; Choi, Dong-Woog

    2017-06-01

    Pyropia tenera (Kjellman) are marine red algae that grow in the intertidal zone and lose more than 90% of water during hibernal low tides every day. In order to identify the desiccation response gene (DRG) in P. tenera, we generated 1,444,210 transcriptome sequences using the 454-FLX platform from the gametophyte under control and desiccation conditions. De novo assembly of the transcriptome reads generated 13,170 contigs, covering about 12 Mbp. We selected 1160 differentially expressed genes (DEGs) in response to desiccation stress based on reads per kilobase per million reads (RPKM) expression values. As shown in green higher plants, DEGs under desiccation are composed of two groups of genes for gene regulation networks and functional proteins for carbohydrate metabolism, membrane perturbation, compatible solutes, and specific proteins similar to higher plants. DEGs that show no significant homology with known sequences in public databases were selected as DRGs in P. tenera. PtDRG2 encodes a novel polypeptide of 159 amino acid residues locating chloroplast. When PtDRG2 was overexpressed in Chlamydomonas, the PtDRG2 confer mannitol and salt tolerance in transgenic cells. These results suggest that Pyropia may possess novel genes that differ from green plants, although the desiccation tolerance mechanism in red algae is similar to those of higher green plants. These transcriptome sequences will facilitate future studies to understand the common processes and novel mechanisms involved in desiccation stress tolerance in red algae.

  9. Transcriptome profiling of whole blood cells identifies PLEK2 and C1QB in human melanoma.

    Directory of Open Access Journals (Sweden)

    Yuchun Luo

    Full Text Available Developing analytical methodologies to identify biomarkers in easily accessible body fluids is highly valuable for the early diagnosis and management of cancer patients. Peripheral whole blood is a "nucleic acid-rich" and "inflammatory cell-rich" information reservoir and represents systemic processes altered by the presence of cancer cells.We conducted transcriptome profiling of whole blood cells from melanoma patients. To overcome challenges associated with blood-based transcriptome analysis, we used a PAXgene™ tube and NuGEN Ovation™ globin reduction system. The combined use of these systems in microarray resulted in the identification of 78 unique genes differentially expressed in the blood of melanoma patients. Of these, 68 genes were further analyzed by quantitative reverse transcriptase PCR using blood samples from 45 newly diagnosed melanoma patients (stage I to IV and 50 healthy control individuals. Thirty-nine genes were verified to be differentially expressed in blood samples from melanoma patients. A stepwise logit analysis selected eighteen 2-gene signatures that distinguish melanoma from healthy controls. Of these, a 2-gene signature consisting of PLEK2 and C1QB led to the best result that correctly classified 93.3% melanoma patients and 90% healthy controls. Both genes were upregulated in blood samples of melanoma patients from all stages. Further analysis using blood fractionation showed that CD45(- and CD45(+ populations were responsible for the altered expression levels of PLEK2 and C1QB, respectively.The current study provides the first analysis of whole blood-based transcriptome biomarkers for malignant melanoma. The expression of PLEK2, the strongest gene to classify melanoma patients, in CD45(- subsets illustrates the importance of analyzing whole blood cells for biomarker studies. The study suggests that transcriptome profiling of blood cells could be used for both early detection of melanoma and monitoring of patients

  10. Transcriptomic signatures shaped by cell proportions shed light on comparative developmental biology

    Czech Academy of Sciences Publication Activity Database

    Pantalacci, S.; Gueguen, L.; Petit, C.; Lambert, A.; Peterková, Renata; Sémon, E.

    2017-01-01

    Roč. 18, feb (2017), s. 29 ISSN 1474-760X R&D Projects: GA ČR(CZ) GB14-37368G Institutional support: RVO:68378041 Keywords : comparative transcriptomics * developmental biology * transcriptomic signature Subject RIV: EA - Cell Biology OBOR OECD: Developmental biology Impact factor: 11.908, year: 2016

  11. Deep sequencing-based transcriptome profiling analysis of bacteria-challenged Lateolabrax japonicus reveals insight into the immune-relevant genes in marine fish

    Directory of Open Access Journals (Sweden)

    Xiang Li-xin

    2010-08-01

    Full Text Available Abstract Background Systematic research on fish immunogenetics is indispensable in understanding the origin and evolution of immune systems. This has long been a challenging task because of the limited number of deep sequencing technologies and genome backgrounds of non-model fish available. The newly developed Solexa/Illumina RNA-seq and Digital gene expression (DGE are high-throughput sequencing approaches and are powerful tools for genomic studies at the transcriptome level. This study reports the transcriptome profiling analysis of bacteria-challenged Lateolabrax japonicus using RNA-seq and DGE in an attempt to gain insights into the immunogenetics of marine fish. Results RNA-seq analysis generated 169,950 non-redundant consensus sequences, among which 48,987 functional transcripts with complete or various length encoding regions were identified. More than 52% of these transcripts are possibly involved in approximately 219 known metabolic or signalling pathways, while 2,673 transcripts were associated with immune-relevant genes. In addition, approximately 8% of the transcripts appeared to be fish-specific genes that have never been described before. DGE analysis revealed that the host transcriptome profile of Vibrio harveyi-challenged L. japonicus is considerably altered, as indicated by the significant up- or down-regulation of 1,224 strong infection-responsive transcripts. Results indicated an overall conservation of the components and transcriptome alterations underlying innate and adaptive immunity in fish and other vertebrate models. Analysis suggested the acquisition of numerous fish-specific immune system components during early vertebrate evolution. Conclusion This study provided a global survey of host defence gene activities against bacterial challenge in a non-model marine fish. Results can contribute to the in-depth study of candidate genes in marine fish immunity, and help improve current understanding of host

  12. Spatial and Species Variations in Bacterial Communities Associated with Corals from the Red Sea as Revealed by Pyrosequencing

    KAUST Repository

    Lee, O. O.

    2012-08-03

    Microbial associations with corals are common and are most likely symbiotic, although their diversity and relationships with environmental factors and host species remain unclear. In this study, we adopted a 16S rRNA gene tag-pyrosequencing technique to investigate the bacterial communities associated with three stony Scleractinea and two soft Octocorallia corals from three locations in the Red Sea. Our results revealed highly diverse bacterial communities in the Red Sea corals, with more than 600 ribotypes detected and up to 1,000 species estimated from a single coral species. Altogether, 21 bacterial phyla were recovered from the corals, of which Gammaproteobacteria was the most dominant group, and Chloroflexi, Chlamydiae, and the candidate phylum WS3 were reported in corals for the first time. The associated bacterial communities varied greatly with location, where environmental conditions differed significantly. Corals from disturbed areas appeared to share more similar bacterial communities, but larger variations in community structures were observed between different coral species from pristine waters. Ordination methods identified salinity and depth as the most influential parameters affecting the abundance of Vibrio, Pseudoalteromonas, Serratia, Stenotrophomonas, Pseudomonas, and Achromobacter in the corals. On the other hand, bacteria such as Chloracidobacterium and Endozoicomonas were more sensitive to the coral species, suggesting that the host species type may be influential in the associated bacterial community, as well. The combined influences of the coral host and environmental factors on the associated microbial communities are discussed. This study represents the first comparative study using tag-pyrosequencing technology to investigate the bacterial communities in Red Sea corals.

  13. Flower bud transcriptome analysis of Sapium sebiferum (Linn. Roxb. and primary investigation of drought induced flowering: pathway construction and G-quadruplex prediction based on transcriptome.

    Directory of Open Access Journals (Sweden)

    Minglei Yang

    Full Text Available Sapium sebiferum (Linn. Roxb. (Chinese Tallow Tree is a perennial woody tree and its seeds are rich in oil which hold great potential for biodiesel production. Despite a traditional woody oil plant, our understanding on S. sebiferum genetics and molecular biology remains scant. In this study, the first comprehensive transcriptome of S. sebiferum flower has been generated by sequencing and de novo assembly. A total of 149,342 unigenes were generated from raw reads, of which 24,289 unigenes were successfully matched to public database. A total of 61 MADS box genes and putative pathways involved in S. sebiferum flower development have been identified. Abiotic stress response network was also constructed in this work, where 2,686 unigenes are involved in the pathway. As for lipid biosynthesis, 161 unigenes have been identified in fatty acid (FA and triacylglycerol (TAG biosynthesis. Besides, the G-Quadruplexes in RNA of S. sebiferum also have been predicted. An interesting finding is that the stress-induced flowering was observed in S. sebiferum for the first time. According to the results of semi-quantitative PCR, expression tendencies of flowering-related genes, GA1, AP2 and CRY2, accorded with stress-related genes, such as GRX50435 and PRXⅡ39562. This transcriptome provides functional genomic information for further research of S. sebiferum, especially for the genetic engineering to shorten the juvenile period and improve yield by regulating flower development. It also offers a useful database for the research of other Euphorbiaceae family plants.

  14. Haematobia irritans dataset of raw sequence reads from Illumina-based transcriptome sequencing of specific tissues and life stages

    Science.gov (United States)

    Illumina HiSeq technology was used to sequence the transcriptome from various dissected tissues and life stages from the horn fly, Haematobia irritans. These samples include eggs (0, 2, 4, and 9 hours post-oviposition), adult fly gut, adult fly legs, adult fly malpighian tubule, adult fly ovary, adu...

  15. A SAGE based approach to human glomerular endothelium : defining the transcriptome, finding a novel molecule and highlighting endothelial diversity

    NARCIS (Netherlands)

    Sengoelge, Guerkan; Winnicki, Wolfgang; Kupczok, Anne; von Haeseler, Arndt; Schuster, Michael; Pfaller, Walter; Jennings, Paul; Weltermann, Ansgar; Blake, Sophia; Sunder-Plassmann, Gere

    2014-01-01

    BACKGROUND: Large scale transcript analysis of human glomerular microvascular endothelial cells (HGMEC) has never been accomplished. We designed this study to define the transcriptome of HGMEC and facilitate a better characterization of these endothelial cells with unique features. Serial analysis

  16. 454-Pyrosequencing: A Molecular Battiscope for Freshwater Viral Ecology

    Directory of Open Access Journals (Sweden)

    David J. Rooks

    2010-07-01

    Full Text Available Viruses, the most abundant biological entities on the planet, are capable of infecting organisms from all three branches of life, although the majority infect bacteria where the greatest degree of cellular diversity lies. However, the characterization and assessment of viral diversity in natural environments is only beginning to become a possibility. Through the development of a novel technique for the harvest of viral DNA and the application of 454 pyrosequencing, a snapshot of the diversity of the DNA viruses harvested from a standing pond on a cattle farm has been obtained. A high abundance of viral genotypes (785 were present within the virome. The absolute numbers of lambdoid and Shiga toxin (Stx encoding phages detected suggested that the depth of sequencing had enabled recovery of only ca. 8% of the total virus population, numbers that agreed within less than an order of magnitude with predictions made by rarefaction analysis. The most abundant viral genotypes in the pond were bacteriophages (93.7%. The predominant viral genotypes infecting higher life forms found in association with the farm were pathogens that cause disease in cattle and humans, e.g. members of the Herpesviridae. The techniques and analysis described here provide a fresh approach to the monitoring of viral populations in the aquatic environment, with the potential to become integral to the development of risk analysis tools for monitoring the dissemination of viral agents of animal, plant and human diseases.

  17. Workflow and web application for annotating NCBI BioProject transcriptome data.

    Science.gov (United States)

    Vera Alvarez, Roberto; Medeiros Vidal, Newton; Garzón-Martínez, Gina A; Barrero, Luz S; Landsman, David; Mariño-Ramírez, Leonardo

    2017-01-01

    The volume of transcriptome data is growing exponentially due to rapid improvement of experimental technologies. In response, large central resources such as those of the National Center for Biotechnology Information (NCBI) are continually adapting their computational infrastructure to accommodate this large influx of data. New and specialized databases, such as Transcriptome Shotgun Assembly Sequence Database (TSA) and Sequence Read Archive (SRA), have been created to aid the development and expansion of centralized repositories. Although the central resource databases are under continual development, they do not include automatic pipelines to increase annotation of newly deposited data. Therefore, third-party applications are required to achieve that aim. Here, we present an automatic workflow and web application for the annotation of transcriptome data. The workflow creates secondary data such as sequencing reads and BLAST alignments, which are available through the web application. They are based on freely available bioinformatics tools and scripts developed in-house. The interactive web application provides a search engine and several browser utilities. Graphical views of transcript alignments are available through SeqViewer, an embedded tool developed by NCBI for viewing biological sequence data. The web application is tightly integrated with other NCBI web applications and tools to extend the functionality of data processing and interconnectivity. We present a case study for the species Physalis peruviana with data generated from BioProject ID 67621. URL: http://www.ncbi.nlm.nih.gov/projects/physalis/. Published by Oxford University Press 2017. This work is written by US Government employees and is in the public domain in the US.

  18. Massively parallel amplicon sequencing reveals isotype-specific variability of antimicrobial peptide transcripts in Mytilus galloprovincialis.

    Directory of Open Access Journals (Sweden)

    Umberto Rosani

    Full Text Available BACKGROUND: Effective innate responses against potential pathogens are essential in the living world and possibly contributed to the evolutionary success of invertebrates. Taken together, antimicrobial peptide (AMP precursors of defensin, mytilin, myticin and mytimycin can represent about 40% of the hemocyte transcriptome in mussels injected with viral-like and bacterial preparations, and unique profiles of myticin C variants are expressed in single mussels. Based on amplicon pyrosequencing, we have ascertained and compared the natural and Vibrio-induced diversity of AMP transcripts in mussel hemocytes from three European regions. METHODOLOGY/PRINCIPAL FINDINGS: Hemolymph was collected from mussels farmed in the coastal regions of Palavas (France, Vigo (Spain and Venice (Italy. To represent the AMP families known in M. galloprovincialis, nine transcript sequences have been selected, amplified from hemocyte RNA and subjected to pyrosequencing. Hemolymph from farmed (offshore and wild (lagoon Venice mussels, both injected with 10(7 Vibrio cells, were similarly processed. Amplicon pyrosequencing emphasized the AMP transcript diversity, with Single Nucleotide Changes (SNC minimal for mytilin B/C and maximal for arthropod-like defensin and myticin C. Ratio of non-synonymous vs. synonymous changes also greatly differed between AMP isotypes. Overall, each amplicon revealed similar levels of nucleotidic variation across geographical regions, with two main sequence patterns confirmed for mytimycin and no substantial changes after immunostimulation. CONCLUSIONS/SIGNIFICANCE: Barcoding and bidirectional pyrosequencing allowed us to map and compare the transcript diversity of known mussel AMPs. Though most of the genuine cds variation was common to the analyzed samples we could estimate from 9 to 106 peptide variants in hemolymph pools representing 100 mussels, depending on the AMP isoform and sampling site. In this study, no prevailing SNC patterns related

  19. A whole-blood transcriptome meta-analysis identifies gene expression signatures of cigarette smoking

    NARCIS (Netherlands)

    Huan, T. (Tianxiao); R. Joehanes (Roby); C. Schurmann (Claudia); K. Schramm (Katharina); L.C. Pilling (Luke); M.J. Peters (Marjolein); R. Mägi (Reedik); D.L. Demeo (Dawn L.); G.T. O'Connor (George); L. Ferrucci (Luigi); A. Teumer (Alexander); G. Homuth (Georg); R. Biffar (Reiner); U. Völker (Uwe); C. Herder (Christian); M. Waldenberger (Melanie); A. Peters (Annette); S. Zeilinger (Sonja); A. Metspalu (Andres); A. Hofman (Albert); A.G. Uitterlinden (André); D.G. Hernandez (Dena); A. Singleton (Andrew); S. Bandinelli (Stefania); P.J. Munson (Peter); H. Lin (Honghuang); E.J. Benjamin (Emelia); T. Esko (Tõnu); H.J. Grabe (Hans Jörgen); H. Prokisch (Holger); J.B.J. van Meurs (Joyce); D. Melzer (David); D. Levy (Daniel)

    2016-01-01

    textabstractCigarette smoking is a leading modifiable cause of death worldwide. We hypothesized that cigarette smoking induces extensive transcriptomic changes that lead to target-organ damage and smoking-related diseases. We performed a metaanalysis of transcriptome-wide gene expression using whole

  20. A comparison of the Giardia lamblia trophozoite and cyst transcriptome using microarrays

    Directory of Open Access Journals (Sweden)

    Widmer Giovanni

    2011-05-01

    Full Text Available Abstract Background Compared with many protists, Giardia lamblia has a simple life cycle alternating between cyst and trophozoite. Most research on the molecular biology of Giardia parasites has focused on trophozoites and the processes of excystation and encystation, whereas cysts have attracted less interest. The striking morphological differences between the dormant cyst and the rapidly dividing and motile trophozoite implies profound changes in the metabolism as the parasite encysts in the host's intestine and excysts upon ingestion by a new host. Results To investigate the magnitude of the transcriptional changes occurring during the G. lamblia life cycle we compared the transcriptome of G. lamblia trophozoites and cysts using single-color oligonucleotide microarrays. Cysts were found to possess a much smaller transcriptome, both in terms of mRNA diversity and abundance. Genes encoding proteins related to ribosomal functions are highly over-represented. The comparison of the transcriptome of cysts generated in culture or extracted from feces revealed little overlap, raising the possibility of significant biological differences between the two types of cysts. Conclusions The comparison of the G. lamblia cyst and trophozoite transcriptome showed that transcripts of most genes are present at a lower level in cysts. This global view of the cyst and trophozoite transcriptome complements studies focused on the expression of selected genes during trophozoite multiplication, encystation and excystation.

  1. Improving amphibian genomic resources: a multitissue reference transcriptome of an iconic invader.

    Science.gov (United States)

    Richardson, Mark F; Sequeira, Fernando; Selechnik, Daniel; Carneiro, Miguel; Vallinoto, Marcelo; Reid, Jack G; West, Andrea J; Crossland, Michael R; Shine, Richard; Rollins, Lee A

    2018-01-01

    Cane toads (Rhinella marina) are an iconic invasive species introduced to 4 continents and well utilized for studies of rapid evolution in introduced environments. Despite the long introduction history of this species, its profound ecological impacts, and its utility for demonstrating evolutionary principles, genetic information is sparse. Here we produce a de novo transcriptome spanning multiple tissues and life stages to enable investigation of the genetic basis of previously identified rapid phenotypic change over the introduced range. Using approximately 1.9 billion reads from developing tadpoles and 6 adult tissue-specific cDNA libraries, as well as a transcriptome assembly pipeline encompassing 100 separate de novo assemblies, we constructed 62 202 transcripts, of which we functionally annotated ∼50%. Our transcriptome assembly exhibits 90% full-length completeness of the Benchmarking Universal Single-Copy Orthologs data set. Robust assembly metrics and comparisons with several available anuran transcriptomes and genomes indicate that our cane toad assembly is one of the most complete anuran genomic resources available. This comprehensive anuran transcriptome will provide a valuable resource for investigation of genes under selection during invasion in cane toads, but will also greatly expand our general knowledge of anuran genomes, which are underrepresented in the literature. The data set is publically available in NCBI and GigaDB to serve as a resource for other researchers. © The Authors 2017. Published by Oxford University Press.

  2. Transcriptome complexity in cardiac development and diseases--an expanding universe between genome and phenome.

    Science.gov (United States)

    Gao, Chen; Wang, Yibin

    2014-01-01

    With the advancement of transcriptome profiling by micro-arrays and high-throughput RNA-sequencing, transcriptome complexity and its dynamics are revealed at different levels in cardiovascular development and diseases. In this review, we will highlight the recent progress in our knowledge of cardiovascular transcriptome complexity contributed by RNA splicing, RNA editing and noncoding RNAs. The emerging importance of many of these previously under-explored aspects of gene regulation in cardiovascular development and pathology will be discussed.

  3. Deep Sequencing Reveals Uncharted Isoform Heterogeneity of the Protein-Coding Transcriptome in Cerebral Ischemia.

    Science.gov (United States)

    Bhattarai, Sunil; Aly, Ahmed; Garcia, Kristy; Ruiz, Diandra; Pontarelli, Fabrizio; Dharap, Ashutosh

    2018-06-03

    Gene expression in cerebral ischemia has been a subject of intense investigations for several years. Studies utilizing probe-based high-throughput methodologies such as microarrays have contributed significantly to our existing knowledge but lacked the capacity to dissect the transcriptome in detail. Genome-wide RNA-sequencing (RNA-seq) enables comprehensive examinations of transcriptomes for attributes such as strandedness, alternative splicing, alternative transcription start/stop sites, and sequence composition, thus providing a very detailed account of gene expression. Leveraging this capability, we conducted an in-depth, genome-wide evaluation of the protein-coding transcriptome of the adult mouse cortex after transient focal ischemia at 6, 12, or 24 h of reperfusion using RNA-seq. We identified a total of 1007 transcripts at 6 h, 1878 transcripts at 12 h, and 1618 transcripts at 24 h of reperfusion that were significantly altered as compared to sham controls. With isoform-level resolution, we identified 23 splice variants arising from 23 genes that were novel mRNA isoforms. For a subset of genes, we detected reperfusion time-point-dependent splice isoform switching, indicating an expression and/or functional switch for these genes. Finally, for 286 genes across all three reperfusion time-points, we discovered multiple, distinct, simultaneously expressed and differentially altered isoforms per gene that were generated via alternative transcription start/stop sites. Of these, 165 isoforms derived from 109 genes were novel mRNAs. Together, our data unravel the protein-coding transcriptome of the cerebral cortex at an unprecedented depth to provide several new insights into the flexibility and complexity of stroke-related gene transcription and transcript organization.

  4. An Insight into the Transcriptome of the Digestive Tract of the Bloodsucking Bug, Rhodnius prolixus

    Science.gov (United States)

    Ribeiro, José M. C.; Genta, Fernando A.; Sorgine, Marcos H. F.; Logullo, Raquel; Mesquita, Rafael D.; Paiva-Silva, Gabriela O.; Majerowicz, David; Medeiros, Marcelo; Koerich, Leonardo; Terra, Walter R.; Ferreira, Clélia; Pimentel, André C.; Bisch, Paulo M.; Leite, Daniel C.; Diniz, Michelle M. P.; Junior, João Lídio da S. G. V.; Da Silva, Manuela L.; Araujo, Ricardo N.; Gandara, Ana Caroline P.; Brosson, Sébastien; Salmon, Didier; Bousbata, Sabrina; González-Caballero, Natalia; Silber, Ariel Mariano; Alves-Bezerra, Michele; Gondim, Katia C.; Silva-Neto, Mário Alberto C.; Atella, Georgia C.; Araujo, Helena; Dias, Felipe A.; Polycarpo, Carla; Vionette-Amaral, Raquel J.; Fampa, Patrícia; Melo, Ana Claudia A.; Tanaka, Aparecida S.; Balczun, Carsten; Oliveira, José Henrique M.; Gonçalves, Renata L. S.; Lazoski, Cristiano; Rivera-Pomar, Rolando; Diambra, Luis; Schaub, Günter A.; Garcia, Elói S.; Azambuja, Patrícia; Braz, Glória R. C.; Oliveira, Pedro L.

    2014-01-01

    The bloodsucking hemipteran Rhodnius prolixus is a vector of Chagas' disease, which affects 7–8 million people today in Latin America. In contrast to other hematophagous insects, the triatomine gut is compartmentalized into three segments that perform different functions during blood digestion. Here we report analysis of transcriptomes for each of the segments using pyrosequencing technology. Comparison of transcript frequency in digestive libraries with a whole-body library was used to evaluate expression levels. All classes of digestive enzymes were highly expressed, with a predominance of cysteine and aspartic proteinases, the latter showing a significant expansion through gene duplication. Although no protein digestion is known to occur in the anterior midgut (AM), protease transcripts were found, suggesting secretion as pro-enzymes, being possibly activated in the posterior midgut (PM). As expected, genes related to cytoskeleton, protein synthesis apparatus, protein traffic, and secretion were abundantly transcribed. Despite the absence of a chitinous peritrophic membrane in hemipterans - which have instead a lipidic perimicrovillar membrane lining over midgut epithelia - several gut-specific peritrophin transcripts were found, suggesting that these proteins perform functions other than being a structural component of the peritrophic membrane. Among immunity-related transcripts, while lysozymes and lectins were the most highly expressed, several genes belonging to the Toll pathway - found at low levels in the gut of most insects - were identified, contrasting with a low abundance of transcripts from IMD and STAT pathways. Analysis of transcripts related to lipid metabolism indicates that lipids play multiple roles, being a major energy source, a substrate for perimicrovillar membrane formation, and a source for hydrocarbons possibly to produce the wax layer of the hindgut. Transcripts related to amino acid metabolism showed an unanticipated priority for

  5. An insight into the transcriptome of the digestive tract of the bloodsucking bug, Rhodnius prolixus.

    Directory of Open Access Journals (Sweden)

    José M C Ribeiro

    Full Text Available The bloodsucking hemipteran Rhodnius prolixus is a vector of Chagas' disease, which affects 7-8 million people today in Latin America. In contrast to other hematophagous insects, the triatomine gut is compartmentalized into three segments that perform different functions during blood digestion. Here we report analysis of transcriptomes for each of the segments using pyrosequencing technology. Comparison of transcript frequency in digestive libraries with a whole-body library was used to evaluate expression levels. All classes of digestive enzymes were highly expressed, with a predominance of cysteine and aspartic proteinases, the latter showing a significant expansion through gene duplication. Although no protein digestion is known to occur in the anterior midgut (AM, protease transcripts were found, suggesting secretion as pro-enzymes, being possibly activated in the posterior midgut (PM. As expected, genes related to cytoskeleton, protein synthesis apparatus, protein traffic, and secretion were abundantly transcribed. Despite the absence of a chitinous peritrophic membrane in hemipterans - which have instead a lipidic perimicrovillar membrane lining over midgut epithelia - several gut-specific peritrophin transcripts were found, suggesting that these proteins perform functions other than being a structural component of the peritrophic membrane. Among immunity-related transcripts, while lysozymes and lectins were the most highly expressed, several genes belonging to the Toll pathway - found at low levels in the gut of most insects - were identified, contrasting with a low abundance of transcripts from IMD and STAT pathways. Analysis of transcripts related to lipid metabolism indicates that lipids play multiple roles, being a major energy source, a substrate for perimicrovillar membrane formation, and a source for hydrocarbons possibly to produce the wax layer of the hindgut. Transcripts related to amino acid metabolism showed an unanticipated

  6. Comparative de novo transcriptome analysis of male and female Sea buckthorn.

    Science.gov (United States)

    Bansal, Ankush; Salaria, Mehul; Sharma, Tashil; Stobdan, Tsering; Kant, Anil

    2018-02-01

    Sea buckthorn is a dioecious medicinal plant found at high altitude. The plant has both male and female reproductive organs in separate individuals. In this article, whole transcriptome de novo assemblies of male and female flower bud samples were carried out using Illumina NextSeq 500 platform to determine the role of the genes involved in sex determination. Moreover, genes with differential expression in male and female transcriptomes were identified to understand the underlying sex determination mechanism. The current study showed 63,904 and 62,272 coding sequences (CDS) in female and male transcriptome data sets, respectively. 16,831 common CDS were screened out from both transcriptomes, out of which 625 were upregulated and 491 were found to be downregulated. To understand the potential regulatory roles of differentially expressed genes in metabolic networks and biosynthetic pathways: KEGG mapping, gene ontology, and co-expression network analysis were performed. Comparison with Flowering Interactive Database (FLOR-ID) resulted in eight differentially expressed genes viz. CHD3-type chromatin-remodeling factor PICKLE ( PKL ), phytochrome-associated serine/threonine-protein phosphatase ( FYPP ), protein TOPLESS ( TPL ), sensitive to freezing 6 ( SFR6 ), lysine-specific histone demethylase 1 homolog 1 ( LDL1 ), pre-mRNA-processing-splicing factor 8A ( PRP8A ), sucrose synthase 4 ( SUS4 ), ubiquitin carboxyl-terminal hydrolase 12 ( UBP12 ), known to be broadly involved in flowering, photoperiodism, embryo development, and cold response pathways. Male and female flower bud transcriptome data of Sea buckthorn may provide comprehensive information at genomic level for the identification of genetic regulation involved in sex determination.

  7. Investigating the Correspondence Between Transcriptomic and Proteomic Expression Profiles Using Coupled Cluster Models

    International Nuclear Information System (INIS)

    Rogers, Simon; Girolami, Mark; Kolch, Walter; Waters, Katrina M.; Liu, Tao; Thrall, Brian D.; Wiley, H. S.

    2008-01-01

    Modern transcriptomics and proteomics enable us to survey the expression of RNAs and proteins at large scales. While these data are usually generated and analyzed separately, there is an increasing interest in comparing and co-analyzing transcriptome and proteome expression data. A major open question is whether transcriptome and proteome expression is linked and how it is coordinated. Results: Here we have developed a probabilistic clustering model that permits analysis of the links between transcriptomic and proteomic profiles in a sensible and flexible manner. Our coupled mixture model defines a prior probability distribution over the component to which a protein profile should be assigned conditioned on which component the associated mRNA profile belongs to. By providing probabilistic assignments this approach sits between the two extremes of concatenating the data on the assumption that mRNA and protein clusters would have a one-to-one relationship, and independent clustering where the mRNA profile provides no information on the protein profile and vice-versa. We apply this approach to a large dataset of quantitative transcriptomic and proteomic expression data obtained from a human breast epithelial cell line (HMEC) stimulated by epidermal growth factor (EGF) over a series of timepoints corresponding to one cell cycle. The results reveal a complex relationship between transcriptome and proteome with most mRNA clusters linked to at least two protein clusters, and vice versa. A more detailed analysis incorporating information on gene function from the gene ontology database shows that a high correlation of mRNA and protein expression is limited to the components of some molecular machines, such as the ribosome, cell adhesion complexes and the TCP-1 chaperonin involved in protein folding. Conclusions: The dynamic regulation of the transcriptome and proteome in mammalian cells in response to an acute mitogenic stimulus appears largely independent with very little

  8. Deep sequencing-based transcriptome analysis of Plutella xylostella larvae parasitized by Diadegma semiclausum

    Science.gov (United States)

    2011-01-01

    Background Parasitoid insects manipulate their hosts' physiology by injecting various factors into their host upon parasitization. Transcriptomic approaches provide a powerful approach to study insect host-parasitoid interactions at the molecular level. In order to investigate the effects of parasitization by an ichneumonid wasp (Diadegma semiclausum) on the host (Plutella xylostella), the larval transcriptome profile was analyzed using a short-read deep sequencing method (Illumina). Symbiotic polydnaviruses (PDVs) associated with ichneumonid parasitoids, known as ichnoviruses, play significant roles in host immune suppression and developmental regulation. In the current study, D. semiclausum ichnovirus (DsIV) genes expressed in P. xylostella were identified and their sequences compared with other reported PDVs. Five of these genes encode proteins of unknown identity, that have not previously been reported. Results De novo assembly of cDNA sequence data generated 172,660 contigs between 100 and 10000 bp in length; with 35% of > 200 bp in length. Parasitization had significant impacts on expression levels of 928 identified insect host transcripts. Gene ontology data illustrated that the majority of the differentially expressed genes are involved in binding, catalytic activity, and metabolic and cellular processes. In addition, the results show that transcription levels of antimicrobial peptides, such as gloverin, cecropin E and lysozyme, were up-regulated after parasitism. Expression of ichnovirus genes were detected in parasitized larvae with 19 unique sequences identified from five PDV gene families including vankyrin, viral innexin, repeat elements, a cysteine-rich motif, and polar residue rich protein. Vankyrin 1 and repeat element 1 genes showed the highest transcription levels among the DsIV genes. Conclusion This study provides detailed information on differential expression of P. xylostella larval genes following parasitization, DsIV genes expressed in the

  9. The use of genus-specific amplicon pyrosequencing to assess phytophthora species diversity using eDNA from soil and water in Northern Spain.

    Science.gov (United States)

    Català, Santiago; Pérez-Sierra, Ana; Abad-Campos, Paloma

    2015-01-01

    Phytophthora is one of the most important and aggressive plant pathogenic genera in agriculture and forestry. Early detection and identification of its pathways of infection and spread are of high importance to minimize the threat they pose to natural ecosystems. eDNA was extracted from soil and water from forests and plantations in the north of Spain. Phytophthora-specific primers were adapted for use in high-throughput Sequencing (HTS). Primers were tested in a control reaction containing eight Phytophthora species and applied to water and soil eDNA samples from northern Spain. Different score coverage threshold values were tested for optimal Phytophthora species separation in a custom-curated database and in the control reaction. Clustering at 99% was the optimal criteria to separate most of the Phytophthora species. Multiple Molecular Operational Taxonomic Units (MOTUs) corresponding to 36 distinct Phytophthora species were amplified in the environmental samples. Pyrosequencing of amplicons from soil samples revealed low Phytophthora diversity (13 species) in comparison with the 35 species detected in water samples. Thirteen of the MOTUs detected in rivers and streams showed no close match to sequences in international sequence databases, revealing that eDNA pyrosequencing is a useful strategy to assess Phytophthora species diversity in natural ecosystems.

  10. Transcriptome data on maternal RNA of 24 individual zebrafish eggs from five sibling mothers

    Directory of Open Access Journals (Sweden)

    Johanna F.B. Pagano

    2016-09-01

    Full Text Available Maternal mRNA that is present in the mature oocyte plays an important role in the proper development of the early embryo. To elucidate the role of the maternal transcriptome we recently reported a microarray study on individual zebrafish eggs from five different clutches from sibling mothers and showed differences in maternal RNA abundance between and within clutches, “Mother-specific signature in the maternal transcriptome composition of mature, unfertilized Eggs” [1]. Here we provide in detail the applied preprocessing method as well as the R-code to identify expressed and non-expressed genes in the associated transcriptome dataset. Additionally, we provide a website that allows a researcher to search for the expression of their gene of interest in this experiment. Keywords: Zebrafish, Danio rerio, Egg transcriptome, Single egg

  11. Human Transcriptome and Chromatin Modifications: An ENCODE Perspective

    Directory of Open Access Journals (Sweden)

    Li Shen

    2013-06-01

    Full Text Available A decade-long project, led by several international research groups, called the Encyclopedia of DNA Elements (ENCODE, recently released an unprecedented amount of data. The ambitious project covers transcriptome, cistrome, epigenome, and interactome data from more than 1,600 sets of experiments in human. To make use of this valuable resource, it is important to understand the information it represents and the techniques that were used to generate these data. In this review, we introduce the data that ENCODE generated, summarize the observations from the data analysis, and revisit a computational approach that ENCODE used to predict gene expression, with a focus on the human transcriptome and its association with chromatin modifications.

  12. Unique transcriptomic response to sepsis is observed among patients of different age groups.

    Science.gov (United States)

    Raymond, Steven L; López, María Cecilia; Baker, Henry V; Larson, Shawn D; Efron, Philip A; Sweeney, Timothy E; Khatri, Purvesh; Moldawer, Lyle L; Wynn, James L

    2017-01-01

    Sepsis is a major cause of morbidity and mortality, especially at the extremes of age. To understand the human age-specific transcriptomic response to sepsis, a multi-cohort, pooled analysis was conducted on adults, children, infants, and neonates with and without sepsis. Nine public whole-blood gene expression datasets (636 patients) were employed. Age impacted the transcriptomic host response to sepsis. Gene expression from septic neonates and adults was more dissimilar whereas infants and children were more similar. Neonates showed reductions in inflammatory recognition and signaling pathways compared to all other age groups. Likewise, adults demonstrated decreased pathogen sensing, inflammation, and myeloid cell function, as compared to children. This may help to explain the increased incidence of sepsis-related organ failure and death in adults. The number of dysregulated genes in septic patients was proportional to age and significantly differed among septic adults, children, infants, and neonates. Overall, children manifested a greater transcriptomic intensity to sepsis as compared to the other age groups. The transcriptomic magnitude for adults and neonates was dramatically reduced as compared to children and infants. These findings suggest that the transcriptomic response to sepsis is age-dependent, and diagnostic and therapeutic efforts to identify and treat sepsis will have to consider age as an important variable.

  13. Survey of the transcriptome of Aspergillus oryzae via massively parallel mRNA sequencing.

    Science.gov (United States)

    Wang, Bin; Guo, Guangwu; Wang, Chao; Lin, Ying; Wang, Xiaoning; Zhao, Mouming; Guo, Yong; He, Minghui; Zhang, Yong; Pan, Li

    2010-08-01

    Aspergillus oryzae, an important filamentous fungus used in food fermentation and the enzyme industry, has been shown through genome sequencing and various other tools to have prominent features in its genomic composition. However, the functional complexity of the A. oryzae transcriptome has not yet been fully elucidated. Here, we applied direct high-throughput paired-end RNA-sequencing (RNA-Seq) to the transcriptome of A. oryzae under four different culture conditions. With the high resolution and sensitivity afforded by RNA-Seq, we were able to identify a substantial number of novel transcripts, new exons, untranslated regions, alternative upstream initiation codons and upstream open reading frames, which provide remarkable insight into the A. oryzae transcriptome. We were also able to assess the alternative mRNA isoforms in A. oryzae and found a large number of genes undergoing alternative splicing. Many genes and pathways that might be involved in higher levels of protein production in solid-state culture than in liquid culture were identified by comparing gene expression levels between different cultures. Our analysis indicated that the transcriptome of A. oryzae is much more complex than previously anticipated, and these results may provide a blueprint for further study of the A. oryzae transcriptome.

  14. Exploring triacylglycerol biosynthetic pathway in developing seeds of Chia (Salvia hispanica L.: a transcriptomic approach.

    Directory of Open Access Journals (Sweden)

    Sreedhar R V

    Full Text Available Chia (Salvia hispanica L., a member of the mint family (Lamiaceae, is a rediscovered crop with great importance in health and nutrition and is also the highest known terrestrial plant source of heart-healthy omega-3 fatty acid, alpha linolenic acid (ALA. At present, there is no public genomic information or database available for this crop, hindering research on its genetic improvement through genomics-assisted breeding programs. The first comprehensive analysis of the global transcriptome profile of developing Salvia hispanica L. seeds, with special reference to lipid biosynthesis is presented in this study. RNA from five different stages of seed development was extracted and sequenced separately using the Illumina GAIIx platform. De novo assembly of processed reads in the pooled transcriptome using Trinity yielded 76,014 transcripts. The total transcript length was 66,944,462 bases (66.9 Mb, with an average length of approximately 880 bases. In the molecular functions category of Gene Ontology (GO terms, ATP binding and nucleotide binding were found to be the most abundant and in the biological processes category, the metabolic process and the regulation of transcription-DNA-dependent and oxidation-reduction process were abundant. From the EuKaryotic Orthologous Groups of proteins (KOG classification, the major category was "Metabolism" (31.97%, of which the most prominent class was 'carbohydrate metabolism and transport' (5.81% of total KOG classifications followed by 'secondary metabolite biosynthesis transport and catabolism' (5.34% and 'lipid metabolism' (4.57%. A majority of the candidate genes involved in lipid biosynthesis and oil accumulation were identified. Furthermore, 5596 simple sequence repeats (SSRs were identified. The transcriptome data was further validated through confirmative PCR and qRT-PCR for select lipid genes. Our study provides insight into the complex transcriptome and will contribute to further genome-wide research

  15. Shedding light on the microbial community of the macropod foregut using 454-amplicon pyrosequencing.

    Directory of Open Access Journals (Sweden)

    Lisa-Maree Gulino

    Full Text Available Twenty macropods from five locations in Queensland, Australia, grazing on a variety of native pastures were surveyed and the bacterial community of the foregut was examined using 454-amplicon pyrosequencing. Specifically, the V3/V4 region of 16S rRNA gene was examined. A total of 5040 OTUs were identified in the data set (post filtering. Thirty-two OTUs were identified as 'shared' OTUS (i.e. present in all samples belonging to either Firmicutes or Bacteroidetes (Clostridiales/Bacteroidales. These phyla predominated the general microbial community in all macropods. Genera represented within the shared OTUs included: unclassified Ruminococcaceae, unclassified Lachnospiraceae, unclassified Clostridiales, Peptococcus sp. Coprococcus spp., Streptococcus spp., Blautia sp., Ruminoccocus sp., Eubacterium sp., Dorea sp., Oscillospira sp. and Butyrivibrio sp. The composition of the bacterial community of the foregut samples of each the host species (Macropus rufus, Macropus giganteus and Macropus robustus was significantly different allowing differentiation between the host species based on alpha and beta diversity measures. Specifically, eleven dominant OTUs that separated the three host species were identified and classified as: unclassified Ruminococcaceae, unclassified Bacteroidales, Prevotella spp. and a Syntrophococcus sucromutans. Putative reductive acetogens and fibrolytic bacteria were also identified in samples. Future work will investigate the presence and role of fibrolytics and acetogens in these ecosystems. Ideally, the isolation and characterization of these organisms will be used for enhanced feed efficiency in cattle, methane mitigation and potentially for other industries such as the biofuel industry.

  16. Shedding light on the microbial community of the macropod foregut using 454-amplicon pyrosequencing.

    Science.gov (United States)

    Gulino, Lisa-Maree; Ouwerkerk, Diane; Kang, Alicia Y H; Maguire, Anita J; Kienzle, Marco; Klieve, Athol V

    2013-01-01

    Twenty macropods from five locations in Queensland, Australia, grazing on a variety of native pastures were surveyed and the bacterial community of the foregut was examined using 454-amplicon pyrosequencing. Specifically, the V3/V4 region of 16S rRNA gene was examined. A total of 5040 OTUs were identified in the data set (post filtering). Thirty-two OTUs were identified as 'shared' OTUS (i.e. present in all samples) belonging to either Firmicutes or Bacteroidetes (Clostridiales/Bacteroidales). These phyla predominated the general microbial community in all macropods. Genera represented within the shared OTUs included: unclassified Ruminococcaceae, unclassified Lachnospiraceae, unclassified Clostridiales, Peptococcus sp. Coprococcus spp., Streptococcus spp., Blautia sp., Ruminoccocus sp., Eubacterium sp., Dorea sp., Oscillospira sp. and Butyrivibrio sp. The composition of the bacterial community of the foregut samples of each the host species (Macropus rufus, Macropus giganteus and Macropus robustus) was significantly different allowing differentiation between the host species based on alpha and beta diversity measures. Specifically, eleven dominant OTUs that separated the three host species were identified and classified as: unclassified Ruminococcaceae, unclassified Bacteroidales, Prevotella spp. and a Syntrophococcus sucromutans. Putative reductive acetogens and fibrolytic bacteria were also identified in samples. Future work will investigate the presence and role of fibrolytics and acetogens in these ecosystems. Ideally, the isolation and characterization of these organisms will be used for enhanced feed efficiency in cattle, methane mitigation and potentially for other industries such as the biofuel industry.

  17. Deep sequencing-based transcriptome analysis of chicken spleen in response to avian pathogenic Escherichia coli (APEC infection.

    Directory of Open Access Journals (Sweden)

    Qinghua Nie

    Full Text Available Avian pathogenic Escherichia coli (APEC leads to economic losses in poultry production and is also a threat to human health. The goal of this study was to characterize the chicken spleen transcriptome and to identify candidate genes for response and resistance to APEC infection using Solexa sequencing. We obtained 14422935, 14104324, and 14954692 Solexa read pairs for non-challenged (NC, challenged-mild pathology (MD, and challenged-severe pathology (SV, respectively. A total of 148197 contigs and 98461 unigenes were assembled, of which 134949 contigs and 91890 unigenes match the chicken genome. In total, 12272 annotated unigenes take part in biological processes (11664, cellular components (11927, and molecular functions (11963. Summing three specific contrasts, 13650 significantly differentially expressed unigenes were found in NC Vs. MD (6844, NC Vs. SV (7764, and MD Vs. SV (2320. Some unigenes (e.g. CD148, CD45 and LCK were involved in crucial pathways, such as the T cell receptor (TCR signaling pathway and microbial metabolism in diverse environments. This study facilitates understanding of the genetic architecture of the chicken spleen transcriptome, and has identified candidate genes for host response to APEC infection.

  18. CTDB: An Integrated Chickpea Transcriptome Database for Functional and Applied Genomics

    OpenAIRE

    Verma, Mohit; Kumar, Vinay; Patel, Ravi K.; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Chickpea is an important grain legume used as a rich source of protein in human diet. The narrow genetic diversity and limited availability of genomic resources are the major constraints in implementing breeding strategies and biotechnological interventions for genetic enhancement of chickpea. We developed an integrated Chickpea Transcriptome Database (CTDB), which provides the comprehensive web interface for visualization and easy retrieval of transcriptome data in chickpea. The database fea...

  19. De novo transcriptome assembly of mangosteen (Garcinia mangostana L. fruit

    Directory of Open Access Journals (Sweden)

    Deden Derajat Matra

    2016-12-01

    Full Text Available Garcinia mangostana L. (Mangosteen, of the family Clusiaceae, is one of the economically important tropical fruits in Indonesia. In the present study, we performed de novo transcriptomic analysis of Garcinia mangostana L. through RNA-Seq technology. We obtained the raw data from 12 libraries through Ion Proton System. Clean reads of 191,735,809 were obtained from 307,634,890 raw reads. The raw data obtained in this study can be accessible in DDBJ database with accession number of DRA005014 with bioproject accession number of PRJDB5091. We obtained 268,851 transcripts as well as 155,850 unigenes, having N50 value of 555 and 433 bp, respectively. Transcript/unigene length ranged from 201 to 5916 bp. The unigenes were annotated with two main databases from NCBI and UniProtKB, respectively having annotated-sequences of 73,287 and 73,107, respectively. These transcriptomic data will be beneficial for studying transcriptome of Garcinia mangostana L.

  20. Coevolutionary genetic variation in the legume-rhizobium transcriptome.

    Science.gov (United States)

    Heath, Katy D; Burke, Patricia V; Stinchcombe, John R

    2012-10-01

    Coevolutionary change requires reciprocal selection between interacting species, where the partner genotypes that are favoured in one species depend on the genetic composition of the interacting species. Coevolutionary genetic variation is manifested as genotype × genotype (G × G) interactions for fitness in interspecific interactions. Although quantitative genetic approaches have revealed abundant evidence for G × G interactions in symbioses, the molecular basis of this variation remains unclear. Here we study the molecular basis of G × G interactions in a model legume-rhizobium mutualism using gene expression microarrays. We find that, like quantitative traits such as fitness, variation in the symbiotic transcriptome may be partitioned into additive and interactive genetic components. Our results suggest that plant genetic variation had the largest influence on nodule gene expression and that plant genotype and the plant genotype × rhizobium genotype interaction determine global shifts in rhizobium gene expression that in turn feedback to influence plant fitness benefits. Moreover, the transcriptomic variation we uncover implicates regulatory changes in both species as drivers of symbiotic gene expression variation. Our study is the first to partition genetic variation in a symbiotic transcriptome and illuminates potential molecular routes of coevolutionary change. © 2012 Blackwell Publishing Ltd.

  1. Effects of the total replacement of fish-based diet with plant-based diet on the hepatic transcriptome of two European sea bass (Dicentrarchus labrax half-sibfamilies showing different growth rates with the plant-based diet

    Directory of Open Access Journals (Sweden)

    Geay Florian

    2011-10-01

    Full Text Available Abstract Background Efforts towards utilisation of diets without fish meal (FM or fish oil (FO in finfish aquaculture have been being made for more than two decades. Metabolic responses to substitution of fishery products have been shown to impact growth performance and immune system of fish as well as their subsequent nutritional value, particularly in marine fish species, which exhibit low capacity for biosynthesis of long-chain poly-unsaturated fatty acids (LC-PUFA. The main objective of the present study was to analyse the effects of a plant-based diet on the hepatic transcriptome of European sea bass (Dicentrarchus labrax. Results We report the first results obtained using a transcriptomic approach on the liver of two half-sibfamilies of the European sea bass that exhibit similar growth rates when fed a fish-based diet (FD, but significantly different growth rates when fed an all-plant diet (VD. Overall gene expression was analysed using oligo DNA microarrays (GPL9663. Statistical analysis identified 582 unique annotated genes differentially expressed between groups of fish fed the two diets, 199 genes regulated by genetic factors, and 72 genes that exhibited diet-family interactions. The expression of several genes involved in the LC-PUFA and cholesterol biosynthetic pathways was found to be up-regulated in fish fed VD, suggesting a stimulation of the lipogenic pathways. No significant diet-family interaction for the regulation of LC-PUFA biosynthesis pathways could be detected by microarray analysis. This result was in agreement with LC-PUFA profiles, which were found to be similar in the flesh of the two half-sibfamilies. In addition, the combination of our transcriptomic data with an analysis of plasmatic immune parameters revealed a stimulation of complement activity associated with an immunodeficiency in the fish fed VD, and different inflammatory status between the two half-sibfamilies. Biological processes related to protein

  2. Using phylogenetically-informed annotation (PIA) to search for light-interacting genes in transcriptomes from non-model organisms.

    Science.gov (United States)

    Speiser, Daniel I; Pankey, M Sabrina; Zaharoff, Alexander K; Battelle, Barbara A; Bracken-Grissom, Heather D; Breinholt, Jesse W; Bybee, Seth M; Cronin, Thomas W; Garm, Anders; Lindgren, Annie R; Patel, Nipam H; Porter, Megan L; Protas, Meredith E; Rivera, Ajna S; Serb, Jeanne M; Zigler, Kirk S; Crandall, Keith A; Oakley, Todd H

    2014-11-19

    Tools for high throughput sequencing and de novo assembly make the analysis of transcriptomes (i.e. the suite of genes expressed in a tissue) feasible for almost any organism. Yet a challenge for biologists is that it can be difficult to assign identities to gene sequences, especially from non-model organisms. Phylogenetic analyses are one useful method for assigning identities to these sequences, but such methods tend to be time-consuming because of the need to re-calculate trees for every gene of interest and each time a new data set is analyzed. In response, we employed existing tools for phylogenetic analysis to produce a computationally efficient, tree-based approach for annotating transcriptomes or new genomes that we term Phylogenetically-Informed Annotation (PIA), which places uncharacterized genes into pre-calculated phylogenies of gene families. We generated maximum likelihood trees for 109 genes from a Light Interaction Toolkit (LIT), a collection of genes that underlie the function or development of light-interacting structures in metazoans. To do so, we searched protein sequences predicted from 29 fully-sequenced genomes and built trees using tools for phylogenetic analysis in the Osiris package of Galaxy (an open-source workflow management system). Next, to rapidly annotate transcriptomes from organisms that lack sequenced genomes, we repurposed a maximum likelihood-based Evolutionary Placement Algorithm (implemented in RAxML) to place sequences of potential LIT genes on to our pre-calculated gene trees. Finally, we implemented PIA in Galaxy and used it to search for LIT genes in 28 newly-sequenced transcriptomes from the light-interacting tissues of a range of cephalopod mollusks, arthropods, and cubozoan cnidarians. Our new trees for LIT genes are available on the Bitbucket public repository ( http://bitbucket.org/osiris_phylogenetics/pia/ ) and we demonstrate PIA on a publicly-accessible web server ( http://galaxy-dev.cnsi.ucsb.edu/pia/ ). Our new

  3. Comparative whole genome transcriptome and metabolome analyses of five Klebsiella pneumonia strains.

    Science.gov (United States)

    Lee, Soojin; Kim, Borim; Yang, Jeongmo; Jeong, Daun; Park, Soohyun; Shin, Sang Heum; Kook, Jun Ho; Yang, Kap-Seok; Lee, Jinwon

    2015-11-01

    The integration of transcriptomics and metabolomics can provide precise information on gene-to-metabolite networks for identifying the function of novel genes. The goal of this study was to identify novel gene functions involved in 2,3-butanediol (2,3-BDO) biosynthesis by a comprehensive analysis of the transcriptome and metabolome of five mutated Klebsiella pneumonia strains (∆wabG = SGSB100, ∆wabG∆budA = SGSB106, ∆wabG∆budB = SGSB107, ∆wabG∆budC = SGSB108, ∆wabG∆budABC = SGSB109). First, the transcriptomes of all five mutants were analyzed and the genes exhibiting reproducible changes in expression were determined. The transcriptome was well conserved among the five strains, and differences in gene expression occurred mainly in genes coding for 2,3-BDO biosynthesis (budA, budB, and budC) and the genes involved in the degradation of reactive oxygen, biosynthesis and transport of arginine, cysteine biosynthesis, sulfur metabolism, oxidoreductase reaction, and formate dehydrogenase reaction. Second, differences in the metabolome (estimated by carbon distribution, CO2 emission, and redox balance) among the five mutant strains due to gene alteration of the 2,3-BDO operon were detected. The functional genomics approach integrating metabolomics and transcriptomics in K. Pneumonia presented here provides an innovative means of identifying novel gene functions involved in 2,3-BDO biosynthesis metabolism and whole cell metabolism.

  4. Transcriptomic resources for the medicinal legume Mucuna pruriens: de novo transcriptome assembly, annotation, identification and validation of EST-SSR markers.

    Science.gov (United States)

    Sathyanarayana, N; Pittala, Ranjith Kumar; Tripathi, Pankaj Kumar; Chopra, Ratan; Singh, Heikham Russiachand; Belamkar, Vikas; Bhardwaj, Pardeep Kumar; Doyle, Jeff J; Egan, Ashley N

    2017-05-25

    The medicinal legume Mucuna pruriens (L.) DC. has attracted attention worldwide as a source of the anti-Parkinson's drug L-Dopa. It is also a popular green manure cover crop that offers many agronomic benefits including high protein content, nitrogen fixation and soil nutrients. The plant currently lacks genomic resources and there is limited knowledge on gene expression, metabolic pathways, and genetics of secondary metabolite production. Here, we present transcriptomic resources for M. pruriens, including a de novo transcriptome assembly and annotation, as well as differential transcript expression analyses between root, leaf, and pod tissues. We also develop microsatellite markers and analyze genetic diversity and population structure within a set of Indian germplasm accessions. One-hundred ninety-one million two hundred thirty-three thousand two hundred forty-two bp cleaned reads were assembled into 67,561 transcripts with mean length of 626 bp and N50 of 987 bp. Assembled sequences were annotated using BLASTX against public databases with over 80% of transcripts annotated. We identified 7,493 simple sequence repeat (SSR) motifs, including 787 polymorphic repeats between the parents of a mapping population. 134 SSRs from expressed sequenced tags (ESTs) were screened against 23 M. pruriens accessions from India, with 52 EST-SSRs retained after quality control. Population structure analysis using a Bayesian framework implemented in fastSTRUCTURE showed nearly similar groupings as with distance-based (neighbor-joining) and principal component analyses, with most of the accessions clustering per geographical origins. Pair-wise comparison of transcript expression in leaves, roots and pods identified 4,387 differentially expressed transcripts with the highest number occurring between roots and leaves. Differentially expressed transcripts were enriched with transcription factors and transcripts annotated as belonging to secondary metabolite pathways. The M

  5. Genomotyping of Pseudomonas putida strains using P. putida KT2440-based high-density DNA microarrays: Implications for transcriptomics studies

    NARCIS (Netherlands)

    Ballerstedt, H.; Volkers, R.J.M.; Mars, A.E.; Hallsworth, J.E.; Santos, V.A.M.D.; Puchalka, J.; Duuren, J. van; Eggink, G.; Timmis, K.N.; Bont, J.A.M. de; Wery, J.

    2007-01-01

    Pseudomonas putida KT2440 is the only fully sequenced P. putida strain. Thus, for transcriptomics and proteomics studies with other P. putida strains, the P. putida KT2440 genomic database serves as standard reference. The utility of KT2440 whole-genome, high-density oligonucleotide microarrays for

  6. Transcriptomic signatures in cartilage ageing

    Science.gov (United States)

    2013-01-01

    Introduction Age is an important factor in the development of osteoarthritis. Microarray studies provide insight into cartilage aging but do not reveal the full transcriptomic phenotype of chondrocytes such as small noncoding RNAs, pseudogenes, and microRNAs. RNA-Seq is a powerful technique for the interrogation of large numbers of transcripts including nonprotein coding RNAs. The aim of the study was to characterise molecular mechanisms associated with age-related changes in gene signatures. Methods RNA for gene expression analysis using RNA-Seq and real-time PCR analysis was isolated from macroscopically normal cartilage of the metacarpophalangeal joints of eight horses; four young donors (4 years old) and four old donors (>15 years old). RNA sequence libraries were prepared following ribosomal RNA depletion and sequencing was undertaken using the Illumina HiSeq 2000 platform. Differentially expressed genes were defined using Benjamini-Hochberg false discovery rate correction with a generalised linear model likelihood ratio test (P ageing cartilage. Conclusion There was an age-related dysregulation of matrix, anabolic and catabolic cartilage factors. This study has increased our knowledge of transcriptional networks in cartilage ageing by providing a global view of the transcriptome. PMID:23971731

  7. Selecting Superior De Novo Transcriptome Assemblies: Lessons Learned by Leveraging the Best Plant Genome.

    Directory of Open Access Journals (Sweden)

    Loren A Honaas

    Full Text Available Whereas de novo assemblies of RNA-Seq data are being published for a growing number of species across the tree of life, there are currently no broadly accepted methods for evaluating such assemblies. Here we present a detailed comparison of 99 transcriptome assemblies, generated with 6 de novo assemblers including CLC, Trinity, SOAP, Oases, ABySS and NextGENe. Controlled analyses of de novo assemblies for Arabidopsis thaliana and Oryza sativa transcriptomes provide new insights into the strengths and limitations of transcriptome assembly strategies. We find that the leading assemblers generate reassuringly accurate assemblies for the majority of transcripts. At the same time, we find a propensity for assemblers to fail to fully assemble highly expressed genes. Surprisingly, the instance of true chimeric assemblies is very low for all assemblers. Normalized libraries are reduced in highly abundant transcripts, but they also lack 1000s of low abundance transcripts. We conclude that the quality of de novo transcriptome assemblies is best assessed through consideration of a combination of metrics: 1 proportion of reads mapping to an assembly 2 recovery of conserved, widely expressed genes, 3 N50 length statistics, and 4 the total number of unigenes. We provide benchmark Illumina transcriptome data and introduce SCERNA, a broadly applicable modular protocol for de novo assembly improvement. Finally, our de novo assembly of the Arabidopsis leaf transcriptome revealed ~20 putative Arabidopsis genes lacking in the current annotation.

  8. Assessing hog lagoon waste contamination in the Cape Fear Watershed using Bacteroidetes 16S rRNA gene pyrosequencing.

    Science.gov (United States)

    Arfken, Ann M; Song, Bongkeun; Mallin, Michael A

    2015-09-01

    Hog lagoons can be major sources of waste and nutrient contamination to watersheds adjacent to pig farms. Fecal source tracking methods targeting Bacteroidetes 16S rRNA genes in pig fecal matter may underestimate or fail to detect hog lagoon contamination in riverine environments. In order to detect hog lagoon wastewater contamination in the Cape Fear Watershed, where a large number of hog farms are present, we conducted pyrosequencing analyses of Bacteroidetes 16S rRNA genes in hog lagoon waste and identified new hog lagoon-specific marker sequences. Additional pyrosequencing analyses of Bacteroidetes 16S rRNA genes were conducted with surface water samples collected at 4 sites during 5 months in the Cape Fear Watershed. Using an operational taxonomic unit (OTU) identity cutoff value of 97 %, these newly identified hog lagoon markers were found in 3 of the river samples, while only 1 sample contained the pig fecal marker. In the sample containing the pig fecal marker, there was a relatively high percentage (14.1 %) of the hog lagoon markers and a low pig fecal marker relative abundance of 0.4 % in the Bacteroidetes 16S rRNA gene sequences. This suggests that hog lagoon contamination must be somewhat significant in order for pig fecal markers to be detected, and low levels of hog lagoon contamination cannot be detected targeting only pig-specific fecal markers. Thus, new hog lagoon markers have a better detection capacity for lagoon waste contamination, and in conjunction with a pig fecal marker, provide a more comprehensive and accurate detection of hog lagoon waste contamination in susceptible watersheds.

  9. comparative transcriptomics between Synechococcus PCC 7942 and Synechocystis PCC 6803 provide insights into mechanisms of adaptation to stress.

    Energy Technology Data Exchange (ETDEWEB)

    Konstantinos, Billis [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); European Bioinformatics Inst., Hinxton, Cambridge (United Kingdom). European Molecular Biology Lab.; Aristotle Univ., Thessaloniki (Greece). Dept. of Genetics; Billini, Maria [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Max Planck Inst. for Terrestrial Microbiology, Marburg (Germany); Tripp, Harry J. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Kyrpides, Nikos C. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Mavrommatis, Konstantinos [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Celgene Corp, San Francisco, CA (United States)

    2014-03-21

    Background: Synechococcus sp. PCC 7942 and Synechocystis sp. PCC 6803 are model cyanobacteria from which the metabolism and adaptive responses of other cyanobacteria are inferred. Here we report the gene expression response of these two strains to a variety of nutrient and environmental stresses of varying duration, using transcriptomics. Our data comprise both stranded and 5? enriched libraries in order to elucidate many aspects of the transcriptome. Results: Both organisms were exposed to stress conditions due to nutrient deficiency (inorganic carbon) or change of environmental conditions (salinity, temperature, pH, light) sampled at 1 and 24 hours after the application of stress. The transcriptome profile of each strain revealed similarities and differences in gene expression for photosynthetic and respiratory electron transport chains and carbon fixation. Transcriptome profiles also helped us improve the structural annotation of the genome and identify possible missed genes (including anti-sense) and determine transcriptional units (operons). Finally, we predicted association of proteins of unknown function biochemical pathways by associating them to well-characterized ones based on their transcript levels correlation. Conclusions: Overall, this study results an informative annotation of those species and the comparative analysis of the response of the two organisms revealed similarities but also significant changes in the way they respond to external stress and the duration of the response

  10. Transcriptome and proteome exploration to provide a resource for the study of Agrocybe aegerita.

    Directory of Open Access Journals (Sweden)

    Man Wang

    Full Text Available BACKGROUND: Agrocybe aegerita, the black poplar mushroom, has been highly valued as a functional food for its medicinal and nutritional benefits. Several bioactive extracts from A. aegerita have been found to exhibit antitumor and antioxidant activities. However, limited genetic resources for A. aegerita have hindered exploration of this species. METHODOLOGY/PRINCIPAL FINDINGS: To facilitate the research on A. aegerita, we established a deep survey of the transcriptome and proteome of this mushroom. We applied high-throughput sequencing technology (Illumina to sequence A. aegerita transcriptomes from mycelium and fruiting body. The raw clean reads were de novo assembled into a total of 36,134 expressed sequences tags (ESTs with an average length of 663 bp. These ESTs were annotated and classified according to Gene Ontology (GO, Clusters of Orthologous Groups (COG, and Kyoto Encyclopedia of Genes and Genomes (KEGG metabolic pathways. Gene expression profile analysis showed that 18,474 ESTs were differentially expressed, with 10,131 up-regulated in mycelium and 8,343 up-regulated in fruiting body. Putative genes involved in polysaccharide and steroid biosynthesis were identified from A. aegerita transcriptome, and these genes were differentially expressed at the two stages of A. aegerita. Based on one-dimensional gel electrophoresis (1-DGE coupled with electrospray ionization liquid chromatography tandem MS (LC-ESI-MS/MS, we identified a total of 309 non-redundant proteins. And many metabolic enzymes involved in glycolysis were identified in the protein database. CONCLUSIONS/SIGNIFICANCE: This is the first study on transcriptome and proteome analyses of A. aegerita. The data in this study serve as a resource of A. aegerita transcripts and proteins, and offer clues to the applications of this mushroom in nutrition, pharmacy and industry.

  11. Transcriptomic analysis of the oleaginous microalga Neochloris oleoabundans reveals metabolic insights into triacylglyceride accumulation

    Directory of Open Access Journals (Sweden)

    Rismani-Yazdi Hamid

    2012-09-01

    Full Text Available Abstract Background The lack of sequenced genomes for oleaginous microalgae limits our understanding of the mechanisms these organisms utilize to become enriched in triglycerides. Here we report the de novo transcriptome assembly and quantitative gene expression analysis of the oleaginous microalga Neochloris oleoabundans, with a focus on the complex interaction of pathways associated with the production of the triacylglycerol (TAG biofuel precursor. Results After growth under nitrogen replete and nitrogen limiting conditions, we quantified the cellular content of major biomolecules including total lipids, triacylglycerides, starch, protein, and chlorophyll. Transcribed genes were sequenced, the transcriptome was assembled de novo, and the expression of major functional categories, relevant pathways, and important genes was quantified through the mapping of reads to the transcriptome. Over 87 million, 77 base pair high quality reads were produced on the Illumina HiSeq sequencing platform. Metabolite measurements supported by genes and pathway expression results indicated that under the nitrogen-limiting condition, carbon is partitioned toward triglyceride production, which increased fivefold over the nitrogen-replete control. In addition to the observed overexpression of the fatty acid synthesis pathway, TAG production during nitrogen limitation was bolstered by repression of the β-oxidation pathway, up-regulation of genes encoding for the pyruvate dehydrogenase complex which funnels acetyl-CoA to lipid biosynthesis, activation of the pentose phosphate pathway to supply reducing equivalents to inorganic nitrogen assimilation and fatty acid biosynthesis, and the up-regulation of lipases—presumably to reconstruct cell membranes in order to supply additional fatty acids for TAG biosynthesis. Conclusions Our quantitative transcriptome study reveals a broad overview of how nitrogen stress results in excess TAG production in N. oleoabundans, and

  12. Relationships between drought, heat and air humidity responses revealed by transcriptome-metabolome co-analysis.

    Science.gov (United States)

    Georgii, Elisabeth; Jin, Ming; Zhao, Jin; Kanawati, Basem; Schmitt-Kopplin, Philippe; Albert, Andreas; Winkler, J Barbro; Schäffner, Anton R

    2017-07-10

    Elevated temperature and reduced water availability are frequently linked abiotic stresses that may provoke distinct as well as interacting molecular responses. Based on non-targeted metabolomic and transcriptomic measurements from Arabidopsis rosettes, this study aims at a systematic elucidation of relevant components in different drought and heat scenarios as well as relationships between molecular players of stress response. In combined drought-heat stress, the majority of single stress responses are maintained. However, interaction effects between drought and heat can be discovered as well; these relate to protein folding, flavonoid biosynthesis and growth inhibition, which are enhanced, reduced or specifically induced in combined stress, respectively. Heat stress experiments with and without supplementation of air humidity for maintenance of vapor pressure deficit suggest that decreased relative air humidity due to elevated temperature is an important component of heat stress, specifically being responsible for hormone-related responses to water deprivation. Remarkably, this "dry air effect" is the primary trigger of the metabolomic response to heat. In contrast, the transcriptomic response has a substantial temperature component exceeding the dry air component and including up-regulation of many transcription factors and protein folding-related genes. Data level integration independent of prior knowledge on pathways and condition labels reveals shared drought and heat responses between transcriptome and metabolome, biomarker candidates and co-regulation between genes and metabolic compounds, suggesting novel players in abiotic stress response pathways. Drought and heat stress interact both at transcript and at metabolite response level. A comprehensive, non-targeted view of this interaction as well as non-interacting processes is important to be taken into account when improving tolerance to abiotic stresses in breeding programs. Transcriptome and metabolome

  13. Single-cell entropy for accurate estimation of differentiation potency from a cell's transcriptome

    Science.gov (United States)

    Teschendorff, Andrew E.; Enver, Tariq

    2017-01-01

    The ability to quantify differentiation potential of single cells is a task of critical importance. Here we demonstrate, using over 7,000 single-cell RNA-Seq profiles, that differentiation potency of a single cell can be approximated by computing the signalling promiscuity, or entropy, of a cell's transcriptome in the context of an interaction network, without the need for feature selection. We show that signalling entropy provides a more accurate and robust potency estimate than other entropy-based measures, driven in part by a subtle positive correlation between the transcriptome and connectome. Signalling entropy identifies known cell subpopulations of varying potency and drug resistant cancer stem-cell phenotypes, including those derived from circulating tumour cells. It further reveals that expression heterogeneity within single-cell populations is regulated. In summary, signalling entropy allows in silico estimation of the differentiation potency and plasticity of single cells and bulk samples, providing a means to identify normal and cancer stem-cell phenotypes. PMID:28569836

  14. Sequencing and Characterization of Divergent Marbling Levels in the Beef Cattle ( Muscle Transcriptome

    Directory of Open Access Journals (Sweden)

    Dong Chen

    2015-02-01

    Full Text Available Marbling is an important trait regarding the quality of beef. Analysis of beef cattle transcriptome and its expression profile data are essential to extend the genetic information resources and would support further studies on beef cattle. RNA sequencing was performed in beef cattle using the Illumina High-Seq2000 platform. Approximately 251.58 million clean reads were generated from a high marbling (H group and low marbling (L group. Approximately 80.12% of the 19,994 bovine genes (protein coding were detected in all samples, and 749 genes exhibited differential expression between the H and L groups based on fold change (>1.5-fold, p<0.05. Multiple gene ontology terms and biological pathways were found significantly enriched among the differentially expressed genes. The transcriptome data will facilitate future functional studies on marbling formation in beef cattle and may be applied to improve breeding programs for cattle and closely related mammals.

  15. Whole transcriptome analysis using next-generation sequencing of model species Setaria viridis to support C4 photosynthesis research.

    Science.gov (United States)

    Xu, Jiajia; Li, Yuanyuan; Ma, Xiuling; Ding, Jianfeng; Wang, Kai; Wang, Sisi; Tian, Ye; Zhang, Hui; Zhu, Xin-Guang

    2013-09-01

    Setaria viridis is an emerging model species for genetic studies of C4 photosynthesis. Many basic molecular resources need to be developed to support for this species. In this paper, we performed a comprehensive transcriptome analysis from multiple developmental stages and tissues of S. viridis using next-generation sequencing technologies. Sequencing of the transcriptome from multiple tissues across three developmental stages (seed germination, vegetative growth, and reproduction) yielded a total of 71 million single end 100 bp long reads. Reference-based assembly using Setaria italica genome as a reference generated 42,754 transcripts. De novo assembly generated 60,751 transcripts. In addition, 9,576 and 7,056 potential simple sequence repeats (SSRs) covering S. viridis genome were identified when using the reference based assembled transcripts and the de novo assembled transcripts, respectively. This identified transcripts and SSR provided by this study can be used for both reverse and forward genetic studies based on S. viridis.

  16. De novo transcriptome sequencing and analysis of the cereal cyst nematode, Heterodera avenae.

    Directory of Open Access Journals (Sweden)

    Mukesh Kumar

    Full Text Available The cereal cyst nematode (CCN, Heterodera avenae is a major pest of wheat (Triticum spp that reduces crop yields in many countries. Cyst nematodes are obligate sedentary endoparasites that reproduce by amphimixis. Here, we report the first transcriptome analysis of two stages of H. avenae. After sequencing extracted RNA from pre parasitic infective juvenile and adult stages of the life cycle, 131 million Illumina high quality paired end reads were obtained which generated 27,765 contigs with N50 of 1,028 base pairs, of which 10,452 were annotated. Comparative analyses were undertaken to evaluate H. avenae sequences with those of other plant, animal and free living nematodes to identify differences in expressed genes. There were 4,431 transcripts common to H. avenae and the free living nematode Caenorhabditis elegans, and 9,462 in common with more closely related potato cyst nematode, Globodera pallida. Annotation of H. avenae carbohydrate active enzymes (CAZy revealed fewer glycoside hydrolases (GHs but more glycosyl transferases (GTs and carbohydrate esterases (CEs when compared to M. incognita. 1,280 transcripts were found to have secretory signature, presence of signal peptide and absence of transmembrane. In a comparison of genes expressed in the pre-parasitic juvenile and feeding female stages, expression levels of 30 genes with high RPKM (reads per base per kilo million value, were analysed by qRT-PCR which confirmed the observed differences in their levels of expression levels. In addition, we have also developed a user-friendly resource, Heterodera transcriptome database (HATdb for public access of the data generated in this study. The new data provided on the transcriptome of H. avenae adds to the genetic resources available to study plant parasitic nematodes and provides an opportunity to seek new effectors that are specifically involved in the H. avenae-cereal host interaction.

  17. De novo transcriptome sequencing and analysis of the cereal cyst nematode, Heterodera avenae.

    Science.gov (United States)

    Kumar, Mukesh; Gantasala, Nagavara Prasad; Roychowdhury, Tanmoy; Thakur, Prasoon Kumar; Banakar, Prakash; Shukla, Rohit N; Jones, Michael G K; Rao, Uma

    2014-01-01

    The cereal cyst nematode (CCN, Heterodera avenae) is a major pest of wheat (Triticum spp) that reduces crop yields in many countries. Cyst nematodes are obligate sedentary endoparasites that reproduce by amphimixis. Here, we report the first transcriptome analysis of two stages of H. avenae. After sequencing extracted RNA from pre parasitic infective juvenile and adult stages of the life cycle, 131 million Illumina high quality paired end reads were obtained which generated 27,765 contigs with N50 of 1,028 base pairs, of which 10,452 were annotated. Comparative analyses were undertaken to evaluate H. avenae sequences with those of other plant, animal and free living nematodes to identify differences in expressed genes. There were 4,431 transcripts common to H. avenae and the free living nematode Caenorhabditis elegans, and 9,462 in common with more closely related potato cyst nematode, Globodera pallida. Annotation of H. avenae carbohydrate active enzymes (CAZy) revealed fewer glycoside hydrolases (GHs) but more glycosyl transferases (GTs) and carbohydrate esterases (CEs) when compared to M. incognita. 1,280 transcripts were found to have secretory signature, presence of signal peptide and absence of transmembrane. In a comparison of genes expressed in the pre-parasitic juvenile and feeding female stages, expression levels of 30 genes with high RPKM (reads per base per kilo million) value, were analysed by qRT-PCR which confirmed the observed differences in their levels of expression levels. In addition, we have also developed a user-friendly resource, Heterodera transcriptome database (HATdb) for public access of the data generated in this study. The new data provided on the transcriptome of H. avenae adds to the genetic resources available to study plant parasitic nematodes and provides an opportunity to seek new effectors that are specifically involved in the H. avenae-cereal host interaction.

  18. Additive Effects of Millimeter Waves and 2-Deoxyglucose Co-Exposure on the Human Keratinocyte Transcriptome.

    Science.gov (United States)

    Soubere Mahamoud, Yonis; Aite, Meziane; Martin, Catherine; Zhadobov, Maxim; Sauleau, Ronan; Le Dréan, Yves; Habauzit, Denis

    2016-01-01

    Millimeter Waves (MMW) will be used in the next-generation of high-speed wireless technologies, especially in future Ultra-Broadband small cells in 5G cellular networks. Therefore, their biocompatibilities must be evaluated prior to their massive deployment. Using a microarray-based approach, we analyzed modifications to the whole genome of a human keratinocyte model that was exposed at 60.4 GHz-MMW at an incident power density (IPD) of 20 mW/cm2 for 3 hours in athermic conditions. No keratinocyte transcriptome modifications were observed. We tested the effects of MMWs on cell metabolism by co-treating MMW-exposed cells with a glycolysis inhibitor, 2-deoxyglucose (2dG, 20 mM for 3 hours), and whole genome expression was evaluated along with the ATP content. We found that the 2dG treatment decreased the cellular ATP content and induced a high modification in the transcriptome (632 coding genes). The affected genes were associated with transcriptional repression, cellular communication and endoplasmic reticulum homeostasis. The MMW/2dG co-treatment did not alter the keratinocyte ATP content, but it did slightly alter the transcriptome, which reflected the capacity of MMW to interfere with the bioenergetic stress response. The RT-PCR-based validation confirmed 6 MMW-sensitive genes (SOCS3, SPRY2, TRIB1, FAM46A, CSRNP1 and PPP1R15A) during the 2dG treatment. These 6 genes encoded transcription factors or inhibitors of cytokine pathways, which raised questions regarding the potential impact of long-term or chronic MMW exposure on metabolically stressed cells.

  19. Transcriptomic analysis of the rice white tip nematode, Aphelenchoides besseyi (Nematoda: Aphelenchoididae.

    Directory of Open Access Journals (Sweden)

    Feng Wang

    Full Text Available BACKGROUND: The rice white tip nematode Aphelenchoides besseyi, a devastating nematode whose genome has not been sequenced, is distributed widely throughout almost all the rice-growing regions of the world. The aims of the present study were to define the transcriptome of A. besseyi and to identify parasite-related, mortality-related or host resistance-overcoming genes in this nematode. METHODOLOGY AND PRINCIPAL FINDINGS: Using Solexa/Illumina sequencing, we profiled the transcriptome of mixed-stage populations of A. besseyi. A total of 51,270 transcripts without gaps were produced based on high-quality clean reads. Of all the A. besseyi transcripts, 9,132 KEGG Orthology assignments were annotated. Carbohydrate-active enzymes of glycoside hydrolases (GHs, glycosyltransferases (GTs, carbohydrate esterases (CEs and carbohydrate-binding modules (CBMs were identified. The presence of the A. besseyi GH45 cellulase gene was verified by in situ hybridization. Given that 13 unique A. besseyi potential effector genes were identified from 41 candidate effector homologs, further studies of these homologs are merited. Finally, comparative analyses were conducted between A. besseyi contigs and Caenorhabditis elegans genes to look for orthologs of RNAi phenotypes, neuropeptides and peptidases. CONCLUSIONS AND SIGNIFICANCE: The present results provide comprehensive insight into the genetic makeup of A. besseyi. Many of this species' genes are parasite related, nematode mortality-related or necessary to overcome host resistance. The generated transcriptome dataset of A. besseyi reported here lays the foundation for further studies of the molecular mechanisms related to parasitism and facilitates the development of new control strategies for this species.

  20. A high-resolution anatomical atlas of the transcriptome in the mouse embryo.

    Directory of Open Access Journals (Sweden)

    Graciana Diez-Roux

    Full Text Available Ascertaining when and where genes are expressed is of crucial importance to understanding or predicting the physiological role of genes and proteins and how they interact to form the complex networks that underlie organ development and function. It is, therefore, crucial to determine on a genome-wide level, the spatio-temporal gene expression profiles at cellular resolution. This information is provided by colorimetric RNA in situ hybridization that can elucidate expression of genes in their native context and does so at cellular resolution. We generated what is to our knowledge the first genome-wide transcriptome atlas by RNA in situ hybridization of an entire mammalian organism, the developing mouse at embryonic day 14.5. This digital transcriptome atlas, the Eurexpress atlas (http://www.eurexpress.org, consists of a searchable database of annotated images that can be interactively viewed. We generated anatomy-based expression profiles for over 18,000 coding genes and over 400 microRNAs. We identified 1,002 tissue-specific genes that are a source of novel tissue-specific markers for 37 different anatomical structures. The quality and the resolution of the data revealed novel molecular domains for several developing structures, such as the telencephalon, a novel organization for the hypothalamus, and insight on the Wnt network involved in renal epithelial differentiation during kidney development. The digital transcriptome atlas is a powerful resource to determine co-expression of genes, to identify cell populations and lineages, and to identify functional associations between genes relevant to development and disease.

  1. De novo transcriptome analysis of Sinapis alba in revealing the glucosinolate and phytochelatin pathways

    Directory of Open Access Journals (Sweden)

    Xiaohui eZhang

    2016-03-01

    Full Text Available Sinapis alba is an important condiment crop and can also be used as a phytoremediation plant. Though it has important economic and agronomic values, sequence data and the genetic tools are still rare in this plant. In the present study, a de novo transcriptome based on the transcriptions of leaves, stems and roots was assembled for S. alba for the first time. The transcriptome contains 47,972 unigenes with a mean length of 1,185 nt and an N50 of 1,672 nt. Among these unigenes, 46,535 (97% unigenes were annotated by at least one of the following databases: NCBI non-redundant (Nr, Swiss-Prot, Kyoto Encyclopedia of Genes and Genomes (KEGG pathway, Gene Ontology (GO, and Clusters of Orthologous Groups of proteins (COGs. The tissue expression pattern profiles revealed that 3,489, 1,361 and 8,482 unigenes were predominantly expressed in the leaves, stems and roots of S. alba, respectively. Genes predominantly expressed in the leaf were enriched in photosynthesis- and carbon fixation-related pathways. Genes predominantly expressed in the stem were enriched in not only pathways related to sugar, ether lipid and amino acid metabolisms but also plant hormone signal transduction and circadian rhythm pathways, while the root-dominant genes were enriched in pathways related to lignin and cellulose syntheses, involved in plant-pathogen interactions, and potentially responsible for heavy metal chelating and detoxification. Based on this transcriptome, 14,727 simple sequence repeats (SSRs were identified, and 12,830 pairs of primers were developed for 2,522 SSR-containing unigenes. Additionally, the glucosinolate (GSL and phytochelatin metabolic pathways, which give the characteristic flavor and the heavy metal tolerance of this plant, were intensively analyzed. The genes of aliphatic GSLs pathway were predominantly expressed in roots. The absence of aliphatic GSLs in leaf tissues was due to the shutdown of BCAT4, MAM1 and CYP79F1 expressions. Glutathione was

  2. Transcriptome dynamics of the microRNA inhibition response

    DEFF Research Database (Denmark)

    Wen, Jiayu; Leucci, Elenora; Vendramin, Roberto

    2015-01-01

    We report a high-resolution time series study of transcriptome dynamics following antimiR-mediated inhibition of miR-9 in a Hodgkin lymphoma cell-line-the first such dynamic study of the microRNA inhibition response-revealing both general and specific aspects of the physiological response. We show...... validate the key observations with independent time series qPCR and we experimentally validate key predicted miR-9 targets. Methodologically, we developed sensitive functional data analytic predictive methods to analyse the weak response inherent in microRNA inhibition experiments. The methods...... of this study will be applicable to similar high-resolution time series transcriptome analyses and provides the context for more accurate experimental design and interpretation of future microRNA inhibition studies....

  3. Suppression substractive hybridisation and NGS reveal differential transcriptome expression profiles in Wayfaring Tree (Viburnum lantana L. treated with ozone

    Directory of Open Access Journals (Sweden)

    Elena eGottardini

    2016-06-01

    Full Text Available Tropospheric ozone (O3 is a global air pollutant that causes high economical damages by decresing plant productivity. It entering leaves through the stomata, generating reactive oxygen species, which following decreases photosynthesis, plant growth, and biomass accumulation. In order to identify genes that are important for conferring O3 tolerance or sensitivity to plants, a suppression subtractive hybridization analysis was performed on the very sensitive woody shrub, Viburnum lantana, exposed to chronic O3 treatment (60 ppb, 5 h d-1 for 45 consecutive days. Transcript profiling and relative expression assessment were carried out in asymptomatic leaves, after 15 days of O3 exposure. At the end of the experiment symptoms were observed on all treated leaves and plants, with an injured leaf area per plant accounting for 4.2% of the total surface. Using 454-pyrosequencing, the transcriptome analysis of O3-responsive genes in leaves was performed, compiling a total of 38,800 and 12,495 high quality reads obtained in control and O3-treated libraries, respectively (average length of 319±156.7 and 255±107.4 bp. The Ensembl transcriptome yielded a total of 1241 unigenes with a total sequence length of 389,126 bp and an average length size of 389 bp (guanine-cytosine content = 49.9%. mRNA abundance was measured by reads per kilobase per million and 41 and 37 ensembl unigenes showed up- and down-regulation respectively. Photosynthetic performance of unigenes functionally associated to photosynthesis and carbon utilization was repressed, demonstrating the deleterious effect of O3 exposure. Unigenes functionally associated to heat-shock proteins and glutathione were concurrently induced, suggesting the role of thylakoid-localized proteins and antioxidant-detoxification pathways as an effective strategy for responding to O3. Gene Ontology analysis documented a differential expression of co-regulated transcripts for several functional categories, including

  4. Scrimer: designing primers from transcriptome data

    Czech Academy of Sciences Publication Activity Database

    Mořkovský, Libor; Pačes, Jan; Rídl, Jakub; Reifová, R.

    2015-01-01

    Roč. 15, č. 6 (2015), s. 1415-1420 ISSN 1755-098X R&D Projects: GA MŠk EE2.3.20.0303 Institutional support: RVO:68081766 ; RVO:68378050 Keywords : next-generation sequencing * primer design * SNaPshot * SNP genotyping * transcriptome Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 5.298, year: 2015

  5. Widespread uncoupling between transcriptome and translatome variations after a stimulus in mammalian cells

    Directory of Open Access Journals (Sweden)

    Tebaldi Toma

    2012-06-01

    Full Text Available Abstract Background The classical view on eukaryotic gene expression proposes the scheme of a forward flow for which fluctuations in mRNA levels upon a stimulus contribute to determine variations in mRNA availability for translation. Here we address this issue by simultaneously profiling with microarrays the total mRNAs (the transcriptome and the polysome-associated mRNAs (the translatome after EGF treatment of human cells, and extending the analysis to other 19 different transcriptome/translatome comparisons in mammalian cells following different stimuli or undergoing cell programs. Results Triggering of the EGF pathway results in an early induction of transcriptome and translatome changes, but 90% of the significant variation is limited to the translatome and the degree of concordant changes is less than 5%. The survey of other 19 different transcriptome/translatome comparisons shows that extensive uncoupling is a general rule, in terms of both RNA movements and inferred cell activities, with a strong tendency of translation-related genes to be controlled purely at the translational level. By different statistical approaches, we finally provide evidence of the lack of dependence between changes at the transcriptome and translatome levels. Conclusions We propose a model of diffused independency between variation in transcript abundances and variation in their engagement on polysomes, which implies the existence of specific mechanisms to couple these two ways of regulating gene expression.

  6. De novo transcriptome sequencing and assembly from apomictic and sexual Eragrostis curvula genotypes.

    Directory of Open Access Journals (Sweden)

    Ingrid Garbus

    Full Text Available A long-standing goal in plant breeding has been the ability to confer apomixis to agriculturally relevant species, which would require a deeper comprehension of the molecular basis of apomictic regulatory mechanisms. Eragrostis curvula (Schrad. Nees is a perennial grass that includes both sexual and apomictic cytotypes. The availability of a reference transcriptome for this species would constitute a very important tool toward the identification of genes controlling key steps of the apomictic pathway. Here, we used Roche/454 sequencing technologies to generate reads from inflorescences of E. curvula apomictic and sexual genotypes that were de novo assembled into a reference transcriptome. Near 90% of the 49568 assembled isotigs showed sequence similarity to sequences deposited in the public databases. A gene ontology analysis categorized 27448 isotigs into at least one of the three main GO categories. We identified 11475 SSRs, and several of them were assayed in E curvula germoplasm using SSR-based primers, providing a valuable set of molecular markers that could allow direct allele selection. The differential contribution to each library of the spliced forms of several transcripts revealed the existence of several isotigs produced via alternative splicing of single genes. The reference transcriptome presented and validated in this work will be useful for the identification of a wide range of gene(s related to agronomic traits of E. curvula, including those controlling key steps of the apomictic pathway in this species, allowing the extrapolation of the findings to other plant species.

  7. Transcriptome analysis of the rhizosphere bacterium Azospirillum brasilense reveals an extensive auxin response.

    Science.gov (United States)

    Van Puyvelde, Sandra; Cloots, Lore; Engelen, Kristof; Das, Frederik; Marchal, Kathleen; Vanderleyden, Jos; Spaepen, Stijn

    2011-05-01

    The rhizosphere bacterium Azospirillum brasilense produces the auxin indole-3-acetic acid (IAA) through the indole-3-pyruvate pathway. As we previously demonstrated that transcription of the indole-3-pyruvate decarboxylase (ipdC) gene is positively regulated by IAA, produced by A. brasilense itself or added exogenously, we performed a microarray analysis to study the overall effects of IAA on the transcriptome of A. brasilense. The transcriptomes of A. brasilense wild-type and the ipdC knockout mutant, both cultured in the absence and presence of exogenously added IAA, were compared.Interfering with the IAA biosynthesis/homeostasis in A. brasilense through inactivation of the ipdC gene or IAA addition results in much broader transcriptional changes than anticipated. Based on the multitude of changes observed by comparing the different transcriptomes, we can conclude that IAA is a signaling molecule in A. brasilense. It appears that the bacterium, when exposed to IAA, adapts itself to the plant rhizosphere, by changing its arsenal of transport proteins and cell surface proteins. A striking example of adaptation to IAA exposure, as happens in the rhizosphere, is the upregulation of a type VI secretion system (T6SS) in the presence of IAA. The T6SS is described as specifically involved in bacterium-eukaryotic host interactions. Additionally, many transcription factors show an altered regulation as well, indicating that the regulatory machinery of the bacterium is changing.

  8. Assessment of pleiotropic transcriptome perturbations in Arabidopsis engineered for indirect insect defence.

    Science.gov (United States)

    Houshyani, Benyamin; van der Krol, Alexander R; Bino, Raoul J; Bouwmeester, Harro J

    2014-06-19

    Molecular characterization is an essential step of risk/safety assessment of genetically modified (GM) crops. Holistic approaches for molecular characterization using omics platforms can be used to confirm the intended impact of the genetic engineering, but can also reveal the unintended changes at the omics level as a first assessment of potential risks. The potential of omics platforms for risk assessment of GM crops has rarely been used for this purpose because of the lack of a consensus reference and statistical methods to judge the significance or importance of the pleiotropic changes in GM plants. Here we propose a meta data analysis approach to the analysis of GM plants, by measuring the transcriptome distance to untransformed wild-types. In the statistical analysis of the transcriptome distance between GM and wild-type plants, values are compared with naturally occurring transcriptome distances in non-GM counterparts obtained from a database. Using this approach we show that the pleiotropic effect of genes involved in indirect insect defence traits is substantially equivalent to the variation in gene expression occurring naturally in Arabidopsis. Transcriptome distance is a useful screening method to obtain insight in the pleiotropic effects of genetic modification.

  9. Comparison of next generation sequencing technologies for transcriptome characterization

    Directory of Open Access Journals (Sweden)

    Soltis Douglas E

    2009-08-01

    Full Text Available Abstract Background We have developed a simulation approach to help determine the optimal mixture of sequencing methods for most complete and cost effective transcriptome sequencing. We compared simulation results for traditional capillary sequencing with "Next Generation" (NG ultra high-throughput technologies. The simulation model was parameterized using mappings of 130,000 cDNA sequence reads to the Arabidopsis genome (NCBI Accession SRA008180.19. We also generated 454-GS20 sequences and de novo assemblies for the basal eudicot California poppy (Eschscholzia californica and the magnoliid avocado (Persea americana using a variety of methods for cDNA synthesis. Results The Arabidopsis reads tagged more than 15,000 genes, including new splice variants and extended UTR regions. Of the total 134,791 reads (13.8 MB, 119,518 (88.7% mapped exactly to known exons, while 1,117 (0.8% mapped to introns, 11,524 (8.6% spanned annotated intron/exon boundaries, and 3,066 (2.3% extended beyond the end of annotated UTRs. Sequence-based inference of relative gene expression levels correlated significantly with microarray data. As expected, NG sequencing of normalized libraries tagged more genes than non-normalized libraries, although non-normalized libraries yielded more full-length cDNA sequences. The Arabidopsis data were used to simulate additional rounds of NG and traditional EST sequencing, and various combinations of each. Our simulations suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost. We have also developed ESTcalc http://fgp.huck.psu.edu/NG_Sims/ngsim.pl, an online webtool, which allows users to explore the results of this study by specifying individualized costs and sequencing characteristics. Conclusion NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals. In terms of sequence coverage alone, the NG sequencing is a dramatic advance

  10. Characterization of the heart transcriptome of the white shark (Carcharodon carcharias).

    Science.gov (United States)

    Richards, Vincent P; Suzuki, Haruo; Stanhope, Michael J; Shivji, Mahmood S

    2013-10-11

    The white shark (Carcharodon carcharias) is a globally distributed, apex predator possessing physical, physiological, and behavioral traits that have garnered it significant public attention. In addition to interest in the genetic basis of its form and function, as a representative of the oldest extant jawed vertebrate lineage, white sharks are also of conservation concern due to their small population size and threat from overfishing. Despite this, surprisingly little is known about the biology of white sharks, and genomic resources are unavailable. To address this deficit, we combined Roche-454 and Illumina sequencing technologies to characterize the first transciptome of any tissue for this species. From white shark heart cDNA we generated 665,399 Roche 454 reads (median length 387-bp) that were assembled into 141,626 contigs (mean length 503-bp). We also generated 78,566,588 Illumina reads, which we aligned to the 454 contigs producing 105,014 454/Illumina consensus sequences. To these, we added 3,432 non-singleton 454 contigs. By comparing these sequences to the UniProtKB/Swiss-Prot database we were able to annotate 21,019 translated open reading frames (ORFs) of ≥ 20 amino acids. Of these, 19,277 were additionally assigned Gene Ontology (GO) functional annotations. While acknowledging the limitations of our single tissue transcriptome, Fisher tests showed the white shark transcriptome to be significantly enriched for numerous metabolic GO terms compared to the zebra fish and human transcriptomes, with white shark showing more similarity to human than to zebra fish (i.e. fewer terms were significantly different). We also compared the transcriptome to other available elasmobranch sequences, for signatures of positive selection and identified several genes of putative adaptive significance on the white shark lineage. The white shark transcriptome also contained 8,404 microsatellites (dinucleotide, trinucleotide, or tetranucleotide motifs ≥ five perfect

  11. Adaptation and evolution of deep-sea scale worms (Annelida: Polynoidae): insights from transcriptome comparison with a shallow-water species

    Science.gov (United States)

    Zhang, Yanjie; Sun, Jin; Chen, Chong; Watanabe, Hiromi K.; Feng, Dong; Zhang, Yu; Chiu, Jill M.Y.; Qian, Pei-Yuan; Qiu, Jian-Wen

    2017-01-01

    Polynoid scale worms (Polynoidae, Annelida) invaded deep-sea chemosynthesis-based ecosystems approximately 60 million years ago, but little is known about their genetic adaptation to the extreme deep-sea environment. In this study, we reported the first two transcriptomes of deep-sea polynoids (Branchipolynoe pettiboneae, Lepidonotopodium sp.) and compared them with the transcriptome of a shallow-water polynoid (Harmothoe imbricata). We determined codon and amino acid usage, positive selected genes, highly expressed genes and putative duplicated genes. Transcriptome assembly produced 98,806 to 225,709 contigs in the three species. There were more positively charged amino acids (i.e., histidine and arginine) and less negatively charged amino acids (i.e., aspartic acid and glutamic acid) in the deep-sea species. There were 120 genes showing clear evidence of positive selection. Among the 10% most highly expressed genes, there were more hemoglobin genes with high expression levels in both deep-sea species. The duplicated genes related to DNA recombination and metabolism, and gene expression were only enriched in deep-sea species. Deep-sea scale worms adopted two strategies of adaptation to hypoxia in the chemosynthesis-based habitats (i.e., rapid evolution of tetra-domain hemoglobin in Branchipolynoe or high expression of single-domain hemoglobin in Lepidonotopodium sp.). PMID:28397791

  12. Ultra-high resolution HLA genotyping and allele discovery by highly multiplexed cDNA amplicon pyrosequencing

    Directory of Open Access Journals (Sweden)

    Lank Simon M

    2012-08-01

    Full Text Available Abstract Background High-resolution HLA genotyping is a critical diagnostic and research assay. Current methods rarely achieve unambiguous high-resolution typing without making population-specific frequency inferences due to a lack of locus coverage and difficulty in exon-phase matching. Achieving high-resolution typing is also becoming more challenging with traditional methods as the database of known HLA alleles increases. Results We designed a cDNA amplicon-based pyrosequencing method to capture 94% of the HLA class I open-reading-frame with only two amplicons per sample, and an analogous method for class II HLA genes, with a primary focus on sequencing the DRB loci. We present a novel Galaxy server-based analysis workflow for determining genotype. During assay validation, we performed two GS Junior sequencing runs to determine the accuracy of the HLA class I amplicons and DRB amplicon at different levels of multiplexing. When 116 amplicons were multiplexed, we unambiguously resolved 99%of class I alleles to four- or six-digit resolution, as well as 100% unambiguous DRB calls. The second experiment, with 271 multiplexed amplicons, missed some alleles, but generated high-resolution, concordant typing for 93% of class I alleles, and 96% for DRB1 alleles. In a third, preliminary experiment we attempted to sequence novel amplicons for other class II loci with mixed success. Conclusions The presented assay is higher-throughput and higher-resolution than existing HLA genotyping methods, and suitable for allele discovery or large cohort sampling. The validated class I and DRB primers successfully generated unambiguously high-resolution genotypes, while further work is needed to validate additional class II genotyping amplicons.

  13. Sequencing and analysis of the gastrula transcriptome of the brittle star Ophiocoma wendtii

    Directory of Open Access Journals (Sweden)

    Vaughn Roy

    2012-09-01

    Full Text Available Abstract Background The gastrula stage represents the point in development at which the three primary germ layers diverge. At this point the gene regulatory networks that specify the germ layers are established and the genes that define the differentiated states of the tissues have begun to be activated. These networks have been well-characterized in sea urchins, but not in other echinoderms. Embryos of the brittle star Ophiocoma wendtii share a number of developmental features with sea urchin embryos, including the ingression of mesenchyme cells that give rise to an embryonic skeleton. Notable differences are that no micromeres are formed during cleavage divisions and no pigment cells are formed during development to the pluteus larval stage. More subtle changes in timing of developmental events also occur. To explore the molecular basis for the similarities and differences between these two echinoderms, we have sequenced and characterized the gastrula transcriptome of O. wendtii. Methods Development of Ophiocoma wendtii embryos was characterized and RNA was isolated from the gastrula stage. A transcriptome data base was generated from this RNA and was analyzed using a variety of methods to identify transcripts expressed and to compare those transcripts to those expressed at the gastrula stage in other organisms. Results Using existing databases, we identified brittle star transcripts that correspond to 3,385 genes, including 1,863 genes shared with the sea urchin Strongylocentrotus purpuratus gastrula transcriptome. We characterized the functional classes of genes present in the transcriptome and compared them to those found in this sea urchin. We then examined those members of the germ-layer specific gene regulatory networks (GRNs of S. purpuratus that are expressed in the O. wendtii gastrula. Our results indicate that there is a shared ‘genetic toolkit’ central to the echinoderm gastrula, a key stage in embryonic development, though

  14. Integrated transcriptomic and proteomic evaluation of gentamicin nephrotoxicity in rats

    International Nuclear Information System (INIS)

    Com, Emmanuelle; Boitier, Eric; Marchandeau, Jean-Pierre; Brandenburg, Arnd; Schroeder, Susanne; Hoffmann, Dana; Mally, Angela; Gautier, Jean-Charles

    2012-01-01

    Gentamicin is an aminoglycoside antibiotic, which induces renal tubular necrosis in rats. In the context of the European InnoMed PredTox project, transcriptomic and proteomic studies were performed to provide new insights into the molecular mechanisms of gentamicin-induced nephrotoxicity. Male Wistar rats were treated with 25 and 75 mg/kg/day subcutaneously for 1, 3 and 14 days. Histopathology observations showed mild tubular degeneration/necrosis and regeneration and moderate mononuclear cell infiltrate after long-term treatment. Transcriptomic data indicated a strong treatment-related gene expression modulation in kidney and blood cells at the high dose after 14 days of treatment, with the regulation of 463 and 3241 genes, respectively. Of note, the induction of NF-kappa B pathway via the p38 MAPK cascade in the kidney, together with the activation of T-cell receptor signaling in blood cells were suggestive of inflammatory processes in relation with the recruitment of mononuclear cells in the kidney. Proteomic results showed a regulation of 163 proteins in kidney at the high dose after 14 days of treatment. These protein modulations were suggestive of a mitochondrial dysfunction with impairment of cellular energy production, induction of oxidative stress, an effect on protein biosynthesis and on cellular assembly and organization. Proteomic results also provided clues for potential nephrotoxicity biomarkers such as AGAT and PRBP4 which were strongly modulated in the kidney. Transcriptomic and proteomic data turned out to be complementary and their integration gave a more comprehensive insight into the putative mode of nephrotoxicity of gentamicin which was in accordance with histopathological findings. -- Highlights: ► Gentamicin induces renal tubular necrosis in rats. ► The mechanisms of gentamicin nephrotoxicity remain still elusive. ► Transcriptomic and proteomic analyses were performed to study this toxicity in rats. ► Transcriptomic and proteomic

  15. Somatic populations of PGT135-137 HIV-1-neutralizing antibodies identified by 454 pyrosequencing and bioinformatics

    Directory of Open Access Journals (Sweden)

    Jiang eZhu

    2012-09-01

    Full Text Available Select HIV-1-infected individuals develop sera capable of neutralizing diverse viral strains. The molecular basis of this neutralization is currently being deciphered by the isolation of HIV-1-neutralizing antibodies. In one infected donor, three neutralizing antibodies, PGT135-137, were identified by assessment of neutralization from individually sorted B cells and found to recognize an epitope containing an N-linked glycan at residue 332 on HIV-1 gp120. Here we use deep sequencing and bioinformatics methods to interrogate the B cell record of this donor to gain a more complete understanding of the humoral immune response. PGT135-137-gene family-specific primers were used to amplify heavy and light chain-variable domain sequences. 454 pyrosequencing produced 141,298 heavy-chain sequences of IGHV4-39 origin and 87,229 light-chain sequences of IGKV3-15 origin. A number of heavy and light chain sequences of ~90% identity to PGT137, several to PGT136, and none of high identity to PGT135 were identified. After expansion of these sequences to include close phylogenetic relatives, a total of 202 heavy-chain sequences and 72 light-chain sequences were identified. These sequences were clustered into populations of 95% identity comprising 15 for heavy chain and 10 for light chain, and a select sequence from each population was synthesized and reconstituted with a PGT137-partner chain. Reconstituted antibodies showed varied neutralization phenotypes for HIV-1 clade A and D isolates. Sequence diversity of the antibody population represented by these tested sequences was notably higher than observed with a 454 pyrosequencing-control analysis on 10 antibodies of defined sequence, suggesting that this diversity results primarily from somatic maturation. Our results thus provide an example of how pathogens like HIV-1 are opposed by a varied humoral immune response, derived from intrinsic mechanisms of antibody development, and embodied by somatic populations

  16. Impact of enrofloxacin on the human intestinal microbiota revealed by comparative molecular analysis.

    Science.gov (United States)

    Kim, Bong-Soo; Kim, Jong Nam; Yoon, Seok-Hwan; Chun, Jongsik; Cerniglia, Carl E

    2012-06-01

    The indigenous human intestinal microbiota could be disrupted by residues of antibiotics in foods as well as therapeutically administered antibiotics to humans. These disruptions may lead to adverse health outcomes. To observe the possible impact of residues of antibiotics at concentrations below therapeutic levels on human intestinal microbiota, we performed studies using in vitro cultures of fecal suspensions from three individuals with 10 different concentrations (0, 0.1, 0.5, 1, 5, 10, 15, 25, 50 and 150 μg/ml) of the fluoroquinolone, enrofloxacin. The bacterial communities of the control and enrofloxacin dosed fecal samples were analyzed by denaturing gradient gel electrophoresis (DGGE) and pyrosequencing. In addition, changes of functional gene expression were analyzed by a pyrosequencing-based random whole-community mRNA sequencing method. Although each individual had a unique microbial composition, the communities of all individuals were affected by enrofloxacin. The proportions of two phyla, namely, Bacteroidetes and Proteobacteria, were significantly reduced with increasing concentrations of enrofloxacin exposure, while the proportion of Firmicutes increased. Principal Coordinate Analysis (PCoA) using the Fast UniFrac indicated that the community structures of intestinal microbiota were shifted by enrofloxacin. Most of the mRNA transcripts and the anti-microbial drug resistance genes increased with increasing concentrations of enrofloxacin. 16S rRNA gene pyrosequencing of control and enrofloxacin treated fecal suspensions provided valuable information of affected bacterial taxa down to the species level, and the community transcriptomic analyses using mRNA revealed the functional gene expression responses of the changed bacterial communities by enrofloxacin. Published by Elsevier Ltd.

  17. RNA CoMPASS: a dual approach for pathogen and host transcriptome analysis of RNA-seq datasets.

    Directory of Open Access Journals (Sweden)

    Guorong Xu

    Full Text Available High-throughput RNA sequencing (RNA-seq has become an instrumental assay for the analysis of multiple aspects of an organism's transcriptome. Further, the analysis of a biological specimen's associated microbiome can also be performed using RNA-seq data and this application is gaining interest in the scientific community. There are many existing bioinformatics tools designed for analysis and visualization of transcriptome data. Despite the availability of an array of next generation sequencing (NGS analysis tools, the analysis of RNA-seq data sets poses a challenge for many biomedical researchers who are not familiar with command-line tools. Here we present RNA CoMPASS, a comprehensive RNA-seq analysis pipeline for the simultaneous analysis of transcriptomes and metatranscriptomes from diverse biological specimens. RNA CoMPASS leverages existing tools and parallel computing technology to facilitate the analysis of even very large datasets. RNA CoMPASS has a web-based graphical user interface with intrinsic queuing to control a distributed computational pipeline. RNA CoMPASS was evaluated by analyzing RNA-seq data sets from 45 B-cell samples. Twenty-two of these samples were derived from lymphoblastoid cell lines (LCLs generated by the infection of naïve B-cells with the Epstein Barr virus (EBV, while another 23 samples were derived from Burkitt's lymphomas (BL, some of which arose in part through infection with EBV. Appropriately, RNA CoMPASS identified EBV in all LCLs and in a fraction of the BLs. Cluster analysis of the human transcriptome component of the RNA CoMPASS output clearly separated the BLs (which have a germinal center-like phenotype from the LCLs (which have a blast-like phenotype with evidence of activated MYC signaling and lower interferon and NF-kB signaling in the BLs. Together, this analysis illustrates the utility of RNA CoMPASS in the simultaneous analysis of transcriptome and metatranscriptome data. RNA CoMPASS is freely

  18. Microglia Transcriptome Changes in a Model of Depressive Behavior after Immune Challenge.

    Directory of Open Access Journals (Sweden)

    Dianelys Gonzalez-Pena

    Full Text Available Depression symptoms following immune response to a challenge have been reported after the recovery from sickness. A RNA-Seq study of the dysregulation of the microglia transcriptome in a model of inflammation-associated depressive behavior was undertaken. The transcriptome of microglia from mice at day 7 after Bacille Calmette Guérin (BCG challenge was compared to that from unchallenged Control mice and to the transcriptome from peripheral macrophages from the same mice. Among the 562 and 3,851 genes differentially expressed between BCG-challenged and Control mice in microglia and macrophages respectively, 353 genes overlapped between these cells types. Among the most differentially expressed genes in the microglia, serum amyloid A3 (Saa3 and cell adhesion molecule 3 (Cadm3 were over-expressed and coiled-coil domain containing 162 (Ccdc162 and titin-cap (Tcap were under-expressed in BCG-challenged relative to Control. Many of the differentially expressed genes between BCG-challenged and Control mice were associated with neurological disorders encompassing depression symptoms. Across cell types, S100 calcium binding protein A9 (S100A9, interleukin 1 beta (Il1b and kynurenine 3-monooxygenase (Kmo were differentially expressed between challenged and control mice. Immune response, chemotaxis, and chemokine activity were among the functional categories enriched by the differentially expressed genes. Functional categories enriched among the 9,117 genes differentially expressed between cell types included leukocyte regulation and activation, chemokine and cytokine activities, MAP kinase activity, and apoptosis. More than 200 genes exhibited alternative splicing events between cell types including WNK lysine deficient protein kinase 1 (Wnk1 and microtubule-actin crosslinking factor 1(Macf1. Network visualization revealed the capability of microglia to exhibit transcriptome dysregulation in response to immune challenge still after resolution of sickness

  19. Field transcriptome revealed critical developmental and physiological transitions involved in the expression of growth potential in japonica rice

    Directory of Open Access Journals (Sweden)

    Kamatsuki Kaori

    2011-01-01

    Full Text Available Abstract Background Plant growth depends on synergistic interactions between internal and external signals, and yield potential of crops is a manifestation of how these complex factors interact, particularly at critical stages of development. As an initial step towards developing a systems-level understanding of the biological processes underlying the expression of overall agronomic potential in cereal crops, a high-resolution transcriptome analysis of rice was conducted throughout life cycle of rice grown under natural field conditions. Results A wide range of gene expression profiles based on 48 organs and tissues at various developmental stages identified 731 organ/tissue specific genes as well as 215 growth stage-specific expressed genes universally in leaf blade, leaf sheath, and root. Continuous transcriptome profiling of leaf from transplanting until harvesting further elucidated the growth-stage specificity of gene expression and uncovered two major drastic changes in the leaf transcriptional program. The first major change occurred before the panicle differentiation, accompanied by the expression of RFT1, a putative florigen gene in long day conditions, and the downregulation of the precursors of two microRNAs. This transcriptome change was also associated with physiological alterations including phosphate-homeostasis state as evident from the behavior of several key regulators such as miR399. The second major transcriptome change occurred just after flowering, and based on analysis of sterile mutant lines, we further revealed that the formation of strong sink, i.e., a developing grain, is not the major cause but is rather a promoter of this change. Conclusions Our study provides not only the genetic basis for functional genomics in rice but also new insight into understanding the critical physiological processes involved in flowering and seed development, that could lead to novel strategies for optimizing crop productivity.

  20. Transcriptomics and comparative analysis of three antarctic notothenioid fishes.

    Directory of Open Access Journals (Sweden)

    Seung Chul Shin

    Full Text Available For the past 10 to 13 million years, Antarctic notothenioid fish have undergone extraordinary periods of evolution and have adapted to a cold and highly oxygenated Antarctic marine environment. While these species are considered an attractive model with which to study physiology and evolutionary adaptation, they are poorly characterized at the molecular level, and sequence information is lacking. The transcriptomes of the Antarctic fishes Notothenia coriiceps, Chaenocephalus aceratus, and Pleuragramma antarcticum were obtained by 454 FLX Titanium sequencing of a normalized cDNA library. More than 1,900,000 reads were assembled in a total of 71,539 contigs. Overall, 40% of the contigs were annotated based on similarity to known protein or nucleotide sequences, and more than 50% of the predicted transcripts were validated as full-length or putative full-length cDNAs. These three Antarctic fishes shared 663 genes expressed in the brain and 1,557 genes expressed in the liver. In addition, these cold-adapted fish expressed more Ub-conjugated proteins compared to temperate fish; Ub-conjugated proteins are involved in maintaining proteins in their native state in the cold and thermally stable Antarctic environments. Our transcriptome analysis of Antarctic notothenioid fish provides an archive for future studies in molecular mechanisms of fundamental genetic questions, and can be used in evolution studies comparing other fish.