WorldWideScience

Sample records for h-dbas human-transcriptome database

  1. Oracle internals tips, tricks, and techniques for DBAs

    CERN Document Server

    Burleson, Donald K

    2001-01-01

    If you are a typical Oracle professional, you don't have the luxury of time to keep up with new technology and read all the new manuals to understand each new feature of the latest release from Oracle. You need a comprehensive source of information and in-depth tips and techniques for using the new technology. You need Oracle Internals: Tips, Tricks, and Techniques for DBAs.Oracle has evolved from a simple relational database into one of the most complex e-commerce platforms ever devised. It's not enough for you to understand just the Oracle database. You must also understand the components of

  2. CTDB: An Integrated Chickpea Transcriptome Database for Functional and Applied Genomics

    OpenAIRE

    Verma, Mohit; Kumar, Vinay; Patel, Ravi K.; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Chickpea is an important grain legume used as a rich source of protein in human diet. The narrow genetic diversity and limited availability of genomic resources are the major constraints in implementing breeding strategies and biotechnological interventions for genetic enhancement of chickpea. We developed an integrated Chickpea Transcriptome Database (CTDB), which provides the comprehensive web interface for visualization and easy retrieval of transcriptome data in chickpea. The database fea...

  3. TRAM (Transcriptome Mapper: database-driven creation and analysis of transcriptome maps from multiple sources

    Directory of Open Access Journals (Sweden)

    Danieli Gian

    2011-02-01

    Full Text Available Abstract Background Several tools have been developed to perform global gene expression profile data analysis, to search for specific chromosomal regions whose features meet defined criteria as well as to study neighbouring gene expression. However, most of these tools are tailored for a specific use in a particular context (e.g. they are species-specific, or limited to a particular data format and they typically accept only gene lists as input. Results TRAM (Transcriptome Mapper is a new general tool that allows the simple generation and analysis of quantitative transcriptome maps, starting from any source listing gene expression values for a given gene set (e.g. expression microarrays, implemented as a relational database. It includes a parser able to assign univocal and updated gene symbols to gene identifiers from different data sources. Moreover, TRAM is able to perform intra-sample and inter-sample data normalization, including an original variant of quantile normalization (scaled quantile, useful to normalize data from platforms with highly different numbers of investigated genes. When in 'Map' mode, the software generates a quantitative representation of the transcriptome of a sample (or of a pool of samples and identifies if segments of defined lengths are over/under-expressed compared to the desired threshold. When in 'Cluster' mode, the software searches for a set of over/under-expressed consecutive genes. Statistical significance for all results is calculated with respect to genes localized on the same chromosome or to all genome genes. Transcriptome maps, showing differential expression between two sample groups, relative to two different biological conditions, may be easily generated. We present the results of a biological model test, based on a meta-analysis comparison between a sample pool of human CD34+ hematopoietic progenitor cells and a sample pool of megakaryocytic cells. Biologically relevant chromosomal segments and gene

  4. Radiological consequence evaluation of DBAs with alternative source term method for a Chinese PWR

    International Nuclear Information System (INIS)

    Li, J.X.; Cao, X.W.; Tong, L.L.; Huang, G.F.

    2012-01-01

    Highlights: ► Radiological consequence evaluation of DBAs with alternative source term method for a Chinese 900 MWe PWR has been investigated. ► Six typical DBA sequences are analyzed. ► The doses of control room, EAB and outer boundary of LPZ are acceptable. ► The differences between AST method and TID-14844 method are investigated. - Abstract: Since a large amount of fission products may releases into the environment, during the accident progression in nuclear power plants (NPPs), which is a potential hazard to public risk, the radiological consequence should be evaluated for alleviating the hazard. In most Chinese NPPs the method of TID-14844, in which the whole body and thyroid dose criteria is employed as dose criteria, is currently adopted to evaluate the radiological consequences for design-basis accidents (DBAs), but, due to the total effective dose equivalent is employed as dose criteria in alternative radiological source terms (AST) method, it is necessary to evaluate the radiological consequences for DBAs with AST method and to discuss the difference between two methods. By using an integral safety analysis code, an analytical model of the 900 MWe pressurized water reactor (PWR) is built and the radiological consequences in DBAs at control room (CR), exclusion area boundary (EAB), low population zone (LPZ) are analyzed, which includes LOCA and non-LOCA DBAs, such as fuel handling accident (FHA), rod ejection accident (REA), main steam line break (MSLB), steam generator tube rupture (SGTR), locked rotor accident (LRA) by using the guidance of the RG 1.183. The results show that the doses in CR, EAB and LPZ are acceptable compared with dose criteria in RG 1.183 and the differences between AST method and TID-14844 method are also discussed.

  5. CTDB: An Integrated Chickpea Transcriptome Database for Functional and Applied Genomics.

    Directory of Open Access Journals (Sweden)

    Mohit Verma

    Full Text Available Chickpea is an important grain legume used as a rich source of protein in human diet. The narrow genetic diversity and limited availability of genomic resources are the major constraints in implementing breeding strategies and biotechnological interventions for genetic enhancement of chickpea. We developed an integrated Chickpea Transcriptome Database (CTDB, which provides the comprehensive web interface for visualization and easy retrieval of transcriptome data in chickpea. The database features many tools for similarity search, functional annotation (putative function, PFAM domain and gene ontology search and comparative gene expression analysis. The current release of CTDB (v2.0 hosts transcriptome datasets with high quality functional annotation from cultivated (desi and kabuli types and wild chickpea. A catalog of transcription factor families and their expression profiles in chickpea are available in the database. The gene expression data have been integrated to study the expression profiles of chickpea transcripts in major tissues/organs and various stages of flower development. The utilities, such as similarity search, ortholog identification and comparative gene expression have also been implemented in the database to facilitate comparative genomic studies among different legumes and Arabidopsis. Furthermore, the CTDB represents a resource for the discovery of functional molecular markers (microsatellites and single nucleotide polymorphisms between different chickpea types. We anticipate that integrated information content of this database will accelerate the functional and applied genomic research for improvement of chickpea. The CTDB web service is freely available at http://nipgr.res.in/ctdb.html.

  6. DBGC: A Database of Human Gastric Cancer

    Science.gov (United States)

    Wang, Chao; Zhang, Jun; Cai, Mingdeng; Zhu, Zhenggang; Gu, Wenjie; Yu, Yingyan; Zhang, Xiaoyan

    2015-01-01

    The Database of Human Gastric Cancer (DBGC) is a comprehensive database that integrates various human gastric cancer-related data resources. Human gastric cancer-related transcriptomics projects, proteomics projects, mutations, biomarkers and drug-sensitive genes from different sources were collected and unified in this database. Moreover, epidemiological statistics of gastric cancer patients in China and clinicopathological information annotated with gastric cancer cases were also integrated into the DBGC. We believe that this database will greatly facilitate research regarding human gastric cancer in many fields. DBGC is freely available at http://bminfor.tongji.edu.cn/dbgc/index.do PMID:26566288

  7. Database Administrator

    Science.gov (United States)

    Moore, Pam

    2010-01-01

    The Internet and electronic commerce (e-commerce) generate lots of data. Data must be stored, organized, and managed. Database administrators, or DBAs, work with database software to find ways to do this. They identify user needs, set up computer databases, and test systems. They ensure that systems perform as they should and add people to the…

  8. Validation of the German version of the short form of the dysfunctional beliefs and attitudes about sleep scale (DBAS-16).

    Science.gov (United States)

    Lang, Christin; Brand, Serge; Holsboer-Trachsler, Edith; Pühse, Uwe; Colledge, Flora; Gerber, Markus

    2017-06-01

    Research shows that dysfunctional sleep-related cognitions play an important role in the development, maintenance and exacerbation of insomnia. This study examines the factorial validity, psychometric properties and both concurrent and predictive validity of the German version of the 16-item DBAS (dysfunctional beliefs and attitudes about sleep) scale. Data was collected in 864 vocational students from the German-speaking part of Switzerland (43% females, M age  = 17.9 years). Data collection took place twice within a 10-month interval. The students completed a German translation of the DBAS-16, the Insomnia Severity Index (ISI), the Pittsburgh Sleep Quality Index (PSQI), and provided information about their psychological functioning. Descriptive statistics, factorial validity, internal consistency, gender differences, concurrent, and predictive validity were examined. Confirmatory factor analysis supported the 4-factor structure of the DBAS-16. All factors (consequences, worry/helplessness, expectations, medication) were positively correlated and had acceptable psychometric properties. Females reported higher scores across all DBAS measures. Weak-to-moderate correlations were found between dysfunctional sleep-related beliefs, insomnia and poor sleep quality. Dysfunctional sleep-related beliefs were also associated with decreased psychological functioning, and consistently predicted insomnia and poor psychological functioning at follow-up, even after controlling for socio-demographic background and baseline levels. The present study provides support for the validity and psychometric properties of the German version of the DBAS-16. Most importantly, it corroborates the relevance of cognitive-emotional factors in the onset and maintenance of insomnia and psychological symptoms among young people.

  9. PostgreSQL database performance optimization

    OpenAIRE

    Wang, Qiang

    2011-01-01

    The thesis was request by Marlevo software Oy for a general description of the PostgreSQL database and its performance optimization technics. Its purpose was to help new PostgreSQL users to quickly understand the system and to assist DBAs to improve the database performance. The thesis was divided into two parts. The first part described PostgreSQL database optimization technics in theory. In additional popular tools were also introduced. This part was based on PostgreSQL documentation, r...

  10. Oracle Database 12c backup and recovery survival guide

    CERN Document Server

    Alvarez, Francisco Munoz

    2013-01-01

    The book follows a tutorial-based approach, covering all the best practices for backup and recovery. The book starts by introducing readers to the world of backup and recovery, then moves on to teach them the new features offered by Oracle 12c. The book is full of useful tips and best practices that are essential for any DBA to perform backup and recovery operations in an organization.This book is designed for Oracle DBAs and system administrators. The reader will have a basic working experience of administering Oracle databases. This book is designed for Oracle DBAs and system administrators.

  11. KrillDB: A de novo transcriptome database for the Antarctic krill (Euphausia superba.

    Directory of Open Access Journals (Sweden)

    Gabriele Sales

    Full Text Available Antarctic krill (Euphausia superba is a key species in the Southern Ocean with an estimated biomass between 100 and 500 million tonnes. Changes in krill population viability would have catastrophic effect on the Antarctic ecosystem. One looming threat due to elevated levels of anthropogenic atmospheric carbon dioxide (CO2 is ocean acidification (lowering of sea water pH by CO2 dissolving into the oceans. The genetics of Antarctic krill has long been of scientific interest for both for the analysis of population structure and analysis of functional genetics. However, the genetic resources available for the species are relatively modest. We have developed the most advanced genetic database on Euphausia superba, KrillDB, which includes comprehensive data sets of former and present transcriptome projects. In particular, we have built a de novo transcriptome assembly using more than 360 million Illumina sequence reads generated from larval krill including individuals subjected to different CO2 levels. The database gives access to: 1 the full list of assembled genes and transcripts; 2 their level of similarity to transcripts and proteins from other species; 3 the predicted protein domains contained within each transcript; 4 their predicted GO terms; 5 the level of expression of each transcript in the different larval stages and CO2 treatments. All references to external entities (sequences, domains, GO terms are equipped with a link to the appropriate source database. Moreover, the software implements a full-text search engine that makes it possible to submit free-form queries. KrillDB represents the first large-scale attempt at classifying and annotating the full krill transcriptome. For this reason, we believe it will constitute a cornerstone of future approaches devoted to physiological and molecular study of this key species in the Southern Ocean food web.

  12. Analysis of a human brain transcriptome map

    Directory of Open Access Journals (Sweden)

    Greene Jonathan R

    2002-04-01

    Full Text Available Abstract Background Genome wide transcriptome maps can provide tools to identify candidate genes that are over-expressed or silenced in certain disease tissue and increase our understanding of the structure and organization of the genome. Expressed Sequence Tags (ESTs from the public dbEST and proprietary Incyte LifeSeq databases were used to derive a transcript map in conjunction with the working draft assembly of the human genome sequence. Results Examination of ESTs derived from brain tissues (excluding brain tumor tissues suggests that these genes are distributed on chromosomes in a non-random fashion. Some regions on the genome are dense with brain-enriched genes while some regions lack brain-enriched genes, suggesting a significant correlation between distribution of genes along the chromosome and tissue type. ESTs from brain tumor tissues have also been mapped to the human genome working draft. We reveal that some regions enriched in brain genes show a significant decrease in gene expression in brain tumors, and, conversely that some regions lacking in brain genes show an increased level of gene expression in brain tumors. Conclusions This report demonstrates a novel approach for tissue specific transcriptome mapping using EST-based quantitative assessment.

  13. In silico approach towards H5N1 virus protein and transcriptomics ...

    African Journals Online (AJOL)

    H5N1 influenza A virus is a serious threat to human population. With a considerable mortality rate, strategies for coping with the infection are being developed. Our research group and some others investigated the potential therapeutic and preventive measures for tackling H5N1 infections. Protein based and transcriptomics ...

  14. The Human Transcriptome: An Unfinished Story

    Directory of Open Access Journals (Sweden)

    Mihaela Pertea

    2012-06-01

    Full Text Available Despite recent technological advances, the study of the human transcriptome is still in its early stages. Here we provide an overview of the complex human transcriptomic landscape, present the bioinformatics challenges posed by the vast quantities of transcriptomic data, and discuss some of the studies that have tried to determine how much of the human genome is transcribed. Recent evidence has suggested that more than 90% of the human genome is transcribed into RNA. However, this view has been strongly contested by groups of scientists who argued that many of the observed transcripts are simply the result of transcriptional noise. In this review, we conclude that the full extent of transcription remains an open question that will not be fully addressed until we decipher the complete range and biological diversity of the transcribed genomic sequences.

  15. Database reliability engineering designing and operating resilient database systems

    CERN Document Server

    Campbell, Laine

    2018-01-01

    The infrastructure-as-code revolution in IT is also affecting database administration. With this practical book, developers, system administrators, and junior to mid-level DBAs will learn how the modern practice of site reliability engineering applies to the craft of database architecture and operations. Authors Laine Campbell and Charity Majors provide a framework for professionals looking to join the ranks of today’s database reliability engineers (DBRE). You’ll begin by exploring core operational concepts that DBREs need to master. Then you’ll examine a wide range of database persistence options, including how to implement key technologies to provide resilient, scalable, and performant data storage and retrieval. With a firm foundation in database reliability engineering, you’ll be ready to dive into the architecture and operations of any modern database. This book covers: Service-level requirements and risk management Building and evolving an architecture for operational visibility ...

  16. A transcriptome anatomy of human colorectal cancers

    International Nuclear Information System (INIS)

    Lü, Bingjian; Xu, Jing; Lai, Maode; Zhang, Hao; Chen, Jian

    2006-01-01

    Accumulating databases in human genome research have enabled integrated genome-wide study on complicated diseases such as cancers. A practical approach is to mine a global transcriptome profile of disease from public database. New concepts of these diseases might emerge by landscaping this profile. In this study, we clustered human colorectal normal mucosa (N), inflammatory bowel disease (IBD), adenoma (A) and cancer (T) related expression sequence tags (EST) into UniGenes via an in-house GetUni software package and analyzed the transcriptome overview of these libraries by GOTree Machine (GOTM). Additionally, we downloaded UniGene based cDNA libraries of colon and analyzed them by Xprofiler to cross validate the efficiency of GetUni. Semi-quantitative RT-PCR was used to validate the expression of β-catenin and. 7 novel genes in colorectal cancers. The efficiency of GetUni was successfully validated by Xprofiler and RT-PCR. Genes in library N, IBD and A were all found in library T. A total of 14,879 genes were identified with 2,355 of them having at least 2 transcripts. Differences in gene enrichment among these libraries were statistically significant in 50 signal transduction pathways and Pfam protein domains by GOTM analysis P < 0.01 Hypergeometric Test). Genes in two metabolic pathways, ribosome and glycolysis, were more enriched in the expression profiles of A and IBD than in N and T. Seven transmembrane receptor superfamily genes were typically abundant in cancers. Colorectal cancers are genetically heterogeneous. Transcription variants are common in them. Aberrations of ribosome and glycolysis pathway might be early indicators of precursor lesions in colon cancers. The electronic gene expression profile could be used to highlight the integral molecular events in colorectal cancers

  17. The transcriptome of Legionella pneumophila-infected human monocyte-derived macrophages.

    Directory of Open Access Journals (Sweden)

    Christopher T D Price

    Full Text Available Legionella pneumophila is an intracellular bacterial pathogen that invades and replicates within alveolar macrophages through injection of ∼ 300 effector proteins by its Dot/Icm type IV translocation apparatus. The bona fide F-box protein, AnkB, is a nutritional virulence effector that triggers macrophages to generate a surplus of amino acids, which is essential for intravacuolar proliferation. Therefore, the ankB mutant represents a novel genetic tool to determine the transcriptional response of human monocyte-derived macrophages (hMDMs to actively replicating L. pneumophila.Here, we utilized total human gene microarrays to determine the global transcriptional response of hMDMs to infection by wild type or the ankB mutant of L. pneumophila. The transcriptomes of hMDMs infected with either actively proliferating wild type or non-replicative ankB mutant bacteria were remarkably similar. The transcriptome of infected hMDMs was predominated by up-regulation of inflammatory pathways (IL-10 anti-inflammatory, interferon signaling and amphoterin signaling, anti-apoptosis, and down-regulation of protein synthesis pathways. In addition, L. pneumophila modulated diverse metabolic pathways, particularly those associated with bio-active lipid metabolism, and SLC amino acid transporters expression.Taken together, the hMDM transcriptional response to L. pneumophila is independent of intra-vacuolar replication of the bacteria and primarily involves modulation of the immune response and metabolic as well as nutritional pathways.

  18. The Co-regulation Data Harvester: Automating gene annotation starting from a transcriptome database

    Science.gov (United States)

    Tsypin, Lev M.; Turkewitz, Aaron P.

    Identifying co-regulated genes provides a useful approach for defining pathway-specific machinery in an organism. To be efficient, this approach relies on thorough genome annotation, a process much slower than genome sequencing per se. Tetrahymena thermophila, a unicellular eukaryote, has been a useful model organism and has a fully sequenced but sparsely annotated genome. One important resource for studying this organism has been an online transcriptomic database. We have developed an automated approach to gene annotation in the context of transcriptome data in T. thermophila, called the Co-regulation Data Harvester (CDH). Beginning with a gene of interest, the CDH identifies co-regulated genes by accessing the Tetrahymena transcriptome database. It then identifies their closely related genes (orthologs) in other organisms by using reciprocal BLAST searches. Finally, it collates the annotations of those orthologs' functions, which provides the user with information to help predict the cellular role of the initial query. The CDH, which is freely available, represents a powerful new tool for analyzing cell biological pathways in Tetrahymena. Moreover, to the extent that genes and pathways are conserved between organisms, the inferences obtained via the CDH should be relevant, and can be explored, in many other systems.

  19. Transcriptomics resources of human tissues and organs

    DEFF Research Database (Denmark)

    Uhlén, Mathias; Hallström, Björn M.; Lindskog, Cecilia

    2016-01-01

    a framework for defining the molecular constituents of the human body as well as for generating comprehensive lists of proteins expressed across tissues or in a tissue-restricted manner. Here, we review publicly available human transcriptome resources and discuss body-wide data from independent genome......Quantifying the differential expression of genes in various human organs, tissues, and cell types is vital to understand human physiology and disease. Recently, several large-scale transcriptomics studies have analyzed the expression of protein-coding genes across tissues. These datasets provide...

  20. Passive Strategy with Integrated Passive Safety System (IPSS) for DBAs in SBO

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Sang Ho; Kim, Jihee; Choi, Jae Young; Jeon, Inseop; Chang, Soon Heung [Korea Advanced Institute of Science and Technology, Daejeon (Korea, Republic of)

    2013-10-15

    In this paper, the strategies of coping with DBAs in SBO were proposed by the design with IPSS. Current nuclear power plants adopt emergency strategies using fire truck as a provision of steam generator cooling. However, it has a lot of limitation like water inventory, preparedness and accessibility. In the case of passive strategy by the application of IPSS, faster actions and more efficient performances can be achieved. The application of IPSS implies the preparedness of big water tank which can be used as water supplier, heat sink and filtering medium. The proposed strategies are set under the conservative conditions without AC power. In order to set more realistic and acceptable strategy, the proposed passive strategy has to be combined with the current strategies. The combined strategies can avoid the reiteration and complexity in accidents. Accordingly, the set of operation mode considering action priority with estimating specific conditions is the further work of this research. Removing decay heat is one of the most important issues in safety of nuclear engineering. In the Fukushima accidents, the initial problem was an occurrence of tsunami. It was connected into a station black out (SBO) which lost AC power in site. Finally, SBO with human error induced the failure of decay heat removal. The occurrence of SBO and the failure of decay heat removal imply the questions for solving them. In order to prevent and mitigate SBO, some solutions have been proposed after the Fukushima accident. First of all, physical protection is enhanced to prevent external risks. For example, the tsunami barrier was modified to be higher from 7.5 m to 10 m. The second is to add electrical redundancy to prevent a total loss of electrical power. AAC diesel generators and movable diesel generators are examples for emergency conditions to supply AC power in site. Bunker concept which was proposed in Europe is a representative example. The bunker concept was analyzed to be applied in

  1. Dynamics of the transcriptome response of cultured human embryonic stem cells to ionizing radiation exposure

    International Nuclear Information System (INIS)

    Sokolov, Mykyta V.; Panyutin, Irina V.; Panyutin, Igor G.; Neumann, Ronald D.

    2011-01-01

    One of the key consequences of exposure of human cells to genotoxic agents is the activation of DNA damage responses (DDR). While the mechanisms underpinning DDR in fully differentiated somatic human cells have been studied extensively, molecular signaling events and pathways involved in DDR in pluripotent human embryonic stem cells (hESC) remain largely unexplored. We studied changes in the human genome-wide transcriptome of H9 hESC line following exposures to 1 Gy of gamma-radiation at 2 h and 16 h post-irradiation. Quantitative real-time PCR was performed to verify the expression data for a subset of genes. In parallel, the cell growth, DDR kinetics, and expression of pluripotency markers in irradiated hESC were monitored. The changes in gene expression in hESC after exposure to ionizing radiation (IR) are substantially different from those observed in somatic human cell lines. Gene expression patterns at 2 h post-IR showed almost an exclusively p53-dependent, predominantly pro-apoptotic, signature with a total of only 30 up-regulated genes. In contrast, the gene expression patterns at 16 h post-IR showed 354 differentially expressed genes, mostly involved in pro-survival pathways, such as increased expression of metallothioneins, ubiquitin cycle, and general metabolism signaling. Cell growth data paralleled trends in gene expression changes. DDR in hESC followed the kinetics reported for human somatic differentiated cells. The expression of pluripotency markers characteristic of undifferentiated hESC was not affected by exposure to IR during the time course of our analysis. Our data on dynamics of transcriptome response of irradiated hESCs may provide a valuable tool to screen for markers of IR exposure of human cells in their most naive state; thus unmasking the key elements of DDR; at the same time, avoiding the complexity of interpreting distinct cell type-dependent genotoxic stress responses of terminally differentiated cells.

  2. Dynamics of the transcriptome response of cultured human embryonic stem cells to ionizing radiation exposure

    Energy Technology Data Exchange (ETDEWEB)

    Sokolov, Mykyta V., E-mail: sokolovm@mail.nih.gov [Nuclear Medicine Division, Department of Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892 (United States); Panyutin, Irina V., E-mail: ipanyutinv@mail.nih.gov [Nuclear Medicine Division, Department of Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892 (United States); Panyutin, Igor G., E-mail: igorp@helix.nih.gov [Nuclear Medicine Division, Department of Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892 (United States); Neumann, Ronald D., E-mail: rneumann@mail.nih.gov [Nuclear Medicine Division, Department of Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892 (United States)

    2011-05-10

    One of the key consequences of exposure of human cells to genotoxic agents is the activation of DNA damage responses (DDR). While the mechanisms underpinning DDR in fully differentiated somatic human cells have been studied extensively, molecular signaling events and pathways involved in DDR in pluripotent human embryonic stem cells (hESC) remain largely unexplored. We studied changes in the human genome-wide transcriptome of H9 hESC line following exposures to 1 Gy of gamma-radiation at 2 h and 16 h post-irradiation. Quantitative real-time PCR was performed to verify the expression data for a subset of genes. In parallel, the cell growth, DDR kinetics, and expression of pluripotency markers in irradiated hESC were monitored. The changes in gene expression in hESC after exposure to ionizing radiation (IR) are substantially different from those observed in somatic human cell lines. Gene expression patterns at 2 h post-IR showed almost an exclusively p53-dependent, predominantly pro-apoptotic, signature with a total of only 30 up-regulated genes. In contrast, the gene expression patterns at 16 h post-IR showed 354 differentially expressed genes, mostly involved in pro-survival pathways, such as increased expression of metallothioneins, ubiquitin cycle, and general metabolism signaling. Cell growth data paralleled trends in gene expression changes. DDR in hESC followed the kinetics reported for human somatic differentiated cells. The expression of pluripotency markers characteristic of undifferentiated hESC was not affected by exposure to IR during the time course of our analysis. Our data on dynamics of transcriptome response of irradiated hESCs may provide a valuable tool to screen for markers of IR exposure of human cells in their most naive state; thus unmasking the key elements of DDR; at the same time, avoiding the complexity of interpreting distinct cell type-dependent genotoxic stress responses of terminally differentiated cells.

  3. GeNNet: an integrated platform for unifying scientific workflows and graph databases for transcriptome data analysis

    Directory of Open Access Journals (Sweden)

    Raquel L. Costa

    2017-07-01

    analyzed. The results are integrated into GeNNet-DB, a database about genes, clusters, experiments and their properties and relationships. The resulting graph database is explored with queries that demonstrate the expressiveness of this data model for reasoning about gene interaction networks. GeNNet is the first platform to integrate the analytical process of transcriptome data with graph databases. It provides a comprehensive set of tools that would otherwise be challenging for non-expert users to install and use. Developers can add new functionality to components of GeNNet. The derived data allows for testing previous hypotheses about an experiment and exploring new ones through the interactive graph database environment. It enables the analysis of different data on humans, rhesus, mice and rat coming from Affymetrix platforms. GeNNet is available as an open source platform at https://github.com/raquele/GeNNet and can be retrieved as a software container with the command docker pull quelopes/gennet.

  4. Early Transcriptomic Response to LDL and oxLDL in Human Vascular Smooth Muscle Cells.

    Directory of Open Access Journals (Sweden)

    Salvador Damián-Zamacona

    Full Text Available Although nowadays it is well known that the human transcriptome can importantly vary according to external or environmental condition, the reflection of this concept when studying oxidative stress and its direct relationship with gene expression profiling during the process of atherogenesis has not been thoroughly achieved.The ability to analyze genome-wide gene expression through transcriptomics has shown that the genome responds dynamically to diverse stimuli. Here, we describe the transcriptome of human vascular smooth muscle cells (hVSMC stimulated by native and oxidized low-density lipoprotein (nLDL and oxLDL respectively, with the aim of assessing the early molecular changes that induce a response in this cell type resulting in a transcriptomic transformation. This expression has been demonstrated in atherosclerotic plaques in vivo and in vitro, particularly in the light of the oxidative modification hypothesis of atherosclerosis.Total RNA was isolated with TRIzol reagent (Life Technologies and quality estimated using an Agilent 2100 bioanalyzer. The transcriptome of hVSMC under different experimental conditions (1,5 and 24 hours for nLDL and oxLDL was obtained using the GeneChip Human Gene 1.0 ST (Affymetrix designed to measure gene expression of 28,869 well-annotated genes. A fixed fold-change cut-off corresponding to ± 2 was used to identify genes exhibiting the most significant variation and statistical significance (P< 0.05, and 8 genes validated by qPCR using Taqman probes.10 molecular processes were significantly affected in hVSMC: Apoptosis and cell cycle, extracellular matrix remodeling, DNA repair, cholesterol efflux, cGMP biosynthesis, endocytic mechanisms, calcium homeostasis, redox balance, membrane trafficking and finally, the immune response to inflammation. The evidence we present supporting the hypothesis for the involvement of oxidative modification of several processes and metabolic pathways in atherosclerosis is

  5. Transcriptome Profiling in Human Diseases: New Advances and Perspectives.

    Science.gov (United States)

    Casamassimi, Amelia; Federico, Antonio; Rienzo, Monica; Esposito, Sabrina; Ciccodicola, Alfredo

    2017-07-29

    In the last decades, transcriptome profiling has been one of the most utilized approaches to investigate human diseases at the molecular level. Through expression studies, many molecular biomarkers and therapeutic targets have been found for several human pathologies. This number is continuously increasing thanks to total RNA sequencing. Indeed, this new technology has completely revolutionized transcriptome analysis allowing the quantification of gene expression levels and allele-specific expression in a single experiment, as well as to identify novel genes, splice isoforms, fusion transcripts, and to investigate the world of non-coding RNA at an unprecedented level. RNA sequencing has also been employed in important projects, like ENCODE (Encyclopedia of the regulatory elements) and TCGA (The Cancer Genome Atlas), to provide a snapshot of the transcriptome of dozens of cell lines and thousands of primary tumor specimens. Moreover, these studies have also paved the way to the development of data integration approaches in order to facilitate management and analysis of data and to identify novel disease markers and molecular targets to use in the clinics. In this scenario, several ongoing clinical trials utilize transcriptome profiling through RNA sequencing strategies as an important instrument in the diagnosis of numerous human pathologies.

  6. Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE

    Science.gov (United States)

    Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.

    2009-01-01

    Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438

  7. Transcriptome Profiling in Human Diseases: New Advances and Perspectives

    Directory of Open Access Journals (Sweden)

    Amelia Casamassimi

    2017-07-01

    Full Text Available In the last decades, transcriptome profiling has been one of the most utilized approaches to investigate human diseases at the molecular level. Through expression studies, many molecular biomarkers and therapeutic targets have been found for several human pathologies. This number is continuously increasing thanks to total RNA sequencing. Indeed, this new technology has completely revolutionized transcriptome analysis allowing the quantification of gene expression levels and allele-specific expression in a single experiment, as well as to identify novel genes, splice isoforms, fusion transcripts, and to investigate the world of non-coding RNA at an unprecedented level. RNA sequencing has also been employed in important projects, like ENCODE (Encyclopedia of the regulatory elements and TCGA (The Cancer Genome Atlas, to provide a snapshot of the transcriptome of dozens of cell lines and thousands of primary tumor specimens. Moreover, these studies have also paved the way to the development of data integration approaches in order to facilitate management and analysis of data and to identify novel disease markers and molecular targets to use in the clinics. In this scenario, several ongoing clinical trials utilize transcriptome profiling through RNA sequencing strategies as an important instrument in the diagnosis of numerous human pathologies.

  8. Genome-wide binding and transcriptome analysis of human farnesoid X receptor in primary human hepatocytes.

    Directory of Open Access Journals (Sweden)

    Le Zhan

    Full Text Available Farnesoid X receptor (FXR, NR1H4 is a ligand-activated transcription factor, belonging to the nuclear receptor superfamily. FXR is highly expressed in the liver and is essential in regulating bile acid homeostasis. FXR deficiency is implicated in numerous liver diseases and mice with modulation of FXR have been used as animal models to study liver physiology and pathology. We have reported genome-wide binding of FXR in mice by chromatin immunoprecipitation - deep sequencing (ChIP-seq, with results indicating that FXR may be involved in regulating diverse pathways in liver. However, limited information exists for the functions of human FXR and the suitability of using murine models to study human FXR functions.In the current study, we performed ChIP-seq in primary human hepatocytes (PHHs treated with a synthetic FXR agonist, GW4064 or DMSO control. In parallel, RNA deep sequencing (RNA-seq and RNA microarray were performed for GW4064 or control treated PHHs and wild type mouse livers, respectively.ChIP-seq showed similar profiles of genome-wide FXR binding in humans and mice in terms of motif analysis and pathway prediction. However, RNA-seq and microarray showed more different transcriptome profiles between PHHs and mouse livers upon GW4064 treatment.In summary, we have established genome-wide human FXR binding and transcriptome profiles. These results will aid in determining the human FXR functions, as well as judging to what level the mouse models could be used to study human FXR functions.

  9. Comparison of a teratogenic transcriptome-based predictive test based on human embryonic versus inducible pluripotent stem cells.

    Science.gov (United States)

    Shinde, Vaibhav; Perumal Srinivasan, Sureshkumar; Henry, Margit; Rotshteyn, Tamara; Hescheler, Jürgen; Rahnenführer, Jörg; Grinberg, Marianna; Meisig, Johannes; Blüthgen, Nils; Waldmann, Tanja; Leist, Marcel; Hengstler, Jan Georg; Sachinidis, Agapios

    2016-12-30

    Human embryonic stem cells (hESCs) partially recapitulate early embryonic three germ layer development, allowing testing of potential teratogenic hazards. Because use of hESCs is ethically debated, we investigated the potential for human induced pluripotent stem cells (hiPSCs) to replace hESCs in such tests. Three cell lines, comprising hiPSCs (foreskin and IMR90) and hESCs (H9) were differentiated for 14 days. Their transcriptome profiles were obtained on day 0 and day 14 and analyzed by comprehensive bioinformatics tools. The transcriptomes on day 14 showed that more than 70% of the "developmental genes" (regulated genes with > 2-fold change on day 14 compared to day 0) exhibited variability among cell lines. The developmental genes belonging to all three cell lines captured biological processes and KEGG pathways related to all three germ layer embryonic development. In addition, transcriptome profiles were obtained after 14 days of exposure to teratogenic valproic acid (VPA) during differentiation. Although the differentially regulated genes between treated and untreated samples showed more than 90% variability among cell lines, VPA clearly antagonized the expression of developmental genes in all cell lines: suppressing upregulated developmental genes, while inducing downregulated ones. To quantify VPA-disturbed development based on developmental genes, we estimated the "developmental potency" (D p ) and "developmental index" (D i ). Despite differences in genes deregulated by VPA, uniform D i values were obtained for all three cell lines. Given that the D i values for VPA were similar for hESCs and hiPSCs, D i can be used for robust hazard identification, irrespective of whether hESCs or hiPSCs are used in the test systems.

  10. Human Transcriptome and Chromatin Modifications: An ENCODE Perspective

    Directory of Open Access Journals (Sweden)

    Li Shen

    2013-06-01

    Full Text Available A decade-long project, led by several international research groups, called the Encyclopedia of DNA Elements (ENCODE, recently released an unprecedented amount of data. The ambitious project covers transcriptome, cistrome, epigenome, and interactome data from more than 1,600 sets of experiments in human. To make use of this valuable resource, it is important to understand the information it represents and the techniques that were used to generate these data. In this review, we introduce the data that ENCODE generated, summarize the observations from the data analysis, and revisit a computational approach that ENCODE used to predict gene expression, with a focus on the human transcriptome and its association with chromatin modifications.

  11. Reptilian Transcriptomes v2.0: An Extensive Resource for Sauropsida Genomics and Transcriptomics.

    Science.gov (United States)

    Tzika, Athanasia C; Ullate-Agote, Asier; Grbic, Djordje; Milinkovitch, Michel C

    2015-07-01

    Despite the availability of deep-sequencing techniques, genomic and transcriptomic data remain unevenly distributed across phylogenetic groups. For example, reptiles are poorly represented in sequence databases, hindering functional evolutionary and developmental studies in these lineages substantially more diverse than mammals. In addition, different studies use different assembly and annotation protocols, inhibiting meaningful comparisons. Here, we present the "Reptilian Transcriptomes Database 2.0," which provides extensive annotation of transcriptomes and genomes from species covering the major reptilian lineages. To this end, we sequenced normalized complementary DNA libraries of multiple adult tissues and various embryonic stages of the leopard gecko and the corn snake and gathered published reptilian sequence data sets from representatives of the four extant orders of reptiles: Squamata (snakes and lizards), the tuatara, crocodiles, and turtles. The LANE runner 2.0 software was implemented to annotate all assemblies within a single integrated pipeline. We show that this approach increases the annotation completeness of the assembled transcriptomes/genomes. We then built large concatenated protein alignments of single-copy genes and inferred phylogenetic trees that support the positions of turtles and the tuatara as sister groups of Archosauria and Squamata, respectively. The Reptilian Transcriptomes Database 2.0 resource will be updated to include selected new data sets as they become available, thus making it a reference for differential expression studies, comparative genomics and transcriptomics, linkage mapping, molecular ecology, and phylogenomic analyses involving reptiles. The database is available at www.reptilian-transcriptomes.org and can be enquired using a wwwblast server installed at the University of Geneva. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  12. Functional organization of the transcriptome in human brain

    Science.gov (United States)

    Oldham, Michael C; Konopka, Genevieve; Iwamoto, Kazuya; Langfelder, Peter; Kato, Tadafumi; Horvath, Steve; Geschwind, Daniel H

    2009-01-01

    The enormous complexity of the human brain ultimately derives from a finite set of molecular instructions encoded in the human genome. These instructions can be directly studied by exploring the organization of the brain’s transcriptome through systematic analysis of gene coexpression relationships. We analyzed gene coexpression relationships in microarray data generated from specific human brain regions and identified modules of coexpressed genes that correspond to neurons, oligodendrocytes, astrocytes and microglia. These modules provide an initial description of the transcriptional programs that distinguish the major cell classes of the human brain and indicate that cell type–specific information can be obtained from whole brain tissue without isolating homogeneous populations of cells. Other modules corresponded to additional cell types, organelles, synaptic function, gender differences and the subventricular neurogenic niche. We found that subventricular zone astrocytes, which are thought to function as neural stem cells in adults, have a distinct gene expression pattern relative to protoplasmic astrocytes. Our findings provide a new foundation for neurogenetic inquiries by revealing a robust and previously unrecognized organization to the human brain transcriptome. PMID:18849986

  13. Major differences between human atopic dermatitis and murine models as determined by global transcriptomic profiling

    DEFF Research Database (Denmark)

    Ewald, David Adrian; Noda, Shinji; Oliva, Margeaux

    2017-01-01

    , and a comparison of these models with the human AD transcriptomic fingerprint is lacking. We sought to evaluate the transcriptomic profiles of six common murine models and determine how they relate to human AD skin. Transcriptomic profiling was performed using microarrays and qRT-PCR on biopsies from NC/Nga, flaky...

  14. KONAGAbase: a genomic and transcriptomic database for the diamondback moth, Plutella xylostella.

    Science.gov (United States)

    Jouraku, Akiya; Yamamoto, Kimiko; Kuwazaki, Seigo; Urio, Masahiro; Suetsugu, Yoshitaka; Narukawa, Junko; Miyamoto, Kazuhisa; Kurita, Kanako; Kanamori, Hiroyuki; Katayose, Yuichi; Matsumoto, Takashi; Noda, Hiroaki

    2013-07-09

    The diamondback moth (DBM), Plutella xylostella, is one of the most harmful insect pests for crucifer crops worldwide. DBM has rapidly evolved high resistance to most conventional insecticides such as pyrethroids, organophosphates, fipronil, spinosad, Bacillus thuringiensis, and diamides. Therefore, it is important to develop genomic and transcriptomic DBM resources for analysis of genes related to insecticide resistance, both to clarify the mechanism of resistance of DBM and to facilitate the development of insecticides with a novel mode of action for more effective and environmentally less harmful insecticide rotation. To contribute to this goal, we developed KONAGAbase, a genomic and transcriptomic database for DBM (KONAGA is the Japanese word for DBM). KONAGAbase provides (1) transcriptomic sequences of 37,340 ESTs/mRNAs and 147,370 RNA-seq contigs which were clustered and assembled into 84,570 unigenes (30,695 contigs, 50,548 pseudo singletons, and 3,327 singletons); and (2) genomic sequences of 88,530 WGS contigs with 246,244 degenerate contigs and 106,455 singletons from which 6,310 de novo identified repeat sequences and 34,890 predicted gene-coding sequences were extracted. The unigenes and predicted gene-coding sequences were clustered and 32,800 representative sequences were extracted as a comprehensive putative gene set. These sequences were annotated with BLAST descriptions, Gene Ontology (GO) terms, and Pfam descriptions, respectively. KONAGAbase contains rich graphical user interface (GUI)-based web interfaces for easy and efficient searching, browsing, and downloading sequences and annotation data. Five useful search interfaces consisting of BLAST search, keyword search, BLAST result-based search, GO tree-based search, and genome browser are provided. KONAGAbase is publicly available from our website (http://dbm.dna.affrc.go.jp/px/) through standard web browsers. KONAGAbase provides DBM comprehensive transcriptomic and draft genomic sequences with

  15. Chromosomal clustering of a human transcriptome reveals regulatory background

    Directory of Open Access Journals (Sweden)

    Purmann Antje

    2005-09-01

    Full Text Available Abstract Background There has been much evidence recently for a link between transcriptional regulation and chromosomal gene order, but the relationship between genomic organization, regulation and gene function in higher eukaryotes remains to be precisely defined. Results Here, we present evidence for organization of a large proportion of a human transcriptome into gene clusters throughout the genome, which are partly regulated by the same transcription factors, share biological functions and are characterized by non-housekeeping genes. This analysis was based on the cardiac transcriptome identified by our genome-wide array analysis of 55 human heart samples. We found 37% of these genes to be arranged mainly in adjacent pairs or triplets. A significant number of pairs of adjacent genes are putatively regulated by common transcription factors (p = 0.02. Furthermore, these gene pairs share a significant number of GO functional classification terms. We show that the human cardiac transcriptome is organized into many small clusters across the whole genome, rather than being concentrated in a few larger clusters. Conclusion Our findings suggest that genes expressed in concert are organized in a linear arrangement for coordinated regulation. Determining the relationship between gene arrangement, regulation and nuclear organization as well as gene function will have broad biological implications.

  16. TCW: transcriptome computational workbench.

    Directory of Open Access Journals (Sweden)

    Carol Soderlund

    Full Text Available BACKGROUND: The analysis of transcriptome data involves many steps and various programs, along with organization of large amounts of data and results. Without a methodical approach for storage, analysis and query, the resulting ad hoc analysis can lead to human error, loss of data and results, inefficient use of time, and lack of verifiability, repeatability, and extensibility. METHODOLOGY: The Transcriptome Computational Workbench (TCW provides Java graphical interfaces for methodical analysis for both single and comparative transcriptome data without the use of a reference genome (e.g. for non-model organisms. The singleTCW interface steps the user through importing transcript sequences (e.g. Illumina or assembling long sequences (e.g. Sanger, 454, transcripts, annotating the sequences, and performing differential expression analysis using published statistical programs in R. The data, metadata, and results are stored in a MySQL database. The multiTCW interface builds a comparison database by importing sequence and annotation from one or more single TCW databases, executes the ESTscan program to translate the sequences into proteins, and then incorporates one or more clusterings, where the clustering options are to execute the orthoMCL program, compute transitive closure, or import clusters. Both singleTCW and multiTCW allow extensive query and display of the results, where singleTCW displays the alignment of annotation hits to transcript sequences, and multiTCW displays multiple transcript alignments with MUSCLE or pairwise alignments. The query programs can be executed on the desktop for fastest analysis, or from the web for sharing the results. CONCLUSION: It is now affordable to buy a multi-processor machine, and easy to install Java and MySQL. By simply downloading the TCW, the user can interactively analyze, query and view their data. The TCW allows in-depth data mining of the results, which can lead to a better understanding of the

  17. TCW: transcriptome computational workbench.

    Science.gov (United States)

    Soderlund, Carol; Nelson, William; Willer, Mark; Gang, David R

    2013-01-01

    The analysis of transcriptome data involves many steps and various programs, along with organization of large amounts of data and results. Without a methodical approach for storage, analysis and query, the resulting ad hoc analysis can lead to human error, loss of data and results, inefficient use of time, and lack of verifiability, repeatability, and extensibility. The Transcriptome Computational Workbench (TCW) provides Java graphical interfaces for methodical analysis for both single and comparative transcriptome data without the use of a reference genome (e.g. for non-model organisms). The singleTCW interface steps the user through importing transcript sequences (e.g. Illumina) or assembling long sequences (e.g. Sanger, 454, transcripts), annotating the sequences, and performing differential expression analysis using published statistical programs in R. The data, metadata, and results are stored in a MySQL database. The multiTCW interface builds a comparison database by importing sequence and annotation from one or more single TCW databases, executes the ESTscan program to translate the sequences into proteins, and then incorporates one or more clusterings, where the clustering options are to execute the orthoMCL program, compute transitive closure, or import clusters. Both singleTCW and multiTCW allow extensive query and display of the results, where singleTCW displays the alignment of annotation hits to transcript sequences, and multiTCW displays multiple transcript alignments with MUSCLE or pairwise alignments. The query programs can be executed on the desktop for fastest analysis, or from the web for sharing the results. It is now affordable to buy a multi-processor machine, and easy to install Java and MySQL. By simply downloading the TCW, the user can interactively analyze, query and view their data. The TCW allows in-depth data mining of the results, which can lead to a better understanding of the transcriptome. TCW is freely available from www.agcol.arizona.edu/software/tcw.

  18. A meta-analysis of human embryonic stem cells transcriptome integrated into a web-based expression atlas.

    Science.gov (United States)

    Assou, Said; Le Carrour, Tanguy; Tondeur, Sylvie; Ström, Susanne; Gabelle, Audrey; Marty, Sophie; Nadal, Laure; Pantesco, Véronique; Réme, Thierry; Hugnot, Jean-Philippe; Gasca, Stéphan; Hovatta, Outi; Hamamah, Samir; Klein, Bernard; De Vos, John

    2007-04-01

    Microarray technology provides a unique opportunity to examine gene expression patterns in human embryonic stem cells (hESCs). We performed a meta-analysis of 38 original studies reporting on the transcriptome of hESCs. We determined that 1,076 genes were found to be overexpressed in hESCs by at least three studies when compared to differentiated cell types, thus composing a "consensus hESC gene list." Only one gene was reported by all studies: the homeodomain transcription factor POU5F1/OCT3/4. The list comprised other genes critical for pluripotency such as the transcription factors NANOG and SOX2, and the growth factors TDGF1/CRIPTO and Galanin. We show that CD24 and SEMA6A, two cell surface protein-coding genes from the top of the consensus hESC gene list, display a strong and specific membrane protein expression on hESCs. Moreover, CD24 labeling permits the purification by flow cytometry of hESCs cocultured on human fibroblasts. The consensus hESC gene list also included the FZD7 WNT receptor, the G protein-coupled receptor GPR19, and the HELLS helicase, which could play an important role in hESCs biology. Conversely, we identified 783 genes downregulated in hESCs and reported in at least three studies. This "consensus differentiation gene list" included the IL6ST/GP130 LIF receptor. We created an online hESC expression atlas, http://amazonia.montp.inserm.fr, to provide an easy access to this public transcriptome dataset. Expression histograms comparing hESCs to a broad collection of fetal and adult tissues can be retrieved with this web tool for more than 15,000 genes.

  19. Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes

    Directory of Open Access Journals (Sweden)

    Wincker Patrick

    2010-06-01

    Full Text Available Abstract Background The medicinal leech, Hirudo medicinalis, is an important model system for the study of nervous system structure, function, development, regeneration and repair. It is also a unique species in being presently approved for use in medical procedures, such as clearing of pooled blood following certain surgical procedures. It is a current, and potentially also future, source of medically useful molecular factors, such as anticoagulants and antibacterial peptides, which may have evolved as a result of its parasitizing large mammals, including humans. Despite the broad focus of research on this system, little has been done at the genomic or transcriptomic levels and there is a paucity of openly available sequence data. To begin to address this problem, we constructed whole embryo and adult central nervous system (CNS EST libraries and created a clustered sequence database of the Hirudo transcriptome that is available to the scientific community. Results A total of ~133,000 EST clones from two directionally-cloned cDNA libraries, one constructed from mRNA derived from whole embryos at several developmental stages and the other from adult CNS cords, were sequenced in one or both directions by three different groups: Genoscope (French National Sequencing Center, the University of Iowa Sequencing Facility and the DOE Joint Genome Institute. These were assembled using the phrap software package into 31,232 unique contigs and singletons, with an average length of 827 nt. The assembled transcripts were then translated in all six frames and compared to proteins in NCBI's non-redundant (NR and to the Gene Ontology (GO protein sequence databases, resulting in 15,565 matches to 11,236 proteins in NR and 13,935 matches to 8,073 proteins in GO. Searching the database for transcripts of genes homologous to those thought to be involved in the innate immune responses of vertebrates and other invertebrates yielded a set of nearly one hundred

  20. Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes.

    Science.gov (United States)

    Macagno, Eduardo R; Gaasterland, Terry; Edsall, Lee; Bafna, Vineet; Soares, Marcelo B; Scheetz, Todd; Casavant, Thomas; Da Silva, Corinne; Wincker, Patrick; Tasiemski, Aurélie; Salzet, Michel

    2010-06-25

    The medicinal leech, Hirudo medicinalis, is an important model system for the study of nervous system structure, function, development, regeneration and repair. It is also a unique species in being presently approved for use in medical procedures, such as clearing of pooled blood following certain surgical procedures. It is a current, and potentially also future, source of medically useful molecular factors, such as anticoagulants and antibacterial peptides, which may have evolved as a result of its parasitizing large mammals, including humans. Despite the broad focus of research on this system, little has been done at the genomic or transcriptomic levels and there is a paucity of openly available sequence data. To begin to address this problem, we constructed whole embryo and adult central nervous system (CNS) EST libraries and created a clustered sequence database of the Hirudo transcriptome that is available to the scientific community. A total of approximately 133,000 EST clones from two directionally-cloned cDNA libraries, one constructed from mRNA derived from whole embryos at several developmental stages and the other from adult CNS cords, were sequenced in one or both directions by three different groups: Genoscope (French National Sequencing Center), the University of Iowa Sequencing Facility and the DOE Joint Genome Institute. These were assembled using the phrap software package into 31,232 unique contigs and singletons, with an average length of 827 nt. The assembled transcripts were then translated in all six frames and compared to proteins in NCBI's non-redundant (NR) and to the Gene Ontology (GO) protein sequence databases, resulting in 15,565 matches to 11,236 proteins in NR and 13,935 matches to 8,073 proteins in GO. Searching the database for transcripts of genes homologous to those thought to be involved in the innate immune responses of vertebrates and other invertebrates yielded a set of nearly one hundred evolutionarily conserved sequences

  1. H2DB: a heritability database across multiple species by annotating trait-associated genomic loci.

    Science.gov (United States)

    Kaminuma, Eli; Fujisawa, Takatomo; Tanizawa, Yasuhiro; Sakamoto, Naoko; Kurata, Nori; Shimizu, Tokurou; Nakamura, Yasukazu

    2013-01-01

    H2DB (http://tga.nig.ac.jp/h2db/), an annotation database of genetic heritability estimates for humans and other species, has been developed as a knowledge database to connect trait-associated genomic loci. Heritability estimates have been investigated for individual species, particularly in human twin studies and plant/animal breeding studies. However, there appears to be no comprehensive heritability database for both humans and other species. Here, we introduce an annotation database for genetic heritabilities of various species that was annotated by manually curating online public resources in PUBMED abstracts and journal contents. The proposed heritability database contains attribute information for trait descriptions, experimental conditions, trait-associated genomic loci and broad- and narrow-sense heritability specifications. Annotated trait-associated genomic loci, for which most are single-nucleotide polymorphisms derived from genome-wide association studies, may be valuable resources for experimental scientists. In addition, we assigned phenotype ontologies to the annotated traits for the purposes of discussing heritability distributions based on phenotypic classifications.

  2. Human transporter database: comprehensive knowledge and discovery tools in the human transporter genes.

    Directory of Open Access Journals (Sweden)

    Adam Y Ye

    Full Text Available Transporters are essential in homeostatic exchange of endogenous and exogenous substances at the systematic, organic, cellular, and subcellular levels. Gene mutations of transporters are often related to pharmacogenetics traits. Recent developments in high throughput technologies on genomics, transcriptomics and proteomics allow in depth studies of transporter genes in normal cellular processes and diverse disease conditions. The flood of high throughput data have resulted in urgent need for an updated knowledgebase with curated, organized, and annotated human transporters in an easily accessible way. Using a pipeline with the combination of automated keywords query, sequence similarity search and manual curation on transporters, we collected 1,555 human non-redundant transporter genes to develop the Human Transporter Database (HTD (http://htd.cbi.pku.edu.cn. Based on the extensive annotations, global properties of the transporter genes were illustrated, such as expression patterns and polymorphisms in relationships with their ligands. We noted that the human transporters were enriched in many fundamental biological processes such as oxidative phosphorylation and cardiac muscle contraction, and significantly associated with Mendelian and complex diseases such as epilepsy and sudden infant death syndrome. Overall, HTD provides a well-organized interface to facilitate research communities to search detailed molecular and genetic information of transporters for development of personalized medicine.

  3. The human airway epithelial basal cell transcriptome.

    Directory of Open Access Journals (Sweden)

    Neil R Hackett

    2011-05-01

    Full Text Available The human airway epithelium consists of 4 major cell types: ciliated, secretory, columnar and basal cells. During natural turnover and in response to injury, the airway basal cells function as stem/progenitor cells for the other airway cell types. The objective of this study is to better understand human airway epithelial basal cell biology by defining the gene expression signature of this cell population.Bronchial brushing was used to obtain airway epithelium from healthy nonsmokers. Microarrays were used to assess the transcriptome of basal cells purified from the airway epithelium in comparison to the transcriptome of the differentiated airway epithelium. This analysis identified the "human airway basal cell signature" as 1,161 unique genes with >5-fold higher expression level in basal cells compared to differentiated epithelium. The basal cell signature was suppressed when the basal cells differentiated into a ciliated airway epithelium in vitro. The basal cell signature displayed overlap with genes expressed in basal-like cells from other human tissues and with that of murine airway basal cells. Consistent with self-modulation as well as signaling to other airway cell types, the human airway basal cell signature was characterized by genes encoding extracellular matrix components, growth factors and growth factor receptors, including genes related to the EGF and VEGF pathways. Interestingly, while the basal cell signature overlaps that of basal-like cells of other organs, the human airway basal cell signature has features not previously associated with this cell type, including a unique pattern of genes encoding extracellular matrix components, G protein-coupled receptors, neuroactive ligands and receptors, and ion channels.The human airway epithelial basal cell signature identified in the present study provides novel insights into the molecular phenotype and biology of the stem/progenitor cells of the human airway epithelium.

  4. Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

    Directory of Open Access Journals (Sweden)

    Marais Gabriel AB

    2011-07-01

    Full Text Available Abstract Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO terms, and thousands of single-nucleotide polymorphisms (SNPs were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49% that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to

  5. Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

    Science.gov (United States)

    2011-01-01

    Background The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a

  6. Oracle Database 11gR2 Performance Tuning Cookbook

    CERN Document Server

    Fiorillo, Ciro

    2012-01-01

    In this book you will find both examples and theoretical concepts covered. Every recipe is based on a script/procedure explained step-by-step, with screenshots, while theoretical concepts are explained in the context of the recipe, to explain why a solution performs better than another. This book is aimed at software developers, software and data architects, and DBAs who are using or are planning to use the Oracle Database, who have some experience and want to solve performance problems faster and in a rigorous way. If you are an architect who wants to design better applications, a DBA who is

  7. Human Performance Event Database

    International Nuclear Information System (INIS)

    Trager, E. A.

    1998-01-01

    The purpose of this paper is to describe several aspects of a Human Performance Event Database (HPED) that is being developed by the Nuclear Regulatory Commission. These include the background, the database structure and basis for the structure, the process for coding and entering event records, the results of preliminary analyses of information in the database, and plans for the future. In 1992, the Office for Analysis and Evaluation of Operational Data (AEOD) within the NRC decided to develop a database for information on human performance during operating events. The database was needed to help classify and categorize the information to help feedback operating experience information to licensees and others. An NRC interoffice working group prepared a list of human performance information that should be reported for events and the list was based on the Human Performance Investigation Process (HPIP) that had been developed by the NRC as an aid in investigating events. The structure of the HPED was based on that list. The HPED currently includes data on events described in augmented inspection team (AIT) and incident investigation team (IIT) reports from 1990 through 1996, AEOD human performance studies from 1990 through 1993, recent NRR special team inspections, and licensee event reports (LERs) that were prepared for the events. (author)

  8. Analysis of the Transcriptome of the Infective Stage of the Beet Cyst Nematode, H. schachtii.

    Directory of Open Access Journals (Sweden)

    John Fosu-Nyarko

    Full Text Available The beet cyst nematode, Heterodera schachtii, is a major root pest that significantly impacts the yield of sugar beet, brassicas and related species. There has been limited molecular characterisation of this important plant pathogen: to identify target genes for its control the transcriptome of the pre-parasitic J2 stage of H. schachtii was sequenced using Roche GS FLX. Ninety seven percent of reads (i.e., 387,668 with an average PHRED score > 22 were assembled with CAP3 and CLC Genomics Workbench into 37,345 and 47,263 contigs, respectively. The transcripts were annotated by comparing with gene and genomic sequences of other nematodes and annotated proteins on public databases. The annotated transcripts were much more similar to sequences of Heterodera glycines than to those of Globodera pallida and root knot nematodes (Meloidogyne spp.. Analysis of these transcripts showed that a subset of 2,918 transcripts was common to free-living and plant parasitic nematodes suggesting that this subset is involved in general nematode metabolism and development. A set of 148 contigs and 183 singletons encoding putative homologues of effectors previously characterised for plant parasitic nematodes were also identified: these are known to be important for parasitism of host plants during migration through tissues or feeding from cells or are thought to be involved in evasion or modulation of host defences. In addition, the presence of sequences from a nematode virus is suggested. The sequencing and annotation of this transcriptome significantly adds to the genetic data available for H. schachtii, and identifies genes primed to undertake required roles in the critical pre-parasitic and early post-parasitic J2 stages. These data provide new information for identifying potential gene targets for future protection of susceptible crops against H. schachtii.

  9. Mining biological databases for candidate disease genes

    Science.gov (United States)

    Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.

    2001-07-01

    The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).

  10. Major differences between human atopic dermatitis and murine models, as determined by using global transcriptomic profiling

    DEFF Research Database (Denmark)

    Ewald, David A.; Noda, Shinji; Oliva, Margeaux

    2017-01-01

    , and a comparison of these models with the human AD transcriptomic fingerprint is lacking. Objective We sought to evaluate the transcriptomic profiles of 6 common murine models and determine how they relate to human AD skin. Methods Transcriptomic profiling was performed by using microarrays and quantitative RT......-PCR on biopsy specimens from NC/Nga, flaky tail, Flg-mutated, ovalbumin-challenged, oxazolone-challenged, and IL-23–injected mice. Gene expression data of patients with AD, psoriasis, and contact dermatitis were obtained from previous patient cohorts. Criteria of a fold change of 2 or greater and a false...... discovery rate of 0.05 or less were used for gene arrays. Results IL-23–injected, NC/Nga, and oxazolone-challenged mice show the largest homology with our human meta-analysis–derived AD transcriptome (37%, 18%, 17%, respectively). Similar to human AD, robust TH1, TH2, and also TH17 activation are seen in IL...

  11. De novo transcriptome assembly of shrimp Palaemon serratus

    Directory of Open Access Journals (Sweden)

    Alejandra Perina

    2017-03-01

    Full Text Available The shrimp Palaemon serratus is a coastal decapod crustacean with a high commercial value. It is harvested for human consumption. In this study, we used Illumina sequencing technology (HiSeq 2000 to sequence, assemble and annotate the transcriptome of P. serratus. RNA was isolated from muscle of adults individuals and, from a pool of larvae. A total number of 4 cDNA libraries were constructed, using the TruSeq RNA Sample Preparation Kit v2. The raw data in this study was deposited in NCBI SRA database with study accession number of SRP090769. The obtained data were subjected to de novo transcriptome assembly using Trinity software, and coding regions were predicted by TransDecoder. We used Blastp and Sma3s to annotate the identified proteins. The transcriptome data could provide some insight into the understanding of genes involved in the larval development and metamorphosis.

  12. The Human Blood Metabolome-Transcriptome Interface.

    Directory of Open Access Journals (Sweden)

    Jörg Bartel

    2015-06-01

    Full Text Available Biological systems consist of multiple organizational levels all densely interacting with each other to ensure function and flexibility of the system. Simultaneous analysis of cross-sectional multi-omics data from large population studies is a powerful tool to comprehensively characterize the underlying molecular mechanisms on a physiological scale. In this study, we systematically analyzed the relationship between fasting serum metabolomics and whole blood transcriptomics data from 712 individuals of the German KORA F4 cohort. Correlation-based analysis identified 1,109 significant associations between 522 transcripts and 114 metabolites summarized in an integrated network, the 'human blood metabolome-transcriptome interface' (BMTI. Bidirectional causality analysis using Mendelian randomization did not yield any statistically significant causal associations between transcripts and metabolites. A knowledge-based interpretation and integration with a genome-scale human metabolic reconstruction revealed systematic signatures of signaling, transport and metabolic processes, i.e. metabolic reactions mainly belonging to lipid, energy and amino acid metabolism. Moreover, the construction of a network based on functional categories illustrated the cross-talk between the biological layers at a pathway level. Using a transcription factor binding site enrichment analysis, this pathway cross-talk was further confirmed at a regulatory level. Finally, we demonstrated how the constructed networks can be used to gain novel insights into molecular mechanisms associated to intermediate clinical traits. Overall, our results demonstrate the utility of a multi-omics integrative approach to understand the molecular mechanisms underlying both normal physiology and disease.

  13. The Human Blood Metabolome-Transcriptome Interface

    Science.gov (United States)

    Schramm, Katharina; Adamski, Jerzy; Gieger, Christian; Herder, Christian; Carstensen, Maren; Peters, Annette; Rathmann, Wolfgang; Roden, Michael; Strauch, Konstantin; Suhre, Karsten; Kastenmüller, Gabi; Prokisch, Holger; Theis, Fabian J.

    2015-01-01

    Biological systems consist of multiple organizational levels all densely interacting with each other to ensure function and flexibility of the system. Simultaneous analysis of cross-sectional multi-omics data from large population studies is a powerful tool to comprehensively characterize the underlying molecular mechanisms on a physiological scale. In this study, we systematically analyzed the relationship between fasting serum metabolomics and whole blood transcriptomics data from 712 individuals of the German KORA F4 cohort. Correlation-based analysis identified 1,109 significant associations between 522 transcripts and 114 metabolites summarized in an integrated network, the ‘human blood metabolome-transcriptome interface’ (BMTI). Bidirectional causality analysis using Mendelian randomization did not yield any statistically significant causal associations between transcripts and metabolites. A knowledge-based interpretation and integration with a genome-scale human metabolic reconstruction revealed systematic signatures of signaling, transport and metabolic processes, i.e. metabolic reactions mainly belonging to lipid, energy and amino acid metabolism. Moreover, the construction of a network based on functional categories illustrated the cross-talk between the biological layers at a pathway level. Using a transcription factor binding site enrichment analysis, this pathway cross-talk was further confirmed at a regulatory level. Finally, we demonstrated how the constructed networks can be used to gain novel insights into molecular mechanisms associated to intermediate clinical traits. Overall, our results demonstrate the utility of a multi-omics integrative approach to understand the molecular mechanisms underlying both normal physiology and disease. PMID:26086077

  14. Transcriptome variations among human embryonic stem cell lines are associated with their differentiation propensity.

    Directory of Open Access Journals (Sweden)

    Changbin Sun

    Full Text Available Human embryonic stem cells (hESCs have the potential to form any cell type in the body, making them attractive cell sources in drug screening, regenerative medicine, disease and developmental processes modeling. However, not all hESC lines have the equal potency to generate desired cell types in vitro. Significant variations have been observed for the differentiation efficiency of various human ESC lines. The precise underpinning molecular mechanisms are still unclear. In this work, we compared transcriptome variations of four hESC lines H7, HUES1, HUES8 and HUES9. We found that hESC lines have different gene expression profiles, and these differentially expressed genes (DEGs are significantly enriched in developmental processes, such as ectodermal, mesodermal and endodermal development. The enrichment difference between hESC lines was consistent with its lineage bias. Among these DEGs, some pluripotency factors and genes involved in signaling transduction showed great variations as well. The pleiotropic functions of these genes in controlling hESC identity and early lineage specification, implicated that different hESC lines may utilize distinct balance mechanisms to maintain pluripotent state. When the balance is broken in a certain environment, gene expression variation between them could impact on their different lineage specification behavior.

  15. Transcriptome adaptation of group B Streptococcus to growth in human amniotic fluid.

    Directory of Open Access Journals (Sweden)

    Izabela Sitkiewicz

    Full Text Available BACKGROUND: Streptococcus agalactiae (group B Streptococcus is a bacterial pathogen that causes severe intrauterine infections leading to fetal morbidity and mortality. The pathogenesis of GBS infection in this environment is poorly understood, in part because we lack a detailed understanding of the adaptation of this pathogen to growth in amniotic fluid. To address this knowledge deficit, we characterized the transcriptome of GBS grown in human amniotic fluid (AF and compared it with the transcriptome in rich laboratory medium. METHODS: GBS was grown in Todd Hewitt-yeast extract medium and human AF. Bacteria were collected at mid-logarithmic, late-logarithmic and stationary growth phase. We performed global expression microarray analysis using a custom-made Affymetrix GeneChip. The normalized hybridization values derived from three biological replicates at each growth point were obtained. AF/THY transcript ratios representing greater than a 2-fold change and P-value exceeding 0.05 were considered to be statistically significant. PRINCIPAL FINDINGS: We have discovered that GBS significantly remodels its transcriptome in response to exposure to human amniotic fluid. GBS grew rapidly in human AF and did not exhibit a global stress response. The majority of changes in GBS transcripts in AF compared to THY medium were related to genes mediating metabolism of amino acids, carbohydrates, and nucleotides. The majority of the observed changes in transcripts affects genes involved in basic bacterial metabolism and is connected to AF composition and nutritional requirements of the bacterium. Importantly, the response to growth in human AF included significant changes in transcripts of multiple virulence genes such as adhesins, capsule, and hemolysin and IL-8 proteinase what might have consequences for the outcome of host-pathogen interactions. CONCLUSIONS/SIGNIFICANCE: Our work provides extensive new information about how the transcriptome of GBS responds

  16. HCSD: the human cancer secretome database

    DEFF Research Database (Denmark)

    Feizi, Amir; Banaei-Esfahani, Amir; Nielsen, Jens

    2015-01-01

    The human cancer secretome database (HCSD) is a comprehensive database for human cancer secretome data. The cancer secretome describes proteins secreted by cancer cells and structuring information about the cancer secretome will enable further analysis of how this is related with tumor biology...... database is limiting the ability to query the increasing community knowledge. We therefore developed the Human Cancer Secretome Database (HCSD) to fulfil this gap. HCSD contains >80 000 measurements for about 7000 nonredundant human proteins collected from up to 35 high-throughput studies on 17 cancer...

  17. De novo transcriptome assembly of Sorghum bicolor variety Taejin

    Directory of Open Access Journals (Sweden)

    Yeonhwa Jo

    2016-06-01

    Full Text Available Sorghum (Sorghum bicolor, also known as great millet, is one of the most popular cultivated grass species in the world. Sorghum is frequently consumed as food for humans and animals as well as used for ethanol production. In this study, we conducted de novo transcriptome assembly for sorghum variety Taejin by next-generation sequencing, obtaining 8.748 GB of raw data. The raw data in this study can be available in NCBI SRA database with accession number of SRX1715644. Using the Trinity program, we identified 222,161 transcripts from sorghum variety Taejin. We further predicted coding regions within the assembled transcripts by the TransDecoder program, resulting in a total of 148,531 proteins. We carried out BLASTP against the Swiss-Prot protein sequence database to annotate the functions of the identified proteins. To our knowledge, this is the first transcriptome data for a sorghum variety derived from Korea, and it can be usefully applied to the generation of genetic markers.

  18. Genome-wide RNA-seq analysis of human and mouse platelet transcriptomes

    Science.gov (United States)

    Rowley, Jesse W.; Oler, Andrew J.; Tolley, Neal D.; Hunter, Benjamin N.; Low, Elizabeth N.; Nix, David A.; Yost, Christian C.; Zimmerman, Guy A.

    2011-01-01

    Inbred mice are a useful tool for studying the in vivo functions of platelets. Nonetheless, the mRNA signature of mouse platelets is not known. Here, we use paired-end next-generation RNA sequencing (RNA-seq) to characterize the polyadenylated transcriptomes of human and mouse platelets. We report that RNA-seq provides unprecedented resolution of mRNAs that are expressed across the entire human and mouse genomes. Transcript expression and abundance are often conserved between the 2 species. Several mRNAs, however, are differentially expressed in human and mouse platelets. Moreover, previously described functional disparities between mouse and human platelets are reflected in differences at the transcript level, including protease activated receptor-1, protease activated receptor-3, platelet activating factor receptor, and factor V. This suggests that RNA-seq is a useful tool for predicting differences in platelet function between mice and humans. Our next-generation sequencing analysis provides new insights into the human and murine platelet transcriptomes. The sequencing dataset will be useful in the design of mouse models of hemostasis and a catalyst for discovery of new functions of platelets. Access to the dataset is found in the “Introduction.” PMID:21596849

  19. Group B streptococcus activates transcriptomic pathways related to premature birth in human extraplacental membranes in vitro.

    Science.gov (United States)

    Park, Hae-Ryung; Harris, Sean M; Boldenow, Erica; McEachin, Richard C; Sartor, Maureen; Chames, Mark; Loch-Caruso, Rita

    2018-03-01

    Streptococcus agalactiae (group B streptococcus [GBS]) infection in pregnant women is the leading cause of infectious neonatal morbidity and mortality in the United States. Although inflammation during infection has been associated with preterm birth, the contribution of GBS to preterm birth is less certain. Moreover, the early mechanisms by which GBS interacts with the gestational tissue to affect adverse pregnancy outcomes are poorly understood. We hypothesized that short-term GBS inoculation activates pathways related to inflammation and premature birth in human extraplacental membranes. We tested this hypothesis using GBS-inoculated human extraplacental membranes in vitro. In agreement with our hypothesis, a microarray-based transcriptomics analysis of gene expression changes in GBS-inoculated membranes revealed that GBS activated pathways related to inflammation and preterm birth with significant gene expression changes occurring as early as 4 h postinoculation. In addition, pathways related to DNA replication and repair were downregulated with GBS treatment. Conclusions based on our transcriptomics data were further supported by responses of prostaglandin E2 (PGE2), and matrix metalloproteinases 1 (MMP1) and 3 (MMP3), all of which are known to be involved in parturition and premature rupture of membranes. These results support our initial hypothesis and provide new information on molecular targets of GBS infection in human extraplacental membranes.

  20. The H-mode operational window as determined from the ITER H-mode database

    International Nuclear Information System (INIS)

    Ryter, F.; Kardaun, O.J.W.F.; Stroth, U.

    1994-01-01

    The H-mode is a promising regime for fusion reactors and it is essential to be able to predict its operational window in future devices. The 'H-Mode Database Working Group' started in 1992 to gather, analyze and compare H-mode threshold data from several divertor tokamaks so that predictions could be made. The database and first results were presented and the threshold database has been improved and extended since. The work has two objectives: 1) to predict the minimum heating power necessary to reach the H-mode in future devices, 2) to contribute to physics studies of the L-H transition. (author) 11 refs., 2 figs

  1. Sensitive detection of viral transcripts in human tumor transcriptomes.

    Directory of Open Access Journals (Sweden)

    Sven-Eric Schelhorn

    Full Text Available In excess of 12% of human cancer incidents have a viral cofactor. Epidemiological studies of idiopathic human cancers indicate that additional tumor viruses remain to be discovered. Recent advances in sequencing technology have enabled systematic screenings of human tumor transcriptomes for viral transcripts. However, technical problems such as low abundances of viral transcripts in large volumes of sequencing data, viral sequence divergence, and homology between viral and human factors significantly confound identification of tumor viruses. We have developed a novel computational approach for detecting viral transcripts in human cancers that takes the aforementioned confounding factors into account and is applicable to a wide variety of viruses and tumors. We apply the approach to conducting the first systematic search for viruses in neuroblastoma, the most common cancer in infancy. The diverse clinical progression of this disease as well as related epidemiological and virological findings are highly suggestive of a pathogenic cofactor. However, a viral etiology of neuroblastoma is currently contested. We mapped 14 transcriptomes of neuroblastoma as well as positive and negative controls to the human and all known viral genomes in order to detect both known and unknown viruses. Analysis of controls, comparisons with related methods, and statistical estimates demonstrate the high sensitivity of our approach. Detailed investigation of putative viral transcripts within neuroblastoma samples did not provide evidence for the existence of any known human viruses. Likewise, de-novo assembly and analysis of chimeric transcripts did not result in expression signatures associated with novel human pathogens. While confounding factors such as sample dilution or viral clearance in progressed tumors may mask viral cofactors in the data, in principle, this is rendered less likely by the high sensitivity of our approach and the number of biological replicates

  2. The DExH/D protein family database.

    Science.gov (United States)

    Jankowsky, E; Jankowsky, A

    2000-01-01

    DExH/D proteins are essential for all aspects of cellular RNA metabolism and processing, in the replication of many viruses and in DNA replication. DExH/D proteins are subject to current biological, biochemical and biophysical research which provides a continuous wealth of data. The DExH/D protein family database compiles this information and makes it available over the WWW (http://www.columbia.edu/ ej67/dbhome.htm ). The database can be fully searched by text based queries, facilitating fast access to specific information about this important class of enzymes.

  3. Quantitative proteomics and transcriptomics reveals metabolic differences in attracting and non-attracting human-in-mouse glioma stem cell xenografts and stromal cells

    Directory of Open Access Journals (Sweden)

    Norelle C. Wildburger

    2015-09-01

    Full Text Available Bone marrow-derived human mesenchymal stem cells (BM-hMSCs show promise as cell-based delivery vehicles for anti-glioma therapeutics, due to innate tropism for gliomas. However, in clinically relevant human-in-mouse glioma stem cell xenograft models, BM-hMSCs tropism is variable. We compared the proteomic profile of cancer and stromal cells in GSCXs that attract BM-hMSCs (“attractors” with those to do not (“non-attractors” to identify pathways that may modulate BM-hMSC homing, followed by targeted transcriptomics. The results provide the first link between fatty acid metabolism, glucose metabolism, ROS, and N-glycosylation patterns in attractors. Reciprocal expression of these pathways in the stromal cells suggests microenvironmental cross-talk.

  4. Sequencing and de novo assembly of the transcriptome of the glassy-winged sharpshooter (Homalodisca vitripennis.

    Directory of Open Access Journals (Sweden)

    Raja Sekhar Nandety

    Full Text Available BACKGROUND: The glassy-winged sharpshooter Homalodisca vitripennis (Hemiptera: Cicadellidae, is a xylem-feeding leafhopper and important vector of the bacterium Xylella fastidiosa; the causal agent of Pierce's disease of grapevines. The functional complexity of the transcriptome of H. vitripennis has not been elucidated thus far. It is a necessary blueprint for an understanding of the development of H. vitripennis and for designing efficient biorational control strategies including those based on RNA interference. RESULTS: Here we elucidate and explore the transcriptome of adult H. vitripennis using high-throughput paired end deep sequencing and de novo assembly. A total of 32,803,656 paired-end reads were obtained with an average transcript length of 624 nucleotides. We assembled 32.9 Mb of the transcriptome of H. vitripennis that spanned across 47,265 loci and 52,708 transcripts. Comparison of our non-redundant database showed that 45% of the deduced proteins of H. vitripennis exhibit identity (e-value ≤1(-5 with known proteins. We assigned Gene Ontology (GO terms, Kyoto Encyclopedia of Genes and Genomes (KEGG annotations, and potential Pfam domains to each transcript isoform. In order to gain insight into the molecular basis of key regulatory genes of H. vitripennis, we characterized predicted proteins involved in the metabolism of juvenile hormone, and biogenesis of small RNAs (Dicer and Piwi sequences from the transcriptomic sequences. Analysis of transposable element sequences of H. vitripennis indicated that the genome is less expanded in comparison to many other insects with approximately 1% of the transcriptome carrying transposable elements. CONCLUSIONS: Our data significantly enhance the molecular resources available for future study and control of this economically important hemipteran. This transcriptional information not only provides a more nuanced understanding of the underlying biological and physiological mechanisms that

  5. Insight into the transcriptome of Arthrobotrys conoides using high throughput sequencing.

    Science.gov (United States)

    Ramesh, Pandit; Reena, Patel; Amitbikram, Mohapatra; Chaitanya, Joshi; Anju, Kunjadia

    2015-12-01

    Arthrobotrys conoides is a nematode-trapping fungus belonging to Orbiliales, Ascomycota group, and traps prey nematodes by means of adhesive network. Fungus has a potential to be used as a biocontrol agent against plant parasitic nematodes. In the present study, we characterized the transcriptome of A. conoides using high-throughput sequencing technology and characterized its virulence unigenes. Total 7,255 cDNA contigs with an average length of 425 bp were generated and 6184 (61.81%) transcripts were functionally annotated and characterized. Majority of unigenes were found analogous to the genes of plant pathogenic fungi. A total of 1749 transcripts were found to be orthologous with eukaryotic proteins of KOG database. Several carbohydrate active enzymes and peptidases were identified. We also analyzed classically and nonclassically secreted proteins and confirmed by BLASTP against fungal secretome database. A total of 916 contigs were analogous to 556 unique proteins of Pathogen Host Interaction (PHI) database. Further, we identified 91 unigenes homologous to the database of fungal virulence factor (DFVF). A total of 104 putative protein kinases coding transcripts were identified by BLASTP against KinBase database, which are major players in signaling pathways. This study provides a comprehensive look at the transcriptome of A. conoides and the identified unigenes might have a role in catching and killing prey nematodes by A. conoides. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  6. The Porcelain Crab Transcriptome and PCAD, the Porcelain Crab Microarray and Sequence Database

    Energy Technology Data Exchange (ETDEWEB)

    Tagmount, Abderrahmane; Wang, Mei; Lindquist, Erika; Tanaka, Yoshihiro; Teranishi, Kristen S.; Sunagawa, Shinichi; Wong, Mike; Stillman, Jonathon H.

    2010-01-27

    Background: With the emergence of a completed genome sequence of the freshwater crustacean Daphnia pulex, construction of genomic-scale sequence databases for additional crustacean sequences are important for comparative genomics and annotation. Porcelain crabs, genus Petrolisthes, have been powerful crustacean models for environmental and evolutionary physiology with respect to thermal adaptation and understanding responses of marine organisms to climate change. Here, we present a large-scale EST sequencing and cDNA microarray database project for the porcelain crab Petrolisthes cinctipes. Methodology/Principal Findings: A set of ~;;30K unique sequences (UniSeqs) representing ~;;19K clusters were generated from ~;;98K high quality ESTs from a set of tissue specific non-normalized and mixed-tissue normalized cDNA libraries from the porcelain crab Petrolisthes cinctipes. Homology for each UniSeq was assessed using BLAST, InterProScan, GO and KEGG database searches. Approximately 66percent of the UniSeqs had homology in at least one of the databases. All EST and UniSeq sequences along with annotation results and coordinated cDNA microarray datasets have been made publicly accessible at the Porcelain Crab Array Database (PCAD), a feature-enriched version of the Stanford and Longhorn Array Databases.Conclusions/Significance: The EST project presented here represents the third largest sequencing effort for any crustacean, and the largest effort for any crab species. Our assembly and clustering results suggest that our porcelain crab EST data set is equally diverse to the much larger EST set generated in the Daphnia pulex genome sequencing project, and thus will be an important resource to the Daphnia research community. Our homology results support the pancrustacea hypothesis and suggest that Malacostraca may be ancestral to Branchiopoda and Hexapoda. Our results also suggest that our cDNA microarrays cover as much of the transcriptome as can reasonably be captured in

  7. In Vivo Human Somitogenesis Guides Somite Development from hPSCs

    Directory of Open Access Journals (Sweden)

    Haibin Xi

    2017-02-01

    Full Text Available Somites form during embryonic development and give rise to unique cell and tissue types, such as skeletal muscles and bones and cartilage of the vertebrae. Using somitogenesis-stage human embryos, we performed transcriptomic profiling of human presomitic mesoderm as well as nascent and developed somites. In addition to conserved pathways such as WNT-β-catenin, we also identified BMP and transforming growth factor β (TGF-β signaling as major regulators unique to human somitogenesis. This information enabled us to develop an efficient protocol to derive somite cells in vitro from human pluripotent stem cells (hPSCs. Importantly, the in-vitro-differentiating cells progressively expressed markers of the distinct developmental stages that are known to occur during in vivo somitogenesis. Furthermore, when subjected to lineage-specific differentiation conditions, the hPSC-derived somite cells were multipotent in generating somite derivatives, including skeletal myocytes, osteocytes, and chondrocytes. This work improves our understanding of human somitogenesis and may enhance our ability to treat diseases affecting somite derivatives.

  8. Transcriptome profiling of Elettaria cardamomum (L. Maton (small cardamom

    Directory of Open Access Journals (Sweden)

    F. Nadiya

    2017-03-01

    Full Text Available Elettaria cardamomum (L. Maton, known as ‘queen of spices, is a perennial herbaceous monocot of the family Zingiberaceae, native to southern India. Cardamom is an economically valuable spice crop and used widely in culinary and medicinal purposes. In the present study, using Ion Proton RNA sequencing technology, we performed transcriptome sequencing and de novo transcriptome assembly of a wild and five cultivar genotypes of cardamom. RNA-seq generated a total of 22,811,983 (92 base and 24,889,197 (75 base raw reads accounting for approximately 8.21GB and 7.65GB of sequence data for wild and cultivar genotypes of cardamom respectively. The raw data were submitted to SRA database of NCBI under the accession numbers SRX1141272 (wild and SRX1141276 (cultivars. The raw reads were quality filtered and assembled using MIRA assembler resulted with 112,208 and 264,161contigs having N50 value 616 and 664 for wild and cultivar cardamom respectively. The assembled unigenes were functionally annotated using several databases including PlantCyc for pathway annotation. This work represents the first report on cardamom transcriptome sequencing. In order to generate a comprehensive reference transcriptome, we further assembled the raw reads of wild and cultivar genotypes which might enrich the plant transcriptome database and trigger advanced research in cardamom genomics.

  9. Transcriptome analysis of the desert locust central nervous system: production and annotation of a Schistocerca gregaria EST database.

    Science.gov (United States)

    Badisco, Liesbeth; Huybrechts, Jurgen; Simonet, Gert; Verlinden, Heleen; Marchal, Elisabeth; Huybrechts, Roger; Schoofs, Liliane; De Loof, Arnold; Vanden Broeck, Jozef

    2011-03-21

    The desert locust (Schistocerca gregaria) displays a fascinating type of phenotypic plasticity, designated as 'phase polyphenism'. Depending on environmental conditions, one genome can be translated into two highly divergent phenotypes, termed the solitarious and gregarious (swarming) phase. Although many of the underlying molecular events remain elusive, the central nervous system (CNS) is expected to play a crucial role in the phase transition process. Locusts have also proven to be interesting model organisms in a physiological and neurobiological research context. However, molecular studies in locusts are hampered by the fact that genome/transcriptome sequence information available for this branch of insects is still limited. We have generated 34,672 raw expressed sequence tags (EST) from the CNS of desert locusts in both phases. These ESTs were assembled in 12,709 unique transcript sequences and nearly 4,000 sequences were functionally annotated. Moreover, the obtained S. gregaria EST information is highly complementary to the existing orthopteran transcriptomic data. Since many novel transcripts encode neuronal signaling and signal transduction components, this paper includes an overview of these sequences. Furthermore, several transcripts being differentially represented in solitarious and gregarious locusts were retrieved from this EST database. The findings highlight the involvement of the CNS in the phase transition process and indicate that this novel annotated database may also add to the emerging knowledge of concomitant neuronal signaling and neuroplasticity events. In summary, we met the need for novel sequence data from desert locust CNS. To our knowledge, we hereby also present the first insect EST database that is derived from the complete CNS. The obtained S. gregaria EST data constitute an important new source of information that will be instrumental in further unraveling the molecular principles of phase polyphenism, in further establishing

  10. Transcriptome analysis of the desert locust central nervous system: production and annotation of a Schistocerca gregaria EST database.

    Directory of Open Access Journals (Sweden)

    Liesbeth Badisco

    Full Text Available BACKGROUND: The desert locust (Schistocerca gregaria displays a fascinating type of phenotypic plasticity, designated as 'phase polyphenism'. Depending on environmental conditions, one genome can be translated into two highly divergent phenotypes, termed the solitarious and gregarious (swarming phase. Although many of the underlying molecular events remain elusive, the central nervous system (CNS is expected to play a crucial role in the phase transition process. Locusts have also proven to be interesting model organisms in a physiological and neurobiological research context. However, molecular studies in locusts are hampered by the fact that genome/transcriptome sequence information available for this branch of insects is still limited. METHODOLOGY: We have generated 34,672 raw expressed sequence tags (EST from the CNS of desert locusts in both phases. These ESTs were assembled in 12,709 unique transcript sequences and nearly 4,000 sequences were functionally annotated. Moreover, the obtained S. gregaria EST information is highly complementary to the existing orthopteran transcriptomic data. Since many novel transcripts encode neuronal signaling and signal transduction components, this paper includes an overview of these sequences. Furthermore, several transcripts being differentially represented in solitarious and gregarious locusts were retrieved from this EST database. The findings highlight the involvement of the CNS in the phase transition process and indicate that this novel annotated database may also add to the emerging knowledge of concomitant neuronal signaling and neuroplasticity events. CONCLUSIONS: In summary, we met the need for novel sequence data from desert locust CNS. To our knowledge, we hereby also present the first insect EST database that is derived from the complete CNS. The obtained S. gregaria EST data constitute an important new source of information that will be instrumental in further unraveling the molecular

  11. Database security - how can developers and DBAs do it together and what can other Service Managers learn from it

    CERN Multimedia

    CERN. Geneva

    2014-01-01

    This talk gives an overview of security threats affecting databases, preventive measures that we are taking at CERN and best practices in the industry. The presentation will describe how generic the threats are and how can other service managers profit from the database experience to protect other systems.

  12. DisFace: A Database of Human Facial Disorders

    Directory of Open Access Journals (Sweden)

    Paramjit Kaur

    2017-10-01

    Full Text Available Face is an integral part of human body by which an individual communicates in the society. Its importance can be highlighted by the fact that a person deprived of face cannot sustain in the living world. In the past few decades, human face has gained attention of several researchers, whether it is related to facial anthropometry, facial disorder, face transplantation or face reconstruction. Several researches have also shown the correlation between neuropsychiatry disorders and human face and also that how face recognition abilities are correlated with these disorders. Currently, several databases exist which contain the facial images of several individuals captured from different sources. The advantage of these databases is that the images in these databases can be used for testing and training purpose. However, in current date no such database exists which would provide not only facial images of individuals; but also the literature concerning the human face, list of several genes controlling human face, list of facial disorders and various tools which work on facial images. Thus, the current research aims at developing a database of human facial disorders using bioinformatics approach. The database will contain information about facial diseases, medications, symptoms, findings, etc. The information will be extracted from several other databases like OMIM, PubChem, Radiopedia, Medline Plus, FDA, etc. and links to them will also be provided. Initially, the diseases specific for human face have been obtained from already created published corpora of literature using text mining approach. Becas tool was used to obtain the specific task.  A dataset will be created and stored in the form of database. It will be a database containing cross-referenced index of human facial diseases, medications, symptoms, signs, etc. Thus, a database on human face with complete existing information about human facial disorders will be developed. The novelty of the

  13. Sequencing and characterization of the guppy (Poecilia reticulata transcriptome

    Directory of Open Access Journals (Sweden)

    Rodd F Helen

    2011-04-01

    Full Text Available Abstract Background Next-generation sequencing is providing researchers with a relatively fast and affordable option for developing genomic resources for organisms that are not among the traditional genetic models. Here we present a de novo assembly of the guppy (Poecilia reticulata transcriptome using 454 sequence reads, and we evaluate potential uses of this transcriptome, including detection of sex-specific transcripts and deployment as a reference for gene expression analysis in guppies and a related species. Guppies have been model organisms in ecology, evolutionary biology, and animal behaviour for over 100 years. An annotated transcriptome and other genomic tools will facilitate understanding the genetic and molecular bases of adaptation and variation in a vertebrate species with a uniquely well known natural history. Results We generated approximately 336 Mbp of mRNA sequence data from male brain, male body, female brain, and female body. The resulting 1,162,670 reads assembled into 54,921 contigs, creating a reference transcriptome for the guppy with an average read depth of 28×. We annotated nearly 40% of this reference transcriptome by searching protein and gene ontology databases. Using this annotated transcriptome database, we identified candidate genes of interest to the guppy research community, putative single nucleotide polymorphisms (SNPs, and male-specific expressed genes. We also showed that our reference transcriptome can be used for RNA-sequencing-based analysis of differential gene expression. We identified transcripts that, in juveniles, are regulated differently in the presence and absence of an important predator, Rivulus hartii, including two genes implicated in stress response. For each sample in the RNA-seq study, >50% of high-quality reads mapped to unique sequences in the reference database with high confidence. In addition, we evaluated the use of the guppy reference transcriptome for gene expression analyses in

  14. Cardiovascular risk protection from the Mediterranean diet and olive oil. A transcriptomic update in humans

    International Nuclear Information System (INIS)

    Carrion, S.; Torres, L.; Castañer, O.

    2016-01-01

    This review highlights the human studies that explore the benefits of the Mediterranean diet and olive oil, based on gene expression analysis. We summarized consistent human transcriptomic studies on cardiovascular risk, based on TMD and olive oil interventions, with real life doses and conditions. A literature review was carried out leading up to February 2016. The results show that the TMD, specially supplemented with virgin olive oil, produces beneficial changes in the transcriptomic response of relevant genes in cardiovascular risk such as CAT, GPX1 and SIRT2. p65 and MCP-1, IL1B, IL6, CXCL1, INF-γ, ARHGAP15 and IL7R, which are involved in inflammation; and ABCA1, SR-B1, PPARBP, PPARα, PPARγ, PPARδ, CD-36 and COX-1, which play an important role in cholesterol efflux. The available data illustrate a transcriptomic effect on atherosclerosis, inflammation and oxidative stress pathways as well as the mentioned genes. [es

  15. Cardiovascular risk protection from the Mediterranean diet and olive oil. A transcriptomic update in humans

    Directory of Open Access Journals (Sweden)

    S. Carrión

    2016-12-01

    Full Text Available This review highlights the human studies that explore the benefits of the Mediterranean diet and olive oil, based on gene expression analysis. We summarized consistent human transcriptomic studies on cardiovascular risk, based on TMD and olive oil interventions, with real life doses and conditions. A literature review was carried out leading up to February 2016. The results show that the TMD, specially supplemented with virgin olive oil, produces beneficial changes in the transcriptomic response of relevant genes in cardiovascular risk such as CAT, GPX1 and SIRT2. p65 and MCP-1, IL1B, IL6, CXCL1, INF-γ, ARHGAP15 and IL7R, which are involved in inflammation; and ABCA1, SR-B1, PPARBP, PPARα, PPARγ, PPARδ, CD-36 and COX-1, which play an important role in cholesterol efflux. The available data illustrate a transcriptomic effect on atherosclerosis, inflammation and oxidative stress pathways as well as the mentioned genes.

  16. The Human Pancreas Proteome Defined by Transcriptomics and Antibody-Based Profiling

    Science.gov (United States)

    Fagerberg, Linn; Hallström, Björn M.; Schwenk, Jochen M.; Uhlén, Mathias; Korsgren, Olle; Lindskog, Cecilia

    2014-01-01

    The pancreas is composed of both exocrine glands and intermingled endocrine cells to execute its diverse functions, including enzyme production for digestion of nutrients and hormone secretion for regulation of blood glucose levels. To define the molecular constituents with elevated expression in the human pancreas, we employed a genome-wide RNA sequencing analysis of the human transcriptome to identify genes with elevated expression in the human pancreas. This quantitative transcriptomics data was combined with immunohistochemistry-based protein profiling to allow mapping of the corresponding proteins to different compartments and specific cell types within the pancreas down to the single cell level. Analysis of whole pancreas identified 146 genes with elevated expression levels, of which 47 revealed a particular higher expression as compared to the other analyzed tissue types, thus termed pancreas enriched. Extended analysis of in vitro isolated endocrine islets identified an additional set of 42 genes with elevated expression in these specialized cells. Although only 0.7% of all genes showed an elevated expression level in the pancreas, this fraction of transcripts, in most cases encoding secreted proteins, constituted 68% of the total mRNA in pancreas. This demonstrates the extreme specialization of the pancreas for production of secreted proteins. Among the elevated expression profiles, several previously not described proteins were identified, both in endocrine cells (CFC1, FAM159B, RBPJL and RGS9) and exocrine glandular cells (AQP12A, DPEP1, GATM and ERP27). In summary, we provide a global analysis of the pancreas transcriptome and proteome with a comprehensive list of genes and proteins with elevated expression in pancreas. This list represents an important starting point for further studies of the molecular repertoire of pancreatic cells and their relation to disease states or treatment effects. PMID:25546435

  17. Integration of deep transcriptome and proteome analyses reveals the components of alkaloid metabolism in opium poppy cell cultures.

    Science.gov (United States)

    Desgagné-Penix, Isabel; Khan, Morgan F; Schriemer, David C; Cram, Dustin; Nowak, Jacek; Facchini, Peter J

    2010-11-18

    Papaver somniferum (opium poppy) is the source for several pharmaceutical benzylisoquinoline alkaloids including morphine, the codeine and sanguinarine. In response to treatment with a fungal elicitor, the biosynthesis and accumulation of sanguinarine is induced along with other plant defense responses in opium poppy cell cultures. The transcriptional induction of alkaloid metabolism in cultured cells provides an opportunity to identify components of this process via the integration of deep transcriptome and proteome databases generated using next-generation technologies. A cDNA library was prepared for opium poppy cell cultures treated with a fungal elicitor for 10 h. Using 454 GS-FLX Titanium pyrosequencing, 427,369 expressed sequence tags (ESTs) with an average length of 462 bp were generated. Assembly of these sequences yielded 93,723 unigenes, of which 23,753 were assigned Gene Ontology annotations. Transcripts encoding all known sanguinarine biosynthetic enzymes were identified in the EST database, 5 of which were represented among the 50 most abundant transcripts. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) of total protein extracts from cell cultures treated with a fungal elicitor for 50 h facilitated the identification of 1,004 proteins. Proteins were fractionated by one-dimensional SDS-PAGE and digested with trypsin prior to LC-MS/MS analysis. Query of an opium poppy-specific EST database substantially enhanced peptide identification. Eight out of 10 known sanguinarine biosynthetic enzymes and many relevant primary metabolic enzymes were represented in the peptide database. The integration of deep transcriptome and proteome analyses provides an effective platform to catalogue the components of secondary metabolism, and to identify genes encoding uncharacterized enzymes. The establishment of corresponding transcript and protein databases generated by next-generation technologies in a system with a well-defined metabolite profile facilitates

  18. Human Ageing Genomic Resources: new and updated databases

    Science.gov (United States)

    Tacutu, Robi; Thornton, Daniel; Johnson, Emily; Budovsky, Arie; Barardo, Diogo; Craig, Thomas; Diana, Eugene; Lehmann, Gilad; Toren, Dmitri; Wang, Jingwei; Fraifeld, Vadim E

    2018-01-01

    Abstract In spite of a growing body of research and data, human ageing remains a poorly understood process. Over 10 years ago we developed the Human Ageing Genomic Resources (HAGR), a collection of databases and tools for studying the biology and genetics of ageing. Here, we present HAGR’s main functionalities, highlighting new additions and improvements. HAGR consists of six core databases: (i) the GenAge database of ageing-related genes, in turn composed of a dataset of >300 human ageing-related genes and a dataset with >2000 genes associated with ageing or longevity in model organisms; (ii) the AnAge database of animal ageing and longevity, featuring >4000 species; (iii) the GenDR database with >200 genes associated with the life-extending effects of dietary restriction; (iv) the LongevityMap database of human genetic association studies of longevity with >500 entries; (v) the DrugAge database with >400 ageing or longevity-associated drugs or compounds; (vi) the CellAge database with >200 genes associated with cell senescence. All our databases are manually curated by experts and regularly updated to ensure a high quality data. Cross-links across our databases and to external resources help researchers locate and integrate relevant information. HAGR is freely available online (http://genomics.senescence.info/). PMID:29121237

  19. Human Thermal Model Evaluation Using the JSC Human Thermal Database

    Science.gov (United States)

    Bue, Grant; Makinen, Janice; Cognata, Thomas

    2012-01-01

    Human thermal modeling has considerable long term utility to human space flight. Such models provide a tool to predict crew survivability in support of vehicle design and to evaluate crew response in untested space environments. It is to the benefit of any such model not only to collect relevant experimental data to correlate it against, but also to maintain an experimental standard or benchmark for future development in a readily and rapidly searchable and software accessible format. The Human thermal database project is intended to do just so; to collect relevant data from literature and experimentation and to store the data in a database structure for immediate and future use as a benchmark to judge human thermal models against, in identifying model strengths and weakness, to support model development and improve correlation, and to statistically quantify a model s predictive quality. The human thermal database developed at the Johnson Space Center (JSC) is intended to evaluate a set of widely used human thermal models. This set includes the Wissler human thermal model, a model that has been widely used to predict the human thermoregulatory response to a variety of cold and hot environments. These models are statistically compared to the current database, which contains experiments of human subjects primarily in air from a literature survey ranging between 1953 and 2004 and from a suited experiment recently performed by the authors, for a quantitative study of relative strength and predictive quality of the models.

  20. Transcriptome sequencing from diverse human populations reveals differentiated regulatory architecture.

    Directory of Open Access Journals (Sweden)

    Alicia R Martin

    2014-08-01

    Full Text Available Large-scale sequencing efforts have documented extensive genetic variation within the human genome. However, our understanding of the origins, global distribution, and functional consequences of this variation is far from complete. While regulatory variation influencing gene expression has been studied within a handful of populations, the breadth of transcriptome differences across diverse human populations has not been systematically analyzed. To better understand the spectrum of gene expression variation, alternative splicing, and the population genetics of regulatory variation in humans, we have sequenced the genomes, exomes, and transcriptomes of EBV transformed lymphoblastoid cell lines derived from 45 individuals in the Human Genome Diversity Panel (HGDP. The populations sampled span the geographic breadth of human migration history and include Namibian San, Mbuti Pygmies of the Democratic Republic of Congo, Algerian Mozabites, Pathan of Pakistan, Cambodians of East Asia, Yakut of Siberia, and Mayans of Mexico. We discover that approximately 25.0% of the variation in gene expression found amongst individuals can be attributed to population differences. However, we find few genes that are systematically differentially expressed among populations. Of this population-specific variation, 75.5% is due to expression rather than splicing variability, and we find few genes with strong evidence for differential splicing across populations. Allelic expression analyses indicate that previously mapped common regulatory variants identified in eight populations from the International Haplotype Map Phase 3 project have similar effects in our seven sampled HGDP populations, suggesting that the cellular effects of common variants are shared across diverse populations. Together, these results provide a resource for studies analyzing functional differences across populations by estimating the degree of shared gene expression, alternative splicing, and

  1. Ginseng Genome Database: an open-access platform for genomics of Panax ginseng.

    Science.gov (United States)

    Jayakodi, Murukarthick; Choi, Beom-Soon; Lee, Sang-Choon; Kim, Nam-Hoon; Park, Jee Young; Jang, Woojong; Lakshmanan, Meiyappan; Mohan, Shobhana V G; Lee, Dong-Yup; Yang, Tae-Jin

    2018-04-12

    The ginseng (Panax ginseng C.A. Meyer) is a perennial herbaceous plant that has been used in traditional oriental medicine for thousands of years. Ginsenosides, which have significant pharmacological effects on human health, are the foremost bioactive constituents in this plant. Having realized the importance of this plant to humans, an integrated omics resource becomes indispensable to facilitate genomic research, molecular breeding and pharmacological study of this herb. The first draft genome sequences of P. ginseng cultivar "Chunpoong" were reported recently. Here, using the draft genome, transcriptome, and functional annotation datasets of P. ginseng, we have constructed the Ginseng Genome Database http://ginsengdb.snu.ac.kr /, the first open-access platform to provide comprehensive genomic resources of P. ginseng. The current version of this database provides the most up-to-date draft genome sequence (of approximately 3000 Mbp of scaffold sequences) along with the structural and functional annotations for 59,352 genes and digital expression of genes based on transcriptome data from different tissues, growth stages and treatments. In addition, tools for visualization and the genomic data from various analyses are provided. All data in the database were manually curated and integrated within a user-friendly query page. This database provides valuable resources for a range of research fields related to P. ginseng and other species belonging to the Apiales order as well as for plant research communities in general. Ginseng genome database can be accessed at http://ginsengdb.snu.ac.kr /.

  2. Comparative Transcriptomic Profiling and Gene Expression for Myxomatous Mitral Valve Disease in the Dog and Human

    Directory of Open Access Journals (Sweden)

    Greg R. Markby

    2017-07-01

    Full Text Available Myxomatous mitral valve disease is the single most important mitral valve disease in both dogs and humans. In the case of the dog it is ubiquitous, such that all aged dogs will have some evidence of the disease, and for humans it is known as Barlow’s disease and affects up to 3% of the population, with an expected increase in prevalence as the population ages. Disease in the two species show many similarities and while both have the classic myxomatous degeneration only in humans is there extensive fibrosis. This dual pathology of the human disease markedly affects the valve transcriptome and the difference between the dog and human is dominated by changes in genes associated with fibrosis. This review will briefly examine the comparative valve pathology and then, in more detail, the transcriptomic profiling and gene expression reported so far for both species.

  3. Legionella pneumophila transcriptome during intracellular multiplication in human macrophages

    Directory of Open Access Journals (Sweden)

    Sebastien P Faucher

    2011-04-01

    Full Text Available Legionella pneumophila is the causative agent of Legionnaires’ disease, an acute pulmonary infection. L. pneumophila is able to infect and multiply in both phagocytic protozoa, such as Acanthamoeba castellanii, and mammalian professional phagocytes. The best-known L. pneumophila virulence determinant is the Icm/Dot Type IVB secretion system (TFBSS, which is used to translocate more than 150 effector proteins to host cells. While the transcriptional response of Legionella to the intracellular environment of A. castellanii has been investigated, much less is known about the Legionella transcriptional response inside human macrophages. In this study, the transcriptome of L. pneumophila was monitored during exponential and post-exponential phase in rich AYE broth as well as during infection of human cultured macrophages. This was accomplished with microarrays and an RNA amplification procedure called SCOTS to detect small amounts of mRNA from low numbers of intracellular bacteria. Among the genes induced intracellularly are those involved in amino acid biosynthetic pathways leading to L-arginine, L-histidine and L-proline as well as many transport systems involved in amino acid and iron uptake. Gene involved in catabolism of glycerol is also induced during intracellular growth and could be used as a carbon source. The genes encoding the Icm/Dot system are not differentially expressed inside cells compared to control bacteria grown in rich broth, but the genes encoding several translocated effectors are strongly induced. Moreover, we used the transcriptome data to predict previously unrecognized Icm/Dot effector genes based on their expression pattern and confirmed translocation for three candidates. This study provides a comprehensive view of how L. pneumophila responds to the human macrophage intracellular environment.

  4. De novo assembly and characterization of the transcriptome of seagrass Zostera marina using Illumina paired-end sequencing.

    Directory of Open Access Journals (Sweden)

    Fanna Kong

    Full Text Available BACKGROUND: The seagrass Zostera marina is a monocotyledonous angiosperm belonging to a polyphyletic group of plants that can live submerged in marine habitats. Zostera marina L. is one of the most common seagrasses and is considered a cornerstone of marine plant molecular ecology research and comparative studies. However, the mechanisms underlying its adaptation to the marine environment still remain poorly understood due to limited transcriptomic and genomic data. PRINCIPAL FINDINGS: Here we explored the transcriptome of Z. marina leaves under different environmental conditions using Illumina paired-end sequencing. Approximately 55 million sequencing reads were obtained, representing 58,457 transcripts that correspond to 24,216 unigenes. A total of 14,389 (59.41% unigenes were annotated by blast searches against the NCBI non-redundant protein database. 45.18% and 46.91% of the unigenes had significant similarity with proteins in the Swiss-Prot database and Pfam database, respectively. Among these, 13,897 unigenes were assigned to 57 Gene Ontology (GO terms and 4,745 unigenes were identified and mapped to 233 pathways via functional annotation against the Kyoto Encyclopedia of Genes and Genomes pathway database (KEGG. We compared the orthologous gene family of the Z. marina transcriptome to Oryza sativa and Pyropia yezoensis and 11,667 orthologous gene families are specific to Z. marina. Furthermore, we identified the photoreceptors sensing red/far-red light and blue light. Also, we identified a large number of genes that are involved in ion transporters and channels including Na+ efflux, K+ uptake, Cl- channels, and H+ pumping. CONCLUSIONS: Our study contains an extensive sequencing and gene-annotation analysis of Z. marina. This information represents a genetic resource for the discovery of genes related to light sensing and salt tolerance in this species. Our transcriptome can be further utilized in future studies on molecular adaptation to

  5. The Escherichia coli transcriptome linked to growth fitness

    Directory of Open Access Journals (Sweden)

    Bei-Wen Ying

    2016-03-01

    Full Text Available A series of Escherichia coli strains with varied genomic sequences were subjected to high-density microarray analyses to elucidate the fitness-correlated transcriptomes. Fitness, which is commonly evaluated by the growth rate during the exponential phase, is not only determined by the genome but is also linked to growth conditions, e.g., temperature. We previously reported genetic and environmental contributions to E. coli transcriptomes and evolutionary transcriptome changes in thermal adaptation. Here, we describe experimental details on how to prepare microarray samples that truly represent the growth fitness of the E. coli cells. A step-by-step record of sample preparation procedures that correspond to growing cells and transcriptome data sets that are deposited at the GEO database (GSE33212, GSE52770, GSE61739 are also provided for reference. Keywords: Transcriptome, Growth fitness, Escherichia coli, Microarray

  6. Transcriptomic responses to ocean acidification in larval sea urchins from a naturally variable pH environment.

    Science.gov (United States)

    Evans, Tyler G; Chan, Francis; Menge, Bruce A; Hofmann, Gretchen E

    2013-03-01

    Some marine ecosystems already experience natural declines in pH approximating those predicted with future anthropogenic ocean acidification (OA), the decline in seawater pH caused by the absorption of atmospheric CO2 . The molecular mechanisms that allow organisms to inhabit these low pH environments, particularly those building calcium carbonate skeletons, are unknown. Also uncertain is whether an enhanced capacity to cope with present day pH variation will confer resistance to future OA. To address these issues, we monitored natural pH dynamics within an intertidal habitat in the Northeast Pacific, demonstrating that upwelling exposes resident species to pH regimes not predicted to occur elsewhere until 2100. Next, we cultured the progeny of adult purple sea urchins (Strongylocentrotus purpuratus) collected from this region in CO2 -acidified seawater representing present day and near future ocean scenarios and monitored gene expression using transcriptomics. We hypothesized that persistent exposure to upwelling during evolutionary history will have selected for increased pH tolerance in this population and that their transcriptomic response to low pH seawater would provide insight into mechanisms underlying pH tolerance in a calcifying species. Resulting expression patterns revealed two important trends. Firstly, S. purpuratus larvae may alter the bioavailability of calcium and adjust skeletogenic pathways to sustain calcification in a low pH ocean. Secondly, larvae use different strategies for coping with different magnitudes of pH stress: initiating a robust transcriptional response to present day pH regimes but a muted response to near future conditions. Thus, an enhanced capacity to cope with present day pH variation may not translate into success in future oceans. © 2013 Blackwell Publishing Ltd.

  7. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome

    DEFF Research Database (Denmark)

    Camargo, A A; Samaia, H P; Dias-Neto, E

    2001-01-01

    ,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most...

  8. RNA isolation for transcriptomics of human and mouse small skin biopsies

    Directory of Open Access Journals (Sweden)

    Breit Timo M

    2011-10-01

    Full Text Available Abstract Background Isolation of RNA from skin biopsies presents a challenge, due to the tough nature of skin tissue and a high presence of RNases. As we lacked the dedicated equipment, i.e. homogenizer or bead-beater, needed for the available RNA from skin isolation methods, we adapted and tested our zebrafish single-embryo RNA-isolation protocol for RNA isolation from skin punch biopsies. Findings We tested our new RNA-isolation protocol in two experiments: a large-scale study with 97 human skin samples, and a small study with 16 mouse skin samples. Human skin was sampled with 4.0 mm biopsy punches and for the mouse skin different punch diameter sizes were tested; 1.0, 1.5, 2.0, and 2.5 mm. The average RNA yield in human samples was 1.5 μg with an average RNA quality RIN value of 8.1. For the mouse biopsies, the average RNA yield was 2.4 μg with an average RIN value of 7.5. For 96% of the human biopsies and 100% of the mouse biopsies we obtained enough high-quality RNA. The RNA samples were successfully tested in a transcriptomics analysis using the Affymetrix and Roche NimbleGen platforms. Conclusions Using our new RNA-isolation protocol, we were able to consistently isolate high-quality RNA, which is apt for further transcriptomics analysis. Furthermore, this method is already useable on biopsy material obtained with a punch diameter as small as 1.5 mm.

  9. Integration of deep transcriptome and proteome analyses reveals the components of alkaloid metabolism in opium poppy cell cultures

    Directory of Open Access Journals (Sweden)

    Schriemer David C

    2010-11-01

    Full Text Available Abstract Background Papaver somniferum (opium poppy is the source for several pharmaceutical benzylisoquinoline alkaloids including morphine, the codeine and sanguinarine. In response to treatment with a fungal elicitor, the biosynthesis and accumulation of sanguinarine is induced along with other plant defense responses in opium poppy cell cultures. The transcriptional induction of alkaloid metabolism in cultured cells provides an opportunity to identify components of this process via the integration of deep transcriptome and proteome databases generated using next-generation technologies. Results A cDNA library was prepared for opium poppy cell cultures treated with a fungal elicitor for 10 h. Using 454 GS-FLX Titanium pyrosequencing, 427,369 expressed sequence tags (ESTs with an average length of 462 bp were generated. Assembly of these sequences yielded 93,723 unigenes, of which 23,753 were assigned Gene Ontology annotations. Transcripts encoding all known sanguinarine biosynthetic enzymes were identified in the EST database, 5 of which were represented among the 50 most abundant transcripts. Liquid chromatography-tandem mass spectrometry (LC-MS/MS of total protein extracts from cell cultures treated with a fungal elicitor for 50 h facilitated the identification of 1,004 proteins. Proteins were fractionated by one-dimensional SDS-PAGE and digested with trypsin prior to LC-MS/MS analysis. Query of an opium poppy-specific EST database substantially enhanced peptide identification. Eight out of 10 known sanguinarine biosynthetic enzymes and many relevant primary metabolic enzymes were represented in the peptide database. Conclusions The integration of deep transcriptome and proteome analyses provides an effective platform to catalogue the components of secondary metabolism, and to identify genes encoding uncharacterized enzymes. The establishment of corresponding transcript and protein databases generated by next-generation technologies in a

  10. Characterization of Liaoning cashmere goat transcriptome: sequencing, de novo assembly, functional annotation and comparative analysis.

    Directory of Open Access Journals (Sweden)

    Hongliang Liu

    Full Text Available Liaoning cashmere goat is a famous goat breed for cashmere wool. In order to increase the transcriptome data and accelerate genetic improvement for this breed, we performed de novo transcriptome sequencing to generate the first expressed sequence tag dataset for the Liaoning cashmere goat, using next-generation sequencing technology.Transcriptome sequencing of Liaoning cashmere goat on a Roche 454 platform yielded 804,601 high-quality reads. Clustering and assembly of these reads produced a non-redundant set of 117,854 unigenes, comprising 13,194 isotigs and 104,660 singletons. Based on similarity searches with known proteins, 17,356 unigenes were assigned to 6,700 GO categories, and the terms were summarized into three main GO categories and 59 sub-categories. 3,548 and 46,778 unigenes had significant similarity to existing sequences in the KEGG and COG databases, respectively. Comparative analysis revealed that 42,254 unigenes were aligned to 17,532 different sequences in NCBI non-redundant nucleotide databases. 97,236 (82.51% unigenes were mapped to the 30 goat chromosomes. 35,551 (30.17% unigenes were matched to 11,438 reported goat protein-coding genes. The remaining non-matched unigenes were further compared with cattle and human reference genes, 67 putative new goat genes were discovered. Additionally, 2,781 potential simple sequence repeats were initially identified from all unigenes.The transcriptome of Liaoning cashmere goat was deep sequenced, de novo assembled, and annotated, providing abundant data to better understand the Liaoning cashmere goat transcriptome. The potential simple sequence repeats provide a material basis for future genetic linkage and quantitative trait loci analyses.

  11. High-protein and high-carbohydrate breakfasts differentially change the transcriptome of human blood cells

    NARCIS (Netherlands)

    Erk, M.J. van; Blom, W.A.M.; Ommen, B. van; Hendriks, H.F.J.

    2006-01-01

    Background: Application of transcriptomics technology in human nutrition intervention studies would allow for genome-wide screening of the effects of specific diets or nutrients and result in biomarker profiles. Objective: The aim was to evaluate the potential of gene expression profiling in blood

  12. DDBS DB Alert

    Data.gov (United States)

    Social Security Administration — Data store used by the database area for monitoring of database objects. It is used to generate alerts that the DBAs investigate to determine if any action needs to...

  13. Transcriptomic analysis of flower development in wintersweet (Chimonanthus praecox).

    Science.gov (United States)

    Liu, Daofeng; Sui, Shunzhao; Ma, Jing; Li, Zhineng; Guo, Yulong; Luo, Dengpan; Yang, Jianfeng; Li, Mingyang

    2014-01-01

    Wintersweet (Chimonanthus praecox) is familiar as a garden plant and woody ornamental flower. On account of its unique flowering time and strong fragrance, it has a high ornamental and economic value. Despite a long history of human cultivation, our understanding of wintersweet genetics and molecular biology remains scant, reflecting a lack of basic genomic and transcriptomic data. In this study, we assembled three cDNA libraries, from three successive stages in flower development, designated as the flower bud with displayed petal, open flower and senescing flower stages. Using the Illumina RNA-Seq method, we obtained 21,412,928, 26,950,404, 24,912,954 qualified Illumina reads, respectively, for the three successive stages. The pooled reads from all three libraries were then assembled into 106,995 transcripts, 51,793 of which were annotated in the NCBI non-redundant protein database. Of these annotated sequences, 32,649 and 21,893 transcripts were assigned to gene ontology categories and clusters of orthologous groups, respectively. We could map 15,587 transcripts onto 312 pathways using the Kyoto Encyclopedia of Genes and Genomes pathway database. Based on these transcriptomic data, we obtained a large number of candidate genes that were differentially expressed at the open flower and senescing flower stages. An analysis of differentially expressed genes involved in plant hormone signal transduction pathways indicated that although flower opening and senescence may be independent of the ethylene signaling pathway in wintersweet, salicylic acid may be involved in the regulation of flower senescence. We also succeeded in isolating key genes of floral scent biosynthesis and proposed a biosynthetic pathway for monoterpenes and sesquiterpenes in wintersweet flowers, based on the annotated sequences. This comprehensive transcriptomic analysis presents fundamental information on the genes and pathways which are involved in flower development in wintersweet. And our data

  14. Transcriptome and H3K27 tri-methylation profiling of Ezh2-deficient lung epithelium

    Directory of Open Access Journals (Sweden)

    Aliaksei Z. Holik

    2015-09-01

    Full Text Available The adaptation of the lungs to air breathing at birth requires the fine orchestration of different processes to control lung morphogenesis and progenitor cell differentiation. However, there is little understanding of the role that epigenetic modifiers play in the control of lung development. We found that the histone methyl transferase Ezh2 plays a critical role in lung lineage specification and survival at birth. We performed a genome-wide transcriptome study combined with a genome-wide analysis of the distribution of H3K27 tri-methylation marks to interrogate the role of Ezh2 in lung epithelial cells. Lung cells isolated from Ezh2-deficient and control mice at embryonic day E16.5 were sorted into epithelial and mesenchymal populations based on EpCAM expression. This enabled us to dissect the transcriptional and epigenetic changes induced by the loss of Ezh2 specifically in the lung epithelium. Here we provide a detailed description of the analysis of the RNA-seq and ChIP-seq data, including quality control, read mapping, differential expression and differential binding analyses, as well as visualisation methods used to present the data. These data can be accessed from the Gene Expression Omnibus database (super-series accession number GSE57393.

  15. Transcriptomic comparisons between cultured human adipose tissue-derived pericytes and mesenchymal stromal cells

    Directory of Open Access Journals (Sweden)

    Lindolfo da Silva Meirelles

    2016-03-01

    Full Text Available Mesenchymal stromal cells (MSCs, sometimes called mesenchymal stem cells, are cultured cells able to give rise to mature mesenchymal cells such as adipocytes, osteoblasts, and chondrocytes, and to secrete a wide range of trophic and immunomodulatory molecules. Evidence indicates that pericytes, cells that surround and maintain physical connections with endothelial cells in blood vessels, can give rise to MSCs (da Silva Meirelles et al., 2008 [1]; Caplan and Correa, 2011 [2]. We have compared the transcriptomes of highly purified, human adipose tissue pericytes subjected to culture-expansion in pericyte medium or MSC medium, with that of human adipose tissue MSCs isolated with traditional methods to test the hypothesis that their transcriptomes are similar (da Silva Meirelles et al., 2015 [3]. Here, we provide further information and analyses of microarray data from three pericyte populations cultured in pericyte medium, three pericyte populations cultured in MSC medium, and three adipose tissue MSC populations deposited in the Gene Expression Omnibus under accession number GSE67747. Keywords: Mesenchymal stromal cells, Mesenchymal stem cells, Pericytes, Microarrays

  16. Comparative transcriptomic profiling of hydrogen peroxide signaling networks in zebrafish and human keratinocytes: Implications toward conservation, migration and wound healing.

    Science.gov (United States)

    Lisse, Thomas S; King, Benjamin L; Rieger, Sandra

    2016-02-05

    Skin wounds need to be repaired rapidly after injury to restore proper skin barrier function. Hydrogen peroxide (H2O2) is a conserved signaling factor that has been shown to promote a variety of skin wound repair processes, including immune cell migration, angiogenesis and sensory axon repair. Despite growing research on H2O2 functions in wound repair, the downstream signaling pathways activated by this reactive oxygen species in the context of injury remain largely unknown. The goal of this study was to provide a comprehensive analysis of gene expression changes in the epidermis upon exposure to H2O2 concentrations known to promote wound repair. Comparative transcriptome analysis using RNA-seq data from larval zebrafish and previously reported microarray data from a human epidermal keratinocyte line shows that H2O2 activates conserved cell migration, adhesion, cytoprotective and anti-apoptotic programs in both zebrafish and human keratinocytes. Further assessment of expression characteristics and signaling pathways revealed the activation of three major H2O2-dependent pathways, EGF, FOXO1, and IKKα. This study expands on our current understanding of the clinical potential of low-level H2O2 for the promotion of epidermal wound repair and provides potential candidates in the treatment of wound healing deficits.

  17. Transposable elements in the Anopheles funestus transcriptome.

    Science.gov (United States)

    Fernández-Medina, Rita D; Carareto, Claudia M A; Struchiner, Cláudio J; Ribeiro, José M C

    2017-06-01

    Transposable elements (TEs) are present in most of the eukaryotic genomes and their impact on genome evolution is increasingly recognized. Although there is extensive information on the TEs present in several eukaryotic genomes, less is known about the expression of these elements at the transcriptome level. Here we present a detailed analysis regarding the expression of TEs in Anopheles funestus, the second most important vector of human malaria in Africa. Several transcriptionally active TE families belonging both to Class I and II were identified and characterized. Interestingly, we have identified a full-length putative active element (including the presence of full length TIRs in the genomic sequence) belonging to the hAT superfamily, which presents active members in other insect genomes. This work contributes to a comprehensive understanding of the landscape of transposable elements in A. funestus transcriptome. Our results reveal that TEs are abundant and diverse in the mosquito and that most of the TE families found in the genome are represented in the mosquito transcriptome, a fact that could indicate activity of these elements.The vast diversity of TEs expressed in A. funestus suggests that there is ongoing amplification of several families in this organism.

  18. De novo transcriptome sequencing and analysis of the cereal cyst nematode, Heterodera avenae.

    Directory of Open Access Journals (Sweden)

    Mukesh Kumar

    Full Text Available The cereal cyst nematode (CCN, Heterodera avenae is a major pest of wheat (Triticum spp that reduces crop yields in many countries. Cyst nematodes are obligate sedentary endoparasites that reproduce by amphimixis. Here, we report the first transcriptome analysis of two stages of H. avenae. After sequencing extracted RNA from pre parasitic infective juvenile and adult stages of the life cycle, 131 million Illumina high quality paired end reads were obtained which generated 27,765 contigs with N50 of 1,028 base pairs, of which 10,452 were annotated. Comparative analyses were undertaken to evaluate H. avenae sequences with those of other plant, animal and free living nematodes to identify differences in expressed genes. There were 4,431 transcripts common to H. avenae and the free living nematode Caenorhabditis elegans, and 9,462 in common with more closely related potato cyst nematode, Globodera pallida. Annotation of H. avenae carbohydrate active enzymes (CAZy revealed fewer glycoside hydrolases (GHs but more glycosyl transferases (GTs and carbohydrate esterases (CEs when compared to M. incognita. 1,280 transcripts were found to have secretory signature, presence of signal peptide and absence of transmembrane. In a comparison of genes expressed in the pre-parasitic juvenile and feeding female stages, expression levels of 30 genes with high RPKM (reads per base per kilo million value, were analysed by qRT-PCR which confirmed the observed differences in their levels of expression levels. In addition, we have also developed a user-friendly resource, Heterodera transcriptome database (HATdb for public access of the data generated in this study. The new data provided on the transcriptome of H. avenae adds to the genetic resources available to study plant parasitic nematodes and provides an opportunity to seek new effectors that are specifically involved in the H. avenae-cereal host interaction.

  19. De novo transcriptome sequencing and analysis of the cereal cyst nematode, Heterodera avenae.

    Science.gov (United States)

    Kumar, Mukesh; Gantasala, Nagavara Prasad; Roychowdhury, Tanmoy; Thakur, Prasoon Kumar; Banakar, Prakash; Shukla, Rohit N; Jones, Michael G K; Rao, Uma

    2014-01-01

    The cereal cyst nematode (CCN, Heterodera avenae) is a major pest of wheat (Triticum spp) that reduces crop yields in many countries. Cyst nematodes are obligate sedentary endoparasites that reproduce by amphimixis. Here, we report the first transcriptome analysis of two stages of H. avenae. After sequencing extracted RNA from pre parasitic infective juvenile and adult stages of the life cycle, 131 million Illumina high quality paired end reads were obtained which generated 27,765 contigs with N50 of 1,028 base pairs, of which 10,452 were annotated. Comparative analyses were undertaken to evaluate H. avenae sequences with those of other plant, animal and free living nematodes to identify differences in expressed genes. There were 4,431 transcripts common to H. avenae and the free living nematode Caenorhabditis elegans, and 9,462 in common with more closely related potato cyst nematode, Globodera pallida. Annotation of H. avenae carbohydrate active enzymes (CAZy) revealed fewer glycoside hydrolases (GHs) but more glycosyl transferases (GTs) and carbohydrate esterases (CEs) when compared to M. incognita. 1,280 transcripts were found to have secretory signature, presence of signal peptide and absence of transmembrane. In a comparison of genes expressed in the pre-parasitic juvenile and feeding female stages, expression levels of 30 genes with high RPKM (reads per base per kilo million) value, were analysed by qRT-PCR which confirmed the observed differences in their levels of expression levels. In addition, we have also developed a user-friendly resource, Heterodera transcriptome database (HATdb) for public access of the data generated in this study. The new data provided on the transcriptome of H. avenae adds to the genetic resources available to study plant parasitic nematodes and provides an opportunity to seek new effectors that are specifically involved in the H. avenae-cereal host interaction.

  20. Novel mouse model recapitulates genome and transcriptome alterations in human colorectal carcinomas.

    Science.gov (United States)

    McNeil, Nicole E; Padilla-Nash, Hesed M; Buishand, Floryne O; Hue, Yue; Ried, Thomas

    2017-03-01

    Human colorectal carcinomas are defined by a nonrandom distribution of genomic imbalances that are characteristic for this disease. Often, these imbalances affect entire chromosomes. Understanding the role of these aneuploidies for carcinogenesis is of utmost importance. Currently, established transgenic mice do not recapitulate the pathognonomic genome aberration profile of human colorectal carcinomas. We have developed a novel model based on the spontaneous transformation of murine colon epithelial cells. During this process, cells progress through stages of pre-immortalization, immortalization and, finally, transformation, and result in tumors when injected into immunocompromised mice. We analyzed our model for genome and transcriptome alterations using ArrayCGH, spectral karyotyping (SKY), and array based gene expression profiling. ArrayCGH revealed a recurrent pattern of genomic imbalances. These results were confirmed by SKY. Comparing these imbalances with orthologous maps of human chromosomes revealed a remarkable overlap. We observed focal deletions of the tumor suppressor genes Trp53 and Cdkn2a/p16. High-level focal genomic amplification included the locus harboring the oncogene Mdm2, which was confirmed by FISH in the form of double minute chromosomes. Array-based global gene expression revealed distinct differences between the sequential steps of spontaneous transformation. Gene expression changes showed significant similarities with human colorectal carcinomas. Pathways most prominently affected included genes involved in chromosomal instability and in epithelial to mesenchymal transition. Our novel mouse model therefore recapitulates the most prominent genome and transcriptome alterations in human colorectal cancer, and might serve as a valuable tool for understanding the dynamic process of tumorigenesis, and for preclinical drug testing. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  1. Development of high temperature property database for Alloy 800H

    International Nuclear Information System (INIS)

    Yokoyama, Norio; Watanabe, Katsutoshi; Tsuji, Hirokazu; Nakajima, Hajime.

    1993-07-01

    JAERI Material Performance Database (JMPD) has been developed since 1989 in JAERI with a view to utilizing the various kinds of characteristic data of nuclear materials efficiently. Using relational database management system, PLANNER on the mainframe, the JMPD provides the retrieval supporting system, graphic and statistical analyses system. The data obtained with 7868 sets on characteristic data of metallic materials including fatigue crack growth data, etc. have been stored in the JMPD at the end of March in 1993. A ferritic superalloy, Alloy 800H is used for the structural material of the control rods of the High Temperature Engineering Test Reactor (HTTR). Thermal stress generates which might cause a severe creep damage at a reactor scram. It therefore needs to be designed with consideration on the fracture modes induced by creep deformation after neutron irradiation. The creep data (approximately 240 sets) and tensile data (approximately 100 sets) of Alloy 800H including the effects of test environment, aging treatment and neutron irradiation have been stored in the JMPD. Furthermore, using a personal computer, high temperature property database for Alloy 800H has been developed. The present report outlines the development of high temperature property database for Alloy 800H. (author)

  2. De novo transcriptome assembly of mangosteen (Garcinia mangostana L. fruit

    Directory of Open Access Journals (Sweden)

    Deden Derajat Matra

    2016-12-01

    Full Text Available Garcinia mangostana L. (Mangosteen, of the family Clusiaceae, is one of the economically important tropical fruits in Indonesia. In the present study, we performed de novo transcriptomic analysis of Garcinia mangostana L. through RNA-Seq technology. We obtained the raw data from 12 libraries through Ion Proton System. Clean reads of 191,735,809 were obtained from 307,634,890 raw reads. The raw data obtained in this study can be accessible in DDBJ database with accession number of DRA005014 with bioproject accession number of PRJDB5091. We obtained 268,851 transcripts as well as 155,850 unigenes, having N50 value of 555 and 433 bp, respectively. Transcript/unigene length ranged from 201 to 5916 bp. The unigenes were annotated with two main databases from NCBI and UniProtKB, respectively having annotated-sequences of 73,287 and 73,107, respectively. These transcriptomic data will be beneficial for studying transcriptome of Garcinia mangostana L.

  3. Characterization of the heart transcriptome of the white shark (Carcharodon carcharias).

    Science.gov (United States)

    Richards, Vincent P; Suzuki, Haruo; Stanhope, Michael J; Shivji, Mahmood S

    2013-10-11

    The white shark (Carcharodon carcharias) is a globally distributed, apex predator possessing physical, physiological, and behavioral traits that have garnered it significant public attention. In addition to interest in the genetic basis of its form and function, as a representative of the oldest extant jawed vertebrate lineage, white sharks are also of conservation concern due to their small population size and threat from overfishing. Despite this, surprisingly little is known about the biology of white sharks, and genomic resources are unavailable. To address this deficit, we combined Roche-454 and Illumina sequencing technologies to characterize the first transciptome of any tissue for this species. From white shark heart cDNA we generated 665,399 Roche 454 reads (median length 387-bp) that were assembled into 141,626 contigs (mean length 503-bp). We also generated 78,566,588 Illumina reads, which we aligned to the 454 contigs producing 105,014 454/Illumina consensus sequences. To these, we added 3,432 non-singleton 454 contigs. By comparing these sequences to the UniProtKB/Swiss-Prot database we were able to annotate 21,019 translated open reading frames (ORFs) of ≥ 20 amino acids. Of these, 19,277 were additionally assigned Gene Ontology (GO) functional annotations. While acknowledging the limitations of our single tissue transcriptome, Fisher tests showed the white shark transcriptome to be significantly enriched for numerous metabolic GO terms compared to the zebra fish and human transcriptomes, with white shark showing more similarity to human than to zebra fish (i.e. fewer terms were significantly different). We also compared the transcriptome to other available elasmobranch sequences, for signatures of positive selection and identified several genes of putative adaptive significance on the white shark lineage. The white shark transcriptome also contained 8,404 microsatellites (dinucleotide, trinucleotide, or tetranucleotide motifs ≥ five perfect

  4. Transcriptome profiling in conifers and the PiceaGenExpress database show patterns of diversification within gene families and interspecific conservation in vascular gene expression

    Directory of Open Access Journals (Sweden)

    Raherison Elie

    2012-08-01

    Full Text Available Abstract Background Conifers have very large genomes (13 to 30 Gigabases that are mostly uncharacterized although extensive cDNA resources have recently become available. This report presents a global overview of transcriptome variation in a conifer tree and documents conservation and diversity of gene expression patterns among major vegetative tissues. Results An oligonucleotide microarray was developed from Picea glauca and P. sitchensis cDNA datasets. It represents 23,853 unique genes and was shown to be suitable for transcriptome profiling in several species. A comparison of secondary xylem and phelloderm tissues showed that preferential expression in these vascular tissues was highly conserved among Picea spp. RNA-Sequencing strongly confirmed tissue preferential expression and provided a robust validation of the microarray design. A small database of transcription profiles called PiceaGenExpress was developed from over 150 hybridizations spanning eight major tissue types. In total, transcripts were detected for 92% of the genes on the microarray, in at least one tissue. Non-annotated genes were predominantly expressed at low levels in fewer tissues than genes of known or predicted function. Diversity of expression within gene families may be rapidly assessed from PiceaGenExpress. In conifer trees, dehydrins and late embryogenesis abundant (LEA osmotic regulation proteins occur in large gene families compared to angiosperms. Strong contrasts and low diversity was observed in the dehydrin family, while diverse patterns suggested a greater degree of diversification among LEAs. Conclusion Together, the oligonucleotide microarray and the PiceaGenExpress database represent the first resource of this kind for gymnosperm plants. The spruce transcriptome analysis reported here is expected to accelerate genetic studies in the large and important group comprised of conifer trees.

  5. Transcriptome profiling of human pre-implantation development.

    Directory of Open Access Journals (Sweden)

    Pu Zhang

    Full Text Available BACKGROUND: Preimplantation development is a crucial step in early human development. However, the molecular basis of human preimplantation development is not well known. METHODOLOGY: By applying microarray on 397 human oocytes and embryos at six developmental stages, we studied the transcription dynamics during human preimplantation development. PRINCIPAL FINDINGS: We found that the preimplantation development consisted of two main transitions: from metaphase-II oocyte to 4-cell embryo where mainly the maternal genes were expressed, and from 8-cell embryo to blastocyst with down-regulation of the maternal genes and up-regulation of embryonic genes. Human preimplantation development proved relatively autonomous. Genes predominantly expressed in oocytes and embryos are well conserved during evolution. SIGNIFICANCE: Our database and findings provide fundamental resources for understanding

  6. Getting the most out of parasitic helminth transcriptomes using HelmDB: implications for biology and biotechnology.

    Science.gov (United States)

    Mangiola, Stefano; Young, Neil D; Korhonen, Pasi; Mondal, Alinda; Scheerlinck, Jean-Pierre; Sternberg, Paul W; Cantacessi, Cinzia; Hall, Ross S; Jex, Aaron R; Gasser, Robin B

    2013-12-01

    Compounded by a massive global food shortage, many parasitic diseases have a devastating, long-term impact on animal and human health and welfare worldwide. Parasitic helminths (worms) affect the health of billions of animals. Unlocking the systems biology of these neglected pathogens will underpin the design of new and improved interventions against them. Currently, the functional annotation of genomic and transcriptomic sequence data for socio-economically important parasitic worms relies almost exclusively on comparative bioinformatic analyses using model organism- and other databases. However, many genes and gene products of parasitic helminths (often >50%) cannot be annotated using this approach, because they are specific to parasites and/or do not have identifiable homologs in other organisms for which sequence data are available. This inability to fully annotate transcriptomes and predicted proteomes is a major challenge and constrains our understanding of the biology of parasites, interactions with their hosts and of parasitism and the pathogenesis of disease on a molecular level. In the present article, we compiled transcriptomic data sets of key, socioeconomically important parasitic helminths, and constructed and validated a curated database, called HelmDB (www.helmdb.org). We demonstrate how this database can be used effectively for the improvement of functional annotation by employing data integration and clustering. Importantly, HelmDB provides a practical and user-friendly toolkit for sequence browsing and comparative analyses among divergent helminth groups (including nematodes and trematodes), and should be readily adaptable and applicable to a wide range of other organisms. This web-based, integrative database should assist 'systems biology' studies of parasitic helminths, and the discovery and prioritization of novel drug and vaccine targets. This focus provides a pathway toward developing new and improved approaches for the treatment and control

  7. Transcriptome

    Science.gov (United States)

    ... Also: Talking Glossary of Genetic Terms Definitions for genetic terms used on this page En Español: Transcriptoma Transcriptome What is a transcriptome? What can a transcriptome tell us? How can transcriptome data be used to explore gene function? What is ...

  8. Dual Transcriptome Profiling of Leishmania-Infected Human Macrophages Reveals Distinct Reprogramming Signatures.

    Science.gov (United States)

    Fernandes, Maria Cecilia; Dillon, Laura A L; Belew, Ashton Trey; Bravo, Hector Corrada; Mosser, David M; El-Sayed, Najib M

    2016-05-10

    Macrophages are mononuclear phagocytes that constitute a first line of defense against pathogens. While lethal to many microbes, they are the primary host cells of Leishmania spp. parasites, the obligate intracellular pathogens that cause leishmaniasis. We conducted transcriptomic profiling of two Leishmania species and the human macrophage over the course of intracellular infection by using high-throughput RNA sequencing to characterize the global gene expression changes and reprogramming events that underlie the interactions between the pathogen and its host. A systematic exclusion of the generic effects of large-particle phagocytosis revealed a vigorous, parasite-specific response of the human macrophage early in the infection that was greatly tempered at later time points. An analogous temporal expression pattern was observed with the parasite, suggesting that much of the reprogramming that occurs as parasites transform into intracellular forms generally stabilizes shortly after entry. Following that, the parasite establishes an intracellular niche within macrophages, with minimal communication between the parasite and the host cell later during the infection. No significant difference was observed between parasite species transcriptomes or in the transcriptional response of macrophages infected with each species. Our comparative analysis of gene expression changes that occur as mouse and human macrophages are infected by Leishmania spp. points toward a general signature of the Leishmania-macrophage infectome. Little is known about the transcriptional changes that occur within mammalian cells harboring intracellular pathogens. This study characterizes the gene expression signatures of Leishmania spp. parasites and the coordinated response of infected human macrophages as the pathogen enters and persists within them. After accounting for the generic effects of large-particle phagocytosis, we observed a parasite-specific response of the human macrophages early in

  9. The transcriptome of the Didelphis virginiana opossum kidney OK proximal tubule cell line.

    Science.gov (United States)

    Eshbach, Megan L; Sethi, Rahil; Avula, Raghunandan; Lamb, Janette; Hollingshead, Deborah J; Finegold, David N; Locker, Joseph D; Chandran, Uma R; Weisz, Ora A

    2017-09-01

    The OK cell line derived from the kidney of a female opossum Didelphis virginiana has proven to be a useful model in which to investigate the unique regulation of ion transport and membrane trafficking mechanisms in the proximal tubule (PT). Sequence data and comparison of the transcriptome of this cell line to eutherian mammal PTs would further broaden the utility of this culture model. However, the genomic sequence for D. virginiana is not available and although a draft genome sequence for the opossum Monodelphis domestica (sequenced in 2012 by the Broad Institute) exists, transcripts sequenced from both species show significant divergence. The M. domestica sequence is not highly annotated, and the majority of transcripts are predicted rather than experimentally validated. Using deep RNA sequencing of the D. virginiana OK cell line, we characterized its transcriptome via de novo transcriptome assembly and alignment to the M. domestica genome. The quality of the de novo assembled transcriptome was assessed by the extent of homology to sequences in nucleotide and protein databases. Gene expression levels in the OK cell line, from both the de novo transcriptome and genes aligned to the M. domestica genome, were compared with publicly available rat kidney nephron segment expression data. Our studies demonstrate the expression in OK cells of numerous PT-specific ion transporters and other key proteins relevant for rodent and human PT function. Additionally, the sequence and expression data reported here provide an important resource for genetic manipulation and other studies on PT cell function using these cells. Copyright © 2017 the American Physiological Society.

  10. Human more complex than mouse at cellular level.

    Directory of Open Access Journals (Sweden)

    Alexander E Vinogradov

    Full Text Available The family of transcription factors with the C2H2 zinc finger domain is expanding in the evolution of vertebrates, reaching its highest numbers in the mammals. The question arises: whether an increased amount of these transcription factors is related to embryogenesis, nervous system, pathology or more of them are expressed in individual cells? Among mammals, the primates have a more complex anatomical structure than the rodents (e.g., brain. In this work, I show that a greater number of C2H2-ZF genes are expressed in the human cells than in the mouse cells. The effect is especially pronounced for C2H2-ZF genes accompanied with the KRAB domain. The relative difference between the numbers of C2H2-ZF(-KRAB genes in the human and mouse cellular transcriptomes even exceeds their difference in the genomes (i.e. a greater subset of existing in the genome genes is expressed in the human cellular transcriptomes compared to the mouse transcriptomes. The evolutionary turnover of C2H2-ZF(-KRAB genes acts in the direction of the revealed phenomenon, i.e. gene duplication and loss enhances the difference in the relative number of C2H2-ZF(-KRAB genes between human and mouse cellular transcriptomes. A higher amount of these genes is expressed in the brain and embryonic cells (compared with other tissues, whereas a lower amount--in the cancer cells. It is specifically the C2H2-ZF transcription factors whose repertoire is poorer in the cancer and richer in the brain (other transcription factors taken together do not show this trend. These facts suggest that increase of anatomical complexity is accompanied by a more complex intracellular regulation involving these transcription factors. Malignization is associated with simplification of this regulation. These results agree with the known fact that human cells are more resistant to oncogenic transformation than mouse cells. The list of C2H2-ZF genes whose suppression might be involved in malignization is provided.

  11. Nodeomics: pathogen detection in vertebrate lymph nodes using meta-transcriptomics.

    Directory of Open Access Journals (Sweden)

    Nicola E Wittekindt

    Full Text Available The ongoing emergence of human infections originating from wildlife highlights the need for better knowledge of the microbial community in wildlife species where traditional diagnostic approaches are limited. Here we evaluate the microbial biota in healthy mule deer (Odocoileus hemionus by analyses of lymph node meta-transcriptomes. cDNA libraries from five individuals and two pools of samples were prepared from retropharyngeal lymph node RNA enriched for polyadenylated RNA and sequenced using Roche-454 Life Sciences technology. Protein-coding and 16S ribosomal RNA (rRNA sequences were taxonomically profiled using protein and rRNA specific databases. Representatives of all bacterial phyla were detected in the seven libraries based on protein-coding transcripts indicating that viable microbiota were present in lymph nodes. Residents of skin and rumen, and those ubiquitous in mule deer habitat dominated classifiable bacterial species. Based on detection of both rRNA and protein-coding transcripts, we identified two new proteobacterial species; a Helicobacter closely related to Helicobacter cetorum in the Helicobacter pylori/Helicobacter acinonychis complex and an Acinetobacter related to Acinetobacter schindleri. Among viruses, a novel gamma retrovirus and other members of the Poxviridae and Retroviridae were identified. We additionally evaluated bacterial diversity by amplicon sequencing the hypervariable V6 region of 16S rRNA and demonstrate that overall taxonomic diversity is higher with the meta-transcriptomic approach. These data provide the most complete picture to date of the microbial diversity within a wildlife host. Our research advances the use of meta-transcriptomics to study microbiota in wildlife tissues, which will facilitate detection of novel organisms with pathogenic potential to human and animals.

  12. A trending database for human performance events

    International Nuclear Information System (INIS)

    Harrison, D.

    1993-01-01

    An effective Operations Experience program includes a standardized methodology for the investigation of unplanned events and a tool capable of retaining investigation data for the purpose of trending analysis. A database used in conjunction with a formalized investigation procedure for the purpose of trending unplanning event data is described. The database follows the structure of INPO's Human Performance Enhancement System for investigations. The database screens duplicate on-line the HPES evaluation Forms. All information pertaining to investigations is collected, retained and entered into the database using these forms. The database will be used for trending analysis to determine if any significant patterns exist, for tracking progress over time both within AECL and against industry standards, and for evaluating the success of corrective actions. Trending information will be used to help prevent similar occurrences

  13. A SAGE based approach to human glomerular endothelium: defining the transcriptome, finding a novel molecule and highlighting endothelial diversity.

    Science.gov (United States)

    Sengoelge, Guerkan; Winnicki, Wolfgang; Kupczok, Anne; von Haeseler, Arndt; Schuster, Michael; Pfaller, Walter; Jennings, Paul; Weltermann, Ansgar; Blake, Sophia; Sunder-Plassmann, Gere

    2014-08-27

    Large scale transcript analysis of human glomerular microvascular endothelial cells (HGMEC) has never been accomplished. We designed this study to define the transcriptome of HGMEC and facilitate a better characterization of these endothelial cells with unique features. Serial analysis of gene expression (SAGE) was used for its unbiased approach to quantitative acquisition of transcripts. We generated a HGMEC SAGE library consisting of 68,987 transcript tags. Then taking advantage of large public databases and advanced bioinformatics we compared the HGMEC SAGE library with a SAGE library of non-cultured ex vivo human glomeruli (44,334 tags) which contained endothelial cells. The 823 tags common to both which would have the potential to be expressed in vivo were subsequently checked against 822,008 tags from 16 non-glomerular endothelial SAGE libraries. This resulted in 268 transcript tags differentially overexpressed in HGMEC compared to non-glomerular endothelia. These tags were filtered using a set of criteria: never before shown in kidney or any type of endothelial cell, absent in all nephron regions except the glomerulus, more highly expressed than statistically expected in HGMEC. Neurogranin, a direct target of thyroid hormone action which had been thought to be brain specific and never shown in endothelial cells before, fulfilled these criteria. Its expression in glomerular endothelium in vitro and in vivo was then verified by real-time-PCR, sequencing and immunohistochemistry. Our results represent an extensive molecular characterization of HGMEC beyond a mere database, underline the endothelial heterogeneity, and propose neurogranin as a potential link in the kidney-thyroid axis.

  14. RNA-seq analysis and de novo transcriptome assembly of Jerusalem artichoke (Helianthus tuberosus Linne).

    Science.gov (United States)

    Jung, Won Yong; Lee, Sang Sook; Kim, Chul Wook; Kim, Hyun-Soon; Min, Sung Ran; Moon, Jae Sun; Kwon, Suk-Yoon; Jeon, Jae-Heung; Cho, Hye Sun

    2014-01-01

    Jerusalem artichoke (Helianthus tuberosus L.) has long been cultivated as a vegetable and as a source of fructans (inulin) for pharmaceutical applications in diabetes and obesity prevention. However, transcriptomic and genomic data for Jerusalem artichoke remain scarce. In this study, Illumina RNA sequencing (RNA-Seq) was performed on samples from Jerusalem artichoke leaves, roots, stems and two different tuber tissues (early and late tuber development). Data were used for de novo assembly and characterization of the transcriptome. In total 206,215,632 paired-end reads were generated. These were assembled into 66,322 loci with 272,548 transcripts. Loci were annotated by querying against the NCBI non-redundant, Phytozome and UniProt databases, and 40,215 loci were homologous to existing database sequences. Gene Ontology terms were assigned to 19,848 loci, 15,434 loci were matched to 25 Clusters of Eukaryotic Orthologous Groups classifications, and 11,844 loci were classified into 142 Kyoto Encyclopedia of Genes and Genomes pathways. The assembled loci also contained 10,778 potential simple sequence repeats. The newly assembled transcriptome was used to identify loci with tissue-specific differential expression patterns. In total, 670 loci exhibited tissue-specific expression, and a subset of these were confirmed using RT-PCR and qRT-PCR. Gene expression related to inulin biosynthesis in tuber tissue was also investigated. Exsiting genetic and genomic data for H. tuberosus are scarce. The sequence resources developed in this study will enable the analysis of thousands of transcripts and will thus accelerate marker-assisted breeding studies and studies of inulin biosynthesis in Jerusalem artichoke.

  15. Investigation on structuring the human body function database; Shintai kino database no kochiku ni kansuru chosa kenkyu

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1995-03-01

    Based on the concept of human life engineering database, a study was made to know how to technically make such a database fittable to the old people in the age-advancing society. It was then proposed that the old people`s human life engineering database should be prepared to serve for the development and design of life technology to be applied into the age-advancing society. An executive method of structuring the database was established through the `bathing` and `going out` selected as an action to be casestudied in the daily life of old people. As a result of the study, the proposal was made that the old people`s human body function database should be prepared as a R and D base for the life technology in the aged society. Based on the above proposal, a master plan was mapped out to structure this database with the concrete method studied for putting it into action. At the first investigation stage of the above study, documentation was made through utilizing the existing documentary database. Enterprises were also interviewed for the investigation. Pertaining to the function of old people, about 500 documents were extracted with many vague points not clarified yet. The investigation will restart in the next fiscal year. 4 refs., 38 figs., 30 tabs.

  16. The dormant and the fully competent oocyte: comparing the transcriptome of human oocytes from primordial follicles and in metaphase II

    DEFF Research Database (Denmark)

    Grøndahl, Marie Louise; Borup, Rehannah; Vikeså, Jonas

    2013-01-01

    Oocytes become enclosed in primordial follicles during fetal life and remain dormant there until activation followed by growth and meiotic resumption. Current knowledge about the molecular pathways involved in oogenesis is incomplete. This study identifies the specific transcriptome of the human...... oocyte in the quiescent state and at the pinnacle of maturity at ovulation. In silico bioinformatic comparisons were made between the transcriptome of human oocytes from dormant primordial follicles and that of human metaphase II (MII) oocytes and granulosa cells and unique gene expression profiles were...... identified as well as functional and pathway enrichments associated with the oocytes from the two developmental hallmarks. A total of 729 genes were highly enriched in oocytes from primodial follicles and 1456 genes were highly enriched in MII oocytes (>10-fold, P...

  17. Transcriptome analysis and anthocyanin-related genes in red leaf lettuce.

    Science.gov (United States)

    Zhang, Y Z; Xu, S Z; Cheng, Y W; Ya, H Y; Han, J M

    2016-01-29

    This study aimed to analyze the transcriptome profile of red lettuce and identify the genes involved in anthocyanin accumulation. Red leaf lettuce is a popular vegetable and popular due to its high anthocyanin content. However, there is limited information available about the genes involved in anthocyanin biosynthesis in this species. In this study, transcriptomes of 15-day-old seedlings and 40-day-old red lettuce leaves were analyzed using an Illuminia HiseqTM 2500 platform. A total of 10.6 GB clean data were obtained and de novo assembled into 83,333 unigenes with an N50 of 1067. After annotation against public databases, 51,850 unigene sequences were identified, among which 46,087 were annotated in the NCBI non-redundant protein database, and 41,752 were annotated in the Swiss-Prot database. A total of 9125 unigenes were mapped into 163 pathways using the Kyoto Encyclopedia of Genes and Genomes database. Thirty-four structural genes were found to cover the main steps of the anthocyanin pathway, including chalcone synthase, chalcone isomerase, flavanone 3-hydroxylase, flavonoid 3'-hydroxylase, flavonoid 3',5'-hydroxylase, dihydroflavonol 4-reductase, and anthocyanidin synthase. Seven MYB, three bHLH, and two WD40 genes, considered anthocyanin regulatory genes, were also identified. In addition, 3607 simple sequence repeat (SSR) markers were identified from 2916 unigenes. This research uncovered the transcriptomic characteristics of red leaf lettuce seedlings and mature plants. The identified candidate genes related to anthocyanin biosynthesis and the detected SSRs provide useful tools for future molecular breeding studies.

  18. Transcriptome dynamics of human pluripotent stem cell-derived contracting cardiomyocytes using an embryoid body model with fetal bovine serum.

    Science.gov (United States)

    Jung, Kwang Bo; Son, Ye Seul; Lee, Hana; Jung, Cho-Rok; Kim, Janghwan; Son, Mi-Young

    2017-07-25

    Cardiomyocyte (CM) differentiation techniques for generating adult-like mature CMs remain imperfect, and the plausible underlying mechanisms remain unclear; however, there are a number of current protocols available. Here, to explore the mechanisms controlling cardiac differentiation, we analyzed the genome-wide transcription dynamics occurring during the differentiation of human pluripotent stem cells (hPSCs) into CMs using embryoid body (EB) formation. We optimized and updated the protocol to efficiently generate contracting CMs from hPSCs by adding fetal bovine serum (FBS) as a medium supplement, which could have a significant impact on the efficiency of cardiac differentiation. To identify genes, biological processes, and pathways involved in the cardiac differentiation of hPSCs, integrative and comparative analyses of the transcriptome profiles of differentiated CMs from hPSCs and of control CMs of the adult human heart (CM-AHH) were performed using gene ontology, functional annotation clustering, and pathway analyses. Several genes commonly regulated in the differentiated CMs and CM-AHH were enriched in pathways related to cell cycle and nucleotide metabolism. Strikingly, we found that current differentiation protocols did not promote sufficient expression of genes involved in oxidative phosphorylation to differentiate CMs from hPSCs compared to the expression levels in CM-AHH. Therefore, to obtain mature CMs similar to CM-AHH, these deficient pathways in CM differentiation, such as energy-related pathways, must be augmented prior to use for in vitro and in vivo applications. This approach opens up new avenues for facilitating the utilization of hPSC-derived CMs in biomedical research, drug evaluation, and clinical applications for patients with cardiac failure.

  19. Transcriptome analysis in cotton boll weevil (Anthonomus grandis and RNA interference in insect pests.

    Directory of Open Access Journals (Sweden)

    Alexandre Augusto Pereira Firmino

    Full Text Available Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families' data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects.

  20. Transcriptome analysis in cotton boll weevil (Anthonomus grandis) and RNA interference in insect pests.

    Science.gov (United States)

    Firmino, Alexandre Augusto Pereira; Fonseca, Fernando Campos de Assis; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; Antonino de Souza, José Dijair; Togawa, Roberto Coiti; Silva-Junior, Orzenil Bonfim; Pappas, Georgios Joannis; da Silva, Maria Cristina Mattar; Engler, Gilbert; Grossi-de-Sa, Maria Fatima

    2013-01-01

    Cotton plants are subjected to the attack of several insect pests. In Brazil, the cotton boll weevil, Anthonomus grandis, is the most important cotton pest. The use of insecticidal proteins and gene silencing by interference RNA (RNAi) as techniques for insect control are promising strategies, which has been applied in the last few years. For this insect, there are not much available molecular information on databases. Using 454-pyrosequencing methodology, the transcriptome of all developmental stages of the insect pest, A. grandis, was analyzed. The A. grandis transcriptome analysis resulted in more than 500.000 reads and a data set of high quality 20,841 contigs. After sequence assembly and annotation, around 10,600 contigs had at least one BLAST hit against NCBI non-redundant protein database and 65.7% was similar to Tribolium castaneum sequences. A comparison of A. grandis, Drosophila melanogaster and Bombyx mori protein families' data showed higher similarity to dipteran than to lepidopteran sequences. Several contigs of genes encoding proteins involved in RNAi mechanism were found. PAZ Domains sequences extracted from the transcriptome showed high similarity and conservation for the most important functional and structural motifs when compared to PAZ Domains from 5 species. Two SID-like contigs were phylogenetically analyzed and grouped with T. castaneum SID-like proteins. No RdRP gene was found. A contig matching chitin synthase 1 was mined from the transcriptome. dsRNA microinjection of a chitin synthase gene to A. grandis female adults resulted in normal oviposition of unviable eggs and malformed alive larvae that were unable to develop in artificial diet. This is the first study that characterizes the transcriptome of the coleopteran, A. grandis. A new and representative transcriptome database for this insect pest is now available. All data support the state of the art of RNAi mechanism in insects.

  1. Detailed transcriptome description of the neglected cestode Taenia multiceps.

    Science.gov (United States)

    Wu, Xuhang; Fu, Yan; Yang, Deying; Zhang, Runhui; Zheng, Wanpeng; Nie, Huaming; Xie, Yue; Yan, Ning; Hao, Guiying; Gu, Xiaobin; Wang, Shuxian; Peng, Xuerong; Yang, Guangyou

    2012-01-01

    The larval stage of Taenia multiceps, a global cestode, encysts in the central nervous system (CNS) of sheep and other livestock. This frequently leads to their death and huge socioeconomic losses, especially in developing countries. This parasite can also cause zoonotic infections in humans, but has been largely neglected due to a lack of diagnostic techniques and studies. Recent developments in next-generation sequencing provide an opportunity to explore the transcriptome of T. multiceps. We obtained a total of 31,282 unigenes (mean length 920 bp) using Illumina paired-end sequencing technology and a new Trinity de novo assembler without a referenced genome. Individual transcription molecules were determined by sequence-based annotations and/or domain-based annotations against public databases (Nr, UniprotKB/Swiss-Prot, COG, KEGG, UniProtKB/TrEMBL, InterPro and Pfam). We identified 26,110 (83.47%) unigenes and inferred 20,896 (66.8%) coding sequences (CDS). Further comparative transcripts analysis with other cestodes (Taenia pisiformis, Taenia solium, Echincoccus granulosus and Echincoccus multilocularis) and intestinal parasites (Trichinella spiralis, Ancylostoma caninum and Ascaris suum) showed that 5,100 common genes were shared among three Taenia tapeworms, 261 conserved genes were detected among five Taeniidae cestodes, and 109 common genes were found in four zoonotic intestinal parasites. Some of the common genes were genes required for parasite survival, involved in parasite-host interactions. In addition, we amplified two full-length CDS of unigenes from the common genes using RT-PCR. This study provides an extensive transcriptome of the adult stage of T. multiceps, and demonstrates that comparative transcriptomic investigations deserve to be further studied. This transcriptome dataset forms a substantial public information platform to achieve a fundamental understanding of the biology of T. multiceps, and helps in the identification of drug targets and

  2. Detailed transcriptome description of the neglected cestode Taenia multiceps.

    Directory of Open Access Journals (Sweden)

    Xuhang Wu

    Full Text Available BACKGROUND: The larval stage of Taenia multiceps, a global cestode, encysts in the central nervous system (CNS of sheep and other livestock. This frequently leads to their death and huge socioeconomic losses, especially in developing countries. This parasite can also cause zoonotic infections in humans, but has been largely neglected due to a lack of diagnostic techniques and studies. Recent developments in next-generation sequencing provide an opportunity to explore the transcriptome of T. multiceps. METHODOLOGY/PRINCIPAL FINDINGS: We obtained a total of 31,282 unigenes (mean length 920 bp using Illumina paired-end sequencing technology and a new Trinity de novo assembler without a referenced genome. Individual transcription molecules were determined by sequence-based annotations and/or domain-based annotations against public databases (Nr, UniprotKB/Swiss-Prot, COG, KEGG, UniProtKB/TrEMBL, InterPro and Pfam. We identified 26,110 (83.47% unigenes and inferred 20,896 (66.8% coding sequences (CDS. Further comparative transcripts analysis with other cestodes (Taenia pisiformis, Taenia solium, Echincoccus granulosus and Echincoccus multilocularis and intestinal parasites (Trichinella spiralis, Ancylostoma caninum and Ascaris suum showed that 5,100 common genes were shared among three Taenia tapeworms, 261 conserved genes were detected among five Taeniidae cestodes, and 109 common genes were found in four zoonotic intestinal parasites. Some of the common genes were genes required for parasite survival, involved in parasite-host interactions. In addition, we amplified two full-length CDS of unigenes from the common genes using RT-PCR. CONCLUSIONS/SIGNIFICANCE: This study provides an extensive transcriptome of the adult stage of T. multiceps, and demonstrates that comparative transcriptomic investigations deserve to be further studied. This transcriptome dataset forms a substantial public information platform to achieve a fundamental understanding of

  3. Simulated night shift work induces circadian misalignment of the human peripheral blood mononuclear cell transcriptome.

    Science.gov (United States)

    Kervezee, Laura; Cuesta, Marc; Cermakian, Nicolas; Boivin, Diane B

    2018-05-22

    Misalignment of the endogenous circadian timing system leads to disruption of physiological rhythms and may contribute to the development of the deleterious health effects associated with night shift work. However, the molecular underpinnings remain to be elucidated. Here, we investigated the effect of a 4-day simulated night shift work protocol on the circadian regulation of the human transcriptome. Repeated blood samples were collected over two 24-hour measurement periods from eight healthy subjects under highly controlled laboratory conditions before and 4 days after a 10-hour delay of their habitual sleep period. RNA was extracted from peripheral blood mononuclear cells to obtain transcriptomic data. Cosinor analysis revealed a marked reduction of significantly rhythmic transcripts in the night shift condition compared with baseline at group and individual levels. Subsequent analysis using a mixed-effects model selection approach indicated that this decrease is mainly due to dampened rhythms rather than to a complete loss of rhythmicity: 73% of transcripts rhythmically expressed at baseline remained rhythmic during the night shift condition with a similar phase relative to habitual bedtimes, but with lower amplitudes. Functional analysis revealed that key biological processes are affected by the night shift protocol, most notably the natural killer cell-mediated immune response and Jun/AP1 and STAT pathways. These results show that 4 days of simulated night shifts leads to a loss in temporal coordination between the human circadian transcriptome and the external environment and impacts biological processes related to the adverse health effects associated to night shift work.

  4. CyanoEXpress: A web database for exploration and visualisation of the integrated transcriptome of cyanobacterium Synechocystis sp. PCC6803.

    Science.gov (United States)

    Hernandez-Prieto, Miguel A; Futschik, Matthias E

    2012-01-01

    Synechocystis sp. PCC6803 is one of the best studied cyanobacteria and an important model organism for our understanding of photosynthesis. The early availability of its complete genome sequence initiated numerous transcriptome studies, which have generated a wealth of expression data. Analysis of the accumulated data can be a powerful tool to study transcription in a comprehensive manner and to reveal underlying regulatory mechanisms, as well as to annotate genes whose functions are yet unknown. However, use of divergent microarray platforms, as well as distributed data storage make meta-analyses of Synechocystis expression data highly challenging, especially for researchers with limited bioinformatic expertise and resources. To facilitate utilisation of the accumulated expression data for a wider research community, we have developed CyanoEXpress, a web database for interactive exploration and visualisation of transcriptional response patterns in Synechocystis. CyanoEXpress currently comprises expression data for 3073 genes and 178 environmental and genetic perturbations obtained in 31 independent studies. At present, CyanoEXpress constitutes the most comprehensive collection of expression data available for Synechocystis and can be freely accessed. The database is available for free at http://cyanoexpress.sysbiolab.eu.

  5. Transcriptome characterization of the South African abalone Haliotis midae using sequencing-by-synthesis

    Directory of Open Access Journals (Sweden)

    Roodt-Wilding Rouvay

    2011-03-01

    Full Text Available Abstract Background Worldwide, the genus Haliotis is represented by 56 extant species and several of these are commercially cultured. Among the six abalone species found in South Africa, Haliotis midae is the only aquacultured species. Despite its economic importance, genomic sequence resources for H. midae, and for abalone in general, are still scarce. Next generation sequencing technologies provide a fast and efficient tool to generate large sequence collections that can be used to characterize the transcriptome and identify expressed genes associated with economically important traits like growth and disease resistance. Results More than 25 million short reads generated by the Illumina Genome Analyzer were de novo assembled in 22,761 contigs with an average size of 260 bp. With a stringent E-value threshold of 10-10, 3,841 contigs (16.8% had a BLAST homologous match against the Genbank non-redundant (NR protein database. Most of these sequences were annotated using the gene ontology (GO and eukaryotic orthologous groups of proteins (KOG databases and assigned to various functional categories. According to annotation results, many gene families involved in immune response were identified. Thousands of simple sequence repeats (SSR and single nucleotide polymorphisms (SNP were detected. Setting stringent parameters to ensure a high probability of amplification, 420 primer pairs in 181 contigs containing SSR loci were designed. Conclusion This data represents the most comprehensive genomic resource for the South African abalone H. midae to date. The amount of assembled sequences demonstrated the utility of the Illumina sequencing technology in the transcriptome characterization of a non-model species. It allowed the development of several markers and the identification of promising candidate genes for future studies on population and functional genomics in H. midae and in other abalone species.

  6. Transcriptome profiling of Curcuma longa L. cv. Suvarna

    Directory of Open Access Journals (Sweden)

    Ambika Sahoo

    2016-12-01

    Full Text Available Turmeric is an economically valued crop, because of its utility in the food, pharmaceutical industries and Ayurvedic medicine, attracts the attention in many areas of research work. In the present study, we executed resequencing through transcriptome assembly of the turmeric cultivar Suvarna (CL_Suv_10. Resequencing of Suvarna variety has generated 5 Gbases raw data with 75 bp paired-end sequence. The raw data has been submitted to SRA database of NCBI with accession number SRR4042181. Reads were assembled using Cufflinks-2.2.1 tool which ended up with 42994 numbers of transcripts. The length of transcripts ranged from 83 to15565, with a N50 value 1216 and median transcript length 773. The transcripts were annotated through number of databases. For the first time transcriptome profiling of cultivar Suvarna has been done, which could help towards identification of single nucleotide polymorphisms (SNPs between Suvarna and other turmeric cultivars for its authentic identification.

  7. De novo assembly of maritime pine transcriptome: implications for forest breeding and biotechnology.

    Science.gov (United States)

    Canales, Javier; Bautista, Rocio; Label, Philippe; Gómez-Maldonado, Josefa; Lesur, Isabelle; Fernández-Pozo, Noe; Rueda-López, Marina; Guerrero-Fernández, Dario; Castro-Rodríguez, Vanessa; Benzekri, Hicham; Cañas, Rafael A; Guevara, María-Angeles; Rodrigues, Andreia; Seoane, Pedro; Teyssier, Caroline; Morel, Alexandre; Ehrenmann, François; Le Provost, Grégoire; Lalanne, Céline; Noirot, Céline; Klopp, Christophe; Reymond, Isabelle; García-Gutiérrez, Angel; Trontin, Jean-François; Lelu-Walter, Marie-Anne; Miguel, Celia; Cervera, María Teresa; Cantón, Francisco R; Plomion, Christophe; Harvengt, Luc; Avila, Concepción; Gonzalo Claros, M; Cánovas, Francisco M

    2014-04-01

    Maritime pine (Pinus pinasterAit.) is a widely distributed conifer species in Southwestern Europe and one of the most advanced models for conifer research. In the current work, comprehensive characterization of the maritime pine transcriptome was performed using a combination of two different next-generation sequencing platforms, 454 and Illumina. De novo assembly of the transcriptome provided a catalogue of 26 020 unique transcripts in maritime pine trees and a collection of 9641 full-length cDNAs. Quality of the transcriptome assembly was validated by RT-PCR amplification of selected transcripts for structural and regulatory genes. Transcription factors and enzyme-encoding transcripts were annotated. Furthermore, the available sequencing data permitted the identification of polymorphisms and the establishment of robust single nucleotide polymorphism (SNP) and simple-sequence repeat (SSR) databases for genotyping applications and integration of translational genomics in maritime pine breeding programmes. All our data are freely available at SustainpineDB, the P. pinaster expressional database. Results reported here on the maritime pine transcriptome represent a valuable resource for future basic and applied studies on this ecological and economically important pine species. © 2013 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.

  8. Transcriptome sequencing and comparative transcriptome analysis of the scleroglucan producer Sclerotium rolfsii

    Directory of Open Access Journals (Sweden)

    Stahl Ulf

    2010-05-01

    Full Text Available Abstract Background The plant pathogenic basidiomycete Sclerotium rolfsii produces the industrially exploited exopolysaccharide scleroglucan, a polymer that consists of (1 → 3-β-linked glucose with a (1 → 6-β-glycosyl branch on every third unit. Although the physicochemical properties of scleroglucan are well understood, almost nothing is known about the genetics of scleroglucan biosynthesis. Similarly, the biosynthetic pathway of oxalate, the main by-product during scleroglucan production, has not been elucidated yet. In order to provide a basis for genetic and metabolic engineering approaches, we studied scleroglucan and oxalate biosynthesis in S. rolfsii using different transcriptomic approaches. Results Two S. rolfsii transcriptomes obtained from scleroglucan-producing and scleroglucan-nonproducing conditions were pooled and sequenced using the 454 pyrosequencing technique yielding ~350,000 reads. These could be assembled into 21,937 contigs and 171,833 singletons, for which 6,951 had significant matches in public protein data bases. Sequence data were used to obtain first insights into the genomics of scleroglucan and oxalate production and to predict putative proteins involved in the synthesis of both metabolites. Using comparative transcriptomics, namely Agilent microarray hybridization and suppression subtractive hybridization, we identified ~800 unigenes which are differently expressed under scleroglucan-producing and non-producing conditions. From these, candidate genes were identified which could represent potential leads for targeted modification of the S. rolfsii metabolism for increased scleroglucan yields. Conclusions The results presented in this paper provide for the first time genomic and transcriptomic data about S. rolfsii and demonstrate the power and usefulness of combined transcriptome sequencing and comparative microarray analysis. The data obtained allowed us to predict the biosynthetic pathways of scleroglucan and

  9. Development of transcriptomic resources for interrogating the biosynthesis of monoterpene indole alkaloids in medicinal plant species.

    Directory of Open Access Journals (Sweden)

    Elsa Góngora-Castillo

    Full Text Available The natural diversity of plant metabolism has long been a source for human medicines. One group of plant-derived compounds, the monoterpene indole alkaloids (MIAs, includes well-documented therapeutic agents used in the treatment of cancer (vinblastine, vincristine, camptothecin, hypertension (reserpine, ajmalicine, malaria (quinine, and as analgesics (7-hydroxymitragynine. Our understanding of the biochemical pathways that synthesize these commercially relevant compounds is incomplete due in part to a lack of molecular, genetic, and genomic resources for the identification of the genes involved in these specialized metabolic pathways. To address these limitations, we generated large-scale transcriptome sequence and expression profiles for three species of Asterids that produce medicinally important MIAs: Camptotheca acuminata, Catharanthus roseus, and Rauvolfia serpentina. Using next generation sequencing technology, we sampled the transcriptomes of these species across a diverse set of developmental tissues, and in the case of C. roseus, in cultured cells and roots following elicitor treatment. Through an iterative assembly process, we generated robust transcriptome assemblies for all three species with a substantial number of the assembled transcripts being full or near-full length. The majority of transcripts had a related sequence in either UniRef100, the Arabidopsis thaliana predicted proteome, or the Pfam protein domain database; however, we also identified transcripts that lacked similarity with entries in either database and thereby lack a known function. Representation of known genes within the MIA biosynthetic pathway was robust. As a diverse set of tissues and treatments were surveyed, expression abundances of transcripts in the three species could be estimated to reveal transcripts associated with development and response to elicitor treatment. Together, these transcriptomes and expression abundance matrices provide a rich resource

  10. Development of Transcriptomic Resources for Interrogating the Biosynthesis of Monoterpene Indole Alkaloids in Medicinal Plant Species

    Science.gov (United States)

    Góngora-Castillo, Elsa; Childs, Kevin L.; Fedewa, Greg; Hamilton, John P.; Liscombe, David K.; Magallanes-Lundback, Maria; Mandadi, Kranthi K.; Nims, Ezekiel; Runguphan, Weerawat; Vaillancourt, Brieanne; Varbanova-Herde, Marina; DellaPenna, Dean; McKnight, Thomas D.; O’Connor, Sarah; Buell, C. Robin

    2012-01-01

    The natural diversity of plant metabolism has long been a source for human medicines. One group of plant-derived compounds, the monoterpene indole alkaloids (MIAs), includes well-documented therapeutic agents used in the treatment of cancer (vinblastine, vincristine, camptothecin), hypertension (reserpine, ajmalicine), malaria (quinine), and as analgesics (7-hydroxymitragynine). Our understanding of the biochemical pathways that synthesize these commercially relevant compounds is incomplete due in part to a lack of molecular, genetic, and genomic resources for the identification of the genes involved in these specialized metabolic pathways. To address these limitations, we generated large-scale transcriptome sequence and expression profiles for three species of Asterids that produce medicinally important MIAs: Camptotheca acuminata, Catharanthus roseus, and Rauvolfia serpentina. Using next generation sequencing technology, we sampled the transcriptomes of these species across a diverse set of developmental tissues, and in the case of C. roseus, in cultured cells and roots following elicitor treatment. Through an iterative assembly process, we generated robust transcriptome assemblies for all three species with a substantial number of the assembled transcripts being full or near-full length. The majority of transcripts had a related sequence in either UniRef100, the Arabidopsis thaliana predicted proteome, or the Pfam protein domain database; however, we also identified transcripts that lacked similarity with entries in either database and thereby lack a known function. Representation of known genes within the MIA biosynthetic pathway was robust. As a diverse set of tissues and treatments were surveyed, expression abundances of transcripts in the three species could be estimated to reveal transcripts associated with development and response to elicitor treatment. Together, these transcriptomes and expression abundance matrices provide a rich resource for

  11. Transcriptome data - Initial stage of dough fermentation - DGBY | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us DGBY Transcriptome data - Initial stage of dough fermentation Data detail Data name Transcri...ptome data - Initial stage of dough fermentation DOI 10.18908/lsdba.nbdc00953-002 Description of data conten...ts Gene expression profiles of baker's yeast during initial dough-fermentation were investigated using liquid fermentation...aptation mechanisms of baker's yeast. Results showed the onset of fermentation caused drastic changes in gen...f baker's yeast during dough-fermentation, and will thus help clarify genomic res

  12. Development of human protein reference database as an initial platform for approaching systems biology in humans

    DEFF Research Database (Denmark)

    Peri, Suraj; Navarro, J Daniel; Amanchy, Ramars

    2003-01-01

    Human Protein Reference Database (HPRD) is an object database that integrates a wealth of information relevant to the function of human proteins in health and disease. Data pertaining to thousands of protein-protein interactions, posttranslational modifications, enzyme/substrate relationships...

  13. HERVd: database of human endogenous retroviruses

    Czech Academy of Sciences Publication Activity Database

    Pačes, Jan; Pavlíček, Adam; Pačes, Václav

    2002-01-01

    Roč. 30, č. 1 (2002), s. 205-206 ISSN 0305-1048 R&D Projects: GA MŠk LN00A079; GA ČR GA301/99/M023 Keywords : HERV * database * human genome Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 7.051, year: 2002

  14. Annotation of nerve cord transcriptome in earthworm Eisenia fetida

    Directory of Open Access Journals (Sweden)

    Vasanthakumar Ponesakki

    2017-12-01

    Full Text Available In annelid worms, the nerve cord serves as a crucial organ to control the sensory and behavioral physiology. The inadequate genome resource of earthworms has prioritized the comprehensive analysis of their transcriptome dataset to monitor the genes express in the nerve cord and predict their role in the neurotransmission and sensory perception of the species. The present study focuses on identifying the potential transcripts and predicting their functional features by annotating the transcriptome dataset of nerve cord tissues prepared by Gong et al., 2010 from the earthworm Eisenia fetida. Totally 9762 transcripts were successfully annotated against the NCBI nr database using the BLASTX algorithm and among them 7680 transcripts were assigned to a total of 44,354 GO terms. The conserve domain analysis indicated the over representation of P-loop NTPase domain and calcium binding EF-hand domain. The COG functional annotation classified 5860 transcript sequences into 25 functional categories. Further, 4502 contig sequences were found to map with 124 KEGG pathways. The annotated contig dataset exhibited 22 crucial neuropeptides having considerable matches to the marine annelid Platynereis dumerilii, suggesting their possible role in neurotransmission and neuromodulation. In addition, 108 human stem cell marker homologs were identified including the crucial epigenetic regulators, transcriptional repressors and cell cycle regulators, which may contribute to the neuronal and segmental regeneration. The complete functional annotation of this nerve cord transcriptome can be further utilized to interpret genetic and molecular mechanisms associated with neuronal development, nervous system regeneration and nerve cord function.

  15. Investigating the Correspondence Between Transcriptomic and Proteomic Expression Profiles Using Coupled Cluster Models

    International Nuclear Information System (INIS)

    Rogers, Simon; Girolami, Mark; Kolch, Walter; Waters, Katrina M.; Liu, Tao; Thrall, Brian D.; Wiley, H. S.

    2008-01-01

    Modern transcriptomics and proteomics enable us to survey the expression of RNAs and proteins at large scales. While these data are usually generated and analyzed separately, there is an increasing interest in comparing and co-analyzing transcriptome and proteome expression data. A major open question is whether transcriptome and proteome expression is linked and how it is coordinated. Results: Here we have developed a probabilistic clustering model that permits analysis of the links between transcriptomic and proteomic profiles in a sensible and flexible manner. Our coupled mixture model defines a prior probability distribution over the component to which a protein profile should be assigned conditioned on which component the associated mRNA profile belongs to. By providing probabilistic assignments this approach sits between the two extremes of concatenating the data on the assumption that mRNA and protein clusters would have a one-to-one relationship, and independent clustering where the mRNA profile provides no information on the protein profile and vice-versa. We apply this approach to a large dataset of quantitative transcriptomic and proteomic expression data obtained from a human breast epithelial cell line (HMEC) stimulated by epidermal growth factor (EGF) over a series of timepoints corresponding to one cell cycle. The results reveal a complex relationship between transcriptome and proteome with most mRNA clusters linked to at least two protein clusters, and vice versa. A more detailed analysis incorporating information on gene function from the gene ontology database shows that a high correlation of mRNA and protein expression is limited to the components of some molecular machines, such as the ribosome, cell adhesion complexes and the TCP-1 chaperonin involved in protein folding. Conclusions: The dynamic regulation of the transcriptome and proteome in mammalian cells in response to an acute mitogenic stimulus appears largely independent with very little

  16. Profiling the venom gland transcriptomes of Costa Rican snakes by 454 pyrosequencing

    Directory of Open Access Journals (Sweden)

    Sanz Libia

    2011-05-01

    Full Text Available Abstract Background A long term research goal of venomics, of applied importance for improving current antivenom therapy, but also for drug discovery, is to understand the pharmacological potential of venoms. Individually or combined, proteomic and transcriptomic studies have demonstrated their feasibility to explore in depth the molecular diversity of venoms. In the absence of genome sequence, transcriptomes represent also valuable searchable databases for proteomic projects. Results The venom gland transcriptomes of 8 Costa Rican taxa from 5 genera (Crotalus, Bothrops, Atropoides, Cerrophidion, and Bothriechis of pitvipers were investigated using high-throughput 454 pyrosequencing. 100,394 out of 330,010 masked reads produced significant hits in the available databases. 5.165,220 nucleotides (8.27% were masked by RepeatMasker, the vast majority of which corresponding to class I (retroelements and class II (DNA transposons mobile elements. BLAST hits included 79,991 matches to entries of the taxonomic suborder Serpentes, of which 62,433 displayed similarity to documented venom proteins. Strong discrepancies between the transcriptome-computed and the proteome-gathered toxin compositions were obvious at first sight. Although the reasons underlaying this discrepancy are elusive, since no clear trend within or between species is apparent, the data indicate that individual mRNA species may be translationally controlled in a species-dependent manner. The minimum number of genes from each toxin family transcribed into the venom gland transcriptome of each species was calculated from multiple alignments of reads matched to a full-length reference sequence of each toxin family. Reads encoding ORF regions of Kazal-type inhibitor-like proteins were uniquely found in Bothriechis schlegelii and B. lateralis transcriptomes, suggesting a genus-specific recruitment event during the early-Middle Miocene. A transcriptome-based cladogram supports the large

  17. Blood transcriptomics and metabolomics for personalized medicine.

    Science.gov (United States)

    Li, Shuzhao; Todor, Andrei; Luo, Ruiyan

    2016-01-01

    Molecular analysis of blood samples is pivotal to clinical diagnosis and has been intensively investigated since the rise of systems biology. Recent developments have opened new opportunities to utilize transcriptomics and metabolomics for personalized and precision medicine. Efforts from human immunology have infused into this area exquisite characterizations of subpopulations of blood cells. It is now possible to infer from blood transcriptomics, with fine accuracy, the contribution of immune activation and of cell subpopulations. In parallel, high-resolution mass spectrometry has brought revolutionary analytical capability, detecting > 10,000 metabolites, together with environmental exposure, dietary intake, microbial activity, and pharmaceutical drugs. Thus, the re-examination of blood chemicals by metabolomics is in order. Transcriptomics and metabolomics can be integrated to provide a more comprehensive understanding of the human biological states. We will review these new data and methods and discuss how they can contribute to personalized medicine.

  18. Transcriptome analysis of H2O2-treated wheat seedlings reveals a H2O2-responsive fatty acid desaturase gene participating in powdery mildew resistance.

    Directory of Open Access Journals (Sweden)

    Aili Li

    Full Text Available Hydrogen peroxide (H(2O(2 plays important roles in plant biotic and abiotic stress responses. However, the effect of H(2O(2 stress on the bread wheat transcriptome is still lacking. To investigate the cellular and metabolic responses triggered by H(2O(2, we performed an mRNA tag analysis of wheat seedlings under 10 mM H(2O(2 treatment for 6 hour in one powdery mildew (PM resistant (PmA and two susceptible (Cha and Han lines. In total, 6,156, 6,875 and 3,276 transcripts were found to be differentially expressed in PmA, Han and Cha respectively. Among them, 260 genes exhibited consistent expression patterns in all three wheat lines and may represent a subset of basal H(2O(2 responsive genes that were associated with cell defense, signal transduction, photosynthesis, carbohydrate metabolism, lipid metabolism, redox homeostasis, and transport. Among genes specific to PmA, 'transport' activity was significantly enriched in Gene Ontology analysis. MapMan classification showed that, while both up- and down- regulations were observed for auxin, abscisic acid, and brassinolides signaling genes, the jasmonic acid and ethylene signaling pathway genes were all up-regulated, suggesting H(2O(2-enhanced JA/Et functions in PmA. To further study whether any of these genes were involved in wheat PM response, 19 H(2O(2-responsive putative defense related genes were assayed in wheat seedlings infected with Blumeria graminis f. sp. tritici (Bgt. Eight of these genes were found to be co-regulated by H(2O(2 and Bgt, among which a fatty acid desaturase gene TaFAD was then confirmed by virus induced gene silencing (VIGS to be required for the PM resistance. Together, our data presents the first global picture of the wheat transcriptome under H(2O(2 stress and uncovers potential links between H(2O(2 and Bgt responses, hence providing important candidate genes for the PM resistance in wheat.

  19. Transcriptomics and proteomics show that selenium affects inflammation, cytoskeleton, and cancer pathways in human rectal biopsies.

    Science.gov (United States)

    Méplan, Catherine; Johnson, Ian T; Polley, Abigael C J; Cockell, Simon; Bradburn, David M; Commane, Daniel M; Arasaradnam, Ramesh P; Mulholland, Francis; Zupanic, Anze; Mathers, John C; Hesketh, John

    2016-08-01

    Epidemiologic studies highlight the potential role of dietary selenium (Se) in colorectal cancer prevention. Our goal was to elucidate whether expression of factors crucial for colorectal homoeostasis is affected by physiologic differences in Se status. Using transcriptomics and proteomics followed by pathway analysis, we identified pathways affected by Se status in rectal biopsies from 22 healthy adults, including 11 controls with optimal status (mean plasma Se = 1.43 μM) and 11 subjects with suboptimal status (mean plasma Se = 0.86 μM). We observed that 254 genes and 26 proteins implicated in cancer (80%), immune function and inflammatory response (40%), cell growth and proliferation (70%), cellular movement, and cell death (50%) were differentially expressed between the 2 groups. Expression of 69 genes, including selenoproteins W1 and K, which are genes involved in cytoskeleton remodelling and transcription factor NFκB signaling, correlated significantly with Se status. Integrating proteomics and transcriptomics datasets revealed reduced inflammatory and immune responses and cytoskeleton remodelling in the suboptimal Se status group. This is the first study combining omics technologies to describe the impact of differences in Se status on colorectal expression patterns, revealing that suboptimal Se status could alter inflammatory signaling and cytoskeleton in human rectal mucosa and so influence cancer risk.-Méplan, C., Johnson, I. T., Polley, A. C. J., Cockell, S., Bradburn, D. M., Commane, D. M., Arasaradnam, R. P., Mulholland, F., Zupanic, A., Mathers, J. C., Hesketh, J. Transcriptomics and proteomics show that selenium affects inflammation, cytoskeleton, and cancer pathways in human rectal biopsies. © The Author(s).

  20. Analysis of Pigeon (Columba) Ovary Transcriptomes to Identify Genes Involved in Blue Light Regulation

    Science.gov (United States)

    Wang, Ying; Ding, Jia-tong; Yang, Hai-ming; Yan, Zheng-jie; Cao, Wei; Li, Yang-bai

    2015-01-01

    Monochromatic light is widely applied to promote poultry reproductive performance, yet little is currently known regarding the mechanism by which light wavelengths affect pigeon reproduction. Recently, high-throughput sequencing technologies have been used to provide genomic information for solving this problem. In this study, we employed Illumina Hiseq 2000 to identify differentially expressed genes in ovary tissue from pigeons under blue and white light conditions and de novo transcriptome assembly to construct a comprehensive sequence database containing information on the mechanisms of follicle development. A total of 157,774 unigenes (mean length: 790 bp) were obtained by the Trinity program, and 35.83% of these unigenes were matched to genes in a non-redundant protein database. Gene description, gene ontology, and the clustering of orthologous group terms were performed to annotate the transcriptome assembly. Differentially expressed genes between blue and white light conditions included those related to oocyte maturation, hormone biosynthesis, and circadian rhythm. Furthermore, 17,574 SSRs and 533,887 potential SNPs were identified in this transcriptome assembly. This work is the first transcriptome analysis of the Columba ovary using Illumina technology, and the resulting transcriptome and differentially expressed gene data can facilitate further investigations into the molecular mechanism of the effect of blue light on follicle development and reproduction in pigeons and other bird species. PMID:26599806

  1. Analysis of Pigeon (Columba Ovary Transcriptomes to Identify Genes Involved in Blue Light Regulation.

    Directory of Open Access Journals (Sweden)

    Ying Wang

    Full Text Available Monochromatic light is widely applied to promote poultry reproductive performance, yet little is currently known regarding the mechanism by which light wavelengths affect pigeon reproduction. Recently, high-throughput sequencing technologies have been used to provide genomic information for solving this problem. In this study, we employed Illumina Hiseq 2000 to identify differentially expressed genes in ovary tissue from pigeons under blue and white light conditions and de novo transcriptome assembly to construct a comprehensive sequence database containing information on the mechanisms of follicle development. A total of 157,774 unigenes (mean length: 790 bp were obtained by the Trinity program, and 35.83% of these unigenes were matched to genes in a non-redundant protein database. Gene description, gene ontology, and the clustering of orthologous group terms were performed to annotate the transcriptome assembly. Differentially expressed genes between blue and white light conditions included those related to oocyte maturation, hormone biosynthesis, and circadian rhythm. Furthermore, 17,574 SSRs and 533,887 potential SNPs were identified in this transcriptome assembly. This work is the first transcriptome analysis of the Columba ovary using Illumina technology, and the resulting transcriptome and differentially expressed gene data can facilitate further investigations into the molecular mechanism of the effect of blue light on follicle development and reproduction in pigeons and other bird species.

  2. Microbiome and ecotypic adaption of Holcus lanatus (L.) to extremes of its soil pH range, investigated through transcriptome sequencing.

    Science.gov (United States)

    Young, Ellen; Carey, Manus; Meharg, Andrew A; Meharg, Caroline

    2018-03-20

    the most common genus in shoots, with Colletotrichum and Rhizophagus (AM fungi) most numerous in limestone soil roots. The latter coincided with upregulation of plant genes involved in AM symbiosis initiation and AM-based P acquisition in an environment where P availability is low. Meta-transcriptome analyses provided novel insights into H. lanatus transcriptome responses, associated eukaryotic microbiota functions and taxonomic community composition. Significant edaphic and plant ecotype effects were identified, demonstrating that meta-transcriptome-based functional analysis is a powerful tool for the study of natural plant-microbiome interactions.

  3. Professional Microsoft SQL Server 2012 Administration

    CERN Document Server

    Jorgensen, Adam; LoForte, Ross; Knight, Brian

    2012-01-01

    An essential how-to guide for experienced DBAs on the most significant product release since 2005! Microsoft SQL Server 2012 will have major changes throughout the SQL Server and will impact how DBAs administer the database. With this book, a team of well-known SQL Server experts introduces the many new features of the most recent version of SQL Server and deciphers how these changes will affect the methods that administrators have been using for years. Loaded with unique tips, tricks, and workarounds for handling the most difficult SQL Server admin issues, this how-to guide deciphers topics s

  4. Development of Human Face Literature Database Using Text Mining Approach: Phase I.

    Science.gov (United States)

    Kaur, Paramjit; Krishan, Kewal; Sharma, Suresh K

    2018-06-01

    The face is an important part of the human body by which an individual communicates in the society. Its importance can be highlighted by the fact that a person deprived of face cannot sustain in the living world. The amount of experiments being performed and the number of research papers being published under the domain of human face have surged in the past few decades. Several scientific disciplines, which are conducting research on human face include: Medical Science, Anthropology, Information Technology (Biometrics, Robotics, and Artificial Intelligence, etc.), Psychology, Forensic Science, Neuroscience, etc. This alarms the need of collecting and managing the data concerning human face so that the public and free access of it can be provided to the scientific community. This can be attained by developing databases and tools on human face using bioinformatics approach. The current research emphasizes on creating a database concerning literature data of human face. The database can be accessed on the basis of specific keywords, journal name, date of publication, author's name, etc. The collected research papers will be stored in the form of a database. Hence, the database will be beneficial to the research community as the comprehensive information dedicated to the human face could be found at one place. The information related to facial morphologic features, facial disorders, facial asymmetry, facial abnormalities, and many other parameters can be extracted from this database. The front end has been developed using Hyper Text Mark-up Language and Cascading Style Sheets. The back end has been developed using hypertext preprocessor (PHP). The JAVA Script has used as scripting language. MySQL (Structured Query Language) is used for database development as it is most widely used Relational Database Management System. XAMPP (X (cross platform), Apache, MySQL, PHP, Perl) open source web application software has been used as the server.The database is still under the

  5. De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata

    Directory of Open Access Journals (Sweden)

    Neeraja Cherukupalli

    2016-08-01

    Full Text Available Andrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequencing of RNA from A. paniculata leaf using Illumina HiSeqTM 2000 platform followed by the de novo transcriptome assembly. A total of 189.22 million high quality paired reads were generated and 1,70,724 transcripts were predicted in the primary assembly. Secondary assembly generated a transcriptome size of ~88 Mb with 83,800 clustered transcripts. Based on the similarity searches against plant nonredundant protein database, gene ontology and eukaryotic orthologous groups, 49,363 transcripts were annotated constituting upto 58.91% of the identified unigenes. Annotation of transcripts − using kyoto encyclopedia of genes and genomes database − revealed 5,606 transcripts plausibly involved in 140 pathways including biosynthesis of terpenoids and other secondary metabolites. Transcription factor analysis showed 6,767 unique transcripts belonging to 97 different transcription factor families. A total number of 124 CYP450 transcripts belonging to seven divergent clans have been identified. Transcriptome revealed 146 different transcripts coding for enzymes involved in the biosynthesis of terpenoids of which 35 contained terpene synthase motifs. This study also revealed 32,341 simple sequence repeats (SSRs in 23,168 transcripts. Assembled sequences of transcriptome of A.paniculata generated in this study are made available, for the first time, in the TSA database, which provides useful information for functional and comparative genomic analyses besides identification of key enzymes involved in the various pathways of secondary metabolism.

  6. Analysis of the Macaca mulatta transcriptome and the sequence divergence between Macaca and human.

    Science.gov (United States)

    Magness, Charles L; Fellin, P Campion; Thomas, Matthew J; Korth, Marcus J; Agy, Michael B; Proll, Sean C; Fitzgibbon, Matthew; Scherer, Christina A; Miner, Douglas G; Katze, Michael G; Iadonato, Shawn P

    2005-01-01

    We report the initial sequencing and comparative analysis of the Macaca mulatta transcriptome. Cloned sequences from 11 tissues, nine animals, and three species (M. mulatta, M. fascicularis, and M. nemestrina) were sampled, resulting in the generation of 48,642 sequence reads. These data represent an initial sampling of the putative rhesus orthologs for 6,216 human genes. Mean nucleotide diversity within M. mulatta and sequence divergence among M. fascicularis, M. nemestrina, and M. mulatta are also reported.

  7. A SAGE based approach to human glomerular endothelium : defining the transcriptome, finding a novel molecule and highlighting endothelial diversity

    NARCIS (Netherlands)

    Sengoelge, Guerkan; Winnicki, Wolfgang; Kupczok, Anne; von Haeseler, Arndt; Schuster, Michael; Pfaller, Walter; Jennings, Paul; Weltermann, Ansgar; Blake, Sophia; Sunder-Plassmann, Gere

    2014-01-01

    BACKGROUND: Large scale transcript analysis of human glomerular microvascular endothelial cells (HGMEC) has never been accomplished. We designed this study to define the transcriptome of HGMEC and facilitate a better characterization of these endothelial cells with unique features. Serial analysis

  8. Subject Retrieval from Full-Text Databases in the Humanities

    Science.gov (United States)

    East, John W.

    2007-01-01

    This paper examines the problems involved in subject retrieval from full-text databases of secondary materials in the humanities. Ten such databases were studied and their search functionality evaluated, focusing on factors such as Boolean operators, document surrogates, limiting by subject area, proximity operators, phrase searching, wildcards,…

  9. Benchmarking Water Quality from Wastewater to Drinking Waters Using Reduced Transcriptome of Human Cells.

    Science.gov (United States)

    Xia, Pu; Zhang, Xiaowei; Zhang, Hanxin; Wang, Pingping; Tian, Mingming; Yu, Hongxia

    2017-08-15

    One of the major challenges in environmental science is monitoring and assessing the risk of complex environmental mixtures. In vitro bioassays with limited key toxicological end points have been shown to be suitable to evaluate mixtures of organic pollutants in wastewater and recycled water. Omics approaches such as transcriptomics can monitor biological effects at the genome scale. However, few studies have applied omics approach in the assessment of mixtures of organic micropollutants. Here, an omics approach was developed for profiling bioactivity of 10 water samples ranging from wastewater to drinking water in human cells by a reduced human transcriptome (RHT) approach and dose-response modeling. Transcriptional expression of 1200 selected genes were measured by an Ampliseq technology in two cell lines, HepG2 and MCF7, that were exposed to eight serial dilutions of each sample. Concentration-effect models were used to identify differentially expressed genes (DEGs) and to calculate effect concentrations (ECs) of DEGs, which could be ranked to investigate low dose response. Furthermore, molecular pathways disrupted by different samples were evaluated by Gene Ontology (GO) enrichment analysis. The ability of RHT for representing bioactivity utilizing both HepG2 and MCF7 was shown to be comparable to the results of previous in vitro bioassays. Finally, the relative potencies of the mixtures indicated by RHT analysis were consistent with the chemical profiles of the samples. RHT analysis with human cells provides an efficient and cost-effective approach to benchmarking mixture of micropollutants and may offer novel insight into the assessment of mixture toxicity in water.

  10. Genetic signatures of adaptation revealed from transcriptome sequencing of Arctic and red foxes.

    Science.gov (United States)

    Kumar, Vikas; Kutschera, Verena E; Nilsson, Maria A; Janke, Axel

    2015-08-07

    The genus Vulpes (true foxes) comprises numerous species that inhabit a wide range of habitats and climatic conditions, including one species, the Arctic fox (Vulpes lagopus) which is adapted to the arctic region. A close relative to the Arctic fox, the red fox (Vulpes vulpes), occurs in subarctic to subtropical habitats. To study the genetic basis of their adaptations to different environments, transcriptome sequences from two Arctic foxes and one red fox individual were generated and analyzed for signatures of positive selection. In addition, the data allowed for a phylogenetic analysis and divergence time estimate between the two fox species. The de novo assembly of reads resulted in more than 160,000 contigs/transcripts per individual. Approximately 17,000 homologous genes were identified using human and the non-redundant databases. Positive selection analyses revealed several genes involved in various metabolic and molecular processes such as energy metabolism, cardiac gene regulation, apoptosis and blood coagulation to be under positive selection in foxes. Branch site tests identified four genes to be under positive selection in the Arctic fox transcriptome, two of which are fat metabolism genes. In the red fox transcriptome eight genes are under positive selection, including molecular process genes, notably genes involved in ATP metabolism. Analysis of the three transcriptomes and five Sanger re-sequenced genes in additional individuals identified a lower genetic variability within Arctic foxes compared to red foxes, which is consistent with distribution range differences and demographic responses to past climatic fluctuations. A phylogenomic analysis estimated that the Arctic and red fox lineages diverged about three million years ago. Transcriptome data are an economic way to generate genomic resources for evolutionary studies. Despite not representing an entire genome, this transcriptome analysis identified numerous genes that are relevant to arctic

  11. Transcriptome of interstitial cells of Cajal reveals unique and selective gene signatures.

    Directory of Open Access Journals (Sweden)

    Moon Young Lee

    Full Text Available Transcriptome-scale data can reveal essential clues into understanding the underlying molecular mechanisms behind specific cellular functions and biological processes. Transcriptomics is a continually growing field of research utilized in biomarker discovery. The transcriptomic profile of interstitial cells of Cajal (ICC, which serve as slow-wave electrical pacemakers for gastrointestinal (GI smooth muscle, has yet to be uncovered. Using copGFP-labeled ICC mice and flow cytometry, we isolated ICC populations from the murine small intestine and colon and obtained their transcriptomes. In analyzing the transcriptome, we identified a unique set of ICC-restricted markers including transcription factors, epigenetic enzymes/regulators, growth factors, receptors, protein kinases/phosphatases, and ion channels/transporters. This analysis provides new and unique insights into the cellular and biological functions of ICC in GI physiology. Additionally, we constructed an interactive ICC genome browser (http://med.unr.edu/physio/transcriptome based on the UCSC genome database. To our knowledge, this is the first online resource that provides a comprehensive library of all known genetic transcripts expressed in primary ICC. Our genome browser offers a new perspective into the alternative expression of genes in ICC and provides a valuable reference for future functional studies.

  12. The human keratinocyte two-dimensional protein database (update 1994): towards an integrated approach to the study of cell proliferation, differentiation and skin diseases

    DEFF Research Database (Denmark)

    Celis, J E; Rasmussen, H H; Olsen, E

    1994-01-01

    The master two-dimensional (2-D) gel database of human keratinocytes currently lists 3087 cellular proteins (2168 isoelectric focusing, IEF; and 919 none-quilibrium pH gradient electrophoresis, NEPHGE), many of which correspond to posttranslational modifications, 890 polypeptides have been...... in the database. We also report a database of proteins recovered from the medium of noncultured, unfractionated keratinocytes. This database lists 398 polypeptides (309 IEF; 89 NEPHGE) of which 76 have been identified. The aim of the comprehensive databases is to gather, through a systematic study...

  13. De novo transcriptome assembly of two different peach cultivars grown in Korea

    Directory of Open Access Journals (Sweden)

    Yeonhwa Jo

    2015-12-01

    Full Text Available Peach (Prunus persica is one of the most popular stone fruits worldwide. Next generation sequencing (NGS has facilitated genome and transcriptome analyses of several stone fruit trees. In this study, we conducted de novo transcriptome analyses of two peach cultivars grown in Korea. Leaves of two cultivars, referred to as Jangtaek and Mibaek, were harvested and used for library preparation. The two prepared libraries were paired-end sequenced by the HiSeq2000 system. We obtained 8.14 GB and 9.62 GB sequence data from Jangtaek and Mibaek (NCBI accession numbers: SRS1056585 and SRS1056587, respectively. The Trinity program was used to assemble two transcriptomes de novo, resulting in 110,477 (Jangtaek and 136,196 (Mibaek transcripts. TransDecoder identified possible coding regions in assembled transcripts. The identified proteins were subjected to BLASTP search against NCBI's non-redundant database for functional annotation. This study provides transcriptome data for two peach cultivars, which might be useful for genetic marker development and comparative transcriptome analyses.

  14. Transcriptome analysis of the Chinese giant salamander (Andrias davidianus using RNA-sequencing

    Directory of Open Access Journals (Sweden)

    Yong Huang

    2017-12-01

    Full Text Available The Chinese giant salamander (Andrias davidianus is an economically important animal on academic value. However, the genomic information of this species has been less studied. In our study, the transcripts of A. davidianus were obtained by RNA-seq to conduct a transcriptomic analysis. In total 132,912 unigenes were generated with an average length of 690 bp and N50 of 1263 bp by de novo assembly using Trinity software. Using a sequence similarity search against the nine public databases (CDD, KOG, NR, NT, PFAM, Swiss-prot, TrEMBL, GO and KEGG databases, a total of 24,049, 18,406, 36,711, 15,858, 20,500, 27,515, 36,705, 28,879 and 10,958 unigenes were annotated in databases, respectively. Of these, 6323 unigenes were annotated in all database and 39,672 unigenes were annotated in at least one database. Blasted with KEGG pathway, 10,958 unigenes were annotated, and it was divided into 343 categories according to different pathways. In addition, we also identified 29,790 SSRs. This study provided a valuable resource for understanding transcriptomic information of A. davidianus and laid a foundation for further research on functional gene cloning, genomics, genetic diversity analysis and molecular marker exploitation in A. davidianus.

  15. Transcriptome sequencing of the Microarray Quality Control (MAQC RNA reference samples using next generation sequencing

    Directory of Open Access Journals (Sweden)

    Thierry-Mieg Danielle

    2009-06-01

    Full Text Available Abstract Background Transcriptome sequencing using next-generation sequencing platforms will soon be competing with DNA microarray technologies for global gene expression analysis. As a preliminary evaluation of these promising technologies, we performed deep sequencing of cDNA synthesized from the Microarray Quality Control (MAQC reference RNA samples using Roche's 454 Genome Sequencer FLX. Results We generated more that 3.6 million sequence reads of average length 250 bp for the MAQC A and B samples and introduced a data analysis pipeline for translating cDNA read counts into gene expression levels. Using BLAST, 90% of the reads mapped to the human genome and 64% of the reads mapped to the RefSeq database of well annotated genes with e-values ≤ 10-20. We measured gene expression levels in the A and B samples by counting the numbers of reads that mapped to individual RefSeq genes in multiple sequencing runs to evaluate the MAQC quality metrics for reproducibility, sensitivity, specificity, and accuracy and compared the results with DNA microarrays and Quantitative RT-PCR (QRTPCR from the MAQC studies. In addition, 88% of the reads were successfully aligned directly to the human genome using the AceView alignment programs with an average 90% sequence similarity to identify 137,899 unique exon junctions, including 22,193 new exon junctions not yet contained in the RefSeq database. Conclusion Using the MAQC metrics for evaluating the performance of gene expression platforms, the ExpressSeq results for gene expression levels showed excellent reproducibility, sensitivity, and specificity that improved systematically with increasing shotgun sequencing depth, and quantitative accuracy that was comparable to DNA microarrays and QRTPCR. In addition, a careful mapping of the reads to the genome using the AceView alignment programs shed new light on the complexity of the human transcriptome including the discovery of thousands of new splice variants.

  16. LINE FUSION GENES: a database of LINE expression in human genes

    Directory of Open Access Journals (Sweden)

    Park Hong-Seog

    2006-06-01

    Full Text Available Abstract Background Long Interspersed Nuclear Elements (LINEs are the most abundant retrotransposons in humans. About 79% of human genes are estimated to contain at least one segment of LINE per transcription unit. Recent studies have shown that LINE elements can affect protein sequences, splicing patterns and expression of human genes. Description We have developed a database, LINE FUSION GENES, for elucidating LINE expression throughout the human gene database. We searched the 28,171 genes listed in the NCBI database for LINE elements and analyzed their structures and expression patterns. The results show that the mRNA sequences of 1,329 genes were affected by LINE expression. The LINE expression types were classified on the basis of LINEs in the 5' UTR, exon or 3' UTR sequences of the mRNAs. Our database provides further information, such as the tissue distribution and chromosomal location of the genes, and the domain structure that is changed by LINE integration. We have linked all the accession numbers to the NCBI data bank to provide mRNA sequences for subsequent users. Conclusion We believe that our work will interest genome scientists and might help them to gain insight into the implications of LINE expression for human evolution and disease. Availability http://www.primate.or.kr/line

  17. Impact of Transcriptomics on Our Understanding of Pulmonary Fibrosis

    Science.gov (United States)

    Vukmirovic, Milica; Kaminski, Naftali

    2018-01-01

    Idiopathic pulmonary fibrosis (IPF) is a lethal fibrotic lung disease characterized by aberrant remodeling of the lung parenchyma with extensive changes to the phenotypes of all lung resident cells. The introduction of transcriptomics, genome scale profiling of thousands of RNA transcripts, caused a significant inversion in IPF research. Instead of generating hypotheses based on animal models of disease, or biological plausibility, with limited validation in humans, investigators were able to generate hypotheses based on unbiased molecular analysis of human samples and then use animal models of disease to test their hypotheses. In this review, we describe the insights made from transcriptomic analysis of human IPF samples. We describe how transcriptomic studies led to identification of novel genes and pathways involved in the human IPF lung such as: matrix metalloproteinases, WNT pathway, epithelial genes, role of microRNAs among others, as well as conceptual insights such as the involvement of developmental pathways and deep shifts in epithelial and fibroblast phenotypes. The impact of lung and transcriptomic studies on disease classification, endotype discovery, and reproducible biomarkers is also described in detail. Despite these impressive achievements, the impact of transcriptomic studies has been limited because they analyzed bulk tissue and did not address the cellular and spatial heterogeneity of the IPF lung. We discuss new emerging technologies and applications, such as single-cell RNAseq and microenvironment analysis that may address cellular and spatial heterogeneity. We end by making the point that most current tissue collections and resources are not amenable to analysis using the novel technologies. To take advantage of the new opportunities, we need new efforts of sample collections, this time focused on access to all the microenvironments and cells in the IPF lung. PMID:29670881

  18. OPERA-a human performance database under simulated emergencies of nuclear power plants

    International Nuclear Information System (INIS)

    Park, Jinkyun; Jung, Wondea

    2007-01-01

    In complex systems such as the nuclear and chemical industry, the importance of human performance related problems is well recognized. Thus a lot of effort has been spent on this area, and one of the main streams for unraveling human performance related problems is the execution of HRA. Unfortunately a lack of prerequisite information has been pointed out as the most critical problem in conducting HRA. From this necessity, OPERA database that can provide operators' performance data obtained under simulated emergencies has been developed. In this study, typical operators' performance data that are available from OPERA database are briefly explained. After that, in order to ensure the appropriateness of OPERA database, operators' performance data from OPERA database are compared with those of other studies and real events. As a result, it is believed that operators' performance data of OPERA database are fairly comparable to those of other studies and real events. Therefore it is meaningful to expect that OPERA database can be used as a serviceable data source for scrutinizing human performance related problems including HRA

  19. HERVd: the Human Endogenous Retrovirus Database: update

    Czech Academy of Sciences Publication Activity Database

    Pačes, Jan; Pavlíček, A.; Zíka, Radek; Jurka, J.; Pačes, Václav

    2004-01-01

    Roč. 32, č. 1 (2004), s. 50-50 ISSN 0305-1048 R&D Projects: GA MŠk LN00A079 Institutional research plan: CEZ:AV0Z5052915 Keywords : human * endogenous retrovirus * database Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 7.260, year: 2004

  20. Exploring Human Cognition Using Large Image Databases.

    Science.gov (United States)

    Griffiths, Thomas L; Abbott, Joshua T; Hsu, Anne S

    2016-07-01

    Most cognitive psychology experiments evaluate models of human cognition using a relatively small, well-controlled set of stimuli. This approach stands in contrast to current work in neuroscience, perception, and computer vision, which have begun to focus on using large databases of natural images. We argue that natural images provide a powerful tool for characterizing the statistical environment in which people operate, for better evaluating psychological theories, and for bringing the insights of cognitive science closer to real applications. We discuss how some of the challenges of using natural images as stimuli in experiments can be addressed through increased sample sizes, using representations from computer vision, and developing new experimental methods. Finally, we illustrate these points by summarizing recent work using large image databases to explore questions about human cognition in four different domains: modeling subjective randomness, defining a quantitative measure of representativeness, identifying prior knowledge used in word learning, and determining the structure of natural categories. Copyright © 2016 Cognitive Science Society, Inc.

  1. A pathology atlas of the human cancer transcriptome

    DEFF Research Database (Denmark)

    Uhlén, Mathias; Zhang, Xi-Cheng; Lee, Sunjae

    2017-01-01

    Cancer is one of the leading causes of death, and there is great interest in understanding the underlying molecular mechanisms involved in the pathogenesis and progression of individual tumors. We used systems-level approaches to analyze the genome-wide transcriptome of the protein-coding genes o...

  2. Transcriptome and proteomic analysis of mango (Mangifera indica Linn) fruits.

    Science.gov (United States)

    Wu, Hong-xia; Jia, Hui-min; Ma, Xiao-wei; Wang, Song-biao; Yao, Quan-sheng; Xu, Wen-tian; Zhou, Yi-gang; Gao, Zhong-shan; Zhan, Ru-lin

    2014-06-13

    Here we used Illumina RNA-seq technology for transcriptome sequencing of a mixed fruit sample from 'Zill' mango (Mangifera indica Linn) fruit pericarp and pulp during the development and ripening stages. RNA-seq generated 68,419,722 sequence reads that were assembled into 54,207 transcripts with a mean length of 858bp, including 26,413 clusters and 27,794 singletons. A total of 42,515(78.43%) transcripts were annotated using public protein databases, with a cut-off E-value above 10(-5), of which 35,198 and 14,619 transcripts were assigned to gene ontology terms and clusters of orthologous groups respectively. Functional annotation against the Kyoto Encyclopedia of Genes and Genomes database identified 23,741(43.79%) transcripts which were mapped to 128 pathways. These pathways revealed many previously unknown transcripts. We also applied mass spectrometry-based transcriptome data to characterize the proteome of ripe fruit. LC-MS/MS analysis of the mango fruit proteome was using tandem mass spectrometry (MS/MS) in an LTQ Orbitrap Velos (Thermo) coupled online to the HPLC. This approach enabled the identification of 7536 peptides that matched 2754 proteins. Our study provides a comprehensive sequence for a systemic view of transcriptome during mango fruit development and the most comprehensive fruit proteome to date, which are useful for further genomics research and proteomic studies. Our study provides a comprehensive sequence for a systemic view of both the transcriptome and proteome of mango fruit, and a valuable reference for further research on gene expression and protein identification. This article is part of a Special Issue entitled: Proteomics of non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.

  3. De novo sequencing and analysis of the transcriptome during the browning of fresh-cut Luffa cylindrica 'Fusi-3' fruits.

    Directory of Open Access Journals (Sweden)

    Haisheng Zhu

    Full Text Available Fresh-cut luffa (Luffa cylindrica fruits commonly undergo browning. However, little is known about the molecular mechanisms regulating this process. We used the RNA-seq technique to analyze the transcriptomic changes occurring during the browning of fresh-cut fruits from luffa cultivar 'Fusi-3'. Over 90 million high-quality reads were assembled into 58,073 Unigenes, and 60.86% of these were annotated based on sequences in four public databases. We detected 35,282 Unigenes with significant hits to sequences in the NCBInr database, and 24,427 Unigenes encoded proteins with sequences that were similar to those of known proteins in the Swiss-Prot database. Additionally, 20,546 and 13,021 Unigenes were similar to existing sequences in the Eukaryotic Orthologous Groups of proteins and Kyoto Encyclopedia of Genes and Genomes databases, respectively. Furthermore, 27,301 Unigenes were differentially expressed during the browning of fresh-cut luffa fruits (i.e., after 1-6 h. Moreover, 11 genes from five gene families (i.e., PPO, PAL, POD, CAT, and SOD identified as potentially associated with enzymatic browning as well as four WRKY transcription factors were observed to be differentially regulated in fresh-cut luffa fruits. With the assistance of rapid amplification of cDNA ends technology, we obtained the full-length sequences of the 15 Unigenes. We also confirmed these Unigenes were expressed by quantitative real-time polymerase chain reaction analysis. This study provides a comprehensive transcriptome sequence resource, and may facilitate further studies aimed at identifying genes affecting luffa fruit browning for the exploitation of the underlying mechanism.

  4. De novo sequencing and analysis of the transcriptome during the browning of fresh-cut Luffa cylindrica 'Fusi-3' fruits

    Science.gov (United States)

    Chen, Mindong; Wang, Bin; Zhang, Qianrong; Xue, Zhuzheng

    2017-01-01

    Fresh-cut luffa (Luffa cylindrica) fruits commonly undergo browning. However, little is known about the molecular mechanisms regulating this process. We used the RNA-seq technique to analyze the transcriptomic changes occurring during the browning of fresh-cut fruits from luffa cultivar ‘Fusi-3’. Over 90 million high-quality reads were assembled into 58,073 Unigenes, and 60.86% of these were annotated based on sequences in four public databases. We detected 35,282 Unigenes with significant hits to sequences in the NCBInr database, and 24,427 Unigenes encoded proteins with sequences that were similar to those of known proteins in the Swiss-Prot database. Additionally, 20,546 and 13,021 Unigenes were similar to existing sequences in the Eukaryotic Orthologous Groups of proteins and Kyoto Encyclopedia of Genes and Genomes databases, respectively. Furthermore, 27,301 Unigenes were differentially expressed during the browning of fresh-cut luffa fruits (i.e., after 1–6 h). Moreover, 11 genes from five gene families (i.e., PPO, PAL, POD, CAT, and SOD) identified as potentially associated with enzymatic browning as well as four WRKY transcription factors were observed to be differentially regulated in fresh-cut luffa fruits. With the assistance of rapid amplification of cDNA ends technology, we obtained the full-length sequences of the 15 Unigenes. We also confirmed these Unigenes were expressed by quantitative real-time polymerase chain reaction analysis. This study provides a comprehensive transcriptome sequence resource, and may facilitate further studies aimed at identifying genes affecting luffa fruit browning for the exploitation of the underlying mechanism. PMID:29145430

  5. Workflow and web application for annotating NCBI BioProject transcriptome data.

    Science.gov (United States)

    Vera Alvarez, Roberto; Medeiros Vidal, Newton; Garzón-Martínez, Gina A; Barrero, Luz S; Landsman, David; Mariño-Ramírez, Leonardo

    2017-01-01

    The volume of transcriptome data is growing exponentially due to rapid improvement of experimental technologies. In response, large central resources such as those of the National Center for Biotechnology Information (NCBI) are continually adapting their computational infrastructure to accommodate this large influx of data. New and specialized databases, such as Transcriptome Shotgun Assembly Sequence Database (TSA) and Sequence Read Archive (SRA), have been created to aid the development and expansion of centralized repositories. Although the central resource databases are under continual development, they do not include automatic pipelines to increase annotation of newly deposited data. Therefore, third-party applications are required to achieve that aim. Here, we present an automatic workflow and web application for the annotation of transcriptome data. The workflow creates secondary data such as sequencing reads and BLAST alignments, which are available through the web application. They are based on freely available bioinformatics tools and scripts developed in-house. The interactive web application provides a search engine and several browser utilities. Graphical views of transcript alignments are available through SeqViewer, an embedded tool developed by NCBI for viewing biological sequence data. The web application is tightly integrated with other NCBI web applications and tools to extend the functionality of data processing and interconnectivity. We present a case study for the species Physalis peruviana with data generated from BioProject ID 67621. URL: http://www.ncbi.nlm.nih.gov/projects/physalis/. Published by Oxford University Press 2017. This work is written by US Government employees and is in the public domain in the US.

  6. KONAGAbase: a genomic and transcriptomic database for the diamondback moth, Plutella xylostella

    OpenAIRE

    Jouraku, Akiya; Yamamoto, Kimiko; Kuwazaki, Seigo; Urio, Masahiro; Suetsugu, Yoshitaka; Narukawa, Junko; Miyamoto, Kazuhisa; Kurita, Kanako; Kanamori, Hiroyuki; Katayose, Yuichi; Matsumoto, Takashi; Noda, Hiroaki

    2013-01-01

    Background The diamondback moth (DBM), Plutella xylostella, is one of the most harmful insect pests for crucifer crops worldwide. DBM has rapidly evolved high resistance to most conventional insecticides such as pyrethroids, organophosphates, fipronil, spinosad, Bacillus thuringiensis, and diamides. Therefore, it is important to develop genomic and transcriptomic DBM resources for analysis of genes related to insecticide resistance, both to clarify the mechanism of resistance of DBM and to fa...

  7. Statistical Processing Algorithms for Human Population Databases

    Directory of Open Access Journals (Sweden)

    Camelia COLESCU

    2012-01-01

    Full Text Available The article is describing some algoritms for statistic functions aplied to a human population database. The samples are specific for the most interesting periods, when the evolution of statistical datas has spectacolous value. The article describes the most usefull form of grafical prezentation of the results

  8. BGDB: a database of bivalent genes.

    Science.gov (United States)

    Li, Qingyan; Lian, Shuabin; Dai, Zhiming; Xiang, Qian; Dai, Xianhua

    2013-01-01

    Bivalent gene is a gene marked with both H3K4me3 and H3K27me3 epigenetic modification in the same area, and is proposed to play a pivotal role related to pluripotency in embryonic stem (ES) cells. Identification of these bivalent genes and understanding their functions are important for further research of lineage specification and embryo development. So far, lots of genome-wide histone modification data were generated in mouse and human ES cells. These valuable data make it possible to identify bivalent genes, but no comprehensive data repositories or analysis tools are available for bivalent genes currently. In this work, we develop BGDB, the database of bivalent genes. The database contains 6897 bivalent genes in human and mouse ES cells, which are manually collected from scientific literature. Each entry contains curated information, including genomic context, sequences, gene ontology and other relevant information. The web services of BGDB database were implemented with PHP + MySQL + JavaScript, and provide diverse query functions. Database URL: http://dailab.sysu.edu.cn/bgdb/

  9. hSAGEing: an improved SAGE-based software for identification of human tissue-specific or common tumor markers and suppressors.

    Science.gov (United States)

    Yang, Cheng-Hong; Chuang, Li-Yeh; Shih, Tsung-Mu; Chang, Hsueh-Wei

    2010-12-17

    SAGE (serial analysis of gene expression) is a powerful method of analyzing gene expression for the entire transcriptome. There are currently many well-developed SAGE tools. However, the cross-comparison of different tissues is seldom addressed, thus limiting the identification of common- and tissue-specific tumor markers. To improve the SAGE mining methods, we propose a novel function for cross-tissue comparison of SAGE data by combining the mathematical set theory and logic with a unique "multi-pool method" that analyzes multiple pools of pair-wise case controls individually. When all the settings are in "inclusion", the common SAGE tag sequences are mined. When one tissue type is in "inclusion" and the other types of tissues are not in "inclusion", the selected tissue-specific SAGE tag sequences are generated. They are displayed in tags-per-million (TPM) and fold values, as well as visually displayed in four kinds of scales in a color gradient pattern. In the fold visualization display, the top scores of the SAGE tag sequences are provided, along with cluster plots. A user-defined matrix file is designed for cross-tissue comparison by selecting libraries from publically available databases or user-defined libraries. The hSAGEing tool provides a combination of friendly cross-tissue analysis and an interface for comparing SAGE libraries for the first time. Some up- or down-regulated genes with tissue-specific or common tumor markers and suppressors are identified computationally. The tool is useful and convenient for in silico cancer transcriptomic studies and is freely available at http://bio.kuas.edu.tw/hSAGEing.

  10. Integrated Transcriptomic and Epigenomic Analysis of Primary Human Lung Epithelial Cell Differentiation

    Science.gov (United States)

    Marconett, Crystal N.; Zhou, Beiyun; Rieger, Megan E.; Selamat, Suhaida A.; Dubourd, Mickael; Fang, Xiaohui; Lynch, Sean K.; Stueve, Theresa Ryan; Siegmund, Kimberly D.; Berman, Benjamin P.

    2013-01-01

    Elucidation of the epigenetic basis for cell-type specific gene regulation is key to gaining a full understanding of how the distinct phenotypes of differentiated cells are achieved and maintained. Here we examined how epigenetic changes are integrated with transcriptional activation to determine cell phenotype during differentiation. We performed epigenomic profiling in conjunction with transcriptomic profiling using in vitro differentiation of human primary alveolar epithelial cells (AEC). This model recapitulates an in vivo process in which AEC transition from one differentiated cell type to another during regeneration following lung injury. Interrogation of histone marks over time revealed enrichment of specific transcription factor binding motifs within regions of changing chromatin structure. Cross-referencing of these motifs with pathways showing transcriptional changes revealed known regulatory pathways of distal alveolar differentiation, such as the WNT and transforming growth factor beta (TGFB) pathways, and putative novel regulators of adult AEC differentiation including hepatocyte nuclear factor 4 alpha (HNF4A), and the retinoid X receptor (RXR) signaling pathways. Inhibition of the RXR pathway confirmed its functional relevance for alveolar differentiation. Our incorporation of epigenetic data allowed specific identification of transcription factors that are potential direct upstream regulators of the differentiation process, demonstrating the power of this approach. Integration of epigenomic data with transcriptomic profiling has broad application for the identification of regulatory pathways in other models of differentiation. PMID:23818859

  11. De novo transcriptome assembly associated with fumonisin production by the rice pathogen Fusarium fujikuroi

    Directory of Open Access Journals (Sweden)

    Keerthi S. Guruge

    2018-06-01

    Full Text Available The present study employed a next-generation sequencing method to assemble a de novo transcriptome database designed to distinguish gene expression changes exhibited by the fumonisin-producing fungus Fusarium fujikuroi when grown under ‘fumonisin-producing’ compared to ‘non-fumonisin-producing’ conditions. The raw data of this study have been deposited at DNA Data Bank of Japan (DDBJ under the accession ID DRA006146. Keywords: Fusarium fujikuroi, Fumonisin, Next-generation sequencing, Transcriptome, Gene-expression

  12. Critical assessment of human metabolic pathway databases: a stepping stone for future integration

    Directory of Open Access Journals (Sweden)

    Stobbe Miranda D

    2011-10-01

    Full Text Available Abstract Background Multiple pathway databases are available that describe the human metabolic network and have proven their usefulness in many applications, ranging from the analysis and interpretation of high-throughput data to their use as a reference repository. However, so far the various human metabolic networks described by these databases have not been systematically compared and contrasted, nor has the extent to which they differ been quantified. For a researcher using these databases for particular analyses of human metabolism, it is crucial to know the extent of the differences in content and their underlying causes. Moreover, the outcomes of such a comparison are important for ongoing integration efforts. Results We compared the genes, EC numbers and reactions of five frequently used human metabolic pathway databases. The overlap is surprisingly low, especially on reaction level, where the databases agree on 3% of the 6968 reactions they have combined. Even for the well-established tricarboxylic acid cycle the databases agree on only 5 out of the 30 reactions in total. We identified the main causes for the lack of overlap. Importantly, the databases are partly complementary. Other explanations include the number of steps a conversion is described in and the number of possible alternative substrates listed. Missing metabolite identifiers and ambiguous names for metabolites also affect the comparison. Conclusions Our results show that each of the five networks compared provides us with a valuable piece of the puzzle of the complete reconstruction of the human metabolic network. To enable integration of the networks, next to a need for standardizing the metabolite names and identifiers, the conceptual differences between the databases should be resolved. Considerable manual intervention is required to reach the ultimate goal of a unified and biologically accurate model for studying the systems biology of human metabolism. Our comparison

  13. Transcriptome analysis of duck liver and identification of differentially expressed transcripts in response to duck hepatitis A virus genotype C infection.

    Science.gov (United States)

    Tang, Cheng; Lan, Daoliang; Zhang, Huanrong; Ma, Jing; Yue, Hua

    2013-01-01

    Duck is an economically important poultry and animal model for human viral hepatitis B. However, the molecular mechanisms underlying host-virus interaction remain unclear because of limited information on the duck genome. This study aims to characterize the duck normal liver transcriptome and to identify the differentially expressed transcripts at 24 h after duck hepatitis A virus genotype C (DHAV-C) infection using Illumina-Solexa sequencing. After removal of low-quality sequences and assembly, a total of 52,757 unigenes was obtained from the normal liver group. Further blast analysis showed that 18,918 unigenes successfully matched the known genes in the database. GO analysis revealed that 25,116 unigenes took part in 61 categories of biological processes, cellular components, and molecular functions. Among the 25 clusters of orthologous group categories (COG), the cluster for "General function prediction only" represented the largest group, followed by "Transcription" and "Replication, recombination, and repair." KEGG analysis showed that 17,628 unigenes were involved in 301 pathways. Through comparison of normal and infected transcriptome data, we identified 20 significantly differentially expressed unigenes, which were further confirmed by real-time polymerase chain reaction. Of the 20 unigenes, nine matched the known genes in the database, including three up-regulated genes (virus replicase polyprotein, LRRC3B, and PCK1) and six down-regulated genes (CRP, AICL-like 2, L1CAM, CYB26A1, CHAC1, and ADAM32). The remaining 11 novel unigenes that did not match any known genes in the database may provide a basis for the discovery of new transcripts associated with infection. This study provided a gene expression pattern for normal duck liver and for the previously unrecognized changes in gene transcription that are altered during DHAV-C infection. Our data revealed useful information for future studies on the duck genome and provided new insights into the molecular

  14. De novo assembly, characterization and functional annotation of pineapple fruit transcriptome through massively parallel sequencing.

    Science.gov (United States)

    Ong, Wen Dee; Voo, Lok-Yung Christopher; Kumar, Vijay Subbiah

    2012-01-01

    Pineapple (Ananas comosus var. comosus), is an important tropical non-climacteric fruit with high commercial potential. Understanding the mechanism and processes underlying fruit ripening would enable scientists to enhance the improvement of quality traits such as, flavor, texture, appearance and fruit sweetness. Although, the pineapple is an important fruit, there is insufficient transcriptomic or genomic information that is available in public databases. Application of high throughput transcriptome sequencing to profile the pineapple fruit transcripts is therefore needed. To facilitate this, we have performed transcriptome sequencing of ripe yellow pineapple fruit flesh using Illumina technology. About 4.7 millions Illumina paired-end reads were generated and assembled using the Velvet de novo assembler. The assembly produced 28,728 unique transcripts with a mean length of approximately 200 bp. Sequence similarity search against non-redundant NCBI database identified a total of 16,932 unique transcripts (58.93%) with significant hits. Out of these, 15,507 unique transcripts were assigned to gene ontology terms. Functional annotation against Kyoto Encyclopedia of Genes and Genomes pathway database identified 13,598 unique transcripts (47.33%) which were mapped to 126 pathways. The assembly revealed many transcripts that were previously unknown. The unique transcripts derived from this work have rapidly increased of the number of the pineapple fruit mRNA transcripts as it is now available in public databases. This information can be further utilized in gene expression, genomics and other functional genomics studies in pineapple.

  15. Desiccation tolerance in bryophytes: The dehydration and rehydration transcriptomes in the desiccation-tolerant bryophyte Bryum argenteum.

    Science.gov (United States)

    Gao, Bei; Li, Xiaoshuang; Zhang, Daoyuan; Liang, Yuqing; Yang, Honglan; Chen, Moxian; Zhang, Yuanming; Zhang, Jianhua; Wood, Andrew J

    2017-08-08

    The desiccation tolerant bryophyte Bryum argenteum is an important component of desert biological soil crusts (BSCs) and is emerging as a model system for studying vegetative desiccation tolerance. Here we present and analyze the hydration-dehydration-rehydration transcriptomes in B. argenteum to establish a desiccation-tolerance transcriptomic atlas. B. argenteum gametophores representing five different hydration stages (hydrated (H0), dehydrated for 2 h (D2), 24 h (D24), then rehydrated for 2 h (R2) and 48 h (R48)), were sampled for transcriptome analyses. Illumina high throughput RNA-Seq technology was employed and generated more than 488.46 million reads. An in-house de novo transcriptome assembly optimization pipeline based on Trinity assembler was developed to obtain a reference Hydration-Dehydration-Rehydration (H-D-R) transcriptome comprising of 76,206 transcripts, with an N50 of 2,016 bp and average length of 1,222 bp. Comprehensive transcription factor (TF) annotation discovered 978 TFs in 62 families, among which 404 TFs within 40 families were differentially expressed upon dehydration-rehydration. Pfam term enrichment analysis revealed 172 protein families/domains were significantly associated with the H-D-R cycle and confirmed early rehydration (i.e. the R2 stage) as exhibiting the maximum stress-induced changes in gene expression.

  16. BISQUE: locus- and variant-specific conversion of genomic, transcriptomic and proteomic database identifiers.

    Science.gov (United States)

    Meyer, Michael J; Geske, Philip; Yu, Haiyuan

    2016-05-15

    Biological sequence databases are integral to efforts to characterize and understand biological molecules and share biological data. However, when analyzing these data, scientists are often left holding disparate biological currency-molecular identifiers from different databases. For downstream applications that require converting the identifiers themselves, there are many resources available, but analyzing associated loci and variants can be cumbersome if data is not given in a form amenable to particular analyses. Here we present BISQUE, a web server and customizable command-line tool for converting molecular identifiers and their contained loci and variants between different database conventions. BISQUE uses a graph traversal algorithm to generalize the conversion process for residues in the human genome, genes, transcripts and proteins, allowing for conversion across classes of molecules and in all directions through an intuitive web interface and a URL-based web service. BISQUE is freely available via the web using any major web browser (http://bisque.yulab.org/). Source code is available in a public GitHub repository (https://github.com/hyulab/BISQUE). haiyuan.yu@cornell.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  17. De Novo Assembly and Characterization of the Transcriptome of Grasshopper Shirakiacris shirakii

    Directory of Open Access Journals (Sweden)

    Zhongying Qiu

    2016-07-01

    Full Text Available Background: The grasshopper Shirakiacris shirakii is an important agricultural pest and feeds mainly on gramineous plants, thereby causing economic damage to a wide range of crops. However, genomic information on this species is extremely limited thus far, and transcriptome data relevant to insecticide resistance and pest control are also not available. Methods: The transcriptome of S. shirakii was sequenced using the Illumina HiSeq platform, and we de novo assembled the transcriptome. Results: Its sequencing produced a total of 105,408,878 clean reads, and the de novo assembly revealed 74,657 unigenes with an average length of 680 bp and N50 of 1057 bp. A total of 28,173 unigenes were annotated for the NCBI non-redundant protein sequences (Nr, NCBI non-redundant nucleotide sequences (Nt, a manually-annotated and reviewed protein sequence database (Swiss-Prot, Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG databases. Based on the Nr annotation results, we manually identified 79 unigenes encoding cytochrome P450 monooxygenases (P450s, 36 unigenes encoding carboxylesterases (CarEs and 36 unigenes encoding glutathione S-transferases (GSTs in S. shirakii. Core RNAi components relevant to miroRNA, siRNA and piRNA pathways, including Pasha, Loquacious, Argonaute-1, Argonaute-2, Argonaute-3, Zucchini, Aubergine, enhanced RNAi-1 and Piwi, were expressed in S. shirakii. We also identified five unigenes that were homologous to the Sid-1 gene. In addition, the analysis of differential gene expressions revealed that a total of 19,764 unigenes were up-regulated and 4185 unigenes were down-regulated in larvae. In total, we predicted 7504 simple sequence repeats (SSRs from 74,657 unigenes. Conclusions: The comprehensive de novo transcriptomic data of S. shirakii will offer a series of valuable molecular resources for better studying insecticide resistance, RNAi and molecular marker discovery in the transcriptome.

  18. Sequencing and de novo transcriptome assembly of the Chinese giant salamander (Andrias davidianus

    Directory of Open Access Journals (Sweden)

    Yong Huang

    2017-06-01

    Full Text Available Next-generation technologies for determination of genomics and transcriptomics composition have a wide range of applications. Andrias davidianus, has become an endangered amphibian species of salamander endemic in China. However, there is a lack of the molecular information. In this study, we obtained the RNA-Seq data from a pool of A. davidianus tissue including spleen, liver, muscle, kidney, skin, testis, gut and heart using Illumina HiSeq 2500 platform. A total of 15,398,997,600 bp were obtained, corresponding to 102,659,984 raw reads. A total of 102,659,984 reads were filtered after removing low-quality reads and trimming the adapter sequences. The Trinity program was used to de novo assemble 132,912 unigenes with an average length of 690 bp and N50 of 1263 bp. Unigenes were annotated through number of databases. These transcriptomic data of A. davidianus should open the door to molecular evolution studies based on the entire transcriptome or targeted genes of interest to sequence. The raw data in this study can be available in NCBI SRA database with accession number of SRP099564.

  19. A comprehensive aligned nifH gene database: a multipurpose tool for studies of nitrogen-fixing bacteria.

    Science.gov (United States)

    Gaby, John Christian; Buckley, Daniel H

    2014-01-01

    We describe a nitrogenase gene sequence database that facilitates analysis of the evolution and ecology of nitrogen-fixing organisms. The database contains 32 954 aligned nitrogenase nifH sequences linked to phylogenetic trees and associated sequence metadata. The database includes 185 linked multigene entries including full-length nifH, nifD, nifK and 16S ribosomal RNA (rRNA) gene sequences. Evolutionary analyses enabled by the multigene entries support an ancient horizontal transfer of nitrogenase genes between Archaea and Bacteria and provide evidence that nifH has a different history of horizontal gene transfer from the nifDK enzyme core. Further analyses show that lineages in nitrogenase cluster I and cluster III have different rates of substitution within nifD, suggesting that nifD is under different selection pressure in these two lineages. Finally, we find that that the genetic divergence of nifH and 16S rRNA genes does not correlate well at sequence dissimilarity values used commonly to define microbial species, as stains having <3% sequence dissimilarity in their 16S rRNA genes can have up to 23% dissimilarity in nifH. The nifH database has a number of uses including phylogenetic and evolutionary analyses, the design and assessment of primers/probes and the evaluation of nitrogenase sequence diversity. Database URL: http://www.css.cornell.edu/faculty/buckley/nifh.htm.

  20. Comparison of Gene Expression in Human Embryonic Stem Cells, hESC-Derived Mesenchymal Stem Cells and Human Mesenchymal Stem Cells.

    Science.gov (United States)

    Barbet, Romain; Peiffer, Isabelle; Hatzfeld, Antoinette; Charbord, Pierre; Hatzfeld, Jacques A

    2011-01-01

    We present a strategy to identify developmental/differentiation and plasma membrane marker genes of the most primitive human Mesenchymal Stem Cells (hMSCs). Using sensitive and quantitative TaqMan Low Density Arrays (TLDA) methodology, we compared the expression of 381 genes in human Embryonic Stem Cells (hESCs), hESC-derived MSCs (hES-MSCs), and hMSCs. Analysis of differentiation genes indicated that hES-MSCs express the sarcomeric muscle lineage in addition to the classical mesenchymal lineages, suggesting they are more primitive than hMSCs. Transcript analysis of membrane antigens suggests that IL1R1(low), BMPR1B(low), FLT4(low), LRRC32(low), and CD34 may be good candidates for the detection and isolation of the most primitive hMSCs. The expression in hMSCs of cytokine genes, such as IL6, IL8, or FLT3LG, without expression of the corresponding receptor, suggests a role for these cytokines in the paracrine control of stem cell niches. Our database may be shared with other laboratories in order to explore the considerable clinical potential of hES-MSCs, which appear to represent an intermediate developmental stage between hESCs and hMSCs.

  1. An in-depth comparison of the porcine, murine and human inflammasomes; lessons from the porcine genome and transcriptome.

    Science.gov (United States)

    Dawson, Harry D; Smith, Allen D; Chen, Celine; Urban, Joseph F

    2017-04-01

    Emerging evidence suggests that swine are a scientifically acceptable intermediate species between rodents and humans to model immune function relevant to humans. The swine genome has recently been sequenced and several preliminary structural and functional analysis of the porcine immunome have been published. Herein we provide an expanded in silico analysis using an improved assembly of the porcine transcriptome that provides an in depth analysis of genes that are related to inflammasomes, responses to Toll-like receptor ligands, and M1 macrophage polarization and Escherichia coli as a model organism. Comparisons of the expansion or contraction of orthologous gene families indicated more similar rates and classes of genes in humans and pigs than in mice; however several novel porcine or artiodactyl-specific paralogs or pseudogenes were identified. Conservation of homology and structural motifs of orthologs revealed that the overall similarity to human proteins was significantly higher for pigs compared to mouse. Despite these similarities, two out of four canonical inflammasome pathways, Absent in melanoma 2 (AIM2) and NLR family and CARD domain containing 4 (NLRC4), were found to be missing in pigs. Pig M1 Mφ polarization in response to interferon-γ (IFN-γ) and lipopolysaccharide (LPS) was assessed, via the transcriptome, using next generation sequencing. Our analysis revealed predominantly human-like responses however some, mouse-like responses were observed, as well as induction of numerous pig or artiodactyl-specific genes. This work supports using swine to model both human immunological and inflammatory responses to infection. However, caution must be exercised as pigs differ from humans in several fundamental pathways. Published by Elsevier B.V.

  2. Transcriptome alterations in zebrafish embryos after exposure to environmental estrogens and anti-androgens can reveal endocrine disruption.

    Science.gov (United States)

    Schiller, Viktoria; Wichmann, Arne; Kriehuber, Ralf; Schäfers, Christoph; Fischer, Rainer; Fenske, Martina

    2013-12-01

    Exposure to environmental chemicals known as endocrine disruptors (EDs) is in many cases associated with an unpredictable hazard for wildlife and human health. The identification of endocrine disruptive properties of chemicals certain to enter the aquatic environment relies on toxicity tests with fish, assessing adverse effects on reproduction and sexual development. The demand for quick, reliable ED assays favored the use of fish embryos as alternative test organisms. We investigated the application of a transcriptomics-based assay for estrogenic and anti-androgenic chemicals with zebrafish embryos. Two reference compounds, 17α-ethinylestradiol and flutamide, were tested to evaluate the effects on development and the transcriptome after 48h-exposures. Comparison of the transcriptome response with other estrogenic and anti-androgenic compounds (genistein, bisphenol A, methylparaben, linuron, prochloraz, propanil) showed commonalities and differences in regulated pathways, enabling us to classify the estrogenic and anti-androgenic potencies. This demonstrates that different mechanism of ED can be assessed already in fish embryos. Copyright © 2013 Elsevier Inc. All rights reserved.

  3. A systematically improved high quality genome and transcriptome of the human blood fluke Schistosoma mansoni.

    Directory of Open Access Journals (Sweden)

    Anna V Protasio

    2012-01-01

    Full Text Available Schistosomiasis is one of the most prevalent parasitic diseases, affecting millions of people in developing countries. Amongst the human-infective species, Schistosoma mansoni is also the most commonly used in the laboratory and here we present the systematic improvement of its draft genome. We used Sanger capillary and deep-coverage Illumina sequencing from clonal worms to upgrade the highly fragmented draft 380 Mb genome to one with only 885 scaffolds and more than 81% of the bases organised into chromosomes. We have also used transcriptome sequencing (RNA-seq from four time points in the parasite's life cycle to refine gene predictions and profile their expression. More than 45% of predicted genes have been extensively modified and the total number has been reduced from 11,807 to 10,852. Using the new version of the genome, we identified trans-splicing events occurring in at least 11% of genes and identified clear cases where it is used to resolve polycistronic transcripts. We have produced a high-resolution map of temporal changes in expression for 9,535 genes, covering an unprecedented dynamic range for this organism. All of these data have been consolidated into a searchable format within the GeneDB (www.genedb.org and SchistoDB (www.schistodb.net databases. With further transcriptional profiling and genome sequencing increasingly accessible, the upgraded genome will form a fundamental dataset to underpin further advances in schistosome research.

  4. Human Variome Project Quality Assessment Criteria for Variation Databases.

    Science.gov (United States)

    Vihinen, Mauno; Hancock, John M; Maglott, Donna R; Landrum, Melissa J; Schaafsma, Gerard C P; Taschner, Peter

    2016-06-01

    Numerous databases containing information about DNA, RNA, and protein variations are available. Gene-specific variant databases (locus-specific variation databases, LSDBs) are typically curated and maintained for single genes or groups of genes for a certain disease(s). These databases are widely considered as the most reliable information source for a particular gene/protein/disease, but it should also be made clear they may have widely varying contents, infrastructure, and quality. Quality is very important to evaluate because these databases may affect health decision-making, research, and clinical practice. The Human Variome Project (HVP) established a Working Group for Variant Database Quality Assessment. The basic principle was to develop a simple system that nevertheless provides a good overview of the quality of a database. The HVP quality evaluation criteria that resulted are divided into four main components: data quality, technical quality, accessibility, and timeliness. This report elaborates on the developed quality criteria and how implementation of the quality scheme can be achieved. Examples are provided for the current status of the quality items in two different databases, BTKbase, an LSDB, and ClinVar, a central archive of submissions about variants and their clinical significance. © 2016 WILEY PERIODICALS, INC.

  5. Transcriptomic Analysis and Meta-Analysis of Human Granulosa and Cumulus Cells.

    Directory of Open Access Journals (Sweden)

    Tanja Burnik Papler

    Full Text Available Specific gene expression in oocytes and its surrounding cumulus (CC and granulosa (GC cells is needed for successful folliculogenesis and oocyte maturation. The aim of the present study was to compare genome-wide gene expression and biological functions of human GC and CC. Individual GC and CC were derived from 37 women undergoing IVF procedures. Gene expression analysis was performed using microarrays, followed by a meta-analysis. Results were validated using quantitative real-time PCR. There were 6029 differentially expressed genes (q < 10-4; of which 650 genes had a log2 FC ≥ 2. After the meta-analysis there were 3156 genes differentially expressed. Among these there were genes that have previously not been reported in human somatic follicular cells, like prokineticin 2 (PROK2, higher expressed in GC, and pregnancy up-regulated nonubiquitous CaM kinase (PNCK, higher expressed in CC. Pathways like inflammatory response and angiogenesis were enriched in GC, whereas in CC, cell differentiation and multicellular organismal development were among enriched pathways. In conclusion, transcriptomes of GC and CC as well as biological functions, are distinctive for each cell subpopulation. By describing novel genes like PROK2 and PNCK, expressed in GC and CC, we upgraded the existing data on human follicular biology.

  6. Deep Insight into the Ganoderma lucidum by Comprehensive Analysis of Its Transcriptome

    Science.gov (United States)

    Yu, Guo-Jun; Wang, Man; Huang, Jie; Yin, Ya-Lin; Chen, Yi-Jie; Jiang, Shuai; Jin, Yan-Xia; Lan, Xian-Qing; Wong, Barry Hon Cheung; Liang, Yi; Sun, Hui

    2012-01-01

    Background Ganoderma lucidum is a basidiomycete white rot fungus and is of medicinal importance in China, Japan and other countries in the Asiatic region. To date, much research has been performed in identifying the medicinal ingredients in Ganoderma lucidum. Despite its important therapeutic effects in disease, little is known about Ganoderma lucidum at the genomic level. In order to gain a molecular understanding of this fungus, we utilized Illumina high-throughput technology to sequence and analyze the transcriptome of Ganoderma lucidum. Methodology/Principal Findings We obtained 6,439,690 and 6,416,670 high-quality reads from the mycelium and fruiting body of Ganoderma lucidum, and these were assembled to form 18,892 and 27,408 unigenes, respectively. A similarity search was performed against the NCBI non-redundant nucleotide database and a customized database composed of five fungal genomes. 11,098 and 8, 775 unigenes were matched to the NCBI non-redundant nucleotide database and our customized database, respectively. All unigenes were subjected to annotation by Gene Ontology, Eukaryotic Orthologous Group terms and Kyoto Encyclopedia of Genes and Genomes. Differentially expressed genes from the Ganoderma lucidum mycelium and fruiting body stage were analyzed, resulting in the identification of 13 unigenes which are involved in the terpenoid backbone biosynthesis pathway. Quantitative real-time PCR was used to confirm the expression levels of these unigenes. Ganoderma lucidum was also studied for wood degrading activity and a total of 22 putative FOLymes (fungal oxidative lignin enzymes) and 120 CAZymes (carbohydrate-active enzymes) were predicted from our Ganoderma lucidum transcriptome. Conclusions Our study provides comprehensive gene expression information on Ganoderma lucidum at the transcriptional level, which will form the foundation for functional genomics studies in this fungus. The use of Illumina sequencing technology has made de novo transcriptome

  7. Transcriptome sequencing and De Novo analysis of Youngia japonica using the illumina platform.

    Directory of Open Access Journals (Sweden)

    Yulan Peng

    Full Text Available Youngia japonica, a weed species distributed worldwide, has been widely used in traditional Chinese medicine. It is an ideal plant for studying the evolution of Asteraceae plants because of its short life history and abundant source. However, little is known about its evolution and genetic diversity. In this study, de novo transcriptome sequencing was conducted for the first time for the comprehensive analysis of the genetic diversity of Y. japonica. The Y. japonica transcriptome was sequenced using Illumina paired-end sequencing technology. We produced 21,847,909 high-quality reads for Y. japonica and assembled them into contigs. A total of 51,850 unigenes were identified, among which 46,087 were annotated in the NCBI non-redundant protein database and 41,752 were annotated in the Swiss-Prot database. We mapped 9,125 unigenes onto 163 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database. In addition, 3,648 simple sequence repeats (SSRs were detected. Our data provide the most comprehensive transcriptome resource currently available for Y. japonica. C4 photosynthesis unigenes were found in the biological process of Y. japonica. There were 5596 unigenes related to defense response and 1344 ungienes related to signal transduction mechanisms (10.95%. These data provide insights into the genetic diversity of Y. japonica. Numerous SSRs contributed to the development of novel markers. These data may serve as a new valuable resource for genomic studies on Youngia and, more generally, Cichoraceae.

  8. Global transcriptome analysis of Huperzia serrata and identification of critical genes involved in the biosynthesis of huperzine A.

    Science.gov (United States)

    Yang, Mengquan; You, Wenjing; Wu, Shiwen; Fan, Zhen; Xu, Baofu; Zhu, Mulan; Li, Xuan; Xiao, Youli

    2017-03-22

    Huperzia serrata (H. serrata) is an economically important traditional Chinese herb with the notably medicinal value. As a representative member of the Lycopodiaceae family, the H. serrata produces various types of effectively bioactive lycopodium alkaloids, especially the huperzine A (HupA) which is a promising drug for Alzheimer's disease. Despite their medicinal importance, the public genomic and transcriptomic resources are very limited and the biosynthesis of HupA is largely unknown. Previous studies on comparison of 454-ESTs from H. serrata and Phlegmariurus carinatus predicted putative genes involved in lycopodium alkaloid biosynthesis, such as lysine decarboxylase like (LDC-like) protein and some CYP450s. However, these gene annotations were not carried out with further biochemical characterizations. To understand the biosynthesis of HupA and its regulation in H. serrata, a global transcriptome analysis on H. Serrata tissues was performed. In this study, we used the Illumina Highseq4000 platform to generate a substantial RNA sequencing dataset of H. serrata. A total of 40.1 Gb clean data was generated from four different tissues: root, stem, leaf, and sporangia and assembled into 181,141 unigenes. The total length, average length, N50 and GC content of unigenes were 219,520,611 bp, 1,211 bp, 2,488 bp and 42.51%, respectively. Among them, 105,516 unigenes (58.25%) were annotated by seven public databases (NR, NT, Swiss-Prot, KEGG, COG, Interpro, GO), and 54 GO terms and 3,391 transcription factors (TFs) were functionally classified, respectively. KEGG pathway analysis revealed that 72,230 unigenes were classified into 21 functional pathways. Three types of candidate enzymes, LDC, CAO and PKS, responsible for the biosynthesis of precursors of HupA were all identified in the transcripts. Four hundred and fifty-seven CYP450 genes in H. serrata were also analyzed and compared with tissue-specific gene expression. Moreover, two key classes of CYP450 genes BBE

  9. De Novo Assembly and Characterization of Fruit Transcriptome in Black Pepper (Piper nigrum).

    Science.gov (United States)

    Hu, Lisong; Hao, Chaoyun; Fan, Rui; Wu, Baoduo; Tan, Lehe; Wu, Huasong

    2015-01-01

    Black pepper is one of the most popular and oldest spices in the world and valued for its pungent constituent alkaloids. Pinerine is the main bioactive compound in pepper alkaloids, which perform unique physiological functions. However, the mechanisms of piperine synthesis are poorly understood. This study is the first to describe the fruit transcriptome of black pepper by sequencing on Illumina HiSeq 2000 platform. A total of 56,281,710 raw reads were obtained and assembled. From these raw reads, 44,061 unigenes with an average length of 1,345 nt were generated. During functional annotation, 40,537 unigenes were annotated in Gene Ontology categories, Kyoto Encyclopedia of Genes and Genomes pathways, Swiss-Prot database, and Nucleotide Collection (NR/NT) database. In addition, 8,196 simple sequence repeats (SSRs) were detected. In a detailed analysis of the transcriptome, housekeeping genes for quantitative polymerase chain reaction internal control, polymorphic SSRs, and lysine/ornithine metabolism-related genes were identified. These results validated the availability of our database. Our study could provide useful data for further research on piperine synthesis in black pepper.

  10. Transcriptome Analysis and Comparison of Marmota monax and Marmota himalayana.

    Directory of Open Access Journals (Sweden)

    Yanan Liu

    Full Text Available The Eastern woodchuck (Marmota monax is a classical animal model for studying hepatitis B virus (HBV infection and hepatocellular carcinoma (HCC in humans. Recently, we found that Marmota himalayana, an Asian animal species closely related to Marmota monax, is susceptible to woodchuck hepatitis virus (WHV infection and can be used as a new mammalian model for HBV infection. However, the lack of genomic sequence information of both Marmota models strongly limited their application breadth and depth. To address this major obstacle of the Marmota models, we utilized Illumina RNA-Seq technology to sequence the cDNA libraries of liver and spleen samples of two Marmota monax and four Marmota himalayana. In total, over 13 billion nucleotide bases were sequenced and approximately 1.5 billion clean reads were obtained. Following assembly, 106,496 consensus sequences of Marmota monax and 78,483 consensus sequences of Marmota himalayana were detected. For functional annotation, in total 73,603 Unigenes of Marmota monax and 78,483 Unigenes of Marmota himalayana were identified using different databases (NR, NT, Swiss-Prot, KEGG, COG, GO. The Unigenes were aligned by blastx to protein databases to decide the coding DNA sequences (CDS and in total 41,247 CDS of Marmota monax and 34,033 CDS of Marmota himalayana were predicted. The single nucleotide polymorphisms (SNPs and the simple sequence repeats (SSRs were also analyzed for all Unigenes obtained. Moreover, a large-scale transcriptome comparison was performed and revealed a high similarity in transcriptome sequences between the two marmota species. Our study provides an extensive amount of novel sequence information for Marmota monax and Marmota himalayana. This information may serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the identification and characterization of functional genes that are involved in WHV infection and HCC

  11. Additive Effects of Millimeter Waves and 2-Deoxyglucose Co-Exposure on the Human Keratinocyte Transcriptome.

    Science.gov (United States)

    Soubere Mahamoud, Yonis; Aite, Meziane; Martin, Catherine; Zhadobov, Maxim; Sauleau, Ronan; Le Dréan, Yves; Habauzit, Denis

    2016-01-01

    Millimeter Waves (MMW) will be used in the next-generation of high-speed wireless technologies, especially in future Ultra-Broadband small cells in 5G cellular networks. Therefore, their biocompatibilities must be evaluated prior to their massive deployment. Using a microarray-based approach, we analyzed modifications to the whole genome of a human keratinocyte model that was exposed at 60.4 GHz-MMW at an incident power density (IPD) of 20 mW/cm2 for 3 hours in athermic conditions. No keratinocyte transcriptome modifications were observed. We tested the effects of MMWs on cell metabolism by co-treating MMW-exposed cells with a glycolysis inhibitor, 2-deoxyglucose (2dG, 20 mM for 3 hours), and whole genome expression was evaluated along with the ATP content. We found that the 2dG treatment decreased the cellular ATP content and induced a high modification in the transcriptome (632 coding genes). The affected genes were associated with transcriptional repression, cellular communication and endoplasmic reticulum homeostasis. The MMW/2dG co-treatment did not alter the keratinocyte ATP content, but it did slightly alter the transcriptome, which reflected the capacity of MMW to interfere with the bioenergetic stress response. The RT-PCR-based validation confirmed 6 MMW-sensitive genes (SOCS3, SPRY2, TRIB1, FAM46A, CSRNP1 and PPP1R15A) during the 2dG treatment. These 6 genes encoded transcription factors or inhibitors of cytokine pathways, which raised questions regarding the potential impact of long-term or chronic MMW exposure on metabolically stressed cells.

  12. Transcriptome sequencing and positive selected genes analysis of Bombyx mandarina.

    Directory of Open Access Journals (Sweden)

    Tingcai Cheng

    Full Text Available The wild silkworm Bombyx mandarina is widely believed to be an ancestor of the domesticated silkworm, Bombyx mori. Silkworms are often used as a model for studying the mechanism of species domestication. Here, we performed transcriptome sequencing of the wild silkworm using an Illumina HiSeq2000 platform. We produced 100,004,078 high-quality reads and assembled them into 50,773 contigs with an N50 length of 1764 bp and a mean length of 941.62 bp. A total of 33,759 unigenes were identified, with 12,805 annotated in the Nr database, 8273 in the Pfam database, and 9093 in the Swiss-Prot database. Expression profile analysis found significant differential expression of 1308 unigenes between the middle silk gland (MSG and posterior silk gland (PSG. Three sericin genes (sericin 1, sericin 2, and sericin 3 were expressed specifically in the MSG and three fibroin genes (fibroin-H, fibroin-L, and fibroin/P25 were expressed specifically in the PSG. In addition, 32,297 Single-nucleotide polymorphisms (SNPs and 361 insertion-deletions (INDELs were detected. Comparison with the domesticated silkworm p50/Dazao identified 5,295 orthologous genes, among which 400 might have experienced or to be experiencing positive selection by Ka/Ks analysis. These data and analyses presented here provide insights into silkworm domestication and an invaluable resource for wild silkworm genomics research.

  13. Developmental gene discovery in a hemimetabolous insect: de novo assembly and annotation of a transcriptome for the cricket Gryllus bimaculatus.

    Directory of Open Access Journals (Sweden)

    Victor Zeng

    Full Text Available Most genomic resources available for insects represent the Holometabola, which are insects that undergo complete metamorphosis like beetles and flies. In contrast, the Hemimetabola (direct developing insects, representing the basal branches of the insect tree, have very few genomic resources. We have therefore created a large and publicly available transcriptome for the hemimetabolous insect Gryllus bimaculatus (cricket, a well-developed laboratory model organism whose potential for functional genetic experiments is currently limited by the absence of genomic resources. cDNA was prepared using mRNA obtained from adult ovaries containing all stages of oogenesis, and from embryo samples on each day of embryogenesis. Using 454 Titanium pyrosequencing, we sequenced over four million raw reads, and assembled them into 21,512 isotigs (predicted transcripts and 120,805 singletons with an average coverage per base pair of 51.3. We annotated the transcriptome manually for over 400 conserved genes involved in embryonic patterning, gametogenesis, and signaling pathways. BLAST comparison of the transcriptome against the NCBI non-redundant protein database (nr identified significant similarity to nr sequences for 55.5% of transcriptome sequences, and suggested that the transcriptome may contain 19,874 unique transcripts. For predicted transcripts without significant similarity to known sequences, we assessed their similarity to other orthopteran sequences, and determined that these transcripts contain recognizable protein domains, largely of unknown function. We created a searchable, web-based database to allow public access to all raw, assembled and annotated data. This database is to our knowledge the largest de novo assembled and annotated transcriptome resource available for any hemimetabolous insect. We therefore anticipate that these data will contribute significantly to more effective and higher-throughput deployment of molecular analysis tools in

  14. Characterization and analysis of a de novo transcriptome from the pygmy grasshopper Tetrix japonica.

    Science.gov (United States)

    Qiu, Zhongying; Liu, Fei; Lu, Huimeng; Huang, Yuan

    2017-05-01

    The pygmy grasshopper Tetrix japonica is a common insect distributed throughout the world, and it has the potential for use in studies of body colour polymorphism, genomics and the biology of Tetrigoidea (Insecta: Orthoptera). However, limited biological information is available for this insect. Here, we conducted a de novo transcriptome study of adult and larval T. japonica to provide a better understanding of its gene expression and develop genomic resources for future work. We sequenced and explored the characteristics of the de novo transcriptome of T. japonica using Illumina HiSeq 2000 platform. A total of 107 608 206 paired-end clean reads were assembled into 61 141 unigenes using the trinity software; the mean unigene size was 771 bp, and the N50 length was 1238 bp. A total of 29 225 unigenes were functionally annotated to the NCBI nonredundant protein sequences (Nr), NCBI nonredundant nucleotide sequences (Nt), a manually annotated and reviewed protein sequence database (Swiss-Prot), Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. A large number of putative genes that are potentially involved in pigment pathways, juvenile hormone (JH) metabolism and signalling pathways were identified in the T. japonica transcriptome. Additionally, 165 769 and 156 796 putative single nucleotide polymorphisms occurred in the adult and larvae transcriptomes, respectively, and a total of 3162 simple sequence repeats were detected in this assembly. This comprehensive transcriptomic data for T. japonica will provide a usable resource for gene predictions, signalling pathway investigations and molecular marker development for this species and other pygmy grasshoppers. © 2016 John Wiley & Sons Ltd.

  15. Data set for transcriptome analysis of the Chinese giant salamander (Andrias davidianus

    Directory of Open Access Journals (Sweden)

    Xuemei Jiang

    2016-03-01

    Full Text Available The Chinese giant salamander (Andrias davidianus occupies a seat at the phylogenetic and species evolution process, which makes it an invaluable model for genetics; however, the genetic information and gene sequences about the Chinese giant salamander in public databases are scanty. Hence, we aimed to perform transcriptome analysis with the help of high-throughput sequencing. In this data, 61,317,940 raw reads were acquired from Chinese giant salamander mRNA using Illumina paired-end sequencing platform. After de novo assembly, a total of 72,072 unigenes were gained, in which 33,834 (46.95% and 29,479 (40.91% transcripts exhibited homology to sequences in the Nr database and Swiss-Prot database, (E-value <10−5, respectively. In the obtained unigenes, 18,019 (25% transcripts were assigned with at least one Gene Ontology term, of which 1218 (6.8% transcripts were assigned to immune system processes. In addition, a total of 17,572 assembled sequences were assigned into 241 predicted KEGG metabolic pathways. Among these, 2552 (14.5% transcripts were assigned to the immune system relevant pathway and 5 transcripts were identified as potential antimicrobial peptides (AMPs. Keywords: Andrias davidianus, Transcriptome

  16. Next-generation transcriptome assembly

    Energy Technology Data Exchange (ETDEWEB)

    Martin, Jeffrey A.; Wang, Zhong

    2011-09-01

    Transcriptomics studies often rely on partial reference transcriptomes that fail to capture the full catalog of transcripts and their variations. Recent advances in sequencing technologies and assembly algorithms have facilitated the reconstruction of the entire transcriptome by deep RNA sequencing (RNA-seq), even without a reference genome. However, transcriptome assembly from billions of RNA-seq reads, which are often very short, poses a significant informatics challenge. This Review summarizes the recent developments in transcriptome assembly approaches - reference-based, de novo and combined strategies-along with some perspectives on transcriptome assembly in the near future.

  17. De novo transcriptome assembly of Setatria italica variety Taejin

    Directory of Open Access Journals (Sweden)

    Yeonhwa Jo

    2016-06-01

    Full Text Available Foxtail millet (Setaria italica belonging to the family Poaceae is an important millet that is widely cultivated in East Asia. Of the cultivated millets, the foxtail millet has the longest history and is one of the main food crops in South India and China. Moreover, foxtail millet is a model plant system for biofuel generation utilizing the C4 photosynthetic pathway. In this study, we carried out de novo transcriptome assembly for the foxtail millet variety Taejin collected from Korea using next-generation sequencing. We obtained a total of 8.676 GB raw data by paired-end sequencing. The raw data in this study can be available in NCBI SRA database with accession number of SRR3406552. The Trinity program was used to de novo assemble 145,332 transcripts. Using the TransDecoder program, we predicted 82,925 putative proteins. BLASTP was performed against the Swiss-Prot protein sequence database to annotate the functions of identified proteins, resulting in 20,555 potentially novel proteins. Taken together, this study provides transcriptome data for the foxtail millet variety Taejin by RNA-Seq.

  18. The Transcriptome Analysis and Comparison Explorer--T-ACE: a platform-independent, graphical tool to process large RNAseq datasets of non-model organisms.

    Science.gov (United States)

    Philipp, E E R; Kraemer, L; Mountfort, D; Schilhabel, M; Schreiber, S; Rosenstiel, P

    2012-03-15

    Next generation sequencing (NGS) technologies allow a rapid and cost-effective compilation of large RNA sequence datasets in model and non-model organisms. However, the storage and analysis of transcriptome information from different NGS platforms is still a significant bottleneck, leading to a delay in data dissemination and subsequent biological understanding. Especially database interfaces with transcriptome analysis modules going beyond mere read counts are missing. Here, we present the Transcriptome Analysis and Comparison Explorer (T-ACE), a tool designed for the organization and analysis of large sequence datasets, and especially suited for transcriptome projects of non-model organisms with little or no a priori sequence information. T-ACE offers a TCL-based interface, which accesses a PostgreSQL database via a php-script. Within T-ACE, information belonging to single sequences or contigs, such as annotation or read coverage, is linked to the respective sequence and immediately accessible. Sequences and assigned information can be searched via keyword- or BLAST-search. Additionally, T-ACE provides within and between transcriptome analysis modules on the level of expression, GO terms, KEGG pathways and protein domains. Results are visualized and can be easily exported for external analysis. We developed T-ACE for laboratory environments, which have only a limited amount of bioinformatics support, and for collaborative projects in which different partners work on the same dataset from different locations or platforms (Windows/Linux/MacOS). For laboratories with some experience in bioinformatics and programming, the low complexity of the database structure and open-source code provides a framework that can be customized according to the different needs of the user and transcriptome project.

  19. De novo assembly, gene annotation and marker development using Illumina paired-end transcriptome sequences in celery (Apium graveolens L..

    Directory of Open Access Journals (Sweden)

    Nan Fu

    Full Text Available BACKGROUND: Celery is an increasing popular vegetable species, but limited transcriptome and genomic data hinder the research to it. In addition, a lack of celery molecular markers limits the process of molecular genetic breeding. High-throughput transcriptome sequencing is an efficient method to generate a large transcriptome sequence dataset for gene discovery, molecular marker development and marker-assisted selection breeding. PRINCIPAL FINDINGS: Celery transcriptomes from four tissues were sequenced using Illumina paired-end sequencing technology. De novo assembling was performed to generate a collection of 42,280 unigenes (average length of 502.6 bp that represent the first transcriptome of the species. 78.43% and 48.93% of the unigenes had significant similarity with proteins in the National Center for Biotechnology Information (NCBI non-redundant protein database (Nr and Swiss-Prot database respectively, and 10,473 (24.77% unigenes were assigned to Clusters of Orthologous Groups (COG. 21,126 (49.97% unigenes harboring Interpro domains were annotated, in which 15,409 (36.45% were assigned to Gene Ontology(GO categories. Additionally, 7,478 unigenes were mapped onto 228 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG. Large numbers of simple sequence repeats (SSRs were indentified, and then the rate of successful amplication and polymorphism were investigated among 31 celery accessions. CONCLUSIONS: This study demonstrates the feasibility of generating a large scale of sequence information by Illumina paired-end sequencing and efficient assembling. Our results provide a valuable resource for celery research. The developed molecular markers are the foundation of further genetic linkage analysis and gene localization, and they will be essential to accelerate the process of breeding.

  20. De novo assembly of the perennial ryegrass transcriptome using an RNA-Seq strategy.

    Directory of Open Access Journals (Sweden)

    Jacqueline D Farrell

    Full Text Available Perennial ryegrass is a highly heterozygous outbreeding grass species used for turf and forage production. Heterozygosity can affect de-Bruijn graph assembly making de novo transcriptome assembly of species such as perennial ryegrass challenging. Creating a reference transcriptome from a homozygous perennial ryegrass genotype can circumvent the challenge of heterozygosity. The goals of this study were to perform RNA-sequencing on multiple tissues from a highly inbred genotype to develop a reference transcriptome. This was complemented with RNA-sequencing of a highly heterozygous genotype for SNP calling.De novo transcriptome assembly of the inbred genotype created 185,833 transcripts with an average length of 830 base pairs. Within the inbred reference transcriptome 78,560 predicted open reading frames were found of which 24,434 were predicted as complete. Functional annotation found 50,890 transcripts with a BLASTp hit from the Swiss-Prot non-redundant database, 58,941 transcripts with a Pfam protein domain and 1,151 transcripts encoding putative secreted peptides. To evaluate the reference transcriptome we targeted the high-affinity K+ transporter gene family and found multiple orthologs. Using the longest unique open reading frames as the reference sequence, 64,242 single nucleotide polymorphisms were found. One thousand sixty one open reading frames from the inbred genotype contained heterozygous sites, confirming the high degree of homozygosity.Our study has developed an annotated, comprehensive transcriptome reference for perennial ryegrass that can aid in determining genetic variation, expression analysis, genome annotation, and gene mapping.

  1. hSAGEing: an improved SAGE-based software for identification of human tissue-specific or common tumor markers and suppressors.

    Directory of Open Access Journals (Sweden)

    Cheng-Hong Yang

    Full Text Available BACKGROUND: SAGE (serial analysis of gene expression is a powerful method of analyzing gene expression for the entire transcriptome. There are currently many well-developed SAGE tools. However, the cross-comparison of different tissues is seldom addressed, thus limiting the identification of common- and tissue-specific tumor markers. METHODOLOGY/PRINCIPAL FINDINGS: To improve the SAGE mining methods, we propose a novel function for cross-tissue comparison of SAGE data by combining the mathematical set theory and logic with a unique "multi-pool method" that analyzes multiple pools of pair-wise case controls individually. When all the settings are in "inclusion", the common SAGE tag sequences are mined. When one tissue type is in "inclusion" and the other types of tissues are not in "inclusion", the selected tissue-specific SAGE tag sequences are generated. They are displayed in tags-per-million (TPM and fold values, as well as visually displayed in four kinds of scales in a color gradient pattern. In the fold visualization display, the top scores of the SAGE tag sequences are provided, along with cluster plots. A user-defined matrix file is designed for cross-tissue comparison by selecting libraries from publically available databases or user-defined libraries. CONCLUSIONS/SIGNIFICANCE: The hSAGEing tool provides a combination of friendly cross-tissue analysis and an interface for comparing SAGE libraries for the first time. Some up- or down-regulated genes with tissue-specific or common tumor markers and suppressors are identified computationally. The tool is useful and convenient for in silico cancer transcriptomic studies and is freely available at http://bio.kuas.edu.tw/hSAGEing.

  2. Fine-structure resolved rotational transitions and database for CN+H2 collisions

    Science.gov (United States)

    Burton, Hannah; Mysliwiec, Ryan; Forrey, Robert C.; Yang, B. H.; Stancil, P. C.; Balakrishnan, N.

    2018-06-01

    Cross sections and rate coefficients for CN+H2 collisions are calculated using the coupled states (CS) approximation. The calculations are benchmarked against more accurate close-coupling (CC) calculations for transitions between low-lying rotational states. Comparisons are made between the two formulations for collision energies greater than 10 cm-1. The CS approximation is used to construct a database which includes highly excited rotational states that are beyond the practical limitations of the CC method. The database includes fine-structure resolved rotational quenching transitions for v = 0 and j ≤ 40, where v and j are the vibrational and rotational quantum numbers of the initial state of the CN molecule. Rate coefficients are computed for both para-H2 and ortho-H2 colliders. The results are shown to be in good agreement with previous calculations, however, the rates are substantially different from mass-scaled CN+He rates that are often used in astrophysical models.

  3. Haemophilus ducreyi Seeks Alternative Carbon Sources and Adapts to Nutrient Stress and Anaerobiosis during Experimental Infection of Human Volunteers.

    Science.gov (United States)

    Gangaiah, Dharanesh; Zhang, Xinjun; Baker, Beth; Fortney, Kate R; Gao, Hongyu; Holley, Concerta L; Munson, Robert S; Liu, Yunlong; Spinola, Stanley M

    2016-05-01

    Haemophilus ducreyi causes the sexually transmitted disease chancroid in adults and cutaneous ulcers in children. In humans, H. ducreyi resides in an abscess and must adapt to a variety of stresses. Previous studies (D. Gangaiah, M. Labandeira-Rey, X. Zhang, K. R. Fortney, S. Ellinger, B. Zwickl, B. Baker, Y. Liu, D. M. Janowicz, B. P. Katz, C. A. Brautigam, R. S. Munson, Jr., E. J. Hansen, and S. M. Spinola, mBio 5:e01081-13, 2014, http://dx.doi.org/10.1128/mBio.01081-13) suggested that H. ducreyi encounters growth conditions in human lesions resembling those found in stationary phase. However, how H. ducreyi transcriptionally responds to stress during human infection is unknown. Here, we determined the H. ducreyi transcriptome in biopsy specimens of human lesions and compared it to the transcriptomes of bacteria grown to mid-log, transition, and stationary phases. Multidimensional scaling showed that the in vivo transcriptome is distinct from those of in vitro growth. Compared to the inoculum (mid-log-phase bacteria), H. ducreyi harvested from pustules differentially expressed ∼93 genes, of which 62 were upregulated. The upregulated genes encode homologs of proteins involved in nutrient transport, alternative carbon pathways (l-ascorbate utilization and metabolism), growth arrest response, heat shock response, DNA recombination, and anaerobiosis. H. ducreyi upregulated few genes (hgbA, flp-tad, and lspB-lspA2) encoding virulence determinants required for human infection. Most genes regulated by CpxRA, RpoE, Hfq, (p)ppGpp, and DksA, which control the expression of virulence determinants and adaptation to a variety of stresses, were not differentially expressed in vivo, suggesting that these systems are cycling on and off during infection. Taken together, these data suggest that the in vivo transcriptome is distinct from those of in vitro growth and that adaptation to nutrient stress and anaerobiosis is crucial for H. ducreyi survival in humans. Copyright © 2016

  4. Comparative transcriptome reconstruction of four Hypericum species focused on hypericin biosynthesis

    Directory of Open Access Journals (Sweden)

    Miroslav Soták

    2016-07-01

    Full Text Available Next generation sequencing technology (NGS rapidly developed research applications in thefield of plant functional genomics. Several Hypericum spp. with an aim to generate andenhance gene annotations especially for genes coding the enzymes supposedly included inbiosynthesis of valuable bioactive compounds were analyzed. The first de novo transcriptomeprofiling of H. annulatum Moris, H. tomentosum L., H. kalmianum L. and H. androsaemumL. leaves cultivated in vitro was accomplished. All four species with only limited genomicinformation were selected on the basis of differences in ability to synthesize hypericins andpresence of dark nodules accumulating these metabolites with purpose to enrich genomicbackground of Hypericum spp. H. annulatum was chosen because of high number of the darknodules and high content of hypericin. H. tomentosum leaves are typical for the presence ofonly 1-2 dark nodules localized in the apical part. Both H. kalmianum and H. androsaemumlack hypericin and have no dark nodules. Four separated datasets of the pair-end reads weregathered and used for de novo assembly by Trinity program. Assembled transcriptomes wereannotated to the public databases Swiss-Prot and non-redundant protein database (NCBI-nr.Gene ontology analysis was performed. Differences of expression levels in the marginaltissues with dark nodules and inner part of leaves lacking these nodules indicate a potentialgenetic background for hypericin formation as the presumed site of hypericin biosynthesis isin the cells adjacent to these structures. Altogether 165 contigs in H. annulatum and 100contigs in H. tomentosum were detected as significantly differentially expressed (P<0.05 andupregulated in the leaf rim tissues containing the dark nodules. The new sequenceshomologous to octaketide synthase and enzymes catalyzing phenolic oxidative couplingreactions indispensable for hypericin biosynthesis were discovered. The presentedtranscriptomic sequence data will

  5. Fiscal 1998 research report. Construction model project of the human sensory database; 1998 nendo ningen kankaku database kochiku model jigyo seika hokokusho

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2000-03-01

    This report summarizes the fiscal 1998 research result on construction of the human sensory database. The human sensory database for evaluating working environment was constructed on the basis of the measurement result on human sensory data (stress and fatigue) of 400 examinees at fields (transport field, control room and office) and in a laboratory. By using the newly developed standard measurement protocol for evaluating summer clothing (shirt, slacks and underwear), the database composed of the evaluation experiment results and the comparative experiment results on human physiological and sensory data of aged and young people was constructed. The database is featured by easy retrieval of various information concerned corresponding to requirements of tasks and use purposes. For evaluating the mass data with large time variation read corresponding to use purposes for every scene, the data detection support technique was adopted paying attention to physical and psychological variable phases, and mind and body events. A meaning of reaction and a hint for necessary measures are showed for every phase and event. (NEDO)

  6. Consolidated Human Activity Database (CHAD) for use in human exposure and health studies and predictive models

    Science.gov (United States)

    EPA scientists have compiled detailed data on human behavior from 22 separate exposure and time-use studies into CHAD. The database includes more than 54,000 individual study days of detailed human behavior.

  7. Transcriptomic analysis of human polarized macrophages: more than one role of alternative activation?

    Directory of Open Access Journals (Sweden)

    Eleonora Derlindati

    Full Text Available Macrophages are a heterogeneous cell population which in response to the cytokine milieu polarize in either classically activated macrophages (M1 or alternatively activated macrophages (M2. This plasticity makes macrophages essential in regulating inflammation, immune response and tissue remodeling and a novel therapeutic target in inflammatory diseases such as atherosclerosis. The aim of the study was to describe the transcriptomic profiles of differently polarized human macrophages to generate new hypotheses on the biological function of the different macrophage subtypes.Polarization of circulating monocytes/macrophages of blood donors was induced in vitro by IFN-γ and LPS (M1, by IL-4 (M2a, and by IL-10 (M2c. Unstimulated cells (RM served as time controls. Gene expression profile of M1, M2a, M2c and RM was assessed at 6, 12 and 24h after polarization with Whole Human Genome Agilent Microarray technique. When compared to RM, M1 significantly upregulated pathways involved in immunity and inflammation, whereas M2a did the opposite. Conversely, decreased and increased expression of mitochondrial metabolism, consistent with insulin resistant and insulin sensitive patterns, was seen in M1 and M2a, respectively. The time sequence in the expression of some pathways appeared to have some specific bearing on M1 function. Finally, canonical and non-canonical Wnt genes and gene groups, promoting inflammation and tissue remodeling, were upregulated in M2a compared to RM.Our data in in vitro polarized human macrophages: 1. confirm and extend known inflammatory and anti-inflammatory gene expression patterns; 2. demonstrate changes in mitochondrial metabolism associated to insulin resistance and insulin sensitivity in M1 and M2a, respectively; 3. highlight the potential relevance of gene expression timing in M1 function; 4. unveil enhanced expression of Wnt pathways in M2a suggesting a potential dual (pro-inflammatory and anti-inflammatory role of M2a in

  8. Evaluation of de novo assembly technique in the South African abalone Haliotis midae transcriptome: A comparison from Illumina and 454 systems

    Directory of Open Access Journals (Sweden)

    Barbara Picone

    2016-12-01

    Full Text Available Next generation sequencing platforms have recently been used to rapidly characterize transcriptome sequences from a number of non-model organisms. The present study compares two of the most frequently used platforms, the Roche 454-pyrosequencing and the Illumina sequencing-by-synthesis (SBS, on the same RNA sample obtained from an intertidal gastropod mollusc species, Haliotis midae. All the sequencing reads were deposited in the Short Read Archive (SRA database are retrievable under the accession number [SRR071314 (Illumina Genome Analyzer II] and [SRR1737738, SRR1737737, SRR1737735, SRR1737734 (454 GS FLX] in the SRA database of NCBI. Three transcriptomes, composed of either pure 454 or Illumina reads or a mixture of read types (Hybrid, were assembled using CLC Genomics Workbench software. Illumina assemblies performed the best de novo transcriptome characterization in terms of contig length, whereas the 454 assemblies tended to improve the complete assembly of gene transcripts. Both the Hybrid and Illumina assemblies produced longer contigs covering more of the transcriptome than 454 assemblies. However, the addition of 454 significantly increased the number of genes annotated.

  9. De novo Assembly of Leaf Transcriptome in the Medicinal Plant Andrographis paniculata

    Science.gov (United States)

    Cherukupalli, Neeraja; Divate, Mayur; Mittapelli, Suresh R.; Khareedu, Venkateswara R.; Vudem, Dashavantha R.

    2016-01-01

    Andrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequencing of RNA from A. paniculata leaf using Illumina HiSeq™ 2000 platform followed by the de novo transcriptome assembly. A total of 189.22 million high quality paired reads were generated and 1,70,724 transcripts were predicted in the primary assembly. Secondary assembly generated a transcriptome size of ~88 Mb with 83,800 clustered transcripts. Based on the similarity searches against plant non-redundant protein database, gene ontology, and eukaryotic orthologous groups, 49,363 transcripts were annotated constituting upto 58.91% of the identified unigenes. Annotation of transcripts—using kyoto encyclopedia of genes and genomes database—revealed 5606 transcripts plausibly involved in 140 pathways including biosynthesis of terpenoids and other secondary metabolites. Transcription factor analysis showed 6767 unique transcripts belonging to 97 different transcription factor families. A total number of 124 CYP450 transcripts belonging to seven divergent clans have been identified. Transcriptome revealed 146 different transcripts coding for enzymes involved in the biosynthesis of terpenoids of which 35 contained terpene synthase motifs. This study also revealed 32,341 simple sequence repeats (SSRs) in 23,168 transcripts. Assembled sequences of transcriptome of A. paniculata generated in this study are made available, for the first time, in the TSA database, which provides useful information for functional and comparative genomic analysis besides identification of key enzymes involved in the various pathways of secondary metabolism. PMID:27582746

  10. Sugarcane giant borer transcriptome analysis and identification of genes related to digestion.

    Science.gov (United States)

    Fonseca, Fernando Campos de Assis; Firmino, Alexandre Augusto Pereira; de Macedo, Leonardo Lima Pepino; Coelho, Roberta Ramos; de Souza Júnior, José Dijair Antonino; de Sousa Júnior, José Dijair Antonino; Silva-Junior, Orzenil Bonfim; Togawa, Roberto Coiti; Pappas, Georgios Joannis; de Góis, Luiz Avelar Brandão; da Silva, Maria Cristina Mattar; Grossi-de-Sá, Maria Fátima

    2015-01-01

    Sugarcane is a widely cultivated plant that serves primarily as a source of sugar and ethanol. Its annual yield can be significantly reduced by the action of several insect pests including the sugarcane giant borer (Telchin licus licus), a lepidopteran that presents a long life cycle and which efforts to control it using pesticides have been inefficient. Although its economical relevance, only a few DNA sequences are available for this species in the GenBank. Pyrosequencing technology was used to investigate the transcriptome of several developmental stages of the insect. To maximize transcript diversity, a pool of total RNA was extracted from whole body insects and used to construct a normalized cDNA database. Sequencing produced over 650,000 reads, which were de novo assembled to generate a reference library of 23,824 contigs. After quality score and annotation, 43% of the contigs had at least one BLAST hit against the NCBI non-redundant database, and 40% showed similarities with the lepidopteran Bombyx mori. In a further analysis, we conducted a comparison with Manduca sexta midgut sequences to identify transcripts of genes involved in digestion. Of these transcripts, many presented an expansion or depletion in gene number, compared to B. mori genome. From the sugarcane giant borer (SGB) transcriptome, a number of aminopeptidase N (APN) cDNAs were characterized based on homology to those reported as Cry toxin receptors. This is the first report that provides a large-scale EST database for the species. Transcriptome analysis will certainly be useful to identify novel developmental genes, to better understand the insect's biology and to guide the development of new strategies for insect-pest control.

  11. Sugarcane giant borer transcriptome analysis and identification of genes related to digestion.

    Directory of Open Access Journals (Sweden)

    Fernando Campos de Assis Fonseca

    Full Text Available Sugarcane is a widely cultivated plant that serves primarily as a source of sugar and ethanol. Its annual yield can be significantly reduced by the action of several insect pests including the sugarcane giant borer (Telchin licus licus, a lepidopteran that presents a long life cycle and which efforts to control it using pesticides have been inefficient. Although its economical relevance, only a few DNA sequences are available for this species in the GenBank. Pyrosequencing technology was used to investigate the transcriptome of several developmental stages of the insect. To maximize transcript diversity, a pool of total RNA was extracted from whole body insects and used to construct a normalized cDNA database. Sequencing produced over 650,000 reads, which were de novo assembled to generate a reference library of 23,824 contigs. After quality score and annotation, 43% of the contigs had at least one BLAST hit against the NCBI non-redundant database, and 40% showed similarities with the lepidopteran Bombyx mori. In a further analysis, we conducted a comparison with Manduca sexta midgut sequences to identify transcripts of genes involved in digestion. Of these transcripts, many presented an expansion or depletion in gene number, compared to B. mori genome. From the sugarcane giant borer (SGB transcriptome, a number of aminopeptidase N (APN cDNAs were characterized based on homology to those reported as Cry toxin receptors. This is the first report that provides a large-scale EST database for the species. Transcriptome analysis will certainly be useful to identify novel developmental genes, to better understand the insect's biology and to guide the development of new strategies for insect-pest control.

  12. Additive Effects of Millimeter Waves and 2-Deoxyglucose Co-Exposure on the Human Keratinocyte Transcriptome.

    Directory of Open Access Journals (Sweden)

    Yonis Soubere Mahamoud

    Full Text Available Millimeter Waves (MMW will be used in the next-generation of high-speed wireless technologies, especially in future Ultra-Broadband small cells in 5G cellular networks. Therefore, their biocompatibilities must be evaluated prior to their massive deployment. Using a microarray-based approach, we analyzed modifications to the whole genome of a human keratinocyte model that was exposed at 60.4 GHz-MMW at an incident power density (IPD of 20 mW/cm2 for 3 hours in athermic conditions. No keratinocyte transcriptome modifications were observed. We tested the effects of MMWs on cell metabolism by co-treating MMW-exposed cells with a glycolysis inhibitor, 2-deoxyglucose (2dG, 20 mM for 3 hours, and whole genome expression was evaluated along with the ATP content. We found that the 2dG treatment decreased the cellular ATP content and induced a high modification in the transcriptome (632 coding genes. The affected genes were associated with transcriptional repression, cellular communication and endoplasmic reticulum homeostasis. The MMW/2dG co-treatment did not alter the keratinocyte ATP content, but it did slightly alter the transcriptome, which reflected the capacity of MMW to interfere with the bioenergetic stress response. The RT-PCR-based validation confirmed 6 MMW-sensitive genes (SOCS3, SPRY2, TRIB1, FAM46A, CSRNP1 and PPP1R15A during the 2dG treatment. These 6 genes encoded transcription factors or inhibitors of cytokine pathways, which raised questions regarding the potential impact of long-term or chronic MMW exposure on metabolically stressed cells.

  13. A host transcriptional signature for presymptomatic detection of infection in humans exposed to influenza H1N1 or H3N2.

    Directory of Open Access Journals (Sweden)

    Christopher W Woods

    Full Text Available There is great potential for host-based gene expression analysis to impact the early diagnosis of infectious diseases. In particular, the influenza pandemic of 2009 highlighted the challenges and limitations of traditional pathogen-based testing for suspected upper respiratory viral infection. We inoculated human volunteers with either influenza A (A/Brisbane/59/2007 (H1N1 or A/Wisconsin/67/2005 (H3N2, and assayed the peripheral blood transcriptome every 8 hours for 7 days. Of 41 inoculated volunteers, 18 (44% developed symptomatic infection. Using unbiased sparse latent factor regression analysis, we generated a gene signature (or factor for symptomatic influenza capable of detecting 94% of infected cases. This gene signature is detectable as early as 29 hours post-exposure and achieves maximal accuracy on average 43 hours (p = 0.003, H1N1 and 38 hours (p-value = 0.005, H3N2 before peak clinical symptoms. In order to test the relevance of these findings in naturally acquired disease, a composite influenza A signature built from these challenge studies was applied to Emergency Department patients where it discriminates between swine-origin influenza A/H1N1 (2009 infected and non-infected individuals with 92% accuracy. The host genomic response to Influenza infection is robust and may provide the means for detection before typical clinical symptoms are apparent.

  14. A comparative transcriptomic analysis of astrocytes differentiation from human neural progenitor cells.

    Science.gov (United States)

    Magistri, Marco; Khoury, Nathalie; Mazza, Emilia Maria Cristina; Velmeshev, Dmitry; Lee, Jae K; Bicciato, Silvio; Tsoulfas, Pantelis; Faghihi, Mohammad Ali

    2016-11-01

    Astrocytes are a morphologically and functionally heterogeneous population of cells that play critical roles in neurodevelopment and in the regulation of central nervous system homeostasis. Studies of human astrocytes have been hampered by the lack of specific molecular markers and by the difficulties associated with purifying and culturing astrocytes from adult human brains. Human neural progenitor cells (NPCs) with self-renewal and multipotent properties represent an appealing model system to gain insight into the developmental genetics and function of human astrocytes, but a comprehensive molecular characterization that confirms the validity of this cellular system is still missing. Here we used an unbiased transcriptomic analysis to characterize in vitro culture of human NPCs and to define the gene expression programs activated during the differentiation of these cells into astrocytes using FBS or the combination of CNTF and BMP4. Our results demonstrate that in vitro cultures of human NPCs isolated during the gliogenic phase of neurodevelopment mainly consist of radial glial cells (RGCs) and glia-restricted progenitor cells. In these cells the combination of CNTF and BMP4 activates the JAK/STAT and SMAD signaling cascades, leading to the inhibition of oligodendrocytes lineage commitment and activation of astrocytes differentiation. On the other hand, FBS-derived astrocytes have properties of reactive astrocytes. Our work suggests that in vitro culture of human NPCs represents a valuable cellular system to study human disorders characterized by impairment of astrocytes development and function. Our datasets represent an important resource for researchers studying human astrocytes development and might set the basis for the discovery of novel human-specific astrocyte markers. © 2016 The Authors. European Journal of Neuroscience published by Federation of European Neuroscience Societies and John Wiley & Sons Ltd.

  15. Meningococcal factor H-binding protein vaccines with decreased binding to human complement factor H have enhanced immunogenicity in human factor H transgenic mice.

    Science.gov (United States)

    Rossi, Raffaella; Granoff, Dan M; Beernink, Peter T

    2013-11-04

    Factor H-binding protein (fHbp) is a component of a meningococcal vaccine recently licensed in Europe for prevention of serogroup B disease, and a second vaccine in clinical development. The protein specifically binds human factor H (fH), which down-regulates complement activation and enhances resistance to bactericidal activity. There are conflicting data from studies in human fH transgenic mice on whether binding of human fH to fHbp vaccines decreases immunogenicity, and whether mutant fHbp vaccines with decreased fH binding have enhanced immunogenicity. fHbp can be classified into two sub-families based on sequence divergence and immunologic cross-reactivity. Previous studies of mutant fHbp vaccines with low fH binding were from sub-family B, which account for approximately 60% of serogroup B case isolates. In the present study, we evaluated the immunogenicity of two mutant sub-family A fHbp vaccines containing single substitutions, T221A or D211A, which resulted in 15- or 30-fold lower affinity for human fH, respectively, than the corresponding control wild-type fHbp vaccine. In transgenic mice with high serum concentrations of human fH, both mutant vaccines elicited significantly higher IgG titers and higher serum bactericidal antibody responses than the control fHbp vaccine that bound human fH. Thus, mutations introduced into a sub-family A fHbp antigen to decrease fH binding can increase protective antibody responses in human fH transgenic mice. Collectively the data suggest that mutant fHbp antigens with decreased fH binding will result in superior vaccines in humans. Copyright © 2013 Elsevier Ltd. All rights reserved.

  16. De Novo Assembly of the Donkey White Blood Cell Transcriptome and a Comparative Analysis of Phenotype-Associated Genes between Donkeys and Horses.

    Science.gov (United States)

    Xie, Feng-Yun; Feng, Yu-Long; Wang, Hong-Hui; Ma, Yun-Feng; Yang, Yang; Wang, Yin-Chao; Shen, Wei; Pan, Qing-Jie; Yin, Shen; Sun, Yu-Jiang; Ma, Jun-Yu

    2015-01-01

    Prior to the mechanization of agriculture and labor-intensive tasks, humans used donkeys (Equus africanus asinus) for farm work and packing. However, as mechanization increased, donkeys have been increasingly raised for meat, milk, and fur in China. To maintain the development of the donkey industry, breeding programs should focus on traits related to these new uses. Compared to conventional marker-assisted breeding plans, genome- and transcriptome-based selection methods are more efficient and effective. To analyze the coding genes of the donkey genome, we assembled the transcriptome of donkey white blood cells de novo. Using transcriptomic deep-sequencing data, we identified 264,714 distinct donkey unigenes and predicted 38,949 protein fragments. We annotated the donkey unigenes by BLAST searches against the non-redundant (NR) protein database. We also compared the donkey protein sequences with those of the horse (E. caballus) and wild horse (E. przewalskii), and linked the donkey protein fragments with mammalian phenotypes. As the outer ear size of donkeys and horses are obviously different, we compared the outer ear size-associated proteins in donkeys and horses. We identified three ear size-associated proteins, HIC1, PRKRA, and KMT2A, with sequence differences among the donkey, horse, and wild horse loci. Since the donkey genome sequence has not been released, the de novo assembled donkey transcriptome is helpful for preliminary investigations of donkey cultivars and for genetic improvement.

  17. Transcriptomic profiling of primary alveolar epithelial cell differentiation in human and rat

    Directory of Open Access Journals (Sweden)

    Crystal N. Marconett

    2014-12-01

    Full Text Available Cell-type specific gene regulation is a key to gaining a full understanding of how the distinct phenotypes of differentiated cells are achieved and maintained. Here we examined how changes in transcriptional activation during alveolar epithelial cell (AEC differentiation determine phenotype. We performed transcriptomic profiling using in vitro differentiation of human and rat primary AEC. This model recapitulates in vitro an in vivo process in which AEC transition from alveolar type 2 (AT2 cells to alveolar type 1 (AT1 cells during normal maintenance and regeneration following lung injury. Here we describe in detail the quality control, preprocessing, and normalization of microarray data presented within the associated study (Marconett et al., 2013. We also include R code for reproducibility of the referenced data and easily accessible processed data tables.

  18. Mapping of the Co-Transcriptomes of UPEC-Infected Macrophages Reveals New Insights into the Molecular Basis of Host-Pathogen Interactions in Human and Mouse

    KAUST Repository

    Mavromatis, Charalampos Harris

    2014-01-01

    Urinary tract infections (UTI) are among the most common infections in humans. Uropathogenic Escherichia coli (UPEC), the main causative agent of UTIs, can invade and replicate within bladder epithelial cells, and recent evidence demonstrated that some UPEC strains also survive within macrophages. To understand the mechanisms of host subversion that enable UPEC to survive within macrophages, and the contribution of macrophages to UPEC-mediated pathology, I performed hostpathogen co-transcriptome analyses using RNA sequencing. I developed an effective computational framework that simultaneously separated, annotated, and quantified the mammalian and bacterial transcriptomes. First, mouse bone morrow-derived macrophages (BMM) were challenged over a 24 h time course with UPEC reference strains, UTI89 (cystitis strain), 83972 and VR50 (asymptomatic bacteriuria strains) that possess contrasting intramacrophage phenotypes. My results showed that BMM responded to the three different UPEC strains with broadly similar gene expression programs. In contrast to the conserved pattern of BMM responses, the transcriptional responses of the different UPEC strains diverged markedly from each other. Hypothesizing that genes upregulated at 24 h post-infection may contribute to intramacrophage survival, I identified UTI89 genes upregulated at this time point, and showed that deletion of one of these genes (pspA) compromised intramacrophage survival of UPEC strain UTI89. Second, human monocyte-derived macrophages (HMDM) and BMM were challenged over a 24 h course with the UPEC strain EC958, a globally disseminated, multi-drug resistant strain. My analysis identified extensive divergence in UPEC-regulated orthologous gene expression between HMDM and BMM, and I validated both known and novel genes in the context of differential regulation. On the contrary, the transcriptional response of EC958 showed a broad conservation across both mammalian intramacrophage environments. My study thus

  19. TranscriptomeBrowser 3.0: introducing a new compendium of molecular interactions and a new visualization tool for the study of gene regulatory networks.

    Science.gov (United States)

    Lepoivre, Cyrille; Bergon, Aurélie; Lopez, Fabrice; Perumal, Narayanan B; Nguyen, Catherine; Imbert, Jean; Puthier, Denis

    2012-01-31

    Deciphering gene regulatory networks by in silico approaches is a crucial step in the study of the molecular perturbations that occur in diseases. The development of regulatory maps is a tedious process requiring the comprehensive integration of various evidences scattered over biological databases. Thus, the research community would greatly benefit from having a unified database storing known and predicted molecular interactions. Furthermore, given the intrinsic complexity of the data, the development of new tools offering integrated and meaningful visualizations of molecular interactions is necessary to help users drawing new hypotheses without being overwhelmed by the density of the subsequent graph. We extend the previously developed TranscriptomeBrowser database with a set of tables containing 1,594,978 human and mouse molecular interactions. The database includes: (i) predicted regulatory interactions (computed by scanning vertebrate alignments with a set of 1,213 position weight matrices), (ii) potential regulatory interactions inferred from systematic analysis of ChIP-seq experiments, (iii) regulatory interactions curated from the literature, (iv) predicted post-transcriptional regulation by micro-RNA, (v) protein kinase-substrate interactions and (vi) physical protein-protein interactions. In order to easily retrieve and efficiently analyze these interactions, we developed In-teractomeBrowser, a graph-based knowledge browser that comes as a plug-in for Transcriptome-Browser. The first objective of InteractomeBrowser is to provide a user-friendly tool to get new insight into any gene list by providing a context-specific display of putative regulatory and physical interactions. To achieve this, InteractomeBrowser relies on a "cell compartments-based layout" that makes use of a subset of the Gene Ontology to map gene products onto relevant cell compartments. This layout is particularly powerful for visual integration of heterogeneous biological information

  20. Distribution of ADAT-Dependent Codons in the Human Transcriptome

    Directory of Open Access Journals (Sweden)

    Àlbert Rafels-Ybern

    2015-07-01

    Full Text Available Nucleotide modifications in the anticodons of transfer RNAs (tRNA play a central role in translation efficiency, fidelity, and regulation of translation, but, for most of these modifications, the details of their function remain unknown. The heterodimeric adenosine deaminases acting on tRNAs (ADAT2-ADAT3, or ADAT are enzymes present in eukaryotes that convert adenine (A to inosine (I in the first anticodon base (position 34 by hydrolytic deamination. To explore the influence of ADAT activity on mammalian translation, we have characterized the human transcriptome and proteome in terms of frequency and distribution of ADAT-related codons. Eight different tRNAs can be modified by ADAT and, once modified, these tRNAs will recognize NNC, NNU and NNA codons, but not NNG codons. We find that transcripts coding for proteins highly enriched in these eight amino acids (ADAT-aa are specifically enriched in NNC, NNU and NNA codons. We also show that the proteins most enriched in ADAT-aa are composed preferentially of threonine, alanine, proline, and serine (TAPS. We propose that the enrichment in ADAT-codons in these proteins is due to the similarities in the codons that correspond to TAPS.

  1. Sequence comparison of prefrontal cortical brain transcriptome from a tame and an aggressive silver fox (Vulpes vulpes)

    Science.gov (United States)

    2011-01-01

    Background Two strains of the silver fox (Vulpes vulpes), with markedly different behavioral phenotypes, have been developed by long-term selection for behavior. Foxes from the tame strain exhibit friendly behavior towards humans, paralleling the sociability of canine puppies, whereas foxes from the aggressive strain are defensive and exhibit aggression to humans. To understand the genetic differences underlying these behavioral phenotypes fox-specific genomic resources are needed. Results cDNA from mRNA from pre-frontal cortex of a tame and an aggressive fox was sequenced using the Roche 454 FLX Titanium platform (> 2.5 million reads & 0.9 Gbase of tame fox sequence; >3.3 million reads & 1.2 Gbase of aggressive fox sequence). Over 80% of the fox reads were assembled into contigs. Mapping fox reads against the fox transcriptome assembly and the dog genome identified over 30,000 high confidence fox-specific SNPs. Fox transcripts for approximately 14,000 genes were identified using SwissProt and the dog RefSeq databases. An at least 2-fold expression difference between the two samples (p fox transcriptome. Conclusions Transcriptome sequencing significantly expanded genomic resources available for the fox, a species without a sequenced genome. In a very cost efficient manner this yielded a large number of fox-specific SNP markers for genetic studies and provided significant insights into the gene expression profile of the fox pre-frontal cortex; expression differences between the two fox samples; and a catalogue of potentially important gene-specific sequence variants. This result demonstrates the utility of this approach for developing genomic resources in species with limited genomic information. PMID:21967120

  2. Dataset of the first transcriptome assembly of the tree crop “yerba mate” (Ilex paraguariensis and systematic characterization of protein coding genes

    Directory of Open Access Journals (Sweden)

    Patricia M. Aguilera

    2018-04-01

    Full Text Available This contribution contains data associated to the research article entitled “Exploring the genes of yerba mate (Ilex paraguariensis A. St.-Hil. by NGS and de novo transcriptome assembly” (Debat et al., 2014 [1]. By means of a bioinformatic approach involving extensive NGS data analyses, we provide a resource encompassing the full transcriptome assembly of yerba mate, the first available reference for the Ilex L. genus. This dataset (Supplementary files 1 and 2 consolidates the transcriptome-wide assembled sequences of I. paraguariensis with further comprehensive annotation of the protein coding genes of yerba mate via the integration of Arabidopsis thaliana databases. The generated data is pivotal for the characterization of agronomical relevant genes in the tree crop yerba mate -a non-model species- and related taxa in Ilex. The raw sequencing data dissected here is available at DDBJ/ENA/GenBank (NCBI Resource Coordinators, 2016 [2] Sequence Read Archive (SRA under the accession SRP043293 and the assembled sequences have been deposited at the Transcriptome Shotgun Assembly Sequence Database (TSA under the accession GFHV00000000.

  3. Assessment of pleiotropic transcriptome perturbations in Arabidopsis engineered for indirect insect defence.

    Science.gov (United States)

    Houshyani, Benyamin; van der Krol, Alexander R; Bino, Raoul J; Bouwmeester, Harro J

    2014-06-19

    Molecular characterization is an essential step of risk/safety assessment of genetically modified (GM) crops. Holistic approaches for molecular characterization using omics platforms can be used to confirm the intended impact of the genetic engineering, but can also reveal the unintended changes at the omics level as a first assessment of potential risks. The potential of omics platforms for risk assessment of GM crops has rarely been used for this purpose because of the lack of a consensus reference and statistical methods to judge the significance or importance of the pleiotropic changes in GM plants. Here we propose a meta data analysis approach to the analysis of GM plants, by measuring the transcriptome distance to untransformed wild-types. In the statistical analysis of the transcriptome distance between GM and wild-type plants, values are compared with naturally occurring transcriptome distances in non-GM counterparts obtained from a database. Using this approach we show that the pleiotropic effect of genes involved in indirect insect defence traits is substantially equivalent to the variation in gene expression occurring naturally in Arabidopsis. Transcriptome distance is a useful screening method to obtain insight in the pleiotropic effects of genetic modification.

  4. Transcriptome Analysis in Sheepgrass (Leymus chinensis). A Dominant Perennial Grass of the Eurasian Steppe

    Energy Technology Data Exchange (ETDEWEB)

    Chen, Shuangyan [Chinese Academy of Sciences (CAS), Institute of Botany (IB), Beijing; Huang, Xin [Chinese Academy of Sciences (CAS), Institute of Botany (IB), Beijing; Yang, Xiaohan [Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States); Liu, Gongshe [Chinese Academy of Sciences (CAS), Institute of Botany (IB), Beijing

    2013-07-04

    BACKGROUND: Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. RESULTS: The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. CONCLUSIONS: This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.

  5. Transcriptome analysis in sheepgrass (Leymus chinensis): a dominant perennial grass of the Eurasian Steppe.

    Science.gov (United States)

    Chen, Shuangyan; Huang, Xin; Yan, Xueqing; Liang, Ye; Wang, Yuezhu; Li, Xiaofeng; Peng, Xianjun; Ma, Xingyong; Zhang, Lexin; Cai, Yueyue; Ma, Tian; Cheng, Liqin; Qi, Dongmei; Zheng, Huajun; Yang, Xiaohan; Li, Xiaoxia; Liu, Gongshe

    2013-01-01

    Sheepgrass [Leymus chinensis (Trin.) Tzvel.] is an important perennial forage grass across the Eurasian Steppe and is known for its adaptability to various environmental conditions. However, insufficient data resources in public databases for sheepgrass limited our understanding of the mechanism of environmental adaptations, gene discovery and molecular marker development. The transcriptome of sheepgrass was sequenced using Roche 454 pyrosequencing technology. We assembled 952,328 high-quality reads into 87,214 unigenes, including 32,416 contigs and 54,798 singletons. There were 15,450 contigs over 500 bp in length. BLAST searches of our database against Swiss-Prot and NCBI non-redundant protein sequences (nr) databases resulted in the annotation of 54,584 (62.6%) of the unigenes. Gene Ontology (GO) analysis assigned 89,129 GO term annotations for 17,463 unigenes. We identified 11,675 core Poaceae-specific and 12,811 putative sheepgrass-specific unigenes by BLAST searches against all plant genome and transcriptome databases. A total of 2,979 specific freezing-responsive unigenes were found from this RNAseq dataset. We identified 3,818 EST-SSRs in 3,597 unigenes, and some SSRs contained unigenes that were also candidates for freezing-response genes. Characterizations of nucleotide repeats and dominant motifs of SSRs in sheepgrass were also performed. Similarity and phylogenetic analysis indicated that sheepgrass is closely related to barley and wheat. This research has greatly enriched sheepgrass transcriptome resources. The identified stress-related genes will help us to decipher the genetic basis of the environmental and ecological adaptations of this species and will be used to improve wheat and barley crops through hybridization or genetic transformation. The EST-SSRs reported here will be a valuable resource for future gene-phenotype studies and for the molecular breeding of sheepgrass and other Poaceae species.

  6. The Human Gene Mutation Database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine.

    Science.gov (United States)

    Stenson, Peter D; Mort, Matthew; Ball, Edward V; Shaw, Katy; Phillips, Andrew; Cooper, David N

    2014-01-01

    The Human Gene Mutation Database (HGMD®) is a comprehensive collection of germline mutations in nuclear genes that underlie, or are associated with, human inherited disease. By June 2013, the database contained over 141,000 different lesions detected in over 5,700 different genes, with new mutation entries currently accumulating at a rate exceeding 10,000 per annum. HGMD was originally established in 1996 for the scientific study of mutational mechanisms in human genes. However, it has since acquired a much broader utility as a central unified disease-oriented mutation repository utilized by human molecular geneticists, genome scientists, molecular biologists, clinicians and genetic counsellors as well as by those specializing in biopharmaceuticals, bioinformatics and personalized genomics. The public version of HGMD (http://www.hgmd.org) is freely available to registered users from academic institutions/non-profit organizations whilst the subscription version (HGMD Professional) is available to academic, clinical and commercial users under license via BIOBASE GmbH.

  7. Sequence protein identification by randomized sequence database and transcriptome mass spectrometry (SPIDER-TMS): from manual to automatic application of a 'de novo sequencing' approach.

    Science.gov (United States)

    Pascale, Raffaella; Grossi, Gerarda; Cruciani, Gabriele; Mecca, Giansalvatore; Santoro, Donatello; Sarli Calace, Renzo; Falabella, Patrizia; Bianco, Giuliana

    Sequence protein identification by a randomized sequence database and transcriptome mass spectrometry software package has been developed at the University of Basilicata in Potenza (Italy) and designed to facilitate the determination of the amino acid sequence of a peptide as well as an unequivocal identification of proteins in a high-throughput manner with enormous advantages of time, economical resource and expertise. The software package is a valid tool for the automation of a de novo sequencing approach, overcoming the main limits and a versatile platform useful in the proteomic field for an unequivocal identification of proteins, starting from tandem mass spectrometry data. The strength of this software is that it is a user-friendly and non-statistical approach, so protein identification can be considered unambiguous.

  8. A curated transcriptome dataset collection to investigate the functional programming of human hematopoietic cells in early life.

    Science.gov (United States)

    Rahman, Mahbuba; Boughorbel, Sabri; Presnell, Scott; Quinn, Charlie; Cugno, Chiara; Chaussabel, Damien; Marr, Nico

    2016-01-01

    Compendia of large-scale datasets made available in public repositories provide an opportunity to identify and fill gaps in biomedical knowledge. But first, these data need to be made readily accessible to research investigators for interpretation. Here we make available a collection of transcriptome datasets to investigate the functional programming of human hematopoietic cells in early life. Thirty two datasets were retrieved from the NCBI Gene Expression Omnibus (GEO) and loaded in a custom web application called the Gene Expression Browser (GXB), which was designed for interactive query and visualization of integrated large-scale data. Quality control checks were performed. Multiple sample groupings and gene rank lists were created allowing users to reveal age-related differences in transcriptome profiles, changes in the gene expression of neonatal hematopoietic cells to a variety of immune stimulators and modulators, as well as during cell differentiation. Available demographic, clinical, and cell phenotypic information can be overlaid with the gene expression data and used to sort samples. Web links to customized graphical views can be generated and subsequently inserted in manuscripts to report novel findings. GXB also enables browsing of a single gene across projects, thereby providing new perspectives on age- and developmental stage-specific expression of a given gene across the human hematopoietic system. This dataset collection is available at: http://developmentalimmunology.gxbsidra.org/dm3/geneBrowser/list.

  9. Genomic and transcriptome profiling identified both human and HBV genetic variations and their interactions in Chinese hepatocellular carcinoma

    Directory of Open Access Journals (Sweden)

    Hua Dong

    2015-12-01

    Full Text Available Interaction between HBV and host genome integrations in hepatocellular carcinoma (HCC development is a complex process and the mechanism is still unclear. Here we described in details the quality controls and data mining of aCGH and transcriptome sequencing data on 50 HCC samples from the Chinese patients, published by Dong et al. (2015 (GEO#: GSE65486. In additional to the HBV-MLL4 integration discovered, we also investigated the genetic aberrations of HBV and host genes as well as their genetic interactions. We reported human genome copy number changes and frequent transcriptome variations (e.g. TP53, CTNNB1 mutation, especially MLL family mutations in this cohort of the patients. For HBV genotype C, we identified a novel linkage disequilibrium region covering HBV replication regulatory elements, including basal core promoter, DR1, epsilon and poly-A regions, which is associated with HBV core antigen over-expression and almost exclusive to HBV-MLL4 integration.

  10. RAD9 deficiency enhances radiation induced bystander DNA damage and transcriptomal response

    International Nuclear Information System (INIS)

    Ghandhi, Shanaz A; Ponnaiya, Brian; Panigrahi, Sunil K; Hopkins, Kevin M; Cui, Qingping; Hei, Tom K; Amundson, Sally A; Lieberman, Howard B

    2014-01-01

    Radiation induced bystander effects are an important component of the overall response of cells to irradiation and are associated with human health risks. The mechanism responsible includes intra-cellular and inter-cellular signaling by which the bystander response is propagated. However, details of the signaling mechanism are not well defined. We measured the bystander response of Mrad9 +/+ and Mrad9 −/− mouse embryonic stem cells, as well as human H1299 cells with inherent or RNA interference-mediated reduced RAD9 levels after exposure to 1 Gy α particles, by scoring chromosomal aberrations and micronuclei formation, respectively. In addition, we used microarray gene expression analyses to profile the transcriptome of directly irradiated and bystander H1299 cells. We demonstrated that Mrad9 null enhances chromatid aberration frequency induced by radiation in bystander mouse embryonic stem cells. In addition, we found that H1299 cells with reduced RAD9 protein levels showed a higher frequency of radiation induced bystander micronuclei formation, compared with parental cells containing inherent levels of RAD9. The enhanced bystander response in human cells was associated with a unique transcriptomic profile. In unirradiated cells, RAD9 reduction broadly affected stress response pathways at the mRNA level; there was reduction in transcript levels corresponding to genes encoding multiple members of the UVA-MAPK and p38MAPK families, such as STAT1 and PARP1, suggesting that these signaling mechanisms may not function optimally when RAD9 is reduced. Using network analysis, we found that differential activation of the SP1 and NUPR1 transcriptional regulators was predicted in directly irradiated and bystander H1299 cells. Transcription factor prediction analysis also implied that HIF1α (Hypoxia induced factor 1 alpha) activation by protein stabilization in irradiated cells could be a negative predictor of the bystander response, suggesting that local hypoxic stress

  11. HMDB 3.0--The Human Metabolome Database in 2013.

    Science.gov (United States)

    Wishart, David S; Jewison, Timothy; Guo, An Chi; Wilson, Michael; Knox, Craig; Liu, Yifeng; Djoumbou, Yannick; Mandal, Rupasri; Aziat, Farid; Dong, Edison; Bouatra, Souhaila; Sinelnikov, Igor; Arndt, David; Xia, Jianguo; Liu, Philip; Yallou, Faizath; Bjorndahl, Trent; Perez-Pineiro, Rolando; Eisner, Roman; Allen, Felicity; Neveu, Vanessa; Greiner, Russ; Scalbert, Augustin

    2013-01-01

    The Human Metabolome Database (HMDB) (www.hmdb.ca) is a resource dedicated to providing scientists with the most current and comprehensive coverage of the human metabolome. Since its first release in 2007, the HMDB has been used to facilitate research for nearly 1000 published studies in metabolomics, clinical biochemistry and systems biology. The most recent release of HMDB (version 3.0) has been significantly expanded and enhanced over the 2009 release (version 2.0). In particular, the number of annotated metabolite entries has grown from 6500 to more than 40,000 (a 600% increase). This enormous expansion is a result of the inclusion of both 'detected' metabolites (those with measured concentrations or experimental confirmation of their existence) and 'expected' metabolites (those for which biochemical pathways are known or human intake/exposure is frequent but the compound has yet to be detected in the body). The latest release also has greatly increased the number of metabolites with biofluid or tissue concentration data, the number of compounds with reference spectra and the number of data fields per entry. In addition to this expansion in data quantity, new database visualization tools and new data content have been added or enhanced. These include better spectral viewing tools, more powerful chemical substructure searches, an improved chemical taxonomy and better, more interactive pathway maps. This article describes these enhancements to the HMDB, which was previously featured in the 2009 NAR Database Issue. (Note to referees, HMDB 3.0 will go live on 18 September 2012.).

  12. De novo transcriptome assembly of two Vigna angularis varieties collected from Korea

    Directory of Open Access Journals (Sweden)

    Yeonhwa Jo

    2016-06-01

    Full Text Available The adzuki bean (Vigna angularis, a member of the family Fabaceae, is widely grown in Asia, from East Asia to the Himalayas. The adzuki bean is known as an ingredient that adds sweetness to diverse desserts made in Eastern Asian countries. Libraries prepared from two V. angularis varieties referred to as Taejin Black and Taejin Red were paired-end sequenced using the Illumina HiSeq 2000 system. The raw data in this study can be available in NCBI SRA database with accession numbers of SRR3406660 and SRR3406553. After de novo transcriptome assembly using Trinity, we obtained 324,219 and 280,056 transcripts from Taejin Black and Taejin Red, respectively. We predicted a total of 238,321 proteins and 179,519 proteins for Taejin Black and Taejin Red, respectively, by the TransDecoder program. We carried out BLASTP on the predicted proteins against the Swiss-Prot protein sequence database to predict the putative functions of identified proteins. Taken together, we provide transcriptomes of two adzuki bean varieties by RNA-Seq, which might be usefully applied to generate molecular markers.

  13. Bioinformatics analysis of transcriptome dynamics during growth in angus cattle longissimus muscle.

    Science.gov (United States)

    Moisá, Sonia J; Shike, Daniel W; Graugnard, Daniel E; Rodriguez-Zas, Sandra L; Everts, Robin E; Lewin, Harris A; Faulkner, Dan B; Berger, Larry L; Loor, Juan J

    2013-01-01

    Transcriptome dynamics in the longissimus muscle (LM) of young Angus cattle were evaluated at 0, 60, 120, and 220 days from early-weaning. Bioinformatic analysis was performed using the dynamic impact approach (DIA) by means of Kyoto Encyclopedia of Genes and Genomes (KEGG) and Database for Annotation, Visualization and Integrated Discovery (DAVID) databases. Between 0 to 120 days (growing phase) most of the highly-impacted pathways (eg, ascorbate and aldarate metabolism, drug metabolism, cytochrome P450 and Retinol metabolism) were inhibited. The phase between 120 to 220 days (finishing phase) was characterized by the most striking differences with 3,784 differentially expressed genes (DEGs). Analysis of those DEGs revealed that the most impacted KEGG canonical pathway was glycosylphosphatidylinositol (GPI)-anchor biosynthesis, which was inhibited. Furthermore, inhibition of calpastatin and activation of tyrosine aminotransferase ubiquitination at 220 days promotes proteasomal degradation, while the concurrent activation of ribosomal proteins promotes protein synthesis. Therefore, the balance of these processes likely results in a steady-state of protein turnover during the finishing phase. Results underscore the importance of transcriptome dynamics in LM during growth.

  14. A transcriptome-based assessment of the astrocytic dystrophin-associated complex in the developing human brain.

    Science.gov (United States)

    Simon, Matthew J; Murchison, Charles; Iliff, Jeffrey J

    2018-02-01

    Astrocytes play a critical role in regulating the interface between the cerebral vasculature and the central nervous system. Contributing to this is the astrocytic endfoot domain, a specialized structure that ensheathes the entirety of the vasculature and mediates signaling between endothelial cells, pericytes, and neurons. The astrocytic endfoot has been implicated as a critical element of the glymphatic pathway, and changes in protein expression profiles in this cellular domain are linked to Alzheimer's disease pathology. Despite this, basic physiological properties of this structure remain poorly understood including the developmental timing of its formation, and the protein components that localize there to mediate its functions. Here we use human transcriptome data from male and female subjects across several developmental stages and brain regions to characterize the gene expression profile of the dystrophin-associated complex (DAC), a known structural component of the astrocytic endfoot that supports perivascular localization of the astroglial water channel aquaporin-4. Transcriptomic profiling is also used to define genes exhibiting parallel expression profiles to DAC elements, generating a pool of candidate genes that encode gene products that may contribute to the physiological function of the perivascular astrocytic endfoot domain. We found that several genes encoding transporter proteins are transcriptionally associated with DAC genes. © 2017 Wiley Periodicals, Inc.

  15. GenMapDB: a database of mapped human BAC clones

    OpenAIRE

    Morley, Michael; Arcaro, Melissa; Burdick, Joshua; Yonescu, Raluca; Reid, Thomas; Kirsch, Ilan R.; Cheung, Vivian G.

    2001-01-01

    GenMapDB (http://genomics.med.upenn.edu/genmapdb) is a repository of human bacterial artificial chromosome (BAC) clones mapped by our laboratory to sequence-tagged site markers. Currently, GenMapDB contains over 3000 mapped clones that span 19 chromosomes, chromosomes 2, 4, 5, 9–22, X and Y. This database provides positional information about human BAC clones from the RPCI-11 human male BAC library. It also contains restriction fragment analysis data and end sequen...

  16. Comparative analysis of transcriptomes in aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing

    Directory of Open Access Journals (Sweden)

    Taketo Okada

    2016-12-01

    Full Text Available Ephedra plants are taxonomically classified as gymnosperms, and are medicinally important as the botanical origin of crude drugs and as bioresources that contain pharmacologically active chemicals. Here we show a comparative analysis of the transcriptomes of aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing by RNA-Seq. De novo assembly of short cDNA sequence reads generated 23,358, 13,373, and 28,579 contigs longer than 200 bases from aerial stems, roots, or both aerial stems and roots, respectively. The presumed functions encoded by these contig sequences were annotated by BLAST (blastx. Subsequently, these contigs were classified based on gene ontology slims, Enzyme Commission numbers, and the InterPro database. Furthermore, comparative gene expression analysis was performed between aerial stems and roots. These transcriptome analyses revealed differences and similarities between the transcriptomes of aerial stems and roots in E. sinica. Deep transcriptome sequencing of Ephedra should open the door to molecular biological studies based on the entire transcriptome, tissue- or organ-specific transcriptomes, or targeted genes of interest.

  17. Transcriptome Analysis of Two Different Developmental Stages of Paeonia lactiflora Seeds

    Directory of Open Access Journals (Sweden)

    Yonglei Ma

    2017-01-01

    Full Text Available Paeonia lactiflora is a herbaceous flower in the family Paeoniaceae with both hypocotyl and epicotyl dormant seeds. We used high-throughput transcriptome sequencing on two different developmental stages of P. lactiflora seeds to identify seed dormancy and germination-related genes. We performed de novo assembly and annotated a total of 123,577 unigenes, which encoded 24,688 putative proteins with 47 GO categories. A total of 10,714 unigenes were annotated in the KEGG database, and 258 pathways were involved in the annotations. A total of 1795 genes were differentially expressed in the functional enrichment analysis. The key genes for seed germination and dormancy, such as GAI1 and ARF, were confirmed by quantitative reverse transcription-polymerase chain reaction analysis. This is the first report of sequencing the P. lactiflora seed transcriptome. Our results provide fundamental frame work and technical support for further selective breeding and cultivation of Paeonia. Our transcriptomic data also serves as the basis for future genetics and genomics research on Paeonia and its closely related species.

  18. The SACADA database for human reliability and human performance

    International Nuclear Information System (INIS)

    James Chang, Y.; Bley, Dennis; Criscione, Lawrence; Kirwan, Barry; Mosleh, Ali; Madary, Todd; Nowell, Rodney; Richards, Robert; Roth, Emilie M.; Sieben, Scott; Zoulis, Antonios

    2014-01-01

    Lack of appropriate and sufficient human performance data has been identified as a key factor affecting human reliability analysis (HRA) quality especially in the estimation of human error probability (HEP). The Scenario Authoring, Characterization, and Debriefing Application (SACADA) database was developed by the U.S. Nuclear Regulatory Commission (NRC) to address this data need. An agreement between NRC and the South Texas Project Nuclear Operating Company (STPNOC) was established to support the SACADA development with aims to make the SACADA tool suitable for implementation in the nuclear power plants' operator training program to collect operator performance information. The collected data would support the STPNOC's operator training program and be shared with the NRC for improving HRA quality. This paper discusses the SACADA data taxonomy, the theoretical foundation, the prospective data to be generated from the SACADA raw data to inform human reliability and human performance, and the considerations on the use of simulator data for HRA. Each SACADA data point consists of two information segments: context and performance results. Context is a characterization of the performance challenges to task success. The performance results are the results of performing the task. The data taxonomy uses a macrocognitive functions model for the framework. At a high level, information is classified according to the macrocognitive functions of detecting the plant abnormality, understanding the abnormality, deciding the response plan, executing the response plan, and team related aspects (i.e., communication, teamwork, and supervision). The data are expected to be useful for analyzing the relations between context, error modes and error causes in human performance

  19. Global analysis of transcriptome responses and gene expression profiles to cold stress of Jatropha curcas L.

    Science.gov (United States)

    Wang, Haibo; Zou, Zhurong; Wang, Shasha; Gong, Ming

    2013-01-01

    Jatropha curcas L., also called the Physic nut, is an oil-rich shrub with multiple uses, including biodiesel production, and is currently exploited as a renewable energy resource in many countries. Nevertheless, because of its origin from the tropical MidAmerican zone, J. curcas confers an inherent but undesirable characteristic (low cold resistance) that may seriously restrict its large-scale popularization. This adaptive flaw can be genetically improved by elucidating the mechanisms underlying plant tolerance to cold temperatures. The newly developed Illumina Hiseq™ 2000 RNA-seq and Digital Gene Expression (DGE) are deep high-throughput approaches for gene expression analysis at the transcriptome level, using which we carefully investigated the gene expression profiles in response to cold stress to gain insight into the molecular mechanisms of cold response in J. curcas. In total, 45,251 unigenes were obtained by assembly of clean data generated by RNA-seq analysis of the J. curcas transcriptome. A total of 33,363 and 912 complete or partial coding sequences (CDSs) were determined by protein database alignments and ESTScan prediction, respectively. Among these unigenes, more than 41.52% were involved in approximately 128 known metabolic or signaling pathways, and 4,185 were possibly associated with cold resistance. DGE analysis was used to assess the changes in gene expression when exposed to cold condition (12°C) for 12, 24, and 48 h. The results showed that 3,178 genes were significantly upregulated and 1,244 were downregulated under cold stress. These genes were then functionally annotated based on the transcriptome data from RNA-seq analysis. This study provides a global view of transcriptome response and gene expression profiling of J. curcas in response to cold stress. The results can help improve our current understanding of the mechanisms underlying plant cold resistance and favor the screening of crucial genes for genetically enhancing cold resistance

  20. Database Description - SKIP Stemcell Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us SKIP Stemcell Database Database Description General information of database Database name SKIP Stemcell Database...rsity Journal Search: Contact address http://www.skip.med.keio.ac.jp/en/contact/ Database classification Human Genes and Diseases Dat...abase classification Stemcell Article Organism Taxonomy Name: Homo sapiens Taxonomy ID: 9606 Database...ks: Original website information Database maintenance site Center for Medical Genetics, School of medicine, ...lable Web services Not available URL of Web services - Need for user registration Not available About This Database Database

  1. De novo sequencing, assembly and characterization of antennal transcriptome of Anomala corpulenta Motschulsky (Coleoptera: Rutelidae.

    Directory of Open Access Journals (Sweden)

    Haoliang Chen

    Full Text Available Anomala corpulenta is an important insect pest and can cause enormous economic losses in agriculture, horticulture and forestry. It is widely distributed in China, and both larvae and adults can cause serious damage. It is difficult to control this pest because the larvae live underground. Any new control strategy should exploit alternatives to heavily and frequently used chemical insecticides. However, little genetic research has been carried out on A. corpulenta due to the lack of genomic resources. Genomic resources could be produced by next generation sequencing technologies with low cost and in a short time. In this study, we performed de novo sequencing, assembly and characterization of the antennal transcriptome of A. corpulenta.Illumina sequencing technology was used to sequence the antennal transcriptome of A. corpulenta. Approximately 76.7 million total raw reads and about 68.9 million total clean reads were obtained, and then 35,656 unigenes were assembled. Of these unigenes, 21,463 of them could be annotated in the NCBI nr database, and, among the annotated unigenes, 11,154 and 6,625 unigenes could be assigned to GO and COG, respectively. Additionally, 16,350 unigenes could be annotated in the Swiss-Prot database, and 14,499 unigenes could map onto 258 pathways in the KEGG Pathway database. We also found 24 unigenes related to OBPs, 6 to CSPs, and in total 167 unigenes related to chemodetection. We analyzed 4 OBPs and 3CSPs sequences and their RT-qPCR results agreed well with their FPKM values.We produced the first large-scale antennal transcriptome of A. corpulenta, which is a species that has little genomic information in public databases. The identified chemodetection unigenes can promote the molecular mechanistic study of behavior in A. corpulenta. These findings provide a general sequence resource for molecular genetics research on A. corpulenta.

  2. Transcriptomic profiles of human foreskin fibroblast cells in response to orf virus.

    Science.gov (United States)

    Chen, Daxiang; Long, Mingjian; Xiao, Bin; Xiong, Yufeng; Chen, Huiqin; Chen, Yu; Kuang, Zhenzhan; Li, Ming; Wu, Yingsong; Rock, Daniel L; Gong, Daoyuan; Wang, Yong; He, Haijian; Liu, Fang; Luo, Shuhong; Hao, Wenbo

    2017-08-29

    Orf virus has been utilized as a safe and efficient viral vector against not only diverse infectious diseases, but also against tumors. However, the nature of the genes triggered by the vector in human cells is poorly characterized. Using RNA sequencing technology, we compared specific changes in the transcriptomic profiles in human foreskin fibroblast cells following infection by the orf virus. The results indicated that orf virus upregulates or downregulates expression of a variety of genes, including genes involved in antiviral immune response, apoptosis, cell cycle and a series of signaling pathways, such as the IFN and p53-signaling pathways. The orf virus stimulates or inhibits immune gene expression such as chemokines, chemokine receptors, cytokines, cytokine receptors, and molecules involved in antigen uptake and processing after infection. Expression of pro-apoptotic genes increased at 8 hours post-infection. The p53 signaling pathway was activated to induce apoptosis at the same time. However, the cell cycle program was promoted after infection, which may be due to the immunomodulatory genes of the orf virus. This presents the first description of transcription profile changes in human foreskin fibroblast cells after orf virus infection and provides an in-depth analysis of the interaction between the host and orf virus. These data offer new insights into the understanding of the mechanisms of infection by orf virus and identify potential targets for future studies.

  3. Structural Design of HRA Database using generic task for Quantitative Analysis of Human Performance

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Seung Hwan; Kim, Yo Chan; Choi, Sun Yeong; Park, Jin Kyun; Jung Won Dea [KAERI, Daejeon (Korea, Republic of)

    2016-05-15

    This paper describes a design of generic task based HRA database for quantitative analysis of human performance in order to estimate the number of task conductions. The estimation method to get the total task conduction number using direct counting is not easy to realize and maintain its data collection framework. To resolve this problem, this paper suggests an indirect method and a database structure using generic task that enables to estimate the total number of conduction based on instructions of operating procedures of nuclear power plants. In order to reduce human errors, therefore, all information on the human errors taken by operators in the power plant should be systematically collected and examined in its management. Korea Atomic Energy Research Institute (KAERI) is carrying out a research to develop a data collection framework to establish a Human Reliability Analysis (HRA) database that could be employed as technical bases to generate human error probabilities (HEPs) and performance shaping factors (PSFs)]. As a result of the study, the essential table schema was designed to the generic task database which stores generic tasks, procedure lists and task tree structures, and other supporting tables. The number of task conduction based on the operating procedures for HEP estimation was enabled through the generic task database and framework. To verify the framework applicability, case study for the simulated experiments was performed and analyzed using graphic user interfaces developed in this study.

  4. Structural Design of HRA Database using generic task for Quantitative Analysis of Human Performance

    International Nuclear Information System (INIS)

    Kim, Seung Hwan; Kim, Yo Chan; Choi, Sun Yeong; Park, Jin Kyun; Jung Won Dea

    2016-01-01

    This paper describes a design of generic task based HRA database for quantitative analysis of human performance in order to estimate the number of task conductions. The estimation method to get the total task conduction number using direct counting is not easy to realize and maintain its data collection framework. To resolve this problem, this paper suggests an indirect method and a database structure using generic task that enables to estimate the total number of conduction based on instructions of operating procedures of nuclear power plants. In order to reduce human errors, therefore, all information on the human errors taken by operators in the power plant should be systematically collected and examined in its management. Korea Atomic Energy Research Institute (KAERI) is carrying out a research to develop a data collection framework to establish a Human Reliability Analysis (HRA) database that could be employed as technical bases to generate human error probabilities (HEPs) and performance shaping factors (PSFs)]. As a result of the study, the essential table schema was designed to the generic task database which stores generic tasks, procedure lists and task tree structures, and other supporting tables. The number of task conduction based on the operating procedures for HEP estimation was enabled through the generic task database and framework. To verify the framework applicability, case study for the simulated experiments was performed and analyzed using graphic user interfaces developed in this study.

  5. An integrated genomic and transcriptomic survey of mucormycosis-causing fungi

    Science.gov (United States)

    Chibucos, Marcus C.; Soliman, Sameh; Gebremariam, Teclegiorgis; Lee, Hongkyu; Daugherty, Sean; Orvis, Joshua; Shetty, Amol C.; Crabtree, Jonathan; Hazen, Tracy H.; Etienne, Kizee A.; Kumari, Priti; O'Connor, Timothy D.; Rasko, David A.; Filler, Scott G.; Fraser, Claire M.; Lockhart, Shawn R.; Skory, Christopher D.; Ibrahim, Ashraf S.; Bruno, Vincent M.

    2016-01-01

    Mucormycosis is a life-threatening infection caused by Mucorales fungi. Here we sequence 30 fungal genomes, and perform transcriptomics with three representative Rhizopus and Mucor strains and with human airway epithelial cells during fungal invasion, to reveal key host and fungal determinants contributing to pathogenesis. Analysis of the host transcriptional response to Mucorales reveals platelet-derived growth factor receptor B (PDGFRB) signaling as part of a core response to divergent pathogenic fungi; inhibition of PDGFRB reduces Mucorales-induced damage to host cells. The unique presence of CotH invasins in all invasive Mucorales, and the correlation between CotH gene copy number and clinical prevalence, are consistent with an important role for these proteins in mucormycosis pathogenesis. Our work provides insight into the evolution of this medically and economically important group of fungi, and identifies several molecular pathways that might be exploited as potential therapeutic targets. PMID:27447865

  6. Integrated transcriptome and binding sites analysis implicates E2F in the regulation of self-renewal in human pluripotent stem cells.

    Directory of Open Access Journals (Sweden)

    Hock Chuan Yeo

    Full Text Available Rapid cellular growth and multiplication, limited replicative senescence, calibrated sensitivity to apoptosis, and a capacity to differentiate into almost any cell type are major properties that underline the self-renewal capabilities of human pluripotent stem cells (hPSCs. We developed an integrated bioinformatics pipeline to understand the gene regulation and functions involved in maintaining such self-renewal properties of hPSCs compared to matched fibroblasts. An initial genome-wide screening of transcription factor activity using in silico binding-site and gene expression microarray data newly identified E2F as one of major candidate factors, revealing their significant regulation of the transcriptome. This is underscored by an elevated level of its transcription factor activity and expression in all tested pluripotent stem cell lines. Subsequent analysis of functional gene groups demonstrated the importance of the TFs to self-renewal in the pluripotency-coupled context; E2F directly targets the global signaling (e.g. self-renewal associated WNT and FGF pathways and metabolic network (e.g. energy generation pathways, molecular transports and fatty acid metabolism to promote its canonical functions that are driving the self-renewal of hPSCs. In addition, we proposed a core self-renewal module of regulatory interplay between E2F and, WNT and FGF pathways in these cells. Thus, we conclude that E2F plays a significant role in influencing the self-renewal capabilities of hPSCs.

  7. Generation of iPSC lines from primary human chorionic villi cells

    Directory of Open Access Journals (Sweden)

    Björn Lichtner

    2015-11-01

    Full Text Available Primary human chorionic villi (CV cells were used to generate the iPSC line by retroviral transduction of the four Yamanaka-factors OCT4, SOX2, KLF4 and c-MYC. Pluripotency was confirmed both in vivo and in vitro. The transcriptomes of the CV-derived iPSC lines and the human embryonic stem cell lines—H1 and H9 have a Pearson correlation of 0.929 and 0.943 respectively.

  8. HIP2: An online database of human plasma proteins from healthy individuals

    Directory of Open Access Journals (Sweden)

    Shen Changyu

    2008-04-01

    Full Text Available Abstract Background With the introduction of increasingly powerful mass spectrometry (MS techniques for clinical research, several recent large-scale MS proteomics studies have sought to characterize the entire human plasma proteome with a general objective for identifying thousands of proteins leaked from tissues in the circulating blood. Understanding the basic constituents, diversity, and variability of the human plasma proteome is essential to the development of sensitive molecular diagnosis and treatment monitoring solutions for future biomedical applications. Biomedical researchers today, however, do not have an integrated online resource in which they can search for plasma proteins collected from different mass spectrometry platforms, experimental protocols, and search software for healthy individuals. The lack of such a resource for comparisons has made it difficult to interpret proteomics profile changes in patients' plasma and to design protein biomarker discovery experiments. Description To aid future protein biomarker studies of disease and health from human plasma, we developed an online database, HIP2 (Healthy Human Individual's Integrated Plasma Proteome. The current version contains 12,787 protein entries linked to 86,831 peptide entries identified using different MS platforms. Conclusion This web-based database will be useful to biomedical researchers involved in biomarker discovery research. This database has been developed to be the comprehensive collection of healthy human plasma proteins, and has protein data captured in a relational database schema built to contain mappings of supporting peptide evidence from several high-quality and high-throughput mass-spectrometry (MS experimental data sets. Users can search for plasma protein/peptide annotations, peptide/protein alignments, and experimental/sample conditions with options for filter-based retrieval to achieve greater analytical power for discovery and validation.

  9. De Novo Assembly of the Donkey White Blood Cell Transcriptome and a Comparative Analysis of Phenotype-Associated Genes between Donkeys and Horses.

    Directory of Open Access Journals (Sweden)

    Feng-Yun Xie

    Full Text Available Prior to the mechanization of agriculture and labor-intensive tasks, humans used donkeys (Equus africanus asinus for farm work and packing. However, as mechanization increased, donkeys have been increasingly raised for meat, milk, and fur in China. To maintain the development of the donkey industry, breeding programs should focus on traits related to these new uses. Compared to conventional marker-assisted breeding plans, genome- and transcriptome-based selection methods are more efficient and effective. To analyze the coding genes of the donkey genome, we assembled the transcriptome of donkey white blood cells de novo. Using transcriptomic deep-sequencing data, we identified 264,714 distinct donkey unigenes and predicted 38,949 protein fragments. We annotated the donkey unigenes by BLAST searches against the non-redundant (NR protein database. We also compared the donkey protein sequences with those of the horse (E. caballus and wild horse (E. przewalskii, and linked the donkey protein fragments with mammalian phenotypes. As the outer ear size of donkeys and horses are obviously different, we compared the outer ear size-associated proteins in donkeys and horses. We identified three ear size-associated proteins, HIC1, PRKRA, and KMT2A, with sequence differences among the donkey, horse, and wild horse loci. Since the donkey genome sequence has not been released, the de novo assembled donkey transcriptome is helpful for preliminary investigations of donkey cultivars and for genetic improvement.

  10. Transcriptome Profiling of the Abdominal Skin of Larimichthys crocea in Light Stress

    Science.gov (United States)

    Han, Zhaofang; Lv, Changhuan; Xiao, Shijun; Ye, Kun; Zhang, Dongling; Tsai, Huai Jen; Wang, Zhiyong

    2018-04-01

    Large yellow croaker ( Larimichthys crocea), one of the most important marine fish species in China, can change its abdominal skin color when it is shifted from light to dark or from dark to light, providing us an opportunity of investigating the molecular responding mechanism of teleost in light stress. The gene expression profile of fish under light stress is rarely documented. In this research, the transcriptome profiles of the abdominal skin of L. crocea exposed to light or dark for 0 h, 0.5 h and 2 h were produced by next-generation sequencing (NGS). The cluster results demonstrated that stress period, rather than light intensity ( e.g., light or dark), is the major influencing factor. Differently expressed genes (DEGs) were identified between 0 h and 0.5 h groups, between 0 h and 2 h groups, between 0.5 h light and 0.5 h dark, and between 2 h light and 2 h dark, respectively. The gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) annotation revealed that the genes relating to immunity, energy metabolism, and cytoskeletal protein binding were significantly enriched. The detailed analysis of transcriptome profiles also revealed regular gene expression trends, indicating that the elaborate gene regulation networks underlined the molecular responses of the fish to light stress. This transcriptome analysis suggested that systematic and complicated regulatory cascades were functionally activated in response to external stress, and coloration change caused by light stress was mainly attributed to the change in the density of chromatophores for L. crocea. This study also provided valuable information for skin coloration or light stress research on other marine fish species.

  11. De novo transcriptome assembly of a sour cherry cultivar, Schattenmorelle

    Directory of Open Access Journals (Sweden)

    Yeonhwa Jo

    2015-12-01

    Full Text Available Sour cherry (Prunus cerasus in the genus Prunus in the family Rosaceae is one of the most popular stone fruit trees worldwide. Of known sour cherry cultivars, the Schattenmorelle is a famous old sour cherry with a high amount of fruit production. The Schattenmorelle was selected before 1650 and described in the 1800s. This cultivar was named after gardens of the Chateau de Moreille in which the cultivar was initially found. In order to identify new genes and to develop genetic markers for sour cherry, we performed a transcriptome analysis of a sour cherry. We selected the cultivar Schattenmorelle, which is among commercially important cultivars in Europe and North America. We obtained 2.05 GB raw data from the Schattenmorelle (NCBI accession number: SRX1187170. De novo transcriptome assembly using Trinity identified 61,053 transcripts in which N50 was 611 bp. Next, we identified 25,585 protein coding sequences using TransDecoder. The identified proteins were blasted against NCBI's non-redundant database for annotation. Based on blast search, we taxonomically classified the obtained sequences. As a result, we provide the transcriptome of sour cherry cultivar Schattenmorelle using next generation sequencing.

  12. A two term model of the confinement in Elmy H-modes using the global confinement and pedestal databases

    International Nuclear Information System (INIS)

    2003-01-01

    Two different physical models of the H-mode pedestal are tested against the joint pedestal-core database. These models are then combined with models for the core and shown to give a good fit to the ELMy H-mode database. Predictions are made for the next step tokamaks ITER and FIRE. (author)

  13. Comparative de novo transcriptome analysis of male and female Sea buckthorn.

    Science.gov (United States)

    Bansal, Ankush; Salaria, Mehul; Sharma, Tashil; Stobdan, Tsering; Kant, Anil

    2018-02-01

    Sea buckthorn is a dioecious medicinal plant found at high altitude. The plant has both male and female reproductive organs in separate individuals. In this article, whole transcriptome de novo assemblies of male and female flower bud samples were carried out using Illumina NextSeq 500 platform to determine the role of the genes involved in sex determination. Moreover, genes with differential expression in male and female transcriptomes were identified to understand the underlying sex determination mechanism. The current study showed 63,904 and 62,272 coding sequences (CDS) in female and male transcriptome data sets, respectively. 16,831 common CDS were screened out from both transcriptomes, out of which 625 were upregulated and 491 were found to be downregulated. To understand the potential regulatory roles of differentially expressed genes in metabolic networks and biosynthetic pathways: KEGG mapping, gene ontology, and co-expression network analysis were performed. Comparison with Flowering Interactive Database (FLOR-ID) resulted in eight differentially expressed genes viz. CHD3-type chromatin-remodeling factor PICKLE ( PKL ), phytochrome-associated serine/threonine-protein phosphatase ( FYPP ), protein TOPLESS ( TPL ), sensitive to freezing 6 ( SFR6 ), lysine-specific histone demethylase 1 homolog 1 ( LDL1 ), pre-mRNA-processing-splicing factor 8A ( PRP8A ), sucrose synthase 4 ( SUS4 ), ubiquitin carboxyl-terminal hydrolase 12 ( UBP12 ), known to be broadly involved in flowering, photoperiodism, embryo development, and cold response pathways. Male and female flower bud transcriptome data of Sea buckthorn may provide comprehensive information at genomic level for the identification of genetic regulation involved in sex determination.

  14. Proteogenomics Dashboard for the Human Proteome Project.

    Science.gov (United States)

    Tabas-Madrid, Daniel; Alves-Cruzeiro, Joao; Segura, Victor; Guruceaga, Elizabeth; Vialas, Vital; Prieto, Gorka; García, Carlos; Corrales, Fernando J; Albar, Juan Pablo; Pascual-Montano, Alberto

    2015-09-04

    dasHPPboard is a novel proteomics-based dashboard that collects and reports the experiments produced by the Spanish Human Proteome Project consortium (SpHPP) and aims to help HPP to map the entire human proteome. We have followed the strategy of analog genomics projects like the Encyclopedia of DNA Elements (ENCODE), which provides a vast amount of data on human cell lines experiments. The dashboard includes results of shotgun and selected reaction monitoring proteomics experiments, post-translational modifications information, as well as proteogenomics studies. We have also processed the transcriptomics data from the ENCODE and Human Body Map (HBM) projects for the identification of specific gene expression patterns in different cell lines and tissues, taking special interest in those genes having little proteomic evidence available (missing proteins). Peptide databases have been built using single nucleotide variants and novel junctions derived from RNA-Seq data that can be used in search engines for sample-specific protein identifications on the same cell lines or tissues. The dasHPPboard has been designed as a tool that can be used to share and visualize a combination of proteomic and transcriptomic data, providing at the same time easy access to resources for proteogenomics analyses. The dasHPPboard can be freely accessed at: http://sphppdashboard.cnb.csic.es.

  15. The testis and ovary transcriptomes of the rock bream (Oplegnathus fasciatus: A bony fish with a unique neo Y chromosome

    Directory of Open Access Journals (Sweden)

    Dongdong Xu

    2016-03-01

    Full Text Available The rock bream (Oplegnathus fasciatus is considerably one of the most economically important marine fish in East Asia and has a unique neo-Y chromosome system that is a good model to study the sex determination and differentiation in fish. In the present study, we used Illumina sequencing technology (HiSeq2000 to sequence, assemble and annotate the transcriptome of the testis and ovary tissues of rock bream. A total of 40,004,378 (NCBI SRA database SRX1406649 and 53,108,992 (NCBI SRA database SRX1406648 high quality reads were obtained from testis and ovary RNA sequencing, respectively, and 60,421 contigs (with average length of 1301 bp were obtained after de novo assembling with Trinity software. Digital gene expression analysis reveals 14,036 contigs that show gender-enriched expressional profile with either testis-enriched (237 contigs or ovary-enriched (581 contigs with RPKM >100. There are 237 male- and 582 female-abundant expressed genes that show sex dimorphic expression. We hope that the gonad transcriptome and those gender-enriched transcripts of rock bream can provide some insight into the understanding of genome-wide transcriptome profile of teleost gonad tissue and give useful information in fish gonad development. Keywords: Gonad transcriptome, Testis, Ovary, Rock bream

  16. Principle considerations for the use of transcriptomics in doping research.

    Science.gov (United States)

    Neuberger, Elmo W I; Moser, Dirk A; Simon, Perikles

    2011-10-01

    Over the course of the past decade, technical progress has enabled scientists to investigate genome-wide RNA expression using microarray platforms. This transcriptomic approach represents a promising tool for the discovery of basic gene expression patterns and for identification of cellular signalling pathways under various conditions. Since doping substances have been shown to influence mRNA expression, it has been suggested that these changes can be detected by screening the blood transcriptome. In this review, we critically discuss the potential but also the pitfalls of this application as a tool in doping research. Transcriptomic approaches were considered to potentially provide researchers with a unique gene expression signature or with a specific biomarker for various physiological and pathophysiological conditions. Since transcriptomic approaches are considerably prone to biological and technical confounding factors that act on study subjects or samples, very strict guidelines for the use of transcriptomics in human study subjects have been developed. Typical field conditions associated with doping controls limit the feasibility of following these strict guidelines as there are too many variables counteracting a standardized procedure. After almost a decade of research using transcriptomic tools, it still remains a matter of future technological progress to identify the ultimate biomarker using technologies and/or methodologies that are sufficiently robust against typical biological and technical bias and that are valid in a court of law. Copyright © 2011 John Wiley & Sons, Ltd.

  17. Whole transcriptome organisation in the dehydrated supraoptic nucleus

    Directory of Open Access Journals (Sweden)

    C.C.T. Hindmarch

    2013-12-01

    Full Text Available The supraoptic nucleus (SON is part of the central osmotic circuitry that synthesises the hormone vasopressin (Avp and transports it to terminals in the posterior lobe of the pituitary. Following osmotic stress such as dehydration, this tissue undergoes morphological, electrical and transcriptional changes to facilitate the appropriate regulation and release of Avp into the circulation where it conserves water at the level of the kidney. Here, the organisation of the whole transcriptome following dehydration is modelled to fit Zipf's law, a natural power law that holds true for all natural languages, that states if the frequency of word usage is plotted against its rank, then the log linear regression of this is -1. We have applied this model to our previously published euhydrated and dehydrated SON data to observe this trend and how it changes following dehydration. In accordance with other studies, our whole transcriptome data fit well with this model in the euhydrated SON microarrays, but interestingly, fit better in the dehydrated arrays. This trend was observed in a subset of differentially regulated genes and also following network reconstruction using a third-party database that mines public data. We make use of language as a metaphor that helps us philosophise about the role of the whole transcriptome in providing a suitable environment for the delivery of Avp following a survival threat like dehydration.

  18. Transcriptomic Analysis and the Expression of Disease-Resistant Genes in Oryza meyeriana under Native Condition.

    Directory of Open Access Journals (Sweden)

    Bin He

    Full Text Available Oryza meyeriana (O. meyeriana, with a GG genome type (2n = 24, accumulated plentiful excellent characteristics with respect to resistance to many diseases such as rice shade and blast, even immunity to bacterial blight. It is very important to know if the diseases-resistant genes exist and express in this wild rice under native conditions. However, limited genomic or transcriptomic data of O. meyeriana are currently available. In this study, we present the first comprehensive characterization of the O. meyeriana transcriptome using RNA-seq and obtained 185,323 contigs with an average length of 1,692 bp and an N50 of 2,391 bp. Through differential expression analysis, it was found that there were most tissue-specifically expressed genes in roots, and next to stems and leaves. By similarity search against protein databases, 146,450 had at least a significant alignment to existed gene models. Comparison with the Oryza sativa (japonica-type Nipponbare and indica-type 93-11 genomes revealed that 13% of the O. meyeriana contigs had not been detected in O. sativa. Many diseases-resistant genes, such as bacterial blight resistant, blast resistant, rust resistant, fusarium resistant, cyst nematode resistant and downy mildew gene, were mined from the transcriptomic database. There are two kinds of rice bacterial blight-resistant genes (Xa1 and Xa26 differentially or specifically expressed in O. meyeriana. The 4 Xa1 contigs were all only expressed in root, while three of Xa26 contigs have the highest expression level in leaves, two of Xa26 contigs have the highest expression profile in stems and one of Xa26 contigs was expressed dominantly in roots. The transcriptomic database of O. meyeriana has been constructed and many diseases-resistant genes were found to express under native condition, which provides a foundation for future discovery of a number of novel genes and provides a basis for studying the molecular mechanisms associated with disease

  19. TranscriptomeBrowser 3.0: introducing a new compendium of molecular interactions and a new visualization tool for the study of gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Lepoivre Cyrille

    2012-01-01

    Full Text Available Abstract Background Deciphering gene regulatory networks by in silico approaches is a crucial step in the study of the molecular perturbations that occur in diseases. The development of regulatory maps is a tedious process requiring the comprehensive integration of various evidences scattered over biological databases. Thus, the research community would greatly benefit from having a unified database storing known and predicted molecular interactions. Furthermore, given the intrinsic complexity of the data, the development of new tools offering integrated and meaningful visualizations of molecular interactions is necessary to help users drawing new hypotheses without being overwhelmed by the density of the subsequent graph. Results We extend the previously developed TranscriptomeBrowser database with a set of tables containing 1,594,978 human and mouse molecular interactions. The database includes: (i predicted regulatory interactions (computed by scanning vertebrate alignments with a set of 1,213 position weight matrices, (ii potential regulatory interactions inferred from systematic analysis of ChIP-seq experiments, (iii regulatory interactions curated from the literature, (iv predicted post-transcriptional regulation by micro-RNA, (v protein kinase-substrate interactions and (vi physical protein-protein interactions. In order to easily retrieve and efficiently analyze these interactions, we developed In-teractomeBrowser, a graph-based knowledge browser that comes as a plug-in for Transcriptome-Browser. The first objective of InteractomeBrowser is to provide a user-friendly tool to get new insight into any gene list by providing a context-specific display of putative regulatory and physical interactions. To achieve this, InteractomeBrowser relies on a "cell compartments-based layout" that makes use of a subset of the Gene Ontology to map gene products onto relevant cell compartments. This layout is particularly powerful for visual integration

  20. THE EXTRAGALACTIC DISTANCE DATABASE: ALL DIGITAL H I PROFILE CATALOG

    International Nuclear Information System (INIS)

    Courtois, Helene M.; Bonhomme, Nicolas; Tully, R. Brent; Zavodny, Maximilian; Barnes, Austin; Fisher, J. Richard

    2009-01-01

    An important component of the Extragalactic Distance Database is a group of catalogs related to the measurement of H I line profile parameters. One of these is the All Digital H I catalog which contains an amalgam of information from new data and old. The new data result from observations with Arecibo and Parkes Telescopes and with the Green Bank Telescope, including continuing input since the award of the NRAO Cosmic Flows Large Program. The old data have been collected from archives, wherever available, particularly the Cornell University Digital H I Archive, the Nancay Telescope extragalactic H I archive, and the Australia Telescope H I archive. The catalog currently contains information on ∼15, 000 profiles relating to ∼13, 000 galaxies. The channel-flux per channel files, from whatever source, is carried through a common pipeline. The derived parameter of greatest interest is W m50 , the profile width at 50% of the mean flux. After appropriate adjustment, the parameter W mx is derived, the line width that statistically approximates the peak-to-peak maximum rotation velocity before correction for inclination, 2V max sini.

  1. Transcriptome Profiling of Louisiana iris Root and Identification of Genes Involved in Lead-Stress Response

    Directory of Open Access Journals (Sweden)

    Songqing Tian

    2015-11-01

    Full Text Available Louisiana iris is tolerant to and accumulates the heavy metal lead (Pb. However, there is limited knowledge of the molecular mechanisms behind this feature. We describe the transcriptome of Louisiana iris using Illumina sequencing technology. The root transcriptome of Louisiana iris under control and Pb-stress conditions was sequenced. Overall, 525,498 transcripts representing 313,958 unigenes were assembled using the clean raw reads. Among them, 43,015 unigenes were annotated and their functions classified using the euKaryotic Orthologous Groups (KOG database. They were divided into 25 molecular families. In the Gene Ontology (GO database, 50,174 unigenes were categorized into three GO trees (molecular function, cellular component and biological process. After analysis of differentially expressed genes, some Pb-stress-related genes were selected, including biosynthesis genes of chelating compounds, metal transporters, transcription factors and antioxidant-related genes. This study not only lays a foundation for further studies on differential genes under Pb stress, but also facilitates the molecular breeding of Louisiana iris.

  2. Transcriptome Profiling and In Silico Analysis of the Antimicrobial Peptides of the Grasshopper Oxya chinensis sinuosa.

    Science.gov (United States)

    Kim, In-Woo; Markkandan, Kesavan; Lee, Joon Ha; Subramaniyam, Sathiyamoorthy; Yoo, Seungil; Park, Junhyung; Hwang, Jae Sam

    2016-11-28

    Antimicrobial peptides/proteins (AMPs) are present in all types of organisms, from microbes and plants to vertebrates and invertebrates such as insects. The grasshopper Oxya chinensis sinuosa is an insect species that is widely consumed around the world for its broad medicinal value. However, the lack of available genetic information for this species is an obstacle to understanding the full potential of its AMPs. Analysis of the O. chinensis sinuosa transcriptome and expression profile is essential for extending the available genetic information resources. In this study, we determined the whole-body transcriptome of O. chinensis sinuosa and analyzed the potential AMPs induced by bacterial immunization. A high-throughput RNA-Seq approach generated 94,348 contigs and 66,555 unigenes. Of these unigenes, 36,032 (54.14%) matched known proteins in the NCBI database in a BLAST search. Functional analysis demonstrated that 38,219 unigenes were clustered into 5,499 gene ontology terms. In addition, 26 cDNAs encoding novel AMPs were identified by an in silico approach using public databases. Our transcriptome dataset and AMP profile greatly improve our understanding of O. chinensis sinuosa genetics and provide a huge number of gene sequences for further study, including genes of known importance and genes of unknown function.

  3. Identification of myogenic regulatory genes in the muscle transcriptome of beltfish (Trichiurus lepturus: A major commercial marine fish species with robust swimming ability

    Directory of Open Access Journals (Sweden)

    Hui Zhang

    2016-06-01

    Full Text Available The beltfish (Trichiurus lepturus is considered as one of the most economically important marine fish in East Asia. It is a top predator with a robust swimming ability that is a good model to study muscle physiology in fish. In the present study, we used Illumina sequencing technology (NextSeq500 to sequence, assemble and annotate the muscle transcriptome of juvenile beltfish. A total of 57,509,280 clean reads (deposited in NCBI SRA database with accession number of SRX1674471 were obtained from RNA sequencing and 26,811 unigenes (with N50 of 1033 bp were obtained after de novo assembling with Trinity software. BLASTX against NR, GO, KEGG and eggNOG databases show 100%, 49%, 31% and 96% annotation rate, respectively. By mining beltfish muscle transcriptome, several key genes which play essential role on regulating myogenesis, including pax3, pax7, myf5, myoD, mrf4/myf6, myogenin and myostatin were identified with a low expression level. The muscle transcriptome of beltfish can provide some insight into the understanding of genome-wide transcriptome profile of teleost muscle tissue and give useful information to study myogenesis in juvenile/adult fish.

  4. Feasibility of the salivary transcriptome as a novel biomarker in determining disease susceptibility.

    Science.gov (United States)

    Hidayat, M F H; Milne, T; Cullinan, M P; Seymour, G J

    2018-06-01

    The salivary transcriptome may present as a readily available and non-invasive source of potential biomarkers. The development of chronic periodontitis is determined by individual patient susceptibility; hence, the aim of this study was to determine the potential of the salivary transcriptome as a biomarker of disease susceptibility using chronic periodontitis as an example. Using an Oragene ® RNA kit, the total RNA was purified from the saliva of 10 patients with chronic periodontitis and 10 patients without chronic periodontitis. The quantity and quality of the total RNA was determined, and a measure of gene expression via cDNA was undertaken using the Affymetrix microarray system. The microarray profiling result was further validated by real-time quantitative polymerase chain reaction. Spectrophotometric analysis showed the total RNA purified from each participant ranged from 0.92 μg/500 μL to 62.85 μg/500 μL. There was great variability in the quantity of total RNA obtained from the 2 groups in the study with a mean of 10.21 ± 12.71 μg/500 μL for the periodontitis group and 15.97 ± 23.47 μg/500 μL for the control group. Further the RNA purity (based on the A 260 /A 280 ratio) for the majority of participants (9 periodontitis and 6 controls) were within the acceptable limits for downstream analysis (2.0 ± 0.1). The study samples, showed 2 distinct bands at 23S (3800 bp) and 16S (1500 bp) characteristic of bacterial rRNA. Preliminary microarray analysis was performed for 4 samples (P2, P6, H5 and H9). The percentage of genes present in each of the 4 samples was not consistent with about 1.8%-18.7% of genes being detected. Quantitative real-time polymerase chain reaction confirmed that the total RNA purified from each sample was mainly bacterial RNA (Uni 16S) with minimal human mRNA. This study showed that minimal amounts of human RNA were able to be isolated from the saliva of patients with periodontitis as well as controls. Further

  5. Transcriptome Analysis of Syringa oblata Lindl. Inflorescence Identifies Genes Associated with Pigment Biosynthesis and Scent Metabolism.

    Directory of Open Access Journals (Sweden)

    Jian Zheng

    Full Text Available Syringa oblata Lindl. is a woody ornamental plant with high economic value and characteristics that include early flowering, multiple flower colors, and strong fragrance. Despite a long history of cultivation, the genetics and molecular biology of S. oblata are poorly understood. Transcriptome and expression profiling data are needed to identify genes and to better understand the biological mechanisms of floral pigments and scents in this species. Nine cDNA libraries were obtained from three replicates of three developmental stages: inflorescence with enlarged flower buds not protruded, inflorescence with corolla lobes not displayed, and inflorescence with flowers fully opened and emitting strong fragrance. Using the Illumina RNA-Seq technique, 319,425,972 clean reads were obtained and were assembled into 104,691 final unigenes (average length of 853 bp, 41.75% of which were annotated in the NCBI non-redundant protein database. Among the annotated unigenes, 36,967 were assigned to gene ontology categories and 19,956 were assigned to eukaryoticorthologous groups. Using the Kyoto Encyclopedia of Genes and Genomes pathway database, 12,388 unigenes were sorted into 286 pathways. Based on these transcriptomic data, we obtained a large number of candidate genes that were differentially expressed at different flower stages and that were related to floral pigment biosynthesis and fragrance metabolism. This comprehensive transcriptomic analysis provides fundamental information on the genes and pathways involved in flower secondary metabolism and development in S. oblata, providing a useful database for further research on S. oblata and other plants of genus Syringa.

  6. Transcriptomic network analysis of micronuclei-related genes: a case study

    DEFF Research Database (Denmark)

    van Leeuwen, D. M.; Pedersen, Marie; Knudsen, Lisbeth E.

    2011-01-01

    checkpoint and aneuploidy. The MN-related gene network was tested against a transcriptomics case study associated with MN measurements. In this case study, transcriptomic data from children and adults differentially exposed to ambient air pollution in the Czech Republic were analysed and visualised......Mechanistically relevant information on responses of humans to xenobiotic exposure in relation to chemically induced biological effects, such as micronuclei (MN) formation can be obtained through large-scale transcriptomics studies. Network analysis may enhance the analysis and visualisation...... of such data. Therefore, this study aimed to develop a 'MN formation' network based on a priori knowledge, by using the pathway tool MetaCore. The gene network contained 27 genes and three gene complexes that are related to processes involved in MN formation, e.g. spindle assembly checkpoint, cell cycle...

  7. In Silico Functional Networks Identified in Fish Nucleated Red Blood Cells by Means of Transcriptomic and Proteomic Profiling.

    Science.gov (United States)

    Puente-Marin, Sara; Nombela, Iván; Ciordia, Sergio; Mena, María Carmen; Chico, Verónica; Coll, Julio; Ortega-Villaizan, María Del Mar

    2018-04-09

    Nucleated red blood cells (RBCs) of fish have, in the last decade, been implicated in several immune-related functions, such as antiviral response, phagocytosis or cytokine-mediated signaling. RNA-sequencing (RNA-seq) and label-free shotgun proteomic analyses were carried out for in silico functional pathway profiling of rainbow trout RBCs. For RNA-seq, a de novo assembly was conducted, in order to create a transcriptome database for RBCs. For proteome profiling, we developed a proteomic method that combined: (a) fractionation into cytosolic and membrane fractions, (b) hemoglobin removal of the cytosolic fraction, (c) protein digestion, and (d) a novel step with pH reversed-phase peptide fractionation and final Liquid Chromatography Electrospray Ionization Tandem Mass Spectrometric (LC ESI-MS/MS) analysis of each fraction. Combined transcriptome- and proteome- sequencing data identified, in silico, novel and striking immune functional networks for rainbow trout nucleated RBCs, which are mainly linked to innate and adaptive immunity. Functional pathways related to regulation of hematopoietic cell differentiation, antigen presentation via major histocompatibility complex class II (MHCII), leukocyte differentiation and regulation of leukocyte activation were identified. These preliminary findings further implicate nucleated RBCs in immune function, such as antigen presentation and leukocyte activation.

  8. Characterization of mango (Mangifera indica L.) transcriptome and chloroplast genome.

    Science.gov (United States)

    Azim, M Kamran; Khan, Ishtaiq A; Zhang, Yong

    2014-05-01

    We characterized mango leaf transcriptome and chloroplast genome using next generation DNA sequencing. The RNA-seq output of mango transcriptome generated >12 million reads (total nucleotides sequenced >1 Gb). De novo transcriptome assembly generated 30,509 unigenes with lengths in the range of 300 to ≥3,000 nt and 67× depth of coverage. Blast searching against nonredundant nucleotide databases and several Viridiplantae genomic datasets annotated 24,593 mango unigenes (80% of total) and identified Citrus sinensis as closest neighbor of mango with 9,141 (37%) matched sequences. The annotation with gene ontology and Clusters of Orthologous Group terms categorized unigene sequences into 57 and 25 classes, respectively. More than 13,500 unigenes were assigned to 293 KEGG pathways. Besides major plant biology related pathways, KEGG based gene annotation pointed out active presence of an array of biochemical pathways involved in (a) biosynthesis of bioactive flavonoids, flavones and flavonols, (b) biosynthesis of terpenoids and lignins and (c) plant hormone signal transduction. The mango transcriptome sequences revealed 235 proteases belonging to five catalytic classes of proteolytic enzymes. The draft genome of mango chloroplast (cp) was obtained by a combination of Sanger and next generation sequencing. The draft mango cp genome size is 151,173 bp with a pair of inverted repeats of 27,093 bp separated by small and large single copy regions, respectively. Out of 139 genes in mango cp genome, 91 found to be protein coding. Sequence analysis revealed cp genome of C. sinensis as closest neighbor of mango. We found 51 short repeats in mango cp genome supposed to be associated with extensive rearrangements. This is the first report of transcriptome and chloroplast genome analysis of any Anacardiaceae family member.

  9. The Peripheral Whole Blood Transcriptome of Acute Pyelonephritis in Human Pregnancy

    Science.gov (United States)

    Madan, Ichchha; Than, Nandor Gabor; Romero, Roberto; Chaemsaithong, Piya; Miranda, Jezid; Tarca, Adi L.; Bhatti, Gaurav; Draghici, Sorin; Yeo, Lami; Mazor, Moshe; Hassan, Sonia S.; Chaiworapongsa, Tinnakorn

    2018-01-01

    Objective Human pregnancy is characterized by activation of the innate immune response and suppression of adaptive immunity. The former is thought to provide protection against infection to the mother, and the latter, tolerance against paternal antigens expressed in fetal cells. Acute pyelonephritis is associated with an increased risk of acute respiratory distress syndrome and sepsis in pregnant (vs. nonpregnant) women. The objective of this study was to describe the gene expression profile (transcriptome) of maternal whole blood in acute pyelonephritis. Method A case-control study was conducted to include pregnant women with acute pyelonephritis (n=15) and women with a normal pregnancy (n=34). Affymetrix HG-U133 Plus 2.0 arrays (Affymetrix, Santa Clara, CA, USA) were used for gene expression profiling. A linear model was used to test the association between the presence of pyelonephritis and gene expression levels while controlling for white blood cell count and gestational age. A fold change of 1.5 was considered significant at a false discovery rate of 0.1. A subset of differentially expressed genes (n=56) was tested with real-time quantitative reverse transcription-polymerase chain reaction (qRT-PCR) (cases, n=19; controls, n=59). Gene ontology and pathway analysis were applied. Results A total of 983 genes were differentially expressed in acute pyelonephritis: 457 were up-regulated and 526 were down-regulated. Significant enrichment of 300 biological processes and 63 molecular functions was found in pyelonephritis. Significantly impacted pathways in pyelonephritis included a) cytokine-cytokine receptor interaction; b) T-cell receptor signaling; c) Jak-STAT signaling; and d) complement and coagulation cascades. Of 56 genes tested by qRT-PCR, 48 (85.7%) had confirmation of differential expression. Conclusion This is the first study of the transcriptomic signature of whole blood in pregnant women with acute pyelonephritis. Acute infection during pregnancy is

  10. Functional annotation of the human retinal pigment epithelium transcriptome

    Directory of Open Access Journals (Sweden)

    Gorgels Theo GMF

    2009-04-01

    Full Text Available Abstract Background To determine level, variability and functional annotation of gene expression of the human retinal pigment epithelium (RPE, the key tissue involved in retinal diseases like age-related macular degeneration and retinitis pigmentosa. Macular RPE cells from six selected healthy human donor eyes (aged 63–78 years were laser dissected and used for 22k microarray studies (Agilent technologies. Data were analyzed with Rosetta Resolver, the web tool DAVID and Ingenuity software. Results In total, we identified 19,746 array entries with significant expression in the RPE. Gene expression was analyzed according to expression levels, interindividual variability and functionality. A group of highly (n = 2,194 expressed RPE genes showed an overrepresentation of genes of the oxidative phosphorylation, ATP synthesis and ribosome pathways. In the group of moderately expressed genes (n = 8,776 genes of the phosphatidylinositol signaling system and aminosugars metabolism were overrepresented. As expected, the top 10 percent (n = 2,194 of genes with the highest interindividual differences in expression showed functional overrepresentation of the complement cascade, essential in inflammation in age-related macular degeneration, and other signaling pathways. Surprisingly, this same category also includes the genes involved in Bruch's membrane (BM composition. Among the top 10 percent of genes with low interindividual differences, there was an overrepresentation of genes involved in local glycosaminoglycan turnover. Conclusion Our study expands current knowledge of the RPE transcriptome by assigning new genes, and adding data about expression level and interindividual variation. Functional annotation suggests that the RPE has high levels of protein synthesis, strong energy demands, and is exposed to high levels of oxidative stress and a variable degree of inflammation. Our data sheds new light on the molecular composition of BM, adjacent to the

  11. Two Search Techniques within a Human Pedigree Database

    OpenAIRE

    Gersting, J. M.; Conneally, P. M.; Rogers, K.

    1982-01-01

    This paper presents the basic features of two search techniques from MEGADATS-2 (MEdical Genetics Acquisition and DAta Transfer System), a system for collecting, storing, retrieving and plotting human family pedigrees. The individual search provides a quick method for locating an individual in the pedigree database. This search uses a modified soundex coding and an inverted file structure based on a composite key. The navigational search uses a set of pedigree traversal operations (individual...

  12. The Activin A-Peroxisome Proliferator-Activated Receptor Gamma Axis Contributes to the Transcriptome of GM-CSF-Conditioned Human Macrophages.

    Science.gov (United States)

    Nieto, Concha; Bragado, Rafael; Municio, Cristina; Sierra-Filardi, Elena; Alonso, Bárbara; Escribese, María M; Domínguez-Andrés, Jorge; Ardavín, Carlos; Castrillo, Antonio; Vega, Miguel A; Puig-Kröger, Amaya; Corbí, Angel L

    2018-01-01

    GM-CSF promotes the functional maturation of lung alveolar macrophages (A-MØ), whose differentiation is dependent on the peroxisome proliferator-activated receptor gamma (PPARγ) transcription factor. In fact, blockade of GM-CSF-initiated signaling or deletion of the PPARγ-encoding gene PPARG leads to functionally defective A-MØ and the onset of pulmonary alveolar proteinosis. In vitro , macrophages generated in the presence of GM-CSF display potent proinflammatory, immunogenic and tumor growth-limiting activities. Since GM-CSF upregulates PPARγ expression, we hypothesized that PPARγ might contribute to the gene signature and functional profile of human GM-CSF-conditioned macrophages. To verify this hypothesis, PPARγ expression and activity was assessed in human monocyte-derived macrophages generated in the presence of GM-CSF [proinflammatory GM-CSF-conditioned human monocyte-derived macrophages (GM-MØ)] or M-CSF (anti-inflammatory M-MØ), as well as in ex vivo isolated human A-MØ. GM-MØ showed higher PPARγ expression than M-MØ, and the expression of PPARγ in GM-MØ was found to largely depend on activin A. Ligand-induced activation of PPARγ also resulted in distinct transcriptional and functional outcomes in GM-MØ and M-MØ. Moreover, and in the absence of exogenous activating ligands, PPARγ knockdown significantly altered the GM-MØ transcriptome, causing a global upregulation of proinflammatory genes and significantly modulating the expression of genes involved in cell proliferation and migration. Similar effects were observed in ex vivo isolated human A-MØ, where PPARγ silencing led to enhanced expression of genes coding for growth factors and chemokines and downregulation of cell surface pathogen receptors. Therefore, PPARγ shapes the transcriptome of GM-CSF-dependent human macrophages ( in vitro derived GM-MØ and ex vivo isolated A-MØ) in the absence of exogenous activating ligands, and its expression is primarily regulated by activin A

  13. De novo transcriptome assembly of two contrasting pumpkin cultivars

    Directory of Open Access Journals (Sweden)

    Aliki Xanthopoulou

    2016-03-01

    Full Text Available Cucurbita pepo (squash, pumpkin, gourd, a worldwide-cultivated vegetable of American origin, is extremely variable in fruit characteristics. However, the information associated with genes and genetic markers for pumpkin is very limited. In order to identify new genes and to develop genetic markers, we performed a transcriptome analysis (RNA-Seq of two contrasting pumpkin cultivars. Leaves and female flowers of cultivars, ‘Big Moose’ with large round fruits and ‘Munchkin’ with small round fruits, were harvested for total RNA extraction. We obtained a total of 6 GB (Big Moose; http://www.ncbi.nlm.nih.gov/Traces/sra/?run=SRR3056882 and 5 GB (Munchkin; http://www.ncbi.nlm.nih.gov/Traces/sra/?run=SRR3056883 sequence data (NCBI SRA database SRX1502732 and SRX1502735, respectively, which correspond to 18,055,786 and 14,824,292 150-base reads. After quality assessment, the clean sequences where 17,995,932 and 14,774,486 respectively. The numbers of total transcripts for ‘Big Moose’ and ‘Munchkin’ were 84,727 and 68,051, respectively. TransDecoder identified possible coding regions in assembled transcripts. This study provides transcriptome data for two contrasting pumpkin cultivars, which might be useful for genetic marker development and comparative transcriptome analyses. Keywords: RNA-Seq, Pumpkin, Contrasting cultivars, Cucurbita pepo

  14. Identification of Novel Placentally Expressed Aspartic Proteinase in Humans

    Directory of Open Access Journals (Sweden)

    Marta Majewska

    2017-06-01

    Full Text Available This study presents pioneering data concerning the human pregnancy-associated glycoprotein-Like family, identified in the genome, of the term placental transcriptome and proteome. RNA-seq allowed the identification of 1364 bp hPAG-L/pep cDNA with at least 56.5% homology with other aspartic proteinases (APs. In silico analyses revealed 388 amino acids (aa of full-length hPAG-L polypeptide precursor, with 15 aa-signal peptide, 47 aa-blocking peptide and 326 aa-mature protein, and two Asp residues (D, specific for a catalytic cleft of the APs (VVFDTGSSNLWV91-102 and AIVDTGTSLLTG274-285. Capillary sequencing identified 9330 bp of the hPAG-L gene (Gen Bank Acc. No. KX533473, composed of nine exons and eight introns. Heterologous Western blotting revealed the presence of one dominant 60 kDa isoform of the hPAG-L amongst cellular placental proteins. Detection with anti-pPAG-P and anti-Rec pPAG2 polyclonals allowed identification of the hPAG-L proteins located within regions of chorionic villi, especially within the syncytiotrophoblast of term singleton placentas. Our novel data extend the present knowledge about the human genome, as well as placental transcriptome and proteome during term pregnancy. Presumably, this may contribute to establishing a new diagnostic tool for examination of some disturbances during human pregnancy, as well as growing interest from both scientific and clinical perspectives.

  15. Transcriptomic analysis of endangered Chinese salamander: identification of immune, sex and reproduction-related genes and genetic markers.

    Directory of Open Access Journals (Sweden)

    Rongbo Che

    Full Text Available The Chinese salamander (Hynobius chinensis, an endangered amphibian species of salamander endemic to China, has attracted much attention because of its value of studying paleontology evolutionary history and decreasing population size. Despite increasing interest in the Hynobius chinensis genome, genomic resources for the species are still very limited. A comprehensive transcriptome of Hynobius chinensis, which will provide a resource for genome annotation, candidate genes identification and molecular marker development should be generated to supplement it.We performed a de novo assembly of Hynobius chinensis transcriptome by Illumina sequencing. A total of 148,510 nonredundant unigenes with an average length of approximately 580 bp were obtained. In all, 60,388 (40.66% unigenes showed homologous matches in at least one database and 33,537 (22.58% unigenes were annotated by all four databases. In total, 41,553 unigenes were categorized into 62 sub-categories by BLAST2GO search, and 19,468 transcripts were assigned to 140 KEGG pathways. A large number of unigenes involved in immune system, local adaptation, reproduction and sex determination were identified, as well as 31,982 simple sequence repeats (SSRs and 460,923 putative single nucleotide polymorphisms (SNPs.This dataset represents the first transcriptome analysis of the Chinese salamander (Hynobius chinensis, an endangered species, to be also the first time of hynobiidae. The transcriptome will provide valuable resource for further research in discovery of new genes, protection of population, adaptive evolution and survey of various pathways, as well as development of molecule markers in Chinese salamander; and reference information for closely related species.

  16. Construction of an Ostrea edulis database from genomic and expressed sequence tags (ESTs) obtained from Bonamia ostreae infected haemocytes: Development of an immune-enriched oligo-microarray.

    Science.gov (United States)

    Pardo, Belén G; Álvarez-Dios, José Antonio; Cao, Asunción; Ramilo, Andrea; Gómez-Tato, Antonio; Planas, Josep V; Villalba, Antonio; Martínez, Paulino

    2016-12-01

    The flat oyster, Ostrea edulis, is one of the main farmed oysters, not only in Europe but also in the United States and Canada. Bonamiosis due to the parasite Bonamia ostreae has been associated with high mortality episodes in this species. This parasite is an intracellular protozoan that infects haemocytes, the main cells involved in oyster defence. Due to the economical and ecological importance of flat oyster, genomic data are badly needed for genetic improvement of the species, but they are still very scarce. The objective of this study is to develop a sequence database, OedulisDB, with new genomic and transcriptomic resources, providing new data and convenient tools to improve our knowledge of the oyster's immune mechanisms. Transcriptomic and genomic sequences were obtained using 454 pyrosequencing and compiled into an O. edulis database, OedulisDB, consisting of two sets of 10,318 and 7159 unique sequences that represent the oyster's genome (WG) and de novo haemocyte transcriptome (HT), respectively. The flat oyster transcriptome was obtained from two strains (naïve and tolerant) challenged with B. ostreae, and from their corresponding non-challenged controls. Approximately 78.5% of 5619 HT unique sequences were successfully annotated by Blast search using public databases. A total of 984 sequences were identified as being related to immune response and several key immune genes were identified for the first time in flat oyster. Additionally, transcriptome information was used to design and validate the first oligo-microarray in flat oyster enriched with immune sequences from haemocytes. Our transcriptomic and genomic sequencing and subsequent annotation have largely increased the scarce resources available for this economically important species and have enabled us to develop an OedulisDB database and accompanying tools for gene expression analysis. This study represents the first attempt to characterize in depth the O. edulis haemocyte transcriptome in

  17. CBrowse: a SAM/BAM-based contig browser for transcriptome assembly visualization and analysis.

    Science.gov (United States)

    Li, Pei; Ji, Guoli; Dong, Min; Schmidt, Emily; Lenox, Douglas; Chen, Liangliang; Liu, Qi; Liu, Lin; Zhang, Jie; Liang, Chun

    2012-09-15

    To address the impending need for exploring rapidly increased transcriptomics data generated for non-model organisms, we developed CBrowse, an AJAX-based web browser for visualizing and analyzing transcriptome assemblies and contigs. Designed in a standard three-tier architecture with a data pre-processing pipeline, CBrowse is essentially a Rich Internet Application that offers many seamlessly integrated web interfaces and allows users to navigate, sort, filter, search and visualize data smoothly. The pre-processing pipeline takes the contig sequence file in FASTA format and its relevant SAM/BAM file as the input; detects putative polymorphisms, simple sequence repeats and sequencing errors in contigs and generates image, JSON and database-compatible CSV text files that are directly utilized by different web interfaces. CBowse is a generic visualization and analysis tool that facilitates close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors in transcriptome sequencing projects. CBrowse is distributed under the GNU General Public License, available at http://bioinfolab.muohio.edu/CBrowse/ liangc@muohio.edu or liangc.mu@gmail.com; glji@xmu.edu.cn Supplementary data are available at Bioinformatics online.

  18. Large-Scale Transcriptome Analysis in Faba Bean (Vicia faba L. under Ascochyta fabae Infection.

    Directory of Open Access Journals (Sweden)

    Sara Ocaña

    Full Text Available Faba bean is an important food crop worldwide. However, progress in faba bean genomics lags far behind that of model systems due to limited availability of genetic and genomic information. Using the Illumina platform the faba bean transcriptome from leaves of two lines (29H and Vf136 subjected to Ascochyta fabae infection have been characterized. De novo transcriptome assembly provided a total of 39,185 different transcripts that were functionally annotated, and among these, 13,266 were assigned to gene ontology against Arabidopsis. Quality of the assembly was validated by RT-qPCR amplification of selected transcripts differentially expressed. Comparison of faba bean transcripts with those of better-characterized plant genomes such as Arabidopsis thaliana, Medicago truncatula and Cicer arietinum revealed a sequence similarity of 68.3%, 72.8% and 81.27%, respectively. Moreover, 39,060 single nucleotide polymorphism (SNP and 3,669 InDels were identified for genotyping applications. Mapping of the sequence reads generated onto the assembled transcripts showed that 393 and 457 transcripts were overexpressed in the resistant (29H and susceptible genotype (Vf136, respectively. Transcripts involved in plant-pathogen interactions such as leucine rich proteins (LRR or plant growth regulators involved in plant adaptation to abiotic and biotic stresses were found to be differently expressed in the resistant line. The results reported here represent the most comprehensive transcript database developed so far in faba bean, providing valuable information that could be used to gain insight into the pathways involved in the resistance mechanism against A. fabae and to identify potential resistance genes to be further used in marker assisted selection.

  19. Small RNA and transcriptome deep sequencing proffers insight into floral gene regulation in Rosa cultivars

    Directory of Open Access Journals (Sweden)

    Kim Jungeun

    2012-11-01

    Full Text Available Abstract Background Roses (Rosa sp., which belong to the family Rosaceae, are the most economically important ornamental plants—making up 30% of the floriculture market. However, given high demand for roses, rose breeding programs are limited in molecular resources which can greatly enhance and speed breeding efforts. A better understanding of important genes that contribute to important floral development and desired phenotypes will lead to improved rose cultivars. For this study, we analyzed rose miRNAs and the rose flower transcriptome in order to generate a database to expound upon current knowledge regarding regulation of important floral characteristics. A rose genetic database will enable comprehensive analysis of gene expression and regulation via miRNA among different Rosa cultivars. Results We produced more than 0.5 million reads from expressed sequences, totalling more than 110 million bp. From these, we generated 35,657, 31,434, 34,725, and 39,722 flower unigenes from Rosa hybrid: ‘Vital’, ‘Maroussia’, and ‘Sympathy’ and Rosa rugosa Thunb. , respectively. The unigenes were assigned functional annotations, domains, metabolic pathways, Gene Ontology (GO terms, Plant Ontology (PO terms, and MIPS Functional Catalogue (FunCat terms. Rose flower transcripts were compared with genes from whole genome sequences of Rosaceae members (apple, strawberry, and peach and grape. We also produced approximately 40 million small RNA reads from flower tissue for Rosa, representing 267 unique miRNA tags. Among identified miRNAs, 25 of them were novel and 242 of them were conserved miRNAs. Statistical analyses of miRNA profiles revealed both shared and species-specific miRNAs, which presumably effect flower development and phenotypes. Conclusions In this study, we constructed a Rose miRNA and transcriptome database, and we analyzed the miRNAs and transcriptome generated from the flower tissues of four Rosa cultivars. The database provides a

  20. De novo transcriptome assembly of the mycoheterotrophic plant Monotropa hypopitys

    Directory of Open Access Journals (Sweden)

    Alexey V. Beletsky

    2017-03-01

    Full Text Available Monotropa hypopitys (pinesap is a non-photosynthetic obligately mycoheterotrophic plant of the family Ericaceae. It obtains the carbon and other nutrients from the roots of surrounding autotrophic trees through the associated mycorrhizal fungi. In order to understand the evolutionary changes in the plant genome associated with transition to a heterotrophic lifestyle, we performed de novo transcriptomic analysis of M. hypopitys using next-generation sequencing. We obtained the RNA-Seq data from flowers, flower bracts and roots with haustoria using Illumina HiSeq2500 platform. The raw data obtained in this study can be available in NCBI SRA database with accession number of SRP069226. A total of 10.3 GB raw sequence data were obtained, corresponding to 103,357,809 raw reads. A total of 103,025,683 reads were filtered after removing low-quality reads and trimming the adapter sequences. The Trinity program was used to de novo assemble 98,349 unigens with an N50 of 1342 bp. Using the TransDecoder program, we predicted 43,505 putative proteins. 38,416 unigenes were annotated in the Swiss-Prot protein sequence database using BLASTX. The obtained transcriptomic data will be useful for further studies of the evolution of plant genomes upon transition to a non-photosynthetic lifestyle and the loss of photosynthesis-related functions.

  1. SQUAT: A web tool to mine human, murine and avian SAGE data

    Directory of Open Access Journals (Sweden)

    Besson Jérémy

    2008-09-01

    Full Text Available Abstract Background There is an increasing need in transcriptome research for gene expression data and pattern warehouses. It is of importance to integrate in these warehouses both raw transcriptomic data, as well as some properties encoded in these data, like local patterns. Description We have developed an application called SQUAT (SAGE Querying and Analysis Tools which is available at: http://bsmc.insa-lyon.fr/squat/. This database gives access to both raw SAGE data and patterns mined from these data, for three species (human, mouse and chicken. This database allows to make simple queries like "In which biological situations is my favorite gene expressed?" as well as much more complex queries like: ≪what are the genes that are frequently co-over-expressed with my gene of interest in given biological situations?≫. Connections with external web databases enrich biological interpretations, and enable sophisticated queries. To illustrate the power of SQUAT, we show and analyze the results of three different queries, one of which led to a biological hypothesis that was experimentally validated. Conclusion SQUAT is a user-friendly information retrieval platform, which aims at bringing some of the state-of-the-art mining tools to biologists.

  2. Comparative Analysis of the Arabidopsis Pollen Transcriptome

    Czech Academy of Sciences Publication Activity Database

    Honys, David; Twell, D.

    2003-01-01

    Roč. 132, - (2003), s. 640ů652 ISSN 0032-0889 R&D Projects: GA AV ČR IAA5038207 Grant - others:Royal Society(GB) NATO Postdoctoral Fellowship (to D.H.) Institutional research plan: CEZ:AV0Z5038910; CEZ:MSM 113100003 Keywords : transcriptome profiling * Arabidopsis pollen * male gametophyte Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 5.634, year: 2003

  3. Update History of This Database - Yeast Interacting Proteins Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Yeast Interacting Proteins Database Update History of This Database Date Update contents 201...0/03/29 Yeast Interacting Proteins Database English archive site is opened. 2000/12/4 Yeast Interacting Proteins Database...( http://itolab.cb.k.u-tokyo.ac.jp/Y2H/ ) is released. About This Database Database Description... Download License Update History of This Database Site Policy | Contact Us Update History of This Database... - Yeast Interacting Proteins Database | LSDB Archive ...

  4. Developmental Transcriptome Analysis and Identification of Genes Involved in Larval Metamorphosis of the Razor Clam, Sinonovacula constricta.

    Science.gov (United States)

    Niu, Donghong; Wang, Fei; Xie, Shumei; Sun, Fanyue; Wang, Ze; Peng, Maoxiao; Li, Jiale

    2016-04-01

    The razor clam Sinonovacula constricta is an important commercial species. The deficiency of developmental transcriptomic data is becoming the bottleneck of further researches on the mechanisms underlying settlement and metamorphosis in early development. In this study, de novo transcriptome sequencing was performed for S. constricta at different early developmental stages by using Illumina HiSeq 2000 paired-end (PE) sequencing technology. A total of 112,209,077 PE clean reads were generated. De novo assembly generated 249,795 contigs with an average length of 585 bp. Gene annotation resulted in the identification of 22,870 unigene hits against the NCBI database. Eight unique sequences related to metamorphosis were identified and analyzed using real-time PCR. The razor clam reference transcriptome would provide useful information on early developmental and metamorphosis mechanisms and could be used in the genetic breeding of shellfish.

  5. Saudi anti-human cancer plants database (SACPD): A collection of plants with anti-human cancer activities.

    Science.gov (United States)

    Al-Zahrani, Ateeq Ahmed

    2018-01-30

    Several anticancer drugs have been developed from natural products such as plants. Successful experiments in inhibiting the growth of human cancer cell lines using Saudi plants were published over the last three decades. Up to date, there is no Saudi anticancer plants database as a comprehensive source for the interesting data generated from these experiments. Therefore, there was a need for creating a database to collect, organize, search and retrieve such data. As a result, the current paper describes the generation of the Saudi anti-human cancer plants database (SACPD). The database contains most of the reported information about the naturally growing Saudi anticancer plants. SACPD comprises the scientific and local names of 91 plant species that grow naturally in Saudi Arabia. These species belong to 38 different taxonomic families. In Addition, 18 species that represent16 family of medicinal plants and are intensively sold in the local markets in Saudi Arabia were added to the database. The website provides interesting details, including plant part containing the anticancer bioactive compounds, plants locations and cancer/cell type against which they exhibit their anticancer activity. Our survey revealed that breast, liver and leukemia were the most studied cancer cell lines in Saudi Arabia with percentages of 27%, 19% and 15%, respectively. The current SACPD represents a nucleus around which more development efforts can expand to accommodate all future submissions about new Saudi plant species with anticancer activities. SACPD will provide an excellent starting point for researchers and pharmaceutical companies who are interested in developing new anticancer drugs. SACPD is available online at https://teeqrani1.wixsite.com/sapd.

  6. The Consolidated Human Activity Database — Master Version (CHAD-Master) Technical Memorandum

    Science.gov (United States)

    This technical memorandum contains information about the Consolidated Human Activity Database -- Master version, including CHAD contents, inventory of variables: Questionnaire files and Event files, CHAD codes, and references.

  7. Analysis of the transcriptome of Isodon rubescens and key enzymes involved in terpenoid biosynthesis

    Directory of Open Access Journals (Sweden)

    Xiuhong Su

    2016-05-01

    Full Text Available Isodon rubescens is an important medicinal plant in China that has been shown to reduce tumour growth due to the presence of the compound oridonin. In an effort to facilitate molecular research on oridonin biosynthesis, we reported the use of next generation massively parallel sequencing technologies and de novo transcriptome assembly to gain a comprehensive overview of I. rubescens transcriptome. In our study, a total of 50,934,276 clean reads, 101,640 transcripts and 44,626 unigenes were generated through de novo transcriptome assembly. A number of unigenes – 23,987, 10,263, 7359, 18,245, 17,683, 19,485, 9361 – were annotated in the National Center for Biotechnology Information (NCBI non-redundant protein (Nr, NCBI nucleotide sequences (Nt, Kyoto Encyclopedia of Genes and Genomes (KEGG Orthology (KO, Swiss-Prot, protein family (Pfam, gene ontology (GO, eukaryotic ortholog groups (KOG databases, respectively. Furthermore, the annotated unigenes were functionally classified according to the GO, KOG and KEGG. Based on these results, candidate genes encoding enzymes involved in terpenoids backbone biosynthesis were detected. Our data provided the most comprehensive sequence resource available for the study on I. rubescens, as well as demonstrated the effective use of Illumina sequencing and de novo transcriptome assembly on a species lacking genomic information.

  8. Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

    Science.gov (United States)

    Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

    2013-01-01

    The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867

  9. CLIP-seq analysis of multi-mapped reads discovers novel functional RNA regulatory sites in the human transcriptome.

    Science.gov (United States)

    Zhang, Zijun; Xing, Yi

    2017-09-19

    Crosslinking or RNA immunoprecipitation followed by sequencing (CLIP-seq or RIP-seq) allows transcriptome-wide discovery of RNA regulatory sites. As CLIP-seq/RIP-seq reads are short, existing computational tools focus on uniquely mapped reads, while reads mapped to multiple loci are discarded. We present CLAM (CLIP-seq Analysis of Multi-mapped reads). CLAM uses an expectation-maximization algorithm to assign multi-mapped reads and calls peaks combining uniquely and multi-mapped reads. To demonstrate the utility of CLAM, we applied it to a wide range of public CLIP-seq/RIP-seq datasets involving numerous splicing factors, microRNAs and m6A RNA methylation. CLAM recovered a large number of novel RNA regulatory sites inaccessible by uniquely mapped reads. The functional significance of these sites was demonstrated by consensus motif patterns and association with alternative splicing (splicing factors), transcript abundance (AGO2) and mRNA half-life (m6A). CLAM provides a useful tool to discover novel protein-RNA interactions and RNA modification sites from CLIP-seq and RIP-seq data, and reveals the significant contribution of repetitive elements to the RNA regulatory landscape of the human transcriptome. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Transcriptome profiling of tobacco under water deficit conditions

    Directory of Open Access Journals (Sweden)

    Roel C. Rabara

    2015-09-01

    Full Text Available Drought is one of the limiting environmental factors that affect crop production. Understanding the molecular basis of how plants respond to this water deficit stress is key to developing drought tolerant crops. In this study we generated time course-based transcriptome profiles of tobacco plants under water deficit conditions using microarray technology. In this paper, we describe in detail the experimental procedures and analyses performed in our study. The data set we generated (available in the NCBI/GEO database under GSE67434 has been analysed to identify genes that are involved in the regulation of tobacco's responses to drought.

  11. De novo assembly of pen shell ( Atrina pectinata) transcriptome and screening of its genic microsatellites

    Science.gov (United States)

    Sun, Xiujun; Li, Dongming; Liu, Zhihong; Zhou, Liqing; Wu, Biao; Yang, Aiguo

    2017-10-01

    The pen shell ( Atrina pectinata) is a large wedge-shaped bivalve, which belongs to family Pinnidae. Due to its large and nutritious adductor muscle, it is the popular seafood with high commercial value in Asia-Pacific countries. However, limiting genomic and transcriptomic data have hampered its genetic investigations. In this study, the transcriptome of A. pectinata was deeply sequenced using Illumina pair-end sequencing technology. After assembling, a total of 127263 unigenes were obtained. Functional annotation indicated that the highest percentage of unigenes (18.60%) was annotated on GO database, followed by 18.44% on PFAM database and 17.04% on NR database. There were 270 biological pathways matched with those in KEGG database. Furthermore, a total of 23452 potential simple sequence repeats (SSRs) were identified, of them the most abundant type was mono-nucleotide repeats (12902, 55.01%), which was followed by di-nucleotide (8132, 34.68%), tri-nucleotide (2010, 8.57%), tetra-nucleotide (401, 1.71%), and penta-nucleotide (7, 0.03%) repeats. Sixty SSRs were selected for validating and developing genic SSR markers, of them 23 showed polymorphism in a cultured population with the average observed and expected heterozygosities of 0.412 and 0.579, respectively. In this study, we established the first comprehensive transcript dataset of A. pectinata genes. Our results demonstrated that RNA-Seq is a fast and cost-effective method for genic SSR development in non-model species.

  12. A SNP-centric database for the investigation of the human genome

    Directory of Open Access Journals (Sweden)

    Kohane Isaac S

    2004-03-01

    Full Text Available Abstract Background Single Nucleotide Polymorphisms (SNPs are an increasingly important tool for genetic and biomedical research. Although current genomic databases contain information on several million SNPs and are growing at a very fast rate, the true value of a SNP in this context is a function of the quality of the annotations that characterize it. Retrieving and analyzing such data for a large number of SNPs often represents a major bottleneck in the design of large-scale association studies. Description SNPper is a web-based application designed to facilitate the retrieval and use of human SNPs for high-throughput research purposes. It provides a rich local database generated by combining SNP data with the Human Genome sequence and with several other data sources, and offers the user a variety of querying, visualization and data export tools. In this paper we describe the structure and organization of the SNPper database, we review the available data export and visualization options, and we describe how the architecture of SNPper and its specialized data structures support high-volume SNP analysis. Conclusions The rich annotation database and the powerful data manipulation and presentation facilities it offers make SNPper a very useful online resource for SNP research. Its success proves the great need for integrated and interoperable resources in the field of computational biology, and shows how such systems may play a critical role in supporting the large-scale computational analysis of our genome.

  13. Transcriptome signature of the adult mouse choroid plexus

    Directory of Open Access Journals (Sweden)

    Marques Fernanda

    2011-01-01

    Full Text Available Abstract Background Although the gene expression profile of several tissues in humans and in rodent animal models has been explored, analysis of the complete choroid plexus (CP transcriptome is still lacking. A better characterization of the CP transcriptome can provide key insights into its functions as one of the barriers that separate the brain from the periphery and in the production of cerebrospinal fluid. Methods This work extends further what is known about the mouse CP transcriptome through a microarray analysis of CP tissue from normal mice under physiological conditions. Results We found that the genes most highly expressed are those implicated in energy metabolism (oxidative phosphorylation, glycolysis/gluconeogenesis and in ribosomal function, which is in agreement with the secretory nature of the CP. On the other hand, genes encoding for immune mediators are among those with lower expression in basal conditions. In addition, we found genes known to be relevant during brain development, and not previously identified to be expressed in the CP, including those encoding for various axonal guidance and angiogenesis molecules and for growth factors. Some of these are known to influence the neural stem cell niche in the subventricular zone, highlighting the involvement of the CP as a likely modulator of neurogenesis. Interestingly, our observations confirm that the CP transcriptome is unique, displaying low homology with that of other tissues. Of note, we describe here that the closest similarity is with the transcriptome of the endothelial cells of the blood-brain barrier. Conclusions Based on the data presented here, it will now be possible to further explore the function of particular proteins of the CP secretome in health and in disease.

  14. Global analysis of transcriptome responses and gene expression profiles to cold stress of Jatropha curcas L.

    Directory of Open Access Journals (Sweden)

    Haibo Wang

    Full Text Available BACKGROUND: Jatropha curcas L., also called the Physic nut, is an oil-rich shrub with multiple uses, including biodiesel production, and is currently exploited as a renewable energy resource in many countries. Nevertheless, because of its origin from the tropical MidAmerican zone, J. curcas confers an inherent but undesirable characteristic (low cold resistance that may seriously restrict its large-scale popularization. This adaptive flaw can be genetically improved by elucidating the mechanisms underlying plant tolerance to cold temperatures. The newly developed Illumina Hiseq™ 2000 RNA-seq and Digital Gene Expression (DGE are deep high-throughput approaches for gene expression analysis at the transcriptome level, using which we carefully investigated the gene expression profiles in response to cold stress to gain insight into the molecular mechanisms of cold response in J. curcas. RESULTS: In total, 45,251 unigenes were obtained by assembly of clean data generated by RNA-seq analysis of the J. curcas transcriptome. A total of 33,363 and 912 complete or partial coding sequences (CDSs were determined by protein database alignments and ESTScan prediction, respectively. Among these unigenes, more than 41.52% were involved in approximately 128 known metabolic or signaling pathways, and 4,185 were possibly associated with cold resistance. DGE analysis was used to assess the changes in gene expression when exposed to cold condition (12°C for 12, 24, and 48 h. The results showed that 3,178 genes were significantly upregulated and 1,244 were downregulated under cold stress. These genes were then functionally annotated based on the transcriptome data from RNA-seq analysis. CONCLUSIONS: This study provides a global view of transcriptome response and gene expression profiling of J. curcas in response to cold stress. The results can help improve our current understanding of the mechanisms underlying plant cold resistance and favor the screening of

  15. Transcriptomic analysis reveals key genes related to betalain biosynthesis in pulp coloration of Hylocereus polyrhizus

    Directory of Open Access Journals (Sweden)

    Hua eQingzhu

    2016-01-01

    Full Text Available Betalains have high nutritional value and bioactivities. Red pulp pitaya (Hylocereus polyrhizus is the only fruit containing abundant betalains for consumer. However, no information is available about genes involved in betalain biosynthesis in H. polyrhizus. Herein, two cDNA libraries of pitaya pulps with two different coloration stages (white and red pulp stages of Guanhuahong (H. polyrhizus were constructed. A total of about 12 Gb raw RNA-Seq data was generated and was de novo assembled into 122,677 transcripts with an average length of 1,183 bp and an N50 value of 2008. Approximately 99.99% of all transcripts were annotated based on seven public databases. A total of 8,871 transcripts were significantly regulated. Thirty-three candidate transcripts related to betalain biosynthesis were obtained from the transcriptome data. Transcripts encoding enzymes involved in betalain biosynthesis were analyzed using RT-qPCR at the whole pulp coloration stages of H. Polyrhizus (7-1 and H. Undatus (132-4. Nine key transcripts of betalain biosynthesis were identified. They were assigned to four kinds of genes in betalain biosynthetic pathway, including tyrosinase, 4, 5-DOPA dioxygenase extradiol, cytochrome P450 and glucosyltransferase. Ultimately, a preliminary betalain biosynthetic pathway for pitaya was proposed based on betalain analyses and gene expression profiles.

  16. A comprehensive two-dimensional gel protein database of noncultured unfractionated normal human epidermal keratinocytes: towards an integrated approach to the study of cell proliferation, differentiation and skin diseases

    DEFF Research Database (Denmark)

    Celis, J E; Madsen, Peder; Rasmussen, H H

    1991-01-01

    A two-dimensional (2-D) gel database of cellular proteins from noncultured, unfractionated normal human epidermal keratinocytes has been established. A total of 2651 [35S]methionine-labeled cellular proteins (1868 isoelectric focusing, 783 nonequilibrium pH gradient electrophoresis) were resolved...

  17. High Quality Unigenes and Microsatellite Markers from Tissue Specific Transcriptome and Development of a Database in Clusterbean (Cyamopsis tetragonoloba, L. Taub

    Directory of Open Access Journals (Sweden)

    Hukam C. Rawal

    2017-11-01

    Full Text Available Clusterbean (Cyamopsis tetragonoloba L. Taub, is an important industrial, vegetable and forage crop. This crop owes its commercial importance to the presence of guar gum (galactomannans in its endosperm which is used as a lubricant in a range of industries. Despite its relevance to agriculture and industry, genomic resources available in this crop are limited. Therefore, the present study was undertaken to generate RNA-Seq based transcriptome from leaf, shoot, and flower tissues. A total of 145 million high quality Illumina reads were assembled using Trinity into 127,706 transcripts and 48,007 non-redundant high quality (HQ unigenes. We annotated 79% unigenes against Plant Genes from the National Center for Biotechnology Information (NCBI, Swiss-Prot, Pfam, gene ontology (GO and KEGG databases. Among the annotated unigenes, 30,020 were assigned with 116,964 GO terms, 9984 with EC and 6111 with 137 KEGG pathways. At different fragments per kilobase of transcript per millions fragments sequenced (FPKM levels, genes were found expressed higher in flower tissue followed by shoot and leaf. Additionally, we identified 8687 potential simple sequence repeats (SSRs with an average frequency of one SSR per 8.75 kb. A total of 28 amplified SSRs in 21 clusterbean genotypes resulted in polymorphism in 13 markers with average polymorphic information content (PIC of 0.21. We also constructed a database named ‘ClustergeneDB’ for easy retrieval of unigenes and the microsatellite markers. The tissue specific genes identified and the molecular marker resources developed in this study is expected to aid in genetic improvement of clusterbean for its end use.

  18. Deep Sequencing Reveals Uncharted Isoform Heterogeneity of the Protein-Coding Transcriptome in Cerebral Ischemia.

    Science.gov (United States)

    Bhattarai, Sunil; Aly, Ahmed; Garcia, Kristy; Ruiz, Diandra; Pontarelli, Fabrizio; Dharap, Ashutosh

    2018-06-03

    Gene expression in cerebral ischemia has been a subject of intense investigations for several years. Studies utilizing probe-based high-throughput methodologies such as microarrays have contributed significantly to our existing knowledge but lacked the capacity to dissect the transcriptome in detail. Genome-wide RNA-sequencing (RNA-seq) enables comprehensive examinations of transcriptomes for attributes such as strandedness, alternative splicing, alternative transcription start/stop sites, and sequence composition, thus providing a very detailed account of gene expression. Leveraging this capability, we conducted an in-depth, genome-wide evaluation of the protein-coding transcriptome of the adult mouse cortex after transient focal ischemia at 6, 12, or 24 h of reperfusion using RNA-seq. We identified a total of 1007 transcripts at 6 h, 1878 transcripts at 12 h, and 1618 transcripts at 24 h of reperfusion that were significantly altered as compared to sham controls. With isoform-level resolution, we identified 23 splice variants arising from 23 genes that were novel mRNA isoforms. For a subset of genes, we detected reperfusion time-point-dependent splice isoform switching, indicating an expression and/or functional switch for these genes. Finally, for 286 genes across all three reperfusion time-points, we discovered multiple, distinct, simultaneously expressed and differentially altered isoforms per gene that were generated via alternative transcription start/stop sites. Of these, 165 isoforms derived from 109 genes were novel mRNAs. Together, our data unravel the protein-coding transcriptome of the cerebral cortex at an unprecedented depth to provide several new insights into the flexibility and complexity of stroke-related gene transcription and transcript organization.

  19. Database Description - SAHG | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available base Description General information of database Database name SAHG Alternative nam...h: Contact address Chie Motono Tel : +81-3-3599-8067 E-mail : Database classification Structure Databases - ...e databases - Protein properties Organism Taxonomy Name: Homo sapiens Taxonomy ID: 9606 Database description... Links: Original website information Database maintenance site The Molecular Profiling Research Center for D...stration Not available About This Database Database Description Download License Update History of This Database Site Policy | Contact Us Database Description - SAHG | LSDB Archive ...

  20. Use of SQL Databases to Support Human Resource Management

    OpenAIRE

    Zeman, Jan

    2011-01-01

    Bakalářská práce se zaměřuje na návrh SQL databáze pro podporu Řízení lidských zdrojů a její následné vytvoření v programu MS SQL Server. This thesis focuses on the design of SQL database for support Human resources management and its creation in MS SQL Server. A

  1. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome

    Science.gov (United States)

    Camargo, Anamaria A.; Samaia, Helena P. B.; Dias-Neto, Emmanuel; Simão, Daniel F.; Migotto, Italo A.; Briones, Marcelo R. S.; Costa, Fernando F.; Aparecida Nagai, Maria; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; Sonati, Maria de Fátima; Tajara, Eloiza H.; Valentini, Sandro R.; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Arnaldi, Liliane A. T.; de Assis, Angela M.; Bengtson, Mário Henrique; Bergamo, Nadia Aparecida; Bombonato, Vanessa; de Camargo, Maria E. R.; Canevari, Renata A.; Carraro, Dirce M.; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Corrêa, Rosana F. R.; Costa, Maria Cristina R.; Curcio, Cyntia; Hokama, Paula O. M.; Ferreira, Ari J. S.; Furuzawa, Gilberto K.; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Krieger, José E.; Leite, Luciana C. C.; Majumder, Paromita; Marins, Mozart; Marques, Everaldo R.; Melo, Analy S. A.; Melo, Monica; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana G.; Prevedel, Aline C.; Rahal, Paula; Rainho, Claudia A.; Reis, Eduardo M. R.; Ribeiro, Marcelo L.; da Rós, Nancy; de Sá, Renata G.; Sales, Magaly M.; Sant'anna, Simone Cristina; dos Santos, Mariana L.; da Silva, Aline M.; da Silva, Neusa P.; Silva, Wilson A.; da Silveira, Rosana A.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Soares, Fernando; Moreira, Eloisa S.; Nunes, Diana N.; Correa, Ricardo G.; Zalcberg, Heloisa; Carvalho, Alex F.; Reis, Luis F. L.; Brentani, Ricardo R.; Simpson, Andrew J. G.; de Souza, Sandro J.

    2001-01-01

    Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription–PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning. PMID:11593022

  2. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

    Science.gov (United States)

    Camargo, A A; Samaia, H P; Dias-Neto, E; Simão, D F; Migotto, I A; Briones, M R; Costa, F F; Nagai, M A; Verjovski-Almeida, S; Zago, M A; Andrade, L E; Carrer, H; El-Dorry, H F; Espreafico, E M; Habr-Gama, A; Giannella-Neto, D; Goldman, G H; Gruber, A; Hackel, C; Kimura, E T; Maciel, R M; Marie, S K; Martins, E A; Nobrega, M P; Paco-Larson, M L; Pardini, M I; Pereira, G G; Pesquero, J B; Rodrigues, V; Rogatto, S R; da Silva, I D; Sogayar, M C; Sonati, M F; Tajara, E H; Valentini, S R; Alberto, F L; Amaral, M E; Aneas, I; Arnaldi, L A; de Assis, A M; Bengtson, M H; Bergamo, N A; Bombonato, V; de Camargo, M E; Canevari, R A; Carraro, D M; Cerutti, J M; Correa, M L; Correa, R F; Costa, M C; Curcio, C; Hokama, P O; Ferreira, A J; Furuzawa, G K; Gushiken, T; Ho, P L; Kimura, E; Krieger, J E; Leite, L C; Majumder, P; Marins, M; Marques, E R; Melo, A S; Melo, M B; Mestriner, C A; Miracca, E C; Miranda, D C; Nascimento, A L; Nobrega, F G; Ojopi, E P; Pandolfi, J R; Pessoa, L G; Prevedel, A C; Rahal, P; Rainho, C A; Reis, E M; Ribeiro, M L; da Ros, N; de Sa, R G; Sales, M M; Sant'anna, S C; dos Santos, M L; da Silva, A M; da Silva, N P; Silva, W A; da Silveira, R A; Sousa, J F; Stecconi, D; Tsukumo, F; Valente, V; Soares, F; Moreira, E S; Nunes, D N; Correa, R G; Zalcberg, H; Carvalho, A F; Reis, L F; Brentani, R R; Simpson, A J; de Souza, S J; Melo, M

    2001-10-09

    Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.

  3. Combining next-generation sequencing and online databases for microsatellite development in non-model organisms.

    Science.gov (United States)

    Rico, Ciro; Normandeau, Eric; Dion-Côté, Anne-Marie; Rico, María Inés; Côté, Guillaume; Bernatchez, Louis

    2013-12-03

    Next-generation sequencing (NGS) is revolutionising marker development and the rapidly increasing amount of transcriptomes published across a wide variety of taxa is providing valuable sequence databases for the identification of genetic markers without the need to generate new sequences. Microsatellites are still the most important source of polymorphic markers in ecology and evolution. Motivated by our long-term interest in the adaptive radiation of a non-model species complex of whitefishes (Coregonus spp.), in this study, we focus on microsatellite characterisation and multiplex optimisation using transcriptome sequences generated by Illumina® and Roche-454, as well as online databases of Expressed Sequence Tags (EST) for the study of whitefish evolution and demographic history. We identified and optimised 40 polymorphic loci in multiplex PCR reactions and validated the robustness of our analyses by testing several population genetics and phylogeographic predictions using 494 fish from five lakes and 2 distinct ecotypes.

  4. Sequencing and Characterization of the Invasive Sycamore Lace Bug Corythucha ciliata (Hemiptera: Tingidae) Transcriptome

    Science.gov (United States)

    Qu, Cheng; Fu, Ningning; Xu, Yihua

    2016-01-01

    The sycamore lace bug, Corythucha ciliata (Hemiptera: Tingidae), is an invasive forestry pest rapidly expanding in many countries. This pest poses a considerable threat to the urban forestry ecosystem, especially to Platanus spp. However, its molecular biology and biochemistry are poorly understood. This study reports the first C. ciliata transcriptome, encompassing three different life stages (Nymphs, adults female (AF) and adults male (AM)). In total, 26.53 GB of clean data and 60,879 unigenes were obtained from three RNA-seq libraries. These unigenes were annotated and classified by Nr (NCBI non-redundant protein sequences), Nt (NCBI non-redundant nucleotide sequences), Pfam (Protein family), KOG/COG (Clusters of Orthologous Groups of proteins), Swiss-Prot (A manually annotated and reviewed protein sequence database), and KO (KEGG Ortholog database). After all pairwise comparisons between these three different samples, a large number of differentially expressed genes were revealed. The dramatic differences in global gene expression profiles were found between distinct life stages (nymphs and AF, nymphs and AM) and sex difference (AF and AM), with some of the significantly differentially expressed genes (DEGs) being related to metamorphosis, digestion, immune and sex difference. The different express of unigenes were validated through quantitative Real-Time PCR (qRT-PCR) for 16 randomly selected unigenes. In addition, 17,462 potential simple sequence repeat molecular markers were identified in these transcriptome resources. These comprehensive C. ciliata transcriptomic information can be utilized to promote the development of environmentally friendly methodologies to disrupt the processes of metamorphosis, digestion, immune and sex differences. PMID:27494615

  5. Transcriptome sequence analysis of an ornamental plant, Ananas comosus var. bracteatus, revealed the potential unigenes involved in terpenoid and phenylpropanoid biosynthesis.

    Science.gov (United States)

    Ma, Jun; Kanakala, S; He, Yehua; Zhang, Junli; Zhong, Xiaolan

    2015-01-01

    Ananas comosus var. bracteatus (Red Pineapple) is an important ornamental plant for its colorful leaves and decorative red fruits. Because of its complex genome, it is difficult to understand the molecular mechanisms involved in the growth and development. Thus high-throughput transcriptome sequencing of Ananas comosus var. bracteatus is necessary to generate large quantities of transcript sequences for the purpose of gene discovery and functional genomic studies. The Ananas comosus var. bracteatus transcriptome was sequenced by the Illumina paired-end sequencing technology. We obtained a total of 23.5 million high quality sequencing reads, 1,555,808 contigs and 41,052 unigenes. In total 41,052 unigenes of Ananas comosus var. bracteatus, 23,275 unigenes were annotated in the NCBI non-redundant protein database and 23,134 unigenes were annotated in the Swiss-Port database. Out of these, 17,748 and 8,505 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. Functional annotation against Kyoto Encyclopedia of Genes and Genomes Pathway database identified 5,825 unigenes which were mapped to 117 pathways. The assembly predicted many unigenes that were previously unknown. The annotated unigenes were compared against pineapple, rice, maize, Arabidopsis, and sorghum. Unigenes that did not match any of those five sequence datasets are considered to be Ananas comosus var. bracteatus unique. We predicted unigenes encoding enzymes involved in terpenoid and phenylpropanoid biosynthesis. The sequence data provide the most comprehensive transcriptomic resource currently available for Ananas comosus var. bracteatus. To our knowledge; this is the first report on the de novo transcriptome sequencing of the Ananas comosus var. bracteatus. Unigenes obtained in this study, may help improve future gene expression, genetic and genomics studies in Ananas comosus var. bracteatus.

  6. Transcriptome sequence analysis of an ornamental plant, Ananas comosus var. bracteatus, revealed the potential unigenes involved in terpenoid and phenylpropanoid biosynthesis.

    Directory of Open Access Journals (Sweden)

    Jun Ma

    Full Text Available Ananas comosus var. bracteatus (Red Pineapple is an important ornamental plant for its colorful leaves and decorative red fruits. Because of its complex genome, it is difficult to understand the molecular mechanisms involved in the growth and development. Thus high-throughput transcriptome sequencing of Ananas comosus var. bracteatus is necessary to generate large quantities of transcript sequences for the purpose of gene discovery and functional genomic studies.The Ananas comosus var. bracteatus transcriptome was sequenced by the Illumina paired-end sequencing technology. We obtained a total of 23.5 million high quality sequencing reads, 1,555,808 contigs and 41,052 unigenes. In total 41,052 unigenes of Ananas comosus var. bracteatus, 23,275 unigenes were annotated in the NCBI non-redundant protein database and 23,134 unigenes were annotated in the Swiss-Port database. Out of these, 17,748 and 8,505 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. Functional annotation against Kyoto Encyclopedia of Genes and Genomes Pathway database identified 5,825 unigenes which were mapped to 117 pathways. The assembly predicted many unigenes that were previously unknown. The annotated unigenes were compared against pineapple, rice, maize, Arabidopsis, and sorghum. Unigenes that did not match any of those five sequence datasets are considered to be Ananas comosus var. bracteatus unique. We predicted unigenes encoding enzymes involved in terpenoid and phenylpropanoid biosynthesis.The sequence data provide the most comprehensive transcriptomic resource currently available for Ananas comosus var. bracteatus. To our knowledge; this is the first report on the de novo transcriptome sequencing of the Ananas comosus var. bracteatus. Unigenes obtained in this study, may help improve future gene expression, genetic and genomics studies in Ananas comosus var. bracteatus.

  7. In Silico Functional Networks Identified in Fish Nucleated Red Blood Cells by Means of Transcriptomic and Proteomic Profiling

    Directory of Open Access Journals (Sweden)

    Sara Puente-Marin

    2018-04-01

    Full Text Available Nucleated red blood cells (RBCs of fish have, in the last decade, been implicated in several immune-related functions, such as antiviral response, phagocytosis or cytokine-mediated signaling. RNA-sequencing (RNA-seq and label-free shotgun proteomic analyses were carried out for in silico functional pathway profiling of rainbow trout RBCs. For RNA-seq, a de novo assembly was conducted, in order to create a transcriptome database for RBCs. For proteome profiling, we developed a proteomic method that combined: (a fractionation into cytosolic and membrane fractions, (b hemoglobin removal of the cytosolic fraction, (c protein digestion, and (d a novel step with pH reversed-phase peptide fractionation and final Liquid Chromatography Electrospray Ionization Tandem Mass Spectrometric (LC ESI-MS/MS analysis of each fraction. Combined transcriptome- and proteome- sequencing data identified, in silico, novel and striking immune functional networks for rainbow trout nucleated RBCs, which are mainly linked to innate and adaptive immunity. Functional pathways related to regulation of hematopoietic cell differentiation, antigen presentation via major histocompatibility complex class II (MHCII, leukocyte differentiation and regulation of leukocyte activation were identified. These preliminary findings further implicate nucleated RBCs in immune function, such as antigen presentation and leukocyte activation.

  8. Identification of Genes Relevant to Pesticides and Biology from Global Transcriptome Data of Monochamus alternatus Hope (Coleoptera: Cerambycidae Larvae.

    Directory of Open Access Journals (Sweden)

    Songqing Wu

    Full Text Available Monochamus alternatus Hope is the main vector in China of the Pine Wilt Disease caused by the pine wood nematode Bursaphelenchus xylophilus. Although chemical control is traditionally used to prevent pine wilt disease, new strategies based in biological control are promising ways for the management of the disease. However, there is no deep sequence analysis of Monochamus alternatus Hope that describes the transcriptome and no information is available about gene function of this insect vector. We used next generation sequencing technology to sequence the whole fourth instar larva transcriptome of Monochamus alternatus Hope and successfully built a Monochamus alternatus Hope transcriptome database. In total, 105,612 unigenes were assigned for Gene Ontology (GO terms, information for 16,730 classified unigenes was obtained in the Clusters of Orthologous Groups (COGs database, and 13,024 unigenes matched with 224 predicted pathways in the Kyoto Encyclopedia of Genes and Genome (KEGG. In addition, genes related to putative insecticide resistance-related genes, RNAi, the Bt receptor, intestinal digestive enzymes, possible future insect control targets and immune-related molecules are described. This study provides valuable basic information that can be used as a gateway to develop new molecular tools for Monochamus alternatus Hope control strategies.

  9. A human friendly reporting and database system for brain PET analysis

    International Nuclear Information System (INIS)

    Jamzad, M.; Ishii, Kenji; Toyama, Hinako; Senda, Michio

    1996-01-01

    We have developed a human friendly reporting and database system for clinical brain PET (Positron Emission Tomography) scans, which enables statistical data analysis on qualitative information obtained from image interpretation. Our system consists of a Brain PET Data (Input) Tool and Report Writing Tool. In the Brain PET Data Tool, findings and interpretations are input by selecting menu icons in a window panel instead of writing a free text. This method of input enables on-line data entry into and update of the database by means of pre-defined consistent words, which facilitates statistical data analysis. The Report Writing Tool generates a one page report of natural English sentences semi-automatically by using the above input information and the patient information obtained from our PET center's main database. It also has a keyword selection function from the report text so that we can save a set of keywords on the database for further analysis. By means of this system, we can store the data related to patient information and visual interpretation of the PET examination while writing clinical reports in daily work. The database files in our system can be accessed by means of commercially available databases. We have used the 4th Dimension database that runs on a Macintosh computer and analyzed 95 cases of 18 F-FDG brain PET studies. The results showed high specificity of parietal hypometabolism for Alzheimer's patients. (author)

  10. Web services for transcriptomics

    NARCIS (Netherlands)

    Neerincx, P.

    2009-01-01

    Transcriptomics is part of a family of disciplines focussing on high throughput molecular biology experiments. In the case of transcriptomics, scientists study the expression of genes resulting in transcripts. These transcripts can either perform a biological function themselves or function as

  11. Database Description - DGBY | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available base Description General information of database Database name DGBY Alternative name Database...EL: +81-29-838-8066 E-mail: Database classification Microarray Data and other Gene Expression Databases Orga...nism Taxonomy Name: Saccharomyces cerevisiae Taxonomy ID: 4932 Database descripti...-called phenomics). We uploaded these data on this website which is designated DGBY(Database for Gene expres...ma J, Ando A, Takagi H. Journal: Yeast. 2008 Mar;25(3):179-90. External Links: Original website information Database

  12. Preliminary analysis of Psoroptes ovis transcriptome in different developmental stages

    Directory of Open Access Journals (Sweden)

    Man-Li He

    2016-11-01

    Full Text Available Abstract Background Psoroptic mange is a chronic, refractory, contagious and infectious disease mainly caused by the mange mite Psoroptes ovis, which can infect horses, sheep, buffaloes, rabbits, other domestic animals, deer, wild camels, foxes, minks, lemurs, alpacas, elks and other wild animals. Features of the disease include intense pruritus and dermatitis, depilation and hyperkeratosis, which ultimately result in emaciation or death caused by secondary bacterial infections. The infestation is usually transmitted by close contact between animals. Psoroptic mange is widespread in the world. In this paper, the transcriptome of P. ovis is described following sequencing and analysis of transcripts from samples of larvae (i.e. the Pso_L group and nymphs and adults (i.e. the Pso_N_A group. The study describes differentially expressed genes (DEGs and genes encoding allergens, which help understanding the biology of P. ovis and lay foundations for the development of vaccine antigens and drug target screening. Methods The transcriptome of P. ovis was assembled and analyzed using bioinformatic tools. The unigenes of P. ovis from each developmental stage and the unigenes differentially between developmental stages were compared with allergen protein sequences contained in the allergen database website to predict potential allergens. Results We identified 38,836 unigenes, whose mean length was 825 bp. On the basis of sequence similarity with seven databases, a total of 17,366 unigenes were annotated. A total of 1,316 DEGs were identified, including 496 upregulated and 820 downregulated in the Pso_L group compared with the Pso_N_A group. We predicted 205 allergens genes in the two developmental stages similar to genes from other mites and ticks, of these, 14 were among the upregulated DEGs and 26 among the downregulated DEGs. Conclusion This study provides a reference transcriptome of P. ovis in absence of a reference genome. The analysis of DEGs and

  13. Differential Transcriptome Analysis between Paulownia fortunei and Its Synthesized Autopolyploid

    Directory of Open Access Journals (Sweden)

    Xiaoshen Zhang

    2014-03-01

    Full Text Available Paulownia fortunei is an ecologically and economically important tree species that is widely used as timber and chemical pulp. Its autotetraploid, which carries a number of valuable traits, was successfully induced with colchicine. To identify differences in gene expression between P. fortunei and its synthesized autotetraploid, we performed transcriptome sequencing using an Illumina Genome Analyzer IIx (GAIIx. About 94.8 million reads were generated and assembled into 383,056 transcripts, including 18,984 transcripts with a complete open reading frame. A conducted Basic Local Alignment Search Tool (BLAST search indicated that 16,004 complete transcripts had significant hits in the National Center for Biotechnology Information (NCBI non-redundant database. The complete transcripts were given functional assignments using three public protein databases. One thousand one hundred fifty eight differentially expressed complete transcripts were screened through a digital abundance analysis, including transcripts involved in energy metabolism and epigenetic regulation. Finally, the expression levels of several transcripts were confirmed by quantitative real-time PCR. Our results suggested that polyploidization caused epigenetic-related changes, which subsequently resulted in gene expression variation between diploid and autotetraploid P. fortunei. This might be the main mechanism affected by the polyploidization. Our results represent an extensive survey of the P. fortunei transcriptome and will facilitate subsequent functional genomics research in P. fortunei. Moreover, the gene expression profiles of P. fortunei and its autopolyploid will provide a valuable resource for the study of polyploidization.

  14. DenHunt - A Comprehensive Database of the Intricate Network of Dengue-Human Interactions.

    Directory of Open Access Journals (Sweden)

    Prashanthi Karyala

    2016-09-01

    Full Text Available Dengue virus (DENV is a human pathogen and its etiology has been widely established. There are many interactions between DENV and human proteins that have been reported in literature. However, no publicly accessible resource for efficiently retrieving the information is yet available. In this study, we mined all publicly available dengue-human interactions that have been reported in the literature into a database called DenHunt. We retrieved 682 direct interactions of human proteins with dengue viral components, 382 indirect interactions and 4120 differentially expressed human genes in dengue infected cell lines and patients. We have illustrated the importance of DenHunt by mapping the dengue-human interactions on to the host interactome and observed that the virus targets multiple host functional complexes of important cellular processes such as metabolism, immune system and signaling pathways suggesting a potential role of these interactions in viral pathogenesis. We also observed that 7 percent of the dengue virus interacting human proteins are also associated with other infectious and non-infectious diseases. Finally, the understanding that comes from such analyses could be used to design better strategies to counteract the diseases caused by dengue virus. The whole dataset has been catalogued in a searchable database, called DenHunt (http://proline.biochem.iisc.ernet.in/DenHunt/.

  15. DenHunt - A Comprehensive Database of the Intricate Network of Dengue-Human Interactions.

    Science.gov (United States)

    Karyala, Prashanthi; Metri, Rahul; Bathula, Christopher; Yelamanchi, Syam K; Sahoo, Lipika; Arjunan, Selvam; Sastri, Narayan P; Chandra, Nagasuma

    2016-09-01

    Dengue virus (DENV) is a human pathogen and its etiology has been widely established. There are many interactions between DENV and human proteins that have been reported in literature. However, no publicly accessible resource for efficiently retrieving the information is yet available. In this study, we mined all publicly available dengue-human interactions that have been reported in the literature into a database called DenHunt. We retrieved 682 direct interactions of human proteins with dengue viral components, 382 indirect interactions and 4120 differentially expressed human genes in dengue infected cell lines and patients. We have illustrated the importance of DenHunt by mapping the dengue-human interactions on to the host interactome and observed that the virus targets multiple host functional complexes of important cellular processes such as metabolism, immune system and signaling pathways suggesting a potential role of these interactions in viral pathogenesis. We also observed that 7 percent of the dengue virus interacting human proteins are also associated with other infectious and non-infectious diseases. Finally, the understanding that comes from such analyses could be used to design better strategies to counteract the diseases caused by dengue virus. The whole dataset has been catalogued in a searchable database, called DenHunt (http://proline.biochem.iisc.ernet.in/DenHunt/).

  16. The draft genome and transcriptome of Cannabis sativa.

    Science.gov (United States)

    van Bakel, Harm; Stout, Jake M; Cote, Atina G; Tallon, Carling M; Sharpe, Andrew G; Hughes, Timothy R; Page, Jonathan E

    2011-10-20

    Cannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored. We sequenced genomic DNA and RNA from the marijuana strain Purple Kush using shortread approaches. We report a draft haploid genome sequence of 534 Mb and a transcriptome of 30,000 genes. Comparison of the transcriptome of Purple Kush with that of the hemp cultivar 'Finola' revealed that many genes encoding proteins involved in cannabinoid and precursor pathways are more highly expressed in Purple Kush than in 'Finola'. The exclusive occurrence of Δ9-tetrahydrocannabinolic acid synthase in the Purple Kush transcriptome, and its replacement by cannabidiolic acid synthase in 'Finola', may explain why the psychoactive cannabinoid Δ9-tetrahydrocannabinol (THC) is produced in marijuana but not in hemp. Resequencing the hemp cultivars 'Finola' and 'USO-31' showed little difference in gene copy numbers of cannabinoid pathway enzymes. However, single nucleotide variant analysis uncovered a relatively high level of variation among four cannabis types, and supported a separation of marijuana and hemp. The availability of the Cannabis sativa genome enables the study of a multifunctional plant that occupies a unique role in human culture. Its availability will aid the development of therapeutic marijuana strains with tailored cannabinoid profiles and provide a basis for the breeding of hemp with improved agronomic characteristics.

  17. CyanOmics: an integrated database of omics for the model cyanobacterium Synechococcus sp. PCC 7002.

    Science.gov (United States)

    Yang, Yaohua; Feng, Jie; Li, Tao; Ge, Feng; Zhao, Jindong

    2015-01-01

    Cyanobacteria are an important group of organisms that carry out oxygenic photosynthesis and play vital roles in both the carbon and nitrogen cycles of the Earth. The annotated genome of Synechococcus sp. PCC 7002, as an ideal model cyanobacterium, is available. A series of transcriptomic and proteomic studies of Synechococcus sp. PCC 7002 cells grown under different conditions have been reported. However, no database of such integrated omics studies has been constructed. Here we present CyanOmics, a database based on the results of Synechococcus sp. PCC 7002 omics studies. CyanOmics comprises one genomic dataset, 29 transcriptomic datasets and one proteomic dataset and should prove useful for systematic and comprehensive analysis of all those data. Powerful browsing and searching tools are integrated to help users directly access information of interest with enhanced visualization of the analytical results. Furthermore, Blast is included for sequence-based similarity searching and Cluster 3.0, as well as the R hclust function is provided for cluster analyses, to increase CyanOmics's usefulness. To the best of our knowledge, it is the first integrated omics analysis database for cyanobacteria. This database should further understanding of the transcriptional patterns, and proteomic profiling of Synechococcus sp. PCC 7002 and other cyanobacteria. Additionally, the entire database framework is applicable to any sequenced prokaryotic genome and could be applied to other integrated omics analysis projects. Database URL: http://lag.ihb.ac.cn/cyanomics. © The Author(s) 2015. Published by Oxford University Press.

  18. Transcriptome response to copper heavy metal stress in hard-shelled mussel (Mytilus coruscus

    Directory of Open Access Journals (Sweden)

    Meiying Xu

    2016-03-01

    Full Text Available The hard-shelled mussel (Mytilus coruscus has considerably one of the most economically important marine shellfish worldwide and considered as a good invertebrate model for ecotoxicity study for a long time. In the present study, we used Illumina sequencing technology (HiSeq2000 to sequence, assemble and annotate the transcriptome of the hard-shelled mussel which challenged with copper pollution. A total of 21,723,913 paired-end clean reads (NCBI SRA database SRX1411195 were generated from HiSeq2000 sequencer and 96,403 contigs (with N50 = 1118 bp were obtained after de novo assembling with Trinity software. Digital gene expression analysis reveals 1156 unigenes are upregulated and 1681 unigenes are downregulated when challenged with copper. By KEGG pathway enrichment analysis, we found that unigenes in four KEGG pathways (aminoacyl-tRNA biosynthesis, apoptosis, DNA replication and mismatch repair show significant differential expressed between control and copper treated groups. We hope that the gill transcriptome in copper treated hard-shelled mussel can give useful information to understand how mussel handles with heavy metal stress at molecular level. Keywords: Hard-shelled mussel, Heavy metal, Transcriptome, Ecotoxicity

  19. De novo transcriptomic analysis of cowpea (Vigna unguiculata L. Walp.) for genic SSR marker development.

    Science.gov (United States)

    Chen, Honglin; Wang, Lixia; Liu, Xiaoyan; Hu, Liangliang; Wang, Suhua; Cheng, Xuzhen

    2017-07-11

    Cowpea [Vigna unguiculata (L.) Walp.] is one of the most important legumes in tropical and semi-arid regions. However, there is relatively little genomic information available for genetic research on and breeding of cowpea. The objectives of this study were to analyse the cowpea transcriptome and develop genic molecular markers for future genetic studies of this genus. Approximately 54 million high-quality cDNA sequence reads were obtained from cowpea based on Illumina paired-end sequencing technology and were de novo assembled to generate 47,899 unigenes with an N50 length of 1534 bp. Sequence similarity analysis revealed 36,289 unigenes (75.8%) with significant similarity to known proteins in the non-redundant (Nr) protein database, 23,471 unigenes (49.0%) with BLAST hits in the Swiss-Prot database, and 20,654 unigenes (43.1%) with high similarity in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Further analysis identified 5560 simple sequence repeats (SSRs) as potential genic molecular markers. Validating a random set of 500 SSR markers yielded 54 polymorphic markers among 32 cowpea accessions. This transcriptomic analysis of cowpea provided a valuable set of genomic data for characterizing genes with important agronomic traits in Vigna unguiculata and a new set of genic SSR markers for further genetic studies and breeding in cowpea and related Vigna species.

  20. De novo Transcriptome Assembly of Common Wild Rice (Oryza rufipogon Griff.) and Discovery of Drought-Response Genes in Root Tissue Based on Transcriptomic Data.

    Science.gov (United States)

    Tian, Xin-Jie; Long, Yan; Wang, Jiao; Zhang, Jing-Wen; Wang, Yan-Yan; Li, Wei-Min; Peng, Yu-Fa; Yuan, Qian-Hua; Pei, Xin-Wu

    2015-01-01

    The perennial O. rufipogon (common wild rice), which is considered to be the ancestor of Asian cultivated rice species, contains many useful genetic resources, including drought resistance genes. However, few studies have identified the drought resistance and tissue-specific genes in common wild rice. In this study, transcriptome sequencing libraries were constructed, including drought-treated roots (DR) and control leaves (CL) and roots (CR). Using Illumina sequencing technology, we generated 16.75 million bases of high-quality sequence data for common wild rice and conducted de novo assembly and annotation of genes without prior genome information. These reads were assembled into 119,332 unigenes with an average length of 715 bp. A total of 88,813 distinct sequences (74.42% of unigenes) significantly matched known genes in the NCBI NT database. Differentially expressed gene (DEG) analysis showed that 3617 genes were up-regulated and 4171 genes were down-regulated in the CR library compared with the CL library. Among the DEGs, 535 genes were expressed in roots but not in shoots. A similar comparison between the DR and CR libraries showed that 1393 genes were up-regulated and 315 genes were down-regulated in the DR library compared with the CR library. Finally, 37 genes that were specifically expressed in roots were screened after comparing the DEGs identified in the above-described analyses. This study provides a transcriptome sequence resource for common wild rice plants and establishes a digital gene expression profile of wild rice plants under drought conditions using the assembled transcriptome data as a reference. Several tissue-specific and drought-stress-related candidate genes were identified, representing a fully characterized transcriptome and providing a valuable resource for genetic and genomic studies in plants.

  1. Probing the evolution, ecology and physiology of marine protists using transcriptomics.

    Science.gov (United States)

    Caron, David A; Alexander, Harriet; Allen, Andrew E; Archibald, John M; Armbrust, E Virginia; Bachy, Charles; Bell, Callum J; Bharti, Arvind; Dyhrman, Sonya T; Guida, Stephanie M; Heidelberg, Karla B; Kaye, Jonathan Z; Metzner, Julia; Smith, Sarah R; Worden, Alexandra Z

    2017-01-01

    Protists, which are single-celled eukaryotes, critically influence the ecology and chemistry of marine ecosystems, but genome-based studies of these organisms have lagged behind those of other microorganisms. However, recent transcriptomic studies of cultured species, complemented by meta-omics analyses of natural communities, have increased the amount of genetic information available for poorly represented branches on the tree of eukaryotic life. This information is providing insights into the adaptations and interactions between protists and other microorganisms and macroorganisms, but many of the genes sequenced show no similarity to sequences currently available in public databases. A better understanding of these newly discovered genes will lead to a deeper appreciation of the functional diversity and metabolic processes in the ocean. In this Review, we summarize recent developments in our understanding of the ecology, physiology and evolution of protists, derived from transcriptomic studies of cultured strains and natural communities, and discuss how these novel large-scale genetic datasets will be used in the future.

  2. High probability of avian influenza virus (H7N7) transmission from poultry to humans active in disease control on infected farms

    NARCIS (Netherlands)

    M.E.H. Bos (Marian); D.E. te Beest (Dennis); M. van Boven (Michiel); M.R.D.R.B. van Holle; A. Meijer (Adam); A. Bosman (Arnold); Y.M. Mulder (Yonne); M.P.G. Koopmans D.V.M. (Marion); A. Stegeman (Arjan)

    2010-01-01

    textabstractAn epizootic of avian influenza (H7N7) caused a large number of human infections in The Netherlands in 2003. We used data from this epizootic to estimate infection probabilities for persons involved in disease control on infected farms. Analyses were based on databases containing

  3. Transcriptomic response of the Antarctic pteropod Limacina helicina antarctica to ocean acidification.

    Science.gov (United States)

    Johnson, Kevin M; Hofmann, Gretchen E

    2017-10-23

    Ocean acidification (OA), a change in ocean chemistry due to the absorption of atmospheric CO 2 into surface oceans, challenges biogenic calcification in many marine organisms. Ocean acidification is expected to rapidly progress in polar seas, with regions of the Southern Ocean expected to experience severe OA within decades. Biologically, the consequences of OA challenge calcification processes and impose an energetic cost. In order to better characterize the response of a polar calcifier to conditions of OA, we assessed differential gene expression in the Antarctic pteropod, Limacina helicina antarctica. Experimental levels of pCO 2 were chosen to create both contemporary pH conditions, and to mimic future pH expected in OA scenarios. Significant changes in the transcriptome were observed when juvenile L. h. antarctica were acclimated for 21 days to low-pH (7.71), mid-pH (7.9) or high-pH (8.13) conditions. Differential gene expression analysis of individuals maintained in the low-pH treatment identified down-regulation of genes involved in cytoskeletal structure, lipid transport, and metabolism. High pH exposure led to increased expression and enrichment for genes involved in shell formation, calcium ion binding, and DNA binding. Significant differential gene expression was observed in four major cellular and physiological processes: shell formation, the cellular stress response, metabolism, and neural function. Across these functional groups, exposure to conditions that mimic ocean acidification led to rapid suppression of gene expression. Results of this study demonstrated that the transcriptome of the juvenile pteropod, L. h. antarctica, was dynamic and changed in response to different levels of pCO 2 . In a global change context, exposure of L. h. antarctica to the low pH, high pCO 2 OA conditions resulted in a suppression of transcripts for genes involved in key physiological processes: calcification, metabolism, and the cellular stress response. The

  4. Flower bud transcriptome analysis of Sapium sebiferum (Linn.) Roxb. and primary investigation of drought induced flowering: pathway construction and G-quadruplex prediction based on transcriptome.

    Science.gov (United States)

    Yang, Minglei; Wu, Ying; Jin, Shan; Hou, Jinyan; Mao, Yingji; Liu, Wenbo; Shen, Yangcheng; Wu, Lifang

    2015-01-01

    Sapium sebiferum (Linn.) Roxb. (Chinese Tallow Tree) is a perennial woody tree and its seeds are rich in oil which hold great potential for biodiesel production. Despite a traditional woody oil plant, our understanding on S. sebiferum genetics and molecular biology remains scant. In this study, the first comprehensive transcriptome of S. sebiferum flower has been generated by sequencing and de novo assembly. A total of 149,342 unigenes were generated from raw reads, of which 24,289 unigenes were successfully matched to public database. A total of 61 MADS box genes and putative pathways involved in S. sebiferum flower development have been identified. Abiotic stress response network was also constructed in this work, where 2,686 unigenes are involved in the pathway. As for lipid biosynthesis, 161 unigenes have been identified in fatty acid (FA) and triacylglycerol (TAG) biosynthesis. Besides, the G-Quadruplexes in RNA of S. sebiferum also have been predicted. An interesting finding is that the stress-induced flowering was observed in S. sebiferum for the first time. According to the results of semi-quantitative PCR, expression tendencies of flowering-related genes, GA1, AP2 and CRY2, accorded with stress-related genes, such as GRX50435 and PRXⅡ39562. This transcriptome provides functional genomic information for further research of S. sebiferum, especially for the genetic engineering to shorten the juvenile period and improve yield by regulating flower development. It also offers a useful database for the research of other Euphorbiaceae family plants.

  5. Reprogrammed Functional Brown Adipocytes Ameliorate Insulin Resistance and Dyslipidemia in Diet-Induced Obesity and Type 2 Diabetes

    Directory of Open Access Journals (Sweden)

    Tsunao Kishida

    2015-10-01

    Full Text Available Brown adipocytes (BAs play important roles in body temperature regulation, energy balance, and carbohydrate and lipid metabolism. Activities of BAs are remarkably diminished in obese and diabetic patients, providing possibilities of transplanting functional BAs resulting in therapeutic benefit. Here, we show generation of functional BAs by cellular reprogramming procedures. Transduction of the PRDM16 gene into iPSC-derived embryoid bodies induced BA phenotypes (iBAs. Moreover, normal human fibroblasts were directly converted into BAs (dBAs by C/EBP-β and C-MYC gene transduction. Approximately 90% of the fibroblasts were successfully converted within 12 days. The dBAs were highly active in mitochondrial biogenesis and oxidative metabolism. Mouse dBAs were induced by Prdm16, C/ebp-β, and L-myc genes, and after transplantation, they significantly reduced diet-induced obesity and insulin resistance in an UCP1-dependent manner. Thus, highly functional BAs can be generated by cellular reprogramming, suggesting a promising tailor-made cell therapy against metabolic disorders including type 2 diabetes mellitus.

  6. Reprogrammed Functional Brown Adipocytes Ameliorate Insulin Resistance and Dyslipidemia in Diet-Induced Obesity and Type 2 Diabetes.

    Science.gov (United States)

    Kishida, Tsunao; Ejima, Akika; Yamamoto, Kenta; Tanaka, Seiji; Yamamoto, Toshiro; Mazda, Osam

    2015-10-13

    Brown adipocytes (BAs) play important roles in body temperature regulation, energy balance, and carbohydrate and lipid metabolism. Activities of BAs are remarkably diminished in obese and diabetic patients, providing possibilities of transplanting functional BAs resulting in therapeutic benefit. Here, we show generation of functional BAs by cellular reprogramming procedures. Transduction of the PRDM16 gene into iPSC-derived embryoid bodies induced BA phenotypes (iBAs). Moreover, normal human fibroblasts were directly converted into BAs (dBAs) by C/EBP-β and C-MYC gene transduction. Approximately 90% of the fibroblasts were successfully converted within 12 days. The dBAs were highly active in mitochondrial biogenesis and oxidative metabolism. Mouse dBAs were induced by Prdm16, C/ebp-β, and L-myc genes, and after transplantation, they significantly reduced diet-induced obesity and insulin resistance in an UCP1-dependent manner. Thus, highly functional BAs can be generated by cellular reprogramming, suggesting a promising tailor-made cell therapy against metabolic disorders including type 2 diabetes mellitus. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  7. RNA sequencing of the human milk fat layer transcriptome reveals distinct gene expression profiles at three stages of lactation.

    Directory of Open Access Journals (Sweden)

    Danielle G Lemay

    Full Text Available Aware of the important benefits of human milk, most U.S. women initiate breastfeeding but difficulties with milk supply lead some to quit earlier than intended. Yet, the contribution of maternal physiology to lactation difficulties remains poorly understood. Human milk fat globules, by enveloping cell contents during their secretion into milk, are a rich source of mammary cell RNA. Here, we pair this non-invasive mRNA source with RNA-sequencing to probe the milk fat layer transcriptome during three stages of lactation: colostral, transitional, and mature milk production. The resulting transcriptomes paint an exquisite portrait of human lactation. The resulting transcriptional profiles cluster not by postpartum day, but by milk Na:K ratio, indicating that women sampled during similar postpartum time frames could be at markedly different stages of gene expression. Each stage of lactation is characterized by a dynamic range (10(5-fold in transcript abundances not previously observed with microarray technology. We discovered that transcripts for isoferritins and cathepsins are strikingly abundant during colostrum production, highlighting the potential importance of these proteins for neonatal health. Two transcripts, encoding β-casein (CSN2 and α-lactalbumin (LALBA, make up 45% of the total pool of mRNA in mature lactation. Genes significantly expressed across all stages of lactation are associated with making, modifying, transporting, and packaging milk proteins. Stage-specific transcripts are associated with immune defense during the colostral stage, up-regulation of the machinery needed for milk protein synthesis during the transitional stage, and the production of lipids during mature lactation. We observed strong modulation of key genes involved in lactose synthesis and insulin signaling. In particular, protein tyrosine phosphatase, receptor type, F (PTPRF may serve as a biomarker linking insulin resistance with insufficient milk supply. This

  8. Chapter 4 genomics, transcriptomics, and epigenomics in traumatic brain injury research.

    Science.gov (United States)

    Puccio, Ava M; Alexander, Sheila

    2015-01-01

    The long-term effects and significant impact of the full spectrum of traumatic brain injury (TBI) has received increased attention in recent years. Despite increased research efforts, there has been little movement toward improving outcomes for the survivors of TBI. TBI is a heterogeneous condition with a complex biological response, and significant variability in human recovery contributes to the difficulty in identifying therapeutics that improve outcomes. Personalized medicine, identifying the best course of treatment for a given individual based on individual characteristics, has great potential to improve recovery for TBI survivors. The advances in medical genetics and genomics over the past 20 years have increased our understanding of many biological processes. A substantial amount of research has focused on the genomic, transcriptomic, and epigenomic profiles in many health and disease states, including recovery from TBI. The focus of this review chapter is to describe the current state of the science in genomic, transcriptomic, and epigenomic research in the TBI population. There have been some advancements toward understanding the genomic, transcriptomic, and epigenomic processes in humans, but much of this work remains at the preclinical stage. This current evidence does improve our understanding of TBI recovery, but also serves as an excellent platform upon which to build further study toward improved outcomes for this population.

  9. Targeted disruption in mice of a neural stem cell-maintaining, KRAB-Zn finger-encoding gene that has rapidly evolved in the human lineage.

    Directory of Open Access Journals (Sweden)

    Huan-Chieh Chien

    Full Text Available Understanding the genetic basis of the physical and behavioral traits that separate humans from other primates is a challenging but intriguing topic. The adaptive functions of the expansion and/or reduction in human brain size have long been explored. From a brain transcriptome project we have identified a KRAB-Zn finger protein-encoding gene (M003-A06 that has rapidly evolved since the human-chimpanzee separation. Quantitative RT-PCR analysis of different human tissues indicates that M003-A06 expression is enriched in the human fetal brain in addition to the fetal heart. Furthermore, analysis with use of immunofluorescence staining, neurosphere culturing and Western blotting indicates that the mouse ortholog of M003-A06, Zfp568, is expressed mainly in the embryonic stem (ES cells and fetal as well as adult neural stem cells (NSCs. Conditional gene knockout experiments in mice demonstrates that Zfp568 is both an NSC maintaining- and a brain size-regulating gene. Significantly, molecular genetic analyses show that human M003-A06 consists of 2 equilibrated allelic types, H and C, one of which (H is human-specific. Combined contemporary genotyping and database mining have revealed interesting genetic associations between the different genotypes of M003-A06 and the human head sizes. We propose that M003-A06 is likely one of the genes contributing to the uniqueness of the human brain in comparison to other higher primates.

  10. PATRIC, the bacterial bioinformatics database and analysis resource.

    Science.gov (United States)

    Wattam, Alice R; Abraham, David; Dalay, Oral; Disz, Terry L; Driscoll, Timothy; Gabbard, Joseph L; Gillespie, Joseph J; Gough, Roger; Hix, Deborah; Kenyon, Ronald; Machi, Dustin; Mao, Chunhong; Nordberg, Eric K; Olson, Robert; Overbeek, Ross; Pusch, Gordon D; Shukla, Maulik; Schulman, Julie; Stevens, Rick L; Sullivan, Daniel E; Vonstein, Veronika; Warren, Andrew; Will, Rebecca; Wilson, Meredith J C; Yoo, Hyun Seung; Zhang, Chengdong; Zhang, Yan; Sobral, Bruno W

    2014-01-01

    The Pathosystems Resource Integration Center (PATRIC) is the all-bacterial Bioinformatics Resource Center (BRC) (http://www.patricbrc.org). A joint effort by two of the original National Institute of Allergy and Infectious Diseases-funded BRCs, PATRIC provides researchers with an online resource that stores and integrates a variety of data types [e.g. genomics, transcriptomics, protein-protein interactions (PPIs), three-dimensional protein structures and sequence typing data] and associated metadata. Datatypes are summarized for individual genomes and across taxonomic levels. All genomes in PATRIC, currently more than 10,000, are consistently annotated using RAST, the Rapid Annotations using Subsystems Technology. Summaries of different data types are also provided for individual genes, where comparisons of different annotations are available, and also include available transcriptomic data. PATRIC provides a variety of ways for researchers to find data of interest and a private workspace where they can store both genomic and gene associations, and their own private data. Both private and public data can be analyzed together using a suite of tools to perform comparative genomic or transcriptomic analysis. PATRIC also includes integrated information related to disease and PPIs. All the data and integrated analysis and visualization tools are freely available. This manuscript describes updates to the PATRIC since its initial report in the 2007 NAR Database Issue.

  11. Licensing topical report: application of probabilistic risk assessment in the selection of design basis accidents

    International Nuclear Information System (INIS)

    Houghton, W.J.

    1980-06-01

    A probabilistic risk assessment (PRA) approach is proposed to be used to scrutinize selection of accident sequences. A technique is described in this Licensing Topical Report to identify candidates for Design Basis Accidents (DBAs) utilizing the risk assessment results. As a part of this technique, it is proposed that events with frequencies below a specified limit would not be candidates. The use of the methodology described is supplementary to the traditional, deterministic approach and may result, in some cases, in the selection of multiple failure sequences as DBAs; it may also provide a basis for not considering some traditionally postulated events as being DBAs. A process is then described for selecting a list of DBAs based on the candidates from PRA as supplementary to knowledge and judgments from past licensing practice. These DBAs would be the events considered in Chapter 15 of Safety Analysis Reports of high-temperature gas-cooled reactors

  12. Changes in the transcriptome of the human endometrial Ishikawa cancer cell line induced by estrogen, progesterone, tamoxifen, and mifepristone (RU486 as detected by RNA-sequencing.

    Directory of Open Access Journals (Sweden)

    Karin Tamm-Rosenstein

    Full Text Available BACKGROUND: Estrogen (E2 and progesterone (P4 are key players in the maturation of the human endometrium. The corresponding steroid hormone modulators, tamoxifen (TAM and mifepristone (RU486 are widely used in breast cancer therapy and for contraception purposes, respectively. METHODOLOGY/PRINCIPAL FINDINGS: Gene expression profiling of the human endometrial Ishikawa cancer cell line treated with E2 and P4 for 3 h and 12 h, and TAM and RU486 for 12 h, was performed using RNA-sequencing. High levels of mRNA were detected for genes, including PSAP, ATP5G2, ATP5H, and GNB2L1 following E2 or P4 treatment. A total of 82 biomarkers for endometrial biology were identified among E2 induced genes, and 93 among P4 responsive genes. Identified biomarkers included: EZH2, MDK, MUC1, SLIT2, and IL6ST, which are genes previously associated with endometrial receptivity. Moreover, 98.8% and 98.6% of E2 and P4 responsive genes in Ishikawa cells, respectively, were also detected in two human mid-secretory endometrial biopsy samples. TAM treatment exhibited both antagonistic and agonistic effects of E2, and also regulated a subset of genes independently. The cell cycle regulator cyclin D1 (CCND1 showed significant up-regulation following treatment with TAM. RU486 did not appear to act as a pure antagonist of P4 and a functional analysis of RU486 response identified genes related to adhesion and apoptosis, including down-regulated genes associated with cell-cell contacts and adhesion as CTNND1, JUP, CDH2, IQGAP1, and COL2A1. CONCLUSIONS: Significant changes in gene expression by the Ishikawa cell line were detected after treatments with E2, P4, TAM, and RU486. These transcriptome data provide valuable insight into potential biomarkers related to endometrial receptivity, and also facilitate an understanding of the molecular changes that take place in the endometrium in the early stages of breast cancer treatment and contraception usage.

  13. Global transcriptome analysis of developing chickpea (Cicer arietinum L.) seeds.

    Science.gov (United States)

    Pradhan, Seema; Bandhiwal, Nitesh; Shah, Niraj; Kant, Chandra; Gaur, Rashmi; Bhatia, Sabhyata

    2014-01-01

    Understanding developmental processes, especially in non-model crop plants, is extremely important in order to unravel unique mechanisms regulating development. Chickpea (C. arietinum L.) seeds are especially valued for their high carbohydrate and protein content. Therefore, in order to elucidate the mechanisms underlying seed development in chickpea, deep sequencing of transcriptomes from four developmental stages was undertaken. In this study, next generation sequencing platform was utilized to sequence the transcriptome of four distinct stages of seed development in chickpea. About 1.3 million reads were generated which were assembled into 51,099 unigenes by merging the de novo and reference assemblies. Functional annotation of the unigenes was carried out using the Uniprot, COG and KEGG databases. RPKM based digital expression analysis revealed specific gene activities at different stages of development which was validated using Real time PCR analysis. More than 90% of the unigenes were found to be expressed in at least one of the four seed tissues. DEGseq was used to determine differentially expressing genes which revealed that only 6.75% of the unigenes were differentially expressed at various stages. Homology based comparison revealed 17.5% of the unigenes to be putatively seed specific. Transcription factors were predicted based on HMM profiles built using TF sequences from five legume plants and analyzed for their differential expression during progression of seed development. Expression analysis of genes involved in biosynthesis of important secondary metabolites suggested that chickpea seeds can serve as a good source of antioxidants. Since transcriptomes are a valuable source of molecular markers like simple sequence repeats (SSRs), about 12,000 SSRs were mined in chickpea seed transcriptome and few of them were validated. In conclusion, this study will serve as a valuable resource for improved chickpea breeding.

  14. Global transcriptome analysis of developing chickpea (Cicer arietinum L. seeds

    Directory of Open Access Journals (Sweden)

    Seema ePradhan

    2014-12-01

    Full Text Available Understanding developmental processes, especially in non-model crop plants, is extremely important in order to unravel unique mechanisms regulating development. Chickpea (C. arietinum L. seeds are especially valued for their high carbohydrate and protein content. Therefore, in order to elucidate the mechanisms underlying seed development in chickpea, deep sequencing of transcriptomes from four developmental stages was undertaken. In this study, next generation sequencing platform was utilised to sequence the transcriptome of four distinct stages of seed development in chickpea. About 1.3 million reads were generated which were assembled into 51,099 unigenes by merging the de novo and reference assemblies. Functional annotation of the unigenes was carried out using the Uniprot, COG and KEGG databases. RPKM based digital expression analysis revealed specific gene activities at different stages of development which was validated using Real time PCR analysis. More than 90% of the unigenes were found to be expressed in at least one of the four seed tissues. DEGseq was used to determine differentially expressing genes which revealed that only 6.75% of the unigenes were differentially expressed at various stages. Homology based comparison revealed 17.5% of the unigenes to be putatively seed specific. Transcription factors were predicted based on HMM profiles built using TF sequences from five legume plants and analysed for their differential expression during progression of seed development. Expression analysis of genes involved in biosynthesis of important secondary metabolites suggested that chickpea seeds can serve as a good source of antioxidants. Since transcriptomes are a valuable source of molecular markers like simple sequence repeats (SSRs, about 12,000 SSRs were mined in chickpea seed transcriptome and few of them were validated. In conclusion, this study will serve as a valuable resource for improved chickpea breeding.

  15. Genomotyping of Pseudomonas putida strains using P. putida KT2440-based high-density DNA microarrays: Implications for transcriptomics studies

    NARCIS (Netherlands)

    Ballerstedt, H.; Volkers, R.J.M.; Mars, A.E.; Hallsworth, J.E.; Santos, V.A.M.D.; Puchalka, J.; Duuren, J. van; Eggink, G.; Timmis, K.N.; Bont, J.A.M. de; Wery, J.

    2007-01-01

    Pseudomonas putida KT2440 is the only fully sequenced P. putida strain. Thus, for transcriptomics and proteomics studies with other P. putida strains, the P. putida KT2440 genomic database serves as standard reference. The utility of KT2440 whole-genome, high-density oligonucleotide microarrays for

  16. Transcriptome Sequencing in a Tibetan Barley Landrace with High Resistance to Powdery Mildew

    Directory of Open Access Journals (Sweden)

    Xing-Quan Zeng

    2014-01-01

    Full Text Available Hulless barley is an important cereal crop worldwide, especially in Tibet of China. However, this crop is usually susceptible to powdery mildew caused by Blumeria graminis f. sp. hordei. In this study, we aimed to understand the functions and pathways of genes involved in the disease resistance by transcriptome sequencing of a Tibetan barley landrace with high resistance to powdery mildew. A total of 831 significant differentially expressed genes were found in the infected seedlings, covering 19 functions. Either “cell,” “cell part,” and “extracellular region” in the cellular component category or “binding” and “catalytic” in the category of molecular function as well as “metabolic process” and “cellular process” in the biological process category together demonstrated that these functions may be involved in the resistance to powdery mildew of the hulless barley. In addition, 330 KEGG pathways were found using BLASTx with an E-value cut-off of <10−5. Among them, three pathways, namely, “photosynthesis,” “plant-pathogen interaction,” and “photosynthesis-antenna proteins” had significant matches in the database. Significant expressions of the three pathways were detected at 24 h, 48 h, and 96 h after infection, respectively. These results indicated a complex process of barley response to powdery mildew infection.

  17. Pyrosequencing the Bemisia tabaci transcriptome reveals a highly diverse bacterial community and a robust system for insecticide resistance.

    Directory of Open Access Journals (Sweden)

    Wen Xie

    Full Text Available BACKGROUND: Bemisia tabaci (Gennadius is a phloem-feeding insect poised to become one of the major insect pests in open field and greenhouse production systems throughout the world. The high level of resistance to insecticides is a main factor that hinders continued use of insecticides for suppression of B. tabaci. Despite its prevalence, little is known about B. tabaci at the genome level. To fill this gap, an invasive B. tabaci B biotype was subjected to pyrosequencing-based transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes. METHODOLOGY AND PRINCIPAL FINDINGS: Using Roche 454 pyrosequencing, 857,205 reads containing approximately 340 megabases were obtained from the B. tabaci transcriptome. De novo assembly generated 178,669 unigenes including 30,980 from insects, 17,881 from bacteria, and 129,808 from the nohit. A total of 50,835 (28.45% unigenes showed similarity to the non-redundant database in GenBank with a cut-off E-value of 10-5. Among them, 40,611 unigenes were assigned to one or more GO terms and 6,917 unigenes were assigned to 288 known pathways. De novo metatranscriptome analysis revealed highly diverse bacterial symbionts in B. tabaci, and demonstrated the host-symbiont cooperation in amino acid production. In-depth transcriptome analysis indentified putative molecular markers, and genes potentially involved in insecticide resistance and nutrient digestion. The utility of this transcriptome was validated by a thiamethoxam resistance study, in which annotated cytochrome P450 genes were significantly overexpressed in the resistant B. tabaci in comparison to its susceptible counterparts. CONCLUSIONS: This transcriptome/metatranscriptome analysis sheds light on the molecular understanding of symbiosis and insecticide resistance in an agriculturally important phloem-feeding insect pest, and lays the foundation for future functional genomics research of the

  18. Functional and Transcriptomic Characterization of Peritoneal Immune-Modulation by Addition of Alanyl-Glutamine to Dialysis Fluid.

    Science.gov (United States)

    Herzog, Rebecca; Kuster, Lilian; Becker, Julia; Gluexam, Tobias; Pils, Dietmar; Spittler, Andreas; Bhasin, Manoj K; Alper, Seth L; Vychytil, Andreas; Aufricht, Christoph; Kratochwill, Klaus

    2017-07-24

    Peritonitis remains a major cause of morbidity and mortality during chronic peritoneal dialysis (PD). Glucose-based PD fluids reduce immunological defenses in the peritoneal cavity. Low concentrations of peritoneal extracellular glutamine during PD may contribute to this immune deficit. For these reasons we have developed a clinical assay to measure the function of the immune-competent cells in PD effluent from PD patients. We then applied this assay to test the impact on peritoneal immune-competence of PD fluid supplementation with alanyl-glutamine (AlaGln) in 6 patients in an open-label, randomized, crossover pilot trial (EudraCT 2012-004004-36), and related the functional results to transcriptome changes in PD effluent cells. Ex-vivo stimulation of PD effluent peritoneal cells increased release of interleukin (IL) 6 and tumor necrosis factor (TNF) α. Both IL-6 and TNF-α were lower at 1 h than at 4 h of the peritoneal equilibration test but the reductions in cytokine release were attenuated in AlaGln-supplemented samples. AlaGln-supplemented samples exhibited priming of IL-6-related pathways and downregulation of TNF-α upstream elements. Results from measurement of cytokine release and transcriptome analysis in this pilot clinical study support the conclusion that suppression of PD effluent cell immune function in human subjects by standard PD fluid is attenuated by AlaGln supplementation.

  19. Novel Polymerase Gene Mutations for Human Adaptation in Clinical Isolates of Avian H5N1 Influenza Viruses.

    Directory of Open Access Journals (Sweden)

    Yasuha Arai

    2016-04-01

    Full Text Available A major determinant in the change of the avian influenza virus host range to humans is the E627K substitution in the PB2 polymerase protein. However, the polymerase activity of avian influenza viruses with a single PB2-E627K mutation is still lower than that of seasonal human influenza viruses, implying that avian viruses require polymerase mutations in addition to PB2-627K for human adaptation. Here, we used a database search of H5N1 clade 2.2.1 virus sequences with the PB2-627K mutation to identify other polymerase adaptation mutations that have been selected in infected patients. Several of the mutations identified acted cooperatively with PB2-627K to increase viral growth in human airway epithelial cells and mouse lungs. These mutations were in multiple domains of the polymerase complex other than the PB2-627 domain, highlighting a complicated avian-to-human adaptation pathway of avian influenza viruses. Thus, H5N1 viruses could rapidly acquire multiple polymerase mutations that function cooperatively with PB2-627K in infected patients for optimal human adaptation.

  20. DENdb: database of integrated human enhancers

    KAUST Repository

    Ashoor, Haitham

    2015-09-05

    Enhancers are cis-acting DNA regulatory regions that play a key role in distal control of transcriptional activities. Identification of enhancers, coupled with a comprehensive functional analysis of their properties, could improve our understanding of complex gene transcription mechanisms and gene regulation processes in general. We developed DENdb, a centralized on-line repository of predicted enhancers derived from multiple human cell-lines. DENdb integrates enhancers predicted by five different methods generating an enriched catalogue of putative enhancers for each of the analysed cell-lines. DENdb provides information about the overlap of enhancers with DNase I hypersensitive regions, ChIP-seq regions of a number of transcription factors and transcription factor binding motifs, means to explore enhancer interactions with DNA using several chromatin interaction assays and enhancer neighbouring genes. DENdb is designed as a relational database that facilitates fast and efficient searching, browsing and visualization of information.

  1. DENdb: database of integrated human enhancers

    KAUST Repository

    Ashoor, Haitham; Kleftogiannis, Dimitrios A.; Radovanovic, Aleksandar; Bajic, Vladimir B.

    2015-01-01

    Enhancers are cis-acting DNA regulatory regions that play a key role in distal control of transcriptional activities. Identification of enhancers, coupled with a comprehensive functional analysis of their properties, could improve our understanding of complex gene transcription mechanisms and gene regulation processes in general. We developed DENdb, a centralized on-line repository of predicted enhancers derived from multiple human cell-lines. DENdb integrates enhancers predicted by five different methods generating an enriched catalogue of putative enhancers for each of the analysed cell-lines. DENdb provides information about the overlap of enhancers with DNase I hypersensitive regions, ChIP-seq regions of a number of transcription factors and transcription factor binding motifs, means to explore enhancer interactions with DNA using several chromatin interaction assays and enhancer neighbouring genes. DENdb is designed as a relational database that facilitates fast and efficient searching, browsing and visualization of information.

  2. A first insight into Pycnoporus sanguineus BAFC 2126 transcriptome.

    Directory of Open Access Journals (Sweden)

    Cristian O Rohr

    Full Text Available Fungi of the genus Pycnoporus are white-rot basidiomycetes widely studied because of their ability to synthesize high added-value compounds and enzymes of industrial interest. Here we report the sequencing, assembly and analysis of the transcriptome of Pycnoporus sanguineus BAFC 2126 grown at stationary phase, in media supplemented with copper sulfate. Using the 454 pyrosequencing platform we obtained a total of 226,336 reads (88,779,843 bases that were filtered and de novo assembled to generate a reference transcriptome of 7,303 transcripts. Putative functions were assigned for 4,732 transcripts by searching similarities of six-frame translated sequences against a customized protein database and by the presence of conserved protein domains. Through the analysis of translated sequences we identified transcripts encoding 178 putative carbohydrate active enzymes, including representatives of 15 families with roles in lignocellulose degradation. Furthermore, we found many transcripts encoding enzymes related to lignin hydrolysis and modification, including laccases and peroxidases, as well as GMC oxidoreductases, copper radical oxidases and other enzymes involved in the generation of extracellular hydrogen peroxide and iron homeostasis. Finally, we identified the transcripts encoding all of the enzymes involved in terpenoid backbone biosynthesis pathway, various terpene synthases related to the biosynthesis of sesquiterpenoids and triterpenoids precursors, and also cytochrome P450 monooxygenases, glutathione S-transferases and epoxide hydrolases with potential functions in the biodegradation of xenobiotics and the enantioselective biosynthesis of biologically active drugs. To our knowledge this is the first report of a transcriptome of genus Pycnoporus and a resource for future molecular studies in P. sanguineus.

  3. Distribution of cellular HSV-1 receptor expression in human brain.

    Science.gov (United States)

    Lathe, Richard; Haas, Juergen G

    2017-06-01

    Herpes simplex virus type 1 (HSV-1) is a neurotropic virus linked to a range of acute and chronic neurological disorders affecting distinct regions of the brain. Unusually, HSV-1 entry into cells requires the interaction of viral proteins glycoprotein D (gD) and glycoprotein B (gB) with distinct cellular receptor proteins. Several different gD and gB receptors have been identified, including TNFRSF14/HVEM and PVRL1/nectin 1 as gD receptors and PILRA, MAG, and MYH9 as gB receptors. We investigated the expression of these receptor molecules in different areas of the adult and developing human brain using online transcriptome databases. Whereas all HSV-1 receptors showed distinct expression patterns in different brain areas, the Allan Brain Atlas (ABA) reported increased expression of both gD and gB receptors in the hippocampus. Specifically, for PVRL1, TNFRFS14, and MYH9, the differential z scores for hippocampal expression, a measure of relative levels of increased expression, rose to 2.9, 2.9, and 2.5, respectively, comparable to the z score for the archetypical hippocampus-enriched mineralocorticoid receptor (NR3C2, z = 3.1). These data were confirmed at the Human Brain Transcriptome (HBT) database, but HBT data indicate that MAG expression is also enriched in hippocampus. The HBT database allowed the developmental pattern of expression to be investigated; we report that all HSV1 receptors markedly increase in expression levels between gestation and the postnatal/adult periods. These results suggest that differential receptor expression levels of several HSV-1 gD and gB receptors in the adult hippocampus are likely to underlie the susceptibility of this brain region to HSV-1 infection.

  4. Analysis of transcriptome differences between resistant and susceptible strains of the citrus red mite Panonychus citri (Acari: Tetranychidae.

    Directory of Open Access Journals (Sweden)

    Bin Liu

    Full Text Available BACKGROUND: The citrus red mite is a worldwide citrus pest and a common sensitizing allergen of asthma and rhinitis. It has developed strong resistance to many registered acaricides, However, the molecular mechanisms of resistance remain unknown. we therefore used next generation sequencing technology to investigate the global transcriptomes between resistant strains and susceptible strains. RESULTS: We obtained 34,159, 30,466 and 32,217 unigenes by assembling the SS reads, RS reads and SS&RS reads respectively. There are total 17,581 annotated unigenes from SS&RS reads by BLAST searching databases of nr, the Clusters of Orthologous Groups (COGs and Kyoto Encyclopedia of Genes and Genomes (KEGG with an E-value ≤ 1e-5, in which 7,075 unigenes were annotated in the COG database, 12, 712 unigenes were found in the KEGG database and 3,812 unigenes were assigned to Gene ontology (GO. Moreover, 2,701 unigenes were judged to be the differentially expressed genes (DEGs based on the uniquely mapped reads. There are 219 pathways in all annotated unigenes and 198 pathways in DEGs that mapped to the KEGG database. We identified 211 metabolism genes and target genes related to general insecticide resistance such as P450 and Cytochrome b, and further compared their differences between RS and SS. Meanwhile, we identified 105 and 194 genes related to growth and reproduction, respectively, based on the mode of action of Hexythiazox. After further analyses, we found variation in sequences but not in gene expression related to mite growth and reproduction between different strains. CONCLUSION: To our knowledge, this is the first comparative transcriptome study to discover candidate genes involved in phytophagous mite resistance. This study identified differential unigenes related to general pesticide resistance and organism growth and reproduction in P. citri. The assembled, annotated transcriptomes provide a valuable genomic resource for further understanding

  5. Unique transcriptomic response to sepsis is observed among patients of different age groups.

    Science.gov (United States)

    Raymond, Steven L; López, María Cecilia; Baker, Henry V; Larson, Shawn D; Efron, Philip A; Sweeney, Timothy E; Khatri, Purvesh; Moldawer, Lyle L; Wynn, James L

    2017-01-01

    Sepsis is a major cause of morbidity and mortality, especially at the extremes of age. To understand the human age-specific transcriptomic response to sepsis, a multi-cohort, pooled analysis was conducted on adults, children, infants, and neonates with and without sepsis. Nine public whole-blood gene expression datasets (636 patients) were employed. Age impacted the transcriptomic host response to sepsis. Gene expression from septic neonates and adults was more dissimilar whereas infants and children were more similar. Neonates showed reductions in inflammatory recognition and signaling pathways compared to all other age groups. Likewise, adults demonstrated decreased pathogen sensing, inflammation, and myeloid cell function, as compared to children. This may help to explain the increased incidence of sepsis-related organ failure and death in adults. The number of dysregulated genes in septic patients was proportional to age and significantly differed among septic adults, children, infants, and neonates. Overall, children manifested a greater transcriptomic intensity to sepsis as compared to the other age groups. The transcriptomic magnitude for adults and neonates was dramatically reduced as compared to children and infants. These findings suggest that the transcriptomic response to sepsis is age-dependent, and diagnostic and therapeutic efforts to identify and treat sepsis will have to consider age as an important variable.

  6. Tissue-specific transcriptome profiling of Plutella xylostella third instar larval midgut.

    Science.gov (United States)

    Xie, Wen; Lei, Yanyuan; Fu, Wei; Yang, Zhongxia; Zhu, Xun; Guo, Zhaojiang; Wu, Qingjun; Wang, Shaoli; Xu, Baoyun; Zhou, Xuguo; Zhang, Youjun

    2012-01-01

    The larval midgut of diamondback moth, Plutella xylostella, is a dynamic tissue that interfaces with a diverse array of physiological and toxicological processes, including nutrient digestion and allocation, xenobiotic detoxification, innate and adaptive immune response, and pathogen defense. Despite its enormous agricultural importance, the genomic resources for P. xylostella are surprisingly scarce. In this study, a Bt resistant P. xylostella strain was subjected to the in-depth transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes in the P. xylostella larval midgut. Using Illumina deep sequencing, we obtained roughly 40 million reads containing approximately 3.6 gigabases of sequence data. De novo assembly generated 63,312 ESTs with an average read length of 416 bp, and approximately half of the P. xylostella sequences (45.4%, 28,768) showed similarity to the non-redundant database in GenBank with a cut-off E-value below 10(-5). Among them, 11,092 unigenes were assigned to one or multiple GO terms and 16,732 unigenes were assigned to 226 specific pathways. In-depth analysis identified genes putatively involved in insecticide resistance, nutrient digestion, and innate immune defense. Besides conventional detoxification enzymes and insecticide targets, novel genes, including 28 chymotrypsins and 53 ABC transporters, have been uncovered in the P. xylostella larval midgut transcriptome; which are potentially linked to the Bt toxicity and resistance. Furthermore, an unexpectedly high number of ESTs, including 46 serpins and 7 lysozymes, were predicted to be involved in the immune defense.As the first tissue-specific transcriptome analysis of P. xylostella, this study sheds light on the molecular understanding of insecticide resistance, especially Bt resistance in an agriculturally important insect pest, and lays the foundation for future functional genomics research. In addition, current

  7. Tissue-Specific Transcriptome Profiling of Plutella Xylostella Third Instar Larval Midgut

    Science.gov (United States)

    Xie, Wen; Lei, Yanyuan; Fu, Wei; Yang, Zhongxia; Zhu, Xun; Guo, Zhaojiang; Wu, Qingjun; Wang, Shaoli; Xu, Baoyun; Zhou, Xuguo; Zhang, Youjun

    2012-01-01

    The larval midgut of diamondback moth, Plutella xylostella, is a dynamic tissue that interfaces with a diverse array of physiological and toxicological processes, including nutrient digestion and allocation, xenobiotic detoxification, innate and adaptive immune response, and pathogen defense. Despite its enormous agricultural importance, the genomic resources for P. xylostella are surprisingly scarce. In this study, a Bt resistant P. xylostella strain was subjected to the in-depth transcriptome analysis to identify genes and gene networks putatively involved in various physiological and toxicological processes in the P. xylostella larval midgut. Using Illumina deep sequencing, we obtained roughly 40 million reads containing approximately 3.6 gigabases of sequence data. De novo assembly generated 63,312 ESTs with an average read length of 416bp, and approximately half of the P. xylostella sequences (45.4%, 28,768) showed similarity to the non-redundant database in GenBank with a cut-off E-value below 10-5. Among them, 11,092 unigenes were assigned to one or multiple GO terms and 16,732 unigenes were assigned to 226 specific pathways. In-depth analysis indentified genes putatively involved in insecticide resistance, nutrient digestion, and innate immune defense. Besides conventional detoxification enzymes and insecticide targets, novel genes, including 28 chymotrypsins and 53 ABC transporters, have been uncovered in the P. xylostella larval midgut transcriptome; which are potentially linked to the Bt toxicity and resistance. Furthermore, an unexpectedly high number of ESTs, including 46 serpins and 7 lysozymes, were predicted to be involved in the immune defense. As the first tissue-specific transcriptome analysis of P. xylostella, this study sheds light on the molecular understanding of insecticide resistance, especially Bt resistance in an agriculturally important insect pest, and lays the foundation for future functional genomics research. In addition, current

  8. The transcriptome of Trichuris suis--first molecular insights into a parasite with curative properties for key immune diseases of humans.

    Directory of Open Access Journals (Sweden)

    Cinzia Cantacessi

    Full Text Available Iatrogenic infection of humans with Trichuris suis (a parasitic nematode of swine is being evaluated or promoted as a biological, curative treatment of immune diseases, such as inflammatory bowel disease (IBD and ulcerative colitis, in humans. Although it is understood that short-term T. suis infection in people with such diseases usually induces a modified Th2-immune response, nothing is known about the molecules in the parasite that induce this response.As a first step toward filling the gaps in our knowledge of the molecular biology of T. suis, we characterised the transcriptome of the adult stage of this nematode employing next-generation sequencing and bioinformatic techniques. A total of ∼65,000,000 reads were generated and assembled into ∼20,000 contiguous sequences ( = contigs; ∼17,000 peptides were predicted and classified based on homology searches, protein motifs and gene ontology and biological pathway mapping.These analyses provided interesting insights into a number of molecular groups, particularly predicted excreted/secreted molecules (n = 1,288, likely to be involved in the parasite-host interactions, and also various molecules (n = 120 linked to chemokine, T-cell receptor and TGF-β signalling as well as leukocyte transendothelial migration and natural killer cell-mediated cytotoxicity, which are likely to be immuno-regulatory or -modulatory in the infected host. This information provides a conceptual framework within which to test the immunobiological basis for the curative effect of T. suis infection in humans against some immune diseases. Importantly, the T. suis transcriptome characterised herein provides a curated resource for detailed studies of the immuno-molecular biology of this parasite, and will underpin future genomic and proteomic explorations.

  9. The developmental transcriptome of Drosophila melanogaster

    Energy Technology Data Exchange (ETDEWEB)

    University of Connecticut; Graveley, Brenton R.; Brooks, Angela N.; Carlson, Joseph W.; Duff, Michael O.; Landolin, Jane M.; Yang, Li; Artieri, Carlo G.; van Baren, Marijke J.; Boley, Nathan; Booth, Benjamin W.; Brown, James B.; Cherbas, Lucy; Davis, Carrie A.; Dobin, Alex; Li, Renhua; Lin, Wei; Malone, John H.; Mattiuzzo, Nicolas R.; Miller, David; Sturgill, David; Tuch, Brian B.; Zaleski, Chris; Zhang, Dayu; Blanchette, Marco; Dudoit, Sandrine; Eads, Brian; Green, Richard E.; Hammonds, Ann; Jiang, Lichun; Kapranov, Phil; Langton, Laura; Perrimon, Norbert; Sandler, Jeremy E.; Wan, Kenneth H.; Willingham, Aarron; Zhang, Yu; Zou, Yi; Andrews, Justen; Bicke, Peter J.; Brenner, Steven E.; Brent, Michael R.; Cherbas, Peter; Gingeras, Thomas R.; Hoskins, Roger A.; Kaufman, Thomas C.; Oliver, Brian; Celniker, Susan E.

    2010-12-02

    Drosophila melanogaster is one of the most well studied genetic model organisms; nonetheless, its genome still contains unannotated coding and non-coding genes, transcripts, exons and RNA editing sites. Full discovery and annotation are pre-requisites for understanding how the regulation of transcription, splicing and RNA editing directs the development of this complex organism. Here we used RNA-Seq, tiling microarrays and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events, and inferred protein isoforms that previously eluded discovery using established experimental, prediction and conservation-based approaches. These data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development. Drosophila melanogaster is an important non-mammalian model system that has had a critical role in basic biological discoveries, such as identifying chromosomes as the carriers of genetic information and uncovering the role of genes in development. Because it shares a substantial genic content with humans, Drosophila is increasingly used as a translational model for human development, homeostasis and disease. High-quality maps are needed for all functional genomic elements. Previous studies demonstrated that a rich collection of genes is deployed during the life cycle of the fly. Although expression profiling using microarrays has revealed the expression of, 13,000 annotated genes, it is difficult to map splice junctions and individual base modifications generated by RNA editing using such approaches. Single-base resolution is essential to define precisely the elements that comprise the Drosophila transcriptome. Estimates of the number of transcript isoforms are less accurate than estimates of the number of genes

  10. Human error and the associated recovery probabilities for soft control being used in the advanced MCRs of NPPs

    International Nuclear Information System (INIS)

    Jang, Inseok; Jung, Wondea; Seong, Poong Hyun

    2016-01-01

    Highlights: • The operation environment of MCRs in NPPs has changed by adopting digital HSIs. • Most current HRA databases are not explicitly designed to deal with digital HSI. • Empirical analysis for new HRA DB under an advanced MCR mockup are carried. • It is expected that the results can be used for advanced MCR HRA. - Abstract: Since the Three Mile Island (TMI)-2 accident, human error has been recognized as one of the main causes of Nuclear Power Plant (NPP) accidents, and numerous studies related to Human Reliability Analysis (HRA) have been carried out. Most of these studies were focused on considering the conventional Main Control Room (MCR) environment. However, the operating environment of MCRs in NPPs has changed with the adoption of new human-system interfaces (HSI) largely based on up-to-date digital technologies. The MCRs that include these digital and computer technologies, such as large display panels, computerized procedures, and soft controls, are called advanced MCRs. Among the many features of advanced MCRs, soft controls are a particularly important because operating actions in advanced MCRs are performed by soft control. Due to the difference in interfaces between soft control and hardwired conventional controls, different HEP should be used in the HRA for advanced MCRs. Unfortunately, most current HRA databases deal with operations in conventional MCRs and are not explicitly designed to deal with digital Human System Interface (HSI). For this reason, empirical human error and the associated error recovery probabilities were collected from the mockup of an advanced MCR equipped with soft controls. To this end, small-scaled experiments are conducted with 48 graduated students in the department of nuclear engineering in Korea Advanced Institute of Science and Technology (KAIST) are participated, and accident scenarios are designed with respect to the typical Design Basis Accidents (DBAs) in NPPs, such as Steam Generator Tube Rupture

  11. Characterizing Ancylostoma caninum transcriptome and exploring nematode parasitic adaptation

    Directory of Open Access Journals (Sweden)

    Hawdon John

    2010-05-01

    Full Text Available Abstract Background Hookworm infection is one of the most important neglected diseases in developing countries, with approximately 1 billion people infected worldwide. To better understand hookworm biology and nematode parasitism, the present study generated a near complete transcriptome of the canine hookworm Ancylostoma caninum to a very high coverage using high throughput technology, and compared it to those of the free-living nematode Caenorhabditis elegans and the parasite Brugia malayi. Results The generated transcripts from four developmental stages, infective L3, serum stimulated L3, adult male and adult female, covered 93% of the A. caninum transcriptome. The broad diversity among nematode transcriptomes was confirmed, and an impact of parasitic adaptation on transcriptome diversity was inferred. Intra-population analysis showed that A. caninum has higher coding sequence diversity than humans. Examining the developmental expression profiles of A. caninum revealed major transitions in gene expression from larval stages to adult. Adult males expressed the highest number of selectively expressed genes, but adult female expressed the highest number of selective parasitism-related genes. Genes related to parasitism adaptation and A. caninum specific genes exhibited more expression selectivity while those conserved in nematodes tend to be consistently expressed. Parasitism related genes were expressed more selectively in adult male and female worms. The comprehensive analysis of digital expression profiles along with transcriptome comparisons enabled identification of a set of parasitism genes encoding secretory proteins in animal parasitic nematode. Conclusions This study validated the usage of deep sequencing for gene expression profiling. Parasitic adaptation of the canine hookworm is related to its diversity and developmental dynamics. This comprehensive comparative genomic and expression study substantially improves our understanding of

  12. Seminal plasma induces global transcriptomic changes associated with cell migration, proliferation and viability in endometrial epithelial cells and stromal fibroblasts.

    Science.gov (United States)

    Chen, Joseph C; Johnson, Brittni A; Erikson, David W; Piltonen, Terhi T; Barragan, Fatima; Chu, Simon; Kohgadai, Nargis; Irwin, Juan C; Greene, Warner C; Giudice, Linda C; Roan, Nadia R

    2014-06-01

    How does seminal plasma (SP) affect the transcriptome of human primary endometrial epithelial cells (eEC) and stromal fibroblasts (eSF)? Exposure of eEC and eSF to SP in vitro increases expression of genes and secreted proteins associated with cellular migration, proliferation, viability and inhibition of cell death. Studies in both humans and animals suggest that SP can access and induce physiological changes in the upper female reproductive tract (FRT), which may participate in promoting reproductive success. This is a cross sectional study involving control samples versus treatment. SP (pooled from twenty donors) was first tested for dose- and time-dependent cytotoxic effects on eEC and eSF (n = 4). As exposure of eEC or eSF to 1% SP for 6 h proved to be non-toxic, a second set of eEC/eSF samples (n = 4) was treated under these conditions for transcriptome, protein and functional analysis. With a third set of samples (n = 3), we further compared the transcriptional response of the cells to SP versus fresh semen. eEC and eSF were isolated from endometrial biopsies from women of reproductive age undergoing benign gynecologic procedures and maintained in vitro. RNA was isolated and processed for microarray studies to analyze global transcriptomic changes. Secreted factors in conditioned media from SP-treated cells were analyzed by Luminex and for the ability to stimulate migration of CD14+ monocytes and CD4+ T cells. Pathway identifications were determined using the Z-scoring system in Ingenuity Pathways Analysis (Z scores ≥|1.5|). SP induced transcriptomic changes (P reproductive success, female reproductive health and susceptibility to sexually transmitted diseases. The gene list provided by the transcriptome analysis reported here should prove a valuable resource for understanding the response of the upper FRT to SP exposure. This project was supported by NIH AI083050-04 (W.C.G./L.C.G.); NIH U54HD 055764 (L.C.G.); NIH 1F32HD074423-02 (J.C.C.); DOD W81XWH-11

  13. Visual analysis of transcriptome data in the context of anatomical structures and biological networks

    Directory of Open Access Journals (Sweden)

    Astrid eJunker

    2012-11-01

    Full Text Available The complexity and temporal as well as spatial resolution of transcriptome datasets is constantly increasing due to extensive technological developments. Here we present methods for advanced visualization and intuitive exploration of transcriptomics data as necessary prerequisites in order to facilitate the gain of biological knowledge. Color-coding of structural images based on the expression level enables a fast visual data analysis in the background of the examined biological system. The network-based exploration of these visualizations allows for comparative analysis of genes with specific transcript patterns and supports the extraction of functional relationships even from large datasets. In order to illustrate the presented methods, the tool HIVE was applied for visualization and exploration of database-retrieved expression data for master regulators of Arabidopsis thaliana flower and seed development in the context of corresponding tissue-specific regulatory networks.

  14. Application of transcriptomics in Chinese herbal medicine studies

    Directory of Open Access Journals (Sweden)

    Hsin-Yi Lo

    2012-04-01

    Full Text Available Transcriptomics using DNA microarray has become a practical and popular tool for herbal medicine study because of high throughput, sensitivity, accuracy, specificity, and reproducibility. Therefore, this article focuses on the overview of DNA microarray technology and the application of DNA microarray in Chinese herbal medicine study. To understand the number and the objectives of articles utilizing DNA microarray for herbal medicine study, we surveyed 297 frequently used Chinese medicinal herbs listed in Pharmacopoeia Commission of People’s Republic of China. We classified these medicinal herbs into 109 families and then applied PudMed search using “microarray” and individual herbal family as keywords. Although thousands of papers applying DNA microarray in Chinese herbal studies have been published since 1998, most of the articles focus on the elucidation of mechanisms of certain biological effects of herbs. Construction of the bioactivity database containing large-scaled gene expression profiles of quality control herbs can be applied in the future to analyze the biological events induced by herbs, predict the therapeutic potential of herbs, evaluate the safety of herbs, and identify the drug candidate of herbs. Moreover, the linkage of systems biology tools, such as functional genomics, transcriptomics, proteomics, metabolomics, pharmacogenomics and toxicogenomics, will become a new translational platform between Western medicine and Chinese herbal medicine.

  15. Metabolomic Dynamic Analysis of Hypoxia in MDA-MB-231 and the Comparison with Inferred Metabolites from Transcriptomics Data

    Energy Technology Data Exchange (ETDEWEB)

    Tsai, I-Lin [Department of Pharmacy, National Taiwan University, No. 1, Jen-Ai Road, Section 1 Taipei 10051, Taiwan (China); The Metabolomics Group, National Taiwan University, Taipei 106, Taiwan (China); Center for Genomic Medicine, National Taiwan University, Taipei 10051, Taiwan (China); Kuo, Tien-Chueh [The Metabolomics Group, National Taiwan University, Taipei 106, Taiwan (China); Graduate Institute of Biomedical Electronic and Bioinformatics, National Taiwan University, Room 410 BL Building, No. 1, Roosevelt Road, Sec. 4, Taipei 106, Taiwan (China); Ho, Tsung-Jung [The Metabolomics Group, National Taiwan University, Taipei 106, Taiwan (China); Department of Computer Science and Information Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Rd., Taipei 10617, Taiwan (China); Harn, Yeu-Chern [The Metabolomics Group, National Taiwan University, Taipei 106, Taiwan (China); Graduate Institute of Networking and Multimedia, National Taiwan University, No. 1, Sec. 4, Roosevelt Rd., Taipei 10617, Taiwan (China); Wang, San-Yuan [The Metabolomics Group, National Taiwan University, Taipei 106, Taiwan (China); Department of Computer Science and Information Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Rd., Taipei 10617, Taiwan (China); Fu, Wen-Mei [Department of Pharmacology, National Taiwan University, 11 F No. 1 Sec. 1, Ren-ai Rd., Taipei 10051, Taiwan (China); Kuo, Ching-Hua, E-mail: kuoch@ntu.edu.tw [Department of Pharmacy, National Taiwan University, No. 1, Jen-Ai Road, Section 1 Taipei 10051, Taiwan (China); The Metabolomics Group, National Taiwan University, Taipei 106, Taiwan (China); Center for Genomic Medicine, National Taiwan University, Taipei 10051, Taiwan (China); Tseng, Yufeng Jane, E-mail: kuoch@ntu.edu.tw [Department of Pharmacy, National Taiwan University, No. 1, Jen-Ai Road, Section 1 Taipei 10051, Taiwan (China); The Metabolomics Group, National Taiwan University, Taipei 106, Taiwan (China); Center for Genomic Medicine, National Taiwan University, Taipei 10051, Taiwan (China); Graduate Institute of Biomedical Electronic and Bioinformatics, National Taiwan University, Room 410 BL Building, No. 1, Roosevelt Road, Sec. 4, Taipei 106, Taiwan (China); Department of Computer Science and Information Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Rd., Taipei 10617, Taiwan (China)

    2013-05-03

    Hypoxia affects the tumor microenvironment and is considered important to metastasis progression and therapy resistance. Thus far, the majority of global analyses of tumor hypoxia responses have been limited to just a single omics level. Combining multiple omics data can broaden our understanding of tumor hypoxia. Here, we investigate the temporal change of the metabolite composition with gene expression data from literature to provide a more comprehensive insight into the system level in response to hypoxia. Nuclear magnetic resonance spectroscopy was used to perform metabolomic profiling on the MDA-MB-231 breast cancer cell line under hypoxic conditions. Multivariate statistical analysis revealed that the metabolic difference between hypoxia and normoxia was similar over 24 h, but became distinct over 48 h. Time dependent microarray data from the same cell line in the literature displayed different gene expressions under hypoxic and normoxic conditions mostly at 12 h or earlier. The direct metabolomic profiles show a large overlap with theoretical metabolic profiles deduced from previous transcriptomic studies. Consistent pathways are glycolysis/gluconeogenesis, pyruvate, purine and arginine and proline metabolism. Ten metabolic pathways revealed by metabolomics were not covered by the downstream of the known transcriptomic profiles, suggesting new metabolic phenotypes. These results confirm previous transcriptomics understanding and expand the knowledge from existing models on correlation and co-regulation between transcriptomic and metabolomics profiles, which demonstrates the power of integrated omics analysis.

  16. Metabolomic Dynamic Analysis of Hypoxia in MDA-MB-231 and the Comparison with Inferred Metabolites from Transcriptomics Data

    International Nuclear Information System (INIS)

    Tsai, I-Lin; Kuo, Tien-Chueh; Ho, Tsung-Jung; Harn, Yeu-Chern; Wang, San-Yuan; Fu, Wen-Mei; Kuo, Ching-Hua; Tseng, Yufeng Jane

    2013-01-01

    Hypoxia affects the tumor microenvironment and is considered important to metastasis progression and therapy resistance. Thus far, the majority of global analyses of tumor hypoxia responses have been limited to just a single omics level. Combining multiple omics data can broaden our understanding of tumor hypoxia. Here, we investigate the temporal change of the metabolite composition with gene expression data from literature to provide a more comprehensive insight into the system level in response to hypoxia. Nuclear magnetic resonance spectroscopy was used to perform metabolomic profiling on the MDA-MB-231 breast cancer cell line under hypoxic conditions. Multivariate statistical analysis revealed that the metabolic difference between hypoxia and normoxia was similar over 24 h, but became distinct over 48 h. Time dependent microarray data from the same cell line in the literature displayed different gene expressions under hypoxic and normoxic conditions mostly at 12 h or earlier. The direct metabolomic profiles show a large overlap with theoretical metabolic profiles deduced from previous transcriptomic studies. Consistent pathways are glycolysis/gluconeogenesis, pyruvate, purine and arginine and proline metabolism. Ten metabolic pathways revealed by metabolomics were not covered by the downstream of the known transcriptomic profiles, suggesting new metabolic phenotypes. These results confirm previous transcriptomics understanding and expand the knowledge from existing models on correlation and co-regulation between transcriptomic and metabolomics profiles, which demonstrates the power of integrated omics analysis

  17. Transcriptome datasets of oil palm pathogen Ganoderma boninense

    Directory of Open Access Journals (Sweden)

    Irene Liza Isaac

    2018-04-01

    Full Text Available Ganoderma boninense is known to be the causal agent for basal stem rot (BSR affecting the oil palm industry worldwide thus cumulating to high economic losses every year. Several reports have shown that a compatible monokaryon pair needs to mate; producing dikaryotic mycelia to initiate the infection towards the oil palm. However, the molecular events occurs during mating process are not well understood. We performed transcriptome sequencing using Illumina RNA-seq technology and de novo assembly of the transcripts from monokaryon, mating junction and dikaryon mycelia of G. boninense. Raw reads from these three libraries were deposited in the NCBI database with accession number SRR1745787, SRR1745773 and SRR1745777, respectively.

  18. Flower bud transcriptome analysis of Sapium sebiferum (Linn. Roxb. and primary investigation of drought induced flowering: pathway construction and G-quadruplex prediction based on transcriptome.

    Directory of Open Access Journals (Sweden)

    Minglei Yang

    Full Text Available Sapium sebiferum (Linn. Roxb. (Chinese Tallow Tree is a perennial woody tree and its seeds are rich in oil which hold great potential for biodiesel production. Despite a traditional woody oil plant, our understanding on S. sebiferum genetics and molecular biology remains scant. In this study, the first comprehensive transcriptome of S. sebiferum flower has been generated by sequencing and de novo assembly. A total of 149,342 unigenes were generated from raw reads, of which 24,289 unigenes were successfully matched to public database. A total of 61 MADS box genes and putative pathways involved in S. sebiferum flower development have been identified. Abiotic stress response network was also constructed in this work, where 2,686 unigenes are involved in the pathway. As for lipid biosynthesis, 161 unigenes have been identified in fatty acid (FA and triacylglycerol (TAG biosynthesis. Besides, the G-Quadruplexes in RNA of S. sebiferum also have been predicted. An interesting finding is that the stress-induced flowering was observed in S. sebiferum for the first time. According to the results of semi-quantitative PCR, expression tendencies of flowering-related genes, GA1, AP2 and CRY2, accorded with stress-related genes, such as GRX50435 and PRXⅡ39562. This transcriptome provides functional genomic information for further research of S. sebiferum, especially for the genetic engineering to shorten the juvenile period and improve yield by regulating flower development. It also offers a useful database for the research of other Euphorbiaceae family plants.

  19. Sequence comparison of prefrontal cortical brain transcriptome from a tame and an aggressive silver fox (Vulpes vulpes

    Directory of Open Access Journals (Sweden)

    Sun Qi

    2011-10-01

    Full Text Available Abstract Background Two strains of the silver fox (Vulpes vulpes, with markedly different behavioral phenotypes, have been developed by long-term selection for behavior. Foxes from the tame strain exhibit friendly behavior towards humans, paralleling the sociability of canine puppies, whereas foxes from the aggressive strain are defensive and exhibit aggression to humans. To understand the genetic differences underlying these behavioral phenotypes fox-specific genomic resources are needed. Results cDNA from mRNA from pre-frontal cortex of a tame and an aggressive fox was sequenced using the Roche 454 FLX Titanium platform (> 2.5 million reads & 0.9 Gbase of tame fox sequence; >3.3 million reads & 1.2 Gbase of aggressive fox sequence. Over 80% of the fox reads were assembled into contigs. Mapping fox reads against the fox transcriptome assembly and the dog genome identified over 30,000 high confidence fox-specific SNPs. Fox transcripts for approximately 14,000 genes were identified using SwissProt and the dog RefSeq databases. An at least 2-fold expression difference between the two samples (p Conclusions Transcriptome sequencing significantly expanded genomic resources available for the fox, a species without a sequenced genome. In a very cost efficient manner this yielded a large number of fox-specific SNP markers for genetic studies and provided significant insights into the gene expression profile of the fox pre-frontal cortex; expression differences between the two fox samples; and a catalogue of potentially important gene-specific sequence variants. This result demonstrates the utility of this approach for developing genomic resources in species with limited genomic information.

  20. HIV Structural Database

    Science.gov (United States)

    SRD 102 HIV Structural Database (Web, free access)   The HIV Protease Structural Database is an archive of experimentally determined 3-D structures of Human Immunodeficiency Virus 1 (HIV-1), Human Immunodeficiency Virus 2 (HIV-2) and Simian Immunodeficiency Virus (SIV) Proteases and their complexes with inhibitors or products of substrate cleavage.

  1. HEROD: a human ethnic and regional specific omics database.

    Science.gov (United States)

    Zeng, Xian; Tao, Lin; Zhang, Peng; Qin, Chu; Chen, Shangying; He, Weidong; Tan, Ying; Xia Liu, Hong; Yang, Sheng Yong; Chen, Zhe; Jiang, Yu Yang; Chen, Yu Zong

    2017-10-15

    Genetic and gene expression variations within and between populations and across geographical regions have substantial effects on the biological phenotypes, diseases, and therapeutic response. The development of precision medicines can be facilitated by the OMICS studies of the patients of specific ethnicity and geographic region. However, there is an inadequate facility for broadly and conveniently accessing the ethnic and regional specific OMICS data. Here, we introduced a new free database, HEROD, a human ethnic and regional specific OMICS database. Its first version contains the gene expression data of 53 070 patients of 169 diseases in seven ethnic populations from 193 cities/regions in 49 nations curated from the Gene Expression Omnibus (GEO), the ArrayExpress Archive of Functional Genomics Data (ArrayExpress), the Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC). Geographic region information of curated patients was mainly manually extracted from referenced publications of each original study. These data can be accessed and downloaded via keyword search, World map search, and menu-bar search of disease name, the international classification of disease code, geographical region, location of sample collection, ethnic population, gender, age, sample source organ, patient type (patient or healthy), sample type (disease or normal tissue) and assay type on the web interface. The HEROD database is freely accessible at http://bidd2.nus.edu.sg/herod/index.php. The database and web interface are implemented in MySQL, PHP and HTML with all major browsers supported. phacyz@nus.edu.sg. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  2. A Long-Read Transcriptome Assembly of Cotton (Gossypium hirsutum L. and Intraspecific Single Nucleotide Polymorphism Discovery

    Directory of Open Access Journals (Sweden)

    Hamid Ashrafi

    2015-07-01

    Full Text Available Upland cotton ( L. has a narrow germplasm base, which constrains marker development and hampers intraspecific breeding. A pressing need exists for high-throughput single nucleotide polymorphism (SNP markers that can be readily applied to germplasm in breeding and breeding-related research programs. Despite progress made in developing new sequencing technologies during the past decade, the cost of sequencing remains substantial when one is dealing with numerous samples and large genomes. Several strategies have been proposed to lower the cost of sequencing for multiple genotypes of large-genome species like cotton, such as transcriptome sequencing and reduced-representation DNA sequencing. This paper reports the development of a transcriptome assembly of the inbred line Texas Marker-1 (TM-1, a genetic standard for cotton, its usefulness as a reference for RNA sequencing (RNA-seq-based SNP identification, and the availability of transcriptome sequences of four other cotton cultivars. An assembly of TM-1 was made using Roche 454 transcriptome reads combined with an assembly of all available public expressed sequence tag (EST sequences of TM-1. The TM-1 assembly consists of 72,450 contigs with a total of 70 million bp. Functional predictions of the transcripts were estimated by alignment to selected protein databases. Transcriptome sequences of the five lines, including TM-1, were obtained using an Illumina Genome Analyzer-II, and the short reads were mapped to the TM-1 assembly to discover SNPs among the five lines. We identified >14,000 unfiltered allelic SNPs, of which ∼3,700 SNPs were retained for assay development after applying several rigorous filters. This paper reports availability of the reference transcriptome assembly and shows its utility in developing intraspecific SNP markers in upland cotton.

  3. Analysis of Viral Genetics for Estimating Diffusion of Influenza A H6N1

    OpenAIRE

    Scotch, Matthew; Suchard, Marc A.; Rabinowitz, Peter M.

    2015-01-01

    H6N1 influenza A is an avian virus but in 2013 infected a human in Taiwan. We studied the phylogeography of avian origin H6N1 viruses in the Influenza Research Database and the Global Initiative on Sharing Avian Influenza Data EpiFlu Database in order to characterize their recent evolutionary spread. Our results suggest that the H6N1 virus that infected a human in Taiwan is derived from a diversity of avian strains of H6N1 that have circulated for at least seven years in this region. Understa...

  4. The "GeneTrustee": a universal identification system that ensures privacy and confidentiality for human genetic databases.

    Science.gov (United States)

    Burnett, Leslie; Barlow-Stewart, Kris; Proos, Anné L; Aizenberg, Harry

    2003-05-01

    This article describes a generic model for access to samples and information in human genetic databases. The model utilises a "GeneTrustee", a third-party intermediary independent of the subjects and of the investigators or database custodians. The GeneTrustee model has been implemented successfully in various community genetics screening programs and has facilitated research access to genetic databases while protecting the privacy and confidentiality of research subjects. The GeneTrustee model could also be applied to various types of non-conventional genetic databases, including neonatal screening Guthrie card collections, and to forensic DNA samples.

  5. Transcriptome Characterization for Non-Model Endangered Lycaenids, Protantigius superans and Spindasis takanosis, Using Illumina HiSeq 2500 Sequencing

    Directory of Open Access Journals (Sweden)

    Bharat Bhusan Patnaik

    2015-12-01

    Full Text Available The Lycaenidae butterflies, Protantigius superans and Spindasis takanosis, are endangered insects in Korea known for their symbiotic association with ants. However, necessary genomic and transcriptomics data are lacking in these species, limiting conservation efforts. In this study, the P. superans and S. takanosis transcriptomes were deciphered using Illumina HiSeq 2500 sequencing. The P. superans and S. takanosis transcriptome data included a total of 254,340,693 and 245,110,582 clean reads assembled into 159,074 and 170,449 contigs and 107,950 and 121,140 unigenes, respectively. BLASTX hits (E-value of 1.0 × 10−5 against the known protein databases annotated a total of 46,754 and 51,908 transcripts for P. superans and S. takanosis. Approximately 41.25% and 38.68% of the unigenes for P. superans and S. takanosis found homologous sequences in Protostome DB (PANM-DB. BLAST2GO analysis confirmed 18,611 unigenes representing Gene Ontology (GO terms and a total of 5259 unigenes assigned to 116 pathways for P. superans. For S. takanosis, a total of 6697 unigenes were assigned to 119 pathways using the Kyoto Encyclopedia of Genes and Genomes (KEGG pathway database. Additionally, 382,164 and 390,516 Simple Sequence Repeats (SSRs were compiled from the unigenes of P. superans and S. takanosis, respectively. This is the first report to record new genes and their utilization for conservation of lycaenid species population and as a reference information for closely related species.

  6. PATRIC, the bacterial bioinformatics database and analysis resource

    Science.gov (United States)

    Wattam, Alice R.; Abraham, David; Dalay, Oral; Disz, Terry L.; Driscoll, Timothy; Gabbard, Joseph L.; Gillespie, Joseph J.; Gough, Roger; Hix, Deborah; Kenyon, Ronald; Machi, Dustin; Mao, Chunhong; Nordberg, Eric K.; Olson, Robert; Overbeek, Ross; Pusch, Gordon D.; Shukla, Maulik; Schulman, Julie; Stevens, Rick L.; Sullivan, Daniel E.; Vonstein, Veronika; Warren, Andrew; Will, Rebecca; Wilson, Meredith J.C.; Yoo, Hyun Seung; Zhang, Chengdong; Zhang, Yan; Sobral, Bruno W.

    2014-01-01

    The Pathosystems Resource Integration Center (PATRIC) is the all-bacterial Bioinformatics Resource Center (BRC) (http://www.patricbrc.org). A joint effort by two of the original National Institute of Allergy and Infectious Diseases-funded BRCs, PATRIC provides researchers with an online resource that stores and integrates a variety of data types [e.g. genomics, transcriptomics, protein–protein interactions (PPIs), three-dimensional protein structures and sequence typing data] and associated metadata. Datatypes are summarized for individual genomes and across taxonomic levels. All genomes in PATRIC, currently more than 10 000, are consistently annotated using RAST, the Rapid Annotations using Subsystems Technology. Summaries of different data types are also provided for individual genes, where comparisons of different annotations are available, and also include available transcriptomic data. PATRIC provides a variety of ways for researchers to find data of interest and a private workspace where they can store both genomic and gene associations, and their own private data. Both private and public data can be analyzed together using a suite of tools to perform comparative genomic or transcriptomic analysis. PATRIC also includes integrated information related to disease and PPIs. All the data and integrated analysis and visualization tools are freely available. This manuscript describes updates to the PATRIC since its initial report in the 2007 NAR Database Issue. PMID:24225323

  7. Transcriptome profiling of pumpkin (Cucurbita moschata Duch. leaves infected with powdery mildew.

    Directory of Open Access Journals (Sweden)

    Wei-Li Guo

    Full Text Available Cucurbit powdery mildew (PM is one of the most severe fungal diseases, but the molecular mechanisms underlying PM resistance remain largely unknown, especially in pumpkin (Cucurbita moschata Duch.. The goal of this study was to identify gene expression differences in PM-treated plants (harvested at 24 h and 48 h after inoculation and untreated (control plants of inbred line "112-2" using RNA sequencing (RNA-Seq. The inbred line "112-2" has been purified over 8 consecutive generations of self-pollination and shows high resistance to PM. More than 7600 transcripts were examined in pumpkin leaves, and 3129 and 3080 differentially expressed genes (DEGs were identified in inbred line "112-2" at 24 and 48 hours post inoculation (hpi, respectively. Based on the KEGG (Kyoto Encyclopedia of Genes and Genomes pathway database and GO (Gene Ontology database, a complex regulatory network for PM resistance that may involve hormone signal transduction pathways, transcription factors and defense responses was revealed at the transcription level. In addition, the expression profiles of 16 selected genes were analyzed using quantitative RT-PCR. Among these genes, the transcript levels of 6 DEGs, including bHLH87 (Basic Helix-loop-helix transcription factor, ERF014 (Ethylene response factor, WRKY21 (WRKY domain, HSF (heat stress transcription factor A, MLO3 (Mildew Locus O, and SGT1 (Suppressor of G-Two Allele of Skp1, in PM-resistant "112-2" were found to be significantly up- or down-regulated both before 9 hpi and at 24 hpi or 48 hpi; this behavior differed from that observed in the PM-susceptible material (cultivar "Jiujiangjiaoding". The transcriptome data provide novel insights into the response of Cucurbita moschata to PM stress and are expected to be highly useful for dissecting PM defense mechanisms in this major vegetable and for improving pumpkin breeding with enhanced resistance to PM.

  8. Transcriptome profiling of pumpkin (Cucurbita moschata Duch.) leaves infected with powdery mildew.

    Science.gov (United States)

    Guo, Wei-Li; Chen, Bi-Hua; Chen, Xue-Jin; Guo, Yan-Yan; Yang, He-Lian; Li, Xin-Zheng; Wang, Guang-Yin

    2018-01-01

    Cucurbit powdery mildew (PM) is one of the most severe fungal diseases, but the molecular mechanisms underlying PM resistance remain largely unknown, especially in pumpkin (Cucurbita moschata Duch.). The goal of this study was to identify gene expression differences in PM-treated plants (harvested at 24 h and 48 h after inoculation) and untreated (control) plants of inbred line "112-2" using RNA sequencing (RNA-Seq). The inbred line "112-2" has been purified over 8 consecutive generations of self-pollination and shows high resistance to PM. More than 7600 transcripts were examined in pumpkin leaves, and 3129 and 3080 differentially expressed genes (DEGs) were identified in inbred line "112-2" at 24 and 48 hours post inoculation (hpi), respectively. Based on the KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway database and GO (Gene Ontology) database, a complex regulatory network for PM resistance that may involve hormone signal transduction pathways, transcription factors and defense responses was revealed at the transcription level. In addition, the expression profiles of 16 selected genes were analyzed using quantitative RT-PCR. Among these genes, the transcript levels of 6 DEGs, including bHLH87 (Basic Helix-loop-helix transcription factor), ERF014 (Ethylene response factor), WRKY21 (WRKY domain), HSF (heat stress transcription factor A), MLO3 (Mildew Locus O), and SGT1 (Suppressor of G-Two Allele of Skp1), in PM-resistant "112-2" were found to be significantly up- or down-regulated both before 9 hpi and at 24 hpi or 48 hpi; this behavior differed from that observed in the PM-susceptible material (cultivar "Jiujiangjiaoding"). The transcriptome data provide novel insights into the response of Cucurbita moschata to PM stress and are expected to be highly useful for dissecting PM defense mechanisms in this major vegetable and for improving pumpkin breeding with enhanced resistance to PM.

  9. Identification of Human H1N2 and Human-Swine Reassortant H1N2 and H1N1 Influenza A Viruses among Pigs in Ontario, Canada (2003 to 2005)†

    OpenAIRE

    Karasin, Alexander I.; Carman, Suzanne; Olsen, Christopher W.

    2006-01-01

    Since 2003, three novel genotypes of H1 influenza viruses have been recovered from Canadian pigs, including a wholly human H1N2 virus and human-swine reassortants. These isolates demonstrate that human-lineage H1N2 viruses are infectious for pigs and that viruses with a human PB1/swine PA/swine PB2 polymerase complex can replicate in pigs.

  10. A Comparative Transcriptomic Analysis Reveals Conserved Features of Stem Cell Pluripotency in Planarians and Mammals

    Science.gov (United States)

    Labbé, Roselyne M.; Irimia, Manuel; Currie, Ko W.; Lin, Alexander; Zhu, Shu Jun; Brown, David D.R.; Ross, Eric J.; Voisin, Veronique; Bader, Gary D.; Blencowe, Benjamin J.; Pearson, Bret J.

    2014-01-01

    Many long-lived species of animals require the function of adult stem cells throughout their lives. However, the transcriptomes of stem cells in invertebrates and vertebrates have not been compared, and consequently, ancestral regulatory circuits that control stem cell populations remain poorly defined. In this study, we have used data from high-throughput RNA sequencing to compare the transcriptomes of pluripotent adult stem cells from planarians with the transcriptomes of human and mouse pluripotent embryonic stem cells. From a stringently defined set of 4,432 orthologs shared between planarians, mice and humans, we identified 123 conserved genes that are ≥5-fold differentially expressed in stem cells from all three species. Guided by this gene set, we used RNAi screening in adult planarians to discover novel stem cell regulators, which we found to affect the stem cell-associated functions of tissue homeostasis, regeneration, and stem cell maintenance. Examples of genes that disrupted these processes included the orthologs of TBL3, PSD12, TTC27, and RACK1. From these analyses, we concluded that by comparing stem cell transcriptomes from diverse species, it is possible to uncover conserved factors that function in stem cell biology. These results provide insights into which genes comprised the ancestral circuitry underlying the control of stem cell self-renewal and pluripotency. PMID:22696458

  11. pH distribution in human tumors

    International Nuclear Information System (INIS)

    Thistlethwaite, A.J.; Leeper, D.B.; Moylan, D.J.; Nerlinger, R.E.

    1984-01-01

    pH distribution in human tumors is being determined to evaluate this parameter as a prognostic indicator of hyperthermia response. pH is measured by a modified glass pH electrode (21g, model MI 408, Microelectrodes, Inc., Londonderry, NH) inserted through an 18g open-ended Angiocath. Eight tumors have been evaluated to date; and of those, 3 were also assayed after the first heat treatment coincident with determination of blood flow. Tumors were between 2-5 cm, of various histologies, and of primary, recurrent, or metastatic origin. 2-4 measurements were made per tumor. Pretreatment readings were between 6.4 and 7.2 pH units. As tumor blood flow increased after 1 hr heating (41.5 - 43 0 ) pH rose 0.1 - 0.3 units. Normal rat muscle yields pH readings of 7.35 - 7.45. Although there was considerable heterogeneity of pH within tumors, accuracy and drift were not a problem. 5-15 min were required for pH stabilization after catheter insertion and <5 min after electrode insertion. A saline wheal was used for anesthesia to preclude modification of pH by anesthetics. Patient tolerance has not been a problems. This study suggests that human tumor tissue has a preponderance of areas more acidic than normal tissue. This may serve to sensitize tumor cells to hyperthermia and provide a prognostic indicator of tumor response

  12. Human proton/oligopeptide transporter (POT) genes

    DEFF Research Database (Denmark)

    Botka, C. W.; Wittig, T. W.; Graul, R. C.

    2000-01-01

    The proton-dependent oligopeptide transporters (POT) gene family currently consists of approximately 70 cloned cDNAs derived from diverse organisms. In mammals, two genes encoding peptide transporters, PepT1 and PepT2 have been cloned in several species including humans, in addition to a rat...... histidine/peptide transporter (rPHT1). Because the Candida elegans genome contains five putative POT genes, we searched the available protein and nucleic acid databases for additional mammalian/human POT genes, using iterative BLAST runs and the human expressed sequence tags (EST) database. The apparent...... and introns of the likely human orthologue (termed hPHT2). Northern analyses with EST clones indicated that hPHT1 is primarily expressed in skeletal muscle and spleen, whereas hPHT2 is found in spleen, placenta, lung, leukocytes, and heart. These results suggest considerable complexity of the human POT gene...

  13. Kalium: a database of potassium channel toxins from scorpion venom.

    Science.gov (United States)

    Kuzmenkov, Alexey I; Krylov, Nikolay A; Chugunov, Anton O; Grishin, Eugene V; Vassilevski, Alexander A

    2016-01-01

    Kalium (http://kaliumdb.org/) is a manually curated database that accumulates data on potassium channel toxins purified from scorpion venom (KTx). This database is an open-access resource, and provides easy access to pages of other databases of interest, such as UniProt, PDB, NCBI Taxonomy Browser, and PubMed. General achievements of Kalium are a strict and easy regulation of KTx classification based on the unified nomenclature supported by researchers in the field, removal of peptides with partial sequence and entries supported by transcriptomic information only, classification of β-family toxins, and addition of a novel λ-family. Molecules presented in the database can be processed by the Clustal Omega server using a one-click option. Molecular masses of mature peptides are calculated and available activity data are compiled for all KTx. We believe that Kalium is not only of high interest to professional toxinologists, but also of general utility to the scientific community.Database URL:http://kaliumdb.org/. © The Author(s) 2016. Published by Oxford University Press.

  14. Gene expression data from acetaminophen-induced toxicity in human hepatic in vitro systems and clinical liver samples

    Directory of Open Access Journals (Sweden)

    Robim M. Rodrigues

    2016-06-01

    Full Text Available This data set is composed of transcriptomics analyses of (i liver samples from patients suffering from acetaminophen-induced acute liver failure (ALF and (ii hepatic cell systems exposed to acetaminophen and their respective controls. The in vitro systems include widely employed cell lines i.e. HepaRG and HepG2 cells as well as a novel stem cell-derived model i.e. human skin-precursors-derived hepatocyte-like cells (hSKP-HPC. Data from primary human hepatocytes was also added to the data set “Open TG-GATEs: a large-scale toxicogenomics database” (Igarashi et al., 2015 [1]. Changes in gene expression due to acetaminophen intoxication as well as comparative information between human in vivo and in vitro samples are provided. The microarray data have been deposited in NCBI׳s Gene Expression Omnibus and are accessible through GEO Series accession number GEO: GSE74000. The provided data is used to evaluate the predictive capacity of each hepatic in vitro system and can be directly compared with large-scale publically available toxicogenomics databases. Further interpretation and discussion of these data feature in the corresponding research article “Toxicogenomics-based prediction of acetaminophen-induced liver injury using human hepatic cell systems” (Rodrigues et al., 2016 [2].

  15. Generation of iPSC line epiHUVEC from human umbilical vein endothelial cells

    Directory of Open Access Journals (Sweden)

    Peggy Matz

    2015-11-01

    Full Text Available Human umbilical vein endothelial cells (HUVECs were used to generate the iPSC line epiHUVEC employing a combination of three episomal-based plasmids expressing OCT4, SOX2, NANOG, LIN28, c-MYC and KLF4. Pluripotency was confirmed both in vivo and in vitro. The transcriptome profile of epiHUVEC and the human embryonic stem cell line — H1 have a Pearson correlation of 0.899.

  16. LeishCyc: a biochemical pathways database for Leishmania major

    Directory of Open Access Journals (Sweden)

    Doyle Maria A

    2009-06-01

    biochemical networks and is a tool for analysis, interpretation, and visualization of Leishmania Omics data (transcriptomics, proteomics, metabolomics in the context of metabolic pathways. LeishCyc is the first such database for the Trypanosomatidae family, which includes a number of other important human parasites. Flexible query/visualization capabilities are provided by the Pathway Tools software and its Web interface. The LeishCyc database is made freely available over the Internet http://www.leishcyc.org.

  17. Identification of strong promoters based on the transcriptome of Bacillus licheniformis.

    Science.gov (United States)

    Liu, Xin; Yang, Haiyan; Zheng, Junwei; Ye, Yanrui; Pan, Li

    2017-06-01

    To expand the repertoire of strong promoters for high level expression of proteins based on the transcriptome of Bacillus licheniformis. The transcriptome of B. licheniformis ATCC14580 grown to the early stationary phase was analyzed and the top 10 highly expressed genes/operons out of the 3959 genes and 1249 operons identified were chosen for study promoter activity. Using beta-galactosidase gene as a reporter, the candidate promoter pBL9 exhibited the strongest activity which was comparable to that of the widely used strong promoter p43. Furthermore, the pro-transglutaminase from Streptomyces mobaraensis (pro-MTG) was expressed under the control of promoter pBL9 and the activity of pro-MTG reached 82 U/ml after 36 h, which is 23% higher than that of promoter p43 (66.8 U/ml). In our analyses of the transcriptome of B. licheniformis, we have identified a strong promoter pBL9, which could be adapted for high level expression of proteins in the host Bacillus subtilis.

  18. Widespread uncoupling between transcriptome and translatome variations after a stimulus in mammalian cells

    Directory of Open Access Journals (Sweden)

    Tebaldi Toma

    2012-06-01

    Full Text Available Abstract Background The classical view on eukaryotic gene expression proposes the scheme of a forward flow for which fluctuations in mRNA levels upon a stimulus contribute to determine variations in mRNA availability for translation. Here we address this issue by simultaneously profiling with microarrays the total mRNAs (the transcriptome and the polysome-associated mRNAs (the translatome after EGF treatment of human cells, and extending the analysis to other 19 different transcriptome/translatome comparisons in mammalian cells following different stimuli or undergoing cell programs. Results Triggering of the EGF pathway results in an early induction of transcriptome and translatome changes, but 90% of the significant variation is limited to the translatome and the degree of concordant changes is less than 5%. The survey of other 19 different transcriptome/translatome comparisons shows that extensive uncoupling is a general rule, in terms of both RNA movements and inferred cell activities, with a strong tendency of translation-related genes to be controlled purely at the translational level. By different statistical approaches, we finally provide evidence of the lack of dependence between changes at the transcriptome and translatome levels. Conclusions We propose a model of diffused independency between variation in transcript abundances and variation in their engagement on polysomes, which implies the existence of specific mechanisms to couple these two ways of regulating gene expression.

  19. Human health risk assessment database, "the NHSRC toxicity value database": supporting the risk assessment process at US EPA's National Homeland Security Research Center.

    Science.gov (United States)

    Moudgal, Chandrika J; Garrahan, Kevin; Brady-Roberts, Eletha; Gavrelis, Naida; Arbogast, Michelle; Dun, Sarah

    2008-11-15

    The toxicity value database of the United States Environmental Protection Agency's (EPA) National Homeland Security Research Center has been in development since 2004. The toxicity value database includes a compilation of agent property, toxicity, dose-response, and health effects data for 96 agents: 84 chemical and radiological agents and 12 biotoxins. The database is populated with multiple toxicity benchmark values and agent property information from secondary sources, with web links to the secondary sources, where available. A selected set of primary literature citations and associated dose-response data are also included. The toxicity value database offers a powerful means to quickly and efficiently gather pertinent toxicity and dose-response data for a number of agents that are of concern to the nation's security. This database, in conjunction with other tools, will play an important role in understanding human health risks, and will provide a means for risk assessors and managers to make quick and informed decisions on the potential health risks and determine appropriate responses (e.g., cleanup) to agent release. A final, stand alone MS ACESSS working version of the toxicity value database was completed in November, 2007.

  20. Human health risk assessment database, 'the NHSRC toxicity value database': Supporting the risk assessment process at US EPA's National Homeland Security Research Center

    International Nuclear Information System (INIS)

    Moudgal, Chandrika J.; Garrahan, Kevin; Brady-Roberts, Eletha; Gavrelis, Naida; Arbogast, Michelle; Dun, Sarah

    2008-01-01

    The toxicity value database of the United States Environmental Protection Agency's (EPA) National Homeland Security Research Center has been in development since 2004. The toxicity value database includes a compilation of agent property, toxicity, dose-response, and health effects data for 96 agents: 84 chemical and radiological agents and 12 biotoxins. The database is populated with multiple toxicity benchmark values and agent property information from secondary sources, with web links to the secondary sources, where available. A selected set of primary literature citations and associated dose-response data are also included. The toxicity value database offers a powerful means to quickly and efficiently gather pertinent toxicity and dose-response data for a number of agents that are of concern to the nation's security. This database, in conjunction with other tools, will play an important role in understanding human health risks, and will provide a means for risk assessors and managers to make quick and informed decisions on the potential health risks and determine appropriate responses (e.g., cleanup) to agent release. A final, stand alone MS ACESSS working version of the toxicity value database was completed in November, 2007

  1. The pokeweed leaf mRNA transcriptome and its regulation by jasmonic acid.

    Directory of Open Access Journals (Sweden)

    Kira C.M. Neller

    2016-03-01

    Full Text Available The American pokeweed plant, Phytolacca americana, is recognized for synthesizing pokeweed antiviral protein (PAP, a ribosome inactivating protein (RIP that inhibits the replication of several plant and animal viruses. The plant is also a heavy metal accumulator with applications in soil remediation. However, little is known about pokeweed stress responses, as large-scale sequencing projects have not been performed for this species. Here, we sequenced the mRNA transcriptome of pokeweed in the presence and absence of jasmonic acid (JA, a hormone mediating plant defense. Trinity-based de novo assembly of mRNA from leaf tissue and BLASTx homology searches against public sequence databases resulted in the annotation of 59 096 transcripts. Differential expression analysis identified JA-responsive genes that may be involved in defense against pathogen infection and herbivory. We confirmed the existence of several PAP isoforms and cloned a potentially novel isoform of PAP. Expression analysis indicated that PAP isoforms are differentially responsive to JA, perhaps indicating specialized roles within the plant. Finally, we identified 52 305 natural antisense transcript pairs, four of which comprised PAP isoforms, suggesting a novel form of RIP gene regulation. This transcriptome-wide study of a Phytolaccaceae family member provides a source of new genes that may be involved in stress tolerance in this plant. The sequences generated in our study have been deposited in the SRA database under project # SRP069141.

  2. Transcriptome analysis in Concholepas concholepas (Gastropoda, Muricidae): mining and characterization of new genomic and molecular markers.

    Science.gov (United States)

    Cárdenas, Leyla; Sánchez, Roland; Gomez, Daniela; Fuenzalida, Gonzalo; Gallardo-Escárate, Cristián; Tanguy, Arnaud

    2011-09-01

    The marine gastropod Concholepas concholepas, locally known as the "loco", is the main target species of the benthonic Chilean fisheries. Genetic and genomic tools are necessary to study the genome of this species in order to understand the molecular basis of its development, growth, and other key traits to improve the management strategies and to identify local adaptation to prevent loss of biodiversity. Here, we use pyrosequencing technologies to generate the first transcriptomic database from adult specimens of the loco. After trimming, a total of 140,756 Expressed Sequence Tag sequences were achieved. Clustering and assembly analysis identified 19,219 contigs and 105,435 singleton sequences. BlastN analysis showed a significant identity with Expressed Sequence Tags of different gastropod species available in public databases. Similarly, BlastX results showed that only 895 out of the total 124,654 had significant hits and may represent novel genes for marine gastropods. From this database, simple sequence repeat motifs were also identified and a total of 38 primer pairs were designed and tested to assess their potential as informative markers and to investigate their cross-species amplification in different related gastropod species. This dataset represents the first publicly available 454 data for a marine gastropod endemic to the southeastern Pacific coast, providing a valuable transcriptomic resource for future efforts of gene discovery and development of functional markers in other marine gastropods. Copyright © 2011 Elsevier B.V. All rights reserved.

  3. MetIDB: A Publicly Accessible Database of Predicted and Experimental 1H NMR Spectra of Flavonoids

    NARCIS (Netherlands)

    Mihaleva, V.V.; Beek, te T.A.; Zimmeren, van F.; Moco, S.I.A.; Laatikainen, R.; Niemitz, M.; Korhonen, S.P.; Driel, van M.A.; Vervoort, J.

    2013-01-01

    Identification of natural compounds, especially secondary metabolites, has been hampered by the lack of easy to use and accessible reference databases. Nuclear magnetic resonance (NMR) spectroscopy is the most selective technique for identification of unknown metabolites. High quality 1H NMR (proton

  4. Deep mRNA sequencing of the Tritonia diomedea brain transcriptome provides access to gene homologues for neuronal excitability, synaptic transmission and peptidergic signalling.

    Directory of Open Access Journals (Sweden)

    Adriano Senatore

    Full Text Available The sea slug Tritonia diomedea (Mollusca, Gastropoda, Nudibranchia, has a simple and highly accessible nervous system, making it useful for studying neuronal and synaptic mechanisms underlying behavior. Although many important contributions have been made using Tritonia, until now, a lack of genetic information has impeded exploration at the molecular level.We performed Illumina sequencing of central nervous system mRNAs from Tritonia, generating 133.1 million 100 base pair, paired-end reads. De novo reconstruction of the RNA-Seq data yielded a total of 185,546 contigs, which partitioned into 123,154 non-redundant gene clusters (unigenes. BLAST comparison with RefSeq and Swiss-Prot protein databases, as well as mRNA data from other invertebrates (gastropod molluscs: Aplysia californica, Lymnaea stagnalis and Biomphalaria glabrata; cnidarian: Nematostella vectensis revealed that up to 76,292 unigenes in the Tritonia transcriptome have putative homologues in other databases, 18,246 of which are below a more stringent E-value cut-off of 1x10-6. In silico prediction of secreted proteins from the Tritonia transcriptome shotgun assembly (TSA produced a database of 579 unique sequences of secreted proteins, which also exhibited markedly higher expression levels compared to other genes in the TSA.Our efforts greatly expand the availability of gene sequences available for Tritonia diomedea. We were able to extract full length protein sequences for most queried genes, including those involved in electrical excitability, synaptic vesicle release and neurotransmission, thus confirming that the transcriptome will serve as a useful tool for probing the molecular correlates of behavior in this species. We also generated a neurosecretome database that will serve as a useful tool for probing peptidergic signalling systems in the Tritonia brain.

  5. Transcriptome Analysis of Barbarea vulgaris Infested with Diamondback Moth (Plutella xylostella) Larvae

    Science.gov (United States)

    Shen, Di; Wang, Haiping; Wu, Qingjun; Lu, Peng; Qiu, Yang; Song, Jiangping; Zhang, Youjun; Li, Xixiang

    2013-01-01

    Background The diamondback moth (DBM, Plutella xylostella) is a crucifer-specific pest that causes significant crop losses worldwide. Barbarea vulgaris (Brassicaceae) can resist DBM and other herbivorous insects by producing feeding-deterrent triterpenoid saponins. Plant breeders have long aimed to transfer this insect resistance to other crops. However, a lack of knowledge on the biosynthetic pathways and regulatory networks of these insecticidal saponins has hindered their practical application. A pyrosequencing-based transcriptome analysis of B. vulgaris during DBM larval feeding was performed to identify genes and gene networks responsible for saponin biosynthesis and its regulation at the genome level. Principal Findings Approximately 1.22, 1.19, 1.16, 1.23, 1.16, 1.20, and 2.39 giga base pairs of clean nucleotides were generated from B. vulgaris transcriptomes sampled 1, 4, 8, 12, 24, and 48 h after onset of P. xylostella feeding and from non-inoculated controls, respectively. De novo assembly using all data of the seven transcriptomes generated 39,531 unigenes. A total of 37,780 (95.57%) unigenes were annotated, 14,399 of which were assigned to one or more gene ontology terms and 19,620 of which were assigned to 126 known pathways. Expression profiles revealed 2,016–4,685 up-regulated and 557–5188 down-regulated transcripts. Secondary metabolic pathways, such as those of terpenoids, glucosinolates, and phenylpropanoids, and its related regulators were elevated. Candidate genes for the triterpene saponin pathway were found in the transcriptome. Orthological analysis of the transcriptome with four other crucifer transcriptomes identified 592 B. vulgaris-specific gene families with a P-value cutoff of 1e−5. Conclusion This study presents the first comprehensive transcriptome analysis of B. vulgaris subjected to a series of DBM feedings. The biosynthetic and regulatory pathways of triterpenoid saponins and other DBM deterrent metabolites in this plant were

  6. Comparative Transcriptomics to Identify Novel Genes and Pathways in Dinoflagellates

    Science.gov (United States)

    Ryan, D.

    2016-02-01

    The unarmored dinoflagellate Karenia brevis is among the most prominent harmful, bloom-forming phytoplankton species in the Gulf of Mexico. During blooms, the polyketides PbTx-1 and PbTx-2 (brevetoxins) are produced by K. brevis. Brevetoxins negatively impact human health and the Gulf shellfish harvest. However, the genes underlying brevetoxin synthesis are currently unknown. Because the K. brevis genome is extremely large ( 1 × 1011 base pairs long), and with a high proportion of repetitive, non-coding DNA, it has not been sequenced. In fact, large, repetitive genomes are common among the dinoflagellate group. High-throughput RNA sequencing technology enabled us to assemble Karenia transcriptomes de novo and investigate potential genes in the brevetoxin pathway through comparative transcriptomics. The brevetoxin profile varies among K. brevis clonal cultures. For example, well-documented Wilson-CCFWC268 typically produces 8-10 pg PbTx per cell, whereas SP1 produces differences in gene expression. Of the 85,000 transcripts in the K. brevis transcriptome, 4,600 transcripts, including novel unannotated orthologs and putative polyketide synthases (PKSs), were only expressed by brevetoxin-producing K. brevis and K. papilionacea, not K. mikimotoi. Examination of gene expression between the typical- and low-toxin Wilson clones identified about 3,500 genes with significantly different expression levels, including 2 putative PKSs. One of the 2 PKSs was only found in the brevetoxin-producing Karenia species. These transcriptomes could not have been characterized without high-throughput RNA sequencing.

  7. De novo assembly and comparison of the ovarian transcriptomes of the common Chinese cuttlefish (Sepiella japonica with different gonadal development

    Directory of Open Access Journals (Sweden)

    Zhenming Lü

    2016-03-01

    Full Text Available The common Chinese cuttlefish (Sepiella japonica has been considered one of the most economically important marine Cephalopod species in East Asia and seed breeding technology has been established for massive aquaculture and stock enhancement. In the present study, we used Illumina HiSeq2000 to sequence, assemble and annotate the transcriptome of the ovary tissues of S. japonica for the first time. A total of 53,116,650 and 53,446,640 reads were obtained from the immature and matured ovaries, respectively (NCBI SRA database SRX1409472 and SRX1409473, and 70,039 contigs (N50 = 1443 bp were obtained after de novo assembling with Trinity software. Digital gene expression analysis reveals 47,288 contigs show differential expression profile and 793 contigs are highly expressed in the immature ovary, while 38 contigs are highly expressed in the mature ovary with FPKM >100. We hope that the ovarian transcriptome and those stage-enriched transcripts of S. japonica can provide some insight into the understanding of genome-wide transcriptome profile of cuttlefish gonad tissue and give useful information in cuttlefish gonad development. Keywords: Cuttlefish, Gonad development, Transcriptome

  8. Pepper EST database: comprehensive in silico tool for analyzing the chili pepper (Capsicum annuum transcriptome

    Directory of Open Access Journals (Sweden)

    Kim Woo Taek

    2008-10-01

    Full Text Available Abstract Background There is no dedicated database available for Expressed Sequence Tags (EST of the chili pepper (Capsicum annuum, although the interest in a chili pepper EST database is increasing internationally due to the nutritional, economic, and pharmaceutical value of the plant. Recent advances in high-throughput sequencing of the ESTs of chili pepper cv. Bukang have produced hundreds of thousands of complementary DNA (cDNA sequences. Therefore, a chili pepper EST database was designed and constructed to enable comprehensive analysis of chili pepper gene expression in response to biotic and abiotic stresses. Results We built the Pepper EST database to mine the complexity of chili pepper ESTs. The database was built on 122,582 sequenced ESTs and 116,412 refined ESTs from 21 pepper EST libraries. The ESTs were clustered and assembled into virtual consensus cDNAs and the cDNAs were assigned to metabolic pathway, Gene Ontology (GO, and MIPS Functional Catalogue (FunCat. The Pepper EST database is designed to provide a workbench for (i identifying unigenes in pepper plants, (ii analyzing expression patterns in different developmental tissues and under conditions of stress, and (iii comparing the ESTs with those of other members of the Solanaceae family. The Pepper EST database is freely available at http://genepool.kribb.re.kr/pepper/. Conclusion The Pepper EST database is expected to provide a high-quality resource, which will contribute to gaining a systemic understanding of plant diseases and facilitate genetics-based population studies. The database is also expected to contribute to analysis of gene synteny as part of the chili pepper sequencing project by mapping ESTs to the genome.

  9. Metabolomic Dynamic Analysis of Hypoxia in MDA-MB-231 and the Comparison with Inferred Metabolites from Transcriptomics Data

    Directory of Open Access Journals (Sweden)

    Yufeng Jane Tseng

    2013-05-01

    Full Text Available Hypoxia affects the tumor microenvironment and is considered important to metastasis progression and therapy resistance. Thus far, the majority of global analyses of tumor hypoxia responses have been limited to just a single omics level. Combining multiple omics data can broaden our understanding of tumor hypoxia. Here, we investigate the temporal change of the metabolite composition with gene expression data from literature to provide a more comprehensive insight into the system level in response to hypoxia. Nuclear magnetic resonance spectroscopy was used to perform metabolomic profiling on the MDA-MB-231 breast cancer cell line under hypoxic conditions. Multivariate statistical analysis revealed that the metabolic difference between hypoxia and normoxia was similar over 24 h, but became distinct over 48 h. Time dependent microarray data from the same cell line in the literature displayed different gene expressions under hypoxic and normoxic conditions mostly at 12 h or earlier. The direct metabolomic profiles show a large overlap with theoretical metabolic profiles deduced from previous transcriptomic studies. Consistent pathways are glycolysis/gluconeogenesis, pyruvate, purine and arginine and proline metabolism. Ten metabolic pathways revealed by metabolomics were not covered by the downstream of the known transcriptomic profiles, suggesting new metabolic phenotypes. These results confirm previous transcriptomics understanding and expand the knowledge from existing models on correlation and co-regulation between transcriptomic and metabolomics profiles, which demonstrates the power of integrated omics analysis.

  10. Cross-Tissue Transcriptomic Analysis of Human Secondary Lymphoid Organ-Residing ILC3s Reveals a Quiescent State in the Absence of Inflammation

    Directory of Open Access Journals (Sweden)

    Yotam E. Bar-Ephraim

    2017-10-01

    Full Text Available A substantial number of human and mouse group 3 innate lymphoid cells (ILC3s reside in secondary lymphoid organs, yet the phenotype and function of these ILC3s is incompletely understood. Here, we employed an unbiased cross-tissue transcriptomic approach to compare human ILC3s from non-inflamed lymph nodes and spleen to their phenotypic counterparts in inflamed tonsils and from circulation. These analyses revealed that, in the absence of inflammation, lymphoid organ-residing ILC3s lack transcription of cytokines associated with classical ILC3 functions. This was independent of expression of the natural cytotoxicity receptor NKp44. However, and in contrast to ILC3s from peripheral blood, lymphoid organ-residing ILC3s express activating cytokine receptors and have acquired the ability to be recruited into immune responses by inflammatory cytokines. This comprehensive cross-tissue dataset will allow for identification of functional changes in human lymphoid organ ILC3s associated with human disease.

  11. Massively parallel sequencing and analysis of the Necator americanus transcriptome.

    Directory of Open Access Journals (Sweden)

    Cinzia Cantacessi

    2010-05-01

    Full Text Available The blood-feeding hookworm Necator americanus infects hundreds of millions of people worldwide. In order to elucidate fundamental molecular biological aspects of this hookworm, the transcriptome of the adult stage of Necator americanus was explored using next-generation sequencing and bioinformatic analyses.A total of 19,997 contigs were assembled from the sequence data; 6,771 of these contigs had known orthologues in the free-living nematode Caenorhabditis elegans, and most of them encoded proteins with WD40 repeats (10.6%, proteinase inhibitors (7.8% or calcium-binding EF-hand proteins (6.7%. Bioinformatic analyses inferred that the C. elegans homologues are involved mainly in biological pathways linked to ribosome biogenesis (70%, oxidative phosphorylation (63% and/or proteases (60%; most of these molecules were predicted to be involved in more than one biological pathway. Comparative analyses of the transcriptomes of N. americanus and the canine hookworm, Ancylostoma caninum, revealed qualitative and quantitative differences. For instance, proteinase inhibitors were inferred to be highly represented in the former species, whereas SCP/Tpx-1/Ag5/PR-1/Sc7 proteins ( = SCP/TAPS or Ancylostoma-secreted proteins were predominant in the latter. In N. americanus, essential molecules were predicted using a combination of orthology mapping and functional data available for C. elegans. Further analyses allowed the prioritization of 18 predicted drug targets which did not have homologues in the human host. These candidate targets were inferred to be linked to mitochondrial (e.g., processing proteins or amino acid metabolism (e.g., asparagine t-RNA synthetase.This study has provided detailed insights into the transcriptome of the adult stage of N. americanus and examines similarities and differences between this species and A. caninum. Future efforts should focus on comparative transcriptomic and proteomic investigations of the other predominant human

  12. Next-Generation Sequencing of the Chrysanthemum nankingense (Asteraceae) Transcriptome Permits Large-Scale Unigene Assembly and SSR Marker Discovery

    Science.gov (United States)

    Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Peng, Hui; Li, Pirui; Song, Aiping; Guan, Zhiyong; Fang, Weimin; Liao, Yuan; Chen, Fadi

    2013-01-01

    Background Simple sequence repeats (SSRs) are ubiquitous in eukaryotic genomes. Chrysanthemum is one of the largest genera in the Asteraceae family. Only few Chrysanthemum expressed sequence tag (EST) sequences have been acquired to date, so the number of available EST-SSR markers is very low. Methodology/Principal Findings Illumina paired-end sequencing technology produced over 53 million sequencing reads from C. nankingense mRNA. The subsequent de novo assembly yielded 70,895 unigenes, of which 45,789 (64.59%) unigenes showed similarity to the sequences in NCBI database. Out of 45,789 sequences, 107 have hits to the Chrysanthemum Nr protein database; 679 and 277 sequences have hits to the database of Helianthus and Lactuca species, respectively. MISA software identified a large number of putative EST-SSRs, allowing 1,788 primer pairs to be designed from the de novo transcriptome sequence and a further 363 from archival EST sequence. Among 100 primer pairs randomly chosen, 81 markers have amplicons and 20 are polymorphic for genotypes analysis in Chrysanthemum. The results showed that most (but not all) of the assays were transferable across species and that they exposed a significant amount of allelic diversity. Conclusions/Significance SSR markers acquired by transcriptome sequencing are potentially useful for marker-assisted breeding and genetic analysis in the genus Chrysanthemum and its related genera. PMID:23626799

  13. Transcriptome analysis of the Asian honey bee Apis cerana cerana.

    Directory of Open Access Journals (Sweden)

    Zi Long Wang

    Full Text Available BACKGROUND: The Eastern hive honey bee, Apis cerana cerana is a native and widely bred honey bee species in China. Molecular biology research about this honey bee species is scarce, and genomic information for A. c. cerana is not currently available. Transcriptome and expression profiling data for this species are therefore important resources needed to better understand the biological mechanisms of A. c. cerana. In this study, we obtained the transcriptome information of A. c. cerana by RNA-sequencing and compared gene expression differences between queens and workers of A. c. cerana by digital gene expression (DGE analysis. RESULTS: Using high-throughput Illumina RNA sequencing we obtained 51,581,510 clean reads corresponding to 4.64 Gb total nucleotides from a single run. These reads were assembled into 46,999 unigenes with a mean length of 676 bp. Based on a sequence similarity search against the five public databases (NR, Swissport, GO, COG, KEGG with a cut-off E-value of 10(-5 using BLASTX, a total of 24,630 unigenes were annotated with gene descriptions, gene ontology terms, or metabolic pathways. Using these transcriptome data as references we analyzed the gene expression differences between the queens and workers of A. c. cerana using a tag-based digital gene expression method. We obtained 5.96 and 5.66 million clean tags from the queen and worker samples, respectively. A total of 414 genes were differentially expressed between them, with 189 up-regulated and 225 down-regulated in queens. CONCLUSIONS: Our transcriptome data provide a comprehensive sequence resource for future A. c. cerana study, establishing an important public information platform for functional genomic studies in A. c. cerana. Furthermore, the DGE data provide comprehensive gene expression information for the queens and workers, which will facilitate our understanding of the molecular mechanisms of the different physiological aspects of the two castes.

  14. Liver transcriptome profile in pigs with extreme phenotypes of intramuscular fatty acid composition

    Directory of Open Access Journals (Sweden)

    Ramayo-Caldas Yuliaxis

    2012-10-01

    Full Text Available Abstract Background New advances in high-throughput technologies have allowed for the massive analysis of genomic data, providing new opportunities for the characterization of the transcriptome architectures. Recent studies in pigs have employed RNA-Seq to explore the transcriptome of different tissues in a reduced number of animals. The main goal of this study was the identification of differentially-expressed genes in the liver of Iberian x Landrace crossbred pigs showing extreme phenotypes for intramuscular fatty acid composition using RNA-Seq. Results The liver transcriptomes of two female groups (H and L with phenotypically extreme intramuscular fatty acid composition were sequenced using RNA-Seq. A total of 146 and 180 unannotated protein-coding genes were identified in intergenic regions for the L and H groups, respectively. In addition, a range of 5.8 to 7.3% of repetitive elements was found, with SINEs being the most abundant elements. The expression in liver of 186 (L and 270 (H lncRNAs was also detected. The higher reproducibility of the RNA-Seq data was validated by RT-qPCR and porcine expression microarrays, therefore showing a strong correlation between RT-qPCR and RNA-Seq data (ranking from 0.79 to 0.96, as well as between microarrays and RNA-Seq (r=0.72. A differential expression analysis between H and L animals identified 55 genes differentially-expressed between groups. Pathways analysis revealed that these genes belong to biological functions, canonical pathways and three gene networks related to lipid and fatty acid metabolism. In concordance with the phenotypic classification, the pathways analysis inferred that linolenic and arachidonic acids metabolism was altered between extreme individuals. In addition, a connection was observed among the top three networks, hence suggesting that these genes are interconnected and play an important role in lipid and fatty acid metabolism. Conclusions In the present study RNA-Seq was used

  15. Transcriptome patterns from primary cutaneous Leishmania braziliensis infections associate with eventual development of mucosal disease in humans.

    Directory of Open Access Journals (Sweden)

    Ana Claudia Maretti-Mira

    Full Text Available INTRODUCTION: Localized Cutaneous Leishmaniasis (LCL and Mucosal Leishmaniasis (ML are two extreme clinical forms of American Tegumentary Leishmaniasis that usually begin as solitary primary cutaneous lesions. Host and parasite factors that influence the progression of LCL to ML are not completely understood. In this manuscript, we compare the gene expression profiles of primary cutaneous lesions from patients who eventually developed ML to those that did not. METHODS: Using RNA-seq, we analyzed both the human and Leishmania transcriptomes in primary cutaneous lesions. RESULTS: Limited number of reads mapping to Leishmania transcripts were obtained. For human transcripts, compared to ML patients, lesions from LCL patients displayed a general multi-polarization of the adaptive immune response and showed up-regulation of genes involved in chemoattraction of innate immune cells and in antigen presentation. We also identified a potential transcriptional signature in the primary lesions that may predict long-term disease outcome. CONCLUSIONS: We were able to simultaneously sequence both human and Leishmania mRNA transcripts in primary cutaneous leishmaniasis lesions. Our results suggest an intrinsic difference in the immune capacity of LCL and ML patients. The findings correlate the complete cure of L. braziliensis infection with a controlled inflammatory response and a balanced activation of innate and adaptive immunity.

  16. Exploring Triacylglycerol Biosynthetic Pathway in Developing Seeds of Chia (Salvia hispanica L.): A Transcriptomic Approach

    OpenAIRE

    R. V., Sreedhar; Kumari, Priya; Rupwate, Sunny D.; Rajasekharan, Ram; Srinivasan, Malathi

    2015-01-01

    Chia (Salvia hispanica L.), a member of the mint family (Lamiaceae), is a rediscovered crop with great importance in health and nutrition and is also the highest known terrestrial plant source of heart-healthy omega-3 fatty acid, alpha linolenic acid (ALA). At present, there is no public genomic information or database available for this crop, hindering research on its genetic improvement through genomics-assisted breeding programs. The first comprehensive analysis of the global transcriptome...

  17. Simultaneous sequencing of coding and noncoding RNA reveals a human transcriptome dominated by a small number of highly expressed noncoding genes.

    Science.gov (United States)

    Boivin, Vincent; Deschamps-Francoeur, Gabrielle; Couture, Sonia; Nottingham, Ryan M; Bouchard-Bourelle, Philia; Lambowitz, Alan M; Scott, Michelle S; Abou-Elela, Sherif

    2018-07-01

    Comparing the abundance of one RNA molecule to another is crucial for understanding cellular functions but most sequencing techniques can target only specific subsets of RNA. In this study, we used a new fragmented ribodepleted TGIRT sequencing method that uses a thermostable group II intron reverse transcriptase (TGIRT) to generate a portrait of the human transcriptome depicting the quantitative relationship of all classes of nonribosomal RNA longer than 60 nt. Comparison between different sequencing methods indicated that FRT is more accurate in ranking both mRNA and noncoding RNA than viral reverse transcriptase-based sequencing methods, even those that specifically target these species. Measurements of RNA abundance in different cell lines using this method correlate with biochemical estimates, confirming tRNA as the most abundant nonribosomal RNA biotype. However, the single most abundant transcript is 7SL RNA, a component of the signal recognition particle. S tructured n on c oding RNAs (sncRNAs) associated with the same biological process are expressed at similar levels, with the exception of RNAs with multiple functions like U1 snRNA. In general, sncRNAs forming RNPs are hundreds to thousands of times more abundant than their mRNA counterparts. Surprisingly, only 50 sncRNA genes produce half of the non-rRNA transcripts detected in two different cell lines. Together the results indicate that the human transcriptome is dominated by a small number of highly expressed sncRNAs specializing in functions related to translation and splicing. © 2018 Boivin et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  18. The Future of Asset Management for Human Space Exploration: Supply Classification and an Integrated Database

    Science.gov (United States)

    Shull, Sarah A.; Gralla, Erica L.; deWeck, Olivier L.; Shishko, Robert

    2006-01-01

    One of the major logistical challenges in human space exploration is asset management. This paper presents observations on the practice of asset management in support of human space flight to date and discusses a functional-based supply classification and a framework for an integrated database that could be used to improve asset management and logistics for human missions to the Moon, Mars and beyond.

  19. Coupled Transcriptome and Proteome Analysis of Human Lymphotropic Tumor Viruses: Insights on the Detection and Discovery of Viral Genes

    Energy Technology Data Exchange (ETDEWEB)

    Dresang, Lindsay R.; Teuton, Jeremy R.; Feng, Huichen; Jacobs, Jon M.; Camp, David G.; Purvine, Samuel O.; Gritsenko, Marina A.; Li, Zhihua; Smith, Richard D.; Sugden, Bill; Moore, Patrick S.; Chang, Yuan

    2011-12-20

    Kaposi's sarcoma-associated herpesvirus (KSHV) and Epstein-Barr virus (EBV) are related human tumor viruses that cause primary effusion lymphomas (PEL) and Burkitt's lymphomas (BL), respectively. Viral genes expressed in naturally-infected cancer cells contribute to disease pathogenesis; knowing which viral genes are expressed is critical in understanding how these viruses cause cancer. To evaluate the expression of viral genes, we used high-resolution separation and mass spectrometry coupled with custom tiling arrays to align the viral proteomes and transcriptomes of three PEL and two BL cell lines under latent and lytic culture conditions. Results The majority of viral genes were efficiently detected at the transcript and/or protein level on manipulating the viral life cycle. Overall the correlation of expressed viral proteins and transcripts was highly complementary in both validating and providing orthogonal data with latent/lytic viral gene expression. Our approach also identified novel viral genes in both KSHV and EBV, and extends viral genome annotation. Several previously uncharacterized genes were validated at both transcript and protein levels. Conclusions This systems biology approach coupling proteome and transcriptome measurements provides a comprehensive view of viral gene expression that could not have been attained using each methodology independently. Detection of viral proteins in combination with viral transcripts is a potentially powerful method for establishing virus-disease relationships.

  20. Transcriptome sequencing and annotation for the Jamaican fruit bat (Artibeus jamaicensis.

    Directory of Open Access Journals (Sweden)

    Timothy I Shaw

    Full Text Available The Jamaican fruit bat (Artibeus jamaicensis is one of the most common bats in the tropical Americas. It is thought to be a potential reservoir host of Tacaribe virus, an arenavirus closely related to the South American hemorrhagic fever viruses. We performed transcriptome sequencing and annotation from lung, kidney and spleen tissues using 454 and Illumina platforms to develop this species as an animal model. More than 100,000 contigs were assembled, with 25,000 genes that were functionally annotated. Of the remaining unannotated contigs, 80% were found within bat genomes or transcriptomes. Annotated genes are involved in a broad range of activities ranging from cellular metabolism to genome regulation through ncRNAs. Reciprocal BLAST best hits yielded 8,785 sequences that are orthologous to mouse, rat, cattle, horse and human. Species tree analysis of sequences from 2,378 loci was used to achieve 95% bootstrap support for the placement of bat as sister to the clade containing horse, dog, and cattle. Through substitution rate estimation between bat and human, 32 genes were identified with evidence for positive selection. We also identified 466 immune-related genes, which may be useful for studying Tacaribe virus infection of this species. The Jamaican fruit bat transcriptome dataset is a resource that should provide additional candidate markers for studying bat evolution and ecology, and tools for analysis of the host response and pathology of disease.

  1. Characterization of gonadal transcriptomes from the turbot (Scophthalmus maximus).

    Science.gov (United States)

    Hu, Yulong; Huang, Meng; Wang, Weiji; Guan, Jiantao; Kong, Jie

    2016-01-01

    The mechanisms underlying sexual reproduction and sex ratio determination remains unclear in turbot, a flatfish of great commercial value. And there is limited information in the turbot database regarding genes related to the reproductive system. Here, we conducted high-throughput transcriptome profiling of turbot gonad tissues to better understand their reproductive functions and to supply essential gene sequence information for marker-assisted selection programs in the turbot industry. In this study, two gonad libraries representing sex differences in Scophthalmus maximus yielded 453 818 high-quality reads that were assembled into 24 611 contigs and 33 713 singletons by using 454 pyrosequencing, 13 936 contigs and singletons (CS) of which were annotated using BLASTx. GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analyses revealed that various biological functions and processes were associated with many of the annotated CS. Expression analyses showed that 510 genes were differentially expressed in males versus females; 80% of these genes were annotated. In addition, 6484 and 6036 single nucleotide polymorphisms (SNPs) were identified in male and female libraries, respectively. This transcriptome resource will serve as the foundation for cDNA or SNP microarray construction, gene expression characterization, and sex-specific linkage mapping in turbot.

  2. Integrating Environmental and Human Health Databases in the Great Lakes Basin: Themes, Challenges and Future Directions

    Directory of Open Access Journals (Sweden)

    Kate L. Bassil

    2015-03-01

    Full Text Available Many government, academic and research institutions collect environmental data that are relevant to understanding the relationship between environmental exposures and human health. Integrating these data with health outcome data presents new challenges that are important to consider to improve our effective use of environmental health information. Our objective was to identify the common themes related to the integration of environmental and health data, and suggest ways to address the challenges and make progress toward more effective use of data already collected, to further our understanding of environmental health associations in the Great Lakes region. Environmental and human health databases were identified and reviewed using literature searches and a series of one-on-one and group expert consultations. Databases identified were predominantly environmental stressors databases, with fewer found for health outcomes and human exposure. Nine themes or factors that impact integration were identified: data availability, accessibility, harmonization, stakeholder collaboration, policy and strategic alignment, resource adequacy, environmental health indicators, and data exchange networks. The use and cost effectiveness of data currently collected could be improved by strategic changes to data collection and access systems to provide better opportunities to identify and study environmental exposures that may impact human health.

  3. Transcriptomic immune response of Tenebrio molitor pupae to parasitization by Scleroderma guani.

    Directory of Open Access Journals (Sweden)

    Jia-Ying Zhu

    Full Text Available BACKGROUND: Host and parasitoid interaction is one of the most fascinating relationships of insects, which is currently receiving an increasing interest. Understanding the mechanisms evolved by the parasitoids to evade or suppress the host immune system is important for dissecting this interaction, while it was still poorly known. In order to gain insight into the immune response of Tenebrio molitor to parasitization by Scleroderma guani, the transcriptome of T. molitor pupae was sequenced with focus on immune-related gene, and the non-parasitized and parasitized T. molitor pupae were analyzed by digital gene expression (DGE analysis with special emphasis on parasitoid-induced immune-related genes using Illumina sequencing. METHODOLOGY/PRINCIPAL FINDINGS: In a single run, 264,698 raw reads were obtained. De novo assembly generated 71,514 unigenes with mean length of 424 bp. Of those unigenes, 37,373 (52.26% showed similarity to the known proteins in the NCBI nr database. Via analysis of the transcriptome data in depth, 430 unigenes related to immunity were identified. DGE analysis revealed that parasitization by S. guani had considerable impacts on the transcriptome profile of T. molitor pupae, as indicated by the significant up- or down-regulation of 3,431 parasitism-responsive transcripts. The expression of a total of 74 unigenes involved in immune response of T. molitor was significantly altered after parasitization. CONCLUSIONS/SIGNIFICANCE: obtained T. molitor transcriptome, in addition to establishing a fundamental resource for further research on functional genomics, has allowed the discovery of a large group of immune genes that might provide a meaningful framework to better understand the immune response in this species and other beetles. The DGE profiling data provides comprehensive T. molitor immune gene expression information at the transcriptional level following parasitization, and sheds valuable light on the molecular

  4. Transcriptomic immune response of Tenebrio molitor pupae to parasitization by Scleroderma guani.

    Science.gov (United States)

    Zhu, Jia-Ying; Yang, Pu; Zhang, Zhong; Wu, Guo-Xing; Yang, Bin

    2013-01-01

    Host and parasitoid interaction is one of the most fascinating relationships of insects, which is currently receiving an increasing interest. Understanding the mechanisms evolved by the parasitoids to evade or suppress the host immune system is important for dissecting this interaction, while it was still poorly known. In order to gain insight into the immune response of Tenebrio molitor to parasitization by Scleroderma guani, the transcriptome of T. molitor pupae was sequenced with focus on immune-related gene, and the non-parasitized and parasitized T. molitor pupae were analyzed by digital gene expression (DGE) analysis with special emphasis on parasitoid-induced immune-related genes using Illumina sequencing. In a single run, 264,698 raw reads were obtained. De novo assembly generated 71,514 unigenes with mean length of 424 bp. Of those unigenes, 37,373 (52.26%) showed similarity to the known proteins in the NCBI nr database. Via analysis of the transcriptome data in depth, 430 unigenes related to immunity were identified. DGE analysis revealed that parasitization by S. guani had considerable impacts on the transcriptome profile of T. molitor pupae, as indicated by the significant up- or down-regulation of 3,431 parasitism-responsive transcripts. The expression of a total of 74 unigenes involved in immune response of T. molitor was significantly altered after parasitization. obtained T. molitor transcriptome, in addition to establishing a fundamental resource for further research on functional genomics, has allowed the discovery of a large group of immune genes that might provide a meaningful framework to better understand the immune response in this species and other beetles. The DGE profiling data provides comprehensive T. molitor immune gene expression information at the transcriptional level following parasitization, and sheds valuable light on the molecular understanding of the host-parasitoid interaction.

  5. Transcriptomic Immune Response of Tenebrio molitor Pupae to Parasitization by Scleroderma guani

    Science.gov (United States)

    Zhu, Jia-Ying; Yang, Pu; Zhang, Zhong; Wu, Guo-Xing; Yang, Bin

    2013-01-01

    Background Host and parasitoid interaction is one of the most fascinating relationships of insects, which is currently receiving an increasing interest. Understanding the mechanisms evolved by the parasitoids to evade or suppress the host immune system is important for dissecting this interaction, while it was still poorly known. In order to gain insight into the immune response of Tenebrio molitor to parasitization by Scleroderma guani, the transcriptome of T. molitor pupae was sequenced with focus on immune-related gene, and the non-parasitized and parasitized T. molitor pupae were analyzed by digital gene expression (DGE) analysis with special emphasis on parasitoid-induced immune-related genes using Illumina sequencing. Methodology/Principal Findings In a single run, 264,698 raw reads were obtained. De novo assembly generated 71,514 unigenes with mean length of 424 bp. Of those unigenes, 37,373 (52.26%) showed similarity to the known proteins in the NCBI nr database. Via analysis of the transcriptome data in depth, 430 unigenes related to immunity were identified. DGE analysis revealed that parasitization by S. guani had considerable impacts on the transcriptome profile of T. molitor pupae, as indicated by the significant up- or down-regulation of 3,431 parasitism-responsive transcripts. The expression of a total of 74 unigenes involved in immune response of T. molitor was significantly altered after parasitization. Conclusions/Significance obtained T. molitor transcriptome, in addition to establishing a fundamental resource for further research on functional genomics, has allowed the discovery of a large group of immune genes that might provide a meaningful framework to better understand the immune response in this species and other beetles. The DGE profiling data provides comprehensive T. molitor immune gene expression information at the transcriptional level following parasitization, and sheds valuable light on the molecular understanding of the host

  6. Transcriptome analysis of Neisseria meningitidis in human whole blood and mutagenesis studies identify virulence factors involved in blood survival.

    Directory of Open Access Journals (Sweden)

    Hebert Echenique-Rivera

    2011-05-01

    Full Text Available During infection Neisseria meningitidis (Nm encounters multiple environments within the host, which makes rapid adaptation a crucial factor for meningococcal survival. Despite the importance of invasion into the bloodstream in the meningococcal disease process, little is known about how Nm adapts to permit survival and growth in blood. To address this, we performed a time-course transcriptome analysis using an ex vivo model of human whole blood infection. We observed that Nm alters the expression of ≈30% of ORFs of the genome and major dynamic changes were observed in the expression of transcriptional regulators, transport and binding proteins, energy metabolism, and surface-exposed virulence factors. In particular, we found that the gene encoding the regulator Fur, as well as all genes encoding iron uptake systems, were significantly up-regulated. Analysis of regulated genes encoding for surface-exposed proteins involved in Nm pathogenesis allowed us to better understand mechanisms used to circumvent host defenses. During blood infection, Nm activates genes encoding for the factor H binding proteins, fHbp and NspA, genes encoding for detoxifying enzymes such as SodC, Kat and AniA, as well as several less characterized surface-exposed proteins that might have a role in blood survival. Through mutagenesis studies of a subset of up-regulated genes we were able to identify new proteins important for survival in human blood and also to identify additional roles of previously known virulence factors in aiding survival in blood. Nm mutant strains lacking the genes encoding the hypothetical protein NMB1483 and the surface-exposed proteins NalP, Mip and NspA, the Fur regulator, the transferrin binding protein TbpB, and the L-lactate permease LctP were sensitive to killing by human blood. This increased knowledge of how Nm responds to adaptation in blood could also be helpful to develop diagnostic and therapeutic strategies to control the devastating

  7. Transcriptome analysis and its application in identifying genes associated with fruiting body development in basidiomycete Hypsizygus marmoreus.

    Directory of Open Access Journals (Sweden)

    Jinjing Zhang

    Full Text Available To elucidate the mechanisms of fruit body development in H. marmoreus, a total of 43609521 high-quality RNA-seq reads were obtained from four developmental stages, including the mycelial knot (H-M, mycelial pigmentation (H-V, primordium (H-P and fruiting body (H-F stages. These reads were assembled to obtain 40568 unigenes with an average length of 1074 bp. A total of 26800 (66.06% unigenes were annotated and analyzed with the Kyoto Encyclopedia of Genes and Genomes (KEGG, Gene Ontology (GO, and Eukaryotic Orthologous Group (KOG databases. Differentially expressed genes (DEGs from the four transcriptomes were analyzed. The KEGG enrichment analysis revealed that the mycelium pigmentation stage was associated with the MAPK, cAMP, and blue light signal transduction pathways. In addition, expression of the two-component system members changed with the transition from H-M to H-V, suggesting that light affected the expression of genes related to fruit body initiation in H. marmoreus. During the transition from H-V to H-P, stress signals associated with MAPK, cAMP and ROS signals might be the most important inducers. Our data suggested that nitrogen starvation might be one of the most important factors in promoting fruit body maturation, and nitrogen metabolism and mTOR signaling pathway were associated with this process. In addition, 30 genes of interest were analyzed by quantitative real-time PCR to verify their expression profiles at the four developmental stages. This study advances our understanding of the molecular mechanism of fruiting body development in H. marmoreus by identifying a wealth of new genes that may play important roles in mushroom morphogenesis.

  8. Pathway aberrations of murine melanoma cells observed in Paired-End diTag transcriptomes

    Directory of Open Access Journals (Sweden)

    Liu Edison

    2007-06-01

    Full Text Available Abstract Background Melanoma is the major cause of skin cancer deaths and melanoma incidence doubles every 10 to 20 years. However, little is known about melanoma pathway aberrations. Here we applied the robust Gene Identification Signature Paired End diTag (GIS-PET approach to investigate the melanoma transcriptome and characterize the global pathway aberrations. Methods GIS-PET technology directly links 5' mRNA signatures with their corresponding 3' signatures to generate, and then concatenate, PETs for efficient sequencing. We annotated PETs to pathways of KEGG database and compared the murine B16F1 melanoma transcriptome with three non-melanoma murine transcriptomes (Melan-a2 melanocytes, E14 embryonic stem cells, and E17.5 embryo. Gene expression levels as represented by PET counts were compared across melanoma and melanocyte libraries to identify the most significantly altered pathways and investigate the expression levels of crucial cancer genes. Results Melanin biosynthesis genes were solely expressed in the cells of melanocytic origin, indicating the feasibility of using the PET approach for transcriptome comparison. The most significantly altered pathways were metabolic pathways, including upregulated pathways: purine metabolism, aminophosphonate metabolism, tyrosine metabolism, selenoamino acid metabolism, galactose utilization, nitrobenzene degradation, and bisphenol A degradation; and downregulated pathways: oxidative phosphorylation, ATPase synthesis, TCA cycle, pyruvate metabolism, and glutathione metabolism. The downregulated pathways concurrently indicated a slowdown of mitochondrial activities. Mitochondrial permeability was also significantly altered, as indicated by transcriptional activation of ATP/ADP, citrate/malate, Mg++, fatty acid and amino acid transporters, and transcriptional repression of zinc and metal ion transporters. Upregulation of cell cycle progression, MAPK, and PI3K/Akt pathways were more limited to certain

  9. mRNA Transcriptomics of Galectins Unveils Heterogeneous Organization in Mouse and Human Brain

    Directory of Open Access Journals (Sweden)

    Sebastian John

    2016-12-01

    Full Text Available Background: Galectins, a family of non-classically secreted, β-galactoside binding proteins is involved in several brain disorders; however no systematic knowledge on the normal neuroanatomical distribution and functions of galectins exits. Hence, the major purpose of this study was to understand spatial distribution and predict functions of galectins in brain and also compare the degree of conservation vs. divergence between mouse and human species. The latter objective was required to determine the relevance and appropriateness of studying galectins in mouse brain which may ultimately enable us to extrapolate the findings to human brain physiology and pathologies.Results: In order to fill this crucial gap in our understanding of brain galectins, we analyzed the in situ hybridization (ISH and microarray data of adult mouse and human brain respectively, from the Allen Brain Atlas, to resolve each galectin-subtype’s spatial distribution across brain distinct cytoarchitecture. Next, transcription factors (TFs that may regulate galectins were identified using TRANSFAC software and the list obtained was further curated to sort TFs on their confirmed transcript expression in the adult brain. Galectin-TF cluster analysis, gene-ontology annotations and co-expression networks were then extrapolated to predict distinct functional relevance of each galectin in the neuronal processes. Data shows that galectins have highly heterogeneous expression within and across brain sub-structures and are predicted to be the crucial targets of brain enriched TFs. Lgals9 had maximal spatial distribution across mouse brain with inferred predominant roles in neurogenesis while LGALS1 was ubiquitously expressed in human. Limbic region associated with learning, memory and emotions and substantia nigra associated with motor movements showed strikingly high expression of LGALS1 and LGALS8 in human vs. mouse brain. The overall expression profile of galectin-8 was most

  10. The co-transcriptome of uropathogenic Escherichia coli-infected mouse macrophages reveals new insights into host-pathogen interactions

    KAUST Repository

    Mavromatis, Charalampos Harris; Bokil, Nilesh J.; Totsika, Makrina; Kakkanat, Asha; Schaale, Kolja; Cannistraci, Carlo V.; Ryu, Tae Woo; Beatson, Scott A.; Ulett, Glen C.; Schembri, Mark A.; Sweet, Matthew J.; Ravasi, Timothy

    2015-01-01

    Urinary tract infections (UTI) are among the most common infections in humans. Uropathogenic Escherichia coli (UPEC) can invade and replicate within bladder epithelial cells, and some UPEC strains can also survive within macrophages. To understand the UPEC transcriptional programme associated with intramacrophage survival, we performed host–pathogen co-transcriptome analyses using RNA sequencing. Mouse bone marrow-derived macrophages (BMMs) were challenged over a 24 h time course with two UPEC reference strains that possess contrasting intramacrophage phenotypes: UTI89, which survives in BMMs, and 83972, which is killed by BMMs. Neither of these strains caused significant BMM cell death at the low multiplicity of infection that was used in this study. We developed an effective computational framework that simultaneously separated, annotated and quantified the mammalian and bacterial transcriptomes. Bone marrow-derived macrophages responded to the two UPEC strains with a broadly similar gene expression programme. In contrast, the transcriptional responses of the UPEC strains diverged markedly from each other. We identified UTI89 genes up-regulated at 24 h post-infection, and hypothesized that some may contribute to intramacrophage survival. Indeed, we showed that deletion of one such gene (pspA) significantly reduced UTI89 survival within BMMs. Our study provides a technological framework for simultaneously capturing global changes at the transcriptional level in co-cultures, and has generated new insights into the mechanisms that UPEC use to persist within the intramacrophage environment.

  11. The co-transcriptome of uropathogenic Escherichia coli-infected mouse macrophages reveals new insights into host-pathogen interactions

    KAUST Repository

    Mavromatis, Charalampos Harris

    2015-01-24

    Urinary tract infections (UTI) are among the most common infections in humans. Uropathogenic Escherichia coli (UPEC) can invade and replicate within bladder epithelial cells, and some UPEC strains can also survive within macrophages. To understand the UPEC transcriptional programme associated with intramacrophage survival, we performed host–pathogen co-transcriptome analyses using RNA sequencing. Mouse bone marrow-derived macrophages (BMMs) were challenged over a 24 h time course with two UPEC reference strains that possess contrasting intramacrophage phenotypes: UTI89, which survives in BMMs, and 83972, which is killed by BMMs. Neither of these strains caused significant BMM cell death at the low multiplicity of infection that was used in this study. We developed an effective computational framework that simultaneously separated, annotated and quantified the mammalian and bacterial transcriptomes. Bone marrow-derived macrophages responded to the two UPEC strains with a broadly similar gene expression programme. In contrast, the transcriptional responses of the UPEC strains diverged markedly from each other. We identified UTI89 genes up-regulated at 24 h post-infection, and hypothesized that some may contribute to intramacrophage survival. Indeed, we showed that deletion of one such gene (pspA) significantly reduced UTI89 survival within BMMs. Our study provides a technological framework for simultaneously capturing global changes at the transcriptional level in co-cultures, and has generated new insights into the mechanisms that UPEC use to persist within the intramacrophage environment.

  12. PGG.Population: a database for understanding the genomic diversity and genetic ancestry of human populations.

    Science.gov (United States)

    Zhang, Chao; Gao, Yang; Liu, Jiaojiao; Xue, Zhe; Lu, Yan; Deng, Lian; Tian, Lei; Feng, Qidi; Xu, Shuhua

    2018-01-04

    There are a growing number of studies focusing on delineating genetic variations that are associated with complex human traits and diseases due to recent advances in next-generation sequencing technologies. However, identifying and prioritizing disease-associated causal variants relies on understanding the distribution of genetic variations within and among populations. The PGG.Population database documents 7122 genomes representing 356 global populations from 107 countries and provides essential information for researchers to understand human genomic diversity and genetic ancestry. These data and information can facilitate the design of research studies and the interpretation of results of both evolutionary and medical studies involving human populations. The database is carefully maintained and constantly updated when new data are available. We included miscellaneous functions and a user-friendly graphical interface for visualization of genomic diversity, population relationships (genetic affinity), ancestral makeup, footprints of natural selection, and population history etc. Moreover, PGG.Population provides a useful feature for users to analyze data and visualize results in a dynamic style via online illustration. The long-term ambition of the PGG.Population, together with the joint efforts from other researchers who contribute their data to our database, is to create a comprehensive depository of geographic and ethnic variation of human genome, as well as a platform bringing influence on future practitioners of medicine and clinical investigators. PGG.Population is available at https://www.pggpopulation.org. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Comparative genomics and transcriptome analysis of Aspergillus niger and metabolic engineering for citrate production

    Science.gov (United States)

    Yin, Xian; Shin, Hyun-dong; Li, Jianghua; Du, Guocheng; Liu, Long; Chen, Jian

    2017-01-01

    Despite a long and successful history of citrate production in Aspergillus niger, the molecular mechanism of citrate accumulation is only partially understood. In this study, we used comparative genomics and transcriptome analysis of citrate-producing strains—namely, A. niger H915-1 (citrate titer: 157 g L−1), A1 (117 g L−1), and L2 (76 g L−1)—to gain a genome-wide view of the mechanism of citrate accumulation. Compared with A. niger A1 and L2, A. niger H915-1 contained 92 mutated genes, including a succinate-semialdehyde dehydrogenase in the γ-aminobutyric acid shunt pathway and an aconitase family protein involved in citrate synthesis. Furthermore, transcriptome analysis of A. niger H915-1 revealed that the transcription levels of 479 genes changed between the cell growth stage (6 h) and the citrate synthesis stage (12 h, 24 h, 36 h, and 48 h). In the glycolysis pathway, triosephosphate isomerase was up-regulated, whereas pyruvate kinase was down-regulated. Two cytosol ATP-citrate lyases, which take part in the cycle of citrate synthesis, were up-regulated, and may coordinate with the alternative oxidases in the alternative respiratory pathway for energy balance. Finally, deletion of the oxaloacetate acetylhydrolase gene in H915-1 eliminated oxalate formation but neither influence on pH decrease nor difference in citrate production were observed. PMID:28106122

  14. RAID: a comprehensive resource for human RNA-associated (RNA–RNA/RNA–protein) interaction

    Science.gov (United States)

    Zhang, Xiaomeng; Wu, Deng; Chen, Liqun; Li, Xiang; Yang, Jinxurong; Fan, Dandan; Dong, Tingting; Liu, Mingyue; Tan, Puwen; Xu, Jintian; Yi, Ying; Wang, Yuting; Zou, Hua; Hu, Yongfei; Fan, Kaili; Kang, Juanjuan; Huang, Yan; Miao, Zhengqiang; Bi, Miaoman; Jin, Nana; Li, Kongning; Li, Xia; Xu, Jianzhen; Wang, Dong

    2014-01-01

    Transcriptomic analyses have revealed an unexpected complexity in the eukaryote transcriptome, which includes not only protein-coding transcripts but also an expanding catalog of noncoding RNAs (ncRNAs). Diverse coding and noncoding RNAs (ncRNAs) perform functions through interaction with each other in various cellular processes. In this project, we have developed RAID (http://www.rna-society.org/raid), an RNA-associated (RNA–RNA/RNA–protein) interaction database. RAID intends to provide the scientific community with all-in-one resources for efficient browsing and extraction of the RNA-associated interactions in human. This version of RAID contains more than 6100 RNA-associated interactions obtained by manually reviewing more than 2100 published papers, including 4493 RNA–RNA interactions and 1619 RNA–protein interactions. Each entry contains detailed information on an RNA-associated interaction, including RAID ID, RNA/protein symbol, RNA/protein categories, validated method, expressing tissue, literature references (Pubmed IDs), and detailed functional description. Users can query, browse, analyze, and manipulate RNA-associated (RNA–RNA/RNA–protein) interaction. RAID provides a comprehensive resource of human RNA-associated (RNA–RNA/RNA–protein) interaction network. Furthermore, this resource will help in uncovering the generic organizing principles of cellular function network. PMID:24803509

  15. Cloning and characterization of a novel human zinc finger gene, hKid3, from a C2H2-ZNF enriched human embryonic cDNA library

    International Nuclear Information System (INIS)

    Gao Li; Sun Chong; Qiu Hongling; Liu Hui; Shao Huanjie; Wang Jun; Li Wenxin

    2004-01-01

    To investigate the zinc finger genes involved in human embryonic development, we constructed a C 2 H 2 -ZNF enriched human embryonic cDNA library, from which a novel human gene named hKid3 was identified. The hKid3 cDNA encodes a 554 amino acid protein with an amino-terminal KRAB domain and 11 carboxyl-terminal C 2 H 2 zinc finger motifs. Northern blot analysis indicates that two hKid3 transcripts of 6 and 8.5 kb express in human fetal brain and kidney. The 6 kb transcript can also be detected in human adult brain, heart, and skeletal muscle while the 8.5 kb transcript appears to be embryo-specific. GFP-fused hKid3 protein is localized to nuclei and the ZF domain is necessary and sufficient for nuclear localization. To explore the DNA-binding specificity of hKid3, an oligonucleotide library was selected by GST fusion protein of hKid3 ZF domain, and the consensus core sequence 5'-CCAC-3' was evaluated by competitive electrophoretic mobility shift assay. Moreover, The KRAB domain of hKid3 exhibits transcription repressor activity when tested in GAL4 fusion protein assay. These results indicate that hKid3 may function as a transcription repressor with regulated expression pattern during human development of brain and kidney

  16. Comparative study of the hemagglutinin and neuraminidase genes of influenza A virus H3N2, H9N2, and H5N1 subtypes using bioinformatics techniques.

    Science.gov (United States)

    Ahn, Insung; Son, Hyeon S

    2007-07-01

    To investigate the genomic patterns of influenza A virus subtypes, such as H3N2, H9N2, and H5N1, we collected 1842 sequences of the hemagglutinin and neuraminidase genes from the NCBI database and parsed them into 7 categories: accession number, host species, sampling year, country, subtype, gene name, and sequence. The sequences that were isolated from the human, avian, and swine populations were extracted and stored in a MySQL database for intensive analysis. The GC content and relative synonymous codon usage (RSCU) values were calculated using JAVA codes. As a result, correspondence analysis of the RSCU values yielded the unique codon usage pattern (CUP) of each subtype and revealed no extreme differences among the human, avian, and swine isolates. H5N1 subtype viruses exhibited little variation in CUPs compared with other subtypes, suggesting that the H5N1 CUP has not yet undergone significant changes within each host species. Moreover, some observations may be relevant to CUP variation that has occurred over time among the H3N2 subtype viruses isolated from humans. All the sequences were divided into 3 groups over time, and each group seemed to have preferred synonymous codon patterns for each amino acid, especially for arginine, glycine, leucine, and valine. The bioinformatics technique we introduce in this study may be useful in predicting the evolutionary patterns of pandemic viruses.

  17. Human Ageing Genomic Resources: Integrated databases and tools for the biology and genetics of ageing

    Science.gov (United States)

    Tacutu, Robi; Craig, Thomas; Budovsky, Arie; Wuttke, Daniel; Lehmann, Gilad; Taranukha, Dmitri; Costa, Joana; Fraifeld, Vadim E.; de Magalhães, João Pedro

    2013-01-01

    The Human Ageing Genomic Resources (HAGR, http://genomics.senescence.info) is a freely available online collection of research databases and tools for the biology and genetics of ageing. HAGR features now several databases with high-quality manually curated data: (i) GenAge, a database of genes associated with ageing in humans and model organisms; (ii) AnAge, an extensive collection of longevity records and complementary traits for >4000 vertebrate species; and (iii) GenDR, a newly incorporated database, containing both gene mutations that interfere with dietary restriction-mediated lifespan extension and consistent gene expression changes induced by dietary restriction. Since its creation about 10 years ago, major efforts have been undertaken to maintain the quality of data in HAGR, while further continuing to develop, improve and extend it. This article briefly describes the content of HAGR and details the major updates since its previous publications, in terms of both structure and content. The completely redesigned interface, more intuitive and more integrative of HAGR resources, is also presented. Altogether, we hope that through its improvements, the current version of HAGR will continue to provide users with the most comprehensive and accessible resources available today in the field of biogerontology. PMID:23193293

  18. Transcriptome analysis of soiny mullet (Liza haematocheila) spleen in response to Streptococcus dysgalactiae.

    Science.gov (United States)

    Qi, Zhitao; Wu, Ping; Zhang, Qihuan; Wei, Youchuan; Wang, Zisheng; Qiu, Ming; Shao, Rong; Li, Yao; Gao, Qian

    2016-02-01

    Soiny mullet (Liza haematocheila) is becoming an economically important aquaculture mugilid species in China and other Asian countries. However, increasing incidences of bacterial pathogenic diseases has greatly hampered the production of the soiny mullet. Deeper understanding of the soiny mullet immune system and its related genes in response to bacterial infections are necessary for disease control in this species. In this study, the transcriptomic profile of spleen from soiny mullet challenged with Streptococcus dysgalactiae was analyzed by Illumina-based paired-end sequencing method. After assembly, 86,884 unique transcript fragments (unigenes) were assembled, with an average length of 991 bp. Approximately 41,795 (48.1%) unigenes were annotated in the nr NCBI database and 57.9% of the unigenes were similar to that of the Nile tilapia. A total of 24,299 unigenes were categorized into three Gene Ontology (GO) categories (molecular function, cellular component and biological process), 13,570 unigenes into 25 functional Clusters of Orthologous Groups of proteins (COG) categories, and 30,547 unigenes were grouped into 258 known pathways in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Following S. dysgalactiae infection, 11,461 differentially expressed unigenes were identified including 4658 up-regulated unigenes and 6803 down-regulated unigenes. Significant enrichment analysis of these differentially expressed unigenes identified major immune related pathways, including the Toll-like receptor, complement and coagulation cascades, T cell receptor signaling pathway and B cell receptor signaling pathway. In addition, 24,813 simple sequence repeats (SSRs) and 127,503 candidate single nucleotide polymorphisms (SNPs) were identified from the mullet spleen transcriptome. To this date, this study has globally analyzed the transcriptome profile from the spleen of L. haematocheila after S. dysgalactiae infection. Therefore, the results of our study

  19. Identification of genes potentially regulated by human polynucleotide phosphorylase (hPNPase old-35 using melanoma as a model.

    Directory of Open Access Journals (Sweden)

    Upneet K Sokhi

    Full Text Available Human Polynucleotide Phosphorylase (hPNPase(old-35 or PNPT1 is an evolutionarily conserved 3'→ 5' exoribonuclease implicated in the regulation of numerous physiological processes including maintenance of mitochondrial homeostasis, mtRNA import and aging-associated inflammation. From an RNase perspective, little is known about the RNA or miRNA species it targets for degradation or whose expression it regulates; except for c-myc and miR-221. To further elucidate the functional implications of hPNPase(old-35 in cellular physiology, we knocked-down and overexpressed hPNPase(old-35 in human melanoma cells and performed gene expression analyses to identify differentially expressed transcripts. Ingenuity Pathway Analysis indicated that knockdown of hPNPase(old-35 resulted in significant gene expression changes associated with mitochondrial dysfunction and cholesterol biosynthesis; whereas overexpression of hPNPase(old-35 caused global changes in cell-cycle related functions. Additionally, comparative gene expression analyses between our hPNPase(old-35 knockdown and overexpression datasets allowed us to identify 77 potential "direct" and 61 potential "indirect" targets of hPNPase(old-35 which formed correlated networks enriched for cell-cycle and wound healing functional association, respectively. These results provide a comprehensive database of genes responsive to hPNPase(old-35 expression levels; along with the identification new potential candidate genes offering fresh insight into cellular pathways regulated by PNPT1 and which may be used in the future for possible therapeutic intervention in mitochondrial- or inflammation-associated disease phenotypes.

  20. Meta-Transcriptomic Analysis of a Chromate-Reducing Aquifer Microbial Community

    Science.gov (United States)

    Beller, H. R.; Brodie, E. L.; Han, R.; Karaoz, U.

    2010-12-01

    A major challenge for microbial ecology that has become more tractable in the advent of new molecular techniques is characterizing gene expression in complex microbial communities. We are using meta-transcriptomic analysis to characterize functional changes in an aquifer-derived, chromate-reducing microbial community as it transitions through various electron-accepting conditions. We inoculated anaerobic microcosms with groundwater from the Cr-contaminated Hanford 100H site and supplemented them with lactate and electron acceptors present at the site, namely, nitrate, sulfate, and Fe(III). The microcosms progressed successively through various electron-accepting conditions (e.g., denitrifying, sulfate-reducing, and ferric iron-reducing conditions, as well as nitrate-dependent, chemolithotrophic Fe(II)-oxidizing conditions). Cr(VI) was rapidly reduced initially and again upon further Cr(VI) amendments. Extensive geochemical sampling and analysis (e.g., lactate, acetate, chloride, nitrate, nitrite, sulfate, dissolved Cr(VI), total Fe(II)), RNA/DNA harvesting, and PhyloChip analyses were conducted. Methods were developed for removal of rRNA from total RNA in preparation for meta-transcriptome sequencing. To date, samples representing denitrifying and fermentative/sulfate-reducing conditions have been sequenced using 454 Titanium technology. Of the non-rRNA related reads for the denitrifying sample (which was also actively reducing chromate), ca. 8% were associated with denitrification and ca. 0.9% were associated with chromate resistance/transport, in contrast to the fermentative/sulfate-reducing sample (in which chromate had already been reduced), which had zero reads associated with either of these categories but many predicted proteins associated with sulfate-reducing bacteria. We observed sequences for key functional transcripts that were unique at the nucleotide level compared to the GenBank non-redundant database [such as L-lactate dehydrogenase (iron

  1. The Human Communication Research Centre dialogue database.

    Science.gov (United States)

    Anderson, A H; Garrod, S C; Clark, A; Boyle, E; Mullin, J

    1992-10-01

    The HCRC dialogue database consists of over 700 transcribed and coded dialogues from pairs of speakers aged from seven to fourteen. The speakers are recorded while tackling co-operative problem-solving tasks and the same pairs of speakers are recorded over two years tackling 10 different versions of our two tasks. In addition there are over 200 dialogues recorded between pairs of undergraduate speakers engaged on versions of the same tasks. Access to the database, and to its accompanying custom-built search software, is available electronically over the JANET system by contacting liz@psy.glasgow.ac.uk, from whom further information about the database and a user's guide to the database can be obtained.

  2. The University of Surrey database of elemental composition of human hair

    International Nuclear Information System (INIS)

    Altaf, W.J.; Akanle, O.A.; Admans, L.L.; Beasley, D.; Butler, C.; Spyrou, N.M.

    2004-01-01

    The elemental composition of human hair obtained from different studies at Surrey University over a period of 25 years has been recorded and forms part of a database, for biological and environmental samples, which is being developed. Instrumental neutron activation analysis (IAA), using reactor neutrons, was the principal method employed and from which reported data are presented. Elemental concentrations of Br, Ca, Ce, Cl, Co, Cr, Cs, F, Fe, Hf, K, Mg, Mn, Na, Rb, Sb, Sc, Se, V and Zn were obtained and recorded in the database. Chronological variations in two sets of subjects separated by a period of time of 16 years are also given. Variations in the concentration values of some elements related to the state of health and disease were reported for hair samples collected from subjects suffering from manic depression, senile dementia and breast cancer. Concentration values of some elements with relation to the nationality of subjects from Bulgaria, England, Kenya, Nigeria and Wales are presented and compared. This study is part of on-going research in the analysis of biomedical and bioenvironmental materials. The database is still in its infancy. (author)

  3. Two-dimensional gel human protein databases offer a systematic approach to the study of cell proliferation and differentiation

    DEFF Research Database (Denmark)

    Celis, julio E.; Gesser, Borbala; Dejgaard, Kurt

    1989-01-01

    Human cellular protein databases have been established using computer-analyzed 2D gel electrophoresis. These databases, which include information on various properties of proteins, offer a global approach to the study of regulation of cell proliferation and differentiation. Furthermore, thanks...

  4. Two dimensional gel human protein databases offer a systematic approach to the study of cell proliferation and differentiation

    DEFF Research Database (Denmark)

    Celis, J E; Gesser, B; Dejgaard, K

    1989-01-01

    Human cellular protein databases have been established using computer-analyzed 2D gel electrophoresis. These databases, which include information on various properties of proteins, offer a global approach to the study of regulation of cell proliferation and differentiation. Furthermore, thanks to...

  5. Transcriptome profiling of whole blood cells identifies PLEK2 and C1QB in human melanoma.

    Directory of Open Access Journals (Sweden)

    Yuchun Luo

    Full Text Available Developing analytical methodologies to identify biomarkers in easily accessible body fluids is highly valuable for the early diagnosis and management of cancer patients. Peripheral whole blood is a "nucleic acid-rich" and "inflammatory cell-rich" information reservoir and represents systemic processes altered by the presence of cancer cells.We conducted transcriptome profiling of whole blood cells from melanoma patients. To overcome challenges associated with blood-based transcriptome analysis, we used a PAXgene™ tube and NuGEN Ovation™ globin reduction system. The combined use of these systems in microarray resulted in the identification of 78 unique genes differentially expressed in the blood of melanoma patients. Of these, 68 genes were further analyzed by quantitative reverse transcriptase PCR using blood samples from 45 newly diagnosed melanoma patients (stage I to IV and 50 healthy control individuals. Thirty-nine genes were verified to be differentially expressed in blood samples from melanoma patients. A stepwise logit analysis selected eighteen 2-gene signatures that distinguish melanoma from healthy controls. Of these, a 2-gene signature consisting of PLEK2 and C1QB led to the best result that correctly classified 93.3% melanoma patients and 90% healthy controls. Both genes were upregulated in blood samples of melanoma patients from all stages. Further analysis using blood fractionation showed that CD45(- and CD45(+ populations were responsible for the altered expression levels of PLEK2 and C1QB, respectively.The current study provides the first analysis of whole blood-based transcriptome biomarkers for malignant melanoma. The expression of PLEK2, the strongest gene to classify melanoma patients, in CD45(- subsets illustrates the importance of analyzing whole blood cells for biomarker studies. The study suggests that transcriptome profiling of blood cells could be used for both early detection of melanoma and monitoring of patients

  6. Examination of Triacylglycerol Biosynthetic Pathways via De Novo Transcriptomic and Proteomic Analyses in an Unsequenced Microalga

    Science.gov (United States)

    2011-10-17

    and none of the TAG enzymatic components. Conversely , utilization of the C. vulgaris transcriptome as a search database allowed us to identify all...for conversion to biodiesel or renewable diesel and jet fuel [1,2,3]. Many of these species can also grow rapidly under a large range of environmental...overnight. Approximately 5 mg of dry biomass was suspended in chloroform-methanol (2:1, v/v), and glyceroli- pids were transesterified in HCl-methanol (5

  7. Sequencing and Characterization of Divergent Marbling Levels in the Beef Cattle ( Muscle Transcriptome

    Directory of Open Access Journals (Sweden)

    Dong Chen

    2015-02-01

    Full Text Available Marbling is an important trait regarding the quality of beef. Analysis of beef cattle transcriptome and its expression profile data are essential to extend the genetic information resources and would support further studies on beef cattle. RNA sequencing was performed in beef cattle using the Illumina High-Seq2000 platform. Approximately 251.58 million clean reads were generated from a high marbling (H group and low marbling (L group. Approximately 80.12% of the 19,994 bovine genes (protein coding were detected in all samples, and 749 genes exhibited differential expression between the H and L groups based on fold change (>1.5-fold, p<0.05. Multiple gene ontology terms and biological pathways were found significantly enriched among the differentially expressed genes. The transcriptome data will facilitate future functional studies on marbling formation in beef cattle and may be applied to improve breeding programs for cattle and closely related mammals.

  8. Transcriptome of the freshwater amphipod Gammarus pulex hepatopancreas

    Directory of Open Access Journals (Sweden)

    E. Gismondi

    2016-06-01

    Full Text Available So far, ecotoxicological studies used biomarkers of exposure or of effects in order to investigate the impacts of contaminated areas on biota (Peakall, 1994 [6]. However, although these results are important in the ecotoxicological risk assessment, biomarkers are very specific and only provide information on the biological processes or physiological pathways targeted by the biomarkers experimenters choose to test (Monsinjon and Knigge, 2007 [5]. In recent years, proteomics have become a major tool in ecotoxicology, as they provide a global insight into the mechanism of action of pollutants without the need of hypothesis testing or any preconception on the biological processes likely impacted (Gismondi et al., 2015; Trapp et al., 2015 [7]; Truebano, 2016 [8]. However, the analysis of proteomic results is often limited due to the lack of database, especially for non-model organisms, such as Gammarus sp, commonly used as biological model in ecotoxicology (Sornom et al., 2012 [11]; Vellinger et al., 2013 [9]; Gismondi and Thomé, 2014 [1]; Lebrun et al., 2014 [3]. Here, we performed Illumina HiSeq sequencing to total RNA isolated from the hepatopancreas (i.e. detoxification tissue of Gammarus pulex males and females coming from uncontaminated river and contaminated river (e.g. PCB, benzo(apyrene. Approximately 290 M paired-end reads were assembled, filtered and sorted into 39,801 contigs whose 10.878 were similar of proteins available in databases. The assembled contigs could represent a reference hepatopancreas transcriptome for G. pulex, and constitute an important resource for future investigations on the impacts of pollutants on invertebrate biota, since it would improve the understanding of the mechanisms of action involved in toxicity. In addition, the hepatopancreas transcriptome will also allow the identification of new potential biomarkers for the ecotoxicological risk assessments. Assembled contigs were deposited in the European

  9. A database for human performance under simulated emergencies of nuclear power plants

    International Nuclear Information System (INIS)

    Park, Jin Kyun; Jung, Won Dea

    2005-01-01

    Reliable human performance is a prerequisite in securing the safety of complicated process systems such as nuclear power plants. However, the amount of available knowledge that can explain why operators deviate from an expected performance level is so small because of the infrequency of real accidents. Therefore, in this study, a database that contains a set of useful information extracted from simulated emergencies was developed in order to provide important clues for understanding the change of operators' performance under stressful conditions (i.e., real accidents). The database was developed under Microsoft Windows TM environment using Microsoft Access 97 TM and Microsoft Visual Basic 6.0 TM . In the database, operators' performance data obtained from the analysis of over 100 audio-visual records for simulated emergencies were stored using twenty kinds of distinctive data fields. A total of ten kinds of operators' performance data are available from the developed database. Although it is still difficult to predict operators' performance under stressful conditions based on the results of simulated emergencies, simulation studies remain the most feasible way to scrutinize performance. Accordingly, it is expected that the performance data of this study will provide a concrete foundation for understanding the change of operators' performance in emergency situations

  10. Human events reference for ATHEANA (HERA) database description and preliminary user's manual

    International Nuclear Information System (INIS)

    Auflick, J.L.; Hahn, H.A.; Pond, D.J.

    1998-01-01

    The Technique for Human Error Analysis (ATHEANA) is a newly developed human reliability analysis (HRA) methodology that aims to facilitate better representation and integration of human performance into probabilistic risk assessment (PRA) modeling and quantification by analyzing risk-significant operating experience in the context of existing behavioral science models. The fundamental premise of ATHEANA is that error-forcing contexts (EFCs), which refer to combinations of equipment/material conditions and performance shaping factors (PSFs), set up or create the conditions under which unsafe actions (UAs) can occur. Because ATHEANA relies heavily on the analysis of operational events that have already occurred as a mechanism for generating creative thinking about possible EFCs, a database, called the Human Events Reference for ATHEANA (HERA), has been developed to support the methodology. This report documents the initial development efforts for HERA

  11. Human Events Reference for ATHEANA (HERA) Database Description and Preliminary User's Manual

    International Nuclear Information System (INIS)

    Auflick, J.L.

    1999-01-01

    The Technique for Human Error Analysis (ATHEANA) is a newly developed human reliability analysis (HRA) methodology that aims to facilitate better representation and integration of human performance into probabilistic risk assessment (PRA) modeling and quantification by analyzing risk-significant operating experience in the context of existing behavioral science models. The fundamental premise of ATHEANA is that error forcing contexts (EFCs), which refer to combinations of equipment/material conditions and performance shaping factors (PSFs), set up or create the conditions under which unsafe actions (UAs) can occur. Because ATHEANA relies heavily on the analysis of operational events that have already occurred as a mechanism for generating creative thinking about possible EFCs, a database (db) of analytical operational events, called the Human Events Reference for ATHEANA (HERA), has been developed to support the methodology. This report documents the initial development efforts for HERA

  12. De novo assembly, gene annotation, and marker discovery in stored-product pest Liposcelis entomophila (Enderlein using transcriptome sequences.

    Directory of Open Access Journals (Sweden)

    Dan-Dan Wei

    Full Text Available BACKGROUND: As a major stored-product pest insect, Liposcelis entomophila has developed high levels of resistance to various insecticides in grain storage systems. However, the molecular mechanisms underlying resistance and environmental stress have not been characterized. To date, there is a lack of genomic information for this species. Therefore, studies aimed at profiling the L. entomophila transcriptome would provide a better understanding of the biological functions at the molecular levels. METHODOLOGY/PRINCIPAL FINDINGS: We applied Illumina sequencing technology to sequence the transcriptome of L. entomophila. A total of 54,406,328 clean reads were obtained and that de novo assembled into 54,220 unigenes, with an average length of 571 bp. Through a similarity search, 33,404 (61.61% unigenes were matched to known proteins in the NCBI non-redundant (Nr protein database. These unigenes were further functionally annotated with gene ontology (GO, cluster of orthologous groups of proteins (COG, and Kyoto Encyclopedia of Genes and Genomes (KEGG databases. A large number of genes potentially involved in insecticide resistance were manually curated, including 68 putative cytochrome P450 genes, 37 putative glutathione S-transferase (GST genes, 19 putative carboxyl/cholinesterase (CCE genes, and other 126 transcripts to contain target site sequences or encoding detoxification genes representing eight types of resistance enzymes. Furthermore, to gain insight into the molecular basis of the L. entomophila toward thermal stresses, 25 heat shock protein (Hsp genes were identified. In addition, 1,100 SSRs and 57,757 SNPs were detected and 231 pairs of SSR primes were designed for investigating the genetic diversity in future. CONCLUSIONS/SIGNIFICANCE: We developed a comprehensive transcriptomic database for L. entomophila. These sequences and putative molecular markers would further promote our understanding of the molecular mechanisms underlying

  13. Human grasping database for activities of daily living with depth, color and kinematic data streams.

    Science.gov (United States)

    Saudabayev, Artur; Rysbek, Zhanibek; Khassenova, Raykhan; Varol, Huseyin Atakan

    2018-05-29

    This paper presents a grasping database collected from multiple human subjects for activities of daily living in unstructured environments. The main strength of this database is the use of three different sensing modalities: color images from a head-mounted action camera, distance data from a depth sensor on the dominant arm and upper body kinematic data acquired from an inertial motion capture suit. 3826 grasps were identified in the data collected during 9-hours of experiments. The grasps were grouped according to a hierarchical taxonomy into 35 different grasp types. The database contains information related to each grasp and associated sensor data acquired from the three sensor modalities. We also provide our data annotation software written in Matlab as an open-source tool. The size of the database is 172 GB. We believe this database can be used as a stepping stone to develop big data and machine learning techniques for grasping and manipulation with potential applications in rehabilitation robotics and intelligent automation.

  14. Ovary transcriptome profiling via artificial intelligence reveals a transcriptomic fingerprint predicting egg quality in striped bass, Morone saxatilis.

    Directory of Open Access Journals (Sweden)

    Robert W Chapman

    Full Text Available Inherited gene transcripts deposited in oocytes direct early embryonic development in all vertebrates, but transcript profiles indicative of embryo developmental competence have not previously been identified. We employed artificial intelligence to model profiles of maternal ovary gene expression and their relationship to egg quality, evaluated as production of viable mid-blastula stage embryos, in the striped bass (Morone saxatilis, a farmed species with serious egg quality problems. In models developed using artificial neural networks (ANNs and supervised machine learning, collective changes in the expression of a limited suite of genes (233 representing 90% of the eventual variance in embryo survival. Egg quality related to minor changes in gene expression (<0.2-fold, with most individual transcripts making a small contribution (<1% to the overall prediction of egg quality. These findings indicate that the predictive power of the transcriptome as regards egg quality resides not in levels of individual genes, but rather in the collective, coordinated expression of a suite of transcripts constituting a transcriptomic "fingerprint". Correlation analyses of the corresponding candidate genes indicated that dysfunction of the ubiquitin-26S proteasome, COP9 signalosome, and subsequent control of the cell cycle engenders embryonic developmental incompetence. The affected gene networks are centrally involved in regulation of early development in all vertebrates, including humans. By assessing collective levels of the relevant ovarian transcripts via ANNs we were able, for the first time in any vertebrate, to accurately predict the subsequent embryo developmental potential of eggs from individual females. Our results show that the transcriptomic fingerprint evidencing developmental dysfunction is highly predictive of, and therefore likely to regulate, egg quality, a biologically complex trait crucial to reproductive fitness.

  15. Estrogen and high-fat diet induced alterations in C57BL/6 mice endometrial transcriptome profile

    Directory of Open Access Journals (Sweden)

    Yali Cheng

    2017-12-01

    Full Text Available Unopposed estrogen stimulation and insulin resistance are known to play important roles in endometrial cancer (EC, but the interaction between these two factors and how they contribute to endometrial lesions are not completely elucidated. To investigate the endometrial transcriptome profile and the associated molecular pathway alterations, we established an ovariectomized C57BL/6 mouse model treated with subcutaneous implantation of 17-β estradiol (E2 pellet and/or high-fat diet (HFD for 12 weeks to mimic sustained estrogen stimulation and insulin resistance. Histomorphologically, we found that both E2 and E2 + HFD groups showed markedly enlarged uterus and increased number of endometrial glands. The endometrium samples were collected for microarray assay. GO and KEGG analysis showed that genes regulated by E2 and/or HFD are mainly responsible for immune response, inflammatory response and metabolic pathways. Further IPA analysis demonstrated that the acute phase response signaling, NF-κB signaling, leukocyte extravasation signaling, PPAR signaling and LXR/RXR activation pathways are mainly involved in the pathways above. In addition, the genes modulated reciprocally by E2 and/or HFD were also analyzed, and their crosstalk mainly focuses on enhancing one another’s activity. The combination analysis of microarray data and TCGA database provided potential diagnostic or therapeutic targets for EC. Further validation was performed in mice endometrium and human EC cell lines. In conclusion, this study unraveled the endometrial transcriptome profile alterations affected by E2 and/or HFD that may disturb endometrial homeostasis and contribute to the development of endometrial hyperplasia.

  16. Transcriptome-Derived Tetranucleotide Microsatellites and Their Associated Genes from the Giant Panda (Ailuropoda melanoleuca).

    Science.gov (United States)

    Song, Xuhao; Shen, Fujun; Huang, Jie; Huang, Yan; Du, Lianming; Wang, Chengdong; Fan, Zhenxin; Hou, Rong; Yue, Bisong; Zhang, Xiuyue

    2016-09-01

    Recently, an increasing number of microsatellites or simple sequence repeats (SSRs) have been found and characterized from transcriptomes. Such SSRs can be employed as putative functional markers to easily tag corresponding genes, which play an important role in biomedical studies and genetic analysis. However, the transcriptome-derived SSRs for giant panda (Ailuropoda melanoleuca) are not yet available. In this work, we identified and characterized 20 tetranucleotide microsatellite loci from a transcript database generated from the blood of giant panda. Furthermore, we assigned their predicted transcriptome locations: 16 loci were assigned to untranslated regions (UTRs) and 4 loci were assigned to coding regions (CDSs). Gene identities of 14 transcripts contained corresponding microsatellites were determined, which provide useful information to study the potential contribution of SSRs to gene regulation in giant panda. The polymorphic information content (PIC) values ranged from 0.293 to 0.789 with an average of 0.603 for the 16 UTRs-derived SSRs. Interestingly, 4 CDS-derived microsatellites developed in our study were also polymorphic, and the instability of these 4 CDS-derived SSRs was further validated by re-genotyping and sequencing. The genes containing these 4 CDS-derived SSRs were embedded with various types of repeat motifs. The interaction of all the length-changing SSRs might provide a way against coding region frameshift caused by microsatellite instability. We hope these newly gene-associated biomarkers will pave the way for genetic and biomedical studies for giant panda in the future. In sum, this set of transcriptome-derived markers complements the genetic resources available for giant panda. © The American Genetic Association. 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  17. Analysis of insecticide resistance-related genes of the Carmine spider mite Tetranychus cinnabarinus based on a de novo assembled transcriptome.

    Science.gov (United States)

    Xu, Zhifeng; Zhu, Wenyi; Liu, Yanchao; Liu, Xing; Chen, Qiushuang; Peng, Miao; Wang, Xiangzun; Shen, Guangmao; He, Lin

    2014-01-01

    The carmine spider mite (CSM), Tetranychus cinnabarinus, is an important pest mite in agriculture, because it can develop insecticide resistance easily. To gain valuable gene information and molecular basis for the future insecticide resistance study of CSM, the first transcriptome analysis of CSM was conducted. A total of 45,016 contigs and 25,519 unigenes were generated from the de novo transcriptome assembly, and 15,167 unigenes were annotated via BLAST querying against current databases, including nr, SwissProt, the Clusters of Orthologous Groups (COGs), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO). Aligning the transcript to Tetranychus urticae genome, the 19255 (75.45%) of the transcripts had significant (e-value insecticide resistance in arthropod were generated from CSM transcriptome, including 53 P450-, 22 GSTs-, 23 CarEs-, 1 AChE-, 7 GluCls-, 9 nAChRs-, 8 GABA receptor-, 1 sodium channel-, 6 ATPase- and 12 Cyt b genes. We developed significant molecular resources for T. cinnabarinus putatively involved in insecticide resistance. The transcriptome assembly analysis will significantly facilitate our study on the mechanism of adapting environmental stress (including insecticide) in CSM at the molecular level, and will be very important for developing new control strategies against this pest mite.

  18. Current Knowledge and Recent Advances in Marine Dinoflagellate Transcriptomic Research

    Directory of Open Access Journals (Sweden)

    Muhamad Afiq Akbar

    2018-02-01

    Full Text Available Dinoflagellates are essential components in marine ecosystems, and they possess two dissimilar flagella to facilitate movement. Dinoflagellates are major components of marine food webs and of extreme importance in balancing the ecosystem energy flux in oceans. They have been reported to be the primary cause of harmful algae bloom (HABs events around the world, causing seafood poisoning and therefore having a direct impact on human health. Interestingly, dinoflagellates in the genus Symbiodinium are major components of coral reef foundations. Knowledge regarding their genes and genome organization is currently limited due to their large genome size and other genetic and cytological characteristics that hinder whole genome sequencing of dinoflagellates. Transcriptomic approaches and genetic analyses have been employed to unravel the physiological and metabolic characteristics of dinoflagellates and their complexity. In this review, we summarize the current knowledge and findings from transcriptomic studies to understand the cell growth, effects on environmental stress, toxin biosynthesis, dynamic of HABs, phylogeny and endosymbiosis of dinoflagellates. With the advancement of high throughput sequencing technologies and lower cost of sequencing, transcriptomic approaches will likely deepen our understanding in other aspects of dinoflagellates’ molecular biology such as gene functional analysis, systems biology and development of model organisms.

  19. Fish-T1K (Transcriptomes of 1,000 Fishes) Project: large-scale transcriptome data for fish evolution studies.

    Science.gov (United States)

    Sun, Ying; Huang, Yu; Li, Xiaofeng; Baldwin, Carole C; Zhou, Zhuocheng; Yan, Zhixiang; Crandall, Keith A; Zhang, Yong; Zhao, Xiaomeng; Wang, Min; Wong, Alex; Fang, Chao; Zhang, Xinhui; Huang, Hai; Lopez, Jose V; Kilfoyle, Kirk; Zhang, Yong; Ortí, Guillermo; Venkatesh, Byrappa; Shi, Qiong

    2016-01-01

    Ray-finned fishes (Actinopterygii) represent more than 50 % of extant vertebrates and are of great evolutionary, ecologic and economic significance, but they are relatively underrepresented in 'omics studies. Increased availability of transcriptome data for these species will allow researchers to better understand changes in gene expression, and to carry out functional analyses. An international project known as the "Transcriptomes of 1,000 Fishes" (Fish-T1K) project has been established to generate RNA-seq transcriptome sequences for 1,000 diverse species of ray-finned fishes. The first phase of this project has produced transcriptomes from more than 180 ray-finned fishes, representing 142 species and covering 51 orders and 109 families. Here we provide an overview of the goals of this project and the work done so far.

  20. Deep sequencing-based analysis of the Cymbidium ensifolium floral transcriptome.

    Directory of Open Access Journals (Sweden)

    Xiaobai Li

    Full Text Available Cymbidium ensifolium is a Chinese Cymbidium with an elegant shape, beautiful appearance, and a fragrant aroma. C. ensifolium has a long history of cultivation in China and it has excellent commercial value as a potted plant and cut flower. The development of C. ensifolium genomic resources has been delayed because of its large genome size. Taking advantage of technical and cost improvement of RNA-Seq, we extracted total mRNA from flower buds and mature flowers and obtained a total of 9.52 Gb of filtered nucleotides comprising 98,819,349 filtered reads. The filtered reads were assembled into 101,423 isotigs, representing 51,696 genes. Of the 101,423 isotigs, 41,873 were putative homologs of annotated sequences in the public databases, of which 158 were associated with floral development and 119 were associated with flowering. The isotigs were categorized according to their putative functions. In total, 10,212 of the isotigs were assigned into 25 eukaryotic orthologous groups (KOGs, 41,690 into 58 gene ontology (GO terms, and 9,830 into 126 Arabidopsis Kyoto Encyclopedia of Genes and Genomes (KEGG pathways, and 9,539 isotigs into 123 rice pathways. Comparison of the isotigs with those of the two related orchid species P. equestris and C. sinense showed that 17,906 isotigs are unique to C. ensifolium. In addition, a total of 7,936 SSRs and 16,676 putative SNPs were identified. To our knowledge, this transcriptome database is the first major genomic resource for C. ensifolium and the most comprehensive transcriptomic resource for genus Cymbidium. These sequences provide valuable information for understanding the molecular mechanisms of floral development and flowering. Sequences predicted to be unique to C. ensifolium would provide more insights into C. ensifolium gene diversity. The numerous SNPs and SSRs identified in the present study will contribute to marker development for C. ensifolium.

  1. Human eosinophils - potential pharmacological model applied in human histamine H4 receptor research.

    Science.gov (United States)

    Grosicki, Marek; Kieć-Kononowicz, Katarzyna

    2015-01-01

    Histamine and histamine receptors are well known for their immunomodulatory role in inflammation. In this review we describe the role of histamine and histamine H4 receptor on human eosinophils. In the first part of article we provide short summary of histamine and histamine receptors role in physiology and histamine related therapeutics used in clinics. We briefly describe the human histamine receptor H4 and its ligands, as well as human eosinophils. In the second part of the review we provide detailed description of known histamine effects on eosinophils including: intracellular calcium concentration flux, actin polymerization, cellular shape change, upregulation of adhesion proteins and cellular chemotaxis. We provide proofs that these effects are mainly connected with the activation of histamine H4 receptor. When examining experimental data we discuss the controversial results and limitations of the studies performed on isolated eosinophils. In conclusion we believe that studies on histamine H4 receptor on human eosinophils can provide interesting new biomarkers that can be used in clinical studies of histamine receptors, that in future might result in the development of new strategies in the treatment of chronic inflammatory conditions like asthma or allergy, in which eosinophils are involved.

  2. Developmental Transcriptome for a Facultatively Eusocial Bee, Megalopta genalis

    OpenAIRE

    Jones, Beryl M.; Wcislo, William T.; Robinson, Gene E.

    2015-01-01

    Transcriptomes provide excellent foundational resources for mechanistic and evolutionary analyses of complex traits. We present a developmental transcriptome for the facultatively eusocial bee Megalopta genalis, which represents a potential transition point in the evolution of eusociality. A de novo transcriptome assembly of Megalopta genalis was generated using paired-end Illumina sequencing and the Trinity assembler. Males and females of all life stages were aligned to this transcriptome fo...

  3. Sequencing, De Novo Assembly, and Annotation of the Transcriptome of the Endangered Freshwater Pearl Bivalve, Cristaria plicata, Provides Novel Insights into Functional Genes and Marker Discovery.

    Directory of Open Access Journals (Sweden)

    Bharat Bhusan Patnaik

    Full Text Available The freshwater mussel Cristaria plicata (Bivalvia: Eulamellibranchia: Unionidae, is an economically important species in molluscan aquaculture due to its use in pearl farming. The species have been listed as endangered in South Korea due to the loss of natural habitats caused by anthropogenic activities. The decreasing population and a lack of genomic information on the species is concerning for environmentalists and conservationists. In this study, we conducted a de novo transcriptome sequencing and annotation analysis of C. plicata using Illumina HiSeq 2500 next-generation sequencing (NGS technology, the Trinity assembler, and bioinformatics databases to prepare a sustainable resource for the identification of candidate genes involved in immunity, defense, and reproduction.The C. plicata transcriptome analysis included a total of 286,152,584 raw reads and 281,322,837 clean reads. The de novo assembly identified a total of 453,931 contigs and 374,794 non-redundant unigenes with average lengths of 731.2 and 737.1 bp, respectively. Furthermore, 100% coverage of C. plicata mitochondrial genes within two unigenes supported the quality of the assembler. In total, 84,274 unigenes showed homology to entries in at least one database, and 23,246 unigenes were allocated to one or more Gene Ontology (GO terms. The most prominent GO biological process, cellular component, and molecular function categories (level 2 were cellular process, membrane, and binding, respectively. A total of 4,776 unigenes were mapped to 123 biological pathways in the KEGG database. Based on the GO terms and KEGG annotation, the unigenes were suggested to be involved in immunity, stress responses, sex-determination, and reproduction. A total of 17,251 cDNA simple sequence repeats (cSSRs were identified from 61,141 unigenes (size of >1 kb with the most abundant being dinucleotide repeats.This dataset represents the first transcriptome analysis of the endangered mollusc, C. plicata

  4. Saccharomyces genome database informs human biology

    OpenAIRE

    Skrzypek, Marek S; Nash, Robert S; Wong, Edith D; MacPherson, Kevin A; Hellerstedt, Sage T; Engel, Stacia R; Karra, Kalpana; Weng, Shuai; Sheppard, Travis K; Binkley, Gail; Simison, Matt; Miyasato, Stuart R; Cherry, J Michael

    2017-01-01

    Abstract The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is an expertly curated database of literature-derived functional information for the model organism budding yeast, Saccharomyces cerevisiae. SGD constantly strives to synergize new types of experimental data and bioinformatics predictions with existing data, and to organize them into a comprehensive and up-to-date information resource. The primary mission of SGD is to facilitate research into the biology of yeast and...

  5. GestuRe and ACtion Exemplar (GRACE) video database: stimuli for research on manners of human locomotion and iconic gestures.

    Science.gov (United States)

    Aussems, Suzanne; Kwok, Natasha; Kita, Sotaro

    2018-06-01

    Human locomotion is a fundamental class of events, and manners of locomotion (e.g., how the limbs are used to achieve a change of location) are commonly encoded in language and gesture. To our knowledge, there is no openly accessible database containing normed human locomotion stimuli. Therefore, we introduce the GestuRe and ACtion Exemplar (GRACE) video database, which contains 676 videos of actors performing novel manners of human locomotion (i.e., moving from one location to another in an unusual manner) and videos of a female actor producing iconic gestures that represent these actions. The usefulness of the database was demonstrated across four norming experiments. First, our database contains clear matches and mismatches between iconic gesture videos and action videos. Second, the male actors and female actors whose action videos matched the gestures in the best possible way, perform the same actions in very similar manners and different actions in highly distinct manners. Third, all the actions in the database are distinct from each other. Fourth, adult native English speakers were unable to describe the 26 different actions concisely, indicating that the actions are unusual. This normed stimuli set is useful for experimental psychologists working in the language, gesture, visual perception, categorization, memory, and other related domains.

  6. De novo assembly and characterization of the transcriptome, and development of SSR markers in wax gourd (Benicasa hispida.

    Directory of Open Access Journals (Sweden)

    Biao Jiang

    Full Text Available BACKGROUND: Wax gourd is a widely used vegetable of Cucuribtaceae, and also has important medicinal and health values. However, the genomic resources of wax gourd were scarcity, and only a few nucleotide sequences could be obtained in public databases. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we examined transcriptome in wax gourd. More than 44 million of high quality reads were generated from five different tissues of wax gourd using Illumina paired-end sequencing technology. Approximately 4 Gbp data were generated, and de novo assembled into 65,059 unigenes, with an N50 of 1,132 bp. Based on sequence similarity search with known protein database, 36,070 (55.4% showed significant similarity to known proteins in Nr database, and 24,969 (38.4% had BLAST hits in Swiss-Prot database. Among the annotated unigenes, 14,994 of wax gourd unigenes were assigned to GO term annotation, and 23,977 were found to have COG classifications. In addition, a total of 18,713 unigenes were assigned to 281 KEGG pathways. Furthermore, 6,242 microsatellites (simple sequence repeats were detected as potential molecular markers in wax gourd. Two hundred primer pairs for SSRs were designed for validation of the amplification and polymorphism. The result showed that 170 of the 200 primer pairs were successfully amplified and 49 (28.8% of them exhibited polymorphisms. CONCLUSION/SIGNIFICANCE: Our study enriches the genomic resources of wax gourd and provides powerful information for future studies. The availability of this ample amount of information about the transcriptome and SSRs in wax gourd could serve as valuable basis for studies on the physiology, biochemistry, molecular genetics and molecular breeding of this important vegetable crop.

  7. Leading edge analysis of transcriptomic changes during pseudorabies virus infection.

    Science.gov (United States)

    Fleming, Damarius S; Miller, Laura C

    2016-12-01

    Eight RNA samples taken from the tracheobronchial lymph nodes (TBLN) of pigs that were either infected or non-infected with a feral isolate of porcine pseudorabies virus (PRV) were used to investigate changes in gene expression related to the pathogen. The RNA was processed into fastq files for each library prior to being analyzed using Illumina Digital Gene Expression Tag Profiling sequences (DGETP) which were used as the downstream measure of differential expression. Analyzed tags consisted of 21 base pair sequences taken from time points 1, 3, 6, and 14 days' post infection (dpi) that generated 1,927,547 unique tag sequences. Tag sequences were analyzed for differential transcript expression and gene set enrichment analysis (GSEA) to uncover transcriptomic changes related to PRV pathology progression. In conjunction with the DGETP and GSEA, the study also incorporated use of leading edge analysis to help link the TBLN transcriptome data to clinical progression of PRV at each of the sampled time points. The purpose of this manuscript is to provide useful background on applying the leading edge analysis to GSEA and expression data to help identify genes considered to be of high biological interest. The data in the form of fastq files has been uploaded to the NCBI Gene Expression Omnibus (GEO) (GSE74473) database.

  8. Transcriptome Sequencing of Chemically Induced Aquilaria sinensis to Identify Genes Related to Agarwood Formation.

    Science.gov (United States)

    Ye, Wei; Wu, Hongqing; He, Xin; Wang, Lei; Zhang, Weimin; Li, Haohua; Fan, Yunfei; Tan, Guohui; Liu, Taomei; Gao, Xiaoxia

    2016-01-01

    Agarwood is a traditional Chinese medicine used as a clinical sedative, carminative, and antiemetic drug. Agarwood is formed in Aquilaria sinensis when A. sinensis trees are threatened by external physical, chemical injury or endophytic fungal irritation. However, the mechanism of agarwood formation via chemical induction remains unclear. In this study, we characterized the transcriptome of different parts of a chemically induced A. sinensis trunk sample with agarwood. The Illumina sequencing platform was used to identify the genes involved in agarwood formation. A five-year-old Aquilaria sinensis treated by formic acid was selected. The white wood part (B1 sample), the transition part between agarwood and white wood (W2 sample), the agarwood part (J3 sample), and the rotten wood part (F5 sample) were collected for transcriptome sequencing. Accordingly, 54,685,634 clean reads, which were assembled into 83,467 unigenes, were obtained with a Q20 value of 97.5%. A total of 50,565 unigenes were annotated using the Nr, Nt, SWISS-PROT, KEGG, COG, and GO databases. In particular, 171,331,352 unigenes were annotated by various pathways, including the sesquiterpenoid (ko00909) and plant-pathogen interaction (ko03040) pathways. These pathways were related to sesquiterpenoid biosynthesis and defensive responses to chemical stimulation. The transcriptome data of the different parts of the chemically induced A. sinensis trunk provide a rich source of materials for discovering and identifying the genes involved in sesquiterpenoid production and in defensive responses to chemical stimulation. This study is the first to use de novo sequencing and transcriptome assembly for different parts of chemically induced A. sinensis. Results demonstrate that the sesquiterpenoid biosynthesis pathway and WRKY transcription factor play important roles in agarwood formation via chemical induction. The comparative analysis of the transcriptome data of agarwood and A. sinensis lays the foundation

  9. Combining independent de novo assemblies optimizes the coding transcriptome for nonconventional model eukaryotic organisms.

    Science.gov (United States)

    Cerveau, Nicolas; Jackson, Daniel J

    2016-12-09

    Next-generation sequencing (NGS) technologies are arguably the most revolutionary technical development to join the list of tools available to molecular biologists since PCR. For researchers working with nonconventional model organisms one major problem with the currently dominant NGS platform (Illumina) stems from the obligatory fragmentation of nucleic acid material that occurs prior to sequencing during library preparation. This step creates a significant bioinformatic challenge for accurate de novo assembly of novel transcriptome data. This challenge becomes apparent when a variety of modern assembly tools (of which there is no shortage) are applied to the same raw NGS dataset. With the same assembly parameters these tools can generate markedly different assembly outputs. In this study we present an approach that generates an optimized consensus de novo assembly of eukaryotic coding transcriptomes. This approach does not represent a new assembler, rather it combines the outputs of a variety of established assembly packages, and removes redundancy via a series of clustering steps. We test and validate our approach using Illumina datasets from six phylogenetically diverse eukaryotes (three metazoans, two plants and a yeast) and two simulated datasets derived from metazoan reference genome annotations. All of these datasets were assembled using three currently popular assembly packages (CLC, Trinity and IDBA-tran). In addition, we experimentally demonstrate that transcripts unique to one particular assembly package are likely to be bioinformatic artefacts. For all eight datasets our pipeline generates more concise transcriptomes that in fact possess more unique annotatable protein domains than any of the three individual assemblers we employed. Another measure of assembly completeness (using the purpose built BUSCO databases) also confirmed that our approach yields more information. Our approach yields coding transcriptome assemblies that are more likely to be

  10. Transcriptomic response of maize primary roots to low temperatures at seedling emergence.

    Science.gov (United States)

    Di Fenza, Mauro; Hogg, Bridget; Grant, Jim; Barth, Susanne

    2017-01-01

    Maize ( Zea mays ) is a C 4 tropical cereal and its adaptation to temperate climates can be problematic due to low soil temperatures at early stages of establishment. In the current study we have firstly investigated the physiological response of twelve maize varieties, from a chilling condition adapted gene pool, to sub-optimal growth temperature during seedling emergence. To identify transcriptomic markers of cold tolerance in already adapted maize genotypes, temperature conditions were set below the optimal growth range in both control and low temperature groups. The conditions were as follows; control (18 °C for 16 h and 12 °C for 8 h) and low temperature (12 °C for 16 h and 6 °C for 8 h). Four genotypes were identified from the condition adapted gene pool with significant contrasting chilling tolerance. Picker and PR39B29 were the more cold-tolerant lines and Fergus and Codisco were the less cold-tolerant lines. These four varieties were subjected to microarray analysis to identify differentially expressed genes under chilling conditions. Exposure to low temperature during establishment in the maize varieties Picker, PR39B29, Fergus and Codisco, was reflected at the transcriptomic level in the varieties Picker and PR39B29. No significant changes in expression were observed in Fergus and Codisco following chilling stress. A total number of 64 genes were differentially expressed in the two chilling tolerant varieties. These two varieties exhibited contrasting transcriptomic profiles, in which only four genes overlapped. We observed that maize varieties possessing an enhanced root growth ratio under low temperature were more tolerant, which could be an early and inexpensive measure for germplasm screening under controlled conditions. We have identified novel cold inducible genes in an already adapted maize breeding gene pool. This illustrates that further varietal selection for enhanced chilling tolerance is possible in an already preselected gene pool.

  11. Transcriptomic response of maize primary roots to low temperatures at seedling emergence

    Directory of Open Access Journals (Sweden)

    Mauro Di Fenza

    2017-01-01

    Full Text Available Background Maize (Zea mays is a C4 tropical cereal and its adaptation to temperate climates can be problematic due to low soil temperatures at early stages of establishment. Methods In the current study we have firstly investigated the physiological response of twelve maize varieties, from a chilling condition adapted gene pool, to sub-optimal growth temperature during seedling emergence. To identify transcriptomic markers of cold tolerance in already adapted maize genotypes, temperature conditions were set below the optimal growth range in both control and low temperature groups. The conditions were as follows; control (18 °C for 16 h and 12 °C for 8 h and low temperature (12 °C for 16 h and 6 °C for 8 h. Four genotypes were identified from the condition adapted gene pool with significant contrasting chilling tolerance. Results Picker and PR39B29 were the more cold-tolerant lines and Fergus and Codisco were the less cold-tolerant lines. These four varieties were subjected to microarray analysis to identify differentially expressed genes under chilling conditions. Exposure to low temperature during establishment in the maize varieties Picker, PR39B29, Fergus and Codisco, was reflected at the transcriptomic level in the varieties Picker and PR39B29. No significant changes in expression were observed in Fergus and Codisco following chilling stress. A total number of 64 genes were differentially expressed in the two chilling tolerant varieties. These two varieties exhibited contrasting transcriptomic profiles, in which only four genes overlapped. Discussion We observed that maize varieties possessing an enhanced root growth ratio under low temperature were more tolerant, which could be an early and inexpensive measure for germplasm screening under controlled conditions. We have identified novel cold inducible genes in an already adapted maize breeding gene pool. This illustrates that further varietal selection for enhanced chilling

  12. Characteristics of recombinantly expressed rat and human histamine H3 receptors.

    Science.gov (United States)

    Wulff, Birgitte S; Hastrup, Sven; Rimvall, Karin

    2002-10-18

    Human and rat histamine H(3) receptors were recombinantly expressed and characterized using receptor binding and a functional cAMP assay. Seven of nine agonists had similar affinities and potencies at the rat and human histamine H(3) receptor. S-alpha-methylhistamine had a significantly higher affinity and potency at the human than rat receptor, and for 4-[(1R*,2R*)-2-(5,5-dimethyl-1-hexynyl)cyclopropyl]-1H-imidazole (Perceptin) the preference was the reverse. Only two of six antagonists had similar affinities and potencies at the human and the rat histamine H(3) receptor. Ciproxifan, thioperamide and (1R*,2R*)-trans-2-imidazol-4 ylcyclopropyl) (cyclohexylmethoxy) carboxamide (GT2394) had significantly higher affinities and potencies at the rat than at the human histamine H(3) receptor, while for N-(4-chlorobenzyl)-N-(7-pyrrolodin-1-ylheptyl)guanidine (JB98064) the preference was the reverse. All antagonists also showed potent inverse agonism properties. Iodoproxyfan, Perceptin, proxyfan and GR175737, compounds previously described as histamine H(3) receptor antagonists, acted as full or partial agonists at both the rat and the human histamine H(3) receptor. Copyright 2002 Elsevier Science B.V.

  13. Defining the genomic signature of totipotency and pluripotency during early human development.

    Directory of Open Access Journals (Sweden)

    Amparo Galan

    Full Text Available The genetic mechanisms governing human pre-implantation embryo development and the in vitro counterparts, human embryonic stem cells (hESCs, still remain incomplete. Previous global genome studies demonstrated that totipotent blastomeres from day-3 human embryos and pluripotent inner cell masses (ICMs from blastocysts, display unique and differing transcriptomes. Nevertheless, comparative gene expression analysis has revealed that no significant differences exist between hESCs derived from blastomeres versus those obtained from ICMs, suggesting that pluripotent hESCs involve a new developmental progression. To understand early human stages evolution, we developed an undifferentiation network signature (UNS and applied it to a differential gene expression profile between single blastomeres from day-3 embryos, ICMs and hESCs. This allowed us to establish a unique signature composed of highly interconnected genes characteristic of totipotency (61 genes, in vivo pluripotency (20 genes, and in vitro pluripotency (107 genes, and which are also proprietary according to functional analysis. This systems biology approach has led to an improved understanding of the molecular and signaling processes governing human pre-implantation embryo development, as well as enabling us to comprehend how hESCs might adapt to in vitro culture conditions.

  14. Whole transcriptome sequencing enables discovery and analysis of viruses in archived primary central nervous system lymphomas.

    Directory of Open Access Journals (Sweden)

    Christopher DeBoever

    Full Text Available Primary central nervous system lymphomas (PCNSL have a dramatically increased prevalence among persons living with AIDS and are known to be associated with human Epstein Barr virus (EBV infection. Previous work suggests that in some cases, co-infection with other viruses may be important for PCNSL pathogenesis. Viral transcription in tumor samples can be measured using next generation transcriptome sequencing. We demonstrate the ability of transcriptome sequencing to identify viruses, characterize viral expression, and identify viral variants by sequencing four archived AIDS-related PCNSL tissue samples and analyzing raw sequencing reads. EBV was detected in all four PCNSL samples and cytomegalovirus (CMV, JC polyomavirus (JCV, and HIV were also discovered, consistent with clinical diagnoses. CMV was found to express three long non-coding RNAs recently reported as expressed during active infection. Single nucleotide variants were observed in each of the viruses observed and three indels were found in CMV. No viruses were found in several control tumor types including 32 diffuse large B-cell lymphoma samples. This study demonstrates the ability of next generation transcriptome sequencing to accurately identify viruses, including DNA viruses, in solid human cancer tissue samples.

  15. Transcriptome Changes of Escherichia coli, Enterococcus faecalis, and Escherichia coli O157:H7 Laboratory Strains in Response to Photo-Degraded DOM

    Directory of Open Access Journals (Sweden)

    Adelumola Oladeinde

    2018-05-01

    Full Text Available In this study, we investigated gene expression changes in three bacterial strains (Escherichia coli C3000, Escherichia coli O157:H7 B6914, and Enterococcus faecalis ATCC 29212, commonly used as indicators of water quality and as control strains in clinical, food, and water microbiology laboratories. Bacterial transcriptome responses from pure cultures were monitored in microcosms containing water amended with manure-derived dissolved organic matter (DOM, previously exposed to simulated sunlight for 12 h. We used RNA sequencing (RNA-seq and quantitative real-time reverse transcriptase (qRT-PCR to compare differentially expressed temporal transcripts between bacteria incubated in microcosms containing sunlight irradiated and non-irradiated DOM, for up to 24 h. In addition, we used whole genome sequencing simultaneously with RNA-seq to identify single nucleotide variants (SNV acquired in bacterial populations during incubation. These results indicate that E. coli and E. faecalis have different mechanisms for removal of reactive oxygen species (ROS produced from irradiated DOM. They are also able to produce micromolar concentrations of H2O2 from non-irradiated DOM, that should be detrimental to other bacteria present in the environment. Notably, this study provides an assessment of the role of two conjugative plasmids carried by the E. faecalis and highlights the differences in the overall survival dynamics of environmentally-relevant bacteria in the presence of naturally-produced ROS.

  16. Distinct RNA transcriptome patterns are potentially associated with angiogenesis in Tie2-expressing monocytes.

    Science.gov (United States)

    Wang, Xinjing; Dai, Zhiyuan; Wu, Xiaoli; Wang, Kai; Wang, Xipeng

    2016-04-10

    Tie2-expressing Monocytes (TEMs) were previously identified as a novel subset of monocytes and were believed to have prominent pro-angiogenesis activities in human tumors. While the molecular mechanism of the angiogenesis promoting capacity of TEMs remains unclear. RNA transcriptome pattern, including non-coding RNAs as microRNA (miRNA) and long non-coding RNA (lncRNA), plays important role in cell differentiation and functions. However, little is known about the transcriptome patterns of TEMs, including those non-coding RNAs. We explore the transcriptome of TEMs and the matched monocytes that do not express Tie2 (Tie2(-)monocytes) isolated from peripheral blood of healthy adults employing the Agilent Human miRNA(8*60K,Design ID: 046064)microarray and the Agilent lncRNA Gene Expression(4*180K, Design ID: 042818)microarray. A total of 141 mRNAs, 142 lncRNAs and 75 miRNAs were found dysregulated in TEMs compared to Tie2(-)monocytes. TEMs have the distinct RNA transcriptome patterns according to the Hierarchical clustering and then the gene expression patterns were confirmed by quantitative real-time reverse transcription polymerase chain reaction (qRT-PCR). Functional annotation by Gene Ontology (GO) analyses showed that the up-regulated mRNAs in TEMs were associated to blood vessel remodeling and positive regulation of epithelial cell proliferation, and the up-regulated insulin like growth factor 1(IGF1) mRNA was involved in both pathways. For functional analysis of those dysregulated non-coding RNAs, target genes of the miRNAs were predicted and cis/trans-regulation analysis of the lncRNAs were performed. Copyright © 2016 Elsevier B.V. All rights reserved.

  17. De Novo Transcriptome Analysis of Two Seahorse Species (Hippocampus erectus and H. mohnikei and the Development of Molecular Markers for Population Genetics.

    Directory of Open Access Journals (Sweden)

    Qiang Lin

    Full Text Available Seahorse conservation has been performed utilizing various strategies for many decades, and the deeper understanding of genomic information is necessary to more efficiently protect the germplasm resources of seahorse species. However, little genetic information about seahorses currently exists in the public databases. In this study, high-throughput RNA sequencing for two seahorse species, Hippocampus erectus and H. mohnikei, was carried out, and de novo assembly generated 37,506 unigenes for H. erectus and 36,113 unigenes for H. mohnikei. Among them, 17,338 (46.23% unigenes for H. erectus and 17,900 (49.57% for H. mohnikei were successfully annotated based on the information available from the public databases. Through comparing the unigenes of two seahorse species, 7,802 candidate orthologous genes were identified and 5,268 genes among them could be annotated. In addition, gene ontology analysis of two species was similarly performed on biological processes, cellular components, and molecular functions. Twenty-four and twenty-one unigenes in H. erectus and H. mohnikei were annotated in the biosynthesis of unsaturated fatty acids pathways, and both seahorses lacked the Δ12 and Δ15 desaturases. Total of 8,992 and 9,116 SSR loci were obtained from H. erectus and H. mohnikei unigenes, respectively. Dozens of SSR were developed and then applied to assess the population genetic diversity, as well as cross-amplified in a related species, H. trimaculatus. The HO and HE values of the tested populations for H. erectus, H. mohnikei, and H. trimaculatus were medium. These resources would facilitate the conservation of the species through a better understanding of the genomics and comparative genome analysis within the Hippocampus genus.

  18. De Novo Transcriptome Analysis of Two Seahorse Species (Hippocampus erectus and H. mohnikei) and the Development of Molecular Markers for Population Genetics.

    Science.gov (United States)

    Lin, Qiang; Luo, Wei; Wan, Shiming; Gao, Zexia

    2016-01-01

    Seahorse conservation has been performed utilizing various strategies for many decades, and the deeper understanding of genomic information is necessary to more efficiently protect the germplasm resources of seahorse species. However, little genetic information about seahorses currently exists in the public databases. In this study, high-throughput RNA sequencing for two seahorse species, Hippocampus erectus and H. mohnikei, was carried out, and de novo assembly generated 37,506 unigenes for H. erectus and 36,113 unigenes for H. mohnikei. Among them, 17,338 (46.23%) unigenes for H. erectus and 17,900 (49.57%) for H. mohnikei were successfully annotated based on the information available from the public databases. Through comparing the unigenes of two seahorse species, 7,802 candidate orthologous genes were identified and 5,268 genes among them could be annotated. In addition, gene ontology analysis of two species was similarly performed on biological processes, cellular components, and molecular functions. Twenty-four and twenty-one unigenes in H. erectus and H. mohnikei were annotated in the biosynthesis of unsaturated fatty acids pathways, and both seahorses lacked the Δ12 and Δ15 desaturases. Total of 8,992 and 9,116 SSR loci were obtained from H. erectus and H. mohnikei unigenes, respectively. Dozens of SSR were developed and then applied to assess the population genetic diversity, as well as cross-amplified in a related species, H. trimaculatus. The HO and HE values of the tested populations for H. erectus, H. mohnikei, and H. trimaculatus were medium. These resources would facilitate the conservation of the species through a better understanding of the genomics and comparative genome analysis within the Hippocampus genus.

  19. Human vaginal pH and microbiota: an update.

    Science.gov (United States)

    Godha, Keshav; Tucker, Kelly M; Biehl, Colton; Archer, David F; Mirkin, Sebastian

    2018-06-01

    A woman's vaginal pH has many implications on her health and it can be a useful tool in disease diagnosis and prevention. For that reason, the further examination of the relationship between the human vaginal pH and microbiota is imperative. In the past several decades, much has been learned about the physiological mechanisms modulating the vaginal pH, and exogenous/genetic factors that may influence it. A unified, coherent understanding of these concepts is presented to comprehend their interrelationships and their cumulative effect on a woman's health. In this review, we explore research on vaginal pH and microbiota throughout a woman's life, vaginal intermediate cell anaerobic metabolism and net proton secretion by the vaginal epithelial, and the way these factors interact to acidify the vaginal pH. This review provides foundational information about what a microbiota is and its relationship with human physiology and vaginal pH. We then evaluate the influence of physiological mechanisms, demographic factors, and propose ideas for the mechanisms behind their action on the vaginal pH.

  20. Integrative investigation of metabolic and transcriptomic data

    Directory of Open Access Journals (Sweden)

    Önsan Z İlsen

    2006-04-01

    Full Text Available Abstract Background New analysis methods are being developed to integrate data from transcriptome, proteome, interactome, metabolome, and other investigative approaches. At the same time, existing methods are being modified to serve the objectives of systems biology and permit the interpretation of the huge datasets currently being generated by high-throughput methods. Results Transcriptomic and metabolic data from chemostat fermentors were collected with the aim of investigating the relationship between these two data sets. The variation in transcriptome data in response to three physiological or genetic perturbations (medium composition, growth rate, and specific gene deletions was investigated using linear modelling, and open reading-frames (ORFs whose expression changed significantly in response to these perturbations were identified. Assuming that the metabolic profile is a function of the transcriptome profile, expression levels of the different ORFs were used to model the metabolic variables via Partial Least Squares (Projection to Latent Structures – PLS using PLS toolbox in Matlab. Conclusion The experimental design allowed the analyses to discriminate between the effects which the growth medium, dilution rate, and the deletion of specific genes had on the transcriptome and metabolite profiles. Metabolite data were modelled as a function of the transcriptome to determine their congruence. The genes that are involved in central carbon metabolism of yeast cells were found to be the ORFs with the most significant contribution to the model.

  1. Transcriptome Sequencing and Analysis for Culm Elongation of the World's Largest Bamboo (Dendrocalamus sinicus.

    Directory of Open Access Journals (Sweden)

    Kai Cui

    Full Text Available Dendrocalamus sinicus is the world's largest bamboo species with strong woody culms, and known for its fast-growing culms. As an economic bamboo species, it was popularized for multi-functional applications including furniture, construction, and industrial paper pulp. To comprehensively elucidate the molecular processes involved in its culm elongation, Illumina paired-end sequencing was conducted. About 65.08 million high-quality reads were produced, and assembled into 81,744 unigenes with an average length of 723 bp. A total of 64,338 (79% unigenes were annotated for their functions, of which, 56,587 were annotated in the NCBI non-redundant protein database and 35,262 were annotated in the Swiss-Prot database. Also, 42,508 and 21,009 annotated unigenes were allocated to gene ontology (GO categories and clusters of orthologous groups (COG, respectively. By searching against the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG, 33,920 unigenes were assigned to 128 KEGG pathways. Meanwhile, 8,553 simple sequence repeats (SSRs and 81,534 single-nucleotide polymorphism (SNPs were identified, respectively. Additionally, 388 transcripts encoding lignin biosynthesis were detected, among which, 27 transcripts encoding Shikimate O-hydroxycinnamoyltransferase (HCT specifically expressed in D. sinicus when compared to other bamboo species and rice. The phylogenetic relationship between D. sinicus and other plants was analyzed, suggesting functional diversity of HCT unigenes in D. sinicus. We conjectured that HCT might lead to the high lignin content and giant culm. Given that the leaves are not yet formed and culm is covered with sheaths during culm elongation, the existence of photosynthesis of bamboo culm is usually neglected. Surprisedly, 109 transcripts encoding photosynthesis were identified, including photosystem I and II, cytochrome b6/f complex, photosynthetic electron transport and F-type ATPase, and 24 transcripts were characterized

  2. Transcriptome Sequencing and Analysis for Culm Elongation of the World's Largest Bamboo (Dendrocalamus sinicus).

    Science.gov (United States)

    Cui, Kai; Wang, Haiying; Liao, Shengxi; Tang, Qi; Li, Li; Cui, Yongzhong; He, Yuan

    2016-01-01

    Dendrocalamus sinicus is the world's largest bamboo species with strong woody culms, and known for its fast-growing culms. As an economic bamboo species, it was popularized for multi-functional applications including furniture, construction, and industrial paper pulp. To comprehensively elucidate the molecular processes involved in its culm elongation, Illumina paired-end sequencing was conducted. About 65.08 million high-quality reads were produced, and assembled into 81,744 unigenes with an average length of 723 bp. A total of 64,338 (79%) unigenes were annotated for their functions, of which, 56,587 were annotated in the NCBI non-redundant protein database and 35,262 were annotated in the Swiss-Prot database. Also, 42,508 and 21,009 annotated unigenes were allocated to gene ontology (GO) categories and clusters of orthologous groups (COG), respectively. By searching against the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG), 33,920 unigenes were assigned to 128 KEGG pathways. Meanwhile, 8,553 simple sequence repeats (SSRs) and 81,534 single-nucleotide polymorphism (SNPs) were identified, respectively. Additionally, 388 transcripts encoding lignin biosynthesis were detected, among which, 27 transcripts encoding Shikimate O-hydroxycinnamoyltransferase (HCT) specifically expressed in D. sinicus when compared to other bamboo species and rice. The phylogenetic relationship between D. sinicus and other plants was analyzed, suggesting functional diversity of HCT unigenes in D. sinicus. We conjectured that HCT might lead to the high lignin content and giant culm. Given that the leaves are not yet formed and culm is covered with sheaths during culm elongation, the existence of photosynthesis of bamboo culm is usually neglected. Surprisedly, 109 transcripts encoding photosynthesis were identified, including photosystem I and II, cytochrome b6/f complex, photosynthetic electron transport and F-type ATPase, and 24 transcripts were characterized as antenna

  3. A comparative gene expression database for invertebrates

    Directory of Open Access Journals (Sweden)

    Ormestad Mattias

    2011-08-01

    Full Text Available Abstract Background As whole genome and transcriptome sequencing gets cheaper and faster, a great number of 'exotic' animal models are emerging, rapidly adding valuable data to the ever-expanding Evo-Devo field. All these new organisms serve as a fantastic resource for the research community, but the sheer amount of data, some published, some not, makes detailed comparison of gene expression patterns very difficult to summarize - a problem sometimes even noticeable within a single lab. The need to merge existing data with new information in an organized manner that is publicly available to the research community is now more necessary than ever. Description In order to offer a homogenous way of storing and handling gene expression patterns from a variety of organisms, we have developed the first web-based comparative gene expression database for invertebrates that allows species-specific as well as cross-species gene expression comparisons. The database can be queried by gene name, developmental stage and/or expression domains. Conclusions This database provides a unique tool for the Evo-Devo research community that allows the retrieval, analysis and comparison of gene expression patterns within or among species. In addition, this database enables a quick identification of putative syn-expression groups that can be used to initiate, among other things, gene regulatory network (GRN projects.

  4. De novo transcriptome sequencing and assembly from apomictic and sexual Eragrostis curvula genotypes.

    Directory of Open Access Journals (Sweden)

    Ingrid Garbus

    Full Text Available A long-standing goal in plant breeding has been the ability to confer apomixis to agriculturally relevant species, which would require a deeper comprehension of the molecular basis of apomictic regulatory mechanisms. Eragrostis curvula (Schrad. Nees is a perennial grass that includes both sexual and apomictic cytotypes. The availability of a reference transcriptome for this species would constitute a very important tool toward the identification of genes controlling key steps of the apomictic pathway. Here, we used Roche/454 sequencing technologies to generate reads from inflorescences of E. curvula apomictic and sexual genotypes that were de novo assembled into a reference transcriptome. Near 90% of the 49568 assembled isotigs showed sequence similarity to sequences deposited in the public databases. A gene ontology analysis categorized 27448 isotigs into at least one of the three main GO categories. We identified 11475 SSRs, and several of them were assayed in E curvula germoplasm using SSR-based primers, providing a valuable set of molecular markers that could allow direct allele selection. The differential contribution to each library of the spliced forms of several transcripts revealed the existence of several isotigs produced via alternative splicing of single genes. The reference transcriptome presented and validated in this work will be useful for the identification of a wide range of gene(s related to agronomic traits of E. curvula, including those controlling key steps of the apomictic pathway in this species, allowing the extrapolation of the findings to other plant species.

  5. A third human retinoic acid receptor, hRAR-γ

    International Nuclear Information System (INIS)

    Krust, A.; Kastner, Ph.; Petkovich, M.; Zelent, A.; Chambon, P.

    1989-01-01

    Retinoic acid receptors (RARs) are retinoic acid (RA)-inducible enhancer factors belonging to the superfamily of steroid/thyroid nuclear receptors. The authors have previously characterized two human RAR (hRAR-α and hRAR-β) cDNAs and have recently cloned their murine cognates (mRAR-α and mRAR-β) together with a third RAR (mRAR-γ) whose RNA was detected predominantly in skin, a well-known target for RA. mRAR-γ cDNA was used here to clone its human counterpart (hRAR-γ) from a T47D breast cancer cell cDNA library. Using a transient transfection assay in HeLa cells and a reporter gene harboring a synthetic RA responsive element, they demonstrate that hRAR-γ cDNA indeed encodes a RA-inducible transcriptional trans-activator. Interestingly, comparisons of the amino acid sequences of all six human and mouse RARs indicate that the interspecies conservation of a given member of the RAR subfamily (either α, β, or γ) is much higher than the conservation of all three receptors within a given species. These observations indicate that RAR-α, -β, and -γ may perform specific functions. They show also that hRAR-γ RNA is the predominant RAR RNA species in human skin, which suggests that hRAR-γ mediates some of the retinoid effects in this tissue

  6. Let's talk about it: dialogues with multimedia databases Database support for human activity

    NARCIS (Netherlands)

    de Vries, A.P.; van der Veer, Gerrit C.; Blanken, Henk

    We describe two scenarios of user tasks in which access to multimedia data plays a significant role. Because current multimedia databases cannot support these tasks, we introduce three new requirements on multimedia databases: multimedia objects should be active objects, querying is an interaction

  7. Single-cell analysis of targeted transcriptome predicts drug sensitivity of single cells within human myeloma tumors.

    Science.gov (United States)

    Mitra, A K; Mukherjee, U K; Harding, T; Jang, J S; Stessman, H; Li, Y; Abyzov, A; Jen, J; Kumar, S; Rajkumar, V; Van Ness, B

    2016-05-01

    Multiple myeloma (MM) is characterized by significant genetic diversity at subclonal levels that have a defining role in the heterogeneity of tumor progression, clinical aggressiveness and drug sensitivity. Although genome profiling studies have demonstrated heterogeneity in subclonal architecture that may ultimately lead to relapse, a gene expression-based prediction program that can identify, distinguish and quantify drug response in sub-populations within a bulk population of myeloma cells is lacking. In this study, we performed targeted transcriptome analysis on 528 pre-treatment single cells from 11 myeloma cell lines and 418 single cells from 8 drug-naïve MM patients, followed by intensive bioinformatics and statistical analysis for prediction of proteasome inhibitor sensitivity in individual cells. Using our previously reported drug response gene expression profile signature at the single-cell level, we developed an R Statistical analysis package available at https://github.com/bvnlabSCATTome, SCATTome (single-cell analysis of targeted transcriptome), that restructures the data obtained from Fluidigm single-cell quantitative real-time-PCR analysis run, filters missing data, performs scaling of filtered data, builds classification models and predicts drug response of individual cells based on targeted transcriptome using an assortment of machine learning methods. Application of SCATT should contribute to clinically relevant analysis of intratumor heterogeneity, and better inform drug choices based on subclonal cellular responses.

  8. Novel functional view of the crocidolite asbestos-treated A549 human lung epithelial transcriptome reveals an intricate network of pathways with opposing functions

    Directory of Open Access Journals (Sweden)

    Stevens John R

    2008-08-01

    Full Text Available Abstract Background Although exposure to asbestos is now regulated, patients continue to be diagnosed with mesothelioma, asbestosis, fibrosis and lung carcinoma because of the long latent period between exposure and clinical disease. Asbestosis is observed in approximately 200,000 patients annually and asbestos-related deaths are estimated at 4,000 annually1. Although advances have been made using single gene/gene product or pathway studies, the complexity of the response to asbestos and the many unanswered questions suggested the need for a systems biology approach. The objective of this study was to generate a comprehensive view of the transcriptional changes induced by crocidolite asbestos in A549 human lung epithelial cells. Results A statistically robust, comprehensive data set documenting the crocidolite-induced changes in the A549 transcriptome was collected. A systems biology approach involving global observations from gene ontological analyses coupled with functional network analyses was used to explore the effects of crocidolite in the context of known molecular interactions. The analyses uniquely document a transcriptome with function-based networks in cell death, cancer, cell cycle, cellular growth, proliferation, and gene expression. These functional modules show signs of a complex interplay between signaling pathways consisting of both novel and previously described asbestos-related genes/gene products. These networks allowed for the identification of novel, putative crocidolite-related genes, leading to several new hypotheses regarding genes that are important for the asbestos response. The global analysis revealed a transcriptome that bears signatures of both apoptosis/cell death and cell survival/proliferation. Conclusion Our analyses demonstrate the power of combining a statistically robust, comprehensive dataset and a functional network genomics approach to 1 identify and explore relationships between genes of known importance

  9. De novo assembly and analysis of the Artemisia argyi transcriptome and identification of genes involved in terpenoid biosynthesis.

    Science.gov (United States)

    Liu, Miaomiao; Zhu, Jinhang; Wu, Shengbing; Wang, Chenkai; Guo, Xingyi; Wu, Jiawen; Zhou, Meiqi

    2018-04-11

    Artemisia argyi Lev. et Vant. (A. argyi) is widely utilized for moxibustion in Chinese medicine, and the mechanism underlying terpenoid biosynthesis in its leaves is suggested to play an important role in its medicinal use. However, the A. argyi transcriptome has not been sequenced. Herein, we performed RNA sequencing for A. argyi leaf, root and stem tissues to identify as many as possible of the transcribed genes. In total, 99,807 unigenes were assembled by analysing the expression profiles generated from the three tissue types, and 67,446 of those unigenes were annotated in public databases. We further performed differential gene expression analysis to compare leaf tissue with the other two tissue types and identified numerous genes that were specifically expressed or up-regulated in leaf tissue. Specifically, we identified multiple genes encoding significant enzymes or transcription factors related to terpenoid synthesis. This study serves as a valuable resource for transcriptome information, as many transcribed genes related to terpenoid biosynthesis were identified in the A. argyi transcriptome, providing a functional genomic basis for additional studies on molecular mechanisms underlying the medicinal use of A. argyi.

  10. A high-resolution anatomical atlas of the transcriptome in the mouse embryo.

    Directory of Open Access Journals (Sweden)

    Graciana Diez-Roux

    Full Text Available Ascertaining when and where genes are expressed is of crucial importance to understanding or predicting the physiological role of genes and proteins and how they interact to form the complex networks that underlie organ development and function. It is, therefore, crucial to determine on a genome-wide level, the spatio-temporal gene expression profiles at cellular resolution. This information is provided by colorimetric RNA in situ hybridization that can elucidate expression of genes in their native context and does so at cellular resolution. We generated what is to our knowledge the first genome-wide transcriptome atlas by RNA in situ hybridization of an entire mammalian organism, the developing mouse at embryonic day 14.5. This digital transcriptome atlas, the Eurexpress atlas (http://www.eurexpress.org, consists of a searchable database of annotated images that can be interactively viewed. We generated anatomy-based expression profiles for over 18,000 coding genes and over 400 microRNAs. We identified 1,002 tissue-specific genes that are a source of novel tissue-specific markers for 37 different anatomical structures. The quality and the resolution of the data revealed novel molecular domains for several developing structures, such as the telencephalon, a novel organization for the hypothalamus, and insight on the Wnt network involved in renal epithelial differentiation during kidney development. The digital transcriptome atlas is a powerful resource to determine co-expression of genes, to identify cell populations and lineages, and to identify functional associations between genes relevant to development and disease.

  11. Uptake of 3H-choline and synthesis of 3H-acetylcholine by human penile corpus cavernosum

    International Nuclear Information System (INIS)

    Blanco, R.; Saenz de Tejada, I.; Azadzoi, K.; Goldstein, I.; Krane, R.J.; Wotiz, H.H.; Cohen, R.A.

    1986-01-01

    The neuroeffectors which relax penile smooth muscle and lead to erection are unknown; physiological studies of human corpus cavernosum, in vitro, have suggested a significant role of cholinergic neurotransmission. To further characterize the importance of cholinergic nerves, biopsies of human corpus cavernosum were obtained at the time of penile prosthesis implantation. Tissues were incubated in 3 H-choline (10 -5 M, 80 Ci/mmol) in oxygenated physiological salt solution at 37 0 C, pH 7.4 for 1 hour. Radiolabelled compounds were extracted with perchloric acid (0.4 M) and acetylcholine and choline were separated by HPLC; 14 C-acetylcholine was used as internal standard. 3 H-choline was accumulated by the tissues (20 +/- 1.9 fmol/mg), and 3 H-acetylcholine was synthesized (4.0 +/- 1.1 fmol/mg). In control experiments, heating of the tissue blocked synthesis of 3 H-acetylcholine. Inhibition of high affinity choline transport by hemicholinium-3 (10 -5 M) diminished tissue accumulation of 3 H-choline and significantly reduced the synthesis of 3 H-acetylcholine (0.5 +/ 0.2 fmol/mg, p < 0.05). These results provide direct evidence of neuronal accumulation of choline and enzymatic conversion to acetylcholine in human corpus cavernosum. Taken together with the physiological studies, it can be concluded that cholinergic neurotransmission in human corpus cavernosum plays a role in penile erection

  12. Tentacle Transcriptome and Venom Proteome of the Pacific Sea Nettle, Chrysaora fuscescens (Cnidaria: Scyphozoa

    Directory of Open Access Journals (Sweden)

    Dalia Ponce

    2016-04-01

    Full Text Available Jellyfish venoms are rich sources of toxins designed to capture prey or deter predators, but they can also elicit harmful effects in humans. In this study, an integrated transcriptomic and proteomic approach was used to identify putative toxins and their potential role in the venom of the scyphozoan jellyfish Chrysaora fuscescens. A de novo tentacle transcriptome, containing more than 23,000 contigs, was constructed and used in proteomic analysis of C. fuscescens venom to identify potential toxins. From a total of 163 proteins identified in the venom proteome, 27 were classified as putative toxins and grouped into six protein families: proteinases, venom allergens, C-type lectins, pore-forming toxins, glycoside hydrolases and enzyme inhibitors. Other putative toxins identified in the transcriptome, but not the proteome, included additional proteinases as well as lipases and deoxyribonucleases. Sequence analysis also revealed the presence of ShKT domains in two putative venom proteins from the proteome and an additional 15 from the transcriptome, suggesting potential ion channel blockade or modulatory activities. Comparison of these potential toxins to those from other cnidarians provided insight into their possible roles in C. fuscescens venom and an overview of the diversity of potential toxin families in cnidarian venoms.

  13. Tentacle Transcriptome and Venom Proteome of the Pacific Sea Nettle, Chrysaora fuscescens (Cnidaria: Scyphozoa)

    Science.gov (United States)

    Ponce, Dalia; Brinkman, Diane L.; Potriquet, Jeremy; Mulvenna, Jason

    2016-01-01

    Jellyfish venoms are rich sources of toxins designed to capture prey or deter predators, but they can also elicit harmful effects in humans. In this study, an integrated transcriptomic and proteomic approach was used to identify putative toxins and their potential role in the venom of the scyphozoan jellyfish Chrysaora fuscescens. A de novo tentacle transcriptome, containing more than 23,000 contigs, was constructed and used in proteomic analysis of C. fuscescens venom to identify potential toxins. From a total of 163 proteins identified in the venom proteome, 27 were classified as putative toxins and grouped into six protein families: proteinases, venom allergens, C-type lectins, pore-forming toxins, glycoside hydrolases and enzyme inhibitors. Other putative toxins identified in the transcriptome, but not the proteome, included additional proteinases as well as lipases and deoxyribonucleases. Sequence analysis also revealed the presence of ShKT domains in two putative venom proteins from the proteome and an additional 15 from the transcriptome, suggesting potential ion channel blockade or modulatory activities. Comparison of these potential toxins to those from other cnidarians provided insight into their possible roles in C. fuscescens venom and an overview of the diversity of potential toxin families in cnidarian venoms. PMID:27058558

  14. Multi-tissue RNA-seq and transcriptome characterisation of the spiny dogfish shark (Squalus acanthias provides a molecular tool for biological research and reveals new genes involved in osmoregulation.

    Directory of Open Access Journals (Sweden)

    Andres Chana-Munoz

    Full Text Available The spiny dogfish shark (Squalus acanthias is one of the most commonly used cartilaginous fishes in biological research, especially in the fields of nitrogen metabolism, ion transporters and osmoregulation. Nonetheless, transcriptomic data for this organism is scarce. In the present study, a multi-tissue RNA-seq experiment and de novo transcriptome assembly was performed in four different spiny dogfish tissues (brain, liver, kidney and ovary, providing an annotated sequence resource. The characterization of the transcriptome greatly increases the scarce sequence information for shark species. Reads were assembled with the Trinity de novo assembler both within each tissue and across all tissues combined resulting in 362,690 transcripts in the combined assembly which represent 289,515 Trinity genes. BUSCO analysis determined a level of 87% completeness for the combined transcriptome. In total, 123,110 proteins were predicted of which 78,679 and 83,164 had significant hits against the SwissProt and Uniref90 protein databases, respectively. Additionally, 61,215 proteins aligned to known protein domains, 7,208 carried a signal peptide and 15,971 possessed at least one transmembrane region. Based on the annotation, 81,582 transcripts were assigned to gene ontology terms and 42,078 belong to known clusters of orthologous groups (eggNOG. To demonstrate the value of our molecular resource, we show that the improved transcriptome data enhances the current possibilities of osmoregulation research in spiny dogfish by utilizing the novel gene and protein annotations to investigate a set of genes involved in urea synthesis and urea, ammonia and water transport, all of them crucial in osmoregulation. We describe the presence of different gene copies and isoforms of key enzymes involved in this process, including arginases and transporters of urea and ammonia, for which sequence information is currently absent in the databases for this model species. The

  15. Multi-tissue RNA-seq and transcriptome characterisation of the spiny dogfish shark (Squalus acanthias) provides a molecular tool for biological research and reveals new genes involved in osmoregulation.

    Science.gov (United States)

    Chana-Munoz, Andres; Jendroszek, Agnieszka; Sønnichsen, Malene; Kristiansen, Rune; Jensen, Jan K; Andreasen, Peter A; Bendixen, Christian; Panitz, Frank

    2017-01-01

    The spiny dogfish shark (Squalus acanthias) is one of the most commonly used cartilaginous fishes in biological research, especially in the fields of nitrogen metabolism, ion transporters and osmoregulation. Nonetheless, transcriptomic data for this organism is scarce. In the present study, a multi-tissue RNA-seq experiment and de novo transcriptome assembly was performed in four different spiny dogfish tissues (brain, liver, kidney and ovary), providing an annotated sequence resource. The characterization of the transcriptome greatly increases the scarce sequence information for shark species. Reads were assembled with the Trinity de novo assembler both within each tissue and across all tissues combined resulting in 362,690 transcripts in the combined assembly which represent 289,515 Trinity genes. BUSCO analysis determined a level of 87% completeness for the combined transcriptome. In total, 123,110 proteins were predicted of which 78,679 and 83,164 had significant hits against the SwissProt and Uniref90 protein databases, respectively. Additionally, 61,215 proteins aligned to known protein domains, 7,208 carried a signal peptide and 15,971 possessed at least one transmembrane region. Based on the annotation, 81,582 transcripts were assigned to gene ontology terms and 42,078 belong to known clusters of orthologous groups (eggNOG). To demonstrate the value of our molecular resource, we show that the improved transcriptome data enhances the current possibilities of osmoregulation research in spiny dogfish by utilizing the novel gene and protein annotations to investigate a set of genes involved in urea synthesis and urea, ammonia and water transport, all of them crucial in osmoregulation. We describe the presence of different gene copies and isoforms of key enzymes involved in this process, including arginases and transporters of urea and ammonia, for which sequence information is currently absent in the databases for this model species. The transcriptome

  16. Symbiodinium transcriptomes: genome insights into the dinoflagellate symbionts of reef-building corals.

    KAUST Repository

    Bayer, Till

    2012-04-18

    Dinoflagellates are unicellular algae that are ubiquitously abundant in aquatic environments. Species of the genus Symbiodinium form symbiotic relationships with reef-building corals and other marine invertebrates. Despite their ecologic importance, little is known about the genetics of dinoflagellates in general and Symbiodinium in particular. Here, we used 454 sequencing to generate transcriptome data from two Symbiodinium species from different clades (clade A and clade B). With more than 56,000 assembled sequences per species, these data represent the largest transcriptomic resource for dinoflagellates to date. Our results corroborate previous observations that dinoflagellates possess the complete nucleosome machinery. We found a complete set of core histones as well as several H3 variants and H2A.Z in one species. Furthermore, transcriptome analysis points toward a low number of transcription factors in Symbiodinium spp. that also differ in the distribution of DNA-binding domains relative to other eukaryotes. In particular the cold shock domain was predominant among transcription factors. Additionally, we found a high number of antioxidative genes in comparison to non-symbiotic but evolutionary related organisms. These findings might be of relevance in the context of the role that Symbiodinium spp. play as coral symbionts.Our data represent the most comprehensive dinoflagellate EST data set to date. This study provides a comprehensive resource to further analyze the genetic makeup, metabolic capacities, and gene repertoire of Symbiodinium and dinoflagellates. Overall, our findings indicate that Symbiodinium possesses some unique characteristics, in particular the transcriptional regulation in Symbiodinium may differ from the currently known mechanisms of eukaryotic gene regulation.

  17. Transcriptomic analysis of Petunia hybrida in response to salt stress using high throughput RNA sequencing.

    Directory of Open Access Journals (Sweden)

    Gonzalo H Villarino

    Full Text Available Salinity and drought stress are the primary cause of crop losses worldwide. In sodic saline soils sodium chloride (NaCl disrupts normal plant growth and development. The complex interactions of plant systems with abiotic stress have made RNA sequencing a more holistic and appealing approach to study transcriptome level responses in a single cell and/or tissue. In this work, we determined the Petunia transcriptome response to NaCl stress by sequencing leaf samples and assembling 196 million Illumina reads with Trinity software. Using our reference transcriptome we identified more than 7,000 genes that were differentially expressed within 24 h of acute NaCl stress. The proposed transcriptome can also be used as an excellent tool for biological and bioinformatics in the absence of an available Petunia genome and it is available at the SOL Genomics Network (SGN http://solgenomics.net. Genes related to regulation of reactive oxygen species, transport, and signal transductions as well as novel and undescribed transcripts were among those differentially expressed in response to salt stress. The candidate genes identified in this study can be applied as markers for breeding or to genetically engineer plants to enhance salt tolerance. Gene Ontology analyses indicated that most of the NaCl damage happened at 24 h inducing genotoxicity, affecting transport and organelles due to the high concentration of Na+ ions. Finally, we report a modification to the library preparation protocol whereby cDNA samples were bar-coded with non-HPLC purified primers, without affecting the quality and quantity of the RNA-seq data. The methodological improvement presented here could substantially reduce the cost of sample preparation for future high-throughput RNA sequencing experiments.

  18. Transcriptomic analysis of Petunia hybrida in response to salt stress using high throughput RNA sequencing.

    Science.gov (United States)

    Villarino, Gonzalo H; Bombarely, Aureliano; Giovannoni, James J; Scanlon, Michael J; Mattson, Neil S

    2014-01-01

    Salinity and drought stress are the primary cause of crop losses worldwide. In sodic saline soils sodium chloride (NaCl) disrupts normal plant growth and development. The complex interactions of plant systems with abiotic stress have made RNA sequencing a more holistic and appealing approach to study transcriptome level responses in a single cell and/or tissue. In this work, we determined the Petunia transcriptome response to NaCl stress by sequencing leaf samples and assembling 196 million Illumina reads with Trinity software. Using our reference transcriptome we identified more than 7,000 genes that were differentially expressed within 24 h of acute NaCl stress. The proposed transcriptome can also be used as an excellent tool for biological and bioinformatics in the absence of an available Petunia genome and it is available at the SOL Genomics Network (SGN) http://solgenomics.net. Genes related to regulation of reactive oxygen species, transport, and signal transductions as well as novel and undescribed transcripts were among those differentially expressed in response to salt stress. The candidate genes identified in this study can be applied as markers for breeding or to genetically engineer plants to enhance salt tolerance. Gene Ontology analyses indicated that most of the NaCl damage happened at 24 h inducing genotoxicity, affecting transport and organelles due to the high concentration of Na+ ions. Finally, we report a modification to the library preparation protocol whereby cDNA samples were bar-coded with non-HPLC purified primers, without affecting the quality and quantity of the RNA-seq data. The methodological improvement presented here could substantially reduce the cost of sample preparation for future high-throughput RNA sequencing experiments.

  19. Transcriptome Analysis of Dendrobium officinale and its Application to the Identification of Genes Associated with Polysaccharide Synthesis

    Science.gov (United States)

    Zhang, Jianxia; He, Chunmei; Wu, Kunlin; Teixeira da Silva, Jaime A.; Zeng, Songjun; Zhang, Xinhua; Yu, Zhenming; Xia, Haoqiang; Duan, Jun

    2016-01-01

    Dendrobium officinale is one of the most important Chinese medicinal herbs. Polysaccharides are one of the main active ingredients of D. officinale. To identify the genes that maybe related to polysaccharides synthesis, two cDNA libraries were prepared from juvenile and adult D. officinale, and were named Dendrobium-1 and Dendrobium-2, respectively. Illumina sequencing for Dendrobium-1 generated 102 million high quality reads that were assembled into 93,881 unigenes with an average sequence length of 790 base pairs. The sequencing for Dendrobium-2 generated 86 million reads that were assembled into 114,098 unigenes with an average sequence length of 695 base pairs. Two transcriptome databases were integrated and assembled into a total of 145,791 unigenes. Among them, 17,281 unigenes were assigned to 126 KEGG pathways while 135 unigenes were involved in fructose and mannose metabolism. Gene Ontology analysis revealed that the majority of genes were associated with metabolic and cellular processes. Furthermore, 430 glycosyltransferase and 89 cellulose synthase genes were identified. Comparative analysis of both transcriptome databases revealed a total of 32,794 differential expression genes (DEGs), including 22,051 up-regulated and 10,743 down-regulated genes in Dendrobium-2 compared to Dendrobium-1. Furthermore, a total of 1142 and 7918 unigenes showed unique expression in Dendrobium-1 and Dendrobium-2, respectively. These DEGs were mainly correlated with metabolic pathways and the biosynthesis of secondary metabolites. In addition, 170 DEGs belonged to glycosyltransferase genes, 37 DEGs were related to cellulose synthase genes and 627 DEGs encoded transcription factors. This study substantially expands the transcriptome information for D. officinale and provides valuable clues for identifying candidate genes involved in polysaccharide biosynthesis and elucidating the mechanism of polysaccharide biosynthesis. PMID:26904032

  20. Transcriptome differences between enrofloxacin-resistant and enrofloxacin-susceptible strains of Aeromonas hydrophila.

    Science.gov (United States)

    Zhu, Fengjiao; Yang, Zongying; Zhang, Yiliu; Hu, Kun; Fang, Wenhong

    2017-01-01

    Enrofloxacin is the most commonly used antibiotic to control diseases in aquatic animals caused by A. hydrophila. This study conducted de novo transcriptome sequencing and compared the global transcriptomes of enrofloxacin-resistant and enrofloxacin-susceptible strains. We got a total of 4,714 unigenes were assembled. Of these, 4,122 were annotated. A total of 3,280 unigenes were assigned to GO, 3,388 unigenes were classified into Cluster of Orthologous Groups of proteins (COG) using BLAST and BLAST2GO software, and 2,568 were mapped onto pathways using the Kyoto Encyclopedia of Gene and Genomes Pathway database. Furthermore, 218 unigenes were deemed to be DEGs. After enrofloxacin treatment, 135 genes were upregulated and 83 genes were downregulated. The GO terms biological process (126 genes) and metabolic process (136 genes) were the most enriched, and the terms for protein folding, response to stress, and SOS response were also significantly enriched. This study identified enrofloxacin treatment affects multiple biological functions of A. hydrophila. Enrofloxacin resistance in A. hydrophila is closely related to the reduction of intracellular drug accumulation caused by ABC transporters and increased expression of topoisomerase IV.

  1. Three mutations switch H7N9 influenza to human-type receptor specificity

    Energy Technology Data Exchange (ETDEWEB)

    de Vries, Robert P.; Peng, Wenjie; Grant, Oliver C.; Thompson, Andrew J.; Zhu, Xueyong; Bouwman, Kim M.; de la Pena, Alba T. Torrents; van Breemen, Marielle J.; Ambepitiya Wickramasinghe, Iresha N.; de Haan, Cornelis A. M.; Yu, Wenli; McBride, Ryan; Sanders, Rogier W.; Woods, Robert J.; Verheije, Monique H.; Wilson, Ian A.; Paulson, James C.; Fernandez-Sesma, Ana

    2017-06-15

    The avian H7N9 influenza outbreak in 2013 resulted from an unprecedented incidence of influenza transmission to humans from infected poultry. The majority of human H7N9 isolates contained a hemagglutinin (HA) mutation (Q226L) that has previously been associated with a switch in receptor specificity from avian-type (NeuAcα2-3Gal) to human-type (NeuAcα2-6Gal), as documented for the avian progenitors of the 1957 (H2N2) and 1968 (H3N2) human influenza pandemic viruses. While this raised concern that the H7N9 virus was adapting to humans, the mutation was not sufficient to switch the receptor specificity of H7N9, and has not resulted in sustained transmission in humans. To determine if the H7 HA was capable of acquiring human-type receptor specificity, we conducted mutation analyses. Remarkably, three amino acid mutations conferred a switch in specificity for human-type receptors that resembled the specificity of the 2009 human H1 pandemic virus, and promoted binding to human trachea epithelial cells.

  2. Three mutations switch H7N9 influenza to human-type receptor specificity.

    Directory of Open Access Journals (Sweden)

    Robert P de Vries

    2017-06-01

    Full Text Available The avian H7N9 influenza outbreak in 2013 resulted from an unprecedented incidence of influenza transmission to humans from infected poultry. The majority of human H7N9 isolates contained a hemagglutinin (HA mutation (Q226L that has previously been associated with a switch in receptor specificity from avian-type (NeuAcα2-3Gal to human-type (NeuAcα2-6Gal, as documented for the avian progenitors of the 1957 (H2N2 and 1968 (H3N2 human influenza pandemic viruses. While this raised concern that the H7N9 virus was adapting to humans, the mutation was not sufficient to switch the receptor specificity of H7N9, and has not resulted in sustained transmission in humans. To determine if the H7 HA was capable of acquiring human-type receptor specificity, we conducted mutation analyses. Remarkably, three amino acid mutations conferred a switch in specificity for human-type receptors that resembled the specificity of the 2009 human H1 pandemic virus, and promoted binding to human trachea epithelial cells.

  3. Cardiac fibroblast transcriptome analyses support a role for interferogenic, profibrotic, and inflammatory genes in anti-SSA/Ro-associated congenital heart block.

    Science.gov (United States)

    Clancy, Robert M; Markham, Androo J; Jackson, Tanisha; Rasmussen, Sara E; Blumenberg, Miroslav; Buyon, Jill P

    2017-09-01

    The signature lesion of SSA/Ro autoantibody-associated congenital heart block (CHB) is fibrosis and a macrophage infiltrate, supporting an experimental focus on cues influencing the fibroblast component. The transcriptomes of human fetal cardiac fibroblasts were analyzed using two complementary approaches. Cardiac injury conditions were simulated in vitro by incubating human fetal cardiac fibroblasts with supernatants from macrophages transfected with the SSA/Ro-associated noncoding Y ssRNA. The top 10 upregulated transcripts in the stimulated fibroblasts reflected a type I interferon (IFN) response [e.g., IFN-induced protein 44-like (IFI44L), of MX dynamin-like GTPase (MX)1, MX2, and radical S -adenosyl methionine domain containing 2 (Rsad2)]. Within the fibrotic pathway, transcript levels of endothelin-1 (EDN1), phosphodiesterase (PDE)4D, chemokine (C-X-C motif) ligand (CXCL)2, and CXCL3 were upregulated, while others, including adenomedullin, RAP guanine nucleotide exchange factor 3 (RAPGEF3), tissue inhibitor of metalloproteinase (TIMP)1, TIMP3, and dual specificity phosphatase 1, were downregulated. Agnostic Database for Annotation, Visualization and Integrated Discovery analysis revealed a significant increase in inflammatory genes, including complement C3A receptor 1 (C3AR1), F2R-like thrombin/trypsin receptor 3, and neutrophil cytosolic factor 2. In addition, stimulated fibroblasts expressed high levels of phospho-MADS box transcription enhancer factor 2 [a substrate of MAPK5 (ERK5)], which was inhibited by BIX-02189, a specific inhibitor of ERK5. Translation to human disease leveraged an unprecedented opportunity to interrogate the transcriptome of fibroblasts freshly isolated and cell sorted without stimulation from a fetal heart with CHB and a matched healthy heart. Consistent with the in vitro data, five IFN response genes were among the top 10 most highly expressed transcripts in CHB fibroblasts. In addition, the expression of matrix-related genes

  4. A comprehensive database of the geographic spread of past human Ebola outbreaks.

    Science.gov (United States)

    Mylne, Adrian; Brady, Oliver J; Huang, Zhi; Pigott, David M; Golding, Nick; Kraemer, Moritz U G; Hay, Simon I

    2014-01-01

    Ebola is a zoonotic filovirus that has the potential to cause outbreaks of variable magnitude in human populations. This database collates our existing knowledge of all known human outbreaks of Ebola for the first time by extracting details of their suspected zoonotic origin and subsequent human-to-human spread from a range of published and non-published sources. In total, 22 unique Ebola outbreaks were identified, composed of 117 unique geographic transmission clusters. Details of the index case and geographic spread of secondary and imported cases were recorded as well as summaries of patient numbers and case fatality rates. A brief text summary describing suspected routes and means of spread for each outbreak was also included. While we cannot yet include the ongoing Guinea and DRC outbreaks until they are over, these data and compiled maps can be used to gain an improved understanding of the initial spread of past Ebola outbreaks and help evaluate surveillance and control guidelines for limiting the spread of future epidemics.

  5. Development of a reference database for assessing dietary nitrate in vegetables.

    Science.gov (United States)

    Blekkenhorst, Lauren C; Prince, Richard L; Ward, Natalie C; Croft, Kevin D; Lewis, Joshua R; Devine, Amanda; Shinde, Sujata; Woodman, Richard J; Hodgson, Jonathan M; Bondonno, Catherine P

    2017-08-01

    Nitrate from vegetables improves vascular health with short-term intake. Whether this translates into improved long-term health outcomes has yet to be investigated. To enable reliable analysis of nitrate intake from food records, there is a strong need for a comprehensive nitrate content of vegetables database. A systematic literature search (1980-2016) was performed using Medline, Agricola and Commonwealth Agricultural Bureaux abstracts databases. The nitrate content of vegetables database contains 4237 records from 255 publications with data on 178 vegetables and 22 herbs and spices. The nitrate content of individual vegetables ranged from Chinese flat cabbage (median; range: 4240; 3004-6310 mg/kg FW) to corn (median; range: 12; 5-1091 mg/kg FW). The database was applied to estimate vegetable nitrate intake using 24-h dietary recalls (24-HDRs) and food frequency questionnaires (FFQs). Significant correlations were observed between urinary nitrate excretion and 24-HDR (r = 0.4, P = 0.013), between 24-HDR and 12 month FFQs (r = 0.5, P vegetables. It can be applied to dietary records to explore the associations between nitrate intake and health outcomes in human studies. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  6. Gene expression and functional annotation of the human and mouse choroid plexus epithelium.

    Directory of Open Access Journals (Sweden)

    Sarah F Janssen

    Full Text Available BACKGROUND: The choroid plexus epithelium (CPE is a lobed neuro-epithelial structure that forms the outer blood-brain barrier. The CPE protrudes into the brain ventricles and produces the cerebrospinal fluid (CSF, which is crucial for brain homeostasis. Malfunction of the CPE is possibly implicated in disorders like Alzheimer disease, hydrocephalus or glaucoma. To study human genetic diseases and potential new therapies, mouse models are widely used. This requires a detailed knowledge of similarities and differences in gene expression and functional annotation between the species. The aim of this study is to analyze and compare gene expression and functional annotation of healthy human and mouse CPE. METHODS: We performed 44k Agilent microarray hybridizations with RNA derived from laser dissected healthy human and mouse CPE cells. We functionally annotated and compared the gene expression data of human and mouse CPE using the knowledge database Ingenuity. We searched for common and species specific gene expression patterns and function between human and mouse CPE. We also made a comparison with previously published CPE human and mouse gene expression data. RESULTS: Overall, the human and mouse CPE transcriptomes are very similar. Their major functionalities included epithelial junctions, transport, energy production, neuro-endocrine signaling, as well as immunological, neurological and hematological functions and disorders. The mouse CPE presented two additional functions not found in the human CPE: carbohydrate metabolism and a more extensive list of (neural developmental functions. We found three genes specifically expressed in the mouse CPE compared to human CPE, being ACE, PON1 and TRIM3 and no human specifically expressed CPE genes compared to mouse CPE. CONCLUSION: Human and mouse CPE transcriptomes are very similar, and display many common functionalities. Nonetheless, we also identified a few genes and pathways which suggest that the CPE

  7. Transcriptome and proteome exploration to provide a resource for the study of Agrocybe aegerita.

    Directory of Open Access Journals (Sweden)

    Man Wang

    Full Text Available BACKGROUND: Agrocybe aegerita, the black poplar mushroom, has been highly valued as a functional food for its medicinal and nutritional benefits. Several bioactive extracts from A. aegerita have been found to exhibit antitumor and antioxidant activities. However, limited genetic resources for A. aegerita have hindered exploration of this species. METHODOLOGY/PRINCIPAL FINDINGS: To facilitate the research on A. aegerita, we established a deep survey of the transcriptome and proteome of this mushroom. We applied high-throughput sequencing technology (Illumina to sequence A. aegerita transcriptomes from mycelium and fruiting body. The raw clean reads were de novo assembled into a total of 36,134 expressed sequences tags (ESTs with an average length of 663 bp. These ESTs were annotated and classified according to Gene Ontology (GO, Clusters of Orthologous Groups (COG, and Kyoto Encyclopedia of Genes and Genomes (KEGG metabolic pathways. Gene expression profile analysis showed that 18,474 ESTs were differentially expressed, with 10,131 up-regulated in mycelium and 8,343 up-regulated in fruiting body. Putative genes involved in polysaccharide and steroid biosynthesis were identified from A. aegerita transcriptome, and these genes were differentially expressed at the two stages of A. aegerita. Based on one-dimensional gel electrophoresis (1-DGE coupled with electrospray ionization liquid chromatography tandem MS (LC-ESI-MS/MS, we identified a total of 309 non-redundant proteins. And many metabolic enzymes involved in glycolysis were identified in the protein database. CONCLUSIONS/SIGNIFICANCE: This is the first study on transcriptome and proteome analyses of A. aegerita. The data in this study serve as a resource of A. aegerita transcripts and proteins, and offer clues to the applications of this mushroom in nutrition, pharmacy and industry.

  8. De novo assembly of the Indo-Pacific humpback dolphin leucocyte transcriptome to identify putative genes involved in the aquatic adaptation and immune response.

    Directory of Open Access Journals (Sweden)

    Duan Gui

    Full Text Available BACKGROUND: The Indo-Pacific humpback dolphin (Sousa chinensis, a marine mammal species inhabited in the waters of Southeast Asia, South Africa and Australia, has attracted much attention because of the dramatic decline in population size in the past decades, which raises the concern of extinction. So far, this species is poorly characterized at molecular level due to little sequence information available in public databases. Recent advances in large-scale RNA sequencing provide an efficient approach to generate abundant sequences for functional genomic analyses in the species with un-sequenced genomes. PRINCIPAL FINDINGS: We performed a de novo assembly of the Indo-Pacific humpback dolphin leucocyte transcriptome by Illumina sequencing. 108,751 high quality sequences from 47,840,388 paired-end reads were generated, and 48,868 and 46,587 unigenes were functionally annotated by BLAST search against the NCBI non-redundant and Swiss-Prot protein databases (E-value<10(-5, respectively. In total, 16,467 unigenes were clustered into 25 functional categories by searching against the COG database, and BLAST2GO search assigned 37,976 unigenes to 61 GO terms. In addition, 36,345 unigenes were grouped into 258 KEGG pathways. We also identified 9,906 simple sequence repeats and 3,681 putative single nucleotide polymorphisms as potential molecular markers in our assembled sequences. A large number of unigenes were predicted to be involved in immune response, and many genes were predicted to be relevant to adaptive evolution and cetacean-specific traits. CONCLUSION: This study represented the first transcriptome analysis of the Indo-Pacific humpback dolphin, an endangered species. The de novo transcriptome analysis of the unique transcripts will provide valuable sequence information for discovery of new genes, characterization of gene expression, investigation of various pathways and adaptive evolution, as well as identification of genetic markers.

  9. Transcriptome complexity in a genome-reduced bacterium

    DEFF Research Database (Denmark)

    Güell, Marc; van Noort, Vera; Yus, Eva

    2009-01-01

    To study basic principles of transcriptome organization in bacteria, we analyzed one of the smallest self-replicating organisms, Mycoplasma pneumoniae. We combined strand-specific tiling arrays, complemented by transcriptome sequencing, with more than 252 spotted arrays. We detected 117 previousl...

  10. Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants.

    Science.gov (United States)

    Li, Xinguo; Wu, Harry X; Southerton, Simon G

    2010-06-21

    Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution.

  11. Detailed tail proteomic analysis of axolotl (Ambystoma mexicanum) using an mRNA-seq reference database.

    Science.gov (United States)

    Demircan, Turan; Keskin, Ilknur; Dumlu, Seda Nilgün; Aytürk, Nilüfer; Avşaroğlu, Mahmut Erhan; Akgün, Emel; Öztürk, Gürkan; Baykal, Ahmet Tarık

    2017-01-01

    Salamander axolotl has been emerging as an important model for stem cell research due to its powerful regenerative capacity. Several advantages, such as the high capability of advanced tissue, organ, and appendages regeneration, promote axolotl as an ideal model system to extend our current understanding on the mechanisms of regeneration. Acknowledging the common molecular pathways between amphibians and mammals, there is a great potential to translate the messages from axolotl research to mammalian studies. However, the utilization of axolotl is hindered due to the lack of reference databases of genomic, transcriptomic, and proteomic data. Here, we introduce the proteome analysis of the axolotl tail section searched against an mRNA-seq database. We translated axolotl mRNA sequences to protein sequences and annotated these to process the LC-MS/MS data and identified 1001 nonredundant proteins. Functional classification of identified proteins was performed by gene ontology searches. The presence of some of the identified proteins was validated by in situ antibody labeling. Furthermore, we have analyzed the proteome expressional changes postamputation at three time points to evaluate the underlying mechanisms of the regeneration process. Taken together, this work expands the proteomics data of axolotl to contribute to its establishment as a fully utilized model. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. Structure of gene and pseudogenes of human apoferritin H

    Energy Technology Data Exchange (ETDEWEB)

    Costanzo, F; Colombo, M; Staempfli, S; Santoro, C; Marone, M; Frank, K; Delius, H; Cortese, R

    1986-01-24

    Ferritin is composed of two subunits, H and L. cDNA's coding for these proteins from human liver, lymphocytes and from the monocyte-like cell line U937 have been cloned and sequenced. Southern blot analysis on total human DNA reveals that there are many DNA segments hybridizing to the apoferritin H and L cDNA probes. In view of the tissue heterogeneity of ferritin molecules, it appeared possible that apoferritin molecules could be coded by a family of genes differentially expressed in various tissues. In this paper, the authors describe the cloning and sequencing of the gene coding for human apoferritin H. This gene has three introns; the exon sequence is identical to that of cDNAs isolated from human liver, lymphocytes, HeLa cells and endothelial cells. In addition they show that at least 15 intronless pseudogenes exist, with features suggesting that there were originated by reverse transcription and insertion. On the basis of these results they conclude that only one gene is responsible for the synthesis of the majority of apoferritin H mRNA in various tissues examined, and that probably all the other DNA segments hybridizing with apoferritin cDNA are pseudogenes.

  13. The de novo transcriptome and its analysis in the worldwide vegetable pest, Delia antiqua (Diptera: Anthomyiidae).

    Science.gov (United States)

    Zhang, Yu-Juan; Hao, Youjin; Si, Fengling; Ren, Shuang; Hu, Ganyu; Shen, Li; Chen, Bin

    2014-03-10

    The onion maggot Delia antiqua is a major insect pest of cultivated vegetables, especially the onion, and a good model to investigate the molecular mechanisms of diapause. To better understand the biology and diapause mechanism of the insect pest species, D. antiqua, the transcriptome was sequenced using Illumina paired-end sequencing technology. Approximately 54 million reads were obtained, trimmed, and assembled into 29,659 unigenes, with an average length of 607 bp and an N50 of 818 bp. Among these unigenes, 21,605 (72.8%) were annotated in the public databases. All unigenes were then compared against Drosophila melanogaster and Anopheles gambiae. Codon usage bias was analyzed and 332 simple sequence repeats (SSRs) were detected in this organism. These data represent the most comprehensive transcriptomic resource currently available for D. antiqua and will facilitate the study of genetics, genomics, diapause, and further pest control of D. antiqua. Copyright © 2014 Zhang et al.

  14. Uncovering the Complex Transcriptome Response of Mytilus chilensis against Saxitoxin: Implications of Harmful Algal Blooms on Mussel Populations

    Science.gov (United States)

    Detree, Camille; Núñez-Acuña, Gustavo; Roberts, Steven; Gallardo-Escárate, Cristian

    2016-01-01

    Saxitoxin (STX), a principal phycotoxin contributing to paralytic shellfish poisoning, is largely produced by marine microalgae of the genus Alexandrium. This toxin affects a wide range of species, inducing massive deaths in fish and other marine species. However, marine bivalves can resist and accumulate paralytic shellfish poisons. Despite numerous studies on the impact of STX in marine bivalves, knowledge regarding STX recognition at molecular level by benthic species remains scarce. Therefore, the aim of this study was to identify novel genes that interact with STX in the Chilean mussel Mytilus chilensis. For this, RNA-seq and RT-qPCR approaches were used to evaluate the transcriptomic response of M. chilensis to a purified STX as well as in vivo Alexandrium catenella exposure. Approximately 800 million reads were assembled, generating 138,883 contigs that were blasted against the UniProt Mollusca database. Pattern Recognition Receptors (PRRs) involved in mussel immunity, such as Toll-like receptors, tumor necrosis factor receptors, and scavenger-like receptors were found to be strongly upregulated at 8 and 16 h post-STX injection. These results suggest an involvement of PRRs in the response to STX, as well as identifying potential, novel STX-interacting receptors in this Chilean mussel. This study is the first transcriptomic overview of the STX-response in the edible species M. chilensis. However, the most significant contribution of this work is the identification of immune receptors and pathways potentially involved in the recognition and defense against STX’s toxicity and its impact of harmful algae blooms on wild and cultivated mussel populations. PMID:27764234

  15. Time-resolved transcriptome and proteome landscape of human regulatory T cell (Treg) differentiation reveals novel regulators of FOXP3

    KAUST Repository

    Schmidt, Angelika

    2018-04-27

    BackgroundRegulatory T cells (Tregs) expressing the transcription factor FOXP3 are crucial mediators of self-tolerance, preventing autoimmune diseases but possibly hampering tumor rejection. Clinical manipulation of Tregs is of great interest, and first-in-man trials of Treg transfer have achieved promising outcomes. Yet, the mechanisms governing induced Treg (iTreg) differentiation and the regulation of FOXP3 are incompletely understood.ResultsTo gain a comprehensive and unbiased molecular understanding of FOXP3 induction, we performed time-series RNA sequencing (RNA-Seq) and proteomics profiling on the same samples during human iTreg differentiation. To enable the broad analysis of universal FOXP3-inducing pathways, we used five differentiation protocols in parallel. Integrative analysis of the transcriptome and proteome confirmed involvement of specific molecular processes, as well as overlap of a novel iTreg subnetwork with known Treg regulators and autoimmunity-associated genes. Importantly, we propose 37 novel molecules putatively involved in iTreg differentiation. Their relevance was validated by a targeted shRNA screen confirming a functional role in FOXP3 induction, discriminant analyses classifying iTregs accordingly, and comparable expression in an independent novel iTreg RNA-Seq dataset.ConclusionThe data generated by this novel approach facilitates understanding of the molecular mechanisms underlying iTreg generation as well as of the concomitant changes in the transcriptome and proteome. Our results provide a reference map exploitable for future discovery of markers and drug candidates governing control of Tregs, which has important implications for the treatment of cancer, autoimmune, and inflammatory diseases.

  16. Human events reference for ATHEANA (HERA) database description and preliminary user`s manual

    Energy Technology Data Exchange (ETDEWEB)

    Auflick, J.L.; Hahn, H.A.; Pond, D.J.

    1998-05-27

    The Technique for Human Error Analysis (ATHEANA) is a newly developed human reliability analysis (HRA) methodology that aims to facilitate better representation and integration of human performance into probabilistic risk assessment (PRA) modeling and quantification by analyzing risk-significant operating experience in the context of existing behavioral science models. The fundamental premise of ATHEANA is that error-forcing contexts (EFCs), which refer to combinations of equipment/material conditions and performance shaping factors (PSFs), set up or create the conditions under which unsafe actions (UAs) can occur. Because ATHEANA relies heavily on the analysis of operational events that have already occurred as a mechanism for generating creative thinking about possible EFCs, a database, called the Human Events Reference for ATHEANA (HERA), has been developed to support the methodology. This report documents the initial development efforts for HERA.

  17. The first Chameleon transcriptome: comparative genomic analysis of the OXPHOS system reveals loss of COX8 in Iguanian lizards.

    Science.gov (United States)

    Bar-Yaacov, Dan; Bouskila, Amos; Mishmar, Dan

    2013-01-01

    Recently, we found dramatic mitochondrial DNA divergence of Israeli Chamaeleo chamaeleon populations into two geographically distinct groups. We aimed to examine whether the same pattern of divergence could be found in nuclear genes. However, no genomic resource is available for any chameleon species. Here we present the first chameleon transcriptome, obtained using deep sequencing (SOLiD). Our analysis identified 164,000 sequence contigs of which 19,000 yielded unique BlastX hits. To test the efficacy of our sequencing effort, we examined whether the chameleon and other available reptilian transcriptomes harbored complete sets of genes comprising known biochemical pathways, focusing on the nDNA-encoded oxidative phosphorylation (OXPHOS) genes as a model. As a reference for the screen, we used the human 86 (including isoforms) known structural nDNA-encoded OXPHOS subunits. Analysis of 34 publicly available vertebrate transcriptomes revealed orthologs for most human OXPHOS genes. However, OXPHOS subunit COX8 (Cytochrome C oxidase subunit 8), including all its known isoforms, was consistently absent in transcriptomes of iguanian lizards, implying loss of this subunit during the radiation of this suborder. The lack of COX8 in the suborder Iguania is intriguing, since it is important for cellular respiration and ATP production. Our sequencing effort added a new resource for comparative genomic studies, and shed new light on the evolutionary dynamics of the OXPHOS system.

  18. MortalityPredictors.org: a manually-curated database of published biomarkers of human all-cause mortality.

    Science.gov (United States)

    Peto, Maximus V; De la Guardia, Carlos; Winslow, Ksenia; Ho, Andrew; Fortney, Kristen; Morgen, Eric

    2017-08-31

    Biomarkers of all-cause mortality are of tremendous clinical and research interest. Because of the long potential duration of prospective human lifespan studies, such biomarkers can play a key role in quantifying human aging and quickly evaluating any potential therapies. Decades of research into mortality biomarkers have resulted in numerous associations documented across hundreds of publications. Here, we present MortalityPredictors.org , a manually-curated, publicly accessible database, housing published, statistically-significant relationships between biomarkers and all-cause mortality in population-based or generally healthy samples. To gather the information for this database, we searched PubMed for appropriate research papers and then manually curated relevant data from each paper. We manually curated 1,576 biomarker associations, involving 471 distinct biomarkers. Biomarkers ranged in type from hematologic (red blood cell distribution width) to molecular (DNA methylation changes) to physical (grip strength). Via the web interface, the resulting data can be easily browsed, searched, and downloaded for further analysis. MortalityPredictors.org provides comprehensive results on published biomarkers of human all-cause mortality that can be used to compare biomarkers, facilitate meta-analysis, assist with the experimental design of aging studies, and serve as a central resource for analysis. We hope that it will facilitate future research into human mortality and aging.

  19. Transcriptomic analysis of human retinal detachment reveals both inflammatory response and photoreceptor death.

    Directory of Open Access Journals (Sweden)

    Marie-Noëlle Delyfer

    Full Text Available BACKGROUND: Retinal detachment often leads to a severe and permanent loss of vision and its therapeutic management remains to this day exclusively surgical. We have used surgical specimens to perform a differential analysis of the transcriptome of human retinal tissues following detachment in order to identify new potential pharmacological targets that could be used in combination with surgery to further improve final outcome. METHODOLOGY/PRINCIPAL FINDINGS: Statistical analysis reveals major involvement of the immune response in the disease. Interestingly, using a novel approach relying on coordinated expression, the interindividual variation was monitored to unravel a second crucial aspect of the pathological process: the death of photoreceptor cells. Within the genes identified, the expression of the major histocompatibility complex I gene HLA-C enables diagnosis of the disease, while PKD2L1 and SLCO4A1 -which are both down-regulated- act synergistically to provide an estimate of the duration of the retinal detachment process. Our analysis thus reveals the two complementary cellular and molecular aspects linked to retinal detachment: an immune response and the degeneration of photoreceptor cells. We also reveal that the human specimens have a higher clinical value as compared to artificial models that point to IL6 and oxidative stress, not implicated in the surgical specimens studied here. CONCLUSIONS/SIGNIFICANCE: This systematic analysis confirmed the occurrence of both neurodegeneration and inflammation during retinal detachment, and further identifies precisely the modification of expression of the different genes implicated in these two phenomena. Our data henceforth give a new insight into the disease process and provide a rationale for therapeutic strategies aimed at limiting inflammation and photoreceptor damage associated with retinal detachment and, in turn, improving visual prognosis after retinal surgery.

  20. Transcriptomic insights on the ABC transporter gene family in the salmon louse Caligus rogercresseyi.

    Science.gov (United States)

    Valenzuela-Muñoz, Valentina; Sturm, Armin; Gallardo-Escárate, Cristian

    2015-04-09

    ATP-binding cassette (ABC) protein family encode for membrane proteins involved in the transport of various biomolecules through the cellular membrane. These proteins have been identified in all taxa and present important physiological functions, including the process of insecticide detoxification in arthropods. For that reason the ectoparasite Caligus rogercresseyi represents a model species for understanding the molecular underpinnings involved in insecticide drug resistance. llumina sequencing was performed using sea lice exposed to 2 and 3 ppb of deltamethrin and azamethiphos. Contigs obtained from de novo assembly were annotated by Blastx. RNA-Seq analysis was performed and validated by qPCR analysis. From the transcriptome database of C. rogercresseyi, 57 putative members of ABC protein sequences were identified and phylogenetically classified into the eight subfamilies described for ABC transporters in arthropods. Transcriptomic profiles for ABC proteins subfamilies were evaluated throughout C. rogercresseyi development. Moreover, RNA-Seq analysis was performed for adult male and female salmon lice exposed to the delousing drugs azamethiphos and deltamethrin. High transcript levels of the ABCB and ABCC subfamilies were evidenced. Furthermore, SNPs mining was carried out for the ABC proteins sequences, revealing pivotal genomic information. The present study gives a comprehensive transcriptome analysis of ABC proteins from C. rogercresseyi, providing relevant information about transporter roles during ontogeny and in relation to delousing drug responses in salmon lice. This genomic information represents a valuable tool for pest management in the Chilean salmon aquaculture industry.