WorldWideScience

Sample records for secretome database integrated

  1. HCSD: the human cancer secretome database

    DEFF Research Database (Denmark)

    Feizi, Amir; Banaei-Esfahani, Amir; Nielsen, Jens

    2015-01-01

    The human cancer secretome database (HCSD) is a comprehensive database for human cancer secretome data. The cancer secretome describes proteins secreted by cancer cells and structuring information about the cancer secretome will enable further analysis of how this is related with tumor biology...... database is limiting the ability to query the increasing community knowledge. We therefore developed the Human Cancer Secretome Database (HCSD) to fulfil this gap. HCSD contains >80 000 measurements for about 7000 nonredundant human proteins collected from up to 35 high-throughput studies on 17 cancer...

  2. VerSeDa: vertebrate secretome database.

    Science.gov (United States)

    Cortazar, Ana R; Oguiza, José A; Aransay, Ana M; Lavín, José L

    2017-01-01

    Based on the current tools, de novo secretome (full set of proteins secreted by an organism) prediction is a time consuming bioinformatic task that requires a multifactorial analysis in order to obtain reliable in silico predictions. Hence, to accelerate this process and offer researchers a reliable repository where secretome information can be obtained for vertebrates and model organisms, we have developed VerSeDa (Vertebrate Secretome Database). This freely available database stores information about proteins that are predicted to be secreted through the classical and non-classical mechanisms, for the wide range of vertebrate species deposited at the NCBI, UCSC and ENSEMBL sites. To our knowledge, VerSeDa is the only state-of-the-art database designed to store secretome data from multiple vertebrate genomes, thus, saving an important amount of time spent in the prediction of protein features that can be retrieved from this repository directly. VerSeDa is freely available at http://genomics.cicbiogune.es/VerSeDa/index.php. © The Author(s) 2017. Published by Oxford University Press.

  3. Creation of a Human Secretome: A Novel Composite Library of Human Secreted Proteins: Validation Using Ovarian Cancer Gene Expression Data and a Virtual Secretome Array.

    Science.gov (United States)

    Vathipadiekal, Vinod; Wang, Victoria; Wei, Wei; Waldron, Levi; Drapkin, Ronny; Gillette, Michael; Skates, Steven; Birrer, Michael

    2015-11-01

    To generate a comprehensive "Secretome" of proteins potentially found in the blood and derive a virtual Affymetrix array. To validate the utility of this database for the discovery of novel serum-based biomarkers using ovarian cancer transcriptomic data. The secretome was constructed by aggregating the data from databases of known secreted proteins, transmembrane or membrane proteins, signal peptides, G-protein coupled receptors, or proteins existing in the extracellular region, and the virtual array was generated by mapping them to Affymetrix probeset identifiers. Whole-genome microarray data from ovarian cancer, normal ovarian surface epithelium, and fallopian tube epithelium were used to identify transcripts upregulated in ovarian cancer. We established the secretome from eight public databases and a virtual array consisting of 16,521 Affymetrix U133 Plus 2.0 probesets. Using ovarian cancer transcriptomic data, we identified candidate blood-based biomarkers for ovarian cancer and performed bioinformatic validation by demonstrating rediscovery of known biomarkers including CA125 and HE4. Two novel top biomarkers (FGF18 and GPR172A) were validated in serum samples from an independent patient cohort. We present the secretome, comprising the most comprehensive resource available for protein products that are potentially found in the blood. The associated virtual array can be used to translate gene-expression data into cancer biomarker discovery. A list of blood-based biomarkers for ovarian cancer detection is reported and includes CA125 and HE4. FGF18 and GPR172A were identified and validated by ELISA as being differentially expressed in the serum of ovarian cancer patients compared with controls. ©2015 American Association for Cancer Research.

  4. Mononuclear cell secretome protects from experimental autoimmune myocarditis.

    Science.gov (United States)

    Hoetzenecker, Konrad; Zimmermann, Matthias; Hoetzenecker, Wolfram; Schweiger, Thomas; Kollmann, Dagmar; Mildner, Michael; Hegedus, Balazs; Mitterbauer, Andreas; Hacker, Stefan; Birner, Peter; Gabriel, Christian; Gyöngyösi, Mariann; Blyszczuk, Przemyslaw; Eriksson, Urs; Ankersmit, Hendrik Jan

    2015-03-14

    Supernatants of serum-free cultured mononuclear cells (MNC) contain a mix of immunomodulating factors (secretome), which have been shown to attenuate detrimental inflammatory responses following myocardial ischaemia. Inflammatory dilated cardiomyopathy (iDCM) is a common cause of heart failure in young patients. Experimental autoimmune myocarditis (EAM) is a CD4+ T cell-dependent model, which mirrors important pathogenic aspects of iDCM. The aim of this study was to determine the influence of MNC secretome on myocardial inflammation in the EAM model. BALB/c mice were immunized twice with an alpha myosin heavy chain peptide together with Complete Freund adjuvant. Supernatants from mouse mononuclear cells were collected, dialysed, and injected i.p. at Day 0, Day 7, or Day 14, respectively. Myocarditis severity, T cell responses, and autoantibody formation were assessed at Day 21. The impact of MNC secretome on CD4+ T cell function and viability was evaluated using in vitro proliferation and cell viability assays. A single high-dose application of MNC secretome, injected at Day 14 after the first immunization, effectively attenuated myocardial inflammation. Mechanistically, MNC secretome induced caspase-8-dependent apoptosis in autoreactive CD4+ T cells. MNC secretome abrogated myocardial inflammation in a CD4+ T cell-dependent animal model of autoimmune myocarditis. This anti-inflammatory effect of MNC secretome suggests a novel and simple potential treatment concept for inflammatory heart diseases. © The Author 2013. Published by Oxford University Press on behalf of the European Society of Cardiology.

  5. Proteomic Investigation of Rhizoctonia solani AG 4 Identifies Secretome and Mycelial Proteins with roles in Plant Cell Wall Degradation and Virulence

    KAUST Repository

    Lakshman, Dilip; Roberts, Daniel P.; Garrett, Wesley M.; Natarajan, Savithiry S.; Darwish, Omar; Alkharouf, Nadim; Pain, Arnab; Khan, Farooq; Jambhulkar, Prashant P.; Mitra, Amitava

    2016-01-01

    Rhizoctonia solani AG 4 is a soilborne necrotrophic fungal plant pathogen that causes economically important diseases on agronomic crops worldwide. Here we used a proteomics approach to characterize both intracellular proteins and the secretome of R. solani AG 4 isolate Rs23A under several growth conditions; the secretome being highly important in pathogenesis. From over 500 total secretome and soluble intracellular protein spots from 2-D gels, 457 protein spots were analyzed and 318 proteins positively matched with fungal proteins of known function by comparison with available R. solani genome databases specific for anastomosis groups 1-IA, 1-IB, and 3. These proteins were categorized to possible cellular locations and functional groups; and for some proteins their putative roles in plant cell wall degradation and virulence. The majority of the secreted proteins were grouped to extracellular regions and contain hydrolase activity.

  6. Proteomic Investigation of Rhizoctonia solani AG 4 Identifies Secretome and Mycelial Proteins with roles in Plant Cell Wall Degradation and Virulence

    KAUST Repository

    Lakshman, Dilip

    2016-03-28

    Rhizoctonia solani AG 4 is a soilborne necrotrophic fungal plant pathogen that causes economically important diseases on agronomic crops worldwide. Here we used a proteomics approach to characterize both intracellular proteins and the secretome of R. solani AG 4 isolate Rs23A under several growth conditions; the secretome being highly important in pathogenesis. From over 500 total secretome and soluble intracellular protein spots from 2-D gels, 457 protein spots were analyzed and 318 proteins positively matched with fungal proteins of known function by comparison with available R. solani genome databases specific for anastomosis groups 1-IA, 1-IB, and 3. These proteins were categorized to possible cellular locations and functional groups; and for some proteins their putative roles in plant cell wall degradation and virulence. The majority of the secreted proteins were grouped to extracellular regions and contain hydrolase activity.

  7. Investigation of the indigenous fungal community populating barley grains: Secretomes and xylanolytic potential.

    Science.gov (United States)

    Sultan, Abida; Frisvad, Jens C; Andersen, Birgit; Svensson, Birte; Finnie, Christine

    2017-10-03

    The indigenous fungal species populating cereal grains produce numerous plant cell wall-degrading enzymes including xylanases, which could play important role in plant-pathogen interactions and in adaptation of the fungi to varying carbon sources. To gain more insight into the grain surface-associated enzyme activity, members of the populating fungal community were isolated, and their secretomes and xylanolytic activities assessed. Twenty-seven different fungal species were isolated from grains of six barley cultivars over different harvest years and growing sites. The isolated fungi were grown on medium containing barley flour or wheat arabinoxylan as sole carbon source. Their secretomes and xylanase activities were analyzed using SDS-PAGE and enzyme assays and were found to vary according to species and carbon source. Secretomes were dominated by cell wall degrading enzymes with xylanases and xylanolytic enzymes being the most abundant. A 2-DE-based secretome analysis of Aspergillus niger and the less-studied pathogenic fungus Fusarium poae grown on barley flour and wheat arabinoxylan resulted in identification of 82 A. niger and 31 F. poae proteins many of which were hydrolytic enzymes, including xylanases. The microorganisms that inhabit the surface of cereal grains are specialized in production of enzymes such as xylanases, which depolymerize plant cell walls. Integration of gel-based proteomics approach with activity assays is a powerful tool for analysis and characterization of fungal secretomes and xylanolytic activities which can lead to identification of new enzymes with interesting properties, as well as provide insight into plant-fungal interactions, fungal pathogenicity and adaptation. Understanding the fungal response to host niche is of importance to uncover novel targets for potential symbionts, anti-fungal agents and biotechnical applications. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. The Ewing sarcoma secretome and its response to activation of Wnt/beta-catenin signaling.

    Science.gov (United States)

    Hawkins, Allegra G; Basrur, Venkatesha; da Veiga Leprevost, Felipe; Pedersen, Elisabeth; Sperring, Colin; Nesvizhskii, Alexey I; Lawlor, Elizabeth R

    2018-01-31

    Tumor: tumor microenvironment (TME) interactions are critical for tumor progression and the composition and structure of the local extracellular matrix (ECM) are key determinants of tumor metastasis. We recently reported that activation of Wnt/beta-catenin signaling in Ewing sarcoma cells induces widespread transcriptional changes that are associated with acquisition of a metastatic tumor phenotype. Significantly, ECM protein-encoding genes were found to be enriched among Wnt/beta-catenin induced transcripts, leading us to hypothesize that activation of canonical Wnt signaling might induce changes in the Ewing sarcoma secretome. To address this hypothesis, conditioned media from Ewing sarcoma cell lines cultured in the presence or absence of Wnt3a was collected for proteomic analysis. Label-free mass spectrometry was used to identify and quantify differentially secreted proteins. We then used in silico databases to identify only proteins annotated as secreted. Comparison of the secretomes of two Ewing sarcoma cell lines revealed numerous shared proteins, as well as a degree of heterogeneity, in both basal and Wnt-stimulated conditions. Gene set enrichment analysis of secreted proteins revealed that Wnt stimulation reproducibly resulted in increased secretion of proteins involved in ECM organization, ECM receptor interactions, and collagen formation. In particular, Wnt-stimulated Ewing sarcoma cells upregulated secretion of structural collagens, as well as matricellular proteins, such as the metastasis-associated protein, tenascin C (TNC). Interrogation of published databases confirmed reproducible correlations between Wnt/beta-catenin activation and TNC and COL1A1 expression in patient tumors. In summary, this first study of the Ewing sarcoma secretome reveals that Wnt/beta-catenin activated tumor cells upregulate secretion of ECM proteins. Such Wnt/beta-catenin mediated changes are likely to impact on tumor: TME interactions that contribute to metastatic

  9. Secretomic survey of Trichoderma harzianum grown on plant biomass substrates.

    Science.gov (United States)

    Gómez-Mendoza, Diana Paola; Junqueira, Magno; do Vale, Luis Henrique Ferreira; Domont, Gilberto Barbosa; Ferreira Filho, Edivaldo Ximenes; Sousa, Marcelo Valle de; Ricart, Carlos André Ornelas

    2014-04-04

    The present work aims at characterizing T. harzianum secretome when the fungus is grown in synthetic medium supplemented with one of the four substrates: glucose, cellulose, xylan, and sugarcane bagasse (SB). The characterization was done by enzymatic assays and proteomic analysis using 2-DE/MALDI-TOF and gel-free shotgun LC-MS/MS. The results showed that SB induced the highest cellulolytic and xylanolytic activities when compared with the other substrates, while remarkable differences in terms of number and distribution of protein spots in 2-DE gels were also observed among the samples. Additionally, treatment of the secretomes with PNGase F revealed that most spot trails in 2-DE gels corresponded to N-glycosylated proteoforms. The LC-MS/MS analysis of the samples identified 626 different protein groups, including carbohydrate-active enzymes and accessory, noncatalytic, and cell-wall-associated proteins. Although the SB-induced secretome displayed the highest cellulolytic and xylanolytic activities, it did not correspond to a higher proteome complexity because CM-cellulose-induced secretome was significantly more diverse. Among the identified proteins, 73% were exclusive to one condition, while only 5% were present in all samples. Therefore, this study disclosed the variation of T. harzianum secretome in response to different substrates and revealed the diversity of the fungus enzymatic toolbox.

  10. Secretome data from Trichoderma reesei and Aspergillus niger cultivated in submerged and sequential fermentation methods

    Directory of Open Access Journals (Sweden)

    Camila Florencio

    2016-09-01

    Full Text Available The cultivation procedure and the fungal strain applied for enzyme production may influence levels and profile of the proteins produced. The proteomic analysis data presented here provide critical information to compare proteins secreted by Trichoderma reesei and Aspergillus niger when cultivated through submerged and sequential fermentation processes, using steam-explosion sugarcane bagasse as inducer for enzyme production. The proteins were organized according to the families described in CAZy database as cellulases, hemicellulases, proteases/peptidases, cell-wall-protein, lipases, others (catalase, esterase, etc., glycoside hydrolases families, predicted and hypothetical proteins. Further detailed analysis of this data is provided in “Secretome analysis of Trichoderma reesei and Aspergillus niger cultivated by submerged and sequential fermentation process: enzyme production for sugarcane bagasse hydrolysis” C. Florencio, F.M. Cunha, A.C Badino, C.S. Farinas, E. Ximenes, M.R. Ladisch (2016 [1]. Keywords: Tricoderma reesei, Aspergillus Niger, Enzyme Production, Secretome

  11. Extending Database Integration Technology

    National Research Council Canada - National Science Library

    Buneman, Peter

    1999-01-01

    Formal approaches to the semantics of databases and database languages can have immediate and practical consequences in extending database integration technologies to include a vastly greater range...

  12. Secretome of Aspergillus oryzae in Shaoxing rice wine koji.

    Science.gov (United States)

    Zhang, Bo; Guan, Zheng-Bing; Cao, Yu; Xie, Guang-Fa; Lu, Jian

    2012-04-16

    Shaoxing rice wine is the most famous and representative Chinese rice wine. Aspergillus oryzae SU16 is used in the manufacture of koji, the Shaoxing rice wine starter culture. In the current study, a comprehensive analysis of the secretome profile of A. oryzae SU16 in Shaoxing rice wine koji was performed for the first time. The proteomic analysis for the identification of the secretory proteins was done using two-dimensional electrophoresis combined with matrix-assisted laser desorption/ionization-tandem time of flight mass spectrometry based on the annotated A. oryzae genome sequence. A total of 41 unique proteins were identified from the secretome. These proteins included 17 extracellular proteins following the classical secretory pathway, and 10 extracellular proteins putatively secreted by the non-classical secretory pathway. The present secretome profile greatly differed from previous reports on A. oryzae growing in other solid-state nutrient sources. Several new secretory or putative secretory proteins were also found. These proteomic data will significantly aid the advancement of research on the secretome of A. oryzae, especially in solid-state cultures, and in elucidating the production process mechanism of Shaoxing rice wine koji. The findings may promote the technological development and innovation of the Shaoxing rice wine industry. Copyright © 2012 Elsevier B.V. All rights reserved.

  13. The in vitro secretome of Mycosphaerella fijiensis induces cell death in banana leaves.

    Science.gov (United States)

    Chuc-Uc, José; Brito-Argáez, Ligia; Canto-Canché, Blondy; Tzec-Simá, Miguel; Rodríguez-García, Cecilia; Peraza-Echeverría, Leticia; Peraza-Echeverría, Santy; James-Kay, Andrew; Cruz-Cruz, Carlos Alberto; Peña-Rodríguez, Luis Manuel; Islas-Flores, Ignacio

    2011-06-01

    The hemibiotrophic filamentous fungus Mycosphaerella fijiensis causes the banana foliar disease known as black Sigatoka, responsible for major worldwide losses in the banana fruit industry. In this work the in vitro secretome of M. fijiensis was characterized. Native and denaturant polyacrylamide gel protease assays showed the M. fijiensis secretome contains protease activity capable of degrading gelatin. Necrotic lesions on leaves were produced by application of the in vitro secretome to the surface of one black Sigatoka-resistant banana wild species, one susceptible cultivar and the non-host plant Carica papaya. To distinguish if necrosis by the secretome is produced by phytotoxins or proteins, the latter ones were precipitated with ammonium sulfate and applied in native or denatured forms onto leaves of the same three plant species. Proteins applied in both preparations were able to produce necrotic lesions. Application of Pronase, a commercial bacterial protease suggested that the necrosis was, at least in part, caused by protease activity from the M. fijiensis secretome. The ability to cause necrotic lesions between M. fijiensis secreted- and ammonium sulfate-precipitated proteins, and purified lipophilic or hydrophilic phytotoxins, was compared. The results suggested that leaf necrosis arises from the combined action of non-host specific hydrolytic activities from the secreted proteins and the action of phytotoxins. This is the first characterization of the M. fijiensis protein secretome produced in vitro but, more importantly, it is also the first time the M. fijiensis secretome has been shown to contain virulence factors capable of causing necrosis to its natural host. Copyright © 2011 Elsevier Masson SAS. All rights reserved.

  14. Lignin depolymerization by fungal secretomes and a microbial sink

    Energy Technology Data Exchange (ETDEWEB)

    Salvachúa, Davinia; Katahira, Rui; Cleveland, Nicholas S.; Khanna, Payal; Resch, Michael G.; Black, Brenna A.; Purvine, Samuel O.; Zink, Erika M.; Prieto, Alicia; Martínez, María J.; Martínez, Angel T.; Simmons, Blake A.; Gladden, John M.; Beckham, Gregg T.

    2016-08-25

    In Nature, powerful oxidative enzymes secreted by white rot fungi and some bacteria catalyze lignin depolymerization and some microbes are able to catabolize the resulting aromatic compounds as carbon and energy sources. Taken together, these two processes offer a potential route for microbial valorization of lignin. However, many challenges remain in realizing this concept, including that oxidative enzymes responsible for lignin depolymerization also catalyze polymerization of low molecular weight (LMW) lignin. Here, multiple basidiomycete secretomes were screened for ligninolytic enzyme activities in the presence of a residual lignin solid stream from a corn stover biorefinery, dubbed DMR-EH (Deacetylation, Mechanical Refining, and Enzymatic Hydrolysis) lignin. Two selected fungal secretomes, with high levels of laccases and peroxidases, were utilized for DMR-EH lignin depolymerization assays. The secretome from Pleurotus eryngii, which exhibited the highest laccase activity, reduced the lignin average molecular weight by 63% and 75% at pH 7 compared to the Mw of the control treated at the same conditions and the initial DMR-EH lignin, respectively, and was applied in further depolymerization assays as a function of time. As repolymerization was observed after 3 days of incubation, an aromatic-catabolic microbe (Pseudomonas putida KT2440) was incubated with the fungal secretome and DMR-EH lignin. These experiments demonstrated that the presence of the bacterium enhances lignin depolymerization, likely due to bacterial catabolism of LMW lignin, which may partially prevent repolymerization. In addition, proteomics was also applied to the P. eryngii secretome to identify the enzymes present in the fungal cocktail utilized for the depolymerization assays, which highlighted a significant number of glucose/ methanol/choline (GMC) oxidoreductases and laccases. Overall, this study demonstrates that ligninolytic enzymes can be used to partially depolymerize a solid, high

  15. Comparative secretome analysis of rat stomach under different nutritional status

    Directory of Open Access Journals (Sweden)

    Lucia L. Senin

    2015-06-01

    Full Text Available The fact that gastric surgery is at the moment the most effective treatment to fight against obesity highlights the relevance of gastric derived proteins as potential targets to treat this pathology. Taking advantage of a previously established gastric explant model for endocrine studies, the proteomic analysis of gastric secretome was performed. To validate this gastric explant system for proteomic analysis, the identification of ghrelin, a classical gastric derived peptide, was performed by MS. In addition, the differential analysis of gastric secretomes under differential nutritional status (control feeding vs fasting vs re-feeding was performed. The MS identified proteins are showed in the present manuscript. The data supplied in this article is related to the research article entitled “Comparative secretome analysis of rat stomach under different nutritional status” [1].

  16. A Database Integrity Pattern Language

    Directory of Open Access Journals (Sweden)

    Octavian Paul ROTARU

    2004-08-01

    Full Text Available Patterns and Pattern Languages are ways to capture experience and make it re-usable for others, and describe best practices and good designs. Patterns are solutions to recurrent problems.This paper addresses the database integrity problems from a pattern perspective. Even if the number of vendors of database management systems is quite high, the number of available solutions to integrity problems is limited. They all learned from the past experience applying the same solutions over and over again.The solutions to avoid integrity threats applied to in database management systems (DBMS can be formalized as a pattern language. Constraints, transactions, locks, etc, are recurrent integrity solutions to integrity threats and therefore they should be treated accordingly, as patterns.

  17. Comparative secretome analysis suggests low plant cell wall degrading capacity in Frankia symbionts

    Directory of Open Access Journals (Sweden)

    Normand Philippe

    2008-01-01

    Full Text Available Abstract Background Frankia sp. strains, the nitrogen-fixing facultative endosymbionts of actinorhizal plants, have long been proposed to secrete hydrolytic enzymes such as cellulases, pectinases, and proteases that may contribute to plant root penetration and formation of symbiotic root nodules. These or other secreted proteins might logically be involved in the as yet unknown molecular interactions between Frankia and their host plants. We compared the genome-based secretomes of three Frankia strains representing diverse host specificities. Signal peptide detection algorithms were used to predict the individual secretomes of each strain, and the set of secreted proteins shared among the strains, termed the core Frankia secretome. Proteins in the core secretome may be involved in the actinorhizal symbiosis. Results The Frankia genomes have conserved Sec (general secretory and Tat (twin arginine translocase secretion systems. The potential secretome of each Frankia strain comprised 4–5% of the total proteome, a lower percentage than that found in the genomes of other actinobacteria, legume endosymbionts, and plant pathogens. Hydrolytic enzymes made up only a small fraction of the total number of predicted secreted proteins in each strain. Surprisingly, polysaccharide-degrading enzymes were few in number, especially in strain CcI3, with more esterolytic, lipolytic and proteolytic enzymes having signal peptides. A total of 161 orthologous proteins belong to the core Frankia secretome. Of these, 52 also lack homologs in closely related actinobacteria, and are termed "Frankia-specific." The genes encoding these conserved secreted proteins are often clustered near secretion machinery genes. Conclusion The predicted secretomes of Frankia sp. are relatively small and include few hydrolases, which could reflect adaptation to a symbiotic lifestyle. There are no well-conserved secreted polysaccharide-degrading enzymes present in all three Frankia

  18. Comparative Analysis of Secretome Profiles of Manganese(II-Oxidizing Ascomycete Fungi.

    Directory of Open Access Journals (Sweden)

    Carolyn A Zeiner

    Full Text Available Fungal secretomes contain a wide range of hydrolytic and oxidative enzymes, including cellulases, hemicellulases, pectinases, and lignin-degrading accessory enzymes, that synergistically drive litter decomposition in the environment. While secretome studies of model organisms such as Phanerochaete chrysosporium and Aspergillus species have greatly expanded our knowledge of these enzymes, few have extended secretome characterization to environmental isolates or conducted side-by-side comparisons of diverse species. Thus, the mechanisms of carbon degradation by many ubiquitous soil fungi remain poorly understood. Here we use a combination of LC-MS/MS, genomic, and bioinformatic analyses to characterize and compare the protein composition of the secretomes of four recently isolated, cosmopolitan, Mn(II-oxidizing Ascomycetes (Alternaria alternata SRC1lrK2f, Stagonospora sp. SRC1lsM3a, Pyrenochaeta sp. DS3sAY3a, and Paraconiothyrium sporulosum AP3s5-JAC2a. We demonstrate that the organisms produce a rich yet functionally similar suite of extracellular enzymes, with species-specific differences in secretome composition arising from unique amino acid sequences rather than overall protein function. Furthermore, we identify not only a wide range of carbohydrate-active enzymes that can directly oxidize recalcitrant carbon, but also an impressive suite of redox-active accessory enzymes that suggests a role for Fenton-based hydroxyl radical formation in indirect, non-specific lignocellulose attack. Our findings highlight the diverse oxidative capacity of these environmental isolates and enhance our understanding of the role of filamentous Ascomycetes in carbon turnover in the environment.

  19. Comparative Analysis of Secretome Profiles of Manganese(II)-Oxidizing Ascomycete Fungi

    Energy Technology Data Exchange (ETDEWEB)

    Zeiner, Carolyn A.; Purvine, Samuel O.; Zink, Erika M.; Paša-Tolić, Ljiljana; Chaput, Dominique L.; Haridas, Sajeet; Wu, Si; LaButti, Kurt; Grigoriev, Igor V.; Henrissat, Bernard; Santelli, Cara M.; Hansel, Colleen M.; Pöggeler, Stefanie

    2016-07-19

    Fungal secretomes contain a wide range of hydrolytic and oxidative enzymes, including cellulases, hemicellulases, pectinases, and lignin-degrading accessory enzymes, that synergistically drive litter decomposition in the environment. While secretome studies of model organisms such as Phanerochaete chrysosporium and Aspergillus species have greatly expanded our knowledge of these enzymes, few have extended secretome characterization to environmental isolates or conducted side-by-side comparisons of diverse species. Thus, the mechanisms of carbon degradation by many ubiquitous soil fungi remain poorly understood. Here we use a combination of LC-MS/MS, genomic, and bioinformatic analyses to characterize and compare the protein composition of the secretomes of four recently isolated, cosmopolitan, Mn(II)-oxidizing Ascomycetes (Alternaria alternata SRC1lrK2f, Stagonospora sp. SRC1lsM3a, Pyrenochaeta sp. DS3sAY3a, and Paraconiothyrium sporulosum AP3s5-JAC2a). We demonstrate that the organisms produce a rich yet functionally similar suite of extracellular enzymes, with species-specific differences in secretome composition arising from unique amino acid sequences rather than overall protein function. Furthermore, we identify not only a wide range of carbohydrate-active enzymes that can directly oxidize recalcitrant carbon, but also an impressive suite of redox-active accessory enzymes that suggests a role for Fenton-based hydroxyl radical formation in indirect, non-specific lignocellulose attack. Our findings highlight the diverse oxidative capacity of these environmental isolates and enhance our understanding of the role of filamentous Ascomycetes in carbon turnover in the environment.

  20. Lytic Polysaccharide Monooxygenases - Studies of Fungal Secretomes and Enzyme Properties

    DEFF Research Database (Denmark)

    Nekiunaite, Laura

    degradation, were also identified upstream the LPMO genes, providing evidence for a co-regulatory mechanism of LPMOs and amylolytic hydrolases. The second part of the PhD thesis is focused on understanding the binding properties of LPMOs to starch and starch mimic substrate. It was shown that LPMOs possessing...... to different substrates at the protein level. It could help to design better enzyme cocktails that increase efficiency of biomass degradation. The secretomes of A. nidulans revealed differences in growth and secretion of enzymes, depending on the type and properties of starches. A common characteristic...... conversion as they produce a wide diversity of degrading enzymes. In the first part of this PhD thesis, the secretomes of the well-known fungus Aspergillus nidulans grown on cereal and legume starches were analyzed. Secretomics is a powerful tool to unravel secretion patterns of fungi and their response...

  1. Secretome analysis of the fungus Trichoderma harzianum grown on cellulose.

    Science.gov (United States)

    Do Vale, Luis H F; Gómez-Mendoza, Diana P; Kim, Min-Sik; Pandey, Akhilesh; Ricart, Carlos A O; Ximenes F Filho, Edivaldo; Sousa, Marcelo V

    2012-08-01

    Trichoderma harzianum is a mycoparasitic filamentous fungus that produces and secretes a wide range of extracellular hydrolytic enzymes used in cell wall degradation. Due to its potential in biomass conversion, T. harzianum draws great attention from biofuel and biocontrol industries and research. Here, we report an extensive secretome analysis of T. harzianum. The fungus was grown on cellulose medium, and its secretome was analyzed by a combination of enzymology, 2DE, MALDI-MS and -MS/MS (Autoflex II), and LC-MS/MS (LTQ-Orbitrap XL). A total of 56 proteins were identified using high-resolution MS. Interestingly, although cellulases were found, the major hydrolytic enzymes secreted in the cellulose medium were chitinases and endochitinases, which may reflect the biocontrol feature of T. harzianum. The glycoside hydrolase family, including chitinases (EC 3.2.1.14), endo-N-acetylglucosaminidases (EC 3.2.1.96), hexosaminidases (EC 3.2.1.52), galactosidases (EC 3.2.1.23), xylanases (EC 3.2.1.8), exo-1,3-glucanases (EC 3.2.1.58), endoglucanases (EC 3.2.1.4), xylosidases (EC 3.2.1.37), α-L-arabinofuranosidase (EC 3.2.1.55), N-acetylhexosaminidases (EC 3.2.1.52), and other enzymes represented 51.36% of the total secretome. Few representatives were classified in the protease family (8.90%). Others (17.60%) are mostly intracellular proteins. A considerable part of the secretome was composed of hypothetical proteins (22.14%), probably because of the absence of an annotated T. harzianum genome. The T. harzianum secretome composition highlights the importance of this fungus as a rich source of hydrolytic enzymes for bioconversion and biocontrol applications. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. Therapeutic Potential of Dental Pulp Stem Cell Secretome for Alzheimer’s Disease Treatment: An In Vitro Study

    Directory of Open Access Journals (Sweden)

    Nermeen El-Moataz Bellah Ahmed

    2016-01-01

    Full Text Available The secretome obtained from stem cell cultures contains an array of neurotrophic factors and cytokines that might have the potential to treat neurodegenerative conditions. Alzheimer’s disease (AD is one of the most common human late onset and sporadic neurodegenerative disorders. Here, we investigated the therapeutic potential of secretome derived from dental pulp stem cells (DPSCs to reduce cytotoxicity and apoptosis caused by amyloid beta (Aβ peptide. We determined whether DPSCs can secrete the Aβ-degrading enzyme, neprilysin (NEP, and evaluated the effects of NEP expression in vitro by quantitating Aβ-degrading activity. The results showed that DPSC secretome contains higher concentrations of VEGF, Fractalkine, RANTES, MCP-1, and GM-CSF compared to those of bone marrow and adipose stem cells. Moreover, treatment with DPSC secretome significantly decreased the cytotoxicity of Aβ peptide by increasing cell viability compared to nontreated cells. In addition, DPSC secretome stimulated the endogenous survival factor Bcl-2 and decreased the apoptotic regulator Bax. Furthermore, neprilysin enzyme was detected in DPSC secretome and succeeded in degrading Aβ1–42 in vitro in 12 hours. In conclusion, our study demonstrates that DPSCs may serve as a promising source for secretome-based treatment of Alzheimer’s disease.

  3. Human Cytomegalovirus Secretome Contains Factors That Induce Angiogenesis and Wound Healing

    Energy Technology Data Exchange (ETDEWEB)

    Dumortier, Jerome; Streblow, Daniel N.; Moses, Ashlee V.; Jacobs, Jon M.; Kreklywich, Craig N.; Camp, David G.; Smith, Richard D.; Orloff, Susan L.; Nelson, Jay

    2008-07-01

    Human cytomegalovirus (HCMV) is implicated in the acceleration of a number of vascular diseases including transplant vascular sclerosis (TVS), the lesion associated with chronic rejection (CR) of solid organ transplants. Although the virus persists in the allograft throughout the course of disease, few cells are directly infected by CMV. This observation is in contrast to the global effects that CMV has on the acceleration of TVS/CR, suggesting that CMV infection indirectly promotes the vascular disease process. Recent transcriptome analysis of CMV-infected heart allografts indicates that the virus induces cytokines and growth factors associated with angiogenesis (AG) and wound healing (WH), suggesting that CMV may accelerate TVS/CR through the induction and secretion of AG/WH factors from infected cells. We analyzed virus-free supernatants from HCMV-infected cells (HCMV secretomes) for growth factors, by mass spectrometry and immunoassays, and found that the HCMV secretome contains over 1,000 cellular proteins, many of which are involved in AG/WH. Importantly, functional assays demonstrated that CMV but not herpes simplex virus secretomes not only induce AG/WH but also promote neovessel stabilization and endothelial cell survival for 2 weeks. These findings suggest that CMV acceleration of TVS occurs through virus-induced growth factors and cytokines in the CMV secretome.

  4. Comparative and bioinformatics analyses of pathogenic bacterial secretomes identified by mass spectrometry in Burkholderia species.

    Science.gov (United States)

    Nguyen, Thao Thi; Chon, Tae-Soo; Kim, Jaehan; Seo, Young-Su; Heo, Muyoung

    2017-07-01

    Secreted proteins (secretomes) play crucial roles during bacterial pathogenesis in both plant and human hosts. The identification and characterization of secretomes in the two plant pathogens Burkholderia glumae BGR1 and B. gladioli BSR3, which cause diseases in rice such as seedling blight, panicle blight, and grain rot, are important steps to not only understand the disease-causing mechanisms but also find remedies for the diseases. Here, we identified two datasets of secretomes in B. glumae BGR1 and B. gladioli BSR3, which consist of 118 and 111 proteins, respectively, using mass spectrometry approach and literature curation. Next, we characterized the functional properties, potential secretion pathways and sequence information properties of secretomes of two plant pathogens in a comparative analysis by various computational approaches. The ratio of potential non-classically secreted proteins (NCSPs) to classically secreted proteins (CSPs) in B. glumae BGR1 was greater than that in B. gladioli BSR3. For CSPs, the putative hydrophobic regions (PHRs) which are essential for secretion process of CSPs were screened in detail at their N-terminal sequences using hidden Markov model (HMM)-based method. Total 31 pairs of homologous proteins in two bacterial secretomes were indicated based on the global alignment (identity ≥ 70%). Our results may facilitate the understanding of the species-specific features of secretomes in two plant pathogenic Burkholderia species.

  5. Construction and Screening of a Lentiviral Secretome Library.

    Science.gov (United States)

    Liu, Tao; Jia, Panpan; Ma, Huailei; Reed, Sean A; Luo, Xiaozhou; Larman, H Benjamin; Schultz, Peter G

    2017-06-22

    Over 2,000 human proteins are predicted to be secreted, but the biological function of the many of these proteins is still unknown. Moreover, a number of these proteins may act as new therapeutic agents or be targets for the development of therapeutic antibodies. To further explore the extracellular proteome, we have developed a secretome-enriched open reading frame (ORF) library that can be readily screened for autocrine activity in cell-based phenotypic or reporter assays. Next-generation sequencing (NGS) and database analysis predict that the library contains approximately 900 ORFs encoding known secreted proteins (accounting for 77.8% of the library), as well as genes encoding potentially unknown secreted proteins. In a proof-of-principle study, human TF-1 cells were screened for proliferative factors, and the known cytokine GMCSF was identified as a dominant hit. This library offers a relatively low-cost and straightforward approach for functional autocrine screens of secreted proteins. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. Direct identification of the Meloidogyne incognita secretome reveals proteins with host cell reprogramming potential.

    Directory of Open Access Journals (Sweden)

    Stéphane Bellafiore

    2008-10-01

    Full Text Available The root knot nematode, Meloidogyne incognita, is an obligate parasite that causes significant damage to a broad range of host plants. Infection is associated with secretion of proteins surrounded by proliferating cells. Many parasites are known to secrete effectors that interfere with plant innate immunity, enabling infection to occur; they can also release pathogen-associated molecular patterns (PAMPs, e.g., flagellin that trigger basal immunity through the nematode stylet into the plant cell. This leads to suppression of innate immunity and reprogramming of plant cells to form a feeding structure containing multinucleate giant cells. Effectors have generally been discovered using genetics or bioinformatics, but M. incognita is non-sexual and its genome sequence has not yet been reported. To partially overcome these limitations, we have used mass spectrometry to directly identify 486 proteins secreted by M. incognita. These proteins contain at least segmental sequence identity to those found in our 3 reference databases (published nematode proteins; unpublished M. incognita ESTs; published plant proteins. Several secreted proteins are homologous to plant proteins, which they may mimic, and they contain domains that suggest known effector functions (e.g., regulating the plant cell cycle or growth. Others have regulatory domains that could reprogram cells. Using in situ hybridization we observed that most secreted proteins were produced by the subventral glands, but we found that phasmids also secreted proteins. We annotated the functions of the secreted proteins and classified them according to roles they may play in the development of root knot disease. Our results show that parasite secretomes can be partially characterized without cognate genomic DNA sequence. We observed that the M. incognita secretome overlaps the reported secretome of mammalian parasitic nematodes (e.g., Brugia malayi, suggesting a common parasitic behavior and a possible

  7. Dataset for the proteomic inventory and quantitative analysis of the breast cancer hypoxic secretome associated with osteotropism

    Directory of Open Access Journals (Sweden)

    Thomas R. Cox

    2015-12-01

    Full Text Available The cancer secretome includes all of the macromolecules secreted by cells into their microenvironment. Cancer cell secretomes are significantly different to that of normal cells reflecting the changes that normal cells have undergone during their transition to malignancy. More importantly, cancer secretomes are known to be active mediators of both local and distant host cells and play an important role in the progression and dissemination of cancer. Here we have quantitatively profiled both the composition of breast cancer secretomes associated with osteotropism, and their modulation under normoxic and hypoxic conditions. We detect and quantify 162 secretome proteins across all conditions which show differential hypoxic induction and association with osteotropism. Mass Spectrometry proteomics data have been deposited to the ProteomeXchange Consortium with the dataset identifier PXD000397 and the complete proteomic, bioinformatic and biological analyses are reported in Cox et al. (2015 [1].

  8. Harnessing the Helminth Secretome for Therapeutic Immunomodulators

    Directory of Open Access Journals (Sweden)

    Dana Ditgen

    2014-01-01

    Full Text Available Helminths are the largest and most complex pathogens to invade and live within the human body. Since they are not able to outpace the immune system by rapid antigen variation or faster cell division or retreat into protective niches not accessible to immune effector mechanisms, their long-term survival depends on influencing and regulating the immune responses away from the mode of action most damaging to them. Immunologists have focused on the excretory and secretory products that are released by the helminths, since they can change the host environment by modulating the immune system. Here we give a brief overview of the helminth-associated immune response and the currently available helminth secretome data. We introduce some major secretome-derived immunomodulatory molecules and describe their potential mode of action. Finally, the applicability of helminth-derived therapeutic proteins in the treatment of allergic and autoimmune inflammatory disease is discussed.

  9. Characterization of the secretomes of two vibrios pathogenic to mollusks.

    Directory of Open Access Journals (Sweden)

    Stéphanie Madec

    Full Text Available Vibrio tapetis causes the brown ring disease in the Japanese clam Ruditapes philippinarum while Vibrio aestuarianus is associated with massive oyster mortalities. As extracellular proteins are often associated with the virulence of pathogenic bacteria, we undertook a proteomic approach to characterize the secretomes of both vibrios. The extracellular proteins (ECPs of both species were fractionated by SEC-FPLC and in vitro assays were performed to measure the effects of each fraction on hemocyte cellular parameters (phagocytosis and adhesion. Fractions showing a significant effect were subjected to SDS-PAGE, and proteins were identified by nano LC-MS/MS. 45 proteins were identified for V. aestuarianus and 87 for V. tapetis. Most of them belonged to outer membrane or were periplasmic, including porins or adhesins that were already described as virulence factors in other bacterial species. Others were transporter components, flagella proteins, or proteins of unknown function (14 and 15 respectively. Interestingly, for V. aestuarianus, we noted the secretion of 3 extracellular enzymes including the Vam metalloprotease and two other enzymes (one putative lipase and one protease. For V. tapetis, we identified five extracellular enymes, i.e. two different endochitinases, one protease, one lipase and an adhesin. A comparison of both secretomes also showed that only the putative extracellular lipase was common to both secretomes, underscoring the difference in pathogenicity mechanisms between these two species. Overall, these results characterize for the first time the secretomes of these two marine pathogenic vibrios and constitute a useful working basis to further analyze the contribution of specific proteins in the virulence mechanisms of these species.

  10. Quantitative Secretome Analysis of Activated Jurkat Cells Using Click Chemistry-Based Enrichment of Secreted Glycoproteins.

    Science.gov (United States)

    Witzke, Kathrin E; Rosowski, Kristin; Müller, Christian; Ahrens, Maike; Eisenacher, Martin; Megger, Dominik A; Knobloch, Jürgen; Koch, Andrea; Bracht, Thilo; Sitek, Barbara

    2017-01-06

    Quantitative secretome analyses are a high-performance tool for the discovery of physiological and pathophysiological changes in cellular processes. However, serum supplements in cell culture media limit secretome analyses, but serum depletion often leads to cell starvation and consequently biased results. To overcome these limiting factors, we investigated a model of T cell activation (Jurkat cells) and performed an approach for the selective enrichment of secreted proteins from conditioned medium utilizing metabolic marking of newly synthesized glycoproteins. Marked glycoproteins were labeled via bioorthogonal click chemistry and isolated by affinity purification. We assessed two labeling compounds conjugated with either biotin or desthiobiotin and the respective secretome fractions. 356 proteins were quantified using the biotin probe and 463 using desthiobiotin. 59 proteins were found differentially abundant (adjusted p-value ≤0.05, absolute fold change ≥1.5) between inactive and activated T cells using the biotin method and 86 using the desthiobiotin approach, with 31 mutual proteins cross-verified by independent experiments. Moreover, we analyzed the cellular proteome of the same model to demonstrate the benefit of secretome analyses and provide comprehensive data sets of both. 336 proteins (61.3%) were quantified exclusively in the secretome. Data are available via ProteomeXchange with identifier PXD004280.

  11. Plasmodium falciparum secretome in erythrocyte and beyond

    Directory of Open Access Journals (Sweden)

    Rani eSoni

    2016-02-01

    Full Text Available Plasmodium falciparum is the causative agent of deadly malaria disease. It is an intracellular eukaryote and completes its multi-stage life cycle spanning the two hosts viz, mosquito and human. In order to habituate within host environment, parasite conform several strategies to evade host immune responses such as surface antigen polymorphism or modulation of host immune system and it is mediated by secretion of proteins from parasite to the host erythrocyte and beyond, collectively known as, malaria secretome. In this review, we will discuss about the deployment of parasitic secretory protein in mechanism implicated for immune evasion, protein trafficking, providing virulence, changing permeability and cyto-adherence of infected erythrocyte. We will be covering the possibilities of developing malaria secretome as a drug/vaccine target. This gathered information will be worthwhile in depicting a well-organized picture for host-pathogen interplay during the malaria infection and may also provide some clues for development of novel anti-malarial therapies.

  12. Integration of Biodiversity Databases in Taiwan and Linkage to Global Databases

    Directory of Open Access Journals (Sweden)

    Kwang-Tsao Shao

    2007-03-01

    Full Text Available The biodiversity databases in Taiwan were dispersed to various institutions and colleges with limited amount of data by 2001. The Natural Resources and Ecology GIS Database sponsored by the Council of Agriculture, which is part of the National Geographic Information System planned by the Ministry of Interior, was the most well established biodiversity database in Taiwan. But thisThis database was, however, mainly collectingcollected the distribution data of terrestrial animals and plants within the Taiwan area. In 2001, GBIF was formed, and Taiwan joined as one of the an Associate Participant and started, starting the establishment and integration of animal and plant species databases; therefore, TaiBIF was able to co-operate with GBIF. The information of Catalog of Life, specimens, and alien species were integrated by the Darwin core. The standard. These metadata standards allowed the biodiversity information of Taiwan to connect with global databases.

  13. Investigation of the indigenous fungal community populating barley grains: Secretomes and xylanolytic potential

    DEFF Research Database (Denmark)

    Sultan, Abida; Frisvad, Jens Christian; Andersen, Birgit

    2017-01-01

    The indigenous fungal species populating cereal grains produce numerous plant cell wall-degrading enzymes including xylanases, which could play important role in plant-pathogen interactions and in adaptation of the fungi to varying carbon sources. To gain more insight into the grain surface......-associated enzyme activity, members of the populating fungal community were isolated, and their secretomes and xylanolytic activities assessed. Twenty-seven different fungal species were isolated from grains of six barley cultivars over different harvest years and growing sites. The isolated fungi were grown...... on medium containing barley flour or wheat arabinoxylan as sole carbon source. Their secretomes and xylanase activities were analyzed using SDS-PAGE and enzyme assays and were found to vary according to species and carbon source. Secretomes were dominated by cell wall degrading enzymes with xylanases...

  14. Determination of optimized oxygen partial pressure to maximize the liver regenerative potential of the secretome obtained from adipose-derived stem cells.

    Science.gov (United States)

    Lee, Sang Chul; Kim, Kee-Hwan; Kim, Ok-Hee; Lee, Sang Kuon; Hong, Ha-Eun; Won, Seong Su; Jeon, Sang-Jin; Choi, Byung Jo; Jeong, Wonjun; Kim, Say-June

    2017-08-03

    A hypoxic-preconditioned secretome from stem cells reportedly promotes the functional and regenerative capacity of the liver more effectively than a control secretome. However, the optimum oxygen partial pressure (pO 2 ) in the cell culture system that maximizes the therapeutic potential of the secretome has not yet been determined. We first determined the cellular alterations in adipose tissue-derived stem cells (ASCs) cultured under different pO 2 (21%, 10%, 5%, and 1%). Subsequently, partially hepatectomized mice were injected with the secretome of ASCs cultured under different pO 2 , and then sera and liver specimens were obtained for analyses. Of all AML12 cells cultured under different pO 2 , the AML12 cells cultured under 1% pO 2 showed the highest mRNA expression of proliferation-associated markers (IL-6, HGF, and VEGF). In the cell proliferation assay, the AML12 cells cultured with the secretome of 1% pO 2 showed the highest cell proliferation, followed by the cells cultured with the secretome of 21%, 10%, and 5% pO 2 , in that order. When injected into the partially hepatectomized mice, the 1% pO 2 secretome most significantly increased the number of Ki67-positive cells, reduced serum levels of proinflammatory mediators (IL-6 and TNF-α), and reduced serum levels of liver transaminases. In addition, analysis of the liver specimens indicated that injection with the 1% pO 2 secretome maximized the expression of the intermediate molecules of the PIP3/Akt and IL-6/STAT3 signaling pathways, all of which are known to promote liver regeneration. The data of this study suggest that the secretome of ASCs cultured under 1% pO 2 has the highest liver reparative and regenerative potential of all the secretomes tested here.

  15. Analysis of secretome of breast cancer cell line with an optimized semi-shotgun method

    International Nuclear Information System (INIS)

    Tang Xiaorong; Yao Ling; Chen Keying; Hu Xiaofang; Xu Lisa; Fan Chunhai

    2009-01-01

    Secretome, the totality of secreted proteins, is viewed as a promising pool of candidate cancer biomarkers. Simple and reliable methods for identifying secreted proteins are highly desired. We used an optimized semi-shotgun liquid chromatography followed by tandem mass spectrometry (LC-MS/MS) method to analyze the secretome of breast cancer cell line MDA-MB-231. A total of 464 proteins were identified. About 63% of the proteins were classified as secreted proteins, including many promising breast cancer biomarkers, which were thought to be correlated with tumorigenesis, tumor development and metastasis. These results suggest that the optimized method may be a powerful strategy for cell line secretome profiling, and can be used to find potential cancer biomarkers with great clinical significance. (authors)

  16. Dataset for the proteomic inventory and quantitative analysis of the breast cancer hypoxic secretome associated with osteotropism

    DEFF Research Database (Denmark)

    Cox, T.R.; Schoof, Erwin; Gartland, A.

    2015-01-01

    secretomes are known to be active mediators of both local and distant host cells and play an important role in the progression and dissemination of cancer. Here we have quantitatively profiled both the composition of breast cancer secretomes associated with osteotropism, and their modulation under normoxic...... and hypoxic conditions. We detect and quantify 162 secretome proteins across all conditions which show differential hypoxic induction and association with osteotropism. Mass Spectrometry proteomics data have been deposited to the ProteomeXchange Consortium with the dataset identifier PXD000397...

  17. Secretome-based Manganese(II) Oxidation by Filamentous Ascomycete Fungi

    Science.gov (United States)

    Zeiner, C. A.; Purvine, S.; Zink, E.; Paša-Tolić, L.; Chaput, D.; Wu, S.; Santelli, C. M.; Hansel, C. M.

    2017-12-01

    Manganese (Mn) oxides are among the strongest oxidants in the environment, and Mn(II) oxidation to Mn(III/IV) (hydr)oxides includes both abiotic and microbially-mediated processes. While white-rot Basidiomycete fungi oxidize Mn(II) using laccases and Mn peroxidases in association with lignocellulose degradation, the mechanisms by which filamentous Ascomycete fungi oxidize Mn(II) and a physiological role for Mn(II) oxidation in these organisms remain poorly understood. Through a combination of chemical and in-gel assays, bulk mass spectrometry, and iTRAQ proteomics, we demonstrate enzymatic Mn(II) oxidation in the secretomes of three phylogenetically diverse Ascomycetes that were isolated from Mn-laden sediments. Candidate Mn(II)-oxidizing enzymes were species-specific and included bilirubin oxidase and tyrosinase in Stagonospora sp. SRC1lsM3a, GMC oxidoreductase in Paraconiothyrium sporulosum AP3s5-JAC2a, and FAD-binding oxidoreductases in Pyrenochaeta sp. DS3sAY3a. These findings were supported by full proteomic characterization of the secretomes, which revealed a lack of Mn, lignin, and versatile peroxidases in these Ascomycetes but a substantially higher proportion of LMCOs and GMC oxidoreductases compared to wood-rot Basidiomycetes. We also identified the potential for indirect enzymatic Mn(II) oxidation by hydroxyl radical, as the secretomes were rich in diverse lignocellulose-degrading enzymes that could participate in Fenton chemistry. A link between Mn(II) oxidation and carbon oxidation analogous to white-rot Basidiomycetes remains unknown in these Ascomycetes. Interestingly, growth rates on rich medium were unaffected by the presence of Mn(II), and the production of Mn(II)-oxidizing proteins in the secretome was constitutive and not inducible by Mn(II). Thus, no physiological benefit of Mn(II) oxidation in these Ascomycetes has yet been identified, and Mn(II) oxidation appears to be a side reaction. Future work will explore the lignin-degrading capacity of

  18. Exploring Trichoderma and Aspergillus secretomes: Proteomics approaches for the identification of enzymes of biotechnological interest.

    Science.gov (United States)

    Cologna, Nicholas de Mojana di; Gómez-Mendoza, Diana Paola; Zanoelo, Fabiana Fonseca; Giannesi, Giovana Cristina; Guimarães, Nelciele Cavalieri de Alencar; Moreira, Leonora Rios de Souza; Filho, Edivaldo Ximenes Ferreira; Ricart, Carlos André Ornelas

    2018-02-01

    Filamentous fungal secretomes comprise highly dynamic sets of proteins, including multiple carbohydrate active enzymes (CAZymes) which are able to hydrolyze plant biomass polysaccharides into products of biotechnological interest such as fermentable sugars. In recent years, proteomics has been used to identify and quantify enzymatic and non-enzymatic polypeptides present in secretomes of several fungi species. The resulting data have widened the scientific understanding of the way filamentous fungi perform biomass degradation and offered novel perspectives for biotechnological applications. The present review discusses proteomics approaches that have been applied to the study of fungal secretomes, focusing on two of the most studied filamentous fungi genera: Trichoderma and Aspergillus. Copyright © 2017 Elsevier Inc. All rights reserved.

  19. Secretomes of Mycoplasma hyopneumoniae and Mycoplasma flocculare reveal differences associated to pathogenesis.

    Science.gov (United States)

    Paes, Jéssica A; Lorenzatto, Karina R; de Moraes, Sofia N; Moura, Hercules; Barr, John R; Ferreira, Henrique B

    2017-02-10

    Mycoplasma hyopneumoniae and Mycoplasma flocculare cohabit the porcine respiratory tract. However, M. hyopneumoniae causes the porcine enzootic pneumonia, while M. flocculare is a commensal bacterium. Comparative analyses demonstrated high similarity between these species, which includes the sharing of all predicted virulence factors. Nevertheless, studies related to soluble secretomes of mycoplasmas were little known, although they are important for bacterial-host interactions. The aim of this study was to perform a comparative analysis between the soluble secreted proteins repertoires of the pathogenic Mycoplasma hyopneumoniae and its closely related commensal Mycoplasma flocculare. For that, bacteria were cultured in medium with reduced serum concentration and secreted proteins were identified by a LC-MS/MS proteomics approach. Altogether, 62 and 26 proteins were identified as secreted by M. hyopneumoniae and M. flocculare, respectively, being just seven proteins shared between these bacteria. In M. hyopneumoniae secretome, 15 proteins described as virulence factors were found; while four putative virulence factors were identified in M. flocculare secretome. For the first time, clear differences related to virulence were found between these species, helping to elucidate the pathogenic nature of M. hyopneumoniae to swine hosts. For the first time, the secretomes of two porcine respiratory mycoplasmas, namely the pathogenic M. hyopneumoniae and the commensal M. flocculare were compared. The presented results revealed previously unknown differences between these two genetically related species, some of which are associated to the M. hyopneumoniae ability to cause porcine enzootic pneumonia. Copyright © 2016 Elsevier B.V. All rights reserved.

  20. Ontology based heterogeneous materials database integration and semantic query

    Science.gov (United States)

    Zhao, Shuai; Qian, Quan

    2017-10-01

    Materials digital data, high throughput experiments and high throughput computations are regarded as three key pillars of materials genome initiatives. With the fast growth of materials data, the integration and sharing of data is very urgent, that has gradually become a hot topic of materials informatics. Due to the lack of semantic description, it is difficult to integrate data deeply in semantic level when adopting the conventional heterogeneous database integration approaches such as federal database or data warehouse. In this paper, a semantic integration method is proposed to create the semantic ontology by extracting the database schema semi-automatically. Other heterogeneous databases are integrated to the ontology by means of relational algebra and the rooted graph. Based on integrated ontology, semantic query can be done using SPARQL. During the experiments, two world famous First Principle Computational databases, OQMD and Materials Project are used as the integration targets, which show the availability and effectiveness of our method.

  1. Three-Dimensional Bioprinting Nanotechnologies towards Clinical Application of Stem Cells and Their Secretome in Salivary Gland Regeneration

    Directory of Open Access Journals (Sweden)

    Joao N. Ferreira

    2016-01-01

    Full Text Available Salivary gland (SG functional damage and severe dry mouth (or xerostomia are commonly observed in a wide range of medical conditions from autoimmune to metabolic disorders as well as after radiotherapy to treat specific head and neck cancers. No effective therapy has been developed to completely restore the SG functional damage on the long-term and reverse the poor quality of life of xerostomia patients. Cell- and secretome-based strategies are currently being tested in vitro and in vivo for the repair and/or regeneration of the damaged SG using (1 epithelial SG stem/progenitor cells from salispheres or explant cultures as well as (2 nonepithelial stem cell types and/or their bioactive secretome. These strategies will be the focus of our review. Herein, innovative 3D bioprinting nanotechnologies for the generation of organotypic cultures and SG organoids/mini-glands will also be discussed. These bioprinting technologies will allow researchers to analyze the secretome components and extracellular matrix production, as well as their biofunctional effects in 3D mini-glands ex vivo. Improving our understanding of the SG secretome is critical to develop effective secretome-based therapies towards the regeneration and/or repair of all SG compartments for proper restoration of saliva secretion and flow into the oral cavity.

  2. [A web-based integrated clinical database for laryngeal cancer].

    Science.gov (United States)

    E, Qimin; Liu, Jialin; Li, Yong; Liang, Chuanyu

    2014-08-01

    To establish an integrated database for laryngeal cancer, and to provide an information platform for laryngeal cancer in clinical and fundamental researches. This database also meet the needs of clinical and scientific use. Under the guidance of clinical expert, we have constructed a web-based integrated clinical database for laryngeal carcinoma on the basis of clinical data standards, Apache+PHP+MySQL technology, laryngeal cancer specialist characteristics and tumor genetic information. A Web-based integrated clinical database for laryngeal carcinoma had been developed. This database had a user-friendly interface and the data could be entered and queried conveniently. In addition, this system utilized the clinical data standards and exchanged information with existing electronic medical records system to avoid the Information Silo. Furthermore, the forms of database was integrated with laryngeal cancer specialist characteristics and tumor genetic information. The Web-based integrated clinical database for laryngeal carcinoma has comprehensive specialist information, strong expandability, high feasibility of technique and conforms to the clinical characteristics of laryngeal cancer specialties. Using the clinical data standards and structured handling clinical data, the database can be able to meet the needs of scientific research better and facilitate information exchange, and the information collected and input about the tumor sufferers are very informative. In addition, the user can utilize the Internet to realize the convenient, swift visit and manipulation on the database.

  3. Comparative secretome analysis of rat stomach under different nutritional status.

    Science.gov (United States)

    Senin, Lucia L; Roca-Rivada, Arturo; Castelao, Cecilia; Alonso, Jana; Folgueira, Cintia; Casanueva, Felipe F; Pardo, Maria; Seoane, Luisa M

    2015-02-26

    Obesity is a major public health threat for many industrialised countries. Bariatric surgery is the most effective treatment against obesity, suggesting that gut derived signals are crucial for energy balance regulation. Several descriptive studies have proven the presence of gastric endogenous systems that modulate energy homeostasis; however, these systems and the interactions between them are still not well known. In the present study, we show for the first time the comparative 2-DE gastric secretome analysis under different nutritional status. We have identified 38 differently secreted proteins by comparing stomach secretomes from tissue explant cultures of rats under feeding, fasting and re-feeding conditions. Among the proteins identified, glyceraldehyde-3-phosphate dehydrogenase was found to be more abundant in gastric secretome and plasma after re-feeding, and downregulated in obesity. Additionally, two calponin-1 species were decreased in feeding state, and other were modulated by nutritional and metabolic conditions. These and other secreted proteins identified in this work may be considered as potential gastrokines implicated in food intake regulation. The present work has an important impact in the field of obesity, especially in the regulation of body weight maintenance by the stomach. Nowadays, the most effective treatment in the fight against obesity is bariatric surgery, which suggests that stomach derived signals might be crucial for the regulation of the energy homeostasis. However, until now, the knowledge about the gastrokines and its mechanism of action has been poorly elucidated. In the present work, we had updated a previously validated explant secretion model for proteomic studies; this analysis allowed us, for the first time, to study the gastric secretome without interferences from other organs. We had identified 38 differently secreted proteins comparing ex vivo cultured stomachs from rats under feeding, fasting and re-feeding regimes

  4. Database specification for the Worldwide Port System (WPS) Regional Integrated Cargo Database (ICDB)

    Energy Technology Data Exchange (ETDEWEB)

    Faby, E.Z.; Fluker, J.; Hancock, B.R.; Grubb, J.W.; Russell, D.L. [Univ. of Tennessee, Knoxville, TN (United States); Loftis, J.P.; Shipe, P.C.; Truett, L.F. [Oak Ridge National Lab., TN (United States)

    1994-03-01

    This Database Specification for the Worldwide Port System (WPS) Regional Integrated Cargo Database (ICDB) describes the database organization and storage allocation, provides the detailed data model of the logical and physical designs, and provides information for the construction of parts of the database such as tables, data elements, and associated dictionaries and diagrams.

  5. Data for iTRAQ secretomic analysis of Aspergillus fumigatus in response to different carbon sources

    OpenAIRE

    Sunil S. Adav; Anita Ravindran; Siu Kwan Sze

    2015-01-01

    Here, we provide data related to the research article entitled ?Quantitative proteomics study of Aspergillus fumigatus secretome revealed deamidation of secretory enzymes? by Adav et al. (J. Proteomics (2015) [1]). Aspergillus sp. plays an important role in lignocellulosic biomass recycling. To explore biomass hydrolyzing enzymes of A. fumigatus, we profiled secretome under different carbon sources such as glucose, cellulose, xylan and starch by high throughput quantitative proteomics using i...

  6. Proteomic Profile of Unstable Atheroma Plaque: Increased Neutrophil Defensin 1, Clusterin, and Apolipoprotein E Levels in Carotid Secretome.

    Science.gov (United States)

    Aragonès, Gemma; Auguet, Teresa; Guiu-Jurado, Esther; Berlanga, Alba; Curriu, Marta; Martinez, Salomé; Alibalic, Ajla; Aguilar, Carmen; Hernández, Esteban; Camara, María-Luisa; Canela, Núria; Herrero, Pol; Ruyra, Xavier; Martín-Paredero, Vicente; Richart, Cristóbal

    2016-03-04

    Because of the clinical significance of carotid atherosclerosis, the search for novel biomarkers has become a priority. The aim of the present study was to compare the protein secretion profile of the carotid atherosclerotic plaque (CAP, n = 12) and nonatherosclerotic mammary artery (MA, n = 10) secretomes. We used a nontargeted proteomic approach that incorporated tandem immunoaffinity depletion, iTRAQ labeling, and nanoflow liquid chromatography coupled to high-resolution mass spectrometry. In total, 162 proteins were quantified, of which 25 showed statistically significant differences in secretome levels between carotid atherosclerotic plaque and nondiseased mammary artery. We found increased levels of neutrophil defensin 1, apolipoprotein E, clusterin, and zinc-alpha-2-glycoprotein in CAP secretomes. Results were validated by ELISA assays. Also, differentially secreted proteins are involved in pathways such as focal adhesion and leukocyte transendothelial migration. In conclusion, this study provides a subset of identified proteins that are differently expressed in secretomes of clinical significance.

  7. Controlled Inhibition of the Mesenchymal Stromal Cell Pro-inflammatory Secretome via Microparticle Engineering

    Directory of Open Access Journals (Sweden)

    Sudhir H. Ranganath

    2016-06-01

    Full Text Available Mesenchymal stromal cells (MSCs are promising therapeutic candidates given their potent immunomodulatory and anti-inflammatory secretome. However, controlling the MSC secretome post-transplantation is considered a major challenge that hinders their clinical efficacy. To address this, we used a microparticle-based engineering approach to non-genetically modulate pro-inflammatory pathways in human MSCs (hMSCs under simulated inflammatory conditions. Here we show that microparticles loaded with TPCA-1, a small-molecule NF-κB inhibitor, when delivered to hMSCs can attenuate secretion of pro-inflammatory factors for at least 6 days in vitro. Conditioned medium (CM derived from TPCA-1-loaded hMSCs also showed reduced ability to attract human monocytes and prevented differentiation of human cardiac fibroblasts to myofibroblasts, compared with CM from untreated or TPCA-1-preconditioned hMSCs. Thus, we provide a broadly applicable bioengineering solution to facilitate intracellular sustained release of agents that modulate signaling. We propose that this approach could be harnessed to improve control over MSC secretome post-transplantation, especially to prevent adverse remodeling post-myocardial infarction.

  8. Emission & Generation Resource Integrated Database (eGRID)

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Emissions & Generation Resource Integrated Database (eGRID) is an integrated source of data on environmental characteristics of electric power generation....

  9. Integr8: enhanced inter-operability of European molecular biology databases.

    Science.gov (United States)

    Kersey, P J; Morris, L; Hermjakob, H; Apweiler, R

    2003-01-01

    The increasing production of molecular biology data in the post-genomic era, and the proliferation of databases that store it, require the development of an integrative layer in database services to facilitate the synthesis of related information. The solution of this problem is made more difficult by the absence of universal identifiers for biological entities, and the breadth and variety of available data. Integr8 was modelled using UML (Universal Modelling Language). Integr8 is being implemented as an n-tier system using a modern object-oriented programming language (Java). An object-relational mapping tool, OJB, is being used to specify the interface between the upper layers and an underlying relational database. The European Bioinformatics Institute is launching the Integr8 project. Integr8 will be an automatically populated database in which we will maintain stable identifiers for biological entities, describe their relationships with each other (in accordance with the central dogma of biology), and store equivalences between identified entities in the source databases. Only core data will be stored in Integr8, with web links to the source databases providing further information. Integr8 will provide the integrative layer of the next generation of bioinformatics services from the EBI. Web-based interfaces will be developed to offer gene-centric views of the integrated data, presenting (where known) the links between genome, proteome and phenotype.

  10. Investigating Aspergillus nidulans secretome during colonisation of cork cell walls.

    Science.gov (United States)

    Martins, Isabel; Garcia, Helga; Varela, Adélia; Núñez, Oscar; Planchon, Sébastien; Galceran, Maria Teresa; Renaut, Jenny; Rebelo, Luís P N; Silva Pereira, Cristina

    2014-02-26

    Cork, the outer bark of Quercus suber, shows a unique compositional structure, a set of remarkable properties, including high recalcitrance. Cork colonisation by Ascomycota remains largely overlooked. Herein, Aspergillus nidulans secretome on cork was analysed (2DE). Proteomic data were further complemented by microscopic (SEM) and spectroscopic (ATR-FTIR) evaluation of the colonised substrate and by targeted analysis of lignin degradation compounds (UPLC-HRMS). Data showed that the fungus formed an intricate network of hyphae around the cork cell walls, which enabled polysaccharides and lignin superficial degradation, but probably not of suberin. The degradation of polysaccharides was suggested by the identification of few polysaccharide degrading enzymes (β-glucosidases and endo-1,5-α-l-arabinosidase). Lignin degradation, which likely evolved throughout a Fenton-like mechanism relying on the activity of alcohol oxidases, was supported by the identification of small aromatic compounds (e.g. cinnamic acid and veratrylaldehyde) and of several putative high molecular weight lignin degradation products. In addition, cork recalcitrance was corroborated by the identification of several protein species which are associated with autolysis. Finally, stringent comparative proteomics revealed that A. nidulans colonisation of cork and wood share a common set of enzymatic mechanisms. However the higher polysaccharide accessibility in cork might explain the increase of β-glucosidase in cork secretome. Cork degradation by fungi remains largely overlook. Herein we aimed at understanding how A. nidulans colonise cork cell walls and how this relates to wood colonisation. To address this, the protein species consistently present in the secretome were analysed, as well as major alterations occurring in the substrate, including lignin degradation compounds being released. The obtained data demonstrate that this fungus has superficially attacked the cork cell walls apparently by

  11. A rapidly evolving secretome builds and patterns a sea shell

    Directory of Open Access Journals (Sweden)

    Green Kathryn

    2006-11-01

    Full Text Available Abstract Background Instructions to fabricate mineralized structures with distinct nanoscale architectures, such as seashells and coral and vertebrate skeletons, are encoded in the genomes of a wide variety of animals. In mollusks, the mantle is responsible for the extracellular production of the shell, directing the ordered biomineralization of CaCO3 and the deposition of architectural and color patterns. The evolutionary origins of the ability to synthesize calcified structures across various metazoan taxa remain obscure, with only a small number of protein families identified from molluskan shells. The recent sequencing of a wide range of metazoan genomes coupled with the analysis of gene expression in non-model animals has allowed us to investigate the evolution and process of biomineralization in gastropod mollusks. Results Here we show that over 25% of the genes expressed in the mantle of the vetigastropod Haliotis asinina encode secreted proteins, indicating that hundreds of proteins are likely to be contributing to shell fabrication and patterning. Almost 85% of the secretome encodes novel proteins; remarkably, only 19% of these have identifiable homologues in the full genome of the patellogastropod Lottia scutum. The spatial expression profiles of mantle genes that belong to the secretome is restricted to discrete mantle zones, with each zone responsible for the fabrication of one of the structural layers of the shell. Patterned expression of a subset of genes along the length of the mantle is indicative of roles in shell ornamentation. For example, Has-sometsuke maps precisely to pigmentation patterns in the shell, providing the first case of a gene product to be involved in molluskan shell pigmentation. We also describe the expression of two novel genes involved in nacre (mother of pearl deposition. Conclusion The unexpected complexity and evolvability of this secretome and the modular design of the molluskan mantle enables

  12. Nuclear integrated database and design advancement system

    International Nuclear Information System (INIS)

    Ha, Jae Joo; Jeong, Kwang Sub; Kim, Seung Hwan; Choi, Sun Young.

    1997-01-01

    The objective of NuIDEAS is to computerize design processes through an integrated database by eliminating the current work style of delivering hardcopy documents and drawings. The major research contents of NuIDEAS are the advancement of design processes by computerization, the establishment of design database and 3 dimensional visualization of design data. KSNP (Korea Standard Nuclear Power Plant) is the target of legacy database and 3 dimensional model, so that can be utilized in the next plant design. In the first year, the blueprint of NuIDEAS is proposed, and its prototype is developed by applying the rapidly revolutionizing computer technology. The major results of the first year research were to establish the architecture of the integrated database ensuring data consistency, and to build design database of reactor coolant system and heavy components. Also various softwares were developed to search, share and utilize the data through networks, and the detailed 3 dimensional CAD models of nuclear fuel and heavy components were constructed, and walk-through simulation using the models are developed. This report contains the major additions and modifications to the object oriented database and associated program, using methods and Javascript.. (author). 36 refs., 1 tab., 32 figs

  13. Proteomic techniques for characterisation of mesenchymal stem cell secretome.

    Czech Academy of Sciences Publication Activity Database

    Kupcová Skalníková, Helena

    2013-01-01

    Roč. 95, č. 12 (2013), s. 2196-2211 ISSN 0300-9084 R&D Projects: GA MŠk ED2.1.00/03.0124; GA TA ČR TA01011466 Institutional support: RVO:67985904 Keywords : mesenchymal stem cells * secretome * exosome * conditioned medium * proteomics Subject RIV: CE - Biochemistry Impact factor: 3.123, year: 2013

  14. Optimal database locks for efficient integrity checking

    DEFF Research Database (Denmark)

    Martinenghi, Davide

    2004-01-01

    In concurrent database systems, correctness of update transactions refers to the equivalent effects of the execution schedule and some serial schedule over the same set of transactions. Integrity constraints add further semantic requirements to the correctness of the database states reached upon...... the execution of update transactions. Several methods for efficient integrity checking and enforcing exist. We show in this paper how to apply one such method to automatically extend update transactions with locks and simplified consistency tests on the locked entities. All schedules produced in this way...

  15. Loopedia, a database for loop integrals

    Science.gov (United States)

    Bogner, C.; Borowka, S.; Hahn, T.; Heinrich, G.; Jones, S. P.; Kerner, M.; von Manteuffel, A.; Michel, M.; Panzer, E.; Papara, V.

    2018-04-01

    Loopedia is a new database at loopedia.org for information on Feynman integrals, intended to provide both bibliographic information as well as results made available by the community. Its bibliometry is complementary to that of INSPIRE or arXiv in the sense that it admits searching for integrals by graph-theoretical objects, e.g. its topology.

  16. Functional integration of automated system databases by means of artificial intelligence

    Science.gov (United States)

    Dubovoi, Volodymyr M.; Nikitenko, Olena D.; Kalimoldayev, Maksat; Kotyra, Andrzej; Gromaszek, Konrad; Iskakova, Aigul

    2017-08-01

    The paper presents approaches for functional integration of automated system databases by means of artificial intelligence. The peculiarities of turning to account the database in the systems with the usage of a fuzzy implementation of functions were analyzed. Requirements for the normalization of such databases were defined. The question of data equivalence in conditions of uncertainty and collisions in the presence of the databases functional integration is considered and the model to reveal their possible occurrence is devised. The paper also presents evaluation method of standardization of integrated database normalization.

  17. Investigating the secretome : Lessons about the cells that comprise the heart

    Czech Academy of Sciences Publication Activity Database

    Šťastná, Miroslava; Van Eyk, J.E.

    2012-01-01

    Roč. 5, č. 1 (2012), o8-o18 ISSN 1942-325X Institutional research plan: CEZ:AV0Z40310501 Keywords : secretomes * proteomics * cardiovascular diseases Subject RIV: CB - Analytical Chemistry, Separation Impact factor: 6.728, year: 2012

  18. Secretome analysis of Aspergillus fumigatus reveals Asp-hemolysin as a major secreted protein.

    Science.gov (United States)

    Wartenberg, Dirk; Lapp, Katrin; Jacobsen, Ilse D; Dahse, Hans-Martin; Kniemeyer, Olaf; Heinekamp, Thorsten; Brakhage, Axel A

    2011-11-01

    Surface-associated and secreted proteins represent primarily exposed components of Aspergillus fumigatus during host infection. Several secreted proteins are known to be involved in defense mechanisms or immune evasion, thus, probably contributing to pathogenicity. Furthermore, several secreted antigens were identified as possible biomarkers for the verification of diseases caused by Aspergillus species. Nevertheless, there is only limited knowledge about the composition of the secretome and about molecular functions of particular proteins. To identify secreted proteins potentially essential for virulence, the core secretome of A. fumigatus grown in minimal medium was determined. Two-dimensional gel electrophoretic separation and subsequent MALDI-TOF-MS/MS analyses resulted in the identification of 64 different proteins. Additionally, secretome analyses of A. fumigatus utilizing elastin, collagen or keratin as main carbon and nitrogen source were performed. Thereby, the alkaline serine protease Alp1 was identified as the most abundant protein and hence presumably represents an important protease during host infection. Interestingly, the Asp-hemolysin (Asp-HS), which belongs to the protein family of aegerolysins and which was often suggested to be involved in fungal virulence, was present in the secretome under all growth conditions tested. In addition, a second, non-secreted protein with an aegerolysin domain annotated as Asp-hemolysin-like (HS-like) protein can be found to be encoded in the genome of A. fumigatus. Generation and analysis of Asp-HS and HS-like deletion strains revealed no differences in phenotype compared to the corresponding wild-type strain. Furthermore, hemolysis and cytotoxicity was not altered in both single-deletion and double-deletion mutants lacking both aegerolysin genes. All mutant strains showed no attenuation in virulence in a mouse infection model for invasive pulmonary aspergillosis. Overall, this study provides a comprehensive

  19. Secretome within the bone marrow microenvironment: A basis for mesenchymal stem cell treatment and role in cancer dormancy.

    Science.gov (United States)

    Eltoukhy, Hussam S; Sinha, Garima; Moore, Caitlyn; Gergues, Marina; Rameshwar, Pranela

    2018-05-31

    The secretome produced by cells within the bone marrow is significant to homeostasis. The bone marrow, a well-studied organ, has multiple niches with distinct roles for supporting stem cell functions. Thus, an understanding of mediators involved in the regulation of stem cells could serve as a model for clinical problems and solutions such as tissue repair and regeneration. The exosome secretome of bone marrow stem cells is a developing area of research with respect to the regenerative potential by bone marrow cell, particularly the mesenchymal stem cells. The bone marrow niche regulates endogenous processes such as hematopoiesis but could also support the survival of tumors such as facilitating the cancer stem cells to exist in dormancy for decades. The bone marrow-derived secretome will be critical to future development of therapeutic strategies for oncologic diseases, in addition to regenerative medicine. This article discusses the importance for parallel studies to determine how the same secretome may compromise safety during the use of stem cells in regenerative medicine. Copyright © 2018 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.

  20. Stem cell secretome-rich nanoclay hydrogel: a dual action therapy for cardiovascular regeneration

    Science.gov (United States)

    Waters, Renae; Pacelli, Settimio; Maloney, Ryan; Medhi, Indrani; Ahmed, Rafeeq P. H.; Paul, Arghya

    2016-03-01

    A nanocomposite hydrogel with photocrosslinkable micro-porous networks and a nanoclay component was successfully prepared to control the release of growth factor-rich stem cell secretome. The proven pro-angiogenic and cardioprotective potential of this new bioactive system provides a valuable therapeutic platform for cardiac tissue repair and regeneration.A nanocomposite hydrogel with photocrosslinkable micro-porous networks and a nanoclay component was successfully prepared to control the release of growth factor-rich stem cell secretome. The proven pro-angiogenic and cardioprotective potential of this new bioactive system provides a valuable therapeutic platform for cardiac tissue repair and regeneration. Electronic supplementary information (ESI) available. See DOI: 10.1039/c5nr07806g

  1. Development of an integrated database management system to evaluate integrity of flawed components of nuclear power plant

    International Nuclear Information System (INIS)

    Mun, H. L.; Choi, S. N.; Jang, K. S.; Hong, S. Y.; Choi, J. B.; Kim, Y. J.

    2001-01-01

    The object of this paper is to develop an NPP-IDBMS(Integrated DataBase Management System for Nuclear Power Plants) for evaluating the integrity of components of nuclear power plant using relational data model. This paper describes the relational data model, structure and development strategy for the proposed NPP-IDBMS. The NPP-IDBMS consists of database, database management system and interface part. The database part consists of plant, shape, operating condition, material properties and stress database, which are required for the integrity evaluation of each component in nuclear power plants. For the development of stress database, an extensive finite element analysis was performed for various components considering operational transients. The developed NPP-IDBMS will provide efficient and accurate way to evaluate the integrity of flawed components

  2. GBM secretome induces transient transformation of human neural precursor cells.

    Science.gov (United States)

    Venugopal, Chitra; Wang, X Simon; Manoranjan, Branavan; McFarlane, Nicole; Nolte, Sara; Li, Meredith; Murty, Naresh; Siu, K W Michael; Singh, Sheila K

    2012-09-01

    Glioblastoma (GBM) is the most aggressive primary brain tumor in humans, with a uniformly poor prognosis. The tumor microenvironment is composed of both supportive cellular substrates and exogenous factors. We hypothesize that exogenous factors secreted by brain tumor initiating cells (BTICs) could predispose normal neural precursor cells (NPCs) to transformation. When NPCs are grown in GBM-conditioned media, and designated as "tumor-conditioned NPCs" (tcNPCs), they become highly proliferative and exhibit increased stem cell self-renewal, or the unique ability of stem cells to asymmetrically generate another stem cell and a daughter cell. tcNPCs also show an increased transcript level of stem cell markers such as CD133 and ALDH and growth factor receptors such as VEGFR1, VEGFR2, EGFR and PDGFRα. Media analysis by ELISA of GBM-conditioned media reveals an elevated secretion of growth factors such as EGF, VEGF and PDGF-AA when compared to normal neural stem cell-conditioned media. We also demonstrate that tcNPCs require prolonged or continuous exposure to the GBM secretome in vitro to retain GBM BTIC characteristics. Our in vivo studies reveal that tcNPCs are unable to form tumors, confirming that irreversible transformation events may require sustained or prolonged presence of the GBM secretome. Analysis of GBM-conditioned media by mass spectrometry reveals the presence of secreted proteins Chitinase-3-like 1 (CHI3L1) and H2A histone family member H2AX. Collectively, our data suggest that GBM-secreted factors are capable of transiently altering normal NPCs, although for retention of the transformed phenotype, sustained or prolonged secretome exposure or additional transformation events are likely necessary.

  3. Computational prediction of secretion systems and secretomes of Brucella: identification of novel type IV effectors and their interaction with the host.

    Science.gov (United States)

    Sankarasubramanian, Jagadesan; Vishnu, Udayakumar S; Dinakaran, Vasudevan; Sridhar, Jayavel; Gunasekaran, Paramasamy; Rajendhran, Jeyaprakash

    2016-01-01

    Brucella spp. are facultative intracellular pathogens that cause brucellosis in various mammals including humans. Brucella survive inside the host cells by forming vacuoles and subverting host defence systems. This study was aimed to predict the secretion systems and the secretomes of Brucella spp. from 39 complete genome sequences available in the databases. Furthermore, an attempt was made to identify the type IV secretion effectors and their interactions with host proteins. We predicted the secretion systems of Brucella by the KEGG pathway and SecReT4. Brucella secretomes and type IV effectors (T4SEs) were predicted through genome-wide screening using JVirGel and S4TE, respectively. Protein-protein interactions of Brucella T4SEs with their hosts were analyzed by HPIDB 2.0. Genes coding for Sec and Tat pathways of secretion and type I (T1SS), type IV (T4SS) and type V (T5SS) secretion systems were identified and they are conserved in all the species of Brucella. In addition to the well-known VirB operon coding for the type IV secretion system (T4SS), we have identified the presence of additional genes showing homology with T4SS of other organisms. On the whole, 10.26 to 14.94% of total proteomes were found to be either secreted (secretome) or membrane associated (membrane proteome). Approximately, 1.7 to 3.0% of total proteomes were identified as type IV secretion effectors (T4SEs). Prediction of protein-protein interactions showed 29 and 36 host-pathogen specific interactions between Bos taurus (cattle)-B. abortus and Ovis aries (sheep)-B. melitensis, respectively. Functional characterization of the predicted T4SEs and their interactions with their respective hosts may reveal the secrets of host specificity of Brucella.

  4. A database of immunoglobulins with integrated tools: DIGIT.

    KAUST Repository

    Chailyan, Anna; Tramontano, Anna; Marcatili, Paolo

    2011-01-01

    The DIGIT (Database of ImmunoGlobulins with Integrated Tools) database (http://biocomputing.it/digit) is an integrated resource storing sequences of annotated immunoglobulin variable domains and enriched with tools for searching and analyzing them. The annotations in the database include information on the type of antigen, the respective germline sequences and on pairing information between light and heavy chains. Other annotations, such as the identification of the complementarity determining regions, assignment of their structural class and identification of mutations with respect to the germline, are computed on the fly and can also be obtained for user-submitted sequences. The system allows customized BLAST searches and automatic building of 3D models of the domains to be performed.

  5. A database of immunoglobulins with integrated tools: DIGIT.

    KAUST Repository

    Chailyan, Anna

    2011-11-10

    The DIGIT (Database of ImmunoGlobulins with Integrated Tools) database (http://biocomputing.it/digit) is an integrated resource storing sequences of annotated immunoglobulin variable domains and enriched with tools for searching and analyzing them. The annotations in the database include information on the type of antigen, the respective germline sequences and on pairing information between light and heavy chains. Other annotations, such as the identification of the complementarity determining regions, assignment of their structural class and identification of mutations with respect to the germline, are computed on the fly and can also be obtained for user-submitted sequences. The system allows customized BLAST searches and automatic building of 3D models of the domains to be performed.

  6. Proteomic Studies of Cholangiocarcinoma and Hepatocellular Carcinoma Cell Secretomes

    Directory of Open Access Journals (Sweden)

    Chantragan Srisomsap

    2010-01-01

    Full Text Available Cholangiocarcinoma (CCA and hepatocellular carcinoma (HCC occur with relatively high incidence in Thailand. The secretome, proteins secreted from cancer cells, are potentially useful as biomarkers of the diseases. Proteomic analysis was performed on the secreted proteins of cholangiocarcinoma (HuCCA-1 and hepatocellular carcinoma (HCC-S102, HepG2, SK-Hep-1, and Alexander cell lines. The secretomes of the five cancer cell lines were analyzed by SDS-PAGE combined with LC/MS/MS. Sixty-eight proteins were found to be expressed only in HuCCA-1. Examples include neutrophil gelatinase-associated lipocalin (lipocalin 2, laminin 5 beta 3, cathepsin D precursor, desmoplakin, annexin IV variant, and annexin A5. Immunoblotting was used to confirm the presence of lipocalin 2 in conditioned media and cell lysate of 5 cell lines. The results showed that lipocalin 2 was a secreted protein which is expressed only in the conditioned media of the cholangiocarcinoma cell line. Study of lipocalin 2 expression in different types of cancer and normal tissues from cholangiocarcinoma patients showed that lipocalin 2 was expressed only in the cancer tissues. We suggest that lipocalin 2 may be a potential biomarker for cholangiocarcinoma.

  7. Control of Secreted Protein Gene Expression and the Mammalian Secretome by the Metabolic Regulator PGC-1α.

    Science.gov (United States)

    Minsky, Neri; Roeder, Robert G

    2017-01-06

    Secreted proteins serve pivotal roles in the development of multicellular organisms, acting as structural matrix, extracellular enzymes, and signal molecules. However, how the secretome is regulated remains incompletely understood. Here we demonstrate, unexpectedly, that peroxisome proliferator-activated receptor γ coactivator 1-α (PGC-1α), a critical transcriptional co-activator of metabolic gene expression, functions to down-regulate the expression of diverse genes encoding secreted molecules and extracellular matrix components to modulate the secretome. Using cell lines, primary cells, and mice, we show that both endogenous and exogenous PGC-1α down-regulate the expression of numerous genes encoding secreted molecules. Mechanistically, results obtained using mRNA stability measurements as well as intronic RNA expression analysis are consistent with a transcriptional effect of PGC-1α on the expression of genes encoding secreted proteins. Interestingly, PGC-1α requires the central heat shock response regulator heat shock factor protein 1 (HSF1) to affect some of its targets, and both factors co-reside on several target genes encoding secreted molecules in cells. Finally, using a mass spectrometric analysis of secreted proteins, we demonstrate that PGC-1α modulates the secretome of mouse embryonic fibroblasts. Our results define a link between a key pathway controlling metabolic regulation and the regulation of the mammalian secretome. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  8. Secretome of Aggregated Embryonic Stem Cell-Derived Mesenchymal Stem Cell Modulates the Release of Inflammatory Factors in Lipopolysaccharide-Induced Peripheral Blood Mononuclear Cells

    Science.gov (United States)

    Mohammadi Ghahhari, Nastaran; Maghsood, Faezeh; Jahandideh, Saeed; Lotfinia, Majid; Lak, Shirin; Johari, Behrooz; Azarnezhad, Asaad; Kadivar, Mehdi

    2018-07-01

    Bone marrow mesenchymal stem cells (BM-MSCs) have emerged as a potential therapy for various inflammatory diseases. Because of some limitations, several recent studies have suggested the use of embryonic stem cell-derived MSCs (ESC-MSCs) as an alternative for BM-MSCs. Some of the therapeutic effects of the ESC-MSCs are related to the secretion of a broad array of cytokines and growth factors, known as secretome. Harnessing this secretome for therapeutic applications requires the optimization of production of secretary molecules. It has been shown that aggregation of MSCs into 3D spheroids, as a preconditioning strategy, can enhance immunomodulatory potential of such cells. In this study, we investigated the effect of secretome derived from human ESC-MSCs (hESC-MSCs) spheroids on secretion of IL-1β, IL-10, and tumor necrosis factor α (TNF-α) from lipopolysaccharide (LPS)-induced peripheral blood mononuclear cells (PBMCs). In the present study, after immunophenotyping and considering mesodermal differentiation of hESC-MSCs, the cells were non-adherently grown to prepare 3D aggregates, and then conditioned medium or secretome was extracted from the cultures. Afterwards, the anti-inflammatory effects of the secretome were assessed in an in vitro model of inflammation. Results from this study showed that aggregate-prepared secretome from hESC-MSCs was able to significantly decrease the secretion of TNF-α (301.7 ± 5.906, p strategy to increase immunomodulatory characteristics of hESC-MSCs.

  9. Secretome analysis to elucidate metalloprotease-dependent ectodomain shedding of glycoproteins during neuronal differentiation.

    Science.gov (United States)

    Tsumagari, Kazuya; Shirakabe, Kyoko; Ogura, Mayu; Sato, Fuminori; Ishihama, Yasushi; Sehara-Fujisawa, Atsuko

    2017-02-01

    Many membrane proteins are subjected to limited proteolyses at their juxtamembrane regions, processes referred to as ectodomain shedding. Shedding ectodomains of membrane-bound ligands results in activation of downstream signaling pathways, whereas shedding those of cell adhesion molecules causes loss of cell-cell contacts. Secreted proteomics (secretomics) using high-resolution mass spectrometry would be strong tools for both comprehensive identification and quantitative measurement of membrane proteins that undergo ectodomain shedding. In this study, to elucidate the ectodomain shedding events that occur during neuronal differentiation, we establish a strategy for quantitative secretomics of glycoproteins released from differentiating neuroblastoma cells into culture medium with or without GM6001, a broad-spectrum metalloprotease inhibitor. Considering that most of transmembrane and secreted proteins are N-glycosylated, we include a process of N-glycosylated peptides enrichment as well as isotope tagging in our secretomics workflow. Our results show that differentiating N1E-115 neurons secrete numerous glycosylated polypeptides in metalloprotease-dependent manners. They are derived from cell adhesion molecules such as NCAM1, CADM1, L1CAM, various transporters and receptor proteins. These results show the landscape of ectodomain shedding and other secretory events in differentiating neurons and/or during axon elongation, which should help elucidate the mechanism of neurogenesis and the pathogenesis of neurological disorders. © 2017 Molecular Biology Society of Japan and John Wiley & Sons Australia, Ltd.

  10. Acidithiobacillus thiooxidans secretome containing a newly described lipoprotein Licanantase enhances chalcopyrite bioleaching rate

    Science.gov (United States)

    Bobadilla Fazzini, Roberto A.; Levican, Gloria

    2010-01-01

    The nature of the mineral–bacteria interphase where electron and mass transfer processes occur is a key element of the bioleaching processes of sulfide minerals. This interphase is composed of proteins, metabolites, and other compounds embedded in extracellular polymeric substances mainly consisting of sugars and lipids (Gehrke et al., Appl Environ Microbiol 64(7):2743–2747, 1998). On this respect, despite Acidithiobacilli—a ubiquitous bacterial genera in bioleaching processes (Rawlings, Microb Cell Fact 4(1):13, 2005)—has long been recognized as secreting bacteria (Jones and Starkey, J Bacteriol 82:788–789, 1961; Schaeffer and Umbreit, J Bacteriol 85:492–493, 1963), few studies have been carried out in order to clarify the nature and the role of the secreted protein component: the secretome. This work characterizes for the first time the sulfur (meta)secretome of Acidithiobacillus thiooxidans strain DSM 17318 in pure and mixed cultures with Acidithiobacillus ferrooxidans DSM 16786, identifying the major component of these secreted fractions as a single lipoprotein named here as Licanantase. Bioleaching assays with the addition of Licanantase-enriched concentrated secretome fractions show that this newly found lipoprotein as an active protein additive exerts an increasing effect on chalcopyrite bioleaching rate. Electronic supplementary material The online version of this article (doi:10.1007/s00253-010-3063-8) contains supplementary material, which is available to authorized users. PMID:21191788

  11. Comparative analysis of the in vitro and in planta secretomes from Mycosphaerella fijiensis isolates.

    Science.gov (United States)

    Escobar-Tovar, Lina; Guzmán-Quesada, Mauricio; Sandoval-Fernández, Jorge A; Gómez-Lim, Miguel A

    2015-06-01

    Black Sigatoka, a devastating disease of bananas and plantains worldwide, is caused by the fungus Mycosphaerella fijiensis. Several banana cultivars such as 'Yangambi Km 5' and Calcutta IV, have been known to be resistant to the fungus, but the resistance has been broken in 'Yangambi Km 5' in Costa Rica. Since the resistance of this variety still persists in Mexico, the aim of this study was to compare the in vitro and in planta secretomes from two avirulent and virulent M. fijiensis isolates using proteomics and bioinformatics approaches. We aimed to identify differentially expressed proteins in fungal isolates that differ in pathogenicity and that might be responsible for breaking the resistance in 'Yangambi Km 5'. We were able to identify 90 protein spots in the secretomes of fungal isolates encoding 42 unique proteins and 35 differential spots between them. Proteins involved in carbohydrate transport and metabolism were more prevalent. Several proteases, pathogenicity-related, ROS detoxification and unknown proteins were also highly or specifically expressed by the virulent isolate in vitro or during in planta infection. An unknown protein representing a virulence factor candidate was also identified. These results demonstrated that the secretome reflects major differences between both M. fijiensis isolates. Copyright © 2015 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.

  12. On Simplification of Database Integrity Constraints

    DEFF Research Database (Denmark)

    Christiansen, Henning; Martinenghi, Davide

    2006-01-01

    Without proper simplification techniques, database integrity checking can be prohibitively time consuming. Several methods have been developed for producing simplified incremental checks for each update but none until now of sufficient quality and generality for providing a true practical impact,...

  13. Differential proteomic analysis of the secretome of Irpex lacteus and other white-rot fungi during wheat straw pretreatment.

    Science.gov (United States)

    Salvachúa, Davinia; Martínez, Angel T; Tien, Ming; López-Lucendo, María F; García, Francisco; de Los Ríos, Vivian; Martínez, María Jesús; Prieto, Alicia

    2013-08-10

    Identifying new high-performance enzymes or enzyme complexes to enhance biomass degradation is the key for the development of cost-effective processes for ethanol production. Irpex lacteus is an efficient microorganism for wheat straw pretreatment, yielding easily hydrolysable products with high sugar content. Thus, this fungus was selected to investigate the enzymatic system involved in lignocellulose decay, and its secretome was compared to those from Phanerochaete chrysosporium and Pleurotus ostreatus which produced different degradation patterns when growing on wheat straw. Extracellular enzymes were analyzed through 2D-PAGE, nanoLC/MS-MS, and homology searches against public databases. In wheat straw, I. lacteus secreted proteases, dye-decolorizing and manganese-oxidizing peroxidases, and H2O2 producing-enzymes but also a battery of cellulases and xylanases, excluding those implicated in cellulose and hemicellulose degradation to their monosaccharides, making these sugars poorly available for fungal consumption. In contrast, a significant increase of β-glucosidase production was observed when I. lacteus grew in liquid cultures. P. chrysosporium secreted more enzymes implicated in the total hydrolysis of the polysaccharides and P. ostreatus produced, in proportion, more oxidoreductases. The protein pattern secreted during I. lacteus growth in wheat straw plus the differences observed among the different secretomes, justify the fitness of I. lacteus for biopretreatment processes in 2G-ethanol production. Furthermore, all these data give insight into the biological degradation of lignocellulose and suggest new enzyme mixtures interesting for its efficient hydrolysis.

  14. Integrating heterogeneous databases in clustered medic care environments using object-oriented technology

    Science.gov (United States)

    Thakore, Arun K.; Sauer, Frank

    1994-05-01

    The organization of modern medical care environments into disease-related clusters, such as a cancer center, a diabetes clinic, etc., has the side-effect of introducing multiple heterogeneous databases, often containing similar information, within the same organization. This heterogeneity fosters incompatibility and prevents the effective sharing of data amongst applications at different sites. Although integration of heterogeneous databases is now feasible, in the medical arena this is often an ad hoc process, not founded on proven database technology or formal methods. In this paper we illustrate the use of a high-level object- oriented semantic association method to model information found in different databases into an integrated conceptual global model that integrates the databases. We provide examples from the medical domain to illustrate an integration approach resulting in a consistent global view, without attacking the autonomy of the underlying databases.

  15. Heterogeneous Biomedical Database Integration Using a Hybrid Strategy: A p53 Cancer Research Database

    Directory of Open Access Journals (Sweden)

    Vadim Y. Bichutskiy

    2006-01-01

    Full Text Available Complex problems in life science research give rise to multidisciplinary collaboration, and hence, to the need for heterogeneous database integration. The tumor suppressor p53 is mutated in close to 50% of human cancers, and a small drug-like molecule with the ability to restore native function to cancerous p53 mutants is a long-held medical goal of cancer treatment. The Cancer Research DataBase (CRDB was designed in support of a project to find such small molecules. As a cancer informatics project, the CRDB involved small molecule data, computational docking results, functional assays, and protein structure data. As an example of the hybrid strategy for data integration, it combined the mediation and data warehousing approaches. This paper uses the CRDB to illustrate the hybrid strategy as a viable approach to heterogeneous data integration in biomedicine, and provides a design method for those considering similar systems. More efficient data sharing implies increased productivity, and, hopefully, improved chances of success in cancer research. (Code and database schemas are freely downloadable, http://www.igb.uci.edu/research/research.html.

  16. Secretome weaponries of Cochliobolus lunatus interacting with potato leaf at different temperature regimes reveal a CL[xxxx]LHM - motif.

    Science.gov (United States)

    Louis, Bengyella; Waikhom, Sayanika Devi; Roy, Pranab; Bhardwaj, Pardeep Kumar; Singh, Mohendro Wakambam; Goyari, Sailendra; Sharma, Chandradev K; Talukdar, Narayan Chandra

    2014-03-20

    Plant and animal pathogenic fungus Cochliobolus lunatus cause great economic damages worldwide every year. C. lunatus displays an increased temperature dependent-virulence to a wide range of hosts. Nonetheless, this phenomenon is poorly understood due to lack of insights on the coordinated secretome weaponries produced by C. lunatus under heat-stress conditions on putative hosts. To understand the mechanism better, we dissected the secretome of C. lunatus interacting with potato (Solanum tuberosum L.) leaf at different temperature regimes. C. lunatus produced melanized colonizing hyphae in and on potato leaf, finely modulated the ambient pH as a function of temperature and secreted diverse set of proteins. Using two dimensional gel electrophoresis (2-D) and mass spectrometry (MS) technology, we observed discrete secretomes at 20°C, 28°C and 38°C. A total of 21 differentially expressed peptide spots and 10 unique peptide spots (that did not align on the gels) matched with 28 unique protein models predicted from C. lunatus m118 v.2 genome peptides. Furthermore, C. lunatus secreted peptides via classical and non-classical pathways related to virulence, proteolysis, nucleic acid metabolism, carbohydrate metabolism, heat stress, signal trafficking and some with unidentified catalytic domains. We have identified a set of 5 soluble candidate effectors of unknown function from C. lunatus secretome weaponries against potato crop at different temperature regimes. Our findings demonstrate that C. lunatus has a repertoire of signature secretome which mediates thermo-pathogenicity and share a leucine rich "CL[xxxx]LHM"-motif. Considering the rapidly evolving temperature dependent-virulence and host diversity of C. lunatus, this data will be useful for designing new protection strategies.

  17. KaBOB: ontology-based semantic integration of biomedical databases.

    Science.gov (United States)

    Livingston, Kevin M; Bada, Michael; Baumgartner, William A; Hunter, Lawrence E

    2015-04-23

    The ability to query many independent biological databases using a common ontology-based semantic model would facilitate deeper integration and more effective utilization of these diverse and rapidly growing resources. Despite ongoing work moving toward shared data formats and linked identifiers, significant problems persist in semantic data integration in order to establish shared identity and shared meaning across heterogeneous biomedical data sources. We present five processes for semantic data integration that, when applied collectively, solve seven key problems. These processes include making explicit the differences between biomedical concepts and database records, aggregating sets of identifiers denoting the same biomedical concepts across data sources, and using declaratively represented forward-chaining rules to take information that is variably represented in source databases and integrating it into a consistent biomedical representation. We demonstrate these processes and solutions by presenting KaBOB (the Knowledge Base Of Biomedicine), a knowledge base of semantically integrated data from 18 prominent biomedical databases using common representations grounded in Open Biomedical Ontologies. An instance of KaBOB with data about humans and seven major model organisms can be built using on the order of 500 million RDF triples. All source code for building KaBOB is available under an open-source license. KaBOB is an integrated knowledge base of biomedical data representationally based in prominent, actively maintained Open Biomedical Ontologies, thus enabling queries of the underlying data in terms of biomedical concepts (e.g., genes and gene products, interactions and processes) rather than features of source-specific data schemas or file formats. KaBOB resolves many of the issues that routinely plague biomedical researchers intending to work with data from multiple data sources and provides a platform for ongoing data integration and development and for

  18. Data for iTRAQ secretomic analysis of Aspergillus fumigatus in response to different carbon sources

    Directory of Open Access Journals (Sweden)

    Sunil S. Adav

    2015-06-01

    Full Text Available Here, we provide data related to the research article entitled “Quantitative proteomics study of Aspergillus fumigatus secretome revealed deamidation of secretory enzymes” by Adav et al. (J. Proteomics (2015 [1]. Aspergillus sp. plays an important role in lignocellulosic biomass recycling. To explore biomass hydrolyzing enzymes of A. fumigatus, we profiled secretome under different carbon sources such as glucose, cellulose, xylan and starch by high throughput quantitative proteomics using isobaric tags for relative and absolute quantification (iTRAQ. The data presented here represents the detailed comparative abundances of diverse groups of biomass hydrolyzing enzymes including cellulases, hemicellulases, lignin degrading enzymes, and peptidases and proteases; and their post translational modification like deamidation.

  19. Secretome of fungus-infected aphids documents high pathogen activity and weak host response

    DEFF Research Database (Denmark)

    Grell, Morten Nedergaard; Jensen, Annette Bruun; Olsen, Peter B.

    2011-01-01

    Discovery of novel secretome proteins contributes to the understanding of host-pathogen interactions. Here we report a rich diversity of secreted proteins from the interaction between grain aphids (host, insect order Hemiptera) and fungi of the order Entomophthorales (insect pathogens), made...

  20. Human astrocytes: secretome profiles of cytokines and chemokines.

    Directory of Open Access Journals (Sweden)

    Sung S Choi

    Full Text Available Astrocytes play a key role in maintenance of neuronal functions in the central nervous system by producing various cytokines, chemokines, and growth factors, which act as a molecular coordinator of neuron-glia communication. At the site of neuroinflammation, astrocyte-derived cytokines and chemokines play both neuroprotective and neurotoxic roles in brain lesions of human neurological diseases. At present, the comprehensive profile of human astrocyte-derived cytokines and chemokines during inflammation remains to be fully characterized. We investigated the cytokine secretome profile of highly purified human astrocytes by using a protein microarray. Non-stimulated human astrocytes in culture expressed eight cytokines, including G-CSF, GM-CSF, GROα (CXCL1, IL-6, IL-8 (CXCL8, MCP-1 (CCL2, MIF and Serpin E1. Following stimulation with IL-1β and TNF-α, activated astrocytes newly produced IL-1β, IL-1ra, TNF-α, IP-10 (CXCL10, MIP-1α (CCL3 and RANTES (CCL5, in addition to the induction of sICAM-1 and complement component 5. Database search indicated that most of cytokines and chemokines produced by non-stimulated and activated astrocytes are direct targets of the transcription factor NF-kB. These results indicated that cultured human astrocytes express a distinct set of NF-kB-target cytokines and chemokines in resting and activated conditions, suggesting that the NF-kB signaling pathway differentially regulates gene expression of cytokines and chemokines in human astrocytes under physiological and inflammatory conditions.

  1. A secretome analysis reveals that PPARα is upregulated by fractionated-dose γ-irradiation in three-dimensional keratinocyte cultures

    International Nuclear Information System (INIS)

    Lee, Jee Yong; Kim, Hyun Ji; Yi, Jae Youn

    2016-01-01

    A three-dimensional (3D) environment composed of properly interconnected and differentiated cells that allows communication and cooperation among cells via secreted molecules would be expected to more accurately reflect cellular responses. Here, we investigated γ-irradiation-induced changes in the secretome of 3D-cultured keratinocytes. An analysis of keratinocyte secretome profiles following fractionated-dose γ-irradiation revealed changes in genes involved in cell adhesion, angiogenesis, and the immune system. Notably, peroxisome proliferator-activated receptor-(PPARα) was upregulated in response to fractionated-dose γ-irradiation. This upregulation was associated with an increase in the transcription of known PPARα target genes, including angiopoietin-like protein 4, dermokine and kallikrein-related peptide 12, which were differentially regulated by fractionated-dose γ-irradiation. Collectively, our data imply a mechanism linking γ-irradiation and secretome changes, and suggest that these changes could play a significant role in the coordinated cellular responses to harmful ionizing radiation, such as those associated with radiation therapy. This extension of our understanding of γ-irradiation-induced secretome changes has the potential to improve radiation therapy strategies. Control of inflammatory waves, improved wound healing, and stabilization of the skin barrier are imperative for minimizing such injuries. Therefore, PPARα agonists and antagonists have the potential to become important therapeutic agents for the treatment of γ-irradiation induced skin damage. Specifically, our analysis suggests that the undesirable consequences of long-term exposure to ionizing radiation could be alleviated by PPARα agonists

  2. A secretome analysis reveals that PPARα is upregulated by fractionated-dose γ-irradiation in three-dimensional keratinocyte cultures

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Jee Yong; Kim, Hyun Ji; Yi, Jae Youn [Korea Institute of Radiation and Medical Sciences, Daejeon (Korea, Republic of)

    2016-05-15

    A three-dimensional (3D) environment composed of properly interconnected and differentiated cells that allows communication and cooperation among cells via secreted molecules would be expected to more accurately reflect cellular responses. Here, we investigated γ-irradiation-induced changes in the secretome of 3D-cultured keratinocytes. An analysis of keratinocyte secretome profiles following fractionated-dose γ-irradiation revealed changes in genes involved in cell adhesion, angiogenesis, and the immune system. Notably, peroxisome proliferator-activated receptor-(PPARα) was upregulated in response to fractionated-dose γ-irradiation. This upregulation was associated with an increase in the transcription of known PPARα target genes, including angiopoietin-like protein 4, dermokine and kallikrein-related peptide 12, which were differentially regulated by fractionated-dose γ-irradiation. Collectively, our data imply a mechanism linking γ-irradiation and secretome changes, and suggest that these changes could play a significant role in the coordinated cellular responses to harmful ionizing radiation, such as those associated with radiation therapy. This extension of our understanding of γ-irradiation-induced secretome changes has the potential to improve radiation therapy strategies. Control of inflammatory waves, improved wound healing, and stabilization of the skin barrier are imperative for minimizing such injuries. Therefore, PPARα agonists and antagonists have the potential to become important therapeutic agents for the treatment of γ-irradiation induced skin damage. Specifically, our analysis suggests that the undesirable consequences of long-term exposure to ionizing radiation could be alleviated by PPARα agonists.

  3. Integrated spent nuclear fuel database system

    International Nuclear Information System (INIS)

    Henline, S.P.; Klingler, K.G.; Schierman, B.H.

    1994-01-01

    The Distributed Information Systems software Unit at the Idaho National Engineering Laboratory has designed and developed an Integrated Spent Nuclear Fuel Database System (ISNFDS), which maintains a computerized inventory of all US Department of Energy (DOE) spent nuclear fuel (SNF). Commercial SNF is not included in the ISNFDS unless it is owned or stored by DOE. The ISNFDS is an integrated, single data source containing accurate, traceable, and consistent data and provides extensive data for each fuel, extensive facility data for every facility, and numerous data reports and queries

  4. SINBAD: Shielding integral benchmark archive and database

    International Nuclear Information System (INIS)

    Hunter, H.T.; Ingersoll, D.T.; Roussin, R.W.

    1996-01-01

    SINBAD is a new electronic database developed to store a variety of radiation shielding benchmark data so that users can easily retrieve and incorporate the data into their calculations. SINBAD is an excellent data source for users who require the quality assurance necessary in developing cross-section libraries or radiation transport codes. The future needs of the scientific community are best served by the electronic database format of SINBAD and its user-friendly interface, combined with its data accuracy and integrity

  5. Defining the predicted protein secretome of the fungal wheat leaf pathogen Mycosphaerella graminicola.

    Directory of Open Access Journals (Sweden)

    Alexandre Morais do Amaral

    Full Text Available The Dothideomycete fungus Mycosphaerella graminicola is the causal agent of Septoria tritici blotch, a devastating disease of wheat leaves that causes dramatic decreases in yield. Infection involves an initial extended period of symptomless intercellular colonisation prior to the development of visible necrotic disease lesions. Previous functional genomics and gene expression profiling studies have implicated the production of secreted virulence effector proteins as key facilitators of the initial symptomless growth phase. In order to identify additional candidate virulence effectors, we re-analysed and catalogued the predicted protein secretome of M. graminicola isolate IPO323, which is currently regarded as the reference strain for this species. We combined several bioinformatic approaches in order to increase the probability of identifying truly secreted proteins with either a predicted enzymatic function or an as yet unknown function. An initial secretome of 970 proteins was predicted, whilst further stringent selection criteria predicted 492 proteins. Of these, 321 possess some functional annotation, the composition of which may reflect the strictly intercellular growth habit of this pathogen, leaving 171 with no functional annotation. This analysis identified a protein family encoding secreted peroxidases/chloroperoxidases (PF01328 which is expanded within all members of the family Mycosphaerellaceae. Further analyses were done on the non-annotated proteins for size and cysteine content (effector protein hallmarks, and then by studying the distribution of homologues in 17 other sequenced Dothideomycete fungi within an overall total of 91 predicted proteomes from fungal, oomycete and nematode species. This detailed M. graminicola secretome analysis provides the basis for further functional and comparative genomics studies.

  6. Using XML technology for the ontology-based semantic integration of life science databases.

    Science.gov (United States)

    Philippi, Stephan; Köhler, Jacob

    2004-06-01

    Several hundred internet accessible life science databases with constantly growing contents and varying areas of specialization are publicly available via the internet. Database integration, consequently, is a fundamental prerequisite to be able to answer complex biological questions. Due to the presence of syntactic, schematic, and semantic heterogeneities, large scale database integration at present takes considerable efforts. As there is a growing apprehension of extensible markup language (XML) as a means for data exchange in the life sciences, this article focuses on the impact of XML technology on database integration in this area. In detail, a general architecture for ontology-driven data integration based on XML technology is introduced, which overcomes some of the traditional problems in this area. As a proof of concept, a prototypical implementation of this architecture based on a native XML database and an expert system shell is described for the realization of a real world integration scenario.

  7. Mesenchymal Stem Cell Secretome: A Potential Tool for the Prevention of Muscle Degenerative Changes Associated With Chronic Rotator Cuff Tears.

    Science.gov (United States)

    Sevivas, Nuno; Teixeira, Fábio Gabriel; Portugal, Raquel; Araújo, Luís; Carriço, Luís Filipe; Ferreira, Nuno; Vieira da Silva, Manuel; Espregueira-Mendes, João; Anjo, Sandra; Manadas, Bruno; Sousa, Nuno; Salgado, António J

    2016-08-08

    Massive rotator cuff tears (MRCTs) are usually chronic lesions with pronounced degenerative changes, where advanced fatty degeneration and atrophy can make the tear irreparable. Human mesenchymal stem cells (hMSCs) secrete a range of growth factors and vesicular systems, known as secretome, that mediates regenerative processes in tissues undergoing degeneration. To study the effect of hMSC secretome on muscular degenerative changes and shoulder function on a rat MRCT model. Controlled laboratory study. A bilateral 2-tendon (supraspinatus and infraspinatus) section was performed to create an MRCT in a rat model. Forty-four Wistar-Han rats were randomly assigned to 6 groups: control group (sham surgery), lesion control group (MRCT), and 4 treated-lesion groups according to the site and periodicity of hMSC secretome injection: single local injection, multiple local injections, single systemic injection, and multiple systemic injections. Forelimb function was analyzed with the staircase test. Atrophy and fatty degeneration of the muscle were evaluated at 8 and 16 weeks after injury. A proteomic analysis was conducted to identify the molecules present in the hMSC secretome that can be associated with muscular degeneration prevention. When untreated for 8 weeks, the MRCT rats exhibited a significantly higher fat content (0.73% ± 0.19%) compared with rats treated with a single local injection (0.21% ± 0.04%; P muscle atrophy, 8 weeks after injury, only the single local injection group (0.0993% ± 0.0036%) presented a significantly higher muscle mass than that of the untreated MRCT group (0.0794% ± 0.0047%; P muscle regeneration, namely, pigment epithelium-derived factor and follistatin. The study data suggest that hMSC secretome effectively decreases the fatty degeneration and atrophy of the rotator cuff muscles. We describe a new approach for decreasing the characteristic muscle degeneration associated with chronic rotator cuff tears. This strategy is particularly

  8. Characterization of Regenerative Phenotype of Unrestricted Somatic Stem Cells (USSC) from Human Umbilical Cord Blood (hUCB) by Functional Secretome Analysis*

    Science.gov (United States)

    Schira, Jessica; Falkenberg, Heiner; Hendricks, Marion; Waldera-Lupa, Daniel M.; Kögler, Gesine; Meyer, Helmut E.; Müller, Hans Werner; Stühler, Kai

    2015-01-01

    Stem cell transplantation is a promising therapeutic strategy to enhance axonal regeneration after spinal cord injury. Unrestricted somatic stem cells (USSC) isolated from human umbilical cord blood is an attractive stem cell population available at GMP grade without any ethical concerns. It has been shown that USSC transplantation into acute injured rat spinal cords leads to axonal regrowth and significant locomotor recovery, yet lacking cell replacement. Instead, USSC secrete trophic factors enhancing neurite growth of primary cortical neurons in vitro. Here, we applied a functional secretome approach characterizing proteins secreted by USSC for the first time and validated candidate neurite growth promoting factors using primary cortical neurons in vitro. By mass spectrometric analysis and exhaustive bioinformatic interrogation we identified 1156 proteins representing the secretome of USSC. Using Gene Ontology we revealed that USSC secretome contains proteins involved in a number of relevant biological processes of nerve regeneration such as cell adhesion, cell motion, blood vessel formation, cytoskeleton organization and extracellular matrix organization. We found for instance that 31 well-known neurite growth promoting factors like, e.g. neuronal growth regulator 1, NDNF, SPARC, and PEDF span the whole abundance range of USSC secretome. By the means of primary cortical neurons in vitro assays we verified SPARC and PEDF as significantly involved in USSC mediated neurite growth and therewith underline their role in improved locomotor recovery after transplantation. From our data we are convinced that USSC are a valuable tool in regenerative medicine as USSC's secretome contains a comprehensive network of trophic factors supporting nerve regeneration not only by a single process but also maintained its regenerative phenotype by a multitude of relevant biological processes. PMID:26183719

  9. Data integration for plant genomics--exemplars from the integration of Arabidopsis thaliana databases.

    Science.gov (United States)

    Lysenko, Artem; Lysenko, Atem; Hindle, Matthew Morritt; Taubert, Jan; Saqi, Mansoor; Rawlings, Christopher John

    2009-11-01

    The development of a systems based approach to problems in plant sciences requires integration of existing information resources. However, the available information is currently often incomplete and dispersed across many sources and the syntactic and semantic heterogeneity of the data is a challenge for integration. In this article, we discuss strategies for data integration and we use a graph based integration method (Ondex) to illustrate some of these challenges with reference to two example problems concerning integration of (i) metabolic pathway and (ii) protein interaction data for Arabidopsis thaliana. We quantify the degree of overlap for three commonly used pathway and protein interaction information sources. For pathways, we find that the AraCyc database contains the widest coverage of enzyme reactions and for protein interactions we find that the IntAct database provides the largest unique contribution to the integrated dataset. For both examples, however, we observe a relatively small amount of data common to all three sources. Analysis and visual exploration of the integrated networks was used to identify a number of practical issues relating to the interpretation of these datasets. We demonstrate the utility of these approaches to the analysis of groups of coexpressed genes from an individual microarray experiment, in the context of pathway information and for the combination of coexpression data with an integrated protein interaction network.

  10. Bacillus anthracis secretome time course under host-simulated conditions and identification of immunogenic proteins

    Directory of Open Access Journals (Sweden)

    Whittington Jessica

    2007-07-01

    Full Text Available Abstract Background The secretion time course of Bacillus anthracis strain RA3R (pXO1+/pXO2- during early, mid, and late log phase were investigated under conditions that simulate those encountered in the host. All of the identified proteins were analyzed by different software algorithms to characterize their predicted mode of secretion and cellular localization. In addition, immunogenic proteins were identified using sera from humans with cutaneous anthrax. Results A total of 275 extracellular proteins were identified by a combination of LC MS/MS and MALDI-TOF MS. All of the identified proteins were analyzed by SignalP, SecretomeP, PSORT, LipoP, TMHMM, and PROSITE to characterize their predicted mode of secretion, cellular localization, and protein domains. Fifty-three proteins were predicted by SignalP to harbor the cleavable N-terminal signal peptides and were therefore secreted via the classical Sec pathway. Twenty-three proteins were predicted by SecretomeP for secretion by the alternative Sec pathway characterized by the lack of typical export signal. In contrast to SignalP and SecretomeP predictions, PSORT predicted 171 extracellular proteins, 7 cell wall-associated proteins, and 6 cytoplasmic proteins. Moreover, 51 proteins were predicted by LipoP to contain putative Sec signal peptides (38 have SpI sites, lipoprotein signal peptides (13 have SpII sites, and N-terminal membrane helices (9 have transmembrane helices. The TMHMM algorithm predicted 25 membrane-associated proteins with one to ten transmembrane helices. Immunogenic proteins were also identified using sera from patients who have recovered from anthrax. The charge variants (83 and 63 kDa of protective antigen (PA were the most immunodominant secreted antigens, followed by charge variants of enolase and transketolase. Conclusion This is the first description of the time course of protein secretion for the pathogen Bacillus anthracis. Time course studies of protein secretion and

  11. On the applicability of schema integration techniques to database interoperation

    NARCIS (Netherlands)

    Vermeer, Mark W.W.; Apers, Peter M.G.

    1996-01-01

    We discuss the applicability of schema integration techniques developed for tightly-coupled database interoperation to interoperation of databases stemming from different modelling contexts. We illustrate that in such an environment, it is typically quite difficult to infer the real-world semantics

  12. Design of Integrated Database on Mobile Information System: A Study of Yogyakarta Smart City App

    Science.gov (United States)

    Nurnawati, E. K.; Ermawati, E.

    2018-02-01

    An integration database is a database which acts as the data store for multiple applications and thus integrates data across these applications (in contrast to an Application Database). An integration database needs a schema that takes all its client applications into account. The benefit of the schema that sharing data among applications does not require an extra layer of integration services on the applications. Any changes to data made in a single application are made available to all applications at the time of database commit - thus keeping the applications’ data use better synchronized. This study aims to design and build an integrated database that can be used by various applications in a mobile device based system platforms with the based on smart city system. The built-in database can be used by various applications, whether used together or separately. The design and development of the database are emphasized on the flexibility, security, and completeness of attributes that can be used together by various applications to be built. The method used in this study is to choice of the appropriate database logical structure (patterns of data) and to build the relational-database models (Design Databases). Test the resulting design with some prototype apps and analyze system performance with test data. The integrated database can be utilized both of the admin and the user in an integral and comprehensive platform. This system can help admin, manager, and operator in managing the application easily and efficiently. This Android-based app is built based on a dynamic clientserver where data is extracted from an external database MySQL. So if there is a change of data in the database, then the data on Android applications will also change. This Android app assists users in searching of Yogyakarta (as smart city) related information, especially in culture, government, hotels, and transportation.

  13. Secretome Analysis Identifies Potential Pathogenicity/Virulence Factors of Tilletia indica, a Quarantined Fungal Pathogen Inciting Karnal Bunt Disease in Wheat.

    Science.gov (United States)

    Pandey, Vishakha; Singh, Manoj; Pandey, Dinesh; Marla, Soma; Kumar, Anil

    2018-04-01

    Tilletia indica is a smut fungus that incites Karnal bunt in wheat. It has been considered as quarantine pest in more than 70 countries. Despite its quarantine significance, there is meager knowledge regarding the molecular mechanisms of disease pathogenesis. Moreover, various disease management strategies have proven futile. Development of effective disease management strategy requires identification of pathogenicity/virulence factors. With this aim, the present study was conducted to compare the secretomes of T. indica isolates, that is, highly (TiK) and low (TiP) virulent isolates. About 120 and 95 protein spots were detected reproducibly in TiK and TiP secretome gel images. Nineteen protein spots, which were consistently observed as upregulated/differential in the secretome of TiK isolate, were selected for their identification by MALDI-TOF/TOF. Identified proteins exhibited homology with fungal proteins playing important role in fungal adhesion, penetration, invasion, protection against host-derived reactive oxygen species, production of virulence factors, cellular signaling, and degradation of host cell wall proteins and antifungal proteins. These results were complemented with T. indica genome sequence leading to identification of candidate pathogenicity/virulence factors homologs that were further subjected to sequence- and structure-based functional annotation. Thus, present study reports the first comparative secretome analysis of T. indica for identification of pathogenicity/virulence factors. This would provide insights into pathogenic mechanisms of T. indica and aid in devising effective disease management strategies. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. The Penicillium echinulatum secretome on sugar cane bagasse.

    Directory of Open Access Journals (Sweden)

    Daniela A Ribeiro

    Full Text Available Plant feedstocks are at the leading front of the biofuel industry based on the potential to promote economical, social and environmental development worldwide through sustainable scenarios related to energy production. Penicillium echinulatum is a promising strain for the bioethanol industry based on its capacity to produce large amounts of cellulases at low cost. The secretome profile of P. echinulatum after grown on integral sugarcane bagasse, microcrystalline cellulose and three types of pretreated sugarcane bagasse was evaluated using shotgun proteomics. The comprehensive chemical characterization of the biomass used as the source of fungal nutrition, as well as biochemical activity assays using a collection of natural polysaccharides, were also performed. Our study revealed that the enzymatic repertoire of P. echinulatum is geared mainly toward producing enzymes from the cellulose complex (endogluganases, cellobiohydrolases and β-glucosidases. Glycoside hydrolase (GH family members, important to biomass-to-biofuels conversion strategies, were identified, including endoglucanases GH5, 7, 6, 12, 17 and 61, β-glycosidase GH3, xylanases GH10 and GH11, as well as debranching hemicellulases from GH43, GH62 and CE2 and pectinanes from GH28. Collectively, the approach conducted in this study gave new insights on the better comprehension of the composition and degradation capability of an industrial cellulolytic strain, from which a number of applied technologies, such as biofuel production, can be generated.

  15. The stem cell secretome and its role in brain repair.

    Science.gov (United States)

    Drago, Denise; Cossetti, Chiara; Iraci, Nunzio; Gaude, Edoardo; Musco, Giovanna; Bachi, Angela; Pluchino, Stefano

    2013-12-01

    Compelling evidence exists that non-haematopoietic stem cells, including mesenchymal (MSCs) and neural/progenitor stem cells (NPCs), exert a substantial beneficial and therapeutic effect after transplantation in experimental central nervous system (CNS) disease models through the secretion of immune modulatory or neurotrophic paracrine factors. This paracrine hypothesis has inspired an alternative outlook on the use of stem cells in regenerative neurology. In this paradigm, significant repair of the injured brain may be achieved by injecting the biologics secreted by stem cells (secretome), rather than implanting stem cells themselves for direct cell replacement. The stem cell secretome (SCS) includes cytokines, chemokines and growth factors, and has gained increasing attention in recent years because of its multiple implications for the repair, restoration or regeneration of injured tissues. Thanks to recent improvements in SCS profiling and manipulation, investigators are now inspired to harness the SCS as a novel alternative therapeutic option that might ensure more efficient outcomes than current stem cell-based therapies for CNS repair. This review discusses the most recent identification of MSC- and NPC-secreted factors, including those that are trafficked within extracellular membrane vesicles (EVs), and reflects on their potential effects on brain repair. It also examines some of the most convincing advances in molecular profiling that have enabled mapping of the SCS. Copyright © 2013 The Authors. Published by Elsevier Masson SAS.. All rights reserved.

  16. Construction of an integrated database to support genomic sequence analysis

    Energy Technology Data Exchange (ETDEWEB)

    Gilbert, W.; Overbeek, R.

    1994-11-01

    The central goal of this project is to develop an integrated database to support comparative analysis of genomes including DNA sequence data, protein sequence data, gene expression data and metabolism data. In developing the logic-based system GenoBase, a broader integration of available data was achieved due to assistance from collaborators. Current goals are to easily include new forms of data as they become available and to easily navigate through the ensemble of objects described within the database. This report comments on progress made in these areas.

  17. Fungal Secretome Analysis via PepSAVI-MS: Identification of the Bioactive Peptide KP4 from Ustilago maydis

    Science.gov (United States)

    Kirkpatrick, Christine L.; Parsley, Nicole C.; Bartges, Tessa E.; Cooke, Madeline E.; Evans, Wilaysha S.; Heil, Lilian R.; Smith, Thomas J.; Hicks, Leslie M.

    2018-05-01

    Fungal secondary metabolites represent a rich and largely untapped source for bioactive molecules, including peptides with substantial structural diversity and pharmacological potential. As methods proceed to take a deep dive into fungal genomes, complimentary methods to identify bioactive components are required to keep pace with the expanding fungal repertoire. We developed PepSAVI-MS to expedite the search for natural product bioactive peptides and herein demonstrate proof-of-principle applicability of the pipeline for the discovery of bioactive peptides from fungal secretomes via identification of the antifungal killer toxin KP4 from Ustilago maydis P4. This work opens the door to investigating microbial secretomes with a new lens, and could have broad applications across human health, agriculture, and food safety. [Figure not available: see fulltext.

  18. Comparative transcriptome and secretome analysis of wood decay fungi Postia placenta and Phanerochaete chrysosporium

    Science.gov (United States)

    Amber J. Vanden Wymelenberg; Jill Gaskell; Michael Mozuch; Grzegorz Sabat; John Ralph; Oleksandr Skyba; Shawn D Mansfield; Robert A. Blanchette; Diego Martinez; Igor Grigoriev; Philip J Kersten; Daniel Cullen

    2010-01-01

    Cellulose degradation by brown rot fungi, such as Postia placenta, is poorly understood relative to the phylogenetically related white rot basidiomycete, Phanerochaete chrysosporium. To elucidate the number, structure, and regulation of genes involved in lignocellulosic cell wall attack, secretome and transcriptome analyses were performed on both wood decay fungi...

  19. MitBASE : a comprehensive and integrated mitochondrial DNA database. The present status

    NARCIS (Netherlands)

    Attimonelli, M.; Altamura, N.; Benne, R.; Brennicke, A.; Cooper, J. M.; D'Elia, D.; Montalvo, A.; Pinto, B.; de Robertis, M.; Golik, P.; Knoop, V.; Lanave, C.; Lazowska, J.; Licciulli, F.; Malladi, B. S.; Memeo, F.; Monnerot, M.; Pasimeni, R.; Pilbout, S.; Schapira, A. H.; Sloof, P.; Saccone, C.

    2000-01-01

    MitBASE is an integrated and comprehensive database of mitochondrial DNA data which collects, under a single interface, databases for Plant, Vertebrate, Invertebrate, Human, Protist and Fungal mtDNA and a Pilot database on nuclear genes involved in mitochondrial biogenesis in Saccharomyces

  20. Building an integrated neurodegenerative disease database at an academic health center.

    Science.gov (United States)

    Xie, Sharon X; Baek, Young; Grossman, Murray; Arnold, Steven E; Karlawish, Jason; Siderowf, Andrew; Hurtig, Howard; Elman, Lauren; McCluskey, Leo; Van Deerlin, Vivianna; Lee, Virginia M-Y; Trojanowski, John Q

    2011-07-01

    It is becoming increasingly important to study common and distinct etiologies, clinical and pathological features, and mechanisms related to neurodegenerative diseases such as Alzheimer's disease, Parkinson's disease, amyotrophic lateral sclerosis, and frontotemporal lobar degeneration. These comparative studies rely on powerful database tools to quickly generate data sets that match diverse and complementary criteria set by them. In this article, we present a novel integrated neurodegenerative disease (INDD) database, which was developed at the University of Pennsylvania (Penn) with the help of a consortium of Penn investigators. Because the work of these investigators are based on Alzheimer's disease, Parkinson's disease, amyotrophic lateral sclerosis, and frontotemporal lobar degeneration, it allowed us to achieve the goal of developing an INDD database for these major neurodegenerative disorders. We used the Microsoft SQL server as a platform, with built-in "backwards" functionality to provide Access as a frontend client to interface with the database. We used PHP Hypertext Preprocessor to create the "frontend" web interface and then used a master lookup table to integrate individual neurodegenerative disease databases. We also present methods of data entry, database security, database backups, and database audit trails for this INDD database. Using the INDD database, we compared the results of a biomarker study with those using an alternative approach by querying individual databases separately. We have demonstrated that the Penn INDD database has the ability to query multiple database tables from a single console with high accuracy and reliability. The INDD database provides a powerful tool for generating data sets in comparative studies on several neurodegenerative diseases. Copyright © 2011 The Alzheimer's Association. Published by Elsevier Inc. All rights reserved.

  1. Deletion of flbA results in increased secretome complexity and reduced secretion heterogeneity in colonies of Aspergillus niger.

    Science.gov (United States)

    Krijgsheld, Pauline; Nitsche, Benjamin M; Post, Harm; Levin, Ana M; Müller, Wally H; Heck, Albert J R; Ram, Arthur F J; Altelaar, A F Maarten; Wösten, Han A B

    2013-04-05

    Aspergillus niger is a cell factory for the production of enzymes. This fungus secretes proteins in the central part and at the periphery of the colony. The sporulating zone of the colony overlapped with the nonsecreting subperipheral zone, indicating that sporulation inhibits protein secretion. Indeed, strain ΔflbA that is affected early in the sporulation program secreted proteins throughout the colony. In contrast, the ΔbrlA strain that initiates but not completes sporulation did not show altered spatial secretion. The secretome of 5 concentric zones of xylose-grown ΔflbA colonies was assessed by quantitative proteomics. In total 138 proteins with a signal sequence for secretion were identified in the medium of ΔflbA colonies. Of these, 18 proteins had never been reported to be part of the secretome of A. niger, while 101 proteins had previously not been identified in the culture medium of xylose-grown wild type colonies. Taken together, inactivation of flbA results in spatial changes in secretion and in a more complex secretome. The latter may be explained by the fact that strain ΔflbA has a thinner cell wall compared to the wild type, enabling efficient release of proteins. These results are of interest to improve A. niger as a cell factory.

  2. Integration of Oracle and Hadoop: Hybrid Databases Affordable at Scale

    Science.gov (United States)

    Canali, L.; Baranowski, Z.; Kothuri, P.

    2017-10-01

    This work reports on the activities aimed at integrating Oracle and Hadoop technologies for the use cases of CERN database services and in particular on the development of solutions for offloading data and queries from Oracle databases into Hadoop-based systems. The goal and interest of this investigation is to increase the scalability and optimize the cost/performance footprint for some of our largest Oracle databases. These concepts have been applied, among others, to build offline copies of CERN accelerator controls and logging databases. The tested solution allows to run reports on the controls data offloaded in Hadoop without affecting the critical production database, providing both performance benefits and cost reduction for the underlying infrastructure. Other use cases discussed include building hybrid database solutions with Oracle and Hadoop, offering the combined advantages of a mature relational database system with a scalable analytics engine.

  3. Deep Time Data Infrastructure: Integrating Our Current Geologic and Biologic Databases

    Science.gov (United States)

    Kolankowski, S. M.; Fox, P. A.; Ma, X.; Prabhu, A.

    2016-12-01

    As our knowledge of Earth's geologic and mineralogical history grows, we require more efficient methods of sharing immense amounts of data. Databases across numerous disciplines have been utilized to offer extensive information on very specific Epochs of Earth's history up to its current state, i.e. Fossil record, rock composition, proteins, etc. These databases could be a powerful force in identifying previously unseen correlations such as relationships between minerals and proteins. Creating a unifying site that provides a portal to these databases will aid in our ability as a collaborative scientific community to utilize our findings more effectively. The Deep-Time Data Infrastructure (DTDI) is currently being defined as part of a larger effort to accomplish this goal. DTDI will not be a new database, but an integration of existing resources. Current geologic and related databases were identified, documentation of their schema was established and will be presented as a stage by stage progression. Through conceptual modeling focused around variables from their combined records, we will determine the best way to integrate these databases using common factors. The Deep-Time Data Infrastructure will allow geoscientists to bridge gaps in data and further our understanding of our Earth's history.

  4. INE: a rice genome database with an integrated map view.

    Science.gov (United States)

    Sakata, K; Antonio, B A; Mukai, Y; Nagasaki, H; Sakai, Y; Makino, K; Sasaki, T

    2000-01-01

    The Rice Genome Research Program (RGP) launched a large-scale rice genome sequencing in 1998 aimed at decoding all genetic information in rice. A new genome database called INE (INtegrated rice genome Explorer) has been developed in order to integrate all the genomic information that has been accumulated so far and to correlate these data with the genome sequence. A web interface based on Java applet provides a rapid viewing capability in the database. The first operational version of the database has been completed which includes a genetic map, a physical map using YAC (Yeast Artificial Chromosome) clones and PAC (P1-derived Artificial Chromosome) contigs. These maps are displayed graphically so that the positional relationships among the mapped markers on each chromosome can be easily resolved. INE incorporates the sequences and annotations of the PAC contig. A site on low quality information ensures that all submitted sequence data comply with the standard for accuracy. As a repository of rice genome sequence, INE will also serve as a common database of all sequence data obtained by collaborating members of the International Rice Genome Sequencing Project (IRGSP). The database can be accessed at http://www. dna.affrc.go.jp:82/giot/INE. html or its mirror site at http://www.staff.or.jp/giot/INE.html

  5. Methodologies and Perspectives of Proteomics Applied to Filamentous Fungi: From Sample Preparation to Secretome Analysis

    Science.gov (United States)

    Bianco, Linda; Perrotta, Gaetano

    2015-01-01

    Filamentous fungi possess the extraordinary ability to digest complex biomasses and mineralize numerous xenobiotics, as consequence of their aptitude to sensing the environment and regulating their intra and extra cellular proteins, producing drastic changes in proteome and secretome composition. Recent advancement in proteomic technologies offers an exciting opportunity to reveal the fluctuations of fungal proteins and enzymes, responsible for their metabolic adaptation to a large variety of environmental conditions. Here, an overview of the most commonly used proteomic strategies will be provided; this paper will range from sample preparation to gel-free and gel-based proteomics, discussing pros and cons of each mentioned state-of-the-art technique. The main focus will be kept on filamentous fungi. Due to the biotechnological relevance of lignocellulose degrading fungi, special attention will be finally given to their extracellular proteome, or secretome. Secreted proteins and enzymes will be discussed in relation to their involvement in bio-based processes, such as biomass deconstruction and mycoremediation. PMID:25775160

  6. Methodologies and perspectives of proteomics applied to filamentous fungi: from sample preparation to secretome analysis.

    Science.gov (United States)

    Bianco, Linda; Perrotta, Gaetano

    2015-03-12

    Filamentous fungi possess the extraordinary ability to digest complex biomasses and mineralize numerous xenobiotics, as consequence of their aptitude to sensing the environment and regulating their intra and extra cellular proteins, producing drastic changes in proteome and secretome composition. Recent advancement in proteomic technologies offers an exciting opportunity to reveal the fluctuations of fungal proteins and enzymes, responsible for their metabolic adaptation to a large variety of environmental conditions. Here, an overview of the most commonly used proteomic strategies will be provided; this paper will range from sample preparation to gel-free and gel-based proteomics, discussing pros and cons of each mentioned state-of-the-art technique. The main focus will be kept on filamentous fungi. Due to the biotechnological relevance of lignocellulose degrading fungi, special attention will be finally given to their extracellular proteome, or secretome. Secreted proteins and enzymes will be discussed in relation to their involvement in bio-based processes, such as biomass deconstruction and mycoremediation.

  7. An integrated web medicinal materials DNA database: MMDBD (Medicinal Materials DNA Barcode Database

    Directory of Open Access Journals (Sweden)

    But Paul

    2010-06-01

    Full Text Available Abstract Background Thousands of plants and animals possess pharmacological properties and there is an increased interest in using these materials for therapy and health maintenance. Efficacies of the application is critically dependent on the use of genuine materials. For time to time, life-threatening poisoning is found because toxic adulterant or substitute is administered. DNA barcoding provides a definitive means of authentication and for conducting molecular systematics studies. Owing to the reduced cost in DNA authentication, the volume of the DNA barcodes produced for medicinal materials is on the rise and necessitates the development of an integrated DNA database. Description We have developed an integrated DNA barcode multimedia information platform- Medicinal Materials DNA Barcode Database (MMDBD for data retrieval and similarity search. MMDBD contains over 1000 species of medicinal materials listed in the Chinese Pharmacopoeia and American Herbal Pharmacopoeia. MMDBD also contains useful information of the medicinal material, including resources, adulterant information, medical parts, photographs, primers used for obtaining the barcodes and key references. MMDBD can be accessed at http://www.cuhk.edu.hk/icm/mmdbd.htm. Conclusions This work provides a centralized medicinal materials DNA barcode database and bioinformatics tools for data storage, analysis and exchange for promoting the identification of medicinal materials. MMDBD has the largest collection of DNA barcodes of medicinal materials and is a useful resource for researchers in conservation, systematic study, forensic and herbal industry.

  8. Proteomic analysis of mouse astrocytes and their secretome by a combination of FASP and StageTip-based, high pH, reversed-phase fractionation.

    Science.gov (United States)

    Han, Dohyun; Jin, Jonghwa; Woo, Jongmin; Min, Hophil; Kim, Youngsoo

    2014-07-01

    Astrocytes are the most abundant cells in the CNS, but their function remains largely unknown. Characterization of the whole-cell proteome and secretome in astrocytes would facilitate the study of their functions in various neurodegenerative diseases and astrocyte-neuron communication. To build a reference proteome, we established a C8-D1A astrocyte proteome to a depth of 7265 unique protein groups using a novel strategy that combined two-step digestion, filter-aided sample preparation, StageTip-based high pH fractionation, and high-resolution MS. Nearly, 6000 unique protein groups were identified from conditioned media of astrocyte cultures, constituting the largest astrocyte secretome that has been reported. High-confidence whole-cell proteomes and secretomes are valuable resources in studying astrocyte function by label-free quantitation and bioinformatics analysis. All MS data have been deposited in the ProteomeXchange with identifier PXD000501 (http://proteomecentral.proteomexchange.org/dataset/PXD000501). © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. Eavesdropping on altered cell-to-cell signaling in cancer by secretome profiling.

    Science.gov (United States)

    Klinke, David J

    2016-01-01

    In the past decade, cumulative clinical experiences with molecular targeted therapies and immunotherapies for cancer have promoted a shift in our conceptual understanding of cancer. This view shifted from viewing solid tumors as a homogeneous mass of malignant cells to viewing tumors as heterogeneous structures that are dynamically shaped by intercellular interactions among the variety of stromal, immune, and malignant cells present within the tumor microenvironment. As in any dynamic system, identifying how cells communicate to maintain homeostasis and how this communication is altered during oncogenesis are key hurdles for developing therapies to restore normal tissue homeostasis. Here, I discuss tissues as dynamic systems, using the mammary gland as an example, and the evolutionary concepts applied to oncogenesis. Drawing from these concepts, I present 2 competing hypotheses for how intercellular communication might be altered during oncogenesis. As an initial test of these competing hypotheses, a recent secretome comparison between normal human mammary and HER2+ breast cancer cell lines suggested that the particular proteins secreted by the malignant cells reflect a convergent evolutionary path associated with oncogenesis in a specific anatomical niche, despite arising in different individuals. Overall, this study illustrates the emerging power of secretome proteomics to probe, in an unbiased way, how intercellular communication changes during oncogenesis.

  10. A perspective for biomedical data integration: Design of databases for flow cytometry

    Directory of Open Access Journals (Sweden)

    Lakoumentas John

    2008-02-01

    Full Text Available Abstract Background The integration of biomedical information is essential for tackling medical problems. We describe a data model in the domain of flow cytometry (FC allowing for massive management, analysis and integration with other laboratory and clinical information. The paper is concerned with the proper translation of the Flow Cytometry Standard (FCS into a relational database schema, in a way that facilitates end users at either doing research on FC or studying specific cases of patients undergone FC analysis Results The proposed database schema provides integration of data originating from diverse acquisition settings, organized in a way that allows syntactically simple queries that provide results significantly faster than the conventional implementations of the FCS standard. The proposed schema can potentially achieve up to 8 orders of magnitude reduction in query complexity and up to 2 orders of magnitude reduction in response time for data originating from flow cytometers that record 256 colours. This is mainly achieved by managing to maintain an almost constant number of data-mining procedures regardless of the size and complexity of the stored information. Conclusion It is evident that using single-file data storage standards for the design of databases without any structural transformations significantly limits the flexibility of databases. Analysis of the requirements of a specific domain for integration and massive data processing can provide the necessary schema modifications that will unlock the additional functionality of a relational database.

  11. High-integrity databases for helicopter operations

    Science.gov (United States)

    Pschierer, Christian; Schiefele, Jens; Lüthy, Juerg

    2009-05-01

    Helicopter Emergency Medical Service missions (HEMS) impose a high workload on pilots due to short preparation time, operations in low level flight, and landings in unknown areas. The research project PILAS, a cooperation between Eurocopter, Diehl Avionics, DLR, EADS, Euro Telematik, ESG, Jeppesen, the Universities of Darmstadt and Munich, and funded by the German government, approached this problem by researching a pilot assistance system which supports the pilots during all phases of flight. The databases required for the specified helicopter missions include different types of topological and cultural data for graphical display on the SVS system, AMDB data for operations at airports and helipads, and navigation data for IFR segments. The most critical databases for the PILAS system however are highly accurate terrain and obstacle data. While RTCA DO-276 specifies high accuracies and integrities only for the areas around airports, HEMS helicopters typically operate outside of these controlled areas and thus require highly reliable terrain and obstacle data for their designated response areas. This data has been generated by a LIDAR scan of the specified test region. Obstacles have been extracted into a vector format. This paper includes a short overview of the complete PILAS system and then focus on the generation of the required high quality databases.

  12. CTDB: An Integrated Chickpea Transcriptome Database for Functional and Applied Genomics.

    Directory of Open Access Journals (Sweden)

    Mohit Verma

    Full Text Available Chickpea is an important grain legume used as a rich source of protein in human diet. The narrow genetic diversity and limited availability of genomic resources are the major constraints in implementing breeding strategies and biotechnological interventions for genetic enhancement of chickpea. We developed an integrated Chickpea Transcriptome Database (CTDB, which provides the comprehensive web interface for visualization and easy retrieval of transcriptome data in chickpea. The database features many tools for similarity search, functional annotation (putative function, PFAM domain and gene ontology search and comparative gene expression analysis. The current release of CTDB (v2.0 hosts transcriptome datasets with high quality functional annotation from cultivated (desi and kabuli types and wild chickpea. A catalog of transcription factor families and their expression profiles in chickpea are available in the database. The gene expression data have been integrated to study the expression profiles of chickpea transcripts in major tissues/organs and various stages of flower development. The utilities, such as similarity search, ortholog identification and comparative gene expression have also been implemented in the database to facilitate comparative genomic studies among different legumes and Arabidopsis. Furthermore, the CTDB represents a resource for the discovery of functional molecular markers (microsatellites and single nucleotide polymorphisms between different chickpea types. We anticipate that integrated information content of this database will accelerate the functional and applied genomic research for improvement of chickpea. The CTDB web service is freely available at http://nipgr.res.in/ctdb.html.

  13. KAIKObase: An integrated silkworm genome database and data mining tool

    Directory of Open Access Journals (Sweden)

    Nagaraju Javaregowda

    2009-10-01

    Full Text Available Abstract Background The silkworm, Bombyx mori, is one of the most economically important insects in many developing countries owing to its large-scale cultivation for silk production. With the development of genomic and biotechnological tools, B. mori has also become an important bioreactor for production of various recombinant proteins of biomedical interest. In 2004, two genome sequencing projects for B. mori were reported independently by Chinese and Japanese teams; however, the datasets were insufficient for building long genomic scaffolds which are essential for unambiguous annotation of the genome. Now, both the datasets have been merged and assembled through a joint collaboration between the two groups. Description Integration of the two data sets of silkworm whole-genome-shotgun sequencing by the Japanese and Chinese groups together with newly obtained fosmid- and BAC-end sequences produced the best continuity (~3.7 Mb in N50 scaffold size among the sequenced insect genomes and provided a high degree of nucleotide coverage (88% of all 28 chromosomes. In addition, a physical map of BAC contigs constructed by fingerprinting BAC clones and a SNP linkage map constructed using BAC-end sequences were available. In parallel, proteomic data from two-dimensional polyacrylamide gel electrophoresis in various tissues and developmental stages were compiled into a silkworm proteome database. Finally, a Bombyx trap database was constructed for documenting insertion positions and expression data of transposon insertion lines. Conclusion For efficient usage of genome information for functional studies, genomic sequences, physical and genetic map information and EST data were compiled into KAIKObase, an integrated silkworm genome database which consists of 4 map viewers, a gene viewer, and sequence, keyword and position search systems to display results and data at the level of nucleotide sequence, gene, scaffold and chromosome. Integration of the

  14. Integrated olfactory receptor and microarray gene expression databases

    Directory of Open Access Journals (Sweden)

    Crasto Chiquito J

    2007-06-01

    Full Text Available Abstract Background Gene expression patterns of olfactory receptors (ORs are an important component of the signal encoding mechanism in the olfactory system since they determine the interactions between odorant ligands and sensory neurons. We have developed the Olfactory Receptor Microarray Database (ORMD to house OR gene expression data. ORMD is integrated with the Olfactory Receptor Database (ORDB, which is a key repository of OR gene information. Both databases aim to aid experimental research related to olfaction. Description ORMD is a Web-accessible database that provides a secure data repository for OR microarray experiments. It contains both publicly available and private data; accessing the latter requires authenticated login. The ORMD is designed to allow users to not only deposit gene expression data but also manage their projects/experiments. For example, contributors can choose whether to make their datasets public. For each experiment, users can download the raw data files and view and export the gene expression data. For each OR gene being probed in a microarray experiment, a hyperlink to that gene in ORDB provides access to genomic and proteomic information related to the corresponding olfactory receptor. Individual ORs archived in ORDB are also linked to ORMD, allowing users access to the related microarray gene expression data. Conclusion ORMD serves as a data repository and project management system. It facilitates the study of microarray experiments of gene expression in the olfactory system. In conjunction with ORDB, ORMD integrates gene expression data with the genomic and functional data of ORs, and is thus a useful resource for both olfactory researchers and the public.

  15. Database modeling to integrate macrobenthos data in Spatial Data Infrastructure

    Directory of Open Access Journals (Sweden)

    José Alberto Quintanilha

    2012-08-01

    Full Text Available Coastal zones are complex areas that include marine and terrestrial environments. Besides its huge environmental wealth, they also attracts humans because provides food, recreation, business, and transportation, among others. Some difficulties to manage these areas are related with their complexity, diversity of interests and the absence of standardization to collect and share data to scientific community, public agencies, among others. The idea to organize, standardize and share this information based on Web Atlas is essential to support planning and decision making issues. The construction of a spatial database integrating the environmental business, to be used on Spatial Data Infrastructure (SDI is illustrated by a bioindicator that indicates the quality of the sediments. The models show the phases required to build Macrobenthos spatial database based on Santos Metropolitan Region as a reference. It is concluded that, when working with environmental data the structuring of knowledge in a conceptual model is essential for their subsequent integration into the SDI. During the modeling process it can be noticed that methodological issues related to the collection process may obstruct or prejudice the integration of data from different studies of the same area. The development of a database model, as presented in this study, can be used as a reference for further research with similar goals.

  16. Quantitative iTRAQ-based secretome analysis reveals species-specific and temporal shifts in carbon utilization strategies among manganese(II)-oxidizing Ascomycete fungi

    Energy Technology Data Exchange (ETDEWEB)

    Zeiner, Carolyn A.; Purvine, Samuel O.; Zink, Erika M.; Paša-Tolić, Ljiljana; Chaput, Dominique L.; Wu, Si; Santelli, Cara M.; Hansel, Colleen M.

    2017-09-01

    Fungi generate a wide range of extracellular hydrolytic and oxidative enzymes and reactive metabolites, collectively known as the secretome, that synergistically drive plant litter decomposition in the environment. While secretome studies of model organisms have greatly expanded our knowledge of these enzymes, few have extended secretome characterization to environmental isolates or directly compared temporal patterns of enzyme utilization among diverse species. Thus, the mechanisms of carbon (C) degradation by many ubiquitous soil fungi remain poorly understood. Here we use a combination of iTRAQ proteomics and custom bioinformatic analyses to compare the protein composition of the secretomes of four manganese(II)-oxidizing Ascomycete fungi over a three-week time course. We demonstrate that although the fungi produce a similar suite of extracellular enzymes, they exhibit striking differences in the regulation of these enzymes among species and over time, revealing species-specific and temporal shifts in C utilization strategies as they degrade the same substrate. Specifically, our findings suggest that Paraconiothyrium sporulosum AP3s5-JAC2a and Alternaria alternata SRC1lrK2f employ sequential enzyme secretion patterns concomitant with decreasing resource availability, Stagonospora sp. SRC1lsM3a preferentially degrades proteinaceous substrate before switching to carbohydrates, and Pyrenochaeta sp. DS3sAY3a utilizes primarily peptidases to aggressively attack carbon sources in a concentrated burst. This work highlights the diversity of operative metabolic strategies among cellulose-degrading Ascomycetes and enhances our understanding of their role in C turnover in the environment.

  17. An inventory of the Aspergillus niger secretome by combining in silico predictions with shotgun proteomics data

    NARCIS (Netherlands)

    Braaksma, M.; Martens-Uzunova, E.S.; Punt, P.J.; Schaap, P.J.

    2010-01-01

    Background: The ecological niche occupied by a fungal species, its pathogenicity and its usefulness as a microbial cell factory to a large degree depends on its secretome. Protein secretion usually requires the presence of a N-terminal signal peptide (SP) and by scanning for this feature using

  18. An inventory of the Aspergillus niger secretome by combining in silico predictions with shotgun proteomics data

    NARCIS (Netherlands)

    Braaksma, M.; Martens-Uzunova, E.S.; Punt, P.J.; Schaap, P.J.

    2010-01-01

    BACKGROUND: The ecological niche occupied by a fungal species, its pathogenicity and its usefulness as a microbial cell factory to a large degree depends on its secretome. Protein secretion usually requires the presence of a N-terminal signal peptide (SP) and by scanning for this feature using

  19. Integrated database for rapid mass movements in Norway

    Directory of Open Access Journals (Sweden)

    C. Jaedicke

    2009-03-01

    Full Text Available Rapid gravitational slope mass movements include all kinds of short term relocation of geological material, snow or ice. Traditionally, information about such events is collected separately in different databases covering selected geographical regions and types of movement. In Norway the terrain is susceptible to all types of rapid gravitational slope mass movements ranging from single rocks hitting roads and houses to large snow avalanches and rock slides where entire mountainsides collapse into fjords creating flood waves and endangering large areas. In addition, quick clay slides occur in desalinated marine sediments in South Eastern and Mid Norway. For the authorities and inhabitants of endangered areas, the type of threat is of minor importance and mitigation measures have to consider several types of rapid mass movements simultaneously.

    An integrated national database for all types of rapid mass movements built around individual events has been established. Only three data entries are mandatory: time, location and type of movement. The remaining optional parameters enable recording of detailed information about the terrain, materials involved and damages caused. Pictures, movies and other documentation can be uploaded into the database. A web-based graphical user interface has been developed allowing new events to be entered, as well as editing and querying for all events. An integration of the database into a GIS system is currently under development.

    Datasets from various national sources like the road authorities and the Geological Survey of Norway were imported into the database. Today, the database contains 33 000 rapid mass movement events from the last five hundred years covering the entire country. A first analysis of the data shows that the most frequent type of recorded rapid mass movement is rock slides and snow avalanches followed by debris slides in third place. Most events are recorded in the steep fjord

  20. A Support Database System for Integrated System Health Management (ISHM)

    Science.gov (United States)

    Schmalzel, John; Figueroa, Jorge F.; Turowski, Mark; Morris, John

    2007-01-01

    The development, deployment, operation and maintenance of Integrated Systems Health Management (ISHM) applications require the storage and processing of tremendous amounts of low-level data. This data must be shared in a secure and cost-effective manner between developers, and processed within several heterogeneous architectures. Modern database technology allows this data to be organized efficiently, while ensuring the integrity and security of the data. The extensibility and interoperability of the current database technologies also allows for the creation of an associated support database system. A support database system provides additional capabilities by building applications on top of the database structure. These applications can then be used to support the various technologies in an ISHM architecture. This presentation and paper propose a detailed structure and application description for a support database system, called the Health Assessment Database System (HADS). The HADS provides a shared context for organizing and distributing data as well as a definition of the applications that provide the required data-driven support to ISHM. This approach provides another powerful tool for ISHM developers, while also enabling novel functionality. This functionality includes: automated firmware updating and deployment, algorithm development assistance and electronic datasheet generation. The architecture for the HADS has been developed as part of the ISHM toolset at Stennis Space Center for rocket engine testing. A detailed implementation has begun for the Methane Thruster Testbed Project (MTTP) in order to assist in developing health assessment and anomaly detection algorithms for ISHM. The structure of this implementation is shown in Figure 1. The database structure consists of three primary components: the system hierarchy model, the historical data archive and the firmware codebase. The system hierarchy model replicates the physical relationships between

  1. ChlamyCyc: an integrative systems biology database and web-portal for Chlamydomonas reinhardtii.

    Science.gov (United States)

    May, Patrick; Christian, Jan-Ole; Kempa, Stefan; Walther, Dirk

    2009-05-04

    The unicellular green alga Chlamydomonas reinhardtii is an important eukaryotic model organism for the study of photosynthesis and plant growth. In the era of modern high-throughput technologies there is an imperative need to integrate large-scale data sets from high-throughput experimental techniques using computational methods and database resources to provide comprehensive information about the molecular and cellular organization of a single organism. In the framework of the German Systems Biology initiative GoFORSYS, a pathway database and web-portal for Chlamydomonas (ChlamyCyc) was established, which currently features about 250 metabolic pathways with associated genes, enzymes, and compound information. ChlamyCyc was assembled using an integrative approach combining the recently published genome sequence, bioinformatics methods, and experimental data from metabolomics and proteomics experiments. We analyzed and integrated a combination of primary and secondary database resources, such as existing genome annotations from JGI, EST collections, orthology information, and MapMan classification. ChlamyCyc provides a curated and integrated systems biology repository that will enable and assist in systematic studies of fundamental cellular processes in Chlamydomonas. The ChlamyCyc database and web-portal is freely available under http://chlamycyc.mpimp-golm.mpg.de.

  2. Distortion-Free Watermarking Approach for Relational Database Integrity Checking

    Directory of Open Access Journals (Sweden)

    Lancine Camara

    2014-01-01

    Full Text Available Nowadays, internet is becoming a suitable way of accessing the databases. Such data are exposed to various types of attack with the aim to confuse the ownership proofing or the content protection. In this paper, we propose a new approach based on fragile zero watermarking for the authentication of numeric relational data. Contrary to some previous databases watermarking techniques which cause some distortions in the original database and may not preserve the data usability constraints, our approach simply seeks to generate the watermark from the original database. First, the adopted method partitions the database relation into independent square matrix groups. Then, group-based watermarks are securely generated and registered in a trusted third party. The integrity verification is performed by computing the determinant and the diagonal’s minor for each group. As a result, tampering can be localized up to attribute group level. Theoretical and experimental results demonstrate that the proposed technique is resilient against tuples insertion, tuples deletion, and attributes values modification attacks. Furthermore, comparison with recent related effort shows that our scheme performs better in detecting multifaceted attacks.

  3. Integrated Strategic Tracking and Recruiting Database (iSTAR) Data Inventory

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Integrated Strategic Tracking and Recruiting Database (iSTAR) Data Inventory contains measured and modeled partnership and contact data. It is comprised of basic...

  4. An inventory of the Aspergillus niger secretome by combining in silico predictions with shotgun proteomics data

    Directory of Open Access Journals (Sweden)

    Martens-Uzunova Elena S

    2010-10-01

    Full Text Available Abstract Background The ecological niche occupied by a fungal species, its pathogenicity and its usefulness as a microbial cell factory to a large degree depends on its secretome. Protein secretion usually requires the presence of a N-terminal signal peptide (SP and by scanning for this feature using available highly accurate SP-prediction tools, the fraction of potentially secreted proteins can be directly predicted. However, prediction of a SP does not guarantee that the protein is actually secreted and current in silico prediction methods suffer from gene-model errors introduced during genome annotation. Results A majority rule based classifier that also evaluates signal peptide predictions from the best homologs of three neighbouring Aspergillus species was developed to create an improved list of potential signal peptide containing proteins encoded by the Aspergillus niger genome. As a complement to these in silico predictions, the secretome associated with growth and upon carbon source depletion was determined using a shotgun proteomics approach. Overall, some 200 proteins with a predicted signal peptide were identified to be secreted proteins. Concordant changes in the secretome state were observed as a response to changes in growth/culture conditions. Additionally, two proteins secreted via a non-classical route operating in A. niger were identified. Conclusions We were able to improve the in silico inventory of A. niger secretory proteins by combining different gene-model predictions from neighbouring Aspergilli and thereby avoiding prediction conflicts associated with inaccurate gene-models. The expected accuracy of signal peptide prediction for proteins that lack homologous sequences in the proteomes of related species is 85%. An experimental validation of the predicted proteome confirmed in silico predictions.

  5. An inventory of the Aspergillus niger secretome by combining in silico predictions with shotgun proteomics data.

    Science.gov (United States)

    Braaksma, Machtelt; Martens-Uzunova, Elena S; Punt, Peter J; Schaap, Peter J

    2010-10-19

    The ecological niche occupied by a fungal species, its pathogenicity and its usefulness as a microbial cell factory to a large degree depends on its secretome. Protein secretion usually requires the presence of a N-terminal signal peptide (SP) and by scanning for this feature using available highly accurate SP-prediction tools, the fraction of potentially secreted proteins can be directly predicted. However, prediction of a SP does not guarantee that the protein is actually secreted and current in silico prediction methods suffer from gene-model errors introduced during genome annotation. A majority rule based classifier that also evaluates signal peptide predictions from the best homologs of three neighbouring Aspergillus species was developed to create an improved list of potential signal peptide containing proteins encoded by the Aspergillus niger genome. As a complement to these in silico predictions, the secretome associated with growth and upon carbon source depletion was determined using a shotgun proteomics approach. Overall, some 200 proteins with a predicted signal peptide were identified to be secreted proteins. Concordant changes in the secretome state were observed as a response to changes in growth/culture conditions. Additionally, two proteins secreted via a non-classical route operating in A. niger were identified. We were able to improve the in silico inventory of A. niger secretory proteins by combining different gene-model predictions from neighbouring Aspergilli and thereby avoiding prediction conflicts associated with inaccurate gene-models. The expected accuracy of signal peptide prediction for proteins that lack homologous sequences in the proteomes of related species is 85%. An experimental validation of the predicted proteome confirmed in silico predictions.

  6. Host- and stage-dependent secretome of the arbuscular mycorrhizal fungus Rhizophagus irregularis.

    Science.gov (United States)

    Zeng, Tian; Holmer, Rens; Hontelez, Jan; Te Lintel-Hekkert, Bas; Marufu, Lucky; de Zeeuw, Thijs; Wu, Fangyuan; Schijlen, Elio; Bisseling, Ton; Limpens, Erik

    2018-05-01

    Arbuscular mycorrhizal fungi form the most wide-spread endosymbiosis with plants. There is very little host specificity in this interaction, however host preferences as well as varying symbiotic efficiencies have been observed. We hypothesize that secreted proteins (SPs) may act as fungal effectors to control symbiotic efficiency in a host-dependent manner. Therefore, we studied whether arbuscular mycorrhizal (AM) fungi adjust their secretome in a host- and stage-dependent manner to contribute to their extremely wide host range. We investigated the expression of SP-encoding genes of Rhizophagus irregularis in three evolutionary distantly related plant species, Medicago truncatula, Nicotiana benthamiana and Allium schoenoprasum. In addition we used laser microdissection in combination with RNA-seq to study SP expression at different stages of the interaction in Medicago. Our data indicate that most expressed SPs show roughly equal expression levels in the interaction with all three host plants. In addition, a subset shows significant differential expression depending on the host plant. Furthermore, SP expression is controlled locally in the hyphal network in response to host-dependent cues. Overall, this study presents a comprehensive analysis of the R. irregularis secretome, which now offers a solid basis to direct functional studies on the role of fungal SPs in AM symbiosis. © 2018 The Authors The Plant Journal © 2018 John Wiley & Sons Ltd.

  7. Quantitative iTRAQ secretome analysis of Aspergillus niger reveals novel hydrolytic enzymes.

    Science.gov (United States)

    Adav, Sunil S; Li, An A; Manavalan, Arulmani; Punt, Peter; Sze, Siu Kwan

    2010-08-06

    The natural lifestyle of Aspergillus niger made them more effective secretors of hydrolytic proteins and becomes critical when this species were exploited as hosts for the commercial secretion of heterologous proteins. The protein secretion profile of A. niger and its mutant at different pH was explored using iTRAQ-based quantitative proteomics approach coupled with liquid chromatography-tandem mass spectrometry (LC-MS/MS). This study characterized 102 highly confident unique proteins in the secretome with zero false discovery rate based on decoy strategy. The iTRAQ technique identified and relatively quantified many hydrolyzing enzymes such as cellulases, hemicellulases, glycoside hydrolases, proteases, peroxidases, and protein translocating transporter proteins during fermentation. The enzymes have potential application in lignocellulosic biomass hydrolysis for biofuel production, for example, the cellulolytic and hemicellulolytic enzymes glucan 1,4-alpha-glucosidase, alpha-glucosidase C, endoglucanase, alpha l-arabinofuranosidase, beta-mannosidase, glycosyl hydrolase; proteases such as tripeptidyl-peptidase, aspergillopepsin, and other enzymes including cytochrome c oxidase, cytochrome c oxidase, glucose oxidase were highly expressed in A. niger and its mutant secretion. In addition, specific enzyme production can be stimulated by controlling pH of the culture medium. Our results showed comprehensive unique secretory protein profile of A. niger, its regulation at different pH, and the potential application of iTRAQ-based quantitative proteomics for the microbial secretome analysis.

  8. An Integrated Enterprise Accelerator Database for the SLC Control System

    International Nuclear Information System (INIS)

    2002-01-01

    Since its inception in the early 1980's, the SLC Control System has been driven by a highly structured memory-resident real-time database. While efficient, its rigid structure and file-based sources makes it difficult to maintain and extract relevant information. The goal of transforming the sources for this database into a relational form is to enable it to be part of a Control System Enterprise Database that is an integrated central repository for SLC accelerator device and Control System data with links to other associated databases. We have taken the concepts developed for the NLC Enterprise Database and used them to create and load a relational model of the online SLC Control System database. This database contains data and structure to allow querying and reporting on beamline devices, their associations and parameters. In the future this will be extended to allow generation of EPICS and SLC database files, setup of applications and links to other databases such as accelerator maintenance, archive data, financial and personnel records, cabling information, documentation etc. The database is implemented using Oracle 8i. In the short term it will be updated daily in batch from the online SLC database. In the longer term, it will serve as the primary source for Control System static data, an R and D platform for the NLC, and contribute to SLC Control System operations

  9. GDR (Genome Database for Rosaceae: integrated web resources for Rosaceae genomics and genetics research

    Directory of Open Access Journals (Sweden)

    Ficklin Stephen

    2004-09-01

    Full Text Available Abstract Background Peach is being developed as a model organism for Rosaceae, an economically important family that includes fruits and ornamental plants such as apple, pear, strawberry, cherry, almond and rose. The genomics and genetics data of peach can play a significant role in the gene discovery and the genetic understanding of related species. The effective utilization of these peach resources, however, requires the development of an integrated and centralized database with associated analysis tools. Description The Genome Database for Rosaceae (GDR is a curated and integrated web-based relational database. GDR contains comprehensive data of the genetically anchored peach physical map, an annotated peach EST database, Rosaceae maps and markers and all publicly available Rosaceae sequences. Annotations of ESTs include contig assembly, putative function, simple sequence repeats, and anchored position to the peach physical map where applicable. Our integrated map viewer provides graphical interface to the genetic, transcriptome and physical mapping information. ESTs, BACs and markers can be queried by various categories and the search result sites are linked to the integrated map viewer or to the WebFPC physical map sites. In addition to browsing and querying the database, users can compare their sequences with the annotated GDR sequences via a dedicated sequence similarity server running either the BLAST or FASTA algorithm. To demonstrate the utility of the integrated and fully annotated database and analysis tools, we describe a case study where we anchored Rosaceae sequences to the peach physical and genetic map by sequence similarity. Conclusions The GDR has been initiated to meet the major deficiency in Rosaceae genomics and genetics research, namely a centralized web database and bioinformatics tools for data storage, analysis and exchange. GDR can be accessed at http://www.genome.clemson.edu/gdr/.

  10. GDR (Genome Database for Rosaceae): integrated web resources for Rosaceae genomics and genetics research.

    Science.gov (United States)

    Jung, Sook; Jesudurai, Christopher; Staton, Margaret; Du, Zhidian; Ficklin, Stephen; Cho, Ilhyung; Abbott, Albert; Tomkins, Jeffrey; Main, Dorrie

    2004-09-09

    Peach is being developed as a model organism for Rosaceae, an economically important family that includes fruits and ornamental plants such as apple, pear, strawberry, cherry, almond and rose. The genomics and genetics data of peach can play a significant role in the gene discovery and the genetic understanding of related species. The effective utilization of these peach resources, however, requires the development of an integrated and centralized database with associated analysis tools. The Genome Database for Rosaceae (GDR) is a curated and integrated web-based relational database. GDR contains comprehensive data of the genetically anchored peach physical map, an annotated peach EST database, Rosaceae maps and markers and all publicly available Rosaceae sequences. Annotations of ESTs include contig assembly, putative function, simple sequence repeats, and anchored position to the peach physical map where applicable. Our integrated map viewer provides graphical interface to the genetic, transcriptome and physical mapping information. ESTs, BACs and markers can be queried by various categories and the search result sites are linked to the integrated map viewer or to the WebFPC physical map sites. In addition to browsing and querying the database, users can compare their sequences with the annotated GDR sequences via a dedicated sequence similarity server running either the BLAST or FASTA algorithm. To demonstrate the utility of the integrated and fully annotated database and analysis tools, we describe a case study where we anchored Rosaceae sequences to the peach physical and genetic map by sequence similarity. The GDR has been initiated to meet the major deficiency in Rosaceae genomics and genetics research, namely a centralized web database and bioinformatics tools for data storage, analysis and exchange. GDR can be accessed at http://www.genome.clemson.edu/gdr/.

  11. Spatially resolving the secretome within the mycelium of the cell factory Aspergillus niger.

    Science.gov (United States)

    Krijgsheld, Pauline; Altelaar, A F Maarten; Post, Harm; Ringrose, Jeffrey H; Müller, Wally H; Heck, Albert J R; Wösten, Han A B

    2012-05-04

    Aspergillus niger is an important cell factory for the industrial production of enzymes. These enzymes are released into the culture medium, from which they can be easily isolated. Here, we determined with stable isotope dimethyl labeling the secretome of five concentric zones of 7-day-old xylose-grown colonies of A. niger that had either or not been treated with cycloheximide. As expected, cycloheximide blocked secretion of proteins at the periphery of the colony. Unexpectedly, protein release was increased by cycloheximide in the intermediate and central zones of the mycelium when compared to nontreated colonies. Electron microscopy indicated that this is due to partial degradation of the cell wall. In total, 124 proteins were identified in cycloheximide-treated colonies, of which 19 secreted proteins had not been identified before. Within the pool of 124 proteins, 53 secreted proteins were absent in nontreated colonies, and additionally, 35 proteins were released ≥4-fold in the central and subperipheral zones of cycloheximide-treated colonies when compared to nontreated colonies. The composition of the secretome in each of the five concentric zones differed. This study thus describes spatial release of proteins in A. niger, which is instrumental in understanding how fungi degrade complex substrates in nature.

  12. Database of episode-integrated solar energetic proton fluences

    Science.gov (United States)

    Robinson, Zachary D.; Adams, James H.; Xapsos, Michael A.; Stauffer, Craig A.

    2018-04-01

    A new database of proton episode-integrated fluences is described. This database contains data from two different instruments on multiple satellites. The data are from instruments on the Interplanetary Monitoring Platform-8 (IMP8) and the Geostationary Operational Environmental Satellites (GOES) series. A method to normalize one set of data to one another is presented to create a seamless database spanning 1973 to 2016. A discussion of some of the characteristics that episodes exhibit is presented, including episode duration and number of peaks. As an example of what can be understood about episodes, the July 4, 2012 episode is examined in detail. The coronal mass ejections and solar flares that caused many of the fluctuations of the proton flux seen at Earth are associated with peaks in the proton flux during this episode. The reasoning for each choice is laid out to provide a reference for how CME and solar flares associations are made.

  13. Database of episode-integrated solar energetic proton fluences

    Directory of Open Access Journals (Sweden)

    Robinson Zachary D.

    2018-01-01

    Full Text Available A new database of proton episode-integrated fluences is described. This database contains data from two different instruments on multiple satellites. The data are from instruments on the Interplanetary Monitoring Platform-8 (IMP8 and the Geostationary Operational Environmental Satellites (GOES series. A method to normalize one set of data to one another is presented to create a seamless database spanning 1973 to 2016. A discussion of some of the characteristics that episodes exhibit is presented, including episode duration and number of peaks. As an example of what can be understood about episodes, the July 4, 2012 episode is examined in detail. The coronal mass ejections and solar flares that caused many of the fluctuations of the proton flux seen at Earth are associated with peaks in the proton flux during this episode. The reasoning for each choice is laid out to provide a reference for how CME and solar flares associations are made.

  14. LmSmdB: an integrated database for metabolic and gene regulatory network in Leishmania major and Schistosoma mansoni

    Directory of Open Access Journals (Sweden)

    Priyanka Patel

    2016-03-01

    Full Text Available A database that integrates all the information required for biological processing is essential to be stored in one platform. We have attempted to create one such integrated database that can be a one stop shop for the essential features required to fetch valuable result. LmSmdB (L. major and S. mansoni database is an integrated database that accounts for the biological networks and regulatory pathways computationally determined by integrating the knowledge of the genome sequences of the mentioned organisms. It is the first database of its kind that has together with the network designing showed the simulation pattern of the product. This database intends to create a comprehensive canopy for the regulation of lipid metabolism reaction in the parasite by integrating the transcription factors, regulatory genes and the protein products controlled by the transcription factors and hence operating the metabolism at genetic level. Keywords: L.major, S.mansoni, Regulatory networks, Transcription factors, Database

  15. N-glycoproteome analysis of the secretome of human metastatic hepatocellular carcinoma cell lines combining hydrazide chemistry, HILIC enrichment and mass spectrometry.

    Directory of Open Access Journals (Sweden)

    Xianyu Li

    Full Text Available Cancer cell metastasis is a major cause of cancer death. Unfortunately, the underlying molecular mechanisms remain unknown, which results in the lack of efficient diagnosis, therapy and prevention approaches. Nevertheless, the dysregulation of the cancer cell secretome is known to play key roles in tumor transformation and progression. The majority of proteins in the secretome are secretory proteins and membrane-released proteins, and, mostly, the glycosylated proteins. Until recently, few studies have explored protein N-glycosylation changes in the secretome, although protein glycosylation has received increasing attention in the study of tumor development processes. Here, the N-glycoproteins in the secretome of two human hepatocellular carcinoma (HCC cell lines with low (MHCC97L or high (HCCLM3 metastatic potential were investigated with a in-depth characterization of the N-glycosites by combining two general glycopeptide enrichment approaches, hydrazide chemistry and zwitterionic hydrophilic interaction chromatography (zic-HILIC, with mass spectrometry analysis. A total of 1,213 unique N-glycosites from 611 N-glycoproteins were confidently identified. These N-glycoproteins were primarily localized to the extracellular space and plasma membrane, supporting the important role of N-glycosylation in the secretory pathway. Coupling label-free quantification with a hierarchical clustering strategy, we determined the differential regulation of several N-glycoproteins that are related to metastasis, among which AFP, DKK1, FN1, CD151 and TGFβ2 were up-regulated in HCCLM3 cells. The inclusion of the well-known metastasis-related proteins AFP and DKK1 in this list provides solid supports for our study. Further western blotting experiments detecting FN1 and FAT1 confirmed our discovery. The glycoproteome strategy in this study provides an effective means to explore potential cancer biomarkers.

  16. Secretome profiling of oral squamous cell carcinoma-associated fibroblasts reveals organization and disassembly of extracellular matrix and collagen metabolic process signatures.

    Science.gov (United States)

    Bagordakis, Elizabete; Sawazaki-Calone, Iris; Macedo, Carolina Carneiro Soares; Carnielli, Carolina M; de Oliveira, Carine Ervolino; Rodrigues, Priscila Campioni; Rangel, Ana Lucia C A; Dos Santos, Jean Nunes; Risteli, Juha; Graner, Edgard; Salo, Tuula; Paes Leme, Adriana Franco; Coletta, Ricardo D

    2016-07-01

    An important role has been attributed to cancer-associated fibroblasts (CAFs) in the tumorigenesis of oral squamous cell carcinoma (OSCC), the most common tumor of the oral cavity. Previous studies demonstrated that CAF-secreted molecules promote the proliferation and invasion of OSCC cells, inducing a more aggressive phenotype. In this study, we searched for differences in the secretome of CAFs and normal oral fibroblasts (NOF) using mass spectrometry-based proteomics and biological network analysis. Comparison of the secretome profiles revealed that upregulated proteins involved mainly in extracellular matrix organization and disassembly and collagen metabolism. Among the upregulated proteins were fibronectin type III domain-containing 1 (FNDC1), serpin peptidase inhibitor type 1 (SERPINE1), and stanniocalcin 2 (STC2), the upregulation of which was validated by quantitative PCR and ELISA in an independent set of CAF cell lines. The transition of transforming growth factor beta 1 (TGF-β1)-mediating NOFs into CAFs was accompanied by significant upregulation of FNDC1, SERPINE1, and STC2, confirming the participation of these proteins in the CAF-derived secretome. Type I collagen, the main constituent of the connective tissue, was also associated with several upregulated biological processes. The immunoexpression of type I collagen N-terminal propeptide (PINP) was significantly correlated in vivo with CAFs in the tumor front and was associated with significantly shortened survival of OSCC patients. Presence of CAFs in the tumor stroma was also an independent prognostic factor for OSCC disease-free survival. These results demonstrate the value of secretome profiling for evaluating the role of CAFs in the tumor microenvironment and identify potential novel therapeutic targets such as FNDC1, SERPINE1, and STC2. Furthermore, type I collagen expression by CAFs, represented by PINP levels, may be a prognostic marker of OSCC outcome.

  17. ChlamyCyc: an integrative systems biology database and web-portal for Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    Kempa Stefan

    2009-05-01

    Full Text Available Abstract Background The unicellular green alga Chlamydomonas reinhardtii is an important eukaryotic model organism for the study of photosynthesis and plant growth. In the era of modern high-throughput technologies there is an imperative need to integrate large-scale data sets from high-throughput experimental techniques using computational methods and database resources to provide comprehensive information about the molecular and cellular organization of a single organism. Results In the framework of the German Systems Biology initiative GoFORSYS, a pathway database and web-portal for Chlamydomonas (ChlamyCyc was established, which currently features about 250 metabolic pathways with associated genes, enzymes, and compound information. ChlamyCyc was assembled using an integrative approach combining the recently published genome sequence, bioinformatics methods, and experimental data from metabolomics and proteomics experiments. We analyzed and integrated a combination of primary and secondary database resources, such as existing genome annotations from JGI, EST collections, orthology information, and MapMan classification. Conclusion ChlamyCyc provides a curated and integrated systems biology repository that will enable and assist in systematic studies of fundamental cellular processes in Chlamydomonas. The ChlamyCyc database and web-portal is freely available under http://chlamycyc.mpimp-golm.mpg.de.

  18. Human protein secretory pathway genes are expressed in a tissue-specific pattern to match processing demands of the secretome

    DEFF Research Database (Denmark)

    Feizi, Amir; Gatto, Francesco; Uhlén, Mathias

    2017-01-01

    Protein secretory pathway in eukaryal cells is responsible for delivering functional secretory proteins. The dysfunction of this pathway causes a range of important human diseases from congenital disorders to cancer. Despite the piled-up knowledge on the molecular biology and biochemistry level...... in specific gene families of the secretory pathway. We also inspected the potential functional link between detected extreme genes and the corresponding tissues enriched secretome. As a result, the detected extreme genes showed correlation with the enrichment of the nature and number of specific post......-translational modifications in each tissue's secretome. Our findings conciliate both the housekeeping and tissue-specific nature of the protein secretory pathway, which we attribute to a fine-tuned regulation of defined gene families to support the diversity of secreted proteins and their modifications....

  19. SIRSALE: integrated video database management tools

    Science.gov (United States)

    Brunie, Lionel; Favory, Loic; Gelas, J. P.; Lefevre, Laurent; Mostefaoui, Ahmed; Nait-Abdesselam, F.

    2002-07-01

    Video databases became an active field of research during the last decade. The main objective in such systems is to provide users with capabilities to friendly search, access and playback distributed stored video data in the same way as they do for traditional distributed databases. Hence, such systems need to deal with hard issues : (a) video documents generate huge volumes of data and are time sensitive (streams must be delivered at a specific bitrate), (b) contents of video data are very hard to be automatically extracted and need to be humanly annotated. To cope with these issues, many approaches have been proposed in the literature including data models, query languages, video indexing etc. In this paper, we present SIRSALE : a set of video databases management tools that allow users to manipulate video documents and streams stored in large distributed repositories. All the proposed tools are based on generic models that can be customized for specific applications using ad-hoc adaptation modules. More precisely, SIRSALE allows users to : (a) browse video documents by structures (sequences, scenes, shots) and (b) query the video database content by using a graphical tool, adapted to the nature of the target video documents. This paper also presents an annotating interface which allows archivists to describe the content of video documents. All these tools are coupled to a video player integrating remote VCR functionalities and are based on active network technology. So, we present how dedicated active services allow an optimized video transport for video streams (with Tamanoir active nodes). We then describe experiments of using SIRSALE on an archive of news video and soccer matches. The system has been demonstrated to professionals with a positive feedback. Finally, we discuss open issues and present some perspectives.

  20. Distributed Access View Integrated Database (DAVID) system

    Science.gov (United States)

    Jacobs, Barry E.

    1991-01-01

    The Distributed Access View Integrated Database (DAVID) System, which was adopted by the Astrophysics Division for their Astrophysics Data System, is a solution to the system heterogeneity problem. The heterogeneous components of the Astrophysics problem is outlined. The Library and Library Consortium levels of the DAVID approach are described. The 'books' and 'kits' level is discussed. The Universal Object Typer Management System level is described. The relation of the DAVID project with the Small Business Innovative Research (SBIR) program is explained.

  1. Integrating the DLD dosimetry system into the Almaraz NPP Corporative Database

    International Nuclear Information System (INIS)

    Gonzalez Crego, E.; Martin Lopez-Suevos, C.

    1996-01-01

    The article discusses the experience acquired during the integration of a new MGP Instruments DLD Dosimetry System into the Almaraz NPP corporative database and general communications network, following a client-server philosophy and taking into account the computer standards of the Plant. The most important results obtained are: Integration of DLD dosimetry information into corporative databases, permitting the use of new applications Sharing of existing personnel information with the DLD dosimetry application, thereby avoiding the redundant work of introducing data and improving the quality of the information. Facilitation of maintenance, both software and hardware, of the DLD system. Maximum explotation, from the computer point of view, of the initial investment. Adaptation of the application to the applicable legislation. (Author)

  2. Integration of curated databases to identify genotype-phenotype associations

    Directory of Open Access Journals (Sweden)

    Li Jianrong

    2006-10-01

    Full Text Available Abstract Background The ability to rapidly characterize an unknown microorganism is critical in both responding to infectious disease and biodefense. To do this, we need some way of anticipating an organism's phenotype based on the molecules encoded by its genome. However, the link between molecular composition (i.e. genotype and phenotype for microbes is not obvious. While there have been several studies that address this challenge, none have yet proposed a large-scale method integrating curated biological information. Here we utilize a systematic approach to discover genotype-phenotype associations that combines phenotypic information from a biomedical informatics database, GIDEON, with the molecular information contained in National Center for Biotechnology Information's Clusters of Orthologous Groups database (NCBI COGs. Results Integrating the information in the two databases, we are able to correlate the presence or absence of a given protein in a microbe with its phenotype as measured by certain morphological characteristics or survival in a particular growth media. With a 0.8 correlation score threshold, 66% of the associations found were confirmed by the literature and at a 0.9 correlation threshold, 86% were positively verified. Conclusion Our results suggest possible phenotypic manifestations for proteins biochemically associated with sugar metabolism and electron transport. Moreover, we believe our approach can be extended to linking pathogenic phenotypes with functionally related proteins.

  3. Secretome of obligate intracellular Rickettsia

    Science.gov (United States)

    Gillespie, Joseph J.; Kaur, Simran J.; Rahman, M. Sayeedur; Rennoll-Bankert, Kristen; Sears, Khandra T.; Beier-Sexton, Magda; Azad, Abdu F.

    2014-01-01

    The genus Rickettsia (Alphaproteobacteria, Rickettsiales, Rickettsiaceae) is comprised of obligate intracellular parasites, with virulent species of interest both as causes of emerging infectious diseases and for their potential deployment as bioterrorism agents. Currently, there are no effective commercially available vaccines, with treatment limited primarily to tetracycline antibiotics, although others (e.g. josamycin, ciprofloxacin, chloramphenicol, and azithromycin) are also effective. Much of the recent research geared toward understanding mechanisms underlying rickettsial pathogenicity has centered on characterization of secreted proteins that directly engage eukaryotic cells. Herein, we review all aspects of the Rickettsia secretome, including six secretion systems, 19 characterized secretory proteins, and potential moonlighting proteins identified on surfaces of multiple Rickettsia species. Employing bioinformatics and phylogenomics, we present novel structural and functional insight on each secretion system. Unexpectedly, our investigation revealed that the majority of characterized secretory proteins have not been assigned to their cognate secretion pathways. Furthermore, for most secretion pathways, the requisite signal sequences mediating translocation are poorly understood. As a blueprint for all known routes of protein translocation into host cells, this resource will assist research aimed at uniting characterized secreted proteins with their apposite secretion pathways. Furthermore, our work will help in the identification of novel secreted proteins involved in rickettsial ‘life on the inside’. PMID:25168200

  4. Integrated Controlling System and Unified Database for High Throughput Protein Crystallography Experiments

    International Nuclear Information System (INIS)

    Gaponov, Yu.A.; Igarashi, N.; Hiraki, M.; Sasajima, K.; Matsugaki, N.; Suzuki, M.; Kosuge, T.; Wakatsuki, S.

    2004-01-01

    An integrated controlling system and a unified database for high throughput protein crystallography experiments have been developed. Main features of protein crystallography experiments (purification, crystallization, crystal harvesting, data collection, data processing) were integrated into the software under development. All information necessary to perform protein crystallography experiments is stored (except raw X-ray data that are stored in a central data server) in a MySQL relational database. The database contains four mutually linked hierarchical trees describing protein crystals, data collection of protein crystal and experimental data processing. A database editor was designed and developed. The editor supports basic database functions to view, create, modify and delete user records in the database. Two search engines were realized: direct search of necessary information in the database and object oriented search. The system is based on TCP/IP secure UNIX sockets with four predefined sending and receiving behaviors, which support communications between all connected servers and clients with remote control functions (creating and modifying data for experimental conditions, data acquisition, viewing experimental data, and performing data processing). Two secure login schemes were designed and developed: a direct method (using the developed Linux clients with secure connection) and an indirect method (using the secure SSL connection using secure X11 support from any operating system with X-terminal and SSH support). A part of the system has been implemented on a new MAD beam line, NW12, at the Photon Factory Advanced Ring for general user experiments

  5. CTDB: An Integrated Chickpea Transcriptome Database for Functional and Applied Genomics

    OpenAIRE

    Verma, Mohit; Kumar, Vinay; Patel, Ravi K.; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Chickpea is an important grain legume used as a rich source of protein in human diet. The narrow genetic diversity and limited availability of genomic resources are the major constraints in implementing breeding strategies and biotechnological interventions for genetic enhancement of chickpea. We developed an integrated Chickpea Transcriptome Database (CTDB), which provides the comprehensive web interface for visualization and easy retrieval of transcriptome data in chickpea. The database fea...

  6. Integration of the ATLAS tag database with data management and analysis components

    Energy Technology Data Exchange (ETDEWEB)

    Cranshaw, J; Malon, D [Argonne National Laboratory, Argonne, IL 60439 (United States); Doyle, A T; Kenyon, M J; McGlone, H; Nicholson, C [Department of Physics and Astronomy, University of Glasgow, Glasgow, G12 8QQ, Scotland (United Kingdom)], E-mail: c.nicholson@physics.gla.ac.uk

    2008-07-15

    The ATLAS Tag Database is an event-level metadata system, designed to allow efficient identification and selection of interesting events for user analysis. By making first-level cuts using queries on a relational database, the size of an analysis input sample could be greatly reduced and thus the time taken for the analysis reduced. Deployment of such a Tag database is underway, but to be most useful it needs to be integrated with the distributed data management (DDM) and distributed analysis (DA) components. This means addressing the issue that the DDM system at ATLAS groups files into datasets for scalability and usability, whereas the Tag Database points to events in files. It also means setting up a system which could prepare a list of input events and use both the DDM and DA systems to run a set of jobs. The ATLAS Tag Navigator Tool (TNT) has been developed to address these issues in an integrated way and provide a tool that the average physicist can use. Here, the current status of this work is presented and areas of future work are highlighted.

  7. Integrating the Allen Brain Institute Cell Types Database into Automated Neuroscience Workflow.

    Science.gov (United States)

    Stockton, David B; Santamaria, Fidel

    2017-10-01

    We developed software tools to download, extract features, and organize the Cell Types Database from the Allen Brain Institute (ABI) in order to integrate its whole cell patch clamp characterization data into the automated modeling/data analysis cycle. To expand the potential user base we employed both Python and MATLAB. The basic set of tools downloads selected raw data and extracts cell, sweep, and spike features, using ABI's feature extraction code. To facilitate data manipulation we added a tool to build a local specialized database of raw data plus extracted features. Finally, to maximize automation, we extended our NeuroManager workflow automation suite to include these tools plus a separate investigation database. The extended suite allows the user to integrate ABI experimental and modeling data into an automated workflow deployed on heterogeneous computer infrastructures, from local servers, to high performance computing environments, to the cloud. Since our approach is focused on workflow procedures our tools can be modified to interact with the increasing number of neuroscience databases being developed to cover all scales and properties of the nervous system.

  8. Integration of the ATLAS tag database with data management and analysis components

    International Nuclear Information System (INIS)

    Cranshaw, J; Malon, D; Doyle, A T; Kenyon, M J; McGlone, H; Nicholson, C

    2008-01-01

    The ATLAS Tag Database is an event-level metadata system, designed to allow efficient identification and selection of interesting events for user analysis. By making first-level cuts using queries on a relational database, the size of an analysis input sample could be greatly reduced and thus the time taken for the analysis reduced. Deployment of such a Tag database is underway, but to be most useful it needs to be integrated with the distributed data management (DDM) and distributed analysis (DA) components. This means addressing the issue that the DDM system at ATLAS groups files into datasets for scalability and usability, whereas the Tag Database points to events in files. It also means setting up a system which could prepare a list of input events and use both the DDM and DA systems to run a set of jobs. The ATLAS Tag Navigator Tool (TNT) has been developed to address these issues in an integrated way and provide a tool that the average physicist can use. Here, the current status of this work is presented and areas of future work are highlighted

  9. Techniques to Access Databases and Integrate Data for Hydrologic Modeling

    Energy Technology Data Exchange (ETDEWEB)

    Whelan, Gene; Tenney, Nathan D.; Pelton, Mitchell A.; Coleman, Andre M.; Ward, Duane L.; Droppo, James G.; Meyer, Philip D.; Dorow, Kevin E.; Taira, Randal Y.

    2009-06-17

    This document addresses techniques to access and integrate data for defining site-specific conditions and behaviors associated with ground-water and surface-water radionuclide transport applicable to U.S. Nuclear Regulatory Commission reviews. Environmental models typically require input data from multiple internal and external sources that may include, but are not limited to, stream and rainfall gage data, meteorological data, hydrogeological data, habitat data, and biological data. These data may be retrieved from a variety of organizations (e.g., federal, state, and regional) and source types (e.g., HTTP, FTP, and databases). Available data sources relevant to hydrologic analyses for reactor licensing are identified and reviewed. The data sources described can be useful to define model inputs and parameters, including site features (e.g., watershed boundaries, stream locations, reservoirs, site topography), site properties (e.g., surface conditions, subsurface hydraulic properties, water quality), and site boundary conditions, input forcings, and extreme events (e.g., stream discharge, lake levels, precipitation, recharge, flood and drought characteristics). Available software tools for accessing established databases, retrieving the data, and integrating it with models were identified and reviewed. The emphasis in this review was on existing software products with minimal required modifications to enable their use with the FRAMES modeling framework. The ability of four of these tools to access and retrieve the identified data sources was reviewed. These four software tools were the Hydrologic Data Acquisition and Processing System (HDAPS), Integrated Water Resources Modeling System (IWRMS) External Data Harvester, Data for Environmental Modeling Environmental Data Download Tool (D4EM EDDT), and the FRAMES Internet Database Tools. The IWRMS External Data Harvester and the D4EM EDDT were identified as the most promising tools based on their ability to access and

  10. Techniques to Access Databases and Integrate Data for Hydrologic Modeling

    International Nuclear Information System (INIS)

    Whelan, Gene; Tenney, Nathan D.; Pelton, Mitchell A.; Coleman, Andre M.; Ward, Duane L.; Droppo, James G.; Meyer, Philip D.; Dorow, Kevin E.; Taira, Randal Y.

    2009-01-01

    This document addresses techniques to access and integrate data for defining site-specific conditions and behaviors associated with ground-water and surface-water radionuclide transport applicable to U.S. Nuclear Regulatory Commission reviews. Environmental models typically require input data from multiple internal and external sources that may include, but are not limited to, stream and rainfall gage data, meteorological data, hydrogeological data, habitat data, and biological data. These data may be retrieved from a variety of organizations (e.g., federal, state, and regional) and source types (e.g., HTTP, FTP, and databases). Available data sources relevant to hydrologic analyses for reactor licensing are identified and reviewed. The data sources described can be useful to define model inputs and parameters, including site features (e.g., watershed boundaries, stream locations, reservoirs, site topography), site properties (e.g., surface conditions, subsurface hydraulic properties, water quality), and site boundary conditions, input forcings, and extreme events (e.g., stream discharge, lake levels, precipitation, recharge, flood and drought characteristics). Available software tools for accessing established databases, retrieving the data, and integrating it with models were identified and reviewed. The emphasis in this review was on existing software products with minimal required modifications to enable their use with the FRAMES modeling framework. The ability of four of these tools to access and retrieve the identified data sources was reviewed. These four software tools were the Hydrologic Data Acquisition and Processing System (HDAPS), Integrated Water Resources Modeling System (IWRMS) External Data Harvester, Data for Environmental Modeling Environmental Data Download Tool (D4EM EDDT), and the FRAMES Internet Database Tools. The IWRMS External Data Harvester and the D4EM EDDT were identified as the most promising tools based on their ability to access and

  11. GDR (Genome Database for Rosaceae): integrated web-database for Rosaceae genomics and genetics data.

    Science.gov (United States)

    Jung, Sook; Staton, Margaret; Lee, Taein; Blenda, Anna; Svancara, Randall; Abbott, Albert; Main, Dorrie

    2008-01-01

    The Genome Database for Rosaceae (GDR) is a central repository of curated and integrated genetics and genomics data of Rosaceae, an economically important family which includes apple, cherry, peach, pear, raspberry, rose and strawberry. GDR contains annotated databases of all publicly available Rosaceae ESTs, the genetically anchored peach physical map, Rosaceae genetic maps and comprehensively annotated markers and traits. The ESTs are assembled to produce unigene sets of each genus and the entire Rosaceae. Other annotations include putative function, microsatellites, open reading frames, single nucleotide polymorphisms, gene ontology terms and anchored map position where applicable. Most of the published Rosaceae genetic maps can be viewed and compared through CMap, the comparative map viewer. The peach physical map can be viewed using WebFPC/WebChrom, and also through our integrated GDR map viewer, which serves as a portal to the combined genetic, transcriptome and physical mapping information. ESTs, BACs, markers and traits can be queried by various categories and the search result sites are linked to the mapping visualization tools. GDR also provides online analysis tools such as a batch BLAST/FASTA server for the GDR datasets, a sequence assembly server and microsatellite and primer detection tools. GDR is available at http://www.rosaceae.org.

  12. PGSB/MIPS PlantsDB Database Framework for the Integration and Analysis of Plant Genome Data.

    Science.gov (United States)

    Spannagl, Manuel; Nussbaumer, Thomas; Bader, Kai; Gundlach, Heidrun; Mayer, Klaus F X

    2017-01-01

    Plant Genome and Systems Biology (PGSB), formerly Munich Institute for Protein Sequences (MIPS) PlantsDB, is a database framework for the integration and analysis of plant genome data, developed and maintained for more than a decade now. Major components of that framework are genome databases and analysis resources focusing on individual (reference) genomes providing flexible and intuitive access to data. Another main focus is the integration of genomes from both model and crop plants to form a scaffold for comparative genomics, assisted by specialized tools such as the CrowsNest viewer to explore conserved gene order (synteny). Data exchange and integrated search functionality with/over many plant genome databases is provided within the transPLANT project.

  13. Quality controls in integrative approaches to detect errors and inconsistencies in biological databases

    Directory of Open Access Journals (Sweden)

    Ghisalberti Giorgio

    2010-12-01

    Full Text Available Numerous biomolecular data are available, but they are scattered in many databases and only some of them are curated by experts. Most available data are computationally derived and include errors and inconsistencies. Effective use of available data in order to derive new knowledge hence requires data integration and quality improvement. Many approaches for data integration have been proposed. Data warehousing seams to be the most adequate when comprehensive analysis of integrated data is required. This makes it the most suitable also to implement comprehensive quality controls on integrated data. We previously developed GFINDer (http://www.bioinformatics.polimi.it/GFINDer/, a web system that supports scientists in effectively using available information. It allows comprehensive statistical analysis and mining of functional and phenotypic annotations of gene lists, such as those identified by high-throughput biomolecular experiments. GFINDer backend is composed of a multi-organism genomic and proteomic data warehouse (GPDW. Within the GPDW, several controlled terminologies and ontologies, which describe gene and gene product related biomolecular processes, functions and phenotypes, are imported and integrated, together with their associations with genes and proteins of several organisms. In order to ease maintaining updated the GPDW and to ensure the best possible quality of data integrated in subsequent updating of the data warehouse, we developed several automatic procedures. Within them, we implemented numerous data quality control techniques to test the integrated data for a variety of possible errors and inconsistencies. Among other features, the implemented controls check data structure and completeness, ontological data consistency, ID format and evolution, unexpected data quantification values, and consistency of data from single and multiple sources. We use the implemented controls to analyze the quality of data available from several

  14. Integration of Breast Cancer Secretomes with Clinical Data Elucidates Potential Serum Markers for Disease Detection, Diagnosis, and Prognosis.

    Science.gov (United States)

    Ziegler, Yvonne S; Moresco, James J; Yates, John R; Nardulli, Ann M

    2016-01-01

    Cancer cells secrete factors that influence adjacent cell behavior and can lead to enhanced proliferation and metastasis. To better understand the role of these factors in oncogenesis and disease progression, estrogen and progesterone receptor positive MCF-7 cells, triple negative breast cancer MDA-MB-231, DT22, and DT28 cells, and MCF-10A non-transformed mammary epithelial cells were grown in 3D cultures. A special emphasis was placed on triple negative breast cancer since these tumors are highly aggressive and no targeted treatments are currently available. The breast cancer cells secreted factors of variable potency that stimulated proliferation of the relatively quiescent MCF-10A cells. The conditioned medium from each cell line was subjected to mass spectrometry analysis and a variety of secreted proteins were identified including glycolytic enzymes, proteases, protease inhibitors, extracellular matrix proteins, and insulin-like growth factor binding proteins. An investigation of the secretome from each cell line yielded clues about strategies used for breast cancer proliferation and metastasis. Some of the proteins we identified may be useful in the development of a serum-based test for breast cancer detection, diagnosis, prognosis, and monitoring.

  15. CyanOmics: an integrated database of omics for the model cyanobacterium Synechococcus sp. PCC 7002.

    Science.gov (United States)

    Yang, Yaohua; Feng, Jie; Li, Tao; Ge, Feng; Zhao, Jindong

    2015-01-01

    Cyanobacteria are an important group of organisms that carry out oxygenic photosynthesis and play vital roles in both the carbon and nitrogen cycles of the Earth. The annotated genome of Synechococcus sp. PCC 7002, as an ideal model cyanobacterium, is available. A series of transcriptomic and proteomic studies of Synechococcus sp. PCC 7002 cells grown under different conditions have been reported. However, no database of such integrated omics studies has been constructed. Here we present CyanOmics, a database based on the results of Synechococcus sp. PCC 7002 omics studies. CyanOmics comprises one genomic dataset, 29 transcriptomic datasets and one proteomic dataset and should prove useful for systematic and comprehensive analysis of all those data. Powerful browsing and searching tools are integrated to help users directly access information of interest with enhanced visualization of the analytical results. Furthermore, Blast is included for sequence-based similarity searching and Cluster 3.0, as well as the R hclust function is provided for cluster analyses, to increase CyanOmics's usefulness. To the best of our knowledge, it is the first integrated omics analysis database for cyanobacteria. This database should further understanding of the transcriptional patterns, and proteomic profiling of Synechococcus sp. PCC 7002 and other cyanobacteria. Additionally, the entire database framework is applicable to any sequenced prokaryotic genome and could be applied to other integrated omics analysis projects. Database URL: http://lag.ihb.ac.cn/cyanomics. © The Author(s) 2015. Published by Oxford University Press.

  16. Toward an interactive article: integrating journals and biological databases

    Directory of Open Access Journals (Sweden)

    Marygold Steven J

    2011-05-01

    Full Text Available Abstract Background Journal articles and databases are two major modes of communication in the biological sciences, and thus integrating these critical resources is of urgent importance to increase the pace of discovery. Projects focused on bridging the gap between journals and databases have been on the rise over the last five years and have resulted in the development of automated tools that can recognize entities within a document and link those entities to a relevant database. Unfortunately, automated tools cannot resolve ambiguities that arise from one term being used to signify entities that are quite distinct from one another. Instead, resolving these ambiguities requires some manual oversight. Finding the right balance between the speed and portability of automation and the accuracy and flexibility of manual effort is a crucial goal to making text markup a successful venture. Results We have established a journal article mark-up pipeline that links GENETICS journal articles and the model organism database (MOD WormBase. This pipeline uses a lexicon built with entities from the database as a first step. The entity markup pipeline results in links from over nine classes of objects including genes, proteins, alleles, phenotypes and anatomical terms. New entities and ambiguities are discovered and resolved by a database curator through a manual quality control (QC step, along with help from authors via a web form that is provided to them by the journal. New entities discovered through this pipeline are immediately sent to an appropriate curator at the database. Ambiguous entities that do not automatically resolve to one link are resolved by hand ensuring an accurate link. This pipeline has been extended to other databases, namely Saccharomyces Genome Database (SGD and FlyBase, and has been implemented in marking up a paper with links to multiple databases. Conclusions Our semi-automated pipeline hyperlinks articles published in GENETICS to

  17. Dynamically Integrating OSM Data into a Borderland Database

    Directory of Open Access Journals (Sweden)

    Xiaoguang Zhou

    2015-09-01

    Full Text Available Spatial data are fundamental for borderland analyses of geography, natural resources, demography, politics, economy, and culture. As the spatial data used in borderland research usually cover the borderland regions of several neighboring countries, it is difficult for anyone research institution of government to collect them. Volunteered Geographic Information (VGI is a highly successful method for acquiring timely and detailed global spatial data at a very low cost. Therefore, VGI is a reasonable source of borderland spatial data. OpenStreetMap (OSM is known as the most successful VGI resource. However, OSM's data model is far different from the traditional geographic information model. Thus, the OSM data must be converted in the scientist’s customized data model. Because the real world changes rapidly, the converted data must be updated incrementally. Therefore, this paper presents a method used to dynamically integrate OSM data into the borderland database. In this method, a basic transformation rule base is formed by comparing the OSM Map Feature description document and the destination model definitions. Using the basic rules, the main features can be automatically converted to the destination model. A human-computer interaction model transformation and a rule/automatic-remember mechanism are developed to interactively transfer the unusual features that cannot be transferred by the basic rules to the target model and to remember the reusable rules automatically. To keep the borderland database current, the global OsmChange daily diff file is used to extract the change-only information for the research region. To extract the changed objects in the region under study, the relationship between the changed object and the research region is analyzed considering the evolution of the involved objects. In addition, five rules are determined to select the objects and integrate the changed objects with multi-versions over time. The objects

  18. Integrity Checking and Maintenance with Active Rules in XML Databases

    DEFF Research Database (Denmark)

    Christiansen, Henning; Rekouts, Maria

    2007-01-01

    While specification languages for integrity constraints for XML data have been considered in the literature, actual technologies and methodologies for checking and maintaining integrity are still in their infancy. Triggers, or active rules, which are widely used in previous technologies for the p...... updates, the method indicates trigger conditions and correctness criteria to be met by the trigger code supplied by a developer or possibly automatic methods. We show examples developed in the Sedna XML database system which provides a running implementation of XML triggers....

  19. Database Description - Arabidopsis Phenome Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Arabidopsis Phenome Database Database Description General information of database Database n... BioResource Center Hiroshi Masuya Database classification Plant databases - Arabidopsis thaliana Organism T...axonomy Name: Arabidopsis thaliana Taxonomy ID: 3702 Database description The Arabidopsis thaliana phenome i...heir effective application. We developed the new Arabidopsis Phenome Database integrating two novel database...seful materials for their experimental research. The other, the “Database of Curated Plant Phenome” focusing

  20. Dynamics of the Skeletal Muscle Secretome during Myoblast Differentiation

    DEFF Research Database (Denmark)

    Henningsen, Jeanette; Rigbolt, Kristoffer T G; Blagoev, Blagoy

    2010-01-01

    During recent years, increased efforts have focused on elucidating the secretory function of skeletal muscle. Through secreted molecules, skeletal muscle affects local muscle biology in an auto/paracrine manner as well as having systemic effects on other tissues. Here we used a quantitative...... proteomics platform to investigate the factors secreted during the differentiation of murine C2C12 skeletal muscle cells. Using triple encoding stable isotope labeling by amino acids in cell culture, we compared the secretomes at three different time points of muscle differentiation and followed the dynamics...... of the skeletal muscle as a prominent secretory organ. In addition to previously reported molecules, we identified many secreted proteins that have not previously been shown to be released from skeletal muscle cells nor shown to be differentially released during the process of myogenesis. We found 188...

  1. An object-oriented language-database integration model: The composition filters approach

    NARCIS (Netherlands)

    Aksit, Mehmet; Bergmans, Lodewijk; Vural, Sinan; Vural, S.

    1991-01-01

    This paper introduces a new model, based on so-called object-composition filters, that uniformly integrates database-like features into an object-oriented language. The focus is on providing persistent dynamic data structures, data sharing, transactions, multiple views and associative access,

  2. An Object-Oriented Language-Database Integration Model: The Composition-Filters Approach

    NARCIS (Netherlands)

    Aksit, Mehmet; Bergmans, Lodewijk; Vural, S.; Vural, Sinan; Lehrmann Madsen, O.

    1992-01-01

    This paper introduces a new model, based on so-called object-composition filters, that uniformly integrates database-like features into an object-oriented language. The focus is on providing persistent dynamic data structures, data sharing, transactions, multiple views and associative access,

  3. Integration of first-principles methods and crystallographic database searches for new ferroelectrics: Strategies and explorations

    International Nuclear Information System (INIS)

    Bennett, Joseph W.; Rabe, Karin M.

    2012-01-01

    In this concept paper, the development of strategies for the integration of first-principles methods with crystallographic database mining for the discovery and design of novel ferroelectric materials is discussed, drawing on the results and experience derived from exploratory investigations on three different systems: (1) the double perovskite Sr(Sb 1/2 Mn 1/2 )O 3 as a candidate semiconducting ferroelectric; (2) polar derivatives of schafarzikite MSb 2 O 4 ; and (3) ferroelectric semiconductors with formula M 2 P 2 (S,Se) 6 . A variety of avenues for further research and investigation are suggested, including automated structure type classification, low-symmetry improper ferroelectrics, and high-throughput first-principles searches for additional representatives of structural families with desirable functional properties. - Graphical abstract: Integration of first-principles methods with crystallographic database mining, for the discovery and design of novel ferroelectric materials, could potentially lead to new classes of multifunctional materials. Highlights: ► Integration of first-principles methods and database mining. ► Minor structural families with desirable functional properties. ► Survey of polar entries in the Inorganic Crystal Structural Database.

  4. Data Integration for Spatio-Temporal Patterns of Gene Expression of Zebrafish development: the GEMS database

    Directory of Open Access Journals (Sweden)

    Belmamoune Mounia

    2008-06-01

    Full Text Available The Gene Expression Management System (GEMS is a database system for patterns of gene expression. These patterns result from systematic whole-mount fluorescent in situ hybridization studies on zebrafish embryos. GEMS is an integrative platform that addresses one of the important challenges of developmental biology: how to integrate genetic data that underpin morphological changes during embryogenesis. Our motivation to build this system was by the need to be able to organize and compare multiple patterns of gene expression at tissue level. Integration with other developmental and biomolecular databases will further support our understanding of development. The GEMS operates in concert with a database containing a digital atlas of zebrafish embryo; this digital atlas of zebrafish development has been conceived prior to the expansion of the GEMS. The atlas contains 3D volume models of canonical stages of zebrafish development in which in each volume model element is annotated with an anatomical term. These terms are extracted from a formal anatomical ontology, i.e. the Developmental Anatomy Ontology of Zebrafish (DAOZ. In the GEMS, anatomical terms from this ontology together with terms from the Gene Ontology (GO are also used to annotate patterns of gene expression and in this manner providing mechanisms for integration and retrieval . The annotations are the glue for integration of patterns of gene expression in GEMS as well as in other biomolecular databases. At the one hand, zebrafish anatomy terminology allows gene expression data within GEMS to be integrated with phenotypical data in the 3D atlas of zebrafish development. At the other hand, GO terms extend GEMS expression patterns integration to a wide range of bioinformatics resources.

  5. Integrated Space Asset Management Database and Modeling

    Science.gov (United States)

    MacLeod, Todd; Gagliano, Larry; Percy, Thomas; Mason, Shane

    2015-01-01

    Effective Space Asset Management is one key to addressing the ever-growing issue of space congestion. It is imperative that agencies around the world have access to data regarding the numerous active assets and pieces of space junk currently tracked in orbit around the Earth. At the center of this issues is the effective management of data of many types related to orbiting objects. As the population of tracked objects grows, so too should the data management structure used to catalog technical specifications, orbital information, and metadata related to those populations. Marshall Space Flight Center's Space Asset Management Database (SAM-D) was implemented in order to effectively catalog a broad set of data related to known objects in space by ingesting information from a variety of database and processing that data into useful technical information. Using the universal NORAD number as a unique identifier, the SAM-D processes two-line element data into orbital characteristics and cross-references this technical data with metadata related to functional status, country of ownership, and application category. The SAM-D began as an Excel spreadsheet and was later upgraded to an Access database. While SAM-D performs its task very well, it is limited by its current platform and is not available outside of the local user base. Further, while modeling and simulation can be powerful tools to exploit the information contained in SAM-D, the current system does not allow proper integration options for combining the data with both legacy and new M&S tools. This paper provides a summary of SAM-D development efforts to date and outlines a proposed data management infrastructure that extends SAM-D to support the larger data sets to be generated. A service-oriented architecture model using an information sharing platform named SIMON will allow it to easily expand to incorporate new capabilities, including advanced analytics, M&S tools, fusion techniques and user interface for

  6. dbPAF: an integrative database of protein phosphorylation in animals and fungi.

    Science.gov (United States)

    Ullah, Shahid; Lin, Shaofeng; Xu, Yang; Deng, Wankun; Ma, Lili; Zhang, Ying; Liu, Zexian; Xue, Yu

    2016-03-24

    Protein phosphorylation is one of the most important post-translational modifications (PTMs) and regulates a broad spectrum of biological processes. Recent progresses in phosphoproteomic identifications have generated a flood of phosphorylation sites, while the integration of these sites is an urgent need. In this work, we developed a curated database of dbPAF, containing known phosphorylation sites in H. sapiens, M. musculus, R. norvegicus, D. melanogaster, C. elegans, S. pombe and S. cerevisiae. From the scientific literature and public databases, we totally collected and integrated 54,148 phosphoproteins with 483,001 phosphorylation sites. Multiple options were provided for accessing the data, while original references and other annotations were also present for each phosphoprotein. Based on the new data set, we computationally detected significantly over-represented sequence motifs around phosphorylation sites, predicted potential kinases that are responsible for the modification of collected phospho-sites, and evolutionarily analyzed phosphorylation conservation states across different species. Besides to be largely consistent with previous reports, our results also proposed new features of phospho-regulation. Taken together, our database can be useful for further analyses of protein phosphorylation in human and other model organisms. The dbPAF database was implemented in PHP + MySQL and freely available at http://dbpaf.biocuckoo.org.

  7. PharmDB-K: Integrated Bio-Pharmacological Network Database for Traditional Korean Medicine.

    Directory of Open Access Journals (Sweden)

    Ji-Hyun Lee

    Full Text Available Despite the growing attention given to Traditional Medicine (TM worldwide, there is no well-known, publicly available, integrated bio-pharmacological Traditional Korean Medicine (TKM database for researchers in drug discovery. In this study, we have constructed PharmDB-K, which offers comprehensive information relating to TKM-associated drugs (compound, disease indication, and protein relationships. To explore the underlying molecular interaction of TKM, we integrated fourteen different databases, six Pharmacopoeias, and literature, and established a massive bio-pharmacological network for TKM and experimentally validated some cases predicted from the PharmDB-K analyses. Currently, PharmDB-K contains information about 262 TKMs, 7,815 drugs, 3,721 diseases, 32,373 proteins, and 1,887 side effects. One of the unique sets of information in PharmDB-K includes 400 indicator compounds used for standardization of herbal medicine. Furthermore, we are operating PharmDB-K via phExplorer (a network visualization software and BioMart (a data federation framework for convenient search and analysis of the TKM network. Database URL: http://pharmdb-k.org, http://biomart.i-pharm.org.

  8. High throughput proteomic analysis of the secretome in an explant model of articular cartilage inflammation

    Science.gov (United States)

    Clutterbuck, Abigail L.; Smith, Julia R.; Allaway, David; Harris, Pat; Liddell, Susan; Mobasheri, Ali

    2011-01-01

    This study employed a targeted high-throughput proteomic approach to identify the major proteins present in the secretome of articular cartilage. Explants from equine metacarpophalangeal joints were incubated alone or with interleukin-1beta (IL-1β, 10 ng/ml), with or without carprofen, a non-steroidal anti-inflammatory drug, for six days. After tryptic digestion of culture medium supernatants, resulting peptides were separated by HPLC and detected in a Bruker amaZon ion trap instrument. The five most abundant peptides in each MS scan were fragmented and the fragmentation patterns compared to mammalian entries in the Swiss-Prot database, using the Mascot search engine. Tryptic peptides originating from aggrecan core protein, cartilage oligomeric matrix protein (COMP), fibronectin, fibromodulin, thrombospondin-1 (TSP-1), clusterin (CLU), cartilage intermediate layer protein-1 (CILP-1), chondroadherin (CHAD) and matrix metalloproteinases MMP-1 and MMP-3 were detected. Quantitative western blotting confirmed the presence of CILP-1, CLU, MMP-1, MMP-3 and TSP-1. Treatment with IL-1β increased MMP-1, MMP-3 and TSP-1 and decreased the CLU precursor but did not affect CILP-1 and CLU levels. Many of the proteins identified have well-established extracellular matrix functions and are involved in early repair/stress responses in cartilage. This high throughput approach may be used to study the changes that occur in the early stages of osteoarthritis. PMID:21354348

  9. DPTEdb, an integrative database of transposable elements in dioecious plants.

    Science.gov (United States)

    Li, Shu-Fen; Zhang, Guo-Jun; Zhang, Xue-Jin; Yuan, Jin-Hong; Deng, Chuan-Liang; Gu, Lian-Feng; Gao, Wu-Jun

    2016-01-01

    Dioecious plants usually harbor 'young' sex chromosomes, providing an opportunity to study the early stages of sex chromosome evolution. Transposable elements (TEs) are mobile DNA elements frequently found in plants and are suggested to play important roles in plant sex chromosome evolution. The genomes of several dioecious plants have been sequenced, offering an opportunity to annotate and mine the TE data. However, comprehensive and unified annotation of TEs in these dioecious plants is still lacking. In this study, we constructed a dioecious plant transposable element database (DPTEdb). DPTEdb is a specific, comprehensive and unified relational database and web interface. We used a combination of de novo, structure-based and homology-based approaches to identify TEs from the genome assemblies of previously published data, as well as our own. The database currently integrates eight dioecious plant species and a total of 31 340 TEs along with classification information. DPTEdb provides user-friendly web interfaces to browse, search and download the TE sequences in the database. Users can also use tools, including BLAST, GetORF, HMMER, Cut sequence and JBrowse, to analyze TE data. Given the role of TEs in plant sex chromosome evolution, the database will contribute to the investigation of TEs in structural, functional and evolutionary dynamics of the genome of dioecious plants. In addition, the database will supplement the research of sex diversification and sex chromosome evolution of dioecious plants.Database URL: http://genedenovoweb.ticp.net:81/DPTEdb/index.php. © The Author(s) 2016. Published by Oxford University Press.

  10. The Secretome of Bone Marrow and Wharton Jelly Derived Mesenchymal Stem Cells Induces Differentiation and Neurite Outgrowth in SH-SY5Y Cells

    Directory of Open Access Journals (Sweden)

    Ana O. Pires

    2014-01-01

    Full Text Available The goal of this study was to determine and compare the effects of the secretome of mesenchymal stem cells (MSCs isolated from human bone-marrow (BMSCs and the Wharton jelly surrounding the vein and arteries of the umbilical cord (human umbilical cord perivascular cells (HUCPVCs on the survival and differentiation of a human neuroblastoma cell line (SH-SY5Y. For this purpose, SH-SY5Y cells were differentiated with conditioned media (CM from the MSCs populations referred above. Retinoic acid cultured cells were used as control for neuronal differentiated SH-SY5Y cells. SH-SY5Y cells viability assessment revealed that the secretome of BMSCs and HUCPVCs, in the form of CM, was able to induce their survival. Moreover, immunocytochemical experiments showed that CM from both MSCs was capable of inducing neuronal differentiation of SH-SY5Y cells. Finally, neurite lengths assessment and quantitative real-time reverse-transcription polymerase chain reaction (RT-PCR analysis demonstrated that CM from BMSCs and HUCPVCs differently induced neurite outgrowth and mRNA levels of neuronal markers exhibited by SH-SY5Y cells. Overall, our results show that the secretome of both BMSCs and HUCPVCs was capable of supporting SH-SY5Y cells survival and promoting their differentiation towards a neuronal phenotype.

  11. Angiogenin induces modifications in the astrocyte secretome: relevance to amyotrophic lateral sclerosis.

    Science.gov (United States)

    Skorupa, Alexandra; Urbach, Serge; Vigy, Oana; King, Matthew A; Chaumont-Dubel, Séverine; Prehn, Jochen H M; Marin, Philippe

    2013-10-08

    Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease affecting lower and upper motoneurons. Recent studies have shown that both motor neurons and non-neuronal neighbouring cells such as astrocytes and microglia contribute to disease pathology. Loss-of-function mutations in the angiogenin (ANG) gene have been identified in ALS patients. Angiogenin is enriched in motor neurons and exerts neuroprotective effects in vitro and in vivo. We have recently shown that motoneurons secrete angiogenin, and that secreted angiogenin is exclusively taken up by astrocytes, suggesting a paracrine mechanism of neuroprotection. To gain insights into astrocyte effectors of angiogenin-induced neuroprotection, we examined alterations in the astrocyte secretome induced by angiogenin treatment using quantitative proteomics based on Stable Isotope Labelling by Amino Acids in Cell Culture (SILAC). We identified 2128 proteins in conditioned media from primary cultured mouse astrocytes, including 1247 putative secreted proteins. Of these, 60 proteins showed significant regulation of secretion in response to angiogenin stimulation. Regulated proteins include chemokines and cytokines, proteases and protease inhibitors as well as proteins involved in reorganising the extracellular matrix. In conclusion, this proteomic analysis increases our knowledge of the astrocyte secretome and reveals potential molecular substrates underlying the paracrine, neuroprotective effects of angiogenin. This study provides the most extensive list of astrocyte-secreted proteins available and reveals novel potential molecular substrates of astrocyte-neuron communication. It also identifies a set of astrocyte-derived proteins that might slow down ALS disease progression. It should be relevant to a large readership of neuroscientists and clinicians, in particular those with an interest in the physiological and pathological roles of astrocytes and in the molecular and cellular mechanisms underlying

  12. Integration of a clinical trial database with a PACS

    International Nuclear Information System (INIS)

    Van Herk, M

    2014-01-01

    Many clinical trials use Electronic Case Report Forms (ECRF), e.g., from OpenClinica. Trial data is augmented if DICOM scans, dose cubes, etc. from the Picture Archiving and Communication System (PACS) are included for data mining. Unfortunately, there is as yet no structured way to collect DICOM objects in trial databases. In this paper, we obtain a tight integration of ECRF and PACS using open source software. Methods: DICOM identifiers for selected images/series/studies are stored in associated ECRF events (e.g., baseline) as follows: 1) JavaScript added to OpenClinica communicates using HTML with a gateway server inside the hospitals firewall; 2) On this gateway, an open source DICOM server runs scripts to query and select the data, returning anonymized identifiers; 3) The scripts then collects, anonymizes, zips and transmits selected data to a central trial server; 4) Here data is stored in a DICOM archive which allows authorized ECRF users to view and download the anonymous images associated with each event. Results: All integration scripts are open source. The PACS administrator configures the anonymization script and decides to use the gateway in passive (receiving) mode or in an active mode going out to the PACS to gather data. Our ECRF centric approach supports automatic data mining by iterating over the cases in the ECRF database, providing the identifiers to load images and the clinical data to correlate with image analysis results. Conclusions: Using open source software and web technology, a tight integration has been achieved between PACS and ECRF.

  13. Integration of TGS and CTEN assays using the CTENFIT analysis and databasing program

    International Nuclear Information System (INIS)

    Estep, R.

    2000-01-01

    The CTEN F IT program, written for Windows 9x/NT in C++, performs databasing and analysis of combined thermal/epithermal neutron (CTEN) passive and active neutron assay data and integrates that with isotopics results and gamma-ray data from methods such as tomographic gamma scanning (TGS). The binary database is reflected in a companion Excel database that allows extensive customization via Visual Basic for Applications macros. Automated analysis options make the analysis of the data transparent to the assay system operator. Various record browsers and information displays simplified record keeping tasks

  14. Using ontology databases for scalable query answering, inconsistency detection, and data integration

    Science.gov (United States)

    Dou, Dejing

    2011-01-01

    An ontology database is a basic relational database management system that models an ontology plus its instances. To reason over the transitive closure of instances in the subsumption hierarchy, for example, an ontology database can either unfold views at query time or propagate assertions using triggers at load time. In this paper, we use existing benchmarks to evaluate our method—using triggers—and we demonstrate that by forward computing inferences, we not only improve query time, but the improvement appears to cost only more space (not time). However, we go on to show that the true penalties were simply opaque to the benchmark, i.e., the benchmark inadequately captures load-time costs. We have applied our methods to two case studies in biomedicine, using ontologies and data from genetics and neuroscience to illustrate two important applications: first, ontology databases answer ontology-based queries effectively; second, using triggers, ontology databases detect instance-based inconsistencies—something not possible using views. Finally, we demonstrate how to extend our methods to perform data integration across multiple, distributed ontology databases. PMID:22163378

  15. Critical assessment of human metabolic pathway databases: a stepping stone for future integration

    Directory of Open Access Journals (Sweden)

    Stobbe Miranda D

    2011-10-01

    Full Text Available Abstract Background Multiple pathway databases are available that describe the human metabolic network and have proven their usefulness in many applications, ranging from the analysis and interpretation of high-throughput data to their use as a reference repository. However, so far the various human metabolic networks described by these databases have not been systematically compared and contrasted, nor has the extent to which they differ been quantified. For a researcher using these databases for particular analyses of human metabolism, it is crucial to know the extent of the differences in content and their underlying causes. Moreover, the outcomes of such a comparison are important for ongoing integration efforts. Results We compared the genes, EC numbers and reactions of five frequently used human metabolic pathway databases. The overlap is surprisingly low, especially on reaction level, where the databases agree on 3% of the 6968 reactions they have combined. Even for the well-established tricarboxylic acid cycle the databases agree on only 5 out of the 30 reactions in total. We identified the main causes for the lack of overlap. Importantly, the databases are partly complementary. Other explanations include the number of steps a conversion is described in and the number of possible alternative substrates listed. Missing metabolite identifiers and ambiguous names for metabolites also affect the comparison. Conclusions Our results show that each of the five networks compared provides us with a valuable piece of the puzzle of the complete reconstruction of the human metabolic network. To enable integration of the networks, next to a need for standardizing the metabolite names and identifiers, the conceptual differences between the databases should be resolved. Considerable manual intervention is required to reach the ultimate goal of a unified and biologically accurate model for studying the systems biology of human metabolism. Our comparison

  16. Integrating stations from the North America Gravity Database into a local GPS-based land gravity survey

    Science.gov (United States)

    Shoberg, Thomas G.; Stoddard, Paul R.

    2013-01-01

    The ability to augment local gravity surveys with additional gravity stations from easily accessible national databases can greatly increase the areal coverage and spatial resolution of a survey. It is, however, necessary to integrate such data seamlessly with the local survey. One challenge to overcome in integrating data from national databases is that these data are typically of unknown quality. This study presents a procedure for the evaluation and seamless integration of gravity data of unknown quality from a national database with data from a local Global Positioning System (GPS)-based survey. The starting components include the latitude, longitude, elevation and observed gravity at each station location. Interpolated surfaces of the complete Bouguer anomaly are used as a means of quality control and comparison. The result is an integrated dataset of varying quality with many stations having GPS accuracy and other reliable stations of unknown origin, yielding a wider coverage and greater spatial resolution than either survey alone.

  17. IPAD: the Integrated Pathway Analysis Database for Systematic Enrichment Analysis.

    Science.gov (United States)

    Zhang, Fan; Drabier, Renee

    2012-01-01

    Next-Generation Sequencing (NGS) technologies and Genome-Wide Association Studies (GWAS) generate millions of reads and hundreds of datasets, and there is an urgent need for a better way to accurately interpret and distill such large amounts of data. Extensive pathway and network analysis allow for the discovery of highly significant pathways from a set of disease vs. healthy samples in the NGS and GWAS. Knowledge of activation of these processes will lead to elucidation of the complex biological pathways affected by drug treatment, to patient stratification studies of new and existing drug treatments, and to understanding the underlying anti-cancer drug effects. There are approximately 141 biological human pathway resources as of Jan 2012 according to the Pathguide database. However, most currently available resources do not contain disease, drug or organ specificity information such as disease-pathway, drug-pathway, and organ-pathway associations. Systematically integrating pathway, disease, drug and organ specificity together becomes increasingly crucial for understanding the interrelationships between signaling, metabolic and regulatory pathway, drug action, disease susceptibility, and organ specificity from high-throughput omics data (genomics, transcriptomics, proteomics and metabolomics). We designed the Integrated Pathway Analysis Database for Systematic Enrichment Analysis (IPAD, http://bioinfo.hsc.unt.edu/ipad), defining inter-association between pathway, disease, drug and organ specificity, based on six criteria: 1) comprehensive pathway coverage; 2) gene/protein to pathway/disease/drug/organ association; 3) inter-association between pathway, disease, drug, and organ; 4) multiple and quantitative measurement of enrichment and inter-association; 5) assessment of enrichment and inter-association analysis with the context of the existing biological knowledge and a "gold standard" constructed from reputable and reliable sources; and 6) cross-linking of

  18. KALIMER database development (database configuration and design methodology)

    International Nuclear Information System (INIS)

    Jeong, Kwan Seong; Kwon, Young Min; Lee, Young Bum; Chang, Won Pyo; Hahn, Do Hee

    2001-10-01

    KALIMER Database is an advanced database to utilize the integration management for Liquid Metal Reactor Design Technology Development using Web Applicatins. KALIMER Design database consists of Results Database, Inter-Office Communication (IOC), and 3D CAD database, Team Cooperation system, and Reserved Documents, Results Database is a research results database during phase II for Liquid Metal Reactor Design Technology Develpment of mid-term and long-term nuclear R and D. IOC is a linkage control system inter sub project to share and integrate the research results for KALIMER. 3D CAD Database is s schematic design overview for KALIMER. Team Cooperation System is to inform team member of research cooperation and meetings. Finally, KALIMER Reserved Documents is developed to manage collected data and several documents since project accomplishment. This report describes the features of Hardware and Software and the Database Design Methodology for KALIMER

  19. Development of integrated parameter database for risk assessment at the Rokkasho Reprocessing Plant

    International Nuclear Information System (INIS)

    Tamauchi, Yoshikazu

    2011-01-01

    A study to develop a parameter database for Probabilistic Safety Assessment (PSA) for the application of risk information on plant operation and maintenance activity is important because the transparency, consistency, and traceability of parameters are needed to explanation adequacy of the evaluation to third parties. Application of risk information for the plant operation and maintenance activity, equipment reliability data, human error rate, and 5 factors of 'five-factor formula' for estimation of the amount of radioactive material discharge (source term) are key inputs. As a part of the infrastructure development for the risk information application, we developed the integrated parameter database, 'R-POD' (Rokkasho reprocessing Plant Omnibus parameter Database) on the trial basis for the PSA of the Rokkasho Reprocessing Plant. This database consists primarily of the following 3 parts, 1) an equipment reliability database, 2) a five-factor formula database, and 3) a human reliability database. The underpinning for explaining the validity of the risk assessment can be improved by developing this database. Furthermore, this database is an important tool for the application of risk information, because it provides updated data by incorporating the accumulated operation experiences of the Rokkasho reprocessing plant. (author)

  20. The Center for Integrated Molecular Brain Imaging (Cimbi) database

    DEFF Research Database (Denmark)

    Knudsen, Gitte M.; Jensen, Peter S.; Erritzoe, David

    2016-01-01

    We here describe a multimodality neuroimaging containing data from healthy volunteers and patients, acquired within the Lundbeck Foundation Center for Integrated Molecular Brain Imaging (Cimbi) in Copenhagen, Denmark. The data is of particular relevance for neurobiological research questions rela...... currently contains blood and in some instances saliva samples from about 500 healthy volunteers and 300 patients with e.g., major depression, dementia, substance abuse, obesity, and impulsive aggression. Data continue to be added to the Cimbi database and biobank....

  1. Semantic-JSON: a lightweight web service interface for Semantic Web contents integrating multiple life science databases.

    Science.gov (United States)

    Kobayashi, Norio; Ishii, Manabu; Takahashi, Satoshi; Mochizuki, Yoshiki; Matsushima, Akihiro; Toyoda, Tetsuro

    2011-07-01

    Global cloud frameworks for bioinformatics research databases become huge and heterogeneous; solutions face various diametric challenges comprising cross-integration, retrieval, security and openness. To address this, as of March 2011 organizations including RIKEN published 192 mammalian, plant and protein life sciences databases having 8.2 million data records, integrated as Linked Open or Private Data (LOD/LPD) using SciNetS.org, the Scientists' Networking System. The huge quantity of linked data this database integration framework covers is based on the Semantic Web, where researchers collaborate by managing metadata across public and private databases in a secured data space. This outstripped the data query capacity of existing interface tools like SPARQL. Actual research also requires specialized tools for data analysis using raw original data. To solve these challenges, in December 2009 we developed the lightweight Semantic-JSON interface to access each fragment of linked and raw life sciences data securely under the control of programming languages popularly used by bioinformaticians such as Perl and Ruby. Researchers successfully used the interface across 28 million semantic relationships for biological applications including genome design, sequence processing, inference over phenotype databases, full-text search indexing and human-readable contents like ontology and LOD tree viewers. Semantic-JSON services of SciNetS.org are provided at http://semanticjson.org.

  2. KALIMER database development

    Energy Technology Data Exchange (ETDEWEB)

    Jeong, Kwan Seong; Lee, Yong Bum; Jeong, Hae Yong; Ha, Kwi Seok

    2003-03-01

    KALIMER database is an advanced database to utilize the integration management for liquid metal reactor design technology development using Web applications. KALIMER design database is composed of results database, Inter-Office Communication (IOC), 3D CAD database, and reserved documents database. Results database is a research results database during all phase for liquid metal reactor design technology development of mid-term and long-term nuclear R and D. IOC is a linkage control system inter sub project to share and integrate the research results for KALIMER. 3D CAD database is a schematic overview for KALIMER design structure. And reserved documents database is developed to manage several documents and reports since project accomplishment.

  3. KALIMER database development

    International Nuclear Information System (INIS)

    Jeong, Kwan Seong; Lee, Yong Bum; Jeong, Hae Yong; Ha, Kwi Seok

    2003-03-01

    KALIMER database is an advanced database to utilize the integration management for liquid metal reactor design technology development using Web applications. KALIMER design database is composed of results database, Inter-Office Communication (IOC), 3D CAD database, and reserved documents database. Results database is a research results database during all phase for liquid metal reactor design technology development of mid-term and long-term nuclear R and D. IOC is a linkage control system inter sub project to share and integrate the research results for KALIMER. 3D CAD database is a schematic overview for KALIMER design structure. And reserved documents database is developed to manage several documents and reports since project accomplishment

  4. An inventory of the Aspergillus niger secretome by combining in silico predictions with shotgun proteomics data

    OpenAIRE

    Braaksma, Machtelt; Martens-Uzunova, Elena S; Punt, Peter J; Schaap, Peter J

    2010-01-01

    Abstract Background The ecological niche occupied by a fungal species, its pathogenicity and its usefulness as a microbial cell factory to a large degree depends on its secretome. Protein secretion usually requires the presence of a N-terminal signal peptide (SP) and by scanning for this feature using available highly accurate SP-prediction tools, the fraction of potentially secreted proteins can be directly predicted. However, prediction of a SP does not guarantee that the protein is actuall...

  5. DENdb: database of integrated human enhancers

    KAUST Repository

    Ashoor, Haitham

    2015-09-05

    Enhancers are cis-acting DNA regulatory regions that play a key role in distal control of transcriptional activities. Identification of enhancers, coupled with a comprehensive functional analysis of their properties, could improve our understanding of complex gene transcription mechanisms and gene regulation processes in general. We developed DENdb, a centralized on-line repository of predicted enhancers derived from multiple human cell-lines. DENdb integrates enhancers predicted by five different methods generating an enriched catalogue of putative enhancers for each of the analysed cell-lines. DENdb provides information about the overlap of enhancers with DNase I hypersensitive regions, ChIP-seq regions of a number of transcription factors and transcription factor binding motifs, means to explore enhancer interactions with DNA using several chromatin interaction assays and enhancer neighbouring genes. DENdb is designed as a relational database that facilitates fast and efficient searching, browsing and visualization of information.

  6. DENdb: database of integrated human enhancers

    KAUST Repository

    Ashoor, Haitham; Kleftogiannis, Dimitrios A.; Radovanovic, Aleksandar; Bajic, Vladimir B.

    2015-01-01

    Enhancers are cis-acting DNA regulatory regions that play a key role in distal control of transcriptional activities. Identification of enhancers, coupled with a comprehensive functional analysis of their properties, could improve our understanding of complex gene transcription mechanisms and gene regulation processes in general. We developed DENdb, a centralized on-line repository of predicted enhancers derived from multiple human cell-lines. DENdb integrates enhancers predicted by five different methods generating an enriched catalogue of putative enhancers for each of the analysed cell-lines. DENdb provides information about the overlap of enhancers with DNase I hypersensitive regions, ChIP-seq regions of a number of transcription factors and transcription factor binding motifs, means to explore enhancer interactions with DNA using several chromatin interaction assays and enhancer neighbouring genes. DENdb is designed as a relational database that facilitates fast and efficient searching, browsing and visualization of information.

  7. Characterization of the cellulolytic secretome of Trichoderma harzianum during growth on sugarcane bagasse and analysis of the activity boosting effects of swollenin.

    Science.gov (United States)

    A L Rocha, Vanessa; N Maeda, Roberto; Pereira, Nei; F Kern, Marcelo; Elias, Luisa; Simister, Rachael; Steele-King, Clare; Gómez, Leonardo D; McQueen-Mason, Simon J

    2016-03-01

    This study demonstrates the production of an active enzyme cocktail produced by growing Trichoderma harzianum on sugarcane bagasse. The component enzymes were identified by LCMS-MS. Glycosyl hydrolases were the most abundant class of proteins, representing 67% of total secreted protein. Other carbohydrate active enzymes involved in cell wall deconstruction included lytic polysaccharide mono-oxygenases (AA9), carbohydrate-binding modules, carbohydrate esterases and swollenin, all present at levels of 1%. In total, proteases and lipases represented 5 and 1% of the total secretome, respectively, with the rest of the secretome being made up of proteins of unknown or putative function. This enzyme cocktail was efficient in catalysing the hydrolysis of sugarcane bagasse cellulolignin to fermentable sugars for potential use in ethanol production. Apart from mapping the secretome of T. harzianum, which is a very important tool to understand the catalytic performance of enzyme cocktails, the gene coding for T. harzianum swollenin was expressed in Aspergillus niger. This novel aspect in this work, allowed increasing the swollenin concentration by 95 fold. This is the first report about the heterologous expression of swollenin from T. harzianum, and the findings are of interest in enriching enzyme cocktail with this important accessory protein which takes part in the cellulose amorphogenesis. Despite lacking detectable glycoside activity, the addition of swollenin of T. harzianum increased by two-fold the hydrolysis efficiency of a commercial cellulase cocktail. © 2016 American Institute of Chemical Engineers Biotechnol. Prog., 32:327-336, 2016. © 2016 American Institute of Chemical Engineers.

  8. Integrating Variances into an Analytical Database

    Science.gov (United States)

    Sanchez, Carlos

    2010-01-01

    For this project, I enrolled in numerous SATERN courses that taught the basics of database programming. These include: Basic Access 2007 Forms, Introduction to Database Systems, Overview of Database Design, and others. My main job was to create an analytical database that can handle many stored forms and make it easy to interpret and organize. Additionally, I helped improve an existing database and populate it with information. These databases were designed to be used with data from Safety Variances and DCR forms. The research consisted of analyzing the database and comparing the data to find out which entries were repeated the most. If an entry happened to be repeated several times in the database, that would mean that the rule or requirement targeted by that variance has been bypassed many times already and so the requirement may not really be needed, but rather should be changed to allow the variance's conditions permanently. This project did not only restrict itself to the design and development of the database system, but also worked on exporting the data from the database to a different format (e.g. Excel or Word) so it could be analyzed in a simpler fashion. Thanks to the change in format, the data was organized in a spreadsheet that made it possible to sort the data by categories or types and helped speed up searches. Once my work with the database was done, the records of variances could be arranged so that they were displayed in numerical order, or one could search for a specific document targeted by the variances and restrict the search to only include variances that modified a specific requirement. A great part that contributed to my learning was SATERN, NASA's resource for education. Thanks to the SATERN online courses I took over the summer, I was able to learn many new things about computers and databases and also go more in depth into topics I already knew about.

  9. Integrated data acquisition, storage, retrieval and processing using the COMPASS DataBase (CDB)

    Energy Technology Data Exchange (ETDEWEB)

    Urban, J., E-mail: urban@ipp.cas.cz [Institute of Plasma Physics AS CR, v.v.i., Za Slovankou 3, 182 00 Praha 8 (Czech Republic); Pipek, J.; Hron, M. [Institute of Plasma Physics AS CR, v.v.i., Za Slovankou 3, 182 00 Praha 8 (Czech Republic); Janky, F.; Papřok, R.; Peterka, M. [Institute of Plasma Physics AS CR, v.v.i., Za Slovankou 3, 182 00 Praha 8 (Czech Republic); Department of Surface and Plasma Science, Faculty of Mathematics and Physics, Charles University in Prague, V Holešovičkách 2, 180 00 Praha 8 (Czech Republic); Duarte, A.S. [Instituto de Plasmas e Fusão Nuclear, Instituto Superior Técnico, Universidade Técnica de Lisboa, 1049-001 Lisboa (Portugal)

    2014-05-15

    Highlights: • CDB is used as a new data storage solution for the COMPASS tokamak. • The software is light weight, open, fast and easily extensible and scalable. • CDB seamlessly integrates with any data acquisition system. • Rich metadata are stored for physics signals. • Data can be processed automatically, based on dependence rules. - Abstract: We present a complex data handling system for the COMPASS tokamak, operated by IPP ASCR Prague, Czech Republic [1]. The system, called CDB (COMPASS DataBase), integrates different data sources as an assortment of data acquisition hardware and software from different vendors is used. Based on widely available open source technologies wherever possible, CDB is vendor and platform independent and it can be easily scaled and distributed. The data is directly stored and retrieved using a standard NAS (Network Attached Storage), hence independent of the particular technology; the description of the data (the metadata) is recorded in a relational database. Database structure is general and enables the inclusion of multi-dimensional data signals in multiple revisions (no data is overwritten). This design is inherently distributed as the work is off-loaded to the clients. Both NAS and database can be implemented and optimized for fast local access as well as secure remote access. CDB is implemented in Python language; bindings for Java, C/C++, IDL and Matlab are provided. Independent data acquisitions systems as well as nodes managed by FireSignal [2] are all integrated using CDB. An automated data post-processing server is a part of CDB. Based on dependency rules, the server executes, in parallel if possible, prescribed post-processing tasks.

  10. An information integration system for structured documents, Web, and databases

    OpenAIRE

    Morishima, Atsuyuki

    1998-01-01

    Rapid advance in computer network technology has changed the style of computer utilization. Distributed computing resources over world-wide computer networks are available from our local computers. They include powerful computers and a variety of information sources. This change is raising more advanced requirements. Integration of distributed information sources is one of such requirements. In addition to conventional databases, structured documents have been widely used, and have increasing...

  11. Comparative analysis of the predicted secretomes of Rosaceae scab pathogens Venturia inaequalis and V. pirina reveals expanded effector families and putative determinants of host range.

    Science.gov (United States)

    Deng, Cecilia H; Plummer, Kim M; Jones, Darcy A B; Mesarich, Carl H; Shiller, Jason; Taranto, Adam P; Robinson, Andrew J; Kastner, Patrick; Hall, Nathan E; Templeton, Matthew D; Bowen, Joanna K

    2017-05-02

    Fungal plant pathogens belonging to the genus Venturia cause damaging scab diseases of members of the Rosaceae. In terms of economic impact, the most important of these are V. inaequalis, which infects apple, and V. pirina, which is a pathogen of European pear. Given that Venturia fungi colonise the sub-cuticular space without penetrating plant cells, it is assumed that effectors that contribute to virulence and determination of host range will be secreted into this plant-pathogen interface. Thus the predicted secretomes of a range of isolates of Venturia with distinct host-ranges were interrogated to reveal putative proteins involved in virulence and pathogenicity. Genomes of Venturia pirina (one European pear scab isolate) and Venturia inaequalis (three apple scab, and one loquat scab, isolates) were sequenced and the predicted secretomes of each isolate identified. RNA-Seq was conducted on the apple-specific V. inaequalis isolate Vi1 (in vitro and infected apple leaves) to highlight virulence and pathogenicity components of the secretome. Genes encoding over 600 small secreted proteins (candidate effectors) were identified, most of which are novel to Venturia, with expansion of putative effector families a feature of the genus. Numerous genes with similarity to Leptosphaeria maculans AvrLm6 and the Verticillium spp. Ave1 were identified. Candidates for avirulence effectors with cognate resistance genes involved in race-cultivar specificity were identified, as were putative proteins involved in host-species determination. Candidate effectors were found, on average, to be in regions of relatively low gene-density and in closer proximity to repeats (e.g. transposable elements), compared with core eukaryotic genes. Comparative secretomics has revealed candidate effectors from Venturia fungal plant pathogens that attack pome fruit. Effectors that are putative determinants of host range were identified; both those that may be involved in race-cultivar and host

  12. Characterization of HSP27 phosphorylation sites in human atherosclerotic plaque secretome

    DEFF Research Database (Denmark)

    Durán, Mari-Carmen; Boeri-Erba, Elisabetta; Mohammed, Shabaz

    2007-01-01

    spectrometry (MS). Among the identified proteins, two isoforms of heat shock protein 27 (HSP27), a protein recently described as a potential biomarker of atherosclerosis, were detected. However, the putative mechanisms in which HSP27 isoforms could be involved in the atherosclerotic process are unknown. Thus......, the role that phosphorylated HSP27 could play in the atherosclerotic process is actually under study. The present work shows the strategies employed to characterize the phosphorylation in the HSP27 secreted by atheroma plaque samples. The application of liquid chromatography tandem mass spectrometry (MS......-lymphocytes). These interactions can be mediated by proteins secreted from these cells, which therefore exert an important role in the atherosclerotic process. We recently described a novel strategy for the characterization of the human atherosclerotic plaque secretome, combining two-dimensional gel electrophoresis and mass...

  13. Construction of an ortholog database using the semantic web technology for integrative analysis of genomic data.

    Science.gov (United States)

    Chiba, Hirokazu; Nishide, Hiroyo; Uchiyama, Ikuo

    2015-01-01

    Recently, various types of biological data, including genomic sequences, have been rapidly accumulating. To discover biological knowledge from such growing heterogeneous data, a flexible framework for data integration is necessary. Ortholog information is a central resource for interlinking corresponding genes among different organisms, and the Semantic Web provides a key technology for the flexible integration of heterogeneous data. We have constructed an ortholog database using the Semantic Web technology, aiming at the integration of numerous genomic data and various types of biological information. To formalize the structure of the ortholog information in the Semantic Web, we have constructed the Ortholog Ontology (OrthO). While the OrthO is a compact ontology for general use, it is designed to be extended to the description of database-specific concepts. On the basis of OrthO, we described the ortholog information from our Microbial Genome Database for Comparative Analysis (MBGD) in the form of Resource Description Framework (RDF) and made it available through the SPARQL endpoint, which accepts arbitrary queries specified by users. In this framework based on the OrthO, the biological data of different organisms can be integrated using the ortholog information as a hub. Besides, the ortholog information from different data sources can be compared with each other using the OrthO as a shared ontology. Here we show some examples demonstrating that the ortholog information described in RDF can be used to link various biological data such as taxonomy information and Gene Ontology. Thus, the ortholog database using the Semantic Web technology can contribute to biological knowledge discovery through integrative data analysis.

  14. Integrated database for identifying candidate genes for Aspergillus flavus resistance in maize.

    Science.gov (United States)

    Kelley, Rowena Y; Gresham, Cathy; Harper, Jonathan; Bridges, Susan M; Warburton, Marilyn L; Hawkins, Leigh K; Pechanova, Olga; Peethambaran, Bela; Pechan, Tibor; Luthe, Dawn S; Mylroie, J E; Ankala, Arunkanth; Ozkan, Seval; Henry, W B; Williams, W P

    2010-10-07

    Aspergillus flavus Link:Fr, an opportunistic fungus that produces aflatoxin, is pathogenic to maize and other oilseed crops. Aflatoxin is a potent carcinogen, and its presence markedly reduces the value of grain. Understanding and enhancing host resistance to A. flavus infection and/or subsequent aflatoxin accumulation is generally considered an efficient means of reducing grain losses to aflatoxin. Different proteomic, genomic and genetic studies of maize (Zea mays L.) have generated large data sets with the goal of identifying genes responsible for conferring resistance to A. flavus, or aflatoxin. In order to maximize the usage of different data sets in new studies, including association mapping, we have constructed a relational database with web interface integrating the results of gene expression, proteomic (both gel-based and shotgun), Quantitative Trait Loci (QTL) genetic mapping studies, and sequence data from the literature to facilitate selection of candidate genes for continued investigation. The Corn Fungal Resistance Associated Sequences Database (CFRAS-DB) (http://agbase.msstate.edu/) was created with the main goal of identifying genes important to aflatoxin resistance. CFRAS-DB is implemented using MySQL as the relational database management system running on a Linux server, using an Apache web server, and Perl CGI scripts as the web interface. The database and the associated web-based interface allow researchers to examine many lines of evidence (e.g. microarray, proteomics, QTL studies, SNP data) to assess the potential role of a gene or group of genes in the response of different maize lines to A. flavus infection and subsequent production of aflatoxin by the fungus. CFRAS-DB provides the first opportunity to integrate data pertaining to the problem of A. flavus and aflatoxin resistance in maize in one resource and to support queries across different datasets. The web-based interface gives researchers different query options for mining the database

  15. Comparative Secretome Analysis of Trichoderma reesei and Aspergillus niger during Growth on Sugarcane Biomass.

    Directory of Open Access Journals (Sweden)

    Gustavo Pagotto Borin

    Full Text Available Our dependence on fossil fuel sources and concern about the environment has generated a worldwide interest in establishing new sources of fuel and energy. Thus, the use of ethanol as a fuel is advantageous because it is an inexhaustible energy source and has minimal environmental impact. Currently, Brazil is the world's second largest producer of ethanol, which is produced from sugarcane juice fermentation. However, several studies suggest that Brazil could double its production per hectare by using sugarcane bagasse and straw, known as second-generation (2G bioethanol. Nevertheless, the use of this biomass presents a challenge because the plant cell wall structure, which is composed of complex sugars (cellulose and hemicelluloses, must be broken down into fermentable sugar, such as glucose and xylose. To achieve this goal, several types of hydrolytic enzymes are necessary, and these enzymes represent the majority of the cost associated with 2G bioethanol processing. Reducing the cost of the saccharification process can be achieved via a comprehensive understanding of the hydrolytic mechanisms and enzyme secretion of polysaccharide-hydrolyzing microorganisms. In many natural habitats, several microorganisms degrade lignocellulosic biomass through a set of enzymes that act synergistically. In this study, two fungal species, Aspergillus niger and Trichoderma reesei, were grown on sugarcane biomass with two levels of cell wall complexity, culm in natura and pretreated bagasse. The production of enzymes related to biomass degradation was monitored using secretome analyses after 6, 12 and 24 hours. Concurrently, we analyzed the sugars in the supernatant.Analyzing the concentration of monosaccharides in the supernatant, we observed that both species are able to disassemble the polysaccharides of sugarcane cell walls since 6 hours post-inoculation. The sugars from the polysaccharides such as arabinoxylan and β-glucan (that compose the most external

  16. Comparative Secretome Analysis of Trichoderma reesei and Aspergillus niger during Growth on Sugarcane Biomass

    Science.gov (United States)

    Borin, Gustavo Pagotto; Sanchez, Camila Cristina; de Souza, Amanda Pereira; de Santana, Eliane Silva; de Souza, Aline Tieppo; Leme, Adriana Franco Paes; Squina, Fabio Marcio; Buckeridge, Marcos; Goldman, Gustavo Henrique; Oliveira, Juliana Velasco de Castro

    2015-01-01

    Background Our dependence on fossil fuel sources and concern about the environment has generated a worldwide interest in establishing new sources of fuel and energy. Thus, the use of ethanol as a fuel is advantageous because it is an inexhaustible energy source and has minimal environmental impact. Currently, Brazil is the world's second largest producer of ethanol, which is produced from sugarcane juice fermentation. However, several studies suggest that Brazil could double its production per hectare by using sugarcane bagasse and straw, known as second-generation (2G) bioethanol. Nevertheless, the use of this biomass presents a challenge because the plant cell wall structure, which is composed of complex sugars (cellulose and hemicelluloses), must be broken down into fermentable sugar, such as glucose and xylose. To achieve this goal, several types of hydrolytic enzymes are necessary, and these enzymes represent the majority of the cost associated with 2G bioethanol processing. Reducing the cost of the saccharification process can be achieved via a comprehensive understanding of the hydrolytic mechanisms and enzyme secretion of polysaccharide-hydrolyzing microorganisms. In many natural habitats, several microorganisms degrade lignocellulosic biomass through a set of enzymes that act synergistically. In this study, two fungal species, Aspergillus niger and Trichoderma reesei, were grown on sugarcane biomass with two levels of cell wall complexity, culm in natura and pretreated bagasse. The production of enzymes related to biomass degradation was monitored using secretome analyses after 6, 12 and 24 hours. Concurrently, we analyzed the sugars in the supernatant. Results Analyzing the concentration of monosaccharides in the supernatant, we observed that both species are able to disassemble the polysaccharides of sugarcane cell walls since 6 hours post-inoculation. The sugars from the polysaccharides such as arabinoxylan and β-glucan (that compose the most external

  17. Brassica database (BRAD) version 2.0: integrating and mining Brassicaceae species genomic resources.

    Science.gov (United States)

    Wang, Xiaobo; Wu, Jian; Liang, Jianli; Cheng, Feng; Wang, Xiaowu

    2015-01-01

    The Brassica database (BRAD) was built initially to assist users apply Brassica rapa and Arabidopsis thaliana genomic data efficiently to their research. However, many Brassicaceae genomes have been sequenced and released after its construction. These genomes are rich resources for comparative genomics, gene annotation and functional evolutionary studies of Brassica crops. Therefore, we have updated BRAD to version 2.0 (V2.0). In BRAD V2.0, 11 more Brassicaceae genomes have been integrated into the database, namely those of Arabidopsis lyrata, Aethionema arabicum, Brassica oleracea, Brassica napus, Camelina sativa, Capsella rubella, Leavenworthia alabamica, Sisymbrium irio and three extremophiles Schrenkiella parvula, Thellungiella halophila and Thellungiella salsuginea. BRAD V2.0 provides plots of syntenic genomic fragments between pairs of Brassicaceae species, from the level of chromosomes to genomic blocks. The Generic Synteny Browser (GBrowse_syn), a module of the Genome Browser (GBrowse), is used to show syntenic relationships between multiple genomes. Search functions for retrieving syntenic and non-syntenic orthologs, as well as their annotation and sequences are also provided. Furthermore, genome and annotation information have been imported into GBrowse so that all functional elements can be visualized in one frame. We plan to continually update BRAD by integrating more Brassicaceae genomes into the database. Database URL: http://brassicadb.org/brad/. © The Author(s) 2015. Published by Oxford University Press.

  18. VaProS: a database-integration approach for protein/genome information retrieval

    KAUST Repository

    Gojobori, Takashi; Ikeo, Kazuho; Katayama, Yukie; Kawabata, Takeshi; Kinjo, Akira R.; Kinoshita, Kengo; Kwon, Yeondae; Migita, Ohsuke; Mizutani, Hisashi; Muraoka, Masafumi; Nagata, Koji; Omori, Satoshi; Sugawara, Hideaki; Yamada, Daichi; Yura, Kei

    2016-01-01

    Life science research now heavily relies on all sorts of databases for genome sequences, transcription, protein three-dimensional (3D) structures, protein–protein interactions, phenotypes and so forth. The knowledge accumulated by all the omics research is so vast that a computer-aided search of data is now a prerequisite for starting a new study. In addition, a combinatory search throughout these databases has a chance to extract new ideas and new hypotheses that can be examined by wet-lab experiments. By virtually integrating the related databases on the Internet, we have built a new web application that facilitates life science researchers for retrieving experts’ knowledge stored in the databases and for building a new hypothesis of the research target. This web application, named VaProS, puts stress on the interconnection between the functional information of genome sequences and protein 3D structures, such as structural effect of the gene mutation. In this manuscript, we present the notion of VaProS, the databases and tools that can be accessed without any knowledge of database locations and data formats, and the power of search exemplified in quest of the molecular mechanisms of lysosomal storage disease. VaProS can be freely accessed at http://p4d-info.nig.ac.jp/vapros/.

  19. VaProS: a database-integration approach for protein/genome information retrieval

    KAUST Repository

    Gojobori, Takashi

    2016-12-24

    Life science research now heavily relies on all sorts of databases for genome sequences, transcription, protein three-dimensional (3D) structures, protein–protein interactions, phenotypes and so forth. The knowledge accumulated by all the omics research is so vast that a computer-aided search of data is now a prerequisite for starting a new study. In addition, a combinatory search throughout these databases has a chance to extract new ideas and new hypotheses that can be examined by wet-lab experiments. By virtually integrating the related databases on the Internet, we have built a new web application that facilitates life science researchers for retrieving experts’ knowledge stored in the databases and for building a new hypothesis of the research target. This web application, named VaProS, puts stress on the interconnection between the functional information of genome sequences and protein 3D structures, such as structural effect of the gene mutation. In this manuscript, we present the notion of VaProS, the databases and tools that can be accessed without any knowledge of database locations and data formats, and the power of search exemplified in quest of the molecular mechanisms of lysosomal storage disease. VaProS can be freely accessed at http://p4d-info.nig.ac.jp/vapros/.

  20. IntPath--an integrated pathway gene relationship database for model organisms and important pathogens.

    Science.gov (United States)

    Zhou, Hufeng; Jin, Jingjing; Zhang, Haojun; Yi, Bo; Wozniak, Michal; Wong, Limsoon

    2012-01-01

    Pathway data are important for understanding the relationship between genes, proteins and many other molecules in living organisms. Pathway gene relationships are crucial information for guidance, prediction, reference and assessment in biochemistry, computational biology, and medicine. Many well-established databases--e.g., KEGG, WikiPathways, and BioCyc--are dedicated to collecting pathway data for public access. However, the effectiveness of these databases is hindered by issues such as incompatible data formats, inconsistent molecular representations, inconsistent molecular relationship representations, inconsistent referrals to pathway names, and incomprehensive data from different databases. In this paper, we overcome these issues through extraction, normalization and integration of pathway data from several major public databases (KEGG, WikiPathways, BioCyc, etc). We build a database that not only hosts our integrated pathway gene relationship data for public access but also maintains the necessary updates in the long run. This public repository is named IntPath (Integrated Pathway gene relationship database for model organisms and important pathogens). Four organisms--S. cerevisiae, M. tuberculosis H37Rv, H. Sapiens and M. musculus--are included in this version (V2.0) of IntPath. IntPath uses the "full unification" approach to ensure no deletion and no introduced noise in this process. Therefore, IntPath contains much richer pathway-gene and pathway-gene pair relationships and much larger number of non-redundant genes and gene pairs than any of the single-source databases. The gene relationships of each gene (measured by average node degree) per pathway are significantly richer. The gene relationships in each pathway (measured by average number of gene pairs per pathway) are also considerably richer in the integrated pathways. Moderate manual curation are involved to get rid of errors and noises from source data (e.g., the gene ID errors in WikiPathways and

  1. An integrated photogrammetric and spatial database management system for producing fully structured data using aerial and remote sensing images.

    Science.gov (United States)

    Ahmadi, Farshid Farnood; Ebadi, Hamid

    2009-01-01

    3D spatial data acquired from aerial and remote sensing images by photogrammetric techniques is one of the most accurate and economic data sources for GIS, map production, and spatial data updating. However, there are still many problems concerning storage, structuring and appropriate management of spatial data obtained using these techniques. According to the capabilities of spatial database management systems (SDBMSs); direct integration of photogrammetric and spatial database management systems can save time and cost of producing and updating digital maps. This integration is accomplished by replacing digital maps with a single spatial database. Applying spatial databases overcomes the problem of managing spatial and attributes data in a coupled approach. This management approach is one of the main problems in GISs for using map products of photogrammetric workstations. Also by the means of these integrated systems, providing structured spatial data, based on OGC (Open GIS Consortium) standards and topological relations between different feature classes, is possible at the time of feature digitizing process. In this paper, the integration of photogrammetric systems and SDBMSs is evaluated. Then, different levels of integration are described. Finally design, implementation and test of a software package called Integrated Photogrammetric and Oracle Spatial Systems (IPOSS) is presented.

  2. An Integrated Photogrammetric and Spatial Database Management System for Producing Fully Structured Data Using Aerial and Remote Sensing Images

    Directory of Open Access Journals (Sweden)

    Farshid Farnood Ahmadi

    2009-03-01

    Full Text Available 3D spatial data acquired from aerial and remote sensing images by photogrammetric techniques is one of the most accurate and economic data sources for GIS, map production, and spatial data updating. However, there are still many problems concerning storage, structuring and appropriate management of spatial data obtained using these techniques. According to the capabilities of spatial database management systems (SDBMSs; direct integration of photogrammetric and spatial database management systems can save time and cost of producing and updating digital maps. This integration is accomplished by replacing digital maps with a single spatial database. Applying spatial databases overcomes the problem of managing spatial and attributes data in a coupled approach. This management approach is one of the main problems in GISs for using map products of photogrammetric workstations. Also by the means of these integrated systems, providing structured spatial data, based on OGC (Open GIS Consortium standards and topological relations between different feature classes, is possible at the time of feature digitizing process. In this paper, the integration of photogrammetric systems and SDBMSs is evaluated. Then, different levels of integration are described. Finally design, implementation and test of a software package called Integrated Photogrammetric and Oracle Spatial Systems (IPOSS is presented.

  3. OECD/NEA data bank scientific and integral experiments databases in support of knowledge preservation and transfer

    International Nuclear Information System (INIS)

    Sartori, E.; Kodeli, I.; Mompean, F.J.; Briggs, J.B.; Gado, J.; Hasegawa, A.; D'hondt, P.; Wiesenack, W.; Zaetta, A.

    2004-01-01

    The OECD/Nuclear Energy Data Bank was established by its member countries as an institution to allow effective sharing of knowledge and its basic underlying information and data in key areas of nuclear science and technology. The activities as regards preserving and transferring knowledge consist of the: 1) Acquisition of basic nuclear data, computer codes and experimental system data needed over a wide range of nuclear and radiation applications; 2) Independent verification and validation of these data using quality assurance methods, adding value through international benchmark exercises, workshops and meetings and by issuing relevant reports with conclusions and recommendations, as well as by organising training courses to ensure their qualified and competent use; 3) Dissemination of the different products to authorised establishments in member countries and collecting and integrating user feedback. Of particular importance has been the establishment of basic and integral experiments databases and the methodology developed with the aim of knowledge preservation and transfer. Databases established thus far include: 1) IRPhE - International Reactor Physics Experimental Benchmarks Evaluations, 2) SINBAD - a radiation shielding experiments database (nuclear reactors, fusion neutronics and accelerators), 3) IFPE - International Fuel Performance Benchmark Experiments Database, 4) TDB - The Thermochemical Database Project, 5) ICSBE - International Nuclear Criticality Safety Benchmark Evaluations, 6) CCVM - CSNI Code Validation Matrix of Thermal-hydraulic Codes for LWR LOCA and Transients. This paper will concentrate on knowledge preservation and transfer concepts and methods related to some of the integral experiments and TDB. (author)

  4. MAGIC Database and Interfaces: An Integrated Package for Gene Discovery and Expression

    Directory of Open Access Journals (Sweden)

    Lee H. Pratt

    2006-03-01

    Full Text Available The rapidly increasing rate at which biological data is being produced requires a corresponding growth in relational databases and associated tools that can help laboratories contend with that data. With this need in mind, we describe here a Modular Approach to a Genomic, Integrated and Comprehensive (MAGIC Database. This Oracle 9i database derives from an initial focus in our laboratory on gene discovery via production and analysis of expressed sequence tags (ESTs, and subsequently on gene expression as assessed by both EST clustering and microarrays. The MAGIC Gene Discovery portion of the database focuses on information derived from DNA sequences and on its biological relevance. In addition to MAGIC SEQ-LIMS, which is designed to support activities in the laboratory, it contains several additional subschemas. The latter include MAGIC Admin for database administration, MAGIC Sequence for sequence processing as well as sequence and clone attributes, MAGIC Cluster for the results of EST clustering, MAGIC Polymorphism in support of microsatellite and single-nucleotide-polymorphism discovery, and MAGIC Annotation for electronic annotation by BLAST and BLAT. The MAGIC Microarray portion is a MIAME-compliant database with two components at present. These are MAGIC Array-LIMS, which makes possible remote entry of all information into the database, and MAGIC Array Analysis, which provides data mining and visualization. Because all aspects of interaction with the MAGIC Database are via a web browser, it is ideally suited not only for individual research laboratories but also for core facilities that serve clients at any distance.

  5. Database Description - RMOS | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available base Description General information of database Database name RMOS Alternative nam...arch Unit Shoshi Kikuchi E-mail : Database classification Plant databases - Rice Microarray Data and other Gene Expression Database...s Organism Taxonomy Name: Oryza sativa Taxonomy ID: 4530 Database description The Ric...19&lang=en Whole data download - Referenced database Rice Expression Database (RED) Rice full-length cDNA Database... (KOME) Rice Genome Integrated Map Database (INE) Rice Mutant Panel Database (Tos17) Rice Genome Annotation Database

  6. An Integrated Molecular Database on Indian Insects.

    Science.gov (United States)

    Pratheepa, Maria; Venkatesan, Thiruvengadam; Gracy, Gandhi; Jalali, Sushil Kumar; Rangheswaran, Rajagopal; Antony, Jomin Cruz; Rai, Anil

    2018-01-01

    MOlecular Database on Indian Insects (MODII) is an online database linking several databases like Insect Pest Info, Insect Barcode Information System (IBIn), Insect Whole Genome sequence, Other Genomic Resources of National Bureau of Agricultural Insect Resources (NBAIR), Whole Genome sequencing of Honey bee viruses, Insecticide resistance gene database and Genomic tools. This database was developed with a holistic approach for collecting information about phenomic and genomic information of agriculturally important insects. This insect resource database is available online for free at http://cib.res.in. http://cib.res.in/.

  7. Unraveling the in vitro secretome of the phytopathogen Botrytis cinerea to understand the interaction with its hosts

    Directory of Open Access Journals (Sweden)

    Raquel eGonzález-Fernández

    2015-10-01

    Full Text Available Botrytis cinerea is a necrotrophic fungus with high adaptability to different environments and hosts. It secretes a large number of extracellular proteins, which favor plant tissue penetration and colonization, thus contributing to virulence. Secretomics is a proteomics sub-discipline which study the secreted proteins and their secretion mechanisms, so-called secretome. By using proteomics as experimental approach, many secreted proteins by B. cinerea have been identified from in vitro experiments, and belonging to different functional categories: i cell wall-degrading enzymes such as pectinesterases, and endo-polygalacturonases; ii proteases involved in host protein degradation such as an aspartic protease; iii proteins related to the oxidative burst such as glyoxal oxidase; iv proteins which may induce the plant hypersensitive response such as a cerato-platanin domain-containing protein; and v proteins related to production and secretion of toxins such as malate dehydrogenase. In this mini-review, we made an overview of the proteomics contribution to the study and knowledge of the B. cinerea extracellular secreted proteins based on our current work carried out from in vitro experiments, and recent published papers both in vitro and in planta studies on this fungi. We hypothesize on the putative functions of these secreted proteins, and their connection to the biology of the B. cinerea interaction with its hosts.

  8. Oxidation of Wine Polyphenols by Secretomes of Wild Botrytis cinerea Strains from White and Red Grape Varieties and Determination of Their Specific Laccase Activity.

    Science.gov (United States)

    Zimdars, Sabrina; Hitschler, Julia; Schieber, Andreas; Weber, Fabian

    2017-12-06

    Processing of Botrytis cinerea-infected grapes leads to enhanced enzymatic browning reactions mainly caused by the enzyme laccase which is able to oxidize a wide range of phenolic compounds. The extent of color deterioration depends on the activity of the enzymes secreted by the fungus. The present study revealed significant differences in the oxidative properties of secretomes of several B. cinerea strains isolated from five grape varieties. The presumed laccase-containing secretomes varied in their catalytic activity toward six phenolic compounds present in grapes. All strains led to identical product profiles for five of six substrates, but two strains showed deviating product profiles during gallic acid oxidation. Fast oxidation of caffeic acid, ferulic acid, and malvidin 3-O-glucoside was observed. Product formation rates and relative product concentrations were determined. The results reflect the wide range of enzyme activity and the corresponding different impact on color deterioration by B. cinerea.

  9. High and Low Doses of Ionizing Radiation Induce Different Secretome Profiles in a Human Skin Model

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Qibin; Matzke, Melissa M.; Schepmoes, Athena A.; Moore, Ronald J.; Webb-Robertson, Bobbie-Jo M.; Hu, Zeping; Monroe, Matthew E.; Qian, Weijun; Smith, Richard D.; Morgan, William F.

    2014-03-18

    It is postulated that secreted soluble factors are important contributors of bystander effect and adaptive responses observed in low dose ionizing radiation. Using multidimensional liquid chromatography-mass spectrometry based proteomics, we quantified the changes of skin tissue secretome – the proteins secreted from a full thickness, reconstituted 3-dimensional skin tissue model 48 hr after exposure to 3, 10 and 200 cGy of X-rays. Overall, 135 proteins showed statistical significant difference between the sham (0 cGy) and any of the irradiated groups (3, 10 or 200 cGy) on the basis of Dunnett adjusted t-test; among these, 97 proteins showed a trend of downregulation and 9 proteins showed a trend of upregulation with increasing radiation dose. In addition, there were 21 and 8 proteins observed to have irregular trends with the 10 cGy irradiated group either having the highest or the lowest level among all three radiated doses. Moreover, two proteins, carboxypeptidase E and ubiquitin carboxyl-terminal hydrolase isozyme L1 were sensitive to ionizing radiation, but relatively independent of radiation dose. Conversely, proteasome activator complex subunit 2 protein appeared to be sensitive to the dose of radiation, as rapid upregulation of this protein was observed when radiation doses were increased from 3, to 10 or 200 cGy. These results suggest that different mechanisms of action exist at the secretome level for low and high doses of ionizing radiation.

  10. Analytical and computational approaches to define the Aspergillus niger secretome

    Energy Technology Data Exchange (ETDEWEB)

    Tsang, Adrian; Butler, Gregory D.; Powlowski, Justin; Panisko, Ellen A.; Baker, Scott E.

    2009-03-01

    We used computational and mass spectrometric approaches to characterize the Aspergillus niger secretome. The 11,200 gene models predicted in the genome of A. niger strain ATCC 1015 were the data source for the analysis. Depending on the computational methods used, 691 to 881 proteins were predicted to be secreted proteins. We cultured A. niger in six different media and analyzed the extracellular proteins produced using mass spectrometry. A total of 222 proteins were identified, with 39 proteins expressed under all six conditions and 74 proteins expressed under only one condition. The secreted proteins identified by mass spectrometry were used to guide the correction of about 20 gene models. Additional analysis focused on extracellular enzymes of interest for biomass processing. Of the 63 glycoside hydrolases predicted to be capable of hydrolyzing cellulose, hemicellulose or pectin, 94% of the exo-acting enzymes and only 18% of the endo-acting enzymes were experimentally detected.

  11. FY1995 transduction method and CAD database systems for integrated design; 1995 nendo transduction ho to CAD database togo sekkei shien system

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1997-03-01

    Transduction method developed by the research coordinator and Prof. Muroga is one of the most popular methods to design large-scale integrated circuits, and thus used by major design tool companies in USA and Japan. The major objectives of the research is to improve capability and utilize its reusable property by combining with CAD databases. Major results of the project is as follows, (1) Improvement of Transduction method : Efficiency, capability and the maximum circuit size are improved. Error compensation method is also improved. (2) Applications to new logic elements : Transduction method is modified to cope with wired logic and FPGAs. (3) CAD databases : One of the major advantages of Transduction methods is 'reusability' of already designed circuits. It is suitable to combine with CAD databases. We design CAD databases suitable for cooperative design using Transduction method. (4) Program development : Programs for Windows95 and developed for distribution. (NEDO)

  12. FY1995 transduction method and CAD database systems for integrated design; 1995 nendo transduction ho to CAD database togo sekkei shien system

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1997-03-01

    Transduction method developed by the research coordinator and Prof. Muroga is one of the most popular methods to design large-scale integrated circuits, and thus used by major design tool companies in USA and Japan. The major objectives of the research is to improve capability and utilize its reusable property by combining with CAD databases. Major results of the project is as follows, (1) Improvement of Transduction method : Efficiency, capability and the maximum circuit size are improved. Error compensation method is also improved. (2) Applications to new logic elements : Transduction method is modified to cope with wired logic and FPGAs. (3) CAD databases : One of the major advantages of Transduction methods is 'reusability' of already designed circuits. It is suitable to combine with CAD databases. We design CAD databases suitable for cooperative design using Transduction method. (4) Program development : Programs for Windows95 and developed for distribution. (NEDO)

  13. Global search tool for the Advanced Photon Source Integrated Relational Model of Installed Systems (IRMIS) database

    International Nuclear Information System (INIS)

    Quock, D.E.R.; Cianciarulo, M.B.

    2007-01-01

    The Integrated Relational Model of Installed Systems (IRMIS) is a relational database tool that has been implemented at the Advanced Photon Source to maintain an updated account of approximately 600 control system software applications, 400,000 process variables, and 30,000 control system hardware components. To effectively display this large amount of control system information to operators and engineers, IRMIS was initially built with nine Web-based viewers: Applications Organizing Index, IOC, PLC, Component Type, Installed Components, Network, Controls Spares, Process Variables, and Cables. However, since each viewer is designed to provide details from only one major category of the control system, the necessity for a one-stop global search tool for the entire database became apparent. The user requirements for extremely fast database search time and ease of navigation through search results led to the choice of Asynchronous JavaScript and XML (AJAX) technology in the implementation of the IRMIS global search tool. Unique features of the global search tool include a two-tier level of displayed search results, and a database data integrity validation and reporting mechanism.

  14. Secretome Analysis of Lipid-Induced Insulin Resistance in Skeletal Muscle Cells by a Combined Experimental and Bioinformatics Workflow

    DEFF Research Database (Denmark)

    Deshmukh, Atul S; Cox, Juergen; Jensen, Lars Juhl

    2015-01-01

    , in principle, allows an unbiased and comprehensive analysis of cellular secretomes; however, the distinction of bona fide secreted proteins from proteins released upon lysis of a small fraction of dying cells remains challenging. Here we applied highly sensitive MS and streamlined bioinformatics to analyze......-resistant conditions. Our study demonstrates an efficient combined experimental and bioinformatics workflow to identify putative secreted proteins from insulin-resistant skeletal muscle cells, which could easily be adapted to other cellular models....

  15. CPLA 1.0: an integrated database of protein lysine acetylation.

    Science.gov (United States)

    Liu, Zexian; Cao, Jun; Gao, Xinjiao; Zhou, Yanhong; Wen, Longping; Yang, Xiangjiao; Yao, Xuebiao; Ren, Jian; Xue, Yu

    2011-01-01

    As a reversible post-translational modification (PTM) discovered decades ago, protein lysine acetylation was known for its regulation of transcription through the modification of histones. Recent studies discovered that lysine acetylation targets broad substrates and especially plays an essential role in cellular metabolic regulation. Although acetylation is comparable with other major PTMs such as phosphorylation, an integrated resource still remains to be developed. In this work, we presented the compendium of protein lysine acetylation (CPLA) database for lysine acetylated substrates with their sites. From the scientific literature, we manually collected 7151 experimentally identified acetylation sites in 3311 targets. We statistically studied the regulatory roles of lysine acetylation by analyzing the Gene Ontology (GO) and InterPro annotations. Combined with protein-protein interaction information, we systematically discovered a potential human lysine acetylation network (HLAN) among histone acetyltransferases (HATs), substrates and histone deacetylases (HDACs). In particular, there are 1862 triplet relationships of HAT-substrate-HDAC retrieved from the HLAN, at least 13 of which were previously experimentally verified. The online services of CPLA database was implemented in PHP + MySQL + JavaScript, while the local packages were developed in JAVA 1.5 (J2SE 5.0). The CPLA database is freely available for all users at: http://cpla.biocuckoo.org.

  16. A semantic data dictionary method for database schema integration in CIESIN

    Science.gov (United States)

    Hinds, N.; Huang, Y.; Ravishankar, C.

    1993-08-01

    CIESIN (Consortium for International Earth Science Information Network) is funded by NASA to investigate the technology necessary to integrate and facilitate the interdisciplinary use of Global Change information. A clear of this mission includes providing a link between the various global change data sets, in particular the physical sciences and the human (social) sciences. The typical scientist using the CIESIN system will want to know how phenomena in an outside field affects his/her work. For example, a medical researcher might ask: how does air-quality effect emphysema? This and many similar questions will require sophisticated semantic data integration. The researcher who raised the question may be familiar with medical data sets containing emphysema occurrences. But this same investigator may know little, if anything, about the existance or location of air-quality data. It is easy to envision a system which would allow that investigator to locate and perform a ``join'' on two data sets, one containing emphysema cases and the other containing air-quality levels. No such system exists today. One major obstacle to providing such a system will be overcoming the heterogeneity which falls into two broad categories. ``Database system'' heterogeneity involves differences in data models and packages. ``Data semantic'' heterogeneity involves differences in terminology between disciplines which translates into data semantic issues, and varying levels of data refinement, from raw to summary. Our work investigates a global data dictionary mechanism to facilitate a merged data service. Specially, we propose using a semantic tree during schema definition to aid in locating and integrating heterogeneous databases.

  17. Resolving breast cancer heterogeneity by searching reliable protein cancer biomarkers in the breast fluid secretome

    International Nuclear Information System (INIS)

    Mannello, Ferdinando; Ligi, Daniela

    2013-01-01

    One of the major goals in cancer research is to find and evaluate the early presence of biomarkers in human fluids and tissues. To resolve the complex cell heterogeneity of a tumor mass, it will be useful to characterize the intricate biomolecular composition of tumor microenvironment (the so called cancer secretome), validating secreted proteins as early biomarkers of cancer initiation and progression. This approach is not broadly applicable because of the paucity of well validated and FDA-approved biomarkers and because most of the candidate biomarkers are mainly organ-specific rather than tumor-specific. For these reasons, there is an urgent need to identify and validate a panel of biomarker combinations for early detection of human tumors. This is especially important for breast cancer, the cancer spread most worldwide among women. It is well known that patients with early diagnosed breast cancer live longer, require less extensive treatment and fare better than patients with more aggressive and/or advanced disease. In the frame of searching breast cancer biomarkers (especially using nipple aspirate fluid mirroring breast microenvironment), studies have highlighted an optimal combination of well-known biomarkers: uPA + PAI-1 + TF. When individually investigated they did not show perfect accuracy in predicting the presence of breast cancer, whereas the triple combination has been demonstrated to be highly predictive of pre-cancer and/or cancerous conditions, approaching 97-100% accuracy. Despite the heterogeneous composition of breast cancer and the difficulties to find specific breast cancer biomolecules, the noninvasive analysis of the nipple aspirate fluid secretome may significantly improve the discovery of promising biomarkers, helping also the differentiation among benign and invasive breast diseases, opening new frontiers in early oncoproteomics

  18. Analysis of membrane proteome and secretome in cells over-expressing ADAM17 using quantitative proteomics

    International Nuclear Information System (INIS)

    Kawahara, R.; Simabuco, F.M.; Yokoo, S.; Paes Leme, A.F.; Sherman, N.

    2012-01-01

    Full text: A disintegrin and metalloproteinase (ADAM) protease is involved in proteolytic ectodomain shedding of several membrane-associated proteins and modulation of key cell signaling pathways in the tumor microenvironment. In this study, we examined the effect of over-expressing the full length human ADAM17 in membrane and secreted proteins. To this end, we constructed a stable Flp-In T-RExHEK293 cells expressing ADAM17 by tetracycline induction. These cells were grown in Dulbeccos modified Eagles medium containing light lysine, arginine or heavy, L-Arg-13C615N4 and L-Lys -13C615N2 (SILAC: stable isotope labeling with amino acid in cell culture) media and they were treated with an ADAM17 activator, phorbolester (PMA). Controls such as Flp-In T-RExHEK293 cell without PMA treatment and without ADAM17 cloned were cultivated in light medium. The ADAM17 overexpression was induced with tetracycline 500 ng/ml for 24 hours. Cells in a heavy condition were treated with PMA 50 ng/ml for 1 hour and vehicle DMSO was used as control in a light cell condition. The extracellular media were collected, concentrated and used to evaluate the secretome and a cell surface biotinylation-based approach was used to capture cell surface-associated proteins. The biotinylated proteins were eluted with dithiothreitol, alkylated with iodoacetamide and then digested with trypsin. The resulting peptides were subjected to LC-MS/MS analysis on an ETD enabled Orbitrap Velos instrument. The results showed different proteins up or down regulated in membrane and secretome analysis which might represent potential molecules involved in signaling or ADAM17 regulation events. (author)

  19. Analysis of the outer membrane proteome and secretome of Bacteroides fragilis reveals a multiplicity of secretion mechanisms.

    Directory of Open Access Journals (Sweden)

    Marlena M Wilson

    Full Text Available Bacteroides fragilis is a widely distributed member of the human gut microbiome and an opportunistic pathogen. Cell surface molecules produced by this organism likely play important roles in colonization, communication with other microbes, and pathogenicity, but the protein composition of the outer membrane (OM and the mechanisms used to transport polypeptides into the extracellular space are poorly characterized. Here we used LC-MS/MS to analyze the OM proteome and secretome of B. fragilis NCTC 9343 grown under laboratory conditions. Of the 229 OM proteins that we identified, 108 are predicted to be lipoproteins, and 61 are predicted to be TonB-dependent transporters. Based on their proximity to genes encoding TonB-dependent transporters, many of the lipoprotein genes likely encode proteins involved in nutrient or small molecule uptake. Interestingly, protease accessibility and biotinylation experiments indicated that an unusually large fraction of the lipoproteins are cell-surface exposed. We also identified three proteins that are members of a novel family of autotransporters, multiple potential type I protein secretion systems, and proteins that appear to be components of a type VI secretion apparatus. The secretome consisted of lipoproteins and other proteins that might be substrates of the putative type I or type VI secretion systems. Our proteomic studies show that B. fragilis differs considerably from well-studied Gram-negative bacteria such as Escherichia coli in both the spectrum of OM proteins that it produces and the range of secretion strategies that it utilizes.

  20. Facile synthesis of novel magnetic silica nanoparticles functionalized with layer-by-layer detonation nanodiamonds for secretome study.

    Science.gov (United States)

    Li, Hong; Wang, Yi; Zhang, Lei; Lu, Haojie; Zhou, Zhongjun; Wei, Liming; Yang, Pengyuan

    2015-12-07

    Novel magnetic silica nanoparticles functionalized with layer-by-layer detonation nanodiamonds (dNDs) were prepared by coating single submicron-size magnetite particles with silica and subsequently modified with dNDs. The resulting layer-by-layer dND functionalized magnetic silica microspheres (Fe3O4@SiO2@[dND]n) exhibit a well-defined magnetite-core-silica-shell structure and possess a high content of magnetite, which endow them with high dispersibility and excellent magnetic responsibility. Meanwhile, dNDs are known for their high affinity and biocompatibility towards peptides or proteins. Thus, a novel convenient, fast and efficient pretreatment approach of low-abundance peptides or proteins was successfully established with Fe3O4@SiO2@[dND]n microspheres. The signal intensity of low-abundance peptides was improved by at least two to three orders of magnitude in mass spectrometry analysis. The novel microsphere also showed good tolerance to salt. Even with a high concentration of salt, peptides or proteins could be isolated effectively from samples. Therefore, the convenient and efficient enrichment process of this novel layer-by-layer dND-functionalized microsphere makes it a promising candidate for isolation of protein in a large volume of culture supernatant for secretome analysis. In the application of Fe3O4@SiO2@[dND]n in the secretome of hepatoma cells, 1473 proteins were identified and covered a broad range of pI and molecular weight, including 377 low molecular weight proteins.

  1. Quantification of the N-glycosylated secretome by super-SILAC during breast cancer progression and in human blood Samples

    DEFF Research Database (Denmark)

    Boersema, P.J.; Geiger, T.; Wiśniewski, J.R.

    2013-01-01

    Cells secrete a large number of proteins to communicate with their surroundings. Furthermore, plasma membrane proteins and intracellular proteins can be released into the extracellular space by regulated or non-regulated processes. Here, we profiled the supernatant of 11 cell lines....... In total, 1398 unique N-glycosylation sites were identified and quantified. Enriching for N-glycosylated peptides focused the analysis on classically secreted and membrane proteins. N-glycosylated secretome profiles correctly clustered the different cell lines to their respective cancer stage, suggesting...

  2. GeNNet: an integrated platform for unifying scientific workflows and graph databases for transcriptome data analysis

    Directory of Open Access Journals (Sweden)

    Raquel L. Costa

    2017-07-01

    Full Text Available There are many steps in analyzing transcriptome data, from the acquisition of raw data to the selection of a subset of representative genes that explain a scientific hypothesis. The data produced can be represented as networks of interactions among genes and these may additionally be integrated with other biological databases, such as Protein-Protein Interactions, transcription factors and gene annotation. However, the results of these analyses remain fragmented, imposing difficulties, either for posterior inspection of results, or for meta-analysis by the incorporation of new related data. Integrating databases and tools into scientific workflows, orchestrating their execution, and managing the resulting data and its respective metadata are challenging tasks. Additionally, a great amount of effort is equally required to run in-silico experiments to structure and compose the information as needed for analysis. Different programs may need to be applied and different files are produced during the experiment cycle. In this context, the availability of a platform supporting experiment execution is paramount. We present GeNNet, an integrated transcriptome analysis platform that unifies scientific workflows with graph databases for selecting relevant genes according to the evaluated biological systems. It includes GeNNet-Wf, a scientific workflow that pre-loads biological data, pre-processes raw microarray data and conducts a series of analyses including normalization, differential expression inference, clusterization and gene set enrichment analysis. A user-friendly web interface, GeNNet-Web, allows for setting parameters, executing, and visualizing the results of GeNNet-Wf executions. To demonstrate the features of GeNNet, we performed case studies with data retrieved from GEO, particularly using a single-factor experiment in different analysis scenarios. As a result, we obtained differentially expressed genes for which biological functions were

  3. Automated granularity to integrate digital information: the "Antarctic Treaty Searchable Database" case study

    Directory of Open Access Journals (Sweden)

    Paul Arthur Berkman

    2006-06-01

    Full Text Available Access to information is necessary, but not sufficient in our digital era. The challenge is to objectively integrate digital resources based on user-defined objectives for the purpose of discovering information relationships that facilitate interpretations and decision making. The Antarctic Treaty Searchable Database (http://aspire.nvi.net, which is in its sixth edition, provides an example of digital integration based on the automated generation of information granules that can be dynamically combined to reveal objective relationships within and between digital information resources. This case study further demonstrates that automated granularity and dynamic integration can be accomplished simply by utilizing the inherent structure of the digital information resources. Such information integration is relevant to library and archival programs that require long-term preservation of authentic digital resources.

  4. Secretome analysis of Trichoderma reesei and Aspergillus niger cultivated by submerged and sequential fermentation processes: Enzyme production for sugarcane bagasse hydrolysis.

    Science.gov (United States)

    Florencio, Camila; Cunha, Fernanda M; Badino, Alberto C; Farinas, Cristiane S; Ximenes, Eduardo; Ladisch, Michael R

    2016-08-01

    Cellulases and hemicellulases from Trichoderma reesei and Aspergillus niger have been shown to be powerful enzymes for biomass conversion to sugars, but the production costs are still relatively high for commercial application. The choice of an effective microbial cultivation process employed for enzyme production is important, since it may affect titers and the profile of protein secretion. We used proteomic analysis to characterize the secretome of T. reesei and A. niger cultivated in submerged and sequential fermentation processes. The information gained was key to understand differences in hydrolysis of steam exploded sugarcane bagasse for enzyme cocktails obtained from two different cultivation processes. The sequential process for cultivating A. niger gave xylanase and β-glucosidase activities 3- and 8-fold higher, respectively, than corresponding activities from the submerged process. A greater protein diversity of critical cellulolytic and hemicellulolytic enzymes were also observed through secretome analyses. These results helped to explain the 3-fold higher yield for hydrolysis of non-washed pretreated bagasse when combined T. reesei and A. niger enzyme extracts from sequential fermentation were used in place of enzymes obtained from submerged fermentation. An enzyme loading of 0.7 FPU cellulase activity/g glucan was surprisingly effective when compared to the 5-15 times more enzyme loadings commonly reported for other cellulose hydrolysis studies. Analyses showed that more than 80% consisted of proteins other than cellulases whose role is important to the hydrolysis of a lignocellulose substrate. Our work combined proteomic analyses and enzymology studies to show that sequential and submerged cultivation methods differently influence both titers and secretion profile of key enzymes required for the hydrolysis of sugarcane bagasse. The higher diversity of feruloyl esterases, xylanases and other auxiliary hemicellulolytic enzymes observed in the enzyme

  5. Document control system as an integral part of RA documentation database application

    International Nuclear Information System (INIS)

    Steljic, M.M; Ljubenov, V.Lj. . E-mail address of corresponding author: milijanas@vin.bg.ac.yu; Steljic, M.M.)

    2005-01-01

    The decision about the final shutdown of the RA research reactor in Vinca Institute has been brought in 2002, and therefore the preparations for its decommissioning have begun. All activities are supervised by the International Atomic Energy Agency (IAEA), which also provides technical and experts' support. This paper describes the document control system is an integral part of the existing RA documentation database. (author)

  6. Identification of colonic fibroblast secretomes reveals secretory factors regulating colon cancer cell proliferation.

    Science.gov (United States)

    Chen, Sun-Xia; Xu, Xiao-En; Wang, Xiao-Qing; Cui, Shu-Jian; Xu, Lei-Lei; Jiang, Ying-Hua; Zhang, Yang; Yan, Hai-Bo; Zhang, Qian; Qiao, Jie; Yang, Peng-Yuan; Liu, Feng

    2014-10-14

    Stromal microenvironment influences tumor cell proliferation and migration. Fibroblasts represent the most abundant stromal constituents. Here, we established two pairs of normal fibroblast (NF) and cancer-associated fibroblast (CAF) cultures from colorectal adenocarcinoma tissues and the normal counterparts. The NFs and CAFs were stained positive for typical fibroblast markers and inhibited colon cancer (CC) cell proliferation in in vitro cocultures and in xenograft mouse models. The fibroblast conditioned media were analyzed using LC-MS and 227 proteins were identified at a false discovery rate of 1.3%, including 131 putative secretory and 20 plasma membrane proteins. These proteins were enriched for functional categories of extracellular matrix, adhesion, cell motion, inflammatory response, redox homeostasis and peptidase inhibitor. Secreted protein acidic and rich in cysteine, transgelin, follistatin-related protein 1 (FSTL1) and decorin was abundant in the fibroblast secretome as confirmed by Western blot. Silencing of FSTL1 and transgelin in colonic fibroblast cell line CCD-18Co induced an accelerated proliferation of CC cells in cocultures. Exogenous FSTL1 attenuates CC cell proliferation in a negative fashion. FSTL1 was upregulated in CC patient plasma and cancerous tissues but had no implication in prognosis. Our results provided novel insights into the molecular signatures and modulatory role of CC associated fibroblasts. In this study, a label-free LC-MS was performed to analyze the secretomes of two paired primary fibroblasts, which were isolated from fresh surgical specimen of colorectal adenocarcinoma and adjacent normal colonic tissues and exhibited negative modulatory activity for colon cancer cell growth in in vitro cocultures and in vivo xenograph mouse models. Follistatin-related protein 1 was further revealed to be one of the stroma-derived factors of potential suppression role for colon cancer cell proliferation. Our results provide novel

  7. Reactor core materials research and integrated material database establishment

    International Nuclear Information System (INIS)

    Ryu, Woo Seog; Jang, J. S.; Kim, D. W.

    2002-03-01

    Mainly two research areas were covered in this project. One is to establish the integrated database of nuclear materials, and the other is to study the behavior of reactor core materials, which are usually under the most severe condition in the operating plants. During the stage I of the project (for three years since 1999) in- and out of reactor properties of stainless steel, the major structural material for the core structures of PWR (Pressurized Water Reactor), were evaluated and specification of nuclear grade material was established. And the damaged core components from domestic power plants, e.g. orifice of CVCS, support pin of CRGT, etc. were investigated and the causes were revealed. To acquire more resistant materials to the nuclear environments, development of the alternative alloys was also conducted. For the integrated DB establishment, a task force team was set up including director of nuclear materials technology team, and projector leaders and relevant members from each project. The DB is now opened in public through the Internet

  8. Integrating Environmental and Human Health Databases in the Great Lakes Basin: Themes, Challenges and Future Directions

    Directory of Open Access Journals (Sweden)

    Kate L. Bassil

    2015-03-01

    Full Text Available Many government, academic and research institutions collect environmental data that are relevant to understanding the relationship between environmental exposures and human health. Integrating these data with health outcome data presents new challenges that are important to consider to improve our effective use of environmental health information. Our objective was to identify the common themes related to the integration of environmental and health data, and suggest ways to address the challenges and make progress toward more effective use of data already collected, to further our understanding of environmental health associations in the Great Lakes region. Environmental and human health databases were identified and reviewed using literature searches and a series of one-on-one and group expert consultations. Databases identified were predominantly environmental stressors databases, with fewer found for health outcomes and human exposure. Nine themes or factors that impact integration were identified: data availability, accessibility, harmonization, stakeholder collaboration, policy and strategic alignment, resource adequacy, environmental health indicators, and data exchange networks. The use and cost effectiveness of data currently collected could be improved by strategic changes to data collection and access systems to provide better opportunities to identify and study environmental exposures that may impact human health.

  9. Secretomics identifies Fusarium graminearum proteins involved in the interaction with barley and wheat

    DEFF Research Database (Denmark)

    Yang, Fen; Jensen, Jens D.; Svensson, Birte

    2012-01-01

    Fusarium graminearum is a phytopathogenic fungus primarily infecting small grain cereals, including barley and wheat. Secreted enzymes play important roles in the pathogenicity of many fungi. In order to access the secretome of F. graminearum, the fungus was grown in liquid culture with barley...... or wheat flour as the sole nutrient source to mimic the host–pathogen interaction. A gel‐based proteomics approach was employed to identify the proteins secreted into the culture medium. Sixty‐nine unique fungal proteins were identified in 154 protein spots, including enzymes involved in the degradation...... between wheat and barley flour medium were mainly involved in fungal cell wall remodelling and the degradation of plant cell walls, starch and proteins. The in planta expression of corresponding F. graminearum genes was confirmed by quantitative reverse transcriptase‐polymerase chain reaction in barley...

  10. Recent Advances in In Vitro Fertilization: Proteomics, Secretomics, Metabolomics and In Vitro Maturation

    Directory of Open Access Journals (Sweden)

    Ercan Baştu

    2013-03-01

    Full Text Available Since its first successful result in 1978, clinicians and researchers have been working on increasing the efficiency and safety of in vitro fertilization (IVF. As a result of advances in technology and understanding of human reproduction, IVF success rates have increased while high-order multiple pregnancy (triplets and more rates have decreased. On the other, there is opportunity for further improvement as many couples still face ‘unexplained infertility’ and high rates of twin pregnancies. Latest technologic and scientific improvements in IVF are promising. The aim of this review is to present the latest advances in the fields of proteomics, secretomics, metabolomics and oocyte culture, how they can potentially improve embryo selection and in vitro maturation (IVM and subsequently their possible impact on the safety and efficacy of IVF.

  11. Evolution and diversity of secretome genes in the apicomplexan parasite Theileria annulata

    Directory of Open Access Journals (Sweden)

    Shiels Brian R

    2010-01-01

    Full Text Available Abstract Background Little is known about how apicomplexan parasites have evolved to infect different host species and cell types. Theileria annulata and Theileria parva invade and transform bovine leukocytes but each species favours a different host cell lineage. Parasite-encoded proteins secreted from the intracellular macroschizont stage within the leukocyte represent a critical interface between host and pathogen systems. Genome sequencing has revealed that several Theileria-specific gene families encoding secreted proteins are positively selected at the inter-species level, indicating diversification between the species. We extend this analysis to the intra-species level, focusing on allelic diversity of two major secretome families. These families represent a well-characterised group of genes implicated in control of the host cell phenotype and a gene family of unknown function. To gain further insight into their evolution and function, this study investigates whether representative genes of these two families are diversifying or constrained within the T. annulata population. Results Strong evidence is provided that the sub-telomerically encoded SVSP family and the host-nucleus targeted TashAT family have evolved under contrasting pressures within natural T. annulata populations. SVSP genes were found to possess atypical codon usage and be evolving neutrally, with high levels of nucleotide substitutions and multiple indels. No evidence of geographical sub-structuring of allelic sequences was found. In contrast, TashAT family genes, implicated in control of host cell gene expression, are strongly conserved at the protein level and geographically sub-structured allelic sequences were identified among Tunisian and Turkish isolates. Although different copy numbers of DNA binding motifs were identified in alleles of TashAT proteins, motif periodicity was strongly maintained, implying conserved functional activity of these sites. Conclusions

  12. Genome, secretome and glucose transport highlight unique features of the protein production host Pichia pastoris

    Directory of Open Access Journals (Sweden)

    Mattanovich Diethard

    2009-06-01

    Full Text Available Abstract Background Pichia pastoris is widely used as a production platform for heterologous proteins and model organism for organelle proliferation. Without a published genome sequence available, strain and process development relied mainly on analogies to other, well studied yeasts like Saccharomyces cerevisiae. Results To investigate specific features of growth and protein secretion, we have sequenced the 9.4 Mb genome of the type strain DSMZ 70382 and analyzed the secretome and the sugar transporters. The computationally predicted secretome consists of 88 ORFs. When grown on glucose, only 20 proteins were actually secreted at detectable levels. These data highlight one major feature of P. pastoris, namely the low contamination of heterologous proteins with host cell protein, when applying glucose based expression systems. Putative sugar transporters were identified and compared to those of related yeast species. The genome comprises 2 homologs to S. cerevisiae low affinity transporters and 2 to high affinity transporters of other Crabtree negative yeasts. Contrary to other yeasts, P. pastoris possesses 4 H+/glycerol transporters. Conclusion This work highlights significant advantages of using the P. pastoris system with glucose based expression and fermentation strategies. As only few proteins and no proteases are actually secreted on glucose, it becomes evident that cell lysis is the relevant cause of proteolytic degradation of secreted proteins. The endowment with hexose transporters, dominantly of the high affinity type, limits glucose uptake rates and thus overflow metabolism as observed in S. cerevisiae. The presence of 4 genes for glycerol transporters explains the high specific growth rates on this substrate and underlines the suitability of a glycerol/glucose based fermentation strategy. Furthermore, we present an open access web based genome browser http://www.pichiagenome.org.

  13. MiCroKit 3.0: an integrated database of midbody, centrosome and kinetochore.

    Science.gov (United States)

    Ren, Jian; Liu, Zexian; Gao, Xinjiao; Jin, Changjiang; Ye, Mingliang; Zou, Hanfa; Wen, Longping; Zhang, Zhaolei; Xue, Yu; Yao, Xuebiao

    2010-01-01

    During cell division/mitosis, a specific subset of proteins is spatially and temporally assembled into protein super complexes in three distinct regions, i.e. centrosome/spindle pole, kinetochore/centromere and midbody/cleavage furrow/phragmoplast/bud neck, and modulates cell division process faithfully. Although many experimental efforts have been carried out to investigate the characteristics of these proteins, no integrated database was available. Here, we present the MiCroKit database (http://microkit.biocuckoo.org) of proteins that localize in midbody, centrosome and/or kinetochore. We collected into the MiCroKit database experimentally verified microkit proteins from the scientific literature that have unambiguous supportive evidence for subcellular localization under fluorescent microscope. The current version of MiCroKit 3.0 provides detailed information for 1489 microkit proteins from seven model organisms, including Saccharomyces cerevisiae, Schizasaccharomyces pombe, Caenorhabditis elegans, Drosophila melanogaster, Xenopus laevis, Mus musculus and Homo sapiens. Moreover, the orthologous information was provided for these microkit proteins, and could be a useful resource for further experimental identification. The online service of MiCroKit database was implemented in PHP + MySQL + JavaScript, while the local packages were developed in JAVA 1.5 (J2SE 5.0).

  14. Analysis and databasing software for integrated tomographic gamma scanner (TGS) and passive-active neutron (PAN) assay systems

    International Nuclear Information System (INIS)

    Estep, R.J.; Melton, S.G.; Buenafe, C.

    2000-01-01

    The CTEN-FIT program, written for Windows 9x/NT in C++,performs databasing and analysis of combined thermal/epithermal neutron (CTEN) passive and active neutron assay data and integrates that with isotopics results and gamma-ray data from methods such as tomographic gamma scanning (TGS). The binary database is reflected in a companion Excel database that allows extensive customization via Visual Basic for Applications macros. Automated analysis options make the analysis of the data transparent to the assay system operator. Various record browsers and information displays simplify record keeping tasks

  15. Development of SRS.php, a Simple Object Access Protocol-based library for data acquisition from integrated biological databases.

    Science.gov (United States)

    Barbosa-Silva, A; Pafilis, E; Ortega, J M; Schneider, R

    2007-12-11

    Data integration has become an important task for biological database providers. The current model for data exchange among different sources simplifies the manner that distinct information is accessed by users. The evolution of data representation from HTML to XML enabled programs, instead of humans, to interact with biological databases. We present here SRS.php, a PHP library that can interact with the data integration Sequence Retrieval System (SRS). The library has been written using SOAP definitions, and permits the programmatic communication through webservices with the SRS. The interactions are possible by invoking the methods described in WSDL by exchanging XML messages. The current functions available in the library have been built to access specific data stored in any of the 90 different databases (such as UNIPROT, KEGG and GO) using the same query syntax format. The inclusion of the described functions in the source of scripts written in PHP enables them as webservice clients to the SRS server. The functions permit one to query the whole content of any SRS database, to list specific records in these databases, to get specific fields from the records, and to link any record among any pair of linked databases. The case study presented exemplifies the library usage to retrieve information regarding registries of a Plant Defense Mechanisms database. The Plant Defense Mechanisms database is currently being developed, and the proposal of SRS.php library usage is to enable the data acquisition for the further warehousing tasks related to its setup and maintenance.

  16. Autism genetic database (AGD: a comprehensive database including autism susceptibility gene-CNVs integrated with known noncoding RNAs and fragile sites

    Directory of Open Access Journals (Sweden)

    Talebizadeh Zohreh

    2009-09-01

    Full Text Available Abstract Background Autism is a highly heritable complex neurodevelopmental disorder, therefore identifying its genetic basis has been challenging. To date, numerous susceptibility genes and chromosomal abnormalities have been reported in association with autism, but most discoveries either fail to be replicated or account for a small effect. Thus, in most cases the underlying causative genetic mechanisms are not fully understood. In the present work, the Autism Genetic Database (AGD was developed as a literature-driven, web-based, and easy to access database designed with the aim of creating a comprehensive repository for all the currently reported genes and genomic copy number variations (CNVs associated with autism in order to further facilitate the assessment of these autism susceptibility genetic factors. Description AGD is a relational database that organizes data resulting from exhaustive literature searches for reported susceptibility genes and CNVs associated with autism. Furthermore, genomic information about human fragile sites and noncoding RNAs was also downloaded and parsed from miRBase, snoRNA-LBME-db, piRNABank, and the MIT/ICBP siRNA database. A web client genome browser enables viewing of the features while a web client query tool provides access to more specific information for the features. When applicable, links to external databases including GenBank, PubMed, miRBase, snoRNA-LBME-db, piRNABank, and the MIT siRNA database are provided. Conclusion AGD comprises a comprehensive list of susceptibility genes and copy number variations reported to-date in association with autism, as well as all known human noncoding RNA genes and fragile sites. Such a unique and inclusive autism genetic database will facilitate the evaluation of autism susceptibility factors in relation to known human noncoding RNAs and fragile sites, impacting on human diseases. As a result, this new autism database offers a valuable tool for the research

  17. Cigarette smoke alters the secretome of lung epithelial cells.

    Science.gov (United States)

    Mossina, Alessandra; Lukas, Christina; Merl-Pham, Juliane; Uhl, Franziska E; Mutze, Kathrin; Schamberger, Andrea; Staab-Weijnitz, Claudia; Jia, Jie; Yildirim, Ali Ö; Königshoff, Melanie; Hauck, Stefanie M; Eickelberg, Oliver; Meiners, Silke

    2017-01-01

    Cigarette smoke is the most relevant risk factor for the development of lung cancer and chronic obstructive pulmonary disease. Many of its more than 4500 chemicals are highly reactive, thereby altering protein structure and function. Here, we used subcellular fractionation coupled to label-free quantitative MS to globally assess alterations in the proteome of different compartments of lung epithelial cells upon exposure to cigarette smoke extract. Proteomic profiling of the human alveolar derived cell line A549 revealed the most pronounced changes within the cellular secretome with preferential downregulation of proteins involved in wound healing and extracellular matrix organization. In particular, secretion of secreted protein acidic and rich in cysteine, a matricellular protein that functions in tissue response to injury, was consistently diminished by cigarette smoke extract in various pulmonary epithelial cell lines and primary cells of human and mouse origin as well as in mouse ex vivo lung tissue cultures. Our study reveals a previously unrecognized acute response of lung epithelial cells to cigarette smoke that includes altered secretion of proteins involved in extracellular matrix organization and wound healing. This may contribute to sustained alterations in tissue remodeling as observed in lung cancer and chronic obstructive pulmonary disease. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  18. Mapping N-linked Glycosylation Sites in the Secretome and Whole Cells of Aspergillus niger Using Hydrazide Chemistry and Mass Spectrometry

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Lu; Aryal, Uma K.; Dai, Ziyu; Mason, Alisa C.; Monroe, Matthew E.; Tian, Zhixin; Zhou, Jianying; Su, Dian; Weitz, Karl K.; Liu, Tao; Camp, David G.; Smith, Richard D.; Baker, Scott E.; Qian, Weijun

    2012-01-01

    Protein glycosylation is known to play an essential role in both cellular functions and the secretory pathways; however, little information is available on the dynamics of glycosylated N-linked glycosites of fungi. Herein we present the first extensive mapping of glycosylated N-linked glycosites in industrial strain Aspergillus niger by applying an optimized solid phase enrichment of glycopeptide protocol using hydrazide modified magnetic beads. The enrichment protocol was initially optimized using mouse plasma and A. niger secretome samples, which was then applied to profile N-linked glycosites from both the secretome and whole cell lysates of A. niger. A total of 847 unique N-linked glycosites and 330 N-linked glycoproteins were confidently identified by LC-MS/MS. Based on gene ontology analysis, the identified N-linked glycoproteins in the whole cell lysate were primarily localized in the plasma membrane, endoplasmic reticulum, golgi apparatus, lysosome, and storage vacuoles. The identified N-linked glycoproteins are involved in a wide range of biological processes including gene regulation and signal transduction, protein folding and assembly, protein modification and carbohydrate metabolism. The extensive coverage of glycosylated N-linked glycosites along with identification of partial N-linked glycosylation in those enzymes involving in different biochemical pathways provide useful information for functional studies of N-linked glycosylation and their biotechnological applications in A. niger.

  19. Database Description - tRNADB-CE | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data List Contact us tRNAD...B-CE Database Description General information of database Database name tRNADB-CE Alter...CC BY-SA Detail Background and funding Name: MEXT Integrated Database Project Reference(s) Article title: tRNAD... 2009 Jan;37(Database issue):D163-8. External Links: Article title: tRNADB-CE 2011: tRNA gene database curat...n Download License Update History of This Database Site Policy | Contact Us Database Description - tRNADB-CE | LSDB Archive ...

  20. Network and Database Security: Regulatory Compliance, Network, and Database Security - A Unified Process and Goal

    OpenAIRE

    Errol A. Blake

    2007-01-01

    Database security has evolved; data security professionals have developed numerous techniques and approaches to assure data confidentiality, integrity, and availability. This paper will show that the Traditional Database Security, which has focused primarily on creating user accounts and managing user privileges to database objects are not enough to protect data confidentiality, integrity, and availability. This paper is a compilation of different journals, articles and classroom discussions ...

  1. Development of an Integrated Natural Barrier Database System for Site Evaluation of a Deep Geologic Repository in Korea - 13527

    International Nuclear Information System (INIS)

    Jung, Haeryong; Lee, Eunyong; Jeong, YiYeong; Lee, Jeong-Hwan

    2013-01-01

    Korea Radioactive-waste Management Corporation (KRMC) established in 2009 has started a new project to collect information on long-term stability of deep geological environments on the Korean Peninsula. The information has been built up in the integrated natural barrier database system available on web (www.deepgeodisposal.kr). The database system also includes socially and economically important information, such as land use, mining area, natural conservation area, population density, and industrial complex, because some of this information is used as exclusionary criteria during the site selection process for a deep geological repository for safe and secure containment and isolation of spent nuclear fuel and other long-lived radioactive waste in Korea. Although the official site selection process has not been started yet in Korea, current integrated natural barrier database system and socio-economic database is believed that the database system will be effectively utilized to narrow down the number of sites where future investigation is most promising in the site selection process for a deep geological repository and to enhance public acceptance by providing readily-available relevant scientific information on deep geological environments in Korea. (authors)

  2. Distributed Database Semantic Integration of Wireless Sensor Network to Access the Environmental Monitoring System

    Directory of Open Access Journals (Sweden)

    Ubaidillah Umar

    2018-06-01

    Full Text Available A wireless sensor network (WSN works continuously to gather information from sensors that generate large volumes of data to be handled and processed by applications. Current efforts in sensor networks focus more on networking and development services for a variety of applications and less on processing and integrating data from heterogeneous sensors. There is an increased need for information to become shareable across different sensors, database platforms, and applications that are not easily implemented in traditional database systems. To solve the issue of these large amounts of data from different servers and database platforms (including sensor data, a semantic sensor web service platform is needed to enable a machine to extract meaningful information from the sensor’s raw data. This additionally helps to minimize and simplify data processing and to deduce new information from existing data. This paper implements a semantic web data platform (SWDP to manage the distribution of data sensors based on the semantic database system. SWDP uses sensors for temperature, humidity, carbon monoxide, carbon dioxide, luminosity, and noise. The system uses the Sesame semantic web database for data processing and a WSN to distribute, minimize, and simplify information processing. The sensor nodes are distributed in different places to collect sensor data. The SWDP generates context information in the form of a resource description framework. The experiment results demonstrate that the SWDP is more efficient than the traditional database system in terms of memory usage and processing time.

  3. ChlamyCyc: an integrative systems biology database and web-portal for Chlamydomonas reinhardtii

    OpenAIRE

    May, P.; Christian, J.O.; Kempa, S.; Walther, D.

    2009-01-01

    Abstract Background The unicellular green alga Chlamydomonas reinhardtii is an important eukaryotic model organism for the study of photosynthesis and plant growth. In the era of modern high-throughput technologies there is an imperative need to integrate large-scale data sets from high-throughput experimental techniques using computational methods and database resources to provide comprehensive information about the molecular and cellular organization of a single organism. Results In the fra...

  4. Network and Database Security: Regulatory Compliance, Network, and Database Security - A Unified Process and Goal

    Directory of Open Access Journals (Sweden)

    Errol A. Blake

    2007-12-01

    Full Text Available Database security has evolved; data security professionals have developed numerous techniques and approaches to assure data confidentiality, integrity, and availability. This paper will show that the Traditional Database Security, which has focused primarily on creating user accounts and managing user privileges to database objects are not enough to protect data confidentiality, integrity, and availability. This paper is a compilation of different journals, articles and classroom discussions will focus on unifying the process of securing data or information whether it is in use, in storage or being transmitted. Promoting a change in Database Curriculum Development trends may also play a role in helping secure databases. This paper will take the approach that if one make a conscientious effort to unifying the Database Security process, which includes Database Management System (DBMS selection process, following regulatory compliances, analyzing and learning from the mistakes of others, Implementing Networking Security Technologies, and Securing the Database, may prevent database breach.

  5. The NCBI BioSystems database.

    Science.gov (United States)

    Geer, Lewis Y; Marchler-Bauer, Aron; Geer, Renata C; Han, Lianyi; He, Jane; He, Siqian; Liu, Chunlei; Shi, Wenyao; Bryant, Stephen H

    2010-01-01

    The NCBI BioSystems database, found at http://www.ncbi.nlm.nih.gov/biosystems/, centralizes and cross-links existing biological systems databases, increasing their utility and target audience by integrating their pathways and systems into NCBI resources. This integration allows users of NCBI's Entrez databases to quickly categorize proteins, genes and small molecules by metabolic pathway, disease state or other BioSystem type, without requiring time-consuming inference of biological relationships from the literature or multiple experimental datasets.

  6. A Comprehensive Database and Analysis Framework To Incorporate Multiscale Data Types and Enable Integrated Analysis of Bioactive Polyphenols.

    Science.gov (United States)

    Ho, Lap; Cheng, Haoxiang; Wang, Jun; Simon, James E; Wu, Qingli; Zhao, Danyue; Carry, Eileen; Ferruzzi, Mario G; Faith, Jeremiah; Valcarcel, Breanna; Hao, Ke; Pasinetti, Giulio M

    2018-03-05

    The development of a given botanical preparation for eventual clinical application requires extensive, detailed characterizations of the chemical composition, as well as the biological availability, biological activity, and safety profiles of the botanical. These issues are typically addressed using diverse experimental protocols and model systems. Based on this consideration, in this study we established a comprehensive database and analysis framework for the collection, collation, and integrative analysis of diverse, multiscale data sets. Using this framework, we conducted an integrative analysis of heterogeneous data from in vivo and in vitro investigation of a complex bioactive dietary polyphenol-rich preparation (BDPP) and built an integrated network linking data sets generated from this multitude of diverse experimental paradigms. We established a comprehensive database and analysis framework as well as a systematic and logical means to catalogue and collate the diverse array of information gathered, which is securely stored and added to in a standardized manner to enable fast query. We demonstrated the utility of the database in (1) a statistical ranking scheme to prioritize response to treatments and (2) in depth reconstruction of functionality studies. By examination of these data sets, the system allows analytical querying of heterogeneous data and the access of information related to interactions, mechanism of actions, functions, etc., which ultimately provide a global overview of complex biological responses. Collectively, we present an integrative analysis framework that leads to novel insights on the biological activities of a complex botanical such as BDPP that is based on data-driven characterizations of interactions between BDPP-derived phenolic metabolites and their mechanisms of action, as well as synergism and/or potential cancellation of biological functions. Out integrative analytical approach provides novel means for a systematic integrative

  7. Bone Marrow Stromal Antigen 2 Is a Novel Plasma Biomarker and Prognosticator for Colorectal Carcinoma: A Secretome-Based Verification Study

    Directory of Open Access Journals (Sweden)

    Sum-Fu Chiang

    2015-01-01

    Full Text Available Background. The cancer cell secretome has been recognized as a valuable reservoir for identifying novel serum/plasma biomarkers for different cancers, including colorectal cancer (CRC. This study aimed to verify four CRC cell-secreted proteins (tumor-associated calcium signal transducer 2/trophoblast cell surface antigen 2 (TACSTD2/TROP2, tetraspanin-6 (TSPAN6, bone marrow stromal antigen 2 (BST2, and tumor necrosis factor receptor superfamily member 16 (NGFR as potential plasma CRC biomarkers. Methods. The study population comprises 152 CRC patients and 152 controls. Target protein levels in plasma and tissue samples were assessed by ELISA and immunohistochemistry, respectively. Results. Among the four candidate proteins examined by ELISA in a small sample set, only BST2 showed significantly elevated plasma levels in CRC patients versus controls. Immunohistochemical analysis revealed the overexpression of BST2 in CRC tissues, and higher BST2 expression levels correlated with poorer 5-year survival (46.47% versus 65.57%; p=0.044. Further verification confirmed the elevated plasma BST2 levels in CRC patients (2.35 ± 0.13 ng/mL versus controls (1.04 ± 0.03 ng/mL (p<0.01, with an area under the ROC curve (AUC being 0.858 comparable to that of CEA (0.867. Conclusion. BST2, a membrane protein selectively detected in CRC cell secretome, may be a novel plasma biomarker and prognosticator for CRC.

  8. An Interoperable Cartographic Database

    OpenAIRE

    Slobodanka Ključanin; Zdravko Galić

    2007-01-01

    The concept of producing a prototype of interoperable cartographic database is explored in this paper, including the possibilities of integration of different geospatial data into the database management system and their visualization on the Internet. The implementation includes vectorization of the concept of a single map page, creation of the cartographic database in an object-relation database, spatial analysis, definition and visualization of the database content in the form of a map on t...

  9. An Interoperable Cartographic Database

    Directory of Open Access Journals (Sweden)

    Slobodanka Ključanin

    2007-05-01

    Full Text Available The concept of producing a prototype of interoperable cartographic database is explored in this paper, including the possibilities of integration of different geospatial data into the database management system and their visualization on the Internet. The implementation includes vectorization of the concept of a single map page, creation of the cartographic database in an object-relation database, spatial analysis, definition and visualization of the database content in the form of a map on the Internet. 

  10. Secretome Characterization and Correlation Analysis Reveal Putative Pathogenicity Mechanisms and Identify Candidate Avirulence Genes in the Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici.

    Science.gov (United States)

    Xia, Chongjing; Wang, Meinan; Cornejo, Omar E; Jiwan, Derick A; See, Deven R; Chen, Xianming

    2017-01-01

    Stripe (yellow) rust, caused by Puccinia striiformis f. sp. tritici ( Pst ), is one of the most destructive diseases of wheat worldwide. Planting resistant cultivars is an effective way to control this disease, but race-specific resistance can be overcome quickly due to the rapid evolving Pst population. Studying the pathogenicity mechanisms is critical for understanding how Pst virulence changes and how to develop wheat cultivars with durable resistance to stripe rust. We re-sequenced 7 Pst isolates and included additional 7 previously sequenced isolates to represent balanced virulence/avirulence profiles for several avirulence loci in seretome analyses. We observed an uneven distribution of heterozygosity among the isolates. Secretome comparison of Pst with other rust fungi identified a large portion of species-specific secreted proteins, suggesting that they may have specific roles when interacting with the wheat host. Thirty-two effectors of Pst were identified from its secretome. We identified candidates for Avr genes corresponding to six Yr genes by correlating polymorphisms for effector genes to the virulence/avirulence profiles of the 14 Pst isolates. The putative AvYr76 was present in the avirulent isolates, but absent in the virulent isolates, suggesting that deleting the coding region of the candidate avirulence gene has produced races virulent to resistance gene Yr76 . We conclude that incorporating avirulence/virulence phenotypes into correlation analysis with variations in genomic structure and secretome, particularly presence/absence polymorphisms of effectors, is an efficient way to identify candidate Avr genes in Pst . The candidate effector genes provide a rich resource for further studies to determine the evolutionary history of Pst populations and the co-evolutionary arms race between Pst and wheat. The Avr candidates identified in this study will lead to cloning avirulence genes in Pst , which will enable us to understand molecular mechanisms

  11. BioWarehouse: a bioinformatics database warehouse toolkit.

    Science.gov (United States)

    Lee, Thomas J; Pouliot, Yannick; Wagner, Valerie; Gupta, Priyanka; Stringer-Calvert, David W J; Tenenbaum, Jessica D; Karp, Peter D

    2006-03-23

    This article addresses the problem of interoperation of heterogeneous bioinformatics databases. We introduce BioWarehouse, an open source toolkit for constructing bioinformatics database warehouses using the MySQL and Oracle relational database managers. BioWarehouse integrates its component databases into a common representational framework within a single database management system, thus enabling multi-database queries using the Structured Query Language (SQL) but also facilitating a variety of database integration tasks such as comparative analysis and data mining. BioWarehouse currently supports the integration of a pathway-centric set of databases including ENZYME, KEGG, and BioCyc, and in addition the UniProt, GenBank, NCBI Taxonomy, and CMR databases, and the Gene Ontology. Loader tools, written in the C and JAVA languages, parse and load these databases into a relational database schema. The loaders also apply a degree of semantic normalization to their respective source data, decreasing semantic heterogeneity. The schema supports the following bioinformatics datatypes: chemical compounds, biochemical reactions, metabolic pathways, proteins, genes, nucleic acid sequences, features on protein and nucleic-acid sequences, organisms, organism taxonomies, and controlled vocabularies. As an application example, we applied BioWarehouse to determine the fraction of biochemically characterized enzyme activities for which no sequences exist in the public sequence databases. The answer is that no sequence exists for 36% of enzyme activities for which EC numbers have been assigned. These gaps in sequence data significantly limit the accuracy of genome annotation and metabolic pathway prediction, and are a barrier for metabolic engineering. Complex queries of this type provide examples of the value of the data warehousing approach to bioinformatics research. BioWarehouse embodies significant progress on the database integration problem for bioinformatics.

  12. Integration of an Evidence Base into a Probabilistic Risk Assessment Model. The Integrated Medical Model Database: An Organized Evidence Base for Assessing In-Flight Crew Health Risk and System Design

    Science.gov (United States)

    Saile, Lynn; Lopez, Vilma; Bickham, Grandin; FreiredeCarvalho, Mary; Kerstman, Eric; Byrne, Vicky; Butler, Douglas; Myers, Jerry; Walton, Marlei

    2011-01-01

    This slide presentation reviews the Integrated Medical Model (IMM) database, which is an organized evidence base for assessing in-flight crew health risk. The database is a relational database accessible to many people. The database quantifies the model inputs by a ranking based on the highest value of the data as Level of Evidence (LOE) and the quality of evidence (QOE) score that provides an assessment of the evidence base for each medical condition. The IMM evidence base has already been able to provide invaluable information for designers, and for other uses.

  13. Type conversion of secretomes in a 3D TAM2 and HCC cell co-culture system and functional importance of CXCL2 in HCC.

    Science.gov (United States)

    Lu, Yu; Li, Shan; Ma, Liping; Li, Yan; Zhang, Xiaolian; Peng, Qiliu; Mo, Cuiju; Huang, Li; Qin, Xue; Liu, Yinkun

    2016-04-27

    Macrophages play important roles in the tumor microenvironment, driving cancer progression and metastasis, particularly in hepatocellular carcinoma (HCC). However, few studies have assessed the exact secretome composition in HCC. In the present study, the impact of different phenotype of macrophages on HCC cells was investigated. Alternatively activated macrophages (M2) were found to significantly increase the proliferation, migration, and invasion abilities of SMMC7721 cells (all P cultured with SMMC7721 cells to reconstruct the tumor microenvironment. Conditioned medium from 3D single cultures of M2, SMMC7721 cells, and their co-culture system were analyzed using quantitative proteomics via iTRAQ labeling combined with mass spectrometric analysis. Secretome analysis revealed a total of 159 differential secreted proteins in the co-culture system compared to the single culture systems, with 63 being up-regulated (>1.3-fold) and 96 down-regulated (culture system and HCC tissues, and was selected for further investigation. Functional effects data suggested that recombinant human CXCL2 significantly enhanced the migration, invasion ability of SMMC7721 cells, and weakened adhesion ability. While CXCL2 neutralization and CXCR2 blockage significantly inhibited the effects of CXCL2 on SMMC7721 cells, indicating that CXCL2 may play pivotal role in HCC metastasis.

  14. Bio-optical data integration based on a 4 D database system approach

    Science.gov (United States)

    Imai, N. N.; Shimabukuro, M. H.; Carmo, A. F. C.; Alcantara, E. H.; Rodrigues, T. W. P.; Watanabe, F. S. Y.

    2015-04-01

    Bio-optical characterization of water bodies requires spatio-temporal data about Inherent Optical Properties and Apparent Optical Properties which allow the comprehension of underwater light field aiming at the development of models for monitoring water quality. Measurements are taken to represent optical properties along a column of water, and then the spectral data must be related to depth. However, the spatial positions of measurement may differ since collecting instruments vary. In addition, the records should not refer to the same wavelengths. Additional difficulty is that distinct instruments store data in different formats. A data integration approach is needed to make these large and multi source data sets suitable for analysis. Thus, it becomes possible, even automatically, semi-empirical models evaluation, preceded by preliminary tasks of quality control. In this work it is presented a solution, in the stated scenario, based on spatial - geographic - database approach with the adoption of an object relational Database Management System - DBMS - due to the possibilities to represent all data collected in the field, in conjunction with data obtained by laboratory analysis and Remote Sensing images that have been taken at the time of field data collection. This data integration approach leads to a 4D representation since that its coordinate system includes 3D spatial coordinates - planimetric and depth - and the time when each data was taken. It was adopted PostgreSQL DBMS extended by PostGIS module to provide abilities to manage spatial/geospatial data. It was developed a prototype which has the mainly tools an analyst needs to prepare the data sets for analysis.

  15. KALIMER design database development and operation manual

    International Nuclear Information System (INIS)

    Jeong, Kwan Seong; Hahn, Do Hee; Lee, Yong Bum; Chang, Won Pyo

    2000-12-01

    KALIMER Design Database is developed to utilize the integration management for Liquid Metal Reactor Design Technology Development using Web Applications. KALIMER Design database consists of Results Database, Inter-Office Communication (IOC), 3D CAD database, Team Cooperation System, and Reserved Documents. Results Database is a research results database for mid-term and long-term nuclear R and D. IOC is a linkage control system inter sub project to share and integrate the research results for KALIMER. 3D CAD Database is a schematic design overview for KALIMER. Team Cooperation System is to inform team member of research cooperation and meetings. Finally, KALIMER Reserved Documents is developed to manage collected data and several documents since project accomplishment

  16. KALIMER design database development and operation manual

    Energy Technology Data Exchange (ETDEWEB)

    Jeong, Kwan Seong; Hahn, Do Hee; Lee, Yong Bum; Chang, Won Pyo

    2000-12-01

    KALIMER Design Database is developed to utilize the integration management for Liquid Metal Reactor Design Technology Development using Web Applications. KALIMER Design database consists of Results Database, Inter-Office Communication (IOC), 3D CAD database, Team Cooperation System, and Reserved Documents. Results Database is a research results database for mid-term and long-term nuclear R and D. IOC is a linkage control system inter sub project to share and integrate the research results for KALIMER. 3D CAD Database is a schematic design overview for KALIMER. Team Cooperation System is to inform team member of research cooperation and meetings. Finally, KALIMER Reserved Documents is developed to manage collected data and several documents since project accomplishment.

  17. Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins

    Directory of Open Access Journals (Sweden)

    Bradley Michael E

    2006-02-01

    Full Text Available Abstract Background When accurate models for the divergent evolution of protein sequences are integrated with complementary biological information, such as folded protein structures, analyses of the combined data often lead to new hypotheses about molecular physiology. This represents an excellent example of how bioinformatics can be used to guide experimental research. However, progress in this direction has been slowed by the lack of a publicly available resource suitable for general use. Results The precomputed Magnum database offers a solution to this problem for ca. 1,800 full-length protein families with at least one crystal structure. The Magnum deliverables include 1 multiple sequence alignments, 2 mapping of alignment sites to crystal structure sites, 3 phylogenetic trees, 4 inferred ancestral sequences at internal tree nodes, and 5 amino acid replacements along tree branches. Comprehensive evaluations revealed that the automated procedures used to construct Magnum produced accurate models of how proteins divergently evolve, or genealogies, and correctly integrated these with the structural data. To demonstrate Magnum's capabilities, we asked for amino acid replacements requiring three nucleotide substitutions, located at internal protein structure sites, and occurring on short phylogenetic tree branches. In the cellular retinoid binding protein family a site that potentially modulates ligand binding affinity was discovered. Recruitment of cellular retinol binding protein to function as a lens crystallin in the diurnal gecko afforded another opportunity to showcase the predictive value of a browsable database containing branch replacement patterns integrated with protein structures. Conclusion We integrated two areas of protein science, evolution and structure, on a large scale and created a precomputed database, known as Magnum, which is the first freely available resource of its kind. Magnum provides evolutionary and structural

  18. BioWarehouse: a bioinformatics database warehouse toolkit

    Directory of Open Access Journals (Sweden)

    Stringer-Calvert David WJ

    2006-03-01

    Full Text Available Abstract Background This article addresses the problem of interoperation of heterogeneous bioinformatics databases. Results We introduce BioWarehouse, an open source toolkit for constructing bioinformatics database warehouses using the MySQL and Oracle relational database managers. BioWarehouse integrates its component databases into a common representational framework within a single database management system, thus enabling multi-database queries using the Structured Query Language (SQL but also facilitating a variety of database integration tasks such as comparative analysis and data mining. BioWarehouse currently supports the integration of a pathway-centric set of databases including ENZYME, KEGG, and BioCyc, and in addition the UniProt, GenBank, NCBI Taxonomy, and CMR databases, and the Gene Ontology. Loader tools, written in the C and JAVA languages, parse and load these databases into a relational database schema. The loaders also apply a degree of semantic normalization to their respective source data, decreasing semantic heterogeneity. The schema supports the following bioinformatics datatypes: chemical compounds, biochemical reactions, metabolic pathways, proteins, genes, nucleic acid sequences, features on protein and nucleic-acid sequences, organisms, organism taxonomies, and controlled vocabularies. As an application example, we applied BioWarehouse to determine the fraction of biochemically characterized enzyme activities for which no sequences exist in the public sequence databases. The answer is that no sequence exists for 36% of enzyme activities for which EC numbers have been assigned. These gaps in sequence data significantly limit the accuracy of genome annotation and metabolic pathway prediction, and are a barrier for metabolic engineering. Complex queries of this type provide examples of the value of the data warehousing approach to bioinformatics research. Conclusion BioWarehouse embodies significant progress on the

  19. The Secretome of Bone Marrow and Wharton Jelly Derived Mesenchymal Stem Cells Induces Differentiation and Neurite Outgrowth in SH-SY5Y Cells

    OpenAIRE

    Ana O. Pires; Andreia Neves-Carvalho; Nuno Sousa; António J. Salgado

    2014-01-01

    The goal of this study was to determine and compare the effects of the secretome of mesenchymal stem cells (MSCs) isolated from human bone-marrow (BMSCs) and the Wharton jelly surrounding the vein and arteries of the umbilical cord (human umbilical cord perivascular cells (HUCPVCs)) on the survival and differentiation of a human neuroblastoma cell line (SH-SY5Y). For this purpose, SH-SY5Y cells were differentiated with conditioned media (CM) from the MSCs populations referred above. Retinoic ...

  20. Whistleblowing: An integrative literature review of data-based studies involving nurses.

    Science.gov (United States)

    Jackson, Debra; Hickman, Louise D; Hutchinson, Marie; Andrew, Sharon; Smith, James; Potgieter, Ingrid; Cleary, Michelle; Peters, Kath

    2014-01-01

    Abstract Aim: To summarise and critique the research literature about whistleblowing and nurses. Whistleblowing is identified as a crucial issue in maintenance of healthcare standards and nurses are frequently involved in whistleblowing events. Despite the importance of this issue, to our knowledge an evaluation of this body of the data-based literature has not been undertaken. An integrative literature review approach was used to summarise and critique the research literature. A comprehensive search of five databases including Medline, CINAHL, PubMed and Health Science: Nursing/Academic Edition, and Google, were searched using terms including: 'Whistleblow*,' 'nurs*.' In addition, relevant journals were examined, as well as reference lists of retrieved papers. Papers published during the years 2007-2013 were selected for inclusion. Fifteen papers were identified, capturing data from nurses in seven countries. The findings in this review demonstrate a growing body of research for the nursing profession at large to engage and respond appropriately to issues involving suboptimal patient care or organisational wrongdoing. Nursing plays a key role in maintaining practice standards and in reporting care that is unacceptable although the repercussions to nurses who raise concerns are insupportable. Overall, whistleblowing and how it influences the individual, their family, work colleagues, nursing practice and policy overall, requires further national and international research attention.

  1. Rapid isolation of bone marrow mesenchymal stromal cells using integrated centrifuge-based technology.

    Science.gov (United States)

    Meppelink, Amanda M; Wang, Xing-Hua; Bradica, Gino; Barron, Kathryn; Hiltz, Kathleen; Liu, Xiang-Hong; Goldman, Scott M; Vacanti, Joseph P; Keating, Armand; Hoganson, David M

    2016-06-01

    The use of bone marrow-derived mesenchymal stromal cells (MSCs) in cell-based therapies is currently being developed for a number of diseases. Thus far, the clinical results have been inconclusive and variable, in part because of the variety of cell isolation procedures and culture conditions used in each study. A new isolation technique that streamlines the method of concentration and demands less time and attention could provide clinical and economic advantages compared with current methodologies. In this study, we evaluated the concentrating capability of an integrated centrifuge-based technology compared with standard Ficoll isolation. MSCs were concentrated from bone marrow aspirate using the new device and the Ficoll method. The isolation capabilities of the device and the growth characteristics, secretome production, and differentiation capacity of the derived cells were determined. The new MSC isolation device concentrated the bone marrow in 90 seconds and resulted in a mononuclear cell yield 10-fold higher and with a twofold increase in cell retention compared with Ficoll. The cells isolated using the device were shown to exhibit similar morphology and functional activity as assessed by growth curves and secretome production compared to the Ficoll-isolated cells. The surface marker and trilineage differentiation profile of the device-isolated cells was consistent with the known profile of MSCs. The faster time to isolation and greater cell yield of the integrated centrifuge-based technology may make this an improved approach for MSC isolation from bone marrow aspirates. Copyright © 2016 International Society for Cellular Therapy. Published by Elsevier Inc. All rights reserved.

  2. Integrated Storage and Management of Vector and Raster Data Based on Oracle Database

    Directory of Open Access Journals (Sweden)

    WU Zheng

    2017-05-01

    Full Text Available At present, there are many problems in the storage and management of multi-source heterogeneous spatial data, such as the difficulty of transferring, the lack of unified storage and the low efficiency. By combining relational database and spatial data engine technology, an approach for integrated storage and management of vector and raster data is proposed on the basis of Oracle in this paper. This approach establishes an integrated storage model on vector and raster data and optimizes the retrieval mechanism at first, then designs a framework for the seamless data transfer, finally realizes the unified storage and efficient management of multi-source heterogeneous data. By comparing experimental results with the international leading similar software ArcSDE, it is proved that the proposed approach has higher data transfer performance and better query retrieval efficiency.

  3. Peptidomics and Secretomics of the Mammalian Peripheral Sensory-Motor System

    Science.gov (United States)

    Tillmaand, Emily G.; Yang, Ning; Kindt, Callie A. C.; Romanova, Elena V.; Rubakhin, Stanislav S.; Sweedler, Jonathan V.

    2015-12-01

    The dorsal root ganglion (DRG) and its anatomically and functionally associated spinal nerve and ventral and dorsal roots are important components of the peripheral sensory-motor system in mammals. The cells within these structures use a number of peptides as intercellular signaling molecules. We performed a variety of mass spectrometry (MS)-based characterizations of peptides contained within and secreted from these structures, and from isolated and cultured DRG cells. Liquid chromatography-Fourier transform MS was utilized in DRG and nerve peptidome analysis. In total, 2724 peptides from 296 proteins were identified in tissue extracts. Neuropeptides are among those detected, including calcitonin gene-related peptide I, little SAAS, and known hemoglobin-derived peptides. Solid phase extraction combined with direct matrix-assisted laser desorption/ionization time-of-flight MS was employed to investigate the secretome of these structures. A number of peptides were detected in the releasate from semi-intact preparations of DRGs and associated nerves, including neurofilament- and myelin basic protein-related peptides. A smaller set of analytes was observed in releasates from cultured DRG neurons. The peptide signals observed in the releasates have been mass-matched to those characterized and identified in homogenates of entire DRGs and associated nerves. This data aids our understanding of the chemical composition of the mammalian peripheral sensory-motor system, which is involved in key physiological functions such as nociception, thermoreception, itch sensation, and proprioception.

  4. The Future of Asset Management for Human Space Exploration: Supply Classification and an Integrated Database

    Science.gov (United States)

    Shull, Sarah A.; Gralla, Erica L.; deWeck, Olivier L.; Shishko, Robert

    2006-01-01

    One of the major logistical challenges in human space exploration is asset management. This paper presents observations on the practice of asset management in support of human space flight to date and discusses a functional-based supply classification and a framework for an integrated database that could be used to improve asset management and logistics for human missions to the Moon, Mars and beyond.

  5. TOMATOMICS: A Web Database for Integrated Omics Information in Tomato

    KAUST Repository

    Kudo, Toru; Kobayashi, Masaaki; Terashima, Shin; Katayama, Minami; Ozaki, Soichi; Kanno, Maasa; Saito, Misa; Yokoyama, Koji; Ohyanagi, Hajime; Aoki, Koh; Kubo, Yasutaka; Yano, Kentaro

    2016-01-01

    Solanum lycopersicum (tomato) is an important agronomic crop and a major model fruit-producing plant. To facilitate basic and applied research, comprehensive experimental resources and omics information on tomato are available following their development. Mutant lines and cDNA clones from a dwarf cultivar, Micro-Tom, are two of these genetic resources. Large-scale sequencing data for ESTs and full-length cDNAs from Micro-Tom continue to be gathered. In conjunction with information on the reference genome sequence of another cultivar, Heinz 1706, the Micro-Tom experimental resources have facilitated comprehensive functional analyses. To enhance the efficiency of acquiring omics information for tomato biology, we have integrated the information on the Micro-Tom experimental resources and the Heinz 1706 genome sequence. We have also inferred gene structure by comparison of sequences between the genome of Heinz 1706 and the transcriptome, which are comprised of Micro-Tom full-length cDNAs and Heinz 1706 RNA-seq data stored in the KaFTom and Sequence Read Archive databases. In order to provide large-scale omics information with streamlined connectivity we have developed and maintain a web database TOMATOMICS (http://bioinf.mind.meiji.ac.jp/tomatomics/). In TOMATOMICS, access to the information on the cDNA clone resources, full-length mRNA sequences, gene structures, expression profiles and functional annotations of genes is available through search functions and the genome browser, which has an intuitive graphical interface.

  6. TOMATOMICS: A Web Database for Integrated Omics Information in Tomato

    KAUST Repository

    Kudo, Toru

    2016-11-29

    Solanum lycopersicum (tomato) is an important agronomic crop and a major model fruit-producing plant. To facilitate basic and applied research, comprehensive experimental resources and omics information on tomato are available following their development. Mutant lines and cDNA clones from a dwarf cultivar, Micro-Tom, are two of these genetic resources. Large-scale sequencing data for ESTs and full-length cDNAs from Micro-Tom continue to be gathered. In conjunction with information on the reference genome sequence of another cultivar, Heinz 1706, the Micro-Tom experimental resources have facilitated comprehensive functional analyses. To enhance the efficiency of acquiring omics information for tomato biology, we have integrated the information on the Micro-Tom experimental resources and the Heinz 1706 genome sequence. We have also inferred gene structure by comparison of sequences between the genome of Heinz 1706 and the transcriptome, which are comprised of Micro-Tom full-length cDNAs and Heinz 1706 RNA-seq data stored in the KaFTom and Sequence Read Archive databases. In order to provide large-scale omics information with streamlined connectivity we have developed and maintain a web database TOMATOMICS (http://bioinf.mind.meiji.ac.jp/tomatomics/). In TOMATOMICS, access to the information on the cDNA clone resources, full-length mRNA sequences, gene structures, expression profiles and functional annotations of genes is available through search functions and the genome browser, which has an intuitive graphical interface.

  7. Human Ageing Genomic Resources: Integrated databases and tools for the biology and genetics of ageing

    Science.gov (United States)

    Tacutu, Robi; Craig, Thomas; Budovsky, Arie; Wuttke, Daniel; Lehmann, Gilad; Taranukha, Dmitri; Costa, Joana; Fraifeld, Vadim E.; de Magalhães, João Pedro

    2013-01-01

    The Human Ageing Genomic Resources (HAGR, http://genomics.senescence.info) is a freely available online collection of research databases and tools for the biology and genetics of ageing. HAGR features now several databases with high-quality manually curated data: (i) GenAge, a database of genes associated with ageing in humans and model organisms; (ii) AnAge, an extensive collection of longevity records and complementary traits for >4000 vertebrate species; and (iii) GenDR, a newly incorporated database, containing both gene mutations that interfere with dietary restriction-mediated lifespan extension and consistent gene expression changes induced by dietary restriction. Since its creation about 10 years ago, major efforts have been undertaken to maintain the quality of data in HAGR, while further continuing to develop, improve and extend it. This article briefly describes the content of HAGR and details the major updates since its previous publications, in terms of both structure and content. The completely redesigned interface, more intuitive and more integrative of HAGR resources, is also presented. Altogether, we hope that through its improvements, the current version of HAGR will continue to provide users with the most comprehensive and accessible resources available today in the field of biogerontology. PMID:23193293

  8. A development and integration of the concentration database for relative method, k0 method and absolute method in instrumental neutron activation analysis using Microsoft Access

    International Nuclear Information System (INIS)

    Hoh Siew Sin

    2012-01-01

    Instrumental Neutron Activation Analysis (INAA) is offen used to determine and calculate the concentration of an element in the sample by the National University of Malaysia, especially students of Nuclear Science Program. The lack of a database service leads consumers to take longer time to calculate the concentration of an element in the sample. This is because we are more dependent on software that is developed by foreign researchers which are costly. To overcome this problem, a study has been carried out to build an INAA database software. The objective of this study is to build a database software that help the users of INAA in Relative Method and Absolute Method for calculating the element concentration in the sample using Microsoft Excel 2010 and Microsoft Access 2010. The study also integrates k 0 data, k 0 Concent and k 0 -Westcott to execute and complete the system. After the integration, a study was conducted to test the effectiveness of the database software by comparing the concentrations between the experiments and in the database. Triple Bare Monitor Zr-Au and Cr-Mo-Au were used in Abs-INAA as monitor to determine the thermal to epithermal neutron flux ratio (f). Calculations involved in determining the concentration are the net peak area (N p ), the measurement time (t m ), the irradiation time (t irr ), k-factor (k), thermal to epithermal neutron flux ratio (f), the parameters of the neutron flux distribution epithermal (α) and detection efficiency (ε p ). For Com-INAA databases, reference material IAEA-375 Soil was used to calculate the concentration of elements in the sample. CRM, SRM are also used in this database. After the INAA database integration, a verification process was to examine the effectiveness of the Abs-INAA was carried out by comparing the sample concentration between the in database and the experiment. The result of the experimental concentration value of INAA database software performed with high accuracy and precision. ICC

  9. Secretome analysis defines the major role of SecDF in Staphylococcus aureus virulence.

    Directory of Open Access Journals (Sweden)

    Chantal Quiblier

    Full Text Available The Sec pathway plays a prominent role in protein export and membrane insertion, including the secretion of major bacterial virulence determinants. The accessory Sec constituent SecDF has been proposed to contribute to protein export. Deletion of Staphylococcus aureus secDF has previously been shown to reduce resistance, to alter cell separation, and to change the expression of certain virulence factors. To analyse the impact of the secDF deletion in S. aureus on protein secretion, a quantitative secretome analysis was performed. Numerous Sec signal containing proteins involved in virulence were found to be decreased in the supernatant of the secDF mutant. However, two Sec-dependent hydrolases were increased in comparison to the wild type, suggesting additional indirect, regulatory effects to occur upon deletion of secDF. Adhesion, invasion, and cytotoxicity of the secDF mutant were reduced in human umbilical vein endothelial cells. Virulence was significantly reduced using a Galleria mellonella insect model. Altogether, SecDF is a promising therapeutic target for controlling S. aureus infections.

  10. Database and applications security integrating information security and data management

    CERN Document Server

    Thuraisingham, Bhavani

    2005-01-01

    This is the first book to provide an in-depth coverage of all the developments, issues and challenges in secure databases and applications. It provides directions for data and application security, including securing emerging applications such as bioinformatics, stream information processing and peer-to-peer computing. Divided into eight sections, each of which focuses on a key concept of secure databases and applications, this book deals with all aspects of technology, including secure relational databases, inference problems, secure object databases, secure distributed databases and emerging

  11. LandIT Database

    DEFF Research Database (Denmark)

    Iftikhar, Nadeem; Pedersen, Torben Bach

    2010-01-01

    and reporting purposes. This paper presents the LandIT database; which is result of the LandIT project, which refers to an industrial collaboration project that developed technologies for communication and data integration between farming devices and systems. The LandIT database in principal is based...... on the ISOBUS standard; however the standard is extended with additional requirements, such as gradual data aggregation and flexible exchange of farming data. This paper describes the conceptual and logical schemas of the proposed database based on a real-life farming case study....

  12. Integrating query of relational and textual data in clinical databases: a case study.

    Science.gov (United States)

    Fisk, John M; Mutalik, Pradeep; Levin, Forrest W; Erdos, Joseph; Taylor, Caroline; Nadkarni, Prakash

    2003-01-01

    The authors designed and implemented a clinical data mart composed of an integrated information retrieval (IR) and relational database management system (RDBMS). Using commodity software, which supports interactive, attribute-centric text and relational searches, the mart houses 2.8 million documents that span a five-year period and supports basic IR features such as Boolean searches, stemming, and proximity and fuzzy searching. Results are relevance-ranked using either "total documents per patient" or "report type weighting." Non-curated medical text has a significant degree of malformation with respect to spelling and punctuation, which creates difficulties for text indexing and searching. Presently, the IR facilities of RDBMS packages lack the features necessary to handle such malformed text adequately. A robust IR+RDBMS system can be developed, but it requires integrating RDBMSs with third-party IR software. RDBMS vendors need to make their IR offerings more accessible to non-programmers.

  13. Integrating pattern mining in relational databases

    NARCIS (Netherlands)

    Calders, T.; Goethals, B.; Prado, A.; Fürnkranz, J.; Scheffer, T.; Spiliopoulou, M.

    2006-01-01

    Almost a decade ago, Imielinski and Mannila introduced the notion of Inductive Databases to manage KDD applications just as DBMSs successfully manage business applications. The goal is to follow one of the key DBMS paradigms: building optimizing compilers for ad hoc queries. During the past decade,

  14. DBGC: A Database of Human Gastric Cancer

    Science.gov (United States)

    Wang, Chao; Zhang, Jun; Cai, Mingdeng; Zhu, Zhenggang; Gu, Wenjie; Yu, Yingyan; Zhang, Xiaoyan

    2015-01-01

    The Database of Human Gastric Cancer (DBGC) is a comprehensive database that integrates various human gastric cancer-related data resources. Human gastric cancer-related transcriptomics projects, proteomics projects, mutations, biomarkers and drug-sensitive genes from different sources were collected and unified in this database. Moreover, epidemiological statistics of gastric cancer patients in China and clinicopathological information annotated with gastric cancer cases were also integrated into the DBGC. We believe that this database will greatly facilitate research regarding human gastric cancer in many fields. DBGC is freely available at http://bminfor.tongji.edu.cn/dbgc/index.do PMID:26566288

  15. Brassica ASTRA: an integrated database for Brassica genomic research.

    Science.gov (United States)

    Love, Christopher G; Robinson, Andrew J; Lim, Geraldine A C; Hopkins, Clare J; Batley, Jacqueline; Barker, Gary; Spangenberg, German C; Edwards, David

    2005-01-01

    Brassica ASTRA is a public database for genomic information on Brassica species. The database incorporates expressed sequences with Swiss-Prot and GenBank comparative sequence annotation as well as secondary Gene Ontology (GO) annotation derived from the comparison with Arabidopsis TAIR GO annotations. Simple sequence repeat molecular markers are identified within resident sequences and mapped onto the closely related Arabidopsis genome sequence. Bacterial artificial chromosome (BAC) end sequences derived from the Multinational Brassica Genome Project are also mapped onto the Arabidopsis genome sequence enabling users to identify candidate Brassica BACs corresponding to syntenic regions of Arabidopsis. This information is maintained in a MySQL database with a web interface providing the primary means of interrogation. The database is accessible at http://hornbill.cspp.latrobe.edu.au.

  16. Data integration for European marine biodiversity research: creating a database on benthos and plankton to study large-scale patterns and long-term changes

    NARCIS (Netherlands)

    Vandepitte, L.; Vanhoorne, B.; Kraberg, A.; Anisimova, N.; Antoniadou, C.; Araújo, R.; Bartsch, I.; Beker, B.; Benedetti-Cecchi, L.; Bertocci, I.; Cochrane, S.J.; Cooper, K.; Craeymeersch, J.A.; Christou, E.; Crisp, D.J.; Dahle, S.; de Boissier, M.; De Kluijver, M.; Denisenko, S.; De Vito, D.; Duineveld, G.; Escaravage, V.L.; Fleischer, D.; Fraschetti, S.; Giangrande, A.; Heip, C.H.R.; Hummel, H.; Janas, U.; Karez, R.; Kedra, M.; Kingston, P.; Kuhlenkamp, R.; Libes, M.; Martens, P.; Mees, J.; Mieszkowska, N.; Mudrak, S.; Munda, I.; Orfanidis, S.; Orlando-Bonaca, M.; Palerud, R.; Rachor, E.; Reichert, K.; Rumohr, H.; Schiedek, D.; Schubert, P.; Sistermans, W.C.H.; Sousa Pinto, I.S.; Southward, A.J.; Terlizzi, A.; Tsiaga, E.; Van Beusekom, J.E.E.; Vanden Berghe, E.; Warzocha, J.; Wasmund, N.; Weslawski, J.M.; Widdicombe, C.; Wlodarska-Kowalczuk, M.; Zettler, M.L.

    2010-01-01

    The general aim of setting up a central database on benthos and plankton was to integrate long-, medium- and short-term datasets on marine biodiversity. Such a database makes it possible to analyse species assemblages and their changes on spatial and temporal scales across Europe. Data collation

  17. BIOSPIDA: A Relational Database Translator for NCBI.

    Science.gov (United States)

    Hagen, Matthew S; Lee, Eva K

    2010-11-13

    As the volume and availability of biological databases continue widespread growth, it has become increasingly difficult for research scientists to identify all relevant information for biological entities of interest. Details of nucleotide sequences, gene expression, molecular interactions, and three-dimensional structures are maintained across many different databases. To retrieve all necessary information requires an integrated system that can query multiple databases with minimized overhead. This paper introduces a universal parser and relational schema translator that can be utilized for all NCBI databases in Abstract Syntax Notation (ASN.1). The data models for OMIM, Entrez-Gene, Pubmed, MMDB and GenBank have been successfully converted into relational databases and all are easily linkable helping to answer complex biological questions. These tools facilitate research scientists to locally integrate databases from NCBI without significant workload or development time.

  18. MSblender: A probabilistic approach for integrating peptide identifications from multiple database search engines.

    Science.gov (United States)

    Kwon, Taejoon; Choi, Hyungwon; Vogel, Christine; Nesvizhskii, Alexey I; Marcotte, Edward M

    2011-07-01

    Shotgun proteomics using mass spectrometry is a powerful method for protein identification but suffers limited sensitivity in complex samples. Integrating peptide identifications from multiple database search engines is a promising strategy to increase the number of peptide identifications and reduce the volume of unassigned tandem mass spectra. Existing methods pool statistical significance scores such as p-values or posterior probabilities of peptide-spectrum matches (PSMs) from multiple search engines after high scoring peptides have been assigned to spectra, but these methods lack reliable control of identification error rates as data are integrated from different search engines. We developed a statistically coherent method for integrative analysis, termed MSblender. MSblender converts raw search scores from search engines into a probability score for every possible PSM and properly accounts for the correlation between search scores. The method reliably estimates false discovery rates and identifies more PSMs than any single search engine at the same false discovery rate. Increased identifications increment spectral counts for most proteins and allow quantification of proteins that would not have been quantified by individual search engines. We also demonstrate that enhanced quantification contributes to improve sensitivity in differential expression analyses.

  19. EVpedia: an integrated database of high-throughput data for systemic analyses of extracellular vesicles

    Directory of Open Access Journals (Sweden)

    Dae-Kyum Kim

    2013-03-01

    Full Text Available Secretion of extracellular vesicles is a general cellular activity that spans the range from simple unicellular organisms (e.g. archaea; Gram-positive and Gram-negative bacteria to complex multicellular ones, suggesting that this extracellular vesicle-mediated communication is evolutionarily conserved. Extracellular vesicles are spherical bilayered proteolipids with a mean diameter of 20–1,000 nm, which are known to contain various bioactive molecules including proteins, lipids, and nucleic acids. Here, we present EVpedia, which is an integrated database of high-throughput datasets from prokaryotic and eukaryotic extracellular vesicles. EVpedia provides high-throughput datasets of vesicular components (proteins, mRNAs, miRNAs, and lipids present on prokaryotic, non-mammalian eukaryotic, and mammalian extracellular vesicles. In addition, EVpedia also provides an array of tools, such as the search and browse of vesicular components, Gene Ontology enrichment analysis, network analysis of vesicular proteins and mRNAs, and a comparison of vesicular datasets by ortholog identification. Moreover, publications on extracellular vesicle studies are listed in the database. This free web-based database of EVpedia (http://evpedia.info might serve as a fundamental repository to stimulate the advancement of extracellular vesicle studies and to elucidate the novel functions of these complex extracellular organelles.

  20. Database Security: A Historical Perspective

    OpenAIRE

    Lesov, Paul

    2010-01-01

    The importance of security in database research has greatly increased over the years as most of critical functionality of the business and military enterprises became digitized. Database is an integral part of any information system and they often hold sensitive data. The security of the data depends on physical security, OS security and DBMS security. Database security can be compromised by obtaining sensitive data, changing data or degrading availability of the database. Over the last 30 ye...

  1. Secretome analysis of the thermophilic xylanase hyper-producer Thermomyces lanuginosus SSBP cultivated on corn cobs.

    Science.gov (United States)

    Winger, A M; Heazlewood, J L; Chan, L J G; Petzold, C J; Permaul, K; Singh, S

    2014-11-01

    Thermomyces lanuginosus is a thermophilic fungus known for its ability to produce industrially important enzymes including large amounts of xylanase, the key enzyme in hemicellulose hydrolysis. The secretome of T. lanuginosus SSBP was profiled by shotgun proteomics to elucidate important enzymes involved in hemicellulose saccharification and to characterise the presence of other industrially interesting enzymes. This study reproducibly identified a total of 74 proteins in the supernatant following growth on corn cobs. An analysis of proteins revealed nine glycoside hydrolase (GH) enzymes including xylanase GH11, β-xylosidase GH43, β-glucosidase GH3, α-galactosidase GH36 and trehalose hydrolase GH65. Two commercially produced Thermomyces enzymes, lipase and amylase, were also identified. In addition, other industrially relevant enzymes not currently explored in Thermomyces were identified including glutaminase, fructose-bisphosphate aldolase and cyanate hydratase. Overall, these data provide insight into the novel ability of a cellulase-free fungus to utilise lignocellulosic material, ultimately producing a number of enzymes important to various industrial processes.

  2. Genome and secretome analysis of the hemibiotrophic fungal pathogen, Moniliophthora roreri, which causes frosty pod rot disease of cacao: mechanisms of the biotrophic and necrotrophic phases.

    Science.gov (United States)

    Meinhardt, Lyndel W; Costa, Gustavo Gilson Lacerda; Thomazella, Daniela P T; Teixeira, Paulo José P L; Carazzolle, Marcelo Falsarella; Schuster, Stephan C; Carlson, John E; Guiltinan, Mark J; Mieczkowski, Piotr; Farmer, Andrew; Ramaraj, Thiruvarangan; Crozier, Jayne; Davis, Robert E; Shao, Jonathan; Melnick, Rachel L; Pereira, Gonçalo A G; Bailey, Bryan A

    2014-02-27

    The basidiomycete Moniliophthora roreri is the causal agent of Frosty pod rot (FPR) disease of cacao (Theobroma cacao), the source of chocolate, and FPR is one of the most destructive diseases of this important perennial crop in the Americas. This hemibiotroph infects only cacao pods and has an extended biotrophic phase lasting up to sixty days, culminating in plant necrosis and sporulation of the fungus without the formation of a basidiocarp. We sequenced and assembled 52.3 Mb into 3,298 contigs that represent the M. roreri genome. Of the 17,920 predicted open reading frames (OFRs), 13,760 were validated by RNA-Seq. Using read count data from RNA sequencing of cacao pods at 30 and 60 days post infection, differential gene expression was estimated for the biotrophic and necrotrophic phases of this plant-pathogen interaction. The sequencing data were used to develop a genome based secretome for the infected pods. Of the 1,535 genes encoding putative secreted proteins, 1,355 were expressed in the biotrophic and necrotrophic phases. Analysis of the data revealed secretome gene expression that correlated with infection and intercellular growth in the biotrophic phase and invasive growth and plant cellular death in the necrotrophic phase. Genome sequencing and RNA-Seq was used to determine and validate the Moniliophthora roreri genome and secretome. High sequence identity between Moniliophthora roreri genes and Moniliophthora perniciosa genes supports the taxonomic relationship with Moniliophthora perniciosa and the relatedness of this fungus to other basidiomycetes. Analysis of RNA-Seq data from infected plant tissues revealed differentially expressed genes in the biotrophic and necrotrophic phases. The secreted protein genes that were upregulated in the biotrophic phase are primarily associated with breakdown of the intercellular matrix and modification of the fungal mycelia, possibly to mask the fungus from plant defenses. Based on the transcriptome data, the

  3. JICST Factual Database(2)

    Science.gov (United States)

    Araki, Keisuke

    The computer programme, which builds atom-bond connection tables from nomenclatures, is developed. Chemical substances with their nomenclature and varieties of trivial names or experimental code numbers are inputted. The chemical structures of the database are stereospecifically stored and are able to be searched and displayed according to stereochemistry. Source data are from laws and regulations of Japan, RTECS of US and so on. The database plays a central role within the integrated fact database service of JICST and makes interrelational retrieval possible.

  4. Issues in Big-Data Database Systems

    Science.gov (United States)

    2014-06-01

    that big data will not be manageable using conventional relational database technology, and it is true that alternative paradigms, such as NoSQL systems...conventional relational database technology, and it is true that alternative paradigms, such as NoSQL systems and search engines, have much to offer...scale well, and because integration with external data sources is so difficult. NoSQL systems are more open to this integration, and provide excellent

  5. CyanOmics: an integrated database of omics for the model cyanobacterium Synechococcus sp. PCC 7002

    OpenAIRE

    Yang, Yaohua; Feng, Jie; Li, Tao; Ge, Feng; Zhao, Jindong

    2015-01-01

    Cyanobacteria are an important group of organisms that carry out oxygenic photosynthesis and play vital roles in both the carbon and nitrogen cycles of the Earth. The annotated genome of Synechococcus sp. PCC 7002, as an ideal model cyanobacterium, is available. A series of transcriptomic and proteomic studies of Synechococcus sp. PCC 7002 cells grown under different conditions have been reported. However, no database of such integrated omics studies has been constructed. Here we present Cyan...

  6. Planning the future of JPL's management and administrative support systems around an integrated database

    Science.gov (United States)

    Ebersole, M. M.

    1983-01-01

    JPL's management and administrative support systems have been developed piece meal and without consistency in design approach over the past twenty years. These systems are now proving to be inadequate to support effective management of tasks and administration of the Laboratory. New approaches are needed. Modern database management technology has the potential for providing the foundation for more effective administrative tools for JPL managers and administrators. Plans for upgrading JPL's management and administrative systems over a six year period evolving around the development of an integrated management and administrative data base are discussed.

  7. Quantitative Secretomic Analysis Identifies Extracellular Protein Factors That Modulate the Metastatic Phenotype of Non-Small Cell Lung Cancer.

    Science.gov (United States)

    Hu, Rongkuan; Huffman, Kenneth E; Chu, Michael; Zhang, Yajie; Minna, John D; Yu, Yonghao

    2016-02-05

    Lung cancer is the leading cause of cancer-related deaths for men and women in the United States, with non-small cell lung cancer (NSCLC) representing 85% of all diagnoses. Late stage detection, metastatic disease and lack of actionable biomarkers contribute to the high mortality rate. Proteins in the extracellular space are known to be critically involved in regulating every stage of the pathogenesis of lung cancer. To investigate the mechanism by which secreted proteins contribute to the pathogenesis of NSCLC, we performed quantitative secretomic analysis of two isogenic NSCLC cell lines (NCI-H1993 and NCI-H2073) and an immortalized human bronchial epithelial cell line (HBEC3-KT) as control. H1993 was derived from a chemo-naïve metastatic tumor, while H2073 was derived from the primary tumor after etoposide/cisplatin therapy. From the conditioned media of these three cell lines, we identified and quantified 2713 proteins, including a series of proteins involved in regulating inflammatory response, programmed cell death and cell motion. Gene Ontology (GO) analysis indicates that a number of proteins overexpressed in H1993 media are involved in biological processes related to cancer metastasis, including cell motion, cell-cell adhesion and cell migration. RNA interference (RNAi)-mediated knock down of a number of these proteins, including SULT2B1, CEACAM5, SPRR3, AGR2, S100P, and S100A14, leads to dramatically reduced migration of these cells. In addition, meta-analysis of survival data indicates NSCLC patients whose tumors express higher levels of several of these secreted proteins, including SULT2B1, CEACAM5, SPRR3, S100P, and S100A14, have a worse prognosis. Collectively, our results provide a potential molecular link between deregulated secretome and NSCLC cell migration/metastasis. In addition, the identification of these aberrantly secreted proteins might facilitate the development of biomarkers for early detection of this devastating disease.

  8. Object-oriented modeling and design of database federations

    NARCIS (Netherlands)

    Balsters, H.

    2003-01-01

    We describe a logical architecture and a general semantic framework for precise specification of so-called database federations. A database federation provides for tight coupling of a collection of heterogeneous component databases into a global integrated system. Our approach to database federation

  9. Secretomic Insight into Glucose Metabolism of Aspergillus brasiliensis in Solid-State Fermentation.

    Science.gov (United States)

    Volke-Sepulveda, Tania; Salgado-Bautista, Daniel; Bergmann, Carl; Wells, Lance; Gutierrez-Sanchez, Gerardo; Favela-Torres, Ernesto

    2016-10-07

    The genus Aspergillus is ubiquitous in nature and includes various species extensively exploited industrially due to their ability to produce and secrete a variety of enzymes and metabolites. Most processes are performed in submerged fermentation (SmF); however, solid-state fermentation (SSF) offers several advantages, including lower catabolite repression and substrate inhibition and higher productivity and stability of the enzymes produced. This study aimed to explain the improved metabolic behavior of A. brasiliensis ATCC9642 in SSF at high glucose concentrations through a proteomic approach. Online respirometric analysis provided reproducible samples for secretomic studies when the maximum CO 2 production rate occurred, ensuring consistent physiological states. Extracellular extracts from SSF cultures were treated by SDS-PAGE, digested with trypsin, and analyzed by LC-MS/MS. Of 531 sequences identified, 207 proteins were analyzed. Twenty-five were identified as the most abundant unregulated proteins; 87 were found to be up-regulated and 95 were down-regulated with increasing glucose concentration. Of the regulated proteins, 120 were enzymes, most involved in the metabolism of carbohydrates (51), amino acids (23), and nucleotides (9). This study shows the high protein secretory activity of A. brasiliensis under SSF conditions. High glucose concentration favors catabolic activities, while some stress-related proteins and those involved in proteolysis are down-regulated.

  10. MEGADOCK-Web: an integrated database of high-throughput structure-based protein-protein interaction predictions.

    Science.gov (United States)

    Hayashi, Takanori; Matsuzaki, Yuri; Yanagisawa, Keisuke; Ohue, Masahito; Akiyama, Yutaka

    2018-05-08

    Protein-protein interactions (PPIs) play several roles in living cells, and computational PPI prediction is a major focus of many researchers. The three-dimensional (3D) structure and binding surface are important for the design of PPI inhibitors. Therefore, rigid body protein-protein docking calculations for two protein structures are expected to allow elucidation of PPIs different from known complexes in terms of 3D structures because known PPI information is not explicitly required. We have developed rapid PPI prediction software based on protein-protein docking, called MEGADOCK. In order to fully utilize the benefits of computational PPI predictions, it is necessary to construct a comprehensive database to gather prediction results and their predicted 3D complex structures and to make them easily accessible. Although several databases exist that provide predicted PPIs, the previous databases do not contain a sufficient number of entries for the purpose of discovering novel PPIs. In this study, we constructed an integrated database of MEGADOCK PPI predictions, named MEGADOCK-Web. MEGADOCK-Web provides more than 10 times the number of PPI predictions than previous databases and enables users to conduct PPI predictions that cannot be found in conventional PPI prediction databases. In MEGADOCK-Web, there are 7528 protein chains and 28,331,628 predicted PPIs from all possible combinations of those proteins. Each protein structure is annotated with PDB ID, chain ID, UniProt AC, related KEGG pathway IDs, and known PPI pairs. Additionally, MEGADOCK-Web provides four powerful functions: 1) searching precalculated PPI predictions, 2) providing annotations for each predicted protein pair with an experimentally known PPI, 3) visualizing candidates that may interact with the query protein on biochemical pathways, and 4) visualizing predicted complex structures through a 3D molecular viewer. MEGADOCK-Web provides a huge amount of comprehensive PPI predictions based on

  11. Physical database design using Oracle

    CERN Document Server

    Burleson, Donald K

    2004-01-01

    INTRODUCTION TO ORACLE PHYSICAL DESIGNPrefaceRelational Databases and Physical DesignSystems Analysis and Physical Database DesignIntroduction to Logical Database DesignEntity/Relation ModelingBridging between Logical and Physical ModelsPhysical Design Requirements Validation PHYSICAL ENTITY DESIGN FOR ORACLEData Relationships and Physical DesignMassive De-Normalization: STAR Schema DesignDesigning Class HierarchiesMaterialized Views and De-NormalizationReferential IntegrityConclusionORACLE HARDWARE DESIGNPlanning the Server EnvironmentDesigning the Network Infrastructure for OracleOracle Netw

  12. Facilitating quality control for spectra assignments of small organic molecules: nmrshiftdb2--a free in-house NMR database with integrated LIMS for academic service laboratories.

    Science.gov (United States)

    Kuhn, Stefan; Schlörer, Nils E

    2015-08-01

    nmrshiftdb2 supports with its laboratory information management system the integration of an electronic lab administration and management into academic NMR facilities. Also, it offers the setup of a local database, while full access to nmrshiftdb2's World Wide Web database is granted. This freely available system allows on the one hand the submission of orders for measurement, transfers recorded data automatically or manually, and enables download of spectra via web interface, as well as the integrated access to prediction, search, and assignment tools of the NMR database for lab users. On the other hand, for the staff and lab administration, flow of all orders can be supervised; administrative tools also include user and hardware management, a statistic functionality for accounting purposes, and a 'QuickCheck' function for assignment control, to facilitate quality control of assignments submitted to the (local) database. Laboratory information management system and database are based on a web interface as front end and are therefore independent of the operating system in use. Copyright © 2015 John Wiley & Sons, Ltd.

  13. Integration of published information into a resistance-associated mutation database for Mycobacterium tuberculosis.

    Science.gov (United States)

    Salamon, Hugh; Yamaguchi, Ken D; Cirillo, Daniela M; Miotto, Paolo; Schito, Marco; Posey, James; Starks, Angela M; Niemann, Stefan; Alland, David; Hanna, Debra; Aviles, Enrique; Perkins, Mark D; Dolinger, David L

    2015-04-01

    Tuberculosis remains a major global public health challenge. Although incidence is decreasing, the proportion of drug-resistant cases is increasing. Technical and operational complexities prevent Mycobacterium tuberculosis drug susceptibility phenotyping in the vast majority of new and retreatment cases. The advent of molecular technologies provides an opportunity to obtain results rapidly as compared to phenotypic culture. However, correlations between genetic mutations and resistance to multiple drugs have not been systematically evaluated. Molecular testing of M. tuberculosis sampled from a typical patient continues to provide a partial picture of drug resistance. A database of phenotypic and genotypic testing results, especially where prospectively collected, could document statistically significant associations and may reveal new, predictive molecular patterns. We examine the feasibility of integrating existing molecular and phenotypic drug susceptibility data to identify associations observed across multiple studies and demonstrate potential for well-integrated M. tuberculosis mutation data to reveal actionable findings. © The Author 2014. Published by Oxford University Press on behalf of the Infectious Diseases Society of America. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  14. Integration of gel-based and gel-free proteomic data for functional analysis of proteins through Soybean Proteome Database

    KAUST Repository

    Komatsu, Setsuko

    2017-05-10

    The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max ‘Enrei’). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. Biological significanceThe Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all

  15. Integration of gel-based and gel-free proteomic data for functional analysis of proteins through Soybean Proteome Database.

    Science.gov (United States)

    Komatsu, Setsuko; Wang, Xin; Yin, Xiaojian; Nanjo, Yohei; Ohyanagi, Hajime; Sakata, Katsumi

    2017-06-23

    The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max 'Enrei'). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. The Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all predicted proteins from

  16. Integration of gel-based and gel-free proteomic data for functional analysis of proteins through Soybean Proteome Database

    KAUST Repository

    Komatsu, Setsuko; Wang, Xin; Yin, Xiaojian; Nanjo, Yohei; Ohyanagi, Hajime; Sakata, Katsumi

    2017-01-01

    The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max ‘Enrei’). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. Biological significanceThe Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all

  17. RODOS database adapter

    International Nuclear Information System (INIS)

    Xie Gang

    1995-11-01

    Integrated data management is an essential aspect of many automatical information systems such as RODOS, a real-time on-line decision support system for nuclear emergency management. In particular, the application software must provide access management to different commercial database systems. This report presents the tools necessary for adapting embedded SQL-applications to both HP-ALLBASE/SQL and CA-Ingres/SQL databases. The design of the database adapter and the concept of RODOS embedded SQL syntax are discussed by considering some of the most important features of SQL-functions and the identification of significant differences between SQL-implementations. Finally fully part of the software developed and the administrator's and installation guides are described. (orig.) [de

  18. Molecular Profiling of the Phytophthora plurivora Secretome: A Step towards Understanding the Cross-Talk between Plant Pathogenic Oomycetes and Their Hosts

    Science.gov (United States)

    Fleischmann, Frank; Dalio, Ronaldo J. D.; Di Maro, Antimo; Scognamiglio, Monica; Fiorentino, Antonio; Parente, Augusto; Osswald, Wolfgang; Chambery, Angela

    2014-01-01

    The understanding of molecular mechanisms underlying host–pathogen interactions in plant diseases is of crucial importance to gain insights on different virulence strategies of pathogens and unravel their role in plant immunity. Among plant pathogens, Phytophthora species are eliciting a growing interest for their considerable economical and environmental impact. Plant infection by Phytophthora phytopathogens is a complex process coordinated by a plethora of extracellular signals secreted by both host plants and pathogens. The characterization of the repertoire of effectors secreted by oomycetes has become an active area of research for deciphering molecular mechanisms responsible for host plants colonization and infection. Putative secreted proteins by Phytophthora species have been catalogued by applying high-throughput genome-based strategies and bioinformatic approaches. However, a comprehensive analysis of the effective secretome profile of Phytophthora is still lacking. Here, we report the first large-scale profiling of P. plurivora secretome using a shotgun LC-MS/MS strategy. To gain insight on the molecular signals underlying the cross-talk between plant pathogenic oomycetes and their host plants, we also investigate the quantitative changes of secreted protein following interaction of P. plurivora with the root exudate of Fagus sylvatica which is highly susceptible to the root pathogen. We show that besides known effectors, the expression and/or secretion levels of cell-wall-degrading enzymes were altered following the interaction with the host plant root exudate. In addition, a characterization of the F. sylvatica root exudate was performed by NMR and amino acid analysis, allowing the identification of the main released low-molecular weight components, including organic acids and free amino acids. This study provides important insights for deciphering the extracellular network involved in the highly susceptible P. plurivora-F. sylvatica interaction

  19. Long-acting beneficial effect of percutaneously intramyocardially delivered secretome of apoptotic peripheral blood cells on porcine chronic ischemic left ventricular dysfunction.

    Science.gov (United States)

    Pavo, Noemi; Zimmermann, Matthias; Pils, Dietmar; Mildner, Michael; Petrási, Zsolt; Petneházy, Örs; Fuzik, János; Jakab, András; Gabriel, Christian; Sipos, Wolfgang; Maurer, Gerald; Gyöngyösi, Mariann; Ankersmit, Hendrik Jan

    2014-04-01

    The quantity of cells with paracrine effects for use in myocardial regeneration therapy is limited. This study investigated the effects of catheter-based endomyocardial delivery of secretome of 2.5 × 10(9) apoptotic peripheral blood mononuclear cells (APOSEC) on porcine chronic post-myocardial infarction (MI) left ventricular (LV) dysfunction and on gene expression. Closed-chest reperfused MI was induced in pigs by 90-min occlusion followed by reperfusion of the mid-LAD (day 0). At day 30, animals were randomized to receive porcine APOSEC (n = 8) or medium solution (control; n = 8) injected intramyocardially into the MI border zone using 3D NOGA guidance. At day 60, cardiac MRI with late enhancement and diagnostic NOGA (myocardial viability) were performed. Gene expression profiling of the infarct core, border zone, and normal myocardium was performed using microarray analysis and confirmed by quantitative real-time PCR. Injection of APOSEC significantly decreased infarct size (p < 0.05) and improved cardiac index and myocardial viability compared to controls. A trend towards higher LV ejection fraction was observed in APOSEC vs. controls (45.4 ± 5.9% vs. 37.4 ± 8.9%, p = 0.052). Transcriptome analysis revealed significant downregulation of caspase-1, tumor necrosis factor and other inflammatory genes in APOSEC-affected areas. rtPCR showed higher expression of myogenic factor Mefc2 (p < 0.05) and downregulated caspase genes (p < 0.05) in APOSEC-treated pigs. In conclusion, overexpression of MEF2c and repression of caspase was related to decreased infarct size and improved cardiac function in secretome-treated animals. Altered gene expression 1-month post-APOSEC treatment proved the long-acting effects of cell-free therapy with paracrine factors. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.

  20. Managing Consistency Anomalies in Distributed Integrated Databases with Relaxed ACID Properties

    DEFF Research Database (Denmark)

    Frank, Lars; Ulslev Pedersen, Rasmus

    2014-01-01

    In central databases the consistency of data is normally implemented by using the ACID (Atomicity, Consistency, Isolation and Durability) properties of a DBMS (Data Base Management System). This is not possible if distributed and/or mobile databases are involved and the availability of data also...... has to be optimized. Therefore, we will in this paper use so called relaxed ACID properties across different locations. The objective of designing relaxed ACID properties across different database locations is that the users can trust the data they use even if the distributed database temporarily...... is inconsistent. It is also important that disconnected locations can operate in a meaningful way in socalled disconnected mode. A database is DBMS consistent if its data complies with the consistency rules of the DBMS's metadata. If the database is DBMS consistent both when a transaction starts and when it has...

  1. A coordination language for databases

    DEFF Research Database (Denmark)

    Li, Ximeng; Wu, Xi; Lluch Lafuente, Alberto

    2017-01-01

    We present a coordination language for the modeling of distributed database applications. The language, baptized Klaim-DB, borrows the concepts of localities and nets of the coordination language Klaim but re-incarnates the tuple spaces of Klaim as databases. It provides high-level abstractions...... and primitives for the access and manipulation of structured data, with integrity and atomicity considerations. We present the formal semantics of Klaim-DB and develop a type system that avoids potential runtime errors such as certain evaluation errors and mismatches of data format in tables, which are monitored...... in the semantics. The use of the language is illustrated in a scenario where the sales from different branches of a chain of department stores are aggregated from their local databases. Raising the abstraction level and encapsulating integrity checks in the language primitives have benefited the modeling task...

  2. Study of developing a database of energy statistics

    Energy Technology Data Exchange (ETDEWEB)

    Park, T.S. [Korea Energy Economics Institute, Euiwang (Korea, Republic of)

    1997-08-01

    An integrated energy database should be prepared in advance for managing energy statistics comprehensively. However, since much manpower and budget is required for developing an integrated energy database, it is difficult to establish a database within a short period of time. Therefore, this study sets the purpose in drawing methods to analyze existing statistical data lists and to consolidate insufficient data as first stage work for the energy database, and at the same time, in analyzing general concepts and the data structure of the database. I also studied the data content and items of energy databases in operation in international energy-related organizations such as IEA, APEC, Japan, and the USA as overseas cases as well as domestic conditions in energy databases, and the hardware operating systems of Japanese databases. I analyzed the making-out system of Korean energy databases, discussed the KEDB system which is representative of total energy databases, and present design concepts for new energy databases. In addition, I present the establishment directions and their contents of future Korean energy databases, data contents that should be collected by supply and demand statistics, and the establishment of data collection organization, etc. by analyzing the Korean energy statistical data and comparing them with the system of OECD/IEA. 26 refs., 15 figs., 11 tabs.

  3. Proteomic Analysis of the Secretome of Cellulomonas fimi ATCC 484 and Cellulomonas flavigena ATCC 482.

    Directory of Open Access Journals (Sweden)

    Warren W Wakarchuk

    Full Text Available The bacteria in the genus Cellulomonas are known for their ability to degrade plant cell wall biomass. Cellulomonas fimi ATCC 484 and C. flavigena ATCC 482 have been the subject of much research into secreted cellulases and hemicellulases. Recently the genome sequences of both C. fimi ATCC 484 and C. flavigena ATCC 482 were published, and a genome comparison has revealed their full spectrum of possible carbohydrate-active enzymes (CAZymes. Using mass spectrometry, we have compared the proteins secreted by C. fimi and C. flavigena during growth on the soluble cellulose substrate, carboxymethylcellulose (CMC, as well as a soluble xylan fraction. Many known C. fimi CAZymes were detected, which validated our analysis, as were a number of new CAZymes and other proteins that, though identified in the genome, have not previously been observed in the secretome of either organism. Our data also shows that many of these are co-expressed on growth of either CMC or xylan. This analysis provides a new perspective on Cellulomonas enzymes and provides many new CAZyme targets for characterization.

  4. A comparative secretome analysis of industrial Aspergillus oryzae and its spontaneous mutant ZJGS-LZ-21.

    Science.gov (United States)

    Zhu, Yuanyuan; Liang, Xinle; Zhang, Hong; Feng, Wei; Liu, Ye; Zhang, Fuming; Linhardt, Robert J

    2017-05-02

    Aspergillus oryzae koji plays a crucial role in fermented food products due to the hydrolytic activities of secreted enzymes. In the present study, we performed a comparative secretome analysis of the industrial strain of Aspergillus oryzae 3.042 and its spontaneous mutantZJGS-LZ-21. One hundred and fifty two (152) differential protein spots were excised (p<0.05), and 25 proteins were identified. Of the identified proteins, 91.3% belonged to hydrolytic enzymes acting on carbohydrates or proteins. Consistent with their enzyme activities, the expression of 14 proteins involved in the degradation of cellulose, hemicellulose, starch and proteins, increased in the ZJGS-LZ-21isolate. In particular, increased levels of acid protease (Pep) may favor the degradation of soy proteins in acidic environments and promote the cleavage of allergenic soybean proteins in fermentation, resulting in improvements of product safety and quality. The ZJGS-LZ-21 isolate showed higher protein secretion and increased hydrolytic activities than did strain 3.042, indicating its promising application in soybean paste fermentation. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Database Translator (DATALATOR) for Integrated Exploitation

    Science.gov (United States)

    2010-10-31

    via the Internet to Fortune 1000 clients including Mercedes Benz , Procter & Gamble, and HP. I look forward to hearing of your successful proposal and working with you to build a successful business. Sincerely, ...testing the DATALATOR experimental prototype (IRL 4) designed to demonstrate its core functions based on Next (icneration Software technology . Die...sources, but is not directly dependent on the platform such as database technology or data formats. In other words, there is a clear air gap between

  6. LSDB Archive - KEGG MEDICUS | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available [ Credits ] English ]; } else if ( url.search(//en//) != -1 ) { url = url.replace(/...switchLanguage; BLAST Search Image Search Home About Archive Update History Data List Contact us KEGG MEDI...CUS Database Description General information of database Database name KEGG MEDICUS...ug design Organism Taxonomy Name: Human Taxonomy ID: 9606 Database description KEGG MEDICUS is an integrated...ge inserts) of all marketed drugs in Japan and the USA are integrated with the KEGG DRUG and KEGG DISEASE databases in KEGG MEDI

  7. An object-oriented framework for managing cooperating legacy databases

    NARCIS (Netherlands)

    Balsters, H; de Brock, EO

    2003-01-01

    We describe a general semantic framework for precise specification of so-called database federations. A database federation provides for tight coupling of a collection of heterogeneous legacy databases into a global integrated system. Our approach to database federation is based on the UML/OCL data

  8. Database citation in full text biomedical articles.

    Science.gov (United States)

    Kafkas, Şenay; Kim, Jee-Hyub; McEntyre, Johanna R

    2013-01-01

    Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current patterns of how database entries are cited in research articles, based on analysis of the full text Open Access articles available from Europe PMC. Focusing on citation of entries in the European Nucleotide Archive (ENA), UniProt and Protein Data Bank, Europe (PDBe), we demonstrate that text mining doubles the number of structured annotations of database record citations supplied in journal articles by publishers. Many thousands of new literature-database relationships are found by text mining, since these relationships are also not present in the set of articles cited by database records. We recommend that structured annotation of database records in articles is extended to other databases, such as ArrayExpress and Pfam, entries from which are also cited widely in the literature. The very high precision and high-throughput of this text-mining pipeline makes this activity possible both accurately and at low cost, which will allow the development of new integrated data services.

  9. European Vegetation Archive (EVA): an integrated database of European vegetation plots

    DEFF Research Database (Denmark)

    Chytrý, M; Hennekens, S M; Jiménez-Alfaro, B

    2015-01-01

    vegetation- plot databases on a single software platform. Data storage in EVA does not affect on-going independent development of the contributing databases, which remain the property of the data contributors. EVA uses a prototype of the database management software TURBOVEG 3 developed for joint management......The European Vegetation Archive (EVA) is a centralized database of European vegetation plots developed by the IAVS Working Group European Vegetation Survey. It has been in development since 2012 and first made available for use in research projects in 2014. It stores copies of national and regional...... data source for large-scale analyses of European vegetation diversity both for fundamental research and nature conservation applications. Updated information on EVA is available online at http://euroveg.org/eva-database....

  10. A database and tool, IM Browser, for exploring and integrating emerging gene and protein interaction data for Drosophila

    Directory of Open Access Journals (Sweden)

    Parrish Jodi R

    2006-04-01

    Full Text Available Abstract Background Biological processes are mediated by networks of interacting genes and proteins. Efforts to map and understand these networks are resulting in the proliferation of interaction data derived from both experimental and computational techniques for a number of organisms. The volume of this data combined with the variety of specific forms it can take has created a need for comprehensive databases that include all of the available data sets, and for exploration tools to facilitate data integration and analysis. One powerful paradigm for the navigation and analysis of interaction data is an interaction graph or map that represents proteins or genes as nodes linked by interactions. Several programs have been developed for graphical representation and analysis of interaction data, yet there remains a need for alternative programs that can provide casual users with rapid easy access to many existing and emerging data sets. Description Here we describe a comprehensive database of Drosophila gene and protein interactions collected from a variety of sources, including low and high throughput screens, genetic interactions, and computational predictions. We also present a program for exploring multiple interaction data sets and for combining data from different sources. The program, referred to as the Interaction Map (IM Browser, is a web-based application for searching and visualizing interaction data stored in a relational database system. Use of the application requires no downloads and minimal user configuration or training, thereby enabling rapid initial access to interaction data. IM Browser was designed to readily accommodate and integrate new types of interaction data as it becomes available. Moreover, all information associated with interaction measurements or predictions and the genes or proteins involved are accessible to the user. This allows combined searches and analyses based on either common or technique-specific attributes

  11. Use of Graph Database for the Integration of Heterogeneous Biological Data.

    Science.gov (United States)

    Yoon, Byoung-Ha; Kim, Seon-Kyu; Kim, Seon-Young

    2017-03-01

    Understanding complex relationships among heterogeneous biological data is one of the fundamental goals in biology. In most cases, diverse biological data are stored in relational databases, such as MySQL and Oracle, which store data in multiple tables and then infer relationships by multiple-join statements. Recently, a new type of database, called the graph-based database, was developed to natively represent various kinds of complex relationships, and it is widely used among computer science communities and IT industries. Here, we demonstrate the feasibility of using a graph-based database for complex biological relationships by comparing the performance between MySQL and Neo4j, one of the most widely used graph databases. We collected various biological data (protein-protein interaction, drug-target, gene-disease, etc.) from several existing sources, removed duplicate and redundant data, and finally constructed a graph database containing 114,550 nodes and 82,674,321 relationships. When we tested the query execution performance of MySQL versus Neo4j, we found that Neo4j outperformed MySQL in all cases. While Neo4j exhibited a very fast response for various queries, MySQL exhibited latent or unfinished responses for complex queries with multiple-join statements. These results show that using graph-based databases, such as Neo4j, is an efficient way to store complex biological relationships. Moreover, querying a graph database in diverse ways has the potential to reveal novel relationships among heterogeneous biological data.

  12. An inductive database system based on virtual mining views

    NARCIS (Netherlands)

    Blockeel, H.; Calders, T.G.K.; Fromont, É.; Goethals, B.; Prado, A.; Robardet, C.

    2012-01-01

    Inductive databases integrate database querying with database mining. In this article, we present an inductive database system that does not rely on a new data mining query language, but on plain SQL. We propose an intuitive and elegant framework based on virtual mining views, which are relational

  13. Neutron metrology file NMF-90. An integrated database for performing neutron spectrum adjustment calculations

    International Nuclear Information System (INIS)

    Kocherov, N.P.

    1996-01-01

    The Neutron Metrology File NMF-90 is an integrated database for performing neutron spectrum adjustment (unfolding) calculations. It contains 4 different adjustment codes, the dosimetry reaction cross-section library IRDF-90/NMF-G with covariances files, 6 input data sets for reactor benchmark neutron fields and a number of utility codes for processing and plotting the input and output data. The package consists of 9 PC HD diskettes and manuals for the codes. It is distributed by the Nuclear Data Section of the IAEA on request free of charge. About 10 MB of diskspace is needed to install and run a typical reactor neutron dosimetry unfolding problem. (author). 8 refs

  14. A framework for organizing cancer-related variations from existing databases, publications and NGS data using a High-performance Integrated Virtual Environment (HIVE).

    Science.gov (United States)

    Wu, Tsung-Jung; Shamsaddini, Amirhossein; Pan, Yang; Smith, Krista; Crichton, Daniel J; Simonyan, Vahan; Mazumder, Raja

    2014-01-01

    Years of sequence feature curation by UniProtKB/Swiss-Prot, PIR-PSD, NCBI-CDD, RefSeq and other database biocurators has led to a rich repository of information on functional sites of genes and proteins. This information along with variation-related annotation can be used to scan human short sequence reads from next-generation sequencing (NGS) pipelines for presence of non-synonymous single-nucleotide variations (nsSNVs) that affect functional sites. This and similar workflows are becoming more important because thousands of NGS data sets are being made available through projects such as The Cancer Genome Atlas (TCGA), and researchers want to evaluate their biomarkers in genomic data. BioMuta, an integrated sequence feature database, provides a framework for automated and manual curation and integration of cancer-related sequence features so that they can be used in NGS analysis pipelines. Sequence feature information in BioMuta is collected from the Catalogue of Somatic Mutations in Cancer (COSMIC), ClinVar, UniProtKB and through biocuration of information available from publications. Additionally, nsSNVs identified through automated analysis of NGS data from TCGA are also included in the database. Because of the petabytes of data and information present in NGS primary repositories, a platform HIVE (High-performance Integrated Virtual Environment) for storing, analyzing, computing and curating NGS data and associated metadata has been developed. Using HIVE, 31 979 nsSNVs were identified in TCGA-derived NGS data from breast cancer patients. All variations identified through this process are stored in a Curated Short Read archive, and the nsSNVs from the tumor samples are included in BioMuta. Currently, BioMuta has 26 cancer types with 13 896 small-scale and 308 986 large-scale study-derived variations. Integration of variation data allows identifications of novel or common nsSNVs that can be prioritized in validation studies. Database URL: BioMuta: http

  15. Pancreatic Expression database: a generic model for the organization, integration and mining of complex cancer datasets

    Directory of Open Access Journals (Sweden)

    Lemoine Nicholas R

    2007-11-01

    Full Text Available Abstract Background Pancreatic cancer is the 5th leading cause of cancer death in both males and females. In recent years, a wealth of gene and protein expression studies have been published broadening our understanding of pancreatic cancer biology. Due to the explosive growth in publicly available data from multiple different sources it is becoming increasingly difficult for individual researchers to integrate these into their current research programmes. The Pancreatic Expression database, a generic web-based system, is aiming to close this gap by providing the research community with an open access tool, not only to mine currently available pancreatic cancer data sets but also to include their own data in the database. Description Currently, the database holds 32 datasets comprising 7636 gene expression measurements extracted from 20 different published gene or protein expression studies from various pancreatic cancer types, pancreatic precursor lesions (PanINs and chronic pancreatitis. The pancreatic data are stored in a data management system based on the BioMart technology alongside the human genome gene and protein annotations, sequence, homologue, SNP and antibody data. Interrogation of the database can be achieved through both a web-based query interface and through web services using combined criteria from pancreatic (disease stages, regulation, differential expression, expression, platform technology, publication and/or public data (antibodies, genomic region, gene-related accessions, ontology, expression patterns, multi-species comparisons, protein data, SNPs. Thus, our database enables connections between otherwise disparate data sources and allows relatively simple navigation between all data types and annotations. Conclusion The database structure and content provides a powerful and high-speed data-mining tool for cancer research. It can be used for target discovery i.e. of biomarkers from body fluids, identification and analysis

  16. InterAction Database (IADB)

    Science.gov (United States)

    The InterAction Database includes demographic and prescription information for more than 500,000 patients in the northern and middle Netherlands and has been integrated with other systems to enhance data collection and analysis.

  17. Database Systems - Present and Future

    Directory of Open Access Journals (Sweden)

    2009-01-01

    Full Text Available The database systems have nowadays an increasingly important role in the knowledge-based society, in which computers have penetrated all fields of activity and the Internet tends to develop worldwide. In the current informatics context, the development of the applications with databases is the work of the specialists. Using databases, reach a database from various applications, and also some of related concepts, have become accessible to all categories of IT users. This paper aims to summarize the curricular area regarding the fundamental database systems issues, which are necessary in order to train specialists in economic informatics higher education. The database systems integrate and interfere with several informatics technologies and therefore are more difficult to understand and use. Thus, students should know already a set of minimum, mandatory concepts and their practical implementation: computer systems, programming techniques, programming languages, data structures. The article also presents the actual trends in the evolution of the database systems, in the context of economic informatics.

  18. The bovine QTL viewer: a web accessible database of bovine Quantitative Trait Loci

    Directory of Open Access Journals (Sweden)

    Xavier Suresh R

    2006-06-01

    Full Text Available Abstract Background Many important agricultural traits such as weight gain, milk fat content and intramuscular fat (marbling in cattle are quantitative traits. Most of the information on these traits has not previously been integrated into a genomic context. Without such integration application of these data to agricultural enterprises will remain slow and inefficient. Our goal was to populate a genomic database with data mined from the bovine quantitative trait literature and to make these data available in a genomic context to researchers via a user friendly query interface. Description The QTL (Quantitative Trait Locus data and related information for bovine QTL are gathered from published work and from existing databases. An integrated database schema was designed and the database (MySQL populated with the gathered data. The bovine QTL Viewer was developed for the integration of QTL data available for cattle. The tool consists of an integrated database of bovine QTL and the QTL viewer to display QTL and their chromosomal position. Conclusion We present a web accessible, integrated database of bovine (dairy and beef cattle QTL for use by animal geneticists. The viewer and database are of general applicability to any livestock species for which there are public QTL data. The viewer can be accessed at http://bovineqtl.tamu.edu.

  19. Design and implementation of typical target image database system

    International Nuclear Information System (INIS)

    Qin Kai; Zhao Yingjun

    2010-01-01

    It is necessary to provide essential background data and thematic data timely in image processing and application. In fact, application is an integrating and analyzing procedure with different kinds of data. In this paper, the authors describe an image database system which classifies, stores, manages and analyzes database of different types, such as image database, vector database, spatial database, spatial target characteristics database, its design and structure. (authors)

  20. The STRING database in 2017

    DEFF Research Database (Denmark)

    Szklarczyk, Damian; Morris, John H; Cook, Helen

    2017-01-01

    A system-wide understanding of cellular function requires knowledge of all functional interactions between the expressed proteins. The STRING database aims to collect and integrate this information, by consolidating known and predicted protein-protein association data for a large number of organi......A system-wide understanding of cellular function requires knowledge of all functional interactions between the expressed proteins. The STRING database aims to collect and integrate this information, by consolidating known and predicted protein-protein association data for a large number...... of organisms. The associations in STRING include direct (physical) interactions, as well as indirect (functional) interactions, as long as both are specific and biologically meaningful. Apart from collecting and reassessing available experimental data on protein-protein interactions, and importing known...... pathways and protein complexes from curated databases, interaction predictions are derived from the following sources: (i) systematic co-expression analysis, (ii) detection of shared selective signals across genomes, (iii) automated text-mining of the scientific literature and (iv) computational transfer...

  1. The CUTLASS database facilities

    International Nuclear Information System (INIS)

    Jervis, P.; Rutter, P.

    1988-09-01

    The enhancement of the CUTLASS database management system to provide improved facilities for data handling is seen as a prerequisite to its effective use for future power station data processing and control applications. This particularly applies to the larger projects such as AGR data processing system refurbishments, and the data processing systems required for the new Coal Fired Reference Design stations. In anticipation of the need for improved data handling facilities in CUTLASS, the CEGB established a User Sub-Group in the early 1980's to define the database facilities required by users. Following the endorsement of the resulting specification and a detailed design study, the database facilities have been implemented as an integral part of the CUTLASS system. This paper provides an introduction to the range of CUTLASS Database facilities, and emphasises the role of Database as the central facility around which future Kit 1 and (particularly) Kit 6 CUTLASS based data processing and control systems will be designed and implemented. (author)

  2. 1.15 - Structural Chemogenomics Databases to Navigate Protein–Ligand Interaction Space

    NARCIS (Netherlands)

    Kanev, G.K.; Kooistra, A.J.; de Esch, I.J.P.; de Graaf, C.

    2017-01-01

    Structural chemogenomics databases allow the integration and exploration of heterogeneous genomic, structural, chemical, and pharmacological data in order to extract useful information that is applicable for the discovery of new protein targets and biologically active molecules. Integrated databases

  3. Databases for INDUS-1 and INDUS-2

    International Nuclear Information System (INIS)

    Merh, Bhavna N.; Fatnani, Pravin

    2003-01-01

    The databases for Indus are relational databases designed to store various categories of data related to the accelerator. The data archiving and retrieving system in Indus is based on a client/sever model. A general purpose commercial database is used to store parameters and equipment data for the whole machine. The database manages configuration, on-line and historical databases. On line and off line applications distributed in several systems can store and retrieve the data from the database over the network. This paper describes the structure of databases for Indus-1 and Indus-2 and their integration within the software architecture. The data analysis, design, resulting data-schema and implementation issues are discussed. (author)

  4. Evaluation of secretome of highly efficient lignocellulolytic Penicillium sp. Dal 5 isolated from rhizosphere of conifers.

    Science.gov (United States)

    Rai, Rohit; Kaur, Baljit; Singh, Surender; Di Falco, Macros; Tsang, Adrian; Chadha, B S

    2016-09-01

    Penicillium sp. (Dal 5) isolated from rhizosphere of conifers from Dalhousie (Himachal Pradesh, India) was found to be an efficient cellulolytic strain. The culture under shake flask on CWR (cellulose, wheat bran and rice straw) medium produced appreciably higher levels of endoglucanase (35.69U/ml), β-glucosidase (4.20U/ml), cellobiohydrolase (2.86U/ml), FPase (1.2U/ml) and xylanase (115U/ml) compared to other Penicillium strains reported in literature. The mass spectroscopy analysis of Penicillium sp. Dal 5 secretome identified 108 proteins constituting an array of CAZymes including glycosyl hydrolases (GH) belonging to 24 different families, polysaccharide lyases (PL), carbohydrate esterases (CE), lytic polysaccharide mono-oxygenases (LPMO) in addition to swollenin and a variety of carbohydrate binding modules (CBM) indicating an elaborate genetic potential of this strain for hydrolysis of lignocellulosics. Further, the culture extract was evaluated for hydrolysis of alkali treated rice straw, wheat straw, bagasse and corn cob at 10% substrate loading rate. Copyright © 2016 Elsevier Ltd. All rights reserved.

  5. A new relational database structure and online interface for the HITRAN database

    International Nuclear Information System (INIS)

    Hill, Christian; Gordon, Iouli E.; Rothman, Laurence S.; Tennyson, Jonathan

    2013-01-01

    A new format for the HITRAN database is proposed. By storing the line-transition data in a number of linked tables described by a relational database schema, it is possible to overcome the limitations of the existing format, which have become increasingly apparent over the last few years as new and more varied data are being used by radiative-transfer models. Although the database in the new format can be searched using the well-established Structured Query Language (SQL), a web service, HITRANonline, has been deployed to allow users to make most common queries of the database using a graphical user interface in a web page. The advantages of the relational form of the database to ensuring data integrity and consistency are explored, and the compatibility of the online interface with the emerging standards of the Virtual Atomic and Molecular Data Centre (VAMDC) project is discussed. In particular, the ability to access HITRAN data using a standard query language from other websites, command line tools and from within computer programs is described. -- Highlights: • A new, interactive version of the HITRAN database is presented. • The data is stored in a structured fashion in a relational database. • The new HITRANonline interface offers increased functionality and easier error correction

  6. An XML-Based Networking Method for Connecting Distributed Anthropometric Databases

    Directory of Open Access Journals (Sweden)

    H Cheng

    2007-03-01

    Full Text Available Anthropometric data are used by numerous types of organizations for health evaluation, ergonomics, apparel sizing, fitness training, and many other applications. Data have been collected and stored in electronic databases since at least the 1940s. These databases are owned by many organizations around the world. In addition, the anthropometric studies stored in these databases often employ different standards, terminology, procedures, or measurement sets. To promote the use and sharing of these databases, the World Engineering Anthropometry Resources (WEAR group was formed and tasked with the integration and publishing of member resources. It is easy to see that organizing worldwide anthropometric data into a single database architecture could be a daunting and expensive undertaking. The challenges of WEAR integration reflect mainly in the areas of distributed and disparate data, different standards and formats, independent memberships, and limited development resources. Fortunately, XML schema and web services provide an alternative method for networking databases, referred to as the Loosely Coupled WEAR Integration. A standard XML schema can be defined and used as a type of Rosetta stone to translate the anthropometric data into a universal format, and a web services system can be set up to link the databases to one another. In this way, the originators of the data can keep their data locally along with their own data management system and user interface, but their data can be searched and accessed as part of the larger data network, and even combined with the data of others. This paper will identify requirements for WEAR integration, review XML as the universal format, review different integration approaches, and propose a hybrid web services/data mart solution.

  7. Genome analysis of Excretory/Secretory proteins in Taenia solium reveals their Abundance of Antigenic Regions (AAR).

    Science.gov (United States)

    Gomez, Sandra; Adalid-Peralta, Laura; Palafox-Fonseca, Hector; Cantu-Robles, Vito Adrian; Soberón, Xavier; Sciutto, Edda; Fragoso, Gladis; Bobes, Raúl J; Laclette, Juan P; Yauner, Luis del Pozo; Ochoa-Leyva, Adrián

    2015-05-19

    Excretory/Secretory (ES) proteins play an important role in the host-parasite interactions. Experimental identification of ES proteins is time-consuming and expensive. Alternative bioinformatics approaches are cost-effective and can be used to prioritize the experimental analysis of therapeutic targets for parasitic diseases. Here we predicted and functionally annotated the ES proteins in T. solium genome using an integration of bioinformatics tools. Additionally, we developed a novel measurement to evaluate the potential antigenicity of T. solium secretome using sequence length and number of antigenic regions of ES proteins. This measurement was formalized as the Abundance of Antigenic Regions (AAR) value. AAR value for secretome showed a similar value to that obtained for a set of experimentally determined antigenic proteins and was different to the calculated value for the non-ES proteins of T. solium genome. Furthermore, we calculated the AAR values for known helminth secretomes and they were similar to that obtained for T. solium. The results reveal the utility of AAR value as a novel genomic measurement to evaluate the potential antigenicity of secretomes. This comprehensive analysis of T. solium secretome provides functional information for future experimental studies, including the identification of novel ES proteins of therapeutic, diagnosis and immunological interest.

  8. A new relational database structure and online interface for the HITRAN database

    Science.gov (United States)

    Hill, Christian; Gordon, Iouli E.; Rothman, Laurence S.; Tennyson, Jonathan

    2013-11-01

    A new format for the HITRAN database is proposed. By storing the line-transition data in a number of linked tables described by a relational database schema, it is possible to overcome the limitations of the existing format, which have become increasingly apparent over the last few years as new and more varied data are being used by radiative-transfer models. Although the database in the new format can be searched using the well-established Structured Query Language (SQL), a web service, HITRANonline, has been deployed to allow users to make most common queries of the database using a graphical user interface in a web page. The advantages of the relational form of the database to ensuring data integrity and consistency are explored, and the compatibility of the online interface with the emerging standards of the Virtual Atomic and Molecular Data Centre (VAMDC) project is discussed. In particular, the ability to access HITRAN data using a standard query language from other websites, command line tools and from within computer programs is described.

  9. Klaim-DB: A Modeling Language for Distributed Database Applications

    DEFF Research Database (Denmark)

    Wu, Xi; Li, Ximeng; Lluch Lafuente, Alberto

    2015-01-01

    and manipulation of structured data, with integrity and atomicity considerations. We present the formal semantics of KlaimDB and illustrate the use of the language in a scenario where the sales from different branches of a chain of department stores are aggregated from their local databases. It can be seen......We present the modelling language, Klaim-DB, for distributed database applications. Klaim-DB borrows the distributed nets of the coordination language Klaim but essentially re-incarnates the tuple spaces of Klaim as databases, and provides high-level language abstractions for the access...... that raising the abstraction level and encapsulating integrity checks (concerning the schema of tables, etc.) in the language primitives for database operations benefit the modelling task considerably....

  10. An integrated database on ticks and tick-borne zoonoses in the tropics and subtropics with special reference to developing and emerging countries.

    Science.gov (United States)

    Vesco, Umberto; Knap, Nataša; Labruna, Marcelo B; Avšič-Županc, Tatjana; Estrada-Peña, Agustín; Guglielmone, Alberto A; Bechara, Gervasio H; Gueye, Arona; Lakos, Andras; Grindatto, Anna; Conte, Valeria; De Meneghi, Daniele

    2011-05-01

    Tick-borne zoonoses (TBZ) are emerging diseases worldwide. A large amount of information (e.g. case reports, results of epidemiological surveillance, etc.) is dispersed through various reference sources (ISI and non-ISI journals, conference proceedings, technical reports, etc.). An integrated database-derived from the ICTTD-3 project ( http://www.icttd.nl )-was developed in order to gather TBZ records in the (sub-)tropics, collected both by the authors and collaborators worldwide. A dedicated website ( http://www.tickbornezoonoses.org ) was created to promote collaboration and circulate information. Data collected are made freely available to researchers for analysis by spatial methods, integrating mapped ecological factors for predicting TBZ risk. The authors present the assembly process of the TBZ database: the compilation of an updated list of TBZ relevant for (sub-)tropics, the database design and its structure, the method of bibliographic search, the assessment of spatial precision of geo-referenced records. At the time of writing, 725 records extracted from 337 publications related to 59 countries in the (sub-)tropics, have been entered in the database. TBZ distribution maps were also produced. Imported cases have been also accounted for. The most important datasets with geo-referenced records were those on Spotted Fever Group rickettsiosis in Latin-America and Crimean-Congo Haemorrhagic Fever in Africa. The authors stress the need for international collaboration in data collection to update and improve the database. Supervision of data entered remains always necessary. Means to foster collaboration are discussed. The paper is also intended to describe the challenges encountered to assemble spatial data from various sources and to help develop similar data collections.

  11. Tight-coupling of groundwater flow and transport modelling engines with spatial databases and GIS technology: a new approach integrating Feflow and ArcGIS

    Directory of Open Access Journals (Sweden)

    Ezio Crestaz

    2012-09-01

    Full Text Available Implementation of groundwater flow and transport numerical models is generally a challenge, time-consuming and financially-demanding task, in charge to specialized modelers and consulting firms. At a later stage, within clearly stated limits of applicability, these models are often expected to be made available to less knowledgeable personnel to support/design and running of predictive simulations within more familiar environments than specialized simulation systems. GIS systems coupled with spatial databases appear to be ideal candidates to address problem above, due to their much wider diffusion and expertise availability. Current paper discusses the issue from a tight-coupling architecture perspective, aimed at integration of spatial databases, GIS and numerical simulation engines, addressing both observed and computed data management, retrieval and spatio-temporal analysis issues. Observed data can be migrated to the central database repository and then used to set up transient simulation conditions in the background, at run time, while limiting additional complexity and integrity failure risks as data duplication during data transfer through proprietary file formats. Similarly, simulation scenarios can be set up in a familiar GIS system and stored to spatial database for later reference. As numerical engine is tightly coupled with the GIS, simulations can be run within the environment and results themselves saved to the database. Further tasks, as spatio-temporal analysis (i.e. for postcalibration auditing scopes, cartography production and geovisualization, can then be addressed using traditional GIS tools. Benefits of such an approach include more effective data management practices, integration and availability of modeling facilities in a familiar environment, streamlining spatial analysis processes and geovisualization requirements for the non-modelers community. Major drawbacks include limited 3D and time-dependent support in

  12. The UCSC Genome Browser Database: 2008 update

    DEFF Research Database (Denmark)

    Karolchik, D; Kuhn, R M; Baertsch, R

    2007-01-01

    The University of California, Santa Cruz, Genome Browser Database (GBD) provides integrated sequence and annotation data for a large collection of vertebrate and model organism genomes. Seventeen new assemblies have been added to the database in the past year, for a total coverage of 19 vertebrat...

  13. Mining Views : database views for data mining

    NARCIS (Netherlands)

    Blockeel, H.; Calders, T.; Fromont, É.; Goethals, B.; Prado, A.; Nijssen, S.; De Raedt, L.

    2007-01-01

    We propose a relational database model towards the integration of data mining into relational database systems, based on the so called virtual mining views. We show that several types of patterns and models over the data, such as itemsets, association rules, decision trees and clusterings, can be

  14. Mining Views : database views for data mining

    NARCIS (Netherlands)

    Blockeel, H.; Calders, T.; Fromont, É.; Goethals, B.; Prado, A.

    2008-01-01

    We present a system towards the integration of data mining into relational databases. To this end, a relational database model is proposed, based on the so called virtual mining views. We show that several types of patterns and models over the data, such as itemsets, association rules and decision

  15. A dedicated database system for handling multi-level data in systems biology.

    Science.gov (United States)

    Pornputtapong, Natapol; Wanichthanarak, Kwanjeera; Nilsson, Avlant; Nookaew, Intawat; Nielsen, Jens

    2014-01-01

    Advances in high-throughput technologies have enabled extensive generation of multi-level omics data. These data are crucial for systems biology research, though they are complex, heterogeneous, highly dynamic, incomplete and distributed among public databases. This leads to difficulties in data accessibility and often results in errors when data are merged and integrated from varied resources. Therefore, integration and management of systems biological data remain very challenging. To overcome this, we designed and developed a dedicated database system that can serve and solve the vital issues in data management and hereby facilitate data integration, modeling and analysis in systems biology within a sole database. In addition, a yeast data repository was implemented as an integrated database environment which is operated by the database system. Two applications were implemented to demonstrate extensibility and utilization of the system. Both illustrate how the user can access the database via the web query function and implemented scripts. These scripts are specific for two sample cases: 1) Detecting the pheromone pathway in protein interaction networks; and 2) Finding metabolic reactions regulated by Snf1 kinase. In this study we present the design of database system which offers an extensible environment to efficiently capture the majority of biological entities and relations encountered in systems biology. Critical functions and control processes were designed and implemented to ensure consistent, efficient, secure and reliable transactions. The two sample cases on the yeast integrated data clearly demonstrate the value of a sole database environment for systems biology research.

  16. Integrated data management for RODOS

    International Nuclear Information System (INIS)

    Abramowicz, K.; Koschel, A.; Rafat, M.; Wendelgass, R.

    1995-12-01

    The report presents the results of a feasibility study on an integrated data organisation and management in RODOS, the real-time on-line decision support system for off-site nuclear emergency management. The conceptual design of the functional components of the integrated data management are described taking account of the software components and the operation environment of the RODOS system. In particular, the scheme architecture of a database integration manager for accessing and updating a multi-database system is discussed in detail under a variety of database management aspects. Furthermore, the structural design of both a simple knowledge database and a real-time database are described. Finally, some short comments on the benefits and disadvantages of the proposed concept of data integration in RODOS are given. (orig.) [de

  17. Respiratory cancer database: An open access database of respiratory cancer gene and miRNA.

    Science.gov (United States)

    Choubey, Jyotsna; Choudhari, Jyoti Kant; Patel, Ashish; Verma, Mukesh Kumar

    2017-01-01

    Respiratory cancer database (RespCanDB) is a genomic and proteomic database of cancer of respiratory organ. It also includes the information of medicinal plants used for the treatment of various respiratory cancers with structure of its active constituents as well as pharmacological and chemical information of drug associated with various respiratory cancers. Data in RespCanDB has been manually collected from published research article and from other databases. Data has been integrated using MySQL an object-relational database management system. MySQL manages all data in the back-end and provides commands to retrieve and store the data into the database. The web interface of database has been built in ASP. RespCanDB is expected to contribute to the understanding of scientific community regarding respiratory cancer biology as well as developments of new way of diagnosing and treating respiratory cancer. Currently, the database consist the oncogenomic information of lung cancer, laryngeal cancer, and nasopharyngeal cancer. Data for other cancers, such as oral and tracheal cancers, will be added in the near future. The URL of RespCanDB is http://ridb.subdic-bioinformatics-nitrr.in/.

  18. Databases in Cloud - Solutions for Developing Renewable Energy Informatics Systems

    Directory of Open Access Journals (Sweden)

    Adela BARA

    2017-08-01

    Full Text Available The paper presents the data model of a decision support prototype developed for generation monitoring, forecasting and advanced analysis in the renewable energy filed. The solutions considered for developing this system include databases in cloud, XML integration, spatial data representation and multidimensional modeling. This material shows the advantages of Cloud databases and spatial data representation and their implementation in Oracle Database 12 c. Also, it contains a data integration part and a multidimensional analysis. The presentation of output data is made using dashboards.

  19. A Reaction Database for Small Molecule Pharmaceutical Processes Integrated with Process Information

    Directory of Open Access Journals (Sweden)

    Emmanouil Papadakis

    2017-10-01

    Full Text Available This article describes the development of a reaction database with the objective to collect data for multiphase reactions involved in small molecule pharmaceutical processes with a search engine to retrieve necessary data in investigations of reaction-separation schemes, such as the role of organic solvents in reaction performance improvement. The focus of this reaction database is to provide a data rich environment with process information available to assist during the early stage synthesis of pharmaceutical products. The database is structured in terms of reaction classification of reaction types; compounds participating in the reaction; use of organic solvents and their function; information for single step and multistep reactions; target products; reaction conditions and reaction data. Information for reactor scale-up together with information for the separation and other relevant information for each reaction and reference are also available in the database. Additionally, the retrieved information obtained from the database can be evaluated in terms of sustainability using well-known “green” metrics published in the scientific literature. The application of the database is illustrated through the synthesis of ibuprofen, for which data on different reaction pathways have been retrieved from the database and compared using “green” chemistry metrics.

  20. Professional iOS database application programming

    CERN Document Server

    Alessi, Patrick

    2013-01-01

    Updated and revised coverage that includes the latest versions of iOS and Xcode Whether you're a novice or experienced developer, you will want to dive into this updated resource on database application programming for the iPhone and iPad. Packed with more than 50 percent new and revised material - including completely rebuilt code, screenshots, and full coverage of new features pertaining to database programming and enterprise integration in iOS 6 - this must-have book intends to continue the precedent set by the previous edition by helping thousands of developers master database

  1. Concurrency control in distributed database systems

    CERN Document Server

    Cellary, W; Gelenbe, E

    1989-01-01

    Distributed Database Systems (DDBS) may be defined as integrated database systems composed of autonomous local databases, geographically distributed and interconnected by a computer network.The purpose of this monograph is to present DDBS concurrency control algorithms and their related performance issues. The most recent results have been taken into consideration. A detailed analysis and selection of these results has been made so as to include those which will promote applications and progress in the field. The application of the methods and algorithms presented is not limited to DDBSs but a

  2. The ESID Online Database network.

    Science.gov (United States)

    Guzman, D; Veit, D; Knerr, V; Kindle, G; Gathmann, B; Eades-Perner, A M; Grimbacher, B

    2007-03-01

    Primary immunodeficiencies (PIDs) belong to the group of rare diseases. The European Society for Immunodeficiencies (ESID), is establishing an innovative European patient and research database network for continuous long-term documentation of patients, in order to improve the diagnosis, classification, prognosis and therapy of PIDs. The ESID Online Database is a web-based system aimed at data storage, data entry, reporting and the import of pre-existing data sources in an enterprise business-to-business integration (B2B). The online database is based on Java 2 Enterprise System (J2EE) with high-standard security features, which comply with data protection laws and the demands of a modern research platform. The ESID Online Database is accessible via the official website (http://www.esid.org/). Supplementary data are available at Bioinformatics online.

  3. Transaction management with integrity checking

    DEFF Research Database (Denmark)

    Martinenghi, Davide; Christiansen, Henning

    2005-01-01

    Database integrity constraints, understood as logical conditions that must hold for any database state, are not fully supported by current database technology. It is typically up to the database designer and application programmer to enforce integrity via triggers or tests at the application level....... 2.~In concurrent database systems, besides the traditional correctness criterion, the execution schedule must ensure that the different transactions can overlap in time without destroying the consistency requirements tested by other, concurrent transactions....

  4. Single-cell protein secretomic signatures as potential correlates to tumor cell lineage evolution and cell-cell interaction

    Directory of Open Access Journals (Sweden)

    Minsuk eKwak

    2013-02-01

    Full Text Available Secreted proteins including cytokines, chemokines and growth factors represent important functional regulators mediating a range of cellular behavior and cell-cell paracrine/autocrine signaling, e.g. in the immunological system, tumor microenvironment or stem cell niche. Detection of these proteins is of great value not only in basic cell biology but also for diagnosis and therapeutic monitoring of human diseases such as cancer. However, due to co-production of multiple effector proteins from a single cell, referred to as polyfunctionality, it is biologically informative to measure a panel of secreted proteins, or secretomic signature, at the level of single cells. Recent evidence further indicates that a genetically-identical cell population can give rise to diverse phenotypic differences. It is known that cytokines, for example, in the immune system define the effector functions and lineage differentiation of immune cells. In this Perspective Article, we hypothesize that protein secretion profile may represent a universal measure to identify the definitive correlate in the larger context of cellular functions to dissect cellular heterogeneity and evolutionary lineage relationship in human cancer.

  5. Follicle Online: an integrated database of follicle assembly, development and ovulation.

    Science.gov (United States)

    Hua, Juan; Xu, Bo; Yang, Yifan; Ban, Rongjun; Iqbal, Furhan; Cooke, Howard J; Zhang, Yuanwei; Shi, Qinghua

    2015-01-01

    Folliculogenesis is an important part of ovarian function as it provides the oocytes for female reproductive life. Characterizing genes/proteins involved in folliculogenesis is fundamental for understanding the mechanisms associated with this biological function and to cure the diseases associated with folliculogenesis. A large number of genes/proteins associated with folliculogenesis have been identified from different species. However, no dedicated public resource is currently available for folliculogenesis-related genes/proteins that are validated by experiments. Here, we are reporting a database 'Follicle Online' that provides the experimentally validated gene/protein map of the folliculogenesis in a number of species. Follicle Online is a web-based database system for storing and retrieving folliculogenesis-related experimental data. It provides detailed information for 580 genes/proteins (from 23 model organisms, including Homo sapiens, Mus musculus, Rattus norvegicus, Mesocricetus auratus, Bos Taurus, Drosophila and Xenopus laevis) that have been reported to be involved in folliculogenesis, POF (premature ovarian failure) and PCOS (polycystic ovary syndrome). The literature was manually curated from more than 43,000 published articles (till 1 March 2014). The Follicle Online database is implemented in PHP + MySQL + JavaScript and this user-friendly web application provides access to the stored data. In summary, we have developed a centralized database that provides users with comprehensive information about genes/proteins involved in folliculogenesis. This database can be accessed freely and all the stored data can be viewed without any registration. Database URL: http://mcg.ustc.edu.cn/sdap1/follicle/index.php © The Author(s) 2015. Published by Oxford University Press.

  6. The PEP-II project-wide database

    International Nuclear Information System (INIS)

    Chan, A.; Calish, S.; Crane, G.; MacGregor, I.; Meyer, S.; Wong, J.

    1995-05-01

    The PEP-II Project Database is a tool for monitoring the technical and documentation aspects of this accelerator construction. It holds the PEP-II design specifications, fabrication and installation data in one integrated system. Key pieces of the database include the machine parameter list, magnet and vacuum fabrication data. CAD drawings, publications and documentation, survey and alignment data and property control. The database can be extended to contain information required for the operations phase of the accelerator and detector. Features such as viewing CAD drawing graphics from the database will be implemented in the future. This central Oracle database on a UNIX server is built using ORACLE Case tools. Users at the three collaborating laboratories (SLAC, LBL, LLNL) can access the data remotely, using various desktop computer platforms and graphical interfaces

  7. Coordinate Systems Integration for Craniofacial Database from Multimodal Devices

    Directory of Open Access Journals (Sweden)

    Deni Suwardhi

    2005-05-01

    Full Text Available This study presents a data registration method for craniofacial spatial data of different modalities. The data consists of three dimensional (3D vector and raster data models. The data is stored in object relational database. The data capture devices are Laser scanner, CT (Computed Tomography scan and CR (Close Range Photogrammetry. The objective of the registration is to transform the data from various coordinate systems into a single 3-D Cartesian coordinate system. The standard error of the registration obtained from multimodal imaging devices using 3D affine transformation is in the ranged of 1-2 mm. This study is a step forward for storing the craniofacial spatial data in one reference system in database.

  8. Development and implementation of a custom integrated database with dashboards to assist with hematopathology specimen triage and traffic

    Directory of Open Access Journals (Sweden)

    Elizabeth M Azzato

    2014-01-01

    Full Text Available Background: At some institutions, including ours, bone marrow aspirate specimen triage is complex, with hematopathology triage decisions that need to be communicated to downstream ancillary testing laboratories and many specimen aliquot transfers that are handled outside of the laboratory information system (LIS. We developed a custom integrated database with dashboards to facilitate and streamline this workflow. Methods: We developed user-specific dashboards that allow entry of specimen information by technologists in the hematology laboratory, have custom scripting to present relevant information for the hematopathology service and ancillary laboratories and allow communication of triage decisions from the hematopathology service to other laboratories. These dashboards are web-accessible on the local intranet and accessible from behind the hospital firewall on a computer or tablet. Secure user access and group rights ensure that relevant users can edit or access appropriate records. Results: After database and dashboard design, two-stage beta-testing and user education was performed, with the first focusing on technologist specimen entry and the second on downstream users. Commonly encountered issues and user functionality requests were resolved with database and dashboard redesign. Final implementation occurred within 6 months of initial design; users report improved triage efficiency and reduced need for interlaboratory communications. Conclusions: We successfully developed and implemented a custom database with dashboards that facilitates and streamlines our hematopathology bone marrow aspirate triage. This provides an example of a possible solution to specimen communications and traffic that are outside the purview of a standard LIS.

  9. Electronic database of arterial aneurysms

    Directory of Open Access Journals (Sweden)

    Fabiano Luiz Erzinger

    2014-12-01

    Full Text Available Background:The creation of an electronic database facilitates the storage of information, as well as streamlines the exchange of data, making easier the exchange of knowledge for future research.Objective:To construct an electronic database containing comprehensive and up-to-date clinical and surgical data on the most common arterial aneurysms, to help advance scientific research.Methods:The most important specialist textbooks and articles found in journals and on internet databases were reviewed in order to define the basic structure of the protocol. Data were computerized using the SINPE© system for integrated electronic protocols and tested in a pilot study.Results:The data entered onto the system was first used to create a Master protocol, organized into a structure of top-level directories covering a large proportion of the content on vascular diseases as follows: patient history; physical examination; supplementary tests and examinations; diagnosis; treatment; and clinical course. By selecting items from the Master protocol, Specific protocols were then created for the 22 arterial sites most often involved by aneurysms. The program provides a method for collection of data on patients including clinical characteristics (patient history and physical examination, supplementary tests and examinations, treatments received and follow-up care after treatment. Any information of interest on these patients that is contained in the protocol can then be used to query the database and select data for studies.Conclusions:It proved possible to construct a database of clinical and surgical data on the arterial aneurysms of greatest interest and, by adapting the data to specific software, the database was integrated into the SINPE© system, thereby providing a standardized method for collection of data on these patients and tools for retrieving this information in an organized manner for use in scientific studies.

  10. Customer database for Watrec Oy

    OpenAIRE

    Melnichikhina, Ekaterina

    2016-01-01

    This thesis is a development project for Watrec Oy. Watrec Oy is a Finnish company specializes in “waste-to-energy” issues. Customer Relation Management (CRM) strategies are now being applied within the company. The customer database is the first and trial step towards CRM strategy in Watrec Oy. The reasons for database project lie in lacking of clear customers’ data. The main objectives are: - To integrate the customers’ and project data; - To improve the level of sales and mar...

  11. Development of IAEA nuclear reaction databases and services

    Energy Technology Data Exchange (ETDEWEB)

    Zerkin, V.; Trkov, A. [International Atomic Energy Agency, Dept. of Nuclear Sciences and Applications, Vienna (Austria)

    2008-07-01

    From mid-2004 onwards, the major nuclear reaction databases (EXFOR, CINDA and Endf) and services (Web and CD-Roms retrieval systems and specialized applications) have been functioning within a modern computing environment as multi-platform software, working under several operating systems with relational databases. Subsequent work at the IAEA has focused on three areas of development: revision and extension of the contents of the databases; extension and improvement of the functionality and integrity of the retrieval systems; development of software for database maintenance and system deployment. (authors)

  12. Large scale access tests and online interfaces to ATLAS conditions databases

    International Nuclear Information System (INIS)

    Amorim, A; Lopes, L; Pereira, P; Simoes, J; Soloviev, I; Burckhart, D; Schmitt, J V D; Caprini, M; Kolos, S

    2008-01-01

    The access of the ATLAS Trigger and Data Acquisition (TDAQ) system to the ATLAS Conditions Databases sets strong reliability and performance requirements on the database storage and access infrastructures. Several applications were developed to support the integration of Conditions database access with the online services in TDAQ, including the interface to the Information Services (IS) and to the TDAQ Configuration Databases. The information storage requirements were the motivation for the ONline A Synchronous Interface to COOL (ONASIC) from the Information Service (IS) to LCG/COOL databases. ONASIC avoids the possible backpressure from Online Database servers by managing a local cache. In parallel, OKS2COOL was developed to store Configuration Databases into an Offline Database with history record. The DBStressor application was developed to test and stress the access to the Conditions database using the LCG/COOL interface while operating in an integrated way as a TDAQ application. The performance scaling of simultaneous Conditions database read accesses was studied in the context of the ATLAS High Level Trigger large computing farms. A large set of tests were performed involving up to 1000 computing nodes that simultaneously accessed the LCG central database server infrastructure at CERN

  13. Secure Distributed Databases Using Cryptography

    Directory of Open Access Journals (Sweden)

    Ion IVAN

    2006-01-01

    Full Text Available The computational encryption is used intensively by different databases management systems for ensuring privacy and integrity of information that are physically stored in files. Also, the information is sent over network and is replicated on different distributed systems. It is proved that a satisfying level of security is achieved if the rows and columns of tables are encrypted independently of table or computer that sustains the data. Also, it is very important that the SQL - Structured Query Language query requests and responses to be encrypted over the network connection between the client and databases server. All this techniques and methods must be implemented by the databases administrators, designer and developers in a consistent security policy.

  14. ADANS database specification

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1997-01-16

    The purpose of the Air Mobility Command (AMC) Deployment Analysis System (ADANS) Database Specification (DS) is to describe the database organization and storage allocation and to provide the detailed data model of the physical design and information necessary for the construction of the parts of the database (e.g., tables, indexes, rules, defaults). The DS includes entity relationship diagrams, table and field definitions, reports on other database objects, and a description of the ADANS data dictionary. ADANS is the automated system used by Headquarters AMC and the Tanker Airlift Control Center (TACC) for airlift planning and scheduling of peacetime and contingency operations as well as for deliberate planning. ADANS also supports planning and scheduling of Air Refueling Events by the TACC and the unit-level tanker schedulers. ADANS receives input in the form of movement requirements and air refueling requests. It provides a suite of tools for planners to manipulate these requirements/requests against mobility assets and to develop, analyze, and distribute schedules. Analysis tools are provided for assessing the products of the scheduling subsystems, and editing capabilities support the refinement of schedules. A reporting capability provides formatted screen, print, and/or file outputs of various standard reports. An interface subsystem handles message traffic to and from external systems. The database is an integral part of the functionality summarized above.

  15. Solutions for medical databases optimal exploitation.

    Science.gov (United States)

    Branescu, I; Purcarea, V L; Dobrescu, R

    2014-03-15

    The paper discusses the methods to apply OLAP techniques for multidimensional databases that leverage the existing, performance-enhancing technique, known as practical pre-aggregation, by making this technique relevant to a much wider range of medical applications, as a logistic support to the data warehousing techniques. The transformations have practically low computational complexity and they may be implemented using standard relational database technology. The paper also describes how to integrate the transformed hierarchies in current OLAP systems, transparently to the user and proposes a flexible, "multimodel" federated system for extending OLAP querying to external object databases.

  16. Simplification of integrity constraints for data integration

    DEFF Research Database (Denmark)

    Christiansen, Henning; Martinenghi, Davide

    2004-01-01

    , because either the global database is known to be consistent or suitable actions have been taken to provide consistent views. The present work generalizes simplification techniques for integrity checking in traditional databases to the combined case. Knowledge of local consistency is employed, perhaps...

  17. SmallSat Database

    Science.gov (United States)

    Petropulos, Dolores; Bittner, David; Murawski, Robert; Golden, Bert

    2015-01-01

    The SmallSat has an unrealized potential in both the private industry and in the federal government. Currently over 70 companies, 50 universities and 17 governmental agencies are involved in SmallSat research and development. In 1994, the U.S. Army Missile and Defense mapped the moon using smallSat imagery. Since then Smart Phones have introduced this imagery to the people of the world as diverse industries watched this trend. The deployment cost of smallSats is also greatly reduced compared to traditional satellites due to the fact that multiple units can be deployed in a single mission. Imaging payloads have become more sophisticated, smaller and lighter. In addition, the growth of small technology obtained from private industries has led to the more widespread use of smallSats. This includes greater revisit rates in imagery, significantly lower costs, the ability to update technology more frequently and the ability to decrease vulnerability of enemy attacks. The popularity of smallSats show a changing mentality in this fast paced world of tomorrow. What impact has this created on the NASA communication networks now and in future years? In this project, we are developing the SmallSat Relational Database which can support a simulation of smallSats within the NASA SCaN Compatability Environment for Networks and Integrated Communications (SCENIC) Modeling and Simulation Lab. The NASA Space Communications and Networks (SCaN) Program can use this modeling to project required network support needs in the next 10 to 15 years. The SmallSat Rational Database could model smallSats just as the other SCaN databases model the more traditional larger satellites, with a few exceptions. One being that the smallSat Database is designed to be built-to-order. The SmallSat database holds various hardware configurations that can be used to model a smallSat. It will require significant effort to develop as the research material can only be populated by hand to obtain the unique data

  18. DEVELOPING FLEXIBLE APPLICATIONS WITH XML AND DATABASE INTEGRATION

    Directory of Open Access Journals (Sweden)

    Hale AS

    2004-04-01

    Full Text Available In recent years the most popular subject in Information System area is Enterprise Application Integration (EAI. It can be defined as a process of forming a standart connection between different systems of an organization?s information system environment. The incorporating, gaining and marriage of corporations are the major reasons of popularity in Enterprise Application Integration. The main purpose is to solve the application integrating problems while similar systems in such corporations continue working together for a more time. With the help of XML technology, it is possible to find solutions to the problems of application integration either within the corporation or between the corporations.

  19. A Generative Approach for Building Database Federations

    Directory of Open Access Journals (Sweden)

    Uwe Hohenstein

    1999-11-01

    Full Text Available A comprehensive, specification-based approach for building database federations is introduced that supports an integrated ODMG2.0 conforming access to heterogeneous data sources seamlessly done in C++. The approach is centered around several generators. A first set of generators produce ODMG adapters for local sources in order to homogenize them. Each adapter represents an ODMG view and supports the ODMG manipulation and querying. The adapters can be plugged into a federation framework. Another generator produces an homogeneous and uniform view by putting an ODMG conforming federation layer on top of the adapters. Input to these generators are schema specifications. Schemata are defined in corresponding specification languages. There are languages to homogenize relational and object-oriented databases, as well as ordinary file systems. Any specification defines an ODMG schema and relates it to an existing data source. An integration language is then used to integrate the schemata and to build system-spanning federated views thereupon. The generative nature provides flexibility with respect to schema modification of component databases. Any time a schema changes, only the specification has to be adopted; new adapters are generated automatically

  20. Legume and Lotus japonicus Databases

    DEFF Research Database (Denmark)

    Hirakawa, Hideki; Mun, Terry; Sato, Shusei

    2014-01-01

    Since the genome sequence of Lotus japonicus, a model plant of family Fabaceae, was determined in 2008 (Sato et al. 2008), the genomes of other members of the Fabaceae family, soybean (Glycine max) (Schmutz et al. 2010) and Medicago truncatula (Young et al. 2011), have been sequenced. In this sec....... In this section, we introduce representative, publicly accessible online resources related to plant materials, integrated databases containing legume genome information, and databases for genome sequence and derived marker information of legume species including L. japonicus...

  1. System/subsystem specifications for the Worldwide Port System (WPS) Regional Integrated Cargo Database (ICDB)

    Energy Technology Data Exchange (ETDEWEB)

    Rollow, J.P.; Shipe, P.C.; Truett, L.F. [Oak Ridge National Lab., TN (United States); Faby, E.Z.; Fluker, J.; Grubb, J.; Hancock, B.R. [Univ. of Tennessee, Knoxville, TN (United States); Ferguson, R.A. [Science Applications International Corp., Oak Ridge, TN (United States)

    1995-11-20

    A system is being developed by the Military Traffic Management Command (MTMC) to provide data integration and worldwide management and tracking of surface cargo movements. The Integrated Cargo Database (ICDB) will be a data repository for the WPS terminal-level system, will be a primary source of queries and cargo traffic reports, will receive data from and provide data to other MTMC and non-MTMC systems, will provide capabilities for processing Advance Transportation Control and Movement Documents (ATCMDs), and will process and distribute manifests. This System/Subsystem Specifications for the Worldwide Port System Regional ICDB documents the system/subsystem functions, provides details of the system/subsystem analysis in order to provide a communication link between developers and operational personnel, and identifies interfaces with other systems and subsystems. It must be noted that this report is being produced near the end of the initial development phase of ICDB, while formal software testing is being done. Following the initial implementation of the ICDB system, maintenance contractors will be in charge of making changes and enhancing software modules. Formal testing and user reviews may indicate the need for additional software units or changes to existing ones. This report describes the software units that are components of this ICDB system as of August 1995.

  2. Nuclear data processing using a database management system

    International Nuclear Information System (INIS)

    Castilla, V.; Gonzalez, L.

    1991-01-01

    A database management system that permits the design of relational models was used to create an integrated database with experimental and evaluated nuclear data.A system that reduces the time and cost of processing was created for computers type EC or compatibles.A set of programs for the conversion from nuclear calculated data output format to EXFOR format was developed.A dictionary to perform a retrospective search in the ENDF database was created too

  3. [Research and development of medical case database: a novel medical case information system integrating with biospecimen management].

    Science.gov (United States)

    Pan, Shiyang; Mu, Yuan; Wang, Hong; Wang, Tong; Huang, Peijun; Ma, Jianfeng; Jiang, Li; Zhang, Jie; Gu, Bing; Yi, Lujiang

    2010-04-01

    To meet the needs of management of medical case information and biospecimen simultaneously, we developed a novel medical case information system integrating with biospecimen management. The database established by MS SQL Server 2000 covered, basic information, clinical diagnosis, imaging diagnosis, pathological diagnosis and clinical treatment of patient; physicochemical property, inventory management and laboratory analysis of biospecimen; users log and data maintenance. The client application developed by Visual C++ 6.0 was used to implement medical case and biospecimen management, which was based on Client/Server model. This system can perform input, browse, inquest, summary of case and related biospecimen information, and can automatically synthesize case-records based on the database. Management of not only a long-term follow-up on individual, but also of grouped cases organized according to the aim of research can be achieved by the system. This system can improve the efficiency and quality of clinical researches while biospecimens are used coordinately. It realizes synthesized and dynamic management of medical case and biospecimen, which may be considered as a new management platform.

  4. Uses and limitations of registry and academic databases.

    Science.gov (United States)

    Williams, William G

    2010-01-01

    A database is simply a structured collection of information. A clinical database may be a Registry (a limited amount of data for every patient undergoing heart surgery) or Academic (an organized and extensive dataset of an inception cohort of carefully selected subset of patients). A registry and an academic database have different purposes and cost. The data to be collected for a database is defined by its purpose and the output reports required for achieving that purpose. A Registry's purpose is to ensure quality care, an Academic Database, to discover new knowledge through research. A database is only as good as the data it contains. Database personnel must be exceptionally committed and supported by clinical faculty. A system to routinely validate and verify data integrity is essential to ensure database utility. Frequent use of the database improves its accuracy. For congenital heart surgeons, routine use of a Registry Database is an essential component of clinical practice. Copyright (c) 2010 Elsevier Inc. All rights reserved.

  5. MIPS Arabidopsis thaliana Database (MAtDB): an integrated biological knowledge resource based on the first complete plant genome

    Science.gov (United States)

    Schoof, Heiko; Zaccaria, Paolo; Gundlach, Heidrun; Lemcke, Kai; Rudd, Stephen; Kolesov, Grigory; Arnold, Roland; Mewes, H. W.; Mayer, Klaus F. X.

    2002-01-01

    Arabidopsis thaliana is the first plant for which the complete genome has been sequenced and published. Annotation of complex eukaryotic genomes requires more than the assignment of genetic elements to the sequence. Besides completing the list of genes, we need to discover their cellular roles, their regulation and their interactions in order to understand the workings of the whole plant. The MIPS Arabidopsis thaliana Database (MAtDB; http://mips.gsf.de/proj/thal/db) started out as a repository for genome sequence data in the European Scientists Sequencing Arabidopsis (ESSA) project and the Arabidopsis Genome Initiative. Our aim is to transform MAtDB into an integrated biological knowledge resource by integrating diverse data, tools, query and visualization capabilities and by creating a comprehensive resource for Arabidopsis as a reference model for other species, including crop plants. PMID:11752263

  6. 77 FR 71089 - Pilot Loading of Aeronautical Database Updates

    Science.gov (United States)

    2012-11-29

    ...) card, rather than in resident memory. The database update was accomplished by removing the SD card with... frequency distance measuring equipment (DME), and any updates that affect system operating software--that... developed with attention to data integrity. Current technology uses databases which are developed in...

  7. Integrative analysis to select cancer candidate biomarkers to targeted validation

    Science.gov (United States)

    Heberle, Henry; Domingues, Romênia R.; Granato, Daniela C.; Yokoo, Sami; Canevarolo, Rafael R.; Winck, Flavia V.; Ribeiro, Ana Carolina P.; Brandão, Thaís Bianca; Filgueiras, Paulo R.; Cruz, Karen S. P.; Barbuto, José Alexandre; Poppi, Ronei J.; Minghim, Rosane; Telles, Guilherme P.; Fonseca, Felipe Paiva; Fox, Jay W.; Santos-Silva, Alan R.; Coletta, Ricardo D.; Sherman, Nicholas E.; Paes Leme, Adriana F.

    2015-01-01

    Targeted proteomics has flourished as the method of choice for prospecting for and validating potential candidate biomarkers in many diseases. However, challenges still remain due to the lack of standardized routines that can prioritize a limited number of proteins to be further validated in human samples. To help researchers identify candidate biomarkers that best characterize their samples under study, a well-designed integrative analysis pipeline, comprising MS-based discovery, feature selection methods, clustering techniques, bioinformatic analyses and targeted approaches was performed using discovery-based proteomic data from the secretomes of three classes of human cell lines (carcinoma, melanoma and non-cancerous). Three feature selection algorithms, namely, Beta-binomial, Nearest Shrunken Centroids (NSC), and Support Vector Machine-Recursive Features Elimination (SVM-RFE), indicated a panel of 137 candidate biomarkers for carcinoma and 271 for melanoma, which were differentially abundant between the tumor classes. We further tested the strength of the pipeline in selecting candidate biomarkers by immunoblotting, human tissue microarrays, label-free targeted MS and functional experiments. In conclusion, the proposed integrative analysis was able to pre-qualify and prioritize candidate biomarkers from discovery-based proteomics to targeted MS. PMID:26540631

  8. The NAGRA/PSI thermochemical database: new developments

    International Nuclear Information System (INIS)

    Hummel, W.; Berner, U.; Thoenen, T.; Pearson, F.J.Jr.

    2000-01-01

    The development of a high quality thermochemical database for performance assessment is a scientifically fascinating and demanding task, and is not simply collecting and recording numbers. The final product can by visualised as a complex building with different storeys representing different levels of complexity. The present status report illustrates the various building blocks which we believe are integral to such a database structure. (authors)

  9. The NAGRA/PSI thermochemical database: new developments

    Energy Technology Data Exchange (ETDEWEB)

    Hummel, W.; Berner, U.; Thoenen, T. [Paul Scherrer Inst. (PSI), Villigen (Switzerland); Pearson, F.J.Jr. [Ground-Water Geochemistry, New Bern, NC (United States)

    2000-07-01

    The development of a high quality thermochemical database for performance assessment is a scientifically fascinating and demanding task, and is not simply collecting and recording numbers. The final product can by visualised as a complex building with different storeys representing different levels of complexity. The present status report illustrates the various building blocks which we believe are integral to such a database structure. (authors)

  10. Database Vs Data Warehouse

    Directory of Open Access Journals (Sweden)

    2007-01-01

    Full Text Available Data warehouse technology includes a set of concepts and methods that offer the users useful information for decision making. The necessity to build a data warehouse arises from the necessity to improve the quality of information in the organization. The date proceeding from different sources, having a variety of forms - both structured and unstructured, are filtered according to business rules and are integrated in a single large data collection. Using informatics solutions, managers have understood that data stored in operational systems - including databases, are an informational gold mine that must be exploited. Data warehouses have been developed to answer the increasing demands for complex analysis, which could not be properly achieved with operational databases. The present paper emphasizes some of the criteria that information application developers can use in order to choose between a database solution or a data warehouse one.

  11. Development of a PSA information database system

    International Nuclear Information System (INIS)

    Kim, Seung Hwan

    2005-01-01

    The need to develop the PSA information database for performing a PSA has been growing rapidly. For example, performing a PSA requires a lot of data to analyze, to evaluate the risk, to trace the process of results and to verify the results. PSA information database is a system that stores all PSA related information into the database and file system with cross links to jump to the physical documents whenever they are needed. Korea Atomic Energy Research Institute is developing a PSA information database system, AIMS (Advanced Information Management System for PSA). The objective is to integrate and computerize all the distributed information of a PSA into a system and to enhance the accessibility to PSA information for all PSA related activities. This paper describes how we implemented such a database centered application in the view of two areas, database design and data (document) service

  12. Database Aspects of Location-Based Services

    DEFF Research Database (Denmark)

    Jensen, Christian Søndergaard

    2004-01-01

    in the databases underlying high-quality services. Several integrated representations - which capture different aspects of the same infrastructure - are needed. Further, all other content that can be related to geographical space must be integrated with the infrastructure representations. The chapter describes...... the general concepts underlying one approach to data modeling for location-based services. The chapter also covers techniques that are needed to keep a database for location-based services up to date with the reality it models. As part of this, caching is touched upon briefly. The notion of linear referencing......Adopting a data management perspective on location-based services, this chapter explores central challenges to data management posed by location-based services. Because service users typically travel in, and are constrained to, transportation infrastructures, such structures must be represented...

  13. Some Considerations about Modern Database Machines

    Directory of Open Access Journals (Sweden)

    Manole VELICANU

    2010-01-01

    Full Text Available Optimizing the two computing resources of any computing system - time and space - has al-ways been one of the priority objectives of any database. A current and effective solution in this respect is the computer database. Optimizing computer applications by means of database machines has been a steady preoccupation of researchers since the late seventies. Several information technologies have revolutionized the present information framework. Out of these, those which have brought a major contribution to the optimization of the databases are: efficient handling of large volumes of data (Data Warehouse, Data Mining, OLAP – On Line Analytical Processing, the improvement of DBMS – Database Management Systems facilities through the integration of the new technologies, the dramatic increase in computing power and the efficient use of it (computer networks, massive parallel computing, Grid Computing and so on. All these information technologies, and others, have favored the resumption of the research on database machines and the obtaining in the last few years of some very good practical results, as far as the optimization of the computing resources is concerned.

  14. PostGIS-Based Heterogeneous Sensor Database Framework for the Sensor Observation Service

    Directory of Open Access Journals (Sweden)

    Ikechukwu Maduako

    2012-10-01

    Full Text Available Environmental monitoring and management systems in most cases deal with models and spatial analytics that involve the integration of in-situ and remote sensor observations. In-situ sensor observations and those gathered by remote sensors are usually provided by different databases and services in real-time dynamic services such as the Geo-Web Services. Thus, data have to be pulled from different databases and transferred over the network before they are fused and processed on the service middleware. This process is very massive and unnecessary communication and work load on the service. Massive work load in large raster downloads from flat-file raster data sources each time a request is made and huge integration and geo-processing work load on the service middleware which could actually be better leveraged at the database level. In this paper, we propose and present a heterogeneous sensor database framework or model for integration, geo-processing and spatial analysis of remote and in-situ sensor observations at the database level.  And how this can be integrated in the Sensor Observation Service, SOS to reduce communication and massive workload on the Geospatial Web Services and as well make query request from the user end a lot more flexible.

  15. International Nuclear Safety Center (INSC) database

    International Nuclear Information System (INIS)

    Sofu, T.; Ley, H.; Turski, R.B.

    1997-01-01

    As an integral part of DOE's International Nuclear Safety Center (INSC) at Argonne National Laboratory, the INSC Database has been established to provide an interactively accessible information resource for the world's nuclear facilities and to promote free and open exchange of nuclear safety information among nations. The INSC Database is a comprehensive resource database aimed at a scope and level of detail suitable for safety analysis and risk evaluation for the world's nuclear power plants and facilities. It also provides an electronic forum for international collaborative safety research for the Department of Energy and its international partners. The database is intended to provide plant design information, material properties, computational tools, and results of safety analysis. Initial emphasis in data gathering is given to Soviet-designed reactors in Russia, the former Soviet Union, and Eastern Europe. The implementation is performed under the Oracle database management system, and the World Wide Web is used to serve as the access path for remote users. An interface between the Oracle database and the Web server is established through a custom designed Web-Oracle gateway which is used mainly to perform queries on the stored data in the database tables

  16. Towards a Component Based Model for Database Systems

    Directory of Open Access Journals (Sweden)

    Octavian Paul ROTARU

    2004-02-01

    Full Text Available Due to their effectiveness in the design and development of software applications and due to their recognized advantages in terms of reusability, Component-Based Software Engineering (CBSE concepts have been arousing a great deal of interest in recent years. This paper presents and extends a component-based approach to object-oriented database systems (OODB introduced by us in [1] and [2]. Components are proposed as a new abstraction level for database system, logical partitions of the schema. In this context, the scope is introduced as an escalated property for transactions. Components are studied from the integrity, consistency, and concurrency control perspective. The main benefits of our proposed component model for OODB are the reusability of the database design, including the access statistics required for a proper query optimization, and a smooth information exchange. The integration of crosscutting concerns into the component database model using aspect-oriented techniques is also discussed. One of the main goals is to define a method for the assessment of component composition capabilities. These capabilities are restricted by the component’s interface and measured in terms of adaptability, degree of compose-ability and acceptability level. The above-mentioned metrics are extended from database components to generic software components. This paper extends and consolidates into one common view the ideas previously presented by us in [1, 2, 3].[1] Octavian Paul Rotaru, Marian Dobre, Component Aspects in Object Oriented Databases, Proceedings of the International Conference on Software Engineering Research and Practice (SERP’04, Volume II, ISBN 1-932415-29-7, pages 719-725, Las Vegas, NV, USA, June 2004.[2] Octavian Paul Rotaru, Marian Dobre, Mircea Petrescu, Integrity and Consistency Aspects in Component-Oriented Databases, Proceedings of the International Symposium on Innovation in Information and Communication Technology (ISIICT

  17. RAACFDb: Rheumatoid arthritis ayurvedic classical formulations database.

    Science.gov (United States)

    Mohamed Thoufic Ali, A M; Agrawal, Aakash; Sajitha Lulu, S; Mohana Priya, A; Vino, S

    2017-02-02

    In the past years, the treatment of rheumatoid arthritis (RA) has undergone remarkable changes in all therapeutic modes. The present newfangled care in clinical research is to determine and to pick a new track for better treatment options for RA. Recent ethnopharmacological investigations revealed that traditional herbal remedies are the most preferred modality of complementary and alternative medicine (CAM). However, several ayurvedic modes of treatments and formulations for RA are not much studied and documented from Indian traditional system of medicine. Therefore, this directed us to develop an integrated database, RAACFDb (acronym: Rheumatoid Arthritis Ayurvedic Classical Formulations Database) by consolidating data from the repository of Vedic Samhita - The Ayurveda to retrieve the available formulations information easily. Literature data was gathered using several search engines and from ayurvedic practitioners for loading information in the database. In order to represent the collected information about classical ayurvedic formulations, an integrated database is constructed and implemented on a MySQL and PHP back-end. The database is supported by describing all the ayurvedic classical formulations for the treatment rheumatoid arthritis. It includes composition, usage, plant parts used, active ingredients present in the composition and their structures. The prime objective is to locate ayurvedic formulations proven to be quite successful and highly effective among the patients with reduced side effects. The database (freely available at www.beta.vit.ac.in/raacfdb/index.html) hopefully enables easy access for clinical researchers and students to discover novel leads with reduced side effects. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  18. Development of database systems for safety of repositories for disposal of radioactive wastes

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Yeong Hun; Han, Jeong Sang; Shin, Hyeon Jun; Ham, Sang Won; Kim, Hye Seong [Yonsei Univ., Seoul (Korea, Republic of)

    1999-03-15

    In the study, GSIS os developed for the maximizing effectiveness of the database system. For this purpose, the spatial relation of data from various fields that are constructed in the database which was developed for the site selection and management of repository for radioactive waste disposal. By constructing the integration system that can link attribute and spatial data, it is possible to evaluate the safety of repository effectively and economically. The suitability of integrating database and GSIS is examined by constructing the database in the test district where the site characteristics are similar to that of repository for radioactive waste disposal.

  19. An integrative clinical database and diagnostics platform for biomarker identification and analysis in ion mobility spectra of human exhaled air

    DEFF Research Database (Denmark)

    Schneider, Till; Hauschild, Anne-Christin; Baumbach, Jörg Ingo

    2013-01-01

    data integration and semi-automated data analysis, in particular with regard to the rapid data accumulation, emerging from the high-throughput nature of the MCC/IMS technology. Here, we present a comprehensive database application and analysis platform, which combines metabolic maps with heterogeneous...... biomedical data in a well-structured manner. The design of the database is based on a hybrid of the entity-attribute-value (EAV) model and the EAV-CR, which incorporates the concepts of classes and relationships. Additionally it offers an intuitive user interface that provides easy and quick access...... to have a clear understanding of the detailed composition of human breath. Therefore, in addition to the clinical studies, there is a need for a flexible and comprehensive centralized data repository, which is capable of gathering all kinds of related information. Moreover, there is a demand for automated...

  20. Linking the Taiwan Fish Database to the Global Database

    Directory of Open Access Journals (Sweden)

    Kwang-Tsao Shao

    2007-03-01

    Full Text Available Under the support of the National Digital Archive Program (NDAP, basic species information about most Taiwanese fishes, including their morphology, ecology, distribution, specimens with photos, and literatures have been compiled into the "Fish Database of Taiwan" (http://fishdb.sinica.edu.tw. We expect that the all Taiwanese fish species databank (RSD, with 2800+ species, and the digital "Fish Fauna of Taiwan" will be completed in 2007. Underwater ecological photos and video images for all 2,800+ fishes are quite difficult to achieve but will be collected continuously in the future. In the last year of NDAP, we have successfully integrated all fish specimen data deposited at 7 different institutes in Taiwan as well as their collection maps on the Google Map and Google Earth. Further, the database also provides the pronunciation of Latin scientific names and transliteration of Chinese common names by referring to the Romanization system for all Taiwanese fishes (2,902 species in 292 families so far. The Taiwanese fish species checklist with Chinese common/vernacular names and specimen data has been updated periodically and provided to the global FishBase as well as the Global Biodiversity Information Facility (GBIF through the national portal of the Taiwan Biodiversity Information Facility (TaiBIF. Thus, Taiwanese fish data can be queried and browsed on the WWW. For contributing to the "Barcode of Life" and "All Fishes" international projects, alcohol-preserved specimens of more than 1,800 species and cryobanking tissues of 800 species have been accumulated at RCBAS in the past two years. Through this close collaboration between local and global databases, "The Fish Database of Taiwan" now attracts more than 250,000 visitors and achieves 5 million hits per month. We believe that this local database is becoming an important resource for education, research, conservation, and sustainable use of fish in Taiwan.

  1. ATLAS database application enhancements using Oracle 11g

    CERN Document Server

    Dimitrov, G; The ATLAS collaboration; Blaszczyk, M; Sorokoletov, R

    2012-01-01

    The ATLAS experiment at LHC relies on databases for detector online data-taking, storage and retrieval of configurations, calibrations and alignments, post data-taking analysis, file management over the grid, job submission and management, condition data replication to remote sites. Oracle Relational Database Management System (RDBMS) has been addressing the ATLAS database requirements to a great extent for many years. Ten database clusters are currently deployed for the needs of the different applications, divided in production, integration and standby databases. The data volume, complexity and demands from the users are increasing steadily with time. Nowadays more than 20 TB of data are stored in the ATLAS production Oracle databases at CERN (not including the index overhead), but the most impressive number is the hosted 260 database schemas (for the most common case each schema is related to a dedicated client application with its own requirements). At the beginning of 2012 all ATLAS databases at CERN have...

  2. Experience using a distributed object oriented database for a DAQ system

    International Nuclear Information System (INIS)

    Bee, C.P.; Eshghi, S.; Jones, R.

    1996-01-01

    To configure the RD13 data acquisition system, we need many parameters which describe the various hardware and software components. Such information has been defined using an entity-relation model and stored in a commercial memory-resident database. during the last year, Itasca, an object oriented database management system (OODB), was chosen as a replacement database system. We have ported the existing databases (hs and sw configurations, run parameters etc.) to Itasca and integrated it with the run control system. We believe that it is possible to use an OODB in real-time environments such as DAQ systems. In this paper, we present our experience and impression: why we wanted to change from an entity-relational approach, some useful features of Itasca, the issues we meet during this project including integration of the database into an existing distributed environment and factors which influence performance. (author)

  3. Scale out databases for CERN use cases

    International Nuclear Information System (INIS)

    Baranowski, Zbigniew; Grzybek, Maciej; Canali, Luca; Garcia, Daniel Lanza; Surdy, Kacper

    2015-01-01

    Data generation rates are expected to grow very fast for some database workloads going into LHC run 2 and beyond. In particular this is expected for data coming from controls, logging and monitoring systems. Storing, administering and accessing big data sets in a relational database system can quickly become a very hard technical challenge, as the size of the active data set and the number of concurrent users increase. Scale-out database technologies are a rapidly developing set of solutions for deploying and managing very large data warehouses on commodity hardware and with open source software. In this paper we will describe the architecture and tests on database systems based on Hadoop and the Cloudera Impala engine. We will discuss the results of our tests, including tests of data loading and integration with existing data sources and in particular with relational databases. We will report on query performance tests done with various data sets of interest at CERN, notably data from the accelerator log database. (paper)

  4. ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding.

    Science.gov (United States)

    Guhlin, Joseph; Silverstein, Kevin A T; Zhou, Peng; Tiffin, Peter; Young, Nevin D

    2017-08-10

    Rapid generation of omics data in recent years have resulted in vast amounts of disconnected datasets without systemic integration and knowledge building, while individual groups have made customized, annotated datasets available on the web with few ways to link them to in-lab datasets. With so many research groups generating their own data, the ability to relate it to the larger genomic and comparative genomic context is becoming increasingly crucial to make full use of the data. The Omics Database Generator (ODG) allows users to create customized databases that utilize published genomics data integrated with experimental data which can be queried using a flexible graph database. When provided with omics and experimental data, ODG will create a comparative, multi-dimensional graph database. ODG can import definitions and annotations from other sources such as InterProScan, the Gene Ontology, ENZYME, UniPathway, and others. This annotation data can be especially useful for studying new or understudied species for which transcripts have only been predicted, and rapidly give additional layers of annotation to predicted genes. In better studied species, ODG can perform syntenic annotation translations or rapidly identify characteristics of a set of genes or nucleotide locations, such as hits from an association study. ODG provides a web-based user-interface for configuring the data import and for querying the database. Queries can also be run from the command-line and the database can be queried directly through programming language hooks available for most languages. ODG supports most common genomic formats as well as generic, easy to use tab-separated value format for user-provided annotations. ODG is a user-friendly database generation and query tool that adapts to the supplied data to produce a comparative genomic database or multi-layered annotation database. ODG provides rapid comparative genomic annotation and is therefore particularly useful for non-model or

  5. Upgrade of laser and electron beam welding database

    CERN Document Server

    Furman, Magdalena

    2014-01-01

    The main purpose of this project was to fix existing issues and update the existing database holding parameters of laser-beam and electron-beam welding machines. Moreover, the database had to be extended to hold the data for the new machines that arrived recently at the workshop. As a solution - the database had to be migrated to Oracle framework, the new user interface (using APEX) had to be designed and implemented with the integration with the CERN web services (EDMS, Phonebook, JMT, CDD and EDH).

  6. Data Cleaning and Semantic Improvement in Biological Databases

    Directory of Open Access Journals (Sweden)

    Apiletti Daniele

    2006-12-01

    Full Text Available Public genomic and proteomic databases can be affected by a variety of errors. These errors may involve either the description or the meaning of data (namely, syntactic or semantic errors. We focus our analysis on the detection of semantic errors, in order to verify the accuracy of the stored information. In particular, we address the issue of data constraints and functional dependencies among attributes in a given relational database. Constraints and dependencies show semantics among attributes in a database schema and their knowledge may be exploited to improve data quality and integration in database design, and to perform query optimization and dimensional reduction.

  7. Documentation of databases in the Wilmar Planning tool

    International Nuclear Information System (INIS)

    Kiviluioma, J.; Meimbom, P.

    2006-01-01

    The Wilmar Planning tool consists of a number of databases and models as shown in Figure 1. This report documents the design of the following subparts of the Wilmar Planning tool: 1. The Scenario database holding the scenario trees generated from the Scenario Tree Creation model. 2. The Input database holding input data to the Joint Market model and the Long-term model apart from the scenario trees. 3. The output database containing the results of a Joint Market model run. The Wilmar Planning Tool is developed in the project Wind Power Integration in Liberalised Electricity Markets (WILMAR) supported by EU (contract ENK5-CT-2002-00663). (LN)

  8. MIPS PlantsDB: a database framework for comparative plant genome research.

    Science.gov (United States)

    Nussbaumer, Thomas; Martis, Mihaela M; Roessner, Stephan K; Pfeifer, Matthias; Bader, Kai C; Sharma, Sapna; Gundlach, Heidrun; Spannagl, Manuel

    2013-01-01

    The rapidly increasing amount of plant genome (sequence) data enables powerful comparative analyses and integrative approaches and also requires structured and comprehensive information resources. Databases are needed for both model and crop plant organisms and both intuitive search/browse views and comparative genomics tools should communicate the data to researchers and help them interpret it. MIPS PlantsDB (http://mips.helmholtz-muenchen.de/plant/genomes.jsp) was initially described in NAR in 2007 [Spannagl,M., Noubibou,O., Haase,D., Yang,L., Gundlach,H., Hindemitt, T., Klee,K., Haberer,G., Schoof,H. and Mayer,K.F. (2007) MIPSPlantsDB-plant database resource for integrative and comparative plant genome research. Nucleic Acids Res., 35, D834-D840] and was set up from the start to provide data and information resources for individual plant species as well as a framework for integrative and comparative plant genome research. PlantsDB comprises database instances for tomato, Medicago, Arabidopsis, Brachypodium, Sorghum, maize, rice, barley and wheat. Building up on that, state-of-the-art comparative genomics tools such as CrowsNest are integrated to visualize and investigate syntenic relationships between monocot genomes. Results from novel genome analysis strategies targeting the complex and repetitive genomes of triticeae species (wheat and barley) are provided and cross-linked with model species. The MIPS Repeat Element Database (mips-REdat) and Catalog (mips-REcat) as well as tight connections to other databases, e.g. via web services, are further important components of PlantsDB.

  9. Cross: an OWL wrapper for teasoning on relational databases

    NARCIS (Netherlands)

    Champin, P.A.; Houben, G.J.P.M.; Thiran, Ph.; Parent, C.; Schewe, K.D.; Storey, V.C.; Thalheim, B.

    2007-01-01

    One of the challenges of the Semantic Web is to integrate the huge amount of information already available on the standard Web, usually stored in relational databases. In this paper, we propose a formalization of a logic model of relational databases, and a transformation of that model into OWL, a

  10. SSC lattice database and graphical interface

    International Nuclear Information System (INIS)

    Trahern, C.G.; Zhou, J.

    1991-11-01

    When completed the Superconducting Super Collider will be the world's largest accelerator complex. In order to build this system on schedule, the use of database technologies will be essential. In this paper we discuss one of the database efforts underway at the SSC, the lattice database. The SSC lattice database provides a centralized source for the design of each major component of the accelerator complex. This includes the two collider rings, the High Energy Booster, Medium Energy Booster, Low Energy Booster, and the LINAC as well as transfer and test beam lines. These designs have been created using a menagerie of programs such as SYNCH, DIMAD, MAD, TRANSPORT, MAGIC, TRACE3D AND TEAPOT. However, once a design has been completed, it is entered into a uniform database schema in the database system. In this paper we discuss the reasons for creating the lattice database and its implementation via the commercial database system SYBASE. Each lattice in the lattice database is composed of a set of tables whose data structure can describe any of the SSC accelerator lattices. In order to allow the user community access to the databases, a programmatic interface known as dbsf (for database to several formats) has been written. Dbsf creates ascii input files appropriate to the above mentioned accelerator design programs. In addition it has a binary dataset output using the Self Describing Standard data discipline provided with the Integrated Scientific Tool Kit software tools. Finally we discuss the graphical interfaces to the lattice database. The primary interface, known as OZ, is a simulation environment as well as a database browser

  11. Integrated application of the database for airborne geophysical survey achievement information

    International Nuclear Information System (INIS)

    Ji Zengxian; Zhang Junwei

    2006-01-01

    The paper briefly introduces the database of information for airborne geophysical survey achievements. This database was developed on the platform of Microsoft Windows System with the technical methods of Visual C++ 6.0 and MapGIS. It is an information management system concerning airborne geophysical surveying achievements with perfect functions in graphic display, graphic cutting and output, query of data, printing of documents and reports, maintenance of database, etc. All information of airborne geophysical survey achievements in nuclear industry from 1972 to 2003 was embedded in. Based on regional geological map and Meso-Cenozoic basin map, the detailed statistical information of each airborne survey area, each airborne radioactive anomalous point and high field point can be presented visually by combining geological or basin research result. The successful development of this system will provide a fairly good base and platform for management of archives and data of airborne geophysical survey achievements in nuclear industry. (authors)

  12. Integrated remote sensing and visualization (IRSV) system for transportation infrastructure operations and management, phase two, volume 4 : web-based bridge information database--visualization analytics and distributed sensing.

    Science.gov (United States)

    2012-03-01

    This report introduces the design and implementation of a Web-based bridge information visual analytics system. This : project integrates Internet, multiple databases, remote sensing, and other visualization technologies. The result : combines a GIS ...

  13. HCVpro: Hepatitis C virus protein interaction database

    KAUST Repository

    Kwofie, Samuel K.

    2011-12-01

    It is essential to catalog characterized hepatitis C virus (HCV) protein-protein interaction (PPI) data and the associated plethora of vital functional information to augment the search for therapies, vaccines and diagnostic biomarkers. In furtherance of these goals, we have developed the hepatitis C virus protein interaction database (HCVpro) by integrating manually verified hepatitis C virus-virus and virus-human protein interactions curated from literature and databases. HCVpro is a comprehensive and integrated HCV-specific knowledgebase housing consolidated information on PPIs, functional genomics and molecular data obtained from a variety of virus databases (VirHostNet, VirusMint, HCVdb and euHCVdb), and from BIND and other relevant biology repositories. HCVpro is further populated with information on hepatocellular carcinoma (HCC) related genes that are mapped onto their encoded cellular proteins. Incorporated proteins have been mapped onto Gene Ontologies, canonical pathways, Online Mendelian Inheritance in Man (OMIM) and extensively cross-referenced to other essential annotations. The database is enriched with exhaustive reviews on structure and functions of HCV proteins, current state of drug and vaccine development and links to recommended journal articles. Users can query the database using specific protein identifiers (IDs), chromosomal locations of a gene, interaction detection methods, indexed PubMed sources as well as HCVpro, BIND and VirusMint IDs. The use of HCVpro is free and the resource can be accessed via http://apps.sanbi.ac.za/hcvpro/ or http://cbrc.kaust.edu.sa/hcvpro/. © 2011 Elsevier B.V.

  14. Application of material databases for improved reliability of reactor pressure vessels

    International Nuclear Information System (INIS)

    Griesbach, T.J.; Server, W.L.; Beaudoin, B.F.; Burgos, B.N.

    1994-01-01

    A vital part of reactor vessel Life Cycle Management program must begin with an accurate characterization of the vessel material properties. Uncertainties in vessel material properties or use of bounding values may result in unnecessary conservatisms in vessel integrity calculations. These conservatisms may be eliminated through a better understanding of the material properties in reactor vessels, both in the unirradiated and irradiated conditions. Reactor vessel material databases are available for quantifying the chemistry and Charpy shift behavior of individual heats of reactor vessel materials. Application of the databases for vessels with embrittlement concerns has proven to be an effective embrittlement management tool. This paper presents details of database development and applications which demonstrate the value of using material databases for improving material chemistry and for maximizing the data from integrated material surveillance programs

  15. Physics analysis database for the DIII-D tokamak

    International Nuclear Information System (INIS)

    Schissel, D.P.; Bramson, G.; DeBoo, J.C.

    1986-01-01

    The authors report on a centralized database for handling reduced data for physics analysis implemented for the DIII-D tokamak. Each database record corresponds to a specific snapshot in time for a selected discharge. Features of the database environment include automatic updating, data integrity checks, and data traceability. Reduced data from each diagnostic comprises a dedicated data bank (a subset of the database) with quality assurance provided by a physicist. These data banks will be used to create profile banks which will be input to a transport code to create a transport bank. Access to the database is initially through FORTRAN programs. One user interface, PLOTN, is a command driven program to select and display data subsets. Another user interface, PROF, compares and displays profiles. The database is implemented on a Digital Equipment Corporation VAX 8600 running VMS

  16. Exploration of a Vision for Actor Database Systems

    DEFF Research Database (Denmark)

    Shah, Vivek

    of these services. Existing popular approaches to building these services either use an in-memory database system or an actor runtime. We observe that these approaches have complementary strengths and weaknesses. In this dissertation, we propose the integration of actor programming models in database systems....... In doing so, we lay down a vision for a new class of systems called actor database systems. To explore this vision, this dissertation crystallizes the notion of an actor database system by defining its feature set in light of current application and hardware trends. In order to explore the viability...... of the outlined vision, a new programming model named Reactors has been designed to enrich classic relational database programming models with logical actor programming constructs. To support the reactor programming model, a high-performance in-memory multi-core OLTP database system named REACTDB has been built...

  17. High-Performance Secure Database Access Technologies for HEP Grids

    Energy Technology Data Exchange (ETDEWEB)

    Matthew Vranicar; John Weicher

    2006-04-17

    The Large Hadron Collider (LHC) at the CERN Laboratory will become the largest scientific instrument in the world when it starts operations in 2007. Large Scale Analysis Computer Systems (computational grids) are required to extract rare signals of new physics from petabytes of LHC detector data. In addition to file-based event data, LHC data processing applications require access to large amounts of data in relational databases: detector conditions, calibrations, etc. U.S. high energy physicists demand efficient performance of grid computing applications in LHC physics research where world-wide remote participation is vital to their success. To empower physicists with data-intensive analysis capabilities a whole hyperinfrastructure of distributed databases cross-cuts a multi-tier hierarchy of computational grids. The crosscutting allows separation of concerns across both the global environment of a federation of computational grids and the local environment of a physicist’s computer used for analysis. Very few efforts are on-going in the area of database and grid integration research. Most of these are outside of the U.S. and rely on traditional approaches to secure database access via an extraneous security layer separate from the database system core, preventing efficient data transfers. Our findings are shared by the Database Access and Integration Services Working Group of the Global Grid Forum, who states that "Research and development activities relating to the Grid have generally focused on applications where data is stored in files. However, in many scientific and commercial domains, database management systems have a central role in data storage, access, organization, authorization, etc, for numerous applications.” There is a clear opportunity for a technological breakthrough, requiring innovative steps to provide high-performance secure database access technologies for grid computing. We believe that an innovative database architecture where the

  18. High-Performance Secure Database Access Technologies for HEP Grids

    International Nuclear Information System (INIS)

    Vranicar, Matthew; Weicher, John

    2006-01-01

    The Large Hadron Collider (LHC) at the CERN Laboratory will become the largest scientific instrument in the world when it starts operations in 2007. Large Scale Analysis Computer Systems (computational grids) are required to extract rare signals of new physics from petabytes of LHC detector data. In addition to file-based event data, LHC data processing applications require access to large amounts of data in relational databases: detector conditions, calibrations, etc. U.S. high energy physicists demand efficient performance of grid computing applications in LHC physics research where world-wide remote participation is vital to their success. To empower physicists with data-intensive analysis capabilities a whole hyperinfrastructure of distributed databases cross-cuts a multi-tier hierarchy of computational grids. The crosscutting allows separation of concerns across both the global environment of a federation of computational grids and the local environment of a physicist's computer used for analysis. Very few efforts are on-going in the area of database and grid integration research. Most of these are outside of the U.S. and rely on traditional approaches to secure database access via an extraneous security layer separate from the database system core, preventing efficient data transfers. Our findings are shared by the Database Access and Integration Services Working Group of the Global Grid Forum, who states that 'Research and development activities relating to the Grid have generally focused on applications where data is stored in files. However, in many scientific and commercial domains, database management systems have a central role in data storage, access, organization, authorization, etc, for numerous applications'. There is a clear opportunity for a technological breakthrough, requiring innovative steps to provide high-performance secure database access technologies for grid computing. We believe that an innovative database architecture where the secure

  19. Development of Integrated PSA Database and Application Technology

    Energy Technology Data Exchange (ETDEWEB)

    Han, Sang Hoon; Park, Jin Hee; Kim, Seung Hwan; Choi, Sun Yeong; Jung, Woo Sik; Jeong, Kwang Sub; Ha Jae Joo; Yang, Joon Eon; Min Kyung Ran; Kim, Tae Woon

    2005-04-15

    The purpose of this project is to develop 1) the reliability database framework, 2) the methodology for the reactor trip and abnormal event analysis, and 3) the prototype PSA information DB system. We already have a part of the reactor trip and component reliability data. In this study, we extend the collection of data up to 2002. We construct the pilot reliability database for common cause failure and piping failure data. A reactor trip or a component failure may have an impact on the safety of a nuclear power plant. We perform the precursor analysis for such events that occurred in the KSNP, and to develop a procedure for the precursor analysis. A risk monitor provides a mean to trace the changes in the risk following the changes in the plant configurations. We develop a methodology incorporating the model of secondary system related to the reactor trip into the risk monitor model. We develop a prototype PSA information system for the UCN 3 and 4 PSA models where information for the PSA is inputted into the system such as PSA reports, analysis reports, thermal-hydraulic analysis results, system notebooks, and so on. We develop a unique coherent BDD method to quantify a fault tree and the fastest fault tree quantification engine FTREX. We develop quantification software for a full PSA model and a one top model.

  20. Efficient Integrity Checking for Databases with Recursive Views

    DEFF Research Database (Denmark)

    Martinenghi, Davide; Christiansen, Henning

    2005-01-01

    Efficient and incremental maintenance of integrity constraints involving recursive views is a difficult issue that has received some attention in the past years, but for which no widely accepted solution exists yet. In this paper a technique is proposed for compiling such integrity constraints in...... approaches have not achieved comparable optimization with the same level of generality....

  1. The composition of accessory enzymes of Penicillium chrysogenum P33 revealed by secretome and synergistic effects with commercial cellulase on lignocellulose hydrolysis.

    Science.gov (United States)

    Yang, Yi; Yang, Jinshui; Liu, Jiawen; Wang, Ruonan; Liu, Liang; Wang, Fengqin; Yuan, Hongli

    2018-06-01

    Herein, we report the secretome of Penicillium chrysogenum P33 under induction of lignocellulose for the first time. A total of 356 proteins were identified, including complete cellulases and numerous hemicellulases. Supplementing a commercial cellulase with increasing dosage of P33 enzyme cocktail from 1 to 5 mg/g substrate increased the release of reducing sugars from delignified corn stover by 21.4% to 106.8%. When 50% cellulase was replaced by P33 enzyme cocktail, release of reducing sugars was 78.6% higher than with cellulase alone. Meanwhile, glucan and xylan conversion was increased by 37% and 106%, respectively. P33 enzyme cocktail also enhanced commercial cellulase hydrolysis against four different delignified lignocellulosic biomass. These findings demonstrate that mixing appropriate amount of P33 cocktail with cellulase improves polysaccharide hydrolysis, suggesting P33 enzymes have great potential for industrial applications. Copyright © 2018 Elsevier Ltd. All rights reserved.

  2. Database Description - Trypanosomes Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Trypanosomes Database Database Description General information of database Database name Trypanosomes Database...stitute of Genetics Research Organization of Information and Systems Yata 1111, Mishima, Shizuoka 411-8540, JAPAN E mail: Database...y Name: Trypanosoma Taxonomy ID: 5690 Taxonomy Name: Homo sapiens Taxonomy ID: 9606 Database description The... Article title: Author name(s): Journal: External Links: Original website information Database maintenance s...DB (Protein Data Bank) KEGG PATHWAY Database DrugPort Entry list Available Query search Available Web servic

  3. Integrated Space Asset Management Database and Modeling

    Science.gov (United States)

    Gagliano, L.; MacLeod, T.; Mason, S.; Percy, T.; Prescott, J.

    The Space Asset Management Database (SAM-D) was implemented in order to effectively track known objects in space by ingesting information from a variety of databases and performing calculations to determine the expected position of the object at a specified time. While SAM-D performs this task very well, it is limited by technology and is not available outside of the local user base. Modeling and simulation can be powerful tools to exploit the information contained in SAM-D. However, the current system does not allow proper integration options for combining the data with both legacy and new M&S tools. A more capable data management infrastructure would extend SAM-D to support the larger data sets to be generated by the COI. A service-oriented architecture model will allow it to easily expand to incorporate new capabilities, including advanced analytics, M&S tools, fusion techniques and user interface for visualizations. Based on a web-centric approach, the entire COI will be able to access the data and related analytics. In addition, tight control of information sharing policy will increase confidence in the system, which would encourage industry partners to provide commercial data. SIMON is a Government off the Shelf information sharing platform in use throughout DoD and DHS information sharing and situation awareness communities. SIMON providing fine grained control to data owners allowing them to determine exactly how and when their data is shared. SIMON supports a micro-service approach to system development, meaning M&S and analytic services can be easily built or adapted. It is uniquely positioned to fill this need as an information-sharing platform with a proven track record of successful situational awareness system deployments. Combined with the integration of new and legacy M&S tools, a SIMON-based architecture will provide a robust SA environment for the NASA SA COI that can be extended and expanded indefinitely. First Results of Coherent Uplink from a

  4. The Problem with the Delta Cost Project Database

    Science.gov (United States)

    Jaquette, Ozan; Parra, Edna

    2016-01-01

    The Integrated Postsecondary Education System (IPEDS) collects data on Title IV institutions. The Delta Cost Project (DCP) integrated data from multiple IPEDS survey components into a public-use longitudinal dataset. The DCP Database was the basis for dozens of journal articles and a series of influential policy reports. Unfortunately, a flaw in…

  5. Distributed Database Access in the LHC Computing Grid with CORAL

    CERN Document Server

    Molnár, Z; Düllmann, D; Giacomo, G; Kalkhof, A; Valassi, A; CERN. Geneva. IT Department

    2009-01-01

    The CORAL package is the LCG Persistency Framework foundation for accessing relational databases. From the start CORAL has been designed to facilitate the deployment of the LHC experiment database applications in a distributed computing environment. In particular we cover - improvements to database service scalability by client connection management - platform-independent, multi-tier scalable database access by connection multiplexing, caching - a secure authentication and authorisation scheme integrated with existing grid services. We will summarize the deployment experience from several experiment productions using the distributed database infrastructure, which is now available in LCG. Finally, we present perspectives for future developments in this area.

  6. MetaboSearch: tool for mass-based metabolite identification using multiple databases.

    Directory of Open Access Journals (Sweden)

    Bin Zhou

    Full Text Available Searching metabolites against databases according to their masses is often the first step in metabolite identification for a mass spectrometry-based untargeted metabolomics study. Major metabolite databases include Human Metabolome DataBase (HMDB, Madison Metabolomics Consortium Database (MMCD, Metlin, and LIPID MAPS. Since each one of these databases covers only a fraction of the metabolome, integration of the search results from these databases is expected to yield a more comprehensive coverage. However, the manual combination of multiple search results is generally difficult when identification of hundreds of metabolites is desired. We have implemented a web-based software tool that enables simultaneous mass-based search against the four major databases, and the integration of the results. In addition, more complete chemical identifier information for the metabolites is retrieved by cross-referencing multiple databases. The search results are merged based on IUPAC International Chemical Identifier (InChI keys. Besides a simple list of m/z values, the software can accept the ion annotation information as input for enhanced metabolite identification. The performance of the software is demonstrated on mass spectrometry data acquired in both positive and negative ionization modes. Compared with search results from individual databases, MetaboSearch provides better coverage of the metabolome and more complete chemical identifier information.The software tool is available at http://omics.georgetown.edu/MetaboSearch.html.

  7. TRENDS: The aeronautical post-test database management system

    Science.gov (United States)

    Bjorkman, W. S.; Bondi, M. J.

    1990-01-01

    TRENDS, an engineering-test database operating system developed by NASA to support rotorcraft flight tests, is described. Capabilities and characteristics of the system are presented, with examples of its use in recalling and analyzing rotorcraft flight-test data from a TRENDS database. The importance of system user-friendliness in gaining users' acceptance is stressed, as is the importance of integrating supporting narrative data with numerical data in engineering-test databases. Considerations relevant to the creation and maintenance of flight-test database are discussed and TRENDS' solutions to database management problems are described. Requirements, constraints, and other considerations which led to the system's configuration are discussed and some of the lessons learned during TRENDS' development are presented. Potential applications of TRENDS to a wide range of aeronautical and other engineering tests are identified.

  8. Database Description - SKIP Stemcell Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us SKIP Stemcell Database Database Description General information of database Database name SKIP Stemcell Database...rsity Journal Search: Contact address http://www.skip.med.keio.ac.jp/en/contact/ Database classification Human Genes and Diseases Dat...abase classification Stemcell Article Organism Taxonomy Name: Homo sapiens Taxonomy ID: 9606 Database...ks: Original website information Database maintenance site Center for Medical Genetics, School of medicine, ...lable Web services Not available URL of Web services - Need for user registration Not available About This Database Database

  9. Ultra-Structure database design methodology for managing systems biology data and analyses

    Directory of Open Access Journals (Sweden)

    Hemminger Bradley M

    2009-08-01

    Full Text Available Abstract Background Modern, high-throughput biological experiments generate copious, heterogeneous, interconnected data sets. Research is dynamic, with frequently changing protocols, techniques, instruments, and file formats. Because of these factors, systems designed to manage and integrate modern biological data sets often end up as large, unwieldy databases that become difficult to maintain or evolve. The novel rule-based approach of the Ultra-Structure design methodology presents a potential solution to this problem. By representing both data and processes as formal rules within a database, an Ultra-Structure system constitutes a flexible framework that enables users to explicitly store domain knowledge in both a machine- and human-readable form. End users themselves can change the system's capabilities without programmer intervention, simply by altering database contents; no computer code or schemas need be modified. This provides flexibility in adapting to change, and allows integration of disparate, heterogenous data sets within a small core set of database tables, facilitating joint analysis and visualization without becoming unwieldy. Here, we examine the application of Ultra-Structure to our ongoing research program for the integration of large proteomic and genomic data sets (proteogenomic mapping. Results We transitioned our proteogenomic mapping information system from a traditional entity-relationship design to one based on Ultra-Structure. Our system integrates tandem mass spectrum data, genomic annotation sets, and spectrum/peptide mappings, all within a small, general framework implemented within a standard relational database system. General software procedures driven by user-modifiable rules can perform tasks such as logical deduction and location-based computations. The system is not tied specifically to proteogenomic research, but is rather designed to accommodate virtually any kind of biological research. Conclusion We find

  10. Design of a Multi Dimensional Database for the Archimed DataWarehouse.

    Science.gov (United States)

    Bréant, Claudine; Thurler, Gérald; Borst, François; Geissbuhler, Antoine

    2005-01-01

    The Archimed data warehouse project started in 1993 at the Geneva University Hospital. It has progressively integrated seven data marts (or domains of activity) archiving medical data such as Admission/Discharge/Transfer (ADT) data, laboratory results, radiology exams, diagnoses, and procedure codes. The objective of the Archimed data warehouse is to facilitate the access to an integrated and coherent view of patient medical in order to support analytical activities such as medical statistics, clinical studies, retrieval of similar cases and data mining processes. This paper discusses three principal design aspects relative to the conception of the database of the data warehouse: 1) the granularity of the database, which refers to the level of detail or summarization of data, 2) the database model and architecture, describing how data will be presented to end users and how new data is integrated, 3) the life cycle of the database, in order to ensure long term scalability of the environment. Both, the organization of patient medical data using a standardized elementary fact representation and the use of the multi dimensional model have proved to be powerful design tools to integrate data coming from the multiple heterogeneous database systems part of the transactional Hospital Information System (HIS). Concurrently, the building of the data warehouse in an incremental way has helped to control the evolution of the data content. These three design aspects bring clarity and performance regarding data access. They also provide long term scalability to the system and resilience to further changes that may occur in source systems feeding the data warehouse.

  11. Differential secretome analysis of Pseudomonas syringae pv tomato using gel-free MS proteomics

    Directory of Open Access Journals (Sweden)

    Jörg eSchumacher

    2014-07-01

    Full Text Available The plant pathogen Pseudomonas syringae pv. tomato (DC3000 causes virulence by delivering effector proteins into host plant cells through its type three secretion system (T3SS. In response to the plant environment DC3000 expresses hypersensitive response and pathogenicity genes (hrp. Pathogenesis depends on the ability of the pathogen to manipulate the plant metabolism and to inhibit plant immunity, which depends to a large degree on the plant’s capacity to recognise both pathogen and microbial determinants (PAMP/MAMP-triggered immunity. We have developed and employed MS-based shotgun and targeted proteomics to (i elucidate the extracellular and secretome composition of DC3000 and (ii evaluate temporal features of the assembly of the T3SS and the secretion process together with its dependence of pH. The proteomic screen, under hrp inducing in vitro conditions, of extracellular and cytoplasmatic fractions indicated the segregated presence of not only T3SS implicated proteins such as HopK1, HrpK1, HrpA1 and Avrpto1, but also of proteins not usually associated with the T3SS or with pathogenicity. Using multiple reaction monitoring MS (MRM-MS to quantify HrpA1 and Avrpto1, we found that HrpA1 is rapidly expressed, at a strict pH-dependent rate and is post-translationally processed extracellularly. These features appear to not interfere with rapid Avrpto1 expression and secretion but may suggest some temporal post-translational regulatory mechanism of the T3SS assembly. The high specificity and sensitivity of the MRM-MS approach should provide a powerful tool to measure secretion and translocation in infected tissues.

  12. DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis

    Directory of Open Access Journals (Sweden)

    Baseler Michael W

    2007-11-01

    Full Text Available Abstract Background Due to the complex and distributed nature of biological research, our current biological knowledge is spread over many redundant annotation databases maintained by many independent groups. Analysts usually need to visit many of these bioinformatics databases in order to integrate comprehensive annotation information for their genes, which becomes one of the bottlenecks, particularly for the analytic task associated with a large gene list. Thus, a highly centralized and ready-to-use gene-annotation knowledgebase is in demand for high throughput gene functional analysis. Description The DAVID Knowledgebase is built around the DAVID Gene Concept, a single-linkage method to agglomerate tens of millions of gene/protein identifiers from a variety of public genomic resources into DAVID gene clusters. The grouping of such identifiers improves the cross-reference capability, particularly across NCBI and UniProt systems, enabling more than 40 publicly available functional annotation sources to be comprehensively integrated and centralized by the DAVID gene clusters. The simple, pair-wise, text format files which make up the DAVID Knowledgebase are freely downloadable for various data analysis uses. In addition, a well organized web interface allows users to query different types of heterogeneous annotations in a high-throughput manner. Conclusion The DAVID Knowledgebase is designed to facilitate high throughput gene functional analysis. For a given gene list, it not only provides the quick accessibility to a wide range of heterogeneous annotation data in a centralized location, but also enriches the level of biological information for an individual gene. Moreover, the entire DAVID Knowledgebase is freely downloadable or searchable at http://david.abcc.ncifcrf.gov/knowledgebase/.

  13. Scale out databases for CERN use cases

    CERN Document Server

    Baranowski, Zbigniew; Canali, Luca; Garcia, Daniel Lanza; Surdy, Kacper

    2015-01-01

    Data generation rates are expected to grow very fast for some database workloads going into LHC run 2 and beyond. In particular this is expected for data coming from controls, logging and monitoring systems. Storing, administering and accessing big data sets in a relational database system can quickly become a very hard technical challenge, as the size of the active data set and the number of concurrent users increase. Scale-out database technologies are a rapidly developing set of solutions for deploying and managing very large data warehouses on commodity hardware and with open source software. In this paper we will describe the architecture and tests on database systems based on Hadoop and the Cloudera Impala engine. We will discuss the results of our tests, including tests of data loading and integration with existing data sources and in particular with relational databases. We will report on query performance tests done with various data sets of interest at CERN, notably data from the accelerator log dat...

  14. Principles of data integration

    CERN Document Server

    Doan, AnHai; Ives, Zachary

    2012-01-01

    How do you approach answering queries when your data is stored in multiple databases that were designed independently by different people? This is first comprehensive book on data integration and is written by three of the most respected experts in the field. This book provides an extensive introduction to the theory and concepts underlying today's data integration techniques, with detailed, instruction for their application using concrete examples throughout to explain the concepts. Data integration is the problem of answering queries that span multiple data sources (e.g., databases, web

  15. Cost benefit analysis of power plant database integration

    International Nuclear Information System (INIS)

    Wilber, B.E.; Cimento, A.; Stuart, R.

    1988-01-01

    A cost benefit analysis of plant wide data integration allows utility management to evaluate integration and automation benefits from an economic perspective. With this evaluation, the utility can determine both the quantitative and qualitative savings that can be expected from data integration. The cost benefit analysis is then a planning tool which helps the utility to develop a focused long term implementation strategy that will yield significant near term benefits. This paper presents a flexible cost benefit analysis methodology which is both simple to use and yields accurate, verifiable results. Included in this paper is a list of parameters to consider, a procedure for performing the cost savings analysis, and samples of this procedure when applied to a utility. A case study is presented involving a specific utility where this procedure was applied. Their uses of the cost-benefit analysis are also described

  16. An Integrative Clinical Database and Diagnostics Platform for Biomarker Identification and Analysis in Ion Mobility Spectra of Human Exhaled Air

    Directory of Open Access Journals (Sweden)

    Schneider Till

    2013-06-01

    Full Text Available Over the last decade the evaluation of odors and vapors in human breath has gained more and more attention, particularly in the diagnostics of pulmonary diseases. Ion mobility spectrometry coupled with multi-capillary columns (MCC/IMS, is a well known technology for detecting volatile organic compounds (VOCs in air. It is a comparatively inexpensive, non-invasive, high-throughput method, which is able to handle the moisture that comes with human exhaled air, and allows for characterizing of VOCs in very low concentrations. To identify discriminating compounds as biomarkers, it is necessary to have a clear understanding of the detailed composition of human breath. Therefore, in addition to the clinical studies, there is a need for a flexible and comprehensive centralized data repository, which is capable of gathering all kinds of related information. Moreover, there is a demand for automated data integration and semi-automated data analysis, in particular with regard to the rapid data accumulation, emerging from the high-throughput nature of the MCC/IMS technology. Here, we present a comprehensive database application and analysis platform, which combines metabolic maps with heterogeneous biomedical data in a well-structured manner. The design of the database is based on a hybrid of the entity-attribute- value (EAV model and the EAV-CR, which incorporates the concepts of classes and relationships. Additionally it offers an intuitive user interface that provides easy and quick access to the platform’s functionality: automated data integration and integrity validation, versioning and roll-back strategy, data retrieval as well as semi-automatic data mining and machine learning capabilities. The platform will support MCC/IMS-based biomarker identification and validation. The software, schemata, data sets and further information is publicly available at http://imsdb.mpi-inf.mpg.de.

  17. EchoBASE: an integrated post-genomic database for Escherichia coli.

    Science.gov (United States)

    Misra, Raju V; Horler, Richard S P; Reindl, Wolfgang; Goryanin, Igor I; Thomas, Gavin H

    2005-01-01

    EchoBASE (http://www.ecoli-york.org) is a relational database designed to contain and manipulate information from post-genomic experiments using the model bacterium Escherichia coli K-12. Its aim is to collate information from a wide range of sources to provide clues to the functions of the approximately 1500 gene products that have no confirmed cellular function. The database is built on an enhanced annotation of the updated genome sequence of strain MG1655 and the association of experimental data with the E.coli genes and their products. Experiments that can be held within EchoBASE include proteomics studies, microarray data, protein-protein interaction data, structural data and bioinformatics studies. EchoBASE also contains annotated information on 'orphan' enzyme activities from this microbe to aid characterization of the proteins that catalyse these elusive biochemical reactions.

  18. An Intelligent Assistant for Construction of Terrain Databases

    OpenAIRE

    Rowe, Neil C.; Reed, Chris; Jackson, Leroy; Baer, Wolfgang

    1998-01-01

    1998 Command and Control Research and Technology Symposium, Monterey CA, June 1998, 481-486. We describe TELLUSPLAN, an intelligent assistant for the problem of bargaining between user goals and system resources in the integration of terrain databases from separate source databases. TELLUSPLAN uses nondeterministic methods from artificial intelligence and a detailed cost model to infer the most reasonable compromise with the user's needs. Supported by the Army Artificial Int...

  19. Building spatio-temporal database model based on ontological approach using relational database environment

    International Nuclear Information System (INIS)

    Mahmood, N.; Burney, S.M.A.

    2017-01-01

    Everything in this world is encapsulated by space and time fence. Our daily life activities are utterly linked and related with other objects in vicinity. Therefore, a strong relationship exist with our current location, time (including past, present and future) and event through with we are moving as an object also affect our activities in life. Ontology development and its integration with database are vital for the true understanding of the complex systems involving both spatial and temporal dimensions. In this paper we propose a conceptual framework for building spatio-temporal database model based on ontological approach. We have used relational data model for modelling spatio-temporal data content and present our methodology with spatio-temporal ontological accepts and its transformation into spatio-temporal database model. We illustrate the implementation of our conceptual model through a case study related to cultivated land parcel used for agriculture to exhibit the spatio-temporal behaviour of agricultural land and related entities. Moreover, it provides a generic approach for designing spatiotemporal databases based on ontology. The proposed model is capable to understand the ontological and somehow epistemological commitments and to build spatio-temporal ontology and transform it into a spatio-temporal data model. Finally, we highlight the existing and future research challenges. (author)

  20. TrED: the Trichophyton rubrum Expression Database

    Directory of Open Access Journals (Sweden)

    Liu Tao

    2007-07-01

    Full Text Available Abstract Background Trichophyton rubrum is the most common dermatophyte species and the most frequent cause of fungal skin infections in humans worldwide. It's a major concern because feet and nail infections caused by this organism is extremely difficult to cure. A large set of expression data including expressed sequence tags (ESTs and transcriptional profiles of this important fungal pathogen are now available. Careful analysis of these data can give valuable information about potential virulence factors, antigens and novel metabolic pathways. We intend to create an integrated database TrED to facilitate the study of dermatophytes, and enhance the development of effective diagnostic and treatment strategies. Description All publicly available ESTs and expression profiles of T. rubrum during conidial germination in time-course experiments and challenged with antifungal agents are deposited in the database. In addition, comparative genomics hybridization results of 22 dermatophytic fungi strains from three genera, Trichophyton, Microsporum and Epidermophyton, are also included. ESTs are clustered and assembled to elongate the sequence length and abate redundancy. TrED provides functional analysis based on GenBank, Pfam, and KOG databases, along with KEGG pathway and GO vocabulary. It is integrated with a suite of custom web-based tools that facilitate querying and retrieving various EST properties, visualization and comparison of transcriptional profiles, and sequence-similarity searching by BLAST. Conclusion TrED is built upon a relational database, with a web interface offering analytic functions, to provide integrated access to various expression data of T. rubrum and comparative results of dermatophytes. It is devoted to be a comprehensive resource and platform to assist functional genomic studies in dermatophytes. TrED is available from URL: http://www.mgc.ac.cn/TrED/.

  1. METRICS FOR DYNAMIC SCALING OF DATABASE IN CLOUDS

    Directory of Open Access Journals (Sweden)

    Alexander V. Boichenko

    2013-01-01

    Full Text Available This article analyzes the main methods of scaling databases (replication, sharding and their support at the popular relational databases and NoSQL solutions with different data models: a document-oriented, key-value, column-oriented, graph. The article provides an assessment of the capabilities of modern cloud-based solution and gives a model for the organization of dynamic scaling in the cloud infrastructure. In the article are analyzed different types of metrics and are included the basic metrics that characterize the functioning parameters and database technology, as well as sets the goals of the integral metrics, necessary for the implementation of adaptive algorithms for dynamic scaling databases in the cloud infrastructure. This article was prepared with the support of RFBR grant № 13-07-00749.

  2. Human Ageing Genomic Resources: new and updated databases

    Science.gov (United States)

    Tacutu, Robi; Thornton, Daniel; Johnson, Emily; Budovsky, Arie; Barardo, Diogo; Craig, Thomas; Diana, Eugene; Lehmann, Gilad; Toren, Dmitri; Wang, Jingwei; Fraifeld, Vadim E

    2018-01-01

    Abstract In spite of a growing body of research and data, human ageing remains a poorly understood process. Over 10 years ago we developed the Human Ageing Genomic Resources (HAGR), a collection of databases and tools for studying the biology and genetics of ageing. Here, we present HAGR’s main functionalities, highlighting new additions and improvements. HAGR consists of six core databases: (i) the GenAge database of ageing-related genes, in turn composed of a dataset of >300 human ageing-related genes and a dataset with >2000 genes associated with ageing or longevity in model organisms; (ii) the AnAge database of animal ageing and longevity, featuring >4000 species; (iii) the GenDR database with >200 genes associated with the life-extending effects of dietary restriction; (iv) the LongevityMap database of human genetic association studies of longevity with >500 entries; (v) the DrugAge database with >400 ageing or longevity-associated drugs or compounds; (vi) the CellAge database with >200 genes associated with cell senescence. All our databases are manually curated by experts and regularly updated to ensure a high quality data. Cross-links across our databases and to external resources help researchers locate and integrate relevant information. HAGR is freely available online (http://genomics.senescence.info/). PMID:29121237

  3. Simple re-instantiation of small databases using cloud computing.

    Science.gov (United States)

    Tan, Tin Wee; Xie, Chao; De Silva, Mark; Lim, Kuan Siong; Patro, C Pawan K; Lim, Shen Jean; Govindarajan, Kunde Ramamoorthy; Tong, Joo Chuan; Choo, Khar Heng; Ranganathan, Shoba; Khan, Asif M

    2013-01-01

    Small bioinformatics databases, unlike institutionally funded large databases, are vulnerable to discontinuation and many reported in publications are no longer accessible. This leads to irreproducible scientific work and redundant effort, impeding the pace of scientific progress. We describe a Web-accessible system, available online at http://biodb100.apbionet.org, for archival and future on demand re-instantiation of small databases within minutes. Depositors can rebuild their databases by downloading a Linux live operating system (http://www.bioslax.com), preinstalled with bioinformatics and UNIX tools. The database and its dependencies can be compressed into an ".lzm" file for deposition. End-users can search for archived databases and activate them on dynamically re-instantiated BioSlax instances, run as virtual machines over the two popular full virtualization standard cloud-computing platforms, Xen Hypervisor or vSphere. The system is adaptable to increasing demand for disk storage or computational load and allows database developers to use the re-instantiated databases for integration and development of new databases. Herein, we demonstrate that a relatively inexpensive solution can be implemented for archival of bioinformatics databases and their rapid re-instantiation should the live databases disappear.

  4. An Autonomic Framework for Integrating Security and Quality of Service Support in Databases

    Science.gov (United States)

    Alomari, Firas

    2013-01-01

    The back-end databases of multi-tiered applications are a major data security concern for enterprises. The abundance of these systems and the emergence of new and different threats require multiple and overlapping security mechanisms. Therefore, providing multiple and diverse database intrusion detection and prevention systems (IDPS) is a critical…

  5. YPED: an integrated bioinformatics suite and database for mass spectrometry-based proteomics research.

    Science.gov (United States)

    Colangelo, Christopher M; Shifman, Mark; Cheung, Kei-Hoi; Stone, Kathryn L; Carriero, Nicholas J; Gulcicek, Erol E; Lam, TuKiet T; Wu, Terence; Bjornson, Robert D; Bruce, Can; Nairn, Angus C; Rinehart, Jesse; Miller, Perry L; Williams, Kenneth R

    2015-02-01

    We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database (YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a single laboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography-tandem mass spectrometry (LC-MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring (MRM)/selective reaction monitoring (SRM) assay development. We have linked YPED's database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results. Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.

  6. Update of the database of photovoltaic installations in the UK

    Energy Technology Data Exchange (ETDEWEB)

    Taylor, D.; Bruhns, H.

    1999-07-01

    The article describes an updated database of photovoltaic (PV) installations in the UK. The database contains more than 300 records representing over 40,000 photovoltaic installations with more than 100 buildings that use photovoltaic arrays. Figures show: (i) a chart of cumulative PV applications to date; (ii) a chart of cumulative installations in the database; (iii) the growth of Building Integrated PV installed to date; (iv) the cumulative growth of peak power of PV for buildings installed every year since 1985; (v) the distribution by application of all PV installations in the database and (vi) the various applications of PV installations.

  7. MIPS Arabidopsis thaliana Database (MAtDB): an integrated biological knowledge resource for plant genomics

    Science.gov (United States)

    Schoof, Heiko; Ernst, Rebecca; Nazarov, Vladimir; Pfeifer, Lukas; Mewes, Hans-Werner; Mayer, Klaus F. X.

    2004-01-01

    Arabidopsis thaliana is the most widely studied model plant. Functional genomics is intensively underway in many laboratories worldwide. Beyond the basic annotation of the primary sequence data, the annotated genetic elements of Arabidopsis must be linked to diverse biological data and higher order information such as metabolic or regulatory pathways. The MIPS Arabidopsis thaliana database MAtDB aims to provide a comprehensive resource for Arabidopsis as a genome model that serves as a primary reference for research in plants and is suitable for transfer of knowledge to other plants, especially crops. The genome sequence as a common backbone serves as a scaffold for the integration of data, while, in a complementary effort, these data are enhanced through the application of state-of-the-art bioinformatics tools. This information is visualized on a genome-wide and a gene-by-gene basis with access both for web users and applications. This report updates the information given in a previous report and provides an outlook on further developments. The MAtDB web interface can be accessed at http://mips.gsf.de/proj/thal/db. PMID:14681437

  8. How the choice of Operating System can affect databases on a Virtual Machine

    OpenAIRE

    Karlsson, Jan; Eriksson, Patrik

    2014-01-01

    As databases grow in size, the need for optimizing databases is becoming a necessity. Choosing the right operating system to support your database becomes paramount to ensure that the database is fully utilized. Furthermore with the virtualization of operating systems becoming more commonplace, we find ourselves with more choices than we ever faced before. This paper demonstrates why the choice of operating system plays an integral part in deciding the right database for your system in a virt...

  9. Integration of functions in logic database systems

    NARCIS (Netherlands)

    Lambrichts, E.; Nees, P.; Paredaens, J.; Peelman, P.; Tanca, L.

    1990-01-01

    We extend Datalog, a logic programming language for rule-based systems, by respectively integrating types, negation and functions. This extention of Datalog is called MilAnt. Furthermore, MilAnt consistency is defined as a stronger form of consistency for functions. It is known that consistency for

  10. Molecule database framework: a framework for creating database applications with chemical structure search capability.

    Science.gov (United States)

    Kiener, Joos

    2013-12-11

    Research in organic chemistry generates samples of novel chemicals together with their properties and other related data. The involved scientists must be able to store this data and search it by chemical structure. There are commercial solutions for common needs like chemical registration systems or electronic lab notebooks. However for specific requirements of in-house databases and processes no such solutions exist. Another issue is that commercial solutions have the risk of vendor lock-in and may require an expensive license of a proprietary relational database management system. To speed up and simplify the development for applications that require chemical structure search capabilities, I have developed Molecule Database Framework. The framework abstracts the storing and searching of chemical structures into method calls. Therefore software developers do not require extensive knowledge about chemistry and the underlying database cartridge. This decreases application development time. Molecule Database Framework is written in Java and I created it by integrating existing free and open-source tools and frameworks. The core functionality includes:•Support for multi-component compounds (mixtures)•Import and export of SD-files•Optional security (authorization)For chemical structure searching Molecule Database Framework leverages the capabilities of the Bingo Cartridge for PostgreSQL and provides type-safe searching, caching, transactions and optional method level security. Molecule Database Framework supports multi-component chemical compounds (mixtures).Furthermore the design of entity classes and the reasoning behind it are explained. By means of a simple web application I describe how the framework could be used. I then benchmarked this example application to create some basic performance expectations for chemical structure searches and import and export of SD-files. By using a simple web application it was shown that Molecule Database Framework

  11. Spatio-temporal databases complex motion pattern queries

    CERN Document Server

    Vieira, Marcos R

    2013-01-01

    This brief presents several new query processing techniques, called complex motion pattern queries, specifically designed for very large spatio-temporal databases of moving objects. The brief begins with the definition of flexible pattern queries, which are powerful because of the integration of variables and motion patterns. This is followed by a summary of the expressive power of patterns and flexibility of pattern queries. The brief then present the Spatio-Temporal Pattern System (STPS) and density-based pattern queries. STPS databases contain millions of records with information about mobi

  12. PROS-1/Prospero Is a Major Regulator of the Glia-Specific Secretome Controlling Sensory-Neuron Shape and Function in C. elegans.

    Science.gov (United States)

    Wallace, Sean W; Singhvi, Aakanksha; Liang, Yupu; Lu, Yun; Shaham, Shai

    2016-04-19

    Sensory neurons are an animal's gateway to the world, and their receptive endings, the sites of sensory signal transduction, are often associated with glia. Although glia are known to promote sensory-neuron functions, the molecular bases of these interactions are poorly explored. Here, we describe a post-developmental glial role for the PROS-1/Prospero/PROX1 homeodomain protein in sensory-neuron function in C. elegans. Using glia expression profiling, we demonstrate that, unlike previously characterized cell fate roles, PROS-1 functions post-embryonically to control sense-organ glia-specific secretome expression. PROS-1 functions cell autonomously to regulate glial secretion and membrane structure, and non-cell autonomously to control the shape and function of the receptive endings of sensory neurons. Known glial genes controlling sensory-neuron function are PROS-1 targets, and we identify additional PROS-1-dependent genes required for neuron attributes. Drosophila Prospero and vertebrate PROX1 are expressed in post-mitotic sense-organ glia and astrocytes, suggesting conserved roles for this class of transcription factors. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  13. Establishment of database system for management of KAERI wastes

    International Nuclear Information System (INIS)

    Shon, J. S.; Kim, K. J.; Ahn, S. J.

    2004-07-01

    Radioactive wastes generated by KAERI has various types, nuclides and characteristics. To manage and control these kinds of radioactive wastes, it comes to need systematic management of their records, efficient research and quick statistics. Getting information about radioactive waste generated and stored by KAERI is the basic factor to construct the rapid information system for national cooperation management of radioactive waste. In this study, Radioactive Waste Management Integration System (RAWMIS) was developed. It is is aimed at management of record of radioactive wastes, uplifting the efficiency of management and support WACID(Waste Comprehensive Integration Database System) which is a national radioactive waste integrated safety management system of Korea. The major information of RAWMIS supported by user's requirements is generation, gathering, transfer, treatment, and storage information for solid waste, liquid waste, gas waste and waste related to spent fuel. RAWMIS is composed of database, software (interface between user and database), and software for a manager and it was designed with Client/Server structure. RAWMIS will be a useful tool to analyze radioactive waste management and radiation safety management. Also, this system is developed to share information with associated companies. Moreover, it can be expected to support the technology of research and development for radioactive waste treatment

  14. E-MSD: an integrated data resource for bioinformatics.

    Science.gov (United States)

    Velankar, S; McNeil, P; Mittard-Runte, V; Suarez, A; Barrell, D; Apweiler, R; Henrick, K

    2005-01-01

    The Macromolecular Structure Database (MSD) group (http://www.ebi.ac.uk/msd/) continues to enhance the quality and consistency of macromolecular structure data in the worldwide Protein Data Bank (wwPDB) and to work towards the integration of various bioinformatics data resources. One of the major obstacles to the improved integration of structural databases such as MSD and sequence databases like UniProt is the absence of up to date and well-maintained mapping between corresponding entries. We have worked closely with the UniProt group at the EBI to clean up the taxonomy and sequence cross-reference information in the MSD and UniProt databases. This information is vital for the reliable integration of the sequence family databases such as Pfam and Interpro with the structure-oriented databases of SCOP and CATH. This information has been made available to the eFamily group (http://www.efamily.org.uk/) and now forms the basis of the regular interchange of information between the member databases (MSD, UniProt, Pfam, Interpro, SCOP and CATH). This exchange of annotation information has enriched the structural information in the MSD database with annotation from wider sequence-oriented resources. This work was carried out under the 'Structure Integration with Function, Taxonomy and Sequences (SIFTS)' initiative (http://www.ebi.ac.uk/msd-srv/docs/sifts) in the MSD group.

  15. Web Exploration Tools for a Fast Federated Optical Survey Database

    Science.gov (United States)

    Humphreys, Roberta M.

    2000-01-01

    We implemented several new web-based tools to improve the efficiency and versatility of access to the APS Catalog of the POSS I (Palomar Observatory-National Geographic Sky Survey) and its associated image database. The most important addition was a federated database system to link the APS Catalog and image database into one Internet-accessible database. With the FDBS, the queries and transactions on the integrated database are performed as if it were a single database. We installed Myriad the FDBS developed by Professor Jaideep Srivastava and members of his group in the University of Minnesota Computer Science Department. It is the first system to provide schema integration, query processing and optimization, and transaction management capabilities in a single framework. The attached figure illustrates the Myriad architecture. The FDBS permits horizontal access to the data, not just vertical. For example, for the APS, queries can be made not only by sky position, but also by any parameter present in either of the databases. APS users will be able to produce an image of all the blue galaxies and stellar sources for comparison with x-ray source error ellipses from AXAF (X Ray Astrophysics Facility) (Chandra) for example. The FDBS is now available as a beta release with the appropriate query forms at our web site. While much of our time was occupied with adapting Myriad to the APS environment, we also made major changes in Star Base, our DBMS for the Catalog, at the web interface to improve its efficiency for issuing and processing queries. Star Base is now three times faster for large queries. Improvements were also made at the web end of the image database for faster access; although work still needs to be done to the image database itself for more efficient return with the FDBS. During the past few years, we made several improvements to the database pipeline that creates the individual plate databases queries by StarBase. The changes include improved positions

  16. PathwayAccess: CellDesigner plugins for pathway databases.

    Science.gov (United States)

    Van Hemert, John L; Dickerson, Julie A

    2010-09-15

    CellDesigner provides a user-friendly interface for graphical biochemical pathway description. Many pathway databases are not directly exportable to CellDesigner models. PathwayAccess is an extensible suite of CellDesigner plugins, which connect CellDesigner directly to pathway databases using respective Java application programming interfaces. The process is streamlined for creating new PathwayAccess plugins for specific pathway databases. Three PathwayAccess plugins, MetNetAccess, BioCycAccess and ReactomeAccess, directly connect CellDesigner to the pathway databases MetNetDB, BioCyc and Reactome. PathwayAccess plugins enable CellDesigner users to expose pathway data to analytical CellDesigner functions, curate their pathway databases and visually integrate pathway data from different databases using standard Systems Biology Markup Language and Systems Biology Graphical Notation. Implemented in Java, PathwayAccess plugins run with CellDesigner version 4.0.1 and were tested on Ubuntu Linux, Windows XP and 7, and MacOSX. Source code, binaries, documentation and video walkthroughs are freely available at http://vrac.iastate.edu/~jlv.

  17. SIMS: addressing the problem of heterogeneity in databases

    Science.gov (United States)

    Arens, Yigal

    1997-02-01

    The heterogeneity of remotely accessible databases -- with respect to contents, query language, semantics, organization, etc. -- presents serious obstacles to convenient querying. The SIMS (single interface to multiple sources) system addresses this global integration problem. It does so by defining a single language for describing the domain about which information is stored in the databases and using this language as the query language. Each database to which SIMS is to provide access is modeled using this language. The model describes a database's contents, organization, and other relevant features. SIMS uses these models, together with a planning system drawing on techniques from artificial intelligence, to decompose a given user's high-level query into a series of queries against the databases and other data manipulation steps. The retrieval plan is constructed so as to minimize data movement over the network and maximize parallelism to increase execution speed. SIMS can recover from network failures during plan execution by obtaining data from alternate sources, when possible. SIMS has been demonstrated in the domains of medical informatics and logistics, using real databases.

  18. ANISEED 2017: extending the integrated ascidian database to the exploration and evolutionary comparison of genome-scale datasets.

    Science.gov (United States)

    Brozovic, Matija; Dantec, Christelle; Dardaillon, Justine; Dauga, Delphine; Faure, Emmanuel; Gineste, Mathieu; Louis, Alexandra; Naville, Magali; Nitta, Kazuhiro R; Piette, Jacques; Reeves, Wendy; Scornavacca, Céline; Simion, Paul; Vincentelli, Renaud; Bellec, Maelle; Aicha, Sameh Ben; Fagotto, Marie; Guéroult-Bellone, Marion; Haeussler, Maximilian; Jacox, Edwin; Lowe, Elijah K; Mendez, Mickael; Roberge, Alexis; Stolfi, Alberto; Yokomori, Rui; Brown, C Titus; Cambillau, Christian; Christiaen, Lionel; Delsuc, Frédéric; Douzery, Emmanuel; Dumollard, Rémi; Kusakabe, Takehiro; Nakai, Kenta; Nishida, Hiroki; Satou, Yutaka; Swalla, Billie; Veeman, Michael; Volff, Jean-Nicolas; Lemaire, Patrick

    2018-01-04

    ANISEED (www.aniseed.cnrs.fr) is the main model organism database for tunicates, the sister-group of vertebrates. This release gives access to annotated genomes, gene expression patterns, and anatomical descriptions for nine ascidian species. It provides increased integration with external molecular and taxonomy databases, better support for epigenomics datasets, in particular RNA-seq, ChIP-seq and SELEX-seq, and features novel interactive interfaces for existing and novel datatypes. In particular, the cross-species navigation and comparison is enhanced through a novel taxonomy section describing each represented species and through the implementation of interactive phylogenetic gene trees for 60% of tunicate genes. The gene expression section displays the results of RNA-seq experiments for the three major model species of solitary ascidians. Gene expression is controlled by the binding of transcription factors to cis-regulatory sequences. A high-resolution description of the DNA-binding specificity for 131 Ciona robusta (formerly C. intestinalis type A) transcription factors by SELEX-seq is provided and used to map candidate binding sites across the Ciona robusta and Phallusia mammillata genomes. Finally, use of a WashU Epigenome browser enhances genome navigation, while a Genomicus server was set up to explore microsynteny relationships within tunicates and with vertebrates, Amphioxus, echinoderms and hemichordates. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. Cadastral Database Positional Accuracy Improvement

    Science.gov (United States)

    Hashim, N. M.; Omar, A. H.; Ramli, S. N. M.; Omar, K. M.; Din, N.

    2017-10-01

    Positional Accuracy Improvement (PAI) is the refining process of the geometry feature in a geospatial dataset to improve its actual position. This actual position relates to the absolute position in specific coordinate system and the relation to the neighborhood features. With the growth of spatial based technology especially Geographical Information System (GIS) and Global Navigation Satellite System (GNSS), the PAI campaign is inevitable especially to the legacy cadastral database. Integration of legacy dataset and higher accuracy dataset like GNSS observation is a potential solution for improving the legacy dataset. However, by merely integrating both datasets will lead to a distortion of the relative geometry. The improved dataset should be further treated to minimize inherent errors and fitting to the new accurate dataset. The main focus of this study is to describe a method of angular based Least Square Adjustment (LSA) for PAI process of legacy dataset. The existing high accuracy dataset known as National Digital Cadastral Database (NDCDB) is then used as bench mark to validate the results. It was found that the propose technique is highly possible for positional accuracy improvement of legacy spatial datasets.

  20. An Integrated Database of Unit Training Performance: Description an Lessons Learned

    National Research Council Canada - National Science Library

    Leibrecht, Bruce

    1997-01-01

    The Army Research Institute (ARI) has developed a prototype relational database for processing and archiving unit performance data from home station, training area, simulation based, and Combat Training Center training exercises...

  1. kpath: integration of metabolic pathway linked data.

    Science.gov (United States)

    Navas-Delgado, Ismael; García-Godoy, María Jesús; López-Camacho, Esteban; Rybinski, Maciej; Reyes-Palomares, Armando; Medina, Miguel Ángel; Aldana-Montes, José F

    2015-01-01

    In the last few years, the Life Sciences domain has experienced a rapid growth in the amount of available biological databases. The heterogeneity of these databases makes data integration a challenging issue. Some integration challenges are locating resources, relationships, data formats, synonyms or ambiguity. The Linked Data approach partially solves the heterogeneity problems by introducing a uniform data representation model. Linked Data refers to a set of best practices for publishing and connecting structured data on the Web. This article introduces kpath, a database that integrates information related to metabolic pathways. kpath also provides a navigational interface that enables not only the browsing, but also the deep use of the integrated data to build metabolic networks based on existing disperse knowledge. This user interface has been used to showcase relationships that can be inferred from the information available in several public databases. © The Author(s) 2015. Published by Oxford University Press.

  2. Database Description - Yeast Interacting Proteins Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Yeast Interacting Proteins Database Database Description General information of database Database... name Yeast Interacting Proteins Database Alternative name - DOI 10.18908/lsdba.nbdc00742-000 Creator C...-ken 277-8561 Tel: +81-4-7136-3989 FAX: +81-4-7136-3979 E-mail : Database classif...s cerevisiae Taxonomy ID: 4932 Database description Information on interactions and related information obta...l Acad Sci U S A. 2001 Apr 10;98(8):4569-74. Epub 2001 Mar 13. External Links: Original website information Database

  3. Data integration and knowledge discovery in biomedical databases. Reliable information from unreliable sources

    Directory of Open Access Journals (Sweden)

    A Mitnitski

    2003-01-01

    Full Text Available To better understand information about human health from databases we analyzed three datasets collected for different purposes in Canada: a biomedical database of older adults, a large population survey across all adult ages, and vital statistics. Redundancy in the variables was established, and this led us to derive a generalized (macroscopic state variable, being a fitness/frailty index that reflects both individual and group health status. Evaluation of the relationship between fitness/frailty and the mortality rate revealed that the latter could be expressed in terms of variables generally available from any cross-sectional database. In practical terms, this means that the risk of mortality might readily be assessed from standard biomedical appraisals collected for other purposes.

  4. Update History of This Database - Trypanosomes Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Trypanosomes Database Update History of This Database Date Update contents 2014/05/07 The co...ntact information is corrected. The features and manner of utilization of the database are corrected. 2014/02/04 Trypanosomes Databas...e English archive site is opened. 2011/04/04 Trypanosomes Database ( http://www.tan...paku.org/tdb/ ) is opened. About This Database Database Description Download Lice...nse Update History of This Database Site Policy | Contact Us Update History of This Database - Trypanosomes Database | LSDB Archive ...

  5. Study on Mandatory Access Control in a Secure Database Management System

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    This paper proposes a security policy model for mandatory access control in class B1 database management system whose level of labeling is tuple. The relation-hierarchical data model is extended to multilevel relation-hierarchical data model. Based on the multilevel relation-hierarchical data model, the concept of upper-lower layer relational integrity is presented after we analyze and eliminate the covert channels caused by the database integrity. Two SQL statements are extended to process polyinstantiation in the multilevel secure environment. The system is based on the multilevel relation-hierarchical data model and is capable of integratively storing and manipulating multilevel complicated objects (e. g., multilevel spatial data) and multilevel conventional data ( e. g., integer. real number and character string).

  6. Native Pig and Chicken Breed Database: NPCDB

    Directory of Open Access Journals (Sweden)

    Hyeon-Soo Jeong

    2014-10-01

    Full Text Available Indigenous (native breeds of livestock have higher disease resistance and adaptation to the environment due to high genetic diversity. Even though their extinction rate is accelerated due to the increase of commercial breeds, natural disaster, and civil war, there is a lack of well-established databases for the native breeds. Thus, we constructed the native pig and chicken breed database (NPCDB which integrates available information on the breeds from around the world. It is a nonprofit public database aimed to provide information on the genetic resources of indigenous pig and chicken breeds for their conservation. The NPCDB (http://npcdb.snu.ac.kr/ provides the phenotypic information and population size of each breed as well as its specific habitat. In addition, it provides information on the distribution of genetic resources across the country. The database will contribute to understanding of the breed’s characteristics such as disease resistance and adaptation to environmental changes as well as the conservation of indigenous genetic resources.

  7. The ATLAS Distributed Data Management System & Databases

    CERN Document Server

    Garonne, V; The ATLAS collaboration; Barisits, M; Beermann, T; Vigne, R; Serfon, C

    2013-01-01

    The ATLAS Distributed Data Management (DDM) System is responsible for the global management of petabytes of high energy physics data. The current system, DQ2, has a critical dependency on Relational Database Management Systems (RDBMS), like Oracle. RDBMS are well-suited to enforcing data integrity in online transaction processing applications, however, concerns have been raised about the scalability of its data warehouse-like workload. In particular, analysis of archived data or aggregation of transactional data for summary purposes is problematic. Therefore, we have evaluated new approaches to handle vast amounts of data. We have investigated a class of database technologies commonly referred to as NoSQL databases. This includes distributed filesystems, like HDFS, that support parallel execution of computational tasks on distributed data, as well as schema-less approaches via key-value stores, like HBase. In this talk we will describe our use cases in ATLAS, share our experiences with various databases used ...

  8. Molecular Therapy for Degenerative Disc Disease: Clues from Secretome Analysis of the Notochordal Cell-Rich Nucleus Pulposus

    Science.gov (United States)

    Matta, Ajay; Karim, M. Zia; Isenman, David E.; Erwin, W. Mark

    2017-01-01

    Degenerative disc disease (DDD) is associated with spinal pain often leading to long-term disability. However, the non-chondrodystrophic canine intervertebral disc is protected from the development of DDD, ostensibly due to its retention of notochordal cells (NC) in the nucleus pulposus (NP). In this study, we hypothesized that secretome analysis of the NC-rich NP will lead to the identification of key proteins that delay the onset of DDD. Using mass-spectrometry, we identified 303 proteins including components of TGFβ- and Wnt-signaling, anti-angiogeneic factors and proteins that inhibit axonal ingrowth in the bioactive fractions of serum free, notochordal cell derived conditioned medium (NCCM). Ingenuity Pathway Analysis revealed TGFβ1 and CTGF as major hubs in protein interaction networks. In vitro treatment with TGFβ1 and CTGF promoted the synthesis of healthy extra-cellular matrix proteins, increased cell proliferation and reduced cell death in human degenerative disc NP cells. A single intra-discal injection of recombinant TGFβ1 and CTGF proteins in a pre-clinical rat-tail disc injury model restored the NC and stem cell rich NP. In conclusion, we demonstrate the potential of TGFβ1 and CTGF to mitigate the progression of disc degeneration and the potential use of these molecules in a molecular therapy to treat the degenerative disc. PMID:28358123

  9. TriMEDB: A database to integrate transcribed markers and facilitate genetic studies of the tribe Triticeae

    Directory of Open Access Journals (Sweden)

    Yoshida Takuhiro

    2008-06-01

    Full Text Available Abstract Background The recent rapid accumulation of sequence resources of various crop species ensures an improvement in the genetics approach, including quantitative trait loci (QTL analysis as well as the holistic population analysis and association mapping of natural variations. Because the tribe Triticeae includes important cereals such as wheat and barley, integration of information on the genetic markers in these crops should effectively accelerate map-based genetic studies on Triticeae species and lead to the discovery of key loci involved in plant productivity, which can contribute to sustainable food production. Therefore, informatics applications and a semantic knowledgebase of genome-wide markers are required for the integration of information on and further development of genetic markers in wheat and barley in order to advance conventional marker-assisted genetic analyses and population genomics of Triticeae species. Description The Triticeae mapped expressed sequence tag (EST database (TriMEDB provides information, along with various annotations, regarding mapped cDNA markers that are related to barley and their homologues in wheat. The current version of TriMEDB provides map-location data for barley and wheat ESTs that were retrieved from 3 published barley linkage maps (the barley single nucleotide polymorphism database of the Scottish Crop Research Institute, the barley transcript map of Leibniz Institute of Plant Genetics and Crop Plant Research, and HarvEST barley ver. 1.63 and 1 diploid wheat map. These data were imported to CMap to allow the visualization of the map positions of the ESTs and interrelationships of these ESTs with public gene models and representative cDNA sequences. The retrieved cDNA sequences corresponding to each EST marker were assigned to the rice genome to predict an exon-intron structure. Furthermore, to generate a unique set of EST markers in Triticeae plants among the public domain, 3472 markers were

  10. SPAN: A Network Providing Integrated, End-to-End, Sensor-to-Database Solutions for Environmental Sciences

    Science.gov (United States)

    Benzel, T.; Cho, Y. H.; Deschon, A.; Gullapalli, S.; Silva, F.

    2009-12-01

    In recent years, advances in sensor network technology have shown great promise to revolutionize environmental data collection. Still, wide spread adoption of these systems by domain experts has been lacking, and these have remained the purview of the engineers who design them. While there are many data logging options for basic data collection in the field currently, scientists are often required to visit the deployment sites to retrieve their data and manually import it into spreadsheets. Some advanced commercial software systems do allow scientists to collect data remotely, but most of these systems only allow point-to-point access, and require proprietary hardware. Furthermore, these commercial solutions preclude the use of sensors from other manufacturers or integration with internet based database repositories and compute engines. Therefore, scientists often must download and manually reformat their data before uploading it to the repositories if they wish to share their data. We present an open-source, low-cost, extensible, turnkey solution called Sensor Processing and Acquisition Network (SPAN) which provides a robust and flexible sensor network service. At the deployment site, SPAN leverages low-power generic embedded processors to integrate variety of commercially available sensor hardware to the network of environmental observation systems. By bringing intelligence close to the sensed phenomena, we can remotely control configuration and re-use, establish rules to trigger sensor activity, manage power requirements, and control the two-way flow of sensed data as well as control information to the sensors. Key features of our design include (1) adoption of a hardware agnostic architecture: our solutions are compatible with several programmable platforms, sensor systems, communication devices and protocols. (2) information standardization: our system supports several popular communication protocols and data formats, and (3) extensible data support: our

  11. SPIRE Data-Base Management System

    Science.gov (United States)

    Fuechsel, C. F.

    1984-01-01

    Spacelab Payload Integration and Rocket Experiment (SPIRE) data-base management system (DBMS) based on relational model of data bases. Data bases typically used for engineering and mission analysis tasks and, unlike most commercially available systems, allow data items and data structures stored in forms suitable for direct analytical computation. SPIRE DBMS designed to support data requests from interactive users as well as applications programs.

  12. Integrating Informatics Technologies into Oracle

    Directory of Open Access Journals (Sweden)

    Manole VELICANU

    2006-01-01

    Full Text Available A characteristic of the actual informatics’ context is the interference of the technologies, which assumes that for creating an informatics product, is necessary to use integrate many technologies. This thing is also used for database systems which had integrated, in the past few years, almost everything is new in informatics technology. The idea is that when using database management systems - DBMS the user can benefit all the necessary interfaces and instruments for developing an application with databases from the very beginning to the end, no matter the type of application and the work environment. For example, if the database application needs any Internet facilities these could be appealed from the products that the DBMS is working with offers. The concept of the interference of informatics technologies has many advantages, which all contribute to increasing the efficiency of the activities that develop and maintain complex databases applications.

  13. SQL Server 2012 data integration recipes solutions for integration services and other ETL tools

    CERN Document Server

    Aspin, Adam

    2012-01-01

    SQL Server 2012 Data Integration Recipes provides focused and practical solutions to real world problems of data integration. Need to import data into SQL Server from an outside source? Need to export data and send it to another system? SQL Server 2012 Data Integration Recipes has your back. You'll find solutions for importing from Microsoft Office data stores such as Excel and Access, from text files such as CSV files, from XML, from other database brands such as Oracle and MySQL, and even from other SQL Server databases. You'll learn techniques for managing metadata, transforming data to mee

  14. International integral experiments databases in support of nuclear data and code validation

    International Nuclear Information System (INIS)

    Briggs, J. Blair; Gado, Janos; Hunter, Hamilton; Kodeli, Ivan; Salvatores, Massimo; Sartori, Enrico

    2002-01-01

    The OECD/NEA Nuclear Science Committee (NSC) has identified the need to establish international databases containing all the important experiments that are available for sharing among the specialists. The NSC has set up or sponsored specific activities to achieve this. The aim is to preserve them in an agreed standard format in computer accessible form, to use them for international activities involving validation of current and new calculational schemes including computer codes and nuclear data libraries, for assessing uncertainties, confidence bounds and safety margins, and to record measurement methods and techniques. The databases so far established or in preparation related to nuclear data validation cover the following areas: SINBAD - A Radiation Shielding Experiments database encompassing reactor shielding, fusion blanket neutronics, and accelerator shielding. ICSBEP - International Criticality Safety Benchmark Experiments Project Handbook, with more than 2500 critical configurations with different combination of materials and spectral indices. IRPhEP - International Reactor Physics Experimental Benchmarks Evaluation Project. The different projects are described in the following including results achieved, work in progress and planned. (author)

  15. Database resources for the tuberculosis community.

    Science.gov (United States)

    Lew, Jocelyne M; Mao, Chunhong; Shukla, Maulik; Warren, Andrew; Will, Rebecca; Kuznetsov, Dmitry; Xenarios, Ioannis; Robertson, Brian D; Gordon, Stephen V; Schnappinger, Dirk; Cole, Stewart T; Sobral, Bruno

    2013-01-01

    Access to online repositories for genomic and associated "-omics" datasets is now an essential part of everyday research activity. It is important therefore that the Tuberculosis community is aware of the databases and tools available to them online, as well as for the database hosts to know what the needs of the research community are. One of the goals of the Tuberculosis Annotation Jamboree, held in Washington DC on March 7th-8th 2012, was therefore to provide an overview of the current status of three key Tuberculosis resources, TubercuList (tuberculist.epfl.ch), TB Database (www.tbdb.org), and Pathosystems Resource Integration Center (PATRIC, www.patricbrc.org). Here we summarize some key updates and upcoming features in TubercuList, and provide an overview of the PATRIC site and its online tools for pathogen RNA-Seq analysis. Copyright © 2012 Elsevier Ltd. All rights reserved.

  16. Wireless Sensor Networks Database: Data Management and Implementation

    Directory of Open Access Journals (Sweden)

    Ping Liu

    2014-04-01

    Full Text Available As the core application of wireless sensor network technology, Data management and processing have become the research hotspot in the new database. This article studied mainly data management in wireless sensor networks, in connection with the characteristics of the data in wireless sensor networks, discussed wireless sensor network data query, integrating technology in-depth, proposed a mobile database structure based on wireless sensor network and carried out overall design and implementation for the data management system. In order to achieve the communication rules of above routing trees, network manager uses a simple maintenance algorithm of routing trees. Design ordinary node end, server end in mobile database at gathering nodes and mobile client end that can implement the system, focus on designing query manager, storage modules and synchronous module at server end in mobile database at gathering nodes.

  17. ViralORFeome: an integrated database to generate a versatile collection of viral ORFs.

    Science.gov (United States)

    Pellet, J; Tafforeau, L; Lucas-Hourani, M; Navratil, V; Meyniel, L; Achaz, G; Guironnet-Paquet, A; Aublin-Gex, A; Caignard, G; Cassonnet, P; Chaboud, A; Chantier, T; Deloire, A; Demeret, C; Le Breton, M; Neveu, G; Jacotot, L; Vaglio, P; Delmotte, S; Gautier, C; Combet, C; Deleage, G; Favre, M; Tangy, F; Jacob, Y; Andre, P; Lotteau, V; Rabourdin-Combe, C; Vidalain, P O

    2010-01-01

    Large collections of protein-encoding open reading frames (ORFs) established in a versatile recombination-based cloning system have been instrumental to study protein functions in high-throughput assays. Such 'ORFeome' resources have been developed for several organisms but in virology, plasmid collections covering a significant fraction of the virosphere are still needed. In this perspective, we present ViralORFeome 1.0 (http://www.viralorfeome.com), an open-access database and management system that provides an integrated set of bioinformatic tools to clone viral ORFs in the Gateway(R) system. ViralORFeome provides a convenient interface to navigate through virus genome sequences, to design ORF-specific cloning primers, to validate the sequence of generated constructs and to browse established collections of virus ORFs. Most importantly, ViralORFeome has been designed to manage all possible variants or mutants of a given ORF so that the cloning procedure can be applied to any emerging virus strain. A subset of plasmid constructs generated with ViralORFeome platform has been tested with success for heterologous protein expression in different expression systems at proteome scale. ViralORFeome should provide our community with a framework to establish a large collection of virus ORF clones, an instrumental resource to determine functions, activities and binding partners of viral proteins.

  18. Constructing a Geology Ontology Using a Relational Database

    Science.gov (United States)

    Hou, W.; Yang, L.; Yin, S.; Ye, J.; Clarke, K.

    2013-12-01

    In geology community, the creation of a common geology ontology has become a useful means to solve problems of data integration, knowledge transformation and the interoperation of multi-source, heterogeneous and multiple scale geological data. Currently, human-computer interaction methods and relational database-based methods are the primary ontology construction methods. Some human-computer interaction methods such as the Geo-rule based method, the ontology life cycle method and the module design method have been proposed for applied geological ontologies. Essentially, the relational database-based method is a reverse engineering of abstracted semantic information from an existing database. The key is to construct rules for the transformation of database entities into the ontology. Relative to the human-computer interaction method, relational database-based methods can use existing resources and the stated semantic relationships among geological entities. However, two problems challenge the development and application. One is the transformation of multiple inheritances and nested relationships and their representation in an ontology. The other is that most of these methods do not measure the semantic retention of the transformation process. In this study, we focused on constructing a rule set to convert the semantics in a geological database into a geological ontology. According to the relational schema of a geological database, a conversion approach is presented to convert a geological spatial database to an OWL-based geological ontology, which is based on identifying semantics such as entities, relationships, inheritance relationships, nested relationships and cluster relationships. The semantic integrity of the transformation was verified using an inverse mapping process. In a geological ontology, an inheritance and union operations between superclass and subclass were used to present the nested relationship in a geochronology and the multiple inheritances

  19. TransAtlasDB: an integrated database connecting expression data, metadata and variants

    Science.gov (United States)

    Adetunji, Modupeore O; Lamont, Susan J; Schmidt, Carl J

    2018-01-01

    Abstract High-throughput transcriptome sequencing (RNAseq) is the universally applied method for target-free transcript identification and gene expression quantification, generating huge amounts of data. The constraint of accessing such data and interpreting results can be a major impediment in postulating suitable hypothesis, thus an innovative storage solution that addresses these limitations, such as hard disk storage requirements, efficiency and reproducibility are paramount. By offering a uniform data storage and retrieval mechanism, various data can be compared and easily investigated. We present a sophisticated system, TransAtlasDB, which incorporates a hybrid architecture of both relational and NoSQL databases for fast and efficient data storage, processing and querying of large datasets from transcript expression analysis with corresponding metadata, as well as gene-associated variants (such as SNPs) and their predicted gene effects. TransAtlasDB provides the data model of accurate storage of the large amount of data derived from RNAseq analysis and also methods of interacting with the database, either via the command-line data management workflows, written in Perl, with useful functionalities that simplifies the complexity of data storage and possibly manipulation of the massive amounts of data generated from RNAseq analysis or through the web interface. The database application is currently modeled to handle analyses data from agricultural species, and will be expanded to include more species groups. Overall TransAtlasDB aims to serve as an accessible repository for the large complex results data files derived from RNAseq gene expression profiling and variant analysis. Database URL: https://modupeore.github.io/TransAtlasDB/ PMID:29688361

  20. BDHI: a French national database on historical floods

    Directory of Open Access Journals (Sweden)

    Lang Michel

    2016-01-01

    Full Text Available The paper describes the various features of the BDHI database (objects, functions, content. This document database provides document sheets on historical floods from various sources: technical reports from water authorities, scientific accounts (meteorology, hydrology, hydraulics..., post-disaster reports, newspapers or book extracts... It is complemented by fact sheets on flood events, which provide a summary text on significant past floods: location, date and duration, type of flood, extent, probability, adverse consequences A search engine is provided for information search based on time (specific date or period, on location (district, basin, city or thematic topic (document type, flood type, flood magnitude, flood impact.... We conclude by some future challenges in relation to the next cycle of the Floods Directive (2016-2022, with the inventory of past floods which had significant adverse impacts. What are the flood events that need to be integrated (new ones later than 2011 and/or previous floods that had not yet been selected? How can the process of historical data integration be extended at a local scale, with an adequate process of validation? How to promote the use of BDHI database in relation with the development of the culture of risk?

  1. M4FT-16LL080302052-Update to Thermodynamic Database Development and Sorption Database Integration

    Energy Technology Data Exchange (ETDEWEB)

    Zavarin, Mavrik [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States). Glenn T. Seaborg Inst.. Physical and Life Sciences; Wolery, T. J. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States). Akima Infrastructure Services, LLC; Atkins-Duffin, C. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States). Global Security

    2016-08-16

    This progress report (Level 4 Milestone Number M4FT-16LL080302052) summarizes research conducted at Lawrence Livermore National Laboratory (LLNL) within the Argillite Disposal R&D Work Package Number FT-16LL08030205. The focus of this research is the thermodynamic modeling of Engineered Barrier System (EBS) materials and properties and development of thermodynamic databases and models to evaluate the stability of EBS materials and their interactions with fluids at various physico-chemical conditions relevant to subsurface repository environments. The development and implementation of equilibrium thermodynamic models are intended to describe chemical and physical processes such as solubility, sorption, and diffusion.

  2. Common hyperspectral image database design

    Science.gov (United States)

    Tian, Lixun; Liao, Ningfang; Chai, Ali

    2009-11-01

    This paper is to introduce Common hyperspectral image database with a demand-oriented Database design method (CHIDB), which comprehensively set ground-based spectra, standardized hyperspectral cube, spectral analysis together to meet some applications. The paper presents an integrated approach to retrieving spectral and spatial patterns from remotely sensed imagery using state-of-the-art data mining and advanced database technologies, some data mining ideas and functions were associated into CHIDB to make it more suitable to serve in agriculture, geological and environmental areas. A broad range of data from multiple regions of the electromagnetic spectrum is supported, including ultraviolet, visible, near-infrared, thermal infrared, and fluorescence. CHIDB is based on dotnet framework and designed by MVC architecture including five main functional modules: Data importer/exporter, Image/spectrum Viewer, Data Processor, Parameter Extractor, and On-line Analyzer. The original data were all stored in SQL server2008 for efficient search, query and update, and some advance Spectral image data Processing technology are used such as Parallel processing in C#; Finally an application case is presented in agricultural disease detecting area.

  3. Building a multi-scaled geospatial temporal ecology database from disparate data sources: Fostering open science through data reuse

    Science.gov (United States)

    Soranno, Patricia A.; Bissell, E.G.; Cheruvelil, Kendra S.; Christel, Samuel T.; Collins, Sarah M.; Fergus, C. Emi; Filstrup, Christopher T.; Lapierre, Jean-Francois; Lotting, Noah R.; Oliver, Samantha K.; Scott, Caren E.; Smith, Nicole J.; Stopyak, Scott; Yuan, Shuai; Bremigan, Mary Tate; Downing, John A.; Gries, Corinna; Henry, Emily N.; Skaff, Nick K.; Stanley, Emily H.; Stow, Craig A.; Tan, Pang-Ning; Wagner, Tyler; Webster, Katherine E.

    2015-01-01

    Although there are considerable site-based data for individual or groups of ecosystems, these datasets are widely scattered, have different data formats and conventions, and often have limited accessibility. At the broader scale, national datasets exist for a large number of geospatial features of land, water, and air that are needed to fully understand variation among these ecosystems. However, such datasets originate from different sources and have different spatial and temporal resolutions. By taking an open-science perspective and by combining site-based ecosystem datasets and national geospatial datasets, science gains the ability to ask important research questions related to grand environmental challenges that operate at broad scales. Documentation of such complicated database integration efforts, through peer-reviewed papers, is recommended to foster reproducibility and future use of the integrated database. Here, we describe the major steps, challenges, and considerations in building an integrated database of lake ecosystems, called LAGOS (LAke multi-scaled GeOSpatial and temporal database), that was developed at the sub-continental study extent of 17 US states (1,800,000 km2). LAGOS includes two modules: LAGOSGEO, with geospatial data on every lake with surface area larger than 4 ha in the study extent (~50,000 lakes), including climate, atmospheric deposition, land use/cover, hydrology, geology, and topography measured across a range of spatial and temporal extents; and LAGOSLIMNO, with lake water quality data compiled from ~100 individual datasets for a subset of lakes in the study extent (~10,000 lakes). Procedures for the integration of datasets included: creating a flexible database design; authoring and integrating metadata; documenting data provenance; quantifying spatial measures of geographic data; quality-controlling integrated and derived data; and extensively documenting the database. Our procedures make a large, complex, and integrated

  4. Building a multi-scaled geospatial temporal ecology database from disparate data sources: fostering open science and data reuse.

    Science.gov (United States)

    Soranno, Patricia A; Bissell, Edward G; Cheruvelil, Kendra S; Christel, Samuel T; Collins, Sarah M; Fergus, C Emi; Filstrup, Christopher T; Lapierre, Jean-Francois; Lottig, Noah R; Oliver, Samantha K; Scott, Caren E; Smith, Nicole J; Stopyak, Scott; Yuan, Shuai; Bremigan, Mary Tate; Downing, John A; Gries, Corinna; Henry, Emily N; Skaff, Nick K; Stanley, Emily H; Stow, Craig A; Tan, Pang-Ning; Wagner, Tyler; Webster, Katherine E

    2015-01-01

    Although there are considerable site-based data for individual or groups of ecosystems, these datasets are widely scattered, have different data formats and conventions, and often have limited accessibility. At the broader scale, national datasets exist for a large number of geospatial features of land, water, and air that are needed to fully understand variation among these ecosystems. However, such datasets originate from different sources and have different spatial and temporal resolutions. By taking an open-science perspective and by combining site-based ecosystem datasets and national geospatial datasets, science gains the ability to ask important research questions related to grand environmental challenges that operate at broad scales. Documentation of such complicated database integration efforts, through peer-reviewed papers, is recommended to foster reproducibility and future use of the integrated database. Here, we describe the major steps, challenges, and considerations in building an integrated database of lake ecosystems, called LAGOS (LAke multi-scaled GeOSpatial and temporal database), that was developed at the sub-continental study extent of 17 US states (1,800,000 km(2)). LAGOS includes two modules: LAGOSGEO, with geospatial data on every lake with surface area larger than 4 ha in the study extent (~50,000 lakes), including climate, atmospheric deposition, land use/cover, hydrology, geology, and topography measured across a range of spatial and temporal extents; and LAGOSLIMNO, with lake water quality data compiled from ~100 individual datasets for a subset of lakes in the study extent (~10,000 lakes). Procedures for the integration of datasets included: creating a flexible database design; authoring and integrating metadata; documenting data provenance; quantifying spatial measures of geographic data; quality-controlling integrated and derived data; and extensively documenting the database. Our procedures make a large, complex, and integrated

  5. Design research of uranium mine borehole database

    International Nuclear Information System (INIS)

    Xie Huaming; Hu Guangdao; Zhu Xianglin; Chen Dehua; Chen Miaoshun

    2008-01-01

    With short supply of energy sources, exploration of uranium mine have been enhanced, but data storage, analysis and usage of exploration data of uranium mine are not highly computerized currently in China, the data is poor shared and used that it can not adapt the need of production and research. It will be well done, if the data are stored and managed in a database system. The concept structure design, logic structure design and data integrity checks are discussed according to the demand of applications and the analysis of exploration data of uranium mine. An application of the database is illustrated finally. (authors)

  6. Update History of This Database - Arabidopsis Phenome Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Arabidopsis Phenome Database Update History of This Database Date Update contents 2017/02/27 Arabidopsis Phenome Data...base English archive site is opened. - Arabidopsis Phenome Database (http://jphenom...e.info/?page_id=95) is opened. About This Database Database Description Download License Update History of This Database... Site Policy | Contact Us Update History of This Database - Arabidopsis Phenome Database | LSDB Archive ...

  7. Performance assessment of EMR systems based on post-relational database.

    Science.gov (United States)

    Yu, Hai-Yan; Li, Jing-Song; Zhang, Xiao-Guang; Tian, Yu; Suzuki, Muneou; Araki, Kenji

    2012-08-01

    Post-relational databases provide high performance and are currently widely used in American hospitals. As few hospital information systems (HIS) in either China or Japan are based on post-relational databases, here we introduce a new-generation electronic medical records (EMR) system called Hygeia, which was developed with the post-relational database Caché and the latest platform Ensemble. Utilizing the benefits of a post-relational database, Hygeia is equipped with an "integration" feature that allows all the system users to access data-with a fast response time-anywhere and at anytime. Performance tests of databases in EMR systems were implemented in both China and Japan. First, a comparison test was conducted between a post-relational database, Caché, and a relational database, Oracle, embedded in the EMR systems of a medium-sized first-class hospital in China. Second, a user terminal test was done on the EMR system Izanami, which is based on the identical database Caché and operates efficiently at the Miyazaki University Hospital in Japan. The results proved that the post-relational database Caché works faster than the relational database Oracle and showed perfect performance in the real-time EMR system.

  8. BioCarian: search engine for exploratory searches in heterogeneous biological databases.

    Science.gov (United States)

    Zaki, Nazar; Tennakoon, Chandana

    2017-10-02

    There are a large number of biological databases publicly available for scientists in the web. Also, there are many private databases generated in the course of research projects. These databases are in a wide variety of formats. Web standards have evolved in the recent times and semantic web technologies are now available to interconnect diverse and heterogeneous sources of data. Therefore, integration and querying of biological databases can be facilitated by techniques used in semantic web. Heterogeneous databases can be converted into Resource Description Format (RDF) and queried using SPARQL language. Searching for exact queries in these databases is trivial. However, exploratory searches need customized solutions, especially when multiple databases are involved. This process is cumbersome and time consuming for those without a sufficient background in computer science. In this context, a search engine facilitating exploratory searches of databases would be of great help to the scientific community. We present BioCarian, an efficient and user-friendly search engine for performing exploratory searches on biological databases. The search engine is an interface for SPARQL queries over RDF databases. We note that many of the databases can be converted to tabular form. We first convert the tabular databases to RDF. The search engine provides a graphical interface based on facets to explore the converted databases. The facet interface is more advanced than conventional facets. It allows complex queries to be constructed, and have additional features like ranking of facet values based on several criteria, visually indicating the relevance of a facet value and presenting the most important facet values when a large number of choices are available. For the advanced users, SPARQL queries can be run directly on the databases. Using this feature, users will be able to incorporate federated searches of SPARQL endpoints. We used the search engine to do an exploratory search

  9. Second-Tier Database for Ecosystem Focus, 2000-2001 Annual Report.

    Energy Technology Data Exchange (ETDEWEB)

    Van Holmes, Chris; Muongchanh, Christine; Anderson, James J. (University of Washington, School of Aquatic and Fishery Sciences, Seattle, WA)

    2001-11-01

    The Second-Tier Database for Ecosystem Focus (Contract 00004124) provides direct and timely public access to Columbia Basin environmental, operational, fishery and riverine data resources for federal, state, public and private entities. The Second-Tier Database known as Data Access in Realtime (DART) does not duplicate services provided by other government entities in the region. Rather, it integrates public data for effective access, consideration and application.

  10. Refactoring databases evolutionary database design

    CERN Document Server

    Ambler, Scott W

    2006-01-01

    Refactoring has proven its value in a wide range of development projects–helping software professionals improve system designs, maintainability, extensibility, and performance. Now, for the first time, leading agile methodologist Scott Ambler and renowned consultant Pramodkumar Sadalage introduce powerful refactoring techniques specifically designed for database systems. Ambler and Sadalage demonstrate how small changes to table structures, data, stored procedures, and triggers can significantly enhance virtually any database design–without changing semantics. You’ll learn how to evolve database schemas in step with source code–and become far more effective in projects relying on iterative, agile methodologies. This comprehensive guide and reference helps you overcome the practical obstacles to refactoring real-world databases by covering every fundamental concept underlying database refactoring. Using start-to-finish examples, the authors walk you through refactoring simple standalone databas...

  11. Update History of This Database - SKIP Stemcell Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us SKIP Stemcell Database Update History of This Database Date Update contents 2017/03/13 SKIP Stemcell Database... English archive site is opened. 2013/03/29 SKIP Stemcell Database ( https://www.skip.med.k...eio.ac.jp/SKIPSearch/top?lang=en ) is opened. About This Database Database Description Download License Update History of This Databa...se Site Policy | Contact Us Update History of This Database - SKIP Stemcell Database | LSDB Archive ...

  12. Development of a virtual private database for a multi-institutional internet-based radiation oncology database overcoming differences in protocols

    International Nuclear Information System (INIS)

    Harauchi, Hajime; Kondo, Takashi; Kumasaki, Yu

    2002-01-01

    A multi-institutional Radiation Oncology Greater Area Database (ROGAD) was started in 1991 under the direction of the Japanese Society for Therapeutic Radiology and Oncology (JASTRO). Use of ROGAD was intended to allow reflection of results of data analysis into treatment strategy and treatment planning for individual cases, to provide quality assurance, to maximize the efficacy of radiotherapy, to allow assessment of new technologies or new modalities, and to optimize medical decision making. ROGAD collected 13,448 radiotherapy treatment cases from 325 facilities during the period from 1992 to 2001. In 2000, questionnaires were sent to 725 radiotherapy facilities throughout Japan, to further obtain the situation of the radiation oncology database. Workers at 179 facilities replied that ''the protocol of my facility is different from ROGAD protocol and I must send data according to the ROGAD protocol''. So, we developed the Virtual Private Database System (VPDS) which is operated as if an oncologist had a database solely owned by his own facility, in spite of actually operating ROGAD. VPDS realizes integration of different plural databases, regardless of differences in entry methods, protocols, definitions and interpretations of contents of clinical data elements between facilities. (author)

  13. Influenza research database: an integrated bioinformatics resource for influenza virus research

    Science.gov (United States)

    The Influenza Research Database (IRD) is a U.S. National Institute of Allergy and Infectious Diseases (NIAID)-sponsored Bioinformatics Resource Center dedicated to providing bioinformatics support for influenza virus research. IRD facilitates the research and development of vaccines, diagnostics, an...

  14. Quality assurance database for the CBM silicon tracking system

    Energy Technology Data Exchange (ETDEWEB)

    Lymanets, Anton [Physikalisches Institut, Universitaet Tuebingen (Germany); Collaboration: CBM-Collaboration

    2015-07-01

    The Silicon Tracking System is a main tracking device of the CBM Experiment at FAIR. Its construction includes production, quality assurance and assembly of large number of components, e.g., 106 carbon fiber support structures, 1300 silicon microstrip sensors, 16.6k readout chips, analog microcables, etc. Detector construction is distributed over several production and assembly sites and calls for a database that would be extensible and allow tracing the components, integrating the test data, monitoring the component statuses and data flow. A possible implementation of the above-mentioned requirements is being developed at GSI (Darmstadt) based on the FAIR DB Virtual Database Library that provides connectivity to common SQL-Database engines (PostgreSQL, Oracle, etc.). Data structure, database architecture as well as status of implementation are discussed.

  15. The computational design of Geological Disposal Technology Integration System

    International Nuclear Information System (INIS)

    Ishihara, Yoshinao; Iwamoto, Hiroshi; Kobayashi, Shigeki; Neyama, Atsushi; Endo, Shuji; Shindo, Tomonori

    2002-03-01

    In order to develop 'Geological Disposal Technology Integration System' that is intended to systematize as knowledge base for fundamental study, the computational design of an indispensable database and image processing function to 'Geological Disposal Technology Integration System' was done, the prototype was made for trial purposes, and the function was confirmed. (1) Database of Integration System which systematized necessary information and relating information as an examination of a whole of repository composition and managed were constructed, and the system function was constructed as a system composed of image processing, analytical information management, the repository component management, and the system security function. (2) The range of the data treated with this system and information was examined, the design examination of the database structure was done, and the design examination of the image processing function of the data preserved in an integrated database was done. (3) The prototype of the database concerning a basic function, the system operation interface, and the image processing function was manufactured to verify the feasibility of the 'Geological Disposal Technology Integration System' based on the result of the design examination and the function was confirmed. (author)

  16. NCBI2RDF: Enabling Full RDF-Based Access to NCBI Databases

    Directory of Open Access Journals (Sweden)

    Alberto Anguita

    2013-01-01

    Full Text Available RDF has become the standard technology for enabling interoperability among heterogeneous biomedical databases. The NCBI provides access to a large set of life sciences databases through a common interface called Entrez. However, the latter does not provide RDF-based access to such databases, and, therefore, they cannot be integrated with other RDF-compliant databases and accessed via SPARQL query interfaces. This paper presents the NCBI2RDF system, aimed at providing RDF-based access to the complete NCBI data repository. This API creates a virtual endpoint for servicing SPARQL queries over different NCBI repositories and presenting to users the query results in SPARQL results format, thus enabling this data to be integrated and/or stored with other RDF-compliant repositories. SPARQL queries are dynamically resolved, decomposed, and forwarded to the NCBI-provided E-utilities programmatic interface to access the NCBI data. Furthermore, we show how our approach increases the expressiveness of the native NCBI querying system, allowing several databases to be accessed simultaneously. This feature significantly boosts productivity when working with complex queries and saves time and effort to biomedical researchers. Our approach has been validated with a large number of SPARQL queries, thus proving its reliability and enhanced capabilities in biomedical environments.

  17. NCBI2RDF: Enabling Full RDF-Based Access to NCBI Databases

    Science.gov (United States)

    Anguita, Alberto; García-Remesal, Miguel; de la Iglesia, Diana; Maojo, Victor

    2013-01-01

    RDF has become the standard technology for enabling interoperability among heterogeneous biomedical databases. The NCBI provides access to a large set of life sciences databases through a common interface called Entrez. However, the latter does not provide RDF-based access to such databases, and, therefore, they cannot be integrated with other RDF-compliant databases and accessed via SPARQL query interfaces. This paper presents the NCBI2RDF system, aimed at providing RDF-based access to the complete NCBI data repository. This API creates a virtual endpoint for servicing SPARQL queries over different NCBI repositories and presenting to users the query results in SPARQL results format, thus enabling this data to be integrated and/or stored with other RDF-compliant repositories. SPARQL queries are dynamically resolved, decomposed, and forwarded to the NCBI-provided E-utilities programmatic interface to access the NCBI data. Furthermore, we show how our approach increases the expressiveness of the native NCBI querying system, allowing several databases to be accessed simultaneously. This feature significantly boosts productivity when working with complex queries and saves time and effort to biomedical researchers. Our approach has been validated with a large number of SPARQL queries, thus proving its reliability and enhanced capabilities in biomedical environments. PMID:23984425

  18. NCBI2RDF: enabling full RDF-based access to NCBI databases.

    Science.gov (United States)

    Anguita, Alberto; García-Remesal, Miguel; de la Iglesia, Diana; Maojo, Victor

    2013-01-01

    RDF has become the standard technology for enabling interoperability among heterogeneous biomedical databases. The NCBI provides access to a large set of life sciences databases through a common interface called Entrez. However, the latter does not provide RDF-based access to such databases, and, therefore, they cannot be integrated with other RDF-compliant databases and accessed via SPARQL query interfaces. This paper presents the NCBI2RDF system, aimed at providing RDF-based access to the complete NCBI data repository. This API creates a virtual endpoint for servicing SPARQL queries over different NCBI repositories and presenting to users the query results in SPARQL results format, thus enabling this data to be integrated and/or stored with other RDF-compliant repositories. SPARQL queries are dynamically resolved, decomposed, and forwarded to the NCBI-provided E-utilities programmatic interface to access the NCBI data. Furthermore, we show how our approach increases the expressiveness of the native NCBI querying system, allowing several databases to be accessed simultaneously. This feature significantly boosts productivity when working with complex queries and saves time and effort to biomedical researchers. Our approach has been validated with a large number of SPARQL queries, thus proving its reliability and enhanced capabilities in biomedical environments.

  19. dEMBF: A Comprehensive Database of Enzymes of Microalgal Biofuel Feedstock.

    Science.gov (United States)

    Misra, Namrata; Panda, Prasanna Kumar; Parida, Bikram Kumar; Mishra, Barada Kanta

    2016-01-01

    Microalgae have attracted wide attention as one of the most versatile renewable feedstocks for production of biofuel. To develop genetically engineered high lipid yielding algal strains, a thorough understanding of the lipid biosynthetic pathway and the underpinning enzymes is essential. In this work, we have systematically mined the genomes of fifteen diverse algal species belonging to Chlorophyta, Heterokontophyta, Rhodophyta, and Haptophyta, to identify and annotate the putative enzymes of lipid metabolic pathway. Consequently, we have also developed a database, dEMBF (Database of Enzymes of Microalgal Biofuel Feedstock), which catalogues the complete list of identified enzymes along with their computed annotation details including length, hydrophobicity, amino acid composition, subcellular location, gene ontology, KEGG pathway, orthologous group, Pfam domain, intron-exon organization, transmembrane topology, and secondary/tertiary structural data. Furthermore, to facilitate functional and evolutionary study of these enzymes, a collection of built-in applications for BLAST search, motif identification, sequence and phylogenetic analysis have been seamlessly integrated into the database. dEMBF is the first database that brings together all enzymes responsible for lipid synthesis from available algal genomes, and provides an integrative platform for enzyme inquiry and analysis. This database will be extremely useful for algal biofuel research. It can be accessed at http://bbprof.immt.res.in/embf.

  20. Comparing human pancreatic cell secretomes by in vitro aptamer selection identifies cyclophilin B as a candidate pancreatic cancer biomarker.

    Science.gov (United States)

    Ray, Partha; Rialon-Guevara, Kristy L; Veras, Emanuela; Sullenger, Bruce A; White, Rebekah R

    2012-05-01

    Most cases of pancreatic cancer are not diagnosed until they are no longer curable with surgery. Therefore, it is critical to develop a sensitive, preferably noninvasive, method for detecting the disease at an earlier stage. In order to identify biomarkers for pancreatic cancer, we devised an in vitro positive/negative selection strategy to identify RNA ligands (aptamers) that could detect structural differences between the secretomes of pancreatic cancer and non-cancerous cells. Using this molecular recognition approach, we identified an aptamer (M9-5) that differentially bound conditioned media from cancerous and non-cancerous human pancreatic cell lines. This aptamer further discriminated between the sera of pancreatic cancer patients and healthy volunteers with high sensitivity and specificity. We utilized biochemical purification methods and mass-spectrometric analysis to identify the M9-5 target as cyclophilin B (CypB). This molecular recognition-based strategy simultaneously identified CypB as a serum biomarker and generated a new reagent to recognize it in body fluids. Moreover, this approach should be generalizable to other diseases and complementary to traditional approaches that focus on differences in expression level between samples. Finally, we suggest that the aptamer we identified has the potential to serve as a tool for the early detection of pancreatic cancer.

  1. Secretome Analysis of Metarhizium anisopliae Under Submerged Conditions Using Bombyx mori Chrysalis to Induce Expression of Virulence-Related Proteins.

    Science.gov (United States)

    Rustiguel, Cynthia Barbosa; Rosa, José Cesar; Jorge, João Atílio; de Oliveira, Arthur Henrique Cavalcanti; Guimarães, Luis Henrique Souza

    2016-02-01

    The entomopathogenic fungus Metarhizium anisopliae is used to control insect pests. This species is specialized for the secretion of an enzymatic complex consisting of proteases, lipases, and chitinases related to pathogenicity and virulence. In this context, the secretomes of strains IBCB 167 and IBCB 384 of M. anisopliae var. anisopliae, grown under submerged fermentation in the presence of chrysalis as an inducer, were analyzed. Analysis of two-dimensional gels showed qualitative and quantitative differences between secreted proteins in both isolates. Around 102 protein spots were analyzed, and 76 % of the corresponding proteins identified by mass spectrometry were grouped into different classes (hydrolases, oxidases, reductases, isomerases, kinases, WSC domains, and hypothetical proteins). Thirty-three per cent of all the proteins analyzed were found to be common in both strains. Several virulence-related proteins were identified as proteases and mannosidases. Endo-N-acetyl-β-D-glucosaminidase expression was observed to be 10.14-fold higher for strain IBCB 384 than for strain IBCB 167, which may be an important contributor to the high virulence of IBCB 384 in Diatraea ssaccharalis. These results are important for elucidation of the host-pathogen relationship and the differences in virulence observed between the two strains.

  2. ATLAS database application enhancements using Oracle 11g

    International Nuclear Information System (INIS)

    Dimitrov, G; Canali, L; Blaszczyk, M; Sorokoletov, R

    2012-01-01

    The ATLAS experiment at LHC relies on databases for detector online data-taking, storage and retrieval of configurations, calibrations and alignments, post data-taking analysis, file management over the grid, job submission and management, condition data replication to remote sites. Oracle Relational Database Management System (RDBMS) has been addressing the ATLAS database requirements to a great extent for many years. Ten database clusters are currently deployed for the needs of the different applications, divided in production, integration and standby databases. The data volume, complexity and demands from the users are increasing steadily with time. Nowadays more than 20 TB of data are stored in the ATLAS production Oracle databases at CERN (not including the index overhead), but the most impressive number is the hosted 260 database schemes (for the most common case each schema is related to a dedicated client application with its own requirements). At the beginning of 2012 all ATLAS databases at CERN have been upgraded to the newest Oracle version at the time: Oracle 11g Release 2. Oracle 11g come with several key improvements compared to previous database engine versions. In this work we present our evaluation of the most relevant new features of Oracle 11g of interest for ATLAS applications and use cases. Notably we report on the performance and scalability enhancements obtained in production since the Oracle 11g deployment during Q1 2012 and we outline plans for future work in this area.

  3. Database design and database administration for a kindergarten

    OpenAIRE

    Vítek, Daniel

    2009-01-01

    The bachelor thesis deals with creation of database design for a standard kindergarten, installation of the designed database into the database system Oracle Database 10g Express Edition and demonstration of the administration tasks in this database system. The verification of the database was proved by a developed access application.

  4. Functional Decomposition of Modeling and Simulation Terrain Database Generation Process

    National Research Council Canada - National Science Library

    Yakich, Valerie R; Lashlee, J. D

    2008-01-01

    .... This report documents the conceptual procedure as implemented by Lockheed Martin Simulation, Training, and Support and decomposes terrain database construction using the Integration Definition for Function Modeling (IDEF...

  5. Skeletal muscle secretome in Duchenne muscular dystrophy: a pivotal anti-inflammatory role of adiponectin.

    Science.gov (United States)

    Lecompte, S; Abou-Samra, M; Boursereau, R; Noel, L; Brichard, S M

    2017-07-01

    Persistent inflammation exacerbates the progression of Duchenne muscular dystrophy (DMD). The hormone, adiponectin (ApN), which is decreased in the metabolic syndrome, exhibits anti-inflammatory properties on skeletal muscle and alleviates the dystrophic phenotype of mdx mice. Here, we investigate whether ApN retains its anti-inflammatory action in myotubes obtained from DMD patients. We unravel the underlying mechanisms by studying the secretome and the early events of ApN. Primary cultures of myotubes from DMD and control patients were treated or not by ApN after an inflammatory challenge. Myokines secreted in medium were identified by cytokine antibody-arrays and ELISAs. The early events of ApN signaling were assessed by abrogating selected genes. ApN retained its anti-inflammatory properties in both dystrophic and control myotubes. Profiling of secretory products revealed that ApN downregulated the secretion of two pro-inflammatory factors (TNFα and IL-17A), one soluble receptor (sTNFRII), and one chemokine (CCL28) in DMD myotubes, while upregulating IL-6 that exerts some anti-inflammatory effects. These changes were explained by pretranslational mechanisms. Earlier events of the ApN cascade involved AdipoR1, the main receptor for muscle, and the AMPK-SIRT1-PGC-1α axis leading, besides alteration of the myokine profile, to the upregulation of utrophin A (a dystrophin analog). ApN retains its beneficial properties in dystrophic muscles by activating the AdipoR1-AMPK-SIRT1-PGC-1α pathway, thereby inducing a shift in the secretion of downstream myokines toward a less inflammatory profile while upregulating utrophin. ApN, the early events of the cascade and downstream myokines may be therapeutic targets for the management of DMD.

  6. Database Description - Open TG-GATEs Pathological Image Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Open TG-GATEs Pathological Image Database Database Description General information of database Database... name Open TG-GATEs Pathological Image Database Alternative name - DOI 10.18908/lsdba.nbdc00954-0...iomedical Innovation 7-6-8, Saito-asagi, Ibaraki-city, Osaka 567-0085, Japan TEL:81-72-641-9826 Email: Database... classification Toxicogenomics Database Organism Taxonomy Name: Rattus norvegi... Article title: Author name(s): Journal: External Links: Original website information Database

  7. Site initialization, recovery, and back-up in a distributed database system

    International Nuclear Information System (INIS)

    Attar, R.; Bernstein, P.A.; Goodman, N.

    1982-01-01

    Site initialization is the problem of integrating a new site into a running distributed database system (DDBS). Site recovery is the problem of integrating an old site into a DDBS when the site recovers from failure. Site backup is the problem of creating a static backup copy of a database for archival or query purposes. We present an algorithm that solves the site initialization problem. By modifying the algorithm slightly, we get solutions to the other two problems as well. Our algorithm exploits the fact that a correct DDBS must run a serializable concurrency control algorithm. Our algorithm relies on the concurrency control algorithm to handle all inter-site synchronization

  8. The French-German initiative for Chernobyl: programme 2: REDAC, the radioecological database after the Chernobyl accident

    International Nuclear Information System (INIS)

    Deville-Cavelin, G.; Biesold, H.; Chabanyuk, V.

    2006-01-01

    Goals: to built a database for integrating the results of programme 'Radioecology' of the French-German Initiative: Ecological portrait, initial contamination, wastes management, soil-plants and animals transfer, transfer by runoff and in the aquatic environment, countermeasures in urban and natural and agricultural environments. Specific methodology: original 'Project Solutions Framework': Information system developed as a soft integrated portal, Geo-information system: all spatial data geo-coded. DB structure: Publications: all classical informations, original data; Products: storage of open publications of the Project; Processes: management of the Project and Sub-projects; Services: information and software objects, help; Basics: information on system and organizational development. - Soft integration: cartography system: Map from 'Ecological portrait' integrated with thematic databases, Loaded in a special category (by IS Geo Internet Map Server); Cartographical functions: navigation, scaling, extracting, layer management, Databases arrangement independent of map system architecture. - Soft integration: portlets and DDB: Portlets = mini-applications for business functions and processes, made of web parts; Digital Dashboards (DDB) Portlets + web parts DDB sites = collections of DDB, adjustable by users. - General conclusions: REDAC, powerful and useful radioecological tool: All elements easily accessible through the original tool, ProSF, developed by IS Geo; Relations constructed between the documents (files, databases, documentation, reports,...); All elements structured by a meta-information; Mechanisms of search; Global radioecological glossary; Spatial data geo-coded; Processes, tools and methodology suitable for similar projects; Data useful for scientific studies, modelling, operational purposes, communication with mass media. - Outlook: Addition of functionality, support and maintenance Strong integration: Thematic integration = merging of all DB in an

  9. The French-German initiative for Chernobyl: programme 2: REDAC, the radioecological database after the Chernobyl accident

    Energy Technology Data Exchange (ETDEWEB)

    Deville-Cavelin, G. [Institut de Radioprotection et de Surete Nucleaire (IRSN), Environment and Emergency Operations Div. - Dept. for the Study of Radionuclide Behaviour in Ecosystems, 13 - Saint-Paul-lez-Durance (France); Biesold, H. [Gesellschaft fuer Anlagen- und Reaktorsicherheit mbH (GRS), Braunschweig (Germany); Chabanyuk, V. [Chornobyl Center (CC), Kiev regoin (Ukraine)

    2006-07-01

    Goals: to built a database for integrating the results of programme 'Radioecology' of the French-German Initiative: Ecological portrait, initial contamination, wastes management, soil-plants and animals transfer, transfer by runoff and in the aquatic environment, countermeasures in urban and natural and agricultural environments. Specific methodology: original 'Project Solutions Framework': Information system developed as a soft integrated portal, Geo-information system: all spatial data geo-coded. DB structure: Publications: all classical informations, original data; Products: storage of open publications of the Project; Processes: management of the Project and Sub-projects; Services: information and software objects, help; Basics: information on system and organizational development. - Soft integration: cartography system: Map from 'Ecological portrait' integrated with thematic databases, Loaded in a special category (by IS Geo Internet Map Server); Cartographical functions: navigation, scaling, extracting, layer management, Databases arrangement independent of map system architecture. - Soft integration: portlets and DDB: Portlets = mini-applications for business functions and processes, made of web parts; Digital Dashboards (DDB) Portlets + web parts DDB sites = collections of DDB, adjustable by users. - General conclusions: REDAC, powerful and useful radioecological tool: All elements easily accessible through the original tool, ProSF, developed by IS Geo; Relations constructed between the documents (files, databases, documentation, reports,...); All elements structured by a meta-information; Mechanisms of search; Global radioecological glossary; Spatial data geo-coded; Processes, tools and methodology suitable for similar projects; Data useful for scientific studies, modelling, operational purposes, communication with mass media. - Outlook: Addition of functionality, support and maintenance Strong integration: Thematic

  10. Zebrafish Database: Customizable, Free, and Open-Source Solution for Facility Management.

    Science.gov (United States)

    Yakulov, Toma Antonov; Walz, Gerd

    2015-12-01

    Zebrafish Database is a web-based customizable database solution, which can be easily adapted to serve both single laboratories and facilities housing thousands of zebrafish lines. The database allows the users to keep track of details regarding the various genomic features, zebrafish lines, zebrafish batches, and their respective locations. Advanced search and reporting options are available. Unique features are the ability to upload files and images that are associated with the respective records and an integrated calendar component that supports multiple calendars and categories. Built on the basis of the Joomla content management system, the Zebrafish Database is easily extendable without the need for advanced programming skills.

  11. Using SQL Databases for Sequence Similarity Searching and Analysis.

    Science.gov (United States)

    Pearson, William R; Mackey, Aaron J

    2017-09-13

    Relational databases can integrate diverse types of information and manage large sets of similarity search results, greatly simplifying genome-scale analyses. By focusing on taxonomic subsets of sequences, relational databases can reduce the size and redundancy of sequence libraries and improve the statistical significance of homologs. In addition, by loading similarity search results into a relational database, it becomes possible to explore and summarize the relationships between all of the proteins in an organism and those in other biological kingdoms. This unit describes how to use relational databases to improve the efficiency of sequence similarity searching and demonstrates various large-scale genomic analyses of homology-related data. It also describes the installation and use of a simple protein sequence database, seqdb_demo, which is used as a basis for the other protocols. The unit also introduces search_demo, a database that stores sequence similarity search results. The search_demo database is then used to explore the evolutionary relationships between E. coli proteins and proteins in other organisms in a large-scale comparative genomic analysis. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.

  12. Analysis of the Phlebiopsis gigantea Genome, Transcriptome and Secretome Provides Insight into Its Pioneer Colonization Strategies of Wood

    Science.gov (United States)

    Hori, Chiaki; Ishida, Takuya; Igarashi, Kiyohiko; Samejima, Masahiro; Suzuki, Hitoshi; Master, Emma; Ferreira, Patricia; Ruiz-Dueñas, Francisco J.; Held, Benjamin; Canessa, Paulo; Larrondo, Luis F.; Schmoll, Monika; Druzhinina, Irina S.; Kubicek, Christian P.; Gaskell, Jill A.; Kersten, Phil; St. John, Franz; Glasner, Jeremy; Sabat, Grzegorz; Splinter BonDurant, Sandra; Syed, Khajamohiddin; Yadav, Jagjit; Mgbeahuruike, Anthony C.; Kovalchuk, Andriy; Asiegbu, Fred O.; Lackner, Gerald; Hoffmeister, Dirk; Rencoret, Jorge; Gutiérrez, Ana; Sun, Hui; Lindquist, Erika; Barry, Kerrie; Riley, Robert; Grigoriev, Igor V.; Henrissat, Bernard; Kües, Ursula; Berka, Randy M.; Martínez, Angel T.; Covert, Sarah F.; Blanchette, Robert A.; Cullen, Daniel

    2014-01-01

    Collectively classified as white-rot fungi, certain basidiomycetes efficiently degrade the major structural polymers of wood cell walls. A small subset of these Agaricomycetes, exemplified by Phlebiopsis gigantea, is capable of colonizing freshly exposed conifer sapwood despite its high content of extractives, which retards the establishment of other fungal species. The mechanism(s) by which P. gigantea tolerates and metabolizes resinous compounds have not been explored. Here, we report the annotated P. gigantea genome and compare profiles of its transcriptome and secretome when cultured on fresh-cut versus solvent-extracted loblolly pine wood. The P. gigantea genome contains a conventional repertoire of hydrolase genes involved in cellulose/hemicellulose degradation, whose patterns of expression were relatively unperturbed by the absence of extractives. The expression of genes typically ascribed to lignin degradation was also largely unaffected. In contrast, genes likely involved in the transformation and detoxification of wood extractives were highly induced in its presence. Their products included an ABC transporter, lipases, cytochrome P450s, glutathione S-transferase and aldehyde dehydrogenase. Other regulated genes of unknown function and several constitutively expressed genes are also likely involved in P. gigantea's extractives metabolism. These results contribute to our fundamental understanding of pioneer colonization of conifer wood and provide insight into the diverse chemistries employed by fungi in carbon cycling processes. PMID:25474575

  13. Software for pipeline integrity administration

    Energy Technology Data Exchange (ETDEWEB)

    Soula, Gerardo; Perona, Lucas Fernandez [Gie SA., Buenos Aires (Argentina); Martinich, Carlos [Refinaria do Norte S. A. (REFINOR), Tartagal, Provincia de Salta (Argentina)

    2009-07-01

    A Software for 'pipeline integrity management' was developed. It allows to deal with Geographical Information and a PODS database (Pipeline Open database Standard) simultaneously, in a simple and reliable way. The premises for the design were the following: didactic, geo referenced, multiple reference systems. Program skills: 1.PODS+GIS: the PODS database in which the software is based on is completely integrated with the GIS module. 2 Management of different kinds of information: it allows to manage information on facilities, repairs, interventions, physical inspections, geographical characteristics, compliance with regulations, training, offline events, operation measures, O and M information treatment and importing specific data and studies in a massive way. It also assures the integrity of the loaded information. 3 Right of way survey: it allows to verify the class location, ROW occupation, sensitive areas identification and to manage landowners. 4 Risk analysis: it is done in a qualitative way, depending on the entered data, allowing the user to identify the riskiest stretches of the system. Either results from risk analysis, data and consultations made about the database, can be exported to standard formats. (author)

  14. Co-Option and De Novo Gene Evolution Underlie Molluscan Shell Diversity

    Science.gov (United States)

    Aguilera, Felipe; McDougall, Carmel

    2017-01-01

    Abstract Molluscs fabricate shells of incredible diversity and complexity by localized secretions from the dorsal epithelium of the mantle. Although distantly related molluscs express remarkably different secreted gene products, it remains unclear if the evolution of shell structure and pattern is underpinned by the differential co-option of conserved genes or the integration of lineage-specific genes into the mantle regulatory program. To address this, we compare the mantle transcriptomes of 11 bivalves and gastropods of varying relatedness. We find that each species, including four Pinctada (pearl oyster) species that diverged within the last 20 Ma, expresses a unique mantle secretome. Lineage- or species-specific genes comprise a large proportion of each species’ mantle secretome. A majority of these secreted proteins have unique domain architectures that include repetitive, low complexity domains (RLCDs), which evolve rapidly, and have a proclivity to expand, contract and rearrange in the genome. There are also a large number of secretome genes expressed in the mantle that arose before the origin of gastropods and bivalves. Each species expresses a unique set of these more ancient genes consistent with their independent co-option into these mantle gene regulatory networks. From this analysis, we infer lineage-specific secretomes underlie shell diversity, and include both rapidly evolving RLCD-containing proteins, and the continual recruitment and loss of both ancient and recently evolved genes into the periphery of the regulatory network controlling gene expression in the mantle epithelium. PMID:28053006

  15. Rapid HIS, RIS, PACS Integration Using Graphical CASE Tools

    Science.gov (United States)

    Taira, Ricky K.; Breant, Claudine M.; Stepczyk, Frank M.; Kho, Hwa T.; Valentino, Daniel J.; Tashima, Gregory H.; Materna, Anthony T.

    1994-05-01

    We describe the clinical requirements of the integrated federation of databases and present our client-mediator-server design. The main body of the paper describes five important aspects of integrating information systems: (1) global schema design, (2) establishing sessions with remote database servers, (3) development of schema translators, (4) integration of global system triggers, and (5) development of job workflow scripts.

  16. Advanced techniques for efficient data integrity checking

    DEFF Research Database (Denmark)

    Martinenghi, Davide

    Integrity constraint checking, understood as the verification of data correctness and well-formedness conditions that must be satisfied in any state of a database, is not fully supported by current database technology. In a typical scenario, a database is required to comply with given semantic...... criteria (the integrity constraints) and to maintain the compliance each time data are updated. Since the introduction of the SQL2 standard, the SQL language started supporting assertions, which allow one to define general data consistency requirements expressing arbitrarily complex “business rules...

  17. Active in-database processing to support ambient assisted living systems.

    Science.gov (United States)

    de Morais, Wagner O; Lundström, Jens; Wickström, Nicholas

    2014-08-12

    As an alternative to the existing software architectures that underpin the development of smart homes and ambient assisted living (AAL) systems, this work presents a database-centric architecture that takes advantage of active databases and in-database processing. Current platforms supporting AAL systems use database management systems (DBMSs) exclusively for data storage. Active databases employ database triggers to detect and react to events taking place inside or outside of the database. DBMSs can be extended with stored procedures and functions that enable in-database processing. This means that the data processing is integrated and performed within the DBMS. The feasibility and flexibility of the proposed approach were demonstrated with the implementation of three distinct AAL services. The active database was used to detect bed-exits and to discover common room transitions and deviations during the night. In-database machine learning methods were used to model early night behaviors. Consequently, active in-database processing avoids transferring sensitive data outside the database, and this improves performance, security and privacy. Furthermore, centralizing the computation into the DBMS facilitates code reuse, adaptation and maintenance. These are important system properties that take into account the evolving heterogeneity of users, their needs and the devices that are characteristic of smart homes and AAL systems. Therefore, DBMSs can provide capabilities to address requirements for scalability, security, privacy, dependability and personalization in applications of smart environments in healthcare.

  18. An integrated data-analysis and database system for AMS 14C

    International Nuclear Information System (INIS)

    Kjeldsen, Henrik; Olsen, Jesper; Heinemeier, Jan

    2010-01-01

    AMSdata is the name of a combined database and data-analysis system for AMS 14 C and stable-isotope work that has been developed at Aarhus University. The system (1) contains routines for data analysis of AMS and MS data, (2) allows a flexible and accurate description of sample extraction and pretreatment, also when samples are split into several fractions, and (3) keeps track of all measured, calculated and attributed data. The structure of the database is flexible and allows an unlimited number of measurement and pretreatment procedures. The AMS 14 C data analysis routine is fairly advanced and flexible, and it can be easily optimized for different kinds of measuring processes. Technically, the system is based on a Microsoft SQL server and includes stored SQL procedures for the data analysis. Microsoft Office Access is used for the (graphical) user interface, and in addition Excel, Word and Origin are exploited for input and output of data, e.g. for plotting data during data analysis.

  19. MINDMAP: establishing an integrated database infrastructure for research in ageing, mental well-being, and the urban environment.

    Science.gov (United States)

    Beenackers, Mariëlle A; Doiron, Dany; Fortier, Isabel; Noordzij, J Mark; Reinhard, Erica; Courtin, Emilie; Bobak, Martin; Chaix, Basile; Costa, Giuseppe; Dapp, Ulrike; Diez Roux, Ana V; Huisman, Martijn; Grundy, Emily M; Krokstad, Steinar; Martikainen, Pekka; Raina, Parminder; Avendano, Mauricio; van Lenthe, Frank J

    2018-01-19

    Urbanization and ageing have important implications for public mental health and well-being. Cities pose major challenges for older citizens, but also offer opportunities to develop, test, and implement policies, services, infrastructure, and interventions that promote mental well-being. The MINDMAP project aims to identify the opportunities and challenges posed by urban environmental characteristics for the promotion and management of mental well-being and cognitive function of older individuals. MINDMAP aims to achieve its research objectives by bringing together longitudinal studies from 11 countries covering over 35 cities linked to databases of area-level environmental exposures and social and urban policy indicators. The infrastructure supporting integration of this data will allow multiple MINDMAP investigators to safely and remotely co-analyse individual-level and area-level data. Individual-level data is derived from baseline and follow-up measurements of ten participating cohort studies and provides information on mental well-being outcomes, sociodemographic variables, health behaviour characteristics, social factors, measures of frailty, physical function indicators, and chronic conditions, as well as blood derived clinical biochemistry-based biomarkers and genetic biomarkers. Area-level information on physical environment characteristics (e.g. green spaces, transportation), socioeconomic and sociodemographic characteristics (e.g. neighbourhood income, residential segregation, residential density), and social environment characteristics (e.g. social cohesion, criminality) and national and urban social policies is derived from publically available sources such as geoportals and administrative databases. The linkage, harmonization, and analysis of data from different sources are being carried out using piloted tools to optimize the validity of the research results and transparency of the methodology. MINDMAP is a novel research collaboration that is

  20. Lessons learned while building the Deepwater Horizon Database: Toward improved data sharing in coastal science

    Science.gov (United States)

    Thessen, Anne E.; McGinnis, Sean; North, Elizabeth W.

    2016-02-01

    Process studies and coupled-model validation efforts in geosciences often require integration of multiple data types across time and space. For example, improved prediction of hydrocarbon fate and transport is an important societal need which fundamentally relies upon synthesis of oceanography and hydrocarbon chemistry. Yet, there are no publically accessible databases which integrate these diverse data types in a georeferenced format, nor are there guidelines for developing such a database. The objective of this research was to analyze the process of building one such database to provide baseline information on data sources and data sharing and to document the challenges and solutions that arose during this major undertaking. The resulting Deepwater Horizon Database was approximately 2.4 GB in size and contained over 8 million georeferenced data points collected from industry, government databases, volunteer networks, and individual researchers. The major technical challenges that were overcome were reconciliation of terms, units, and quality flags which were necessary to effectively integrate the disparate data sets. Assembling this database required the development of relationships with individual researchers and data managers which often involved extensive e-mail contacts. The average number of emails exchanged per data set was 7.8. Of the 95 relevant data sets that were discovered, 38 (40%) were obtained, either in whole or in part. Over one third (36%) of the requests for data went unanswered. The majority of responses were received after the first request (64%) and within the first week of the first request (67%). Although fewer than half of the potentially relevant datasets were incorporated into the database, the level of sharing (40%) was high compared to some other disciplines where sharing can be as low as 10%. Our suggestions for building integrated databases include budgeting significant time for e-mail exchanges, being cognizant of the cost versus