WorldWideScience

Sample records for datasets reveals strain

  1. Vibrio cholerae classical biotype strains reveal distinct signatures in Mexico.

    Science.gov (United States)

    Alam, Munirul; Islam, M Tarequl; Rashed, Shah Manzur; Johura, Fatema-tuz; Bhuiyan, Nurul A; Delgado, Gabriela; Morales, Rosario; Mendez, Jose Luis; Navarro, Armando; Watanabe, Haruo; Hasan, Nur-A; Colwell, Rita R; Cravioto, Alejandro

    2012-07-01

    Vibrio cholerae O1 classical (CL) biotype caused the fifth and sixth pandemics, and probably the earlier cholera pandemics, before the El Tor (ET) biotype initiated the seventh pandemic in Asia in the 1970s by completely displacing the CL biotype. Although the CL biotype was thought to be extinct in Asia and although it had never been reported from Latin America, V. cholerae CL and ET biotypes, including a hybrid ET, were found associated with areas of cholera endemicity in Mexico between 1991 and 1997. In this study, CL biotype strains isolated from areas of cholera endemicity in Mexico between 1983 and 1997 were characterized in terms of major phenotypic and genetic traits and compared with CL biotype strains isolated in Bangladesh between 1962 and 1989. According to sero- and biotyping data, all V. cholerae strains tested had the major phenotypic and genotypic characteristics specific for the CL biotype. Antibiograms revealed the majority of the Bangladeshi strains to be resistant to trimethoprim-sulfamethoxazole, furazolidone, ampicillin, and gentamicin, while the Mexican strains were sensitive to all of these drugs, as well as to ciprofloxacin, erythromycin, and tetracycline. Pulsed-field gel electrophoresis (PFGE) of NotI-digested genomic DNA revealed characteristic banding patterns for all of the CL biotype strains although the Mexican strains differed from the Bangladeshi strains in 1 to 2 DNA bands. The difference was subtle but consistent, as confirmed by the subclustering patterns in the PFGE-based dendrogram, and can serve as a regional signature, suggesting the pre-1991 existence and evolution of the CL biotype strains in the Americas, independent from Asia.

  2. Transcriptomic profiling of diverse Aedes aegypti strains reveals increased basal-level immune activation in dengue virus-refractory populations and identifies novel virus-vector molecular interactions.

    Directory of Open Access Journals (Sweden)

    Shuzhen Sim

    Full Text Available Genetic variation among Aedes aegypti populations can greatly influence their vector competence for human pathogens such as the dengue virus (DENV. While intra-species transcriptome differences remain relatively unstudied when compared to coding sequence polymorphisms, they also affect numerous aspects of mosquito biology. Comparative molecular profiling of mosquito strain transcriptomes can therefore provide valuable insight into the regulation of vector competence. We established a panel of A. aegypti strains with varying levels of susceptibility to DENV, comprising both laboratory-maintained strains and field-derived colonies collected from geographically distinct dengue-endemic regions spanning South America, the Caribbean, and Southeast Asia. A comparative genome-wide gene expression microarray-based analysis revealed higher basal levels of numerous immunity-related gene transcripts in DENV-refractory mosquito strains than in susceptible strains, and RNA interference assays further showed different degrees of immune pathway contribution to refractoriness in different strains. By correlating transcript abundance patterns with DENV susceptibility across our panel, we also identified new candidate modulators of DENV infection in the mosquito, and we provide functional evidence for two potential DENV host factors and one potential restriction factor. Our comparative transcriptome dataset thus not only provides valuable information about immune gene regulation and usage in natural refractoriness of mosquito populations to dengue virus but also allows us to identify new molecular interactions between the virus and its mosquito vector.

  3. Simulation of Smart Home Activity Datasets

    Directory of Open Access Journals (Sweden)

    Jonathan Synnott

    2015-06-01

    Full Text Available A globally ageing population is resulting in an increased prevalence of chronic conditions which affect older adults. Such conditions require long-term care and management to maximize quality of life, placing an increasing strain on healthcare resources. Intelligent environments such as smart homes facilitate long-term monitoring of activities in the home through the use of sensor technology. Access to sensor datasets is necessary for the development of novel activity monitoring and recognition approaches. Access to such datasets is limited due to issues such as sensor cost, availability and deployment time. The use of simulated environments and sensors may address these issues and facilitate the generation of comprehensive datasets. This paper provides a review of existing approaches for the generation of simulated smart home activity datasets, including model-based approaches and interactive approaches which implement virtual sensors, environments and avatars. The paper also provides recommendation for future work in intelligent environment simulation.

  4. Simulation of Smart Home Activity Datasets.

    Science.gov (United States)

    Synnott, Jonathan; Nugent, Chris; Jeffers, Paul

    2015-06-16

    A globally ageing population is resulting in an increased prevalence of chronic conditions which affect older adults. Such conditions require long-term care and management to maximize quality of life, placing an increasing strain on healthcare resources. Intelligent environments such as smart homes facilitate long-term monitoring of activities in the home through the use of sensor technology. Access to sensor datasets is necessary for the development of novel activity monitoring and recognition approaches. Access to such datasets is limited due to issues such as sensor cost, availability and deployment time. The use of simulated environments and sensors may address these issues and facilitate the generation of comprehensive datasets. This paper provides a review of existing approaches for the generation of simulated smart home activity datasets, including model-based approaches and interactive approaches which implement virtual sensors, environments and avatars. The paper also provides recommendation for future work in intelligent environment simulation.

  5. Comparative genomics analyses revealed two virulent Listeria monocytogenes strains isolated from ready-to-eat food.

    Science.gov (United States)

    Lim, Shu Yong; Yap, Kien-Pong; Thong, Kwai Lin

    2016-01-01

    Listeria monocytogenes is an important foodborne pathogen that causes considerable morbidity in humans with high mortality rates. In this study, we have sequenced the genomes and performed comparative genomics analyses on two strains, LM115 and LM41, isolated from ready-to-eat food in Malaysia. The genome size of LM115 and LM41 was 2,959,041 and 2,963,111 bp, respectively. These two strains shared approximately 90% homologous genes. Comparative genomics and phylogenomic analyses revealed that LM115 and LM41 were more closely related to the reference strains F2365 and EGD-e, respectively. Our virulence profiling indicated a total of 31 virulence genes shared by both analysed strains. These shared genes included those that encode for internalins and L. monocytogenes pathogenicity island 1 (LIPI-1). Both the Malaysian L. monocytogenes strains also harboured several genes associated with stress tolerance to counter the adverse conditions. Seven antibiotic and efflux pump related genes which may confer resistance against lincomycin, erythromycin, fosfomycin, quinolone, tetracycline, and penicillin, and macrolides were identified in the genomes of both strains. Whole genome sequencing and comparative genomics analyses revealed two virulent L. monocytogenes strains isolated from ready-to-eat foods in Malaysia. The identification of strains with pathogenic, persistent, and antibiotic resistant potentials from minimally processed food warrant close attention from both healthcare and food industry.

  6. Improving Phylogeny Reconstruction at the Strain Level Using Peptidome Datasets.

    Directory of Open Access Journals (Sweden)

    Aitor Blanco-Míguez

    2016-12-01

    Full Text Available Typical bacterial strain differentiation methods are often challenged by high genetic similarity between strains. To address this problem, we introduce a novel in silico peptide fingerprinting method based on conventional wet-lab protocols that enables the identification of potential strain-specific peptides. These can be further investigated using in vitro approaches, laying a foundation for the development of biomarker detection and application-specific methods. This novel method aims at reducing large amounts of comparative peptide data to binary matrices while maintaining a high phylogenetic resolution. The underlying case study concerns the Bacillus cereus group, namely the differentiation of Bacillus thuringiensis, Bacillus anthracis and Bacillus cereus strains. Results show that trees based on cytoplasmic and extracellular peptidomes are only marginally in conflict with those based on whole proteomes, as inferred by the established Genome-BLAST Distance Phylogeny (GBDP method. Hence, these results indicate that the two approaches can most likely be used complementarily even in other organismal groups. The obtained results confirm previous reports about the misclassification of many strains within the B. cereus group. Moreover, our method was able to separate the B. anthracis strains with high resolution, similarly to the GBDP results as benchmarked via Bayesian inference and both Maximum Likelihood and Maximum Parsimony. In addition to the presented phylogenomic applications, whole-peptide fingerprinting might also become a valuable complementary technique to digital DNA-DNA hybridization, notably for bacterial classification at the species and subspecies level in the future.

  7. Improving Phylogeny Reconstruction at the Strain Level Using Peptidome Datasets.

    Science.gov (United States)

    Blanco-Míguez, Aitor; Meier-Kolthoff, Jan P; Gutiérrez-Jácome, Alberto; Göker, Markus; Fdez-Riverola, Florentino; Sánchez, Borja; Lourenço, Anália

    2016-12-01

    Typical bacterial strain differentiation methods are often challenged by high genetic similarity between strains. To address this problem, we introduce a novel in silico peptide fingerprinting method based on conventional wet-lab protocols that enables the identification of potential strain-specific peptides. These can be further investigated using in vitro approaches, laying a foundation for the development of biomarker detection and application-specific methods. This novel method aims at reducing large amounts of comparative peptide data to binary matrices while maintaining a high phylogenetic resolution. The underlying case study concerns the Bacillus cereus group, namely the differentiation of Bacillus thuringiensis, Bacillus anthracis and Bacillus cereus strains. Results show that trees based on cytoplasmic and extracellular peptidomes are only marginally in conflict with those based on whole proteomes, as inferred by the established Genome-BLAST Distance Phylogeny (GBDP) method. Hence, these results indicate that the two approaches can most likely be used complementarily even in other organismal groups. The obtained results confirm previous reports about the misclassification of many strains within the B. cereus group. Moreover, our method was able to separate the B. anthracis strains with high resolution, similarly to the GBDP results as benchmarked via Bayesian inference and both Maximum Likelihood and Maximum Parsimony. In addition to the presented phylogenomic applications, whole-peptide fingerprinting might also become a valuable complementary technique to digital DNA-DNA hybridization, notably for bacterial classification at the species and subspecies level in the future.

  8. Viral forensic genomics reveals the relatedness of classic herpes simplex virus strains KOS, KOS63, and KOS79.

    Science.gov (United States)

    Bowen, Christopher D; Renner, Daniel W; Shreve, Jacob T; Tafuri, Yolanda; Payne, Kimberly M; Dix, Richard D; Kinchington, Paul R; Gatherer, Derek; Szpara, Moriah L

    2016-05-01

    Herpes simplex virus 1 (HSV-1) is a widespread global pathogen, of which the strain KOS is one of the most extensively studied. Previous sequence studies revealed that KOS does not cluster with other strains of North American geographic origin, but instead clustered with Asian strains. We sequenced a historical isolate of the original KOS strain, called KOS63, along with a separately isolated strain attributed to the same source individual, termed KOS79. Genomic analyses revealed that KOS63 closely resembled other recently sequenced isolates of KOS and was of Asian origin, but that KOS79 was a genetically unrelated strain that clustered in genetic distance analyses with HSV-1 strains of North American/European origin. These data suggest that the human source of KOS63 and KOS79 could have been infected with two genetically unrelated strains of disparate geographic origins. A PCR RFLP test was developed for rapid identification of these strains. Copyright © 2016 Elsevier Inc. All rights reserved.

  9. A Scalable Permutation Approach Reveals Replication and Preservation Patterns of Network Modules in Large Datasets.

    Science.gov (United States)

    Ritchie, Scott C; Watts, Stephen; Fearnley, Liam G; Holt, Kathryn E; Abraham, Gad; Inouye, Michael

    2016-07-01

    Network modules-topologically distinct groups of edges and nodes-that are preserved across datasets can reveal common features of organisms, tissues, cell types, and molecules. Many statistics to identify such modules have been developed, but testing their significance requires heuristics. Here, we demonstrate that current methods for assessing module preservation are systematically biased and produce skewed p values. We introduce NetRep, a rapid and computationally efficient method that uses a permutation approach to score module preservation without assuming data are normally distributed. NetRep produces unbiased p values and can distinguish between true and false positives during multiple hypothesis testing. We use NetRep to quantify preservation of gene coexpression modules across murine brain, liver, adipose, and muscle tissues. Complex patterns of multi-tissue preservation were revealed, including a liver-derived housekeeping module that displayed adipose- and muscle-specific association with body weight. Finally, we demonstrate the broader applicability of NetRep by quantifying preservation of bacterial networks in gut microbiota between men and women. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.

  10. Comparative Genomics Revealed Genetic Diversity and Species/Strain-Level Differences in Carbohydrate Metabolism of Three Probiotic Bifidobacterial Species

    Directory of Open Access Journals (Sweden)

    Toshitaka Odamaki

    2015-01-01

    Full Text Available Strains of Bifidobacterium longum, Bifidobacterium breve, and Bifidobacterium animalis are widely used as probiotics in the food industry. Although numerous studies have revealed the properties and functionality of these strains, it is uncertain whether these characteristics are species common or strain specific. To address this issue, we performed a comparative genomic analysis of 49 strains belonging to these three bifidobacterial species to describe their genetic diversity and to evaluate species-level differences. There were 166 common clusters between strains of B. breve and B. longum, whereas there were nine common clusters between strains of B. animalis and B. longum and four common clusters between strains of B. animalis and B. breve. Further analysis focused on carbohydrate metabolism revealed the existence of certain strain-dependent genes, such as those encoding enzymes for host glycan utilisation or certain membrane transporters, and many genes commonly distributed at the species level, as was previously reported in studies with limited strains. As B. longum and B. breve are human-residential bifidobacteria (HRB, whereas B. animalis is a non-HRB species, several of the differences in these species’ gene distributions might be the result of their adaptations to the nutrient environment. This information may aid both in selecting probiotic candidates and in understanding their potential function as probiotics.

  11. Hydrogen embrittlement of austenitic stainless steels revealed by deformation microstructures and strain-induced creation of vacancies

    International Nuclear Information System (INIS)

    Hatano, M.; Fujinami, M.; Arai, K.; Fujii, H.; Nagumo, M.

    2014-01-01

    Hydrogen embrittlement of austenitic stainless steels has been examined with respect to deformation microstructures and lattice defects created during plastic deformation. Two types of austenitic stainless steels, SUS 304 and SUS 316L, uniformly hydrogen-precharged to 30 mass ppm in a high-pressure hydrogen environment, are subjected to tensile straining at room temperature. A substantial reduction of tensile ductility appears in hydrogen-charged SUS 304 and the onset of fracture is likely due to plastic instability. Fractographic features show involvement of plasticity throughout the crack path, implying the degradation of the austenitic phase. Electron backscatter diffraction analyses revealed prominent strain localization enhanced by hydrogen in SUS 304. Deformation microstructures of hydrogen-charged SUS 304 were characterized by the formation of high densities of fine stacking faults and ε-martensite, while tangled dislocations prevailed in SUS 316L. Positron lifetime measurements have revealed for the first time hydrogen-enhanced creation of strain-induced vacancies rather than dislocations in the austenitic phase and more clustering of vacancies in SUS 304 than in SUS 316L. Embrittlement and its mechanism are ascribed to the decrease in stacking fault energies resulting in strain localization and hydrogen-enhanced creation of strain-induced vacancies, leading to premature fracture in a similar way to that proposed for ferritic steels

  12. Genetic Diversity among Rhizobium leguminosarum bv. Trifolii Strains Revealed by Allozyme and Restriction Fragment Length Polymorphism Analyses

    Science.gov (United States)

    Demezas, David H.; Reardon, Terry B.; Watson, John M.; Gibson, Alan H.

    1991-01-01

    Allozyme electrophoresis and restriction fragment length polymorphism (RFLP) analyses were used to examine the genetic diversity of a collection of 18 Rhizobium leguminosarum bv. trifolii, 1 R. leguminosarum bv. viciae, and 2 R. meliloti strains. Allozyme analysis at 28 loci revealed 16 electrophoretic types. The mean genetic distance between electrophoretic types of R. leguminosarum and R. meliloti was 0.83. Within R. leguminosarum, the single strain of bv. viciae differed at an average of 0.65 from strains of bv. trifolii, while electrophoretic types of bv. trifolii differed at a range of 0.23 to 0.62. Analysis of RFLPs around two chromosomal DNA probes also delineated 16 unique RFLP patterns and yielded genetic diversity similar to that revealed by the allozyme data. Analysis of RFLPs around three Sym (symbiotic) plasmid-derived probes demonstrated that the Sym plasmids reflect genetic divergence similar to that of their bacterial hosts. The large genetic distances between many strains precluded reliable estimates of their genetic relationships. PMID:16348600

  13. Re-inspection of small RNA sequence datasets reveals several novel human miRNA genes.

    Directory of Open Access Journals (Sweden)

    Thomas Birkballe Hansen

    Full Text Available BACKGROUND: miRNAs are key players in gene expression regulation. To fully understand the complex nature of cellular differentiation or initiation and progression of disease, it is important to assess the expression patterns of as many miRNAs as possible. Thereby, identifying novel miRNAs is an essential prerequisite to make possible a comprehensive and coherent understanding of cellular biology. METHODOLOGY/PRINCIPAL FINDINGS: Based on two extensive, but previously published, small RNA sequence datasets from human embryonic stem cells and human embroid bodies, respectively [1], we identified 112 novel miRNA-like structures and were able to validate miRNA processing in 12 out of 17 investigated cases. Several miRNA candidates were furthermore substantiated by including additional available small RNA datasets, thereby demonstrating the power of combining datasets to identify miRNAs that otherwise may be assigned as experimental noise. CONCLUSIONS/SIGNIFICANCE: Our analysis highlights that existing datasets are not yet exhaustedly studied and continuous re-analysis of the available data is important to uncover all features of small RNA sequencing.

  14. Comparative Transcriptome Analysis Reveals Different Silk Yields of Two Silkworm Strains.

    Directory of Open Access Journals (Sweden)

    Juan Li

    Full Text Available Cocoon and silk yields are the most important characteristics of sericulture. However, few studies have examined the genes that modulate these features. Further studies of these genes will be useful for improving the products of sericulture. JingSong (JS and Lan10 (L10 are two strains having significantly different cocoon and silk yields. In the current study, RNA-Seq and quantitative polymerase chain reaction (qPCR were performed on both strains in order to determine divergence of the silk gland, which controls silk biosynthesis in silkworms. Compared with L10, JS had 1375 differentially expressed genes (DEGs; 738 up-regulated genes and 673 down-regulated genes. Nine enriched gene ontology (GO terms were identified by GO enrichment analysis based on these DEGs. KEGG enrichment analysis results showed that the DEGs were enriched in three pathways, which were mainly associated with the processing and biosynthesis of proteins. The representative genes in the enrichment pathways and ten significant DEGs were further verified by qPCR, the results of which were consistent with the RNA-Seq data. Our study has revealed differences in silk glands between the two silkworm strains and provides a perspective for understanding the molecular mechanisms determining silk yield.

  15. A curated database of cyanobacterial strains relevant for modern taxonomy and phylogenetic studies

    OpenAIRE

    Ramos, Vitor; Morais, Jo?o; Vasconcelos, Vitor M.

    2017-01-01

    The dataset herein described lays the groundwork for an online database of relevant cyanobacterial strains, named CyanoType (http://lege.ciimar.up.pt/cyanotype). It is a database that includes categorized cyanobacterial strains useful for taxonomic, phylogenetic or genomic purposes, with associated information obtained by means of a literature-based curation. The dataset lists 371 strains and represents the first version of the database (CyanoType v.1). Information for each strain includes st...

  16. Integrated Analysis of Alzheimer's Disease and Schizophrenia Dataset Revealed Different Expression Pattern in Learning and Memory.

    Science.gov (United States)

    Li, Wen-Xing; Dai, Shao-Xing; Liu, Jia-Qian; Wang, Qian; Li, Gong-Hua; Huang, Jing-Fei

    2016-01-01

    Alzheimer's disease (AD) and schizophrenia (SZ) are both accompanied by impaired learning and memory functions. This study aims to explore the expression profiles of learning or memory genes between AD and SZ. We downloaded 10 AD and 10 SZ datasets from GEO-NCBI for integrated analysis. These datasets were processed using RMA algorithm and a global renormalization for all studies. Then Empirical Bayes algorithm was used to find the differentially expressed genes between patients and controls. The results showed that most of the differentially expressed genes were related to AD whereas the gene expression profile was little affected in the SZ. Furthermore, in the aspects of the number of differentially expressed genes, the fold change and the brain region, there was a great difference in the expression of learning or memory related genes between AD and SZ. In AD, the CALB1, GABRA5, and TAC1 were significantly downregulated in whole brain, frontal lobe, temporal lobe, and hippocampus. However, in SZ, only two genes CRHBP and CX3CR1 were downregulated in hippocampus, and other brain regions were not affected. The effect of these genes on learning or memory impairment has been widely studied. It was suggested that these genes may play a crucial role in AD or SZ pathogenesis. The different gene expression patterns between AD and SZ on learning and memory functions in different brain regions revealed in our study may help to understand the different mechanism between two diseases.

  17. Comparative genomic analysis of Brucella abortus vaccine strain 104M reveals a set of candidate genes associated with its virulence attenuation.

    Science.gov (United States)

    Yu, Dong; Hui, Yiming; Zai, Xiaodong; Xu, Junjie; Liang, Long; Wang, Bingxiang; Yue, Junjie; Li, Shanhu

    2015-01-01

    The Brucella abortus strain 104M, a spontaneously attenuated strain, has been used as a vaccine strain in humans against brucellosis for 6 decades in China. Despite many studies, the molecular mechanisms that cause the attenuation are still unclear. Here, we determined the whole-genome sequence of 104M and conducted a comprehensive comparative analysis against the whole genome sequences of the virulent strain, A13334, and other reference strains. This analysis revealed a highly similar genome structure between 104M and A13334. The further comparative genomic analysis between 104M and A13334 revealed a set of genes missing in 104M. Some of these genes were identified to be directly or indirectly associated with virulence. Similarly, a set of mutations in the virulence-related genes was also identified, which may be related to virulence alteration. This study provides a set of candidate genes associated with virulence attenuation in B.abortus vaccine strain 104M.

  18. Differential lysine acetylation profiles of Erwinia amylovora strains revealed by proteomics

    Science.gov (United States)

    Wu, Xia; Vellaichamy, Adaikkalam; Wang, Dongping; Zamdborg, Leonid; Kelleher, Neil L.; Huber, Steven C.; Zhao, Youfu

    2015-01-01

    Protein lysine acetylation (LysAc) has recently been demonstrated to be widespread in E. coli and Salmonella, and to broadly regulate bacterial physiology and metabolism. However, LysAc in plant pathogenic bacteria is largely unknown. Here we first report the lysine acetylome of Erwinia amylovora, an enterobacterium causing serious fire blight disease of apples and pears. Immunoblots using generic anti-lysine acetylation antibodies demonstrated that growth conditions strongly affected the LysAc profiles in E. amylovora. Differential LysAc profiles were also observed for two E. amylovora strains, known to have differential virulence in plants, indicating translational modification of proteins may be important in determining virulence of bacterial strains. Proteomic analysis of LysAc in two E. amylovora strains identified 141 LysAc sites in 96 proteins that function in a wide range of biological pathways. Consistent with previous reports, 44% of the proteins are involved in metabolic processes, including central metabolism, lipopolysaccharide, nucleotide and amino acid metabolism. Interestingly, for the first time, several proteins involved in E. amylovora virulence, including exopolysaccharide amylovoran biosynthesis- and type III secretion-associated proteins, were found to be lysine acetylated, suggesting that LysAc may play a major role in bacterial virulence. Comparative analysis of LysAc sites in E. amylovora and E. coli further revealed the sequence and structural commonality for LysAc in the two organisms. Collectively, these results reinforce the notion that LysAc of proteins is widespread in bacterial metabolism and virulence. PMID:23234799

  19. Genetic variation in the Staphylococcus aureus 8325 strain lineage revealed by whole-genome sequencing.

    Directory of Open Access Journals (Sweden)

    Kristoffer T Bæk

    Full Text Available Staphylococcus aureus strains of the 8325 lineage, especially 8325-4 and derivatives lacking prophage, have been used extensively for decades of research. We report herein the results of our deep sequence analysis of strain 8325-4. Assignment of sequence variants compared with the reference strain 8325 (NRS77/PS47 required correction of errors in the 8325 reference genome, and reassessment of variation previously attributed to chemical mutagenesis of the restriction-defective RN4220. Using an extensive strain pedigree analysis, we discovered that 8325-4 contains 16 single nucleotide polymorphisms (SNP arising prior to the construction of RN4220. We identified 5 indels in 8325-4 compared with 8325. Three indels correspond to expected Φ11, 12, 13 excisions, one indel is explained by a sequence assembly artifact, and the final indel (Δ63bp in the spa-sarS intergenic region is common to only a sub-lineage of 8325-4 strains including SH1000. This deletion was found to significantly decrease (75% steady state sarS but not spa transcript levels in post-exponential phase. The sub-lineage 8325-4 was also found to harbor 4 additional SNPs. We also found large sequence variation between 8325, 8325-4 and RN4220 in a cluster of repetitive hypothetical proteins (SA0282 homologs near the Ess secretion cluster. The overall 8325-4 SNP set results in 17 alterations within coding sequences. Remarkably, we discovered that all tested strains of the 8325-4 lineage lack phenol soluble modulin α3 (PSMα3, a virulence determinant implicated in neutrophil chemotaxis, biofilm architecture and surface spreading. Collectively, our results clarify and define the 8325-4 pedigree and reveal clear evidence that mutations existing throughout all branches of this lineage, including the widely used RN6390 and SH1000 strains, could conceivably impact virulence regulation.

  20. Strains and Stressors: An Analysis of Touchscreen Learning in Genetically Diverse Mouse Strains

    Science.gov (United States)

    Graybeal, Carolyn; Bachu, Munisa; Mozhui, Khyobeni; Saksida, Lisa M.; Bussey, Timothy J.; Sagalyn, Erica; Williams, Robert W.; Holmes, Andrew

    2014-01-01

    Touchscreen-based systems are growing in popularity as a tractable, translational approach for studying learning and cognition in rodents. However, while mouse strains are well known to differ in learning across various settings, performance variation between strains in touchscreen learning has not been well described. The selection of appropriate genetic strains and backgrounds is critical to the design of touchscreen-based studies and provides a basis for elucidating genetic factors moderating behavior. Here we provide a quantitative foundation for visual discrimination and reversal learning using touchscreen assays across a total of 35 genotypes. We found significant differences in operant performance and learning, including faster reversal learning in DBA/2J compared to C57BL/6J mice. We then assessed DBA/2J and C57BL/6J for differential sensitivity to an environmental insult by testing for alterations in reversal learning following exposure to repeated swim stress. Stress facilitated reversal learning (selectively during the late stage of reversal) in C57BL/6J, but did not affect learning in DBA/2J. To dissect genetic factors underlying these differences, we phenotyped a family of 27 BXD strains generated by crossing C57BL/6J and DBA/2J. There was marked variation in discrimination, reversal and extinction learning across the BXD strains, suggesting this task may be useful for identifying underlying genetic differences. Moreover, different measures of touchscreen learning were only modestly correlated in the BXD strains, indicating that these processes are comparatively independent at both genetic and phenotypic levels. Finally, we examined the behavioral structure of learning via principal component analysis of the current data, plus an archival dataset, totaling 765 mice. This revealed 5 independent factors suggestive of “reversal learning,” “motivation-related late reversal learning,” “discrimination learning,” “speed to respond,” and

  1. Nomadic lifestyle of Lactobacillus plantarum revealed by comparative genomics of 54 strains isolated from different habitats.

    Science.gov (United States)

    Martino, Maria Elena; Bayjanov, Jumamurat R; Caffrey, Brian E; Wels, Michiel; Joncour, Pauline; Hughes, Sandrine; Gillet, Benjamin; Kleerebezem, Michiel; van Hijum, Sacha A F T; Leulier, François

    2016-12-01

    The ability of bacteria to adapt to diverse environmental conditions is well-known. The process of bacterial adaptation to a niche has been linked to large changes in the genome content, showing that many bacterial genomes reflect the constraints imposed by their habitat. However, some highly versatile bacteria are found in diverse habitats that almost share nothing in common. Lactobacillus plantarum is a lactic acid bacterium that is found in a large variety of habitat. With the aim of unravelling the link between evolution and ecological versatility of L. plantarum, we analysed the genomes of 54 L. plantarum strains isolated from different environments. Comparative genome analysis identified a high level of genomic diversity and plasticity among the strains analysed. Phylogenomic and functional divergence studies coupled with gene-trait matching analyses revealed a mixed distribution of the strains, which was uncoupled from their environmental origin. Our findings revealed the absence of specific genomic signatures marking adaptations of L. plantarum towards the diverse habitats it is associated with. This suggests fundamentally similar trends of genome evolution in L. plantarum, which occur in a manner that is apparently uncoupled from ecological constraint and reflects the nomadic lifestyle of this species. © 2016 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.

  2. A curated database of cyanobacterial strains relevant for modern taxonomy and phylogenetic studies.

    Science.gov (United States)

    Ramos, Vitor; Morais, João; Vasconcelos, Vitor M

    2017-04-25

    The dataset herein described lays the groundwork for an online database of relevant cyanobacterial strains, named CyanoType (http://lege.ciimar.up.pt/cyanotype). It is a database that includes categorized cyanobacterial strains useful for taxonomic, phylogenetic or genomic purposes, with associated information obtained by means of a literature-based curation. The dataset lists 371 strains and represents the first version of the database (CyanoType v.1). Information for each strain includes strain synonymy and/or co-identity, strain categorization, habitat, accession numbers for molecular data, taxonomy and nomenclature notes according to three different classification schemes, hierarchical automatic classification, phylogenetic placement according to a selection of relevant studies (including this), and important bibliographic references. The database will be updated periodically, namely by adding new strains meeting the criteria for inclusion and by revising and adding up-to-date metadata for strains already listed. A global 16S rDNA-based phylogeny is provided in order to assist users when choosing the appropriate strains for their studies.

  3. Comparative transcriptomic analysis reveals similarities and dissimilarities in Saccharomyces cerevisiae wine strains response to nitrogen availability.

    Directory of Open Access Journals (Sweden)

    Catarina Barbosa

    Full Text Available Nitrogen levels in grape-juices are of major importance in winemaking ensuring adequate yeast growth and fermentation performance. Here we used a comparative transcriptome analysis to uncover wine yeasts responses to nitrogen availability during fermentation. Gene expression was assessed in three genetically and phenotypically divergent commercial wine strains (CEG, VL1 and QA23, under low (67 mg/L and high nitrogen (670 mg/L regimes, at three time points during fermentation (12 h, 24 h and 96 h. Two-way ANOVA analysis of each fermentation condition led to the identification of genes whose expression was dependent on strain, fermentation stage and on the interaction of both factors. The high fermenter yeast strain QA23 was more clearly distinct from the other two strains, by differential expression of genes involved in flocculation, mitochondrial functions, energy generation and protein folding and stabilization. For all strains, higher transcriptional variability due to fermentation stage was seen in the high nitrogen fermentations. A positive correlation between maximum fermentation rate and the expression of genes involved in stress response was observed. The finding of common genes correlated with both fermentation activity and nitrogen up-take underlies the role of nitrogen on yeast fermentative fitness. The comparative analysis of genes differentially expressed between both fermentation conditions at 12 h, where the main difference was the level of nitrogen available, showed the highest variability amongst strains revealing strain-specific responses. Nevertheless, we were able to identify a small set of genes whose expression profiles can quantitatively assess the common response of the yeast strains to varying nitrogen conditions. The use of three contrasting yeast strains in gene expression analysis prompts the identification of more reliable, accurate and reproducible biomarkers that will facilitate the diagnosis of deficiency of this

  4. Comparative Transcriptomic Analysis Reveals Similarities and Dissimilarities in Saccharomyces cerevisiae Wine Strains Response to Nitrogen Availability

    Science.gov (United States)

    Barbosa, Catarina; García-Martínez, José; Pérez-Ortín, José E.; Mendes-Ferreira, Ana

    2015-01-01

    Nitrogen levels in grape-juices are of major importance in winemaking ensuring adequate yeast growth and fermentation performance. Here we used a comparative transcriptome analysis to uncover wine yeasts responses to nitrogen availability during fermentation. Gene expression was assessed in three genetically and phenotypically divergent commercial wine strains (CEG, VL1 and QA23), under low (67 mg/L) and high nitrogen (670 mg/L) regimes, at three time points during fermentation (12h, 24h and 96h). Two-way ANOVA analysis of each fermentation condition led to the identification of genes whose expression was dependent on strain, fermentation stage and on the interaction of both factors. The high fermenter yeast strain QA23 was more clearly distinct from the other two strains, by differential expression of genes involved in flocculation, mitochondrial functions, energy generation and protein folding and stabilization. For all strains, higher transcriptional variability due to fermentation stage was seen in the high nitrogen fermentations. A positive correlation between maximum fermentation rate and the expression of genes involved in stress response was observed. The finding of common genes correlated with both fermentation activity and nitrogen up-take underlies the role of nitrogen on yeast fermentative fitness. The comparative analysis of genes differentially expressed between both fermentation conditions at 12h, where the main difference was the level of nitrogen available, showed the highest variability amongst strains revealing strain-specific responses. Nevertheless, we were able to identify a small set of genes whose expression profiles can quantitatively assess the common response of the yeast strains to varying nitrogen conditions. The use of three contrasting yeast strains in gene expression analysis prompts the identification of more reliable, accurate and reproducible biomarkers that will facilitate the diagnosis of deficiency of this nutrient in the grape

  5. Job strain as a risk factor for clinical depression

    DEFF Research Database (Denmark)

    Madsen, I. E. H.; Nyberg, S. T.; Magnusson Hanson, L. L.

    2017-01-01

    BACKGROUND: Adverse psychosocial working environments characterized by job strain (the combination of high demands and low control at work) are associated with an increased risk of depressive symptoms among employees, but evidence on clinically diagnosed depression is scarce. We examined job strain...... as a risk factor for clinical depression. METHOD: We identified published cohort studies from a systematic literature search in PubMed and PsycNET and obtained 14 cohort studies with unpublished individual-level data from the Individual-Participant-Data Meta-analysis in Working Populations (IPD...... unpublished datasets we included 120 221 individuals and 982 first episodes of hospital-treated clinical depression. Job strain was associated with an increased risk of clinical depression in both published [relative risk (RR) = 1.77, 95% confidence interval (CI) 1.47-2.13] and unpublished datasets (RR = 1...

  6. Whole-Genome Analysis of Three Yeast Strains Used for Production of Sherry-Like Wines Revealed Genetic Traits Specific to Flor Yeasts

    Science.gov (United States)

    Eldarov, Mikhail A.; Beletsky, Alexey V.; Tanashchuk, Tatiana N.; Kishkovskaya, Svetlana A.; Ravin, Nikolai V.; Mardanov, Andrey V.

    2018-01-01

    Flor yeast strains represent a specialized group of Saccharomyces cerevisiae yeasts used for biological wine aging. We have sequenced the genomes of three flor strains originated from different geographic regions and used for production of sherry-like wines in Russia. According to the obtained phylogeny of 118 yeast strains, flor strains form very tight cluster adjacent to the main wine clade. SNP analysis versus available genomes of wine and flor strains revealed 2,270 genetic variants in 1,337 loci specific to flor strains. Gene ontology analysis in combination with gene content evaluation revealed a complex landscape of possibly adaptive genetic changes in flor yeast, related to genes associated with cell morphology, mitotic cell cycle, ion homeostasis, DNA repair, carbohydrate metabolism, lipid metabolism, and cell wall biogenesis. Pangenomic analysis discovered the presence of several well-known “non-reference” loci of potential industrial importance. Events of gene loss included deletions of asparaginase genes, maltose utilization locus, and FRE-FIT locus involved in iron transport. The latter in combination with a flor-yeast-specific mutation in the Aft1 transcription factor gene is likely to be responsible for the discovered phenotype of increased iron sensitivity and improved iron uptake of analyzed strains. Expansion of the coding region of the FLO11 flocullin gene and alteration of the balance between members of the FLO gene family are likely to positively affect the well-known propensity of flor strains for velum formation. Our study provides new insights in the nature of genetic variation in flor yeast strains and demonstrates that different adaptive properties of flor yeast strains could have evolved through different mechanisms of genetic variation. PMID:29867869

  7. Genetic relationships between clinical and non-clinical strains of Yersinia enterocolitica biovar 1A as revealed by multilocus enzyme electrophoresis and multilocus restriction typing

    Directory of Open Access Journals (Sweden)

    Virdi Jugsharan S

    2010-05-01

    Full Text Available Abstract Background Genetic relationships among 81 strains of Y. enterocolitica biovar 1A isolated from clinical and non-clinical sources were discerned by multilocus enzyme electrophoresis (MLEE and multilocus restriction typing (MLRT using six loci each. Such studies may reveal associations between the genotypes of the strains and their sources of isolation. Results All loci were polymorphic and generated 62 electrophoretic types (ETs and 12 restriction types (RTs. The mean genetic diversity (H of the strains by MLEE and MLRT was 0.566 and 0.441 respectively. MLEE (DI = 0.98 was more discriminatory and clustered Y. enterocolitica biovar 1A strains into four groups, while MLRT (DI = 0.77 identified two distinct groups. BURST (Based Upon Related Sequence Types analysis of the MLRT data suggested aquatic serotype O:6,30-6,31 isolates to be the ancestral strains from which, clinical O:6,30-6,31 strains might have originated by host adaptation and genetic change. Conclusion MLEE revealed greater genetic diversity among strains of Y. enterocolitica biovar 1A and clustered strains in four groups, while MLRT grouped the strains into two groups. BURST analysis of MLRT data nevertheless provided newer insights into the probable evolution of clinical strains from aquatic strains.

  8. Molecular typing of canine distemper virus strains reveals the presence of a new genetic variant in South America.

    Science.gov (United States)

    Sarute, Nicolás; Pérez, Ruben; Aldaz, Jaime; Alfieri, Amauri A; Alfieri, Alice F; Name, Daniela; Llanes, Jessika; Hernández, Martín; Francia, Lourdes; Panzera, Yanina

    2014-06-01

    Canine distemper virus (CDV, Paramyxoviridae, Morbillivirus) is the causative agent of a severe infectious disease affecting terrestrial and marine carnivores worldwide. Phylogenetic relationships and the genetic variability of the hemagglutinin (H) protein and the fusion protein signal-peptide (Fsp) allow for the classification of field strains into genetic lineages. Currently, there are nine CDV lineages worldwide, two of them co-circulating in South America. Using the Fsp-coding region, we analyzed the genetic variability of strains from Uruguay, Brazil, and Ecuador, and compared them with those described previously in South America and other geographical areas. The results revealed that the Brazilian and Uruguayan strains belong to the already described South America lineage (EU1/SA1), whereas the Ecuadorian strains cluster in a new clade, here named South America 3, which may represent the third CDV lineage described in South America.

  9. The largest human cognitive performance dataset reveals insights into the effects of lifestyle factors and aging

    Directory of Open Access Journals (Sweden)

    Daniel A Sternberg

    2013-06-01

    Full Text Available Making new breakthroughs in understanding the processes underlying human cognition may depend on the availability of very large datasets that have not historically existed in psychology and neuroscience. Lumosity is a web-based cognitive training platform that has grown to include over 600 million cognitive training task results from over 35 million individuals, comprising the largest existing dataset of human cognitive performance. As part of the Human Cognition Project, Lumosity’s collaborative research program to understand the human mind, Lumos Labs researchers and external research collaborators have begun to explore this dataset in order uncover novel insights about the correlates of cognitive performance. This paper presents two preliminary demonstrations of some of the kinds of questions that can be examined with the dataset. The first example focuses on replicating known findings relating lifestyle factors to baseline cognitive performance in a demographically diverse, healthy population at a much larger scale than has previously been available. The second example examines a question that would likely be very difficult to study in laboratory-based and existing online experimental research approaches: specifically, how learning ability for different types of cognitive tasks changes with age. We hope that these examples will provoke the imagination of researchers who are interested in collaborating to answer fundamental questions about human cognitive performance.

  10. Revealing differences in metabolic flux distributions between a mutant strain and its parent strain Gluconacetobacter xylinus CGMCC 2955.

    Directory of Open Access Journals (Sweden)

    Cheng Zhong

    Full Text Available A better understanding of metabolic fluxes is important for manipulating microbial metabolism toward desired end products, or away from undesirable by-products. A mutant strain, Gluconacetobacter xylinus AX2-16, was obtained by combined chemical mutation of the parent strain (G. xylinus CGMCC 2955 using DEC (diethyl sulfate and LiCl. The highest bacterial cellulose production for this mutant was obtained at about 11.75 g/L, which was an increase of 62% compared with that by the parent strain. In contrast, gluconic acid (the main byproduct concentration was only 5.71 g/L for mutant strain, which was 55.7% lower than that of parent strain. Metabolic flux analysis indicated that 40.1% of the carbon source was transformed to bacterial cellulose in mutant strain, compared with 24.2% for parent strain. Only 32.7% and 4.0% of the carbon source were converted into gluconic acid and acetic acid in mutant strain, compared with 58.5% and 9.5% of that in parent strain. In addition, a higher flux of tricarboxylic acid (TCA cycle was obtained in mutant strain (57.0% compared with parent strain (17.0%. It was also indicated from the flux analysis that more ATP was produced in mutant strain from pentose phosphate pathway (PPP and TCA cycle. The enzymatic activity of succinate dehydrogenase (SDH, which is one of the key enzymes in TCA cycle, was 1.65-fold higher in mutant strain than that in parent strain at the end of culture. It was further validated by the measurement of ATPase that 3.53-6.41 fold higher enzymatic activity was obtained from mutant strain compared with parent strain.

  11. Proteomic analysis reveals contrasting stress response to uranium in two nitrogen-fixing Anabaena strains, differentially tolerant to uranium

    Energy Technology Data Exchange (ETDEWEB)

    Panda, Bandita; Basu, Bhakti; Acharya, Celin; Rajaram, Hema; Apte, Shree Kumar, E-mail: aptesk@barc.gov.in

    2017-01-15

    Highlights: • Response of two native cyanobacterial strains to uranium exposure was studied. • Anabaena L-31 exhibited higher tolerance to uranium as compared to Anabaena 7120. • Uranium exposure differentially affected the proteome profiles of the two strains. • Anabaena L-31 showed better sustenance of photosynthesis and carbon metabolism. • Anabaena L-31 displayed superior oxidative stress defense than Anabaena 7120. - Abstract: Two strains of the nitrogen-fixing cyanobacterium Anabaena, native to Indian paddy fields, displayed differential sensitivity to exposure to uranyl carbonate at neutral pH. Anabaena sp. strain PCC 7120 and Anabaena sp. strain L-31 displayed 50% reduction in survival (LD{sub 50} dose), following 3 h exposure to 75 μM and 200 μM uranyl carbonate, respectively. Uranium responsive proteome alterations were visualized by 2D gel electrophoresis, followed by protein identification by MALDI-ToF mass spectrometry. The two strains displayed significant differences in levels of proteins associated with photosynthesis, carbon metabolism, and oxidative stress alleviation, commensurate with their uranium tolerance. Higher uranium tolerance of Anabaena sp. strain L-31 could be attributed to sustained photosynthesis and carbon metabolism and superior oxidative stress defense, as compared to the uranium sensitive Anabaena sp. strain PCC 7120. Significance: Uranium responsive proteome modulations in two nitrogen-fixing strains of Anabaena, native to Indian paddy fields, revealed that rapid adaptation to better oxidative stress management, and maintenance of metabolic and energy homeostasis underlies superior uranium tolerance of Anabaena sp. strain L-31 compared to Anabaena sp. strain PCC 7120.

  12. Maternal mismatches in farmed tilapia strains (Oreochromis spp.) in the Philippines as revealed by mitochondrial COI gene.

    Science.gov (United States)

    Ordoñez, June Feliciano F; Ventolero, Minerva Fatimae H; Santos, Mudjekeewis D

    2017-07-01

    The introduction of genetically enhanced tilapia has significantly boosted the performance of Philippine aquaculture industry. While enhanced strains contribute to the increase in tilapia production, genetic characterization of present tilapia stocks is critical to maintain their quality and to ensure the genetic gains are sustained. To understand and determine the genetic relationship of the genetically enhanced strains produced in the Philippines, mitochondrial cytochrome oxidase subunit I (COI) gene using DNA barcoding approach was analyzed. Specimens representing 10 genetically enhanced strains (GIFT, FaST, GET-EXCEL, GST, SST, COLD, YY-male, GMT, Molobicus, and BEST), three red tilapia (Taiwan red, Florida red, and FAC-red), and two pure lines (initially identified as O. aureus and O. spilurus) were collected, sequenced, and identified using DNA barcoding. Results revealed that farmed tilapias consisted of four different Oreochromis species. As expected, COI could not distinguish individuals at the strain level but surprisingly, mismatch between the species of maternal origin and present-day offspring was observed. This particular result may pose a question on the genetic purity and integrity of the strains being distributed to farmers and suggests a re-evaluation of the effectiveness of major tilapia breeding centers in maintaining their stocks.

  13. Experimental single-strain mobilomics reveals events that shape pathogen emergence.

    Science.gov (United States)

    Schoeniger, Joseph S; Hudson, Corey M; Bent, Zachary W; Sinha, Anupama; Williams, Kelly P

    2016-08-19

    Virulence genes on mobile DNAs such as genomic islands (GIs) and plasmids promote bacterial pathogen emergence. Excision is an early step in GI mobilization, producing a circular GI and a deletion site in the chromosome; circular forms are also known for some bacterial insertion sequences (ISs). The recombinant sequence at the junctions of such circles and deletions can be detected sensitively in high-throughput sequencing data, using new computational methods that enable empirical discovery of mobile DNAs. For the rich mobilome of a hospital Klebsiella pneumoniae strain, circularization junctions (CJs) were detected for six GIs and seven IS types. Our methods revealed differential biology of multiple mobile DNAs, imprecision of integrases and transposases, and differential activity among identical IS copies for IS26, ISKpn18 and ISKpn21 Using the resistance of circular dsDNA molecules to exonuclease, internally calibrated with the native plasmids, showed that not all molecules bearing GI CJs were circular. Transpositions were also detected, revealing replicon preference (ISKpn18 prefers a conjugative IncA/C2 plasmid), local action (IS26), regional preferences, selection (against capsule synthesis) and IS polarity inversion. Efficient discovery and global characterization of numerous mobile elements per experiment improves accounting for the new gene combinations that arise in emerging pathogens. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. Sister Dehalobacter Genomes Reveal Specialization in Organohalide Respiration and Recent Strain Differentiation Likely Driven by Chlorinated Substrates

    Directory of Open Access Journals (Sweden)

    Shuiquan eTang

    2016-02-01

    Full Text Available The genomes of two closely related Dehalobacter strains (strain CF and strain DCA were assembled from the metagenome of an anaerobic enrichment culture that reductively dechlorinates chloroform (CF, 1,1,1-trichloroethane (1,1,1-TCA and 1,1-dichloroethane (1,1-DCA. The 3.1 Mbp genomes of strain CF (that dechlorinates CF and 1,1,1-TCA and strain DCA (that dechlorinates 1,1-DCA each contain 17 putative reductive dehalogenase homologous (rdh genes. These two genomes were systematically compared to three other available organohalide-respiring Dehalobacter genomes (Dehalobacter restrictus strain PER-K23, Dehalobacter sp. strain E1 and Dehalobacter sp. strain UNSWDHB, and to the genomes of Dehalococcoides mccartyi strain 195 and Desulfitobacterium hafniense strain Y51. This analysis compared 42 different metabolic and physiological categories. The genomes of strains CF and DCA share 90% overall average nucleotide identity and greater than 99.8% identity over a 2.9 Mbp alignment that excludes large insertions, indicating that these genomes differentiated from a close common ancestor. This differentiation was likely driven by selection pressures around two orthologous reductive dehalogenase genes, cfrA and dcrA, that code for the enzymes that reduce CF or 1,1,1-TCA and 1,1-DCA. The many reductive dehalogenase genes found in the five Dehalobacter genomes cluster into two small conserved regions and were often associated with Crp/Fnr transcriptional regulators. Specialization is on-going on a strain-specific basis, as some strains but not others have lost essential genes in the Wood-Ljungdahl (strain E1 and corrinoid biosynthesis pathways (strains E1 and PER-K23. The gene encoding phosphoserine phosphatase, which catalyzes the last step of serine biosynthesis, is missing from all five Dehalobacter genomes, yet D. restrictus can grow without serine, suggesting an alternative or unrecognized biosynthesis route exists. In contrast to Dehalococcoides mccartyi

  15. Comparative genome analysis of pathogenic and non-pathogenic Clavibacter strains reveals adaptations to their lifestyle.

    Science.gov (United States)

    Załuga, Joanna; Stragier, Pieter; Baeyen, Steve; Haegeman, Annelies; Van Vaerenbergh, Johan; Maes, Martine; De Vos, Paul

    2014-05-22

    The genus Clavibacter harbors economically important plant pathogens infecting agricultural crops such as potato and tomato. Although the vast majority of Clavibacter strains are pathogenic, there is an increasing number of non-pathogenic isolates reported. Non-pathogenic Clavibacter strains isolated from tomato seeds are particularly problematic because they affect the current detection and identification tests for Clavibacter michiganensis subsp. michiganensis (Cmm), which is regulated with a zero tolerance in tomato seed. Their misidentification as pathogenic Cmm hampers a clear judgment on the seed quality and health. To get more insight in the genetic features linked to the lifestyle of these bacteria, a whole-genome sequence of the tomato seed-borne non-pathogenic Clavibacter LMG 26808 was determined. To gain a better understanding of the molecular determinants of pathogenicity, the genome sequence of LMG 26808 was compared with that of the pathogenic Cmm strain (NCPPB 382). The comparative analysis revealed that LMG 26808 does not contain plasmids pCM1 and pCM2 and also lacks the majority of important virulence factors described so far for pathogenic Cmm. This explains its apparent non-pathogenic nature in tomato plants. Moreover, the genome analysis of LMG 26808 detected sequences from a plasmid originating from a member of Enterobacteriaceae/Klebsiella relative. Genes received that way and coding for antibiotic resistance may provide a competitive advantage for survival of LMG 26808 in its ecological niche. Genetically, LMG 26808 was the most similar to the pathogenic Cmm NCPPB 382 but contained more mobile genetic elements. The genome of this non-pathogenic Clavibacter strain contained also a high number of transporters and regulatory genes. The genome sequence of the non-pathogenic Clavibacter strain LMG 26808 and the comparative analyses with other pathogenic Clavibacter strains provided a better understanding of the genetic bases of virulence and

  16. Private selective sweeps identified from next-generation pool-sequencing reveal convergent pathways under selection in two inbred Schistosoma mansoni strains.

    Directory of Open Access Journals (Sweden)

    Julie A J Clément

    Full Text Available BACKGROUND: The trematode flatworms of the genus Schistosoma, the causative agents of schistosomiasis, are among the most prevalent parasites in humans, affecting more than 200 million people worldwide. In this study, we focused on two well-characterized strains of S. mansoni, to explore signatures of selection. Both strains are highly inbred and exhibit differences in life history traits, in particular in their compatibility with the intermediate host Biomphalaria glabrata. METHODOLOGY/PRINCIPAL FINDINGS: We performed high throughput sequencing of DNA from pools of individuals of each strain using Illumina technology and identified single nucleotide polymorphisms (SNP and copy number variations (CNV. In total, 708,898 SNPs were identified and roughly 2,000 CNVs. The SNPs revealed low nucleotide diversity (π = 2 × 10(-4 within each strain and a high differentiation level (Fst = 0.73 between them. Based on a recently developed in-silico approach, we further detected 12 and 19 private (i.e. specific non-overlapping selective sweeps among the 121 and 151 sweeps found in total for each strain. CONCLUSIONS/SIGNIFICANCE: Functional annotation of transcripts lying in the private selective sweeps revealed specific selection for functions related to parasitic interaction (e.g. cell-cell adhesion or redox reactions. Despite high differentiation between strains, we identified evolutionary convergence of genes related to proteolysis, known as a key virulence factor and a potential target of drug and vaccine development. Our data show that pool-sequencing can be used for the detection of selective sweeps in parasite populations and enables one to identify biological functions under selection.

  17. Genotyping of ancient Mycobacterium tuberculosis strains reveals historic genetic diversity.

    Science.gov (United States)

    Müller, Romy; Roberts, Charlotte A; Brown, Terence A

    2014-04-22

    The evolutionary history of the Mycobacterium tuberculosis complex (MTBC) has previously been studied by analysis of sequence diversity in extant strains, but not addressed by direct examination of strain genotypes in archaeological remains. Here, we use ancient DNA sequencing to type 11 single nucleotide polymorphisms and two large sequence polymorphisms in the MTBC strains present in 10 archaeological samples from skeletons from Britain and Europe dating to the second-nineteenth centuries AD. The results enable us to assign the strains to groupings and lineages recognized in the extant MTBC. We show that at least during the eighteenth-nineteenth centuries AD, strains of M. tuberculosis belonging to different genetic groups were present in Britain at the same time, possibly even at a single location, and we present evidence for a mixed infection in at least one individual. Our study shows that ancient DNA typing applied to multiple samples can provide sufficiently detailed information to contribute to both archaeological and evolutionary knowledge of the history of tuberculosis.

  18. Viability of Controlling Prosthetic Hand Utilizing Electroencephalograph (EEG) Dataset Signal

    Science.gov (United States)

    Miskon, Azizi; A/L Thanakodi, Suresh; Raihan Mazlan, Mohd; Mohd Haziq Azhar, Satria; Nooraya Mohd Tawil, Siti

    2016-11-01

    This project presents the development of an artificial hand controlled by Electroencephalograph (EEG) signal datasets for the prosthetic application. The EEG signal datasets were used as to improvise the way to control the prosthetic hand compared to the Electromyograph (EMG). The EMG has disadvantages to a person, who has not used the muscle for a long time and also to person with degenerative issues due to age factor. Thus, the EEG datasets found to be an alternative for EMG. The datasets used in this work were taken from Brain Computer Interface (BCI) Project. The datasets were already classified for open, close and combined movement operations. It served the purpose as an input to control the prosthetic hand by using an Interface system between Microsoft Visual Studio and Arduino. The obtained results reveal the prosthetic hand to be more efficient and faster in response to the EEG datasets with an additional LiPo (Lithium Polymer) battery attached to the prosthetic. Some limitations were also identified in terms of the hand movements, weight of the prosthetic, and the suggestions to improve were concluded in this paper. Overall, the objective of this paper were achieved when the prosthetic hand found to be feasible in operation utilizing the EEG datasets.

  19. Multilocus Sequence Typing Reveals Relevant Genetic Variation and Different Evolutionary Dynamics among Strains of Xanthomonas arboricola pv. juglandis

    Directory of Open Access Journals (Sweden)

    Marco Scortichini

    2010-11-01

    Full Text Available Forty-five Xanthomonas arboricola pv. juglandis (Xaj strains originating from Juglans regia cultivation in different countries were molecularly typed by means of MultiLocus Sequence Typing (MLST, using acnB, gapA, gyrB and rpoD gene fragments. A total of 2.5 kilobases was used to infer the phylogenetic relationship among the strains and possible recombination events. Haplotype diversity, linkage disequilibrium analysis, selection tests, gene flow estimates and codon adaptation index were also assessed. The dendrograms built by maximum likelihood with concatenated nucleotide and amino acid sequences revealed two major and two minor phylotypes. The same haplotype was found in strains originating from different continents, and different haplotypes were found in strains isolated in the same year from the same location. A recombination breakpoint was detected within the rpoD gene fragment. At the pathovar level, the Xaj populations studied here are clonal and under neutral selection. However, four Xaj strains isolated from walnut fruits with apical necrosis are under diversifying selection, suggesting a possible new adaptation. Gene flow estimates do not support the hypothesis of geographic isolation of the strains, even though the genetic diversity between the strains increases as the geographic distance between them increases. A triplet deletion, causing the absence of valine, was found in the rpoD fragment of all 45 Xaj strains when compared with X. axonopodis pv. citri strain 306. The codon adaptation index was high in all four genes studied, indicating a relevant metabolic activity.

  20. Caenorhabditis briggsae recombinant inbred line genotypes reveal inter-strain incompatibility and the evolution of recombination.

    Directory of Open Access Journals (Sweden)

    Joseph A Ross

    2011-07-01

    Full Text Available The nematode Caenorhabditis briggsae is an emerging model organism that allows evolutionary comparisons with C. elegans and exploration of its own unique biological attributes. To produce a high-resolution C. briggsae recombination map, recombinant inbred lines were generated from reciprocal crosses between two strains and genotyped at over 1,000 loci. A second set of recombinant inbred lines involving a third strain was also genotyped at lower resolution. The resulting recombination maps exhibit discrete domains of high and low recombination, as in C. elegans, indicating these are a general feature of Caenorhabditis species. The proportion of a chromosome's physical size occupied by the central, low-recombination domain is highly correlated between species. However, the C. briggsae intra-species comparison reveals striking variation in the distribution of recombination between domains. Hybrid lines made with the more divergent pair of strains also exhibit pervasive marker transmission ratio distortion, evidence of selection acting on hybrid genotypes. The strongest effect, on chromosome III, is explained by a developmental delay phenotype exhibited by some hybrid F2 animals. In addition, on chromosomes IV and V, cross direction-specific biases towards one parental genotype suggest the existence of cytonuclear epistatic interactions. These interactions are discussed in relation to surprising mitochondrial genome polymorphism in C. briggsae, evidence that the two strains diverged in allopatry, the potential for local adaptation, and the evolution of Dobzhansky-Muller incompatibilities. The genetic and genomic resources resulting from this work will support future efforts to understand inter-strain divergence as well as facilitate studies of gene function, natural variation, and the evolution of recombination in Caenorhabditis nematodes.

  1. Meta-Analysis of High-Throughput Datasets Reveals Cellular Responses Following Hemorrhagic Fever Virus Infection

    Directory of Open Access Journals (Sweden)

    Gavin C. Bowick

    2011-05-01

    Full Text Available The continuing use of high-throughput assays to investigate cellular responses to infection is providing a large repository of information. Due to the large number of differentially expressed transcripts, often running into the thousands, the majority of these data have not been thoroughly investigated. Advances in techniques for the downstream analysis of high-throughput datasets are providing additional methods for the generation of additional hypotheses for further investigation. The large number of experimental observations, combined with databases that correlate particular genes and proteins with canonical pathways, functions and diseases, allows for the bioinformatic exploration of functional networks that may be implicated in replication or pathogenesis. Herein, we provide an example of how analysis of published high-throughput datasets of cellular responses to hemorrhagic fever virus infection can generate additional functional data. We describe enrichment of genes involved in metabolism, post-translational modification and cardiac damage; potential roles for specific transcription factors and a conserved involvement of a pathway based around cyclooxygenase-2. We believe that these types of analyses can provide virologists with additional hypotheses for continued investigation.

  2. Correction of elevation offsets in multiple co-located lidar datasets

    Science.gov (United States)

    Thompson, David M.; Dalyander, P. Soupy; Long, Joseph W.; Plant, Nathaniel G.

    2017-04-07

    IntroductionTopographic elevation data collected with airborne light detection and ranging (lidar) can be used to analyze short- and long-term changes to beach and dune systems. Analysis of multiple lidar datasets at Dauphin Island, Alabama, revealed systematic, island-wide elevation differences on the order of 10s of centimeters (cm) that were not attributable to real-world change and, therefore, were likely to represent systematic sampling offsets. These offsets vary between the datasets, but appear spatially consistent within a given survey. This report describes a method that was developed to identify and correct offsets between lidar datasets collected over the same site at different times so that true elevation changes over time, associated with sediment accumulation or erosion, can be analyzed.

  3. Antagonistic pleiotropy and fitness trade-offs reveal specialist and generalist traits in strains of canine distemper virus.

    Directory of Open Access Journals (Sweden)

    Veljko M Nikolin

    Full Text Available Theoretically, homogeneous environments favor the evolution of specialists whereas heterogeneous environments favor generalists. Canine distemper is a multi-host carnivore disease caused by canine distemper virus (CDV. The described cell receptor of CDV is SLAM (CD150. Attachment of CDV hemagglutinin protein (CDV-H to this receptor facilitates fusion and virus entry in cooperation with the fusion protein (CDV-F. We investigated whether CDV strains co-evolved in the large, homogeneous domestic dog population exhibited specialist traits, and strains adapted to the heterogeneous environment of smaller populations of different carnivores exhibited generalist traits. Comparison of amino acid sequences of the SLAM binding region revealed higher similarity between sequences from Canidae species than to sequences from other carnivore families. Using an in vitro assay, we quantified syncytia formation mediated by CDV-H proteins from dog and non-dog CDV strains in cells expressing dog, lion or cat SLAM. CDV-H proteins from dog strains produced significantly higher values with cells expressing dog SLAM than with cells expressing lion or cat SLAM. CDV-H proteins from strains of non-dog species produced similar values in all three cell types, but lower values in cells expressing dog SLAM than the values obtained for CDV-H proteins from dog strains. By experimentally changing one amino acid (Y549H in the CDV-H protein of one dog strain we decreased expression of specialist traits and increased expression of generalist traits, thereby confirming its functional importance. A virus titer assay demonstrated that dog strains produced higher titers in cells expressing dog SLAM than cells expressing SLAM of non-dog hosts, which suggested possible fitness benefits of specialization post-cell entry. We provide in vitro evidence for the expression of specialist and generalist traits by CDV strains, and fitness trade-offs across carnivore host environments caused by

  4. Antagonistic Pleiotropy and Fitness Trade-Offs Reveal Specialist and Generalist Traits in Strains of Canine Distemper Virus

    Science.gov (United States)

    Nikolin, Veljko M.; Osterrieder, Klaus; von Messling, Veronika; Hofer, Heribert; Anderson, Danielle; Dubovi, Edward; Brunner, Edgar; East, Marion L.

    2012-01-01

    Theoretically, homogeneous environments favor the evolution of specialists whereas heterogeneous environments favor generalists. Canine distemper is a multi-host carnivore disease caused by canine distemper virus (CDV). The described cell receptor of CDV is SLAM (CD150). Attachment of CDV hemagglutinin protein (CDV-H) to this receptor facilitates fusion and virus entry in cooperation with the fusion protein (CDV-F). We investigated whether CDV strains co-evolved in the large, homogeneous domestic dog population exhibited specialist traits, and strains adapted to the heterogeneous environment of smaller populations of different carnivores exhibited generalist traits. Comparison of amino acid sequences of the SLAM binding region revealed higher similarity between sequences from Canidae species than to sequences from other carnivore families. Using an in vitro assay, we quantified syncytia formation mediated by CDV-H proteins from dog and non-dog CDV strains in cells expressing dog, lion or cat SLAM. CDV-H proteins from dog strains produced significantly higher values with cells expressing dog SLAM than with cells expressing lion or cat SLAM. CDV-H proteins from strains of non-dog species produced similar values in all three cell types, but lower values in cells expressing dog SLAM than the values obtained for CDV-H proteins from dog strains. By experimentally changing one amino acid (Y549H) in the CDV-H protein of one dog strain we decreased expression of specialist traits and increased expression of generalist traits, thereby confirming its functional importance. A virus titer assay demonstrated that dog strains produced higher titers in cells expressing dog SLAM than cells expressing SLAM of non-dog hosts, which suggested possible fitness benefits of specialization post-cell entry. We provide in vitro evidence for the expression of specialist and generalist traits by CDV strains, and fitness trade-offs across carnivore host environments caused by antagonistic

  5. Congruent strain specific intestinal persistence of Lactobacillus plantarum in an intestine-mimicking in vitro system and in human volunteers.

    Directory of Open Access Journals (Sweden)

    Hermien van Bokhorst-van de Veen

    Full Text Available BACKGROUND: An important trait of probiotics is their capability to reach their intestinal target sites alive to optimally exert their beneficial effects. Assessment of this trait in intestine-mimicking in vitro model systems has revealed differential survival of individual strains of a species. However, data on the in situ persistence characteristics of individual or mixtures of strains of the same species in the gastrointestinal tract of healthy human volunteers have not been reported to date. METHODOLOGY/PRINCIPAL FINDINGS: The GI-tract survival of individual L. plantarum strains was determined using an intestine mimicking model system, revealing substantial inter-strain differences. The obtained data were correlated to genomic diversity of the strains using comparative genome hybridization (CGH datasets, but this approach failed to discover specific genetic loci that explain the observed differences between the strains. Moreover, we developed a next-generation sequencing-based method that targets a variable intergenic region, and employed this method to assess the in vivo GI-tract persistence of different L. plantarum strains when administered in mixtures to healthy human volunteers. Remarkable consistency of the strain-specific persistence curves were observed between individual volunteers, which also correlated significantly with the GI-tract survival predicted on basis of the in vitro assay. CONCLUSION: The survival of individual L. plantarum strains in the GI-tract could not be correlated to the absence or presence of specific genes compared to the reference strain L. plantarum WCFS1. Nevertheless, in vivo persistence analysis in the human GI-tract confirmed the strain-specific persistence, which appeared to be remarkably similar in different healthy volunteers. Moreover, the relative strain-specific persistence in vivo appeared to be accurately and significantly predicted by their relative survival in the intestine-mimicking in vitro

  6. RNA-Seq Analyses for Two Silkworm Strains Reveals Insight into Their Susceptibility and Resistance to Beauveria bassiana Infection.

    Science.gov (United States)

    Xing, Dongxu; Yang, Qiong; Jiang, Liang; Li, Qingrong; Xiao, Yang; Ye, Mingqiang; Xia, Qingyou

    2017-02-10

    The silkworm Bombyx mori is an economically important species. White muscardine caused by Beauveria bassiana is the main fungal disease in sericulture, and understanding the silkworm responses to B. bassiana infection is of particular interest. Herein, we investigated the molecular mechanisms underlying these responses in two silkworm strains Haoyue (HY, sensitive to B. bassiana ) and Kang 8 (K8, resistant to B. bassiana ) using an RNA-seq approach. For each strain, three biological replicates for immersion treatment, two replicates for injection treatment and three untreated controls were collected to generate 16 libraries for sequencing. Differentially expressed genes (DEGs) between treated samples and untreated controls, and between the two silkworm strains, were identified. DEGs and the enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways of the two strains exhibited an obvious difference. Several genes encoding cuticle proteins, serine proteinase inhibitors (SPI) and antimicrobial peptides (AMP) and the drug metabolism pathway involved in toxin detoxification were considered to be related to the resistance of K8 to B. bassiana. These results revealed insight into the resistance and susceptibility of two silkworm strains against B. bassiana infection and provided a roadmap for silkworm molecular breeding to enhance its resistance to B. bassiana .

  7. RNA-Seq Analyses for Two Silkworm Strains Reveals Insight into Their Susceptibility and Resistance to Beauveria bassiana Infection

    Directory of Open Access Journals (Sweden)

    Dongxu Xing

    2017-02-01

    Full Text Available The silkworm Bombyx mori is an economically important species. White muscardine caused by Beauveria bassiana is the main fungal disease in sericulture, and understanding the silkworm responses to B. bassiana infection is of particular interest. Herein, we investigated the molecular mechanisms underlying these responses in two silkworm strains Haoyue (HY, sensitive to B. bassiana and Kang 8 (K8, resistant to B. bassiana using an RNA-seq approach. For each strain, three biological replicates for immersion treatment, two replicates for injection treatment and three untreated controls were collected to generate 16 libraries for sequencing. Differentially expressed genes (DEGs between treated samples and untreated controls, and between the two silkworm strains, were identified. DEGs and the enriched Kyoto Encyclopedia of Genes and Genomes (KEGG pathways of the two strains exhibited an obvious difference. Several genes encoding cuticle proteins, serine proteinase inhibitors (SPI and antimicrobial peptides (AMP and the drug metabolism pathway involved in toxin detoxification were considered to be related to the resistance of K8 to B. bassiana. These results revealed insight into the resistance and susceptibility of two silkworm strains against B. bassiana infection and provided a roadmap for silkworm molecular breeding to enhance its resistance to B. bassiana.

  8. Multilocus microsatellite typing (MLMT of strains from Turkey and Cyprus reveals a novel monophyletic L. donovani sensu lato group.

    Directory of Open Access Journals (Sweden)

    Evi Gouzelou

    Full Text Available BACKGROUND: New foci of human CL caused by strains of the Leishmania donovani (L. donovani complex have been recently described in Cyprus and the Çukurova region in Turkey (L. infantum situated 150 km north of Cyprus. Cypriot strains were typed by Multilocus Enzyme Electrophoresis (MLEE using the Montpellier (MON system as L. donovani zymodeme MON-37. However, multilocus microsatellite typing (MLMT has shown that this zymodeme is paraphyletic; composed of distantly related genetic subgroups of different geographical origin. Consequently the origin of the Cypriot strains remained enigmatic. METHODOLOGY/PRINCIPAL FINDINGS: The Cypriot strains were compared with a set of Turkish isolates obtained from a CL patient and sand fly vectors in south-east Turkey (Çukurova region; CUK strains and from a VL patient in the south-west (Kuşadasi; EP59 strain. These Turkish strains were initially analyzed using the K26-PCR assay that discriminates MON-1 strains by their amplicon size. In line with previous DNA-based data, the strains were inferred to the L. donovani complex and characterized as non MON-1. For these strains MLEE typing revealed two novel zymodemes; L. donovani MON-309 (CUK strains and MON-308 (EP59. A population genetic analysis of the Turkish isolates was performed using 14 hyper-variable microsatellite loci. The genotypic profiles of 68 previously analyzed L. donovani complex strains from major endemic regions were included for comparison. Population structures were inferred by combination of bayesian model-based and distance-based approaches. MLMT placed the Turkish and Cypriot strains in a subclade of a newly discovered, genetically distinct L. infantum monophyletic group, suggesting that the Cypriot strains may originate from Turkey. CONCLUSION: The discovery of a genetically distinct L. infantum monophyletic group in the south-eastern Mediterranean stresses the importance of species genetic characterization towards better understanding

  9. Measurement of Strain in the Left Ventricle during Diastole withcine-MRI and Deformable Image Registration

    Energy Technology Data Exchange (ETDEWEB)

    Veress, Alexander I.; Gullberg, Grant T.; Weiss, Jeffrey A.

    2005-07-20

    The assessment of regional heart wall motion (local strain) can localize ischemic myocardial disease, evaluate myocardial viability and identify impaired cardiac function due to hypertrophic or dilated cardiomyopathies. The objectives of this research were to develop and validate a technique known as Hyperelastic Warping for the measurement of local strains in the left ventricle from clinical cine-MRI image datasets. The technique uses differences in image intensities between template (reference) and target (loaded) image datasets to generate a body force that deforms a finite element (FE) representation of the template so that it registers with the target image. To validate the technique, MRI image datasets representing two deformation states of a left ventricle were created such that the deformation map between the states represented in the images was known. A beginning diastoliccine-MRI image dataset from a normal human subject was defined as the template. A second image dataset (target) was created by mapping the template image using the deformation results obtained from a forward FE model of diastolic filling. Fiber stretch and strain predictions from Hyperelastic Warping showed good agreement with those of the forward solution. The technique had low sensitivity to changes in material parameters, with the exception of changes in bulk modulus of the material. The use of an isotropic hyperelastic constitutive model in the Warping analyses degraded the predictions of fiber stretch. Results were unaffected by simulated noise down to an SNR of 4.0. This study demonstrates that Warping in conjunction with cine-MRI imaging can be used to determine local ventricular strains during diastole.

  10. EPA Nanorelease Dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — EPA Nanorelease Dataset. This dataset is associated with the following publication: Wohlleben, W., C. Kingston, J. Carter, E. Sahle-Demessie, S. Vazquez-Campos, B....

  11. High-resolution spatiotemporal strain mapping reveals non-uniform deformation in micropatterned elastomers

    Science.gov (United States)

    Aksoy, B.; Rehman, A.; Bayraktar, H.; Alaca, B. E.

    2017-04-01

    Micropatterns are generated on a vast selection of polymeric substrates for various applications ranging from stretchable electronics to cellular mechanobiological systems. When these patterned substrates are exposed to external loading, strain field is primarily affected by the presence of microfabricated structures and similarly by fabrication-related defects. The capturing of such nonhomogeneous strain fields is of utmost importance in cases where study of the mechanical behavior with a high spatial resolution is necessary. Image-based non-contact strain measurement techniques are favorable and have recently been extended to scanning tunneling microscope and scanning electron microscope images for the characterization of mechanical properties of metallic materials, e.g. steel and aluminum, at the microscale. A similar real-time analysis of strain heterogeneity in elastomers is yet to be achieved during the entire loading sequence. The available measurement methods for polymeric materials mostly depend on cross-head displacement or precalibrated strain values. Thus, they suffer either from the lack of any real-time analysis, spatiotemporal distribution or high resolution in addition to a combination of these factors. In this work, these challenges are addressed by integrating a tensile stretcher with an inverted optical microscope and developing a subpixel particle tracking algorithm. As a proof of concept, the patterns with a critical dimension of 200 µm are generated on polydimethylsiloxane substrates and strain distribution in the vicinity of the patterns is captured with a high spatiotemporal resolution. In the field of strain measurement, there is always a tradeoff between minimum measurable strain value and spatial resolution. Current noncontact techniques on elastomers can deliver a strain resolution of 0.001% over a minimum length of 5 cm. More importantly, inhomogeneities within this quite large region cannot be captured. The proposed technique can

  12. High-resolution spatiotemporal strain mapping reveals non-uniform deformation in micropatterned elastomers

    International Nuclear Information System (INIS)

    Aksoy, B; Alaca, B E; Rehman, A; Bayraktar, H

    2017-01-01

    Micropatterns are generated on a vast selection of polymeric substrates for various applications ranging from stretchable electronics to cellular mechanobiological systems. When these patterned substrates are exposed to external loading, strain field is primarily affected by the presence of microfabricated structures and similarly by fabrication-related defects. The capturing of such nonhomogeneous strain fields is of utmost importance in cases where study of the mechanical behavior with a high spatial resolution is necessary. Image-based non-contact strain measurement techniques are favorable and have recently been extended to scanning tunneling microscope and scanning electron microscope images for the characterization of mechanical properties of metallic materials, e.g. steel and aluminum, at the microscale. A similar real-time analysis of strain heterogeneity in elastomers is yet to be achieved during the entire loading sequence. The available measurement methods for polymeric materials mostly depend on cross-head displacement or precalibrated strain values. Thus, they suffer either from the lack of any real-time analysis, spatiotemporal distribution or high resolution in addition to a combination of these factors. In this work, these challenges are addressed by integrating a tensile stretcher with an inverted optical microscope and developing a subpixel particle tracking algorithm. As a proof of concept, the patterns with a critical dimension of 200 µ m are generated on polydimethylsiloxane substrates and strain distribution in the vicinity of the patterns is captured with a high spatiotemporal resolution. In the field of strain measurement, there is always a tradeoff between minimum measurable strain value and spatial resolution. Current noncontact techniques on elastomers can deliver a strain resolution of 0.001% over a minimum length of 5 cm. More importantly, inhomogeneities within this quite large region cannot be captured. The proposed technique can

  13. Multi-gene phylogenetic analysis reveals that shochu-fermenting Saccharomyces cerevisiae strains form a distinct sub-clade of the Japanese sake cluster.

    Science.gov (United States)

    Futagami, Taiki; Kadooka, Chihiro; Ando, Yoshinori; Okutsu, Kayu; Yoshizaki, Yumiko; Setoguchi, Shinji; Takamine, Kazunori; Kawai, Mikihiko; Tamaki, Hisanori

    2017-10-01

    Shochu is a traditional Japanese distilled spirit. The formation of the distinguishing flavour of shochu produced in individual distilleries is attributed to putative indigenous yeast strains. In this study, we performed the first (to our knowledge) phylogenetic classification of shochu strains based on nucleotide gene sequences. We performed phylogenetic classification of 21 putative indigenous shochu yeast strains isolated from 11 distilleries. All of these strains were shown or confirmed to be Saccharomyces cerevisiae, sharing species identification with 34 known S. cerevisiae strains (including commonly used shochu, sake, ale, whisky, bakery, bioethanol and laboratory yeast strains and clinical isolate) that were tested in parallel. Our analysis used five genes that reflect genome-level phylogeny for the strain-level classification. In a first step, we demonstrated that partial regions of the ZAP1, THI7, PXL1, YRR1 and GLG1 genes were sufficient to reproduce previous sub-species classifications. In a second step, these five analysed regions from each of 25 strains (four commonly used shochu strains and the 21 putative indigenous shochu strains) were concatenated and used to generate a phylogenetic tree. Further analysis revealed that the putative indigenous shochu yeast strains form a monophyletic group that includes both the shochu yeasts and a subset of the sake group strains; this cluster is a sister group to other sake yeast strains, together comprising a sake-shochu group. Differences among shochu strains were small, suggesting that it may be possible to correlate subtle phenotypic differences among shochu flavours with specific differences in genome sequences. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  14. Whole genome PCR scanning reveals the syntenic genome structure of toxigenic Vibrio cholerae strains in the O1/O139 population.

    Directory of Open Access Journals (Sweden)

    Bo Pang

    Full Text Available Vibrio cholerae is commonly found in estuarine water systems. Toxigenic O1 and O139 V. cholerae strains have caused cholera epidemics and pandemics, whereas the nontoxigenic strains within these serogroups only occasionally lead to disease. To understand the differences in the genome and clonality between the toxigenic and nontoxigenic strains of V. cholerae serogroups O1 and O139, we employed a whole genome PCR scanning (WGPScanning method, an rrn operon-mediated fragment rearrangement analysis and comparative genomic hybridization (CGH to analyze the genome structure of different strains. WGPScanning in conjunction with CGH revealed that the genomic contents of the toxigenic strains were conservative, except for a few indels located mainly in mobile elements. Minor nucleotide variation in orthologous genes appeared to be the major difference between the toxigenic strains. rrn operon-mediated rearrangements were infrequent in El Tor toxigenic strains tested using I-CeuI digested pulsed-field gel electrophoresis (PFGE analysis and PCR analysis based on flanking sequence of rrn operons. Using these methods, we found that the genomic structures of toxigenic El Tor and O139 strains were syntenic. The nontoxigenic strains exhibited more extensive sequence variations, but toxin coregulated pilus positive (TCP+ strains had a similar structure. TCP+ nontoxigenic strains could be subdivided into multiple lineages according to the TCP type, suggesting the existence of complex intermediates in the evolution of toxigenic strains. The data indicate that toxigenic O1 El Tor and O139 strains were derived from a single lineage of intermediates from complex clones in the environment. The nontoxigenic strains with non-El Tor type TCP may yet evolve into new epidemic clones after attaining toxigenic attributes.

  15. Unified Scaling Law for flux pinning in practical superconductors: III. Minimum datasets, core parameters, and application of the Extrapolative Scaling Expression

    Science.gov (United States)

    Ekin, Jack W.; Cheggour, Najib; Goodrich, Loren; Splett, Jolene

    2017-03-01

    In Part 2 of these articles, an extensive analysis of pinning-force curves and raw scaling data was used to derive the Extrapolative Scaling Expression (ESE). This is a parameterization of the Unified Scaling Law (USL) that has the extrapolation capability of fundamental unified scaling, coupled with the application ease of a simple fitting equation. Here in Part 3, the accuracy of the ESE relation to interpolate and extrapolate limited critical-current data to obtain complete I c(B,T,ɛ) datasets is evaluated and compared with present fitting equations. Accuracy is analyzed in terms of root mean square (RMS) error and fractional deviation statistics. Highlights from 92 test cases are condensed and summarized, covering most fitting protocols and proposed parameterizations of the USL. The results show that ESE reliably extrapolates critical currents at fields B, temperatures T, and strains ɛ that are remarkably different from the fitted minimum dataset. Depending on whether the conductor is moderate-J c or high-J c, effective RMS extrapolation errors for ESE are in the range 2-5 A at 12 T, which approaches the I c measurement error (1-2%). The minimum dataset for extrapolating full I c(B,T,ɛ) characteristics is also determined from raw scaling data. It consists of one set of I c(B,ɛ) data at a fixed temperature (e.g., liquid helium temperature), and one set of I c(B,T) data at a fixed strain (e.g., zero applied strain). Error analysis of extrapolations from the minimum dataset with different fitting equations shows that ESE reduces the percentage extrapolation errors at individual data points at high fields, temperatures, and compressive strains down to 1/10th to 1/40th the size of those for extrapolations with present fitting equations. Depending on the conductor, percentage fitting errors for interpolations are also reduced to as little as 1/15th the size. The extrapolation accuracy of the ESE relation offers the prospect of straightforward implementation of

  16. ClimateNet: A Machine Learning dataset for Climate Science Research

    Science.gov (United States)

    Prabhat, M.; Biard, J.; Ganguly, S.; Ames, S.; Kashinath, K.; Kim, S. K.; Kahou, S.; Maharaj, T.; Beckham, C.; O'Brien, T. A.; Wehner, M. F.; Williams, D. N.; Kunkel, K.; Collins, W. D.

    2017-12-01

    Deep Learning techniques have revolutionized commercial applications in Computer vision, speech recognition and control systems. The key for all of these developments was the creation of a curated, labeled dataset ImageNet, for enabling multiple research groups around the world to develop methods, benchmark performance and compete with each other. The success of Deep Learning can be largely attributed to the broad availability of this dataset. Our empirical investigations have revealed that Deep Learning is similarly poised to benefit the task of pattern detection in climate science. Unfortunately, labeled datasets, a key pre-requisite for training, are hard to find. Individual research groups are typically interested in specialized weather patterns, making it hard to unify, and share datasets across groups and institutions. In this work, we are proposing ClimateNet: a labeled dataset that provides labeled instances of extreme weather patterns, as well as associated raw fields in model and observational output. We develop a schema in NetCDF to enumerate weather pattern classes/types, store bounding boxes, and pixel-masks. We are also working on a TensorFlow implementation to natively import such NetCDF datasets, and are providing a reference convolutional architecture for binary classification tasks. Our hope is that researchers in Climate Science, as well as ML/DL, will be able to use (and extend) ClimateNet to make rapid progress in the application of Deep Learning for Climate Science research.

  17. Whole-genome characterization of Uruguayan strains of avian infectious bronchitis virus reveals extensive recombination between the two major South American lineages.

    Science.gov (United States)

    Marandino, Ana; Tomás, Gonzalo; Panzera, Yanina; Greif, Gonzalo; Parodi-Talice, Adriana; Hernández, Martín; Techera, Claudia; Hernández, Diego; Pérez, Ruben

    2017-10-01

    Infectious bronchitis virus (Gammacoronavirus, Coronaviridae) is a genetically variable RNA virus that causes one of the most persistent respiratory diseases in poultry. The virus is classified in genotypes and lineages with different epidemiological relevance. Two lineages of the GI genotype (11 and 16) have been widely circulating for decades in South America. GI-11 is an exclusive South American lineage while the GI-16 lineage is distributed in Asia, Europe and South America. Here, we obtained the whole genome of two Uruguayan strains of the GI-11 and GI-16 lineages using Illumina high-throughput sequencing. The strains here sequenced are the first obtained in South America for the infectious bronchitis virus and provide new insights into the origin, spreading and evolution of viral variants. The complete genome of the GI-11 and GI-16 strains have 27,621 and 27,638 nucleotides, respectively, and possess the same genomic organization. Phylogenetic incongruence analysis reveals that both strains have a mosaic genome that arose by recombination between Euro Asiatic strains of the GI-16 lineage and ancestral South American GI-11 viruses. The recombination occurred in South America and produced two viral variants that have retained the full-length S1 sequences of the parental lineages but are extremely similar in the rest of their genomes. These recombinant virus have been extraordinary successful, persisting in the continent for several years with a notorious wide geographic distribution. Our findings reveal a singular viral dynamics and emphasize the importance of complete genomic characterization to understand the emergence and evolutionary history of viral variants. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Proteomics dataset

    DEFF Research Database (Denmark)

    Bennike, Tue Bjerg; Carlsen, Thomas Gelsing; Ellingsen, Torkell

    2017-01-01

    The datasets presented in this article are related to the research articles entitled “Neutrophil Extracellular Traps in Ulcerative Colitis: A Proteome Analysis of Intestinal Biopsies” (Bennike et al., 2015 [1]), and “Proteome Analysis of Rheumatoid Arthritis Gut Mucosa” (Bennike et al., 2017 [2])...... been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifiers PXD001608 for ulcerative colitis and control samples, and PXD003082 for rheumatoid arthritis samples....

  19. A high-resolution European dataset for hydrologic modeling

    Science.gov (United States)

    Ntegeka, Victor; Salamon, Peter; Gomes, Goncalo; Sint, Hadewij; Lorini, Valerio; Thielen, Jutta

    2013-04-01

    inputs to the hydrological calibration and validation of EFAS as well as for establishing long-term discharge "proxy" climatologies which can then in turn be used for statistical analysis to derive return periods or other time series derivatives. In addition, this dataset will be used to assess climatological trends in Europe. Unfortunately, to date no baseline dataset at the European scale exists to test the quality of the herein presented data. Hence, a comparison against other existing datasets can therefore only be an indication of data quality. Due to availability, a comparison was made for precipitation and temperature only, arguably the most important meteorological drivers for hydrologic models. A variety of analyses was undertaken at country scale against data reported to EUROSTAT and E-OBS datasets. The comparison revealed that while the datasets showed overall similar temporal and spatial patterns, there were some differences in magnitudes especially for precipitation. It is not straightforward to define the specific cause for these differences. However, in most cases the comparatively low observation station density appears to be the principal reason for the differences in magnitude.

  20. Molecular typing of Brucella melitensis endemic strains and differentiation from the vaccine strain Rev-1.

    Science.gov (United States)

    Noutsios, Georgios T; Papi, Rigini M; Ekateriniadou, Loukia V; Minas, Anastasios; Kyriakidis, Dimitrios A

    2012-03-01

    In the present study forty-four Greek endemic strains of Br. melitensis and three reference strains were genotyped by Multi locus Variable Number Tandem Repeat (ML-VNTR) analysis based on an eight-base pair tandem repeat sequence that was revealed in eight loci of Br. melitensis genome. The forty-four strains were discriminated from the vaccine strain Rev-1 by Restriction Fragment Length Polymorphism (RFLP) and Denaturant Gradient Gel Electrophoresis (DGGE). The ML-VNTR analysis revealed that endemic, reference and vaccine strains are genetically closely related, while most of the loci tested (1, 2, 4, 5 and 7) are highly polymorphic with Hunter-Gaston Genetic Diversity Index (HGDI) values in the range of 0.939 to 0.775. Analysis of ML-VNTRs loci stability through in vitro passages proved that loci 1 and 5 are non stable. Therefore, vaccine strain can be discriminated from endemic strains by allele's clusters of loci 2, 4, 6 and 7. RFLP and DGGE were also employed to analyse omp2 gene and reveled different patterns among Rev-1 and endemic strains. In RFLP, Rev-1 revealed three fragments (282, 238 and 44 bp), while endemic strains two fragments (238 and 44 bp). As for DGGE, the electrophoretic mobility of Rev-1 is different from the endemic strains due to heterologous binding of DNA chains of omp2a and omp2b gene. Overall, our data show clearly that it is feasible to genotype endemic strains of Br. melitensis and differentiate them from vaccine strain Rev-1 with ML-VNTR, RFLP and DGGE techniques. These tools can be used for conventional investigations in brucellosis outbreaks.

  1. An Inducible Operon Is Involved in Inulin Utilization in Lactobacillus plantarum Strains, as Revealed by Comparative Proteogenomics and Metabolic Profiling.

    Science.gov (United States)

    Buntin, Nirunya; Hongpattarakere, Tipparat; Ritari, Jarmo; Douillard, François P; Paulin, Lars; Boeren, Sjef; Shetty, Sudarshan A; de Vos, Willem M

    2017-01-15

    The draft genomes of Lactobacillus plantarum strains isolated from Asian fermented foods, infant feces, and shrimp intestines were sequenced and compared to those of well-studied strains. Among 28 strains of L. plantarum, variations in the genomic features involved in ecological adaptation were elucidated. The genome sizes ranged from approximately 3.1 to 3.5 Mb, of which about 2,932 to 3,345 protein-coding sequences (CDS) were predicted. The food-derived isolates contained a higher number of carbohydrate metabolism-associated genes than those from infant feces. This observation correlated to their phenotypic carbohydrate metabolic profile, indicating their ability to metabolize the largest range of sugars. Surprisingly, two strains (P14 and P76) isolated from fermented fish utilized inulin. β-Fructosidase, the inulin-degrading enzyme, was detected in the supernatants and cell wall extracts of both strains. No activity was observed in the cytoplasmic fraction, indicating that this key enzyme was either membrane-bound or extracellularly secreted. From genomic mining analysis, a predicted inulin operon of fosRABCDXE, which encodes β-fructosidase and many fructose transporting proteins, was found within the genomes of strains P14 and P76. Moreover, pts1BCA genes, encoding sucrose-specific IIBCA components involved in sucrose transport, were also identified. The proteomic analysis revealed the mechanism and functional characteristic of the fosRABCDXE operon involved in the inulin utilization of L. plantarum The expression levels of the fos operon and pst genes were upregulated at mid-log phase. FosE and the LPXTG-motif cell wall anchored β-fructosidase were induced to a high abundance when inulin was present as a carbon source. Inulin is a long-chain carbohydrate that may act as a prebiotic, which provides many health benefits to the host by selectively stimulating the growth and activity of beneficial bacteria in the colon. While certain lactobacilli can catabolize

  2. Factors affecting finite strain estimation in low-grade, low-strain clastic rocks

    Science.gov (United States)

    Pastor-Galán, Daniel; Gutiérrez-Alonso, Gabriel; Meere, Patrick A.; Mulchrone, Kieran F.

    2009-12-01

    The computer strain analysis methods SAPE, MRL and DTNNM have permitted the characterization of finite strain in two different regions with contrasting geodynamic scenarios; (1) the Talas Ala Tau (Tien Shan, Kyrgyzs Republic) and (2) the Somiedo Nappe and Narcea Antiform (Cantabrian to West Asturian-Leonese Zone boundary, Variscan Belt, NW of Iberia). The performed analyses have revealed low-strain values and the regional strain trend in both studied areas. This study also investigates the relationship between lithology (grain size and percentage of matrix) and strain estimates the two methodologies used. The results show that these methods are comparable and the absence of significant finite strain lithological control in rocks deformed under low metamorphic and low-strain conditions.

  3. Atomic-scale Ge diffusion in strained Si revealed by quantitative scanning transmission electron microscopy

    Science.gov (United States)

    Radtke, G.; Favre, L.; Couillard, M.; Amiard, G.; Berbezier, I.; Botton, G. A.

    2013-05-01

    Aberration-corrected scanning transmission electron microscopy is employed to investigate the local chemistry in the vicinity of a Si0.8Ge0.2/Si interface grown by molecular-beam epitaxy. Atomic-resolution high-angle annular dark field contrast reveals the presence of a nonuniform diffusion of Ge from the substrate into the strained Si thin film. On the basis of multislice calculations, a model is proposed to quantify the experimental contrast, showing that the Ge concentration in the thin film reaches about 4% at the interface and decreases monotonically on a typical length scale of 10 nm. Diffusion occurring during the growth process itself therefore appears as a major factor limiting the abruptness of interfaces in the Si-Ge system.

  4. IMPACT OF GENETIC STRAIN ON BODY FAT LOSS, FOOD CONSUMPTION, METABOLISM, VENTILATION, AND MOTOR ACTIVITY IN FREE RUNNING FEMALE RATS

    Data.gov (United States)

    U.S. Environmental Protection Agency — Physiologic data associated with different strains of common laboratory rat strains. This dataset is associated with the following publication: Gordon , C., P....

  5. StrainSeeker: fast identification of bacterial strains from raw sequencing reads using user-provided guide trees.

    Science.gov (United States)

    Roosaare, Märt; Vaher, Mihkel; Kaplinski, Lauris; Möls, Märt; Andreson, Reidar; Lepamets, Maarja; Kõressaar, Triinu; Naaber, Paul; Kõljalg, Siiri; Remm, Maido

    2017-01-01

    Fast, accurate and high-throughput identification of bacterial isolates is in great demand. The present work was conducted to investigate the possibility of identifying isolates from unassembled next-generation sequencing reads using custom-made guide trees. A tool named StrainSeeker was developed that constructs a list of specific k -mers for each node of any given Newick-format tree and enables the identification of bacterial isolates in 1-2 min. It uses a novel algorithm, which analyses the observed and expected fractions of node-specific k -mers to test the presence of each node in the sample. This allows StrainSeeker to determine where the isolate branches off the guide tree and assign it to a clade whereas other tools assign each read to a reference genome. Using a dataset of 100 Escherichia coli isolates, we demonstrate that StrainSeeker can predict the clades of E. coli with 92% accuracy and correct tree branch assignment with 98% accuracy. Twenty-five thousand Illumina HiSeq reads are sufficient for identification of the strain. StrainSeeker is a software program that identifies bacterial isolates by assigning them to nodes or leaves of a custom-made guide tree. StrainSeeker's web interface and pre-computed guide trees are available at http://bioinfo.ut.ee/strainseeker. Source code is stored at GitHub: https://github.com/bioinfo-ut/StrainSeeker.

  6. Characterization of the biocontrol activity of pseudomonas fluorescens strain X reveals novel genes regulated by glucose.

    Directory of Open Access Journals (Sweden)

    Gerasimos F Kremmydas

    Full Text Available Pseudomonas fluorescens strain X, a bacterial isolate from the rhizosphere of bean seedlings, has the ability to suppress damping-off caused by the oomycete Pythium ultimum. To determine the genes controlling the biocontrol activity of strain X, transposon mutagenesis, sequencing and complementation was performed. Results indicate that, biocontrol ability of this isolate is attributed to gcd gene encoding glucose dehydrogenase, genes encoding its co-enzyme pyrroloquinoline quinone (PQQ, and two genes (sup5 and sup6 which seem to be organized in a putative operon. This operon (named supX consists of five genes, one of which encodes a non-ribosomal peptide synthase. A unique binding site for a GntR-type transcriptional factor is localized upstream of the supX putative operon. Synteny comparison of the genes in supX revealed that they are common in the genus Pseudomonas, but with a low degree of similarity. supX shows high similarity only to the mangotoxin operon of Ps. syringae pv. syringae UMAF0158. Quantitative real-time PCR analysis indicated that transcription of supX is strongly reduced in the gcd and PQQ-minus mutants of Ps. fluorescens strain X. On the contrary, transcription of supX in the wild type is enhanced by glucose and transcription levels that appear to be higher during the stationary phase. Gcd, which uses PQQ as a cofactor, catalyses the oxidation of glucose to gluconic acid, which controls the activity of the GntR family of transcriptional factors. The genes in the supX putative operon have not been implicated before in the biocontrol of plant pathogens by pseudomonads. They are involved in the biosynthesis of an antimicrobial compound by Ps. fluorescens strain X and their transcription is controlled by glucose, possibly through the activity of a GntR-type transcriptional factor binding upstream of this putative operon.

  7. RARD: The Related-Article Recommendation Dataset

    OpenAIRE

    Beel, Joeran; Carevic, Zeljko; Schaible, Johann; Neusch, Gabor

    2017-01-01

    Recommender-system datasets are used for recommender-system evaluations, training machine-learning algorithms, and exploring user behavior. While there are many datasets for recommender systems in the domains of movies, books, and music, there are rather few datasets from research-paper recommender systems. In this paper, we introduce RARD, the Related-Article Recommendation Dataset, from the digital library Sowiport and the recommendation-as-a-service provider Mr. DLib. The dataset contains ...

  8. Isfahan MISP Dataset.

    Science.gov (United States)

    Kashefpur, Masoud; Kafieh, Rahele; Jorjandi, Sahar; Golmohammadi, Hadis; Khodabande, Zahra; Abbasi, Mohammadreza; Teifuri, Nilufar; Fakharzadeh, Ali Akbar; Kashefpoor, Maryam; Rabbani, Hossein

    2017-01-01

    An online depository was introduced to share clinical ground truth with the public and provide open access for researchers to evaluate their computer-aided algorithms. PHP was used for web programming and MySQL for database managing. The website was entitled "biosigdata.com." It was a fast, secure, and easy-to-use online database for medical signals and images. Freely registered users could download the datasets and could also share their own supplementary materials while maintaining their privacies (citation and fee). Commenting was also available for all datasets, and automatic sitemap and semi-automatic SEO indexing have been set for the site. A comprehensive list of available websites for medical datasets is also presented as a Supplementary (http://journalonweb.com/tempaccess/4800.584.JMSS_55_16I3253.pdf).

  9. Phylogenetic and genome-wide deep-sequencing analyses of canine parvovirus reveal co-infection with field variants and emergence of a recent recombinant strain.

    Directory of Open Access Journals (Sweden)

    Ruben Pérez

    Full Text Available Canine parvovirus (CPV, a fast-evolving single-stranded DNA virus, comprises three antigenic variants (2a, 2b, and 2c with different frequencies and genetic variability among countries. The contribution of co-infection and recombination to the genetic variability of CPV is far from being fully elucidated. Here we took advantage of a natural CPV population, recently formed by the convergence of divergent CPV-2c and CPV-2a strains, to study co-infection and recombination. Complete sequences of the viral coding region of CPV-2a and CPV-2c strains from 40 samples were generated and analyzed using phylogenetic tools. Two samples showed co-infection and were further analyzed by deep sequencing. The sequence profile of one of the samples revealed the presence of CPV-2c and CPV-2a strains that differed at 29 nucleotides. The other sample included a minor CPV-2a strain (13.3% of the viral population and a major recombinant strain (86.7%. The recombinant strain arose from inter-genotypic recombination between CPV-2c and CPV-2a strains within the VP1/VP2 gene boundary. Our findings highlight the importance of deep-sequencing analysis to provide a better understanding of CPV molecular diversity.

  10. Phylogenetic and Genome-Wide Deep-Sequencing Analyses of Canine Parvovirus Reveal Co-Infection with Field Variants and Emergence of a Recent Recombinant Strain

    Science.gov (United States)

    Pérez, Ruben; Calleros, Lucía; Marandino, Ana; Sarute, Nicolás; Iraola, Gregorio; Grecco, Sofia; Blanc, Hervé; Vignuzzi, Marco; Isakov, Ofer; Shomron, Noam; Carrau, Lucía; Hernández, Martín; Francia, Lourdes; Sosa, Katia; Tomás, Gonzalo; Panzera, Yanina

    2014-01-01

    Canine parvovirus (CPV), a fast-evolving single-stranded DNA virus, comprises three antigenic variants (2a, 2b, and 2c) with different frequencies and genetic variability among countries. The contribution of co-infection and recombination to the genetic variability of CPV is far from being fully elucidated. Here we took advantage of a natural CPV population, recently formed by the convergence of divergent CPV-2c and CPV-2a strains, to study co-infection and recombination. Complete sequences of the viral coding region of CPV-2a and CPV-2c strains from 40 samples were generated and analyzed using phylogenetic tools. Two samples showed co-infection and were further analyzed by deep sequencing. The sequence profile of one of the samples revealed the presence of CPV-2c and CPV-2a strains that differed at 29 nucleotides. The other sample included a minor CPV-2a strain (13.3% of the viral population) and a major recombinant strain (86.7%). The recombinant strain arose from inter-genotypic recombination between CPV-2c and CPV-2a strains within the VP1/VP2 gene boundary. Our findings highlight the importance of deep-sequencing analysis to provide a better understanding of CPV molecular diversity. PMID:25365348

  11. Open University Learning Analytics dataset.

    Science.gov (United States)

    Kuzilek, Jakub; Hlosta, Martin; Zdrahal, Zdenek

    2017-11-28

    Learning Analytics focuses on the collection and analysis of learners' data to improve their learning experience by providing informed guidance and to optimise learning materials. To support the research in this area we have developed a dataset, containing data from courses presented at the Open University (OU). What makes the dataset unique is the fact that it contains demographic data together with aggregated clickstream data of students' interactions in the Virtual Learning Environment (VLE). This enables the analysis of student behaviour, represented by their actions. The dataset contains the information about 22 courses, 32,593 students, their assessment results, and logs of their interactions with the VLE represented by daily summaries of student clicks (10,655,280 entries). The dataset is freely available at https://analyse.kmi.open.ac.uk/open_dataset under a CC-BY 4.0 license.

  12. Knowledge discovery with classification rules in a cardiovascular dataset.

    Science.gov (United States)

    Podgorelec, Vili; Kokol, Peter; Stiglic, Milojka Molan; Hericko, Marjan; Rozman, Ivan

    2005-12-01

    In this paper we study an evolutionary machine learning approach to data mining and knowledge discovery based on the induction of classification rules. A method for automatic rules induction called AREX using evolutionary induction of decision trees and automatic programming is introduced. The proposed algorithm is applied to a cardiovascular dataset consisting of different groups of attributes which should possibly reveal the presence of some specific cardiovascular problems in young patients. A case study is presented that shows the use of AREX for the classification of patients and for discovering possible new medical knowledge from the dataset. The defined knowledge discovery loop comprises a medical expert's assessment of induced rules to drive the evolution of rule sets towards more appropriate solutions. The final result is the discovery of a possible new medical knowledge in the field of pediatric cardiology.

  13. Laboratory-Cultured Strains of the Sea Anemone Exaiptasia Reveal Distinct Bacterial Communities

    KAUST Repository

    Herrera Sarrias, Marcela; Ziegler, Maren; Voolstra, Christian R.; Aranda, Manuel

    2017-01-01

    Exaiptasia is a laboratory sea anemone model system for stony corals. Two clonal strains are commonly used, referred to as H2 and CC7, that originate from two genetically distinct lineages and that differ in their Symbiodinium specificity. However, little is known about their other microbial associations. Here, we examined and compared the taxonomic composition of the bacterial assemblages of these two symbiotic Exaiptasia strains, both of which have been cultured in the laboratory long-term under identical conditions. We found distinct bacterial microbiota for each strain, indicating the presence of host-specific microbial consortia. Putative differences in the bacterial functional profiles (i.e., enrichment and depletion of various metabolic processes) based on taxonomic inference were also detected, further suggesting functional differences of the microbiomes associated with these lineages. Our study contributes to the current knowledge of the Exaiptasia holobiont by comparing the bacterial diversity of two commonly used strains as models for coral research.

  14. Laboratory-Cultured Strains of the Sea Anemone Exaiptasia Reveal Distinct Bacterial Communities

    KAUST Repository

    Herrera Sarrias, Marcela

    2017-05-02

    Exaiptasia is a laboratory sea anemone model system for stony corals. Two clonal strains are commonly used, referred to as H2 and CC7, that originate from two genetically distinct lineages and that differ in their Symbiodinium specificity. However, little is known about their other microbial associations. Here, we examined and compared the taxonomic composition of the bacterial assemblages of these two symbiotic Exaiptasia strains, both of which have been cultured in the laboratory long-term under identical conditions. We found distinct bacterial microbiota for each strain, indicating the presence of host-specific microbial consortia. Putative differences in the bacterial functional profiles (i.e., enrichment and depletion of various metabolic processes) based on taxonomic inference were also detected, further suggesting functional differences of the microbiomes associated with these lineages. Our study contributes to the current knowledge of the Exaiptasia holobiont by comparing the bacterial diversity of two commonly used strains as models for coral research.

  15. Analysis of cagA in Helicobacter pylori strains from Colombian populations with contrasting gastric cancer risk reveals a biomarker for disease severity

    Science.gov (United States)

    Loh, John T.; Shaffer, Carrie L.; Piazuelo, M. Blanca; Bravo, Luis E.; McClain, Mark S.; Correa, Pelayo; Cover, Timothy L.

    2011-01-01

    BACKGROUND Helicobacter pylori infection is a risk factor for the development of gastric cancer, and the bacterial oncoprotein CagA contributes to gastric carcinogenesis. METHODS We analyzed H. pylori isolates from persons in Colombia and observed that there was marked variation among strains in levels of CagA expression. To elucidate the basis for this variation, we analyzed sequences upstream from the CagA translational initiation site in each strain. RESULTS A DNA motif (AATAAGATA) upstream of the translational initiation site of CagA was associated with high levels of CagA expression. Experimental studies showed that this motif was necessary but not sufficient for high-level CagA expression. H. pylori strains from a region of Colombia with high gastric cancer rates expressed higher levels of CagA than did strains from a region with lower gastric cancer rates, and Colombian strains of European phylogeographic origin expressed higher levels of CagA than did strains of African origin. Histopathological analysis of gastric biopsy specimens revealed that strains expressing high levels of CagA or containing the AATAAGATA motif were associated with more advanced precancerous lesions than those found in persons infected with strains expressing low levels of CagA or lacking the AATAAGATA motif. CONCLUSIONS CagA expression varies greatly among H. pylori strains. The DNA motif identified in this study is associated with high levels of CagA expression, and may be a useful biomarker to predict gastric cancer risk. IMPACT These findings help to explain why some persons infected with cagA-positive H. pylori develop gastric cancer and others do not. PMID:21859954

  16. Mridangam stroke dataset

    OpenAIRE

    CompMusic

    2014-01-01

    The audio examples were recorded from a professional Carnatic percussionist in a semi-anechoic studio conditions by Akshay Anantapadmanabhan using SM-58 microphones and an H4n ZOOM recorder. The audio was sampled at 44.1 kHz and stored as 16 bit wav files. The dataset can be used for training models for each Mridangam stroke. /n/nA detailed description of the Mridangam and its strokes can be found in the paper below. A part of the dataset was used in the following paper. /nAkshay Anantapadman...

  17. 2008 TIGER/Line Nationwide Dataset

    Data.gov (United States)

    California Natural Resource Agency — This dataset contains a nationwide build of the 2008 TIGER/Line datasets from the US Census Bureau downloaded in April 2009. The TIGER/Line Shapefiles are an extract...

  18. Plant root transcriptome profiling reveals a strain-dependent response during Azospirillum-rice cooperation

    Directory of Open Access Journals (Sweden)

    Benoît eDrogue

    2014-11-01

    Full Text Available Cooperation involving Plant Growth-Promoting Rhizobacteria results in improvements of plant growth and health. While pathogenic and symbiotic interactions are known to induce transcriptional changes for genes related to plant defence and development, little is known about the impact of phytostimulating rhizobacteria on plant gene expression. This study aims at identifying genes significantly regulated in rice roots upon Azospirillum inoculation, considering possible favored interaction between a strain and its original host cultivar. Genome-wide analyses of Oryza sativa japonica cultivars Cigalon and Nipponbare were performed, by using microarrays, seven days post inoculation with A. lipoferum 4B (isolated from Cigalon or Azospirillum sp. B510 (isolated from Nipponbare and compared to the respective non-inoculated condition. A total of 7,384 genes were significantly regulated, which represent about 16 % of total rice genes. A set of 34 genes is regulated by both Azospirillum strains in both cultivars, including a gene orthologous to PR10 of Brachypodium, and these could represent plant markers of Azospirillum-rice interactions. The results highlight a strain-dependent response of rice, with 83 % of the differentially expressed genes being classified as combination-specific. Whatever the combination, most of the differentially expressed genes are involved in primary metabolism, transport, regulation of transcription and protein fate. When considering genes involved in response to stress and plant defence, it appears that strain B510, a strain displaying endophytic properties, leads to the repression of a wider set of genes than strain 4B. Individual genotypic variations could be the most important driving force of rice roots gene expression upon Azospirillum inoculation. Strain-dependent transcriptional changes observed for genes related to auxin and ethylene signalling highlight the complexity of hormone signalling networks in the Azospirillum

  19. Design of an audio advertisement dataset

    Science.gov (United States)

    Fu, Yutao; Liu, Jihong; Zhang, Qi; Geng, Yuting

    2015-12-01

    Since more and more advertisements swarm into radios, it is necessary to establish an audio advertising dataset which could be used to analyze and classify the advertisement. A method of how to establish a complete audio advertising dataset is presented in this paper. The dataset is divided into four different kinds of advertisements. Each advertisement's sample is given in *.wav file format, and annotated with a txt file which contains its file name, sampling frequency, channel number, broadcasting time and its class. The classifying rationality of the advertisements in this dataset is proved by clustering the different advertisements based on Principal Component Analysis (PCA). The experimental results show that this audio advertisement dataset offers a reliable set of samples for correlative audio advertisement experimental studies.

  20. Background qualitative analysis of the European reference life cycle database (ELCD) energy datasets - part II: electricity datasets.

    Science.gov (United States)

    Garraín, Daniel; Fazio, Simone; de la Rúa, Cristina; Recchioni, Marco; Lechón, Yolanda; Mathieux, Fabrice

    2015-01-01

    The aim of this paper is to identify areas of potential improvement of the European Reference Life Cycle Database (ELCD) electricity datasets. The revision is based on the data quality indicators described by the International Life Cycle Data system (ILCD) Handbook, applied on sectorial basis. These indicators evaluate the technological, geographical and time-related representativeness of the dataset and the appropriateness in terms of completeness, precision and methodology. Results show that ELCD electricity datasets have a very good quality in general terms, nevertheless some findings and recommendations in order to improve the quality of Life-Cycle Inventories have been derived. Moreover, these results ensure the quality of the electricity-related datasets to any LCA practitioner, and provide insights related to the limitations and assumptions underlying in the datasets modelling. Giving this information, the LCA practitioner will be able to decide whether the use of the ELCD electricity datasets is appropriate based on the goal and scope of the analysis to be conducted. The methodological approach would be also useful for dataset developers and reviewers, in order to improve the overall Data Quality Requirements of databases.

  1. Data Recommender: An Alternative Way to Discover Open Scientific Datasets

    Science.gov (United States)

    Klump, J. F.; Devaraju, A.; Williams, G.; Hogan, D.; Davy, R.; Page, J.; Singh, D.; Peterson, N.

    2017-12-01

    similar and serendipitous data recommendations. It measures the relevance between datasets based on their properties, and search and download patterns. We evaluated the recommendation approach in a user study, and the obtained user judgments revealed the ability of the approach to accurately quantify the relevance of the datasets.

  2. Comparative genome analysis of VSP-II and SNPs reveals heterogenic variation in contemporary strains of Vibrio cholerae O1 isolated from cholera patients in Kolkata, India.

    Science.gov (United States)

    Imamura, Daisuke; Morita, Masatomo; Sekizuka, Tsuyoshi; Mizuno, Tamaki; Takemura, Taichiro; Yamashiro, Tetsu; Chowdhury, Goutam; Pazhani, Gururaja P; Mukhopadhyay, Asish K; Ramamurthy, Thandavarayan; Miyoshi, Shin-Ichi; Kuroda, Makoto; Shinoda, Sumio; Ohnishi, Makoto

    2017-02-01

    Cholera is an acute diarrheal disease and a major public health problem in many developing countries in Asia, Africa, and Latin America. Since the Bay of Bengal is considered the epicenter for the seventh cholera pandemic, it is important to understand the genetic dynamism of Vibrio cholerae from Kolkata, as a representative of the Bengal region. We analyzed whole genome sequence data of V. cholerae O1 isolated from cholera patients in Kolkata, India, from 2007 to 2014 and identified the heterogeneous genomic region in these strains. In addition, we carried out a phylogenetic analysis based on the whole genome single nucleotide polymorphisms to determine the genetic lineage of strains in Kolkata. This analysis revealed the heterogeneity of the Vibrio seventh pandemic island (VSP)-II in Kolkata strains. The ctxB genotype was also heterogeneous and was highly related to VSP-II types. In addition, phylogenetic analysis revealed the shifts in predominant strains in Kolkata. Two distinct lineages, 1 and 2, were found between 2007 and 2010. However, the proportion changed markedly in 2010 and lineage 2 strains were predominant thereafter. Lineage 2 can be divided into four sublineages, I, II, III and IV. The results of this study indicate that lineages 1 and 2-I were concurrently prevalent between 2007 and 2009, and lineage 2-III observed in 2010, followed by the predominance of lineage 2-IV in 2011 and continued until 2014. Our findings demonstrate that the epidemic of cholera in Kolkata was caused by several distinct strains that have been constantly changing within the genetic lineages of V. cholerae O1 in recent years.

  3. Pyrosequencing Analysis Reveals Changes in Intestinal Microbiota of Healthy Adults Who Received a Daily Dose of Immunomodulatory Probiotic Strains

    Directory of Open Access Journals (Sweden)

    Julio Plaza-Díaz

    2015-05-01

    Full Text Available The colon microbiota plays a crucial role in human gastrointestinal health. Current attempts to manipulate the colon microbiota composition are aimed at finding remedies for various diseases. We have recently described the immunomodulatory effects of three probiotic strains (Lactobacillus rhamnosus CNCM I-4036, Lactobacillus paracasei CNCM I-4034, and Bifidobacterium breve CNCM I-4035. The goal of the present study was to analyze the compositions of the fecal microbiota of healthy adults who received one of these strains using high-throughput 16S ribosomal RNA gene sequencing. Bacteroides was the most abundant genus in the groups that received L. rhamnosus CNCM I-4036 or L. paracasei CNCM I-4034. The Shannon indices were significantly increased in these two groups. Our results also revealed a significant increase in the Lactobacillus genus after the intervention with L. rhamnosus CNCM I-4036. The initially different colon microbiota became homogeneous in the subjects who received L. rhamnosus CNCM I-4036. While some orders that were initially present disappeared after the administration of L. rhamnosus CNCM I-4036, other orders, such as Sphingobacteriales, Nitrospirales, Desulfobacterales, Thiotrichales, and Synergistetes, were detected after the intervention. In summary, our results show that the intake of these three bacterial strains induced changes in the colon microbiota.

  4. A rapid NMR-based method for discrimination of strain-specific cell wall teichoic acid structures reveals a third backbone type in Lactobacillus plantarum.

    Science.gov (United States)

    Tomita, Satoru; Tanaka, Naoto; Okada, Sanae

    2017-03-01

    The lactic acid bacterium Lactobacillus plantarum is capable of producing strain-specific structures of cell wall teichoic acid (WTA), an anionic polysaccharide found in the Gram-positive bacterial cell wall. In this study, we established a rapid, NMR-based procedure to discriminate WTA structures in this species, and applied it to 94 strains of L. plantarum. Six previously reported glycerol- and ribitol-containing WTA subtypes were successfully identified from 78 strains, suggesting that these were the dominant structures. However, the level of structural variety differed markedly among bacterial sources, possibly reflecting differences in strain-level microbial diversity. WTAs from eight strains were not identified based on NMR spectra and were classified into three groups. Structural analysis of a partial degradation product of an unidentified WTA produced by strain TUA 1496L revealed that the WTA was 1-O-β-d-glucosylglycerol. Two-dimensional NMR analysis of the polymer structure showed phosphodiester bonds between C-3 and C-6 of the glycerol and glucose residues, suggesting a polymer structure of 3,6΄-linked poly(1-O-β-d-glucosyl-sn-glycerol phosphate). This is the third WTA backbone structure in L. plantarum, following 3,6΄-linked poly(1-O-α-d-glucosyl-sn-glycerol phosphate) and 1,5-linked poly(ribitol phosphate). © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  5. Error characterisation of global active and passive microwave soil moisture datasets

    Directory of Open Access Journals (Sweden)

    W. A. Dorigo

    2010-12-01

    Full Text Available Understanding the error structures of remotely sensed soil moisture observations is essential for correctly interpreting observed variations and trends in the data or assimilating them in hydrological or numerical weather prediction models. Nevertheless, a spatially coherent assessment of the quality of the various globally available datasets is often hampered by the limited availability over space and time of reliable in-situ measurements. As an alternative, this study explores the triple collocation error estimation technique for assessing the relative quality of several globally available soil moisture products from active (ASCAT and passive (AMSR-E and SSM/I microwave sensors. The triple collocation is a powerful statistical tool to estimate the root mean square error while simultaneously solving for systematic differences in the climatologies of a set of three linearly related data sources with independent error structures. Prerequisite for this technique is the availability of a sufficiently large number of timely corresponding observations. In addition to the active and passive satellite-based datasets, we used the ERA-Interim and GLDAS-NOAH reanalysis soil moisture datasets as a third, independent reference. The prime objective is to reveal trends in uncertainty related to different observation principles (passive versus active, the use of different frequencies (C-, X-, and Ku-band for passive microwave observations, and the choice of the independent reference dataset (ERA-Interim versus GLDAS-NOAH. The results suggest that the triple collocation method provides realistic error estimates. Observed spatial trends agree well with the existing theory and studies on the performance of different observation principles and frequencies with respect to land cover and vegetation density. In addition, if all theoretical prerequisites are fulfilled (e.g. a sufficiently large number of common observations is available and errors of the different

  6. The GTZAN dataset

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2013-01-01

    The GTZAN dataset appears in at least 100 published works, and is the most-used public dataset for evaluation in machine listening research for music genre recognition (MGR). Our recent work, however, shows GTZAN has several faults (repetitions, mislabelings, and distortions), which challenge...... of GTZAN, and provide a catalog of its faults. We review how GTZAN has been used in MGR research, and find few indications that its faults have been known and considered. Finally, we rigorously study the effects of its faults on evaluating five different MGR systems. The lesson is not to banish GTZAN...

  7. Global mRNA expression analysis in myosin II deficient strains of Saccharomyces cerevisiae reveals an impairment of cell integrity functions

    Directory of Open Access Journals (Sweden)

    Rivera-Molina Félix E

    2008-01-01

    Full Text Available Abstract Background The Saccharomyces cerevisiae MYO1 gene encodes the myosin II heavy chain (Myo1p, a protein required for normal cytokinesis in budding yeast. Myo1p deficiency in yeast (myo1Δ causes a cell separation defect characterized by the formation of attached cells, yet it also causes abnormal budding patterns, formation of enlarged and elongated cells, increased osmotic sensitivity, delocalized chitin deposition, increased chitin synthesis, and hypersensitivity to the chitin synthase III inhibitor Nikkomycin Z. To determine how differential expression of genes is related to these diverse cell wall phenotypes, we analyzed the global mRNA expression profile of myo1Δ strains. Results Global mRNA expression profiles of myo1Δ strains and their corresponding wild type controls were obtained by hybridization to yeast oligonucleotide microarrays. Results for selected genes were confirmed by real time RT-PCR. A total of 547 differentially expressed genes (p ≤ 0.01 were identified with 263 up regulated and 284 down regulated genes in the myo1Δ strains. Gene set enrichment analysis revealed the significant over-representation of genes in the protein biosynthesis and stress response categories. The SLT2/MPK1 gene was up regulated in the microarray, and a myo1Δslt2Δ double mutant was non-viable. Overexpression of ribosomal protein genes RPL30 and RPS31 suppressed the hypersensitivity to Nikkomycin Z and increased the levels of phosphorylated Slt2p in myo1Δ strains. Increased levels of phosphorylated Slt2p were also observed in wild type strains under these conditions. Conclusion Following this analysis of global mRNA expression in yeast myo1Δ strains, we conclude that 547 genes were differentially regulated in myo1Δ strains and that the stress response and protein biosynthesis gene categories were coordinately regulated in this mutant. The SLT2/MPK1 gene was confirmed to be essential for myo1Δ strain viability, supporting that the up

  8. Anonymising the Sparse Dataset: A New Privacy Preservation Approach while Predicting Diseases

    Directory of Open Access Journals (Sweden)

    V. Shyamala Susan

    2016-09-01

    Full Text Available Data mining techniques analyze the medical dataset with the intention of enhancing patient’s health and privacy. Most of the existing techniques are properly suited for low dimensional medical dataset. The proposed methodology designs a model for the representation of sparse high dimensional medical dataset with the attitude of protecting the patient’s privacy from an adversary and additionally to predict the disease’s threat degree. In a sparse data set many non-zero values are randomly spread in the entire data space. Hence, the challenge is to cluster the correlated patient’s record to predict the risk degree of the disease earlier than they occur in patients and to keep privacy. The first phase converts the sparse dataset right into a band matrix through the Genetic algorithm along with Cuckoo Search (GCS.This groups the correlated patient’s record together and arranges them close to the diagonal. The next segment dissociates the patient’s disease, which is a sensitive value (SA with the parameters that determine the disease normally Quasi Identifier (QI.Finally, density based clustering technique is used over the underlying data to  create anonymized groups to maintain privacy and to predict the risk level of disease. Empirical assessments on actual health care data corresponding to V.A.Medical Centre heart disease dataset reveal the efficiency of this model pertaining to information loss, utility and privacy.

  9. Enzyme markers in inbred rat strains: genetics of new markers and strain profiles.

    Science.gov (United States)

    Adams, M; Baverstock, P R; Watts, C H; Gutman, G A

    1984-08-01

    Twenty-six inbred strains of the laboratory rat (Rattus norvegicus) were examined for electrophoretic variation at an estimated 97 genetic loci. In addition to previously documented markers, variation was observed for the enzymes aconitase, aldehyde dehydrogenase, and alkaline phosphatase. The genetic basis of these markers (Acon-1, Ahd-2, and Akp-1) was confirmed. Linkage analysis between 35 pairwise comparisons revealed that the markers Fh-1 and Pep-3 are linked. The strain profiles of the 25 inbred strains at 11 electrophoretic markers are given.

  10. Genetic analysis of Saccharomyces cerevisiae strains isolated from palm wine in eastern Nigeria. Comparison with other African strains.

    Science.gov (United States)

    Ezeronye, O U; Legras, J-L

    2009-05-01

    To study the yeast diversity of Nigerian palm wines by comparison with other African strains. Twenty-three Saccharomyces cerevisiae strains were obtained from palm wine samples collected at four locations in eastern Nigeria, and characterized using different molecular techniques: internal transcribed spacer restriction fragment length polymorphism and sequence analysis, pulsed field gel electrophoresis, inter delta typing and microsatellite multilocus analysis. These techniques revealed that palm wine yeasts represent a group of closely related strains that includes other West African isolates (CBS400, NCYC110, DVPG6044). Population analysis revealed an excess of homozygote strains and an allelic richness similar to wine suggestive of local domestication. Several other African yeast strains were not connected to this group. Ghana sorghum beer strains and other African strains (DBVPG1853 and MUCL28071) displayed strikingly high relatedness with European bread, beer or wine strains, and the genome of strain MUCL30909 contained African and wine-type alleles, indicating its hybrid origin. Nigerian palm wine yeast represents a local specific yeast flora, whereas a European origin or hybrid was suspected for several other Africa isolates. This study presents the first genetic characterization of an autochthonous African palm wine yeast population and confirms the idea that human intervention has favoured yeast migration.

  11. Longitudinal genotyping of Candida dubliniensis isolates reveals strain maintenance, microevolution, and the emergence of itraconazole resistance.

    LENUS (Irish Health Repository)

    Fleischhacker, M

    2010-05-01

    We investigated the population structure of 208 Candida dubliniensis isolates obtained from 29 patients (25 human immunodeficiency virus [HIV] positive and 4 HIV negative) as part of a longitudinal study. The isolates were identified as C. dubliniensis by arbitrarily primed PCR (AP-PCR) and then genotyped using the Cd25 probe specific for C. dubliniensis. The majority of the isolates (55 of 58) were unique to individual patients, and more than one genotype was recovered from 15 of 29 patients. A total of 21 HIV-positive patients were sampled on more than one occasion (2 to 36 times). Sequential isolates recovered from these patients were all closely related, as demonstrated by hybridization with Cd25 and genotyping by PCR. Six patients were colonized by the same genotype of C. dubliniensis on repeated sampling, while strains exhibiting altered genotypes were recovered from 15 of 21 patients. The majority of these isolates demonstrated minor genetic alterations, i.e., microevolution, while one patient acquired an unrelated strain. The C. dubliniensis strains could not be separated into genetically distinct groups based on patient viral load, CD4 cell count, or oropharyngeal candidosis. However, C. dubliniensis isolates obtained from HIV-positive patients were more closely related than those recovered from HIV-negative patients. Approximately 8% (16 of 194) of isolates exhibited itraconazole resistance. Cross-resistance to fluconazole was only observed in one of these patients. Two patients harboring itraconazole-resistant isolates had not received any previous azole therapy. In conclusion, longitudinal genotyping of C. dubliniensis isolates from HIV-infected patients reveals that isolates from the same patient are generally closely related and may undergo microevolution. In addition, isolates may acquire itraconazole resistance, even in the absence of prior azole therapy.

  12. Ancient genomes reveal a high diversity of Mycobacterium leprae in medieval Europe.

    Directory of Open Access Journals (Sweden)

    Verena J Schuenemann

    2018-05-01

    Full Text Available Studying ancient DNA allows us to retrace the evolutionary history of human pathogens, such as Mycobacterium leprae, the main causative agent of leprosy. Leprosy is one of the oldest recorded and most stigmatizing diseases in human history. The disease was prevalent in Europe until the 16th century and is still endemic in many countries with over 200,000 new cases reported annually. Previous worldwide studies on modern and European medieval M. leprae genomes revealed that they cluster into several distinct branches of which two were present in medieval Northwestern Europe. In this study, we analyzed 10 new medieval M. leprae genomes including the so far oldest M. leprae genome from one of the earliest known cases of leprosy in the United Kingdom-a skeleton from the Great Chesterford cemetery with a calibrated age of 415-545 C.E. This dataset provides a genetic time transect of M. leprae diversity in Europe over the past 1500 years. We find M. leprae strains from four distinct branches to be present in the Early Medieval Period, and strains from three different branches were detected within a single cemetery from the High Medieval Period. Altogether these findings suggest a higher genetic diversity of M. leprae strains in medieval Europe at various time points than previously assumed. The resulting more complex picture of the past phylogeography of leprosy in Europe impacts current phylogeographical models of M. leprae dissemination. It suggests alternative models for the past spread of leprosy such as a wide spread prevalence of strains from different branches in Eurasia already in Antiquity or maybe even an origin in Western Eurasia. Furthermore, these results highlight how studying ancient M. leprae strains improves understanding the history of leprosy worldwide.

  13. Ancient genomes reveal a high diversity of Mycobacterium leprae in medieval Europe.

    Science.gov (United States)

    Schuenemann, Verena J; Avanzi, Charlotte; Krause-Kyora, Ben; Seitz, Alexander; Herbig, Alexander; Inskip, Sarah; Bonazzi, Marion; Reiter, Ella; Urban, Christian; Dangvard Pedersen, Dorthe; Taylor, G Michael; Singh, Pushpendra; Stewart, Graham R; Velemínský, Petr; Likovsky, Jakub; Marcsik, Antónia; Molnár, Erika; Pálfi, György; Mariotti, Valentina; Riga, Alessandro; Belcastro, M Giovanna; Boldsen, Jesper L; Nebel, Almut; Mays, Simon; Donoghue, Helen D; Zakrzewski, Sonia; Benjak, Andrej; Nieselt, Kay; Cole, Stewart T; Krause, Johannes

    2018-05-01

    Studying ancient DNA allows us to retrace the evolutionary history of human pathogens, such as Mycobacterium leprae, the main causative agent of leprosy. Leprosy is one of the oldest recorded and most stigmatizing diseases in human history. The disease was prevalent in Europe until the 16th century and is still endemic in many countries with over 200,000 new cases reported annually. Previous worldwide studies on modern and European medieval M. leprae genomes revealed that they cluster into several distinct branches of which two were present in medieval Northwestern Europe. In this study, we analyzed 10 new medieval M. leprae genomes including the so far oldest M. leprae genome from one of the earliest known cases of leprosy in the United Kingdom-a skeleton from the Great Chesterford cemetery with a calibrated age of 415-545 C.E. This dataset provides a genetic time transect of M. leprae diversity in Europe over the past 1500 years. We find M. leprae strains from four distinct branches to be present in the Early Medieval Period, and strains from three different branches were detected within a single cemetery from the High Medieval Period. Altogether these findings suggest a higher genetic diversity of M. leprae strains in medieval Europe at various time points than previously assumed. The resulting more complex picture of the past phylogeography of leprosy in Europe impacts current phylogeographical models of M. leprae dissemination. It suggests alternative models for the past spread of leprosy such as a wide spread prevalence of strains from different branches in Eurasia already in Antiquity or maybe even an origin in Western Eurasia. Furthermore, these results highlight how studying ancient M. leprae strains improves understanding the history of leprosy worldwide.

  14. Sequence analysis of measles virus strains collected during the pre- and early-vaccination era in Denmark reveals a considerable diversity of ancient strains

    DEFF Research Database (Denmark)

    Christensen, Laurids Siig; Schöller, S.; Schierup, M. H.

    2002-01-01

    A total of 199 serum samples from patients with measles collected in Denmark, Greenland and the Faroe Islands from 1964 to 1983 were analysed by PCR. Measles virus (MV) RNA could be detected in 38 (19%) of the samples and a total of 18 strains were subjected to partial sequence analysis of the he......A total of 199 serum samples from patients with measles collected in Denmark, Greenland and the Faroe Islands from 1964 to 1983 were analysed by PCR. Measles virus (MV) RNA could be detected in 38 (19%) of the samples and a total of 18 strains were subjected to partial sequence analysis...... of the hemagglutinin gene. The strains exhibited a considerable genomic diversity, which is at odds with the assumption that one genome type prevailed among globally circulating MV strains prior to the advent of live-attenuated vaccines. Our data indicate that the similarity of the various vaccine strains...... is attributed to their having originated from the same primary isolate. Consequently, it is implied that a small number of clinical manifestations of MV worldwide from which strains similar to the vaccine strain were identified were vaccine related rather than being caused by members of a persistently...

  15. Strain Pattern in Supercooled Liquids

    Science.gov (United States)

    Illing, Bernd; Fritschi, Sebastian; Hajnal, David; Klix, Christian; Keim, Peter; Fuchs, Matthias

    2016-11-01

    Investigations of strain correlations at the glass transition reveal unexpected phenomena. The shear strain fluctuations show an Eshelby-strain pattern [˜cos (4 θ ) /r2 ], characteristic of elastic response, even in liquids, at long times. We address this using a mode-coupling theory for the strain fluctuations in supercooled liquids and data from both video microscopy of a two-dimensional colloidal glass former and simulations of Brownian hard disks. We show that the long-ranged and long-lived strain signatures follow a scaling law valid close to the glass transition. For large enough viscosities, the Eshelby-strain pattern is visible even on time scales longer than the structural relaxation time τ and after the shear modulus has relaxed to zero.

  16. Polymorphism of Paramecium pentaurelia (Ciliophora, Oligohymenophorea) strains revealed by rDNA and mtDNA sequences.

    Science.gov (United States)

    Przyboś, Ewa; Tarcz, Sebastian; Greczek-Stachura, Magdalena; Surmacz, Marta

    2011-05-01

    Paramecium pentaurelia is one of 15 known sibling species of the Paramecium aurelia complex. It is recognized as a species showing no intra-specific differentiation on the basis of molecular fingerprint analyses, whereas the majority of other species are polymorphic. This study aimed at assessing genetic polymorphism within P. pentaurelia including new strains recently found in Poland (originating from two water bodies, different years, seasons, and clones of one strain) as well as strains collected from distant habitats (USA, Europe, Asia), and strains representing other species of the complex. We compared two DNA fragments: partial sequences (349 bp) of the LSU rDNA and partial sequences (618 bp) of cytochrome B gene. A correlation between the geographical origin of the strains and the genetic characteristics of their genotypes was not observed. Different genotypes were found in Kraków in two types of water bodies (Opatkowice-natural pond; Jordan's Park-artificial pond). Haplotype diversity within a single water body was not recorded. Likewise, seasonal haplotype differences between the strains within the artificial water body, as well as differences between clones originating from one strain, were not detected. The clustering of some strains belonging to different species was observed in the phylogenies. Copyright © 2010 Elsevier GmbH. All rights reserved.

  17. 2-D DIGE proteomic profiles of three strains of Fusarium graminearum grown in agmatine or glutamic acid medium

    Directory of Open Access Journals (Sweden)

    Tommaso Serchi

    2016-03-01

    Full Text Available 2D DIGE proteomics data obtained from three strains belonging to Fusarium graminearum s.s. species growing in a glutamic acid or agmatine containing medium are provided.A total of 381 protein species have been identified which do differ for abundance among the two treatments and among the strains (ANOVA±1.3.Data on the diversity of protein species profiles between the two media for each strain are made available. Shared profiles among strains are discussed in Pasquali et al. [1].Here proteins that with diverse profile can be used to differentiate strains are highlighted. The full dataset allow to obtaining single strain proteomic profiles. Keywords: Comparative strain proteomics, Toxigenic fungi, Polyamines, Trichothecenes, Strain variability

  18. High-Resolution Typing Reveals Distinct Chlamydia trachomatis Strains in an At-Risk Population in Nanjing, China

    NARCIS (Netherlands)

    Bom, Reinier J. M.; van den Hoek, Anneke; Wang, Qianqiu; Long, Fuquan; de Vries, Henry J. C.; Bruisten, Sylvia M.

    2013-01-01

    We investigated Chlamydia trachomatis strains from Nanjing, China, and whether these strains differed from Amsterdam, the Netherlands. C. trachomatis type was determined with multilocus sequence typing. Most strains were specific to Nanjing, but some clustered with strains from Amsterdam. This

  19. Editorial: Datasets for Learning Analytics

    NARCIS (Netherlands)

    Dietze, Stefan; George, Siemens; Davide, Taibi; Drachsler, Hendrik

    2018-01-01

    The European LinkedUp and LACE (Learning Analytics Community Exchange) project have been responsible for setting up a series of data challenges at the LAK conferences 2013 and 2014 around the LAK dataset. The LAK datasets consists of a rich collection of full text publications in the domain of

  20. A Novel Technique for Time-Centric Analysis of Massive Remotely-Sensed Datasets

    Directory of Open Access Journals (Sweden)

    Glenn E. Grant

    2015-04-01

    Full Text Available Analyzing massive remotely-sensed datasets presents formidable challenges. The volume of satellite imagery collected often outpaces analytical capabilities, however thorough analyses of complete datasets may provide new insights into processes that would otherwise be unseen. In this study we present a novel, object-oriented approach to storing, retrieving, and analyzing large remotely-sensed datasets. The objective is to provide a new structure for scalable storage and rapid, Internet-based analysis of climatology data. The concept of a “data rod” is introduced, a conceptual data object that organizes time-series information into a temporally-oriented vertical column at any given location. To demonstrate one possible use, we ingest 25 years of Greenland imagery into a series of pure-object databases, then retrieve and analyze the data. The results provide a basis for evaluating the database performance and scientific analysis capabilities. The project succeeds in demonstrating the effectiveness of the prototype database architecture and analysis approach, not because new scientific information is discovered, but because quality control issues are revealed in the source data that had gone undetected for years.

  1. The Geometry of Finite Equilibrium Datasets

    DEFF Research Database (Denmark)

    Balasko, Yves; Tvede, Mich

    We investigate the geometry of finite datasets defined by equilibrium prices, income distributions, and total resources. We show that the equilibrium condition imposes no restrictions if total resources are collinear, a property that is robust to small perturbations. We also show that the set...... of equilibrium datasets is pathconnected when the equilibrium condition does impose restrictions on datasets, as for example when total resources are widely non collinear....

  2. Sequencing of bovine herpesvirus 4 v.test strain reveals important genome features

    Directory of Open Access Journals (Sweden)

    Gillet Laurent

    2011-08-01

    Full Text Available Abstract Background Bovine herpesvirus 4 (BoHV-4 is a useful model for the human pathogenic gammaherpesviruses Epstein-Barr virus and Kaposi's Sarcoma-associated Herpesvirus. Although genome manipulations of this virus have been greatly facilitated by the cloning of the BoHV-4 V.test strain as a Bacterial Artificial Chromosome (BAC, the lack of a complete genome sequence for this strain limits its experimental use. Methods In this study, we have determined the complete sequence of BoHV-4 V.test strain by a pyrosequencing approach. Results The long unique coding region (LUR consists of 108,241 bp encoding at least 79 open reading frames and is flanked by several polyrepetitive DNA units (prDNA. As previously suggested, we showed that the prDNA unit located at the left prDNA-LUR junction (prDNA-G differs from the other prDNA units (prDNA-inner. Namely, the prDNA-G unit lacks the conserved pac-2 cleavage and packaging signal in its right terminal region. Based on the mechanisms of cleavage and packaging of herpesvirus genomes, this feature implies that only genomes bearing left and right end prDNA units are encapsulated into virions. Conclusions In this study, we have determined the complete genome sequence of the BAC-cloned BoHV-4 V.test strain and identified genome organization features that could be important in other herpesviruses.

  3. Life Stress, Strain, and Deviance Across Schools: Testing the Contextual Version of General Strain Theory in China.

    Science.gov (United States)

    Zhang, Jinwu; Liu, Jianhong; Wang, Xin; Zou, Anquan

    2017-08-01

    General Strain Theory delineates different types of strain and intervening processes from strain to deviance and crime. In addition to explaining individual strain-crime relationship, a contextualized version of general strain theory, which is called the Macro General Strain Theory, has been used to analyze how aggregate variables influence aggregate and individual deviance and crime. Using a sample of 1,852 students (Level 1) nested in 52 schools (Level 2), the current study tests the Macro General Strain Theory using Chinese data. The results revealed that aggregate life stress and strain have influences on aggregate and individual deviance, and reinforce the individual stress-deviance association. The current study contributes by providing the first Macro General Strain Theory test based on Chinese data and offering empirical evidence for the multilevel intervening processes from strain to deviance. Limitations and future research directions are discussed.

  4. Sequencing of emerging canine distemper virus strain reveals new distinct genetic lineage in the United States associated with disease in wildlife and domestic canine populations.

    Science.gov (United States)

    Riley, Matthew C; Wilkes, Rebecca P

    2015-12-18

    Recent outbreaks of canine distemper have prompted examination of strains from clinical samples submitted to the University of Tennessee College of Veterinary Medicine (UTCVM) Clinical Virology Lab. We previously described a new strain of CDV that significantly diverged from all genotypes reported to date including America 2, the genotype proposed to be the main lineage currently circulating in the US. The aim of this study was to determine when this new strain appeared and how widespread it is in animal populations, given that it has also been detected in fully vaccinated adult dogs. Additionally, we sequenced complete viral genomes to characterize the strain and determine if variation is confined to known variable regions of the genome or if the changes are also present in more conserved regions. Archived clinical samples were genotyped using real-time RT-PCR amplification and sequencing. The genomes of two unrelated viruses from a dog and fox each from a different state were sequenced and aligned with previously published genomes. Phylogenetic analysis was performed using coding, non-coding and genome-length sequences. Virus neutralization assays were used to evaluate potential antigenic differences between this strain and a vaccine strain and mixed ANOVA test was used to compare the titers. Genotyping revealed this strain first appeared in 2011 and was detected in dogs from multiple states in the Southeast region of the United States. It was the main strain detected among the clinical samples that were typed from 2011-2013, including wildlife submissions. Genome sequencing demonstrated that it is highly conserved within a new lineage and preliminary serologic testing showed significant differences in neutralizing antibody titers between this strain and the strain commonly used in vaccines. This new strain represents an emerging CDV in domestic dogs in the US, may be associated with a stable reservoir in the wildlife population, and could facilitate vaccine

  5. Comparative genome analysis of Prevotella intermedia strain isolated from infected root canal reveals features related to pathogenicity and adaptation.

    Science.gov (United States)

    Ruan, Yunfeng; Shen, Lu; Zou, Yan; Qi, Zhengnan; Yin, Jun; Jiang, Jie; Guo, Liang; He, Lin; Chen, Zijiang; Tang, Zisheng; Qin, Shengying

    2015-02-25

    Many species of the genus Prevotella are pathogens that cause oral diseases. Prevotella intermedia is known to cause various oral disorders e.g. periodontal disease, periapical periodontitis and noma as well as colonize in the respiratory tract and be associated with cystic fibrosis and chronic bronchitis. It is of clinical significance to identify the main drive of its various adaptation and pathogenicity. In order to explore the intra-species genetic differences among strains of Prevotella intermedia of different niches, we isolated a strain Prevotella intermedia ZT from the infected root canal of a Chinese patient with periapical periodontitis and gained a draft genome sequence. We annotated the genome and compared it with the genomes of other taxa in the genus Prevotella. The raw data set, consisting of approximately 65X-coverage reads, was trimmed and assembled into contigs from which 2165 ORFs were predicted. The comparison of the Prevotella intermedia ZT genome sequence with the published genome sequence of Prevotella intermedia 17 and Prevotella intermedia ATCC25611 revealed that ~14% of the genes were strain-specific. The Preveotella intermedia strains share a set of conserved genes contributing to its adaptation and pathogenic and possess strain-specific genes especially those involved in adhesion and secreting bacteriocin. The Prevotella intermedia ZT shares similar gene content with other taxa of genus Prevotella. The genomes of the genus Prevotella is highly dynamic with relative conserved parts: on average, about half of the genes in one Prevotella genome were not included in another genome of the different Prevotella species. The degree of conservation varied with different pathways: the ability of amino acid biosynthesis varied greatly with species but the pathway of cell wall components biosynthesis were nearly constant. Phylogenetic tree shows that the taxa from different niches are scarcely distributed among clades. Prevotella intermedia ZT

  6. The fusion protein signal-peptide-coding region of canine distemper virus: a useful tool for phylogenetic reconstruction and lineage identification.

    Directory of Open Access Journals (Sweden)

    Nicolás Sarute

    Full Text Available Canine distemper virus (CDV; Paramyxoviridae, Morbillivirus is the etiologic agent of a multisystemic infectious disease affecting all terrestrial carnivore families with high incidence and mortality in domestic dogs. Sequence analysis of the hemagglutinin (H gene has been widely employed to characterize field strains, permitting the identification of nine CDV lineages worldwide. Recently, it has been established that the sequences of the fusion protein signal-peptide (Fsp coding region are extremely variable, suggesting that analysis of its sequence might be useful for strain characterization studies. However, the divergence of Fsp sequences among worldwide strains and its phylogenetic resolution has not yet been evaluated. We constructed datasets containing the Fsp-coding region and H gene sequences of the same strains belonging to eight CDV lineages. Both datasets were used to evaluate their phylogenetic resolution. The phylogenetic analysis revealed that both datasets clustered the same strains into eight different branches, corresponding to CDV lineages. The inter-lineage amino acid divergence was fourfold greater for the Fsp peptide than for the H protein. The likelihood mapping revealed that both datasets display strong phylogenetic signals in the region of well-resolved topologies. These features indicate that Fsp-coding region sequence analysis is suitable for evolutionary studies as it allows for straightforward identification of CDV lineages.

  7. The fusion protein signal-peptide-coding region of canine distemper virus: a useful tool for phylogenetic reconstruction and lineage identification.

    Science.gov (United States)

    Sarute, Nicolás; Calderón, Marina Gallo; Pérez, Ruben; La Torre, José; Hernández, Martín; Francia, Lourdes; Panzera, Yanina

    2013-01-01

    Canine distemper virus (CDV; Paramyxoviridae, Morbillivirus) is the etiologic agent of a multisystemic infectious disease affecting all terrestrial carnivore families with high incidence and mortality in domestic dogs. Sequence analysis of the hemagglutinin (H) gene has been widely employed to characterize field strains, permitting the identification of nine CDV lineages worldwide. Recently, it has been established that the sequences of the fusion protein signal-peptide (Fsp) coding region are extremely variable, suggesting that analysis of its sequence might be useful for strain characterization studies. However, the divergence of Fsp sequences among worldwide strains and its phylogenetic resolution has not yet been evaluated. We constructed datasets containing the Fsp-coding region and H gene sequences of the same strains belonging to eight CDV lineages. Both datasets were used to evaluate their phylogenetic resolution. The phylogenetic analysis revealed that both datasets clustered the same strains into eight different branches, corresponding to CDV lineages. The inter-lineage amino acid divergence was fourfold greater for the Fsp peptide than for the H protein. The likelihood mapping revealed that both datasets display strong phylogenetic signals in the region of well-resolved topologies. These features indicate that Fsp-coding region sequence analysis is suitable for evolutionary studies as it allows for straightforward identification of CDV lineages.

  8. fCCAC: functional canonical correlation analysis to evaluate covariance between nucleic acid sequencing datasets.

    Science.gov (United States)

    Madrigal, Pedro

    2017-03-01

    Computational evaluation of variability across DNA or RNA sequencing datasets is a crucial step in genomic science, as it allows both to evaluate reproducibility of biological or technical replicates, and to compare different datasets to identify their potential correlations. Here we present fCCAC, an application of functional canonical correlation analysis to assess covariance of nucleic acid sequencing datasets such as chromatin immunoprecipitation followed by deep sequencing (ChIP-seq). We show how this method differs from other measures of correlation, and exemplify how it can reveal shared covariance between histone modifications and DNA binding proteins, such as the relationship between the H3K4me3 chromatin mark and its epigenetic writers and readers. An R/Bioconductor package is available at http://bioconductor.org/packages/fCCAC/ . pmb59@cam.ac.uk. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  9. Compilation and analysis of multiple groundwater-quality datasets for Idaho

    Science.gov (United States)

    Hundt, Stephen A.; Hopkins, Candice B.

    2018-05-09

    Groundwater is an important source of drinking and irrigation water throughout Idaho, and groundwater quality is monitored by various Federal, State, and local agencies. The historical, multi-agency records of groundwater quality include a valuable dataset that has yet to be compiled or analyzed on a statewide level. The purpose of this study is to combine groundwater-quality data from multiple sources into a single database, to summarize this dataset, and to perform bulk analyses to reveal spatial and temporal patterns of water quality throughout Idaho. Data were retrieved from the Water Quality Portal (https://www.waterqualitydata.us/), the Idaho Department of Environmental Quality, and the Idaho Department of Water Resources. Analyses included counting the number of times a sample location had concentrations above Maximum Contaminant Levels (MCL), performing trends tests, and calculating correlations between water-quality analytes. The water-quality database and the analysis results are available through USGS ScienceBase (https://doi.org/10.5066/F72V2FBG).

  10. Benchmarking undedicated cloud computing providers for analysis of genomic datasets.

    Science.gov (United States)

    Yazar, Seyhan; Gooden, George E C; Mackey, David A; Hewitt, Alex W

    2014-01-01

    A major bottleneck in biological discovery is now emerging at the computational level. Cloud computing offers a dynamic means whereby small and medium-sized laboratories can rapidly adjust their computational capacity. We benchmarked two established cloud computing services, Amazon Web Services Elastic MapReduce (EMR) on Amazon EC2 instances and Google Compute Engine (GCE), using publicly available genomic datasets (E.coli CC102 strain and a Han Chinese male genome) and a standard bioinformatic pipeline on a Hadoop-based platform. Wall-clock time for complete assembly differed by 52.9% (95% CI: 27.5-78.2) for E.coli and 53.5% (95% CI: 34.4-72.6) for human genome, with GCE being more efficient than EMR. The cost of running this experiment on EMR and GCE differed significantly, with the costs on EMR being 257.3% (95% CI: 211.5-303.1) and 173.9% (95% CI: 134.6-213.1) more expensive for E.coli and human assemblies respectively. Thus, GCE was found to outperform EMR both in terms of cost and wall-clock time. Our findings confirm that cloud computing is an efficient and potentially cost-effective alternative for analysis of large genomic datasets. In addition to releasing our cost-effectiveness comparison, we present available ready-to-use scripts for establishing Hadoop instances with Ganglia monitoring on EC2 or GCE.

  11. Benchmarking undedicated cloud computing providers for analysis of genomic datasets.

    Directory of Open Access Journals (Sweden)

    Seyhan Yazar

    Full Text Available A major bottleneck in biological discovery is now emerging at the computational level. Cloud computing offers a dynamic means whereby small and medium-sized laboratories can rapidly adjust their computational capacity. We benchmarked two established cloud computing services, Amazon Web Services Elastic MapReduce (EMR on Amazon EC2 instances and Google Compute Engine (GCE, using publicly available genomic datasets (E.coli CC102 strain and a Han Chinese male genome and a standard bioinformatic pipeline on a Hadoop-based platform. Wall-clock time for complete assembly differed by 52.9% (95% CI: 27.5-78.2 for E.coli and 53.5% (95% CI: 34.4-72.6 for human genome, with GCE being more efficient than EMR. The cost of running this experiment on EMR and GCE differed significantly, with the costs on EMR being 257.3% (95% CI: 211.5-303.1 and 173.9% (95% CI: 134.6-213.1 more expensive for E.coli and human assemblies respectively. Thus, GCE was found to outperform EMR both in terms of cost and wall-clock time. Our findings confirm that cloud computing is an efficient and potentially cost-effective alternative for analysis of large genomic datasets. In addition to releasing our cost-effectiveness comparison, we present available ready-to-use scripts for establishing Hadoop instances with Ganglia monitoring on EC2 or GCE.

  12. An Annotated Dataset of 14 Meat Images

    DEFF Research Database (Denmark)

    Stegmann, Mikkel Bille

    2002-01-01

    This note describes a dataset consisting of 14 annotated images of meat. Points of correspondence are placed on each image. As such, the dataset can be readily used for building statistical models of shape. Further, format specifications and terms of use are given.......This note describes a dataset consisting of 14 annotated images of meat. Points of correspondence are placed on each image. As such, the dataset can be readily used for building statistical models of shape. Further, format specifications and terms of use are given....

  13. Comparison of recent SnIa datasets

    International Nuclear Information System (INIS)

    Sanchez, J.C. Bueno; Perivolaropoulos, L.; Nesseris, S.

    2009-01-01

    We rank the six latest Type Ia supernova (SnIa) datasets (Constitution (C), Union (U), ESSENCE (Davis) (E), Gold06 (G), SNLS 1yr (S) and SDSS-II (D)) in the context of the Chevalier-Polarski-Linder (CPL) parametrization w(a) = w 0 +w 1 (1−a), according to their Figure of Merit (FoM), their consistency with the cosmological constant (ΛCDM), their consistency with standard rulers (Cosmic Microwave Background (CMB) and Baryon Acoustic Oscillations (BAO)) and their mutual consistency. We find a significant improvement of the FoM (defined as the inverse area of the 95.4% parameter contour) with the number of SnIa of these datasets ((C) highest FoM, (U), (G), (D), (E), (S) lowest FoM). Standard rulers (CMB+BAO) have a better FoM by about a factor of 3, compared to the highest FoM SnIa dataset (C). We also find that the ranking sequence based on consistency with ΛCDM is identical with the corresponding ranking based on consistency with standard rulers ((S) most consistent, (D), (C), (E), (U), (G) least consistent). The ranking sequence of the datasets however changes when we consider the consistency with an expansion history corresponding to evolving dark energy (w 0 ,w 1 ) = (−1.4,2) crossing the phantom divide line w = −1 (it is practically reversed to (G), (U), (E), (S), (D), (C)). The SALT2 and MLCS2k2 fitters are also compared and some peculiar features of the SDSS-II dataset when standardized with the MLCS2k2 fitter are pointed out. Finally, we construct a statistic to estimate the internal consistency of a collection of SnIa datasets. We find that even though there is good consistency among most samples taken from the above datasets, this consistency decreases significantly when the Gold06 (G) dataset is included in the sample

  14. SIMADL: Simulated Activities of Daily Living Dataset

    Directory of Open Access Journals (Sweden)

    Talal Alshammari

    2018-04-01

    Full Text Available With the realisation of the Internet of Things (IoT paradigm, the analysis of the Activities of Daily Living (ADLs, in a smart home environment, is becoming an active research domain. The existence of representative datasets is a key requirement to advance the research in smart home design. Such datasets are an integral part of the visualisation of new smart home concepts as well as the validation and evaluation of emerging machine learning models. Machine learning techniques that can learn ADLs from sensor readings are used to classify, predict and detect anomalous patterns. Such techniques require data that represent relevant smart home scenarios, for training, testing and validation. However, the development of such machine learning techniques is limited by the lack of real smart home datasets, due to the excessive cost of building real smart homes. This paper provides two datasets for classification and anomaly detection. The datasets are generated using OpenSHS, (Open Smart Home Simulator, which is a simulation software for dataset generation. OpenSHS records the daily activities of a participant within a virtual environment. Seven participants simulated their ADLs for different contexts, e.g., weekdays, weekends, mornings and evenings. Eighty-four files in total were generated, representing approximately 63 days worth of activities. Forty-two files of classification of ADLs were simulated in the classification dataset and the other forty-two files are for anomaly detection problems in which anomalous patterns were simulated and injected into the anomaly detection dataset.

  15. The NOAA Dataset Identifier Project

    Science.gov (United States)

    de la Beaujardiere, J.; Mccullough, H.; Casey, K. S.

    2013-12-01

    The US National Oceanic and Atmospheric Administration (NOAA) initiated a project in 2013 to assign persistent identifiers to datasets archived at NOAA and to create informational landing pages about those datasets. The goals of this project are to enable the citation of datasets used in products and results in order to help provide credit to data producers, to support traceability and reproducibility, and to enable tracking of data usage and impact. A secondary goal is to encourage the submission of datasets for long-term preservation, because only archived datasets will be eligible for a NOAA-issued identifier. A team was formed with representatives from the National Geophysical, Oceanographic, and Climatic Data Centers (NGDC, NODC, NCDC) to resolve questions including which identifier scheme to use (answer: Digital Object Identifier - DOI), whether or not to embed semantics in identifiers (no), the level of granularity at which to assign identifiers (as coarsely as reasonable), how to handle ongoing time-series data (do not break into chunks), creation mechanism for the landing page (stylesheet from formal metadata record preferred), and others. Decisions made and implementation experience gained will inform the writing of a Data Citation Procedural Directive to be issued by the Environmental Data Management Committee in 2014. Several identifiers have been issued as of July 2013, with more on the way. NOAA is now reporting the number as a metric to federal Open Government initiatives. This paper will provide further details and status of the project.

  16. Control Measure Dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — The EPA Control Measure Dataset is a collection of documents describing air pollution control available to regulated facilities for the control and abatement of air...

  17. Effect of genetic strain and gender on age-related changes in body composition of the laboratory rat.

    Data.gov (United States)

    U.S. Environmental Protection Agency — Body composition data for common laboratory strains of rat as a function of age. This dataset is associated with the following publication: Gordon , C., K. Jarema ,...

  18. Isolation and genetic characterization of Aurantimonas and Methylobacterium strains from stems of hypernodulated soybeans.

    Science.gov (United States)

    Anda, Mizue; Ikeda, Seishi; Eda, Shima; Okubo, Takashi; Sato, Shusei; Tabata, Satoshi; Mitsui, Hisayuki; Minamisawa, Kiwamu

    2011-01-01

    The aims of this study were to isolate Aurantimonas and Methylobacterium strains that responded to soybean nodulation phenotypes and nitrogen fertilization rates in a previous culture-independent analysis (Ikeda et al. ISME J. 4:315-326, 2010). Two strategies were adopted for isolation from enriched bacterial cells prepared from stems of field-grown, hypernodulated soybeans: PCR-assisted isolation for Aurantimonas and selective cultivation for Methylobacterium. Thirteen of 768 isolates cultivated on Nutrient Agar medium were identified as Aurantimonas by colony PCR specific for Aurantimonas and 16S rRNA gene sequencing. Meanwhile, among 187 isolates on methanol-containing agar media, 126 were identified by 16S rRNA gene sequences as Methylobacterium. A clustering analysis (>99% identity) of the 16S rRNA gene sequences for the combined datasets of the present and previous studies revealed 4 and 8 operational taxonomic units (OTUs) for Aurantimonas and Methylobacterium, respectively, and showed the successful isolation of target bacteria for these two groups. ERIC- and BOX-PCR showed the genomic uniformity of the target isolates. In addition, phylogenetic analyses of Aurantimonas revealed a phyllosphere-specific cluster in the genus. The isolates obtained in the present study will be useful for revealing unknown legume-microbe interactions in relation to the autoregulation of nodulation.

  19. The Kinetics Human Action Video Dataset

    OpenAIRE

    Kay, Will; Carreira, Joao; Simonyan, Karen; Zhang, Brian; Hillier, Chloe; Vijayanarasimhan, Sudheendra; Viola, Fabio; Green, Tim; Back, Trevor; Natsev, Paul; Suleyman, Mustafa; Zisserman, Andrew

    2017-01-01

    We describe the DeepMind Kinetics human action video dataset. The dataset contains 400 human action classes, with at least 400 video clips for each action. Each clip lasts around 10s and is taken from a different YouTube video. The actions are human focussed and cover a broad range of classes including human-object interactions such as playing instruments, as well as human-human interactions such as shaking hands. We describe the statistics of the dataset, how it was collected, and give some ...

  20. The complete genome sequence of Bacillus velezensis strain GH1-13 reveals agriculturally beneficial properties and a unique plasmid.

    Science.gov (United States)

    Kim, Sang Yoon; Song, Hajin; Sang, Mee Kyung; Weon, Hang-Yeon; Song, Jaekyeong

    2017-10-10

    The bacterial strain Bacillus velezensis GH1-13, isolated from rice paddy soil in Korea, has been shown to promote plant growth and have strong antagonistic activities against pathogens. Here, we report the complete genome sequence of GH1-13, revealing that it possesses a single 4,071,980-bp circular chromosome with 46.2% GC-content. The chromosome encodes 3,930 genes, and we have also identified a unique plasmid in the strain that encodes a further 104 genes (71,628bp and 31.7% GC-content). The genome was found to contain various enzyme-encoding operons, including indole-3-acetic acid (IAA) biosynthesis proteins, 2,3-butanediol dehydrogenase, various non-ribosomal peptide synthetases, and several polyketide synthases. These properties are responsible for the promotion of plant growth and the biosynthesis of secondary metabolites. They therefore have multiple beneficial effects that could be applied to agriculture. Through curing, we found that the unique plasmid of GH1-13 has important roles in the production of phytohormones, such as IAA, and in shaping phenotypic and physiological characteristics. The plasmid therefore likely influences the biological activities of GH1-13. The complete genome sequence of B. velezensis GH1-13 contributes to our understanding of this beneficial strain and will encourage research into its development for agricultural or biotechnological applications, enhancing productivity and crop quality. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. Comparison of CORA and EN4 in-situ datasets validation methods, toward a better quality merged dataset.

    Science.gov (United States)

    Szekely, Tanguy; Killick, Rachel; Gourrion, Jerome; Reverdin, Gilles

    2017-04-01

    CORA and EN4 are both global delayed time mode validated in-situ ocean temperature and salinity datasets distributed by the Met Office (http://www.metoffice.gov.uk/) and Copernicus (www.marine.copernicus.eu). A large part of the profiles distributed by CORA and EN4 in recent years are Argo profiles from the ARGO DAC, but profiles are also extracted from the World Ocean Database and TESAC profiles from GTSPP. In the case of CORA, data coming from the EUROGOOS Regional operationnal oserving system( ROOS) operated by European institutes no managed by National Data Centres and other datasets of profiles povided by scientific sources can also be found (Sea mammals profiles from MEOP, XBT datasets from cruises ...). (EN4 also takes data from the ASBO dataset to supplement observations in the Arctic). First advantage of this new merge product is to enhance the space and time coverage at global and european scales for the period covering 1950 till a year before the current year. This product is updated once a year and T&S gridded fields are alos generated for the period 1990-year n-1. The enhancement compared to the revious CORA product will be presented Despite the fact that the profiles distributed by both datasets are mostly the same, the quality control procedures developed by the Met Office and Copernicus teams differ, sometimes leading to different quality control flags for the same profile. Started in 2016 a new study started that aims to compare both validation procedures to move towards a Copernicus Marine Service dataset with the best features of CORA and EN4 validation.A reference data set composed of the full set of in-situ temperature and salinity measurements collected by Coriolis during 2015 is used. These measurements have been made thanks to wide range of instruments (XBTs, CTDs, Argo floats, Instrumented sea mammals,...), covering the global ocean. The reference dataset has been validated simultaneously by both teams.An exhaustive comparison of the

  2. Whole genome sequencing reveals complex evolution patterns of multidrug-resistant Mycobacterium tuberculosis Beijing strains in patients.

    Directory of Open Access Journals (Sweden)

    Matthias Merker

    Full Text Available Multidrug-resistant (MDR Mycobacterium tuberculosis complex (MTBC strains represent a major threat for tuberculosis (TB control. Treatment of MDR-TB patients is long and less effective, resulting in a significant number of treatment failures. The development of further resistances leads to extensively drug-resistant (XDR variants. However, data on the individual reasons for treatment failure, e.g. an induced mutational burst, and on the evolution of bacteria in the patient are only sparsely available. To address this question, we investigated the intra-patient evolution of serial MTBC isolates obtained from three MDR-TB patients undergoing longitudinal treatment, finally leading to XDR-TB. Sequential isolates displayed identical IS6110 fingerprint patterns, suggesting the absence of exogenous re-infection. We utilized whole genome sequencing (WGS to screen for variations in three isolates from Patient A and four isolates from Patient B and C, respectively. Acquired polymorphisms were subsequently validated in up to 15 serial isolates by Sanger sequencing. We determined eight (Patient A and nine (Patient B polymorphisms, which occurred in a stepwise manner during the course of the therapy and were linked to resistance or a potential compensatory mechanism. For both patients, our analysis revealed the long-term co-existence of clonal subpopulations that displayed different drug resistance allele combinations. Out of these, the most resistant clone was fixed in the population. In contrast, baseline and follow-up isolates of Patient C were distinguished each by eleven unique polymorphisms, indicating an exogenous re-infection with an XDR strain not detected by IS6110 RFLP typing. Our study demonstrates that intra-patient microevolution of MDR-MTBC strains under longitudinal treatment is more complex than previously anticipated. However, a mutator phenotype was not detected. The presence of different subpopulations might confound phenotypic and

  3. Characterization of Foodborne Strains of Staphylococcus aureus by Shotgun Proteomics: Functional Networks, Virulence Factors and Species-Specific Peptide Biomarkers

    Science.gov (United States)

    Carrera, Mónica; Böhme, Karola; Gallardo, José M.; Barros-Velázquez, Jorge; Cañas, Benito; Calo-Mata, Pilar

    2017-01-01

    In the present work, we applied a shotgun proteomics approach for the fast and easy characterization of 20 different foodborne strains of Staphylococcus aureus (S. aureus), one of the most recognized foodborne pathogenic bacteria. A total of 644 non-redundant proteins were identified and analyzed via an easy and rapid protein sample preparation procedure. The results allowed the differentiation of several proteome datasets from the different strains (common, accessory, and unique datasets), which were used to determine relevant functional pathways and differentiate the strains into different Euclidean hierarchical clusters. Moreover, a predicted protein-protein interaction network of the foodborne S. aureus strains was created. The whole confidence network contains 77 nodes and 769 interactions. Most of the identified proteins were surface-associated proteins that were related to pathways and networks of energy, lipid metabolism and virulence. Twenty-seven virulence factors were identified, and most of them corresponded to autolysins, N-acetylmuramoyl-L-alanine amidases, phenol-soluble modulins, extracellular fibrinogen-binding proteins and virulence factor EsxA. Potential species-specific peptide biomarkers were screened. Twenty-one species-specific peptide biomarkers, belonging to eight different proteins (nickel-ABC transporter, N-acetylmuramoyl-L-alanine amidase, autolysin, clumping factor A, gram-positive signal peptide YSIRK, cysteine protease/staphopain, transcriptional regulator MarR, and transcriptional regulator Sar-A), were proposed to identify S. aureus. These results constitute the first major dataset of peptides and proteins of foodborne S. aureus strains. This repository may be useful for further studies, for the development of new therapeutic treatments for S. aureus food intoxications and for microbial source-tracking in foodstuffs. PMID:29312172

  4. Significant strain accumulation between the deformation front and landward out-of-sequence thrusts in accretionary wedge of SW Taiwan revealed by cGPS and SAR interferometry

    Science.gov (United States)

    Tsai, M. C.

    2017-12-01

    High strain accumulation across the fold-and-thrust belt in Southwestern Taiwan are revealed by the Continuous GPS (cGPS) and SAR interferometry. This high strain is generally accommodated by the major active structures in fold-and-thrust belt of western Foothills in SW Taiwan connected to the accretionary wedge in the incipient are-continent collision zone. The active structures across the high strain accumulation include the deformation front around the Tainan Tableland, the Hochiali, Hsiaokangshan, Fangshan and Chishan faults. Among these active structures, the deformation pattern revealed from cGPS and SAR interferometry suggest that the Fangshan transfer fault may be a left-lateral fault zone with thrust component accommodating the westward differential motion of thrust sheets on both side of the fault. In addition, the Chishan fault connected to the splay fault bordering the lower-slope and upper-slope of the accretionary wedge which could be the major seismogenic fault and an out-of-sequence thrust fault in SW Taiwan. The big earthquakes resulted from the reactivation of out-of-sequence thrusts have been observed along the Nankai accretionary wedge, thus the assessment of the major seismogenic structures by strain accumulation between the frontal décollement and out-of-sequence thrusts is a crucial topic. According to the background seismicity, the low seismicity and mid-crust to mantle events are observed inland and the lower- and upper- slope domain offshore SW Taiwan, which rheologically implies the upper crust of the accretionary wedge is more or less aseimic. This result may suggest that the excess fluid pressure from the accretionary wedge not only has significantly weakened the prism materials as well as major fault zone, but also makes the accretionary wedge landward extension, which is why the low seismicity is observed in SW Taiwan area. Key words: Continuous GPS, SAR interferometry, strain rate, out-of-sequence thrust.

  5. The impact of the resolution of meteorological datasets on catchment-scale drought studies

    Science.gov (United States)

    Hellwig, Jost; Stahl, Kerstin

    2017-04-01

    Gridded meteorological datasets provide the basis to study drought at a range of scales, including catchment scale drought studies in hydrology. They are readily available to study past weather conditions and often serve real time monitoring as well. As these datasets differ in spatial/temporal coverage and spatial/temporal resolution, for most studies there is a tradeoff between these features. Our investigation examines whether biases occur when studying drought on catchment scale with low resolution input data. For that, a comparison among the datasets HYRAS (covering Central Europe, 1x1 km grid, daily data, 1951 - 2005), E-OBS (Europe, 0.25° grid, daily data, 1950-2015) and GPCC (whole world, 0.5° grid, monthly data, 1901 - 2013) is carried out. Generally, biases in precipitation increase with decreasing resolution. Most important variations are found during summer. In low mountain range of Central Europe the datasets of sparse resolution (E-OBS, GPCC) overestimate dry days and underestimate total precipitation since they are not able to describe high spatial variability. However, relative measures like the correlation coefficient reveal good consistencies of dry and wet periods, both for absolute precipitation values and standardized indices like the Standardized Precipitation Index (SPI) or Standardized Precipitation Evaporation Index (SPEI). Particularly the most severe droughts derived from the different datasets match very well. These results indicate that absolute values of sparse resolution datasets applied to catchment scale might be critical to use for an assessment of the hydrological drought at catchment scale, whereas relative measures for determining periods of drought are more trustworthy. Therefore, studies on drought, that downscale meteorological data, should carefully consider their data needs and focus on relative measures for dry periods if sufficient for the task.

  6. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets

    Energy Technology Data Exchange (ETDEWEB)

    Wu, Yu-Wei [Joint BioEnergy Inst. (JBEI), Emeryville, CA (United States); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Simmons, Blake A. [Joint BioEnergy Inst. (JBEI), Emeryville, CA (United States); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Singer, Steven W. [Joint BioEnergy Inst. (JBEI), Emeryville, CA (United States); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

    2015-10-29

    The recovery of genomes from metagenomic datasets is a critical step to defining the functional roles of the underlying uncultivated populations. We previously developed MaxBin, an automated binning approach for high-throughput recovery of microbial genomes from metagenomes. Here, we present an expanded binning algorithm, MaxBin 2.0, which recovers genomes from co-assembly of a collection of metagenomic datasets. Tests on simulated datasets revealed that MaxBin 2.0 is highly accurate in recovering individual genomes, and the application of MaxBin 2.0 to several metagenomes from environmental samples demonstrated that it could achieve two complementary goals: recovering more bacterial genomes compared to binning a single sample as well as comparing the microbial community composition between different sampling environments. Availability and implementation: MaxBin 2.0 is freely available at http://sourceforge.net/projects/maxbin/ under BSD license. Supplementary information: Supplementary data are available at Bioinformatics online.

  7. Fluxnet Synthesis Dataset Collaboration Infrastructure

    Energy Technology Data Exchange (ETDEWEB)

    Agarwal, Deborah A. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Humphrey, Marty [Univ. of Virginia, Charlottesville, VA (United States); van Ingen, Catharine [Microsoft. San Francisco, CA (United States); Beekwilder, Norm [Univ. of Virginia, Charlottesville, VA (United States); Goode, Monte [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Jackson, Keith [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Rodriguez, Matt [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Weber, Robin [Univ. of California, Berkeley, CA (United States)

    2008-02-06

    The Fluxnet synthesis dataset originally compiled for the La Thuile workshop contained approximately 600 site years. Since the workshop, several additional site years have been added and the dataset now contains over 920 site years from over 240 sites. A data refresh update is expected to increase those numbers in the next few months. The ancillary data describing the sites continues to evolve as well. There are on the order of 120 site contacts and 60proposals have been approved to use thedata. These proposals involve around 120 researchers. The size and complexity of the dataset and collaboration has led to a new approach to providing access to the data and collaboration support and the support team attended the workshop and worked closely with the attendees and the Fluxnet project office to define the requirements for the support infrastructure. As a result of this effort, a new website (http://www.fluxdata.org) has been created to provide access to the Fluxnet synthesis dataset. This new web site is based on a scientific data server which enables browsing of the data on-line, data download, and version tracking. We leverage database and data analysis tools such as OLAP data cubes and web reports to enable browser and Excel pivot table access to the data.

  8. Exoproteome analysis reveals higher abundance of proteins linked to alkaline stress in persistent Listeria monocytogenes strains.

    Science.gov (United States)

    Rychli, Kathrin; Grunert, Tom; Ciolacu, Luminita; Zaiser, Andreas; Razzazi-Fazeli, Ebrahim; Schmitz-Esser, Stephan; Ehling-Schulz, Monika; Wagner, Martin

    2016-02-02

    The foodborne pathogen Listeria monocytogenes, responsible for listeriosis a rare but severe infection disease, can survive in the food processing environment for month or even years. So-called persistent L. monocytogenes strains greatly increase the risk of (re)contamination of food products, and are therefore a great challenge for food safety. However, our understanding of the mechanism underlying persistence is still fragmented. In this study we compared the exoproteome of three persistent strains with the reference strain EGDe under mild stress conditions using 2D differential gel electrophoresis. Principal component analysis including all differentially abundant protein spots showed that the exoproteome of strain EGDe (sequence type (ST) 35) is distinct from that of the persistent strain R479a (ST8) and the two closely related ST121 strains 4423 and 6179. Phylogenetic analyses based on multilocus ST genes showed similar grouping of the strains. Comparing the exoproteome of strain EGDe and the three persistent strains resulted in identification of 22 differentially expressed protein spots corresponding to 16 proteins. Six proteins were significantly increased in the persistent L. monocytogenes exoproteomes, among them proteins involved in alkaline stress response (e.g. the membrane anchored lipoprotein Lmo2637 and the NADPH dehydrogenase NamA). In parallel the persistent strains showed increased survival under alkaline stress, which is often provided during cleaning and disinfection in the food processing environments. In addition, gene expression of the proteins linked to stress response (Lmo2637, NamA, Fhs and QoxA) was higher in the persistent strain not only at 37 °C but also at 10 °C. Invasion efficiency of EGDe was higher in intestinal epithelial Caco2 and macrophage-like THP1 cells compared to the persistent strains. Concurrently we found higher expression of proteins involved in virulence in EGDe e.g. the actin-assembly-inducing protein ActA and the

  9. Feedback control in deep drawing based on experimental datasets

    Science.gov (United States)

    Fischer, P.; Heingärtner, J.; Aichholzer, W.; Hortig, D.; Hora, P.

    2017-09-01

    In large-scale production of deep drawing parts, like in automotive industry, the effects of scattering material properties as well as warming of the tools have a significant impact on the drawing result. In the scope of the work, an approach is presented to minimize the influence of these effects on part quality by optically measuring the draw-in of each part and adjusting the settings of the press to keep the strain distribution, which is represented by the draw-in, inside a certain limit. For the design of the control algorithm, a design of experiments for in-line tests is used to quantify the influence of the blank holder force as well as the force distribution on the draw-in. The results of this experimental dataset are used to model the process behavior. Based on this model, a feedback control loop is designed. Finally, the performance of the control algorithm is validated in the production line.

  10. Intramyocardial strain estimation from cardiac cine MRI.

    Science.gov (United States)

    Elnakib, Ahmed; Beache, Garth M; Gimel'farb, Georgy; El-Baz, Ayman

    2015-08-01

    Functional strain is one of the important clinical indicators for the quantification of heart performance and the early detection of cardiovascular diseases, and functional strain parameters are used to aid therapeutic decisions and follow-up evaluations after cardiac surgery. A comprehensive framework for deriving functional strain parameters at the endocardium, epicardium, and mid-wall of the left ventricle (LV) from conventional cine MRI data was developed and tested. Cine data were collected using short TR-/TE-balanced steady-state free precession acquisitions on a 1.5T Siemens Espree scanner. The LV wall borders are segmented using a level set-based deformable model guided by a stochastic force derived from a second-order Markov-Gibbs random field model that accounts for the object shape and appearance features. Then, the mid-wall of the segmented LV is determined based on estimating the centerline between the endocardium and epicardium of the LV. Finally, a geometrical Laplace-based method is proposed to track corresponding points on successive myocardial contours throughout the cardiac cycle in order to characterize the strain evolutions. The method was tested using simulated phantom images with predefined point locations of the LV wall throughout the cardiac cycle. The method was tested on 30 in vivo datasets to evaluate the feasibility of the proposed framework to index functional strain parameters. The cine MRI-based model agreed with the ground truth for functional metrics to within 0.30 % for indexing the peak systolic strain change and 0.29 % (per unit time) for indexing systolic and diastolic strain rates. The method was feasible for in vivo extraction of functional strain parameters. Strain indexes of the endocardium, mid-wall, and epicardium can be derived from routine cine images using automated techniques, thereby improving the utility of cine MRI data for characterization of myocardial function. Unlike traditional texture-based tracking, the

  11. Solar Integration National Dataset Toolkit | Grid Modernization | NREL

    Science.gov (United States)

    Solar Integration National Dataset Toolkit Solar Integration National Dataset Toolkit NREL is working on a Solar Integration National Dataset (SIND) Toolkit to enable researchers to perform U.S . regional solar generation integration studies. It will provide modeled, coherent subhourly solar power data

  12. PROVIDING GEOGRAPHIC DATASETS AS LINKED DATA IN SDI

    Directory of Open Access Journals (Sweden)

    E. Hietanen

    2016-06-01

    Full Text Available In this study, a prototype service to provide data from Web Feature Service (WFS as linked data is implemented. At first, persistent and unique Uniform Resource Identifiers (URI are created to all spatial objects in the dataset. The objects are available from those URIs in Resource Description Framework (RDF data format. Next, a Web Ontology Language (OWL ontology is created to describe the dataset information content using the Open Geospatial Consortium’s (OGC GeoSPARQL vocabulary. The existing data model is modified in order to take into account the linked data principles. The implemented service produces an HTTP response dynamically. The data for the response is first fetched from existing WFS. Then the Geographic Markup Language (GML format output of the WFS is transformed on-the-fly to the RDF format. Content Negotiation is used to serve the data in different RDF serialization formats. This solution facilitates the use of a dataset in different applications without replicating the whole dataset. In addition, individual spatial objects in the dataset can be referred with URIs. Furthermore, the needed information content of the objects can be easily extracted from the RDF serializations available from those URIs. A solution for linking data objects to the dataset URI is also introduced by using the Vocabulary of Interlinked Datasets (VoID. The dataset is divided to the subsets and each subset is given its persistent and unique URI. This enables the whole dataset to be explored with a web browser and all individual objects to be indexed by search engines.

  13. Wind Integration National Dataset Toolkit | Grid Modernization | NREL

    Science.gov (United States)

    Integration National Dataset Toolkit Wind Integration National Dataset Toolkit The Wind Integration National Dataset (WIND) Toolkit is an update and expansion of the Eastern Wind Integration Data Set and Western Wind Integration Data Set. It supports the next generation of wind integration studies. WIND

  14. Optimizing disk registration algorithms for nanobeam electron diffraction strain mapping

    Energy Technology Data Exchange (ETDEWEB)

    Pekin, Thomas C. [Department of Materials Science and Engineering, University of California, Berkeley, Berkeley, USA 94720 (United States); National Center for Electron Microscopy, Molecular Foundry, Lawrence Berkeley National Laboratory, Berkeley, USA 94720 (United States); Gammer, Christoph [Erich Schmid Institute of Materials Science, Jahnstrasse 12, Leoben, Austria 8700 (Austria); Ciston, Jim [National Center for Electron Microscopy, Molecular Foundry, Lawrence Berkeley National Laboratory, Berkeley, USA 94720 (United States); Minor, Andrew M. [Department of Materials Science and Engineering, University of California, Berkeley, Berkeley, USA 94720 (United States); National Center for Electron Microscopy, Molecular Foundry, Lawrence Berkeley National Laboratory, Berkeley, USA 94720 (United States); Ophus, Colin, E-mail: cophus@gmail.com [National Center for Electron Microscopy, Molecular Foundry, Lawrence Berkeley National Laboratory, Berkeley, USA 94720 (United States)

    2017-05-15

    Scanning nanobeam electron diffraction strain mapping is a technique by which the positions of diffracted disks sampled at the nanoscale over a crystalline sample can be used to reconstruct a strain map over a large area. However, it is important that the disk positions are measured accurately, as their positions relative to a reference are directly used to calculate strain. In this study, we compare several correlation methods using both simulated and experimental data in order to directly probe susceptibility to measurement error due to non-uniform diffracted disk illumination structure. We found that prefiltering the diffraction patterns with a Sobel filter before performing cross correlation or performing a square-root magnitude weighted phase correlation returned the best results when inner disk structure was present. We have tested these methods both on simulated datasets, and experimental data from unstrained silicon as well as a twin grain boundary in 304 stainless steel.

  15. (Project 14-6770) An Investigation to Establish Multiphysical Property Dataset of Nuclear Materials Based on in-situ Observations and Measurements

    Energy Technology Data Exchange (ETDEWEB)

    Tomar, Vikas [Purdue Univ., West Lafayette, IN (United States); Haque, Aman [Pennsylvania State Univ., University Park, PA (United States). Dept of Physics; Hattar, Khalid [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

    2017-11-10

    In-core nuclear materials including fuel pins and cladding materials fail due to issues including corrosion, mechanical wear, and pellet cladding interaction. In most such scenario microstructure dependent and corrosioninduced chemistry dependent property changes significantly affect performance of cladding, pellet, and housing. Emphasis of this work was on replace conventional pellet-cladding material models with a new straingradient viscoplasticity model that is informed by transmission electron microscopy (TEM) based measurements and by nanomechanical Raman spectroscopy (NMRS) based measurements. The TEM measurements are quantitative in nature and therefore reveal stress-strain relations with simultaneous insights into mechanisms of deformation at nanoscale. The NMRS measurements reveal the similar information at mesoscale along with additional information on relating local microstructural stresses with applied stresses. The resulting information is used to fit constants in the strain gradient viscoplasticity model as well as to validate one. During TEM measurements, a micro-electro-mechanical system based setup was developed with mechanical actuation, sensing, heating, and electrical loading. Contrary to post-mortem analysis or qualitative visualization, this setup combines direct visualization of the mechanisms behind deformation with measurement of stress, strain, thermal and electrical properties. The unique research philosophy of visualizing the microstructure at high resolution while measuring the properties led to fundamental understanding in grain size and temperature effects on measured mechanical properties such as fracture toughness. A key contribution is the role of mechanical loading boundary conditions to deconvolute the insitu TEM based nanoscale and NMRS based mesoscale data to bulk behavior. First the literature based pellet cladding mechanical interaction model based on the work of Retel’s and Williamson’s in literature work to predict

  16. PhenoLink - a web-tool for linking phenotype to ~omics data for bacteria: application to gene-trait matching for Lactobacillus plantarum strains

    Directory of Open Access Journals (Sweden)

    Bayjanov Jumamurat R

    2012-05-01

    Full Text Available Abstract Background Linking phenotypes to high-throughput molecular biology information generated by ~omics technologies allows revealing cellular mechanisms underlying an organism's phenotype. ~Omics datasets are often very large and noisy with many features (e.g., genes, metabolite abundances. Thus, associating phenotypes to ~omics data requires an approach that is robust to noise and can handle large and diverse data sets. Results We developed a web-tool PhenoLink (http://bamics2.cmbi.ru.nl/websoftware/phenolink/ that links phenotype to ~omics data sets using well-established as well new techniques. PhenoLink imputes missing values and preprocesses input data (i to decrease inherent noise in the data and (ii to counterbalance pitfalls of the Random Forest algorithm, on which feature (e.g., gene selection is based. Preprocessed data is used in feature (e.g., gene selection to identify relations to phenotypes. We applied PhenoLink to identify gene-phenotype relations based on the presence/absence of 2847 genes in 42 Lactobacillus plantarum strains and phenotypic measurements of these strains in several experimental conditions, including growth on sugars and nitrogen-dioxide production. Genes were ranked based on their importance (predictive value to correctly predict the phenotype of a given strain. In addition to known gene to phenotype relations we also found novel relations. Conclusions PhenoLink is an easily accessible web-tool to facilitate identifying relations from large and often noisy phenotype and ~omics datasets. Visualization of links to phenotypes offered in PhenoLink allows prioritizing links, finding relations between features, finding relations between phenotypes, and identifying outliers in phenotype data. PhenoLink can be used to uncover phenotype links to a multitude of ~omics data, e.g., gene presence/absence (determined by e.g.: CGH or next-generation sequencing, gene expression (determined by e.g.: microarrays or RNA

  17. Multiple Genome Sequences of Lactobacillus plantarum Strains

    OpenAIRE

    Kafka, Thomas A.; Geissler, Andreas J.; Vogel, Rudi F.

    2017-01-01

    ABSTRACT We report here the genome sequences of four Lactobacillus plantarum strains which vary in surface hydrophobicity. Bioinformatic analysis, using additional genomes of Lactobacillus plantarum strains, revealed a possible correlation between the cell wall teichoic acid-type and cell surface hydrophobicity and provide the basis for consecutive analyses.

  18. Comparative Genomic Analyses of Multiple Pseudomonas Strains Infecting Corylus avellana Trees Reveal the Occurrence of Two Genetic Clusters with Both Common and Distinctive Virulence and Fitness Traits

    Science.gov (United States)

    Marcelletti, Simone; Scortichini, Marco

    2015-01-01

    The European hazelnut (Corylus avellana) is threatened in Europe by several pseudomonads which cause symptoms ranging from twig dieback to tree death. A comparison of the draft genomes of nine Pseudomonas strains isolated from symptomatic C. avellana trees was performed to identify common and distinctive genomic traits. The thorough assessment of genetic relationships among the strains revealed two clearly distinct clusters: P. avellanae and P. syringae. The latter including the pathovars avellanae, coryli and syringae. Between these two clusters, no recombination event was found. A genomic island of approximately 20 kb, containing the hrp/hrc type III secretion system gene cluster, was found to be present without any genomic difference in all nine pseudomonads. The type III secretion system effector repertoires were remarkably different in the two groups, with P. avellanae showing a higher number of effectors. Homologue genes of the antimetabolite mangotoxin and ice nucleation activity clusters were found solely in all P. syringae pathovar strains, whereas the siderophore yersiniabactin was only present in P. avellanae. All nine strains have genes coding for pectic enzymes and sucrose metabolism. By contrast, they do not have genes coding for indolacetic acid and anti-insect toxin. Collectively, this study reveals that genomically different Pseudomonas can converge on the same host plant by suppressing the host defence mechanisms with the use of different virulence weapons. The integration into their genomes of a horizontally acquired genomic island could play a fundamental role in their evolution, perhaps giving them the ability to exploit new ecological niches. PMID:26147218

  19. A New Outlier Detection Method for Multidimensional Datasets

    KAUST Repository

    Abdel Messih, Mario A.

    2012-07-01

    This study develops a novel hybrid method for outlier detection (HMOD) that combines the idea of distance based and density based methods. The proposed method has two main advantages over most of the other outlier detection methods. The first advantage is that it works well on both dense and sparse datasets. The second advantage is that, unlike most other outlier detection methods that require careful parameter setting and prior knowledge of the data, HMOD is not very sensitive to small changes in parameter values within certain parameter ranges. The only required parameter to set is the number of nearest neighbors. In addition, we made a fully parallelized implementation of HMOD that made it very efficient in applications. Moreover, we proposed a new way of using the outlier detection for redundancy reduction in datasets where the confidence level that evaluates how accurate the less redundant dataset can be used to represent the original dataset can be specified by users. HMOD is evaluated on synthetic datasets (dense and mixed “dense and sparse”) and a bioinformatics problem of redundancy reduction of dataset of position weight matrices (PWMs) of transcription factor binding sites. In addition, in the process of assessing the performance of our redundancy reduction method, we developed a simple tool that can be used to evaluate the confidence level of reduced dataset representing the original dataset. The evaluation of the results shows that our method can be used in a wide range of problems.

  20. Temporal dynamics of the developing lung transcriptome in three common inbred strains of laboratory mice reveals multiple stages of postnatal alveolar development

    Directory of Open Access Journals (Sweden)

    Kyle J. Beauchemin

    2016-08-01

    Full Text Available To characterize temporal patterns of transcriptional activity during normal lung development, we generated genome wide gene expression data for 26 pre- and post-natal time points in three common inbred strains of laboratory mice (C57BL/6J, A/J, and C3H/HeJ. Using Principal Component Analysis and least squares regression modeling, we identified both strain-independent and strain-dependent patterns of gene expression. The 4,683 genes contributing to the strain-independent expression patterns were used to define a murine Developing Lung Characteristic Subtranscriptome (mDLCS. Regression modeling of the Principal Components supported the four canonical stages of mammalian embryonic lung development (embryonic, pseudoglandular, canalicular, saccular defined previously by morphology and histology. For postnatal alveolar development, the regression model was consistent with four stages of alveolarization characterized by episodic transcriptional activity of genes related to pulmonary vascularization. Genes expressed in a strain-dependent manner were enriched for annotations related to neurogenesis, extracellular matrix organization, and Wnt signaling. Finally, a comparison of mouse and human transcriptomics from pre-natal stages of lung development revealed conservation of pathways associated with cell cycle, axon guidance, immune function, and metabolism as well as organism-specific expression of genes associated with extracellular matrix organization and protein modification. The mouse lung development transcriptome data generated for this study serves as a unique reference set to identify genes and pathways essential for normal mammalian lung development and for investigations into the developmental origins of respiratory disease and cancer. The gene expression data are available from the Gene Expression Omnibus (GEO archive (GSE74243. Temporal expression patterns of mouse genes can be investigated using a study specific web resource (http://lungdevelopment.jax.org.

  1. Temporal dynamics of the developing lung transcriptome in three common inbred strains of laboratory mice reveals multiple stages of postnatal alveolar development.

    Science.gov (United States)

    Beauchemin, Kyle J; Wells, Julie M; Kho, Alvin T; Philip, Vivek M; Kamir, Daniela; Kohane, Isaac S; Graber, Joel H; Bult, Carol J

    2016-01-01

    To characterize temporal patterns of transcriptional activity during normal lung development, we generated genome wide gene expression data for 26 pre- and post-natal time points in three common inbred strains of laboratory mice (C57BL/6J, A/J, and C3H/HeJ). Using Principal Component Analysis and least squares regression modeling, we identified both strain-independent and strain-dependent patterns of gene expression. The 4,683 genes contributing to the strain-independent expression patterns were used to define a murine Developing Lung Characteristic Subtranscriptome (mDLCS). Regression modeling of the Principal Components supported the four canonical stages of mammalian embryonic lung development (embryonic, pseudoglandular, canalicular, saccular) defined previously by morphology and histology. For postnatal alveolar development, the regression model was consistent with four stages of alveolarization characterized by episodic transcriptional activity of genes related to pulmonary vascularization. Genes expressed in a strain-dependent manner were enriched for annotations related to neurogenesis, extracellular matrix organization, and Wnt signaling. Finally, a comparison of mouse and human transcriptomics from pre-natal stages of lung development revealed conservation of pathways associated with cell cycle, axon guidance, immune function, and metabolism as well as organism-specific expression of genes associated with extracellular matrix organization and protein modification. The mouse lung development transcriptome data generated for this study serves as a unique reference set to identify genes and pathways essential for normal mammalian lung development and for investigations into the developmental origins of respiratory disease and cancer. The gene expression data are available from the Gene Expression Omnibus (GEO) archive (GSE74243). Temporal expression patterns of mouse genes can be investigated using a study specific web resource (http://lungdevelopment.jax.org).

  2. A novel system for tracking social preference dynamics in mice reveals sex- and strain-specific characteristics.

    Science.gov (United States)

    Netser, Shai; Haskal, Shani; Magalnik, Hen; Wagner, Shlomo

    2017-01-01

    Deciphering the biological mechanisms underlying social behavior in animal models requires standard behavioral paradigms that can be unbiasedly employed in an observer- and laboratory-independent manner. During the past decade, the three-chamber test has become such a standard paradigm used to evaluate social preference (sociability) and social novelty preference in mice. This test suffers from several caveats, including its reliance on spatial navigation skills and negligence of behavioral dynamics. Here, we present a novel experimental apparatus and an automated analysis system which offer an alternative to the three-chamber test while solving the aforementioned caveats. The custom-made apparatus is simple for production, and the analysis system is publically available as an open-source software, enabling its free use. We used this system to compare the dynamics of social behavior during the social preference and social novelty preference tests between male and female C57BL/6J mice. We found that in both tests, male mice keep their preference towards one of the stimuli for longer periods than females. We then employed our system to define several new parameters of social behavioral dynamics in mice and revealed that social preference behavior is segregated in time into two distinct phases. An early exploration phase, characterized by high rate of transitions between stimuli and short bouts of stimulus investigation, is followed by an interaction phase with low transition rate and prolonged interactions, mainly with the preferred stimulus. Finally, we compared the dynamics of social behavior between C57BL/6J and BTBR male mice, the latter of which are considered as asocial strain serving as a model for autism spectrum disorder. We found that BTBR mice ( n  = 8) showed a specific deficit in transition from the exploration phase to the interaction phase in the social preference test, suggesting a reduced tendency towards social interaction. We successfully

  3. NP-PAH Interaction Dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — Dataset presents concentrations of organic pollutants, such as polyaromatic hydrocarbon compounds, in water samples. Water samples of known volume and concentration...

  4. A dataset on tail risk of commodities markets.

    Science.gov (United States)

    Powell, Robert J; Vo, Duc H; Pham, Thach N; Singh, Abhay K

    2017-12-01

    This article contains the datasets related to the research article "The long and short of commodity tails and their relationship to Asian equity markets"(Powell et al., 2017) [1]. The datasets contain the daily prices (and price movements) of 24 different commodities decomposed from the S&P GSCI index and the daily prices (and price movements) of three share market indices including World, Asia, and South East Asia for the period 2004-2015. Then, the dataset is divided into annual periods, showing the worst 5% of price movements for each year. The datasets are convenient to examine the tail risk of different commodities as measured by Conditional Value at Risk (CVaR) as well as their changes over periods. The datasets can also be used to investigate the association between commodity markets and share markets.

  5. Research on the Phenotypic Characterization of Mrsa Strains Isolated from Animals

    Directory of Open Access Journals (Sweden)

    Iulia Maria BUCUR

    2017-05-01

    Full Text Available Keywords: chromogen, methicillin, MRSA, resistance Introduction: Currently, both in staphylococci isolated from animals with different diseases, as well as in humans, the MRSA strains (Methicillin Resistant S. aureus are monitored, as the methicillin resistance is associated with the resistance to other antibiotic groups. Methicillin resistance is encoded by mec staphylococcal chromosomal cassettes (SCCmec, which are islands of resistance. These strains can be identified by molecular biology tests and tests that reveal several phenotypic characteristics. The research was made in order to characterize and identify phenotypically the MRSA staphylococci strains isolated from animals. Materials and Methods: Researches were made on 240 coagulase positive and coagulase negative strains of staphylococci. Mannitol fermentation was tested on Champan medium, free coagulase was revealed on Baird-Parker medium and to identify S. aureus subsp. aureus was used the chromogenic medium Chromatic Staph. Methicillin-resistant strains were detected by disc diffusion method, using biodiscs with methicillin, oxacillin and cefoxitin. Also, to identify the MRSA strains, was used the chromogenic medium Chromatic MRSA. Results: The isolates were positive to mannitol and produced complete haemolysis or were unhaemolytic. A total of 44 strains produced free coagulase on Baird-Parker medium, considered coagulase positive strains, while 196 were coagulase negative strains. The isolates conducted differently to methicillin: 22,08% of strains were resistant, 51,25% of strains were susceptible and 26,66% had intermediate resistance, while the resistant strains to oxacillin were 42,91%. The increased frequency of methicillin-resistant strains of staphylococci and, particularly, MRSA strains, determined using the cefoxitin disk diffusion test, which is more reliable than methicillin and oxacillin. On the MRSA chromogenic medium, the methicillin-resistant strains of staphylococci

  6. Proteomics dataset

    DEFF Research Database (Denmark)

    Bennike, Tue Bjerg; Carlsen, Thomas Gelsing; Ellingsen, Torkell

    2017-01-01

    patients (Morgan et al., 2012; Abraham and Medzhitov, 2011; Bennike, 2014) [8–10. Therefore, we characterized the proteome of colon mucosa biopsies from 10 inflammatory bowel disease ulcerative colitis (UC) patients, 11 gastrointestinal healthy rheumatoid arthritis (RA) patients, and 10 controls. We...... been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifiers PXD001608 for ulcerative colitis and control samples, and PXD003082 for rheumatoid arthritis samples....

  7. Genome-wide comparison and taxonomic relatedness of multiple Xylella fastidiosa strains reveal the occurrence of three subspecies and a new Xylella species.

    Science.gov (United States)

    Marcelletti, Simone; Scortichini, Marco

    2016-10-01

    A total of 21 Xylella fastidiosa strains were assessed by comparing their genomes to infer their taxonomic relationships. The whole-genome-based average nucleotide identity and tetranucleotide frequency correlation coefficient analyses were performed. In addition, a consensus tree based on comparisons of 956 core gene families, and a genome-wide phylogenetic tree and a Neighbor-net network were constructed with 820,088 nucleotides (i.e., approximately 30-33 % of the entire X. fastidiosa genome). All approaches revealed the occurrence of three well-demarcated genetic clusters that represent X. fastidiosa subspecies fastidiosa, multiplex and pauca, with the latter appeared to diverge. We suggest that the proposed but never formally described subspecies 'sandyi' and 'morus' are instead members of the subspecies fastidiosa. These analyses support the view that the Xylella strain isolated from Pyrus pyrifolia in Taiwan is likely to be a new species. A widely used multilocus sequence typing analysis yielded conflicting results.

  8. MODERNIZATION OF GENEOTIPING OF STRAINS B. PERTUSSIS

    Directory of Open Access Journals (Sweden)

    G. A. Ivashinnikova

    2013-01-01

    Full Text Available The new rapid molecular genotyping method was developed for studying the structure of ptxP promoter of pertussis toxin. Method is based on PCR-RFLP analysis, which allows studying the specific restriction profiles of the B. pertussis strains and allows differentiation of the strains with the ptxP structural particularities. The developed method for genotyping of strains of B. pertussis can be hhelpful when monitoring strains of the causative agent of whooping cough in system of an epidemiological surveillance over pertussis infections, allowing observation over circulating population of B.pertussis, revealing strains of the causative agent of whooping cough with high production of pertussis toxin and to watch their distribution.

  9. Investigating country-specific music preferences and music recommendation algorithms with the LFM-1b dataset.

    Science.gov (United States)

    Schedl, Markus

    2017-01-01

    Recently, the LFM-1b dataset has been proposed to foster research and evaluation in music retrieval and music recommender systems, Schedl (Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR). New York, 2016). It contains more than one billion music listening events created by more than 120,000 users of Last.fm. Each listening event is characterized by artist, album, and track name, and further includes a timestamp. Basic demographic information and a selection of more elaborate listener-specific descriptors are included as well, for anonymized users. In this article, we reveal information about LFM-1b's acquisition and content and we compare it to existing datasets. We furthermore provide an extensive statistical analysis of the dataset, including basic properties of the item sets, demographic coverage, distribution of listening events (e.g., over artists and users), and aspects related to music preference and consumption behavior (e.g., temporal features and mainstreaminess of listeners). Exploiting country information of users and genre tags of artists, we also create taste profiles for populations and determine similar and dissimilar countries in terms of their populations' music preferences. Finally, we illustrate the dataset's usage in a simple artist recommendation task, whose results are intended to serve as baseline against which more elaborate techniques can be assessed.

  10. Comparison of Shallow Survey 2012 Multibeam Datasets

    Science.gov (United States)

    Ramirez, T. M.

    2012-12-01

    The purpose of the Shallow Survey common dataset is a comparison of the different technologies utilized for data acquisition in the shallow survey marine environment. The common dataset consists of a series of surveys conducted over a common area of seabed using a variety of systems. It provides equipment manufacturers the opportunity to showcase their latest systems while giving hydrographic researchers and scientists a chance to test their latest algorithms on the dataset so that rigorous comparisons can be made. Five companies collected data for the Common Dataset in the Wellington Harbor area in New Zealand between May 2010 and May 2011; including Kongsberg, Reson, R2Sonic, GeoAcoustics, and Applied Acoustics. The Wellington harbor and surrounding coastal area was selected since it has a number of well-defined features, including the HMNZS South Seas and HMNZS Wellington wrecks, an armored seawall constructed of Tetrapods and Akmons, aquifers, wharves and marinas. The seabed inside the harbor basin is largely fine-grained sediment, with gravel and reefs around the coast. The area outside the harbor on the southern coast is an active environment, with moving sand and exposed reefs. A marine reserve is also in this area. For consistency between datasets, the coastal research vessel R/V Ikatere and crew were used for all surveys conducted for the common dataset. Using Triton's Perspective processing software multibeam datasets collected for the Shallow Survey were processed for detail analysis. Datasets from each sonar manufacturer were processed using the CUBE algorithm developed by the Center for Coastal and Ocean Mapping/Joint Hydrographic Center (CCOM/JHC). Each dataset was gridded at 0.5 and 1.0 meter resolutions for cross comparison and compliance with International Hydrographic Organization (IHO) requirements. Detailed comparisons were made of equipment specifications (transmit frequency, number of beams, beam width), data density, total uncertainty, and

  11. National Hydrography Dataset (NHD)

    Data.gov (United States)

    Kansas Data Access and Support Center — The National Hydrography Dataset (NHD) is a feature-based database that interconnects and uniquely identifies the stream segments or reaches that comprise the...

  12. Resistance of Permafrost and Modern Acinetobacter lwoffii Strains to Heavy Metals and Arsenic Revealed by Genome Analysis.

    Science.gov (United States)

    Mindlin, Sofia; Petrenko, Anatolii; Kurakov, Anton; Beletsky, Alexey; Mardanov, Andrey; Petrova, Mayya

    2016-01-01

    We performed whole-genome sequencing of five permafrost strains of Acinetobacter lwoffii (frozen for 15-3000 thousand years) and analyzed their resistance genes found in plasmids and chromosomes. Four strains contained multiple plasmids (8-12), which varied significantly in size (from 4,135 to 287,630 bp) and genetic structure; the fifth strain contained only two plasmids. All large plasmids and some medium-size and small plasmids contained genes encoding resistance to various heavy metals, including mercury, cobalt, zinc, cadmium, copper, chromium, and arsenic compounds. Most resistance genes found in the ancient strains of A . lwoffii had their closely related counterparts in modern clinical A . lwoffii strains that were also located on plasmids. The vast majority of the chromosomal resistance determinants did not possess complete sets of the resistance genes or contained truncated genes. Comparative analysis of various A . lwoffii and of A . baumannii strains discovered a number of differences between them: (i) chromosome sizes in A . baumannii exceeded those in A . lwoffii by about 20%; (ii) on the contrary, the number of plasmids in A . lwoffii and their total size were much higher than those in A . baumannii ; (iii) heavy metal resistance genes in the environmental A . lwoffii strains surpassed those in A . baumannii strains in the number and diversity and were predominantly located on plasmids. Possible reasons for these differences are discussed.

  13. The Harvard organic photovoltaic dataset.

    Science.gov (United States)

    Lopez, Steven A; Pyzer-Knapp, Edward O; Simm, Gregor N; Lutzow, Trevor; Li, Kewei; Seress, Laszlo R; Hachmann, Johannes; Aspuru-Guzik, Alán

    2016-09-27

    The Harvard Organic Photovoltaic Dataset (HOPV15) presented in this work is a collation of experimental photovoltaic data from the literature, and corresponding quantum-chemical calculations performed over a range of conformers, each with quantum chemical results using a variety of density functionals and basis sets. It is anticipated that this dataset will be of use in both relating electronic structure calculations to experimental observations through the generation of calibration schemes, as well as for the creation of new semi-empirical methods and the benchmarking of current and future model chemistries for organic electronic applications.

  14. Molecular characterization of the probiotic strain Bacillus cereus var. toyoi NCIMB 40112 and differentiation from food poisoning strains.

    Science.gov (United States)

    Klein, Günter

    2011-07-01

    Bacillus cereus var. toyoi strain NCIMB 40112 (Toyocerin), a probiotic authorized in the European Union as feed additive for swine, bovines, poultry, and rabbits, was characterized by DNA fingerprinting applying pulsed-field gel electrophoresis and multilocus sequence typing and was compared with reference strains (of clinical and environmental origins). The probiotic strain was clearly characterized by pulsed-field gel electrophoresis using the restriction enzymes Apa I and Sma I resulting in unique DNA patterns. The comparison to the clinical reference strain B. cereus DSM 4312 was done with the same restriction enzymes, and again a clear differentiation of the two strains was possible by the resulting DNA patterns. The use of the restriction enzymes Apa I and Sma I is recommended for further studies. Furthermore, multilocus sequence typing analysis revealed a sequence type (ST 111) that was different from all known STs of B. cereus strains from food poisoning incidents. Thus, a strain characterization and differentiation from food poisoning strains for the probiotic strain was possible. Copyright ©, International Association for Food Protection

  15. Tables and figure datasets

    Data.gov (United States)

    U.S. Environmental Protection Agency — Soil and air concentrations of asbestos in Sumas study. This dataset is associated with the following publication: Wroble, J., T. Frederick, A. Frame, and D....

  16. Genome-Wide Transcription Study of Cryptococcus neoformans H99 Clinical Strain versus Environmental Strains.

    Directory of Open Access Journals (Sweden)

    Elaheh Movahed

    Full Text Available The infection of Cryptococcus neoformans is acquired through the inhalation of desiccated yeast cells and basidiospores originated from the environment, particularly from bird's droppings and decaying wood. Three environmental strains of C. neoformans originated from bird droppings (H4, S48B and S68B and C. neoformans reference clinical strain (H99 were used for intranasal infection in C57BL/6 mice. We showed that the H99 strain demonstrated higher virulence compared to H4, S48B and S68B strains. To examine if gene expression contributed to the different degree of virulence among these strains, a genome-wide microarray study was performed to inspect the transcriptomic profiles of all four strains. Our results revealed that out of 7,419 genes (22,257 probes examined, 65 genes were significantly up-or down-regulated in H99 versus H4, S48B and S68B strains. The up-regulated genes in H99 strain include Hydroxymethylglutaryl-CoA synthase (MVA1, Mitochondrial matrix factor 1 (MMF1, Bud-site-selection protein 8 (BUD8, High affinity glucose transporter 3 (SNF3 and Rho GTPase-activating protein 2 (RGA2. Pathway annotation using DAVID bioinformatics resource showed that metal ion binding and sugar transmembrane transporter activity pathways were highly expressed in the H99 strain. We suggest that the genes and pathways identified may possibly play crucial roles in the fungal pathogenesis.

  17. Genome sequencing and analysis of BCG vaccine strains.

    Directory of Open Access Journals (Sweden)

    Wen Zhang

    Full Text Available BACKGROUND: Although the Bacillus Calmette-Guérin (BCG vaccine against tuberculosis (TB has been available for more than 75 years, one third of the world's population is still infected with Mycobacterium tuberculosis and approximately 2 million people die of TB every year. To reduce this immense TB burden, a clearer understanding of the functional genes underlying the action of BCG and the development of new vaccines are urgently needed. METHODS AND FINDINGS: Comparative genomic analysis of 19 M. tuberculosis complex strains showed that BCG strains underwent repeated human manipulation, had higher region of deletion rates than those of natural M. tuberculosis strains, and lost several essential components such as T-cell epitopes. A total of 188 BCG strain T-cell epitopes were lost to various degrees. The non-virulent BCG Tokyo strain, which has the largest number of T-cell epitopes (359, lost 124. Here we propose that BCG strain protection variability results from different epitopes. This study is the first to present BCG as a model organism for genetics research. BCG strains have a very well-documented history and now detailed genome information. Genome comparison revealed the selection process of BCG strains under human manipulation (1908-1966. CONCLUSIONS: Our results revealed the cause of BCG vaccine strain protection variability at the genome level and supported the hypothesis that the restoration of lost BCG Tokyo epitopes is a useful future vaccine development strategy. Furthermore, these detailed BCG vaccine genome investigation results will be useful in microbial genetics, microbial engineering and other research fields.

  18. PHYSICS PERFORMANCE AND DATASET (PPD)

    CERN Multimedia

    L. Silvestris

    2013-01-01

    The first part of the Long Shutdown period has been dedicated to the preparation of the samples for the analysis targeting the summer conferences. In particular, the 8 TeV data acquired in 2012, including most of the “parked datasets”, have been reconstructed profiting from improved alignment and calibration conditions for all the sub-detectors. A careful planning of the resources was essential in order to deliver the datasets well in time to the analysts, and to schedule the update of all the conditions and calibrations needed at the analysis level. The newly reprocessed data have undergone detailed scrutiny by the Dataset Certification team allowing to recover some of the data for analysis usage and further improving the certification efficiency, which is now at 91% of the recorded luminosity. With the aim of delivering a consistent dataset for 2011 and 2012, both in terms of conditions and release (53X), the PPD team is now working to set up a data re-reconstruction and a new MC pro...

  19. Integrated Surface Dataset (Global)

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The Integrated Surface (ISD) Dataset (ISD) is composed of worldwide surface weather observations from over 35,000 stations, though the best spatial coverage is...

  20. Aaron Journal article datasets

    Data.gov (United States)

    U.S. Environmental Protection Agency — All figures used in the journal article are in netCDF format. This dataset is associated with the following publication: Sims, A., K. Alapaty , and S. Raman....

  1. Market Squid Ecology Dataset

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — This dataset contains ecological information collected on the major adult spawning and juvenile habitats of market squid off California and the US Pacific Northwest....

  2. Solitary waves in morphogenesis: Determination fronts as strain-cued strain transformations among automatous cells

    Science.gov (United States)

    Cox, Brian N.; Landis, Chad M.

    2018-02-01

    We present a simple theory of a strain pulse propagating as a solitary wave through a continuous two-dimensional population of cells. A critical strain is assumed to trigger a strain transformation, while, simultaneously, cells move as automata to tend to restore a preferred cell density. We consider systems in which the strain transformation is a shape change, a burst of proliferation, or the commencement of growth (which changes the shape of the population sheet), and demonstrate isomorphism among these cases. Numerical and analytical solutions describe a strain pulse whose height does not depend on how the strain disturbance was first launched, or the rate at which the strain transformation is achieved, or the rate constant in the rule for the restorative cell motion. The strain pulse is therefore very stable, surviving the imposition of strong perturbations: it would serve well as a timing signal in development. The automatous wave formulation is simple, with few model parameters. A strong case exists for the presence of a strain pulse during amelogenesis. Quantitative analysis reveals a simple relationship between the velocity of the leading edge of the pulse in amelogenesis and the known speed of migration of ameloblast cells. This result and energy arguments support the depiction of wave motion as an automatous cell response to strain, rather than as a response to an elastic energy gradient. The theory may also contribute to understanding the determination front in somitogenesis, moving fronts of convergent-extension transformation, and mitotic wavefronts in the syncytial drosophila embryo.

  3. Genome analysis coupled with physiological studies reveals a diverse nitrogen metabolism in Methylocystis sp. strain SC2.

    Directory of Open Access Journals (Sweden)

    Bomba Dam

    Full Text Available BACKGROUND: Methylocystis sp. strain SC2 can adapt to a wide range of methane concentrations. This is due to the presence of two isozymes of particulate methane monooxygenase exhibiting different methane oxidation kinetics. To gain insight into the underlying genetic information, its genome was sequenced and found to comprise a 3.77 Mb chromosome and two large plasmids. PRINCIPAL FINDINGS: We report important features of the strain SC2 genome. Its sequence is compared with those of seven other methanotroph genomes, comprising members of the Alphaproteobacteria, Gammaproteobacteria, and Verrucomicrobia. While the pan-genome of all eight methanotroph genomes totals 19,358 CDS, only 154 CDS are shared. The number of core genes increased with phylogenetic relatedness: 328 CDS for proteobacterial methanotrophs and 1,853 CDS for the three alphaproteobacterial Methylocystaceae members, Methylocystis sp. strain SC2 and strain Rockwell, and Methylosinus trichosporium OB3b. The comparative study was coupled with physiological experiments to verify that strain SC2 has diverse nitrogen metabolism capabilities. In correspondence to a full complement of 34 genes involved in N2 fixation, strain SC2 was found to grow with atmospheric N2 as the sole nitrogen source, preferably at low oxygen concentrations. Denitrification-mediated accumulation of 0.7 nmol (30N2/hr/mg dry weight of cells under anoxic conditions was detected by tracer analysis. N2 production is related to the activities of plasmid-borne nitric oxide and nitrous oxide reductases. CONCLUSIONS/PERSPECTIVES: Presence of a complete denitrification pathway in strain SC2, including the plasmid-encoded nosRZDFYX operon, is unique among known methanotrophs. However, the exact ecophysiological role of this pathway still needs to be elucidated. Detoxification of toxic nitrogen compounds and energy conservation under oxygen-limiting conditions are among the possible roles. Relevant features that may stimulate

  4. Strain distributions and their influence on electronic structures of WSe2-MoS2 laterally strained heterojunctions

    Science.gov (United States)

    Zhang, Chendong; Li, Ming-Yang; Tersoff, Jerry; Han, Yimo; Su, Yushan; Li, Lain-Jong; Muller, David A.; Shih, Chih-Kang

    2018-02-01

    Monolayer transition metal dichalcogenide heterojunctions, including vertical and lateral p-n junctions, have attracted considerable attention due to their potential applications in electronics and optoelectronics. Lattice-misfit strain in atomically abrupt lateral heterojunctions, such as WSe2-MoS2, offers a new band-engineering strategy for tailoring their electronic properties. However, this approach requires an understanding of the strain distribution and its effect on band alignment. Here, we study a WSe2-MoS2 lateral heterojunction using scanning tunnelling microscopy and image its moiré pattern to map the full two-dimensional strain tensor with high spatial resolution. Using scanning tunnelling spectroscopy, we measure both the strain and the band alignment of the WSe2-MoS2 lateral heterojunction. We find that the misfit strain induces type II to type I band alignment transformation. Scanning transmission electron microscopy reveals the dislocations at the interface that partially relieve the strain. Finally, we observe a distinctive electronic structure at the interface due to hetero-bonding.

  5. Antifungal susceptibility profiles of 1698 yeast reference strains revealing potential emerging human pathogens.

    Directory of Open Access Journals (Sweden)

    Marie Desnos-Ollivier

    Full Text Available New molecular identification techniques and the increased number of patients with various immune defects or underlying conditions lead to the emergence and/or the description of novel species of human and animal fungal opportunistic pathogens. Antifungal susceptibility provides important information for ecological, epidemiological and therapeutic issues. The aim of this study was to assess the potential risk of the various species based on their antifungal drug resistance, keeping in mind the methodological limitations. Antifungal susceptibility profiles to the five classes of antifungal drugs (polyens, azoles, echinocandins, allylamines and antimetabolites were determined for 1698 yeast reference strains belonging to 992 species (634 Ascomycetes and 358 Basidiomycetes. Interestingly, geometric mean minimum inhibitory concentrations (MICs of all antifungal drugs tested were significantly higher for Basidiomycetes compared to Ascomycetes (p<0.001. Twenty four strains belonging to 23 species of which 19 were Basidiomycetes seem to be intrinsically "resistant" to all drugs. Comparison of the antifungal susceptibility profiles of the 4240 clinical isolates and the 315 reference strains belonging to 53 shared species showed similar results. Even in the absence of demonstrated in vitro/in vivo correlation, knowing the in vitro susceptibility to systemic antifungal agents and the putative intrinsic resistance of yeast species present in the environment is important because they could become opportunistic pathogens.

  6. ATLAS File and Dataset Metadata Collection and Use

    CERN Document Server

    Albrand, S; The ATLAS collaboration; Lambert, F; Gallas, E J

    2012-01-01

    The ATLAS Metadata Interface (“AMI”) was designed as a generic cataloguing system, and as such it has found many uses in the experiment including software release management, tracking of reconstructed event sizes and control of dataset nomenclature. The primary use of AMI is to provide a catalogue of datasets (file collections) which is searchable using physics criteria. In this paper we discuss the various mechanisms used for filling the AMI dataset and file catalogues. By correlating information from different sources we can derive aggregate information which is important for physics analysis; for example the total number of events contained in dataset, and possible reasons for missing events such as a lost file. Finally we will describe some specialized interfaces which were developed for the Data Preparation and reprocessing coordinators. These interfaces manipulate information from both the dataset domain held in AMI, and the run-indexed information held in the ATLAS COMA application (Conditions and ...

  7. Influence of strain on dislocation core in silicon

    Science.gov (United States)

    Pizzagalli, L.; Godet, J.; Brochard, S.

    2018-05-01

    First principles, density functional-based tight binding and semi-empirical interatomic potentials calculations are performed to analyse the influence of large strains on the structure and stability of a 60? dislocation in silicon. Such strains typically arise during the mechanical testing of nanostructures like nanopillars or nanoparticles. We focus on bi-axial strains in the plane normal to the dislocation line. Our calculations surprisingly reveal that the dislocation core structure largely depends on the applied strain, for strain levels of about 5%. In the particular case of bi-axial compression, the transformation of the dislocation to a locally disordered configuration occurs for similar strain magnitudes. The formation of an opening, however, requires larger strains, of about 7.5%. Furthermore, our results suggest that electronic structure methods should be favoured to model dislocation cores in case of large strains whenever possible.

  8. Norwegian Hydrological Reference Dataset for Climate Change Studies

    Energy Technology Data Exchange (ETDEWEB)

    Magnussen, Inger Helene; Killingland, Magnus; Spilde, Dag

    2012-07-01

    Based on the Norwegian hydrological measurement network, NVE has selected a Hydrological Reference Dataset for studies of hydrological change. The dataset meets international standards with high data quality. It is suitable for monitoring and studying the effects of climate change on the hydrosphere and cryosphere in Norway. The dataset includes streamflow, groundwater, snow, glacier mass balance and length change, lake ice and water temperature in rivers and lakes.(Author)

  9. The Harvard organic photovoltaic dataset

    Science.gov (United States)

    Lopez, Steven A.; Pyzer-Knapp, Edward O.; Simm, Gregor N.; Lutzow, Trevor; Li, Kewei; Seress, Laszlo R.; Hachmann, Johannes; Aspuru-Guzik, Alán

    2016-01-01

    The Harvard Organic Photovoltaic Dataset (HOPV15) presented in this work is a collation of experimental photovoltaic data from the literature, and corresponding quantum-chemical calculations performed over a range of conformers, each with quantum chemical results using a variety of density functionals and basis sets. It is anticipated that this dataset will be of use in both relating electronic structure calculations to experimental observations through the generation of calibration schemes, as well as for the creation of new semi-empirical methods and the benchmarking of current and future model chemistries for organic electronic applications. PMID:27676312

  10. Myocardial strain assessment by cine cardiac magnetic resonance imaging using non-rigid registration.

    Science.gov (United States)

    Tsadok, Yossi; Friedman, Zvi; Haluska, Brian A; Hoffmann, Rainer; Adam, Dan

    2016-05-01

    To evaluate a novel post-processing method for assessment of longitudinal mid-myocardial strain in standard cine cardiac magnetic resonance (CMR) imaging sequences. Cine CMR imaging and tagged cardiac magnetic resonance imaging (TMRI) were performed in 15 patients with acute myocardial infarction (AMI) and 15 healthy volunteers served as control group. A second group of 37 post-AMI patients underwent both cine CMR and late gadolinium enhancement (LGE) CMR exams. Speckle tracking echocardiography (STE) was performed in 36 of these patients. Cine CMR, TMRI and STE were analyzed to obtain longitudinal strain. LGE-CMR datasets were analyzed to evaluate scar extent. Comparison of peak systolic strain (PSS) measured from CMR and TMRI yielded a strong correlation (r=0.86, pcine CMR data. The method was found to be highly correlated with strain measurements obtained by TMRI and STE. This tool allows accurate discrimination between different transmurality states of myocardial infarction. Copyright © 2015 Elsevier Inc. All rights reserved.

  11. Synthetic and Empirical Capsicum Annuum Image Dataset

    NARCIS (Netherlands)

    Barth, R.

    2016-01-01

    This dataset consists of per-pixel annotated synthetic (10500) and empirical images (50) of Capsicum annuum, also known as sweet or bell pepper, situated in a commercial greenhouse. Furthermore, the source models to generate the synthetic images are included. The aim of the datasets are to

  12. Differential stress transcriptome landscape of historic and recently emerged hypervirulent strains of Clostridium difficile strains determined using RNA-seq.

    Directory of Open Access Journals (Sweden)

    Joy Scaria

    Full Text Available C. difficile is the most common cause of nosocomial diarrhea in North America and Europe. Genomes of individual strains of C. difficile are highly divergent. To determine how divergent strains respond to environmental changes, the transcriptomes of two historic and two recently isolated hypervirulent strains were analyzed following nutrient shift and osmotic shock. Illumina based RNA-seq was used to sequence these transcriptomes. Our results reveal that although C. difficile strains contain a large number of shared and strain specific genes, the majority of the differentially expressed genes were core genes. We also detected a number of transcriptionally active regions that were not part of the primary genome annotation. Some of these are likely to be small regulatory RNAs.

  13. High strain and strain-rate behaviour of PTFE/aluminium/tungsten mixtures

    International Nuclear Information System (INIS)

    Addiss, John; Walley, Stephen; Proud, William; Cai Jing; Nesterenko, Vitali

    2007-01-01

    Conventional drop-weight techniques were modified to accommodate low-amplitude force transducer signals from low-strength, cold isostatically pressed 'heavy' composites of polytetrafluoroethylene, aluminum and tungsten (W). The failure strength, strain and the post-critical behavior of failed samples were measured for samples of different porosity and tungsten grain size. Unusual phenomenon of significantly higher strength (55 MPa) of porous composites (density 5.9 g/cm 3 ) with small W particles ( 3 ) with larger W particles (44 μm) at the same volume content of components was observed. This is attributed to force chains created by a network of small W particles. Interrupted tests at different levels of strain revealed the mechanisms of fracture under dynamic compression

  14. Draft genome sequence of two Shingopyxis sp. strains H107 and H115 isolated from a chloraminated drinking water distriburion system simulator

    Data.gov (United States)

    U.S. Environmental Protection Agency — Draft genome sequence of two Shingopyxis sp. strains H107 and H115 isolated from a chloraminated drinking water distriburion system simulator. This dataset is...

  15. EEG datasets for motor imagery brain-computer interface.

    Science.gov (United States)

    Cho, Hohyun; Ahn, Minkyu; Ahn, Sangtae; Kwon, Moonyoung; Jun, Sung Chan

    2017-07-01

    Most investigators of brain-computer interface (BCI) research believe that BCI can be achieved through induced neuronal activity from the cortex, but not by evoked neuronal activity. Motor imagery (MI)-based BCI is one of the standard concepts of BCI, in that the user can generate induced activity by imagining motor movements. However, variations in performance over sessions and subjects are too severe to overcome easily; therefore, a basic understanding and investigation of BCI performance variation is necessary to find critical evidence of performance variation. Here we present not only EEG datasets for MI BCI from 52 subjects, but also the results of a psychological and physiological questionnaire, EMG datasets, the locations of 3D EEG electrodes, and EEGs for non-task-related states. We validated our EEG datasets by using the percentage of bad trials, event-related desynchronization/synchronization (ERD/ERS) analysis, and classification analysis. After conventional rejection of bad trials, we showed contralateral ERD and ipsilateral ERS in the somatosensory area, which are well-known patterns of MI. Finally, we showed that 73.08% of datasets (38 subjects) included reasonably discriminative information. Our EEG datasets included the information necessary to determine statistical significance; they consisted of well-discriminated datasets (38 subjects) and less-discriminative datasets. These may provide researchers with opportunities to investigate human factors related to MI BCI performance variation, and may also achieve subject-to-subject transfer by using metadata, including a questionnaire, EEG coordinates, and EEGs for non-task-related states. © The Authors 2017. Published by Oxford University Press.

  16. Comparative genome analysis of pathogenic and non-pathogenic Clavibacter strains reveals adaptations to their lifestyle

    OpenAIRE

    Załuga, Joanna; Stragier, Pieter; Baeyen, Steve; Haegeman, Annelies; Van Vaerenbergh, Johan; Maes, Martine; De Vos, Paul

    2014-01-01

    Background The genus Clavibacter harbors economically important plant pathogens infecting agricultural crops such as potato and tomato. Although the vast majority of Clavibacter strains are pathogenic, there is an increasing number of non-pathogenic isolates reported. Non-pathogenic Clavibacter strains isolated from tomato seeds are particularly problematic because they affect the current detection and identification tests for Clavibacter michiganensis subsp. michiganensis (Cmm), which is reg...

  17. ASSISTments Dataset from Multiple Randomized Controlled Experiments

    Science.gov (United States)

    Selent, Douglas; Patikorn, Thanaporn; Heffernan, Neil

    2016-01-01

    In this paper, we present a dataset consisting of data generated from 22 previously and currently running randomized controlled experiments inside the ASSISTments online learning platform. This dataset provides data mining opportunities for researchers to analyze ASSISTments data in a convenient format across multiple experiments at the same time.…

  18. Would the ‘real’ observed dataset stand up? A critical examination of eight observed gridded climate datasets for China

    International Nuclear Information System (INIS)

    Sun, Qiaohong; Miao, Chiyuan; Duan, Qingyun; Kong, Dongxian; Ye, Aizhong; Di, Zhenhua; Gong, Wei

    2014-01-01

    This research compared and evaluated the spatio-temporal similarities and differences of eight widely used gridded datasets. The datasets include daily precipitation over East Asia (EA), the Climate Research Unit (CRU) product, the Global Precipitation Climatology Centre (GPCC) product, the University of Delaware (UDEL) product, Precipitation Reconstruction over Land (PREC/L), the Asian Precipitation Highly Resolved Observational (APHRO) product, the Institute of Atmospheric Physics (IAP) dataset from the Chinese Academy of Sciences, and the National Meteorological Information Center dataset from the China Meteorological Administration (CN05). The meteorological variables focus on surface air temperature (SAT) or precipitation (PR) in China. All datasets presented general agreement on the whole spatio-temporal scale, but some differences appeared for specific periods and regions. On a temporal scale, EA shows the highest amount of PR, while APHRO shows the lowest. CRU and UDEL show higher SAT than IAP or CN05. On a spatial scale, the most significant differences occur in western China for PR and SAT. For PR, the difference between EA and CRU is the largest. When compared with CN05, CRU shows higher SAT in the central and southern Northwest river drainage basin, UDEL exhibits higher SAT over the Southwest river drainage system, and IAP has lower SAT in the Tibetan Plateau. The differences in annual mean PR and SAT primarily come from summer and winter, respectively. Finally, potential factors impacting agreement among gridded climate datasets are discussed, including raw data sources, quality control (QC) schemes, orographic correction, and interpolation techniques. The implications and challenges of these results for climate research are also briefly addressed. (paper)

  19. Estimating parameters for probabilistic linkage of privacy-preserved datasets.

    Science.gov (United States)

    Brown, Adrian P; Randall, Sean M; Ferrante, Anna M; Semmens, James B; Boyd, James H

    2017-07-10

    Probabilistic record linkage is a process used to bring together person-based records from within the same dataset (de-duplication) or from disparate datasets using pairwise comparisons and matching probabilities. The linkage strategy and associated match probabilities are often estimated through investigations into data quality and manual inspection. However, as privacy-preserved datasets comprise encrypted data, such methods are not possible. In this paper, we present a method for estimating the probabilities and threshold values for probabilistic privacy-preserved record linkage using Bloom filters. Our method was tested through a simulation study using synthetic data, followed by an application using real-world administrative data. Synthetic datasets were generated with error rates from zero to 20% error. Our method was used to estimate parameters (probabilities and thresholds) for de-duplication linkages. Linkage quality was determined by F-measure. Each dataset was privacy-preserved using separate Bloom filters for each field. Match probabilities were estimated using the expectation-maximisation (EM) algorithm on the privacy-preserved data. Threshold cut-off values were determined by an extension to the EM algorithm allowing linkage quality to be estimated for each possible threshold. De-duplication linkages of each privacy-preserved dataset were performed using both estimated and calculated probabilities. Linkage quality using the F-measure at the estimated threshold values was also compared to the highest F-measure. Three large administrative datasets were used to demonstrate the applicability of the probability and threshold estimation technique on real-world data. Linkage of the synthetic datasets using the estimated probabilities produced an F-measure that was comparable to the F-measure using calculated probabilities, even with up to 20% error. Linkage of the administrative datasets using estimated probabilities produced an F-measure that was higher

  20. Viking Seismometer PDS Archive Dataset

    Science.gov (United States)

    Lorenz, R. D.

    2016-12-01

    The Viking Lander 2 seismometer operated successfully for over 500 Sols on the Martian surface, recording at least one likely candidate Marsquake. The Viking mission, in an era when data handling hardware (both on board and on the ground) was limited in capability, predated modern planetary data archiving, and ad-hoc repositories of the data, and the very low-level record at NSSDC, were neither convenient to process nor well-known. In an effort supported by the NASA Mars Data Analysis Program, we have converted the bulk of the Viking dataset (namely the 49,000 and 270,000 records made in High- and Event- modes at 20 and 1 Hz respectively) into a simple ASCII table format. Additionally, since wind-generated lander motion is a major component of the signal, contemporaneous meteorological data are included in summary records to facilitate correlation. These datasets are being archived at the PDS Geosciences Node. In addition to brief instrument and dataset descriptions, the archive includes code snippets in the freely-available language 'R' to demonstrate plotting and analysis. Further, we present examples of lander-generated noise, associated with the sampler arm, instrument dumps and other mechanical operations.

  1. Origin of the Strain Sensitivity for an Organic Heptazole Thin-Film and Its Strain Gauge Application

    Science.gov (United States)

    Bae, Heesun; Jeon, Pyo Jin; Park, Ji Hoon; Lee, Kimoon

    2018-04-01

    The authors report on the origin of the strain sensitivity for an organic C26H16N2 (heptazole) thinfilm and its application for the detection of tensile strain. From the electrical characterization on the thin-film transistor adopting a heptazole channel, heptazole film exhibits p-channel conduction with a relatively low value of field-effect mobility (0.05 cm2/Vs), suggesting a hopping conduction behavior via hole carriers. By analyzing the strain and temperature dependences of the electrical conductivity, we reveal that the electrical conduction for a heptazole thin-film is dominated by the variable range hopping process with quite a large energy separation (224.9 meV) between the localized states under a relatively long attenuation length (10.46 Å). This indicates that a change in the inter-grain spacing that is much larger than the attenuation length is responsible for the reversible modification of electrical conductivity depending on strain for the heptazole film. By utilizing our heptazole thin-film both as a strain sensitive passive resistor and an active semiconducting channel layer, we can achieve a strain gauge device exhibiting reversible endurance for tensile strains up to 2.12%. Consequently, this study advances the understanding of the fundamental strain sensing mechanism in a heptazole thin-film toward finding a promise material with a strain gauge for applications as potential flexible devices and/or wearable electronics.

  2. Mobilomics in Saccharomyces cerevisiae strains.

    Science.gov (United States)

    Menconi, Giulia; Battaglia, Giovanni; Grossi, Roberto; Pisanti, Nadia; Marangoni, Roberto

    2013-03-20

    Mobile Genetic Elements (MGEs) are selfish DNA integrated in the genomes. Their detection is mainly based on consensus-like searches by scanning the investigated genome against the sequence of an already identified MGE. Mobilomics aims at discovering all the MGEs in a genome and understanding their dynamic behavior: The data for this kind of investigation can be provided by comparative genomics of closely related organisms. The amount of data thus involved requires a strong computational effort, which should be alleviated. Our approach proposes to exploit the high similarity among homologous chromosomes of different strains of the same species, following a progressive comparative genomics philosophy. We introduce a software tool based on our new fast algorithm, called regender, which is able to identify the conserved regions between chromosomes. Our case study is represented by a unique recently available dataset of 39 different strains of S.cerevisiae, which regender is able to compare in few minutes. By exploring the non-conserved regions, where MGEs are mainly retrotransposons called Tys, and marking the candidate Tys based on their length, we are able to locate a priori and automatically all the already known Tys and map all the putative Tys in all the strains. The remaining putative mobile elements (PMEs) emerging from this intra-specific comparison are sharp markers of inter-specific evolution: indeed, many events of non-conservation among different yeast strains correspond to PMEs. A clustering based on the presence/absence of the candidate Tys in the strains suggests an evolutionary interconnection that is very similar to classic phylogenetic trees based on SNPs analysis, even though it is computed without using phylogenetic information. The case study indicates that the proposed methodology brings two major advantages: (a) it does not require any template sequence for the wanted MGEs and (b) it can be applied to infer MGEs also for low coverage genomes

  3. Mobilomics in Saccharomyces cerevisiae strains

    Science.gov (United States)

    2013-01-01

    Background Mobile Genetic Elements (MGEs) are selfish DNA integrated in the genomes. Their detection is mainly based on consensus–like searches by scanning the investigated genome against the sequence of an already identified MGE. Mobilomics aims at discovering all the MGEs in a genome and understanding their dynamic behavior: The data for this kind of investigation can be provided by comparative genomics of closely related organisms. The amount of data thus involved requires a strong computational effort, which should be alleviated. Results Our approach proposes to exploit the high similarity among homologous chromosomes of different strains of the same species, following a progressive comparative genomics philosophy. We introduce a software tool based on our new fast algorithm, called regender, which is able to identify the conserved regions between chromosomes. Our case study is represented by a unique recently available dataset of 39 different strains of S.cerevisiae, which regender is able to compare in few minutes. By exploring the non–conserved regions, where MGEs are mainly retrotransposons called Tys, and marking the candidate Tys based on their length, we are able to locate a priori and automatically all the already known Tys and map all the putative Tys in all the strains. The remaining putative mobile elements (PMEs) emerging from this intra–specific comparison are sharp markers of inter–specific evolution: indeed, many events of non–conservation among different yeast strains correspond to PMEs. A clustering based on the presence/absence of the candidate Tys in the strains suggests an evolutionary interconnection that is very similar to classic phylogenetic trees based on SNPs analysis, even though it is computed without using phylogenetic information. Conclusions The case study indicates that the proposed methodology brings two major advantages: (a) it does not require any template sequence for the wanted MGEs and (b) it can be applied to

  4. Genomic comparison of invasive and rare non-invasive strains reveals Porphyromonas gingivalis genetic polymorphisms

    Directory of Open Access Journals (Sweden)

    Svetlana Dolgilevich

    2011-03-01

    Full Text Available Porphyromonas gingivalis strains are shown to invade human cells in vitro with different invasion efficiencies, varying by up to three orders of magnitude.We tested the hypothesis that invasion-associated interstrain genomic polymorphisms are present in P. gingivalis and that putative invasion-associated genes can contribute to P. gingivalis invasion.Using an invasive (W83 and the only available non-invasive P. gingivalis strain (AJW4 and whole genome microarrays followed by two separate software tools, we carried out comparative genomic hybridization (CGH analysis.We identified 68 annotated and 51 hypothetical open reading frames (ORFs that are polymorphic between these strains. Among these are surface proteins, lipoproteins, capsular polysaccharide biosynthesis enzymes, regulatory and immunoreactive proteins, integrases, and transposases often with abnormal GC content and clustered on the chromosome. Amplification of selected ORFs was used to validate the approach and the selection. Eleven clinical strains were investigated for the presence of selected ORFs. The putative invasion-associated ORFs were present in 10 of the isolates. The invasion ability of three isogenic mutants, carrying deletions in PG0185, PG0186, and PG0982 was tested. The PG0185 (ragA and PG0186 (ragB mutants had 5.1×103-fold and 3.6×103-fold decreased in vitro invasion ability, respectively.The annotation of divergent ORFs suggests deficiency in multiple genes as a basis for P. gingivalis non-invasive phenotype. Access the supplementary material to this article: Supplement, table (see Supplementary files under Reading Tools online.

  5. Homogenised Australian climate datasets used for climate change monitoring

    International Nuclear Information System (INIS)

    Trewin, Blair; Jones, David; Collins; Dean; Jovanovic, Branislava; Braganza, Karl

    2007-01-01

    Full text: The Australian Bureau of Meteorology has developed a number of datasets for use in climate change monitoring. These datasets typically cover 50-200 stations distributed as evenly as possible over the Australian continent, and have been subject to detailed quality control and homogenisation.The time period over which data are available for each element is largely determined by the availability of data in digital form. Whilst nearly all Australian monthly and daily precipitation data have been digitised, a significant quantity of pre-1957 data (for temperature and evaporation) or pre-1987 data (for some other elements) remains to be digitised, and is not currently available for use in the climate change monitoring datasets. In the case of temperature and evaporation, the start date of the datasets is also determined by major changes in instruments or observing practices for which no adjustment is feasible at the present time. The datasets currently available cover: Monthly and daily precipitation (most stations commence 1915 or earlier, with many extending back to the late 19th century, and a few to the mid-19th century); Annual temperature (commences 1910); Daily temperature (commences 1910, with limited station coverage pre-1957); Twice-daily dewpoint/relative humidity (commences 1957); Monthly pan evaporation (commences 1970); Cloud amount (commences 1957) (Jovanovic etal. 2007). As well as the station-based datasets listed above, an additional dataset being developed for use in climate change monitoring (and other applications) covers tropical cyclones in the Australian region. This is described in more detail in Trewin (2007). The datasets already developed are used in analyses of observed climate change, which are available through the Australian Bureau of Meteorology website (http://www.bom.gov.au/silo/products/cli_chg/). They are also used as a basis for routine climate monitoring, and in the datasets used for the development of seasonal

  6. Segmentation of teeth in CT volumetric dataset by panoramic projection and variational level set

    Energy Technology Data Exchange (ETDEWEB)

    Hosntalab, Mohammad [Islamic Azad University, Faculty of Engineering, Science and Research Branch, Tehran (Iran); Aghaeizadeh Zoroofi, Reza [University of Tehran, Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering, College of Engineering, Tehran (Iran); Abbaspour Tehrani-Fard, Ali [Islamic Azad University, Faculty of Engineering, Science and Research Branch, Tehran (Iran); Sharif University of Technology, Department of Electrical Engineering, Tehran (Iran); Shirani, Gholamreza [Faculty of Dentistry Medical Science of Tehran University, Oral and Maxillofacial Surgery Department, Tehran (Iran)

    2008-09-15

    Quantification of teeth is of clinical importance for various computer assisted procedures such as dental implant, orthodontic planning, face, jaw and cosmetic surgeries. In this regard, segmentation is a major step. In this paper, we propose a method for segmentation of teeth in volumetric computed tomography (CT) data using panoramic re-sampling of the dataset in the coronal view and variational level set. The proposed method consists of five steps as follows: first, we extract a mask in a CT images using Otsu thresholding. Second, the teeth are segmented from other bony tissues by utilizing anatomical knowledge of teeth in the jaws. Third, the proposed method is followed by estimating the arc of the upper and lower jaws and panoramic re-sampling of the dataset. Separation of upper and lower jaws and initial segmentation of teeth are performed by employing the horizontal and vertical projections of the panoramic dataset, respectively. Based the above mentioned procedures an initial mask for each tooth is obtained. Finally, we utilize the initial mask of teeth and apply a Variational level set to refine initial teeth boundaries to final contours. The proposed algorithm was evaluated in the presence of 30 multi-slice CT datasets including 3,600 images. Experimental results reveal the effectiveness of the proposed method. In the proposed algorithm, the variational level set technique was utilized to trace the contour of the teeth. In view of the fact that, this technique is based on the characteristic of the overall region of the teeth image, it is possible to extract a very smooth and accurate tooth contour using this technique. In the presence of the available datasets, the proposed technique was successful in teeth segmentation compared to previous techniques. (orig.)

  7. Segmentation of teeth in CT volumetric dataset by panoramic projection and variational level set

    International Nuclear Information System (INIS)

    Hosntalab, Mohammad; Aghaeizadeh Zoroofi, Reza; Abbaspour Tehrani-Fard, Ali; Shirani, Gholamreza

    2008-01-01

    Quantification of teeth is of clinical importance for various computer assisted procedures such as dental implant, orthodontic planning, face, jaw and cosmetic surgeries. In this regard, segmentation is a major step. In this paper, we propose a method for segmentation of teeth in volumetric computed tomography (CT) data using panoramic re-sampling of the dataset in the coronal view and variational level set. The proposed method consists of five steps as follows: first, we extract a mask in a CT images using Otsu thresholding. Second, the teeth are segmented from other bony tissues by utilizing anatomical knowledge of teeth in the jaws. Third, the proposed method is followed by estimating the arc of the upper and lower jaws and panoramic re-sampling of the dataset. Separation of upper and lower jaws and initial segmentation of teeth are performed by employing the horizontal and vertical projections of the panoramic dataset, respectively. Based the above mentioned procedures an initial mask for each tooth is obtained. Finally, we utilize the initial mask of teeth and apply a Variational level set to refine initial teeth boundaries to final contours. The proposed algorithm was evaluated in the presence of 30 multi-slice CT datasets including 3,600 images. Experimental results reveal the effectiveness of the proposed method. In the proposed algorithm, the variational level set technique was utilized to trace the contour of the teeth. In view of the fact that, this technique is based on the characteristic of the overall region of the teeth image, it is possible to extract a very smooth and accurate tooth contour using this technique. In the presence of the available datasets, the proposed technique was successful in teeth segmentation compared to previous techniques. (orig.)

  8. Multilocus dataset reveals demographic histories of two peat mosses in Europe

    Directory of Open Access Journals (Sweden)

    Hock Zsófia

    2007-08-01

    Full Text Available Abstract Background Revealing the past and present demographic history of populations is of high importance to evaluate the conservation status of species. Demographic data can be obtained by direct monitoring or by analysing data of historical and recent collections. Although these methods provide the most detailed information they are very time consuming. Another alternative way is to make use of the information accumulated in the species' DNA over its history. Recent development of the coalescent theory makes it possible to reconstruct the demographic history of species using nucleotide polymorphism data. To separate the effect of natural selection and demography, multilocus analysis is needed because these two forces can produce similar patterns of polymorphisms. In this study we investigated the amount and pattern of sequence variability of a Europe wide sample set of two peat moss species (Sphagnum fimbriatum and S. squarrosum with similar distributions and mating systems but presumably contrasting historical demographies using 3 regions of the nuclear genome (appr. 3000 bps. We aimed to draw inferences concerning demographic, and phylogeographic histories of the species. Results All three nuclear regions supported the presence of an Atlantic and Non-Atlantic clade of S. fimbriatum suggesting glacial survival of the species along the Atlantic coast of Europe. Contrarily, S. squarrosum haplotypes showed three clades but no geographic structure at all. Maximum likelihood, mismatch and Bayesian analyses supported a severe historical bottleneck and a relatively recent demographic expansion of the Non-Atlantic clade of S. fimbriatum, whereas size of S. squarrosum populations has probably decreased in the past. Species wide molecular diversity of the two species was nearly the same with an excess of replacement mutations in S. fimbriatum. Similar levels of molecular diversity, contrasting phylogeographic patterns and excess of replacement

  9. Introduction of a simple-model-based land surface dataset for Europe

    Science.gov (United States)

    Orth, Rene; Seneviratne, Sonia I.

    2015-04-01

    Land surface hydrology can play a crucial role during extreme events such as droughts, floods and even heat waves. We introduce in this study a new hydrological dataset for Europe that consists of soil moisture, runoff and evapotranspiration (ET). It is derived with a simple water balance model (SWBM) forced with precipitation, temperature and net radiation. The SWBM dataset extends over the period 1984-2013 with a daily time step and 0.5° × 0.5° resolution. We employ a novel calibration approach, in which we consider 300 random parameter sets chosen from an observation-based range. Using several independent validation datasets representing soil moisture (or terrestrial water content), ET and streamflow, we identify the best performing parameter set and hence the new dataset. To illustrate its usefulness, the SWBM dataset is compared against several state-of-the-art datasets (ERA-Interim/Land, MERRA-Land, GLDAS-2-Noah, simulations of the Community Land Model Version 4), using all validation datasets as reference. For soil moisture dynamics it outperforms the benchmarks. Therefore the SWBM soil moisture dataset constitutes a reasonable alternative to sparse measurements, little validated model results, or proxy data such as precipitation indices. Also in terms of runoff the SWBM dataset performs well, whereas the evaluation of the SWBM ET dataset is overall satisfactory, but the dynamics are less well captured for this variable. This highlights the limitations of the dataset, as it is based on a simple model that uses uniform parameter values. Hence some processes impacting ET dynamics may not be captured, and quality issues may occur in regions with complex terrain. Even though the SWBM is well calibrated, it cannot replace more sophisticated models; but as their calibration is a complex task the present dataset may serve as a benchmark in future. In addition we investigate the sources of skill of the SWBM dataset and find that the parameter set has a similar

  10. Data Mining for Imbalanced Datasets: An Overview

    Science.gov (United States)

    Chawla, Nitesh V.

    A dataset is imbalanced if the classification categories are not approximately equally represented. Recent years brought increased interest in applying machine learning techniques to difficult "real-world" problems, many of which are characterized by imbalanced data. Additionally the distribution of the testing data may differ from that of the training data, and the true misclassification costs may be unknown at learning time. Predictive accuracy, a popular choice for evaluating performance of a classifier, might not be appropriate when the data is imbalanced and/or the costs of different errors vary markedly. In this Chapter, we discuss some of the sampling techniques used for balancing the datasets, and the performance measures more appropriate for mining imbalanced datasets.

  11. Combination of Metabolomic and Proteomic Analysis Revealed Different Features among Lactobacillus delbrueckii Subspecies bulgaricus and lactis Strains While In Vivo Testing in the Model Organism Caenorhabditis elegans Highlighted Probiotic Properties

    Directory of Open Access Journals (Sweden)

    Elena Zanni

    2017-06-01

    Full Text Available Lactobacillus delbrueckii represents a technologically relevant member of lactic acid bacteria, since the two subspecies bulgaricus and lactis are widely associated with fermented dairy products. In the present work, we report the characterization of two commercial strains belonging to L. delbrueckii subspecies bulgaricus, lactis and a novel strain previously isolated from a traditional fermented fresh cheese. A phenomic approach was performed by combining metabolomic and proteomic analysis of the three strains, which were subsequently supplemented as food source to the model organism Caenorhabditis elegans, with the final aim to evaluate their possible probiotic effects. Restriction analysis of 16S ribosomal DNA revealed that the novel foodborne strain belonged to L. delbrueckii subspecies lactis. Proteomic and metabolomic approaches showed differences in folate, aminoacid and sugar metabolic pathways among the three strains. Moreover, evaluation of C. elegans lifespan, larval development, brood size, and bacterial colonization capacity demonstrated that L. delbrueckii subsp. bulgaricus diet exerted beneficial effects on nematodes. On the other hand, both L. delbrueckii subsp. lactis strains affected lifespan and larval development. We have characterized three strains belonging to L. delbrueckii subspecies bulgaricus and lactis highlighting their divergent origin. In particular, the two closely related isolates L. delbrueckii subspecies lactis display different galactose metabolic capabilities. Moreover, the L. delbrueckii subspecies bulgaricus strain demonstrated potential probiotic features. Combination of omic platforms coupled with in vivo screening in the simple model organism C. elegans is a powerful tool to characterize industrially relevant bacterial isolates.

  12. Combination of Metabolomic and Proteomic Analysis Revealed Different Features among Lactobacillus delbrueckii Subspecies bulgaricus and lactis Strains While In Vivo Testing in the Model Organism Caenorhabditis elegans Highlighted Probiotic Properties.

    Science.gov (United States)

    Zanni, Elena; Schifano, Emily; Motta, Sara; Sciubba, Fabio; Palleschi, Claudio; Mauri, Pierluigi; Perozzi, Giuditta; Uccelletti, Daniela; Devirgiliis, Chiara; Miccheli, Alfredo

    2017-01-01

    Lactobacillus delbrueckii represents a technologically relevant member of lactic acid bacteria, since the two subspecies bulgaricus and lactis are widely associated with fermented dairy products. In the present work, we report the characterization of two commercial strains belonging to L. delbrueckii subspecies bulgaricus , lactis and a novel strain previously isolated from a traditional fermented fresh cheese. A phenomic approach was performed by combining metabolomic and proteomic analysis of the three strains, which were subsequently supplemented as food source to the model organism Caenorhabditis elegans , with the final aim to evaluate their possible probiotic effects. Restriction analysis of 16S ribosomal DNA revealed that the novel foodborne strain belonged to L. delbrueckii subspecies lactis . Proteomic and metabolomic approaches showed differences in folate, aminoacid and sugar metabolic pathways among the three strains. Moreover, evaluation of C. elegans lifespan, larval development, brood size, and bacterial colonization capacity demonstrated that L. delbrueckii subsp. bulgaricus diet exerted beneficial effects on nematodes. On the other hand, both L. delbrueckii subsp. lactis strains affected lifespan and larval development. We have characterized three strains belonging to L. delbrueckii subspecies bulgaricus and lactis highlighting their divergent origin. In particular, the two closely related isolates L. delbrueckii subspecies lactis display different galactose metabolic capabilities. Moreover, the L. delbrueckii subspecies bulgaricus strain demonstrated potential probiotic features. Combination of omic platforms coupled with in vivo screening in the simple model organism C. elegans is a powerful tool to characterize industrially relevant bacterial isolates.

  13. Exploring the Saccharomyces cerevisiae Volatile Metabolome: Indigenous versus Commercial Strains

    Science.gov (United States)

    Alves, Zélia; Melo, André; Figueiredo, Ana Raquel; Coimbra, Manuel A.; Gomes, Ana C.; Rocha, Sílvia M.

    2015-01-01

    Winemaking is a highly industrialized process and a number of commercial Saccharomyces cerevisiae strains are used around the world, neglecting the diversity of native yeast strains that are responsible for the production of wines peculiar flavours. The aim of this study was to in-depth establish the S. cerevisiae volatile metabolome and to assess inter-strains variability. To fulfill this objective, two indigenous strains (BT2652 and BT2453 isolated from spontaneous fermentation of grapes collected in Bairrada Appellation, Portugal) and two commercial strains (CSc1 and CSc2) S. cerevisiae were analysed using a methodology based on advanced multidimensional gas chromatography (HS-SPME/GC×GC-ToFMS) tandem with multivariate analysis. A total of 257 volatile metabolites were identified, distributed over the chemical families of acetals, acids, alcohols, aldehydes, ketones, terpenic compounds, esters, ethers, furan-type compounds, hydrocarbons, pyrans, pyrazines and S-compounds. Some of these families are related with metabolic pathways of amino acid, carbohydrate and fatty acid metabolism as well as mono and sesquiterpenic biosynthesis. Principal Component Analysis (PCA) was used with a dataset comprising all variables (257 volatile components), and a distinction was observed between commercial and indigenous strains, which suggests inter-strains variability. In a second step, a subset containing esters and terpenic compounds (C10 and C15), metabolites of particular relevance to wine aroma, was also analysed using PCA. The terpenic and ester profiles express the strains variability and their potential contribution to the wine aromas, specially the BT2453, which produced the higher terpenic content. This research contributes to understand the metabolic diversity of indigenous wine microflora versus commercial strains and achieved knowledge that may be further exploited to produce wines with peculiar aroma properties. PMID:26600152

  14. Genetic diversity and population structure of Iranian wild Pleurotus eryngii species-complex strains revealed by URP-PCR markers

    NARCIS (Netherlands)

    Behnamian, Mahdi; Mohammadi, Seyed A.; Sonnenberg, A.S.M.; Goltapeh, Ebrahim M.; Hendrickx, P.M.

    2010-01-01

    In the present study, a set of 68 P. eryngii wild strains collected from nine locations in northwest and west of Iran along with six commercial strains were studied using universal rice primers (URP). The wild strains were isolated from Ferula ovina, F. haussknechtii, Cachrys ferulacea, Kellusia

  15. Strain path and work-hardening behavior of brass

    International Nuclear Information System (INIS)

    Sakharova, N.A.; Fernandes, J.V.; Vieira, M.F.

    2009-01-01

    Plastic straining in metal forming usually includes changes of strain path, which are frequently not taken into account in the analysis of forming processes. Moreover, strain path change can significantly affect the mechanical behavior and microstructural evolution of the material. For this reason, a combination of several simple loading test sequences is an effective way to investigate the dislocation microstructure of sheet metals under such forming conditions. Pure tension and rolling strain paths and rolling-tension strain path sequences were performed on brass sheets. A study of mechanical behavior and microstructural evolution during the simple and the complex strain paths was carried out, within a wide range of strain values. The appearance and development of deformation twinning was evident. It was shown that strain path change promotes the onset of premature twinning. The work-hardening behavior is discussed in terms of the twinning and dislocation microstructure evolution, as revealed by transmission electron microscopy

  16. A hybrid organic-inorganic perovskite dataset

    Science.gov (United States)

    Kim, Chiho; Huan, Tran Doan; Krishnan, Sridevi; Ramprasad, Rampi

    2017-05-01

    Hybrid organic-inorganic perovskites (HOIPs) have been attracting a great deal of attention due to their versatility of electronic properties and fabrication methods. We prepare a dataset of 1,346 HOIPs, which features 16 organic cations, 3 group-IV cations and 4 halide anions. Using a combination of an atomic structure search method and density functional theory calculations, the optimized structures, the bandgap, the dielectric constant, and the relative energies of the HOIPs are uniformly prepared and validated by comparing with relevant experimental and/or theoretical data. We make the dataset available at Dryad Digital Repository, NoMaD Repository, and Khazana Repository (http://khazana.uconn.edu/), hoping that it could be useful for future data-mining efforts that can explore possible structure-property relationships and phenomenological models. Progressive extension of the dataset is expected as new organic cations become appropriate within the HOIP framework, and as additional properties are calculated for the new compounds found.

  17. Genomics dataset of unidentified disclosed isolates

    Directory of Open Access Journals (Sweden)

    Bhagwan N. Rekadwad

    2016-09-01

    Full Text Available Analysis of DNA sequences is necessary for higher hierarchical classification of the organisms. It gives clues about the characteristics of organisms and their taxonomic position. This dataset is chosen to find complexities in the unidentified DNA in the disclosed patents. A total of 17 unidentified DNA sequences were thoroughly analyzed. The quick response codes were generated. AT/GC content of the DNA sequences analysis was carried out. The QR is helpful for quick identification of isolates. AT/GC content is helpful for studying their stability at different temperatures. Additionally, a dataset on cleavage code and enzyme code studied under the restriction digestion study, which helpful for performing studies using short DNA sequences was reported. The dataset disclosed here is the new revelatory data for exploration of unique DNA sequences for evaluation, identification, comparison and analysis. Keywords: BioLABs, Blunt ends, Genomics, NEB cutter, Restriction digestion, Short DNA sequences, Sticky ends

  18. Integrated analysis of ischemic stroke datasets revealed sex and age difference in anti-stroke targets

    Directory of Open Access Journals (Sweden)

    Wen-Xing Li

    2016-09-01

    Full Text Available Ischemic stroke is a common neurological disorder and the burden in the world is growing. This study aims to explore the effect of sex and age difference on ischemic stroke using integrated microarray datasets. The results showed a dramatic difference in whole gene expression profiles and influenced pathways between males and females, and also in the old and young individuals. Furthermore, compared with old males, old female patients showed more serious biological function damage. However, females showed less affected pathways than males in young subjects. Functional interaction networks showed these differential expression genes were mostly related to immune and inflammation-related functions. In addition, we found ARG1 and MMP9 were up-regulated in total and all subgroups. Importantly, IL1A, ILAB, IL6 and TNF and other anti-stroke target genes were up-regulated in males. However, these anti-stroke target genes showed low expression in females. This study found huge sex and age differences in ischemic stroke especially the opposite expression of anti-stroke target genes. Future studies are needed to uncover these pathological mechanisms, and to take appropriate pre-prevention, treatment and rehabilitation measures.

  19. Omics strategies for revealing Yersinia pestis virulence

    Science.gov (United States)

    Yang, Ruifu; Du, Zongmin; Han, Yanping; Zhou, Lei; Song, Yajun; Zhou, Dongsheng; Cui, Yujun

    2012-01-01

    Omics has remarkably changed the way we investigate and understand life. Omics differs from traditional hypothesis-driven research because it is a discovery-driven approach. Mass datasets produced from omics-based studies require experts from different fields to reveal the salient features behind these data. In this review, we summarize omics-driven studies to reveal the virulence features of Yersinia pestis through genomics, trascriptomics, proteomics, interactomics, etc. These studies serve as foundations for further hypothesis-driven research and help us gain insight into Y. pestis pathogenesis. PMID:23248778

  20. IPCC Socio-Economic Baseline Dataset

    Data.gov (United States)

    National Aeronautics and Space Administration — The Intergovernmental Panel on Climate Change (IPCC) Socio-Economic Baseline Dataset consists of population, human development, economic, water resources, land...

  1. Clinical Trichophyton rubrum Strain Exhibiting Primary Resistance to Terbinafine

    Science.gov (United States)

    Mukherjee, Pranab K.; Leidich, Steven D.; Isham, Nancy; Leitner, Ingrid; Ryder, Neil S.; Ghannoum, Mahmoud A.

    2003-01-01

    The in vitro antifungal susceptibilities of six clinical Trichophyton rubrum isolates obtained sequentially from a single onychomycosis patient who failed oral terbinafine therapy (250 mg/day for 24 weeks) were determined by broth microdilution and macrodilution methodologies. Strain relatedness was examined by random amplified polymorphic DNA (RAPD) analyses. Data obtained from both broth micro- and macrodilution assays were in agreement and revealed that the six clinical isolates had greatly reduced susceptibilities to terbinafine. The MICs of terbinafine for these strains were >4 μg/ml, whereas they were terbinafine for all six strains were >128 μg/ml, whereas they were 0.0002 μg/ml for the reference strain. The MIC of terbinafine for the baseline strain (cultured at the initial screening visit and before therapy was started) was already 4,000-fold higher than normal, suggesting that this is a case of primary resistance to terbinafine. The results obtained by the broth macrodilution procedure revealed that the terbinafine MICs and MFCs for sequential isolates apparently increased during the course of therapy. RAPD analyses did not reveal any differences between the isolates. The terbinafine-resistant isolates exhibited normal susceptibilities to clinically available antimycotics including itraconazole, fluconazole, and griseofulvin. However, these isolates were fully cross resistant to several other known squalene epoxidase inhibitors, including naftifine, butenafine, tolnaftate, and tolciclate, suggesting a target-specific mechanism of resistance. This is the first confirmed report of terbinafine resistance in dermatophytes. PMID:12499173

  2. The LANDFIRE Refresh strategy: updating the national dataset

    Science.gov (United States)

    Nelson, Kurtis J.; Connot, Joel A.; Peterson, Birgit E.; Martin, Charley

    2013-01-01

    The LANDFIRE Program provides comprehensive vegetation and fuel datasets for the entire United States. As with many large-scale ecological datasets, vegetation and landscape conditions must be updated periodically to account for disturbances, growth, and natural succession. The LANDFIRE Refresh effort was the first attempt to consistently update these products nationwide. It incorporated a combination of specific systematic improvements to the original LANDFIRE National data, remote sensing based disturbance detection methods, field collected disturbance information, vegetation growth and succession modeling, and vegetation transition processes. This resulted in the creation of two complete datasets for all 50 states: LANDFIRE Refresh 2001, which includes the systematic improvements, and LANDFIRE Refresh 2008, which includes the disturbance and succession updates to the vegetation and fuel data. The new datasets are comparable for studying landscape changes in vegetation type and structure over a decadal period, and provide the most recent characterization of fuel conditions across the country. The applicability of the new layers is discussed and the effects of using the new fuel datasets are demonstrated through a fire behavior modeling exercise using the 2011 Wallow Fire in eastern Arizona as an example.

  3. Direct determination of elastic strains and dislocation densities in individual subgrains in deformation structures

    DEFF Research Database (Denmark)

    Jakobsen, Bo; Poulsen, Henning Friis; Lienert, U.

    2007-01-01

    A novel synchrotron-based technique "high angular resolution 3DXRD" is presented in detail, and applied to the characterization of oxygen-free, high-conductivity copper at a tensile deformation of 2%. The position and shape in reciprocal space of 14 peaks originating from deeply embedded individual...... subgrains is reported. From this dataset the density of redundant dislocations in the individual subgrains is inferred to be below 12 × 1012 m-2 on average. It is found that the subgrains on average experience a reduction in strain of 0.9 × 10-4 with respect to the mean elastic strain of the full grain...

  4. Information contained within the large scale gas injection test (Lasgit) dataset exposed using a bespoke data analysis tool-kit

    International Nuclear Information System (INIS)

    Bennett, D.P.; Thomas, H.R.; Cuss, R.J.; Harrington, J.F.; Vardon, P.J.

    2012-01-01

    with time, for use as a small scale event indicator; a non-parametric time-series component analysis technique (Singular Spectrum Analysis - SSA) for trend identification; and a unique Non-uniform Discrete Fourier Transformation (NDFT) technique that is suited to a non-uniformly sampled time-series input. Specific details of the implementations of these techniques are outlined. As a result of the application of the developed tool-kit a number of easily observable and quantified phenomena are revealed, for example: - The location of a number of small scale anomalous behaviours of potential interest are highlighted; - Frequency, amplitude and phase of highly cyclic sensors are deterministically established; - Long term trends in each sensor series are identified, revealing the residual forms of the sensor records without the long term behaviour superimposed. Re-application of the tool-kit when applied to the residual time series as determined by the initial application further reveals information of potential interest from the dataset. For example, small scale events as indicated by the noise parameterization process and frequency information determined by the NDFT process are less likely to be masked by the long-term variation in the original sensor record. Results of these improvements are also presented within the manuscript. Initial interpretation of the information exposed by the EDA performed through application of the developed tool-kit is presented. The qualitative results, i.e. the event indicators are tentatively associated with experimental procedure or response e.g. changes in noise floor correlated with hydraulic over pressurisation down-hole. The quantitative results, i.e. the frequency information, are used to estimate the effect of environmental conditions on the experimental set-up. While manipulation of a dataset to this extent can expose valuable information useful in further analysis, care must be given to ensure the phenomenon revealed are not a

  5. MiSTIC, an integrated platform for the analysis of heterogeneity in large tumour transcriptome datasets.

    Science.gov (United States)

    Lemieux, Sebastien; Sargeant, Tobias; Laperrière, David; Ismail, Houssam; Boucher, Geneviève; Rozendaal, Marieke; Lavallée, Vincent-Philippe; Ashton-Beaucage, Dariel; Wilhelm, Brian; Hébert, Josée; Hilton, Douglas J; Mader, Sylvie; Sauvageau, Guy

    2017-07-27

    Genome-wide transcriptome profiling has enabled non-supervised classification of tumours, revealing different sub-groups characterized by specific gene expression features. However, the biological significance of these subtypes remains for the most part unclear. We describe herein an interactive platform, Minimum Spanning Trees Inferred Clustering (MiSTIC), that integrates the direct visualization and comparison of the gene correlation structure between datasets, the analysis of the molecular causes underlying co-variations in gene expression in cancer samples, and the clinical annotation of tumour sets defined by the combined expression of selected biomarkers. We have used MiSTIC to highlight the roles of specific transcription factors in breast cancer subtype specification, to compare the aspects of tumour heterogeneity targeted by different prognostic signatures, and to highlight biomarker interactions in AML. A version of MiSTIC preloaded with datasets described herein can be accessed through a public web server (http://mistic.iric.ca); in addition, the MiSTIC software package can be obtained (github.com/iric-soft/MiSTIC) for local use with personalized datasets. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Morphological characterization and molecular fingerprinting of Nostoc strains by multiplex RAPD.

    Science.gov (United States)

    Hillol, Chakdar; Pabbi, Sunil

    2012-01-01

    Morphological parameters studied for the twenty selected Nostoc strains were mostly found to be consistent with the earlier reports. But the shape of akinetes observed in this study was a little deviation from the existing descriptions and heterocyst frequency was also found to be different in different strains in spite of growing in the same nitrogen free media. Multiplex RAPD produced reproducible and completely polymorphic amplification profiles for all the strains including some strain specific unique bands which are intended to be useful for identification of those strains. At least one to a maximum of two unique bands was produced by different dual primer combinations. For ten strains out of twenty, strain specific bands were found to be generated. Cluster analysis revealed a vast heterogeneity among these Nostoc strains and no specific clustering based on geographical origin was found except a few strains. It was also observed that morphological data may not necessarily correspond to the genetic data in most of the cases. CCC92 (Nostoc muscorum) and CCC48 (Nostoc punctiforme) showed a high degree of similarity which was well supported by high bootstrap value. The level of similarity of the strains ranged from 0.15 to 0.94. Cluster analysis based on multiplex RAPD showed a good fit revealing the discriminatory power of this technique.

  7. Omicseq: a web-based search engine for exploring omics datasets

    Science.gov (United States)

    Sun, Xiaobo; Pittard, William S.; Xu, Tianlei; Chen, Li; Zwick, Michael E.; Jiang, Xiaoqian; Wang, Fusheng

    2017-01-01

    Abstract The development and application of high-throughput genomics technologies has resulted in massive quantities of diverse omics data that continue to accumulate rapidly. These rich datasets offer unprecedented and exciting opportunities to address long standing questions in biomedical research. However, our ability to explore and query the content of diverse omics data is very limited. Existing dataset search tools rely almost exclusively on the metadata. A text-based query for gene name(s) does not work well on datasets wherein the vast majority of their content is numeric. To overcome this barrier, we have developed Omicseq, a novel web-based platform that facilitates the easy interrogation of omics datasets holistically to improve ‘findability’ of relevant data. The core component of Omicseq is trackRank, a novel algorithm for ranking omics datasets that fully uses the numerical content of the dataset to determine relevance to the query entity. The Omicseq system is supported by a scalable and elastic, NoSQL database that hosts a large collection of processed omics datasets. In the front end, a simple, web-based interface allows users to enter queries and instantly receive search results as a list of ranked datasets deemed to be the most relevant. Omicseq is freely available at http://www.omicseq.org. PMID:28402462

  8. Intraspecies diversity of Lactobacillus sakei response to oxidative stress and variability of strain performance in mixed strains challenges.

    Science.gov (United States)

    Guilbaud, Morgan; Zagorec, Monique; Chaillou, Stéphane; Champomier-Vergès, Marie-Christine

    2012-04-01

    Lactobacillus sakei is a meat-borne lactic acid bacterium species exhibiting a wide genomic diversity. We have investigated the diversity of response to various oxidative compounds, between L. sakei strains, among a collection representing the genomic diversity. We observed various responses to the different compounds as well as a diversity of response depending on the aeration conditions used for cell growth. A principal component analysis revealed two main phenotypic groups, partially correlating with previously described genomic clusters. We designed strains mixes composed of three different strains, in order to examine the behavior of each strain, when cultured alone or in the presence of other strains. The strains composing the mixtures were chosen as diverse as possible, i.e. exhibiting diverse responses to oxidative stress and belonging to different genomic clusters. Growth and survival rates of each strain were monitored under various aeration conditions, with or without heme supplementation. The results obtained suggest that some strains may act as "helper" or "burden" strains depending on the oxidative conditions encountered during incubation. This study confirms that resistance to oxidative stress is extremely variable within the L. sakei species and that this property should be considered when investigating starter performance in the complex meat bacterial ecosystems. Copyright © 2011 Elsevier Ltd. All rights reserved.

  9. Nanoparticle-organic pollutant interaction dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — Dataset presents concentrations of organic pollutants, such as polyaromatic hydrocarbon compounds, in water samples. Water samples of known volume and concentration...

  10. Yersinia enterocolitica YopH-Deficient Strain Activates Neutrophil Recruitment to Peyer's Patches and Promotes Clearance of the Virulent Strain.

    Science.gov (United States)

    Dave, Mabel N; Silva, Juan E; Eliçabe, Ricardo J; Jeréz, María B; Filippa, Verónica P; Gorlino, Carolina V; Autenrieth, Stella; Autenrieth, Ingo B; Di Genaro, María S

    2016-11-01

    Yersinia enterocolitica evades the immune response by injecting Yersinia outer proteins (Yops) into the cytosol of host cells. YopH is a tyrosine phosphatase critical for Yersinia virulence. However, the mucosal immune mechanisms subverted by YopH during in vivo orogastric infection with Y. enterocolitica remain elusive. The results of this study revealed neutrophil recruitment to Peyer's patches (PP) after infection with a YopH-deficient mutant strain (Y. enterocolitica ΔyopH). While the Y. enterocolitica wild-type (WT) strain in PP induced the major neutrophil chemoattractant CXCL1 mRNA and protein levels, infection with the Y. enterocolitica ΔyopH mutant strain exhibited a higher expression of the CXCL1 receptor, CXCR2, in blood neutrophils, leading to efficient neutrophil recruitment to the PP. In contrast, migration of neutrophils into PP was impaired upon infection with Y. enterocolitica WT strain. In vitro infection of blood neutrophils revealed the involvement of YopH in CXCR2 expression. Depletion of neutrophils during Y. enterocolitica ΔyopH infection raised the bacterial load in PP. Moreover, the clearance of WT Y. enterocolitica was improved when an equal mixture of Y. enterocolitica WT and Y. enterocolitica ΔyopH strains was used in infecting the mice. This study indicates that Y. enterocolitica prevents early neutrophil recruitment in the intestine and that the effector protein YopH plays an important role in the immune evasion mechanism. The findings highlight the potential use of the Y. enterocolitica YopH-deficient strain as an oral vaccine carrier. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  11. Strain-accelerated dynamics of soft colloidal glasses

    KAUST Repository

    Agarwal, Praveen

    2011-04-11

    We have investigated strain-accelerated dynamics of soft glasses theoretically and experimentally. Mechanical rheology measurements performed on a variety of systems reveal evidence for the speeding-up of relaxation at modest shear strains in both step and oscillatory shear flows. Using the soft glassy rheology (SGR) model framework, we show that the observed behavior is a fundamental, but heretofore unexplored attribute of soft glasses. © 2011 American Physical Society.

  12. Framework for Interactive Parallel Dataset Analysis on the Grid

    Energy Technology Data Exchange (ETDEWEB)

    Alexander, David A.; Ananthan, Balamurali; /Tech-X Corp.; Johnson, Tony; Serbo, Victor; /SLAC

    2007-01-10

    We present a framework for use at a typical Grid site to facilitate custom interactive parallel dataset analysis targeting terabyte-scale datasets of the type typically produced by large multi-institutional science experiments. We summarize the needs for interactive analysis and show a prototype solution that satisfies those needs. The solution consists of desktop client tool and a set of Web Services that allow scientists to sign onto a Grid site, compose analysis script code to carry out physics analysis on datasets, distribute the code and datasets to worker nodes, collect the results back to the client, and to construct professional-quality visualizations of the results.

  13. Different distribution patterns of ten virulence genes in Legionella reference strains and strains isolated from environmental water and patients.

    Science.gov (United States)

    Zhan, Xiao-Yong; Hu, Chao-Hui; Zhu, Qing-Yi

    2016-04-01

    Virulence genes are distinct regions of DNA which are present in the genome of pathogenic bacteria and absent in nonpathogenic strains of the same or related species. Virulence genes are frequently associated with bacterial pathogenicity in genus Legionella. In the present study, an assay was performed to detect ten virulence genes, including iraA, iraB, lvrA, lvrB, lvhD, cpxR, cpxA, dotA, icmC and icmD in different pathogenicity islands of 47 Legionella reference strains, 235 environmental strains isolated from water, and 4 clinical strains isolated from the lung tissue of pneumonia patients. The distribution frequencies of these genes in reference or/and environmental L. pneumophila strains were much higher than those in reference non-L. pneumophila or/and environmental non-L. pneumophila strains, respectively. L. pneumophila clinical strains also maintained higher frequencies of these genes compared to four other types of Legionella strains. Distribution frequencies of these genes in reference L. pneumophila strains were similar to those in environmental L. pneumophila strains. In contrast, environmental non-L. pneumophila maintained higher frequencies of these genes compared to those found in reference non-L. pneumophila strains. This study illustrates the association of virulence genes with Legionella pathogenicity and reveals the possible virulence evolution of non-L. pneumophia strains isolated from environmental water.

  14. Large-scale Labeled Datasets to Fuel Earth Science Deep Learning Applications

    Science.gov (United States)

    Maskey, M.; Ramachandran, R.; Miller, J.

    2017-12-01

    Deep learning has revolutionized computer vision and natural language processing with various algorithms scaled using high-performance computing. However, generic large-scale labeled datasets such as the ImageNet are the fuel that drives the impressive accuracy of deep learning results. Large-scale labeled datasets already exist in domains such as medical science, but creating them in the Earth science domain is a challenge. While there are ways to apply deep learning using limited labeled datasets, there is a need in the Earth sciences for creating large-scale labeled datasets for benchmarking and scaling deep learning applications. At the NASA Marshall Space Flight Center, we are using deep learning for a variety of Earth science applications where we have encountered the need for large-scale labeled datasets. We will discuss our approaches for creating such datasets and why these datasets are just as valuable as deep learning algorithms. We will also describe successful usage of these large-scale labeled datasets with our deep learning based applications.

  15. An Affinity Propagation Clustering Algorithm for Mixed Numeric and Categorical Datasets

    Directory of Open Access Journals (Sweden)

    Kang Zhang

    2014-01-01

    Full Text Available Clustering has been widely used in different fields of science, technology, social science, and so forth. In real world, numeric as well as categorical features are usually used to describe the data objects. Accordingly, many clustering methods can process datasets that are either numeric or categorical. Recently, algorithms that can handle the mixed data clustering problems have been developed. Affinity propagation (AP algorithm is an exemplar-based clustering method which has demonstrated good performance on a wide variety of datasets. However, it has limitations on processing mixed datasets. In this paper, we propose a novel similarity measure for mixed type datasets and an adaptive AP clustering algorithm is proposed to cluster the mixed datasets. Several real world datasets are studied to evaluate the performance of the proposed algorithm. Comparisons with other clustering algorithms demonstrate that the proposed method works well not only on mixed datasets but also on pure numeric and categorical datasets.

  16. Genomic and gene variation in Mycoplasma hominis strains

    DEFF Research Database (Denmark)

    Christiansen, Gunna; Andersen, H; Birkelund, Svend

    1987-01-01

    DNAs from 14 strains of Mycoplasma hominis isolated from various habitats, including strain PG21, were analyzed for genomic heterogeneity. DNA-DNA filter hybridization values were from 51 to 91%. Restriction endonuclease digestion patterns, analyzed by agarose gel electrophoresis, revealed...... no identity or cluster formation between strains. Variation within M. hominis rRNA genes was analyzed by Southern hybridization of EcoRI-cleaved DNA hybridized with a cloned fragment of the rRNA gene from the mycoplasma strain PG50. Five of the M. hominis strains showed identical hybridization patterns....... These hybridization patterns were compared with those of 12 other mycoplasma species, which showed a much more complex band pattern. Cloned nonribosomal RNA gene fragments of M. hominis PG21 DNA were analyzed, and the fragments were used to demonstrate heterogeneity among the strains. A monoclonal antibody against...

  17. Using Multiple Big Datasets and Machine Learning to Produce a New Global Particulate Dataset: A Technology Challenge Case Study

    Science.gov (United States)

    Lary, D. J.

    2013-12-01

    A BigData case study is described where multiple datasets from several satellites, high-resolution global meteorological data, social media and in-situ observations are combined using machine learning on a distributed cluster using an automated workflow. The global particulate dataset is relevant to global public health studies and would not be possible to produce without the use of the multiple big datasets, in-situ data and machine learning.To greatly reduce the development time and enhance the functionality a high level language capable of parallel processing has been used (Matlab). A key consideration for the system is high speed access due to the large data volume, persistence of the large data volumes and a precise process time scheduling capability.

  18. Chemical product and function dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — Merged product weight fraction and chemical function data. This dataset is associated with the following publication: Isaacs , K., M. Goldsmith, P. Egeghy , K....

  19. Bordetella pertussis pertactin knock-out strains reveal immunomodulatory properties of this virulence factor.

    NARCIS (Netherlands)

    Hovingh, Elise Sofie; Mariman, Rob; Solans, Luis; Hijdra, Daniëlle; Hamstra, Hendrik-Jan; Jongerius, Ilse; van Gent, Marjolein; Mooi, Frits; Locht, Camille; Pinelli, Elena

    2018-01-01

    Whooping cough, caused by Bordetella pertussis, has resurged and presents a global health burden worldwide. B. pertussis strains unable to produce the acellular pertussis vaccine component pertactin (Prn), have been emerging and in some countries represent up to 95% of recent clinical isolates.

  20. General Purpose Multimedia Dataset - GarageBand 2008

    DEFF Research Database (Denmark)

    Meng, Anders

    This document describes a general purpose multimedia data-set to be used in cross-media machine learning problems. In more detail we describe the genre taxonomy applied at http://www.garageband.com, from where the data-set was collected, and how the taxonomy have been fused into a more human...... understandable taxonomy. Finally, a description of various features extracted from both the audio and text are presented....

  1. Omicseq: a web-based search engine for exploring omics datasets.

    Science.gov (United States)

    Sun, Xiaobo; Pittard, William S; Xu, Tianlei; Chen, Li; Zwick, Michael E; Jiang, Xiaoqian; Wang, Fusheng; Qin, Zhaohui S

    2017-07-03

    The development and application of high-throughput genomics technologies has resulted in massive quantities of diverse omics data that continue to accumulate rapidly. These rich datasets offer unprecedented and exciting opportunities to address long standing questions in biomedical research. However, our ability to explore and query the content of diverse omics data is very limited. Existing dataset search tools rely almost exclusively on the metadata. A text-based query for gene name(s) does not work well on datasets wherein the vast majority of their content is numeric. To overcome this barrier, we have developed Omicseq, a novel web-based platform that facilitates the easy interrogation of omics datasets holistically to improve 'findability' of relevant data. The core component of Omicseq is trackRank, a novel algorithm for ranking omics datasets that fully uses the numerical content of the dataset to determine relevance to the query entity. The Omicseq system is supported by a scalable and elastic, NoSQL database that hosts a large collection of processed omics datasets. In the front end, a simple, web-based interface allows users to enter queries and instantly receive search results as a list of ranked datasets deemed to be the most relevant. Omicseq is freely available at http://www.omicseq.org. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Quantifying uncertainty in observational rainfall datasets

    Science.gov (United States)

    Lennard, Chris; Dosio, Alessandro; Nikulin, Grigory; Pinto, Izidine; Seid, Hussen

    2015-04-01

    The CO-ordinated Regional Downscaling Experiment (CORDEX) has to date seen the publication of at least ten journal papers that examine the African domain during 2012 and 2013. Five of these papers consider Africa generally (Nikulin et al. 2012, Kim et al. 2013, Hernandes-Dias et al. 2013, Laprise et al. 2013, Panitz et al. 2013) and five have regional foci: Tramblay et al. (2013) on Northern Africa, Mariotti et al. (2014) and Gbobaniyi el al. (2013) on West Africa, Endris et al. (2013) on East Africa and Kalagnoumou et al. (2013) on southern Africa. There also are a further three papers that the authors know about under review. These papers all use an observed rainfall and/or temperature data to evaluate/validate the regional model output and often proceed to assess projected changes in these variables due to climate change in the context of these observations. The most popular reference rainfall data used are the CRU, GPCP, GPCC, TRMM and UDEL datasets. However, as Kalagnoumou et al. (2013) point out there are many other rainfall datasets available for consideration, for example, CMORPH, FEWS, TAMSAT & RIANNAA, TAMORA and the WATCH & WATCH-DEI data. They, with others (Nikulin et al. 2012, Sylla et al. 2012) show that the observed datasets can have a very wide spread at a particular space-time coordinate. As more ground, space and reanalysis-based rainfall products become available, all which use different methods to produce precipitation data, the selection of reference data is becoming an important factor in model evaluation. A number of factors can contribute to a uncertainty in terms of the reliability and validity of the datasets such as radiance conversion algorithims, the quantity and quality of available station data, interpolation techniques and blending methods used to combine satellite and guage based products. However, to date no comprehensive study has been performed to evaluate the uncertainty in these observational datasets. We assess 18 gridded

  3. Mechanical control over valley magnetotransport in strained graphene

    Energy Technology Data Exchange (ETDEWEB)

    Ma, Ning, E-mail: maning@stu.xjtu.edu.cn [Department of Physics, MOE Key Laboratory of Advanced Transducers and Intelligent Control System, Taiyuan University of Technology, Taiyuan 030024 (China); Department of Applied Physics, MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, Xi' an Jiaotong University, Xi' an 710049 (China); Zhang, Shengli, E-mail: zhangsl@mail.xjtu.edu.cn [Department of Applied Physics, MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, Xi' an Jiaotong University, Xi' an 710049 (China); Liu, Daqing, E-mail: liudq@cczu.edu.cn [School of Mathematics and Physics, Changzhou University, Changzhou 213164 (China)

    2016-05-06

    Recent experiments report that the graphene exhibits Landau levels (LLs) that form in the presence of a uniform strain pseudomagnetic field with magnitudes up to hundreds of tesla. We further reveal that the strain removes the valley degeneracy in LLs, and leads to a significant valley polarization with inversion symmetry broken. This accordingly gives rise to the well separated valley Hall plateaus and Shubnikov–de Haas oscillations. These effects are absent in strainless graphene, and can be used to generate and detect valley polarization by mechanical means, forming the basis for the new paradigm “valleytronics” applications. - Highlights: • We explore the mechanical strain effects on the valley magnetotransport in graphene. • We analytically derive the dc collisional and Hall conductivities under strain. • The strain removes the valley degeneracy in Landau levels. • The strain causes a significant valley polarization with inversion symmetry broken. • The strain leads to the well separated valley Hall and Shubnikov–de Haas effects.

  4. Strain distributions and their influence on electronic structures of WSe2–MoS2 laterally strained heterojunctions

    KAUST Repository

    Zhang, Chendong

    2018-01-12

    Monolayer transition metal dichalcogenide heterojunctions, including vertical and lateral p–n junctions, have attracted considerable attention due to their potential applications in electronics and optoelectronics. Lattice-misfit strain in atomically abrupt lateral heterojunctions, such as WSe2–MoS2, offers a new band-engineering strategy for tailoring their electronic properties. However, this approach requires an understanding of the strain distribution and its effect on band alignment. Here, we study a WSe2–MoS2 lateral heterojunction using scanning tunnelling microscopy and image its moiré pattern to map the full two-dimensional strain tensor with high spatial resolution. Using scanning tunnelling spectroscopy, we measure both the strain and the band alignment of the WSe2–MoS2 lateral heterojunction. We find that the misfit strain induces type II to type I band alignment transformation. Scanning transmission electron microscopy reveals the dislocations at the interface that partially relieve the strain. Finally, we observe a distinctive electronic structure at the interface due to hetero-bonding.

  5. Strain distributions and their influence on electronic structures of WSe2–MoS2 laterally strained heterojunctions

    KAUST Repository

    Zhang, Chendong; Li, Ming-yang; Tersoff, Jerry; Han, Yimo; Su, Yushan; Li, Lain-Jong; Muller, David A.; Shih, Chih-Kang

    2018-01-01

    Monolayer transition metal dichalcogenide heterojunctions, including vertical and lateral p–n junctions, have attracted considerable attention due to their potential applications in electronics and optoelectronics. Lattice-misfit strain in atomically abrupt lateral heterojunctions, such as WSe2–MoS2, offers a new band-engineering strategy for tailoring their electronic properties. However, this approach requires an understanding of the strain distribution and its effect on band alignment. Here, we study a WSe2–MoS2 lateral heterojunction using scanning tunnelling microscopy and image its moiré pattern to map the full two-dimensional strain tensor with high spatial resolution. Using scanning tunnelling spectroscopy, we measure both the strain and the band alignment of the WSe2–MoS2 lateral heterojunction. We find that the misfit strain induces type II to type I band alignment transformation. Scanning transmission electron microscopy reveals the dislocations at the interface that partially relieve the strain. Finally, we observe a distinctive electronic structure at the interface due to hetero-bonding.

  6. Chemical Profile of Monascus ruber Strains

    Directory of Open Access Journals (Sweden)

    Ahamed M. Moharram

    2012-01-01

    Full Text Available Chemical profile of Monascus ruber strains has been studied using gas chromatography-mass spectrometry (GC/MS analysis. The colour intensity of the red pigment and secondary metabolic products of two M. ruber strains (AUMC 4066 and AUMC 5705 cultivated on ten different media were also studied. Metabolic products can be classified into four categories: anticholesterol, anticancer, food colouring, and essential fatty acids necessary for human health. Using GC/MS, the following 88 metabolic products were detected: butyric acid and its derivatives (25 products, other fatty acids and their derivatives (19 products, pyran and its derivatives (22 products and other metabolites (22 products. Among these, 32 metabolites were specific for AUMC 4066 strain and 34 for AUMC 5705 strain, whereas 22 metabolites were produced by both strains on different tested substrates. Production of some metabolites depended on the substrate used. High number of metabolites was recorded in the red pigment extract obtained by both strains grown on malt broth and malt agar. Also, 42 aroma compounds were recorded (4 alcohols, 2 benzaldehydes, 27 esters, 3 lactones, 1 phenol, 1 terpenoid, 3 thiol compounds and acetate-3-mercapto butyric acid. Thin layer chromatography and GC/MS analyses revealed no mycotoxin citrinin in any media used for the growth of the two M. ruber strains.

  7. Atlantic small-mammal: a dataset of communities of rodents and marsupials of the Atlantic forests of South America.

    Science.gov (United States)

    Bovendorp, Ricardo S; Villar, Nacho; de Abreu-Junior, Edson F; Bello, Carolina; Regolin, André L; Percequillo, Alexandre R; Galetti, Mauro

    2017-08-01

    The contribution of small mammal ecology to the understanding of macroecological patterns of biodiversity, population dynamics, and community assembly has been hindered by the absence of large datasets of small mammal communities from tropical regions. Here we compile the largest dataset of inventories of small mammal communities for the Neotropical region. The dataset reviews small mammal communities from the Atlantic forest of South America, one of the regions with the highest diversity of small mammals and a global biodiversity hotspot, though currently covering less than 12% of its original area due to anthropogenic pressures. The dataset comprises 136 references from 300 locations covering seven vegetation types of tropical and subtropical Atlantic forests of South America, and presents data on species composition, richness, and relative abundance (captures/trap-nights). One paper was published more than 70 yr ago, but 80% of them were published after 2000. The dataset comprises 53,518 individuals of 124 species of small mammals, including 30 species of marsupials and 94 species of rodents. Species richness averaged 8.2 species (1-21) per site. Only two species occurred in more than 50% of the sites (the common opossum, Didelphis aurita and black-footed pigmy rice rat Oligoryzomys nigripes). Mean species abundance varied 430-fold, from 4.3 to 0.01 individuals/trap-night. The dataset also revealed a hyper-dominance of 22 species that comprised 78.29% of all individuals captured, with only seven species representing 44% of all captures. The information contained on this dataset can be applied in the study of macroecological patterns of biodiversity, communities, and populations, but also to evaluate the ecological consequences of fragmentation and defaunation, and predict disease outbreaks, trophic interactions and community dynamics in this biodiversity hotspot. © 2017 by the Ecological Society of America.

  8. Turkey Run Landfill Emissions Dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — landfill emissions measurements for the Turkey run landfill in Georgia. This dataset is associated with the following publication: De la Cruz, F., R. Green, G....

  9. Trend and variability in a new, reconstructed streamflow dataset for West and Central Africa, and climatic interactions, 1950-2005

    Science.gov (United States)

    Sidibe, Moussa; Dieppois, Bastien; Mahé, Gil; Paturel, Jean-Emmanuel; Amoussou, Ernest; Anifowose, Babatunde; Lawler, Damian

    2018-06-01

    Over recent decades, regions of West and Central Africa have experienced different and significant changes in climatic patterns, which have significantly impacted hydrological regimes. Such impacts, however, are not fully understood at the regional scale, largely because of scarce hydroclimatic data. Therefore, the aim of this study is to (a) assemble a new, robust, reconstructed streamflow dataset of 152 gauging stations; (b) quantify changes in streamflow over 1950-2005 period, using these newly reconstructed datasets; (c) significantly reveal trends and variability in streamflow over West and Central Africa based on new reconstructions; and (d) assess the robustness of this dataset by comparing the results with those identified in key climatic drivers (e.g. precipitation and temperature) over the region. Gap filling methods applied to monthly time series (1950-2005) yielded robust results (median Kling-Gupta Efficiency >0.75). The study underlines a good agreement between precipitation and streamflow trends and reveals contrasts between western Africa (negative trends) and Central Africa (positive trends) in the 1950s and 1960s. Homogenous dry conditions of the 1970s and 1980s, characterized by reduced significant negative trends resulting from quasi-decadal modulations of the trend, are replaced by wetter conditions in the recent period (1993-2005). The effect of this rainfall recovery (which extends to West and Central Africa) on increased river flows are further amplified by land use change in some Sahelian basins. This is partially offset, however, by higher potential evapotranspiration rates over parts of Niger and Nigeria. Crucially, the new reconstructed streamflow datasets presented here will be available for both the scientific community and water resource managers.

  10. Topic modeling for cluster analysis of large biological and medical datasets.

    Science.gov (United States)

    Zhao, Weizhong; Zou, Wen; Chen, James J

    2014-01-01

    The big data moniker is nowhere better deserved than to describe the ever-increasing prodigiousness and complexity of biological and medical datasets. New methods are needed to generate and test hypotheses, foster biological interpretation, and build validated predictors. Although multivariate techniques such as cluster analysis may allow researchers to identify groups, or clusters, of related variables, the accuracies and effectiveness of traditional clustering methods diminish for large and hyper dimensional datasets. Topic modeling is an active research field in machine learning and has been mainly used as an analytical tool to structure large textual corpora for data mining. Its ability to reduce high dimensionality to a small number of latent variables makes it suitable as a means for clustering or overcoming clustering difficulties in large biological and medical datasets. In this study, three topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, are proposed and tested on the cluster analysis of three large datasets: Salmonella pulsed-field gel electrophoresis (PFGE) dataset, lung cancer dataset, and breast cancer dataset, which represent various types of large biological or medical datasets. All three various methods are shown to improve the efficacy/effectiveness of clustering results on the three datasets in comparison to traditional methods. A preferable cluster analysis method emerged for each of the three datasets on the basis of replicating known biological truths. Topic modeling could be advantageously applied to the large datasets of biological or medical research. The three proposed topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, yield clustering improvements for the three different data types. Clusters more efficaciously represent truthful groupings and subgroupings in the data than traditional methods, suggesting

  11. An Analysis of the GTZAN Music Genre Dataset

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2012-01-01

    Most research in automatic music genre recognition has used the dataset assembled by Tzanetakis et al. in 2001. The composition and integrity of this dataset, however, has never been formally analyzed. For the first time, we provide an analysis of its composition, and create a machine...

  12. Dataset definition for CMS operations and physics analyses

    Science.gov (United States)

    Franzoni, Giovanni; Compact Muon Solenoid Collaboration

    2016-04-01

    Data recorded at the CMS experiment are funnelled into streams, integrated in the HLT menu, and further organised in a hierarchical structure of primary datasets and secondary datasets/dedicated skims. Datasets are defined according to the final-state particles reconstructed by the high level trigger, the data format and the use case (physics analysis, alignment and calibration, performance studies). During the first LHC run, new workflows have been added to this canonical scheme, to exploit at best the flexibility of the CMS trigger and data acquisition systems. The concepts of data parking and data scouting have been introduced to extend the physics reach of CMS, offering the opportunity of defining physics triggers with extremely loose selections (e.g. dijet resonance trigger collecting data at a 1 kHz). In this presentation, we review the evolution of the dataset definition during the LHC run I, and we discuss the plans for the run II.

  13. Dataset definition for CMS operations and physics analyses

    CERN Document Server

    AUTHOR|(CDS)2051291

    2016-01-01

    Data recorded at the CMS experiment are funnelled into streams, integrated in the HLT menu, and further organised in a hierarchical structure of primary datasets, secondary datasets, and dedicated skims. Datasets are defined according to the final-state particles reconstructed by the high level trigger, the data format and the use case (physics analysis, alignment and calibration, performance studies). During the first LHC run, new workflows have been added to this canonical scheme, to exploit at best the flexibility of the CMS trigger and data acquisition systems. The concept of data parking and data scouting have been introduced to extend the physics reach of CMS, offering the opportunity of defining physics triggers with extremely loose selections (e.g. dijet resonance trigger collecting data at a 1 kHz). In this presentation, we review the evolution of the dataset definition during the first run, and we discuss the plans for the second LHC run.

  14. Dataset of NRDA emission data

    Data.gov (United States)

    U.S. Environmental Protection Agency — Emissions data from open air oil burns. This dataset is associated with the following publication: Gullett, B., J. Aurell, A. Holder, B. Mitchell, D. Greenwell, M....

  15. Medical Image Data and Datasets in the Era of Machine Learning-Whitepaper from the 2016 C-MIMI Meeting Dataset Session.

    Science.gov (United States)

    Kohli, Marc D; Summers, Ronald M; Geis, J Raymond

    2017-08-01

    At the first annual Conference on Machine Intelligence in Medical Imaging (C-MIMI), held in September 2016, a conference session on medical image data and datasets for machine learning identified multiple issues. The common theme from attendees was that everyone participating in medical image evaluation with machine learning is data starved. There is an urgent need to find better ways to collect, annotate, and reuse medical imaging data. Unique domain issues with medical image datasets require further study, development, and dissemination of best practices and standards, and a coordinated effort among medical imaging domain experts, medical imaging informaticists, government and industry data scientists, and interested commercial, academic, and government entities. High-level attributes of reusable medical image datasets suitable to train, test, validate, verify, and regulate ML products should be better described. NIH and other government agencies should promote and, where applicable, enforce, access to medical image datasets. We should improve communication among medical imaging domain experts, medical imaging informaticists, academic clinical and basic science researchers, government and industry data scientists, and interested commercial entities.

  16. Discovery and Reuse of Open Datasets: An Exploratory Study

    Directory of Open Access Journals (Sweden)

    Sara

    2016-07-01

    Full Text Available Objective: This article analyzes twenty cited or downloaded datasets and the repositories that house them, in order to produce insights that can be used by academic libraries to encourage discovery and reuse of research data in institutional repositories. Methods: Using Thomson Reuters’ Data Citation Index and repository download statistics, we identified twenty cited/downloaded datasets. We documented the characteristics of the cited/downloaded datasets and their corresponding repositories in a self-designed rubric. The rubric includes six major categories: basic information; funding agency and journal information; linking and sharing; factors to encourage reuse; repository characteristics; and data description. Results: Our small-scale study suggests that cited/downloaded datasets generally comply with basic recommendations for facilitating reuse: data are documented well; formatted for use with a variety of software; and shared in established, open access repositories. Three significant factors also appear to contribute to dataset discovery: publishing in discipline-specific repositories; indexing in more than one location on the web; and using persistent identifiers. The cited/downloaded datasets in our analysis came from a few specific disciplines, and tended to be funded by agencies with data publication mandates. Conclusions: The results of this exploratory research provide insights that can inform academic librarians as they work to encourage discovery and reuse of institutional datasets. Our analysis also suggests areas in which academic librarians can target open data advocacy in their communities in order to begin to build open data success stories that will fuel future advocacy efforts.

  17. Visualization of conserved structures by fusing highly variable datasets.

    Science.gov (United States)

    Silverstein, Jonathan C; Chhadia, Ankur; Dech, Fred

    2002-01-01

    Skill, effort, and time are required to identify and visualize anatomic structures in three-dimensions from radiological data. Fundamentally, automating these processes requires a technique that uses symbolic information not in the dynamic range of the voxel data. We were developing such a technique based on mutual information for automatic multi-modality image fusion (MIAMI Fuse, University of Michigan). This system previously demonstrated facility at fusing one voxel dataset with integrated symbolic structure information to a CT dataset (different scale and resolution) from the same person. The next step of development of our technique was aimed at accommodating the variability of anatomy from patient to patient by using warping to fuse our standard dataset to arbitrary patient CT datasets. A standard symbolic information dataset was created from the full color Visible Human Female by segmenting the liver parenchyma, portal veins, and hepatic veins and overwriting each set of voxels with a fixed color. Two arbitrarily selected patient CT scans of the abdomen were used for reference datasets. We used the warping functions in MIAMI Fuse to align the standard structure data to each patient scan. The key to successful fusion was the focused use of multiple warping control points that place themselves around the structure of interest automatically. The user assigns only a few initial control points to align the scans. Fusion 1 and 2 transformed the atlas with 27 points around the liver to CT1 and CT2 respectively. Fusion 3 transformed the atlas with 45 control points around the liver to CT1 and Fusion 4 transformed the atlas with 5 control points around the portal vein. The CT dataset is augmented with the transformed standard structure dataset, such that the warped structure masks are visualized in combination with the original patient dataset. This combined volume visualization is then rendered interactively in stereo on the ImmersaDesk in an immersive Virtual

  18. An Annotated Dataset of 14 Cardiac MR Images

    DEFF Research Database (Denmark)

    Stegmann, Mikkel Bille

    2002-01-01

    This note describes a dataset consisting of 14 annotated cardiac MR images. Points of correspondence are placed on each image at the left ventricle (LV). As such, the dataset can be readily used for building statistical models of shape. Further, format specifications and terms of use are given....

  19. Axial strain in GaAs/InAs core-shell nanowires

    Energy Technology Data Exchange (ETDEWEB)

    Biermanns, Andreas; Pietsch, Ullrich [Universitaet Siegen, Festkoerperphysik, 57068 Siegen (Germany); Rieger, Torsten; Gruetzmacher, Detlev; Ion Lepsa, Mihail [Peter Gruenberg Institute (PGI-9), Forschungszentrum, 52425 Juelich (Germany); JARA-Fundamentals of Future Information Technology, 52425 Juelich (Germany); Bussone, Genziana [Universitaet Siegen, Festkoerperphysik, 57068 Siegen (Germany); ESRF, 6 rue Jules Horowitz, BP220, F-38043 Grenoble Cedex (France)

    2013-01-28

    We study the axial strain relaxation in GaAs/InAs core-shell nanowire heterostructures grown by molecular beam epitaxy. Besides a gradual strain relaxation of the shell material, we find a significant strain in the GaAs core, increasing with shell thickness. This strain is explained by a saturation of the dislocation density at the core-shell interface. Independent measurements of core and shell lattice parameters by x-ray diffraction reveal a relaxation of 93% in a 35 nm thick InAs shell surrounding cores of 80 nm diameter. The compressive strain of -0.5% compared to bulk InAs is accompanied by a tensile strain up to 0.9% in the GaAs core.

  20. Dataset - Adviesregel PPL 2010

    NARCIS (Netherlands)

    Evert, van F.K.; Schans, van der D.A.; Geel, van W.C.A.; Slabbekoorn, J.J.; Booij, R.; Jukema, J.N.; Meurs, E.J.J.; Uenk, D.

    2011-01-01

    This dataset contains experimental data from a number of field experiments with potato in The Netherlands (Van Evert et al., 2011). The data are presented as an SQL dump of a PostgreSQL database (version 8.4.4). An outline of the entity-relationship diagram of the database is given in an

  1. Tension in the recent Type Ia supernovae datasets

    International Nuclear Information System (INIS)

    Wei, Hao

    2010-01-01

    In the present work, we investigate the tension in the recent Type Ia supernovae (SNIa) datasets Constitution and Union. We show that they are in tension not only with the observations of the cosmic microwave background (CMB) anisotropy and the baryon acoustic oscillations (BAO), but also with other SNIa datasets such as Davis and SNLS. Then, we find the main sources responsible for the tension. Further, we make this more robust by employing the method of random truncation. Based on the results of this work, we suggest two truncated versions of the Union and Constitution datasets, namely the UnionT and ConstitutionT SNIa samples, whose behaviors are more regular.

  2. Technical note: An inorganic water chemistry dataset (1972–2011 ...

    African Journals Online (AJOL)

    A national dataset of inorganic chemical data of surface waters (rivers, lakes, and dams) in South Africa is presented and made freely available. The dataset comprises more than 500 000 complete water analyses from 1972 up to 2011, collected from more than 2 000 sample monitoring stations in South Africa. The dataset ...

  3. Minimum datasets to establish a CAR-mediated mode of action for rodent liver tumors.

    Science.gov (United States)

    Peffer, Richard C; LeBaron, Matthew J; Battalora, Michael; Bomann, Werner H; Werner, Christoph; Aggarwal, Manoj; Rowe, Rocky R; Tinwell, Helen

    2018-07-01

    Methods for investigating the Mode of Action (MoA) for rodent liver tumors via constitutive androstane receptor (CAR) activation are outlined here, based on current scientific knowledge about CAR and feedback from regulatory agencies globally. The key events (i.e., CAR activation, altered gene expression, cell proliferation, altered foci and increased adenomas/carcinomas) can be demonstrated by measuring a combination of key events and associative events that are markers for the key events. For crop protection products, a primary dataset typically should include a short-term study in the species/strain that showed the tumor response at dose levels that bracket the tumorigenic and non-tumorigenic dose levels. The dataset may vary depending on the species and the test compound. As examples, Case Studies with nitrapyrin (in mice) and metofluthrin (in rats) are described. Based on qualitative differences between the species, the key events leading to tumors in mice or rats by this MoA are not operative in humans. In the future, newer approaches such as a CAR biomarker signature approach and/or in vitro CAR3 reporter assays for mouse, rat and human CAR may eventually be used to demonstrate a CAR MoA is operative, without the need for extensive additional studies in laboratory animals. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  4. Wind and wave dataset for Matara, Sri Lanka

    Science.gov (United States)

    Luo, Yao; Wang, Dongxiao; Priyadarshana Gamage, Tilak; Zhou, Fenghua; Madusanka Widanage, Charith; Liu, Taiwei

    2018-01-01

    We present a continuous in situ hydro-meteorology observational dataset from a set of instruments first deployed in December 2012 in the south of Sri Lanka, facing toward the north Indian Ocean. In these waters, simultaneous records of wind and wave data are sparse due to difficulties in deploying measurement instruments, although the area hosts one of the busiest shipping lanes in the world. This study describes the survey, deployment, and measurements of wind and waves, with the aim of offering future users of the dataset the most comprehensive and as much information as possible. This dataset advances our understanding of the nearshore hydrodynamic processes and wave climate, including sea waves and swells, in the north Indian Ocean. Moreover, it is a valuable resource for ocean model parameterization and validation. The archived dataset (Table 1) is examined in detail, including wave data at two locations with water depths of 20 and 10 m comprising synchronous time series of wind, ocean astronomical tide, air pressure, etc. In addition, we use these wave observations to evaluate the ERA-Interim reanalysis product. Based on Buoy 2 data, the swells are the main component of waves year-round, although monsoons can markedly alter the proportion between swell and wind sea. The dataset (Luo et al., 2017) is publicly available from Science Data Bank (https://doi.org/10.11922/sciencedb.447).

  5. Wind and wave dataset for Matara, Sri Lanka

    Directory of Open Access Journals (Sweden)

    Y. Luo

    2018-01-01

    Full Text Available We present a continuous in situ hydro-meteorology observational dataset from a set of instruments first deployed in December 2012 in the south of Sri Lanka, facing toward the north Indian Ocean. In these waters, simultaneous records of wind and wave data are sparse due to difficulties in deploying measurement instruments, although the area hosts one of the busiest shipping lanes in the world. This study describes the survey, deployment, and measurements of wind and waves, with the aim of offering future users of the dataset the most comprehensive and as much information as possible. This dataset advances our understanding of the nearshore hydrodynamic processes and wave climate, including sea waves and swells, in the north Indian Ocean. Moreover, it is a valuable resource for ocean model parameterization and validation. The archived dataset (Table 1 is examined in detail, including wave data at two locations with water depths of 20 and 10 m comprising synchronous time series of wind, ocean astronomical tide, air pressure, etc. In addition, we use these wave observations to evaluate the ERA-Interim reanalysis product. Based on Buoy 2 data, the swells are the main component of waves year-round, although monsoons can markedly alter the proportion between swell and wind sea. The dataset (Luo et al., 2017 is publicly available from Science Data Bank (https://doi.org/10.11922/sciencedb.447.

  6. Nuclear magnetic resonance probe head design for precision strain control

    International Nuclear Information System (INIS)

    Kissikov, T.; Sarkar, R.; Bush, B. T.; Lawson, M.; Canfield, P. C.; Curro, N. J.

    2017-01-01

    Here, we present the design and construction of an NMR probe to investigate single crystals under strain at cryogenic temperatures. The probe head incorporates a piezoelectric-based apparatus from Razorbill Instruments that enables both compressive and tensile strain tuning up to strain values on the order of 0.3% with a precision of 0.001%. 75 As NMR in BaFe 2 As 2 reveals large changes to the electric field gradient and indicates that the strain is homogeneous to within 16% over the volume of the NMR coil.

  7. Cross-Comparison of Leaching Strains Isolated from Two Different Regions: Chambishi and Dexing Copper Mines

    Directory of Open Access Journals (Sweden)

    Baba Ngom

    2014-01-01

    Full Text Available A cross-comparison of six strains isolated from two different regions, Chambishi copper mine (Zambia, Africa and Dexing copper mine (China, Asia, was conducted to study the leaching efficiency of low grade copper ores. The strains belong to the three major species often encountered in bioleaching of copper sulfide ores under mesophilic conditions: Acidithiobacillus ferrooxidans, Acidithiobacillus thiooxidans, and Leptospirillum ferriphilum. Prior to their study in bioleaching, the different strains were characterized and compared at physiological level. The results revealed that, except for copper tolerance, strains within species presented almost similar physiological traits with slight advantages of Chambishi strains. However, in terms of leaching efficiency, native strains always achieved higher cell density and greater iron and copper extraction rates than the foreign microorganisms. In addition, microbial community analysis revealed that the different mixed cultures shared almost the same profile, and At. ferrooxidans strains always outcompeted the other strains.

  8. Heuristics for Relevancy Ranking of Earth Dataset Search Results

    Science.gov (United States)

    Lynnes, Christopher; Quinn, Patrick; Norton, James

    2016-01-01

    As the Variety of Earth science datasets increases, science researchers find it more challenging to discover and select the datasets that best fit their needs. The most common way of search providers to address this problem is to rank the datasets returned for a query by their likely relevance to the user. Large web page search engines typically use text matching supplemented with reverse link counts, semantic annotations and user intent modeling. However, this produces uneven results when applied to dataset metadata records simply externalized as a web page. Fortunately, data and search provides have decades of experience in serving data user communities, allowing them to form heuristics that leverage the structure in the metadata together with knowledge about the user community. Some of these heuristics include specific ways of matching the user input to the essential measurements in the dataset and determining overlaps of time range and spatial areas. Heuristics based on the novelty of the datasets can prioritize later, better versions of data over similar predecessors. And knowledge of how different user types and communities use data can be brought to bear in cases where characteristics of the user (discipline, expertise) or their intent (applications, research) can be divined. The Earth Observing System Data and Information System has begun implementing some of these heuristics in the relevancy algorithm of its Common Metadata Repository search engine.

  9. Twenty-eight divergent polysaccharide loci specifying within and amongst strain capsule diversity in three strains of Bacteroides fragilis

    DEFF Research Database (Denmark)

    Patrick, S.; Blakely, G.W.; Houston, S.

    2010-01-01

    including a putative Wzx flippase and Wzy polymerase, was confirmed in all three strains, despite a lack of cross-reactivity between NCTC 9343 and 638R surface polysaccharide-specific antibodies by immunolabelling and microscopy. Genomic comparisons revealed an exceptional level of polysaccharide...... biosynthesis locus diversity. Of the 10 divergent polysaccharide associated loci apparent in each strain, none are similar between NCTC9343 and 638R. YCH46 shares one locus with NCTC9343, confirmed by MAb labelling, and a second different locus with 638R, making a total of 28 divergent polysaccharide...... restriction and modification systems that act to prevent acquisition of foreign DNA. The level of amongst strain diversity in polysaccharide biosynthesis loci is unprecedented....

  10. QSAR ligand dataset for modelling mutagenicity, genotoxicity, and rodent carcinogenicity

    Directory of Open Access Journals (Sweden)

    Davy Guan

    2018-04-01

    Full Text Available Five datasets were constructed from ligand and bioassay result data from the literature. These datasets include bioassay results from the Ames mutagenicity assay, Greenscreen GADD-45a-GFP assay, Syrian Hamster Embryo (SHE assay, and 2 year rat carcinogenicity assay results. These datasets provide information about chemical mutagenicity, genotoxicity and carcinogenicity.

  11. Biochemical and genetical analysis reveal a new clade of biovar 3 Dickeya spp. strains isolated from potato in Europe

    NARCIS (Netherlands)

    Slawiak, M.; Beckhoven, van J.R.C.M.; Speksnijder, A.G.C.L.; Czajkowski, R.L.; Grabe, G.; Wolf, van der J.M.

    2009-01-01

    Sixty-five potato strains of the soft rot-causing plant pathogenic bacterium Dickeya spp., and two strains from hyacinth, were characterised using biochemical assays, REP-PCR genomic finger printing, 16S rDNA and dnaX sequence analysis. These methods were compared with nineteen strains representing

  12. Leuconostoc strains isolated from dairy products: Response against food stress conditions.

    Science.gov (United States)

    D'Angelo, Luisa; Cicotello, Joaquín; Zago, Miriam; Guglielmotti, Daniela; Quiberoni, Andrea; Suárez, Viviana

    2017-09-01

    A systematic study about the intrinsic resistance of 29 strains (26 autochthonous and 3 commercial ones), belonging to Leuconostoc genus, against diverse stress factors (thermal, acidic, alkaline, osmotic and oxidative) commonly present at industrial or conservation processes were evaluated. Exhaustive result processing was made by applying one-way ANOVA, Student's test (t), multivariate analysis by Principal Component Analysis (PCA) and Matrix Hierarchical Cluster Analysis. In addition, heat adaptation on 4 strains carefully selected based on previous data analysis was assayed. The strains revealed wide diversity of resistance to stress factors and, in general, a clear relationship between resistance and Leuconostoc species was established. In this sense, the highest resistance was shown by Leuconostoc lactis followed by Leuconostoc mesenteroides strains, while Leuconostoc pseudomesenteroides and Leuconostoc citreum strains revealed the lowest resistance to the stress factors applied. Heat adaptation improved thermal cell survival and resulted in a cross-resistance against the acidic factor. However, all adapted cells showed diminished their oxidative resistance. According to our knowledge, this is the first study regarding response of Leuconostoc strains against technological stress factors and could establish the basis for the selection of "more robust" strains and propose the possibility of improving their performance during industrial processes. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. The Dataset of Countries at Risk of Electoral Violence

    OpenAIRE

    Birch, Sarah; Muchlinski, David

    2017-01-01

    Electoral violence is increasingly affecting elections around the world, yet researchers have been limited by a paucity of granular data on this phenomenon. This paper introduces and describes a new dataset of electoral violence – the Dataset of Countries at Risk of Electoral Violence (CREV) – that provides measures of 10 different types of electoral violence across 642 elections held around the globe between 1995 and 2013. The paper provides a detailed account of how and why the dataset was ...

  14. Towards interoperable and reproducible QSAR analyses: Exchange of datasets.

    Science.gov (United States)

    Spjuth, Ola; Willighagen, Egon L; Guha, Rajarshi; Eklund, Martin; Wikberg, Jarl Es

    2010-06-30

    QSAR is a widely used method to relate chemical structures to responses or properties based on experimental observations. Much effort has been made to evaluate and validate the statistical modeling in QSAR, but these analyses treat the dataset as fixed. An overlooked but highly important issue is the validation of the setup of the dataset, which comprises addition of chemical structures as well as selection of descriptors and software implementations prior to calculations. This process is hampered by the lack of standards and exchange formats in the field, making it virtually impossible to reproduce and validate analyses and drastically constrain collaborations and re-use of data. We present a step towards standardizing QSAR analyses by defining interoperable and reproducible QSAR datasets, consisting of an open XML format (QSAR-ML) which builds on an open and extensible descriptor ontology. The ontology provides an extensible way of uniquely defining descriptors for use in QSAR experiments, and the exchange format supports multiple versioned implementations of these descriptors. Hence, a dataset described by QSAR-ML makes its setup completely reproducible. We also provide a reference implementation as a set of plugins for Bioclipse which simplifies setup of QSAR datasets, and allows for exporting in QSAR-ML as well as old-fashioned CSV formats. The implementation facilitates addition of new descriptor implementations from locally installed software and remote Web services; the latter is demonstrated with REST and XMPP Web services. Standardized QSAR datasets open up new ways to store, query, and exchange data for subsequent analyses. QSAR-ML supports completely reproducible creation of datasets, solving the problems of defining which software components were used and their versions, and the descriptor ontology eliminates confusions regarding descriptors by defining them crisply. This makes is easy to join, extend, combine datasets and hence work collectively, but

  15. Towards interoperable and reproducible QSAR analyses: Exchange of datasets

    Directory of Open Access Journals (Sweden)

    Spjuth Ola

    2010-06-01

    Full Text Available Abstract Background QSAR is a widely used method to relate chemical structures to responses or properties based on experimental observations. Much effort has been made to evaluate and validate the statistical modeling in QSAR, but these analyses treat the dataset as fixed. An overlooked but highly important issue is the validation of the setup of the dataset, which comprises addition of chemical structures as well as selection of descriptors and software implementations prior to calculations. This process is hampered by the lack of standards and exchange formats in the field, making it virtually impossible to reproduce and validate analyses and drastically constrain collaborations and re-use of data. Results We present a step towards standardizing QSAR analyses by defining interoperable and reproducible QSAR datasets, consisting of an open XML format (QSAR-ML which builds on an open and extensible descriptor ontology. The ontology provides an extensible way of uniquely defining descriptors for use in QSAR experiments, and the exchange format supports multiple versioned implementations of these descriptors. Hence, a dataset described by QSAR-ML makes its setup completely reproducible. We also provide a reference implementation as a set of plugins for Bioclipse which simplifies setup of QSAR datasets, and allows for exporting in QSAR-ML as well as old-fashioned CSV formats. The implementation facilitates addition of new descriptor implementations from locally installed software and remote Web services; the latter is demonstrated with REST and XMPP Web services. Conclusions Standardized QSAR datasets open up new ways to store, query, and exchange data for subsequent analyses. QSAR-ML supports completely reproducible creation of datasets, solving the problems of defining which software components were used and their versions, and the descriptor ontology eliminates confusions regarding descriptors by defining them crisply. This makes is easy to join

  16. Use of combined microscopic and spectroscopic techniques to reveal interactions between uranium and Microbacterium sp. A9, a strain isolated from the Chernobyl exclusion zone

    Energy Technology Data Exchange (ETDEWEB)

    Theodorakopoulos, Nicolas [CEA, DSV, IBEB, SBVME, LIPM, F-13108 Saint-Paul-lez-Durance (France); CNRS, UMR 7265, F-13108 Saint-Paul-lez-Durance (France); Université d' Aix-Marseille, F-13108 Saint-Paul-lez-Durance (France); IRSN/PRP-ENV/SERIS/L2BT, bat 183, B.P. 3, F-13115 Saint Paul-lez-Durance (France); Chapon, Virginie [CEA, DSV, IBEB, SBVME, LIPM, F-13108 Saint-Paul-lez-Durance (France); CNRS, UMR 7265, F-13108 Saint-Paul-lez-Durance (France); Université d' Aix-Marseille, F-13108 Saint-Paul-lez-Durance (France); Coppin, Fréderic; Floriani, Magali [IRSN/PRP-ENV/SERIS/L2BT, bat 183, B.P. 3, F-13115 Saint Paul-lez-Durance (France); Vercouter, Thomas [CEA, DEN, DANS, DPC SEARS, LANIE, F-91191 Gif-Sur-Yvette Cedex (France); Sergeant, Claire [Univ Bordeaux, CENBG, UMR5797, F-33170 Gradignan (France); CNRS, IN2P3, CENBG, UMR5797, F-33170 Gradignan (France); Camilleri, Virginie [IRSN/PRP-ENV/SERIS/L2BT, bat 183, B.P. 3, F-13115 Saint Paul-lez-Durance (France); Berthomieu, Catherine [CEA, DSV, IBEB, SBVME, LIPM, F-13108 Saint-Paul-lez-Durance (France); CNRS, UMR 7265, F-13108 Saint-Paul-lez-Durance (France); Université d' Aix-Marseille, F-13108 Saint-Paul-lez-Durance (France); Février, Laureline, E-mail: laureline.fevrier@irsn.fr [IRSN/PRP-ENV/SERIS/L2BT, bat 183, B.P. 3, F-13115 Saint Paul-lez-Durance (France)

    2015-03-21

    Highlights: • Microbacterium sp. A9 develops various detoxification mechanisms. • Microbacterium sp. A9 promotes metal efflux from the cells. • Microbacterium sp. A9 releases phosphate to prevent uranium entrance in the cells. • Microbacterium sp. A9 stores U intracellularly as autunite. - Abstract: Although uranium (U) is naturally found in the environment, soil remediation programs will become increasingly important in light of certain human activities. This work aimed to identify U(VI) detoxification mechanisms employed by a bacteria strain isolated from a Chernobyl soil sample, and to distinguish its active from passive mechanisms of interaction. The ability of the Microbacterium sp. A9 strain to remove U(VI) from aqueous solutions at 4 °C and 25 °C was evaluated, as well as its survival capacity upon U(VI) exposure. The subcellular localisation of U was determined by TEM/EDX microscopy, while functional groups involved in the interaction with U were further evaluated by FTIR; finally, the speciation of U was analysed by TRLFS. We have revealed, for the first time, an active mechanism promoting metal efflux from the cells, during the early steps following U(VI) exposure at 25 °C. The Microbacterium sp. A9 strain also stores U intracellularly, as needle-like structures that have been identified as an autunite group mineral. Taken together, our results demonstrate that this strain exhibits a high U(VI) tolerance based on multiple detoxification mechanisms. These findings support the potential role of the genus Microbacterium in the remediation of aqueous environments contaminated with U(VI) under aerobic conditions.

  17. VideoWeb Dataset for Multi-camera Activities and Non-verbal Communication

    Science.gov (United States)

    Denina, Giovanni; Bhanu, Bir; Nguyen, Hoang Thanh; Ding, Chong; Kamal, Ahmed; Ravishankar, Chinya; Roy-Chowdhury, Amit; Ivers, Allen; Varda, Brenda

    Human-activity recognition is one of the most challenging problems in computer vision. Researchers from around the world have tried to solve this problem and have come a long way in recognizing simple motions and atomic activities. As the computer vision community heads toward fully recognizing human activities, a challenging and labeled dataset is needed. To respond to that need, we collected a dataset of realistic scenarios in a multi-camera network environment (VideoWeb) involving multiple persons performing dozens of different repetitive and non-repetitive activities. This chapter describes the details of the dataset. We believe that this VideoWeb Activities dataset is unique and it is one of the most challenging datasets available today. The dataset is publicly available online at http://vwdata.ee.ucr.edu/ along with the data annotation.

  18. Toward computational cumulative biology by combining models of biological datasets.

    Science.gov (United States)

    Faisal, Ali; Peltonen, Jaakko; Georgii, Elisabeth; Rung, Johan; Kaski, Samuel

    2014-01-01

    A main challenge of data-driven sciences is how to make maximal use of the progressively expanding databases of experimental datasets in order to keep research cumulative. We introduce the idea of a modeling-based dataset retrieval engine designed for relating a researcher's experimental dataset to earlier work in the field. The search is (i) data-driven to enable new findings, going beyond the state of the art of keyword searches in annotations, (ii) modeling-driven, to include both biological knowledge and insights learned from data, and (iii) scalable, as it is accomplished without building one unified grand model of all data. Assuming each dataset has been modeled beforehand, by the researchers or automatically by database managers, we apply a rapidly computable and optimizable combination model to decompose a new dataset into contributions from earlier relevant models. By using the data-driven decomposition, we identify a network of interrelated datasets from a large annotated human gene expression atlas. While tissue type and disease were major driving forces for determining relevant datasets, the found relationships were richer, and the model-based search was more accurate than the keyword search; moreover, it recovered biologically meaningful relationships that are not straightforwardly visible from annotations-for instance, between cells in different developmental stages such as thymocytes and T-cells. Data-driven links and citations matched to a large extent; the data-driven links even uncovered corrections to the publication data, as two of the most linked datasets were not highly cited and turned out to have wrong publication entries in the database.

  19. Strained Si engineering for nanoscale MOSFETs

    International Nuclear Information System (INIS)

    Park, Jea-Gun; Lee, Gon-Sub; Kim, Tae-Hyun; Hong, Seuck-Hoon; Kim, Seong-Je; Song, Jin-Hwan; Shim, Tae-Hun

    2006-01-01

    We have revealed a strain relaxation mechanism for strained Si grown on a relaxed SiGe-on-insulator structure fabricated by the bonding, dislocation sink, or condensation method. Strain relaxation for both the bonding and dislocation sink methods was achieved by grading the Ge concentration; in contrast, the relaxation for the condensation method was achieved through Ge atom condensation during oxidation. In addition, we estimated the surface roughness and threading-dislocation pit density for relaxed SiGe layer fabricated by the bonding, dislocation sink, or condensation method. The surface roughness and threading-dislocation pit density for the bonding, dislocation sink, and condensation methods were 2.45, 0.46, and 0.40 nm and 5.0 x 10 3 , 9 x 10 3 , and 0, respectively. In terms of quality and cost-effectiveness, the condensation method was superior to the bonding and dislocation sink methods for forming strained Si on a relaxed SiGe-on-insulator structure

  20. A curated transcriptome dataset collection to investigate the functional programming of human hematopoietic cells in early life.

    Science.gov (United States)

    Rahman, Mahbuba; Boughorbel, Sabri; Presnell, Scott; Quinn, Charlie; Cugno, Chiara; Chaussabel, Damien; Marr, Nico

    2016-01-01

    Compendia of large-scale datasets made available in public repositories provide an opportunity to identify and fill gaps in biomedical knowledge. But first, these data need to be made readily accessible to research investigators for interpretation. Here we make available a collection of transcriptome datasets to investigate the functional programming of human hematopoietic cells in early life. Thirty two datasets were retrieved from the NCBI Gene Expression Omnibus (GEO) and loaded in a custom web application called the Gene Expression Browser (GXB), which was designed for interactive query and visualization of integrated large-scale data. Quality control checks were performed. Multiple sample groupings and gene rank lists were created allowing users to reveal age-related differences in transcriptome profiles, changes in the gene expression of neonatal hematopoietic cells to a variety of immune stimulators and modulators, as well as during cell differentiation. Available demographic, clinical, and cell phenotypic information can be overlaid with the gene expression data and used to sort samples. Web links to customized graphical views can be generated and subsequently inserted in manuscripts to report novel findings. GXB also enables browsing of a single gene across projects, thereby providing new perspectives on age- and developmental stage-specific expression of a given gene across the human hematopoietic system. This dataset collection is available at: http://developmentalimmunology.gxbsidra.org/dm3/geneBrowser/list.

  1. 3DSEM: A 3D microscopy dataset

    Directory of Open Access Journals (Sweden)

    Ahmad P. Tafti

    2016-03-01

    Full Text Available The Scanning Electron Microscope (SEM as a 2D imaging instrument has been widely used in many scientific disciplines including biological, mechanical, and materials sciences to determine the surface attributes of microscopic objects. However the SEM micrographs still remain 2D images. To effectively measure and visualize the surface properties, we need to truly restore the 3D shape model from 2D SEM images. Having 3D surfaces would provide anatomic shape of micro-samples which allows for quantitative measurements and informative visualization of the specimens being investigated. The 3DSEM is a dataset for 3D microscopy vision which is freely available at [1] for any academic, educational, and research purposes. The dataset includes both 2D images and 3D reconstructed surfaces of several real microscopic samples. Keywords: 3D microscopy dataset, 3D microscopy vision, 3D SEM surface reconstruction, Scanning Electron Microscope (SEM

  2. Active Semisupervised Clustering Algorithm with Label Propagation for Imbalanced and Multidensity Datasets

    Directory of Open Access Journals (Sweden)

    Mingwei Leng

    2013-01-01

    Full Text Available The accuracy of most of the existing semisupervised clustering algorithms based on small size of labeled dataset is low when dealing with multidensity and imbalanced datasets, and labeling data is quite expensive and time consuming in many real-world applications. This paper focuses on active data selection and semisupervised clustering algorithm in multidensity and imbalanced datasets and proposes an active semisupervised clustering algorithm. The proposed algorithm uses an active mechanism for data selection to minimize the amount of labeled data, and it utilizes multithreshold to expand labeled datasets on multidensity and imbalanced datasets. Three standard datasets and one synthetic dataset are used to demonstrate the proposed algorithm, and the experimental results show that the proposed semisupervised clustering algorithm has a higher accuracy and a more stable performance in comparison to other clustering and semisupervised clustering algorithms, especially when the datasets are multidensity and imbalanced.

  3. A reanalysis dataset of the South China Sea

    Science.gov (United States)

    Zeng, Xuezhi; Peng, Shiqiu; Li, Zhijin; Qi, Yiquan; Chen, Rongyu

    2014-01-01

    Ocean reanalysis provides a temporally continuous and spatially gridded four-dimensional estimate of the ocean state for a better understanding of the ocean dynamics and its spatial/temporal variability. Here we present a 19-year (1992–2010) high-resolution ocean reanalysis dataset of the upper ocean in the South China Sea (SCS) produced from an ocean data assimilation system. A wide variety of observations, including in-situ temperature/salinity profiles, ship-measured and satellite-derived sea surface temperatures, and sea surface height anomalies from satellite altimetry, are assimilated into the outputs of an ocean general circulation model using a multi-scale incremental three-dimensional variational data assimilation scheme, yielding a daily high-resolution reanalysis dataset of the SCS. Comparisons between the reanalysis and independent observations support the reliability of the dataset. The presented dataset provides the research community of the SCS an important data source for studying the thermodynamic processes of the ocean circulation and meso-scale features in the SCS, including their spatial and temporal variability. PMID:25977803

  4. A dataset of forest biomass structure for Eurasia.

    Science.gov (United States)

    Schepaschenko, Dmitry; Shvidenko, Anatoly; Usoltsev, Vladimir; Lakyda, Petro; Luo, Yunjian; Vasylyshyn, Roman; Lakyda, Ivan; Myklush, Yuriy; See, Linda; McCallum, Ian; Fritz, Steffen; Kraxner, Florian; Obersteiner, Michael

    2017-05-16

    The most comprehensive dataset of in situ destructive sampling measurements of forest biomass in Eurasia have been compiled from a combination of experiments undertaken by the authors and from scientific publications. Biomass is reported as four components: live trees (stem, bark, branches, foliage, roots); understory (above- and below ground); green forest floor (above- and below ground); and coarse woody debris (snags, logs, dead branches of living trees and dead roots), consisting of 10,351 unique records of sample plots and 9,613 sample trees from ca 1,200 experiments for the period 1930-2014 where there is overlap between these two datasets. The dataset also contains other forest stand parameters such as tree species composition, average age, tree height, growing stock volume, etc., when available. Such a dataset can be used for the development of models of biomass structure, biomass extension factors, change detection in biomass structure, investigations into biodiversity and species distribution and the biodiversity-productivity relationship, as well as the assessment of the carbon pool and its dynamics, among many others.

  5. A Dataset for Visual Navigation with Neuromorphic Methods

    Directory of Open Access Journals (Sweden)

    Francisco eBarranco

    2016-02-01

    Full Text Available Standardized benchmarks in Computer Vision have greatly contributed to the advance of approaches to many problems in the field. If we want to enhance the visibility of event-driven vision and increase its impact, we will need benchmarks that allow comparison among different neuromorphic methods as well as comparison to Computer Vision conventional approaches. We present datasets to evaluate the accuracy of frame-free and frame-based approaches for tasks of visual navigation. Similar to conventional Computer Vision datasets, we provide synthetic and real scenes, with the synthetic data created with graphics packages, and the real data recorded using a mobile robotic platform carrying a dynamic and active pixel vision sensor (DAVIS and an RGB+Depth sensor. For both datasets the cameras move with a rigid motion in a static scene, and the data includes the images, events, optic flow, 3D camera motion, and the depth of the scene, along with calibration procedures. Finally, we also provide simulated event data generated synthetically from well-known frame-based optical flow datasets.

  6. Sparse Group Penalized Integrative Analysis of Multiple Cancer Prognosis Datasets

    Science.gov (United States)

    Liu, Jin; Huang, Jian; Xie, Yang; Ma, Shuangge

    2014-01-01

    SUMMARY In cancer research, high-throughput profiling studies have been extensively conducted, searching for markers associated with prognosis. Because of the “large d, small n” characteristic, results generated from the analysis of a single dataset can be unsatisfactory. Recent studies have shown that integrative analysis, which simultaneously analyzes multiple datasets, can be more effective than single-dataset analysis and classic meta-analysis. In most of existing integrative analysis, the homogeneity model has been assumed, which postulates that different datasets share the same set of markers. Several approaches have been designed to reinforce this assumption. In practice, different datasets may differ in terms of patient selection criteria, profiling techniques, and many other aspects. Such differences may make the homogeneity model too restricted. In this study, we assume the heterogeneity model, under which different datasets are allowed to have different sets of markers. With multiple cancer prognosis datasets, we adopt the AFT (accelerated failure time) model to describe survival. This model may have the lowest computational cost among popular semiparametric survival models. For marker selection, we adopt a sparse group MCP (minimax concave penalty) approach. This approach has an intuitive formulation and can be computed using an effective group coordinate descent algorithm. Simulation study shows that it outperforms the existing approaches under both the homogeneity and heterogeneity models. Data analysis further demonstrates the merit of heterogeneity model and proposed approach. PMID:23938111

  7. Genetic Diversity of Tick-Borne Rickettsial Pathogens; Insights Gained from Distant Strains

    Directory of Open Access Journals (Sweden)

    Sebastián Aguilar Pierlé

    2014-01-01

    Full Text Available The ability to capture genetic variation with unprecedented resolution improves our understanding of bacterial populations and their ability to cause disease. The goal of the pathogenomics era is to define genetic diversity that results in disease. Despite the economic losses caused by vector-borne bacteria in the Order Rickettsiales, little is known about the genetic variants responsible for observed phenotypes. The tick-transmitted rickettsial pathogen Anaplasma marginale infects cattle in tropical and subtropical regions worldwide, including Australia. Genomic analysis of North American A. marginale strains reveals a closed core genome defined by high levels of Single Nucleotide Polymorphisms (SNPs. Here we report the first genome sequences and comparative analysis for Australian strains that differ in virulence and transmissibility. A list of genetic differences that segregate with phenotype was evaluated for the ability to distinguish the attenuated strain from virulent field strains. Phylogenetic analyses of the Australian strains revealed a marked evolutionary distance from all previously sequenced strains. SNP analysis showed a strikingly reduced genetic diversity between these strains, with the smallest number of SNPs detected between any two A. marginale strains. The low diversity between these phenotypically distinct bacteria presents a unique opportunity to identify the genetic determinants of virulence and transmission.

  8. Multilocus Microsatellite Typing reveals intra-focal genetic diversity among strains of Leishmania tropica in Chichaoua Province, Morocco.

    Science.gov (United States)

    Krayter, Lena; Alam, Mohammad Zahangir; Rhajaoui, Mohamed; Schnur, Lionel F; Schönian, Gabriele

    2014-12-01

    In Morocco, cutaneous leishmaniasis (CL) caused by Leishmania (L.) tropica is a major public health threat. Strains of this species have been shown to display considerable serological, biochemical, molecular biological and genetic heterogeneity; and Multilocus Enzyme Electrophoresis (MLEE), has shown that in many countries including Morocco heterogenic variants of L. tropica can co-exist in single geographical foci. Here, the microsatellite profiles discerned by MLMT of nine Moroccan strains of L. tropica isolated in 2000 from human cases of CL from Chichaoua Province were compared to those of nine Moroccan strains of L. tropica isolated between 1988 and 1990 from human cases of CL from Marrakech Province, and also to those of 147 strains of L. tropica isolated at different times from different worldwide geographical locations within the range of distribution of the species. Several programs, each employing a different algorithm, were used for population genetic analysis. The strains from each of the two Moroccan foci separated into two phylogenetic clusters independent of their geographical origin. Genetic diversity and heterogeneity existed in both foci, which are geographically close to each other. This intra-focal distribution of genetic variants of L. tropica is not considered owing to in situ mutation. Rather, it is proposed to be explained by the importation of pre-existing variants of L. tropica into Morocco. Copyright © 2014 The Authors. Published by Elsevier B.V. All rights reserved.

  9. Draft genome sequence of Therminicola potens strain JR

    Energy Technology Data Exchange (ETDEWEB)

    Byrne-Bailey, K.G.; Wrighton, K.C.; Melnyk, R.A.; Agbo, P.; Hazen, T.C.; Coates, J.D.

    2010-07-01

    'Thermincola potens' strain JR is one of the first Gram-positive dissimilatory metal-reducing bacteria (DMRB) for which there is a complete genome sequence. Consistent with the physiology of this organism, preliminary annotation revealed an abundance of multiheme c-type cytochromes that are putatively associated with the periplasm and cell surface in a Gram-positive bacterium. Here we report the complete genome sequence of strain JR.

  10. Description of a taxonomically undefined Sclerotiniaceae strain from withered rotten-grapes.

    Science.gov (United States)

    Lorenzini, Marilinda; Zapparoli, Giacomo

    2016-02-01

    A necrotrophic member of the Sclerotiniaceae family (herewith named strain C10) isolated from withered rotten-grapes, is described. Interestingly, the fungus has no defined taxonomic position since it has been impossible to attribute it to an existing genus. Phylogenetic analysis of partial sequences of glyceraldehyde 3-phosphate dehydrogenase (G3PDH), heat shock protein 60 (HSP60) and DNA-directed RNA polymerase II subunit (RPB2), revealed that strain C10 is distantly related to Amphobotrys and Botrytis. This evidence clearly distinguishes this new Sclerotiniaceae member from other taxa of the family. Moreover, its morphological characteristics did not match those of Amphobotrys and Botrytis. Infectivity assays demonstrated that strain C10 could be a potential postharvest pathogen of withered grapes. This study revealed the taxonomic importance of this strain suggesting the existence of a possible new genus, a theory that requires further investigation.

  11. Estimation of lattice strain in nanocrystalline RuO2 by Williamson-Hall and size-strain plot methods

    Science.gov (United States)

    Sivakami, R.; Dhanuskodi, S.; Karvembu, R.

    2016-01-01

    RuO2 nanoparticles (RuO2 NPs) have been successfully synthesized by the hydrothermal method. Structure and the particle size have been determined by X-ray diffraction (XRD), scanning electron microscopy (SEM), atomic force microscopy (AFM) and transmission electron microscopy (TEM). UV-Vis spectra reveal that the optical band gap of RuO2 nanoparticles is red shifted from 3.95 to 3.55 eV. BET measurements show a high specific surface area (SSA) of 118-133 m2/g and pore diameter (10-25 nm) has been estimated by Barret-Joyner-Halenda (BJH) method. The crystallite size and lattice strain in the samples have been investigated by Williamson-Hall (W-H) analysis assuming uniform deformation, deformation stress and deformation energy density, and the size-strain plot method. All other relevant physical parameters including stress, strain and energy density have been calculated. The average crystallite size and the lattice strain evaluated from XRD measurements are in good agreement with the results of TEM.

  12. High-strain-induced deformation mechanisms in block-graft and multigraft copolymers

    KAUST Repository

    Schlegel, Ralf

    2011-12-13

    The molecular orientation behavior and structural changes of morphology at high strains for multigraft and block-graft copolymers based on polystyrene (PS) and polyisoprene (PI) were investigated during uniaxial monotonic loading via FT-IR and synchrotron SAXS. Results from FT-IR revealed specific orientations of PS and PI segments depending on molecular architecture and on the morphology, while structural investigations revealed a typical decrease in long-range order with increasing strain. This decrease was interpreted as strain-induced dissolution of the glassy blocks in the soft matrix, which is assumed to affect an additional enthalpic contribution (strain-induced mixing of polymer chains) and stronger retracting forces of the network chains during elongation. Our interpretation is supported by FT-IR measurements showing similar orientation of rubbery and glassy segments up to high strains. It also points to highly deformable PS domains. By synchrotron SAXS, we observed in the neo-Hookean region an approach of glassy domains, while at higher elongations the intensity of the primary reflection peak was significantly decreasing. The latter clearly verifies the assumption that the glassy chains are pulled out from the domains and are partly mixed in the PI matrix. Results obtained by applying models of rubber elasticity to stress-strain and hysteresis data revealed similar correlations between the softening behavior and molecular and morphological parameters. Further, an influence of the network modality was observed (random grafted branches). For sphere forming multigraft copolymers the domain functionality was found to be less important to achieve improved mechanical properties but rather size and distribution of the domains. © 2011 American Chemical Society.

  13. Biodiversity among Lactobacillus helveticus Strains Isolated from Different Natural Whey Starter Cultures as Revealed by Classification Trees

    Science.gov (United States)

    Gatti, Monica; Trivisano, Carlo; Fabrizi, Enrico; Neviani, Erasmo; Gardini, Fausto

    2004-01-01

    Lactobacillus helveticus is a homofermentative thermophilic lactic acid bacterium used extensively for manufacturing Swiss type and aged Italian cheese. In this study, the phenotypic and genotypic diversity of strains isolated from different natural dairy starter cultures used for Grana Padano, Parmigiano Reggiano, and Provolone cheeses was investigated by a classification tree technique. A data set was used that consists of 119 L. helveticus strains, each of which was studied for its physiological characters, as well as surface protein profiles and hybridization with a species-specific DNA probe. The methodology employed in this work allowed the strains to be grouped into terminal nodes without difficult and subjective interpretation. In particular, good discrimination was obtained between L. helveticus strains isolated, respectively, from Grana Padano and from Provolone natural whey starter cultures. The method used in this work allowed identification of the main characteristics that permit discrimination of biotypes. In order to understand what kind of genes could code for phenotypes of technological relevance, evidence that specific DNA sequences are present only in particular biotypes may be of great interest. PMID:14711641

  14. Genes but not genomes reveal bacterial domestication of Lactococcus lactis.

    Directory of Open Access Journals (Sweden)

    Delphine Passerini

    Full Text Available BACKGROUND: The population structure and diversity of Lactococcus lactis subsp. lactis, a major industrial bacterium involved in milk fermentation, was determined at both gene and genome level. Seventy-six lactococcal isolates of various origins were studied by different genotyping methods and thirty-six strains displaying unique macrorestriction fingerprints were analyzed by a new multilocus sequence typing (MLST scheme. This gene-based analysis was compared to genomic characteristics determined by pulsed-field gel electrophoresis (PFGE. METHODOLOGY/PRINCIPAL FINDINGS: The MLST analysis revealed that L. lactis subsp. lactis is essentially clonal with infrequent intra- and intergenic recombination; also, despite its taxonomical classification as a subspecies, it displays a genetic diversity as substantial as that within several other bacterial species. Genome-based analysis revealed a genome size variability of 20%, a value typical of bacteria inhabiting different ecological niches, and that suggests a large pan-genome for this subspecies. However, the genomic characteristics (macrorestriction pattern, genome or chromosome size, plasmid content did not correlate to the MLST-based phylogeny, with strains from the same sequence type (ST differing by up to 230 kb in genome size. CONCLUSION/SIGNIFICANCE: The gene-based phylogeny was not fully consistent with the traditional classification into dairy and non-dairy strains but supported a new classification based on ecological separation between "environmental" strains, the main contributors to the genetic diversity within the subspecies, and "domesticated" strains, subject to recent genetic bottlenecks. Comparison between gene- and genome-based analyses revealed little relationship between core and dispensable genome phylogenies, indicating that clonal diversification and phenotypic variability of the "domesticated" strains essentially arose through substantial genomic flux within the dispensable

  15. Feature tracking CMR reveals abnormal strain in preclinical arrhythmogenic right ventricular dysplasia/ cardiomyopathy: a multisoftware feasibility and clinical implementation study.

    Science.gov (United States)

    Bourfiss, Mimount; Vigneault, Davis M; Aliyari Ghasebeh, Mounes; Murray, Brittney; James, Cynthia A; Tichnell, Crystal; Mohamed Hoesein, Firdaus A; Zimmerman, Stefan L; Kamel, Ihab R; Calkins, Hugh; Tandri, Harikrishna; Velthuis, Birgitta K; Bluemke, David A; Te Riele, Anneline S J M

    2017-09-01

    Regional right ventricular (RV) dysfunction is the hallmark of Arrhythmogenic Right Ventricular Dysplasia/Cardiomyopathy (ARVD/C), but is currently only qualitatively evaluated in the clinical setting. Feature Tracking Cardiovascular Magnetic Resonance (FT-CMR) is a novel quantitative method that uses cine CMR to calculate strain values. However, most prior FT-CMR studies in ARVD/C have focused on global RV strain using different software methods, complicating implementation of FT-CMR in clinical practice. We aimed to assess the clinical value of global and regional strain using FT-CMR in ARVD/C and to determine differences between commercially available FT-CMR software packages. We analyzed cine CMR images of 110 subjects (39 overt ARVD/C [mutation+/phenotype+], 40 preclinical ARVD/C [mutation+/phenotype-] and 31 control) for global and regional (subtricuspid, anterior, apical) RV strain in the horizontal longitudinal axis using four FT-CMR software methods (Multimodality Tissue Tracking, TomTec, Medis and Circle Cardiovascular Imaging). Intersoftware agreement was assessed using Bland Altman plots. For global strain, all methods showed reduced strain in overt ARVD/C patients compared to control subjects (p  0.275). For regional strain, overt ARVD/C patients showed reduced strain compared to control subjects in all segments which reached statistical significance in the subtricuspid region for all software methods (p < 0.037), in the anterior wall for two methods (p < 0.005) and in the apex for one method (p = 0.012). Preclinical subjects showed abnormal subtricuspid strain compared to control subjects using one of the software methods (p = 0.009). Agreement between software methods for absolute strain values was low (Intraclass Correlation Coefficient = 0.373). Despite large intersoftware variability of FT-CMR derived strain values, all four software methods distinguished overt ARVD/C patients from control subjects by both global and subtricuspid

  16. Whole exome sequencing of wild-derived inbred strains of mice improves power to link phenotype and genotype.

    Science.gov (United States)

    Chang, Peter L; Kopania, Emily; Keeble, Sara; Sarver, Brice A J; Larson, Erica; Orth, Annie; Belkhir, Khalid; Boursot, Pierre; Bonhomme, François; Good, Jeffrey M; Dean, Matthew D

    2017-10-01

    The house mouse is a powerful model to dissect the genetic basis of phenotypic variation, and serves as a model to study human diseases. Despite a wealth of discoveries, most classical laboratory strains have captured only a small fraction of genetic variation known to segregate in their wild progenitors, and existing strains are often related to each other in complex ways. Inbred strains of mice independently derived from natural populations have the potential to increase power in genetic studies with the addition of novel genetic variation. Here, we perform exome-enrichment and high-throughput sequencing (~8× coverage) of 26 wild-derived strains known in the mouse research community as the "Montpellier strains." We identified 1.46 million SNPs in our dataset, approximately 19% of which have not been detected from other inbred strains. This novel genetic variation is expected to contribute to phenotypic variation, as they include 18,496 nonsynonymous variants and 262 early stop codons. Simulations demonstrate that the higher density of genetic variation in the Montpellier strains provides increased power for quantitative genetic studies. Inasmuch as the power to connect genotype to phenotype depends on genetic variation, it is important to incorporate these additional genetic strains into future research programs.

  17. Molecular Characterization of Salmonella Typhimurium Highly Successful Outbreak Strains

    DEFF Research Database (Denmark)

    Petersen, Randi Føns; Litrup, Eva; Larsson, Jonas T.

    2011-01-01

    we detected changes in three of five MLVA loci in a small fraction of isolates. These changes were mainly due to the gain or loss of single repeats. Optical Mapping of the large cluster strain indicated no increased content of virulence genes; however, Optical Mapping did reveal a large insert......, a probable prophage, in the main cluster. This probable prophage may give the cluster strain a competitive advantage. The molecular methods employed suggested that the four clusters represented four distinct strains, although they seemed to be epidemiologically linked and shared genotypic characteristics....

  18. DNA type analysis to differentiate strains of Xylophilus ampelinus from Europe and Hokkaido, Japan

    OpenAIRE

    Komatsu, Tsutomu; Shinmura, Akinori; Kondo, Norio

    2016-01-01

    Strains of the bacterium Xylophilus ampelinus were collected from Europe and Hokkaido, Japan. Genomic fingerprints generated from 43 strains revealed four DNA types (A-D) using the combined results of Rep-, ERIC-, and Box-PCR. Genetic variation was found among the strains examined; strains collected from Europe belonged to DNA types A or B, and strains collected from Hokkaido belonged to DNA types C or D. However, strains belonging to each DNA type showed the same pathogenicity to grapevines ...

  19. An Analysis on Better Testing than Training Performances on the Iris Dataset

    NARCIS (Netherlands)

    Schutten, Marten; Wiering, Marco

    2016-01-01

    The Iris dataset is a well known dataset containing information on three different types of Iris flowers. A typical and popular method for solving classification problems on datasets such as the Iris set is the support vector machine (SVM). In order to do so the dataset is separated in a set used

  20. A novel Zika virus mouse model reveals strain specific differences in virus pathogenesis and host inflammatory immune responses.

    Directory of Open Access Journals (Sweden)

    Shashank Tripathi

    2017-03-01

    Full Text Available Zika virus (ZIKV is a mosquito borne flavivirus, which was a neglected tropical pathogen until it emerged and spread across the Pacific Area and the Americas, causing large human outbreaks associated with fetal abnormalities and neurological disease in adults. The factors that contributed to the emergence, spread and change in pathogenesis of ZIKV are not understood. We previously reported that ZIKV evades cellular antiviral responses by targeting STAT2 for degradation in human cells. In this study, we demonstrate that Stat2-/- mice are highly susceptible to ZIKV infection, recapitulate virus spread to the central nervous system (CNS, gonads and other visceral organs, and display neurological symptoms. Further, we exploit this model to compare ZIKV pathogenesis caused by a panel of ZIKV strains of a range of spatiotemporal history of isolation and representing African and Asian lineages. We observed that African ZIKV strains induce short episodes of severe neurological symptoms followed by lethality. In comparison, Asian strains manifest prolonged signs of neuronal malfunctions, occasionally causing death of the Stat2-/- mice. African ZIKV strains induced higher levels of inflammatory cytokines and markers associated with cellular infiltration in the infected brain in mice, which may explain exacerbated pathogenesis in comparison to those of the Asian lineage. Interestingly, viral RNA levels in different organs did not correlate with the pathogenicity of the different strains. Taken together, we have established a new murine model that supports ZIKV infection and demonstrate its utility in highlighting intrinsic differences in the inflammatory response induced by different ZIKV strains leading to severity of disease. This study paves the way for the future interrogation of strain-specific changes in the ZIKV genome and their contribution to viral pathogenesis.

  1. Proglacial river stage, discharge, and temperature datasets from the Akuliarusiarsuup Kuua River northern tributary, Southwest Greenland, 2008–2011

    Directory of Open Access Journals (Sweden)

    A. K. Rennermalm

    2012-05-01

    Full Text Available Pressing scientific questions concerning the Greenland ice sheet's climatic sensitivity, hydrology, and contributions to current and future sea level rise require hydrological datasets to resolve. While direct observations of ice sheet meltwater losses can be obtained in terrestrial rivers draining the ice sheet and from lake levels, few such datasets exist. We present a new hydrologic dataset from previously unmonitored sites in the vicinity of Kangerlussuaq, Southwest Greenland. This dataset contains measurements of river stage and discharge for three sites along the Akuliarusiarsuup Kuua (Watson River's northern tributary, with 30 min temporal resolution between June 2008 and July 2011. Additional data of water temperature, air pressure, and lake stage are also provided. Flow velocity and depth measurements were collected at sites with incised bedrock or structurally reinforced channels to maximize data quality. However, like most proglacial rivers, high turbulence and bedload transport introduce considerable uncertainty to the derived discharge estimates. Eleven propagating error sources were quantified, and reveal that largest uncertainties are associated with flow depth observations. Mean discharge uncertainties (approximately the 68% confidence interval are two to four times larger (±19% to ±43% than previously published estimates for Greenland rivers. Despite these uncertainties, this dataset offers a rare collection of direct measurements of ice sheet runoff to the global ocean and is freely available for scientific use at http://dx.doi.org/10.1594/PANGAEA.762818.

  2. Biodegradation of furfural by Bacillus subtilis strain DS3.

    Science.gov (United States)

    Zheng, Dan; Bao, Jianguo; Lu, Jueming; Lv, Quanxi

    2015-07-01

    An aerobic bacterial strain DS3, capable of growing on furfural as sole carbon source, was isolated from actived sludge of wastewater treatment plant in a diosgenin factory after enrichment. Based on morphological physiological tests as well as 16SrDNA sequence and Biolog analyses it was identified as Bacillus subtilis. The study revealed that strain DS3 utilized furfural, as analyzed by high-performance liquid chromatography (HPLC). Under following conditions: pH 8.0, temperature 35 degrees C, 150 rpm and 10% inoculum, strain DS3 showed 31.2% furfural degradation. Furthermore, DS3 strain was found to tolerate furfural concentration as high as 6000 mg(-1). The ability of Bacillus subtilis strain DS3 to degrade furfural has been demonstrated for the first time in the present study.

  3. Analysis of Public Datasets for Wearable Fall Detection Systems.

    Science.gov (United States)

    Casilari, Eduardo; Santoyo-Ramón, José-Antonio; Cano-García, José-Manuel

    2017-06-27

    Due to the boom of wireless handheld devices such as smartwatches and smartphones, wearable Fall Detection Systems (FDSs) have become a major focus of attention among the research community during the last years. The effectiveness of a wearable FDS must be contrasted against a wide variety of measurements obtained from inertial sensors during the occurrence of falls and Activities of Daily Living (ADLs). In this regard, the access to public databases constitutes the basis for an open and systematic assessment of fall detection techniques. This paper reviews and appraises twelve existing available data repositories containing measurements of ADLs and emulated falls envisaged for the evaluation of fall detection algorithms in wearable FDSs. The analysis of the found datasets is performed in a comprehensive way, taking into account the multiple factors involved in the definition of the testbeds deployed for the generation of the mobility samples. The study of the traces brings to light the lack of a common experimental benchmarking procedure and, consequently, the large heterogeneity of the datasets from a number of perspectives (length and number of samples, typology of the emulated falls and ADLs, characteristics of the test subjects, features and positions of the sensors, etc.). Concerning this, the statistical analysis of the samples reveals the impact of the sensor range on the reliability of the traces. In addition, the study evidences the importance of the selection of the ADLs and the need of categorizing the ADLs depending on the intensity of the movements in order to evaluate the capability of a certain detection algorithm to discriminate falls from ADLs.

  4. Interactive visualization and analysis of multimodal datasets for surgical applications.

    Science.gov (United States)

    Kirmizibayrak, Can; Yim, Yeny; Wakid, Mike; Hahn, James

    2012-12-01

    Surgeons use information from multiple sources when making surgical decisions. These include volumetric datasets (such as CT, PET, MRI, and their variants), 2D datasets (such as endoscopic videos), and vector-valued datasets (such as computer simulations). Presenting all the information to the user in an effective manner is a challenging problem. In this paper, we present a visualization approach that displays the information from various sources in a single coherent view. The system allows the user to explore and manipulate volumetric datasets, display analysis of dataset values in local regions, combine 2D and 3D imaging modalities and display results of vector-based computer simulations. Several interaction methods are discussed: in addition to traditional interfaces including mouse and trackers, gesture-based natural interaction methods are shown to control these visualizations with real-time performance. An example of a medical application (medialization laryngoplasty) is presented to demonstrate how the combination of different modalities can be used in a surgical setting with our approach.

  5. Assessment of mechanical strain in the intact plantar fascia.

    Science.gov (United States)

    Clark, Ross A; Franklyn-Miller, Andrew; Falvey, Eanna; Bryant, Adam L; Bartold, Simon; McCrory, Paul

    2009-09-01

    A method of measuring tri-axial plantar fascia strain that is minimally affected by external compressive force has not previously been reported. The purpose of this study was to assess the use of micro-strain gauges to examine strain in the different axes of the plantar fascia. Two intact limbs from a thawed, fresh-frozen cadaver were dissected, and a combination of five linear and one three-way rosette gauges were attached to the fascia of the foot and ankle. Strain was assessed during two trials, both consisting of an identical controlled, loaded dorsiflexion. An ICC analysis of the results revealed that the majority of gauge placement sites produced reliable measures (ICC>0.75). Strain mapping of the plantar fascia indicates that the majority of the strain is centrally longitudinal, which provides supportive evidence for finite element model analysis. Although micro-strain gauges do possess the limitation of calibration difficulty, they provide a repeatable measure of fascial strain and may provide benefits in situations that require tri-axial assessment or external compression.

  6. Something From Nothing (There): Collecting Global IPv6 Datasets from DNS

    NARCIS (Netherlands)

    Fiebig, T.; Borgolte, Kevin; Hao, Shuang; Kruegel, Christopher; Vigna, Giovanny; Spring, Neil; Riley, George F.

    2017-01-01

    Current large-scale IPv6 studies mostly rely on non-public datasets, asmost public datasets are domain specific. For instance, traceroute-based datasetsare biased toward network equipment. In this paper, we present a new methodologyto collect IPv6 address datasets that does not require access to

  7. Automatic processing of multimodal tomography datasets.

    Science.gov (United States)

    Parsons, Aaron D; Price, Stephen W T; Wadeson, Nicola; Basham, Mark; Beale, Andrew M; Ashton, Alun W; Mosselmans, J Frederick W; Quinn, Paul D

    2017-01-01

    With the development of fourth-generation high-brightness synchrotrons on the horizon, the already large volume of data that will be collected on imaging and mapping beamlines is set to increase by orders of magnitude. As such, an easy and accessible way of dealing with such large datasets as quickly as possible is required in order to be able to address the core scientific problems during the experimental data collection. Savu is an accessible and flexible big data processing framework that is able to deal with both the variety and the volume of data of multimodal and multidimensional scientific datasets output such as those from chemical tomography experiments on the I18 microfocus scanning beamline at Diamond Light Source.

  8. GUDM: Automatic Generation of Unified Datasets for Learning and Reasoning in Healthcare.

    Science.gov (United States)

    Ali, Rahman; Siddiqi, Muhammad Hameed; Idris, Muhammad; Ali, Taqdir; Hussain, Shujaat; Huh, Eui-Nam; Kang, Byeong Ho; Lee, Sungyoung

    2015-07-02

    A wide array of biomedical data are generated and made available to healthcare experts. However, due to the diverse nature of data, it is difficult to predict outcomes from it. It is therefore necessary to combine these diverse data sources into a single unified dataset. This paper proposes a global unified data model (GUDM) to provide a global unified data structure for all data sources and generate a unified dataset by a "data modeler" tool. The proposed tool implements user-centric priority based approach which can easily resolve the problems of unified data modeling and overlapping attributes across multiple datasets. The tool is illustrated using sample diabetes mellitus data. The diverse data sources to generate the unified dataset for diabetes mellitus include clinical trial information, a social media interaction dataset and physical activity data collected using different sensors. To realize the significance of the unified dataset, we adopted a well-known rough set theory based rules creation process to create rules from the unified dataset. The evaluation of the tool on six different sets of locally created diverse datasets shows that the tool, on average, reduces 94.1% time efforts of the experts and knowledge engineer while creating unified datasets.

  9. A Research Graph dataset for connecting research data repositories using RD-Switchboard.

    Science.gov (United States)

    Aryani, Amir; Poblet, Marta; Unsworth, Kathryn; Wang, Jingbo; Evans, Ben; Devaraju, Anusuriya; Hausstein, Brigitte; Klas, Claus-Peter; Zapilko, Benjamin; Kaplun, Samuele

    2018-05-29

    This paper describes the open access graph dataset that shows the connections between Dryad, CERN, ANDS and other international data repositories to publications and grants across multiple research data infrastructures. The graph dataset was created using the Research Graph data model and the Research Data Switchboard (RD-Switchboard), a collaborative project by the Research Data Alliance DDRI Working Group (DDRI WG) with the aim to discover and connect the related research datasets based on publication co-authorship or jointly funded grants. The graph dataset allows researchers to trace and follow the paths to understanding a body of work. By mapping the links between research datasets and related resources, the graph dataset improves both their discovery and visibility, while avoiding duplicate efforts in data creation. Ultimately, the linked datasets may spur novel ideas, facilitate reproducibility and re-use in new applications, stimulate combinatorial creativity, and foster collaborations across institutions.

  10. Process mining in oncology using the MIMIC-III dataset

    Science.gov (United States)

    Prima Kurniati, Angelina; Hall, Geoff; Hogg, David; Johnson, Owen

    2018-03-01

    Process mining is a data analytics approach to discover and analyse process models based on the real activities captured in information systems. There is a growing body of literature on process mining in healthcare, including oncology, the study of cancer. In earlier work we found 37 peer-reviewed papers describing process mining research in oncology with a regular complaint being the limited availability and accessibility of datasets with suitable information for process mining. Publicly available datasets are one option and this paper describes the potential to use MIMIC-III, for process mining in oncology. MIMIC-III is a large open access dataset of de-identified patient records. There are 134 publications listed as using the MIMIC dataset, but none of them have used process mining. The MIMIC-III dataset has 16 event tables which are potentially useful for process mining and this paper demonstrates the opportunities to use MIMIC-III for process mining in oncology. Our research applied the L* lifecycle method to provide a worked example showing how process mining can be used to analyse cancer pathways. The results and data quality limitations are discussed along with opportunities for further work and reflection on the value of MIMIC-III for reproducible process mining research.

  11. Cognitive assessment of mice strains heterozygous for cell-adhesion genes reveals strain-specific alterations in timing.

    Science.gov (United States)

    Gallistel, C R; Tucci, Valter; Nolan, Patrick M; Schachner, Melitta; Jakovcevski, Igor; Kheifets, Aaron; Barboza, Luendro

    2014-03-05

    We used a fully automated system for the behavioural measurement of physiologically meaningful properties of basic mechanisms of cognition to test two strains of heterozygous mutant mice, Bfc (batface) and L1, and their wild-type littermate controls. Both of the target genes are involved in the establishment and maintenance of synapses. We find that the Bfc heterozygotes show reduced precision in their representation of interval duration, whereas the L1 heterozygotes show increased precision. These effects are functionally specific, because many other measures made on the same mice are unaffected, namely: the accuracy of matching temporal investment ratios to income ratios in a matching protocol, the rate of instrumental and classical conditioning, the latency to initiate a cued instrumental response, the trials on task and the impulsivity in a switch paradigm, the accuracy with which mice adjust timed switches to changes in the temporal constraints, the days to acquisition, and mean onset time and onset variability in the circadian anticipation of food availability.

  12. Veterans Affairs Suicide Prevention Synthetic Dataset

    Data.gov (United States)

    Department of Veterans Affairs — The VA's Veteran Health Administration, in support of the Open Data Initiative, is providing the Veterans Affairs Suicide Prevention Synthetic Dataset (VASPSD). The...

  13. Characterization of CRISPR-Cas system in clinical Staphylococcus epidermidis strains revealed its potential association with bacterial infection sites

    DEFF Research Database (Denmark)

    Li, Qiuchun; Xie, Xiaolei; Yin, Kequan

    2016-01-01

    Staphylococcus epidermidis is considered as a major cause of nosocomial infections, bringing an immense burden to healthcare systems. Virulent phages have been confirmed to be efficient in combating the pathogen, but the prensence of CRISPR-Cas system, which is a bacterial immune system eliminating...... phages was reported in few S. epidermidis strains. In this study, the CRISPR-Cas system was detected in 12 from almost 300 published genomes in GenBank and by PCR of cas6 gene in 18 strains out of 130 clinical isolates obtained in Copenhagen. Four strains isolated in 1965-1966 harboured CRISPR elements...... spacers located in the CRISPR1 locus with homolgy to virulent phage 6ec DNA sequences, and 19 strains each carrying 2 or 3 different spacers recognizing this phage, implied that the CRISPR-Cas immunity could be abrogated by nucleotide mismatch between the spacer and its target phage sequence, while new...

  14. Revealing strategies of quorum sensing in Azospirillum brasilense strains Ab-V5 and Ab-V6.

    Science.gov (United States)

    Fukami, Josiane; Abrantes, Julia Laura Fernandes; Del Cerro, Pablo; Nogueira, Marco Antonio; Ollero, Francisco Javier; Megías, Manuel; Hungria, Mariangela

    2018-01-01

    Azospirillum brasilense is an important plant-growth promoting bacterium (PGPB) that requires several critical steps for root colonization, including biofilm and exopolysaccharide (EPS) synthesis and cell motility. In several bacteria these mechanisms are mediated by quorum sensing (QS) systems that regulate the expression of specific genes mediated by the autoinducers N-acyl-homoserine lactones (AHLs). We investigated QS mechanisms in strains Ab-V5 and Ab-V6 of A. brasilense, which are broadly used in commercial inoculants in Brazil. Neither of these strains carries a luxI gene, but there are several luxR solos that might perceive AHL molecules. By adding external AHLs we verified that biofilm and EPS production and cell motility (swimming and swarming) were regulated via QS in Ab-V5, but not in Ab-V6. Differences were observed not only between strains, but also in the specificity of LuxR-type receptors to AHL molecules. However, Ab-V6 was outstanding in indole acetic acid (IAA) synthesis and this molecule might mimic AHL signals. We also applied the quorum quenching (QQ) strategy, obtaining transconjugants of Ab-V5 and Ab-V6 carrying a plasmid with acyl-homoserine lactonase. When maize (Zea mays L.) was inoculated with the wild-type and transconjugant strains, plant growth was decreased with the transconjugant of Ab-V5-confirming the importance of an AHL-mediated QS system-but did not affect plant growth promotion by Ab-V6.

  15. Clostridium botulinum strains producing BoNT/F4 or BoNT/F5.

    Science.gov (United States)

    Raphael, Brian H; Bradshaw, Marite; Kalb, Suzanne R; Joseph, Lavin A; Lúquez, Carolina; Barr, John R; Johnson, Eric A; Maslanka, Susan E

    2014-05-01

    Botulinum neurotoxin type F (BoNT/F) may be produced by Clostridium botulinum alone or in combination with another toxin type such as BoNT/A or BoNT/B. Type F neurotoxin gene sequences have been further classified into seven toxin subtypes. Recently, the genome sequence of one strain of C. botulinum (Af84) was shown to contain three neurotoxin genes (bont/F4, bont/F5, and bont/A2). In this study, eight strains containing bont/F4 and seven strains containing bont/F5 were examined. Culture supernatants produced by these strains were incubated with BoNT/F-specific peptide substrates. Cleavage products of these peptides were subjected to mass spectral analysis, allowing detection of the BoNT/F subtypes present in the culture supernatants. PCR analysis demonstrated that a plasmid-specific marker (PL-6) was observed only among strains containing bont/F5. Among these strains, Southern hybridization revealed the presence of an approximately 242-kb plasmid harboring bont/F5. Genome sequencing of four of these strains revealed that the genomic backgrounds of strains harboring either bont/F4 or bont/F5 are diverse. None of the strains analyzed in this study were shown to produce BoNT/F4 and BoNT/F5 simultaneously, suggesting that strain Af84 is unusual. Finally, these data support a role for the mobility of a bont/F5-carrying plasmid among strains of diverse genomic backgrounds.

  16. SAR image classification based on CNN in real and simulation datasets

    Science.gov (United States)

    Peng, Lijiang; Liu, Ming; Liu, Xiaohua; Dong, Liquan; Hui, Mei; Zhao, Yuejin

    2018-04-01

    Convolution neural network (CNN) has made great success in image classification tasks. Even in the field of synthetic aperture radar automatic target recognition (SAR-ATR), state-of-art results has been obtained by learning deep representation of features on the MSTAR benchmark. However, the raw data of MSTAR have shortcomings in training a SAR-ATR model because of high similarity in background among the SAR images of each kind. This indicates that the CNN would learn the hierarchies of features of backgrounds as well as the targets. To validate the influence of the background, some other SAR images datasets have been made which contains the simulation SAR images of 10 manufactured targets such as tank and fighter aircraft, and the backgrounds of simulation SAR images are sampled from the whole original MSTAR data. The simulation datasets contain the dataset that the backgrounds of each kind images correspond to the one kind of backgrounds of MSTAR targets or clutters and the dataset that each image shares the random background of whole MSTAR targets or clutters. In addition, mixed datasets of MSTAR and simulation datasets had been made to use in the experiments. The CNN architecture proposed in this paper are trained on all datasets mentioned above. The experimental results shows that the architecture can get high performances on all datasets even the backgrounds of the images are miscellaneous, which indicates the architecture can learn a good representation of the targets even though the drastic changes on background.

  17. On sample size and different interpretations of snow stability datasets

    Science.gov (United States)

    Schirmer, M.; Mitterer, C.; Schweizer, J.

    2009-04-01

    Interpretations of snow stability variations need an assessment of the stability itself, independent of the scale investigated in the study. Studies on stability variations at a regional scale have often chosen stability tests such as the Rutschblock test or combinations of various tests in order to detect differences in aspect and elevation. The question arose: ‘how capable are such stability interpretations in drawing conclusions'. There are at least three possible errors sources: (i) the variance of the stability test itself; (ii) the stability variance at an underlying slope scale, and (iii) that the stability interpretation might not be directly related to the probability of skier triggering. Various stability interpretations have been proposed in the past that provide partly different results. We compared a subjective one based on expert knowledge with a more objective one based on a measure derived from comparing skier-triggered slopes vs. slopes that have been skied but not triggered. In this study, the uncertainties are discussed and their effects on regional scale stability variations will be quantified in a pragmatic way. An existing dataset with very large sample sizes was revisited. This dataset contained the variance of stability at a regional scale for several situations. The stability in this dataset was determined using the subjective interpretation scheme based on expert knowledge. The question to be answered was how many measurements were needed to obtain similar results (mainly stability differences in aspect or elevation) as with the complete dataset. The optimal sample size was obtained in several ways: (i) assuming a nominal data scale the sample size was determined with a given test, significance level and power, and by calculating the mean and standard deviation of the complete dataset. With this method it can also be determined if the complete dataset consists of an appropriate sample size. (ii) Smaller subsets were created with similar

  18. Influence of dynamic strain ageing on tensile strain energy of type 316L(N) austenitic stainless steel

    International Nuclear Information System (INIS)

    Isaac Samuel, B.; Choudhary, B.K.; Bhanu Sankara Rao, K.

    2010-01-01

    Tensile tests were conducted on type 316 L(N) stainless steel over a wide temperature range of 300-1123 K employing strain rates ranging from 3.16 X 10 -5 to 3.16 X 10 -3/s . The variation of strain energy in terms of modulus of resilience and modulus of toughness over the wide range of temperatures and strain rates were examined. The variation in modulus of resilience with temperature and strain rate did not show the signatures of dynamic strain ageing (DSA). However, the modulus of toughness exhibited a plateau at the intermediate temperatures of 523-1023 K. Further, the distribution of energy absorbed till necking and energy absorbed from necking till fracture were found to characterise the deformation and damage processes, respectively, and exhibited anomalous variations in the temperature range 523-823 K and 823-1023 K, respectively. In addition to the observed manifestations of DSA such as serrated load-elongation curve, peaks/plateaus in flow stress, ultimate tensile strength and work hardening rate, negative strain rate sensitivity and ductility minima, the observed anomalous variations in modulus of toughness at intermediate temperatures (523-1023 K) can be regarded as yet another key manifestation of DSA. At temperatures above 1023 K, a sharp decrease in the modulus of toughness and also in the strain energies up to necking and from necking to fracture observed, with increasing temperature and decreasing strain rate, reveal the onset of dynamic recovery leading to early cross slip and climb processes. (author)

  19. Really big data: Processing and analysis of large datasets

    Science.gov (United States)

    Modern animal breeding datasets are large and getting larger, due in part to the recent availability of DNA data for many animals. Computational methods for efficiently storing and analyzing those data are under development. The amount of storage space required for such datasets is increasing rapidl...

  20. Effects of mean strain on the random cyclic stress-strain relations of 0Cr18Ni10Ti pipe steel

    International Nuclear Information System (INIS)

    Zhao Yongxiang; Yang Bing

    2005-01-01

    Experimental study is performed for the effects of the mean strain on the random cyclic stress-strain relations of the new nuclear material, 0Cr18Ni10Ti pipe steel. From saving the size of specimens, an improved maximum likelihood fatigue test method is proposed to operate the present strain-controlled fatigue tests. Six straining ratios, -1, -0.52, -0.22, 0.029, 0.18, and 0.48, respectively, are applied to study the effects. Fatigue test has been carried out on totally 104 specimens. The test results reveal that the material exhibits a Masing behaviour and the saturation hysteresis loops under the six ratios hold an entirely relaxation effect of mean stress. There is no effectively method for the description of the mean straining effects under this case. Previous Zhao's random stress-strain relations are therefore applied to characterizing effectively the scattering test data under the six ratios on a basis of Ramberg-Osgood equation. Then the effects of the ratios are analyzed respectively on the average stress amplitudes, the standard deviations of the stress amplitudes, and the stress amplitudes under different survival probabilities and confidences. The results reveal that the ratios act a relatively decreasing effect to the stress amplitudes under higher survival probabilities and confidences. The strongest effect appears at the ratio of 0.029, and a weaker effect acts as the distance increase of the ratio from the zero. In addition, it is indicated that the effects from the sense of average fatigue lives might result in a wrong conclusion. The effects can be appropriately assessed from a probabilistic sense to take into account the scattering regularity of test data and the size of sampling. (author)

  1. Nannochloropsis genomes reveal evolution of microalgal oleaginous traits.

    Directory of Open Access Journals (Sweden)

    Dongmei Wang

    2014-01-01

    Full Text Available Oleaginous microalgae are promising feedstock for biofuels, yet the genetic diversity, origin and evolution of oleaginous traits remain largely unknown. Here we present a detailed phylogenomic analysis of five oleaginous Nannochloropsis species (a total of six strains and one time-series transcriptome dataset for triacylglycerol (TAG synthesis on one representative strain. Despite small genome sizes, high coding potential and relative paucity of mobile elements, the genomes feature small cores of ca. 2,700 protein-coding genes and a large pan-genome of >38,000 genes. The six genomes share key oleaginous traits, such as the enrichment of selected lipid biosynthesis genes and certain glycoside hydrolase genes that potentially shift carbon flux from chrysolaminaran to TAG synthesis. The eleven type II diacylglycerol acyltransferase genes (DGAT-2 in every strain, each expressed during TAG synthesis, likely originated from three ancient genomes, including the secondary endosymbiosis host and the engulfed green and red algae. Horizontal gene transfers were inferred in most lipid synthesis nodes with expanded gene doses and many glycoside hydrolase genes. Thus multiple genome pooling and horizontal genetic exchange, together with selective inheritance of lipid synthesis genes and species-specific gene loss, have led to the enormous genetic apparatus for oleaginousness and the wide genomic divergence among present-day Nannochloropsis. These findings have important implications in the screening and genetic engineering of microalgae for biofuels.

  2. A robust dataset-agnostic heart disease classifier from Phonocardiogram.

    Science.gov (United States)

    Banerjee, Rohan; Dutta Choudhury, Anirban; Deshpande, Parijat; Bhattacharya, Sakyajit; Pal, Arpan; Mandana, K M

    2017-07-01

    Automatic classification of normal and abnormal heart sounds is a popular area of research. However, building a robust algorithm unaffected by signal quality and patient demography is a challenge. In this paper we have analysed a wide list of Phonocardiogram (PCG) features in time and frequency domain along with morphological and statistical features to construct a robust and discriminative feature set for dataset-agnostic classification of normal and cardiac patients. The large and open access database, made available in Physionet 2016 challenge was used for feature selection, internal validation and creation of training models. A second dataset of 41 PCG segments, collected using our in-house smart phone based digital stethoscope from an Indian hospital was used for performance evaluation. Our proposed methodology yielded sensitivity and specificity scores of 0.76 and 0.75 respectively on the test dataset in classifying cardiovascular diseases. The methodology also outperformed three popular prior art approaches, when applied on the same dataset.

  3. A Comparative Analysis of Classification Algorithms on Diverse Datasets

    Directory of Open Access Journals (Sweden)

    M. Alghobiri

    2018-04-01

    Full Text Available Data mining involves the computational process to find patterns from large data sets. Classification, one of the main domains of data mining, involves known structure generalizing to apply to a new dataset and predict its class. There are various classification algorithms being used to classify various data sets. They are based on different methods such as probability, decision tree, neural network, nearest neighbor, boolean and fuzzy logic, kernel-based etc. In this paper, we apply three diverse classification algorithms on ten datasets. The datasets have been selected based on their size and/or number and nature of attributes. Results have been discussed using some performance evaluation measures like precision, accuracy, F-measure, Kappa statistics, mean absolute error, relative absolute error, ROC Area etc. Comparative analysis has been carried out using the performance evaluation measures of accuracy, precision, and F-measure. We specify features and limitations of the classification algorithms for the diverse nature datasets.

  4. An assessment of differences in gridded precipitation datasets in complex terrain

    Science.gov (United States)

    Henn, Brian; Newman, Andrew J.; Livneh, Ben; Daly, Christopher; Lundquist, Jessica D.

    2018-01-01

    Hydrologic modeling and other geophysical applications are sensitive to precipitation forcing data quality, and there are known challenges in spatially distributing gauge-based precipitation over complex terrain. We conduct a comparison of six high-resolution, daily and monthly gridded precipitation datasets over the Western United States. We compare the long-term average spatial patterns, and interannual variability of water-year total precipitation, as well as multi-year trends in precipitation across the datasets. We find that the greatest absolute differences among datasets occur in high-elevation areas and in the maritime mountain ranges of the Western United States, while the greatest percent differences among datasets relative to annual total precipitation occur in arid and rain-shadowed areas. Differences between datasets in some high-elevation areas exceed 200 mm yr-1 on average, and relative differences range from 5 to 60% across the Western United States. In areas of high topographic relief, true uncertainties and biases are likely higher than the differences among the datasets; we present evidence of this based on streamflow observations. Precipitation trends in the datasets differ in magnitude and sign at smaller scales, and are sensitive to how temporal inhomogeneities in the underlying precipitation gauge data are handled.

  5. Strontium removal jar test dataset for all figures and tables.

    Data.gov (United States)

    U.S. Environmental Protection Agency — The datasets where used to generate data to demonstrate strontium removal under various water quality and treatment conditions. This dataset is associated with the...

  6. Development of a SPARK Training Dataset

    Energy Technology Data Exchange (ETDEWEB)

    Sayre, Amanda M. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Olson, Jarrod R. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States)

    2015-03-01

    In its first five years, the National Nuclear Security Administration’s (NNSA) Next Generation Safeguards Initiative (NGSI) sponsored more than 400 undergraduate, graduate, and post-doctoral students in internships and research positions (Wyse 2012). In the past seven years, the NGSI program has, and continues to produce a large body of scientific, technical, and policy work in targeted core safeguards capabilities and human capital development activities. Not only does the NGSI program carry out activities across multiple disciplines, but also across all U.S. Department of Energy (DOE)/NNSA locations in the United States. However, products are not readily shared among disciplines and across locations, nor are they archived in a comprehensive library. Rather, knowledge of NGSI-produced literature is localized to the researchers, clients, and internal laboratory/facility publication systems such as the Electronic Records and Information Capture Architecture (ERICA) at the Pacific Northwest National Laboratory (PNNL). There is also no incorporated way of analyzing existing NGSI literature to determine whether the larger NGSI program is achieving its core safeguards capabilities and activities. A complete library of NGSI literature could prove beneficial to a cohesive, sustainable, and more economical NGSI program. The Safeguards Platform for Automated Retrieval of Knowledge (SPARK) has been developed to be a knowledge storage, retrieval, and analysis capability to capture safeguards knowledge to exist beyond the lifespan of NGSI. During the development process, it was necessary to build a SPARK training dataset (a corpus of documents) for initial entry into the system and for demonstration purposes. We manipulated these data to gain new information about the breadth of NGSI publications, and they evaluated the science-policy interface at PNNL as a practical demonstration of SPARK’s intended analysis capability. The analysis demonstration sought to answer the

  7. Benchmarking of Typical Meteorological Year datasets dedicated to Concentrated-PV systems

    Science.gov (United States)

    Realpe, Ana Maria; Vernay, Christophe; Pitaval, Sébastien; Blanc, Philippe; Wald, Lucien; Lenoir, Camille

    2016-04-01

    Accurate analysis of meteorological and pyranometric data for long-term analysis is the basis of decision-making for banks and investors, regarding solar energy conversion systems. This has led to the development of methodologies for the generation of Typical Meteorological Years (TMY) datasets. The most used method for solar energy conversion systems was proposed in 1978 by the Sandia Laboratory (Hall et al., 1978) considering a specific weighted combination of different meteorological variables with notably global, diffuse horizontal and direct normal irradiances, air temperature, wind speed, relative humidity. In 2012, a new approach was proposed in the framework of the European project FP7 ENDORSE. It introduced the concept of "driver" that is defined by the user as an explicit function of the pyranometric and meteorological relevant variables to improve the representativeness of the TMY datasets with respect the specific solar energy conversion system of interest. The present study aims at comparing and benchmarking different TMY datasets considering a specific Concentrated-PV (CPV) system as the solar energy conversion system of interest. Using long-term (15+ years) time-series of high quality meteorological and pyranometric ground measurements, three types of TMY datasets generated by the following methods: the Sandia method, a simplified driver with DNI as the only representative variable and a more sophisticated driver. The latter takes into account the sensitivities of the CPV system with respect to the spectral distribution of the solar irradiance and wind speed. Different TMY datasets from the three methods have been generated considering different numbers of years in the historical dataset, ranging from 5 to 15 years. The comparisons and benchmarking of these TMY datasets are conducted considering the long-term time series of simulated CPV electric production as a reference. The results of this benchmarking clearly show that the Sandia method is not

  8. Strain hardening of aluminium alloy 3004 in the deep drawing and ironing processes

    International Nuclear Information System (INIS)

    Courbon, J.; Duval, J.L.

    1993-01-01

    The evolution of material hardening resulting from the canmaking operations on aluminium beverage cans has been investigated. Tensile tests in cup walls revealed that deep drawing induced softening in the hoop direction and hardening in the meridian direction. This anisotropy is retained in the ironing operation. Changes in strain path on a heavily cold-rolled material probably cause such a complex behaviour. To determine hardening laws for deep drawing, simple shear tests were thus performed because of the strain path similarity. They allowed to determine hardening laws over larger strains than tension could reach and revealed a saturation of stress. Altogether they proved adapted to the understanding of deep drawing. (orig.)

  9. Evolution and Strain Variation in BCG

    KAUST Repository

    Abdallah, Abdallah

    2017-11-07

    BCG vaccines were derived by in vitro passage, during the years 1908–1921, at the Pasteur Institute of Lille. Following the distribution of stocks of BCG to vaccine production laboratories around the world, it was only a few decades before different BCG producers recognized that there were variants of BCG, likely due to different passaging conditions in the different laboratories. This ultimately led to the lyophilization of stable BCG products in the 1950s and 1960s, but not before considerable evolution of the different BCG strains had taken place. The application of contemporary research methodologies has now revealed genomic, transcriptomic and proteomic differences between BCG strains. These molecular differences in part account for phenotypic differences in vitro between BCG strains, such as their variable secretion of antigenic proteins. Yet, the relevance of BCG variability for immunization policy remains elusive. In this chapter we present an overview of what is known about BCG evolution and its resulting strain variability, and provide some speculation as to the potential relevance for a vaccine given to over 100 million newborns each year.

  10. Dynamic strain measurements in a sliding microstructured contact

    International Nuclear Information System (INIS)

    Bennewitz, Roland; David, Jonathan; Lannoy, Charles-Francois de; Drevniok, Benedict; Hubbard-Davis, Paris; Miura, Takashi; Trichtchenko, Olga

    2008-01-01

    A novel experiment is described which measures the tangential strain development across the contact between a PDMS (polydimethylsiloxane) block and a glass surface during the initial stages of sliding. The surface of the PDMS block has been microfabricated to take the form of a regular array of pyramidal tips at 20 μm separation. Tangential strain is measured by means of light scattering from the interface between the block and surface. Three phases are observed in all experiments: initial shear deformation of the whole PDMS block, a pre-sliding tangential compression of the tip array with stepwise increase of the compressive strain, and sliding in stick-slip movements as revealed by periodic variation of the strain. The stick-slip sliding between the regular tip array and the randomly rough counter surface always takes on the periodicity of the tip array. The fast slip can cause either a sudden increase or a sudden decrease in compressive strain

  11. An exponential scaling law for the strain dependence of the Nb3Sn critical current density

    International Nuclear Information System (INIS)

    Bordini, B; Alknes, P; Bottura, L; Rossi, L; Valentinis, D

    2013-01-01

    The critical current density of the Nb 3 Sn superconductor is strongly dependent on the strain applied to the material. In order to investigate this dependence, it is a common practice to measure the critical current of Nb 3 Sn strands for different values of applied axial strain. In the literature, several models have been proposed to describe these experimental data in the reversible strain region. All these models are capable of fitting the measurement results in the strain region where data are collected, but tend to predict unphysical trends outside the range of data, and especially for large strain values. In this paper we present a model of a new strain function, together with the results obtained by applying the new scaling law on relevant datasets. The data analyzed consisted of the critical current measurements at 4.2 K that were carried out under applied axial strain at Durham University and the University of Geneva on different strand types. With respect to the previous models proposed, the new scaling function does not present problems at large strain values, has a lower number of fitting parameters (only two instead of three or four), and is very stable, so that, starting from few experimental points, it can estimate quite accurately the strand behavior in a strain region where there are no data. A relationship is shown between the proposed strain function and the elastic strain energy, and an analogy is drawn with the exponential form of the McMillan equation for the critical temperature. (paper)

  12. Critical strain region evaluation of self-assembled semiconductor quantum dots

    Energy Technology Data Exchange (ETDEWEB)

    Sales, D L [Departamento de Ciencia de los Materiales e I. M. y Q. I., Universidad de Cadiz, Puerto Real, Cadiz (Spain); Pizarro, J [Departamento de Lenguajes y Sistemas Informaticos, Universidad de Cadiz, Puerto Real, Cadiz (Spain); Galindo, P L [Departamento de Lenguajes y Sistemas Informaticos, Universidad de Cadiz, Puerto Real, Cadiz (Spain); Garcia, R [Departamento de Ciencia de los Materiales e I. M. y Q. I., Universidad de Cadiz, Puerto Real, Cadiz (Spain); Trevisi, G [CNR-IMEM Institute, Parco delle Scienze 37a, 43100, Parma (Italy); Frigeri, P [CNR-IMEM Institute, Parco delle Scienze 37a, 43100, Parma (Italy); Nasi, L [CNR-IMEM Institute, Parco delle Scienze 37a, 43100, Parma (Italy); Franchi, S [CNR-IMEM Institute, Parco delle Scienze 37a, 43100, Parma (Italy); Molina, S I [Departamento de Ciencia de los Materiales e I. M. y Q. I., Universidad de Cadiz, Puerto Real, Cadiz (Spain)

    2007-11-28

    A novel peak finding method to map the strain from high resolution transmission electron micrographs, known as the Peak Pairs method, has been applied to In(Ga)As/AlGaAs quantum dot (QD) samples, which present stacking faults emerging from the QD edges. Moreover, strain distribution has been simulated by the finite element method applying the elastic theory on a 3D QD model. The agreement existing between determined and simulated strain values reveals that these techniques are consistent enough to qualitatively characterize the strain distribution of nanostructured materials. The correct application of both methods allows the localization of critical strain zones in semiconductor QDs, predicting the nucleation of defects, and being a very useful tool for the design of semiconductor devices.

  13. SIAM 2007 Text Mining Competition dataset

    Data.gov (United States)

    National Aeronautics and Space Administration — Subject Area: Text Mining Description: This is the dataset used for the SIAM 2007 Text Mining competition. This competition focused on developing text mining...

  14. Environmental Dataset Gateway (EDG) REST Interface

    Data.gov (United States)

    U.S. Environmental Protection Agency — Use the Environmental Dataset Gateway (EDG) to find and access EPA's environmental resources. Many options are available for easily reusing EDG content in other...

  15. Genomic Comparison among Lethal Invasive Strains of Streptococcus pyogenes Serotype M1

    Directory of Open Access Journals (Sweden)

    Gabriel R. Fernandes

    2017-10-01

    Full Text Available Streptococcus pyogenes, also known as group A Streptococcus (GAS, is a human pathogen that causes diverse human diseases including streptococcal toxic shock syndrome (STSS. A GAS outbreak occurred in Brasilia, Brazil, during the second half of the year 2011, causing 26 deaths. Whole genome sequencing was performed using Illumina platform. The sequences were assembled and genes were predicted for comparative analysis with emm type 1 strains: MGAS5005 and M1 GAS. Genomics comparison revealed one of the invasive strains that differ from others isolates and from emm 1 reference genomes. Also, the new invasive strain showed differences in the content of virulence factors compared to other isolated in the same outbreak. The evolution of contemporary GAS strains is strongly associated with horizontal gene transfer. This is the first genomic study of a Streptococcal emm 1 outbreak in Brazil, and revealed the rapid bacterial evolution leading to new clones. The emergence of new invasive strains can be a consequence of the injudicious use of antibiotics in Brazil during the past decades.

  16. Generation of lycopene-overproducing strains of the fungus Mucor circinelloides reveals important aspects of lycopene formation and accumulation.

    Science.gov (United States)

    Zhang, Yingtong; Chen, Haiqin; Navarro, Eusebio; López-García, Sergio; Chen, Yong Q; Zhang, Hao; Chen, Wei; Garre, Victoriano

    2017-03-01

    To generate lycopene-overproducing strains of the fungus Mucor circinelloides with interest for industrial production and to gain insight into the catalytic mechanism of lycopene cyclase and regulatory process during lycopene overaccumulation. Three lycopene-overproducing mutants were generated by classic mutagenesis techniques from a β-carotene-overproducing strain. They carried distinct mutations in the carRP gene encoding lycopene cyclase that produced loss of enzymatic activity to different extents. In one mutant (MU616), the lycopene cyclase was completely destroyed, and a 43.8% (1.1 mg/g dry mass) increase in lycopene production was observed in comparison to that by the previously existing lycopene overproducer. In addition, feedback regulation of the end product was suggested in lycopene-overproducing strains. A lycopene-overaccumulating strain of the fungus M. circinelloides was generated that could be an alternative for the industrial production of lycopene. Vital catalytic residues for lycopene cyclase activity and the potential mechanism of lycopene formation and accumulation were identified.

  17. Anchored enrichment dataset for true flies (order Diptera) reveals insights into the phylogeny of flower flies (family Syrphidae).

    Science.gov (United States)

    Young, Andrew Donovan; Lemmon, Alan R; Skevington, Jeffrey H; Mengual, Ximo; Ståhls, Gunilla; Reemer, Menno; Jordaens, Kurt; Kelso, Scott; Lemmon, Emily Moriarty; Hauser, Martin; De Meyer, Marc; Misof, Bernhard; Wiegmann, Brian M

    2016-06-29

    Anchored hybrid enrichment is a form of next-generation sequencing that uses oligonucleotide probes to target conserved regions of the genome flanked by less conserved regions in order to acquire data useful for phylogenetic inference from a broad range of taxa. Once a probe kit is developed, anchored hybrid enrichment is superior to traditional PCR-based Sanger sequencing in terms of both the amount of genomic data that can be recovered and effective cost. Due to their incredibly diverse nature, importance as pollinators, and historical instability with regard to subfamilial and tribal classification, Syrphidae (flower flies or hoverflies) are an ideal candidate for anchored hybrid enrichment-based phylogenetics, especially since recent molecular phylogenies of the syrphids using only a few markers have resulted in highly unresolved topologies. Over 6200 syrphids are currently known and uncovering their phylogeny will help us to understand how these species have diversified, providing insight into an array of ecological processes, from the development of adult mimicry, the origin of adult migration, to pollination patterns and the evolution of larval resource utilization. We present the first use of anchored hybrid enrichment in insect phylogenetics on a dataset containing 30 flower fly species from across all four subfamilies and 11 tribes out of 15. To produce a phylogenetic hypothesis, 559 loci were sampled to produce a final dataset containing 217,702 sites. We recovered a well resolved topology with bootstrap support values that were almost universally >95 %. The subfamily Eristalinae is recovered as paraphyletic, with the strongest support for this hypothesis to date. The ant predators in the Microdontinae are sister to all other syrphids. Syrphinae and Pipizinae are monophyletic and sister to each other. Larval predation on soft-bodied hemipterans evolved only once in this family. Anchored hybrid enrichment was successful in producing a robustly supported

  18. A New Heuristic Anonymization Technique for Privacy Preserved Datasets Publication on Cloud Computing

    Science.gov (United States)

    Aldeen Yousra, S.; Mazleena, Salleh

    2018-05-01

    Recent advancement in Information and Communication Technologies (ICT) demanded much of cloud services to sharing users’ private data. Data from various organizations are the vital information source for analysis and research. Generally, this sensitive or private data information involves medical, census, voter registration, social network, and customer services. Primary concern of cloud service providers in data publishing is to hide the sensitive information of individuals. One of the cloud services that fulfill the confidentiality concerns is Privacy Preserving Data Mining (PPDM). The PPDM service in Cloud Computing (CC) enables data publishing with minimized distortion and absolute privacy. In this method, datasets are anonymized via generalization to accomplish the privacy requirements. However, the well-known privacy preserving data mining technique called K-anonymity suffers from several limitations. To surmount those shortcomings, I propose a new heuristic anonymization framework for preserving the privacy of sensitive datasets when publishing on cloud. The advantages of K-anonymity, L-diversity and (α, k)-anonymity methods for efficient information utilization and privacy protection are emphasized. Experimental results revealed the superiority and outperformance of the developed technique than K-anonymity, L-diversity, and (α, k)-anonymity measure.

  19. Geoseq: a tool for dissecting deep-sequencing datasets

    Directory of Open Access Journals (Sweden)

    Homann Robert

    2010-10-01

    Full Text Available Abstract Background Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO, Sequence Read Archive (SRA hosted by the NCBI, or the DNA Data Bank of Japan (ddbj. Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest. Results Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments. Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries. The analysis reduces the input sequence to tiles and measures the coverage of each tile in a sequence library through the use of suffix arrays. The user can upload custom target sequences or use gene/miRNA names for the search and get back results as plots and spreadsheet files. Geoseq organizes the public sequencing data using a controlled vocabulary, allowing identification of relevant libraries by organism, tissue and type of experiment. Conclusions Analysis of small sets of sequences against deep-sequencing datasets, as well as identification of public datasets of interest, is simplified by Geoseq. We applied Geoseq to, a identify differential isoform expression in mRNA-seq datasets, b identify miRNAs (microRNAs in libraries, and identify mature and star sequences in miRNAS and c to identify potentially mis-annotated miRNAs. The ease of using Geoseq for these analyses suggests its utility and uniqueness as an analysis tool.

  20. Estimation of lattice strain in nanocrystalline RuO2 by Williamson-Hall and size-strain plot methods.

    Science.gov (United States)

    Sivakami, R; Dhanuskodi, S; Karvembu, R

    2016-01-05

    RuO2 nanoparticles (RuO2 NPs) have been successfully synthesized by the hydrothermal method. Structure and the particle size have been determined by X-ray diffraction (XRD), scanning electron microscopy (SEM), atomic force microscopy (AFM) and transmission electron microscopy (TEM). UV-Vis spectra reveal that the optical band gap of RuO2 nanoparticles is red shifted from 3.95 to 3.55eV. BET measurements show a high specific surface area (SSA) of 118-133m(2)/g and pore diameter (10-25nm) has been estimated by Barret-Joyner-Halenda (BJH) method. The crystallite size and lattice strain in the samples have been investigated by Williamson-Hall (W-H) analysis assuming uniform deformation, deformation stress and deformation energy density, and the size-strain plot method. All other relevant physical parameters including stress, strain and energy density have been calculated. The average crystallite size and the lattice strain evaluated from XRD measurements are in good agreement with the results of TEM. Copyright © 2015 Elsevier B.V. All rights reserved.

  1. A multi-dataset time-reversal approach to clinical trial placebo response and the relationship to natural variability in epilepsy.

    Science.gov (United States)

    Goldenholz, Daniel M; Strashny, Alex; Cook, Mark; Moss, Robert; Theodore, William H

    2017-12-01

    Clinical epilepsy drug trials have been measuring increasingly high placebo response rates, up to 40%. This study was designed to examine the relationship between the natural variability in epilepsy, and the placebo response seen in trials. We tested the hypothesis that 'reversing' trial direction, with the baseline period as the treatment observation phase, would reveal effects of natural variability. Clinical trial simulations were run with time running forward and in reverse. Data sources were: SeizureTracker.com (patient reported diaries), a randomized sham-controlled TMS trial, and chronically implanted intracranial EEG electrodes. Outcomes were 50%-responder rates (RR50) and median percentage change (MPC). The RR50 results showed evidence that temporal reversal does not prevent large responder rates across datasets. The MPC results negative in the TMS dataset, and positive in the other two. Typical RR50s of clinical trials can be reproduced using the natural variability of epilepsy as a substrate across multiple datasets. Therefore, the placebo response in epilepsy clinical trials may be attributable almost entirely to this variability, rather than the "placebo effect". Published by Elsevier Ltd.

  2. Integrative functional analyses using rainbow trout selected for tolerance to plant diets reveal nutrigenomic signatures for soy utilization without the concurrence of enteritis.

    Directory of Open Access Journals (Sweden)

    Jason Abernathy

    Full Text Available Finding suitable alternative protein sources for diets of carnivorous fish species remains a major concern for sustainable aquaculture. Through genetic selection, we created a strain of rainbow trout that outperforms parental lines in utilizing an all-plant protein diet and does not develop enteritis in the distal intestine, as is typical with salmonids on long-term plant protein-based feeds. By incorporating this strain into functional analyses, we set out to determine which genes are critical to plant protein utilization in the absence of gut inflammation. After a 12-week feeding trial with our selected strain and a control trout strain fed either a fishmeal-based diet or an all-plant protein diet, high-throughput RNA sequencing was completed on both liver and muscle tissues. Differential gene expression analyses, weighted correlation network analyses and further functional characterization were performed. A strain-by-diet design revealed differential expression ranging from a few dozen to over one thousand genes among the various comparisons and tissues. Major gene ontology groups identified between comparisons included those encompassing central, intermediary and foreign molecule metabolism, associated biosynthetic pathways as well as immunity. A systems approach indicated that genes involved in purine metabolism were highly perturbed. Systems analysis among the tissues tested further suggests the interplay between selection for growth, dietary utilization and protein tolerance may also have implications for nonspecific immunity. By combining data from differential gene expression and co-expression networks using selected trout, along with ontology and pathway analyses, a set of 63 candidate genes for plant diet tolerance was found. Risk loci in human inflammatory bowel diseases were also found in our datasets, indicating rainbow trout selected for plant-diet tolerance may have added utility as a potential biomedical model.

  3. Cloning and sequencing of wsp encoding gene fragments reveals a diversity of co-infecting Wolbachia strains in Acromyrmex leafcutter ants

    DEFF Research Database (Denmark)

    van Borm, S.; Wenseleers, T.; Billen, J.

    2003-01-01

    Acromyrmex insinuator hosted two additional infections. The multiple Wolbachia strains may influence the expression of reproductive conflicts in leafcutter ants, but the expected turnover of infections may make the cumulative effects on host ant reproduction complex. The additional Wolbachia infections......By sequencing part of the wsp gene of a series of clones, we detected an unusually high diversity of nine Wolbachia strains in queens of three species of leafcutter ants. Up to four strains co-occurred in a single ant. Most strains occurred in two clusters (InvA and InvB), but the social parasite...

  4. Harvard Aging Brain Study: Dataset and accessibility.

    Science.gov (United States)

    Dagley, Alexander; LaPoint, Molly; Huijbers, Willem; Hedden, Trey; McLaren, Donald G; Chatwal, Jasmeer P; Papp, Kathryn V; Amariglio, Rebecca E; Blacker, Deborah; Rentz, Dorene M; Johnson, Keith A; Sperling, Reisa A; Schultz, Aaron P

    2017-01-01

    The Harvard Aging Brain Study is sharing its data with the global research community. The longitudinal dataset consists of a 284-subject cohort with the following modalities acquired: demographics, clinical assessment, comprehensive neuropsychological testing, clinical biomarkers, and neuroimaging. To promote more extensive analyses, imaging data was designed to be compatible with other publicly available datasets. A cloud-based system enables access to interested researchers with blinded data available contingent upon completion of a data usage agreement and administrative approval. Data collection is ongoing and currently in its fifth year. Copyright © 2015 Elsevier Inc. All rights reserved.

  5. Dislocation Interactions in Olivine Revealed by HR-EBSD

    Science.gov (United States)

    Wallis, David; Hansen, Lars N.; Britton, T. Ben; Wilkinson, Angus J.

    2017-10-01

    Interactions between dislocations potentially provide a control on strain rates produced by dislocation motion during creep of rocks at high temperatures. However, it has been difficult to establish the dominant types of interactions and their influence on the rheological properties of creeping rocks due to a lack of suitable observational techniques. We apply high-angular resolution electron backscatter diffraction to map geometrically necessary dislocation (GND) density, elastic strain, and residual stress in experimentally deformed single crystals of olivine. Short-range interactions are revealed by cross correlation of GND density maps. Spatial correlations between dislocation types indicate that noncollinear interactions may impede motion of proximal dislocations at temperatures of 1000°C and 1200°C. Long-range interactions are revealed by autocorrelation of GND density maps. These analyses reveal periodic variations in GND density and sign, with characteristic length scales on the order of 1-10 μm. These structures are spatially associated with variations in elastic strain and residual stress on the order of 10-3 and 100 MPa, respectively. Therefore, short-range interactions generate local accumulations of dislocations, leading to heterogeneous internal stress fields that influence dislocation motion over longer length scales. The impacts of these short- and/or long-range interactions on dislocation velocities may therefore influence the strain rate of the bulk material and are an important consideration for future models of dislocation-mediated deformation mechanisms in olivine. Establishing the types and impacts of dislocation interactions that occur across a range of laboratory and natural deformation conditions will help to establish the reliability of extrapolating laboratory-derived flow laws to real Earth conditions.

  6. Discovering New Global Climate Patterns: Curating a 21-Year High Temporal (Hourly) and Spatial (40km) Resolution Reanalysis Dataset

    Science.gov (United States)

    Hou, C. Y.; Dattore, R.; Peng, G. S.

    2014-12-01

    The National Center for Atmospheric Research's Global Climate Four-Dimensional Data Assimilation (CFDDA) Hourly 40km Reanalysis dataset is a dynamically downscaled dataset with high temporal and spatial resolution. The dataset contains three-dimensional hourly analyses in netCDF format for the global atmospheric state from 1985 to 2005 on a 40km horizontal grid (0.4°grid increment) with 28 vertical levels, providing good representation of local forcing and diurnal variation of processes in the planetary boundary layer. This project aimed to make the dataset publicly available, accessible, and usable in order to provide a unique resource to allow and promote studies of new climate characteristics. When the curation project started, it had been five years since the data files were generated. Also, although the Principal Investigator (PI) had generated a user document at the end of the project in 2009, the document had not been maintained. Furthermore, the PI had moved to a new institution, and the remaining team members were reassigned to other projects. These factors made data curation in the areas of verifying data quality, harvest metadata descriptions, documenting provenance information especially challenging. As a result, the project's curation process found that: Data curator's skill and knowledge helped make decisions, such as file format and structure and workflow documentation, that had significant, positive impact on the ease of the dataset's management and long term preservation. Use of data curation tools, such as the Data Curation Profiles Toolkit's guidelines, revealed important information for promoting the data's usability and enhancing preservation planning. Involving data curators during each stage of the data curation life cycle instead of at the end could improve the curation process' efficiency. Overall, the project showed that proper resources invested in the curation process would give datasets the best chance to fulfill their potential to

  7. Sensitivity of a numerical wave model on wind re-analysis datasets

    Science.gov (United States)

    Lavidas, George; Venugopal, Vengatesan; Friedrich, Daniel

    2017-03-01

    Wind is the dominant process for wave generation. Detailed evaluation of metocean conditions strengthens our understanding of issues concerning potential offshore applications. However, the scarcity of buoys and high cost of monitoring systems pose a barrier to properly defining offshore conditions. Through use of numerical wave models, metocean conditions can be hindcasted and forecasted providing reliable characterisations. This study reports the sensitivity of wind inputs on a numerical wave model for the Scottish region. Two re-analysis wind datasets with different spatio-temporal characteristics are used, the ERA-Interim Re-Analysis and the CFSR-NCEP Re-Analysis dataset. Different wind products alter results, affecting the accuracy obtained. The scope of this study is to assess different available wind databases and provide information concerning the most appropriate wind dataset for the specific region, based on temporal, spatial and geographic terms for wave modelling and offshore applications. Both wind input datasets delivered results from the numerical wave model with good correlation. Wave results by the 1-h dataset have higher peaks and lower biases, in expense of a high scatter index. On the other hand, the 6-h dataset has lower scatter but higher biases. The study shows how wind dataset affects the numerical wave modelling performance, and that depending on location and study needs, different wind inputs should be considered.

  8. Displacement sensing based on modal interference in polymer optical fibers with partially applied strain

    Science.gov (United States)

    Mizuno, Yosuke; Hagiwara, Sonoko; Kawa, Tomohito; Lee, Heeyoung; Nakamura, Kentaro

    2018-05-01

    Strain sensing based on modal interference in multimode fibers (MMFs) has been extensively studied, but no experimental or theoretical reports have been given as to how the system works when strain is applied not to the whole MMF but only to part of the MMF. Here, using a perfluorinated graded-index polymer optical fiber as the MMF, we investigate the strain sensing characteristics of this type of sensor when strain is partially applied to fiber sections with different lengths. The strain sensitivity dependence on the length of the strained section reveals that this strain sensor actually behaves as a displacement sensor.

  9. Querying Large Biological Network Datasets

    Science.gov (United States)

    Gulsoy, Gunhan

    2013-01-01

    New experimental methods has resulted in increasing amount of genetic interaction data to be generated every day. Biological networks are used to store genetic interaction data gathered. Increasing amount of data available requires fast large scale analysis methods. Therefore, we address the problem of querying large biological network datasets.…

  10. BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters

    Directory of Open Access Journals (Sweden)

    Mithun Biswas

    2017-06-01

    Full Text Available BanglaLekha-Isolated, a Bangla handwritten isolated character dataset is presented in this article. This dataset contains 84 different characters comprising of 50 Bangla basic characters, 10 Bangla numerals and 24 selected compound characters. 2000 handwriting samples for each of the 84 characters were collected, digitized and pre-processed. After discarding mistakes and scribbles, 1,66,105 handwritten character images were included in the final dataset. The dataset also includes labels indicating the age and the gender of the subjects from whom the samples were collected. This dataset could be used not only for optical handwriting recognition research but also to explore the influence of gender and age on handwriting. The dataset is publicly available at https://data.mendeley.com/datasets/hf6sf8zrkc/2.

  11. Actin and microtubule networks contribute differently to cell response for small and large strains

    Science.gov (United States)

    Kubitschke, H.; Schnauss, J.; Nnetu, K. D.; Warmt, E.; Stange, R.; Kaes, J.

    2017-09-01

    Cytoskeletal filaments provide cells with mechanical stability and organization. The main key players are actin filaments and microtubules governing a cell’s response to mechanical stimuli. We investigated the specific influences of these crucial components by deforming MCF-7 epithelial cells at small (≤5% deformation) and large strains (>5% deformation). To understand specific contributions of actin filaments and microtubules, we systematically studied cellular responses after treatment with cytoskeleton influencing drugs. Quantification with the microfluidic optical stretcher allowed capturing the relative deformation and relaxation of cells under different conditions. We separated distinctive deformational and relaxational contributions to cell mechanics for actin and microtubule networks for two orders of magnitude of drug dosages. Disrupting actin filaments via latrunculin A, for instance, revealed a strain-independent softening. Stabilizing these filaments by treatment with jasplakinolide yielded cell softening for small strains but showed no significant change at large strains. In contrast, cells treated with nocodazole to disrupt microtubules displayed a softening at large strains but remained unchanged at small strains. Stabilizing microtubules within the cells via paclitaxel revealed no significant changes for deformations at small strains, but concentration-dependent impact at large strains. This suggests that for suspended cells, the actin cortex is probed at small strains, while at larger strains; the whole cell is probed with a significant contribution from the microtubules.

  12. Linking strain anisotropy and plasticity in copper metallization

    International Nuclear Information System (INIS)

    Murray, Conal E.; Jordan-Sweet, Jean; Priyadarshini, Deepika; Nguyen, Son

    2015-01-01

    The elastic anisotropy of copper leads to significant variation in the x-ray elastic constants (XEC), which link diffraction-based strain measurements to stress. An accurate depiction of the mechanical response in copper thin films requires a determination of an appropriate grain interaction model that lies between Voigt and Reuss limits. It is shown that the associated XEC weighting fraction, x*, between these limits provides a metric by which strain anisotropy can be quantified. Experimental values of x*, as determined by a linear regression scheme of diffraction data collected from multiple reflections, reveal the degree of strain anisotropy and its dependence on plastic deformation induced during in-situ and ex-situ thermal treatments

  13. A dataset of human decision-making in teamwork management

    Science.gov (United States)

    Yu, Han; Shen, Zhiqi; Miao, Chunyan; Leung, Cyril; Chen, Yiqiang; Fauvel, Simon; Lin, Jun; Cui, Lizhen; Pan, Zhengxiang; Yang, Qiang

    2017-01-01

    Today, most endeavours require teamwork by people with diverse skills and characteristics. In managing teamwork, decisions are often made under uncertainty and resource constraints. The strategies and the effectiveness of the strategies different people adopt to manage teamwork under different situations have not yet been fully explored, partially due to a lack of detailed large-scale data. In this paper, we describe a multi-faceted large-scale dataset to bridge this gap. It is derived from a game simulating complex project management processes. It presents the participants with different conditions in terms of team members' capabilities and task characteristics for them to exhibit their decision-making strategies. The dataset contains detailed data reflecting the decision situations, decision strategies, decision outcomes, and the emotional responses of 1,144 participants from diverse backgrounds. To our knowledge, this is the first dataset simultaneously covering these four facets of decision-making. With repeated measurements, the dataset may help establish baseline variability of decision-making in teamwork management, leading to more realistic decision theoretic models and more effective decision support approaches.

  14. Analysis of Public Datasets for Wearable Fall Detection Systems

    Directory of Open Access Journals (Sweden)

    Eduardo Casilari

    2017-06-01

    Full Text Available Due to the boom of wireless handheld devices such as smartwatches and smartphones, wearable Fall Detection Systems (FDSs have become a major focus of attention among the research community during the last years. The effectiveness of a wearable FDS must be contrasted against a wide variety of measurements obtained from inertial sensors during the occurrence of falls and Activities of Daily Living (ADLs. In this regard, the access to public databases constitutes the basis for an open and systematic assessment of fall detection techniques. This paper reviews and appraises twelve existing available data repositories containing measurements of ADLs and emulated falls envisaged for the evaluation of fall detection algorithms in wearable FDSs. The analysis of the found datasets is performed in a comprehensive way, taking into account the multiple factors involved in the definition of the testbeds deployed for the generation of the mobility samples. The study of the traces brings to light the lack of a common experimental benchmarking procedure and, consequently, the large heterogeneity of the datasets from a number of perspectives (length and number of samples, typology of the emulated falls and ADLs, characteristics of the test subjects, features and positions of the sensors, etc.. Concerning this, the statistical analysis of the samples reveals the impact of the sensor range on the reliability of the traces. In addition, the study evidences the importance of the selection of the ADLs and the need of categorizing the ADLs depending on the intensity of the movements in order to evaluate the capability of a certain detection algorithm to discriminate falls from ADLs.

  15. EVALUATION OF LAND USE/LAND COVER DATASETS FOR URBAN WATERSHED MODELING

    International Nuclear Information System (INIS)

    S.J. BURIAN; M.J. BROWN; T.N. MCPHERSON

    2001-01-01

    Land use/land cover (LULC) data are a vital component for nonpoint source pollution modeling. Most watershed hydrology and pollutant loading models use, in some capacity, LULC information to generate runoff and pollutant loading estimates. Simple equation methods predict runoff and pollutant loads using runoff coefficients or pollutant export coefficients that are often correlated to LULC type. Complex models use input variables and parameters to represent watershed characteristics and pollutant buildup and washoff rates as a function of LULC type. Whether using simple or complex models an accurate LULC dataset with an appropriate spatial resolution and level of detail is paramount for reliable predictions. The study presented in this paper compared and evaluated several LULC dataset sources for application in urban environmental modeling. The commonly used USGS LULC datasets have coarser spatial resolution and lower levels of classification than other LULC datasets. In addition, the USGS datasets do not accurately represent the land use in areas that have undergone significant land use change during the past two decades. We performed a watershed modeling analysis of three urban catchments in Los Angeles, California, USA to investigate the relative difference in average annual runoff volumes and total suspended solids (TSS) loads when using the USGS LULC dataset versus using a more detailed and current LULC dataset. When the two LULC datasets were aggregated to the same land use categories, the relative differences in predicted average annual runoff volumes and TSS loads from the three catchments were 8 to 14% and 13 to 40%, respectively. The relative differences did not have a predictable relationship with catchment size

  16. The Geographic Distribution of Saccharomyces cerevisiae Isolates within three Italian Neighboring Winemaking Regions Reveals Strong Differences in Yeast Abundance, Genetic Diversity and Industrial Strain Dissemination

    Directory of Open Access Journals (Sweden)

    Alessia Viel

    2017-08-01

    Full Text Available In recent years the interest for natural fermentations has been re-evaluated in terms of increasing the wine terroir and managing more sustainable winemaking practices. Therefore, the level of yeast genetic variability and the abundance of Saccharomyces cerevisiae native populations in vineyard are becoming more and more crucial at both ecological and technological level. Among the factors that can influence the strain diversity, the commercial starter release that accidentally occur in the environment around the winery, has to be considered. In this study we led a wide scale investigation of S. cerevisiae genetic diversity and population structure in the vineyards of three neighboring winemaking regions of Protected Appellation of Origin, in North-East of Italy. Combining mtDNA RFLP and microsatellite markers analyses we evaluated 634 grape samples collected over 3 years. We could detect major differences in the presence of S. cerevisiae yeasts, according to the winemaking region. The population structures revealed specificities of yeast microbiota at vineyard scale, with a relative Appellation of Origin area homogeneity, and transition zones suggesting a geographic differentiation. Surprisingly, we found a widespread industrial yeast dissemination that was very high in the areas where the native yeast abundance was low. Although geographical distance is a key element involved in strain distribution, the high presence of industrial strains in vineyard reduced the differences between populations. This finding indicates that industrial yeast diffusion it is a real emergency and their presence strongly interferes with the natural yeast microbiota.

  17. Sharing Video Datasets in Design Research

    DEFF Research Database (Denmark)

    Christensen, Bo; Abildgaard, Sille Julie Jøhnk

    2017-01-01

    This paper examines how design researchers, design practitioners and design education can benefit from sharing a dataset. We present the Design Thinking Research Symposium 11 (DTRS11) as an exemplary project that implied sharing video data of design processes and design activity in natural settings...... with a large group of fellow academics from the international community of Design Thinking Research, for the purpose of facilitating research collaboration and communication within the field of Design and Design Thinking. This approach emphasizes the social and collaborative aspects of design research, where...... a multitude of appropriate perspectives and methods may be utilized in analyzing and discussing the singular dataset. The shared data is, from this perspective, understood as a design object in itself, which facilitates new ways of working, collaborating, studying, learning and educating within the expanding...

  18. Interpolation of diffusion weighted imaging datasets

    DEFF Research Database (Denmark)

    Dyrby, Tim B; Lundell, Henrik; Burke, Mark W

    2014-01-01

    anatomical details and signal-to-noise-ratio for reliable fibre reconstruction. We assessed the potential benefits of interpolating DWI datasets to a higher image resolution before fibre reconstruction using a diffusion tensor model. Simulations of straight and curved crossing tracts smaller than or equal......Diffusion weighted imaging (DWI) is used to study white-matter fibre organisation, orientation and structural connectivity by means of fibre reconstruction algorithms and tractography. For clinical settings, limited scan time compromises the possibilities to achieve high image resolution for finer...... interpolation methods fail to disentangle fine anatomical details if PVE is too pronounced in the original data. As for validation we used ex-vivo DWI datasets acquired at various image resolutions as well as Nissl-stained sections. Increasing the image resolution by a factor of eight yielded finer geometrical...

  19. Evidence for differences between B. bruxellensis strains originating from an enological environment

    Directory of Open Access Journals (Sweden)

    Vincent Renouf

    2009-03-01

    Full Text Available Vincent Renouf1,2, Cécile Miot-Sertier2, Marie-Claire Perello2, Gilles de Revel2, Aline Lonvaud-Funel21Laffort, Bordeaux, France; 2UMR Œnologie, INRA-Université Bordeaux, FranceAbstract: The aim of this paper is to study and compare the physiological diversity of different strains of a wine spoilage yeast species: Brettanomyces bruxellensis. The minimum inhibitory concentrations of several drugs on different B. bruxellensis strains were scored on solid nutrient media. This revealed variations in resistance among the B. bruxellensis strains. Their capacity to develop in different wine and must environments: pH, ethanol, and SO2 concentrations, were evaluated by measuring the direct incubation survival rate. The results, compared with those obtained for other wine yeast species, confirmed the remarkable resistance of B. bruxellensis strains to various conditions which inhibit the growth of other species. Nevertheless some differences were observed among the B. bruxellensis strains, thus confirming their physiological diversity. A comparison of their volatile phenol production revealed intraspecific heterogeneity among B. bruxellensis strains. B. bruxellensis is one of the microbial species most resistant to environmental constraints in wine. It is the best adapted to growing in wine and spoiling it by volatile phenol production. However, different B. bruxellensis strains exhibit varying characteristics, particularly their capacity to produce volatile phenols. This implies that certain strains are more prejudicial than others. Further studies are required to determine the molecular causes of this intraspecific diversity.Keywords: Brettanomyces bruxellensis, strain diversity, physiology, volatile phenols

  20. Cardiac biplane strain imaging: initial in vivo experience

    Science.gov (United States)

    Lopata, R. G. P.; Nillesen, M. M.; Verrijp, C. N.; Singh, S. K.; Lammens, M. M. Y.; van der Laak, J. A. W. M.; van Wetten, H. B.; Thijssen, J. M.; Kapusta, L.; de Korte, C. L.

    2010-02-01

    In this study, first we propose a biplane strain imaging method using a commercial ultrasound system, yielding estimation of the strain in three orthogonal directions. Secondly, an animal model of a child's heart was introduced that is suitable to simulate congenital heart disease and was used to test the method in vivo. The proposed approach can serve as a framework to monitor the development of cardiac hypertrophy and fibrosis. A 2D strain estimation technique using radio frequency (RF) ultrasound data was applied. Biplane image acquisition was performed at a relatively low frame rate (dogs with an aortic stenosis. Initial results reveal the feasibility of measuring large radial, circumferential and longitudinal cumulative strain (up to 70%) at a frame rate of 100 Hz. Mean radial strain curves of a manually segmented region-of-interest in the infero-lateral wall show excellent correlation between the measured strain curves acquired in two perpendicular planes. Furthermore, the results show the feasibility and reproducibility of assessing radial, circumferential and longitudinal strains simultaneously. In this preliminary study, three beagles developed an elevated pressure gradient over the aortic valve (Δp: 100-200 mmHg) and myocardial hypertrophy. One dog did not develop any sign of hypertrophy (Δp = 20 mmHg). Initial strain (rate) results showed that the maximum strain (rate) decreased with increasing valvular stenosis (-50%), which is in accordance with previous studies. Histological findings corroborated these results and showed an increase in fibrotic tissue for the hearts with larger pressure gradients (100, 200 mmHg), as well as lower strain and strain rate values.

  1. Rapid molecular evolution of human bocavirus revealed by Bayesian coalescent inference.

    Science.gov (United States)

    Zehender, Gianguglielmo; De Maddalena, Chiara; Canuti, Marta; Zappa, Alessandra; Amendola, Antonella; Lai, Alessia; Galli, Massimo; Tanzi, Elisabetta

    2010-03-01

    Human bocavirus (HBoV) is a linear single-stranded DNA virus belonging to the Parvoviridae family that has recently been isolated from the upper respiratory tract of children with acute respiratory infection. All of the strains observed so far segregate into two genotypes (1 and 2) with a low level of polymorphism. Given the recent description of the infection and the lack of epidemiological and molecular data, we estimated the virus's rates of molecular evolution and population dynamics. A dataset of forty-nine dated VP2 sequences, including also eight new isolates obtained from pharyngeal swabs of Italian patients with acute respiratory tract infections, was submitted to phylogenetic analysis. The model parameters, evolutionary rates and population dynamics were co-estimated using a Bayesian Markov Chain Monte Carlo approach, and site-specific positive and negative selection was also investigated. Recombination was investigated by seven different methods and one suspected recombinant strain was excluded from further analysis. The estimated mean evolutionary rate of HBoV was 8.6x10(-4)subs/site/year, and that of the 1st+2nd codon positions was more than 15 times less than that of the 3rd codon position. Viral population dynamics analysis revealed that the two known genotypes diverged recently (mean tMRCA: 24 years), and that the epidemic due to HBoV genotype 2 grew exponentially at a rate of 1.01year(-1). Selection analysis of the partial VP2 showed that 8.5% of sites were under significant negative pressure and the absence of positive selection. Our results show that, like other parvoviruses, HBoV is characterised by a rapid evolution. The low level of polymorphism is probably due to a relatively recent divergence between the circulating genotypes and strong purifying selection acting on viral antigens.

  2. Rapid discrimination of strain-dependent fermentation characteristics among Lactobacillus strains by NMR-based metabolomics of fermented vegetable juice.

    Directory of Open Access Journals (Sweden)

    Satoru Tomita

    Full Text Available In this study, we investigated the applicability of NMR-based metabolomics to discriminate strain-dependent fermentation characteristics of lactic acid bacteria (LAB, which are important microorganisms for fermented food production. To evaluate the discrimination capability, six type strains of Lactobacillus species and six additional L. brevis strains were used focusing on i the difference between homo- and hetero-lactic fermentative species and ii strain-dependent characteristics within L. brevis. Based on the differences in the metabolite profiles of fermented vegetable juices, non-targeted principal component analysis (PCA clearly separated the samples into those inoculated with homo- and hetero-lactic fermentative species. The separation was primarily explained by the different levels of dominant metabolites (lactic acid, acetic acid, ethanol, and mannitol. Orthogonal partial least squares discrimination analysis, based on a regions-of-interest (ROIs approach, revealed the contribution of low-abundance metabolites: acetoin, phenyllactic acid, p-hydroxyphenyllactic acid, glycerophosphocholine, and succinic acid for homolactic fermentation; and ornithine, tyramine, and γ-aminobutyric acid (GABA for heterolactic fermentation. Furthermore, ROIs-based PCA of seven L. brevis strains separated their strain-dependent fermentation characteristics primarily based on their ability to utilize sucrose and citric acid, and convert glutamic acid and tyrosine into GABA and tyramine, respectively. In conclusion, NMR metabolomics successfully discriminated the fermentation characteristics of the tested strains and provided further information on metabolites responsible for these characteristics, which may impact the taste, aroma, and functional properties of fermented foods.

  3. Development of a SPARK Training Dataset

    International Nuclear Information System (INIS)

    Sayre, Amanda M.; Olson, Jarrod R.

    2015-01-01

    In its first five years, the National Nuclear Security Administration's (NNSA) Next Generation Safeguards Initiative (NGSI) sponsored more than 400 undergraduate, graduate, and post-doctoral students in internships and research positions (Wyse 2012). In the past seven years, the NGSI program has, and continues to produce a large body of scientific, technical, and policy work in targeted core safeguards capabilities and human capital development activities. Not only does the NGSI program carry out activities across multiple disciplines, but also across all U.S. Department of Energy (DOE)/NNSA locations in the United States. However, products are not readily shared among disciplines and across locations, nor are they archived in a comprehensive library. Rather, knowledge of NGSI-produced literature is localized to the researchers, clients, and internal laboratory/facility publication systems such as the Electronic Records and Information Capture Architecture (ERICA) at the Pacific Northwest National Laboratory (PNNL). There is also no incorporated way of analyzing existing NGSI literature to determine whether the larger NGSI program is achieving its core safeguards capabilities and activities. A complete library of NGSI literature could prove beneficial to a cohesive, sustainable, and more economical NGSI program. The Safeguards Platform for Automated Retrieval of Knowledge (SPARK) has been developed to be a knowledge storage, retrieval, and analysis capability to capture safeguards knowledge to exist beyond the lifespan of NGSI. During the development process, it was necessary to build a SPARK training dataset (a corpus of documents) for initial entry into the system and for demonstration purposes. We manipulated these data to gain new information about the breadth of NGSI publications, and they evaluated the science-policy interface at PNNL as a practical demonstration of SPARK's intended analysis capability. The analysis demonstration sought to answer

  4. Resampling Methods Improve the Predictive Power of Modeling in Class-Imbalanced Datasets

    Directory of Open Access Journals (Sweden)

    Paul H. Lee

    2014-09-01

    Full Text Available In the medical field, many outcome variables are dichotomized, and the two possible values of a dichotomized variable are referred to as classes. A dichotomized dataset is class-imbalanced if it consists mostly of one class, and performance of common classification models on this type of dataset tends to be suboptimal. To tackle such a problem, resampling methods, including oversampling and undersampling can be used. This paper aims at illustrating the effect of resampling methods using the National Health and Nutrition Examination Survey (NHANES wave 2009–2010 dataset. A total of 4677 participants aged ≥20 without self-reported diabetes and with valid blood test results were analyzed. The Classification and Regression Tree (CART procedure was used to build a classification model on undiagnosed diabetes. A participant demonstrated evidence of diabetes according to WHO diabetes criteria. Exposure variables included demographics and socio-economic status. CART models were fitted using a randomly selected 70% of the data (training dataset, and area under the receiver operating characteristic curve (AUC was computed using the remaining 30% of the sample for evaluation (testing dataset. CART models were fitted using the training dataset, the oversampled training dataset, the weighted training dataset, and the undersampled training dataset. In addition, resampling case-to-control ratio of 1:1, 1:2, and 1:4 were examined. Resampling methods on the performance of other extensions of CART (random forests and generalized boosted trees were also examined. CARTs fitted on the oversampled (AUC = 0.70 and undersampled training data (AUC = 0.74 yielded a better classification power than that on the training data (AUC = 0.65. Resampling could also improve the classification power of random forests and generalized boosted trees. To conclude, applying resampling methods in a class-imbalanced dataset improved the classification power of CART, random forests

  5. Influence of Cyclic Straining on Fatigue, Deformation, and Fracture Behavior of High-Strength Alloy Steel

    Science.gov (United States)

    Manigandan, K.; Srivatsan, T. S.; Vasudevan, V. K.; Tammana, D.; Poorganji, B.

    2016-01-01

    In this paper, the results of a study on microstructural influences on mechanical behavior of the high-strength alloy steel Tenax™ 310 are presented and discussed. Under the influence of fully reversed strain cycling, the stress response of this alloy steel revealed softening from the onset of deformation. Cyclic strain resistance exhibited a linear trend for the variation of both elastic strain amplitude with reversals-to-failure, and plastic strain amplitude with reversals-to-failure. Fracture morphology was essentially the same at the macroscopic level over the entire range of cyclic strain amplitudes examined. However, at the fine microscopic level, this high-strength alloy steel revealed fracture to be mixed-mode with features reminiscent of "locally" ductile and brittle mechanisms. The macroscopic mechanisms governing stress response at the fine microscopic level, resultant fatigue life, and final fracture behavior are presented and discussed in light of the mutually interactive influences of intrinsic microstructural effects, deformation characteristics of the microstructural constituents during fully reversed strain cycling, cyclic strain amplitude, and resultant response stress.

  6. BASE MAP DATASET, INYO COUNTY, OKLAHOMA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  7. BASE MAP DATASET, JACKSON COUNTY, OKLAHOMA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  8. BASE MAP DATASET, KINGFISHER COUNTY, OKLAHOMA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  9. Prevalence of job strain among Indian foundry shop floor workers.

    Science.gov (United States)

    Mohan, G Madhan; Elangovan, S; Prasad, P S S; Krishna, P Rama; Mokkapati, Anil Kumar

    2008-01-01

    Global competition in manufacturing sector demand higher productivity levels. In this context, workers in this sector are set with high output targets, leading to job strain. In addition to the strain, hazardous conditions also prevail in some of the manufacturing processes like foundry activities. This paper attempts to appraise the prevalence of job strain among foundry shop floor workers in India with the help of Demands-Control model [8]. In this study, data was collected through a survey using 49-item Job Content Questionnaire (JCQ) [9], a widely used and well-validated test for job strain. Then the data was subjected to statistical analysis after ascertaining the reliability. This survey has revealed that 25% of workers in foundry were experiencing high job strain. Hazardous working conditions, limited decision making authority, etc. appear to be the main contributing factors for the higher levels of strain.

  10. In-vivo expression profiling of Pseudomonas aeruginosa infections reveals niche-specific and strain-independent transcriptional programs.

    Directory of Open Access Journals (Sweden)

    Piotr Bielecki

    Full Text Available Pseudomonas aeruginosa is a threatening, opportunistic pathogen causing disease in immunocompromised individuals. The hallmark of P. aeruginosa virulence is its multi-factorial and combinatorial nature. It renders such bacteria infectious for many organisms and it is often resistant to antibiotics. To gain insights into the physiology of P. aeruginosa during infection, we assessed the transcriptional programs of three different P. aeruginosa strains directly after isolation from burn wounds of humans. We compared the programs to those of the same strains using two infection models: a plant model, which consisted of the infection of the midrib of lettuce leaves, and a murine tumor model, which was obtained by infection of mice with an induced tumor in the abdomen. All control conditions of P. aeruginosa cells growing in suspension and as a biofilm were added to the analysis. We found that these different P. aeruginosa strains express a pool of distinct genetic traits that are activated under particular infection conditions regardless of their genetic variability. The knowledge herein generated will advance our understanding of P. aeruginosa virulence and provide valuable cues for the definition of prospective targets to develop novel intervention strategies.

  11. Novel distributed strain sensing in polymeric materials

    International Nuclear Information System (INIS)

    Abot, Jandro L; Song, Yi; Medikonda, Sandeep; Rooy, Nathan; Schulz, Mark J

    2010-01-01

    Monitoring the state of strain throughout an entire structure is essential to determine its state of stress, detect potential residual stresses after fabrication, and also to help to establish its integrity. Several sensing technologies are presently available to determine the strain in the surface or inside a structure. Large sensor dimensions, complex signal conditioning equipment, and difficulty in achieving a widely distributed system have however hindered their development into robust structural health monitoring techniques. Recently, carbon nanotube forests were spun into a microscale thread that is electrically conductive, tough, and easily tailorable. The thread was integrated into polymeric materials and used for the first time as a piezoresistive sensor to monitor strain and also to detect damage in the material. It is revealed that the created self-sensing polymeric materials are sensitive to normal strains above 0.07% and that the sensor thread exhibits a perfectly linear delta resistance–strain response above 0.3%. The longitudinal gauge factors were determined to be in the 2–5 range. This low cost and simple built-in sensor thread may provide a new integrated and distributed sensor technology that enables robust real-time health monitoring of structures

  12. Structural dataset for the PPARγ V290M mutant

    Directory of Open Access Journals (Sweden)

    Ana C. Puhl

    2016-06-01

    Full Text Available Loss-of-function mutation V290M in the ligand-binding domain of the peroxisome proliferator activated receptor γ (PPARγ is associated with a ligand resistance syndrome (PLRS, characterized by partial lipodystrophy and severe insulin resistance. In this data article we discuss an X-ray diffraction dataset that yielded the structure of PPARγ LBD V290M mutant refined at 2.3 Å resolution, that allowed building of 3D model of the receptor mutant with high confidence and revealed continuous well-defined electron density for the partial agonist diclofenac bound to hydrophobic pocket of the PPARγ. These structural data provide significant insights into molecular basis of PLRS caused by V290M mutation and are correlated with the receptor disability of rosiglitazone binding and increased affinity for corepressors. Furthermore, our structural evidence helps to explain clinical observations which point out to a failure to restore receptor function by the treatment with a full agonist of PPARγ, rosiglitazone.

  13. Image segmentation evaluation for very-large datasets

    Science.gov (United States)

    Reeves, Anthony P.; Liu, Shuang; Xie, Yiting

    2016-03-01

    With the advent of modern machine learning methods and fully automated image analysis there is a need for very large image datasets having documented segmentations for both computer algorithm training and evaluation. Current approaches of visual inspection and manual markings do not scale well to big data. We present a new approach that depends on fully automated algorithm outcomes for segmentation documentation, requires no manual marking, and provides quantitative evaluation for computer algorithms. The documentation of new image segmentations and new algorithm outcomes are achieved by visual inspection. The burden of visual inspection on large datasets is minimized by (a) customized visualizations for rapid review and (b) reducing the number of cases to be reviewed through analysis of quantitative segmentation evaluation. This method has been applied to a dataset of 7,440 whole-lung CT images for 6 different segmentation algorithms designed to fully automatically facilitate the measurement of a number of very important quantitative image biomarkers. The results indicate that we could achieve 93% to 99% successful segmentation for these algorithms on this relatively large image database. The presented evaluation method may be scaled to much larger image databases.

  14. Strain, Stress and Seismicity pattern in Switzerland

    Science.gov (United States)

    Houlié, Nicolas; Woessner, Jochen; Villiger, Arturo; Deichmann, Nicholas; Rothacher, Markus; Giardini, Domenico; Geiger, Alain

    2013-04-01

    Switzerland lies across one of the most complex plate boundary in the world. With a 100 Ma of deformation history, and a wide diversity of deformation mechanism, it is an ideal place to study the link(s) between small strain rates measured at the surface and stress dissipated at depth. The link is of genuine interest for seismic hazard assessment as it provides an independent estimate for moment release within the seismogenic volume. We use geodetic (GPS velocities, shortening axes, strain maps) and seismic (anisotropy, P-axes, focal mechanisms) datasets in order to assess whether the stress accumulated at depth due to the continental collision reflects the deformation rates measured at the surface and correlates with the seismic activity as well as the stress directions deduced from earthquake focal mechanisms throughout the area - or not. While the deformation amplitudes of the area are small (less than 10-7 yr-1) in some areas of Switzerland, we can relate long- and short-term features of the tectonic processes occurring over the last 10+ Ma. Preliminary results suggest that while deformation rates measured by GPS are large in the Ticino compared to the Valais region - its seismic activity rate is lower. This implies other processes might play important roles in the generation of seismicity.

  15. A global water resources ensemble of hydrological models: the eartH2Observe Tier-1 dataset

    Science.gov (United States)

    Schellekens, Jaap; Dutra, Emanuel; Martínez-de la Torre, Alberto; Balsamo, Gianpaolo; van Dijk, Albert; Sperna Weiland, Frederiek; Minvielle, Marie; Calvet, Jean-Christophe; Decharme, Bertrand; Eisner, Stephanie; Fink, Gabriel; Flörke, Martina; Peßenteiner, Stefanie; van Beek, Rens; Polcher, Jan; Beck, Hylke; Orth, René; Calton, Ben; Burke, Sophia; Dorigo, Wouter; Weedon, Graham P.

    2017-07-01

    The dataset presented here consists of an ensemble of 10 global hydrological and land surface models for the period 1979-2012 using a reanalysis-based meteorological forcing dataset (0.5° resolution). The current dataset serves as a state of the art in current global hydrological modelling and as a benchmark for further improvements in the coming years. A signal-to-noise ratio analysis revealed low inter-model agreement over (i) snow-dominated regions and (ii) tropical rainforest and monsoon areas. The large uncertainty of precipitation in the tropics is not reflected in the ensemble runoff. Verification of the results against benchmark datasets for evapotranspiration, snow cover, snow water equivalent, soil moisture anomaly and total water storage anomaly using the tools from The International Land Model Benchmarking Project (ILAMB) showed overall useful model performance, while the ensemble mean generally outperformed the single model estimates. The results also show that there is currently no single best model for all variables and that model performance is spatially variable. In our unconstrained model runs the ensemble mean of total runoff into the ocean was 46 268 km3 yr-1 (334 kg m-2 yr-1), while the ensemble mean of total evaporation was 537 kg m-2 yr-1. All data are made available openly through a Water Cycle Integrator portal (WCI, wci.earth2observe.eu), and via a direct http and ftp download. The portal follows the protocols of the open geospatial consortium such as OPeNDAP, WCS and WMS. The DOI for the data is https://doi.org/10.1016/10.5281/zenodo.167070.

  16. A global water resources ensemble of hydrological models: the eartH2Observe Tier-1 dataset

    Directory of Open Access Journals (Sweden)

    J. Schellekens

    2017-07-01

    Full Text Available The dataset presented here consists of an ensemble of 10 global hydrological and land surface models for the period 1979–2012 using a reanalysis-based meteorological forcing dataset (0.5° resolution. The current dataset serves as a state of the art in current global hydrological modelling and as a benchmark for further improvements in the coming years. A signal-to-noise ratio analysis revealed low inter-model agreement over (i snow-dominated regions and (ii tropical rainforest and monsoon areas. The large uncertainty of precipitation in the tropics is not reflected in the ensemble runoff. Verification of the results against benchmark datasets for evapotranspiration, snow cover, snow water equivalent, soil moisture anomaly and total water storage anomaly using the tools from The International Land Model Benchmarking Project (ILAMB showed overall useful model performance, while the ensemble mean generally outperformed the single model estimates. The results also show that there is currently no single best model for all variables and that model performance is spatially variable. In our unconstrained model runs the ensemble mean of total runoff into the ocean was 46 268 km3 yr−1 (334 kg m−2 yr−1, while the ensemble mean of total evaporation was 537 kg m−2 yr−1. All data are made available openly through a Water Cycle Integrator portal (WCI, wci.earth2observe.eu, and via a direct http and ftp download. The portal follows the protocols of the open geospatial consortium such as OPeNDAP, WCS and WMS. The DOI for the data is https://doi.org/10.1016/10.5281/zenodo.167070.

  17. Biotransformation of Tributyltin chloride by Pseudomonas stutzeri strain DN2

    Directory of Open Access Journals (Sweden)

    Dnyanada S. Khanolkar

    2014-12-01

    Full Text Available A bacterial isolate capable of utilizing tributyltin chloride (TBTCl as sole carbon source was isolated from estuarine sediments of west coast of India and identified as Pseudomonas stutzeri based on biochemical tests and Fatty acid methyl ester (FAME analysis. This isolate was designated as strain DN2. Although this bacterial isolate could resist up to 3 mM TBTCl level, it showed maximum growth at 2 mM TBTCl in mineral salt medium (MSM. Pseudomonas stutzeri DN2 exposed to 2 mM TBTCl revealed significant alteration in cell morphology as elongation and shrinkage in cell size along with roughness of cell surface. FTIR and NMR analysis of TBTCl degradation product extracted using chloroform and purified using column chromatography clearly revealed biotransformation of TBTCl into Dibutyltin dichloride (DBTCl2 through debutylation process. Therefore, Pseudomonas stutzeri strain DN2 may be used as a potential bacterial strain for bioremediation of TBTCl contaminated aquatic environmental sites.

  18. Biotransformation of Tributyltin chloride by Pseudomonas stutzeri strain DN2

    Science.gov (United States)

    Khanolkar, Dnyanada S.; Naik, Milind Mohan; Dubey, Santosh Kumar

    2014-01-01

    A bacterial isolate capable of utilizing tributyltin chloride (TBTCl) as sole carbon source was isolated from estuarine sediments of west coast of India and identified as Pseudomonas stutzeri based on biochemical tests and Fatty acid methyl ester (FAME) analysis. This isolate was designated as strain DN2. Although this bacterial isolate could resist up to 3 mM TBTCl level, it showed maximum growth at 2 mM TBTCl in mineral salt medium (MSM). Pseudomonas stutzeri DN2 exposed to 2 mM TBTCl revealed significant alteration in cell morphology as elongation and shrinkage in cell size along with roughness of cell surface. FTIR and NMR analysis of TBTCl degradation product extracted using chloroform and purified using column chromatography clearly revealed biotransformation of TBTCl into Dibutyltin dichloride (DBTCl2) through debutylation process. Therefore, Pseudomonas stutzeri strain DN2 may be used as a potential bacterial strain for bioremediation of TBTCl contaminated aquatic environmental sites. PMID:25763027

  19. Impact of exopolysaccharide production on functional properties of some Lactobacillus salivarius strains.

    Science.gov (United States)

    Mercan, Emin; İspirli, Hümeyra; Sert, Durmuş; Yılmaz, Mustafa Tahsin; Dertli, Enes

    2015-11-01

    The aim of this work was to characterize functional properties of Lactobacillus salivarius strains isolated from chicken feces. Detection of genes responsible for exopolysaccharide (EPS) production revealed that all strains harbored a dextransucrase gene, but p-gtf gene was only detected in strain E4. Analysis of EPS production levels showed significant alterations among strains tested. Biofilm formation was found to be medium composition dependant, and there was a negative correlation with biofilm formation and EPS production. Autoaggregation properties and coaggregation of L. salivarius strains with chicken pathogens were appeared to be specific at strain level. An increment in bacterial adhesion to chicken gut explants was observed in L. salivarius strains with the reduction in EPS production levels. This study showed that strain-specific properties can determine the functional properties of L. salivarius strains, and the interference of these properties might be crucial for final selection of these strains for technological purposes.

  20. Probiotic attributes of autochthonous Lactobacillus rhamnosus strains of human origin.

    Science.gov (United States)

    Pithva, Sheetal; Shekh, Satyamitra; Dave, Jayantilal; Vyas, Bharatkumar Rajiv Manuel

    2014-05-01

    The study was aimed at evaluating the probiotic potential of indigenous autochthonous Lactobacillus rhamnosus strains isolated from infant feces and vaginal mucosa of healthy female. The survival of the selected strains and the two reference strains (L. rhamnosus GG and L. casei Actimel) was 67-81 % at pH 2 and 70-80 % after passage through the simulated gastrointestinal fluid. These strains are able to grow in the presence of 4 % bile salt, 10 % NaCl, and 0.6 % phenol. The cell surface of L. rhamnosus strains is hydrophilic in nature as revealed by bacterial adhesion to hydrocarbons (BATH) assay. Despite this, L. rhamnosus strains showed mucin adherence, autoaggregation and coaggregation properties that are strain-specific. In addition, they produce bile salt hydrolase (BSH) and β-galactosidase activities. L. rhamnosus strains exhibit antimicrobial activity against food spoilage organisms and gastrointestinal pathogens, as well as Candida and Aspergillus spp. L. rhamnosus strains have similar antibiotic susceptibility pattern, and resistance to certain antibiotics is intrinsic or innate. The strains are neither haemolytic nor producer of biogenic amines such as histamine, putrescine, cadaverine and tyramine. Lyophilized cells of L. rhamnosus Fb exhibited probiotic properties demonstrating potential of the strain for technological suitability and in the preparation of diverse probiotic food formulations.

  1. A New Dataset Size Reduction Approach for PCA-Based Classification in OCR Application

    Directory of Open Access Journals (Sweden)

    Mohammad Amin Shayegan

    2014-01-01

    Full Text Available A major problem of pattern recognition systems is due to the large volume of training datasets including duplicate and similar training samples. In order to overcome this problem, some dataset size reduction and also dimensionality reduction techniques have been introduced. The algorithms presently used for dataset size reduction usually remove samples near to the centers of classes or support vector samples between different classes. However, the samples near to a class center include valuable information about the class characteristics and the support vector is important for evaluating system efficiency. This paper reports on the use of Modified Frequency Diagram technique for dataset size reduction. In this new proposed technique, a training dataset is rearranged and then sieved. The sieved training dataset along with automatic feature extraction/selection operation using Principal Component Analysis is used in an OCR application. The experimental results obtained when using the proposed system on one of the biggest handwritten Farsi/Arabic numeral standard OCR datasets, Hoda, show about 97% accuracy in the recognition rate. The recognition speed increased by 2.28 times, while the accuracy decreased only by 0.7%, when a sieved version of the dataset, which is only as half as the size of the initial training dataset, was used.

  2. The CMS dataset bookkeeping service

    Science.gov (United States)

    Afaq, A.; Dolgert, A.; Guo, Y.; Jones, C.; Kosyakov, S.; Kuznetsov, V.; Lueking, L.; Riley, D.; Sekhri, V.

    2008-07-01

    The CMS Dataset Bookkeeping Service (DBS) has been developed to catalog all CMS event data from Monte Carlo and Detector sources. It provides the ability to identify MC or trigger source, track data provenance, construct datasets for analysis, and discover interesting data. CMS requires processing and analysis activities at various service levels and the DBS system provides support for localized processing or private analysis, as well as global access for CMS users at large. Catalog entries can be moved among the various service levels with a simple set of migration tools, thus forming a loose federation of databases. DBS is available to CMS users via a Python API, Command Line, and a Discovery web page interfaces. The system is built as a multi-tier web application with Java servlets running under Tomcat, with connections via JDBC to Oracle or MySQL database backends. Clients connect to the service through HTTP or HTTPS with authentication provided by GRID certificates and authorization through VOMS. DBS is an integral part of the overall CMS Data Management and Workflow Management systems.

  3. The CMS dataset bookkeeping service

    Energy Technology Data Exchange (ETDEWEB)

    Afaq, A; Guo, Y; Kosyakov, S; Lueking, L; Sekhri, V [Fermilab, Batavia, Illinois 60510 (United States); Dolgert, A; Jones, C; Kuznetsov, V; Riley, D [Cornell University, Ithaca, New York 14850 (United States)

    2008-07-15

    The CMS Dataset Bookkeeping Service (DBS) has been developed to catalog all CMS event data from Monte Carlo and Detector sources. It provides the ability to identify MC or trigger source, track data provenance, construct datasets for analysis, and discover interesting data. CMS requires processing and analysis activities at various service levels and the DBS system provides support for localized processing or private analysis, as well as global access for CMS users at large. Catalog entries can be moved among the various service levels with a simple set of migration tools, thus forming a loose federation of databases. DBS is available to CMS users via a Python API, Command Line, and a Discovery web page interfaces. The system is built as a multi-tier web application with Java servlets running under Tomcat, with connections via JDBC to Oracle or MySQL database backends. Clients connect to the service through HTTP or HTTPS with authentication provided by GRID certificates and authorization through VOMS. DBS is an integral part of the overall CMS Data Management and Workflow Management systems.

  4. The CMS dataset bookkeeping service

    International Nuclear Information System (INIS)

    Afaq, A; Guo, Y; Kosyakov, S; Lueking, L; Sekhri, V; Dolgert, A; Jones, C; Kuznetsov, V; Riley, D

    2008-01-01

    The CMS Dataset Bookkeeping Service (DBS) has been developed to catalog all CMS event data from Monte Carlo and Detector sources. It provides the ability to identify MC or trigger source, track data provenance, construct datasets for analysis, and discover interesting data. CMS requires processing and analysis activities at various service levels and the DBS system provides support for localized processing or private analysis, as well as global access for CMS users at large. Catalog entries can be moved among the various service levels with a simple set of migration tools, thus forming a loose federation of databases. DBS is available to CMS users via a Python API, Command Line, and a Discovery web page interfaces. The system is built as a multi-tier web application with Java servlets running under Tomcat, with connections via JDBC to Oracle or MySQL database backends. Clients connect to the service through HTTP or HTTPS with authentication provided by GRID certificates and authorization through VOMS. DBS is an integral part of the overall CMS Data Management and Workflow Management systems

  5. The CMS dataset bookkeeping service

    International Nuclear Information System (INIS)

    Afaq, Anzar; Dolgert, Andrew; Guo, Yuyi; Jones, Chris; Kosyakov, Sergey; Kuznetsov, Valentin; Lueking, Lee; Riley, Dan; Sekhri, Vijay

    2007-01-01

    The CMS Dataset Bookkeeping Service (DBS) has been developed to catalog all CMS event data from Monte Carlo and Detector sources. It provides the ability to identify MC or trigger source, track data provenance, construct datasets for analysis, and discover interesting data. CMS requires processing and analysis activities at various service levels and the DBS system provides support for localized processing or private analysis, as well as global access for CMS users at large. Catalog entries can be moved among the various service levels with a simple set of migration tools, thus forming a loose federation of databases. DBS is available to CMS users via a Python API, Command Line, and a Discovery web page interfaces. The system is built as a multi-tier web application with Java servlets running under Tomcat, with connections via JDBC to Oracle or MySQL database backends. Clients connect to the service through HTTP or HTTPS with authentication provided by GRID certificates and authorization through VOMS. DBS is an integral part of the overall CMS Data Management and Workflow Management systems

  6. The interaction of H2O with strained uranium metal surfaces

    International Nuclear Information System (INIS)

    Tiferet, E.; Mintz, M. H.; Zalkind, S.; Jacob, I.; Shamir, N.

    2014-01-01

    The interaction of water vapor was studied on uranium metal surfaces, with various degrees of strain (relieved by different degrees of heating). The main features of dissociation, adsorption and initial oxidation for the studied surfaces will be presented. Common to all strained surfaces, on the metal surface a full dissociation occurs, while after oxidation only on most of them the water dissociation is full and on one of them, it is only partial. The oxygen dissociation product adsorbs (with sticking coefficient decreasing with strain relief), forming clusters, for all strains, while the hydrogen product clusters only on the strain relieved and recrystallized surface. The most interesting phenomenon, revealed for these surfaces, is the inhibition of hydrogen adsorption by traces of water vapor , changing from 10% for the mostly strained (defected) surface down to 1% for the strain relieved one. The suggested mechanism for this inhibition will be discussed

  7. Characterization of oriented cracks with differential strain analysis

    International Nuclear Information System (INIS)

    Siegfried, R.; Simmons, G.

    1978-01-01

    Linear strain of a rock sample as a function of hydrostatic pressure can be measured with a precision of 2 x 10 -6 . Such high-precision data for three orthogonal directions allow calculation of the distribution function for the porosity due to cracks closing completely at a given pressure. Such data for at least six independent directions yield the zero-pressure strain tensor due to cracks closing completely at a given pressure. The principal values and axes of this tensor distribution function provide information about the orientation of cracks as a function of closure pressure. In this manuscript we first develop the mathematical basis for the technique and then illustrate it with differential strain data for two samples, the Westerly (Rhode Island) granite and the Twin Sisters (Washington) dunite. Strain tensor calculations reveal that each of these samples has a different type of anisotropic crack distribution

  8. Residual Strain Characteristics of Nickel-coated FBG Sensors

    International Nuclear Information System (INIS)

    Cho, Won-Jae; Hwang, A-Reum; Kim, Sang-Woo

    2017-01-01

    A metal-coated FBG (fiber Bragg grating) sensor has a memory effect, which can recall the maximum strains experienced by the structure. In this study, a nickel-coated FBG sensor was fabricated through electroless (i.e., chemical plating) and electroplating. A thickness of approximately 43 μm of a nickel layer was achieved. Then, we conducted cyclic loading tests for the fabricated nickel-coated FBG sensors to verify their capability to produce residual strains. The results revealed that the residual strain induced by the nickel coating linearly increased with an increase in the maximum strain experienced by the sensor. Therefore, we verified that a nickel-coated FBG sensor has a memory effect. The fabrication methods and the results of the cycle loading test will provide basic information and guidelines in the design of a nickel-coated FBG sensor when it is applied in the development of structural health monitoring techniques.

  9. Residual Strain Characteristics of Nickel-coated FBG Sensors

    Energy Technology Data Exchange (ETDEWEB)

    Cho, Won-Jae; Hwang, A-Reum; Kim, Sang-Woo [Hankyong National Univ., Ansung (Korea, Republic of)

    2017-07-15

    A metal-coated FBG (fiber Bragg grating) sensor has a memory effect, which can recall the maximum strains experienced by the structure. In this study, a nickel-coated FBG sensor was fabricated through electroless (i.e., chemical plating) and electroplating. A thickness of approximately 43 μm of a nickel layer was achieved. Then, we conducted cyclic loading tests for the fabricated nickel-coated FBG sensors to verify their capability to produce residual strains. The results revealed that the residual strain induced by the nickel coating linearly increased with an increase in the maximum strain experienced by the sensor. Therefore, we verified that a nickel-coated FBG sensor has a memory effect. The fabrication methods and the results of the cycle loading test will provide basic information and guidelines in the design of a nickel-coated FBG sensor when it is applied in the development of structural health monitoring techniques.

  10. A cross-country Exchange Market Pressure (EMP dataset

    Directory of Open Access Journals (Sweden)

    Mohit Desai

    2017-06-01

    Full Text Available The data presented in this article are related to the research article titled - “An exchange market pressure measure for cross country analysis” (Patnaik et al. [1]. In this article, we present the dataset for Exchange Market Pressure values (EMP for 139 countries along with their conversion factors, ρ (rho. Exchange Market Pressure, expressed in percentage change in exchange rate, measures the change in exchange rate that would have taken place had the central bank not intervened. The conversion factor ρ can interpreted as the change in exchange rate associated with $1 billion of intervention. Estimates of conversion factor ρ allow us to calculate a monthly time series of EMP for 139 countries. Additionally, the dataset contains the 68% confidence interval (high and low values for the point estimates of ρ’s. Using the standard errors of estimates of ρ’s, we obtain one sigma intervals around mean estimates of EMP values. These values are also reported in the dataset.

  11. A cross-country Exchange Market Pressure (EMP) dataset.

    Science.gov (United States)

    Desai, Mohit; Patnaik, Ila; Felman, Joshua; Shah, Ajay

    2017-06-01

    The data presented in this article are related to the research article titled - "An exchange market pressure measure for cross country analysis" (Patnaik et al. [1]). In this article, we present the dataset for Exchange Market Pressure values (EMP) for 139 countries along with their conversion factors, ρ (rho). Exchange Market Pressure, expressed in percentage change in exchange rate, measures the change in exchange rate that would have taken place had the central bank not intervened. The conversion factor ρ can interpreted as the change in exchange rate associated with $1 billion of intervention. Estimates of conversion factor ρ allow us to calculate a monthly time series of EMP for 139 countries. Additionally, the dataset contains the 68% confidence interval (high and low values) for the point estimates of ρ 's. Using the standard errors of estimates of ρ 's, we obtain one sigma intervals around mean estimates of EMP values. These values are also reported in the dataset.

  12. Serological characterization of Actinobacillus pleuropneumoniae biotype 1 strains antigenically related to both serotypes 2 and 7

    DEFF Research Database (Denmark)

    Nielsen, R.; Andresen, Lars Ole; Plambeck, Tamara

    1996-01-01

    Nine Danish Actinobacillus pleuropneumoniae biotype 1 isolates were shown by latex agglutination and indirect haemagglutination to possess capsular polysaccharide epitopes identical to those of serotype 2 strain 1536 (reference strain of serotype 2) and strain 4226 (Danish serotype 2 strain). Imm...... in the LPS of strains 1536 and 7317 were revealed. Since an antigenic determinant specific for the 9 isolates could not be demonstrated with the methods used, the strains are proposed to be designated K2:O7....

  13. Determination of strain fields in porous shape memory alloys using micro-computed tomography

    Science.gov (United States)

    Bormann, Therese; Friess, Sebastian; de Wild, Michael; Schumacher, Ralf; Schulz, Georg; Müller, Bert

    2010-09-01

    Shape memory alloys (SMAs) belong to 'intelligent' materials since the metal alloy can change its macroscopic shape as the result of the temperature-induced, reversible martensite-austenite phase transition. SMAs are often applied for medical applications such as stents, hinge-less instruments, artificial muscles, and dental braces. Rapid prototyping techniques, including selective laser melting (SLM), allow fabricating complex porous SMA microstructures. In the present study, the macroscopic shape changes of the SMA test structures fabricated by SLM have been investigated by means of micro computed tomography (μCT). For this purpose, the SMA structures are placed into the heating stage of the μCT system SkyScan 1172™ (SkyScan, Kontich, Belgium) to acquire three-dimensional datasets above and below the transition temperature, i.e. at room temperature and at about 80°C, respectively. The two datasets were registered on the basis of an affine registration algorithm with nine independent parameters - three for the translation, three for the rotation and three for the scaling in orthogonal directions. Essentially, the scaling parameters characterize the macroscopic deformation of the SMA structure of interest. Furthermore, applying the non-rigid registration algorithm, the three-dimensional strain field of the SMA structure on the micrometer scale comes to light. The strain fields obtained will serve for the optimization of the SLM-process and, more important, of the design of the complex shaped SMA structures for tissue engineering and medical implants.

  14. Privacy-Preserving Matching of Spatial Datasets with Protection against Background Knowledge

    DEFF Research Database (Denmark)

    Ghinita, Gabriel; Vicente, Carmen Ruiz; Shang, Ning

    2010-01-01

    should be disclosed. Previous research efforts focused on private matching for relational data, and rely either on spaceembedding or on SMC techniques. Space-embedding transforms data points to hide their exact attribute values before matching is performed, whereas SMC protocols simulate complex digital...... circuits that evaluate the matching condition without revealing anything else other than the matching outcome. However, existing solutions have at least one of the following drawbacks: (i) they fail to protect against adversaries with background knowledge on data distribution, (ii) they compromise privacy...... by returning large amounts of false positives and (iii) they rely on complex and expensive SMC protocols. In this paper, we introduce a novel geometric transformation to perform private matching on spatial datasets. Our method is efficient and it is not vulnerable to background knowledge attacks. We consider...

  15. The NASA Subsonic Jet Particle Image Velocimetry (PIV) Dataset

    Science.gov (United States)

    Bridges, James; Wernet, Mark P.

    2011-01-01

    Many tasks in fluids engineering require prediction of turbulence of jet flows. The present document documents the single-point statistics of velocity, mean and variance, of cold and hot jet flows. The jet velocities ranged from 0.5 to 1.4 times the ambient speed of sound, and temperatures ranged from unheated to static temperature ratio 2.7. Further, the report assesses the accuracies of the data, e.g., establish uncertainties for the data. This paper covers the following five tasks: (1) Document acquisition and processing procedures used to create the particle image velocimetry (PIV) datasets. (2) Compare PIV data with hotwire and laser Doppler velocimetry (LDV) data published in the open literature. (3) Compare different datasets acquired at the same flow conditions in multiple tests to establish uncertainties. (4) Create a consensus dataset for a range of hot jet flows, including uncertainty bands. (5) Analyze this consensus dataset for self-consistency and compare jet characteristics to those of the open literature. The final objective was fulfilled by using the potential core length and the spread rate of the half-velocity radius to collapse of the mean and turbulent velocity fields over the first 20 jet diameters.

  16. Proteins involved in difference of sorbitol fermentation rates of the toxigenic and nontoxigenic Vibrio cholerae El Tor strains revealed by comparative proteome analysis

    Science.gov (United States)

    2009-01-01

    Background The nontoxigenic V. cholerae El Tor strains ferment sorbitol faster than the toxigenic strains, hence fast-fermenting and slow-fermenting strains are defined by sorbitol fermentation test. This test has been used for more than 40 years in cholera surveillance and strain analysis in China. Understanding of the mechanisms of sorbitol metabolism of the toxigenic and nontoxigenic strains may help to explore the genome and metabolism divergence in these strains. Here we used comparative proteomic analysis to find the proteins which may be involved in such metabolic difference. Results We found the production of formate and lactic acid in the sorbitol fermentation medium of the nontoxigenic strain was earlier than of the toxigenic strain. We compared the protein expression profiles of the toxigenic strain N16961 and nontoxigenic strain JS32 cultured in sorbitol fermentation medium, by using fructose fermentation medium as the control. Seventy-three differential protein spots were found and further identified by MALDI-MS. The difference of product of fructose-specific IIA/FPR component gene and mannitol-1-P dehydrogenase, may be involved in the difference of sorbitol transportation and dehydrogenation in the sorbitol fast- and slow-fermenting strains. The difference of the relative transcription levels of pyruvate formate-lyase to pyruvate dehydrogenase between the toxigenic and nontoxigenic strains may be also responsible for the time and ability difference of formate production between these strains. Conclusion Multiple factors involved in different metabolism steps may affect the sorbitol fermentation in the toxigenic and nontoxigenic strains of V. cholerae El Tor. PMID:19589152

  17. Strain-resolved microbial community proteomics reveals simultaneous aerobic and anaerobic function during gastrointestinal tract colonization of a preterm infant

    Directory of Open Access Journals (Sweden)

    Brandon eBrooks

    2015-07-01

    Full Text Available While there has been growing interest in the gut microbiome in recent years, it remains unclear whether closely related species and strains have similar or distinct functional roles and if organisms capable of both aerobic and anaerobic growth do so simultaneously. To investigate these questions, we implemented a high-throughput mass spectrometry-based proteomics approach to identify proteins in fecal samples collected on days of life 13-21 from an infant born at 28 weeks gestation. No prior studies have coupled strain-resolved community metagenomics to proteomics for such a purpose. Sequences were manually curated to resolve the genomes of two strains of Citrobacter that were present during the later stage of colonization. Proteome extracts from fecal samples were processed via a nano-2D-LC-MS/MS and peptides were identified based on information predicted from the genome sequences for the dominant organisms, Serratia and the two Citrobacter strains. These organisms are facultative anaerobes, and proteomic information indicates the utilization of both aerobic and anaerobic metabolisms throughout the time series. This may indicate growth in distinct niches within the gastrointestinal tract. We uncovered differences in the physiology of coexisting Citrobacter strains, including differences in motility and chemotaxis functions. Additionally, for both Citrobacter strains we resolved a community-essential role in vitamin metabolism and a predominant role in propionate production. Finally, in this case study we detected differences between genome abundance and activity levels for the dominant populations. This underlines the value in layering proteomic information over genetic potential.

  18. Knowledge Mining from Clinical Datasets Using Rough Sets and Backpropagation Neural Network

    Directory of Open Access Journals (Sweden)

    Kindie Biredagn Nahato

    2015-01-01

    Full Text Available The availability of clinical datasets and knowledge mining methodologies encourages the researchers to pursue research in extracting knowledge from clinical datasets. Different data mining techniques have been used for mining rules, and mathematical models have been developed to assist the clinician in decision making. The objective of this research is to build a classifier that will predict the presence or absence of a disease by learning from the minimal set of attributes that has been extracted from the clinical dataset. In this work rough set indiscernibility relation method with backpropagation neural network (RS-BPNN is used. This work has two stages. The first stage is handling of missing values to obtain a smooth data set and selection of appropriate attributes from the clinical dataset by indiscernibility relation method. The second stage is classification using backpropagation neural network on the selected reducts of the dataset. The classifier has been tested with hepatitis, Wisconsin breast cancer, and Statlog heart disease datasets obtained from the University of California at Irvine (UCI machine learning repository. The accuracy obtained from the proposed method is 97.3%, 98.6%, and 90.4% for hepatitis, breast cancer, and heart disease, respectively. The proposed system provides an effective classification model for clinical datasets.

  19. Electrocardiogram of Clinically Healthy Mithun (Bos frontalis): Variation among Strains

    Science.gov (United States)

    Sanyal, Sagar; Das, Pradip Kumar; Ghosh, Probal Ranjan; Das, Kinsuk; Vupru, Kezha V.; Rajkhowa, Chandan; Mondal, Mohan

    2010-01-01

    A study was conducted to establish the normal electrocardiogram in four different genetic strains of mithun (Bos frontalis). Electrocardiography, cardiac electrical axis, heart rate, rectal temperature and respiration rate were recorded in a total of 32 adult male mithun of four strains (n = 8 each). It was found that the respiration and heart rates were higher (P electrocardiogram of mithun revealed that the amplitude and duration of P wave, QRS complex and T wave were different among four different genetic strains of mithun and the electrical axis of QRS complex for Nagamese and Mizoram mithuns are dissimilar to bovine species. PMID:20886013

  20. Inhomogeneous strain induced by fast neutron irradiation in NaKSO4 crystals

    International Nuclear Information System (INIS)

    Kandil, S.H.; Kassem, M.E.; El-Khatib, A.; El-Gamal, M.A.; El-Wahidy, E.F.

    1987-01-01

    The paper reports the effect of fast neutron irradiation on the thermal properties of NaKSO 4 crystals in the temperature range 400-475 K. Results are presented for the thermal expansion, tensile strain and specific heat of NaKSO 4 , as a function of neutron irradiation dose. All these results revealed an inhomogeneous strain induced by the radiation. It is suggested that this induced inhomogeneous strain could be used to detect neutron exposure doses. (UK)

  1. Role of scaffold network in controlling strain and functionalities of nanocomposite films.

    Science.gov (United States)

    Chen, Aiping; Hu, Jia-Mian; Lu, Ping; Yang, Tiannan; Zhang, Wenrui; Li, Leigang; Ahmed, Towfiq; Enriquez, Erik; Weigand, Marcus; Su, Qing; Wang, Haiyan; Zhu, Jian-Xin; MacManus-Driscoll, Judith L; Chen, Long-Qing; Yarotski, Dmitry; Jia, Quanxi

    2016-06-01

    Strain is a novel approach to manipulating functionalities in correlated complex oxides. However, significant epitaxial strain can only be achieved in ultrathin layers. We show that, under direct lattice matching framework, large and uniform vertical strain up to 2% can be achieved to significantly modify the magnetic anisotropy, magnetism, and magnetotransport properties in heteroepitaxial nanoscaffold films, over a few hundred nanometers in thickness. Comprehensive designing principles of large vertical strain have been proposed. Phase-field simulations not only reveal the strain distribution but also suggest that the ultimate strain is related to the vertical interfacial area and interfacial dislocation density. By changing the nanoscaffold density and dimension, the strain and the magnetic properties can be tuned. The established correlation among the vertical interface-strain-properties in nanoscaffold films can consequently be used to tune other functionalities in a broad range of complex oxide films far beyond critical thickness.

  2. Spatially-explicit estimation of geographical representation in large-scale species distribution datasets.

    Science.gov (United States)

    Kalwij, Jesse M; Robertson, Mark P; Ronk, Argo; Zobel, Martin; Pärtel, Meelis

    2014-01-01

    Much ecological research relies on existing multispecies distribution datasets. Such datasets, however, can vary considerably in quality, extent, resolution or taxonomic coverage. We provide a framework for a spatially-explicit evaluation of geographical representation within large-scale species distribution datasets, using the comparison of an occurrence atlas with a range atlas dataset as a working example. Specifically, we compared occurrence maps for 3773 taxa from the widely-used Atlas Florae Europaeae (AFE) with digitised range maps for 2049 taxa of the lesser-known Atlas of North European Vascular Plants. We calculated the level of agreement at a 50-km spatial resolution using average latitudinal and longitudinal species range, and area of occupancy. Agreement in species distribution was calculated and mapped using Jaccard similarity index and a reduced major axis (RMA) regression analysis of species richness between the entire atlases (5221 taxa in total) and between co-occurring species (601 taxa). We found no difference in distribution ranges or in the area of occupancy frequency distribution, indicating that atlases were sufficiently overlapping for a valid comparison. The similarity index map showed high levels of agreement for central, western, and northern Europe. The RMA regression confirmed that geographical representation of AFE was low in areas with a sparse data recording history (e.g., Russia, Belarus and the Ukraine). For co-occurring species in south-eastern Europe, however, the Atlas of North European Vascular Plants showed remarkably higher richness estimations. Geographical representation of atlas data can be much more heterogeneous than often assumed. Level of agreement between datasets can be used to evaluate geographical representation within datasets. Merging atlases into a single dataset is worthwhile in spite of methodological differences, and helps to fill gaps in our knowledge of species distribution ranges. Species distribution

  3. Health-promoting properties exhibited by Lactobacillus helveticus strains.

    Science.gov (United States)

    Skrzypczak, Katarzyna; Gustaw, Waldemar; Waśko, Adam

    2015-01-01

    Many strains belonging to lactobacilli exert a variety of beneficial health effects in humans and some of the bacteria are regarded as probiotic microorganisms. Adherence and capabilities of colonization by Lactobacillus strains of the intestinal tract is a prerequisite for probiotic strains to exhibit desired functional properties. The analysis conducted here aimed at screening strains of Lactobacillus helveticus possessing a health-promoting potential. The molecular analysis performed, revealed the presence of a slpA gene encoding the surface S-layer protein SlpA (contributing to the immunostimulatory activity of L. helveticus M 92 probiotic strain) in all B734, DSM, T80, and T105 strains. The product of gene amplification was also identified in a Bifidobacterium animalis ssp. lactis BB12 probiotic strain. SDS-PAGE of a surface protein extract demonstrated the presence of a protein with a mass of about 50 kDa in all strains, which refers to the mass of the S-layer proteins. These results are confirmed by observations carried with transmission electron microscopy, where a clearly visible S-layer was registered in all the strains analyzed. The in vitro study results obtained indicate that the strongest adhesion capacity to epithelial cells (HT-29) was demonstrated by L. helveticus B734, while coaggregation with pathogens was highly diverse among the tested strains. The percentage degree of coaggregation was increasing with the incubation time. After 5 h of incubation, the strongest ability to coaggregate with Escherichia coli was expressed by T104. The T80 strain demonstrated a significant ability to co-aggregate with Staphylococcus aureus, while DSM with Bacillus subtilis. For B734, the highest values of co-aggregation coefficient was noted in samples with Salmonella. The capability of autoaggregation, antibiotic susceptibility, resistance to increasing salt concentrations, and strain survival in simulated small intestinal juice were also analyzed.

  4. The Global Precipitation Climatology Project (GPCP) Combined Precipitation Dataset

    Science.gov (United States)

    Huffman, George J.; Adler, Robert F.; Arkin, Philip; Chang, Alfred; Ferraro, Ralph; Gruber, Arnold; Janowiak, John; McNab, Alan; Rudolf, Bruno; Schneider, Udo

    1997-01-01

    The Global Precipitation Climatology Project (GPCP) has released the GPCP Version 1 Combined Precipitation Data Set, a global, monthly precipitation dataset covering the period July 1987 through December 1995. The primary product in the dataset is a merged analysis incorporating precipitation estimates from low-orbit-satellite microwave data, geosynchronous-orbit -satellite infrared data, and rain gauge observations. The dataset also contains the individual input fields, a combination of the microwave and infrared satellite estimates, and error estimates for each field. The data are provided on 2.5 deg x 2.5 deg latitude-longitude global grids. Preliminary analyses show general agreement with prior studies of global precipitation and extends prior studies of El Nino-Southern Oscillation precipitation patterns. At the regional scale there are systematic differences with standard climatologies.

  5. A new dataset and algorithm evaluation for mood estimation in music

    OpenAIRE

    Godec, Primož

    2014-01-01

    This thesis presents a new dataset of perceived and induced emotions for 200 audio clips. The gathered dataset provides users' perceived and induced emotions for each clip, the association of color, along with demographic and personal data, such as user's emotion state and emotion ratings, genre preference, music experience, among others. With an online survey we collected more than 7000 responses for a dataset of 200 audio excerpts, thus providing about 37 user responses per clip. The foc...

  6. A Large-Scale 3D Object Recognition dataset

    DEFF Research Database (Denmark)

    Sølund, Thomas; Glent Buch, Anders; Krüger, Norbert

    2016-01-01

    geometric groups; concave, convex, cylindrical and flat 3D object models. The object models have varying amount of local geometric features to challenge existing local shape feature descriptors in terms of descriptiveness and robustness. The dataset is validated in a benchmark which evaluates the matching...... performance of 7 different state-of-the-art local shape descriptors. Further, we validate the dataset in a 3D object recognition pipeline. Our benchmark shows as expected that local shape feature descriptors without any global point relation across the surface have a poor matching performance with flat...

  7. The Wind Integration National Dataset (WIND) toolkit (Presentation)

    Energy Technology Data Exchange (ETDEWEB)

    Caroline Draxl: NREL

    2014-01-01

    Regional wind integration studies require detailed wind power output data at many locations to perform simulations of how the power system will operate under high penetration scenarios. The wind datasets that serve as inputs into the study must realistically reflect the ramping characteristics, spatial and temporal correlations, and capacity factors of the simulated wind plants, as well as being time synchronized with available load profiles.As described in this presentation, the WIND Toolkit fulfills these requirements by providing a state-of-the-art national (US) wind resource, power production and forecast dataset.

  8. An integrated pan-tropical biomass map using multiple reference datasets

    NARCIS (Netherlands)

    Avitabile, V.; Herold, M.; Heuvelink, G.B.M.; Lewis, S.L.; Phillips, O.L.; Asner, G.P.; Armston, J.; Asthon, P.; Banin, L.F.; Bayol, N.; Berry, N.; Boeckx, P.; Jong, De B.; Devries, B.; Girardin, C.; Kearsley, E.; Lindsell, J.A.; Lopez-gonzalez, G.; Lucas, R.; Malhi, Y.; Morel, A.; Mitchard, E.; Nagy, L.; Qie, L.; Quinones, M.; Ryan, C.M.; Slik, F.; Sunderland, T.; Vaglio Laurin, G.; Valentini, R.; Verbeeck, H.; Wijaya, A.; Willcock, S.

    2016-01-01

    We combined two existing datasets of vegetation aboveground biomass (AGB) (Proceedings of the National Academy of Sciences of the United States of America, 108, 2011, 9899; Nature Climate Change, 2, 2012, 182) into a pan-tropical AGB map at 1-km resolution using an independent reference dataset of

  9. Global Existence Results for Viscoplasticity at Finite Strain

    Science.gov (United States)

    Mielke, Alexander; Rossi, Riccarda; Savaré, Giuseppe

    2018-01-01

    We study a model for rate-dependent gradient plasticity at finite strain based on the multiplicative decomposition of the strain tensor, and investigate the existence of global-in-time solutions to the related PDE system. We reveal its underlying structure as a generalized gradient system, where the driving energy functional is highly nonconvex and features the geometric nonlinearities related to finite-strain elasticity as well as the multiplicative decomposition of finite-strain plasticity. Moreover, the dissipation potential depends on the left-invariant plastic rate, and thus depends on the plastic state variable. The existence theory is developed for a class of abstract, nonsmooth, and nonconvex gradient systems, for which we introduce suitable notions of solutions, namely energy-dissipation-balance and energy-dissipation-inequality solutions. Hence, we resort to the toolbox of the direct method of the calculus of variations to check that the specific energy and dissipation functionals for our viscoplastic models comply with the conditions of the general theory.

  10. Strain-induced fermi contour anisotropy of GaAs 2D holes.

    Science.gov (United States)

    Shabani, J; Shayegan, M; Winkler, R

    2008-03-07

    We report measurements of magnetoresistance commensurability peaks, induced by a square array of antidots, in GaAs (311)A two-dimensional holes as a function of applied in-plane strain. The data directly probe the shapes of the Fermi contours of the two spin subbands that are split thanks to the spin-orbit interaction and strain. The experimental results are in quantitative agreement with the predictions of accurate energy band calculations, and reveal that the majority spin subband has a severely distorted Fermi contour whose anisotropy can be tuned with strain.

  11. Comparison of global 3-D aviation emissions datasets

    Directory of Open Access Journals (Sweden)

    S. C. Olsen

    2013-01-01

    Full Text Available Aviation emissions are unique from other transportation emissions, e.g., from road transportation and shipping, in that they occur at higher altitudes as well as at the surface. Aviation emissions of carbon dioxide, soot, and water vapor have direct radiative impacts on the Earth's climate system while emissions of nitrogen oxides (NOx, sulfur oxides, carbon monoxide (CO, and hydrocarbons (HC impact air quality and climate through their effects on ozone, methane, and clouds. The most accurate estimates of the impact of aviation on air quality and climate utilize three-dimensional chemistry-climate models and gridded four dimensional (space and time aviation emissions datasets. We compare five available aviation emissions datasets currently and historically used to evaluate the impact of aviation on climate and air quality: NASA-Boeing 1992, NASA-Boeing 1999, QUANTIFY 2000, Aero2k 2002, and AEDT 2006 and aviation fuel usage estimates from the International Energy Agency. Roughly 90% of all aviation emissions are in the Northern Hemisphere and nearly 60% of all fuelburn and NOx emissions occur at cruise altitudes in the Northern Hemisphere. While these datasets were created by independent methods and are thus not strictly suitable for analyzing trends they suggest that commercial aviation fuelburn and NOx emissions increased over the last two decades while HC emissions likely decreased and CO emissions did not change significantly. The bottom-up estimates compared here are consistently lower than International Energy Agency fuelburn statistics although the gap is significantly smaller in the more recent datasets. Overall the emissions distributions are quite similar for fuelburn and NOx with regional peaks over the populated land masses of North America, Europe, and East Asia. For CO and HC there are relatively larger differences. There are however some distinct differences in the altitude distribution

  12. Benchmarking two commonly used Saccharomyces cerevisiae strains for heterologous vanillin-β-glucoside production

    DEFF Research Database (Denmark)

    Strucko, Tomas; Magdenoska, Olivera; Mortensen, Uffe Hasbro

    2015-01-01

    factories for production of specific compounds. To examine this possibility, we have reconstructed a de novo vanillin-β-glucoside pathway in an identical manner in S288c and CEN.PK strains. Characterization of the two resulting strains in two standard conditions revealed that the S288c background strain...... produced up to 10-fold higher amounts of vanillin-β-glucoside compared to CEN.PK. This study demonstrates that yeast strain background may play a major role in the outcome of newly developed cell factories for production of a given product....

  13. Phylogenetic analysis of canine distemper virus in South America clade 1 reveals unique molecular signatures of the local epidemic.

    Science.gov (United States)

    Fischer, Cristine D B; Gräf, Tiago; Ikuta, Nilo; Lehmann, Fernanda K M; Passos, Daniel T; Makiejczuk, Aline; Silveira, Marcos A T; Fonseca, André S K; Canal, Cláudio W; Lunge, Vagner R

    2016-07-01

    Canine distemper virus (CDV) is a highly contagious pathogen for domestic dogs and several wild carnivore species. In Brazil, natural infection of CDV in dogs is very high due to the large non-vaccinated dog population, a scenario that calls for new studies on the molecular epidemiology. This study investigates the phylodynamics and amino-acid signatures of CDV epidemic in South America by analyzing a large dataset compiled from publicly available sequences and also by collecting new samples from Brazil. A population of 175 dogs with canine distemper (CD) signs was sampled, from which 89 were positive for CDV, generating 42 new CDV sequences. Phylogenetic analysis of the new and publicly available sequences revealed that Brazilian sequences mainly clustered in South America 1 (SA1) clade, which has its origin estimated to the late 1980's. The reconstruction of the demographic history in SA1 clade showed an epidemic expanding until the recent years, doubling in size every nine years. SA1 clade epidemic distinguished from the world CDV epidemic by the emergence of the R580Q strain, a very rare and potentially detrimental substitution in the viral genome. The R580Q substitution was estimated to have happened in one single evolutionary step in the epidemic history in SA1 clade, emerging shortly after introduction to the continent. Moreover, a high prevalence (11.9%) of the Y549H mutation was observed among the domestic dogs sampled here. This finding was associated (p<0.05) with outcome-death and higher frequency in mixed-breed dogs, the later being an indicator of a continuous exchange of CDV strains circulating among wild carnivores and domestic dogs. The results reported here highlight the diversity of the worldwide CDV epidemic and reveal local features that can be valuable for combating the disease. Copyright © 2016 Elsevier B.V. All rights reserved.

  14. The emergence of the Activity Reduces Conflict Associated Strain (ARCAS) model: a test of a conditional mediation model of workplace conflict and employee strain.

    Science.gov (United States)

    Dijkstra, Maria T M; Beersma, Bianca; Cornelissen, Roosmarijn A W M

    2012-07-01

    To test and extend the emerging Activity Reduces Conflict-Associated Strain (ARCAS) model, we predicted that the relationship between task conflict and employee strain would be weakened to the extent that people experience high organization-based self-esteem (OBSE). A survey among Dutch employees demonstrated that, consistent with the model, the conflict-employee strain relationship was weaker the higher employees' OBSE and the more they engaged in active problem-solving conflict management. Our data also revealed that higher levels of OBSE were related to more problem-solving conflict management. Moreover, consistent with the ARCAS model, we could confirm a conditional mediation model in which organization-based self-esteem through its relationship with problem-solving conflict management weakened the relationship between task conflict and employee strain. Potential applications of the results are discussed.

  15. Proteogenomic Investigation of Strain Variation in Clinical Mycobacterium tuberculosis Isolates

    KAUST Repository

    Heunis, Tiaan

    2017-08-18

    Mycobacterium tuberculosis consists of a large number of different strains that display unique virulence characteristics. Whole-genome sequencing has revealed substantial genetic diversity among clinical M. tuberculosis isolates, and elucidating the phenotypic variation encoded by this genetic diversity will be of utmost importance to fully understand M. tuberculosis biology and pathogenicity. In this study we integrated whole-genome sequencing and mass spectrometry (GeLC-MS/MS) to reveal strain-specific characteristics in the proteomes of two clinical M. tuberculosis Latin American-Mediterranean isolates. Using this approach we identified 59 peptides containing single amino acid variants, which covered ~9% of all total coding nonsynonymous single nucleotide variants detected by whole-genome sequencing. Furthermore, we identified 29 distinct peptides that mapped to a hypothetical protein not present in the M. tuberculosis H37Rv reference proteome. Here we provide evidence for the expression of this protein in the clinical M. tuberculosis SAWC3651 isolate. The strain-specific databases enabled confirmation of genomic differences (i.e. large genomic regions of difference and nonsynonymous single nucleotide variants) in these two clinical M. tuberculosis isolates and allowed strain differentiation at the proteome level. Our results contribute to the growing field of clinical microbial proteogenomics and can improve our understanding of phenotypic variation in clinical M. tuberculosis isolates.

  16. Proteogenomic Investigation of Strain Variation in Clinical Mycobacterium tuberculosis Isolates

    KAUST Repository

    Heunis, Tiaan; Dippenaar, Anzaan; Warren, Robin M.; van Helden, Paul D.; van der Merwe, Ruben G.; Gey van Pittius, Nicolaas C.; Pain, Arnab; Sampson, Samantha L.; Tabb, David L.

    2017-01-01

    Mycobacterium tuberculosis consists of a large number of different strains that display unique virulence characteristics. Whole-genome sequencing has revealed substantial genetic diversity among clinical M. tuberculosis isolates, and elucidating the phenotypic variation encoded by this genetic diversity will be of utmost importance to fully understand M. tuberculosis biology and pathogenicity. In this study we integrated whole-genome sequencing and mass spectrometry (GeLC-MS/MS) to reveal strain-specific characteristics in the proteomes of two clinical M. tuberculosis Latin American-Mediterranean isolates. Using this approach we identified 59 peptides containing single amino acid variants, which covered ~9% of all total coding nonsynonymous single nucleotide variants detected by whole-genome sequencing. Furthermore, we identified 29 distinct peptides that mapped to a hypothetical protein not present in the M. tuberculosis H37Rv reference proteome. Here we provide evidence for the expression of this protein in the clinical M. tuberculosis SAWC3651 isolate. The strain-specific databases enabled confirmation of genomic differences (i.e. large genomic regions of difference and nonsynonymous single nucleotide variants) in these two clinical M. tuberculosis isolates and allowed strain differentiation at the proteome level. Our results contribute to the growing field of clinical microbial proteogenomics and can improve our understanding of phenotypic variation in clinical M. tuberculosis isolates.

  17. Proteogenomic Investigation of Strain Variation in Clinical Mycobacterium tuberculosis Isolates.

    Science.gov (United States)

    Heunis, Tiaan; Dippenaar, Anzaan; Warren, Robin M; van Helden, Paul D; van der Merwe, Ruben G; Gey van Pittius, Nicolaas C; Pain, Arnab; Sampson, Samantha L; Tabb, David L

    2017-10-06

    Mycobacterium tuberculosis consists of a large number of different strains that display unique virulence characteristics. Whole-genome sequencing has revealed substantial genetic diversity among clinical M. tuberculosis isolates, and elucidating the phenotypic variation encoded by this genetic diversity will be of the utmost importance to fully understand M. tuberculosis biology and pathogenicity. In this study, we integrated whole-genome sequencing and mass spectrometry (GeLC-MS/MS) to reveal strain-specific characteristics in the proteomes of two clinical M. tuberculosis Latin American-Mediterranean isolates. Using this approach, we identified 59 peptides containing single amino acid variants, which covered ∼9% of all coding nonsynonymous single nucleotide variants detected by whole-genome sequencing. Furthermore, we identified 29 distinct peptides that mapped to a hypothetical protein not present in the M. tuberculosis H37Rv reference proteome. Here, we provide evidence for the expression of this protein in the clinical M. tuberculosis SAWC3651 isolate. The strain-specific databases enabled confirmation of genomic differences (i.e., large genomic regions of difference and nonsynonymous single nucleotide variants) in these two clinical M. tuberculosis isolates and allowed strain differentiation at the proteome level. Our results contribute to the growing field of clinical microbial proteogenomics and can improve our understanding of phenotypic variation in clinical M. tuberculosis isolates.

  18. Full genome sequence of a Danish isolate of Mycobacterium avium subspecies paratuberculosis, strain Ejlskov2007

    DEFF Research Database (Denmark)

    Afzal, Mamuna; Abidi, Soad; Mikkelsen, Heidi

    We have sequenced a Danish isolate of Mycobacterium avium subspecies paratuberculosis, strain Ejlskov2007. The strain was isolated from faecal material of a 48 month old second parity Danish Holstein cow, with clinical symptoms of chronic diarrhoea and emaciation. The cultures were grown on Löwen......We have sequenced a Danish isolate of Mycobacterium avium subspecies paratuberculosis, strain Ejlskov2007. The strain was isolated from faecal material of a 48 month old second parity Danish Holstein cow, with clinical symptoms of chronic diarrhoea and emaciation. The cultures were grown......, consisting of 4317 unique gene families. Comparison with M. avium paratuberculosis strain K10 revealed only 3436 genes in common (~70%). We have used GenomeAtlases to show conserved (and unique) regions along the Ejlskov2007 chromosome, compared to 2 other Mycobacterium avium sequenced genomes. Pan......-genome analyses of the sequenced Mycobacterium genomes reveal a surprisingly open and diverse set of genes for this bacterial genera....

  19. Global Human Built-up And Settlement Extent (HBASE) Dataset From Landsat

    Data.gov (United States)

    National Aeronautics and Space Administration — The Global Human Built-up And Settlement Extent (HBASE) Dataset from Landsat is a global map of HBASE derived from the Global Land Survey (GLS) Landsat dataset for...

  20. Passive Containment DataSet

    Science.gov (United States)

    This data is for Figures 6 and 7 in the journal article. The data also includes the two EPANET input files used for the analysis described in the paper, one for the looped system and one for the block system.This dataset is associated with the following publication:Grayman, W., R. Murray , and D. Savic. Redesign of Water Distribution Systems for Passive Containment of Contamination. JOURNAL OF THE AMERICAN WATER WORKS ASSOCIATION. American Water Works Association, Denver, CO, USA, 108(7): 381-391, (2016).

  1. Integrated genomics of Mucorales reveals novel therapeutic targets

    Science.gov (United States)

    Mucormycosis is a life-threatening infection caused by Mucorales fungi. We sequenced 30 fungal genomes and performed transcriptomics with three representative Rhizopus and Mucor strains with human airway epithelial cells during fungal invasion to reveal key host and fungal determinants contributing ...

  2. The Lunar Source Disk: Old Lunar Datasets on a New CD-ROM

    Science.gov (United States)

    Hiesinger, H.

    1998-01-01

    A compilation of previously published datasets on CD-ROM is presented. This Lunar Source Disk is intended to be a first step in the improvement/expansion of the Lunar Consortium Disk, in order to create an "image-cube"-like data pool that can be easily accessed and might be useful for a variety of future lunar investigations. All datasets were transformed to a standard map projection that allows direct comparison of different types of information on a pixel-by pixel basis. Lunar observations have a long history and have been important to mankind for centuries, notably since the work of Plutarch and Galileo. As a consequence of centuries of lunar investigations, knowledge of the characteristics and properties of the Moon has accumulated over time. However, a side effect of this accumulation is that it has become more and more complicated for scientists to review all the datasets obtained through different techniques, to interpret them properly, to recognize their weaknesses and strengths in detail, and to combine them synoptically in geologic interpretations. Such synoptic geologic interpretations are crucial for the study of planetary bodies through remote-sensing data in order to avoid misinterpretation. In addition, many of the modem datasets, derived from Earth-based telescopes as well as from spacecraft missions, are acquired at different geometric and radiometric conditions. These differences make it challenging to compare or combine datasets directly or to extract information from different datasets on a pixel-by-pixel basis. Also, as there is no convention for the presentation of lunar datasets, different authors choose different map projections, depending on the location of the investigated areas and their personal interests. Insufficient or incomplete information on the map parameters used by different authors further complicates the reprojection of these datasets to a standard geometry. The goal of our efforts was to transfer previously published lunar

  3. Gridded 5km GHCN-Daily Temperature and Precipitation Dataset, Version 1

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The Gridded 5km GHCN-Daily Temperature and Precipitation Dataset (nClimGrid) consists of four climate variables derived from the GHCN-D dataset: maximum temperature,...

  4. ENHANCED DATA DISCOVERABILITY FOR IN SITU HYPERSPECTRAL DATASETS

    Directory of Open Access Journals (Sweden)

    B. Rasaiah

    2016-06-01

    Full Text Available Field spectroscopic metadata is a central component in the quality assurance, reliability, and discoverability of hyperspectral data and the products derived from it. Cataloguing, mining, and interoperability of these datasets rely upon the robustness of metadata protocols for field spectroscopy, and on the software architecture to support the exchange of these datasets. Currently no standard for in situ spectroscopy data or metadata protocols exist. This inhibits the effective sharing of growing volumes of in situ spectroscopy datasets, to exploit the benefits of integrating with the evolving range of data sharing platforms. A core metadataset for field spectroscopy was introduced by Rasaiah et al., (2011-2015 with extended support for specific applications. This paper presents a prototype model for an OGC and ISO compliant platform-independent metadata discovery service aligned to the specific requirements of field spectroscopy. In this study, a proof-of-concept metadata catalogue has been described and deployed in a cloud-based architecture as a demonstration of an operationalized field spectroscopy metadata standard and web-based discovery service.

  5. Cardiac biplane strain imaging: initial in vivo experience

    Energy Technology Data Exchange (ETDEWEB)

    Lopata, R G P; Nillesen, M M; Thijssen, J M; De Korte, C L [Clinical Physics Laboratory, Radboud University Nijmegen Medical Centre, Nijmegen (Netherlands); Verrijp, C N; Lammens, M M Y; Van der Laak, J A W M [Department of Pathology, Radboud University Nijmegen Medical Centre, Nijmegen (Netherlands); Singh, S K; Van Wetten, H B [Department of Cardiothoracic Surgery, Radboud University Nijmegen Medical Centre, Nijmegen (Netherlands); Kapusta, L [Pediatric Cardiology, Department of Pediatrics, Radboud University Nijmegen Medical Centre, Nijmegen (Netherlands)], E-mail: R.Lopata@cukz.umcn.nl

    2010-02-21

    In this study, first we propose a biplane strain imaging method using a commercial ultrasound system, yielding estimation of the strain in three orthogonal directions. Secondly, an animal model of a child's heart was introduced that is suitable to simulate congenital heart disease and was used to test the method in vivo. The proposed approach can serve as a framework to monitor the development of cardiac hypertrophy and fibrosis. A 2D strain estimation technique using radio frequency (RF) ultrasound data was applied. Biplane image acquisition was performed at a relatively low frame rate (<100 Hz) using a commercial platform with an RF interface. For testing the method in vivo, biplane image sequences of the heart were recorded during the cardiac cycle in four dogs with an aortic stenosis. Initial results reveal the feasibility of measuring large radial, circumferential and longitudinal cumulative strain (up to 70%) at a frame rate of 100 Hz. Mean radial strain curves of a manually segmented region-of-interest in the infero-lateral wall show excellent correlation between the measured strain curves acquired in two perpendicular planes. Furthermore, the results show the feasibility and reproducibility of assessing radial, circumferential and longitudinal strains simultaneously. In this preliminary study, three beagles developed an elevated pressure gradient over the aortic valve ({delta}p: 100-200 mmHg) and myocardial hypertrophy. One dog did not develop any sign of hypertrophy ({delta}p = 20 mmHg). Initial strain (rate) results showed that the maximum strain (rate) decreased with increasing valvular stenosis (-50%), which is in accordance with previous studies. Histological findings corroborated these results and showed an increase in fibrotic tissue for the hearts with larger pressure gradients (100, 200 mmHg), as well as lower strain and strain rate values.

  6. Cardiac biplane strain imaging: initial in vivo experience

    International Nuclear Information System (INIS)

    Lopata, R G P; Nillesen, M M; Thijssen, J M; De Korte, C L; Verrijp, C N; Lammens, M M Y; Van der Laak, J A W M; Singh, S K; Van Wetten, H B; Kapusta, L

    2010-01-01

    In this study, first we propose a biplane strain imaging method using a commercial ultrasound system, yielding estimation of the strain in three orthogonal directions. Secondly, an animal model of a child's heart was introduced that is suitable to simulate congenital heart disease and was used to test the method in vivo. The proposed approach can serve as a framework to monitor the development of cardiac hypertrophy and fibrosis. A 2D strain estimation technique using radio frequency (RF) ultrasound data was applied. Biplane image acquisition was performed at a relatively low frame rate (<100 Hz) using a commercial platform with an RF interface. For testing the method in vivo, biplane image sequences of the heart were recorded during the cardiac cycle in four dogs with an aortic stenosis. Initial results reveal the feasibility of measuring large radial, circumferential and longitudinal cumulative strain (up to 70%) at a frame rate of 100 Hz. Mean radial strain curves of a manually segmented region-of-interest in the infero-lateral wall show excellent correlation between the measured strain curves acquired in two perpendicular planes. Furthermore, the results show the feasibility and reproducibility of assessing radial, circumferential and longitudinal strains simultaneously. In this preliminary study, three beagles developed an elevated pressure gradient over the aortic valve (Δp: 100-200 mmHg) and myocardial hypertrophy. One dog did not develop any sign of hypertrophy (Δp = 20 mmHg). Initial strain (rate) results showed that the maximum strain (rate) decreased with increasing valvular stenosis (-50%), which is in accordance with previous studies. Histological findings corroborated these results and showed an increase in fibrotic tissue for the hearts with larger pressure gradients (100, 200 mmHg), as well as lower strain and strain rate values.

  7. Distributional records of Antarctic fungi based on strains preserved in the Culture Collection of Fungi from Extreme Environments (CCFEE Mycological Section associated with the Italian National Antarctic Museum (MNA

    Directory of Open Access Journals (Sweden)

    Laura Selbmann

    2015-07-01

    Full Text Available This dataset includes information regarding fungal strains collected during several Antarctic expeditions: the Italian National Antarctic Research program (PNRA expeditions “X” (1994/1995, “XII” (1996/1997, “XVII” (2001/2002, “XIX” (2003/2004, “XXVI” (2010/2011, the Czech “IPY Expedition” (2007–2009 and a number of strains donated by E. Imre Friedmann (Florida State University in 2001, isolated from samples collected during the U.S.A. Antarctic Expeditions of 1980-1982. Samples, consisting of colonized rocks, mosses, lichens, sediments and soils, were collected in Southern and Northern Victoria Land of the continental Antarctica and in the Antarctic Peninsula. A total of 259 different strains were isolated, belonging to 32 genera and 38 species, out of which 12 represented new taxa. These strains are preserved in the Antarctic section of the Culture Collection of Fungi from Extreme Environments (CCFEE, which represents one of the collections associated with the Italian National Antarctic Museum (MNA, Section of Genoa, Italy, located at the Laboratory of Systematic Botany and Mycology, Department of Ecological and Biological Sciences (DEB, Tuscia University (Viterbo, Italy. The CCFEE hosts a total of 486 Antarctic fungal strains from worldwide extreme environments. Distributional records are reported here for 259 of these strains. The holotypes of the 12 new species included in this dataset are maintained at CCFEE and in other international collections: CBS-KNAW Fungal Biodiversity Centre (Utrecht, Netherlands; DBVPG, Industrial Yeasts Collection (University of Perugia, Italy; DSMZ, German Collection of Microorganisms and Cell Cultures (Brunswick, Germany; IMI, International Mycological Institute (London, U.K..

  8. Synergism between hydrogen peroxide and seventeen acids against six bacterial strains.

    Science.gov (United States)

    Martin, H; Maris, P

    2012-09-01

    The objective of this study was to evaluate the bactericidal efficacy of hydrogen peroxide administered in combination with 17 mineral and organic acids authorized for use in the food industry. The assays were performed on a 96-well microplate using a microdilution technique based on the checkerboard titration method. The six selected strains were reference strains and strains representative of contaminating bacteria in the food industry. Each synergistic hydrogen peroxide/acid combination found after 5-min contact time at 20°C in distilled water was then tested in conditions simulating four different use conditions. Thirty-two combinations were synergistic in distilled water; twenty-five of these remained synergistic with one or more of the four mineral and organic interfering substances selected. Hydrogen peroxide/formic acid combination was synergistic for all six bacterial strains in distilled water and remained synergistic with interfering substances. Six other combinations maintained their synergistic effect in the presence of an organic load but only for one or two bacterial strains. Synergistic combinations of disinfectants were revealed, among them the promising hydrogen peroxide/formic acid combination. A rapid screening method was proposed and used to reveal the synergistic potential of disinfectant and/or sanitizer combinations. © 2012 ANSES Fougères Laboratory Journal of Applied Microbiology © 2012 The Society for Applied Microbiology.

  9. Environmental Dataset Gateway (EDG) CS-W Interface

    Data.gov (United States)

    U.S. Environmental Protection Agency — Use the Environmental Dataset Gateway (EDG) to find and access EPA's environmental resources. Many options are available for easily reusing EDG content in other...

  10. Three-dimensional ultrasound strain imaging of skeletal muscles

    NARCIS (Netherlands)

    Gijsbertse, Kaj; Sprengers, Andre M.; Nillesen, Maartje; Hansen, Hendrik H.G.; Verdonschot, Nico; De Korte, Chris L.

    2015-01-01

    Muscle contraction is characterized by large deformation and translation, which requires a multi-dimensional imaging modality to reveal its behavior. Previous work on ultrasound strain imaging of the muscle contraction was limited to 2D and bi-plane techniques. In this study, a three-dimensional

  11. Prior Inoculation with Type B Strains of Francisella tularensis Provides Partial Protection against Virulent Type A Strains in Cottontail Rabbits.

    Directory of Open Access Journals (Sweden)

    Vienna R Brown

    Full Text Available Francisella tularensis is a highly virulent bacterium that is capable of causing severe disease (tularemia in a wide range of species. This organism is characterized into two distinct subspecies: tularensis (type A and holarctica (type B which vary in several crucial ways, with some type A strains having been found to be considerably more virulent in humans and laboratory animals. Cottontail rabbits have been widely implicated as a reservoir species for this subspecies; however, experimental inoculation in our laboratory revealed type A organisms to be highly virulent, resulting in 100% mortality following challenge with 50-100 organisms. Inoculation of cottontail rabbits with the same number of organisms from type B strains of bacteria was found to be rarely lethal and to result in a robust humoral immune response. The objective of this study was to characterize the protection afforded by a prior challenge with type B strains against a later inoculation with a type A strain in North American cottontail rabbits (Sylvilagus spp. Previous infection with a type B strain of organism was found to lengthen survival time and in some cases prevent death following inoculation with a type A2 strain of F. tularensis. In contrast, inoculation of a type A1b strain was uniformly lethal in cottontail rabbits irrespective of a prior type B inoculation. These findings provide important insight about the role cottontail rabbits may play in environmental maintenance and transmission of this organism.

  12. Relationship between strain stored by compressive deformation and crystallographic orientation in a pure aluminum

    International Nuclear Information System (INIS)

    Takayama, Y; Watanabe, H; Yoshimura, T

    2015-01-01

    In order to investigate relationship between stored strain and crystallographic orientation, 99.99% purity aluminum cubes were compressed with uniaxial or with plane strain state up to a nominal strain of 30%. The aluminum cubes were examined on the same surface before and after compression by SEM/EBSD technique. Stored strain was estimated by Kernel Average Misorientation (KAM) derived from the EBSD analysis, and Taylor factor (TF) was measured before the compressive deformation. The analysis revealed that KAM value or the stored strain decreases until a certain value of TF and then increases with increment of TF. (paper)

  13. Annotating spatio-temporal datasets for meaningful analysis in the Web

    Science.gov (United States)

    Stasch, Christoph; Pebesma, Edzer; Scheider, Simon

    2014-05-01

    More and more environmental datasets that vary in space and time are available in the Web. This comes along with an advantage of using the data for other purposes than originally foreseen, but also with the danger that users may apply inappropriate analysis procedures due to lack of important assumptions made during the data collection process. In order to guide towards a meaningful (statistical) analysis of spatio-temporal datasets available in the Web, we have developed a Higher-Order-Logic formalism that captures some relevant assumptions in our previous work [1]. It allows to proof on meaningful spatial prediction and aggregation in a semi-automated fashion. In this poster presentation, we will present a concept for annotating spatio-temporal datasets available in the Web with concepts defined in our formalism. Therefore, we have defined a subset of the formalism as a Web Ontology Language (OWL) pattern. It allows capturing the distinction between the different spatio-temporal variable types, i.e. point patterns, fields, lattices and trajectories, that in turn determine whether a particular dataset can be interpolated or aggregated in a meaningful way using a certain procedure. The actual annotations that link spatio-temporal datasets with the concepts in the ontology pattern are provided as Linked Data. In order to allow data producers to add the annotations to their datasets, we have implemented a Web portal that uses a triple store at the backend to store the annotations and to make them available in the Linked Data cloud. Furthermore, we have implemented functions in the statistical environment R to retrieve the RDF annotations and, based on these annotations, to support a stronger typing of spatio-temporal datatypes guiding towards a meaningful analysis in R. [1] Stasch, C., Scheider, S., Pebesma, E., Kuhn, W. (2014): "Meaningful spatial prediction and aggregation", Environmental Modelling & Software, 51, 149-165.

  14. Evolving hard problems: Generating human genetics datasets with a complex etiology

    Directory of Open Access Journals (Sweden)

    Himmelstein Daniel S

    2011-07-01

    Full Text Available Abstract Background A goal of human genetics is to discover genetic factors that influence individuals' susceptibility to common diseases. Most common diseases are thought to result from the joint failure of two or more interacting components instead of single component failures. This greatly complicates both the task of selecting informative genetic variants and the task of modeling interactions between them. We and others have previously developed algorithms to detect and model the relationships between these genetic factors and disease. Previously these methods have been evaluated with datasets simulated according to pre-defined genetic models. Results Here we develop and evaluate a model free evolution strategy to generate datasets which display a complex relationship between individual genotype and disease susceptibility. We show that this model free approach is capable of generating a diverse array of datasets with distinct gene-disease relationships for an arbitrary interaction order and sample size. We specifically generate eight-hundred Pareto fronts; one for each independent run of our algorithm. In each run the predictiveness of single genetic variation and pairs of genetic variants have been minimized, while the predictiveness of third, fourth, or fifth-order combinations is maximized. Two hundred runs of the algorithm are further dedicated to creating datasets with predictive four or five order interactions and minimized lower-level effects. Conclusions This method and the resulting datasets will allow the capabilities of novel methods to be tested without pre-specified genetic models. This allows researchers to evaluate which methods will succeed on human genetics problems where the model is not known in advance. We further make freely available to the community the entire Pareto-optimal front of datasets from each run so that novel methods may be rigorously evaluated. These 76,600 datasets are available from http://discovery.dartmouth.edu/model_free_data/.

  15. A Dataset from TIMSS to Examine the Relationship between Computer Use and Mathematics Achievement

    Science.gov (United States)

    Kadijevich, Djordje M.

    2015-01-01

    Because the relationship between computer use and achievement is still puzzling, there is a need to prepare and analyze good quality datasets on computer use and achievement. Such a dataset can be derived from TIMSS data. This paper describes how this dataset can be prepared. It also gives an example of how the dataset may be analyzed. The…

  16. Genetic Structuration, Demography and Evolutionary History of Mycobacterium tuberculosis LAM9 Sublineage in the Americas as Two Distinct Subpopulations Revealed by Bayesian Analyses

    Science.gov (United States)

    Reynaud, Yann; Millet, Julie; Rastogi, Nalin

    2015-01-01

    Tuberculosis (TB) remains broadly present in the Americas despite intense global efforts for its control and elimination. Starting from a large dataset comprising spoligotyping (n = 21183 isolates) and 12-loci MIRU-VNTRs data (n = 4022 isolates) from a total of 31 countries of the Americas (data extracted from the SITVIT2 database), this study aimed to get an overview of lineages circulating in the Americas. A total of 17119 (80.8%) strains belonged to the Euro-American lineage 4, among which the most predominant genotypic family belonged to the Latin American and Mediterranean (LAM) lineage (n = 6386, 30.1% of strains). By combining classical phylogenetic analyses and Bayesian approaches, this study revealed for the first time a clear genetic structuration of LAM9 sublineage into two subpopulations named LAM9C1 and LAM9C2, with distinct genetic characteristics. LAM9C1 was predominant in Chile, Colombia and USA, while LAM9C2 was predominant in Brazil, Dominican Republic, Guadeloupe and French Guiana. Globally, LAM9C2 was characterized by higher allelic richness as compared to LAM9C1 isolates. Moreover, LAM9C2 sublineage appeared to expand close to twenty times more than LAM9C1 and showed older traces of expansion. Interestingly, a significant proportion of LAM9C2 isolates presented typical signature of ancestral LAM-RDRio MIRU-VNTR type (224226153321). Further studies based on Whole Genome Sequencing of LAM strains will provide the needed resolution to decipher the biogeographical structure and evolutionary history of this successful family. PMID:26517715

  17. Novel nonsense mutation in the katA gene of a catalase-negative Staphylococcus aureus strain.

    Science.gov (United States)

    Lagos, Jaime; Alarcón, Pedro; Benadof, Dona; Ulloa, Soledad; Fasce, Rodrigo; Tognarelli, Javier; Aguayo, Carolina; Araya, Pamela; Parra, Bárbara; Olivares, Berta; Hormazábal, Juan Carlos; Fernández, Jorge

    2016-01-01

    We report the first description of a rare catalase-negative strain of Staphylococcus aureus in Chile. This new variant was isolated from blood and synovial tissue samples of a pediatric patient. Sequencing analysis revealed that this catalase-negative strain is related to ST10 strain, which has earlier been described in relation to S. aureus carriers. Interestingly, sequence analysis of the catalase gene katA revealed presence of a novel nonsense mutation that causes premature translational truncation of the C-terminus of the enzyme leading to a loss of 222 amino acids. Our study suggests that loss of catalase activity in this rare catalase-negative Chilean strain is due to this novel nonsense mutation in the katA gene, which truncates the enzyme to just 283 amino acids. Copyright © 2015 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.

  18. Occurrence of Killer Yeast Strains in Fruit and Berry Wine Yeast Populations

    Directory of Open Access Journals (Sweden)

    Gintare Gulbiniene

    2004-01-01

    Full Text Available Apple, cranberry, chokeberry and Lithuanian red grape wine yeast populations were used for the determination of killer yeast occurrence. According to the tests of the killer characteristics and immunity the isolated strains were divided into seven groups. In this work the activity of killer toxins purified from some typical strains was evaluated. The analysed strains produced different amounts of active killer toxin and some of them possessed new industrially significant killer properties. Total dsRNA extractions in 11 killer strains of yeast isolated from spontaneous fermentations revealed that the molecular basis of the killer phenomenon was not only dsRNAs, but also unidentified genetic determinants.

  19. Characterization of Local Strain around Through-Silicon Via Interconnects by Using X-ray Microdiffraction

    Science.gov (United States)

    Nakatsuka, Osamu; Kitada, Hideki; Kim, Youngsuk; Mizushima, Yoriko; Nakamura, Tomoji; Ohba, Takayuki; Zaima, Shigeaki

    2011-05-01

    We have demonstrated the characterization of the local strain structure in thinned Si layers for wafer-on-a-wafer (WOW) applications by using X-ray microdiffraction with a synchrotron radiation source. The microdiffraction reveals the fluctuation of strains in the thin Si layer around through-silicon via (TSV) interconnects with a sub-micrometer scale. We can separately estimated the in-plane and out-of-plane strain structures in the Si layer, and found that the anisotropic strain is induced in the Si layer between the TSV interconnects.

  20. A new dataset validation system for the Planetary Science Archive

    Science.gov (United States)

    Manaud, N.; Zender, J.; Heather, D.; Martinez, S.

    2007-08-01

    The Planetary Science Archive is the official archive for the Mars Express mission. It has received its first data by the end of 2004. These data are delivered by the PI teams to the PSA team as datasets, which are formatted conform to the Planetary Data System (PDS). The PI teams are responsible for analyzing and calibrating the instrument data as well as the production of reduced and calibrated data. They are also responsible of the scientific validation of these data. ESA is responsible of the long-term data archiving and distribution to the scientific community and must ensure, in this regard, that all archived products meet quality. To do so, an archive peer-review is used to control the quality of the Mars Express science data archiving process. However a full validation of its content is missing. An independent review board recently recommended that the completeness of the archive as well as the consistency of the delivered data should be validated following well-defined procedures. A new validation software tool is being developed to complete the overall data quality control system functionality. This new tool aims to improve the quality of data and services provided to the scientific community through the PSA, and shall allow to track anomalies in and to control the completeness of datasets. It shall ensure that the PSA end-users: (1) can rely on the result of their queries, (2) will get data products that are suitable for scientific analysis, (3) can find all science data acquired during a mission. We defined dataset validation as the verification and assessment process to check the dataset content against pre-defined top-level criteria, which represent the general characteristics of good quality datasets. The dataset content that is checked includes the data and all types of information that are essential in the process of deriving scientific results and those interfacing with the PSA database. The validation software tool is a multi-mission tool that

  1. Linezolid-Dependent Function and Structure Adaptation of Ribosomes in a Staphylococcus epidermidis Strain Exhibiting Linezolid Dependence

    OpenAIRE

    Kokkori, Sofia; Apostolidi, Maria; Tsakris, Athanassios; Pournaras, Spyros; Stathopoulos, Constantinos; Dinos, George

    2014-01-01

    Linezolid-dependent growth was recently reported in Staphylococcus epidermidis clinical strains carrying mutations associated with linezolid resistance. To investigate this unexpected behavior at the molecular level, we isolated active ribosomes from one of the linezolid-dependent strains and we compared them with ribosomes isolated from a wild-type strain. Both strains were grown in the absence and presence of linezolid. Detailed biochemical and structural analyses revealed essential differe...

  2. Data assimilation and model evaluation experiment datasets

    Science.gov (United States)

    Lai, Chung-Cheng A.; Qian, Wen; Glenn, Scott M.

    1994-01-01

    The Institute for Naval Oceanography, in cooperation with Naval Research Laboratories and universities, executed the Data Assimilation and Model Evaluation Experiment (DAMEE) for the Gulf Stream region during fiscal years 1991-1993. Enormous effort has gone into the preparation of several high-quality and consistent datasets for model initialization and verification. This paper describes the preparation process, the temporal and spatial scopes, the contents, the structure, etc., of these datasets. The goal of DAMEE and the need of data for the four phases of experiment are briefly stated. The preparation of DAMEE datasets consisted of a series of processes: (1) collection of observational data; (2) analysis and interpretation; (3) interpolation using the Optimum Thermal Interpolation System package; (4) quality control and re-analysis; and (5) data archiving and software documentation. The data products from these processes included a time series of 3D fields of temperature and salinity, 2D fields of surface dynamic height and mixed-layer depth, analysis of the Gulf Stream and rings system, and bathythermograph profiles. To date, these are the most detailed and high-quality data for mesoscale ocean modeling, data assimilation, and forecasting research. Feedback from ocean modeling groups who tested this data was incorporated into its refinement. Suggestions for DAMEE data usages include (1) ocean modeling and data assimilation studies, (2) diagnosis and theoretical studies, and (3) comparisons with locally detailed observations.

  3. Artificial intelligence (AI) systems for interpreting complex medical datasets.

    Science.gov (United States)

    Altman, R B

    2017-05-01

    Advances in machine intelligence have created powerful capabilities in algorithms that find hidden patterns in data, classify objects based on their measured characteristics, and associate similar patients/diseases/drugs based on common features. However, artificial intelligence (AI) applications in medical data have several technical challenges: complex and heterogeneous datasets, noisy medical datasets, and explaining their output to users. There are also social challenges related to intellectual property, data provenance, regulatory issues, economics, and liability. © 2017 ASCPT.

  4. Full-Scale Approximations of Spatio-Temporal Covariance Models for Large Datasets

    KAUST Repository

    Zhang, Bohai; Sang, Huiyan; Huang, Jianhua Z.

    2014-01-01

    of dataset and application of such models is not feasible for large datasets. This article extends the full-scale approximation (FSA) approach by Sang and Huang (2012) to the spatio-temporal context to reduce computational complexity. A reversible jump Markov

  5. PERFORMANCE COMPARISON FOR INTRUSION DETECTION SYSTEM USING NEURAL NETWORK WITH KDD DATASET

    Directory of Open Access Journals (Sweden)

    S. Devaraju

    2014-04-01

    Full Text Available Intrusion Detection Systems are challenging task for finding the user as normal user or attack user in any organizational information systems or IT Industry. The Intrusion Detection System is an effective method to deal with the kinds of problem in networks. Different classifiers are used to detect the different kinds of attacks in networks. In this paper, the performance of intrusion detection is compared with various neural network classifiers. In the proposed research the four types of classifiers used are Feed Forward Neural Network (FFNN, Generalized Regression Neural Network (GRNN, Probabilistic Neural Network (PNN and Radial Basis Neural Network (RBNN. The performance of the full featured KDD Cup 1999 dataset is compared with that of the reduced featured KDD Cup 1999 dataset. The MATLAB software is used to train and test the dataset and the efficiency and False Alarm Rate is measured. It is proved that the reduced dataset is performing better than the full featured dataset.

  6. Review of ATLAS Open Data 8 TeV datasets, tools and activities

    CERN Document Server

    The ATLAS collaboration

    2018-01-01

    The ATLAS Collaboration has released two 8 TeV datasets and relevant simulated samples to the public for educational use. A number of groups within ATLAS have used these ATLAS Open Data 8 TeV datasets, developing tools and educational material to promote particle physics. The general aim of these activities is to provide simple and user-friendly interactive interfaces to simulate the procedures used by high-energy physics researchers. International Masterclasses introduce particle physics to high school students and have been studying 8 TeV ATLAS Open Data since 2015. Inspired by this success, a new ATLAS Open Data initiative was launched in 2016 for university students. A comprehensive educational platform was thus developed featuring a second 8 TeV dataset and a new set of educational tools. The 8 TeV datasets and associated tools are presented and discussed here, as well as a selection of activities studying the ATLAS Open Data 8 TeV datasets.

  7. Recent Development on the NOAA's Global Surface Temperature Dataset

    Science.gov (United States)

    Zhang, H. M.; Huang, B.; Boyer, T.; Lawrimore, J. H.; Menne, M. J.; Rennie, J.

    2016-12-01

    Global Surface Temperature (GST) is one of the most widely used indicators for climate trend and extreme analyses. A widely used GST dataset is the NOAA merged land-ocean surface temperature dataset known as NOAAGlobalTemp (formerly MLOST). The NOAAGlobalTemp had recently been updated from version 3.5.4 to version 4. The update includes a significant improvement in the ocean surface component (Extended Reconstructed Sea Surface Temperature or ERSST, from version 3b to version 4) which resulted in an increased temperature trends in recent decades. Since then, advancements in both the ocean component (ERSST) and land component (GHCN-Monthly) have been made, including the inclusion of Argo float SSTs and expanded EOT modes in ERSST, and the use of ISTI databank in GHCN-Monthly. In this presentation, we describe the impact of those improvements on the merged global temperature dataset, in terms of global trends and other aspects.

  8. The OXL format for the exchange of integrated datasets

    Directory of Open Access Journals (Sweden)

    Taubert Jan

    2007-12-01

    Full Text Available A prerequisite for systems biology is the integration and analysis of heterogeneous experimental data stored in hundreds of life-science databases and millions of scientific publications. Several standardised formats for the exchange of specific kinds of biological information exist. Such exchange languages facilitate the integration process; however they are not designed to transport integrated datasets. A format for exchanging integrated datasets needs to i cover data from a broad range of application domains, ii be flexible and extensible to combine many different complex data structures, iii include metadata and semantic definitions, iv include inferred information, v identify the original data source for integrated entities and vi transport large integrated datasets. Unfortunately, none of the exchange formats from the biological domain (e.g. BioPAX, MAGE-ML, PSI-MI, SBML or the generic approaches (RDF, OWL fulfil these requirements in a systematic way.

  9. Typing of Canine Parvovirus Strains Circulating in North-East China.

    Science.gov (United States)

    Zhao, H; Wang, J; Jiang, Y; Cheng, Y; Lin, P; Zhu, H; Han, G; Yi, L; Zhang, S; Guo, L; Cheng, S

    2017-04-01

    Canine parvovirus (CPV) is highly contagious and is a major cause of haemorrhagic enteritis and myocarditis in dogs. We investigated the genetic variation of emerging CPV strains by sequencing 64 CPV VP2 genes from 216 clinical samples of dogs from Heilongjiang, Jilin, Liaoning, Shandong and Hebei in 2014. Genetic analysis revealed that CPV-2b was predominant in Hebei and CPV-2a was predominant in the other four provinces. In addition, a CPV-2c strain has emerged in Shandong province. All samples had a Ser-Ala substitution at residue 297 and an Ile-Arg substitution at residue 324. Interestingly, in five separate canine samples, we found a mutation of Gln370 to Arg, until now detected only in isolates from pandas. The phylogenetic analysis showed clear distinctions between epidemic isolates and vaccine strains and between Chinese CPV-2c strains and CPV-2c strains found in other countries. Monitoring recent incidence of CPV strains enables evaluation and implementation of disease control strategies. © 2015 Blackwell Verlag GmbH.

  10. Two highly divergent lineages of exfoliative toxin B-encoding plasmids revealed in impetigo strains of Staphylococcus aureus.

    Science.gov (United States)

    Botka, Tibor; Růžičková, Vladislava; Svobodová, Karla; Pantůček, Roman; Petráš, Petr; Čejková, Darina; Doškař, Jiří

    2017-09-01

    Exfoliative toxin B (ETB) encoded by some large plasmids plays a crucial role in epidermolytic diseases caused by Staphylococcus aureus. We have found as yet unknown types of etb gene-positive plasmids isolated from a set of impetigo strains implicated in outbreaks of pemphigus neonatorum in Czech maternity hospitals. Plasmids from the strains of clonal complex CC121 were related to archetypal plasmid pETB TY4 . Sharing a 33-kb core sequence including virulence genes for ETB, EDIN C, and lantibiotics, they were assigned to a stand-alone lineage, named pETB TY4 -based plasmids. Differing from each other in the content of variable DNA regions, they formed four sequence types. In addition to them, a novel unique plasmid pETB608 isolated from a strain of ST130 was described. Carrying conjugative cluster genes, as well as new variants of etb and edinA genes, pETB608 could be regarded as a source of a new lineage of ETB plasmids. We have designed a helpful detection assay, which facilitates the precise identification of the all described types of ETB plasmids. Copyright © 2017 Elsevier GmbH. All rights reserved.

  11. Inhomogeneous strain induced by fast neutron irradiation in NaKSO/sub 4/ crystals

    Energy Technology Data Exchange (ETDEWEB)

    Kandil, S.H.; Kassem, M.E.; El-Khatib, A.; El-Gamal, M.A.; El-Wahidy, E.F.

    1987-11-01

    The paper reports the effect of fast neutron irradiation on the thermal properties of NaKSO/sub 4/ crystals in the temperature range 400-475 K. Results are presented for the thermal expansion, tensile strain and specific heat of NaKSO/sub 4/, as a function of neutron irradiation dose. All these results revealed an inhomogeneous strain induced by the radiation. It is suggested that this induced inhomogeneous strain could be used to detect neutron exposure doses.

  12. Incorrect strain information for mouse cell lines: sequential influence of misidentification on sublines

    OpenAIRE

    Uchio-Yamada, Kozue; Kasai, Fumio; Ozawa, Midori; Kohara, Arihiro

    2016-01-01

    Misidentification or cross-contamination of cell lines can cause serious issues. Human cell lines have been authenticated by short tandem repeat profiling; however, mouse cell lines have not been adequately assessed. In this study, mouse cell lines registered with the JCRB cell bank were examined by simple sequence length polymorphism (SSLP) analysis to identify their strains. Based on comparisons with 7 major inbred strains, our results revealed their strains in 80 of 90 cell lines. However,...

  13. Developing a Data-Set for Stereopsis

    Directory of Open Access Journals (Sweden)

    D.W Hunter

    2014-08-01

    Full Text Available Current research on binocular stereopsis in humans and non-human primates has been limited by a lack of available data-sets. Current data-sets fall into two categories; stereo-image sets with vergence but no ranging information (Hibbard, 2008, Vision Research, 48(12, 1427-1439 or combinations of depth information with binocular images and video taken from cameras in fixed fronto-parallel configurations exhibiting neither vergence or focus effects (Hirschmuller & Scharstein, 2007, IEEE Conf. Computer Vision and Pattern Recognition. The techniques for generating depth information are also imperfect. Depth information is normally inaccurate or simply missing near edges and on partially occluded surfaces. For many areas of vision research these are the most interesting parts of the image (Goutcher, Hunter, Hibbard, 2013, i-Perception, 4(7, 484; Scarfe & Hibbard, 2013, Vision Research. Using state-of-the-art open-source ray-tracing software (PBRT as a back-end, our intention is to release a set of tools that will allow researchers in this field to generate artificial binocular stereoscopic data-sets. Although not as realistic as photographs, computer generated images have significant advantages in terms of control over the final output and ground-truth information about scene depth is easily calculated at all points in the scene, even partially occluded areas. While individual researchers have been developing similar stimuli by hand for many decades, we hope that our software will greatly reduce the time and difficulty of creating naturalistic binocular stimuli. Our intension in making this presentation is to elicit feedback from the vision community about what sort of features would be desirable in such software.

  14. BASE MAP DATASET, MAYES COUNTY, OKLAHOMA, USA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications: cadastral, geodetic control,...

  15. PENERAPAN TEKNIK BAGGING PADA ALGORITMA KLASIFIKASI UNTUK MENGATASI KETIDAKSEIMBANGAN KELAS DATASET MEDIS

    Directory of Open Access Journals (Sweden)

    Rizki Tri Prasetio

    2016-03-01

    Full Text Available ABSTRACT – The class imbalance problems have been reported to severely hinder classification performance of many standard learning algorithms, and have attracted a great deal of attention from researchers of different fields. Therefore, a number of methods, such as sampling methods, cost-sensitive learning methods, and bagging and boosting based ensemble methods, have been proposed to solve these problems. Some medical dataset has two classes has two classes or binominal experiencing an imbalance that causes lack of accuracy in classification. This research proposed a combination technique of bagging and algorithms of classification to improve the accuracy of medical datasets. Bagging technique used to solve the problem of imbalanced class. The proposed method is applied on three classifier algorithm i.e., naïve bayes, decision tree and k-nearest neighbor. This research uses five medical datasets obtained from UCI Machine Learning i.e.., breast-cancer, liver-disorder, heart-disease, pima-diabetes and vertebral column. Results of this research indicate that the proposed method makes a significant improvement on two algorithms of classification i.e. decision tree with p value of t-Test 0.0184 and k-nearest neighbor with p value of t-Test 0.0292, but not significant in naïve bayes with p value of t-Test 0.9236. After bagging technique applied at five medical datasets, naïve bayes has the highest accuracy for breast-cancer dataset of 96.14% with AUC of 0.984, heart-disease of 84.44% with AUC of 0.911 and pima-diabetes of 74.73% with AUC of 0.806. While the k-nearest neighbor has the best accuracy for dataset liver-disorder of 62.03% with AUC of 0.632 and vertebral-column of 82.26% with the AUC of 0.867. Keywords: ensemble technique, bagging, imbalanced class, medical dataset. ABSTRAKSI – Masalah ketidakseimbangan kelas telah dilaporkan sangat menghambat kinerja klasifikasi banyak algoritma klasifikasi dan telah menarik banyak perhatian dari

  16. Interstrain polymorphisms of isoenzyme profiles and mitochondrial DNA fingerprints among seven strains assigned to Acanthamoeba polyphaga.

    Science.gov (United States)

    Kong, H H; Park, J H; Chung, D I

    1995-12-01

    Interstrain polymorphisms of isoenzyme profiles and mitochondrial (Mt) DNA fingerprints were observed among seven strains of Acanthamoeba isolated from different sources and morphologically assigned to A. polyphaga. Mt DNA fingerprints by eight restriction endonucleases (Bgl II, Sca I, Cla I, EcoR I, Xba I, Kpn I, Sal I, and Sst I) revealed considerable interstrain polymorphisms. Isoenzyme profiles revealed considerable interstrain polymorphisms for acid phosphatase, lactate dehydrogenase, and glucose-6-phosphate dehydrogenase while those for glucose phosphate isomerase, leucine aminopeptidase, and malate dehydrogenase showed similarity. Despite of the interstrain polymorphisms, the isoenzyme profiles and Mt DNA fingerprints of the strain Ap were found to be identical with those of the strain Jones. Mt DNA fingerprinting was found to be highly applicable for the strain identification, characterization, and differentiation.

  17. Thermal, Thermophysical, and Compositional Properties of the Moon Revealed by the Diviner Lunar Radiometer

    Science.gov (United States)

    Greenhagen, B. T.; Paige, D. A.

    2012-01-01

    The Diviner Lunar Radiometer is the first multispectral thermal instrument to globally map the surface of the Moon. After over three years in operation, this unprecedented dataset has revealed the extreme nature of the Moon's thermal environment, thermophysical properties, and surface composition.

  18. CERC Dataset (Full Hadza Data)

    DEFF Research Database (Denmark)

    2016-01-01

    The dataset includes demographic, behavioral, and religiosity data from eight different populations from around the world. The samples were drawn from: (1) Coastal and (2) Inland Tanna, Vanuatu; (3) Hadzaland, Tanzania; (4) Lovu, Fiji; (5) Pointe aux Piment, Mauritius; (6) Pesqueiro, Brazil; (7......) Kyzyl, Tyva Republic; and (8) Yasawa, Fiji. Related publication: Purzycki, et al. (2016). Moralistic Gods, Supernatural Punishment and the Expansion of Human Sociality. Nature, 530(7590): 327-330....

  19. Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semantic Technologies

    Science.gov (United States)

    Ma, X.

    2014-12-01

    Knowledge evolves in geoscience, and the evolution is reflected in datasets. In a context with distributed data sources, the evolution of knowledge may cause considerable challenges to data management and re-use. For example, a short news published in 2009 (Mascarelli, 2009) revealed the geoscience community's concern that the International Commission on Stratigraphy's change to the definition of Quaternary may bring heavy reworking of geologic maps. Now we are in the era of the World Wide Web, and geoscience knowledge is increasingly modeled and encoded in the form of ontologies and vocabularies by using semantic technologies. Accordingly, knowledge evolution leads to a consequence called ontology dynamics. Flouris et al. (2008) summarized 10 topics of general ontology changes/dynamics such as: ontology mapping, morphism, evolution, debugging and versioning, etc. Ontology dynamics makes impacts at several stages of a data life cycle and causes challenges, such as: the request for reworking of the extant data in a data center, semantic mismatch among data sources, differentiated understanding of a same piece of dataset between data providers and data users, as well as error propagation in cross-discipline data discovery and re-use (Ma et al., 2014). This presentation will analyze the best practices in the geoscience community so far and summarize a few recommendations to reduce the negative impacts of ontology dynamics in a data life cycle, including: communities of practice and collaboration on ontology and vocabulary building, link data records to standardized terms, and methods for (semi-)automatic reworking of datasets using semantic technologies. References: Flouris, G., Manakanatas, D., Kondylakis, H., Plexousakis, D., Antoniou, G., 2008. Ontology change: classification and survey. The Knowledge Engineering Review 23 (2), 117-152. Ma, X., Fox, P., Rozell, E., West, P., Zednik, S., 2014. Ontology dynamics in a data life cycle: Challenges and recommendations

  20. Glyco-centric lectin magnetic bead array (LeMBA − proteomics dataset of human serum samples from healthy, Barrett׳s esophagus and esophageal adenocarcinoma individuals

    Directory of Open Access Journals (Sweden)

    Alok K. Shah

    2016-06-01

    Full Text Available This data article describes serum glycoprotein biomarker discovery and qualification datasets generated using lectin magnetic bead array (LeMBA – mass spectrometry techniques, “Serum glycoprotein biomarker discovery and qualification pipeline reveals novel diagnostic biomarker candidates for esophageal adenocarcinoma” [1]. Serum samples collected from healthy, metaplastic Barrett׳s esophagus (BE and esophageal adenocarcinoma (EAC individuals were profiled for glycoprotein subsets via differential lectin binding. The biomarker discovery proteomics dataset consisting of 20 individual lectin pull-downs for 29 serum samples with a spiked-in internal standard chicken ovalbumin protein has been deposited in the PRIDE partner repository of the ProteomeXchange Consortium with the data set identifier PRIDE: http://www.ebi.ac.uk/pride/archive/projects/PXD002442. Annotated MS/MS spectra for the peptide identifications can be viewed using MS-Viewer (〈http://prospector2.ucsf.edu/prospector/cgi-bin/msform.cgi?form=msviewer〉 using search key “jn7qafftux”. The qualification dataset contained 6-lectin pulldown-coupled multiple reaction monitoring-mass spectrometry (MRM-MS data for 41 protein candidates, from 60 serum samples. This dataset is available as a supplemental files with the original publication [1].

  1. Strain ratio effects on low-cycle fatigue behavior and deformation microstructure of 2124-T851 aluminum alloy

    Energy Technology Data Exchange (ETDEWEB)

    Hao, Hong, E-mail: 10928008@zju.edu.cn [Institute for Process Equipment, Zhejiang University, Hangzhou 310027 (China); School of Environment and Safety, Taiyuan University of Science and Technology, Taiyuan 030024 (China); Ye, Duyi, E-mail: duyi_ye@zju.edu.cn [Institute for Process Equipment, Zhejiang University, Hangzhou 310027 (China); Chen, Chuanyong [Institute for Process Equipment, Zhejiang University, Hangzhou 310027 (China)

    2014-05-01

    The low-cycle fatigue tests of 2124-T851 aluminum alloy with strain ratios of −1, −0.06, 0.06 and 0.5 were conducted under constant amplitude at room temperature. Microstructural and fractographic examinations of the material after fatigue tests were performed by optical microscopy (OM) and scanning electron microscopy (SEM), respectively. Firstly, the results showed that the material exhibited cyclic softening characteristic as a whole. The degree of softening decreased linearly with the increasing strain amplitude and the decreasing strain ratio. The lower fatigue life and ductility of the material corresponded to the larger strain ratios. Secondly, microstructure observations revealed that the density and length of slip bands increased with the increasing strain ratio at the given strain amplitude, and so did the volume fraction and size of coarse constituents, which were responsible for the reduction of fatigue life and ductility of the material. Finally, the SEM micrographs revealed that multiple crack initiation sites took place on the fracture surfaces at different strain ratios. The reduction of stable crack growth area with the increasing strain ratio was observed. Unstable crack growth region was only observed under R≠−1.

  2. Cyclic Strain Resistance, Stress Response, Fatigue Life, and Fracture Behavior of High Strength Low Alloy Steel 300 M

    Science.gov (United States)

    Manigandan, K.; Srivatsan, T. S.; Tammana, Deepthi; Poorgangi, Behrang; Vasudevan, Vijay K.

    2014-05-01

    The focus of this technical manuscript is a record of the specific role of microstructure and test specimen orientation on cyclic stress response, cyclic strain resistance, and cyclic stress versus strain response, deformation and fracture behavior of alloy steel 300 M. The cyclic strain amplitude-controlled fatigue properties of this ultra-high strength alloy steel revealed a linear trend for the variation of log elastic strain amplitude with log reversals-to-failure, and log plastic strain amplitude with log reversals-to-failure for both longitudinal and transverse orientations. Test specimens of the longitudinal orientation showed only a marginal improvement over the transverse orientation at equivalent values of plastic strain amplitude. Cyclic stress response revealed a combination of initial hardening for the first few cycles followed by gradual softening for a large portion of fatigue life before culminating in rapid softening prior to catastrophic failure by fracture. Fracture characteristics of test specimens of this alloy steel were different at both the macroscopic and fine microscopic levels over the entire range of cyclic strain amplitudes examined. Both macroscopic and fine microscopic observations revealed fracture to be a combination of both brittle and ductile mechanisms. The underlying mechanisms governing stress response, deformation characteristics, fatigue life, and final fracture behavior are presented and discussed in light of the competing and mutually interactive influences of test specimen orientation, intrinsic microstructural effects, deformation characteristics of the microstructural constituents, cyclic strain amplitude, and response stress.

  3. Comparative Phosphoproteomics Reveals the Role of AmpC β-lactamase Phosphorylation in the Clinical Imipenem-resistant Strain Acinetobacter baumannii SK17*

    Science.gov (United States)

    Lai, Juo-Hsin; Yang, Jhih-Tian; Chern, Jeffy; Chen, Te-Li; Wu, Wan-Ling; Liao, Jiahn-Haur; Tsai, Shih-Feng; Liang, Suh-Yuen; Chou, Chi-Chi

    2016-01-01

    Nosocomial infectious outbreaks caused by multidrug-resistant Acinetobacter baumannii have emerged as a serious threat to human health. Phosphoproteomics of pathogenic bacteria has been used to identify the mechanisms of bacterial virulence and antimicrobial resistance. In this study, we used a shotgun strategy combined with high-accuracy mass spectrometry to analyze the phosphoproteomics of the imipenem-susceptible strain SK17-S and -resistant strain SK17-R. We identified 410 phosphosites on 248 unique phosphoproteins in SK17-S and 285 phosphosites on 211 unique phosphoproteins in SK17-R. The distributions of the Ser/Thr/Tyr/Asp/His phosphosites in SK17-S and SK17-R were 47.0%/27.6%/12.4%/8.0%/4.9% versus 41.4%/29.5%/17.5%/6.7%/4.9%, respectively. The Ser-90 phosphosite, located on the catalytic motif S88VS90K of the AmpC β-lactamase, was first identified in SK17-S. Based on site-directed mutagenesis, the nonphosphorylatable mutant S90A was found to be more resistant to imipenem, whereas the phosphorylation-simulated mutant S90D was sensitive to imipenem. Additionally, the S90A mutant protein exhibited higher β-lactamase activity and conferred greater bacterial protection against imipenem in SK17-S compared with the wild-type. In sum, our results revealed that in A. baumannii, Ser-90 phosphorylation of AmpC negatively regulates both β-lactamase activity and the ability to counteract the antibiotic effects of imipenem. These findings highlight the impact of phosphorylation-mediated regulation in antibiotic-resistant bacteria on future drug design and new therapies. PMID:26499836

  4. The substrate strain mediated magnetotransport properties of surface states in topological insulators

    Energy Technology Data Exchange (ETDEWEB)

    Ma, Ning, E-mail: maning@stu.xjtu.edu.cn [Department of Physics, MOE Key Laboratory of Advanced Transducers and Intelligent Control System, Taiyuan University of Technology, Taiyuan 030024 (China); Department of Applied Physics, MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, Xi' an Jiaotong University, Xi' an 710049 (China); Zhang, Shengli, E-mail: zhangsl@mail.xjtu.edu.cn [Department of Applied Physics, MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, Xi' an Jiaotong University, Xi' an 710049 (China); Liu, Daqing, E-mail: liudq@cczu.edu.cn [School of Mathematics and Physics, Changzhou University, Changzhou 213164 (China)

    2016-10-14

    Recent experiments reveal that the strained bulk HgTe can be regarded as a three-dimensional topological insulator (TI). We further explore the strain effects on magnetotransport in HgTe at magnetic field. We find that the substrate strain associated with the surface index of carriers, can remove the surfaces degeneracy in Landau levels. This accordingly induces the well separated surface quantum Hall plateaus and Shubnikov–de Haas oscillations. These results can be used to generate and detect surface polarization, not only in HgTe but also in a broad class of TIs, which would be very great news for electronic applications of TIs. - Highlights: • We explore the strain mediated magnetotransport in topological insulators. • We analytically derive the zero frequency magnetoconductivity. • The strain removes the surface degeneracy in Landau levels. • The strain gives rise to the splitting and mixture of Landau levels. • The strain leads to the surface asymmetric spectrum of conductivity.

  5. Phenotypic and molecular characterization of Rhizobium vitis strains from vineyards in Turkey

    Directory of Open Access Journals (Sweden)

    Didem CANIK OREL

    2016-05-01

    Full Text Available Crown gall-affected grapevine samples were collected during 2009–2010 from major vineyards, located in different Turkish provinces. One hundred and three bacterial strains were obtained from 88 vineyards and 18 grapevine varieties; they were tumorigenic when inoculated in tobacco, sunflower and Datura stramonium plants and were identified as Rhizobium vitis using biochemical and physiological tests as well as PCR and specific primers. Nineteen R. vitis strains presented a number of anomalous biochemical and physiological characters. PCR and opine-specific primers revealed the presence of octopine/cucumopine-type plasmid in 82 R. vitis strains, nopaline-type plasmids in 18 strains and vitopine-type plasmids in three strains. Clonal relationship of strains was determined using Pulsed Field Gel Electrophoresis following digestion of genomic DNA with the restriction endonuclease PmeI. The greatest genetic diversity was found for the strains from Denizli, Ankara and Nevşehir provinces. Nopaline and vitopine-types of Rhizobium vitis were detected for the first time in Turkey.

  6. Phenotypic and genetic characterization of Paecilomyces lilacinus strains with biocontrol activity against root-knot nematodes.

    Science.gov (United States)

    Gunasekera, T S; Holland, R J; Gillings, M R; Briscoe, D A; Neethling, D C; Williams, K L; Nevalainen, K M

    2000-09-01

    Efficient selection of fungi for biological control of nematodes requires a series of screening assays. Assessment of genetic diversity in the candidate species maximizes the variety of the isolates tested and permits the assignment of a particular genotype with high nematophagous potential using a rapid novel assay. Molecular analyses also facilitate separation between isolates, allowing the identification of proprietary strains and trace biocontrol strains in the environment. The resistance of propagules to UV radiation is an important factor in the survival of a biocontrol agent. We have analyzed 15 strains of the nematophagous fungus Paecilomyces lilacinus using these principles. Arbitrarily primed DNA and allozyme assays were applied to place the isolates into genetic clusters, and demonstrated that some genetically related P. lilacinus strains exhibit widespread geographic distributions. When exposed to UV radiation, some weakly nematophagous strains were generally more susceptible than effective isolates. A microtitre tray-based assay used to screen the pathogenic activity of each isolate to Meloidogyne javanica egg masses revealed that the nematophagous ability varied between 37%-100%. However, there was no clear relationship between nematophagous ability and genetic clusters. Molecular characterizations revealed sufficient diversity to allow tracking of strains released into the environment.

  7. Histophilus somni IbpA Fic cytotoxin is conserved in disease strains and most carrier strains from cattle, sheep and bison.

    Science.gov (United States)

    Zekarias, B; O'Toole, D; Lehmann, J; Corbeil, L B

    2011-04-21

    Histophilus somni causes bovine pneumonia, septicemia, myocarditis, thrombotic meningoencephalitis and arthritis, as well as a genital or upper respiratory carrier state in normal animals. However, differences in virulence factors among strains are not well studied. The surface and secreted immunoglobulin binding protein A (IbpA) Fic motif of H. somni causes bovine alveolar type 2 (BAT2) cells to retract, allowing virulent bacteria to cross the alveolar monolayer. Because H. somni IbpA is an important virulence factor, its presence was evaluated in different strains from cattle, sheep and bison to define whether there are syndrome specific markers and whether antigenic/molecular/functional conservation occurs. A few preputial carrier strains lacked IbpA by Western blotting but all other tested disease or carrier strains were IbpA positive. These positive strains had either both IbpA DR1/Fic and IbpA DR2/Fic or only IbpA DR2/Fic by PCR. IbpA Fic mediated cytotoxicity for BAT2 cells and sequence analysis of IbpA DR2/Fic from selected strains revealed conservation of sequence and function in disease and IbpA positive carrier strains. Passive protection of mice against H. somni septicemia with antibody to IbpA DR2/Fic, along with previous data, indicates that the IbpA DR1/Fic and/or DR2/Fic domains are candidate vaccine antigens for protection against many strains of H. somni. Since IbpA DR2/Fic is conserved in most carrier strains, they may be virulent if introduced to susceptible animals at susceptible sites. Conservation of the protective IbpA antigen in all disease isolates tested is encouraging for development of protective vaccines and diagnostic assays. Copyright © 2010 Elsevier B.V. All rights reserved.

  8. Synthetic ALSPAC longitudinal datasets for the Big Data VR project.

    Science.gov (United States)

    Avraam, Demetris; Wilson, Rebecca C; Burton, Paul

    2017-01-01

    Three synthetic datasets - of observation size 15,000, 155,000 and 1,555,000 participants, respectively - were created by simulating eleven cardiac and anthropometric variables from nine collection ages of the ALSAPC birth cohort study. The synthetic datasets retain similar data properties to the ALSPAC study data they are simulated from (co-variance matrices, as well as the mean and variance values of the variables) without including the original data itself or disclosing participant information.  In this instance, the three synthetic datasets have been utilised in an academia-industry collaboration to build a prototype virtual reality data analysis software, but they could have a broader use in method and software development projects where sensitive data cannot be freely shared.

  9. BASE MAP DATASET, HONOLULU COUNTY, HAWAII, USA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  10. BASE MAP DATASET, LOS ANGELES COUNTY, CALIFORNIA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  11. BASE MAP DATASET, CHEROKEE COUNTY, SOUTH CAROLINA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  12. BASE MAP DATASET, EDGEFIELD COUNTY, SOUTH CAROLINA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  13. BASE MAP DATASET, SANTA CRIZ COUNTY, CALIFORNIA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  14. Phenotypic and Genotypic Characterization of Virulent Yersinia enterocolitica Strains Unable To Ferment Sucrose

    Science.gov (United States)

    Guiyoule, Annie; Guinet, Françoise; Martin, Liliane; Benoit, Catherine; Desplaces, Nicole; Carniel, Elisabeth

    1998-01-01

    Several atypical sucrose-negative Yersinia strains, isolated from clinical samples and sometimes associated with symptoms, proved to have full virulence potential in in vitro and in vivo testings. DNA-relatedness studies revealed that they were authentic Yersinia enterocolitica strains. Therefore, atypical sucrose-negative Yersinia isolates should be analyzed for their virulence potential. PMID:9705424

  15. The Genetic Relationship between Leishmania aethiopica and Leishmania tropica Revealed by Comparing Microsatellite Profiles.

    Science.gov (United States)

    Krayter, Lena; Schnur, Lionel F; Schönian, Gabriele

    2015-01-01

    Leishmania (Leishmania) aethiopica and L. (L.) tropica cause cutaneous leishmaniases and appear to be related. L. aethiopica is geographically restricted to Ethiopia and Kenya; L. tropica is widely dispersed from the Eastern Mediterranean, through the Middle East into eastern India and in north, east and south Africa. Their phylogenetic inter-relationship is only partially revealed. Some studies indicate a close relationship. Here, eight strains of L. aethiopica were characterized genetically and compared with 156 strains of L. tropica from most of the latter species' geographical range to discern the closeness. Twelve unlinked microsatellite markers previously used to genotype strains of L. tropica were successfully applied to the eight strains of L. aethiopica and their microsatellite profiles were compared to those of 156 strains of L. tropica from various geographical locations that were isolated from human cases of cutaneous and visceral leishmaniasis, hyraxes and sand fly vectors. All the microsatellite profiles were subjected to various analytical algorithms: Bayesian statistics, distance-based and factorial correspondence analysis, revealing: (i) the species L. aethiopica, though geographically restricted, is genetically very heterogeneous; (ii) the strains of L. aethiopica formed a distinct genetic cluster; and (iii) strains of L. aethiopica are closely related to strains of L. tropica and more so to the African ones, although, by factorial correspondence analysis, clearly separate from them. The successful application of the 12 microsatellite markers, originally considered species-specific for the species L. tropica, to strains of L. aethiopica confirmed the close relationship between these two species. The Bayesian and distance-based methods clustered the strains of L. aethiopica among African strains of L. tropica, while the factorial correspondence analysis indicated a clear separation between the two species. There was no correlation between

  16. Satellite-Based Precipitation Datasets

    Science.gov (United States)

    Munchak, S. J.; Huffman, G. J.

    2017-12-01

    Of the possible sources of precipitation data, those based on satellites provide the greatest spatial coverage. There is a wide selection of datasets, algorithms, and versions from which to choose, which can be confusing to non-specialists wishing to use the data. The International Precipitation Working Group (IPWG) maintains tables of the major publicly available, long-term, quasi-global precipitation data sets (http://www.isac.cnr.it/ ipwg/data/datasets.html), and this talk briefly reviews the various categories. As examples, NASA provides two sets of quasi-global precipitation data sets: the older Tropical Rainfall Measuring Mission (TRMM) Multi-satellite Precipitation Analysis (TMPA) and current Integrated Multi-satellitE Retrievals for Global Precipitation Measurement (GPM) mission (IMERG). Both provide near-real-time and post-real-time products that are uniformly gridded in space and time. The TMPA products are 3-hourly 0.25°x0.25° on the latitude band 50°N-S for about 16 years, while the IMERG products are half-hourly 0.1°x0.1° on 60°N-S for over 3 years (with plans to go to 16+ years in Spring 2018). In addition to the precipitation estimates, each data set provides fields of other variables, such as the satellite sensor providing estimates and estimated random error. The discussion concludes with advice about determining suitability for use, the necessity of being clear about product names and versions, and the need for continued support for satellite- and surface-based observation.

  17. FASTQSim: platform-independent data characterization and in silico read generation for NGS datasets.

    Science.gov (United States)

    Shcherbina, Anna

    2014-08-15

    High-throughput next generation sequencing technologies have enabled rapid characterization of clinical and environmental samples. Consequently, the largest bottleneck to actionable data has become sample processing and bioinformatics analysis, creating a need for accurate and rapid algorithms to process genetic data. Perfectly characterized in silico datasets are a useful tool for evaluating the performance of such algorithms. Background contaminating organisms are observed in sequenced mixtures of organisms. In silico samples provide exact truth. To create the best value for evaluating algorithms, in silico data should mimic actual sequencer data as closely as possible. FASTQSim is a tool that provides the dual functionality of NGS dataset characterization and metagenomic data generation. FASTQSim is sequencing platform-independent, and computes distributions of read length, quality scores, indel rates, single point mutation rates, indel size, and similar statistics for any sequencing platform. To create training or testing datasets, FASTQSim has the ability to convert target sequences into in silico reads with specific error profiles obtained in the characterization step. FASTQSim enables users to assess the quality of NGS datasets. The tool provides information about read length, read quality, repetitive and non-repetitive indel profiles, and single base pair substitutions. FASTQSim allows the user to simulate individual read datasets that can be used as standardized test scenarios for planning sequencing projects or for benchmarking metagenomic software. In this regard, in silico datasets generated with the FASTQsim tool hold several advantages over natural datasets: they are sequencing platform independent, extremely well characterized, and less expensive to generate. Such datasets are valuable in a number of applications, including the training of assemblers for multiple platforms, benchmarking bioinformatics algorithm performance, and creating challenge

  18. Se-SAD serial femtosecond crystallography datasets from selenobiotinyl-streptavidin

    Science.gov (United States)

    Yoon, Chun Hong; Demirci, Hasan; Sierra, Raymond G.; Dao, E. Han; Ahmadi, Radman; Aksit, Fulya; Aquila, Andrew L.; Batyuk, Alexander; Ciftci, Halilibrahim; Guillet, Serge; Hayes, Matt J.; Hayes, Brandon; Lane, Thomas J.; Liang, Meng; Lundström, Ulf; Koglin, Jason E.; Mgbam, Paul; Rao, Yashas; Rendahl, Theodore; Rodriguez, Evan; Zhang, Lindsey; Wakatsuki, Soichi; Boutet, Sébastien; Holton, James M.; Hunter, Mark S.

    2017-04-01

    We provide a detailed description of selenobiotinyl-streptavidin (Se-B SA) co-crystal datasets recorded using the Coherent X-ray Imaging (CXI) instrument at the Linac Coherent Light Source (LCLS) for selenium single-wavelength anomalous diffraction (Se-SAD) structure determination. Se-B SA was chosen as the model system for its high affinity between biotin and streptavidin where the sulfur atom in the biotin molecule (C10H16N2O3S) is substituted with selenium. The dataset was collected at three different transmissions (100, 50, and 10%) using a serial sample chamber setup which allows for two sample chambers, a front chamber and a back chamber, to operate simultaneously. Diffraction patterns from Se-B SA were recorded to a resolution of 1.9 Å. The dataset is publicly available through the Coherent X-ray Imaging Data Bank (CXIDB) and also on LCLS compute nodes as a resource for research and algorithm development.

  19. Dataset of transcriptional landscape of B cell early activation

    Directory of Open Access Journals (Sweden)

    Alexander S. Garruss

    2015-09-01

    Full Text Available Signaling via B cell receptors (BCR and Toll-like receptors (TLRs result in activation of B cells with distinct physiological outcomes, but transcriptional regulatory mechanisms that drive activation and distinguish these pathways remain unknown. At early time points after BCR and TLR ligand exposure, 0.5 and 2 h, RNA-seq was performed allowing observations on rapid transcriptional changes. At 2 h, ChIP-seq was performed to allow observations on important regulatory mechanisms potentially driving transcriptional change. The dataset includes RNA-seq, ChIP-seq of control (Input, RNA Pol II, H3K4me3, H3K27me3, and a separate RNA-seq for miRNA expression, which can be found at Gene Expression Omnibus Dataset GSE61608. Here, we provide details on the experimental and analysis methods used to obtain and analyze this dataset and to examine the transcriptional landscape of B cell early activation.

  20. U.S. Climate Divisional Dataset (Version Superseded)

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — This data has been superseded by a newer version of the dataset. Please refer to NOAA's Climate Divisional Database for more information. The U.S. Climate Divisional...

  1. Effect of strain on bond-specific reaction kinetics during the oxidation of H-terminated (111) Si

    International Nuclear Information System (INIS)

    Gokce, Bilal; Aspnes, David E.; Gundogdu, Kenan

    2011-01-01

    Although strain is used in semiconductor technology for manipulating optical, electronic, and chemical properties of semiconductors, the understanding of the microscopic phenomena that are affected or influenced by strain is still incomplete. Second-harmonic generation data obtained during the air oxidation of H-terminated (111) Si reveal the effect of compressive strain on this chemical reaction. Even small amounts of strain manipulate the reaction kinetics of surface bonds significantly, with tensile strain enhancing oxidation and compressive strain retarding it. This dramatic change suggests a strain-driven charge transfer mechanism between Si-H up bonds and Si-Si back bonds in the outer layer of Si atoms.

  2. Selection and evaluation of Malaysian Bacillus spp. strains as potential probiotics in cultured tiger grouper (Epinephelus fuscoguttatus).

    Science.gov (United States)

    Yasin, Ina-salwany Md; Razak, Nabilah Fatin; Natrah, F M I; Harmin, Sharr Azni

    2016-07-01

    A total of 58 Gram-positive bacteria strains were isolated from the marine environment and screened for potential probiotics for disease prevention and improving the productivity of tiger grouper Epinephelus fuscoguttatus larvae and juveniles. The bacteria were identified as Bacillus licheniformis, B. subtilis, B. circulans, B. sphaericus, B. cereus, Brevibacillus brevis, Corynebacterium propinquum, Leifsonia aquatica and Paenibacillus macerans. Only 24 strains showed antagonistic activities against four pathogenic strains; Vibrio alginolyticus, V. harveyi, V. parahaemolyticus and Aeromonas hydrophila, where two of the Bacillus strains, B12 and B45 demonstrated intermediate to highest level of inhibitory activity against these pathogenic strains, respectively. Further assessment by co-culture assay showed that Bacillus strain B12 exhibited a total inhibition of V. alginolyticus, while B45 strain displayed no inhibitory activity. Mixed culture of Bacillus B12 and B45 strains to outcompete V. alginolyticus was observed at a cell density of 10(7) CFU ml(-1). Molecular identification and phylogenetic tree analysis have categorized Bacillus strain B12 to the reference strains GQ340480 and JX290193 of? B. amyloliquafaciens, and Bacillus strain B45 with a reference strain JF496522 of B. subtilis. Safety tests of probionts by intraperitoneal administration of B12 and B45 strains at cell densities of 103, 105 and 10(7) CFU ml(-1) revealed no abnormalities and cent percent survival for healthy Epinephelus fuscoguttatus juveniles within 15 days of experimental period. Overall, the study revealed that Bacillus B12 strain possesses tremendous probiotic potential that could be used as a feed supplement in tiger grouper diets. ?

  3. Global Genome Comparative Analysis Reveals Insights of Resistome and Life-Style Adaptation of Pseudomonas putida Strain T2-2 in Oral Cavity

    Directory of Open Access Journals (Sweden)

    Xin Yue Chan

    2014-01-01

    Full Text Available Most Pseudomonas putida strains are environmental microorganisms exhibiting a wide range of metabolic capability but certain strains have been reported as rare opportunistic pathogens and some emerged as multidrug resistant P. putida. This study aimed to assess the drug resistance profile of, via whole genome analysis, P. putida strain T2-2 isolated from oral cavity. At the same time, we also compared the nonenvironmental strain with environmentally isolated P. putida. In silico comparative genome analysis with available reference strains of P. putida shows that T2-2 has lesser gene counts on carbohydrate and aromatic compounds metabolisms, which suggested its little versatility. The detection of its edd gene also suggested T2-2’s catabolism of glucose via ED pathway instead of EMP pathway. On the other hand, its drug resistance profile was observed via in silico gene prediction and most of the genes found were in agreement with drug-susceptibility testing in laboratory by automated VITEK 2. In addition, the finding of putative genes of multidrug resistance efflux pump and ATP-binding cassette transporters in this strain suggests a multidrug resistant phenotype. In summary, it is believed that multiple metabolic characteristics and drug resistance in P. putida strain T2-2 helped in its survival in human oral cavity.

  4. In-planta Sporulation Capacity Enhances Infectivity and Rhizospheric Competitiveness of Frankia Strains.

    Science.gov (United States)

    Cotin-Galvan, Laetitia; Pozzi, Adrien C; Schwob, Guillaume; Fournier, Pascale; Fernandez, Maria P; Herrera-Belaroussi, Aude

    2016-01-01

    Frankia Sp+ strains maintain their ability to sporulate in symbiosis with actinorhizal plants, producing abundant sporangia inside host plant cells, in contrast to Sp- strains, which are unable to perform in-planta sporulation. We herein examined the role of in-planta sporulation in Frankia infectivity and competitiveness for root infection. Fifteen strains belonging to different Sp+ and Sp- phylogenetic lineages were inoculated on seedlings of Alnus glutinosa (Ag) and A. incana (Ai). Strain competitiveness was investigated by performing Sp-/Sp+ co-inoculations. Plant inoculations were standardized using crushed nodules obtained under laboratory-controlled conditions (same plant species, age, and environmental factors). Specific oligonucleotide primers were developed to identify Frankia Sp+ and/or Sp- strains in the resulting nodules. Single inoculation experiments showed that (i) infectivity by Sp+ strains was significantly greater than that by Sp- strains, (ii) genetically divergent Sp+ strains exhibited different infective abilities, and (iii) Sp+ and Sp- strains showed different host preferences according to the origin (host species) of the inocula. Co-inoculations of Sp+ and Sp- strains revealed the greater competitiveness of Sp+ strains (98.3 to 100% of Sp+ nodules, with up to 15.6% nodules containing both Sp+ and Sp- strains). The results of the present study highlight differences in Sp+/Sp- strain ecological behaviors and provide new insights to strengthen the obligate symbiont hypothesis for Sp+ strains.

  5. UK surveillance: provision of quality assured information from combined datasets.

    Science.gov (United States)

    Paiba, G A; Roberts, S R; Houston, C W; Williams, E C; Smith, L H; Gibbens, J C; Holdship, S; Lysons, R

    2007-09-14

    Surveillance information is most useful when provided within a risk framework, which is achieved by presenting results against an appropriate denominator. Often the datasets are captured separately and for different purposes, and will have inherent errors and biases that can be further confounded by the act of merging. The United Kingdom Rapid Analysis and Detection of Animal-related Risks (RADAR) system contains data from several sources and provides both data extracts for research purposes and reports for wider stakeholders. Considerable efforts are made to optimise the data in RADAR during the Extraction, Transformation and Loading (ETL) process. Despite efforts to ensure data quality, the final dataset inevitably contains some data errors and biases, most of which cannot be rectified during subsequent analysis. So, in order for users to establish the 'fitness for purpose' of data merged from more than one data source, Quality Statements are produced as defined within the overarching surveillance Quality Framework. These documents detail identified data errors and biases following ETL and report construction as well as relevant aspects of the datasets from which the data originated. This paper illustrates these issues using RADAR datasets, and describes how they can be minimised.

  6. Argo_CUDA: Exhaustive GPU based approach for motif discovery in large DNA datasets.

    Science.gov (United States)

    Vishnevsky, Oleg V; Bocharnikov, Andrey V; Kolchanov, Nikolay A

    2018-02-01

    The development of chromatin immunoprecipitation sequencing (ChIP-seq) technology has revolutionized the genetic analysis of the basic mechanisms underlying transcription regulation and led to accumulation of information about a huge amount of DNA sequences. There are a lot of web services which are currently available for de novo motif discovery in datasets containing information about DNA/protein binding. An enormous motif diversity makes their finding challenging. In order to avoid the difficulties, researchers use different stochastic approaches. Unfortunately, the efficiency of the motif discovery programs dramatically declines with the query set size increase. This leads to the fact that only a fraction of top "peak" ChIP-Seq segments can be analyzed or the area of analysis should be narrowed. Thus, the motif discovery in massive datasets remains a challenging issue. Argo_Compute Unified Device Architecture (CUDA) web service is designed to process the massive DNA data. It is a program for the detection of degenerate oligonucleotide motifs of fixed length written in 15-letter IUPAC code. Argo_CUDA is a full-exhaustive approach based on the high-performance GPU technologies. Compared with the existing motif discovery web services, Argo_CUDA shows good prediction quality on simulated sets. The analysis of ChIP-Seq sequences revealed the motifs which correspond to known transcription factor binding sites.

  7. Climate Prediction Center IR 4km Dataset

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — CPC IR 4km dataset was created from all available individual geostationary satellite data which have been merged to form nearly seamless global (60N-60S) IR...

  8. High genetic diversity of equine infectious anaemia virus strains from Slovenia revealed upon phylogenetic analysis of the p15 gag gene region.

    Science.gov (United States)

    Kuhar, U; Malovrh, T

    2016-03-01

    The equine infectious anaemia virus (EIAV), which belongs to the Retroviridae family, infects equids almost worldwide. Every year, sporadic EIAV cases are detected in Slovenia. To characterise the Slovenian EIAV strains in the p15 gag gene region phylogenetically in order to compare the Slovenian EIAV strains with EIAV strains from abroad, especially with the recently published European strains. Cross-sectional study using material derived from post mortem examination. In total, 29 EIAV serologically positive horses from 18 different farms were examined in this study. Primers were designed to amplify the p15 gag gene region. Amplicons of 28 PCRs were subjected to direct DNA sequencing and phylogenetic analysis. Altogether, 28 EIAV sequences were obtained from 17 different farms and were distributed between 4 separate monophyletic groups and 9 branches upon phylogenetic analysis. Among EIAV strains from abroad, the closest relatives to Slovenian EIAV strains were European EIAV strains from Italy. Phylogenetic analysis also showed that some animals from distantly located farms were most probably infected with the same EIAV strains, as well as animals from the same farm and animals from farms located in the same geographical region. This is the first report of such high genetic diversity of EIAV strains from one country. This led to speculation that there is a potential virus reservoir among the populations of riding horses, horses kept for pleasure and horses for meat production, with some farmers or horse-owners not following legislation, thus enabling the spread of infection with EIAV. The low sensitivity of the agar gel immunodiffusion test may also contribute to the spread of infection with EIAV, because some infected horses might have escaped detection. The results of the phylogenetic analysis also provide additional knowledge about the highly heterogeneous nature of the EIAV genome. © 2015 EVJ Ltd.

  9. Multivariate Analysis of Multiple Datasets: a Practical Guide for Chemical Ecology.

    Science.gov (United States)

    Hervé, Maxime R; Nicolè, Florence; Lê Cao, Kim-Anh

    2018-03-01

    Chemical ecology has strong links with metabolomics, the large-scale study of all metabolites detectable in a biological sample. Consequently, chemical ecologists are often challenged by the statistical analyses of such large datasets. This holds especially true when the purpose is to integrate multiple datasets to obtain a holistic view and a better understanding of a biological system under study. The present article provides a comprehensive resource to analyze such complex datasets using multivariate methods. It starts from the necessary pre-treatment of data including data transformations and distance calculations, to the application of both gold standard and novel multivariate methods for the integration of different omics data. We illustrate the process of analysis along with detailed results interpretations for six issues representative of the different types of biological questions encountered by chemical ecologists. We provide the necessary knowledge and tools with reproducible R codes and chemical-ecological datasets to practice and teach multivariate methods.

  10. A high performance Trichoderma reesei strain that reveals the importance of xylanase III in cellulosic biomass conversion.

    Science.gov (United States)

    Nakazawa, Hikaru; Kawai, Tetsushi; Ida, Noriko; Shida, Yosuke; Shioya, Kouki; Kobayashi, Yoshinori; Okada, Hirofumi; Tani, Shuji; Sumitani, Jun-Ichi; Kawaguchi, Takashi; Morikawa, Yasushi; Ogasawara, Wataru

    2016-01-01

    The ability of the Trichoderma reesei X3AB1strain enzyme preparations to convert cellulosic biomass into fermentable sugars is enhanced by the replacement of xyn3 by Aspergillus aculeatus β-glucosidase 1 gene (aabg1), as shown in our previous study. However, subsequent experiments using T. reesei extracts supplemented with the glycoside hydrolase (GH) family 10 xylanase III (XYN III) and GH Family 11 XYN II showed increased conversion of alkaline treated cellulosic biomass, which is rich in xylan, underscoring the importance of XYN III. To attain optimal saccharifying potential in T. reesei, we constructed two new strains, C1AB1 and E1AB1, in which aabg1 was expressed heterologously by means of the cbh1 or egl1 promoters, respectively, so that the endogenous XYN III synthesis remained intact. Due to the presence of wild-type xyn3 in T. reesei E1AB1, enzymes prepared from this strain were 20-30% more effective in the saccharification of alkaline-pretreated rice straw than enzyme extracts from X3AB1, and also outperformed recent commercial cellulase preparations. Our results demonstrate the importance of XYN III in the conversion of alkaline-pretreated cellulosic biomass by T. reesei. Copyright © 2015 Elsevier Inc. All rights reserved.

  11. Comparative Genomics of Early-Diverging Brucella Strains Reveals a Novel Lipopolysaccharide Biosynthesis Pathway

    Science.gov (United States)

    Wattam, Alice R.; Inzana, Thomas J.; Williams, Kelly P.; Mane, Shrinivasrao P.; Shukla, Maulik; Almeida, Nalvo F.; Dickerman, Allan W.; Mason, Steven; Moriyón, Ignacio; O’Callaghan, David; Whatmore, Adrian M.; Sobral, Bruno W.; Tiller, Rebekah V.; Hoffmaster, Alex R.; Frace, Michael A.; De Castro, Cristina; Molinaro, Antonio; Boyle, Stephen M.; De, Barun K.; Setubal, João C.

    2012-01-01

    ABSTRACT Brucella species are Gram-negative bacteria that infect mammals. Recently, two unusual strains (Brucella inopinata BO1T and B. inopinata-like BO2) have been isolated from human patients, and their similarity to some atypical brucellae isolated from Australian native rodent species was noted. Here we present a phylogenomic analysis of the draft genome sequences of BO1T and BO2 and of the Australian rodent strains 83-13 and NF2653 that shows that they form two groups well separated from the other sequenced Brucella spp. Several important differences were noted. Both BO1T and BO2 did not agglutinate significantly when live or inactivated cells were exposed to monospecific A and M antisera against O-side chain sugars composed of N-formyl-perosamine. While BO1T maintained the genes required to synthesize a typical Brucella O-antigen, BO2 lacked many of these genes but still produced a smooth LPS (lipopolysaccharide). Most missing genes were found in the wbk region involved in O-antigen synthesis in classic smooth Brucella spp. In their place, BO2 carries four genes that other bacteria use for making a rhamnose-based O-antigen. Electrophoretic, immunoblot, and chemical analyses showed that BO2 carries an antigenically different O-antigen made of repeating hexose-rich oligosaccharide units that made the LPS water-soluble, which contrasts with the homopolymeric O-antigen of other smooth brucellae that have a phenol-soluble LPS. The results demonstrate the existence of a group of early-diverging brucellae with traits that depart significantly from those of the Brucella species described thus far. PMID:22930339

  12. Harvard Aging Brain Study : Dataset and accessibility

    NARCIS (Netherlands)

    Dagley, Alexander; LaPoint, Molly; Huijbers, Willem; Hedden, Trey; McLaren, Donald G.; Chatwal, Jasmeer P.; Papp, Kathryn V.; Amariglio, Rebecca E.; Blacker, Deborah; Rentz, Dorene M.; Johnson, Keith A.; Sperling, Reisa A.; Schultz, Aaron P.

    2017-01-01

    The Harvard Aging Brain Study is sharing its data with the global research community. The longitudinal dataset consists of a 284-subject cohort with the following modalities acquired: demographics, clinical assessment, comprehensive neuropsychological testing, clinical biomarkers, and neuroimaging.

  13. Large Scale Flood Risk Analysis using a New Hyper-resolution Population Dataset

    Science.gov (United States)

    Smith, A.; Neal, J. C.; Bates, P. D.; Quinn, N.; Wing, O.

    2017-12-01

    Here we present the first national scale flood risk analyses, using high resolution Facebook Connectivity Lab population data and data from a hyper resolution flood hazard model. In recent years the field of large scale hydraulic modelling has been transformed by new remotely sensed datasets, improved process representation, highly efficient flow algorithms and increases in computational power. These developments have allowed flood risk analysis to be undertaken in previously unmodeled territories and from continental to global scales. Flood risk analyses are typically conducted via the integration of modelled water depths with an exposure dataset. Over large scales and in data poor areas, these exposure data typically take the form of a gridded population dataset, estimating population density using remotely sensed data and/or locally available census data. The local nature of flooding dictates that for robust flood risk analysis to be undertaken both hazard and exposure data should sufficiently resolve local scale features. Global flood frameworks are enabling flood hazard data to produced at 90m resolution, resulting in a mis-match with available population datasets which are typically more coarsely resolved. Moreover, these exposure data are typically focused on urban areas and struggle to represent rural populations. In this study we integrate a new population dataset with a global flood hazard model. The population dataset was produced by the Connectivity Lab at Facebook, providing gridded population data at 5m resolution, representing a resolution increase over previous countrywide data sets of multiple orders of magnitude. Flood risk analysis undertaken over a number of developing countries are presented, along with a comparison of flood risk analyses undertaken using pre-existing population datasets.

  14. Comparing the accuracy of food outlet datasets in an urban environment

    Directory of Open Access Journals (Sweden)

    Michelle S. Wong

    2017-05-01

    Full Text Available Studies that investigate the relationship between the retail food environment and health outcomes often use geospatial datasets. Prior studies have identified challenges of using the most common data sources. Retail food environment datasets created through academic-government partnership present an alternative, but their validity (retail existence, type, location has not been assessed yet. In our study, we used ground-truth data to compare the validity of two datasets, a 2015 commercial dataset (InfoUSA and data collected from 2012 to 2014 through the Maryland Food Systems Mapping Project (MFSMP, an academic-government partnership, on the retail food environment in two low-income, inner city neighbourhoods in Baltimore City. We compared sensitivity and positive predictive value (PPV of the commercial and academic-government partnership data to ground-truth data for two broad categories of unhealthy food retailers: small food retailers and quick-service restaurants. Ground-truth data was collected in 2015 and analysed in 2016. Compared to the ground-truth data, MFSMP and InfoUSA generally had similar sensitivity that was greater than 85%. MFSMP had higher PPV compared to InfoUSA for both small food retailers (MFSMP: 56.3% vs InfoUSA: 40.7% and quick-service restaurants (MFSMP: 58.6% vs InfoUSA: 36.4%. We conclude that data from academic-government partnerships like MFSMP might be an attractive alternative option and improvement to relying only on commercial data. Other research institutes or cities might consider efforts to create and maintain such an environmental dataset. Even if these datasets cannot be updated on an annual basis, they are likely more accurate than commercial data.

  15. Comparing the accuracy of food outlet datasets in an urban environment.

    Science.gov (United States)

    Wong, Michelle S; Peyton, Jennifer M; Shields, Timothy M; Curriero, Frank C; Gudzune, Kimberly A

    2017-05-11

    Studies that investigate the relationship between the retail food environment and health outcomes often use geospatial datasets. Prior studies have identified challenges of using the most common data sources. Retail food environment datasets created through academic-government partnership present an alternative, but their validity (retail existence, type, location) has not been assessed yet. In our study, we used ground-truth data to compare the validity of two datasets, a 2015 commercial dataset (InfoUSA) and data collected from 2012 to 2014 through the Maryland Food Systems Mapping Project (MFSMP), an academic-government partnership, on the retail food environment in two low-income, inner city neighbourhoods in Baltimore City. We compared sensitivity and positive predictive value (PPV) of the commercial and academic-government partnership data to ground-truth data for two broad categories of unhealthy food retailers: small food retailers and quick-service restaurants. Ground-truth data was collected in 2015 and analysed in 2016. Compared to the ground-truth data, MFSMP and InfoUSA generally had similar sensitivity that was greater than 85%. MFSMP had higher PPV compared to InfoUSA for both small food retailers (MFSMP: 56.3% vs InfoUSA: 40.7%) and quick-service restaurants (MFSMP: 58.6% vs InfoUSA: 36.4%). We conclude that data from academic-government partnerships like MFSMP might be an attractive alternative option and improvement to relying only on commercial data. Other research institutes or cities might consider efforts to create and maintain such an environmental dataset. Even if these datasets cannot be updated on an annual basis, they are likely more accurate than commercial data.

  16. Informal eldercare and work-related strain.

    Science.gov (United States)

    Trukeschitz, Birgit; Schneider, Ulrike; Mühlmann, Richard; Ponocny, Ivo

    2013-03-01

    In light of an aging workforce, reconciling informal eldercare and paid work becomes increasingly pertinent. This article investigates the association between informal eldercare and work-related strain and tests for both the "competing demands" and "expansion" hypotheses. The sample of 938 Austrian employees consisted of employees caring for older relatives and a control group of employees without eldercare obligations. We ran a Tobit regression model on work-related strain with different measures of informal eldercare as explanatory variables and controls for both personal and workplace characteristics. Accounting for different characteristics of eldercare within one estimation model revealed that informal eldercare was associated with work-related strain in 2 ways, that is, it increased with both care hours and subjective care burden. However, after controlling for these burdensome attributes of eldercare, the carer status as such was found to be negatively associated with work-related strain. In addition and independently of care commitments, work-related factors, such as advanced skills and job motivation, reduced work-related strain. This article lends support to both the "competing demands" and the "expansion" hypotheses. Commitment to eldercare can enhance work-related outcomes but entails work-related problems if care burden and time demands of eldercare are substantial. Thus, workers with eldercare responsibilities cannot be considered less productive from the outset. An individual assessment of their situation, considering the care and work setting, is required. Findings from this study support the design of workplace initiatives to uphold workers' productivity in general and bring specific attention to policies alleviating workers' eldercare burden.

  17. Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling

    Directory of Open Access Journals (Sweden)

    H. E. Beck

    2017-12-01

    Full Text Available We undertook a comprehensive evaluation of 22 gridded (quasi-global (sub-daily precipitation (P datasets for the period 2000–2016. Thirteen non-gauge-corrected P datasets were evaluated using daily P gauge observations from 76 086 gauges worldwide. Another nine gauge-corrected datasets were evaluated using hydrological modeling, by calibrating the HBV conceptual model against streamflow records for each of 9053 small to medium-sized ( <  50 000 km2 catchments worldwide, and comparing the resulting performance. Marked differences in spatio-temporal patterns and accuracy were found among the datasets. Among the uncorrected P datasets, the satellite- and reanalysis-based MSWEP-ng V1.2 and V2.0 datasets generally showed the best temporal correlations with the gauge observations, followed by the reanalyses (ERA-Interim, JRA-55, and NCEP-CFSR and the satellite- and reanalysis-based CHIRP V2.0 dataset, the estimates based primarily on passive microwave remote sensing of rainfall (CMORPH V1.0, GSMaP V5/6, and TMPA 3B42RT V7 or near-surface soil moisture (SM2RAIN-ASCAT, and finally, estimates based primarily on thermal infrared imagery (GridSat V1.0, PERSIANN, and PERSIANN-CCS. Two of the three reanalyses (ERA-Interim and JRA-55 unexpectedly obtained lower trend errors than the satellite datasets. Among the corrected P datasets, the ones directly incorporating daily gauge data (CPC Unified, and MSWEP V1.2 and V2.0 generally provided the best calibration scores, although the good performance of the fully gauge-based CPC Unified is unlikely to translate to sparsely or ungauged regions. Next best results were obtained with P estimates directly incorporating temporally coarser gauge data (CHIRPS V2.0, GPCP-1DD V1.2, TMPA 3B42 V7, and WFDEI-CRU, which in turn outperformed the one indirectly incorporating gauge data through another multi-source dataset (PERSIANN-CDR V1R1. Our results highlight large differences in estimation accuracy

  18. Creation of the Naturalistic Engagement in Secondary Tasks (NEST) distracted driving dataset.

    Science.gov (United States)

    Owens, Justin M; Angell, Linda; Hankey, Jonathan M; Foley, James; Ebe, Kazutoshi

    2015-09-01

    Distracted driving has become a topic of critical importance to driving safety research over the past several decades. Naturalistic driving data offer a unique opportunity to study how drivers engage with secondary tasks in real-world driving; however, the complexities involved with identifying and coding relevant epochs of naturalistic data have limited its accessibility to the general research community. This project was developed to help address this problem by creating an accessible dataset of driver behavior and situational factors observed during distraction-related safety-critical events and baseline driving epochs, using the Strategic Highway Research Program 2 (SHRP2) naturalistic dataset. The new NEST (Naturalistic Engagement in Secondary Tasks) dataset was created using crashes and near-crashes from the SHRP2 dataset that were identified as including secondary task engagement as a potential contributing factor. Data coding included frame-by-frame video analysis of secondary task and hands-on-wheel activity, as well as summary event information. In addition, information about each secondary task engagement within the trip prior to the crash/near-crash was coded at a higher level. Data were also coded for four baseline epochs and trips per safety-critical event. 1,180 events and baseline epochs were coded, and a dataset was constructed. The project team is currently working to determine the most useful way to allow broad public access to the dataset. We anticipate that the NEST dataset will be extraordinarily useful in allowing qualified researchers access to timely, real-world data concerning how drivers interact with secondary tasks during safety-critical events and baseline driving. The coded dataset developed for this project will allow future researchers to have access to detailed data on driver secondary task engagement in the real world. It will be useful for standalone research, as well as for integration with additional SHRP2 data to enable the

  19. XRD and spectral dataset of the UV-A stable nanotubes of 3,5-bis(trifluoromethyl)benzylamine derivative of tyrosine.

    Science.gov (United States)

    Govindhan, R; Karthikeyan, B

    2017-10-01

    The data presented in this article are related to the research entitled of UV-A stable nanotubes. The nanotubes have been prepared from 3,5-bis(trifluoromethyl)benzylamine derivative of tyrosine (BTTP). XRD data reveals the size of the nanotubes. As-synthesized nanotubes (BTTPNTs) are characterized by UV-vis optical absorption studies [1] and photo physical degradation kinetics. The resulted dataset is made available to enable critical or extended analyzes of the BTTPNTs as an excellent light resistive materials.

  20. Digital tissue and what it may reveal about the brain.

    Science.gov (United States)

    Morgan, Josh L; Lichtman, Jeff W

    2017-10-30

    Imaging as a means of scientific data storage has evolved rapidly over the past century from hand drawings, to photography, to digital images. Only recently can sufficiently large datasets be acquired, stored, and processed such that tissue digitization can actually reveal more than direct observation of tissue. One field where this transformation is occurring is connectomics: the mapping of neural connections in large volumes of digitized brain tissue.

  1. Effect of cerulenin on fatty acid composition and gene expression pattern of DHA-producing strain Colwellia psychrerythraea strain 34H.

    Science.gov (United States)

    Wan, Xia; Peng, Yun-Feng; Zhou, Xue-Rong; Gong, Yang-Min; Huang, Feng-Hong; Moncalián, Gabriel

    2016-02-06

    Colwellia psychrerythraea 34H is a psychrophilic bacterium able to produce docosahexaenoic acid (DHA). Polyketide synthase pathway is assumed to be responsible for DHA production in marine bacteria. Five pfa genes from strain 34H were confirmed to be responsible for DHA formation by heterogeneous expression in Escherichia coli. The complexity of fatty acid profile of this strain was revealed by GC and GC-MS. Treatment of cells with cerulenin resulted in significantly reduced level of C16 monounsaturated fatty acid (C16:1(Δ9t), C16:1(Δ7)). In contrast, the amount of saturated fatty acids (C10:0, C12:0, C14:0), hydroxyl fatty acids (3-OH C10:0 and 3-OH C12:0), as well as C20:4ω3, C20:5ω3 and C22:6ω3 were increased. RNA sequencing (RNA-Seq) revealed the altered gene expression pattern when C. psychrerythraea cells were treated with cerulenin. Genes involved in polyketide synthase pathway and fatty acid biosynthesis pathway were not obviously affected by cerulenin treatment. In contrast, several genes involved in fatty acid degradation or β-oxidation pathway were dramatically reduced at the transcriptional level. Genes responsible for DHA formation in C. psychrerythraea was first cloned and characterized. We revealed the complexity of fatty acid profile in this DHA-producing strain. Cerulenin could substantially change the fatty acid composition by affecting the fatty acid degradation at transcriptional level. Acyl-CoA dehydrogenase gene family involved in the first step of β-oxidation pathway may be important to the selectivity of degraded fatty acids. In addition, inhibition of FabB protein by cerulenin may lead to the accumulation of malonyl-CoA, which is the substrate for DHA formation.

  2. A multimodal dataset for authoring and editing multimedia content: The MAMEM project

    Directory of Open Access Journals (Sweden)

    Spiros Nikolopoulos

    2017-12-01

    Full Text Available We present a dataset that combines multimodal biosignals and eye tracking information gathered under a human-computer interaction framework. The dataset was developed in the vein of the MAMEM project that aims to endow people with motor disabilities with the ability to edit and author multimedia content through mental commands and gaze activity. The dataset includes EEG, eye-tracking, and physiological (GSR and Heart rate signals collected from 34 individuals (18 able-bodied and 16 motor-impaired. Data were collected during the interaction with specifically designed interface for web browsing and multimedia content manipulation and during imaginary movement tasks. The presented dataset will contribute towards the development and evaluation of modern human-computer interaction systems that would foster the integration of people with severe motor impairments back into society.

  3. Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Metadata, Usage Metrics, and User Feedback to Improve Data Discovery and Access

    Data.gov (United States)

    National Aeronautics and Space Administration — We propose to mine and utilize the combination of Earth Science dataset, metadata with usage metrics and user feedback to objectively extract relevance for improved...

  4. Strain-specific diversity of mucus-binding proteins in the adhesion and aggregation properties of Lactobacillus reuteri.

    Science.gov (United States)

    Mackenzie, Donald A; Jeffers, Faye; Parker, Mary L; Vibert-Vallet, Amandine; Bongaerts, Roy J; Roos, Stefan; Walter, Jens; Juge, Nathalie

    2010-11-01

    Mucus-binding proteins (MUBs) have been revealed as one of the effector molecules involved in mechanisms of the adherence of lactobacilli to the host; mub, or mub-like, genes are found in all of the six genomes of Lactobacillus reuteri that are available. We recently reported the crystal structure of a Mub repeat from L. reuteri ATCC 53608 (also designated strain 1063), revealing an unexpected recognition of immunoglobulins. In the current study, we explored the diversity of the ATCC 53608 mub gene, and MUB expression levels in a large collection of L. reuteri strains isolated from a range of vertebrate hosts. This analysis revealed that the MUB was only detectable on the cell surface of two highly related isolates when using antibodies that were raised against the protein. There was considerable variation in quantitative mucus adhesion in vitro among L. reuteri strains, and mucus binding showed excellent correlation with the presence of cell-surface ATCC 53608 MUB. ATCC 53608 MUB presence was further highly associated with the autoaggregation of L. reuteri strains in washed cell suspensions, suggesting a novel role of this surface protein in cell aggregation. We also characterized MUB expression in representative L. reuteri strains. This analysis revealed that one derivative of strain 1063 was a spontaneous mutant that expressed a C-terminally truncated version of MUB. This frameshift mutation was caused by the insertion of a duplicated 13 nt sequence at position 4867 nt in the mub gene, producing a truncated MUB also lacking the C-terminal LPxTG region, and thus unable to anchor to the cell wall. This mutant, designated 1063N (mub-4867(i)), displayed low mucus-binding and aggregation capacities, further providing evidence for the contribution of cell-wall-anchored MUB to such phenotypes. In conclusion, this study provided novel information on the functional attributes of MUB in L. reuteri, and further demonstrated that MUB and MUB-like proteins

  5. An integrated dataset for in silico drug discovery

    Directory of Open Access Journals (Sweden)

    Cockell Simon J

    2010-12-01

    Full Text Available Drug development is expensive and prone to failure. It is potentially much less risky and expensive to reuse a drug developed for one condition for treating a second disease, than it is to develop an entirely new compound. Systematic approaches to drug repositioning are needed to increase throughput and find candidates more reliably. Here we address this need with an integrated systems biology dataset, developed using the Ondex data integration platform, for the in silico discovery of new drug repositioning candidates. We demonstrate that the information in this dataset allows known repositioning examples to be discovered. We also propose a means of automating the search for new treatment indications of existing compounds.

  6. Probabilistic and machine learning-based retrieval approaches for biomedical dataset retrieval

    Science.gov (United States)

    Karisani, Payam; Qin, Zhaohui S; Agichtein, Eugene

    2018-01-01

    Abstract The bioCADDIE dataset retrieval challenge brought together different approaches to retrieval of biomedical datasets relevant to a user’s query, expressed as a text description of a needed dataset. We describe experiments in applying a data-driven, machine learning-based approach to biomedical dataset retrieval as part of this challenge. We report on a series of experiments carried out to evaluate the performance of both probabilistic and machine learning-driven techniques from information retrieval, as applied to this challenge. Our experiments with probabilistic information retrieval methods, such as query term weight optimization, automatic query expansion and simulated user relevance feedback, demonstrate that automatically boosting the weights of important keywords in a verbose query is more effective than other methods. We also show that although there is a rich space of potential representations and features available in this domain, machine learning-based re-ranking models are not able to improve on probabilistic information retrieval techniques with the currently available training data. The models and algorithms presented in this paper can serve as a viable implementation of a search engine to provide access to biomedical datasets. The retrieval performance is expected to be further improved by using additional training data that is created by expert annotation, or gathered through usage logs, clicks and other processes during natural operation of the system. Database URL: https://github.com/emory-irlab/biocaddie

  7. An innovative privacy preserving technique for incremental datasets on cloud computing.

    Science.gov (United States)

    Aldeen, Yousra Abdul Alsahib S; Salleh, Mazleena; Aljeroudi, Yazan

    2016-08-01

    Cloud computing (CC) is a magnificent service-based delivery with gigantic computer processing power and data storage across connected communications channels. It imparted overwhelming technological impetus in the internet (web) mediated IT industry, where users can easily share private data for further analysis and mining. Furthermore, user affable CC services enable to deploy sundry applications economically. Meanwhile, simple data sharing impelled various phishing attacks and malware assisted security threats. Some privacy sensitive applications like health services on cloud that are built with several economic and operational benefits necessitate enhanced security. Thus, absolute cyberspace security and mitigation against phishing blitz became mandatory to protect overall data privacy. Typically, diverse applications datasets are anonymized with better privacy to owners without providing all secrecy requirements to the newly added records. Some proposed techniques emphasized this issue by re-anonymizing the datasets from the scratch. The utmost privacy protection over incremental datasets on CC is far from being achieved. Certainly, the distribution of huge datasets volume across multiple storage nodes limits the privacy preservation. In this view, we propose a new anonymization technique to attain better privacy protection with high data utility over distributed and incremental datasets on CC. The proficiency of data privacy preservation and improved confidentiality requirements is demonstrated through performance evaluation. Copyright © 2016 Elsevier Inc. All rights reserved.

  8. TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild

    KAUST Repository

    Mü ller, Matthias; Bibi, Adel Aamer; Giancola, Silvio; Al-Subaihi, Salman; Ghanem, Bernard

    2018-01-01

    Despite the numerous developments in object tracking, further development of current tracking algorithms is limited by small and mostly saturated datasets. As a matter of fact, data-hungry trackers based on deep-learning currently rely on object detection datasets due to the scarcity of dedicated large-scale tracking datasets. In this work, we present TrackingNet, the first large-scale dataset and benchmark for object tracking in the wild. We provide more than 30K videos with more than 14 million dense bounding box annotations. Our dataset covers a wide selection of object classes in broad and diverse context. By releasing such a large-scale dataset, we expect deep trackers to further improve and generalize. In addition, we introduce a new benchmark composed of 500 novel videos, modeled with a distribution similar to our training dataset. By sequestering the annotation of the test set and providing an online evaluation server, we provide a fair benchmark for future development of object trackers. Deep trackers fine-tuned on a fraction of our dataset improve their performance by up to 1.6% on OTB100 and up to 1.7% on TrackingNet Test. We provide an extensive benchmark on TrackingNet by evaluating more than 20 trackers. Our results suggest that object tracking in the wild is far from being solved.

  9. TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild

    KAUST Repository

    Müller, Matthias

    2018-03-28

    Despite the numerous developments in object tracking, further development of current tracking algorithms is limited by small and mostly saturated datasets. As a matter of fact, data-hungry trackers based on deep-learning currently rely on object detection datasets due to the scarcity of dedicated large-scale tracking datasets. In this work, we present TrackingNet, the first large-scale dataset and benchmark for object tracking in the wild. We provide more than 30K videos with more than 14 million dense bounding box annotations. Our dataset covers a wide selection of object classes in broad and diverse context. By releasing such a large-scale dataset, we expect deep trackers to further improve and generalize. In addition, we introduce a new benchmark composed of 500 novel videos, modeled with a distribution similar to our training dataset. By sequestering the annotation of the test set and providing an online evaluation server, we provide a fair benchmark for future development of object trackers. Deep trackers fine-tuned on a fraction of our dataset improve their performance by up to 1.6% on OTB100 and up to 1.7% on TrackingNet Test. We provide an extensive benchmark on TrackingNet by evaluating more than 20 trackers. Our results suggest that object tracking in the wild is far from being solved.

  10. A microdot multilayer oxide device: let us tune the strain-ionic transport interaction.

    Science.gov (United States)

    Schweiger, Sebastian; Kubicek, Markus; Messerschmitt, Felix; Murer, Christoph; Rupp, Jennifer L M

    2014-05-27

    In this paper, we present a strategy to use interfacial strain in multilayer heterostructures to tune their resistive response and ionic transport as active component in an oxide-based multilayer microdot device on chip. For this, fabrication of strained multilayer microdot devices with sideways attached electrodes is reported with the material system Gd0.1Ce0.9O(2-δ)/Er2O3. The fast ionic conducting Gd0.1Ce0.9O(2-δ) single layers are altered in lattice strain by the electrically insulating erbia phases of a microdot. The strain activated volume of the Gd0.1Ce0.9O(2-δ) is investigated by changing the number of individual layers from 1 to 60 while keeping the microdot at a constant thickness; i.e., the proportion of strained volume was systematically varied. Electrical measurements showed that the activation energy of the devices could be altered by Δ0.31 eV by changing the compressive strain of a microdot ceria-based phase by more than 1.16%. The electrical conductivity data is analyzed and interpreted with a strain volume model and defect thermodynamics. Additionally, an equivalent circuit model is presented for sideways contacted multilayer microdots. We give a proof-of-concept for microdot contacting to capture real strain-ionic transport effects and reveal that for classic top-electrode contacting the effect is nil, highlighting the need for sideways electric contacting on a nanoscopic scale. The near order ionic transport interaction is supported by Raman spectroscopy measurements. These were conducted and analyzed together with fully relaxed single thin film samples. Strain states are described relative to the strain activated volumes of Gd0.1Ce0.9O(2-δ) in the microdot multilayer. These findings reveal that strain engineering in microfabricated devices allows altering the ionic conduction over a wide range beyond classic doping strategies for single films. The reported fabrication route and concept of strained multilayer microdots is a promising path

  11. Development of new strains and related SCAR markers for an edible mushroom, Hypsizygus marmoreus.

    Science.gov (United States)

    Lee, Chang Y; Park, Jeong-Eun; Lee, Jia; Kim, Jong-Kuk; Ro, Hyeon-Su

    2012-02-01

    New fast-growing and less bitter varieties of Hypsizygus marmoreus were developed by crossing monokaryotic mycelia from a commercial strain (Hm1-1) and a wild strain (Hm3-10). Six of the better tasting new strains with a shorter cultivation period were selected from 400 crosses in a large-scale cultivation experiment. We attempted to develop sequence characterized amplified region (SCAR) markers to identify the new strain from other commercial strains. For the SCAR markers, we conducted molecular genetic analysis on a wild strain and the eight most cultivated H. marmoreus strains collected from various areas in East Asia by randomly amplified polymorphic DNA. Ten unique DNA bands for a commercial Hm1-1 strain and the Hm3-10 strain were extracted and their sequences were determined. Primer sets were designed based on the determined sequences. PCR reactions with the primer sets revealed that four primer sets successfully discriminated the new strains from other commercial strains and are thus suitable for commercial purposes. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  12. Comparative Genomics of Mycoplasma bovis Strains Reveals That Decreased Virulence with Increasing Passages Might Correlate with Potential Virulence-Related Factors

    Directory of Open Access Journals (Sweden)

    Muhammad A. Rasheed

    2017-05-01

    Full Text Available Mycoplasma bovis is an important cause of bovine respiratory disease worldwide. To understand its virulence mechanisms, we sequenced three attenuated M. bovis strains, P115, P150, and P180, which were passaged in vitro 115, 150, and 180 times, respectively, and exhibited progressively decreasing virulence. Comparative genomics was performed among the wild-type M. bovis HB0801 (P1 strain and the P115, P150, and P180 strains, and one 14.2-kb deleted region covering 14 genes was detected in the passaged strains. Additionally, 46 non-sense single-nucleotide polymorphisms and indels were detected, which confirmed that more passages result in more mutations. A subsequent collective bioinformatics analysis of paralogs, metabolic pathways, protein-protein interactions, secretory proteins, functionally conserved domains, and virulence-related factors identified 11 genes that likely contributed to the increased attenuation in the passaged strains. These genes encode ascorbate-specific phosphotransferase system enzyme IIB and IIA components, enolase, L-lactate dehydrogenase, pyruvate kinase, glycerol, and multiple sugar ATP-binding cassette transporters, ATP binding proteins, NADH dehydrogenase, phosphate acetyltransferase, transketolase, and a variable surface protein. Fifteen genes were shown to be enriched in 15 metabolic pathways, and they included the aforementioned genes encoding pyruvate kinase, transketolase, enolase, and L-lactate dehydrogenase. Hydrogen peroxide (H2O2 production in M. bovis strains representing seven passages from P1 to P180 decreased progressively with increasing numbers of passages and increased attenuation. However, eight mutants specific to eight individual genes within the 14.2-kb deleted region did not exhibit altered H2O2 production. These results enrich the M. bovis genomics database, and they increase our understanding of the mechanisms underlying M. bovis virulence.

  13. Parton Distributions based on a Maximally Consistent Dataset

    Science.gov (United States)

    Rojo, Juan

    2016-04-01

    The choice of data that enters a global QCD analysis can have a substantial impact on the resulting parton distributions and their predictions for collider observables. One of the main reasons for this has to do with the possible presence of inconsistencies, either internal within an experiment or external between different experiments. In order to assess the robustness of the global fit, different definitions of a conservative PDF set, that is, a PDF set based on a maximally consistent dataset, have been introduced. However, these approaches are typically affected by theory biases in the selection of the dataset. In this contribution, after a brief overview of recent NNPDF developments, we propose a new, fully objective, definition of a conservative PDF set, based on the Bayesian reweighting approach. Using the new NNPDF3.0 framework, we produce various conservative sets, which turn out to be mutually in agreement within the respective PDF uncertainties, as well as with the global fit. We explore some of their implications for LHC phenomenology, finding also good consistency with the global fit result. These results provide a non-trivial validation test of the new NNPDF3.0 fitting methodology, and indicate that possible inconsistencies in the fitted dataset do not affect substantially the global fit PDFs.

  14. Decoys Selection in Benchmarking Datasets: Overview and Perspectives

    Science.gov (United States)

    Réau, Manon; Langenfeld, Florent; Zagury, Jean-François; Lagarde, Nathalie; Montes, Matthieu

    2018-01-01

    Virtual Screening (VS) is designed to prospectively help identifying potential hits, i.e., compounds capable of interacting with a given target and potentially modulate its activity, out of large compound collections. Among the variety of methodologies, it is crucial to select the protocol that is the most adapted to the query/target system under study and that yields the most reliable output. To this aim, the performance of VS methods is commonly evaluated and compared by computing their ability to retrieve active compounds in benchmarking datasets. The benchmarking datasets contain a subset of known active compounds together with a subset of decoys, i.e., assumed non-active molecules. The composition of both the active and the decoy compounds subsets is critical to limit the biases in the evaluation of the VS methods. In this review, we focus on the selection of decoy compounds that has considerably changed over the years, from randomly selected compounds to highly customized or experimentally validated negative compounds. We first outline the evolution of decoys selection in benchmarking databases as well as current benchmarking databases that tend to minimize the introduction of biases, and secondly, we propose recommendations for the selection and the design of benchmarking datasets. PMID:29416509

  15. Multiresolution persistent homology for excessively large biomolecular datasets

    Energy Technology Data Exchange (ETDEWEB)

    Xia, Kelin; Zhao, Zhixiong [Department of Mathematics, Michigan State University, East Lansing, Michigan 48824 (United States); Wei, Guo-Wei, E-mail: wei@math.msu.edu [Department of Mathematics, Michigan State University, East Lansing, Michigan 48824 (United States); Department of Electrical and Computer Engineering, Michigan State University, East Lansing, Michigan 48824 (United States); Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824 (United States)

    2015-10-07

    Although persistent homology has emerged as a promising tool for the topological simplification of complex data, it is computationally intractable for large datasets. We introduce multiresolution persistent homology to handle excessively large datasets. We match the resolution with the scale of interest so as to represent large scale datasets with appropriate resolution. We utilize flexibility-rigidity index to access the topological connectivity of the data set and define a rigidity density for the filtration analysis. By appropriately tuning the resolution of the rigidity density, we are able to focus the topological lens on the scale of interest. The proposed multiresolution topological analysis is validated by a hexagonal fractal image which has three distinct scales. We further demonstrate the proposed method for extracting topological fingerprints from DNA molecules. In particular, the topological persistence of a virus capsid with 273 780 atoms is successfully analyzed which would otherwise be inaccessible to the normal point cloud method and unreliable by using coarse-grained multiscale persistent homology. The proposed method has also been successfully applied to the protein domain classification, which is the first time that persistent homology is used for practical protein domain analysis, to our knowledge. The proposed multiresolution topological method has potential applications in arbitrary data sets, such as social networks, biological networks, and graphs.

  16. Occidiofungin is an important component responsible for the antifungal activity of Burkholderia pyrrocinia strain Lyc2.

    Science.gov (United States)

    Wang, X Q; Liu, A X; Guerrero, A; Liu, J; Yu, X Q; Deng, P; Ma, L; Baird, S M; Smith, L; Li, X D; Lu, S E

    2016-03-01

    To identify the taxonomy of tobacco rhizosphere-isolated strain Lyc2 and investigate the mechanisms of the antifungal activities, focusing on antimicrobials gene clusters identification and function analysis. Multilocus sequence typing and 16S rRNA analyses indicated that strain Lyc2 belongs to Burkholderia pyrrocinia. Bioassay results indicated strain Lyc2 showed significant antifungal activities against a broad range of plant and animal fungal pathogens and control efficacy on seedling damping off disease of cotton. A 55·2-kb gene cluster which was homologous to ocf gene clusters in Burkholderia contaminans MS14 was confirmed to be responsible for antifungal activities by random mutagenesis; HPLC was used to verify the production of antifungal compounds. Multiple antibiotic and secondary metabolized biosynthesis gene clusters predicated by antiSMASH revealed the broad spectrum of antimicrobials activities of the strain. Our results revealed the mechanisms of antifungal activities of strain Lyc2 and expand our knowledge about production of occidiofungin in the bacteria Burkholderia. Understanding the mechanisms of antifungal activities of strain Lyc2 has contributed to discovery of new antibiotics and expand our knowledge of production of occidiofungin in the bacteria Burkholderia. © 2015 The Society for Applied Microbiology.

  17. Technical and economical incentives behind strain limitation in piping

    International Nuclear Information System (INIS)

    Koch, E.

    1986-01-01

    An inspection of conventional industrial plants subsequent to severe earthquakes showed that nothing or next to nothing had been damaged in these plants, although - assessed to the codes and standards for nuclear power plants - they were not designed to. Beside a lot of conservatism, an analysis revealed that structures subject to plasticization exhibit a much more favourable behaviour than anticipated on the basis of the design calculations. Thus, the introduction of the strain limitation approach promised service and cost advantages, above all for the expensive nuclear power plant piping. However, at the current state of the art it is possible to design piping sufficiently flexible to obtain satisfactory operational stresses. A cost analysis showed that - on the basis of today's dimensioning regulations, strain limitation is only economic in special cases. Strain limitation is nevertheless the adequate procedure in terms of engineering in those cases where today safeguarding against accidents is based on stresses of Stress Category D. It is therefore recommended to develop rules for admissible strains and economic methods for strain assessment. The efforts and expense should, however, be in line with the economic benefits. (orig.)

  18. The core proteome and pan proteome of Salmonella Paratyphi A epidemic strains.

    Directory of Open Access Journals (Sweden)

    Li Zhang

    Full Text Available Comparative proteomics of the multiple strains within the same species can reveal the genetic variation and relationships among strains without the need to assess the genomic data. Similar to comparative genomics, core proteome and pan proteome can also be obtained within multiple strains under the same culture conditions. In this study we present the core proteome and pan proteome of four epidemic Salmonella Paratyphi A strains cultured under laboratory culture conditions. The proteomic information was obtained using a Two-dimensional gel electrophoresis (2-DE technique. The expression profiles of these strains were conservative, similar to the monomorphic genome of S. Paratyphi A. Few strain-specific proteins were found in these strains. Interestingly, non-core proteins were found in similar categories as core proteins. However, significant fluctuations in the abundance of some core proteins were also observed, suggesting that there is elaborate regulation of core proteins in the different strains even when they are cultured in the same environment. Therefore, core proteome and pan proteome analysis of the multiple strains can demonstrate the core pathways of metabolism of the species under specific culture conditions, and further the specific responses and adaptations of the strains to the growth environment.

  19. Comparative Phosphoproteomics Reveals the Role of AmpC β-lactamase Phosphorylation in the Clinical Imipenem-resistant Strain Acinetobacter baumannii SK17.

    Science.gov (United States)

    Lai, Juo-Hsin; Yang, Jhih-Tian; Chern, Jeffy; Chen, Te-Li; Wu, Wan-Ling; Liao, Jiahn-Haur; Tsai, Shih-Feng; Liang, Suh-Yuen; Chou, Chi-Chi; Wu, Shih-Hsiung

    2016-01-01

    Nosocomial infectious outbreaks caused by multidrug-resistant Acinetobacter baumannii have emerged as a serious threat to human health. Phosphoproteomics of pathogenic bacteria has been used to identify the mechanisms of bacterial virulence and antimicrobial resistance. In this study, we used a shotgun strategy combined with high-accuracy mass spectrometry to analyze the phosphoproteomics of the imipenem-susceptible strain SK17-S and -resistant strain SK17-R. We identified 410 phosphosites on 248 unique phosphoproteins in SK17-S and 285 phosphosites on 211 unique phosphoproteins in SK17-R. The distributions of the Ser/Thr/Tyr/Asp/His phosphosites in SK17-S and SK17-R were 47.0%/27.6%/12.4%/8.0%/4.9% versus 41.4%/29.5%/17.5%/6.7%/4.9%, respectively. The Ser-90 phosphosite, located on the catalytic motif S(88)VS(90)K of the AmpC β-lactamase, was first identified in SK17-S. Based on site-directed mutagenesis, the nonphosphorylatable mutant S90A was found to be more resistant to imipenem, whereas the phosphorylation-simulated mutant S90D was sensitive to imipenem. Additionally, the S90A mutant protein exhibited higher β-lactamase activity and conferred greater bacterial protection against imipenem in SK17-S compared with the wild-type. In sum, our results revealed that in A. baumannii, Ser-90 phosphorylation of AmpC negatively regulates both β-lactamase activity and the ability to counteract the antibiotic effects of imipenem. These findings highlight the impact of phosphorylation-mediated regulation in antibiotic-resistant bacteria on future drug design and new therapies. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  20. Cross-Cultural Concept Mapping of Standardized Datasets

    DEFF Research Database (Denmark)

    Kano Glückstad, Fumiko

    2012-01-01

    This work compares four feature-based similarity measures derived from cognitive sciences. The purpose of the comparative analysis is to verify the potentially most effective model that can be applied for mapping independent ontologies in a culturally influenced domain [1]. Here, datasets based...