WorldWideScience

Sample records for ukidss las datasets

  1. New ultracool subdwarfs identified in large-scale surveys using Virtual Observatory tools. I. UKIDSS LAS DR5 vs. SDSS DR7

    Science.gov (United States)

    Lodieu, N.; Espinoza Contreras, M.; Zapatero Osorio, M. R.; Solano, E.; Aberasturi, M.; Martín, E. L.

    2012-06-01

    Aims: The aim of the project is to improve our knowledge of the low-mass and low-metallicity population to investigate the influence of metallicity on the stellar (and substellar) mass function. Methods: We present the results of a photometric and proper motion search aimed at discovering ultracool subdwarfs in large-scale surveys. We employed and combined the Fifth Data Release (DR5) of the UKIRT Infrared Deep Sky Survey (UKIDSS) Large Area Survey (LAS) and the Sloan Digital Sky Survey (SDSS) Data Release 7 complemented with ancillary data from the Two Micron All-Sky Survey (2MASS), the DEep Near-Infrared Survey (DENIS) and the SuperCOSMOS Sky Surveys (SSS). Results: The SDSS DR7 vs. UKIDSS LAS DR5 search returned a total of 32 ultracool subdwarf candidates, only two of which are recognised as a subdwarf in the literature. Twenty-seven candidates, including the two known ones, were followed-up spectroscopically in the optical between 600 and 1000 nm, thus covering strong spectral features indicative of low metallicity (e.g., CaH), 21 with the Very Large Telescope, one with the Nordic Optical Telescope, and five were extracted from the Sloan spectroscopic database to assess (or refute) their low-metal content. We confirm 20 candidates as subdwarfs, extreme subdwarfs, or ultra-subdwarfs with spectral types later than M5; this represents a success rate of ≥ 60%. Among those 20 new subdwarfs, we identify two early-L subdwarfs that are very likely located within 100 pc, which we propose as templates for future searches because they are the first examples of their subclass. Another seven sources are solar-metallicity M dwarfs with spectral types between M4 and M7 without Hα emission, suggesting that they are old M dwarfs. The remaining five candidates do not have spectroscopic follow-up yet; only one remains as a bona-fide ultracool subdwarf after revision of their proper motions. We assigned spectral types based on the current classification schemes and, when

  2. NEAR-INFRARED PHOTOMETRIC PROPERTIES OF 130,000 QUASARS: AN SDSS-UKIDSS-MATCHED CATALOG

    International Nuclear Information System (INIS)

    Peth, Michael A.; Ross, Nicholas P.; Schneider, Donald P.

    2011-01-01

    We present a catalog of over 130,000 quasar candidates with near-infrared (NIR) photometric properties, with an areal coverage of approximately 1200 deg 2 . This is achieved by matching the Sloan Digital Sky Survey (SDSS) in the optical ugriz bands to the UKIRT Infrared Digital Sky Survey (UKIDSS) Large Area Survey (LAS) in the NIR YJHK bands. We match the ∼1 million SDSS DR6 Photometric Quasar catalog to Data Release 3 of the UKIDSS LAS (ULAS) and produce a catalog with 130,827 objects with detections in one or more NIR bands, of which 74,351 objects have optical and K-band detections and 42,133 objects have the full nine-band photometry. The majority (∼85%) of the SDSS objects were not matched simply because these were not covered by the ULAS. The positional standard deviation of the SDSS Quasar to ULAS matches is δ R.A. = 0.''1370 and δ decl. = 0.''1314. We find an absolute systematic astrometric offset between the SDSS Quasar catalog and the UKIDSS LAS, of |R.A. offset | = 0.''025 and |decl. offset | = 0.''040; we suggest the nature of this offset to be due to the matching of catalog, rather than image, level data. Our matched catalog has a surface density of ∼53 deg -2 for K ≤ 18.27 objects; tests using our matched catalog, along with data from the UKIDSS Deep Extragalactic Survey, imply that our limiting magnitude is i ∼ 20.6. Color-redshift diagrams, for the optical and NIR, show a close agreement between our matched catalog and recent quasar color models at redshift z ∼ 4.6, and very high, z > 5.7, redshift previously discovered quasars.

  3. Four faint T dwarfs from the UKIRT Infrared Deep Sky Survey (UKIDSS) Southern Stripe

    Science.gov (United States)

    Chiu, Kuenley; Liu, Michael C.; Jiang, Linhua; Allers, Katelyn N.; Stark, Daniel P.; Bunker, Andrew; Fan, Xiaohui; Glazebrook, Karl; Dupuy, Trent J.

    2008-03-01

    We present the optical and near-infrared photometry and spectroscopy of four faint T dwarfs newly discovered from the UKIDSS first data release. The sample, drawn from an imaged area of ~136 deg2 to a depth of Y = 19.9 (5σ, Vega), is located in the Sloan Digital Sky Survey (SDSS) Southern Equatorial Stripe, a region of significant future deep imaging potential. We detail the selection and followup of these objects, three of which are spectroscopically confirmed brown dwarfs ranging from type T2.5 to T7.5, and one is photometrically identified as early T. Their magnitudes range from Y = 19.01 to 19.88 with derived distances from 34 to 98 pc, making these among the coldest and faintest brown dwarfs known. The T7.5 dwarf appears to be single based on 0.05-arcsec images from Keck laser guide star adaptive optics. The sample brings the total number of T dwarfs found or confirmed by UKIDSS data in this region to nine, and we discuss the projected numbers of dwarfs in the future survey data. We estimate that ~240 early and late T dwarfs are discoverable in the UKIDSS Large Area Survey (LAS) data, falling significantly short of published model projections and suggesting that initial mass functions and/or birth rates may be at the low end of possible models. Thus, deeper optical data have good potential to exploit the UKIDSS survey depth more fully, but may still find the potential Y dwarf sample to be extremely rare.

  4. Dust reddened quasars in first and UKIDSS: Beyond the tip of the iceberg

    Energy Technology Data Exchange (ETDEWEB)

    Glikman, Eilat [Department of Physics, Middlebury College, Middlebury, VT 05753 (United States); Urrutia, Tanya [Leibniz Institut fr Astrophysik, An der Sternwarte 16, D-14482 Potsdam (Germany); Lacy, Mark [National Radio Astronomy Observatory, Charlottesville, VA (United States); Djorgovski, S. G.; Mahabal, Ashish; Graham, Matthew [California Institute of Technology, Pasadena, CA 91125 (United States); Urry, Meg [Department of Physics and Yale Center for Astronomy and Astrophysics, Yale University, P.O. Box 208121, New Haven, CT 06520-8121 (United States); Croom, Scott [Sydney Institute for Astronomy (SIfA), School of Physics, University of Sydney, NSW 2006 (Australia); Schneider, Donald P. [Department of Astronomy and Astrophysics, The Pennsylvania State University, University Park, PA 16802 (United States); Ge, Jian, E-mail: eglikman@middlebury.edu [Astronomy Department, University of Florida, 211 Bryant Space Science Center, P.O. Box 112055, Gainesville, FL 32611 (United States)

    2013-12-01

    We present the results of a pilot survey to find dust-reddened quasars by matching the Faint Images of the Radio Sky at Twenty-Centimeters (FIRST) radio catalog to the UKIDSS near-infrared survey and using optical data from Sloan Digital Sky Survey to select objects with very red colors. The deep K-band limit provided by UKIDSS allows for finding more heavily reddened quasars at higher redshifts as compared with previous work using FIRST and Two Micron All Sky Survey (2MASS). We selected 87 candidates with K ≤ 17.0 from the UKIDSS Large Area Survey (LAS) First Data Release (DR1), which covers 190 deg{sup 2}. These candidates reach up to ∼1.5 mag below the 2MASS limit and obey the color criteria developed to identify dust-reddened quasars. We have obtained 61 spectroscopic observations in the optical and/or near-infrared, as well as classifications in the literature, and have identified 14 reddened quasars with E(B – V) > 0.1, including 3 at z > 2. We study the infrared properties of the sample using photometry from the Wide-Field Infrared Survey Explorer and find that infrared colors improve the efficiency of red quasar selection, removing many contaminants in an infrared-to-optical color-selected sample alone. The highest-redshift quasars (z ≳ 2) are only moderately reddened, with E(B – V) ∼ 0.2-0.3. We find that the surface density of red quasars rises sharply with faintness, comprising up to 17% of blue quasars at the same apparent K-band flux limit. We estimate that to reach more heavily reddened quasars (i.e., E(B – V) ≳ 0.5) at z > 2 and a depth of K = 17, we would need to survey at least ∼2.5 times more area.

  5. Proteomics dataset

    DEFF Research Database (Denmark)

    Bennike, Tue Bjerg; Carlsen, Thomas Gelsing; Ellingsen, Torkell

    2017-01-01

    The datasets presented in this article are related to the research articles entitled “Neutrophil Extracellular Traps in Ulcerative Colitis: A Proteome Analysis of Intestinal Biopsies” (Bennike et al., 2015 [1]), and “Proteome Analysis of Rheumatoid Arthritis Gut Mucosa” (Bennike et al., 2017 [2])...... been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifiers PXD001608 for ulcerative colitis and control samples, and PXD003082 for rheumatoid arthritis samples....

  6. Proteomics dataset

    DEFF Research Database (Denmark)

    Bennike, Tue Bjerg; Carlsen, Thomas Gelsing; Ellingsen, Torkell

    2017-01-01

    patients (Morgan et al., 2012; Abraham and Medzhitov, 2011; Bennike, 2014) [8–10. Therefore, we characterized the proteome of colon mucosa biopsies from 10 inflammatory bowel disease ulcerative colitis (UC) patients, 11 gastrointestinal healthy rheumatoid arthritis (RA) patients, and 10 controls. We...... been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifiers PXD001608 for ulcerative colitis and control samples, and PXD003082 for rheumatoid arthritis samples....

  7. A wide deep infrared look at the Pleiades with UKIDSS: new constraints on the substellar binary fraction and the low-mass initial mass function

    NARCIS (Netherlands)

    Lodieu, N.; Dobbie, P.D.; Deacon, N.R.; Hodgkin, S.T.; Hambly, N.C.; Jameson, R.F.

    2007-01-01

    We present the results of a deep wide-field near-infrared survey of 12 deg2 of the Pleiades conducted as part of the United Kingdom Infrared Telescope (UKIRT) Infrared Deep Sky Survey (UKIDSS) Galactic Cluster Survey (GCS). We have extracted over 340 high-probability proper motion (PM)

  8. EPA Nanorelease Dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — EPA Nanorelease Dataset. This dataset is associated with the following publication: Wohlleben, W., C. Kingston, J. Carter, E. Sahle-Demessie, S. Vazquez-Campos, B....

  9. Aaron Journal article datasets

    Data.gov (United States)

    U.S. Environmental Protection Agency — All figures used in the journal article are in netCDF format. This dataset is associated with the following publication: Sims, A., K. Alapaty , and S. Raman....

  10. Integrated Surface Dataset (Global)

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The Integrated Surface (ISD) Dataset (ISD) is composed of worldwide surface weather observations from over 35,000 stations, though the best spatial coverage is...

  11. Control Measure Dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — The EPA Control Measure Dataset is a collection of documents describing air pollution control available to regulated facilities for the control and abatement of air...

  12. National Hydrography Dataset (NHD)

    Data.gov (United States)

    Kansas Data Access and Support Center — The National Hydrography Dataset (NHD) is a feature-based database that interconnects and uniquely identifies the stream segments or reaches that comprise the...

  13. Market Squid Ecology Dataset

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — This dataset contains ecological information collected on the major adult spawning and juvenile habitats of market squid off California and the US Pacific Northwest....

  14. Tables and figure datasets

    Data.gov (United States)

    U.S. Environmental Protection Agency — Soil and air concentrations of asbestos in Sumas study. This dataset is associated with the following publication: Wroble, J., T. Frederick, A. Frame, and D....

  15. Isfahan MISP Dataset.

    Science.gov (United States)

    Kashefpur, Masoud; Kafieh, Rahele; Jorjandi, Sahar; Golmohammadi, Hadis; Khodabande, Zahra; Abbasi, Mohammadreza; Teifuri, Nilufar; Fakharzadeh, Ali Akbar; Kashefpoor, Maryam; Rabbani, Hossein

    2017-01-01

    An online depository was introduced to share clinical ground truth with the public and provide open access for researchers to evaluate their computer-aided algorithms. PHP was used for web programming and MySQL for database managing. The website was entitled "biosigdata.com." It was a fast, secure, and easy-to-use online database for medical signals and images. Freely registered users could download the datasets and could also share their own supplementary materials while maintaining their privacies (citation and fee). Commenting was also available for all datasets, and automatic sitemap and semi-automatic SEO indexing have been set for the site. A comprehensive list of available websites for medical datasets is also presented as a Supplementary (http://journalonweb.com/tempaccess/4800.584.JMSS_55_16I3253.pdf).

  16. Mridangam stroke dataset

    OpenAIRE

    CompMusic

    2014-01-01

    The audio examples were recorded from a professional Carnatic percussionist in a semi-anechoic studio conditions by Akshay Anantapadmanabhan using SM-58 microphones and an H4n ZOOM recorder. The audio was sampled at 44.1 kHz and stored as 16 bit wav files. The dataset can be used for training models for each Mridangam stroke. /n/nA detailed description of the Mridangam and its strokes can be found in the paper below. A part of the dataset was used in the following paper. /nAkshay Anantapadman...

  17. The GTZAN dataset

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2013-01-01

    The GTZAN dataset appears in at least 100 published works, and is the most-used public dataset for evaluation in machine listening research for music genre recognition (MGR). Our recent work, however, shows GTZAN has several faults (repetitions, mislabelings, and distortions), which challenge...... of GTZAN, and provide a catalog of its faults. We review how GTZAN has been used in MGR research, and find few indications that its faults have been known and considered. Finally, we rigorously study the effects of its faults on evaluating five different MGR systems. The lesson is not to banish GTZAN...

  18. Dataset - Adviesregel PPL 2010

    NARCIS (Netherlands)

    Evert, van F.K.; Schans, van der D.A.; Geel, van W.C.A.; Slabbekoorn, J.J.; Booij, R.; Jukema, J.N.; Meurs, E.J.J.; Uenk, D.

    2011-01-01

    This dataset contains experimental data from a number of field experiments with potato in The Netherlands (Van Evert et al., 2011). The data are presented as an SQL dump of a PostgreSQL database (version 8.4.4). An outline of the entity-relationship diagram of the database is given in an

  19. A UKIDSS-based search for low-mass stars and small stellar clumps in off-cloud parts of young star-forming regions* **

    Directory of Open Access Journals (Sweden)

    Barrado y Navascués D.

    2011-07-01

    Full Text Available The form and universality of the mass function of young and nearby star-forming regions is still under debate. Its relation to the stellar density, its mass peak and the dependency on most recent models shows significant differencies for the various regions and remains unclear up to date. We aim to get a more complete census of two of such regions. We investigate yet unexplored areas of Orion and Taurus-Auriga, observed by the UKIDSS survey. In the latter, we search for low-mass stars via photometric and proper motion criteria and signs for variability. In Orion, we search for small stellar clumps via nearest-neighbor methods. Highlights in Taurus would be the finding of the missing low-mass stars and the detection of a young cluster T dwarf. In Orion, we discovered small stellar associations of its OB1b and OB1c populations. Combined with what is known in literature, we will provide by this investigations a general picture of the results of the star-forming processes in large areas of Taurus and Orion and probe the most recent models.

  20. Growing up in a megalopolis: environmental effects on galaxy evolution in a supercluster at z ˜ 0.65 in UKIDSS UDS

    Science.gov (United States)

    Galametz, Audrey; Pentericci, Laura; Castellano, Marco; Mendel, Trevor; Hartley, Will G.; Fossati, Matteo; Finoguenov, Alexis; Almaini, Omar; Beifiori, Alessandra; Fontana, Adriano; Grazian, Andrea; Scodeggio, Marco; Kocevski, Dale D.

    2018-04-01

    We present a large-scale galaxy structure Cl J021734-0513 at z ˜ 0.65 discovered in the UKIDSS UDS field, made of ˜20 galaxy groups and clusters, spreading over 10 Mpc. We report on a VLT/VIMOS spectroscopic follow-up program that, combined with past spectroscopy, allowed us to confirm four galaxy clusters (M200 ˜ 1014 M⊙) and a dozen associated groups and star-forming galaxy overdensities. Two additional filamentary structures at z ˜ 0.62 and 0.69 and foreground and background clusters at 0.6 groups. The presence of quiescent galaxies in the core of the latter shows that `pre-processing' has already happened before the groups fall into their more massive neighbours. Our spectroscopy allows us to derive spectral index measurements e.g. emission/absorption line equivalent widths, strength of the 4000 Å break, valuable to investigate the star formation history of structure members. Based on these line measurements, we select a population of `post-starburst' galaxies. These galaxies are preferentially found within the virial radius of clusters, supporting a scenario in which their recent quenching could be prompted by gas stripping by the dense intracluster medium. We derive stellar age estimates using Markov Chain Monte Carlo-based spectral fitting for quiescent galaxies and find a correlation between ages and colours/stellar masses which favours a top-down formation scenario of the red sequence. A catalogue of ˜650 redshifts in UDS is released alongside the paper (via MNRAS online data).

  1. National Elevation Dataset

    Science.gov (United States)

    ,

    2002-01-01

    The National Elevation Dataset (NED) is a new raster product assembled by the U.S. Geological Survey. NED is designed to provide National elevation data in a seamless form with a consistent datum, elevation unit, and projection. Data corrections were made in the NED assembly process to minimize artifacts, perform edge matching, and fill sliver areas of missing data. NED has a resolution of one arc-second (approximately 30 meters) for the conterminous United States, Hawaii, Puerto Rico and the island territories and a resolution of two arc-seconds for Alaska. NED data sources have a variety of elevation units, horizontal datums, and map projections. In the NED assembly process the elevation values are converted to decimal meters as a consistent unit of measure, NAD83 is consistently used as horizontal datum, and all the data are recast in a geographic projection. Older DEM's produced by methods that are now obsolete have been filtered during the NED assembly process to minimize artifacts that are commonly found in data produced by these methods. Artifact removal greatly improves the quality of the slope, shaded-relief, and synthetic drainage information that can be derived from the elevation data. Figure 2 illustrates the results of this artifact removal filtering. NED processing also includes steps to adjust values where adjacent DEM's do not match well, and to fill sliver areas of missing data between DEM's. These processing steps ensure that NED has no void areas and artificial discontinuities have been minimized. The artifact removal filtering process does not eliminate all of the artifacts. In areas where the only available DEM is produced by older methods, then "striping" may still occur.

  2. NP-PAH Interaction Dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — Dataset presents concentrations of organic pollutants, such as polyaromatic hydrocarbon compounds, in water samples. Water samples of known volume and concentration...

  3. Editorial: Datasets for Learning Analytics

    NARCIS (Netherlands)

    Dietze, Stefan; George, Siemens; Davide, Taibi; Drachsler, Hendrik

    2018-01-01

    The European LinkedUp and LACE (Learning Analytics Community Exchange) project have been responsible for setting up a series of data challenges at the LAK conferences 2013 and 2014 around the LAK dataset. The LAK datasets consists of a rich collection of full text publications in the domain of

  4. Open University Learning Analytics dataset.

    Science.gov (United States)

    Kuzilek, Jakub; Hlosta, Martin; Zdrahal, Zdenek

    2017-11-28

    Learning Analytics focuses on the collection and analysis of learners' data to improve their learning experience by providing informed guidance and to optimise learning materials. To support the research in this area we have developed a dataset, containing data from courses presented at the Open University (OU). What makes the dataset unique is the fact that it contains demographic data together with aggregated clickstream data of students' interactions in the Virtual Learning Environment (VLE). This enables the analysis of student behaviour, represented by their actions. The dataset contains the information about 22 courses, 32,593 students, their assessment results, and logs of their interactions with the VLE represented by daily summaries of student clicks (10,655,280 entries). The dataset is freely available at https://analyse.kmi.open.ac.uk/open_dataset under a CC-BY 4.0 license.

  5. Turkey Run Landfill Emissions Dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — landfill emissions measurements for the Turkey run landfill in Georgia. This dataset is associated with the following publication: De la Cruz, F., R. Green, G....

  6. Dataset of NRDA emission data

    Data.gov (United States)

    U.S. Environmental Protection Agency — Emissions data from open air oil burns. This dataset is associated with the following publication: Gullett, B., J. Aurell, A. Holder, B. Mitchell, D. Greenwell, M....

  7. Chemical product and function dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — Merged product weight fraction and chemical function data. This dataset is associated with the following publication: Isaacs , K., M. Goldsmith, P. Egeghy , K....

  8. The NOAA Dataset Identifier Project

    Science.gov (United States)

    de la Beaujardiere, J.; Mccullough, H.; Casey, K. S.

    2013-12-01

    The US National Oceanic and Atmospheric Administration (NOAA) initiated a project in 2013 to assign persistent identifiers to datasets archived at NOAA and to create informational landing pages about those datasets. The goals of this project are to enable the citation of datasets used in products and results in order to help provide credit to data producers, to support traceability and reproducibility, and to enable tracking of data usage and impact. A secondary goal is to encourage the submission of datasets for long-term preservation, because only archived datasets will be eligible for a NOAA-issued identifier. A team was formed with representatives from the National Geophysical, Oceanographic, and Climatic Data Centers (NGDC, NODC, NCDC) to resolve questions including which identifier scheme to use (answer: Digital Object Identifier - DOI), whether or not to embed semantics in identifiers (no), the level of granularity at which to assign identifiers (as coarsely as reasonable), how to handle ongoing time-series data (do not break into chunks), creation mechanism for the landing page (stylesheet from formal metadata record preferred), and others. Decisions made and implementation experience gained will inform the writing of a Data Citation Procedural Directive to be issued by the Environmental Data Management Committee in 2014. Several identifiers have been issued as of July 2013, with more on the way. NOAA is now reporting the number as a metric to federal Open Government initiatives. This paper will provide further details and status of the project.

  9. The Harvard organic photovoltaic dataset.

    Science.gov (United States)

    Lopez, Steven A; Pyzer-Knapp, Edward O; Simm, Gregor N; Lutzow, Trevor; Li, Kewei; Seress, Laszlo R; Hachmann, Johannes; Aspuru-Guzik, Alán

    2016-09-27

    The Harvard Organic Photovoltaic Dataset (HOPV15) presented in this work is a collation of experimental photovoltaic data from the literature, and corresponding quantum-chemical calculations performed over a range of conformers, each with quantum chemical results using a variety of density functionals and basis sets. It is anticipated that this dataset will be of use in both relating electronic structure calculations to experimental observations through the generation of calibration schemes, as well as for the creation of new semi-empirical methods and the benchmarking of current and future model chemistries for organic electronic applications.

  10. The Harvard organic photovoltaic dataset

    Science.gov (United States)

    Lopez, Steven A.; Pyzer-Knapp, Edward O.; Simm, Gregor N.; Lutzow, Trevor; Li, Kewei; Seress, Laszlo R.; Hachmann, Johannes; Aspuru-Guzik, Alán

    2016-01-01

    The Harvard Organic Photovoltaic Dataset (HOPV15) presented in this work is a collation of experimental photovoltaic data from the literature, and corresponding quantum-chemical calculations performed over a range of conformers, each with quantum chemical results using a variety of density functionals and basis sets. It is anticipated that this dataset will be of use in both relating electronic structure calculations to experimental observations through the generation of calibration schemes, as well as for the creation of new semi-empirical methods and the benchmarking of current and future model chemistries for organic electronic applications. PMID:27676312

  11. Querying Large Biological Network Datasets

    Science.gov (United States)

    Gulsoy, Gunhan

    2013-01-01

    New experimental methods has resulted in increasing amount of genetic interaction data to be generated every day. Biological networks are used to store genetic interaction data gathered. Increasing amount of data available requires fast large scale analysis methods. Therefore, we address the problem of querying large biological network datasets.…

  12. Fluxnet Synthesis Dataset Collaboration Infrastructure

    Energy Technology Data Exchange (ETDEWEB)

    Agarwal, Deborah A. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Humphrey, Marty [Univ. of Virginia, Charlottesville, VA (United States); van Ingen, Catharine [Microsoft. San Francisco, CA (United States); Beekwilder, Norm [Univ. of Virginia, Charlottesville, VA (United States); Goode, Monte [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Jackson, Keith [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Rodriguez, Matt [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Weber, Robin [Univ. of California, Berkeley, CA (United States)

    2008-02-06

    The Fluxnet synthesis dataset originally compiled for the La Thuile workshop contained approximately 600 site years. Since the workshop, several additional site years have been added and the dataset now contains over 920 site years from over 240 sites. A data refresh update is expected to increase those numbers in the next few months. The ancillary data describing the sites continues to evolve as well. There are on the order of 120 site contacts and 60proposals have been approved to use thedata. These proposals involve around 120 researchers. The size and complexity of the dataset and collaboration has led to a new approach to providing access to the data and collaboration support and the support team attended the workshop and worked closely with the attendees and the Fluxnet project office to define the requirements for the support infrastructure. As a result of this effort, a new website (http://www.fluxdata.org) has been created to provide access to the Fluxnet synthesis dataset. This new web site is based on a scientific data server which enables browsing of the data on-line, data download, and version tracking. We leverage database and data analysis tools such as OLAP data cubes and web reports to enable browser and Excel pivot table access to the data.

  13. CERC Dataset (Full Hadza Data)

    DEFF Research Database (Denmark)

    2016-01-01

    The dataset includes demographic, behavioral, and religiosity data from eight different populations from around the world. The samples were drawn from: (1) Coastal and (2) Inland Tanna, Vanuatu; (3) Hadzaland, Tanzania; (4) Lovu, Fiji; (5) Pointe aux Piment, Mauritius; (6) Pesqueiro, Brazil; (7......) Kyzyl, Tyva Republic; and (8) Yasawa, Fiji. Related publication: Purzycki, et al. (2016). Moralistic Gods, Supernatural Punishment and the Expansion of Human Sociality. Nature, 530(7590): 327-330....

  14. Viking Seismometer PDS Archive Dataset

    Science.gov (United States)

    Lorenz, R. D.

    2016-12-01

    The Viking Lander 2 seismometer operated successfully for over 500 Sols on the Martian surface, recording at least one likely candidate Marsquake. The Viking mission, in an era when data handling hardware (both on board and on the ground) was limited in capability, predated modern planetary data archiving, and ad-hoc repositories of the data, and the very low-level record at NSSDC, were neither convenient to process nor well-known. In an effort supported by the NASA Mars Data Analysis Program, we have converted the bulk of the Viking dataset (namely the 49,000 and 270,000 records made in High- and Event- modes at 20 and 1 Hz respectively) into a simple ASCII table format. Additionally, since wind-generated lander motion is a major component of the signal, contemporaneous meteorological data are included in summary records to facilitate correlation. These datasets are being archived at the PDS Geosciences Node. In addition to brief instrument and dataset descriptions, the archive includes code snippets in the freely-available language 'R' to demonstrate plotting and analysis. Further, we present examples of lander-generated noise, associated with the sampler arm, instrument dumps and other mechanical operations.

  15. PHYSICS PERFORMANCE AND DATASET (PPD)

    CERN Multimedia

    L. Silvestris

    2013-01-01

    The first part of the Long Shutdown period has been dedicated to the preparation of the samples for the analysis targeting the summer conferences. In particular, the 8 TeV data acquired in 2012, including most of the “parked datasets”, have been reconstructed profiting from improved alignment and calibration conditions for all the sub-detectors. A careful planning of the resources was essential in order to deliver the datasets well in time to the analysts, and to schedule the update of all the conditions and calibrations needed at the analysis level. The newly reprocessed data have undergone detailed scrutiny by the Dataset Certification team allowing to recover some of the data for analysis usage and further improving the certification efficiency, which is now at 91% of the recorded luminosity. With the aim of delivering a consistent dataset for 2011 and 2012, both in terms of conditions and release (53X), the PPD team is now working to set up a data re-reconstruction and a new MC pro...

  16. RARD: The Related-Article Recommendation Dataset

    OpenAIRE

    Beel, Joeran; Carevic, Zeljko; Schaible, Johann; Neusch, Gabor

    2017-01-01

    Recommender-system datasets are used for recommender-system evaluations, training machine-learning algorithms, and exploring user behavior. While there are many datasets for recommender systems in the domains of movies, books, and music, there are rather few datasets from research-paper recommender systems. In this paper, we introduce RARD, the Related-Article Recommendation Dataset, from the digital library Sowiport and the recommendation-as-a-service provider Mr. DLib. The dataset contains ...

  17. Las Sinapsis

    OpenAIRE

    Jorge Eduardo Duque Parra; Genaro Morales Parra; Carlos Alberto Duque Parra

    1997-01-01

    Introducción

    El estudio del sistema nervioso considera entre múltiples conexiones, aquéllas de carácter bioquímico que se median a través de sustancias elaboradas por las neuronas y que reciben la denominación de neurotransmisores’, dichas sustancias se vierten en las zonas de resquicio neuromuscular, neuroneuronal o neuroglandular, para modificar las condiciones de membrana y permitir la continuidad de los p...

  18. Passive Containment DataSet

    Science.gov (United States)

    This data is for Figures 6 and 7 in the journal article. The data also includes the two EPANET input files used for the analysis described in the paper, one for the looped system and one for the block system.This dataset is associated with the following publication:Grayman, W., R. Murray , and D. Savic. Redesign of Water Distribution Systems for Passive Containment of Contamination. JOURNAL OF THE AMERICAN WATER WORKS ASSOCIATION. American Water Works Association, Denver, CO, USA, 108(7): 381-391, (2016).

  19. The CMS dataset bookkeeping service

    Science.gov (United States)

    Afaq, A.; Dolgert, A.; Guo, Y.; Jones, C.; Kosyakov, S.; Kuznetsov, V.; Lueking, L.; Riley, D.; Sekhri, V.

    2008-07-01

    The CMS Dataset Bookkeeping Service (DBS) has been developed to catalog all CMS event data from Monte Carlo and Detector sources. It provides the ability to identify MC or trigger source, track data provenance, construct datasets for analysis, and discover interesting data. CMS requires processing and analysis activities at various service levels and the DBS system provides support for localized processing or private analysis, as well as global access for CMS users at large. Catalog entries can be moved among the various service levels with a simple set of migration tools, thus forming a loose federation of databases. DBS is available to CMS users via a Python API, Command Line, and a Discovery web page interfaces. The system is built as a multi-tier web application with Java servlets running under Tomcat, with connections via JDBC to Oracle or MySQL database backends. Clients connect to the service through HTTP or HTTPS with authentication provided by GRID certificates and authorization through VOMS. DBS is an integral part of the overall CMS Data Management and Workflow Management systems.

  20. The CMS dataset bookkeeping service

    Energy Technology Data Exchange (ETDEWEB)

    Afaq, A; Guo, Y; Kosyakov, S; Lueking, L; Sekhri, V [Fermilab, Batavia, Illinois 60510 (United States); Dolgert, A; Jones, C; Kuznetsov, V; Riley, D [Cornell University, Ithaca, New York 14850 (United States)

    2008-07-15

    The CMS Dataset Bookkeeping Service (DBS) has been developed to catalog all CMS event data from Monte Carlo and Detector sources. It provides the ability to identify MC or trigger source, track data provenance, construct datasets for analysis, and discover interesting data. CMS requires processing and analysis activities at various service levels and the DBS system provides support for localized processing or private analysis, as well as global access for CMS users at large. Catalog entries can be moved among the various service levels with a simple set of migration tools, thus forming a loose federation of databases. DBS is available to CMS users via a Python API, Command Line, and a Discovery web page interfaces. The system is built as a multi-tier web application with Java servlets running under Tomcat, with connections via JDBC to Oracle or MySQL database backends. Clients connect to the service through HTTP or HTTPS with authentication provided by GRID certificates and authorization through VOMS. DBS is an integral part of the overall CMS Data Management and Workflow Management systems.

  1. The CMS dataset bookkeeping service

    International Nuclear Information System (INIS)

    Afaq, A; Guo, Y; Kosyakov, S; Lueking, L; Sekhri, V; Dolgert, A; Jones, C; Kuznetsov, V; Riley, D

    2008-01-01

    The CMS Dataset Bookkeeping Service (DBS) has been developed to catalog all CMS event data from Monte Carlo and Detector sources. It provides the ability to identify MC or trigger source, track data provenance, construct datasets for analysis, and discover interesting data. CMS requires processing and analysis activities at various service levels and the DBS system provides support for localized processing or private analysis, as well as global access for CMS users at large. Catalog entries can be moved among the various service levels with a simple set of migration tools, thus forming a loose federation of databases. DBS is available to CMS users via a Python API, Command Line, and a Discovery web page interfaces. The system is built as a multi-tier web application with Java servlets running under Tomcat, with connections via JDBC to Oracle or MySQL database backends. Clients connect to the service through HTTP or HTTPS with authentication provided by GRID certificates and authorization through VOMS. DBS is an integral part of the overall CMS Data Management and Workflow Management systems

  2. The CMS dataset bookkeeping service

    International Nuclear Information System (INIS)

    Afaq, Anzar; Dolgert, Andrew; Guo, Yuyi; Jones, Chris; Kosyakov, Sergey; Kuznetsov, Valentin; Lueking, Lee; Riley, Dan; Sekhri, Vijay

    2007-01-01

    The CMS Dataset Bookkeeping Service (DBS) has been developed to catalog all CMS event data from Monte Carlo and Detector sources. It provides the ability to identify MC or trigger source, track data provenance, construct datasets for analysis, and discover interesting data. CMS requires processing and analysis activities at various service levels and the DBS system provides support for localized processing or private analysis, as well as global access for CMS users at large. Catalog entries can be moved among the various service levels with a simple set of migration tools, thus forming a loose federation of databases. DBS is available to CMS users via a Python API, Command Line, and a Discovery web page interfaces. The system is built as a multi-tier web application with Java servlets running under Tomcat, with connections via JDBC to Oracle or MySQL database backends. Clients connect to the service through HTTP or HTTPS with authentication provided by GRID certificates and authorization through VOMS. DBS is an integral part of the overall CMS Data Management and Workflow Management systems

  3. 2008 TIGER/Line Nationwide Dataset

    Data.gov (United States)

    California Natural Resource Agency — This dataset contains a nationwide build of the 2008 TIGER/Line datasets from the US Census Bureau downloaded in April 2009. The TIGER/Line Shapefiles are an extract...

  4. Satellite-Based Precipitation Datasets

    Science.gov (United States)

    Munchak, S. J.; Huffman, G. J.

    2017-12-01

    Of the possible sources of precipitation data, those based on satellites provide the greatest spatial coverage. There is a wide selection of datasets, algorithms, and versions from which to choose, which can be confusing to non-specialists wishing to use the data. The International Precipitation Working Group (IPWG) maintains tables of the major publicly available, long-term, quasi-global precipitation data sets (http://www.isac.cnr.it/ ipwg/data/datasets.html), and this talk briefly reviews the various categories. As examples, NASA provides two sets of quasi-global precipitation data sets: the older Tropical Rainfall Measuring Mission (TRMM) Multi-satellite Precipitation Analysis (TMPA) and current Integrated Multi-satellitE Retrievals for Global Precipitation Measurement (GPM) mission (IMERG). Both provide near-real-time and post-real-time products that are uniformly gridded in space and time. The TMPA products are 3-hourly 0.25°x0.25° on the latitude band 50°N-S for about 16 years, while the IMERG products are half-hourly 0.1°x0.1° on 60°N-S for over 3 years (with plans to go to 16+ years in Spring 2018). In addition to the precipitation estimates, each data set provides fields of other variables, such as the satellite sensor providing estimates and estimated random error. The discussion concludes with advice about determining suitability for use, the necessity of being clear about product names and versions, and the need for continued support for satellite- and surface-based observation.

  5. Las Sinapsis

    Directory of Open Access Journals (Sweden)

    Jorge Eduardo Duque Parra

    1997-12-01

    Full Text Available

    Introducción

    El estudio del sistema nervioso considera entre múltiples conexiones, aquéllas de carácter bioquímico que se median a través de sustancias elaboradas por las neuronas y que reciben la denominación de neurotransmisores’, dichas sustancias se vierten en las zonas de resquicio neuromuscular, neuroneuronal o neuroglandular, para modificar las condiciones de membrana y permitir la continuidad de los potenciales de acción (por creación de nuevos potenciales en las células subsiguientes, de la neurona hacia el órgano blanco.

    La integridad de los diversos elementos de la “zona de unión funcional” o sinapsis, asegura la adecuada comunicación entre el sistema nervioso y la mayoría de los elementos restantes del organismo humano.

    Las alteraciones de los elementos de las sinapsis, conllevan a la génesis de estados variables fisiológicos y patológicos somáticos, psicosomáticos o psíquicos, inconsecuentes con el estado de homeostasis.

    Las moléculas que se vierten en las hendiduras sinápticas, sirven, por tanto, de transductoras para efectos elementales (como los reflejos y en las estrategias complejas (como los de la actividad intelectual.

    Antecedentes

    Ya desde los tiempos de Galeno, se sabía que los nervios eran los responsables de la rápida comunicación entre el cuerpo y el cerebro; el estudio de las sinapsis nos remonta de manera indirecta a
    1791, fecha en la que Luigi Galvani, descubrió en sus experimentos con ancas de ranas, que entre los eventos eléctricos y los ocurridos en los nervios, existía una relación evidente (28,37,45’, los experimentos de Galvani se han refinado con el paso del tiempo, en nuestro siglo por ejemplo, el desarrollo del osciloscopio de rayos catódicos combinado con un amplificador potente, ha permitido medir las débiles y variables corrientes bioeléctricas de los

  6. PHYSICS PERFORMANCE AND DATASET (PPD)

    CERN Multimedia

    L. Silvestris

    2012-01-01

      Introduction The first part of the year presented an important test for the new Physics Performance and Dataset (PPD) group (cf. its mandate: http://cern.ch/go/8f77). The activity was focused on the validation of the new releases meant for the Monte Carlo (MC) production and the data-processing in 2012 (CMSSW 50X and 52X), and on the preparation of the 2012 operations. In view of the Chamonix meeting, the PPD and physics groups worked to understand the impact of the higher pile-up scenario on some of the flagship Higgs analyses to better quantify the impact of the high luminosity on the CMS physics potential. A task force is working on the optimisation of the reconstruction algorithms and on the code to cope with the performance requirements imposed by the higher event occupancy as foreseen for 2012. Concerning the preparation for the analysis of the new data, a new MC production has been prepared. The new samples, simulated at 8 TeV, are already being produced and the digitisation and recons...

  7. Pattern Analysis On Banking Dataset

    Directory of Open Access Journals (Sweden)

    Amritpal Singh

    2015-06-01

    Full Text Available Abstract Everyday refinement and development of technology has led to an increase in the competition between the Tech companies and their going out of way to crack the system andbreak down. Thus providing Data mining a strategically and security-wise important area for many business organizations including banking sector. It allows the analyzes of important information in the data warehouse and assists the banks to look for obscure patterns in a group and discover unknown relationship in the data.Banking systems needs to process ample amount of data on daily basis related to customer information their credit card details limit and collateral details transaction details risk profiles Anti Money Laundering related information trade finance data. Thousands of decisionsbased on the related data are taken in a bank daily. This paper analyzes the banking dataset in the weka environment for the detection of interesting patterns based on its applications ofcustomer acquisition customer retention management and marketing and management of risk fraudulence detections.

  8. PHYSICS PERFORMANCE AND DATASET (PPD)

    CERN Multimedia

    L. Silvestris

    2013-01-01

    The PPD activities, in the first part of 2013, have been focused mostly on the final physics validation and preparation for the data reprocessing of the full 8 TeV datasets with the latest calibrations. These samples will be the basis for the preliminary results for summer 2013 but most importantly for the final publications on the 8 TeV Run 1 data. The reprocessing involves also the reconstruction of a significant fraction of “parked data” that will allow CMS to perform a whole new set of precision analyses and searches. In this way the CMSSW release 53X is becoming the legacy release for the 8 TeV Run 1 data. The regular operation activities have included taking care of the prolonged proton-proton data taking and the run with proton-lead collisions that ended in February. The DQM and Data Certification team has deployed a continuous effort to promptly certify the quality of the data. The luminosity-weighted certification efficiency (requiring all sub-detectors to be certified as usab...

  9. Las denominaciones de las ocupaciones

    Directory of Open Access Journals (Sweden)

    Jesus Emilio Castañeda

    1990-04-01

    Full Text Available RESUMEN El propósito de este artículo, es mostrar al lector la forma como en nuestro ámbito existen diversos factores que han permitido la proliferación de maneras de llamar las ocupaciones. Aspectos regionales, la tradición, clases de organización, situaciones románticas, adjetivización de los cargos, peyorización  son entre otras razones que han generado  la creación de serie de sinónimos, de las ocupaciones, objeto de estudio de la sociologia del trabajo y de la administración de los recursos humanos.

  10. The Geometry of Finite Equilibrium Datasets

    DEFF Research Database (Denmark)

    Balasko, Yves; Tvede, Mich

    We investigate the geometry of finite datasets defined by equilibrium prices, income distributions, and total resources. We show that the equilibrium condition imposes no restrictions if total resources are collinear, a property that is robust to small perturbations. We also show that the set...... of equilibrium datasets is pathconnected when the equilibrium condition does impose restrictions on datasets, as for example when total resources are widely non collinear....

  11. IPCC Socio-Economic Baseline Dataset

    Data.gov (United States)

    National Aeronautics and Space Administration — The Intergovernmental Panel on Climate Change (IPCC) Socio-Economic Baseline Dataset consists of population, human development, economic, water resources, land...

  12. Veterans Affairs Suicide Prevention Synthetic Dataset

    Data.gov (United States)

    Department of Veterans Affairs — The VA's Veteran Health Administration, in support of the Open Data Initiative, is providing the Veterans Affairs Suicide Prevention Synthetic Dataset (VASPSD). The...

  13. Nanoparticle-organic pollutant interaction dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — Dataset presents concentrations of organic pollutants, such as polyaromatic hydrocarbon compounds, in water samples. Water samples of known volume and concentration...

  14. An Annotated Dataset of 14 Meat Images

    DEFF Research Database (Denmark)

    Stegmann, Mikkel Bille

    2002-01-01

    This note describes a dataset consisting of 14 annotated images of meat. Points of correspondence are placed on each image. As such, the dataset can be readily used for building statistical models of shape. Further, format specifications and terms of use are given.......This note describes a dataset consisting of 14 annotated images of meat. Points of correspondence are placed on each image. As such, the dataset can be readily used for building statistical models of shape. Further, format specifications and terms of use are given....

  15. Las Farmacodependencias

    Directory of Open Access Journals (Sweden)

    José Francisco Socarrás

    1994-09-01

    Full Text Available

    Ahora, cuando la Corte Constitucional ha despenalizado al farmacodependiente por el consumo de drogas, conviene informar sobre las consecuencias perjudiciales de estas. A quienes deseen un conocimiento detallado al respecto, les recomiendo documentarse en “Enciclopedia de Psiquiatría”, publicada por “El Ateneo” en Buenos Aires.

    El Capítulo respectivo es del doctor Daniel L. Murguia. El denominador corriente de su abuso es sencillamente la dependencia, nombre propuesto por la Organización Mundial de la Salud, definido como el “estado psíquico y a veces físico causado por la interacción entre un organismo vivo y un fármaco”. Su carácter esencial es el impulso psicofísico a consumir el producto en forma continua o periódica, con el fin de experimentar los efectos que acarrea y evitar los malestares de su privación. Todas las drogas del respectivo género, consumidas a dosis altas, tienen efectos psicotóxicos que alteran la conducta y en algunos casos conducen a la muerte.

    Los expertos de la Organización Mundial califican las mencionadas sustancias en seis grupos, a saber:

    1. Morfina y opiáceas, los cuales crean dependencia física y psicológica, con tolerancia precoz, necesidad de aumentar las dosis y síndrome de abstinencia bastante peligroso. Entre los derivados se cuentan los siguientes: Hidromorfina (Dilaudid; oximorfina (Nurmophan; heroína,
    éter diacético de la morfina (Diacetilmorfina; codeína (Metilmorfina; hidrocodeína (Hicadan’ oxicodeína (Perdocan. Los efectos de los opiáceo~ son los siguientes: analgesia, depresión respiratoria y espasmo gastrointestinal. Las dosis tóxicas pueden provocar convulsiones.

    2. Barbitúricos, alcohol y sedantes como el fenobarbital, que provocan así mismo dependencia física.

    3. Anfetaminas, en particular la Bencedrina inhalante, que acarrea dependencia psicológica.

    4. Cocaína, cuya dependencia psíquica es tal que muchos

  16. SIMADL: Simulated Activities of Daily Living Dataset

    Directory of Open Access Journals (Sweden)

    Talal Alshammari

    2018-04-01

    Full Text Available With the realisation of the Internet of Things (IoT paradigm, the analysis of the Activities of Daily Living (ADLs, in a smart home environment, is becoming an active research domain. The existence of representative datasets is a key requirement to advance the research in smart home design. Such datasets are an integral part of the visualisation of new smart home concepts as well as the validation and evaluation of emerging machine learning models. Machine learning techniques that can learn ADLs from sensor readings are used to classify, predict and detect anomalous patterns. Such techniques require data that represent relevant smart home scenarios, for training, testing and validation. However, the development of such machine learning techniques is limited by the lack of real smart home datasets, due to the excessive cost of building real smart homes. This paper provides two datasets for classification and anomaly detection. The datasets are generated using OpenSHS, (Open Smart Home Simulator, which is a simulation software for dataset generation. OpenSHS records the daily activities of a participant within a virtual environment. Seven participants simulated their ADLs for different contexts, e.g., weekdays, weekends, mornings and evenings. Eighty-four files in total were generated, representing approximately 63 days worth of activities. Forty-two files of classification of ADLs were simulated in the classification dataset and the other forty-two files are for anomaly detection problems in which anomalous patterns were simulated and injected into the anomaly detection dataset.

  17. ASSISTments Dataset from Multiple Randomized Controlled Experiments

    Science.gov (United States)

    Selent, Douglas; Patikorn, Thanaporn; Heffernan, Neil

    2016-01-01

    In this paper, we present a dataset consisting of data generated from 22 previously and currently running randomized controlled experiments inside the ASSISTments online learning platform. This dataset provides data mining opportunities for researchers to analyze ASSISTments data in a convenient format across multiple experiments at the same time.…

  18. Synthetic and Empirical Capsicum Annuum Image Dataset

    NARCIS (Netherlands)

    Barth, R.

    2016-01-01

    This dataset consists of per-pixel annotated synthetic (10500) and empirical images (50) of Capsicum annuum, also known as sweet or bell pepper, situated in a commercial greenhouse. Furthermore, the source models to generate the synthetic images are included. The aim of the datasets are to

  19. Design of an audio advertisement dataset

    Science.gov (United States)

    Fu, Yutao; Liu, Jihong; Zhang, Qi; Geng, Yuting

    2015-12-01

    Since more and more advertisements swarm into radios, it is necessary to establish an audio advertising dataset which could be used to analyze and classify the advertisement. A method of how to establish a complete audio advertising dataset is presented in this paper. The dataset is divided into four different kinds of advertisements. Each advertisement's sample is given in *.wav file format, and annotated with a txt file which contains its file name, sampling frequency, channel number, broadcasting time and its class. The classifying rationality of the advertisements in this dataset is proved by clustering the different advertisements based on Principal Component Analysis (PCA). The experimental results show that this audio advertisement dataset offers a reliable set of samples for correlative audio advertisement experimental studies.

  20. The Kinetics Human Action Video Dataset

    OpenAIRE

    Kay, Will; Carreira, Joao; Simonyan, Karen; Zhang, Brian; Hillier, Chloe; Vijayanarasimhan, Sudheendra; Viola, Fabio; Green, Tim; Back, Trevor; Natsev, Paul; Suleyman, Mustafa; Zisserman, Andrew

    2017-01-01

    We describe the DeepMind Kinetics human action video dataset. The dataset contains 400 human action classes, with at least 400 video clips for each action. Each clip lasts around 10s and is taken from a different YouTube video. The actions are human focussed and cover a broad range of classes including human-object interactions such as playing instruments, as well as human-human interactions such as shaking hands. We describe the statistics of the dataset, how it was collected, and give some ...

  1. BASE MAP DATASET, LOS ANGELES COUNTY, CALIFORNIA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  2. BASE MAP DATASET, CHEROKEE COUNTY, SOUTH CAROLINA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  3. SIAM 2007 Text Mining Competition dataset

    Data.gov (United States)

    National Aeronautics and Space Administration — Subject Area: Text Mining Description: This is the dataset used for the SIAM 2007 Text Mining competition. This competition focused on developing text mining...

  4. Harvard Aging Brain Study : Dataset and accessibility

    NARCIS (Netherlands)

    Dagley, Alexander; LaPoint, Molly; Huijbers, Willem; Hedden, Trey; McLaren, Donald G.; Chatwal, Jasmeer P.; Papp, Kathryn V.; Amariglio, Rebecca E.; Blacker, Deborah; Rentz, Dorene M.; Johnson, Keith A.; Sperling, Reisa A.; Schultz, Aaron P.

    2017-01-01

    The Harvard Aging Brain Study is sharing its data with the global research community. The longitudinal dataset consists of a 284-subject cohort with the following modalities acquired: demographics, clinical assessment, comprehensive neuropsychological testing, clinical biomarkers, and neuroimaging.

  5. BASE MAP DATASET, HONOLULU COUNTY, HAWAII, USA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  6. BASE MAP DATASET, EDGEFIELD COUNTY, SOUTH CAROLINA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  7. Simulation of Smart Home Activity Datasets

    Directory of Open Access Journals (Sweden)

    Jonathan Synnott

    2015-06-01

    Full Text Available A globally ageing population is resulting in an increased prevalence of chronic conditions which affect older adults. Such conditions require long-term care and management to maximize quality of life, placing an increasing strain on healthcare resources. Intelligent environments such as smart homes facilitate long-term monitoring of activities in the home through the use of sensor technology. Access to sensor datasets is necessary for the development of novel activity monitoring and recognition approaches. Access to such datasets is limited due to issues such as sensor cost, availability and deployment time. The use of simulated environments and sensors may address these issues and facilitate the generation of comprehensive datasets. This paper provides a review of existing approaches for the generation of simulated smart home activity datasets, including model-based approaches and interactive approaches which implement virtual sensors, environments and avatars. The paper also provides recommendation for future work in intelligent environment simulation.

  8. Simulation of Smart Home Activity Datasets.

    Science.gov (United States)

    Synnott, Jonathan; Nugent, Chris; Jeffers, Paul

    2015-06-16

    A globally ageing population is resulting in an increased prevalence of chronic conditions which affect older adults. Such conditions require long-term care and management to maximize quality of life, placing an increasing strain on healthcare resources. Intelligent environments such as smart homes facilitate long-term monitoring of activities in the home through the use of sensor technology. Access to sensor datasets is necessary for the development of novel activity monitoring and recognition approaches. Access to such datasets is limited due to issues such as sensor cost, availability and deployment time. The use of simulated environments and sensors may address these issues and facilitate the generation of comprehensive datasets. This paper provides a review of existing approaches for the generation of simulated smart home activity datasets, including model-based approaches and interactive approaches which implement virtual sensors, environments and avatars. The paper also provides recommendation for future work in intelligent environment simulation.

  9. Environmental Dataset Gateway (EDG) REST Interface

    Data.gov (United States)

    U.S. Environmental Protection Agency — Use the Environmental Dataset Gateway (EDG) to find and access EPA's environmental resources. Many options are available for easily reusing EDG content in other...

  10. BASE MAP DATASET, INYO COUNTY, OKLAHOMA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  11. BASE MAP DATASET, JACKSON COUNTY, OKLAHOMA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  12. BASE MAP DATASET, SANTA CRIZ COUNTY, CALIFORNIA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  13. Climate Prediction Center IR 4km Dataset

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — CPC IR 4km dataset was created from all available individual geostationary satellite data which have been merged to form nearly seamless global (60N-60S) IR...

  14. BASE MAP DATASET, MAYES COUNTY, OKLAHOMA, USA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications: cadastral, geodetic control,...

  15. BASE MAP DATASET, KINGFISHER COUNTY, OKLAHOMA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — FEMA Framework Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme,...

  16. Comparison of recent SnIa datasets

    International Nuclear Information System (INIS)

    Sanchez, J.C. Bueno; Perivolaropoulos, L.; Nesseris, S.

    2009-01-01

    We rank the six latest Type Ia supernova (SnIa) datasets (Constitution (C), Union (U), ESSENCE (Davis) (E), Gold06 (G), SNLS 1yr (S) and SDSS-II (D)) in the context of the Chevalier-Polarski-Linder (CPL) parametrization w(a) = w 0 +w 1 (1−a), according to their Figure of Merit (FoM), their consistency with the cosmological constant (ΛCDM), their consistency with standard rulers (Cosmic Microwave Background (CMB) and Baryon Acoustic Oscillations (BAO)) and their mutual consistency. We find a significant improvement of the FoM (defined as the inverse area of the 95.4% parameter contour) with the number of SnIa of these datasets ((C) highest FoM, (U), (G), (D), (E), (S) lowest FoM). Standard rulers (CMB+BAO) have a better FoM by about a factor of 3, compared to the highest FoM SnIa dataset (C). We also find that the ranking sequence based on consistency with ΛCDM is identical with the corresponding ranking based on consistency with standard rulers ((S) most consistent, (D), (C), (E), (U), (G) least consistent). The ranking sequence of the datasets however changes when we consider the consistency with an expansion history corresponding to evolving dark energy (w 0 ,w 1 ) = (−1.4,2) crossing the phantom divide line w = −1 (it is practically reversed to (G), (U), (E), (S), (D), (C)). The SALT2 and MLCS2k2 fitters are also compared and some peculiar features of the SDSS-II dataset when standardized with the MLCS2k2 fitter are pointed out. Finally, we construct a statistic to estimate the internal consistency of a collection of SnIa datasets. We find that even though there is good consistency among most samples taken from the above datasets, this consistency decreases significantly when the Gold06 (G) dataset is included in the sample

  17. Las redes sociales presentes en las bibliotecas

    Directory of Open Access Journals (Sweden)

    Magda Cecilia Sandí S.

    2012-01-01

    Full Text Available El presente artículo pretende evidenciar la importancia del uso de las redes sociales en las bibliotecas como una herramienta y un canal de comunicación entre el bibliotecólogo y la comunidad de usuarios. Las redes sociales son una nueva forma de comunicarnos entre las y los usuarios del Internet, su uso es irrestricto y cada vez aumenta la comunidad de usuarios de estas herramientas en la red.

  18. Las redes sociales presentes en las bibliotecas

    Directory of Open Access Journals (Sweden)

    Magda Cecilia Sandí Sandí

    2012-07-01

    Full Text Available El presente artículo pretende evidenciar la importancia del uso de las redes sociales en las bibliotecas como una herramienta y un canal de comunicación entre el bibliotecólogo y la comunidad de usuarios. Las redes sociales son una nueva forma de comunicarnos entre las y los usuarios del Internet, su uso es irrestricto y cada vez aumenta la comunidad de usuarios de estas herramientas en la red.

  19. Las redes sociales presentes en las bibliotecas

    OpenAIRE

    Magda Cecilia Sandí S.

    2012-01-01

    El presente artículo pretende evidenciar la importancia del uso de las redes sociales en las bibliotecas como una herramienta y un canal de comunicación entre el bibliotecólogo y la comunidad de usuarios. Las redes sociales son una nueva forma de comunicarnos entre las y los usuarios del Internet, su uso es irrestricto y cada vez aumenta la comunidad de usuarios de estas herramientas en la red.

  20. Comparison of Shallow Survey 2012 Multibeam Datasets

    Science.gov (United States)

    Ramirez, T. M.

    2012-12-01

    The purpose of the Shallow Survey common dataset is a comparison of the different technologies utilized for data acquisition in the shallow survey marine environment. The common dataset consists of a series of surveys conducted over a common area of seabed using a variety of systems. It provides equipment manufacturers the opportunity to showcase their latest systems while giving hydrographic researchers and scientists a chance to test their latest algorithms on the dataset so that rigorous comparisons can be made. Five companies collected data for the Common Dataset in the Wellington Harbor area in New Zealand between May 2010 and May 2011; including Kongsberg, Reson, R2Sonic, GeoAcoustics, and Applied Acoustics. The Wellington harbor and surrounding coastal area was selected since it has a number of well-defined features, including the HMNZS South Seas and HMNZS Wellington wrecks, an armored seawall constructed of Tetrapods and Akmons, aquifers, wharves and marinas. The seabed inside the harbor basin is largely fine-grained sediment, with gravel and reefs around the coast. The area outside the harbor on the southern coast is an active environment, with moving sand and exposed reefs. A marine reserve is also in this area. For consistency between datasets, the coastal research vessel R/V Ikatere and crew were used for all surveys conducted for the common dataset. Using Triton's Perspective processing software multibeam datasets collected for the Shallow Survey were processed for detail analysis. Datasets from each sonar manufacturer were processed using the CUBE algorithm developed by the Center for Coastal and Ocean Mapping/Joint Hydrographic Center (CCOM/JHC). Each dataset was gridded at 0.5 and 1.0 meter resolutions for cross comparison and compliance with International Hydrographic Organization (IHO) requirements. Detailed comparisons were made of equipment specifications (transmit frequency, number of beams, beam width), data density, total uncertainty, and

  1. 3DSEM: A 3D microscopy dataset

    Directory of Open Access Journals (Sweden)

    Ahmad P. Tafti

    2016-03-01

    Full Text Available The Scanning Electron Microscope (SEM as a 2D imaging instrument has been widely used in many scientific disciplines including biological, mechanical, and materials sciences to determine the surface attributes of microscopic objects. However the SEM micrographs still remain 2D images. To effectively measure and visualize the surface properties, we need to truly restore the 3D shape model from 2D SEM images. Having 3D surfaces would provide anatomic shape of micro-samples which allows for quantitative measurements and informative visualization of the specimens being investigated. The 3DSEM is a dataset for 3D microscopy vision which is freely available at [1] for any academic, educational, and research purposes. The dataset includes both 2D images and 3D reconstructed surfaces of several real microscopic samples. Keywords: 3D microscopy dataset, 3D microscopy vision, 3D SEM surface reconstruction, Scanning Electron Microscope (SEM

  2. Data Mining for Imbalanced Datasets: An Overview

    Science.gov (United States)

    Chawla, Nitesh V.

    A dataset is imbalanced if the classification categories are not approximately equally represented. Recent years brought increased interest in applying machine learning techniques to difficult "real-world" problems, many of which are characterized by imbalanced data. Additionally the distribution of the testing data may differ from that of the training data, and the true misclassification costs may be unknown at learning time. Predictive accuracy, a popular choice for evaluating performance of a classifier, might not be appropriate when the data is imbalanced and/or the costs of different errors vary markedly. In this Chapter, we discuss some of the sampling techniques used for balancing the datasets, and the performance measures more appropriate for mining imbalanced datasets.

  3. Genomics dataset of unidentified disclosed isolates

    Directory of Open Access Journals (Sweden)

    Bhagwan N. Rekadwad

    2016-09-01

    Full Text Available Analysis of DNA sequences is necessary for higher hierarchical classification of the organisms. It gives clues about the characteristics of organisms and their taxonomic position. This dataset is chosen to find complexities in the unidentified DNA in the disclosed patents. A total of 17 unidentified DNA sequences were thoroughly analyzed. The quick response codes were generated. AT/GC content of the DNA sequences analysis was carried out. The QR is helpful for quick identification of isolates. AT/GC content is helpful for studying their stability at different temperatures. Additionally, a dataset on cleavage code and enzyme code studied under the restriction digestion study, which helpful for performing studies using short DNA sequences was reported. The dataset disclosed here is the new revelatory data for exploration of unique DNA sequences for evaluation, identification, comparison and analysis. Keywords: BioLABs, Blunt ends, Genomics, NEB cutter, Restriction digestion, Short DNA sequences, Sticky ends

  4. Harvard Aging Brain Study: Dataset and accessibility.

    Science.gov (United States)

    Dagley, Alexander; LaPoint, Molly; Huijbers, Willem; Hedden, Trey; McLaren, Donald G; Chatwal, Jasmeer P; Papp, Kathryn V; Amariglio, Rebecca E; Blacker, Deborah; Rentz, Dorene M; Johnson, Keith A; Sperling, Reisa A; Schultz, Aaron P

    2017-01-01

    The Harvard Aging Brain Study is sharing its data with the global research community. The longitudinal dataset consists of a 284-subject cohort with the following modalities acquired: demographics, clinical assessment, comprehensive neuropsychological testing, clinical biomarkers, and neuroimaging. To promote more extensive analyses, imaging data was designed to be compatible with other publicly available datasets. A cloud-based system enables access to interested researchers with blinded data available contingent upon completion of a data usage agreement and administrative approval. Data collection is ongoing and currently in its fifth year. Copyright © 2015 Elsevier Inc. All rights reserved.

  5. Las creencias y las concepciones. Perspectivas complementarias

    Directory of Open Access Journals (Sweden)

    Fuensanta HERNÁNDEZ PINA

    2011-01-01

    Full Text Available Las creencias y las concepciones sobre la enseñanza y el aprendizaje que los profesores sostienen como docentes es una línea de investigación que está suponiendo un avance en el conocimiento sobre factores relevantes para la mejor de la educación. Desde hace más de dos décadas han sido numerosos los investigadores que han venido proporcionando resultados de investigación en torno a las creencias y las concepciones de la enseñanza y el aprendizaje lo que ha supuesto establecer nuevas e interesantes interpretaciones en dicha relación. En el trabajo que presentamos se abordan algunas de las aportaciones sobre dichas creencias y sobre las concepciones de la enseñanza y el aprendizaje.

  6. Random Coefficient Logit Model for Large Datasets

    NARCIS (Netherlands)

    C. Hernández-Mireles (Carlos); D. Fok (Dennis)

    2010-01-01

    textabstractWe present an approach for analyzing market shares and products price elasticities based on large datasets containing aggregate sales data for many products, several markets and for relatively long time periods. We consider the recently proposed Bayesian approach of Jiang et al [Jiang,

  7. Thesaurus Dataset of Educational Technology in Chinese

    Science.gov (United States)

    Wu, Linjing; Liu, Qingtang; Zhao, Gang; Huang, Huan; Huang, Tao

    2015-01-01

    The thesaurus dataset of educational technology is a knowledge description of educational technology in Chinese. The aims of this thesaurus were to collect the subject terms in the domain of educational technology, facilitate the standardization of terminology and promote the communication between Chinese researchers and scholars from various…

  8. Sharing Video Datasets in Design Research

    DEFF Research Database (Denmark)

    Christensen, Bo; Abildgaard, Sille Julie Jøhnk

    2017-01-01

    This paper examines how design researchers, design practitioners and design education can benefit from sharing a dataset. We present the Design Thinking Research Symposium 11 (DTRS11) as an exemplary project that implied sharing video data of design processes and design activity in natural settings...... with a large group of fellow academics from the international community of Design Thinking Research, for the purpose of facilitating research collaboration and communication within the field of Design and Design Thinking. This approach emphasizes the social and collaborative aspects of design research, where...... a multitude of appropriate perspectives and methods may be utilized in analyzing and discussing the singular dataset. The shared data is, from this perspective, understood as a design object in itself, which facilitates new ways of working, collaborating, studying, learning and educating within the expanding...

  9. Automatic processing of multimodal tomography datasets.

    Science.gov (United States)

    Parsons, Aaron D; Price, Stephen W T; Wadeson, Nicola; Basham, Mark; Beale, Andrew M; Ashton, Alun W; Mosselmans, J Frederick W; Quinn, Paul D

    2017-01-01

    With the development of fourth-generation high-brightness synchrotrons on the horizon, the already large volume of data that will be collected on imaging and mapping beamlines is set to increase by orders of magnitude. As such, an easy and accessible way of dealing with such large datasets as quickly as possible is required in order to be able to address the core scientific problems during the experimental data collection. Savu is an accessible and flexible big data processing framework that is able to deal with both the variety and the volume of data of multimodal and multidimensional scientific datasets output such as those from chemical tomography experiments on the I18 microfocus scanning beamline at Diamond Light Source.

  10. Interpolation of diffusion weighted imaging datasets

    DEFF Research Database (Denmark)

    Dyrby, Tim B; Lundell, Henrik; Burke, Mark W

    2014-01-01

    anatomical details and signal-to-noise-ratio for reliable fibre reconstruction. We assessed the potential benefits of interpolating DWI datasets to a higher image resolution before fibre reconstruction using a diffusion tensor model. Simulations of straight and curved crossing tracts smaller than or equal......Diffusion weighted imaging (DWI) is used to study white-matter fibre organisation, orientation and structural connectivity by means of fibre reconstruction algorithms and tractography. For clinical settings, limited scan time compromises the possibilities to achieve high image resolution for finer...... interpolation methods fail to disentangle fine anatomical details if PVE is too pronounced in the original data. As for validation we used ex-vivo DWI datasets acquired at various image resolutions as well as Nissl-stained sections. Increasing the image resolution by a factor of eight yielded finer geometrical...

  11. Data assimilation and model evaluation experiment datasets

    Science.gov (United States)

    Lai, Chung-Cheng A.; Qian, Wen; Glenn, Scott M.

    1994-01-01

    The Institute for Naval Oceanography, in cooperation with Naval Research Laboratories and universities, executed the Data Assimilation and Model Evaluation Experiment (DAMEE) for the Gulf Stream region during fiscal years 1991-1993. Enormous effort has gone into the preparation of several high-quality and consistent datasets for model initialization and verification. This paper describes the preparation process, the temporal and spatial scopes, the contents, the structure, etc., of these datasets. The goal of DAMEE and the need of data for the four phases of experiment are briefly stated. The preparation of DAMEE datasets consisted of a series of processes: (1) collection of observational data; (2) analysis and interpretation; (3) interpolation using the Optimum Thermal Interpolation System package; (4) quality control and re-analysis; and (5) data archiving and software documentation. The data products from these processes included a time series of 3D fields of temperature and salinity, 2D fields of surface dynamic height and mixed-layer depth, analysis of the Gulf Stream and rings system, and bathythermograph profiles. To date, these are the most detailed and high-quality data for mesoscale ocean modeling, data assimilation, and forecasting research. Feedback from ocean modeling groups who tested this data was incorporated into its refinement. Suggestions for DAMEE data usages include (1) ocean modeling and data assimilation studies, (2) diagnosis and theoretical studies, and (3) comparisons with locally detailed observations.

  12. A hybrid organic-inorganic perovskite dataset

    Science.gov (United States)

    Kim, Chiho; Huan, Tran Doan; Krishnan, Sridevi; Ramprasad, Rampi

    2017-05-01

    Hybrid organic-inorganic perovskites (HOIPs) have been attracting a great deal of attention due to their versatility of electronic properties and fabrication methods. We prepare a dataset of 1,346 HOIPs, which features 16 organic cations, 3 group-IV cations and 4 halide anions. Using a combination of an atomic structure search method and density functional theory calculations, the optimized structures, the bandgap, the dielectric constant, and the relative energies of the HOIPs are uniformly prepared and validated by comparing with relevant experimental and/or theoretical data. We make the dataset available at Dryad Digital Repository, NoMaD Repository, and Khazana Repository (http://khazana.uconn.edu/), hoping that it could be useful for future data-mining efforts that can explore possible structure-property relationships and phenomenological models. Progressive extension of the dataset is expected as new organic cations become appropriate within the HOIP framework, and as additional properties are calculated for the new compounds found.

  13. Quantifying uncertainty in observational rainfall datasets

    Science.gov (United States)

    Lennard, Chris; Dosio, Alessandro; Nikulin, Grigory; Pinto, Izidine; Seid, Hussen

    2015-04-01

    The CO-ordinated Regional Downscaling Experiment (CORDEX) has to date seen the publication of at least ten journal papers that examine the African domain during 2012 and 2013. Five of these papers consider Africa generally (Nikulin et al. 2012, Kim et al. 2013, Hernandes-Dias et al. 2013, Laprise et al. 2013, Panitz et al. 2013) and five have regional foci: Tramblay et al. (2013) on Northern Africa, Mariotti et al. (2014) and Gbobaniyi el al. (2013) on West Africa, Endris et al. (2013) on East Africa and Kalagnoumou et al. (2013) on southern Africa. There also are a further three papers that the authors know about under review. These papers all use an observed rainfall and/or temperature data to evaluate/validate the regional model output and often proceed to assess projected changes in these variables due to climate change in the context of these observations. The most popular reference rainfall data used are the CRU, GPCP, GPCC, TRMM and UDEL datasets. However, as Kalagnoumou et al. (2013) point out there are many other rainfall datasets available for consideration, for example, CMORPH, FEWS, TAMSAT & RIANNAA, TAMORA and the WATCH & WATCH-DEI data. They, with others (Nikulin et al. 2012, Sylla et al. 2012) show that the observed datasets can have a very wide spread at a particular space-time coordinate. As more ground, space and reanalysis-based rainfall products become available, all which use different methods to produce precipitation data, the selection of reference data is becoming an important factor in model evaluation. A number of factors can contribute to a uncertainty in terms of the reliability and validity of the datasets such as radiance conversion algorithims, the quantity and quality of available station data, interpolation techniques and blending methods used to combine satellite and guage based products. However, to date no comprehensive study has been performed to evaluate the uncertainty in these observational datasets. We assess 18 gridded

  14. Development of a SPARK Training Dataset

    Energy Technology Data Exchange (ETDEWEB)

    Sayre, Amanda M. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Olson, Jarrod R. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States)

    2015-03-01

    In its first five years, the National Nuclear Security Administration’s (NNSA) Next Generation Safeguards Initiative (NGSI) sponsored more than 400 undergraduate, graduate, and post-doctoral students in internships and research positions (Wyse 2012). In the past seven years, the NGSI program has, and continues to produce a large body of scientific, technical, and policy work in targeted core safeguards capabilities and human capital development activities. Not only does the NGSI program carry out activities across multiple disciplines, but also across all U.S. Department of Energy (DOE)/NNSA locations in the United States. However, products are not readily shared among disciplines and across locations, nor are they archived in a comprehensive library. Rather, knowledge of NGSI-produced literature is localized to the researchers, clients, and internal laboratory/facility publication systems such as the Electronic Records and Information Capture Architecture (ERICA) at the Pacific Northwest National Laboratory (PNNL). There is also no incorporated way of analyzing existing NGSI literature to determine whether the larger NGSI program is achieving its core safeguards capabilities and activities. A complete library of NGSI literature could prove beneficial to a cohesive, sustainable, and more economical NGSI program. The Safeguards Platform for Automated Retrieval of Knowledge (SPARK) has been developed to be a knowledge storage, retrieval, and analysis capability to capture safeguards knowledge to exist beyond the lifespan of NGSI. During the development process, it was necessary to build a SPARK training dataset (a corpus of documents) for initial entry into the system and for demonstration purposes. We manipulated these data to gain new information about the breadth of NGSI publications, and they evaluated the science-policy interface at PNNL as a practical demonstration of SPARK’s intended analysis capability. The analysis demonstration sought to answer the

  15. Development of a SPARK Training Dataset

    International Nuclear Information System (INIS)

    Sayre, Amanda M.; Olson, Jarrod R.

    2015-01-01

    In its first five years, the National Nuclear Security Administration's (NNSA) Next Generation Safeguards Initiative (NGSI) sponsored more than 400 undergraduate, graduate, and post-doctoral students in internships and research positions (Wyse 2012). In the past seven years, the NGSI program has, and continues to produce a large body of scientific, technical, and policy work in targeted core safeguards capabilities and human capital development activities. Not only does the NGSI program carry out activities across multiple disciplines, but also across all U.S. Department of Energy (DOE)/NNSA locations in the United States. However, products are not readily shared among disciplines and across locations, nor are they archived in a comprehensive library. Rather, knowledge of NGSI-produced literature is localized to the researchers, clients, and internal laboratory/facility publication systems such as the Electronic Records and Information Capture Architecture (ERICA) at the Pacific Northwest National Laboratory (PNNL). There is also no incorporated way of analyzing existing NGSI literature to determine whether the larger NGSI program is achieving its core safeguards capabilities and activities. A complete library of NGSI literature could prove beneficial to a cohesive, sustainable, and more economical NGSI program. The Safeguards Platform for Automated Retrieval of Knowledge (SPARK) has been developed to be a knowledge storage, retrieval, and analysis capability to capture safeguards knowledge to exist beyond the lifespan of NGSI. During the development process, it was necessary to build a SPARK training dataset (a corpus of documents) for initial entry into the system and for demonstration purposes. We manipulated these data to gain new information about the breadth of NGSI publications, and they evaluated the science-policy interface at PNNL as a practical demonstration of SPARK's intended analysis capability. The analysis demonstration sought to answer

  16. Estudio de las Startups

    OpenAIRE

    Rodríguez Reina, Cristina

    2016-01-01

    Este trabajo trata de analizar las pautas a la hora de poner en marcha una startup. Se explica el método que hay que seguir (Lean Startup), se expone una breve biografía de su autor, se muestran las distintas fases por las que atraviesa, las formas de financiación, así como una serie de puntos que servirán de ayuda para adentrarse en este mundo tan novedoso y hasta entonces tan poco conocido, como es el de la startups. Dentro del ámbito de las startups, se explicará el tema ...

  17. Aborto en las adolescentes

    OpenAIRE

    Cedeño Donet, Marisel; García García, María T; Jímenez Mendeliú, Irene M

    2000-01-01

    Se realizó un estudio descriptivo y transversal sobre el comportamiento del aborto en la población adolescente del territorio occidental de la provincia de Camagüey, desde enero de 1997 hasta agosto de 1998. Se encontró que el 26, 2 % de las jóvenes se realizó un aborto, las regulaciones menstruales fueron el 47, 6 %, el 25 % de las sepsis posinterrupción correspondió a las adolescentes. Más de la cuarta parte de los abortos realizados en la Provincia corresponde a las menores de 20 años. Cas...

  18. Las versiones homericas Las versiones homericas

    Directory of Open Access Journals (Sweden)

    Jorge Luis Borges

    2008-04-01

    Full Text Available Ningún problema tan consustancial con las letras con su modesto misterio como el que propone una traducción. Un olvido animal por la vanidad, el tenor de confesar procesos mentales que adivinamos peligrosamente comunes, el conato de mantener intacta y central una reserve incalculable de sombre, velan las tales escrituras directas. La traducciOn, en cambio, parece destinada a ilustrar la discusión estitica. El modelo propuesto a su imitaciOn es un texto visible, no un labirint° inestimable de proyectos preteritos o la acatada tentaciOn momentanea de una facilidad. Bertrand Russell define un objeto extern( como un sistema circular, irradiante, de impresiones posibles; lo mismo puede aseverarse de un texto, dadas las repercusiones incalculables de lo verbal. Un parcial y precioso documento de las vicisitudes que sufre queda en sus traducciones.

  19. Developing a Data-Set for Stereopsis

    Directory of Open Access Journals (Sweden)

    D.W Hunter

    2014-08-01

    Full Text Available Current research on binocular stereopsis in humans and non-human primates has been limited by a lack of available data-sets. Current data-sets fall into two categories; stereo-image sets with vergence but no ranging information (Hibbard, 2008, Vision Research, 48(12, 1427-1439 or combinations of depth information with binocular images and video taken from cameras in fixed fronto-parallel configurations exhibiting neither vergence or focus effects (Hirschmuller & Scharstein, 2007, IEEE Conf. Computer Vision and Pattern Recognition. The techniques for generating depth information are also imperfect. Depth information is normally inaccurate or simply missing near edges and on partially occluded surfaces. For many areas of vision research these are the most interesting parts of the image (Goutcher, Hunter, Hibbard, 2013, i-Perception, 4(7, 484; Scarfe & Hibbard, 2013, Vision Research. Using state-of-the-art open-source ray-tracing software (PBRT as a back-end, our intention is to release a set of tools that will allow researchers in this field to generate artificial binocular stereoscopic data-sets. Although not as realistic as photographs, computer generated images have significant advantages in terms of control over the final output and ground-truth information about scene depth is easily calculated at all points in the scene, even partially occluded areas. While individual researchers have been developing similar stimuli by hand for many decades, we hope that our software will greatly reduce the time and difficulty of creating naturalistic binocular stimuli. Our intension in making this presentation is to elicit feedback from the vision community about what sort of features would be desirable in such software.

  20. Las lenguas en las sociedades del conocimiento

    Directory of Open Access Journals (Sweden)

    Álvarez, J. Francisco

    2008-12-01

    Full Text Available Languages become a strategic resource for information and knowledge societies. By expressing and sharing knowledge by means of languages, every culture generates deposits of knowledge, which can be transferred and exchanged among different epistemic communities. The contemporary technologies of information and communication have changed the structure of knowledge flows. Globalization of knowledge poses a great challenge to every language, including the Spanish one. In order to survive in the digital world, they should develop technolanguages. The lemma “Thinking in Spanish” implies a new model of governance for the Ibero-American knowledge communities.En las sociedades de la información y el conocimiento las lenguas se convierten en un recurso estratégico. Al expresar y compartir conocimiento por medio de los idiomas, las culturas generan yacimientos de conocimiento, que pueden ser transferidos e intercambiados entre comunidades epistémicas diferentes. Las actuales tecnologías de la información y la comunicación han cambiado la estructura de los flujos de conocimiento. La globalización del conocimiento plantea un gran desafío a todas las lenguas, incluyendo la española. Para sobrevivir en el mundo digital, los lenguajes han de convertirse en tecnolenguajes. El lema “Pensar en español” aporta un nuevo modelo de gobernanza para las comunidades iberoamericanas del conocimiento.

  1. Quality Controlling CMIP datasets at GFDL

    Science.gov (United States)

    Horowitz, L. W.; Radhakrishnan, A.; Balaji, V.; Adcroft, A.; Krasting, J. P.; Nikonov, S.; Mason, E. E.; Schweitzer, R.; Nadeau, D.

    2017-12-01

    As GFDL makes the switch from model development to production in light of the Climate Model Intercomparison Project (CMIP), GFDL's efforts are shifted to testing and more importantly establishing guidelines and protocols for Quality Controlling and semi-automated data publishing. Every CMIP cycle introduces key challenges and the upcoming CMIP6 is no exception. The new CMIP experimental design comprises of multiple MIPs facilitating research in different focus areas. This paradigm has implications not only for the groups that develop the models and conduct the runs, but also for the groups that monitor, analyze and quality control the datasets before data publishing, before their knowledge makes its way into reports like the IPCC (Intergovernmental Panel on Climate Change) Assessment Reports. In this talk, we discuss some of the paths taken at GFDL to quality control the CMIP-ready datasets including: Jupyter notebooks, PrePARE, LAMP (Linux, Apache, MySQL, PHP/Python/Perl): technology-driven tracker system to monitor the status of experiments qualitatively and quantitatively, provide additional metadata and analysis services along with some in-built controlled-vocabulary validations in the workflow. In addition to this, we also discuss the integration of community-based model evaluation software (ESMValTool, PCMDI Metrics Package, and ILAMB) as part of our CMIP6 workflow.

  2. Integrated remotely sensed datasets for disaster management

    Science.gov (United States)

    McCarthy, Timothy; Farrell, Ronan; Curtis, Andrew; Fotheringham, A. Stewart

    2008-10-01

    Video imagery can be acquired from aerial, terrestrial and marine based platforms and has been exploited for a range of remote sensing applications over the past two decades. Examples include coastal surveys using aerial video, routecorridor infrastructures surveys using vehicle mounted video cameras, aerial surveys over forestry and agriculture, underwater habitat mapping and disaster management. Many of these video systems are based on interlaced, television standards such as North America's NTSC and European SECAM and PAL television systems that are then recorded using various video formats. This technology has recently being employed as a front-line, remote sensing technology for damage assessment post-disaster. This paper traces the development of spatial video as a remote sensing tool from the early 1980s to the present day. The background to a new spatial-video research initiative based at National University of Ireland, Maynooth, (NUIM) is described. New improvements are proposed and include; low-cost encoders, easy to use software decoders, timing issues and interoperability. These developments will enable specialists and non-specialists collect, process and integrate these datasets within minimal support. This integrated approach will enable decision makers to access relevant remotely sensed datasets quickly and so, carry out rapid damage assessment during and post-disaster.

  3. Strontium removal jar test dataset for all figures and tables.

    Data.gov (United States)

    U.S. Environmental Protection Agency — The datasets where used to generate data to demonstrate strontium removal under various water quality and treatment conditions. This dataset is associated with the...

  4. Las ondas en las universidades o las universidades en las ondas

    Directory of Open Access Journals (Sweden)

    Verónica Marín Díaz

    2014-01-01

    Full Text Available En las aulas y pasillos de las universidades vuelven a sonar las ondas hercianas que raen una variedad de programas a la anodina vida de la comunidad universitaria, inmersa en la implantación de títulos de Grado, nuevos programas de doctorado, legislación universitaria variable y un largo etcétera que afecta al devenir de la vida en el ámbito de la educación superior.

  5. Predicting dataset popularity for the CMS experiment

    CERN Document Server

    INSPIRE-00005122; Li, Ting; Giommi, Luca; Bonacorsi, Daniele; Wildish, Tony

    2016-01-01

    The CMS experiment at the LHC accelerator at CERN relies on its computing infrastructure to stay at the frontier of High Energy Physics, searching for new phenomena and making discoveries. Even though computing plays a significant role in physics analysis we rarely use its data to predict the system behavior itself. A basic information about computing resources, user activities and site utilization can be really useful for improving the throughput of the system and its management. In this paper, we discuss a first CMS analysis of dataset popularity based on CMS meta-data which can be used as a model for dynamic data placement and provide the foundation of data-driven approach for the CMS computing infrastructure.

  6. Predicting dataset popularity for the CMS experiment

    International Nuclear Information System (INIS)

    Kuznetsov, V.; Li, T.; Giommi, L.; Bonacorsi, D.; Wildish, T.

    2016-01-01

    The CMS experiment at the LHC accelerator at CERN relies on its computing infrastructure to stay at the frontier of High Energy Physics, searching for new phenomena and making discoveries. Even though computing plays a significant role in physics analysis we rarely use its data to predict the system behavior itself. A basic information about computing resources, user activities and site utilization can be really useful for improving the throughput of the system and its management. In this paper, we discuss a first CMS analysis of dataset popularity based on CMS meta-data which can be used as a model for dynamic data placement and provide the foundation of data-driven approach for the CMS computing infrastructure. (paper)

  7. Internationally coordinated glacier monitoring: strategy and datasets

    Science.gov (United States)

    Hoelzle, Martin; Armstrong, Richard; Fetterer, Florence; Gärtner-Roer, Isabelle; Haeberli, Wilfried; Kääb, Andreas; Kargel, Jeff; Nussbaumer, Samuel; Paul, Frank; Raup, Bruce; Zemp, Michael

    2014-05-01

    (c) the Randolph Glacier Inventory (RGI), a new and globally complete digital dataset of outlines from about 180,000 glaciers with some meta-information, which has been used for many applications relating to the IPCC AR5 report. Concerning glacier changes, a database (Fluctuations of Glaciers) exists containing information about mass balance, front variations including past reconstructed time series, geodetic changes and special events. Annual mass balance reporting contains information for about 125 glaciers with a subset of 37 glaciers with continuous observational series since 1980 or earlier. Front variation observations of around 1800 glaciers are available from most of the mountain ranges world-wide. This database was recently updated with 26 glaciers having an unprecedented dataset of length changes from from reconstructions of well-dated historical evidence going back as far as the 16th century. Geodetic observations of about 430 glaciers are available. The database is completed by a dataset containing information on special events including glacier surges, glacier lake outbursts, ice avalanches, eruptions of ice-clad volcanoes, etc. related to about 200 glaciers. A special database of glacier photographs contains 13,000 pictures from around 500 glaciers, some of them dating back to the 19th century. A key challenge is to combine and extend the traditional observations with fast evolving datasets from new technologies.

  8. MIPS bacterial genomes functional annotation benchmark dataset.

    Science.gov (United States)

    Tetko, Igor V; Brauner, Barbara; Dunger-Kaltenbach, Irmtraud; Frishman, Goar; Montrone, Corinna; Fobo, Gisela; Ruepp, Andreas; Antonov, Alexey V; Surmeli, Dimitrij; Mewes, Hans-Wernen

    2005-05-15

    Any development of new methods for automatic functional annotation of proteins according to their sequences requires high-quality data (as benchmark) as well as tedious preparatory work to generate sequence parameters required as input data for the machine learning methods. Different program settings and incompatible protocols make a comparison of the analyzed methods difficult. The MIPS Bacterial Functional Annotation Benchmark dataset (MIPS-BFAB) is a new, high-quality resource comprising four bacterial genomes manually annotated according to the MIPS functional catalogue (FunCat). These resources include precalculated sequence parameters, such as sequence similarity scores, InterPro domain composition and other parameters that could be used to develop and benchmark methods for functional annotation of bacterial protein sequences. These data are provided in XML format and can be used by scientists who are not necessarily experts in genome annotation. BFAB is available at http://mips.gsf.de/proj/bfab

  9. 2006 Fynmeet sea clutter measurement trial: Datasets

    CSIR Research Space (South Africa)

    Herselman, PLR

    2007-09-06

    Full Text Available -011............................................................................................................................................................................................. 25 iii Dataset CAD14-001 0 5 10 15 20 25 30 35 10 20 30 40 50 60 70 80 90 R an ge G at e # Time [s] A bs ol ut e R an ge [m ] RCS [dBm2] vs. time and range for f1 = 9.000 GHz - CAD14-001 2400 2600 2800... 40 10 20 30 40 50 60 70 80 90 R an ge G at e # Time [s] A bs ol ut e R an ge [m ] RCS [dBm2] vs. time and range for f1 = 9.000 GHz - CAD14-002 2400 2600 2800 3000 3200 3400 3600 -30 -25 -20 -15 -10 -5 0 5 10...

  10. Las incubadoras de las empresas de Quebec

    Directory of Open Access Journals (Sweden)

    Bernardo Parra Restrepo

    1993-01-01

    Full Text Available Este articulo contiene información de las incubadoras de empresas especialmente aquellas encontradas en Quebec. A través de los 80's, el concepto de incubadoras de empresas fue introducido en América por expertos en el desarrollo económico. Sus metas fueron el estudio de los efectos del desempleo resultante de la quiebra de plantas industriales o del uso del proceso de producción automático por parte de las grandes compañías. Además se esperaba crear empleo para los inmigrantes. El campo más prometedor de estas experiencias se encuentra en el desarrollo de alta tecnología industrial, especialmente en electrónica, microprocesadores y software para sistemas de computación. Esta área también cubre las industrias innovadoras.

  11. A new bed elevation dataset for Greenland

    Directory of Open Access Journals (Sweden)

    J. L. Bamber

    2013-03-01

    Full Text Available We present a new bed elevation dataset for Greenland derived from a combination of multiple airborne ice thickness surveys undertaken between the 1970s and 2012. Around 420 000 line kilometres of airborne data were used, with roughly 70% of this having been collected since the year 2000, when the last comprehensive compilation was undertaken. The airborne data were combined with satellite-derived elevations for non-glaciated terrain to produce a consistent bed digital elevation model (DEM over the entire island including across the glaciated–ice free boundary. The DEM was extended to the continental margin with the aid of bathymetric data, primarily from a compilation for the Arctic. Ice thickness was determined where an ice shelf exists from a combination of surface elevation and radar soundings. The across-track spacing between flight lines warranted interpolation at 1 km postings for significant sectors of the ice sheet. Grids of ice surface elevation, error estimates for the DEM, ice thickness and data sampling density were also produced alongside a mask of land/ocean/grounded ice/floating ice. Errors in bed elevation range from a minimum of ±10 m to about ±300 m, as a function of distance from an observation and local topographic variability. A comparison with the compilation published in 2001 highlights the improvement in resolution afforded by the new datasets, particularly along the ice sheet margin, where ice velocity is highest and changes in ice dynamics most marked. We estimate that the volume of ice included in our land-ice mask would raise mean sea level by 7.36 m, excluding any solid earth effects that would take place during ice sheet decay.

  12. Las serpientes en colombia

    OpenAIRE

    Daniel, H.

    2012-01-01

    El estudio de las serpientes en Colombia no deja de tener su particular interés; existe una inmensa variedad de especies que se escalonan desde los valles ardientes hasta alturas relativamente considerables; en estos últimos puntos disminuye notablemente la variedad de formas y de modo especial las especies provistas de veneno. Hacer una distinción entre las especies venenosas y las inofensivas, resulta un tanto difícil; en algunos textos se dan unas cuántas reglas, pero la mayor parte d...

  13. Wind Integration National Dataset Toolkit | Grid Modernization | NREL

    Science.gov (United States)

    Integration National Dataset Toolkit Wind Integration National Dataset Toolkit The Wind Integration National Dataset (WIND) Toolkit is an update and expansion of the Eastern Wind Integration Data Set and Western Wind Integration Data Set. It supports the next generation of wind integration studies. WIND

  14. Solar Integration National Dataset Toolkit | Grid Modernization | NREL

    Science.gov (United States)

    Solar Integration National Dataset Toolkit Solar Integration National Dataset Toolkit NREL is working on a Solar Integration National Dataset (SIND) Toolkit to enable researchers to perform U.S . regional solar generation integration studies. It will provide modeled, coherent subhourly solar power data

  15. Technical note: An inorganic water chemistry dataset (1972–2011 ...

    African Journals Online (AJOL)

    A national dataset of inorganic chemical data of surface waters (rivers, lakes, and dams) in South Africa is presented and made freely available. The dataset comprises more than 500 000 complete water analyses from 1972 up to 2011, collected from more than 2 000 sample monitoring stations in South Africa. The dataset ...

  16. QSAR ligand dataset for modelling mutagenicity, genotoxicity, and rodent carcinogenicity

    Directory of Open Access Journals (Sweden)

    Davy Guan

    2018-04-01

    Full Text Available Five datasets were constructed from ligand and bioassay result data from the literature. These datasets include bioassay results from the Ames mutagenicity assay, Greenscreen GADD-45a-GFP assay, Syrian Hamster Embryo (SHE assay, and 2 year rat carcinogenicity assay results. These datasets provide information about chemical mutagenicity, genotoxicity and carcinogenicity.

  17. Las ruinas y las sombras de Manderley

    Directory of Open Access Journals (Sweden)

    Donapetry Camacho, María

    2010-02-01

    Full Text Available The ruins and the shadows of Manderley explores possible reciprocal interpretative relations between Daphne du Maurier’s novel Rebecca and Alfred Hitchcock’s cinematic version. Some of the relevant aspects considered here are: the treatment of the Derridean “ruins of memory” within the novel and within Hitchcock himself, the differentiation and the confrontation of the protagonist as outsider vis á vis the world of Manderley’s insiders, the iconic and symbolic values of the literary text and its filmic visualization, and the ethical positions implied by the choice or rejection of certain narrative devices in both works.

    Las ruinas y las sombras de Manderley aborda posibles relaciones interpretativas recíprocas entre la novela de Daphne du Maurier Rebecca y la película homónima de Alfred Hitchcock. Algunos de los aspectos que se consideran relevantes en este estudio son: el tratamiento de las “ruinas de la memoria” derridianas dentro de la novela y del propio Hitchcock, la diferenciación y el enfrentamiento de la protagonista “externa” con el mundo de los “internos” de Manderley, los valores icónicos y simbólicos del texto literario y de la visualización fílmica, y las posiciones éticas que implican la elección o el rechazo de ciertos recursos narrativos en ambas obras.

  18. Haciendo resonar las voces

    DEFF Research Database (Denmark)

    Tufte, Thomas; Corrigan, Arran; Ekstrøm, Ylva

    2011-01-01

    sexualidad protegida y el VIH/SIDA”. FEMINA persigue sus objetivos por medio de la producción de dos de las revistas de mayor circulación en Tanzania: SiMchezo! y Fema, apuntando a la juventud rural y urbana mediante una estrategia de intervención comunicacional multimedia y participativa. Las dos revistas...

  19. Statistical segmentation of multidimensional brain datasets

    Science.gov (United States)

    Desco, Manuel; Gispert, Juan D.; Reig, Santiago; Santos, Andres; Pascau, Javier; Malpica, Norberto; Garcia-Barreno, Pedro

    2001-07-01

    This paper presents an automatic segmentation procedure for MRI neuroimages that overcomes part of the problems involved in multidimensional clustering techniques like partial volume effects (PVE), processing speed and difficulty of incorporating a priori knowledge. The method is a three-stage procedure: 1) Exclusion of background and skull voxels using threshold-based region growing techniques with fully automated seed selection. 2) Expectation Maximization algorithms are used to estimate the probability density function (PDF) of the remaining pixels, which are assumed to be mixtures of gaussians. These pixels can then be classified into cerebrospinal fluid (CSF), white matter and grey matter. Using this procedure, our method takes advantage of using the full covariance matrix (instead of the diagonal) for the joint PDF estimation. On the other hand, logistic discrimination techniques are more robust against violation of multi-gaussian assumptions. 3) A priori knowledge is added using Markov Random Field techniques. The algorithm has been tested with a dataset of 30 brain MRI studies (co-registered T1 and T2 MRI). Our method was compared with clustering techniques and with template-based statistical segmentation, using manual segmentation as a gold-standard. Our results were more robust and closer to the gold-standard.

  20. ASSESSING SMALL SAMPLE WAR-GAMING DATASETS

    Directory of Open Access Journals (Sweden)

    W. J. HURLEY

    2013-10-01

    Full Text Available One of the fundamental problems faced by military planners is the assessment of changes to force structure. An example is whether to replace an existing capability with an enhanced system. This can be done directly with a comparison of measures such as accuracy, lethality, survivability, etc. However this approach does not allow an assessment of the force multiplier effects of the proposed change. To gauge these effects, planners often turn to war-gaming. For many war-gaming experiments, it is expensive, both in terms of time and dollars, to generate a large number of sample observations. This puts a premium on the statistical methodology used to examine these small datasets. In this paper we compare the power of three tests to assess population differences: the Wald-Wolfowitz test, the Mann-Whitney U test, and re-sampling. We employ a series of Monte Carlo simulation experiments. Not unexpectedly, we find that the Mann-Whitney test performs better than the Wald-Wolfowitz test. Resampling is judged to perform slightly better than the Mann-Whitney test.

  1. en las relaciones entrenadas

    Directory of Open Access Journals (Sweden)

    Camilo Hurtado P.

    2005-01-01

    Full Text Available El presente estudio exploró los efectos en la adquisición y transferencia de discriminaciones condicionales debidos a las diferentes combinaciones de relaciones entrenadas (identidad-diferencia-semejanza en una tarea de igualación a la muestra de segundo orden. De manera complementaria, fueron además dispuestas en la tarea las características que en estudios relacionados habían sido reportadas como facilitadoras de ejecuciones efectivas, a saber: el uso de instrucciones inespecíficas, arreglos de estímulo de tres formas y dos colores y el entrenamiento concurrente de las relaciones. Ocho sujetos fueron distribuidos en cuatro grupos que variaron en la combinación de relaciones a entrenar. Los resultados mostraron que gran parte de los sujetos cumplieron con el criterio de efectividad en entrenamiento y en transferencia extramodal y extrarrelacional, siendo la relación de identidad la relación en la cual se presentaron ejecuciones perfectas, al margen de si ésta había sido entrenada o no. Se discuten los resultados en términos de velocidad de ajuste a las relaciones y de las configuraciones de la tarea que facilitaron mejores desempeños. Se proponen, además, posibles investigaciones que superen las limitaciones metodológicas encontradas en este estudio y que exploren posibles interacciones de las variables manipuladas con otras variables de interés.

  2. Las neurosis actuales y las psicosis ordinarias

    OpenAIRE

    Aguirre, Javier

    2013-01-01

    El trabajo tiene por finalidad poner en tensión las categorías de psicosis ordinaria y neurosis actuales. En primer lugar, se procede a examinar ambas categorías para luego establecer sus puntos de coincidencia y disidencia. Se concluye que la categoría de neurosis actual propuesta por Freud, es una posible expresión de lo que en la actualidad se llama psicosis ordinaria.

  3. Historia de las series

    OpenAIRE

    Cascajosa Virino, Concepción

    2017-01-01

    Reseña: Historia de las series de Toni de la Torre: la gran impostura Durante mucho tiempo los profesores de cine en España se han quejado (amargamente y en privado) de la recurrencia en la bibliografía de los trabajos de sus estudiantes de los libros de un autor conocido por la escasa calidad de sus textos, cuyos vistosos títulos garantizan que acaben en las estanterías de las bibliotecas universitarias. Es muy propio de la cultura española hacer duras aseveraciones en foros irrelevantes y, ...

  4. Las Serpientes en Colombia

    Directory of Open Access Journals (Sweden)

    Daniel H.

    1949-12-01

    Full Text Available El estudio de las serpientes en Colombia no deja de tener su particular interés; existe una inmensa variedad de especies que se escalonan desde los valles ardientes hasta alturas relativamente considerables; en estos últimos puntos disminuye notablemente la variedad de formas y de modo especial las especies provistas de veneno. Hacer una distinción entre las especies venenosas y las inofensivas, resulta un tanto difícil; en algunos textos se dan unas cuántas reglas, pero la mayor parte de ellas sólo tienen aplicación con especies exóticas ya que la mayor parte de esas distinciones se han tomado de obras que hacen referencia a la fauna europea.

  5. The Dataset of Countries at Risk of Electoral Violence

    OpenAIRE

    Birch, Sarah; Muchlinski, David

    2017-01-01

    Electoral violence is increasingly affecting elections around the world, yet researchers have been limited by a paucity of granular data on this phenomenon. This paper introduces and describes a new dataset of electoral violence – the Dataset of Countries at Risk of Electoral Violence (CREV) – that provides measures of 10 different types of electoral violence across 642 elections held around the globe between 1995 and 2013. The paper provides a detailed account of how and why the dataset was ...

  6. Norwegian Hydrological Reference Dataset for Climate Change Studies

    Energy Technology Data Exchange (ETDEWEB)

    Magnussen, Inger Helene; Killingland, Magnus; Spilde, Dag

    2012-07-01

    Based on the Norwegian hydrological measurement network, NVE has selected a Hydrological Reference Dataset for studies of hydrological change. The dataset meets international standards with high data quality. It is suitable for monitoring and studying the effects of climate change on the hydrosphere and cryosphere in Norway. The dataset includes streamflow, groundwater, snow, glacier mass balance and length change, lake ice and water temperature in rivers and lakes.(Author)

  7. Public Availability to ECS Collected Datasets

    Science.gov (United States)

    Henderson, J. F.; Warnken, R.; McLean, S. J.; Lim, E.; Varner, J. D.

    2013-12-01

    Coastal nations have spent considerable resources exploring the limits of their extended continental shelf (ECS) beyond 200 nm. Although these studies are funded to fulfill requirements of the UN Convention on the Law of the Sea, the investments are producing new data sets in frontier areas of Earth's oceans that will be used to understand, explore, and manage the seafloor and sub-seafloor for decades to come. Although many of these datasets are considered proprietary until a nation's potential ECS has become 'final and binding' an increasing amount of data are being released and utilized by the public. Data sets include multibeam, seismic reflection/refraction, bottom sampling, and geophysical data. The U.S. ECS Project, a multi-agency collaboration whose mission is to establish the full extent of the continental shelf of the United States consistent with international law, relies heavily on data and accurate, standard metadata. The United States has made it a priority to make available to the public all data collected with ECS-funding as quickly as possible. The National Oceanic and Atmospheric Administration's (NOAA) National Geophysical Data Center (NGDC) supports this objective by partnering with academia and other federal government mapping agencies to archive, inventory, and deliver marine mapping data in a coordinated, consistent manner. This includes ensuring quality, standard metadata and developing and maintaining data delivery capabilities built on modern digital data archives. Other countries, such as Ireland, have submitted their ECS data for public availability and many others have made pledges to participate in the future. The data services provided by NGDC support the U.S. ECS effort as well as many developing nation's ECS effort through the U.N. Environmental Program. Modern discovery, visualization, and delivery of scientific data and derived products that span national and international sources of data ensure the greatest re-use of data and

  8. BIA Indian Lands Dataset (Indian Lands of the United States)

    Data.gov (United States)

    Federal Geographic Data Committee — The American Indian Reservations / Federally Recognized Tribal Entities dataset depicts feature location, selected demographics and other associated data for the 561...

  9. Framework for Interactive Parallel Dataset Analysis on the Grid

    Energy Technology Data Exchange (ETDEWEB)

    Alexander, David A.; Ananthan, Balamurali; /Tech-X Corp.; Johnson, Tony; Serbo, Victor; /SLAC

    2007-01-10

    We present a framework for use at a typical Grid site to facilitate custom interactive parallel dataset analysis targeting terabyte-scale datasets of the type typically produced by large multi-institutional science experiments. We summarize the needs for interactive analysis and show a prototype solution that satisfies those needs. The solution consists of desktop client tool and a set of Web Services that allow scientists to sign onto a Grid site, compose analysis script code to carry out physics analysis on datasets, distribute the code and datasets to worker nodes, collect the results back to the client, and to construct professional-quality visualizations of the results.

  10. Socioeconomic Data and Applications Center (SEDAC) Treaty Status Dataset

    Data.gov (United States)

    National Aeronautics and Space Administration — The Socioeconomic Data and Application Center (SEDAC) Treaty Status Dataset contains comprehensive treaty information for multilateral environmental agreements,...

  11. An Analysis of the GTZAN Music Genre Dataset

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2012-01-01

    Most research in automatic music genre recognition has used the dataset assembled by Tzanetakis et al. in 2001. The composition and integrity of this dataset, however, has never been formally analyzed. For the first time, we provide an analysis of its composition, and create a machine...

  12. Really big data: Processing and analysis of large datasets

    Science.gov (United States)

    Modern animal breeding datasets are large and getting larger, due in part to the recent availability of DNA data for many animals. Computational methods for efficiently storing and analyzing those data are under development. The amount of storage space required for such datasets is increasing rapidl...

  13. An Annotated Dataset of 14 Cardiac MR Images

    DEFF Research Database (Denmark)

    Stegmann, Mikkel Bille

    2002-01-01

    This note describes a dataset consisting of 14 annotated cardiac MR images. Points of correspondence are placed on each image at the left ventricle (LV). As such, the dataset can be readily used for building statistical models of shape. Further, format specifications and terms of use are given....

  14. A New Outlier Detection Method for Multidimensional Datasets

    KAUST Repository

    Abdel Messih, Mario A.

    2012-07-01

    This study develops a novel hybrid method for outlier detection (HMOD) that combines the idea of distance based and density based methods. The proposed method has two main advantages over most of the other outlier detection methods. The first advantage is that it works well on both dense and sparse datasets. The second advantage is that, unlike most other outlier detection methods that require careful parameter setting and prior knowledge of the data, HMOD is not very sensitive to small changes in parameter values within certain parameter ranges. The only required parameter to set is the number of nearest neighbors. In addition, we made a fully parallelized implementation of HMOD that made it very efficient in applications. Moreover, we proposed a new way of using the outlier detection for redundancy reduction in datasets where the confidence level that evaluates how accurate the less redundant dataset can be used to represent the original dataset can be specified by users. HMOD is evaluated on synthetic datasets (dense and mixed “dense and sparse”) and a bioinformatics problem of redundancy reduction of dataset of position weight matrices (PWMs) of transcription factor binding sites. In addition, in the process of assessing the performance of our redundancy reduction method, we developed a simple tool that can be used to evaluate the confidence level of reduced dataset representing the original dataset. The evaluation of the results shows that our method can be used in a wide range of problems.

  15. ATLAS File and Dataset Metadata Collection and Use

    CERN Document Server

    Albrand, S; The ATLAS collaboration; Lambert, F; Gallas, E J

    2012-01-01

    The ATLAS Metadata Interface (“AMI”) was designed as a generic cataloguing system, and as such it has found many uses in the experiment including software release management, tracking of reconstructed event sizes and control of dataset nomenclature. The primary use of AMI is to provide a catalogue of datasets (file collections) which is searchable using physics criteria. In this paper we discuss the various mechanisms used for filling the AMI dataset and file catalogues. By correlating information from different sources we can derive aggregate information which is important for physics analysis; for example the total number of events contained in dataset, and possible reasons for missing events such as a lost file. Finally we will describe some specialized interfaces which were developed for the Data Preparation and reprocessing coordinators. These interfaces manipulate information from both the dataset domain held in AMI, and the run-indexed information held in the ATLAS COMA application (Conditions and ...

  16. A dataset on tail risk of commodities markets.

    Science.gov (United States)

    Powell, Robert J; Vo, Duc H; Pham, Thach N; Singh, Abhay K

    2017-12-01

    This article contains the datasets related to the research article "The long and short of commodity tails and their relationship to Asian equity markets"(Powell et al., 2017) [1]. The datasets contain the daily prices (and price movements) of 24 different commodities decomposed from the S&P GSCI index and the daily prices (and price movements) of three share market indices including World, Asia, and South East Asia for the period 2004-2015. Then, the dataset is divided into annual periods, showing the worst 5% of price movements for each year. The datasets are convenient to examine the tail risk of different commodities as measured by Conditional Value at Risk (CVaR) as well as their changes over periods. The datasets can also be used to investigate the association between commodity markets and share markets.

  17. Internet de las cosas

    OpenAIRE

    Salazar Soler, Jorge; Silvestre Bergés, Santiago

    2016-01-01

    Este es un curso de introducción a la IoT (Internet de las cosas). En los capítulos primeros capítulos se introducen los conceptos básicos sobre la IoT. Seguidamente se presentan nociones básicas sobre el protocolo de internet IPv6 que es el más utilizado en el entorno de la IoT y se describen las principales aplicaciones, el estado actual del mercado y las tecnologías que permiten la existencia de la IoT. Finalmente se analizan los retos de futuro que se consideran más importa...

  18. Las sirenas de Sarhua

    Directory of Open Access Journals (Sweden)

    Luis Millones

    2011-05-01

    Full Text Available El presente artículo es producto de una investigación sobre uno de los motivos más llamativos y recurrentes en las Tablas de Sarhua: la sirena.Realizamos una aproximación a este motivo, que originalmente aparece en la literatura clásica occidental, con la finalidad de explicar las razones de su presencia en el arte andino. Para tal efecto transcribimos algunos testimoniosque, desde una cosmovisión netamente andina, dan cuenta de una serie de creencias vinculadas con el ámbito musical, en el que las sirenas juegan un papel central.Palabras claves: Tablas de Sarhua, sirena, arte andino, música andina.

  19. Tratamiento de las ictericias

    Directory of Open Access Journals (Sweden)

    Roberto de Zubiría

    1955-05-01

    Full Text Available En el tratamiento de los síndromes ictéricos no existe sistematización de criterios y en más de una ocasión se cometen serios errores terapéuticos. Tal sucede frecuentemente con las hepatitis a virus, por ejemplo las que tratadas con un régimen alimenticio inadecuado, constituído casi siempre por dietas de hambre, se va a complicar más la situación. Otras veces se utilizan sistemáticamente los antibióticos de amplio espectro, los cuales son inútiles e inclusive desaconsejados por la posibilidad de producir fibrosis hepática. Hemos querido hacer un resumen analizando someramente el pro y el contra de las principales medidas terapéuticas.

  20. Learning from las Cuencas

    DEFF Research Database (Denmark)

    2016-01-01

    Aprendiendo de las Cuencas (Learning from las Cuencas) research project, awarded with the EU Prize for Cultural Heritage / Europa Nostra Awards 2015, provides a new perspective of industrial cultural landscapes. Despite being focused on a local environment, the coal mining area of the Cuencas...... economic and social structures came to host busy urban agglomerations of unexpected density in their very heart. This heretical urban configuration is equally the result of the impact of economic interests in a specific space and in a relatively short lapse of time. Consequently, the co...

  1. Analizar las interacciones virtuales

    Directory of Open Access Journals (Sweden)

    Luis Gómez Encinas

    2003-11-01

    Full Text Available El propósito de este artículo es abordar una serie de cuestiones relacionadas con las interacciones virtuales. En concreto, nos centraremos en el IRC, los comúnmente llamados chats, para a partir de una crítica a ciertos posicionamientos analíticos, expresar la necesidad de indagar en esa faceta del ciberespacio desde el punto de vista de las relaciones sociales. Nuevas relaciones sociales, que acontecen en un espacio nuevo, y que por tanto se desarrollan bajo reglas y motivaciones que desbordan los viejos esquemas de observación.

  2. Las causas del desempleo

    OpenAIRE

    García Balbás, Salomé

    2014-01-01

    En este documento vamos a tratar de dar respuesta a la siguiente pregunta: ¿cuáles son las causas del desempleo? Para ello realizaremos un repaso de las principales teorías económicas del desempleo llegando a la conclusión de que existe una causa fundamental que lo genera: la rigidez de los salarios a la baja. Consideramos el desempleo como la existencia de un exceso de oferta en el mercado de trabajo y haremos alusión, únicamente, a la existencia de desempleo involuntario. La exposición de l...

  3. Aprendiendo de las Cuencas

    DEFF Research Database (Denmark)

    2016-01-01

    Aprendiendo de las Cuencas (Learning from las Cuencas) research project, awarded with the EU Prize for Cultural Heritage / Europa Nostra Awards 2015, provides a new perspective of industrial cultural landscapes. Despite being focused on a local environment, the coal mining area of the Cuencas....... This has given rise to incredibly heterodox building patterns that contain the conflict on which they are erected embedded in their genetic code. These are hybrid architectures, mutating artefacts which, despite the invisibility of their inevitable marginality, can offer really interesting lessons...

  4. Discovery and Reuse of Open Datasets: An Exploratory Study

    Directory of Open Access Journals (Sweden)

    Sara

    2016-07-01

    Full Text Available Objective: This article analyzes twenty cited or downloaded datasets and the repositories that house them, in order to produce insights that can be used by academic libraries to encourage discovery and reuse of research data in institutional repositories. Methods: Using Thomson Reuters’ Data Citation Index and repository download statistics, we identified twenty cited/downloaded datasets. We documented the characteristics of the cited/downloaded datasets and their corresponding repositories in a self-designed rubric. The rubric includes six major categories: basic information; funding agency and journal information; linking and sharing; factors to encourage reuse; repository characteristics; and data description. Results: Our small-scale study suggests that cited/downloaded datasets generally comply with basic recommendations for facilitating reuse: data are documented well; formatted for use with a variety of software; and shared in established, open access repositories. Three significant factors also appear to contribute to dataset discovery: publishing in discipline-specific repositories; indexing in more than one location on the web; and using persistent identifiers. The cited/downloaded datasets in our analysis came from a few specific disciplines, and tended to be funded by agencies with data publication mandates. Conclusions: The results of this exploratory research provide insights that can inform academic librarians as they work to encourage discovery and reuse of institutional datasets. Our analysis also suggests areas in which academic librarians can target open data advocacy in their communities in order to begin to build open data success stories that will fuel future advocacy efforts.

  5. Viability of Controlling Prosthetic Hand Utilizing Electroencephalograph (EEG) Dataset Signal

    Science.gov (United States)

    Miskon, Azizi; A/L Thanakodi, Suresh; Raihan Mazlan, Mohd; Mohd Haziq Azhar, Satria; Nooraya Mohd Tawil, Siti

    2016-11-01

    This project presents the development of an artificial hand controlled by Electroencephalograph (EEG) signal datasets for the prosthetic application. The EEG signal datasets were used as to improvise the way to control the prosthetic hand compared to the Electromyograph (EMG). The EMG has disadvantages to a person, who has not used the muscle for a long time and also to person with degenerative issues due to age factor. Thus, the EEG datasets found to be an alternative for EMG. The datasets used in this work were taken from Brain Computer Interface (BCI) Project. The datasets were already classified for open, close and combined movement operations. It served the purpose as an input to control the prosthetic hand by using an Interface system between Microsoft Visual Studio and Arduino. The obtained results reveal the prosthetic hand to be more efficient and faster in response to the EEG datasets with an additional LiPo (Lithium Polymer) battery attached to the prosthetic. Some limitations were also identified in terms of the hand movements, weight of the prosthetic, and the suggestions to improve were concluded in this paper. Overall, the objective of this paper were achieved when the prosthetic hand found to be feasible in operation utilizing the EEG datasets.

  6. Sparse Group Penalized Integrative Analysis of Multiple Cancer Prognosis Datasets

    Science.gov (United States)

    Liu, Jin; Huang, Jian; Xie, Yang; Ma, Shuangge

    2014-01-01

    SUMMARY In cancer research, high-throughput profiling studies have been extensively conducted, searching for markers associated with prognosis. Because of the “large d, small n” characteristic, results generated from the analysis of a single dataset can be unsatisfactory. Recent studies have shown that integrative analysis, which simultaneously analyzes multiple datasets, can be more effective than single-dataset analysis and classic meta-analysis. In most of existing integrative analysis, the homogeneity model has been assumed, which postulates that different datasets share the same set of markers. Several approaches have been designed to reinforce this assumption. In practice, different datasets may differ in terms of patient selection criteria, profiling techniques, and many other aspects. Such differences may make the homogeneity model too restricted. In this study, we assume the heterogeneity model, under which different datasets are allowed to have different sets of markers. With multiple cancer prognosis datasets, we adopt the AFT (accelerated failure time) model to describe survival. This model may have the lowest computational cost among popular semiparametric survival models. For marker selection, we adopt a sparse group MCP (minimax concave penalty) approach. This approach has an intuitive formulation and can be computed using an effective group coordinate descent algorithm. Simulation study shows that it outperforms the existing approaches under both the homogeneity and heterogeneity models. Data analysis further demonstrates the merit of heterogeneity model and proposed approach. PMID:23938111

  7. PROVIDING GEOGRAPHIC DATASETS AS LINKED DATA IN SDI

    Directory of Open Access Journals (Sweden)

    E. Hietanen

    2016-06-01

    Full Text Available In this study, a prototype service to provide data from Web Feature Service (WFS as linked data is implemented. At first, persistent and unique Uniform Resource Identifiers (URI are created to all spatial objects in the dataset. The objects are available from those URIs in Resource Description Framework (RDF data format. Next, a Web Ontology Language (OWL ontology is created to describe the dataset information content using the Open Geospatial Consortium’s (OGC GeoSPARQL vocabulary. The existing data model is modified in order to take into account the linked data principles. The implemented service produces an HTTP response dynamically. The data for the response is first fetched from existing WFS. Then the Geographic Markup Language (GML format output of the WFS is transformed on-the-fly to the RDF format. Content Negotiation is used to serve the data in different RDF serialization formats. This solution facilitates the use of a dataset in different applications without replicating the whole dataset. In addition, individual spatial objects in the dataset can be referred with URIs. Furthermore, the needed information content of the objects can be easily extracted from the RDF serializations available from those URIs. A solution for linking data objects to the dataset URI is also introduced by using the Vocabulary of Interlinked Datasets (VoID. The dataset is divided to the subsets and each subset is given its persistent and unique URI. This enables the whole dataset to be explored with a web browser and all individual objects to be indexed by search engines.

  8. Homogenised Australian climate datasets used for climate change monitoring

    International Nuclear Information System (INIS)

    Trewin, Blair; Jones, David; Collins; Dean; Jovanovic, Branislava; Braganza, Karl

    2007-01-01

    Full text: The Australian Bureau of Meteorology has developed a number of datasets for use in climate change monitoring. These datasets typically cover 50-200 stations distributed as evenly as possible over the Australian continent, and have been subject to detailed quality control and homogenisation.The time period over which data are available for each element is largely determined by the availability of data in digital form. Whilst nearly all Australian monthly and daily precipitation data have been digitised, a significant quantity of pre-1957 data (for temperature and evaporation) or pre-1987 data (for some other elements) remains to be digitised, and is not currently available for use in the climate change monitoring datasets. In the case of temperature and evaporation, the start date of the datasets is also determined by major changes in instruments or observing practices for which no adjustment is feasible at the present time. The datasets currently available cover: Monthly and daily precipitation (most stations commence 1915 or earlier, with many extending back to the late 19th century, and a few to the mid-19th century); Annual temperature (commences 1910); Daily temperature (commences 1910, with limited station coverage pre-1957); Twice-daily dewpoint/relative humidity (commences 1957); Monthly pan evaporation (commences 1970); Cloud amount (commences 1957) (Jovanovic etal. 2007). As well as the station-based datasets listed above, an additional dataset being developed for use in climate change monitoring (and other applications) covers tropical cyclones in the Australian region. This is described in more detail in Trewin (2007). The datasets already developed are used in analyses of observed climate change, which are available through the Australian Bureau of Meteorology website (http://www.bom.gov.au/silo/products/cli_chg/). They are also used as a basis for routine climate monitoring, and in the datasets used for the development of seasonal

  9. de las mujeres

    Directory of Open Access Journals (Sweden)

    Elena Margarita Cacheux Pulido

    2003-01-01

    Full Text Available El feminismo chicano tiene múltiples venas que nutrieron el pensamiento ideológico, político y estratégico durante todo el siglo pasado. Una de ellas se forma por los encuentros locales, nacionales e internacionales que sirvieron de plataforma para comunicar, entre mujeres, las propias necesidades para el desarrollo y la liberación de la injusticia, intolerancia y desdén. Las mujeres han luchado por medio de sindicatos y movimientos sociales para plantear justas demandas, entre otras, educación, igual salario por igual trabajo, bienestar, permiso de maternidad, cuidado infantil, autodeterminación, información sexual, igualdad en la participación política y liderazgo. Con el avance teórico en las cuestiones de raza, clase, minorías étnicas y feminismo lésbico, se desarrolló la identidad de la nueva chicana a la luz de la emancipación de las mujeres.

  10. Las marcas no tradicionales

    Directory of Open Access Journals (Sweden)

    Juan David Castro

    2012-11-01

    Full Text Available Las transformaciones en el comercio tanto nacional como internacionalmente han hecho necesario el aparecimiento de nuevas formas de manifestación de los signos destinados a diferenciar los productos y servicios en el mercado. El artículo reseña algunos de ellos.

  11. Tension in the recent Type Ia supernovae datasets

    International Nuclear Information System (INIS)

    Wei, Hao

    2010-01-01

    In the present work, we investigate the tension in the recent Type Ia supernovae (SNIa) datasets Constitution and Union. We show that they are in tension not only with the observations of the cosmic microwave background (CMB) anisotropy and the baryon acoustic oscillations (BAO), but also with other SNIa datasets such as Davis and SNLS. Then, we find the main sources responsible for the tension. Further, we make this more robust by employing the method of random truncation. Based on the results of this work, we suggest two truncated versions of the Union and Constitution datasets, namely the UnionT and ConstitutionT SNIa samples, whose behaviors are more regular.

  12. Background qualitative analysis of the European reference life cycle database (ELCD) energy datasets - part II: electricity datasets.

    Science.gov (United States)

    Garraín, Daniel; Fazio, Simone; de la Rúa, Cristina; Recchioni, Marco; Lechón, Yolanda; Mathieux, Fabrice

    2015-01-01

    The aim of this paper is to identify areas of potential improvement of the European Reference Life Cycle Database (ELCD) electricity datasets. The revision is based on the data quality indicators described by the International Life Cycle Data system (ILCD) Handbook, applied on sectorial basis. These indicators evaluate the technological, geographical and time-related representativeness of the dataset and the appropriateness in terms of completeness, precision and methodology. Results show that ELCD electricity datasets have a very good quality in general terms, nevertheless some findings and recommendations in order to improve the quality of Life-Cycle Inventories have been derived. Moreover, these results ensure the quality of the electricity-related datasets to any LCA practitioner, and provide insights related to the limitations and assumptions underlying in the datasets modelling. Giving this information, the LCA practitioner will be able to decide whether the use of the ELCD electricity datasets is appropriate based on the goal and scope of the analysis to be conducted. The methodological approach would be also useful for dataset developers and reviewers, in order to improve the overall Data Quality Requirements of databases.

  13. Dataset definition for CMS operations and physics analyses

    Science.gov (United States)

    Franzoni, Giovanni; Compact Muon Solenoid Collaboration

    2016-04-01

    Data recorded at the CMS experiment are funnelled into streams, integrated in the HLT menu, and further organised in a hierarchical structure of primary datasets and secondary datasets/dedicated skims. Datasets are defined according to the final-state particles reconstructed by the high level trigger, the data format and the use case (physics analysis, alignment and calibration, performance studies). During the first LHC run, new workflows have been added to this canonical scheme, to exploit at best the flexibility of the CMS trigger and data acquisition systems. The concepts of data parking and data scouting have been introduced to extend the physics reach of CMS, offering the opportunity of defining physics triggers with extremely loose selections (e.g. dijet resonance trigger collecting data at a 1 kHz). In this presentation, we review the evolution of the dataset definition during the LHC run I, and we discuss the plans for the run II.

  14. U.S. Climate Divisional Dataset (Version Superseded)

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — This data has been superseded by a newer version of the dataset. Please refer to NOAA's Climate Divisional Database for more information. The U.S. Climate Divisional...

  15. Karna Particle Size Dataset for Tables and Figures

    Data.gov (United States)

    U.S. Environmental Protection Agency — This dataset contains 1) table of bulk Pb-XAS LCF results, 2) table of bulk As-XAS LCF results, 3) figure data of particle size distribution, and 4) figure data for...

  16. NOAA Global Surface Temperature Dataset, Version 4.0

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The NOAA Global Surface Temperature Dataset (NOAAGlobalTemp) is derived from two independent analyses: the Extended Reconstructed Sea Surface Temperature (ERSST)...

  17. National Hydrography Dataset (NHD) - USGS National Map Downloadable Data Collection

    Data.gov (United States)

    U.S. Geological Survey, Department of the Interior — The USGS National Hydrography Dataset (NHD) Downloadable Data Collection from The National Map (TNM) is a comprehensive set of digital spatial data that encodes...

  18. Watershed Boundary Dataset (WBD) - USGS National Map Downloadable Data Collection

    Data.gov (United States)

    U.S. Geological Survey, Department of the Interior — The Watershed Boundary Dataset (WBD) from The National Map (TNM) defines the perimeter of drainage areas formed by the terrain and other landscape characteristics....

  19. BASE MAP DATASET, LE FLORE COUNTY, OKLAHOMA, USA

    Data.gov (United States)

    Federal Emergency Management Agency, Department of Homeland Security — Basemap datasets comprise six of the seven FGDC themes of geospatial data that are used by most GIS applications (Note: the seventh framework theme, orthographic...

  20. USGS National Hydrography Dataset from The National Map

    Data.gov (United States)

    U.S. Geological Survey, Department of the Interior — USGS The National Map - National Hydrography Dataset (NHD) is a comprehensive set of digital spatial data that encodes information about naturally occurring and...

  1. A robust dataset-agnostic heart disease classifier from Phonocardiogram.

    Science.gov (United States)

    Banerjee, Rohan; Dutta Choudhury, Anirban; Deshpande, Parijat; Bhattacharya, Sakyajit; Pal, Arpan; Mandana, K M

    2017-07-01

    Automatic classification of normal and abnormal heart sounds is a popular area of research. However, building a robust algorithm unaffected by signal quality and patient demography is a challenge. In this paper we have analysed a wide list of Phonocardiogram (PCG) features in time and frequency domain along with morphological and statistical features to construct a robust and discriminative feature set for dataset-agnostic classification of normal and cardiac patients. The large and open access database, made available in Physionet 2016 challenge was used for feature selection, internal validation and creation of training models. A second dataset of 41 PCG segments, collected using our in-house smart phone based digital stethoscope from an Indian hospital was used for performance evaluation. Our proposed methodology yielded sensitivity and specificity scores of 0.76 and 0.75 respectively on the test dataset in classifying cardiovascular diseases. The methodology also outperformed three popular prior art approaches, when applied on the same dataset.

  2. AFSC/REFM: Seabird Necropsy dataset of North Pacific

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The seabird necropsy dataset contains information on seabird specimens that were collected under salvage and scientific collection permits primarily by...

  3. Dataset definition for CMS operations and physics analyses

    CERN Document Server

    AUTHOR|(CDS)2051291

    2016-01-01

    Data recorded at the CMS experiment are funnelled into streams, integrated in the HLT menu, and further organised in a hierarchical structure of primary datasets, secondary datasets, and dedicated skims. Datasets are defined according to the final-state particles reconstructed by the high level trigger, the data format and the use case (physics analysis, alignment and calibration, performance studies). During the first LHC run, new workflows have been added to this canonical scheme, to exploit at best the flexibility of the CMS trigger and data acquisition systems. The concept of data parking and data scouting have been introduced to extend the physics reach of CMS, offering the opportunity of defining physics triggers with extremely loose selections (e.g. dijet resonance trigger collecting data at a 1 kHz). In this presentation, we review the evolution of the dataset definition during the first run, and we discuss the plans for the second LHC run.

  4. USGS National Boundary Dataset (NBD) Downloadable Data Collection

    Data.gov (United States)

    U.S. Geological Survey, Department of the Interior — The USGS Governmental Unit Boundaries dataset from The National Map (TNM) represents major civil areas for the Nation, including States or Territories, counties (or...

  5. Environmental Dataset Gateway (EDG) CS-W Interface

    Data.gov (United States)

    U.S. Environmental Protection Agency — Use the Environmental Dataset Gateway (EDG) to find and access EPA's environmental resources. Many options are available for easily reusing EDG content in other...

  6. Global Man-made Impervious Surface (GMIS) Dataset From Landsat

    Data.gov (United States)

    National Aeronautics and Space Administration — The Global Man-made Impervious Surface (GMIS) Dataset From Landsat consists of global estimates of fractional impervious cover derived from the Global Land Survey...

  7. A Comparative Analysis of Classification Algorithms on Diverse Datasets

    Directory of Open Access Journals (Sweden)

    M. Alghobiri

    2018-04-01

    Full Text Available Data mining involves the computational process to find patterns from large data sets. Classification, one of the main domains of data mining, involves known structure generalizing to apply to a new dataset and predict its class. There are various classification algorithms being used to classify various data sets. They are based on different methods such as probability, decision tree, neural network, nearest neighbor, boolean and fuzzy logic, kernel-based etc. In this paper, we apply three diverse classification algorithms on ten datasets. The datasets have been selected based on their size and/or number and nature of attributes. Results have been discussed using some performance evaluation measures like precision, accuracy, F-measure, Kappa statistics, mean absolute error, relative absolute error, ROC Area etc. Comparative analysis has been carried out using the performance evaluation measures of accuracy, precision, and F-measure. We specify features and limitations of the classification algorithms for the diverse nature datasets.

  8. Newton SSANTA Dr Water using POU filters dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — This dataset contains information about all the features extracted from the raw data files, the formulas that were assigned to some of these features, and the...

  9. Estimating parameters for probabilistic linkage of privacy-preserved datasets.

    Science.gov (United States)

    Brown, Adrian P; Randall, Sean M; Ferrante, Anna M; Semmens, James B; Boyd, James H

    2017-07-10

    Probabilistic record linkage is a process used to bring together person-based records from within the same dataset (de-duplication) or from disparate datasets using pairwise comparisons and matching probabilities. The linkage strategy and associated match probabilities are often estimated through investigations into data quality and manual inspection. However, as privacy-preserved datasets comprise encrypted data, such methods are not possible. In this paper, we present a method for estimating the probabilities and threshold values for probabilistic privacy-preserved record linkage using Bloom filters. Our method was tested through a simulation study using synthetic data, followed by an application using real-world administrative data. Synthetic datasets were generated with error rates from zero to 20% error. Our method was used to estimate parameters (probabilities and thresholds) for de-duplication linkages. Linkage quality was determined by F-measure. Each dataset was privacy-preserved using separate Bloom filters for each field. Match probabilities were estimated using the expectation-maximisation (EM) algorithm on the privacy-preserved data. Threshold cut-off values were determined by an extension to the EM algorithm allowing linkage quality to be estimated for each possible threshold. De-duplication linkages of each privacy-preserved dataset were performed using both estimated and calculated probabilities. Linkage quality using the F-measure at the estimated threshold values was also compared to the highest F-measure. Three large administrative datasets were used to demonstrate the applicability of the probability and threshold estimation technique on real-world data. Linkage of the synthetic datasets using the estimated probabilities produced an F-measure that was comparable to the F-measure using calculated probabilities, even with up to 20% error. Linkage of the administrative datasets using estimated probabilities produced an F-measure that was higher

  10. Toward computational cumulative biology by combining models of biological datasets.

    Science.gov (United States)

    Faisal, Ali; Peltonen, Jaakko; Georgii, Elisabeth; Rung, Johan; Kaski, Samuel

    2014-01-01

    A main challenge of data-driven sciences is how to make maximal use of the progressively expanding databases of experimental datasets in order to keep research cumulative. We introduce the idea of a modeling-based dataset retrieval engine designed for relating a researcher's experimental dataset to earlier work in the field. The search is (i) data-driven to enable new findings, going beyond the state of the art of keyword searches in annotations, (ii) modeling-driven, to include both biological knowledge and insights learned from data, and (iii) scalable, as it is accomplished without building one unified grand model of all data. Assuming each dataset has been modeled beforehand, by the researchers or automatically by database managers, we apply a rapidly computable and optimizable combination model to decompose a new dataset into contributions from earlier relevant models. By using the data-driven decomposition, we identify a network of interrelated datasets from a large annotated human gene expression atlas. While tissue type and disease were major driving forces for determining relevant datasets, the found relationships were richer, and the model-based search was more accurate than the keyword search; moreover, it recovered biologically meaningful relationships that are not straightforwardly visible from annotations-for instance, between cells in different developmental stages such as thymocytes and T-cells. Data-driven links and citations matched to a large extent; the data-driven links even uncovered corrections to the publication data, as two of the most linked datasets were not highly cited and turned out to have wrong publication entries in the database.

  11. Testing the Neutral Theory of Biodiversity with Human Microbiome Datasets

    OpenAIRE

    Li, Lianwei; Ma, Zhanshan (Sam)

    2016-01-01

    The human microbiome project (HMP) has made it possible to test important ecological theories for arguably the most important ecosystem to human health?the human microbiome. Existing limited number of studies have reported conflicting evidence in the case of the neutral theory; the present study aims to comprehensively test the neutral theory with extensive HMP datasets covering all five major body sites inhabited by the human microbiome. Utilizing 7437 datasets of bacterial community samples...

  12. General Purpose Multimedia Dataset - GarageBand 2008

    DEFF Research Database (Denmark)

    Meng, Anders

    This document describes a general purpose multimedia data-set to be used in cross-media machine learning problems. In more detail we describe the genre taxonomy applied at http://www.garageband.com, from where the data-set was collected, and how the taxonomy have been fused into a more human...... understandable taxonomy. Finally, a description of various features extracted from both the audio and text are presented....

  13. Artificial intelligence (AI) systems for interpreting complex medical datasets.

    Science.gov (United States)

    Altman, R B

    2017-05-01

    Advances in machine intelligence have created powerful capabilities in algorithms that find hidden patterns in data, classify objects based on their measured characteristics, and associate similar patients/diseases/drugs based on common features. However, artificial intelligence (AI) applications in medical data have several technical challenges: complex and heterogeneous datasets, noisy medical datasets, and explaining their output to users. There are also social challenges related to intellectual property, data provenance, regulatory issues, economics, and liability. © 2017 ASCPT.

  14. Heuristics for Relevancy Ranking of Earth Dataset Search Results

    Science.gov (United States)

    Lynnes, Christopher; Quinn, Patrick; Norton, James

    2016-01-01

    As the Variety of Earth science datasets increases, science researchers find it more challenging to discover and select the datasets that best fit their needs. The most common way of search providers to address this problem is to rank the datasets returned for a query by their likely relevance to the user. Large web page search engines typically use text matching supplemented with reverse link counts, semantic annotations and user intent modeling. However, this produces uneven results when applied to dataset metadata records simply externalized as a web page. Fortunately, data and search provides have decades of experience in serving data user communities, allowing them to form heuristics that leverage the structure in the metadata together with knowledge about the user community. Some of these heuristics include specific ways of matching the user input to the essential measurements in the dataset and determining overlaps of time range and spatial areas. Heuristics based on the novelty of the datasets can prioritize later, better versions of data over similar predecessors. And knowledge of how different user types and communities use data can be brought to bear in cases where characteristics of the user (discipline, expertise) or their intent (applications, research) can be divined. The Earth Observing System Data and Information System has begun implementing some of these heuristics in the relevancy algorithm of its Common Metadata Repository search engine.

  15. Historia de las Universidades.

    Directory of Open Access Journals (Sweden)

    Jaime Escobar Triana

    1999-04-01

    ínicos”

    La civilización medieval produjo los arquetipos y el impulso dinámico para las fuerzas principales del mundo moderno. Representa el período de gestación de la cultura moderna. La ciencia occidental se desarrolló en las redes de la erudición medieval que en el siglo XII eran las escuelas catedralicias. La catedral de Chartres, simboliza, históricamente los comienzos de nuestra era científica y tecnológica.

    En la escuela de Chartres se establecieron las bases filosóficas para el surgimiento de la ciencia medieval y la moderna. El estudio de la naturaleza se constituyó en disciplina por derecho propio. En Chartres, durante el siglo XII, el estudio de la ciencia obtuvo por primera vez una prioridad definitiva sobre la enseñanza de las artes liberales y los maestros lucharon por la instauración de osadas reformas en la educación superior general centrando el programa de estudios de ciencias naturales en el cuadrivio: aritmética, música (como matemáticas geometría y astronomía y no en las humanidades tradicionales del trivio: gramática, retórica y lógica.

    La escuela de Chartres desafiaba así a los 7 siglos de enseñanzas cristianas a cerca del lugar de la naturaleza en el esquema divino, contra todas las resistencias de las grandes escuelas catedralicias de Orleans, Saint Victor en París y Laon. El ser humano y el mundo eran una unidad en el pensamiento de Chartres; trivio y cuadrivio eran parte de un universo único. Pedro Abelardo En sus escritos (siglo XII se hallan las raíces históricas de la técnica, del método con el que las grandes escuelas universitarias del S XIII construyeron, organizaron y expresaron sus doctrinas, las síntesis teológicas más complejas y más completas de la edad media. Abelardo en la Escuela de Santa Genoveva en París y Cátedra de Notre-Dame, Busca huir de los condicionamientos de las estructuras culturales cerradas e inmóviles y de las rígidas concepciones tradicionales para abrirse a una vía de investigaci

  16. Historia de las Hormonas

    Directory of Open Access Journals (Sweden)

    Alfredo Jácome-Roca

    2009-01-01

    Full Text Available

    El historiador Garrison afirma en su libro Introducción a la Historia de la Medicina que… “la nueva ciencia de la endocrinología, aunque arraigada en el pasado prehistórico, es virtualmente una creación del siglo XX…”

    Eso es precisamente lo que se quiere mostrar en este libro: las raíces hundidas en el pasado y los sorprendentes hallazgos de la modernidad de la centuria precedente, con algunos hechos importantes acaecidos en el siglo XIX.

    Los primeros datos conocidos comenzaron por las observaciones sobre animales y humanos castrados, sobre gigantes y enanos, diabéticos, cotudos y cretinos, y luego sobre las descripciones incipientes de los órganos endocrinos más grandes como el cuerpo tiroides, las cápsulas suprarrenales, testículos, ovarios y la glándula pituitaria. Personajes que podrían perfectamente provenir del país de Lilliput o de un circo del vecindario, tal era su extraña apariencia.

    Una eventual relación entre los casos clínicos mencionados y las glándulas se ignoraba casi totalmente, casi porque sí se sabía lo que ocurría a un animal o a un hombre castrado, antes y después de la pubertad. Por eso nos detendremos en algunas anécdotas (imposible la historia completa de los eunucos (y por espacio de dos siglos, particularmente en Italia, de los hombres-soprano denominados los castrati.

    Persistía la incógnita sobre la función de ciertos órganos o tejidos –llamados por siglos glándulas sin conducto (incluídos órganos hematopoyéticos como el timo, el bazo o los ganglios linfáticos: una pituitaria productora de moco, que drenaba en las fosas nasales el moco excesivo que se producía en los catarros; o una tiroides que lubricaba la laringe, unas cápsulas suprarrenales que sostenían los riñones, unos ovarios que eran testículos femeninos, o unos testículos que –esos sí- regaban semilla en el útero de las mujeres. Y tambi

  17. Las organizaciones complejas

    Directory of Open Access Journals (Sweden)

    Julio Mario Rodríguez Devis

    2002-05-01

    Full Text Available El aproximarse a las organizaciones con la visión de la complejidad permite observar sus múltiples interrelaciones, la relación orden-desorden-relaciones-retroacciones que se desarrollan en su interior, la íntima influencia bi-direccional entre el entorno y la organización, la emergencia y la autoorganizacion, la importancia de la información-ruido; en fin, observar a la organización como un sistema dinámico, no lineal, abierto y complejo. En este artículo se presentan los fundamentos de la complejidad en las organizaciones, y se describen los principales elementos que la constituyen.

  18. Humanismo en las universidades

    Directory of Open Access Journals (Sweden)

    Carlos Arturo Caparroso

    1963-08-01

    Full Text Available Algunas universidades del país, con encomiable acierto, han establecido, en los programas de sus ciclos de enseñanza profesional, el estudio de asignaturas de cultura general. Y para prestigiarlas, las han rotulado con una denominación ciertamente esclarecida y grata, término de rancia prosapia, de insigne lastre histórico: humanidades.

  19. Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Metadata, Usage Metrics, and User Feedback to Improve Data Discovery and Access

    Data.gov (United States)

    National Aeronautics and Space Administration — We propose to mine and utilize the combination of Earth Science dataset, metadata with usage metrics and user feedback to objectively extract relevance for improved...

  20. Derrida en las pampas

    Directory of Open Access Journals (Sweden)

    Analía Gerbaudo

    2011-07-01

    Full Text Available http://dx.doi.org/10.5007/1984-784X.2011v11n16p16 En el año 2003 Mariano Ben Plotkin publica Freud en las pampas. Orígenes y desarrollo de una cultura psicoanalítica en la Argentina (1910-1983: un estudio minucioso de la recepción y difusión del psicoanálisis que recupero tanto en sus aspectos metodológicos como en sus conjeturas sobre la relación que los intelectuales argentinos suelen entablar con el pensamiento europeo (tesis que se refuerzan si se las lee en conjunción con las desarrolladas respecto de lo que ha acontecido en el campo de los estudios literarios en nuestro país entre la segunda mitad del siglo XX hasta la fecha –cf. Gerbaudo, 2006a, 2007a, 2007b-. Plotkin pone la descripción de la masiva difusión del psicoanálisis en Argentina en el corte temporal seleccionado al servicio del estudio de los factores culturales, sociales y políticos que hicieron lugar a esa acogida.

  1. Las juventudes falangistas

    Directory of Open Access Journals (Sweden)

    Ricardo L. CHUECA RODRÍGUEZ

    2010-03-01

    Full Text Available RESUMEN: Una de las servidumbres de cualquier intervención como la que me propongo realizar aquí es la de los prolegómenos, siempre inevitables. Cuando inicié la preparación de estas líneas pronto caí en la cuenta de que poco se podía entender sin clarificar una serie de cuestiones que yo creía previas. Al final no resultaron ser previas sino más bien contextúales. En todo caso necesarias. Por eso en primer lugar nos detendremos en intentar desentrañar las relaciones entre juventud y fascismo. Veremos después algunos datos contextúales de Falange como partido único en el régimen de Franco, y terminaremos analizando las causas del fracaso de la organización juvenil falangista como aparato del Partido, dirigido a obtener tanto una reprodución política interna como su implantación en la sociedad civil.

  2. Las telenovelas juveniles mexicanas y las adolescentas obesas

    Directory of Open Access Journals (Sweden)

    Tania Meza

    2006-01-01

    Full Text Available Esta investigación analiza la opresión por cuerpo (obesidad a la que las mujeres son sometidas dentro del sistema patriarcal durante la adolescencia, específicamente a través de la representación televisiva que de las jóvenes gordas se hace en la telenovela juvenil mexicana. Los enormes niveles de audiencia que poseen las telenovelas en nuestro país hace indispensable, para los estudios de género desde las ciencias dela comunicación, estudiar el papel de las mujeres en dichas teleseries. En este análisis se pretende mostrar la triple marginación a la que son sometidas las adolescentes obesas en las telenovelas juveniles mexicanas: por ser mujeres, por ser jóvenes y por ser gordas.

  3. EEG datasets for motor imagery brain-computer interface.

    Science.gov (United States)

    Cho, Hohyun; Ahn, Minkyu; Ahn, Sangtae; Kwon, Moonyoung; Jun, Sung Chan

    2017-07-01

    Most investigators of brain-computer interface (BCI) research believe that BCI can be achieved through induced neuronal activity from the cortex, but not by evoked neuronal activity. Motor imagery (MI)-based BCI is one of the standard concepts of BCI, in that the user can generate induced activity by imagining motor movements. However, variations in performance over sessions and subjects are too severe to overcome easily; therefore, a basic understanding and investigation of BCI performance variation is necessary to find critical evidence of performance variation. Here we present not only EEG datasets for MI BCI from 52 subjects, but also the results of a psychological and physiological questionnaire, EMG datasets, the locations of 3D EEG electrodes, and EEGs for non-task-related states. We validated our EEG datasets by using the percentage of bad trials, event-related desynchronization/synchronization (ERD/ERS) analysis, and classification analysis. After conventional rejection of bad trials, we showed contralateral ERD and ipsilateral ERS in the somatosensory area, which are well-known patterns of MI. Finally, we showed that 73.08% of datasets (38 subjects) included reasonably discriminative information. Our EEG datasets included the information necessary to determine statistical significance; they consisted of well-discriminated datasets (38 subjects) and less-discriminative datasets. These may provide researchers with opportunities to investigate human factors related to MI BCI performance variation, and may also achieve subject-to-subject transfer by using metadata, including a questionnaire, EEG coordinates, and EEGs for non-task-related states. © The Authors 2017. Published by Oxford University Press.

  4. Comparison of CORA and EN4 in-situ datasets validation methods, toward a better quality merged dataset.

    Science.gov (United States)

    Szekely, Tanguy; Killick, Rachel; Gourrion, Jerome; Reverdin, Gilles

    2017-04-01

    CORA and EN4 are both global delayed time mode validated in-situ ocean temperature and salinity datasets distributed by the Met Office (http://www.metoffice.gov.uk/) and Copernicus (www.marine.copernicus.eu). A large part of the profiles distributed by CORA and EN4 in recent years are Argo profiles from the ARGO DAC, but profiles are also extracted from the World Ocean Database and TESAC profiles from GTSPP. In the case of CORA, data coming from the EUROGOOS Regional operationnal oserving system( ROOS) operated by European institutes no managed by National Data Centres and other datasets of profiles povided by scientific sources can also be found (Sea mammals profiles from MEOP, XBT datasets from cruises ...). (EN4 also takes data from the ASBO dataset to supplement observations in the Arctic). First advantage of this new merge product is to enhance the space and time coverage at global and european scales for the period covering 1950 till a year before the current year. This product is updated once a year and T&S gridded fields are alos generated for the period 1990-year n-1. The enhancement compared to the revious CORA product will be presented Despite the fact that the profiles distributed by both datasets are mostly the same, the quality control procedures developed by the Met Office and Copernicus teams differ, sometimes leading to different quality control flags for the same profile. Started in 2016 a new study started that aims to compare both validation procedures to move towards a Copernicus Marine Service dataset with the best features of CORA and EN4 validation.A reference data set composed of the full set of in-situ temperature and salinity measurements collected by Coriolis during 2015 is used. These measurements have been made thanks to wide range of instruments (XBTs, CTDs, Argo floats, Instrumented sea mammals,...), covering the global ocean. The reference dataset has been validated simultaneously by both teams.An exhaustive comparison of the

  5. Wind and wave dataset for Matara, Sri Lanka

    Science.gov (United States)

    Luo, Yao; Wang, Dongxiao; Priyadarshana Gamage, Tilak; Zhou, Fenghua; Madusanka Widanage, Charith; Liu, Taiwei

    2018-01-01

    We present a continuous in situ hydro-meteorology observational dataset from a set of instruments first deployed in December 2012 in the south of Sri Lanka, facing toward the north Indian Ocean. In these waters, simultaneous records of wind and wave data are sparse due to difficulties in deploying measurement instruments, although the area hosts one of the busiest shipping lanes in the world. This study describes the survey, deployment, and measurements of wind and waves, with the aim of offering future users of the dataset the most comprehensive and as much information as possible. This dataset advances our understanding of the nearshore hydrodynamic processes and wave climate, including sea waves and swells, in the north Indian Ocean. Moreover, it is a valuable resource for ocean model parameterization and validation. The archived dataset (Table 1) is examined in detail, including wave data at two locations with water depths of 20 and 10 m comprising synchronous time series of wind, ocean astronomical tide, air pressure, etc. In addition, we use these wave observations to evaluate the ERA-Interim reanalysis product. Based on Buoy 2 data, the swells are the main component of waves year-round, although monsoons can markedly alter the proportion between swell and wind sea. The dataset (Luo et al., 2017) is publicly available from Science Data Bank (https://doi.org/10.11922/sciencedb.447).

  6. The LANDFIRE Refresh strategy: updating the national dataset

    Science.gov (United States)

    Nelson, Kurtis J.; Connot, Joel A.; Peterson, Birgit E.; Martin, Charley

    2013-01-01

    The LANDFIRE Program provides comprehensive vegetation and fuel datasets for the entire United States. As with many large-scale ecological datasets, vegetation and landscape conditions must be updated periodically to account for disturbances, growth, and natural succession. The LANDFIRE Refresh effort was the first attempt to consistently update these products nationwide. It incorporated a combination of specific systematic improvements to the original LANDFIRE National data, remote sensing based disturbance detection methods, field collected disturbance information, vegetation growth and succession modeling, and vegetation transition processes. This resulted in the creation of two complete datasets for all 50 states: LANDFIRE Refresh 2001, which includes the systematic improvements, and LANDFIRE Refresh 2008, which includes the disturbance and succession updates to the vegetation and fuel data. The new datasets are comparable for studying landscape changes in vegetation type and structure over a decadal period, and provide the most recent characterization of fuel conditions across the country. The applicability of the new layers is discussed and the effects of using the new fuel datasets are demonstrated through a fire behavior modeling exercise using the 2011 Wallow Fire in eastern Arizona as an example.

  7. Interactive visualization and analysis of multimodal datasets for surgical applications.

    Science.gov (United States)

    Kirmizibayrak, Can; Yim, Yeny; Wakid, Mike; Hahn, James

    2012-12-01

    Surgeons use information from multiple sources when making surgical decisions. These include volumetric datasets (such as CT, PET, MRI, and their variants), 2D datasets (such as endoscopic videos), and vector-valued datasets (such as computer simulations). Presenting all the information to the user in an effective manner is a challenging problem. In this paper, we present a visualization approach that displays the information from various sources in a single coherent view. The system allows the user to explore and manipulate volumetric datasets, display analysis of dataset values in local regions, combine 2D and 3D imaging modalities and display results of vector-based computer simulations. Several interaction methods are discussed: in addition to traditional interfaces including mouse and trackers, gesture-based natural interaction methods are shown to control these visualizations with real-time performance. An example of a medical application (medialization laryngoplasty) is presented to demonstrate how the combination of different modalities can be used in a surgical setting with our approach.

  8. Wind and wave dataset for Matara, Sri Lanka

    Directory of Open Access Journals (Sweden)

    Y. Luo

    2018-01-01

    Full Text Available We present a continuous in situ hydro-meteorology observational dataset from a set of instruments first deployed in December 2012 in the south of Sri Lanka, facing toward the north Indian Ocean. In these waters, simultaneous records of wind and wave data are sparse due to difficulties in deploying measurement instruments, although the area hosts one of the busiest shipping lanes in the world. This study describes the survey, deployment, and measurements of wind and waves, with the aim of offering future users of the dataset the most comprehensive and as much information as possible. This dataset advances our understanding of the nearshore hydrodynamic processes and wave climate, including sea waves and swells, in the north Indian Ocean. Moreover, it is a valuable resource for ocean model parameterization and validation. The archived dataset (Table 1 is examined in detail, including wave data at two locations with water depths of 20 and 10 m comprising synchronous time series of wind, ocean astronomical tide, air pressure, etc. In addition, we use these wave observations to evaluate the ERA-Interim reanalysis product. Based on Buoy 2 data, the swells are the main component of waves year-round, although monsoons can markedly alter the proportion between swell and wind sea. The dataset (Luo et al., 2017 is publicly available from Science Data Bank (https://doi.org/10.11922/sciencedb.447.

  9. Process mining in oncology using the MIMIC-III dataset

    Science.gov (United States)

    Prima Kurniati, Angelina; Hall, Geoff; Hogg, David; Johnson, Owen

    2018-03-01

    Process mining is a data analytics approach to discover and analyse process models based on the real activities captured in information systems. There is a growing body of literature on process mining in healthcare, including oncology, the study of cancer. In earlier work we found 37 peer-reviewed papers describing process mining research in oncology with a regular complaint being the limited availability and accessibility of datasets with suitable information for process mining. Publicly available datasets are one option and this paper describes the potential to use MIMIC-III, for process mining in oncology. MIMIC-III is a large open access dataset of de-identified patient records. There are 134 publications listed as using the MIMIC dataset, but none of them have used process mining. The MIMIC-III dataset has 16 event tables which are potentially useful for process mining and this paper demonstrates the opportunities to use MIMIC-III for process mining in oncology. Our research applied the L* lifecycle method to provide a worked example showing how process mining can be used to analyse cancer pathways. The results and data quality limitations are discussed along with opportunities for further work and reflection on the value of MIMIC-III for reproducible process mining research.

  10. Las marcas propias en Colombia

    Directory of Open Access Journals (Sweden)

    Carlos Felipe Payán Rodríguez

    2013-11-01

    Full Text Available El artículo analiza la situación actual de las marcas propias en Colombia, teniendo en cuenta su creciente auge al interior de las grandes superficies. Mediante este tipo de marcas el mismo distribuidor vende productos y servicios con su nombre o recurriendo a una marca de su propiedad, compitiendo con las marcas ya posicionadas en el mercado de los fabricantes. El texto se articula, primero, en una contextualización con base en doctrina comparada respecto de las marcas propias. Luego, se aplican las reglas de dichos conceptos doctrinales a la situación actual colombiana. Por último, se plantean dos problemas jurídicos respecto de las marcas propias, uno, analizado desde la propiedad industrial, y el otro, desde las reglas del Estatuto del Consumidor.

  11. Las mujeres lacandonas: cambios recientes

    Directory of Open Access Journals (Sweden)

    Lucie Nečasová

    2010-08-01

    Full Text Available El artículo se enfoca en los cambios en la vida de las mujeres lacandonas contemporáneas. Analiza cómo se han modificado el modo de la vida, las costumbres y las relaciones de la mujer dentro de las propias familias pero también dentro de la sociedad y la comunidad. El objetivo es mostrar los cambios reflejados en la vida de las mujeres de tres generaciones. El estudio está basado en las etnografías disponibles combinadas con el propio trabajo de campo realizado en las comunidades Lacanjá y Nahá en los años 2008 y 2009.

  12. Las narcodemocracias andinas

    Directory of Open Access Journals (Sweden)

    Olivier DABÈNE

    2009-11-01

    Full Text Available RESUMEN: Los países de América Latina están afectados en diferentes aspectos por el tráfico de drogas. En especial, en los países andinos, esta actividad ha condicionado de manera decisiva el tipo de régimen democrático iniciado en algún caso en la década de 1980, como Perú o Bolivia, o el régimen colombiano. El autor califica a estos regímenes como narcodemocracias por los efectos del tráfico de drogas no sólo en el orden social y económico, que analiza de forma profunda, sino por la politización del narcotráfico. Las respuestas de las democracias han sido bien distintas en los tres casos analizados, afectando, sin embargo, en todos ellos, a la gobernabilidad de las mismas y a su legitimidad.ABSTRACT: Latin American countries are affected by narcotrafico in different aspects. Particularly, in Andean countries, this activity has deeply conditioned the type of democratic regime installed in some cases in the eighties, as Perou and Bolivia, or the colombian regimen. The author characterizes these regimes as narcodemocracias by effects of narcotrafico in them, not only in social and economic arenas, but also by the politization of narcotrafico. Responses of democracies have been very different in these cases analyzed in the article. Nevertheless, in all of them, governability and legitmity of democracy are affected.

  13. Las actividades de lucha

    OpenAIRE

    Villamón Herrera, Miguel; Gutiérrez García, Carlos; Espartero Casado, Julián

    2003-01-01

    La lucha, como práctica lúdica y agonística está vinculada a la historia de todos los pueblos y civilizaciones. Su origen se remonta a los albores de la humanidad, por la necesidad de defender la propia vida y la integridad física frente a situaciones de peligro para la supervivencia. Así, para someter violentamente al adversario, a lo largo de la historia y en las diversas civilizaciones, se desarrollaron en cada región del mundo distintas técnicas de combate que, en unos casos, utilizaba...

  14. Recent Development on the NOAA's Global Surface Temperature Dataset

    Science.gov (United States)

    Zhang, H. M.; Huang, B.; Boyer, T.; Lawrimore, J. H.; Menne, M. J.; Rennie, J.

    2016-12-01

    Global Surface Temperature (GST) is one of the most widely used indicators for climate trend and extreme analyses. A widely used GST dataset is the NOAA merged land-ocean surface temperature dataset known as NOAAGlobalTemp (formerly MLOST). The NOAAGlobalTemp had recently been updated from version 3.5.4 to version 4. The update includes a significant improvement in the ocean surface component (Extended Reconstructed Sea Surface Temperature or ERSST, from version 3b to version 4) which resulted in an increased temperature trends in recent decades. Since then, advancements in both the ocean component (ERSST) and land component (GHCN-Monthly) have been made, including the inclusion of Argo float SSTs and expanded EOT modes in ERSST, and the use of ISTI databank in GHCN-Monthly. In this presentation, we describe the impact of those improvements on the merged global temperature dataset, in terms of global trends and other aspects.

  15. Synthetic ALSPAC longitudinal datasets for the Big Data VR project.

    Science.gov (United States)

    Avraam, Demetris; Wilson, Rebecca C; Burton, Paul

    2017-01-01

    Three synthetic datasets - of observation size 15,000, 155,000 and 1,555,000 participants, respectively - were created by simulating eleven cardiac and anthropometric variables from nine collection ages of the ALSAPC birth cohort study. The synthetic datasets retain similar data properties to the ALSPAC study data they are simulated from (co-variance matrices, as well as the mean and variance values of the variables) without including the original data itself or disclosing participant information.  In this instance, the three synthetic datasets have been utilised in an academia-industry collaboration to build a prototype virtual reality data analysis software, but they could have a broader use in method and software development projects where sensitive data cannot be freely shared.

  16. The OXL format for the exchange of integrated datasets

    Directory of Open Access Journals (Sweden)

    Taubert Jan

    2007-12-01

    Full Text Available A prerequisite for systems biology is the integration and analysis of heterogeneous experimental data stored in hundreds of life-science databases and millions of scientific publications. Several standardised formats for the exchange of specific kinds of biological information exist. Such exchange languages facilitate the integration process; however they are not designed to transport integrated datasets. A format for exchanging integrated datasets needs to i cover data from a broad range of application domains, ii be flexible and extensible to combine many different complex data structures, iii include metadata and semantic definitions, iv include inferred information, v identify the original data source for integrated entities and vi transport large integrated datasets. Unfortunately, none of the exchange formats from the biological domain (e.g. BioPAX, MAGE-ML, PSI-MI, SBML or the generic approaches (RDF, OWL fulfil these requirements in a systematic way.

  17. Dataset of transcriptional landscape of B cell early activation

    Directory of Open Access Journals (Sweden)

    Alexander S. Garruss

    2015-09-01

    Full Text Available Signaling via B cell receptors (BCR and Toll-like receptors (TLRs result in activation of B cells with distinct physiological outcomes, but transcriptional regulatory mechanisms that drive activation and distinguish these pathways remain unknown. At early time points after BCR and TLR ligand exposure, 0.5 and 2 h, RNA-seq was performed allowing observations on rapid transcriptional changes. At 2 h, ChIP-seq was performed to allow observations on important regulatory mechanisms potentially driving transcriptional change. The dataset includes RNA-seq, ChIP-seq of control (Input, RNA Pol II, H3K4me3, H3K27me3, and a separate RNA-seq for miRNA expression, which can be found at Gene Expression Omnibus Dataset GSE61608. Here, we provide details on the experimental and analysis methods used to obtain and analyze this dataset and to examine the transcriptional landscape of B cell early activation.

  18. The Global Precipitation Climatology Project (GPCP) Combined Precipitation Dataset

    Science.gov (United States)

    Huffman, George J.; Adler, Robert F.; Arkin, Philip; Chang, Alfred; Ferraro, Ralph; Gruber, Arnold; Janowiak, John; McNab, Alan; Rudolf, Bruno; Schneider, Udo

    1997-01-01

    The Global Precipitation Climatology Project (GPCP) has released the GPCP Version 1 Combined Precipitation Data Set, a global, monthly precipitation dataset covering the period July 1987 through December 1995. The primary product in the dataset is a merged analysis incorporating precipitation estimates from low-orbit-satellite microwave data, geosynchronous-orbit -satellite infrared data, and rain gauge observations. The dataset also contains the individual input fields, a combination of the microwave and infrared satellite estimates, and error estimates for each field. The data are provided on 2.5 deg x 2.5 deg latitude-longitude global grids. Preliminary analyses show general agreement with prior studies of global precipitation and extends prior studies of El Nino-Southern Oscillation precipitation patterns. At the regional scale there are systematic differences with standard climatologies.

  19. A high-resolution European dataset for hydrologic modeling

    Science.gov (United States)

    Ntegeka, Victor; Salamon, Peter; Gomes, Goncalo; Sint, Hadewij; Lorini, Valerio; Thielen, Jutta

    2013-04-01

    There is an increasing demand for large scale hydrological models not only in the field of modeling the impact of climate change on water resources but also for disaster risk assessments and flood or drought early warning systems. These large scale models need to be calibrated and verified against large amounts of observations in order to judge their capabilities to predict the future. However, the creation of large scale datasets is challenging for it requires collection, harmonization, and quality checking of large amounts of observations. For this reason, only a limited number of such datasets exist. In this work, we present a pan European, high-resolution gridded dataset of meteorological observations (EFAS-Meteo) which was designed with the aim to drive a large scale hydrological model. Similar European and global gridded datasets already exist, such as the HadGHCND (Caesar et al., 2006), the JRC MARS-STAT database (van der Goot and Orlandi, 2003) and the E-OBS gridded dataset (Haylock et al., 2008). However, none of those provide similarly high spatial resolution and/or a complete set of variables to force a hydrologic model. EFAS-Meteo contains daily maps of precipitation, surface temperature (mean, minimum and maximum), wind speed and vapour pressure at a spatial grid resolution of 5 x 5 km for the time period 1 January 1990 - 31 December 2011. It furthermore contains calculated radiation, which is calculated by using a staggered approach depending on the availability of sunshine duration, cloud cover and minimum and maximum temperature, and evapotranspiration (potential evapotranspiration, bare soil and open water evapotranspiration). The potential evapotranspiration was calculated using the Penman-Monteith equation with the above-mentioned meteorological variables. The dataset was created as part of the development of the European Flood Awareness System (EFAS) and has been continuously updated throughout the last years. The dataset variables are used as

  20. Visualization of conserved structures by fusing highly variable datasets.

    Science.gov (United States)

    Silverstein, Jonathan C; Chhadia, Ankur; Dech, Fred

    2002-01-01

    Skill, effort, and time are required to identify and visualize anatomic structures in three-dimensions from radiological data. Fundamentally, automating these processes requires a technique that uses symbolic information not in the dynamic range of the voxel data. We were developing such a technique based on mutual information for automatic multi-modality image fusion (MIAMI Fuse, University of Michigan). This system previously demonstrated facility at fusing one voxel dataset with integrated symbolic structure information to a CT dataset (different scale and resolution) from the same person. The next step of development of our technique was aimed at accommodating the variability of anatomy from patient to patient by using warping to fuse our standard dataset to arbitrary patient CT datasets. A standard symbolic information dataset was created from the full color Visible Human Female by segmenting the liver parenchyma, portal veins, and hepatic veins and overwriting each set of voxels with a fixed color. Two arbitrarily selected patient CT scans of the abdomen were used for reference datasets. We used the warping functions in MIAMI Fuse to align the standard structure data to each patient scan. The key to successful fusion was the focused use of multiple warping control points that place themselves around the structure of interest automatically. The user assigns only a few initial control points to align the scans. Fusion 1 and 2 transformed the atlas with 27 points around the liver to CT1 and CT2 respectively. Fusion 3 transformed the atlas with 45 control points around the liver to CT1 and Fusion 4 transformed the atlas with 5 control points around the portal vein. The CT dataset is augmented with the transformed standard structure dataset, such that the warped structure masks are visualized in combination with the original patient dataset. This combined volume visualization is then rendered interactively in stereo on the ImmersaDesk in an immersive Virtual

  1. A cross-country Exchange Market Pressure (EMP) dataset.

    Science.gov (United States)

    Desai, Mohit; Patnaik, Ila; Felman, Joshua; Shah, Ajay

    2017-06-01

    The data presented in this article are related to the research article titled - "An exchange market pressure measure for cross country analysis" (Patnaik et al. [1]). In this article, we present the dataset for Exchange Market Pressure values (EMP) for 139 countries along with their conversion factors, ρ (rho). Exchange Market Pressure, expressed in percentage change in exchange rate, measures the change in exchange rate that would have taken place had the central bank not intervened. The conversion factor ρ can interpreted as the change in exchange rate associated with $1 billion of intervention. Estimates of conversion factor ρ allow us to calculate a monthly time series of EMP for 139 countries. Additionally, the dataset contains the 68% confidence interval (high and low values) for the point estimates of ρ 's. Using the standard errors of estimates of ρ 's, we obtain one sigma intervals around mean estimates of EMP values. These values are also reported in the dataset.

  2. Dataset of herbarium specimens of threatened vascular plants in Catalonia.

    Science.gov (United States)

    Nualart, Neus; Ibáñez, Neus; Luque, Pere; Pedrol, Joan; Vilar, Lluís; Guàrdia, Roser

    2017-01-01

    This data paper describes a specimens' dataset of the Catalonian threatened vascular plants conserved in five public Catalonian herbaria (BC, BCN, HGI, HBIL and MTTE). Catalonia is an administrative region of Spain that includes large autochthon plants diversity and 199 taxa with IUCN threatened categories (EX, EW, RE, CR, EN and VU). This dataset includes 1,618 records collected from 17 th century to nowadays. For each specimen, the species name, locality indication, collection date, collector, ecology and revision label are recorded. More than 94% of the taxa are represented in the herbaria, which evidence the paper of the botanical collections as an essential source of occurrence data.

  3. A Large-Scale 3D Object Recognition dataset

    DEFF Research Database (Denmark)

    Sølund, Thomas; Glent Buch, Anders; Krüger, Norbert

    2016-01-01

    geometric groups; concave, convex, cylindrical and flat 3D object models. The object models have varying amount of local geometric features to challenge existing local shape feature descriptors in terms of descriptiveness and robustness. The dataset is validated in a benchmark which evaluates the matching...... performance of 7 different state-of-the-art local shape descriptors. Further, we validate the dataset in a 3D object recognition pipeline. Our benchmark shows as expected that local shape feature descriptors without any global point relation across the surface have a poor matching performance with flat...

  4. Traffic sign classification with dataset augmentation and convolutional neural network

    Science.gov (United States)

    Tang, Qing; Kurnianggoro, Laksono; Jo, Kang-Hyun

    2018-04-01

    This paper presents a method for traffic sign classification using a convolutional neural network (CNN). In this method, firstly we transfer a color image into grayscale, and then normalize it in the range (-1,1) as the preprocessing step. To increase robustness of classification model, we apply a dataset augmentation algorithm and create new images to train the model. To avoid overfitting, we utilize a dropout module before the last fully connection layer. To assess the performance of the proposed method, the German traffic sign recognition benchmark (GTSRB) dataset is utilized. Experimental results show that the method is effective in classifying traffic signs.

  5. Towards interoperable and reproducible QSAR analyses: Exchange of datasets.

    Science.gov (United States)

    Spjuth, Ola; Willighagen, Egon L; Guha, Rajarshi; Eklund, Martin; Wikberg, Jarl Es

    2010-06-30

    QSAR is a widely used method to relate chemical structures to responses or properties based on experimental observations. Much effort has been made to evaluate and validate the statistical modeling in QSAR, but these analyses treat the dataset as fixed. An overlooked but highly important issue is the validation of the setup of the dataset, which comprises addition of chemical structures as well as selection of descriptors and software implementations prior to calculations. This process is hampered by the lack of standards and exchange formats in the field, making it virtually impossible to reproduce and validate analyses and drastically constrain collaborations and re-use of data. We present a step towards standardizing QSAR analyses by defining interoperable and reproducible QSAR datasets, consisting of an open XML format (QSAR-ML) which builds on an open and extensible descriptor ontology. The ontology provides an extensible way of uniquely defining descriptors for use in QSAR experiments, and the exchange format supports multiple versioned implementations of these descriptors. Hence, a dataset described by QSAR-ML makes its setup completely reproducible. We also provide a reference implementation as a set of plugins for Bioclipse which simplifies setup of QSAR datasets, and allows for exporting in QSAR-ML as well as old-fashioned CSV formats. The implementation facilitates addition of new descriptor implementations from locally installed software and remote Web services; the latter is demonstrated with REST and XMPP Web services. Standardized QSAR datasets open up new ways to store, query, and exchange data for subsequent analyses. QSAR-ML supports completely reproducible creation of datasets, solving the problems of defining which software components were used and their versions, and the descriptor ontology eliminates confusions regarding descriptors by defining them crisply. This makes is easy to join, extend, combine datasets and hence work collectively, but

  6. Towards interoperable and reproducible QSAR analyses: Exchange of datasets

    Directory of Open Access Journals (Sweden)

    Spjuth Ola

    2010-06-01

    Full Text Available Abstract Background QSAR is a widely used method to relate chemical structures to responses or properties based on experimental observations. Much effort has been made to evaluate and validate the statistical modeling in QSAR, but these analyses treat the dataset as fixed. An overlooked but highly important issue is the validation of the setup of the dataset, which comprises addition of chemical structures as well as selection of descriptors and software implementations prior to calculations. This process is hampered by the lack of standards and exchange formats in the field, making it virtually impossible to reproduce and validate analyses and drastically constrain collaborations and re-use of data. Results We present a step towards standardizing QSAR analyses by defining interoperable and reproducible QSAR datasets, consisting of an open XML format (QSAR-ML which builds on an open and extensible descriptor ontology. The ontology provides an extensible way of uniquely defining descriptors for use in QSAR experiments, and the exchange format supports multiple versioned implementations of these descriptors. Hence, a dataset described by QSAR-ML makes its setup completely reproducible. We also provide a reference implementation as a set of plugins for Bioclipse which simplifies setup of QSAR datasets, and allows for exporting in QSAR-ML as well as old-fashioned CSV formats. The implementation facilitates addition of new descriptor implementations from locally installed software and remote Web services; the latter is demonstrated with REST and XMPP Web services. Conclusions Standardized QSAR datasets open up new ways to store, query, and exchange data for subsequent analyses. QSAR-ML supports completely reproducible creation of datasets, solving the problems of defining which software components were used and their versions, and the descriptor ontology eliminates confusions regarding descriptors by defining them crisply. This makes is easy to join

  7. The Wind Integration National Dataset (WIND) toolkit (Presentation)

    Energy Technology Data Exchange (ETDEWEB)

    Caroline Draxl: NREL

    2014-01-01

    Regional wind integration studies require detailed wind power output data at many locations to perform simulations of how the power system will operate under high penetration scenarios. The wind datasets that serve as inputs into the study must realistically reflect the ramping characteristics, spatial and temporal correlations, and capacity factors of the simulated wind plants, as well as being time synchronized with available load profiles.As described in this presentation, the WIND Toolkit fulfills these requirements by providing a state-of-the-art national (US) wind resource, power production and forecast dataset.

  8. las distintas organizaciones

    Directory of Open Access Journals (Sweden)

    A. Maldonado Ibáñez

    2006-01-01

    Full Text Available El presente artículo trata sobre la definición de un repositorio-catálogo de estilos aplicables a la representación cartográfica de los distintos fenómenos geográficos, basándose en la simbología normalizada de las distintas Organizaciones Cartográficas, tales como el Instituto Geográfico Nacional, el Ministerio de Medio Ambiente, el Instituto Nacional de Estadística, etc. El catálogo permitirá una elección rápida y sencilla de los diversos estilos utilizados por dichos organismos oficiales. Se trata de definir un repositorio en el que almacenar los estilos individualizados aplicables a los fenómenos geográficos, en el que se disponga de la capacidad de insertar, borrar y actualizar nuevos estilos. El repositorio debe responder como catálogo sobre el que buscar y descargar estilos. Una de las aplicaciones del repositorio-catálogo es la generación de estilos en el formato Style Layer Descriptor (SLD para ser visualizados mediante Servidores de Mapas en Red conformes con OGC.

  9. Las carreras del futuro

    Directory of Open Access Journals (Sweden)

    Luis Piscoya Hermoza

    2011-07-01

    Full Text Available La primera década del siglo XXI ha estado signada por un énfasis en los estudiosde prospectiva, los mismos que, a partir de procesos de complejizacióny articulación de la sociedad planetaria como son la digitalización, el progresivodeterioro del medio ambiente, la necesidad del cambio de la matriz energéticay la globalización del mercado, han ensayado hipótesis para identificar lastendencias que se fortalecerán y profundizarán durante las próximas décadas.Desde el punto de vista metodológico, existen muchas maneras de clasificarlaspara conceptualizarlas y entenderlas mejor. Sin embargo, considerando lanaturaleza de esta edición nos proponemos enfocar directamente aquellas queestán estrechamente ligadas a la formación universitaria.Naturalmente, no pensamos que estamos ante tendencias inevitables en lamedida que epistemológicamente la creencia en leyes históricas deterministases insostenible y la práctica social nos muestra que la construcción del futuroes nuestra responsabilidad y siempre rebasa las mejores previsiones.

  10. Las crisis familiares

    Directory of Open Access Journals (Sweden)

    Idarmis González Benítez

    2000-06-01

    Full Text Available Con este trabajo nos propusimos actualizar el tema de las crisis familiares. Se pone de manifiesto que tanto los eventos propios del desarrollo, como los accidentales, dan lugar a la aparición de crisis en la familia. Queda esclarecido, que no necesariamente han de tener implicaciones negativas para la familia. Se hace énfasis en la valoración del evento por la familia y su significación. Se destaca el papel de los recursos familiares como protectores y moduladores de las crisis. Por último se señalan algunos pasos a seguir en la intervención familiarWith this paper we intend to update the topic of the family crises. It is shown that the own events of development as well as the accidental ones bring about the appearance of crises in the family. It is made clear that they not always have negative implications for the family. Emphasis is made on the assessment of the event for the family and its significance. The role played by the family resources as protective and modulators of the crisis is stressed. Finally, some of the steps to be taken in family intervention are mentioned

  11. Las cruces del olvido

    Directory of Open Access Journals (Sweden)

    Carla Fernandes

    2012-08-01

    Full Text Available Les tragiques événements du « mars paraguayen » ont signifié la disparition de l’espoir en la transition démocratique dans laquelle le pays s’était engagé depuis presque une dizaine d’années. Renée Ferrer, poète, dramaturge, auteur de plusieurs recueils de contes et de deux romans, consigne alors le témoignage poétique de ce drame national dans le recueil Las cruces del olvido (2001. Elle a auparavant publié d’autres textes qui peuvent rentrer dans la catégorie de la poésie sociale : c’est le cas en particulier de Viaje a destiempo (1989 dédié aux victimes de la dictature. Las cruces del olvido, dès son paratexte, insiste sur la fonction de témoignage que l’auteur souhaite assigner à ses vers et sur le rôle que ceux-ci peuvent jouer dans la transmission d’une mémoire collective. La voix d’un jeune disparu de ce « mars paraguayen », le dédoublement qu’il subit et travers lequel il évoque son entrée dans la mort et son absence du monde des vivants, rendent possible l’écriture de cette expérience de l’indicible qu’est la disparition.Los trágicos acontecimientos del «marzo paraguayo» significaron la desaparición de la esperanza en la transición democrática que el país está viviendo desde unos diez años. Renée Ferrer, poeta, dramaturga, autora de varios libros de cuentos y de dos novelas, escribe entonces el testimonio poético de ese drama nacional en su poemario Las cruces del olvido (2001. Ya había publicado antes textos que se pueden considerar como poesía social: es el caso de Viaje a destiempo (1989 dedicado a las víctimas de la dictadura. Desde el texto apertural, Las cruces del olvido potencia la función de testimonio que la autora desea conceder a sus versos y el papel que éstos pueden desempeñar en la transmisión de una memoria colectiva. La voz de un joven desaparecido del «marzo paraguayo», el desdoblamiento que sufre y a través del cual evoca su entrada en la

  12. Las histiocitosis Histiocytosis

    Directory of Open Access Journals (Sweden)

    Eva Svarch

    2001-12-01

    Full Text Available El término histiocitosis identifica un grupo de alteraciones que tienen en común la proliferación de células dendríticas (CD y los macrófagos, y se diagnostican más frecuentemente en niños. Dentro de las relacionadas con las CD las fundamentales son las histiocitosis a células de Langerhans (HCL. Las HCL tienen un comportamiento clínico muy variable, que puede ir desde una lesión que involucra un solo sitio o sistema hasta una enfermedad multisistémica. El tratamiento depende de la extensión del proceso. Una lesión única tiende a desaparecer espontáneamente. También la biopsia diagnóstica con o sin inyección de un esteroide puede iniciar la curación. Los pacientes con enfermedad multisistémica pueden beneficiarse con el tratamiento esteroideo y citostático o inclusive con el trasplante de células progenitoras hematopoyéticas. La histiocitosis sinusal con linfoadenopatías masivas o enfermedad de Rosai Dorfman se debe a la proliferación de los macrófagos, es de naturaleza benigna y usualmente autolimitada. Afecta sobre todo a niños y adultos jóvenes. La linfohistiocitosis hemofagocítica también se produce por la proliferación de los macrófagos y es una enfermedad rara con una alta mortalidad. Puede ser familiar (autosómica recesiva o secundaria a infecciones virales. Esta última forma se presenta más frecuentemente en el lactante pequeño. En la actualidad, sobre todo en la variedad familiar, el trasplante alogénico de células progenitoras hematopoyéticas puede ser la única medida curativaThe term histiocytosis identifies a group of disorders that have in common the proliferation of dentritic cells (DC and macrophages and is frequently diagnosed in children. Among the fundamental variants of histiocytosis related with DC, we find Langerhans cell histiocytosis (LCH. Langerhans cell histiocytosis has very variable clinical behavior that ranges from a lesion involving only one site or system to a multisystem

  13. Using Multiple Big Datasets and Machine Learning to Produce a New Global Particulate Dataset: A Technology Challenge Case Study

    Science.gov (United States)

    Lary, D. J.

    2013-12-01

    A BigData case study is described where multiple datasets from several satellites, high-resolution global meteorological data, social media and in-situ observations are combined using machine learning on a distributed cluster using an automated workflow. The global particulate dataset is relevant to global public health studies and would not be possible to produce without the use of the multiple big datasets, in-situ data and machine learning.To greatly reduce the development time and enhance the functionality a high level language capable of parallel processing has been used (Matlab). A key consideration for the system is high speed access due to the large data volume, persistence of the large data volumes and a precise process time scheduling capability.

  14. Would the ‘real’ observed dataset stand up? A critical examination of eight observed gridded climate datasets for China

    International Nuclear Information System (INIS)

    Sun, Qiaohong; Miao, Chiyuan; Duan, Qingyun; Kong, Dongxian; Ye, Aizhong; Di, Zhenhua; Gong, Wei

    2014-01-01

    This research compared and evaluated the spatio-temporal similarities and differences of eight widely used gridded datasets. The datasets include daily precipitation over East Asia (EA), the Climate Research Unit (CRU) product, the Global Precipitation Climatology Centre (GPCC) product, the University of Delaware (UDEL) product, Precipitation Reconstruction over Land (PREC/L), the Asian Precipitation Highly Resolved Observational (APHRO) product, the Institute of Atmospheric Physics (IAP) dataset from the Chinese Academy of Sciences, and the National Meteorological Information Center dataset from the China Meteorological Administration (CN05). The meteorological variables focus on surface air temperature (SAT) or precipitation (PR) in China. All datasets presented general agreement on the whole spatio-temporal scale, but some differences appeared for specific periods and regions. On a temporal scale, EA shows the highest amount of PR, while APHRO shows the lowest. CRU and UDEL show higher SAT than IAP or CN05. On a spatial scale, the most significant differences occur in western China for PR and SAT. For PR, the difference between EA and CRU is the largest. When compared with CN05, CRU shows higher SAT in the central and southern Northwest river drainage basin, UDEL exhibits higher SAT over the Southwest river drainage system, and IAP has lower SAT in the Tibetan Plateau. The differences in annual mean PR and SAT primarily come from summer and winter, respectively. Finally, potential factors impacting agreement among gridded climate datasets are discussed, including raw data sources, quality control (QC) schemes, orographic correction, and interpolation techniques. The implications and challenges of these results for climate research are also briefly addressed. (paper)

  15. Using Real Datasets for Interdisciplinary Business/Economics Projects

    Science.gov (United States)

    Goel, Rajni; Straight, Ronald L.

    2005-01-01

    The workplace's global and dynamic nature allows and requires improved approaches for providing business and economics education. In this article, the authors explore ways of enhancing students' understanding of course material by using nontraditional, real-world datasets of particular interest to them. Teaching at a historically Black university,…

  16. Dataset-driven research for improving recommender systems for learning

    NARCIS (Netherlands)

    Verbert, Katrien; Drachsler, Hendrik; Manouselis, Nikos; Wolpers, Martin; Vuorikari, Riina; Duval, Erik

    2011-01-01

    Verbert, K., Drachsler, H., Manouselis, N., Wolpers, M., Vuorikari, R., & Duval, E. (2011). Dataset-driven research for improving recommender systems for learning. In Ph. Long, & G. Siemens (Eds.), Proceedings of 1st International Conference Learning Analytics & Knowledge (pp. 44-53). February,

  17. dataTEL - Datasets for Technology Enhanced Learning

    NARCIS (Netherlands)

    Drachsler, Hendrik; Verbert, Katrien; Sicilia, Miguel-Angel; Wolpers, Martin; Manouselis, Nikos; Vuorikari, Riina; Lindstaedt, Stefanie; Fischer, Frank

    2011-01-01

    Drachsler, H., Verbert, K., Sicilia, M. A., Wolpers, M., Manouselis, N., Vuorikari, R., Lindstaedt, S., & Fischer, F. (2011). dataTEL - Datasets for Technology Enhanced Learning. STELLAR Alpine Rendez-Vous White Paper. Alpine Rendez-Vous 2011 White paper collection, Nr. 13., France (2011)

  18. A dataset of forest biomass structure for Eurasia.

    Science.gov (United States)

    Schepaschenko, Dmitry; Shvidenko, Anatoly; Usoltsev, Vladimir; Lakyda, Petro; Luo, Yunjian; Vasylyshyn, Roman; Lakyda, Ivan; Myklush, Yuriy; See, Linda; McCallum, Ian; Fritz, Steffen; Kraxner, Florian; Obersteiner, Michael

    2017-05-16

    The most comprehensive dataset of in situ destructive sampling measurements of forest biomass in Eurasia have been compiled from a combination of experiments undertaken by the authors and from scientific publications. Biomass is reported as four components: live trees (stem, bark, branches, foliage, roots); understory (above- and below ground); green forest floor (above- and below ground); and coarse woody debris (snags, logs, dead branches of living trees and dead roots), consisting of 10,351 unique records of sample plots and 9,613 sample trees from ca 1,200 experiments for the period 1930-2014 where there is overlap between these two datasets. The dataset also contains other forest stand parameters such as tree species composition, average age, tree height, growing stock volume, etc., when available. Such a dataset can be used for the development of models of biomass structure, biomass extension factors, change detection in biomass structure, investigations into biodiversity and species distribution and the biodiversity-productivity relationship, as well as the assessment of the carbon pool and its dynamics, among many others.

  19. A reanalysis dataset of the South China Sea

    Science.gov (United States)

    Zeng, Xuezhi; Peng, Shiqiu; Li, Zhijin; Qi, Yiquan; Chen, Rongyu

    2014-01-01

    Ocean reanalysis provides a temporally continuous and spatially gridded four-dimensional estimate of the ocean state for a better understanding of the ocean dynamics and its spatial/temporal variability. Here we present a 19-year (1992–2010) high-resolution ocean reanalysis dataset of the upper ocean in the South China Sea (SCS) produced from an ocean data assimilation system. A wide variety of observations, including in-situ temperature/salinity profiles, ship-measured and satellite-derived sea surface temperatures, and sea surface height anomalies from satellite altimetry, are assimilated into the outputs of an ocean general circulation model using a multi-scale incremental three-dimensional variational data assimilation scheme, yielding a daily high-resolution reanalysis dataset of the SCS. Comparisons between the reanalysis and independent observations support the reliability of the dataset. The presented dataset provides the research community of the SCS an important data source for studying the thermodynamic processes of the ocean circulation and meso-scale features in the SCS, including their spatial and temporal variability. PMID:25977803

  20. Comparision of analysis of the QTLMAS XII common dataset

    DEFF Research Database (Denmark)

    Crooks, Lucy; Sahana, Goutam; de Koning, Dirk-Jan

    2009-01-01

    As part of the QTLMAS XII workshop, a simulated dataset was distributed and participants were invited to submit analyses of the data based on genome-wide association, fine mapping and genomic selection. We have evaluated the findings from the groups that reported fine mapping and genome-wide asso...

  1. The LAMBADA dataset: Word prediction requiring a broad discourse context

    NARCIS (Netherlands)

    Paperno, D.; Kruszewski, G.; Lazaridou, A.; Pham, Q.N.; Bernardi, R.; Pezzelle, S.; Baroni, M.; Boleda, G.; Fernández, R.; Erk, K.; Smith, N.A.

    2016-01-01

    We introduce LAMBADA, a dataset to evaluate the capabilities of computational models for text understanding by means of a word prediction task. LAMBADA is a collection of narrative passages sharing the characteristic that human subjects are able to guess their last word if they are exposed to the

  2. NEW WEB-BASED ACCESS TO NUCLEAR STRUCTURE DATASETS.

    Energy Technology Data Exchange (ETDEWEB)

    WINCHELL,D.F.

    2004-09-26

    As part of an effort to migrate the National Nuclear Data Center (NNDC) databases to a relational platform, a new web interface has been developed for the dissemination of the nuclear structure datasets stored in the Evaluated Nuclear Structure Data File and Experimental Unevaluated Nuclear Data List.

  3. Cross-Cultural Concept Mapping of Standardized Datasets

    DEFF Research Database (Denmark)

    Kano Glückstad, Fumiko

    2012-01-01

    This work compares four feature-based similarity measures derived from cognitive sciences. The purpose of the comparative analysis is to verify the potentially most effective model that can be applied for mapping independent ontologies in a culturally influenced domain [1]. Here, datasets based...

  4. Level-1 muon trigger performance with the full 2017 dataset

    CERN Document Server

    CMS Collaboration

    2018-01-01

    This document describes the performance of the CMS Level-1 Muon Trigger with the full dataset of 2017. Efficiency plots are included for each track finder (TF) individually and for the system as a whole. The efficiency is measured to be greater than 90% for all track finders.

  5. A Dataset for Visual Navigation with Neuromorphic Methods

    Directory of Open Access Journals (Sweden)

    Francisco eBarranco

    2016-02-01

    Full Text Available Standardized benchmarks in Computer Vision have greatly contributed to the advance of approaches to many problems in the field. If we want to enhance the visibility of event-driven vision and increase its impact, we will need benchmarks that allow comparison among different neuromorphic methods as well as comparison to Computer Vision conventional approaches. We present datasets to evaluate the accuracy of frame-free and frame-based approaches for tasks of visual navigation. Similar to conventional Computer Vision datasets, we provide synthetic and real scenes, with the synthetic data created with graphics packages, and the real data recorded using a mobile robotic platform carrying a dynamic and active pixel vision sensor (DAVIS and an RGB+Depth sensor. For both datasets the cameras move with a rigid motion in a static scene, and the data includes the images, events, optic flow, 3D camera motion, and the depth of the scene, along with calibration procedures. Finally, we also provide simulated event data generated synthetically from well-known frame-based optical flow datasets.

  6. Evaluation of Uncertainty in Precipitation Datasets for New Mexico, USA

    Science.gov (United States)

    Besha, A. A.; Steele, C. M.; Fernald, A.

    2014-12-01

    Climate change, population growth and other factors are endangering water availability and sustainability in semiarid/arid areas particularly in the southwestern United States. Wide coverage of spatial and temporal measurements of precipitation are key for regional water budget analysis and hydrological operations which themselves are valuable tool for water resource planning and management. Rain gauge measurements are usually reliable and accurate at a point. They measure rainfall continuously, but spatial sampling is limited. Ground based radar and satellite remotely sensed precipitation have wide spatial and temporal coverage. However, these measurements are indirect and subject to errors because of equipment, meteorological variability, the heterogeneity of the land surface itself and lack of regular recording. This study seeks to understand precipitation uncertainty and in doing so, lessen uncertainty propagation into hydrological applications and operations. We reviewed, compared and evaluated the TRMM (Tropical Rainfall Measuring Mission) precipitation products, NOAA's (National Oceanic and Atmospheric Administration) Global Precipitation Climatology Centre (GPCC) monthly precipitation dataset, PRISM (Parameter elevation Regression on Independent Slopes Model) data and data from individual climate stations including Cooperative Observer Program (COOP), Remote Automated Weather Stations (RAWS), Soil Climate Analysis Network (SCAN) and Snowpack Telemetry (SNOTEL) stations. Though not yet finalized, this study finds that the uncertainty within precipitation estimates datasets is influenced by regional topography, season, climate and precipitation rate. Ongoing work aims to further evaluate precipitation datasets based on the relative influence of these phenomena so that we can identify the optimum datasets for input to statewide water budget analysis.

  7. Dataset: Multi Sensor-Orientation Movement Data of Goats

    NARCIS (Netherlands)

    Kamminga, Jacob Wilhelm

    2018-01-01

    This is a labeled dataset. Motion data were collected from six sensor nodes that were fixed with different orientations to a collar around the neck of goats. These six sensor nodes simultaneously, with different orientations, recorded various activities performed by the goat. We recorded the

  8. A dataset of human decision-making in teamwork management

    Science.gov (United States)

    Yu, Han; Shen, Zhiqi; Miao, Chunyan; Leung, Cyril; Chen, Yiqiang; Fauvel, Simon; Lin, Jun; Cui, Lizhen; Pan, Zhengxiang; Yang, Qiang

    2017-01-01

    Today, most endeavours require teamwork by people with diverse skills and characteristics. In managing teamwork, decisions are often made under uncertainty and resource constraints. The strategies and the effectiveness of the strategies different people adopt to manage teamwork under different situations have not yet been fully explored, partially due to a lack of detailed large-scale data. In this paper, we describe a multi-faceted large-scale dataset to bridge this gap. It is derived from a game simulating complex project management processes. It presents the participants with different conditions in terms of team members' capabilities and task characteristics for them to exhibit their decision-making strategies. The dataset contains detailed data reflecting the decision situations, decision strategies, decision outcomes, and the emotional responses of 1,144 participants from diverse backgrounds. To our knowledge, this is the first dataset simultaneously covering these four facets of decision-making. With repeated measurements, the dataset may help establish baseline variability of decision-making in teamwork management, leading to more realistic decision theoretic models and more effective decision support approaches.

  9. UK surveillance: provision of quality assured information from combined datasets.

    Science.gov (United States)

    Paiba, G A; Roberts, S R; Houston, C W; Williams, E C; Smith, L H; Gibbens, J C; Holdship, S; Lysons, R

    2007-09-14

    Surveillance information is most useful when provided within a risk framework, which is achieved by presenting results against an appropriate denominator. Often the datasets are captured separately and for different purposes, and will have inherent errors and biases that can be further confounded by the act of merging. The United Kingdom Rapid Analysis and Detection of Animal-related Risks (RADAR) system contains data from several sources and provides both data extracts for research purposes and reports for wider stakeholders. Considerable efforts are made to optimise the data in RADAR during the Extraction, Transformation and Loading (ETL) process. Despite efforts to ensure data quality, the final dataset inevitably contains some data errors and biases, most of which cannot be rectified during subsequent analysis. So, in order for users to establish the 'fitness for purpose' of data merged from more than one data source, Quality Statements are produced as defined within the overarching surveillance Quality Framework. These documents detail identified data errors and biases following ETL and report construction as well as relevant aspects of the datasets from which the data originated. This paper illustrates these issues using RADAR datasets, and describes how they can be minimised.

  10. participatory development of a minimum dataset for the khayelitsha ...

    African Journals Online (AJOL)

    This dataset was integrated with data requirements at ... model for defining health information needs at district level. This participatory process has enabled health workers to appraise their .... of reproductive health, mental health, disability and community ... each chose a facilitator and met in between the forum meetings.

  11. Comparision of analysis of the QTLMAS XII common dataset

    DEFF Research Database (Denmark)

    Lund, Mogens Sandø; Sahana, Goutam; de Koning, Dirk-Jan

    2009-01-01

    A dataset was simulated and distributed to participants of the QTLMAS XII workshop who were invited to develop genomic selection models. Each contributing group was asked to describe the model development and validation as well as to submit genomic predictions for three generations of individuals...

  12. The NASA Subsonic Jet Particle Image Velocimetry (PIV) Dataset

    Science.gov (United States)

    Bridges, James; Wernet, Mark P.

    2011-01-01

    Many tasks in fluids engineering require prediction of turbulence of jet flows. The present document documents the single-point statistics of velocity, mean and variance, of cold and hot jet flows. The jet velocities ranged from 0.5 to 1.4 times the ambient speed of sound, and temperatures ranged from unheated to static temperature ratio 2.7. Further, the report assesses the accuracies of the data, e.g., establish uncertainties for the data. This paper covers the following five tasks: (1) Document acquisition and processing procedures used to create the particle image velocimetry (PIV) datasets. (2) Compare PIV data with hotwire and laser Doppler velocimetry (LDV) data published in the open literature. (3) Compare different datasets acquired at the same flow conditions in multiple tests to establish uncertainties. (4) Create a consensus dataset for a range of hot jet flows, including uncertainty bands. (5) Analyze this consensus dataset for self-consistency and compare jet characteristics to those of the open literature. The final objective was fulfilled by using the potential core length and the spread rate of the half-velocity radius to collapse of the mean and turbulent velocity fields over the first 20 jet diameters.

  13. Ambiente psicologico en las organizaciones

    Directory of Open Access Journals (Sweden)

    Damarcy Cortés Baracaldo

    2002-01-01

    Full Text Available El talento humano en las organizaciones se ha convertido en las ultimas decadas en un recurso que se administra de acuerdo al estilo de liderazgo del jefe, lo que implica una marcada relación hacia la tarea, hacia las relaciones con el personal o una combinación de estas dos, que desencadenan en un ambiente psicológico exclusive en cada organización.

  14. A new dataset validation system for the Planetary Science Archive

    Science.gov (United States)

    Manaud, N.; Zender, J.; Heather, D.; Martinez, S.

    2007-08-01

    The Planetary Science Archive is the official archive for the Mars Express mission. It has received its first data by the end of 2004. These data are delivered by the PI teams to the PSA team as datasets, which are formatted conform to the Planetary Data System (PDS). The PI teams are responsible for analyzing and calibrating the instrument data as well as the production of reduced and calibrated data. They are also responsible of the scientific validation of these data. ESA is responsible of the long-term data archiving and distribution to the scientific community and must ensure, in this regard, that all archived products meet quality. To do so, an archive peer-review is used to control the quality of the Mars Express science data archiving process. However a full validation of its content is missing. An independent review board recently recommended that the completeness of the archive as well as the consistency of the delivered data should be validated following well-defined procedures. A new validation software tool is being developed to complete the overall data quality control system functionality. This new tool aims to improve the quality of data and services provided to the scientific community through the PSA, and shall allow to track anomalies in and to control the completeness of datasets. It shall ensure that the PSA end-users: (1) can rely on the result of their queries, (2) will get data products that are suitable for scientific analysis, (3) can find all science data acquired during a mission. We defined dataset validation as the verification and assessment process to check the dataset content against pre-defined top-level criteria, which represent the general characteristics of good quality datasets. The dataset content that is checked includes the data and all types of information that are essential in the process of deriving scientific results and those interfacing with the PSA database. The validation software tool is a multi-mission tool that

  15. Data Recommender: An Alternative Way to Discover Open Scientific Datasets

    Science.gov (United States)

    Klump, J. F.; Devaraju, A.; Williams, G.; Hogan, D.; Davy, R.; Page, J.; Singh, D.; Peterson, N.

    2017-12-01

    Over the past few years, institutions and government agencies have adopted policies to openly release their data, which has resulted in huge amounts of open data becoming available on the web. When trying to discover the data, users face two challenges: an overload of choice and the limitations of the existing data search tools. On the one hand, there are too many datasets to choose from, and therefore, users need to spend considerable effort to find the datasets most relevant to their research. On the other hand, data portals commonly offer keyword and faceted search, which depend fully on the user queries to search and rank relevant datasets. Consequently, keyword and faceted search may return loosely related or irrelevant results, although the results may contain the same query. They may also return highly specific results that depend more on how well metadata was authored. They do not account well for variance in metadata due to variance in author styles and preferences. The top-ranked results may also come from the same data collection, and users are unlikely to discover new and interesting datasets. These search modes mainly suits users who can express their information needs in terms of the structure and terminology of the data portals, but may pose a challenge otherwise. The above challenges reflect that we need a solution that delivers the most relevant (i.e., similar and serendipitous) datasets to users, beyond the existing search functionalities on the portals. A recommender system is an information filtering system that presents users with relevant and interesting contents based on users' context and preferences. Delivering data recommendations to users can make data discovery easier, and as a result may enhance user engagement with the portal. We developed a hybrid data recommendation approach for the CSIRO Data Access Portal. The approach leverages existing recommendation techniques (e.g., content-based filtering and item co-occurrence) to produce

  16. Las industrias en hueso

    Directory of Open Access Journals (Sweden)

    Jorge Martinez-Moreno

    2005-01-01

    Full Text Available La elaboración de instrumentos en hueso con anterioridad a la aparición de Homo sapiens moderno ha sido descrita en numerosos yacimientos del Paleolítico Medio, entre ellos en Lezetxiki y Axlor. En este artículo se pretende evaluar si estos artefactos pueden considerarse el resultado de comportamientios tecnicos dirigidos a elaborar y/o utilizar esos soportes. Estas observaciones son extensibles a otros conjuntos en los que se mensionan estos óseos con atributos similares. Es esta revisión se describen los procesos y contextos que explican la presencia de esas modificaciones y a las que se les ha atribuido un significado tecnológico

  17. Las culturas no existen

    Directory of Open Access Journals (Sweden)

    Joel Feliu i Samuel-Lajeunesse

    2004-05-01

    Full Text Available Tras un somero recordatorio del hecho, a menudo olvidado, de que la cultura no es una realidad social sino tan solo un concepto, el artículo pretende argumentar en pro de la inexistencia de tal entidad (a pesar de las apariencias. La pregunta principal a la que se intenta responder es cual es el papel ideológico que tiene la idea de cultura, es decir ¿qué efectos de construcción de realidad genera? Para ello se realiza una crítica del concepto y de sus usos argumentando que tanto el concepto como su uso cotidiano entrañan prácticas de clasificación y segregación. Finalmente se aboga por la caída en desuso del concepto y por nuevas formas de pensar la diversidad humana más acordes con la posibilidad de una ética situada.

  18. OMC y las telecomunicaciones

    OpenAIRE

    Orbe Astudillo, Marcos

    2010-01-01

    El mercado mundial de las telecomunicaciones crece rápidamente convirtiéndose en un sector de mayor crecimiento en la economía mundial y en uno de los componentes más importantes de la actividad social, cultural y política del mundo, por ello, la tendencia mundial es la liberalización de los mercados de bienes y servicios de telecomunicaciones y tecnologías de la información y comunicación (TIC´s), volviéndose muy importante para éste propósito el actuar de la Organización Mundial del Comerci...

  19. Comparison of global 3-D aviation emissions datasets

    Directory of Open Access Journals (Sweden)

    S. C. Olsen

    2013-01-01

    Full Text Available Aviation emissions are unique from other transportation emissions, e.g., from road transportation and shipping, in that they occur at higher altitudes as well as at the surface. Aviation emissions of carbon dioxide, soot, and water vapor have direct radiative impacts on the Earth's climate system while emissions of nitrogen oxides (NOx, sulfur oxides, carbon monoxide (CO, and hydrocarbons (HC impact air quality and climate through their effects on ozone, methane, and clouds. The most accurate estimates of the impact of aviation on air quality and climate utilize three-dimensional chemistry-climate models and gridded four dimensional (space and time aviation emissions datasets. We compare five available aviation emissions datasets currently and historically used to evaluate the impact of aviation on climate and air quality: NASA-Boeing 1992, NASA-Boeing 1999, QUANTIFY 2000, Aero2k 2002, and AEDT 2006 and aviation fuel usage estimates from the International Energy Agency. Roughly 90% of all aviation emissions are in the Northern Hemisphere and nearly 60% of all fuelburn and NOx emissions occur at cruise altitudes in the Northern Hemisphere. While these datasets were created by independent methods and are thus not strictly suitable for analyzing trends they suggest that commercial aviation fuelburn and NOx emissions increased over the last two decades while HC emissions likely decreased and CO emissions did not change significantly. The bottom-up estimates compared here are consistently lower than International Energy Agency fuelburn statistics although the gap is significantly smaller in the more recent datasets. Overall the emissions distributions are quite similar for fuelburn and NOx with regional peaks over the populated land masses of North America, Europe, and East Asia. For CO and HC there are relatively larger differences. There are however some distinct differences in the altitude distribution

  20. Geoseq: a tool for dissecting deep-sequencing datasets

    Directory of Open Access Journals (Sweden)

    Homann Robert

    2010-10-01

    Full Text Available Abstract Background Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO, Sequence Read Archive (SRA hosted by the NCBI, or the DNA Data Bank of Japan (ddbj. Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest. Results Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments. Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries. The analysis reduces the input sequence to tiles and measures the coverage of each tile in a sequence library through the use of suffix arrays. The user can upload custom target sequences or use gene/miRNA names for the search and get back results as plots and spreadsheet files. Geoseq organizes the public sequencing data using a controlled vocabulary, allowing identification of relevant libraries by organism, tissue and type of experiment. Conclusions Analysis of small sets of sequences against deep-sequencing datasets, as well as identification of public datasets of interest, is simplified by Geoseq. We applied Geoseq to, a identify differential isoform expression in mRNA-seq datasets, b identify miRNAs (microRNAs in libraries, and identify mature and star sequences in miRNAS and c to identify potentially mis-annotated miRNAs. The ease of using Geoseq for these analyses suggests its utility and uniqueness as an analysis tool.

  1. On sample size and different interpretations of snow stability datasets

    Science.gov (United States)

    Schirmer, M.; Mitterer, C.; Schweizer, J.

    2009-04-01

    Interpretations of snow stability variations need an assessment of the stability itself, independent of the scale investigated in the study. Studies on stability variations at a regional scale have often chosen stability tests such as the Rutschblock test or combinations of various tests in order to detect differences in aspect and elevation. The question arose: ‘how capable are such stability interpretations in drawing conclusions'. There are at least three possible errors sources: (i) the variance of the stability test itself; (ii) the stability variance at an underlying slope scale, and (iii) that the stability interpretation might not be directly related to the probability of skier triggering. Various stability interpretations have been proposed in the past that provide partly different results. We compared a subjective one based on expert knowledge with a more objective one based on a measure derived from comparing skier-triggered slopes vs. slopes that have been skied but not triggered. In this study, the uncertainties are discussed and their effects on regional scale stability variations will be quantified in a pragmatic way. An existing dataset with very large sample sizes was revisited. This dataset contained the variance of stability at a regional scale for several situations. The stability in this dataset was determined using the subjective interpretation scheme based on expert knowledge. The question to be answered was how many measurements were needed to obtain similar results (mainly stability differences in aspect or elevation) as with the complete dataset. The optimal sample size was obtained in several ways: (i) assuming a nominal data scale the sample size was determined with a given test, significance level and power, and by calculating the mean and standard deviation of the complete dataset. With this method it can also be determined if the complete dataset consists of an appropriate sample size. (ii) Smaller subsets were created with similar

  2. Dejar las lagrimas e ir hacia las acciones

    DEFF Research Database (Denmark)

    Jeppesen, Anne Marie Ejdesgaard

    2015-01-01

    Este artículo enfoca en la frontera entre México y los Estados Unidos, las condiciones especiales de la sociedad fronteriza, las economías interconectadas y la violencia, en especial la violencia ejercida contra mujeres y los feminicidios. El artículo discute diferentes perspectivas y maneras de...

  3. Violencia contra las mujeres: las dimensiones de la desigualdad

    OpenAIRE

    Ana Isabel Blanco García

    2008-01-01

    En este número entraremos directamente en el debate acerca de la "Violencia contra las mujeres: las dimensiones de a desigualdad", que desde hace unos años recorre todo el sustrato del pensamiento feminista y del que en buena medida es deudor el propio título de nuestra revista.

  4. Violencia contra las mujeres: las dimensiones de la desigualdad

    Directory of Open Access Journals (Sweden)

    Ana Isabel Blanco García

    2008-12-01

    Full Text Available En este número entraremos directamente en el debate acerca de la "Violencia contra las mujeres: las dimensiones de a desigualdad", que desde hace unos años recorre todo el sustrato del pensamiento feminista y del que en buena medida es deudor el propio título de nuestra revista.

  5. Las TIC como fuente de ventaja competitiva en las PYMES

    Directory of Open Access Journals (Sweden)

    Álvaro Fernando Moncada Niño

    2013-06-01

    Full Text Available Desde su aparición, las Tecnologías de la Información y las Comunicaciones (TIC se han convertido en un recurso fundamental de las empresas para competir en la mayoría de las industrias, generalizando el planteamiento de que son fuente de ventaja competitiva. Pero realmente, ¿cuándo las TIC son fuente potencial de ventaja competitiva para las pymes? ¿Bajo qué condiciones se logra que contribuyan a generar valor y mejorar su posición competitiva? ¿Qué características deben cumplir para alcanzar y sustentar la Ventaja Competitiva? Este documento basado en la Teoría de los Recursos y Capacidades (TRC responde estas preguntas y presenta al análisis de recurso Valioso, Raro, Inimitable y Organización (VRIO como herramienta para la determinación del potencial y valor que las TIC pueden alcanzar en las pymes y cómo pueden contribuir a que la empresa alcance ventajas competitivas sostenibles, en complemento de sus recursos y capacidades organizacionales.

  6. Las TIC como fuente de ventaja competitiva en las PYMES

    Directory of Open Access Journals (Sweden)

    Álvaro Fernando Moncada Niño

    2013-07-01

    Full Text Available Desde su aparición, las Tecnologías de la Información y las Comunicaciones (TIC se han convertido en un recurso fundamental de las empresas para competir en la mayoría de las industrias, generalizando el planteamiento de que son fuente de ventaja competitiva. Pero realmente, ¿cuándo las TIC son fuente potencial de ventaja competitiva para las pymes? ¿Bajo qué condiciones se logra que contribuyan a generar valor y mejorar su posición competitiva? ¿Qué características deben cumplir para alcanzar y sustentar la Ventaja Competitiva? Este documento basado en la Teoría de los Recursos y Capacidades (TRC responde estas preguntas y presenta al análisis de recurso Valioso, Raro, Inimitable y Organización (VRIO como herramienta para la determinación del potencial y valor que las TIC pueden alcanzar en las pymes y cómo pueden contribuir a que la empresa alcance ventajas competitivas sostenibles, en complemento de sus recursos y capacidades organizacionales.

  7. Seguridad de las aplicaciones web

    OpenAIRE

    Luján Mora, Sergio

    2016-01-01

    Conferencia "Seguridad de las aplicaciones web" impartida en las III Jornadas Informáticas - UTE 2016 en la Facultad de Ciencias de la Ingeniería e Industrias de la Universidad Tecnológica Equinoccial (Quito, Ecuador) el 5 de julio de 2016.

  8. Las aventuras con el autoscopio

    Directory of Open Access Journals (Sweden)

    Víctor Florencio Ramírez Hernández

    2006-01-01

    Full Text Available El trabajo se inspira en el famoso personaje Harry Potter para hacer un análisis entre la filosofía y las neurociencias, específicamente sobre las condiciones funcionales que hacen posible filosofar.

  9. La convergencia de las telecomunicaciones

    OpenAIRE

    Galarza, D.

    2000-01-01

    Presenta la evolución de la electrónica y como esta ha influenciado en el desarrollo de las telecomunicaciones, en la creación de redes fijas y móviles. Además visualiza el futuro convergente de las telecomunicaciones.

  10. Revestimientos corrosibles de las obras

    Directory of Open Access Journals (Sweden)

    Jiménez Montoya, P.

    1958-02-01

    Full Text Available Not availableEstudio y descripción de los distintos tipos de revestimiento que requieren las obras industriales para protegerlas contra las acciones químicas desarrolladas por las substancias puestas en contacto con ellas. El amplio estado evolutivo actual de los procedimientos industriales exige, como es natural, que las construcciones modernas se proyecten con un conocimiento claro, no ya de su estabilidad estructural, sino de la de sus propios materiales, que, por fenómenos corrosivos, pueden causar la ruina de la construcción. En este trabajo se dan los procedimientos de revestir, las características más importantes de estas protecciones y materiales empleados y, finalmente, su comportamiento en contacto con los agentes agresivos normalmente manipulados en la industria.

  11. A multimodal MRI dataset of professional chess players.

    Science.gov (United States)

    Li, Kaiming; Jiang, Jing; Qiu, Lihua; Yang, Xun; Huang, Xiaoqi; Lui, Su; Gong, Qiyong

    2015-01-01

    Chess is a good model to study high-level human brain functions such as spatial cognition, memory, planning, learning and problem solving. Recent studies have demonstrated that non-invasive MRI techniques are valuable for researchers to investigate the underlying neural mechanism of playing chess. For professional chess players (e.g., chess grand masters and masters or GM/Ms), what are the structural and functional alterations due to long-term professional practice, and how these alterations relate to behavior, are largely veiled. Here, we report a multimodal MRI dataset from 29 professional Chinese chess players (most of whom are GM/Ms), and 29 age matched novices. We hope that this dataset will provide researchers with new materials to further explore high-level human brain functions.

  12. Knowledge discovery with classification rules in a cardiovascular dataset.

    Science.gov (United States)

    Podgorelec, Vili; Kokol, Peter; Stiglic, Milojka Molan; Hericko, Marjan; Rozman, Ivan

    2005-12-01

    In this paper we study an evolutionary machine learning approach to data mining and knowledge discovery based on the induction of classification rules. A method for automatic rules induction called AREX using evolutionary induction of decision trees and automatic programming is introduced. The proposed algorithm is applied to a cardiovascular dataset consisting of different groups of attributes which should possibly reveal the presence of some specific cardiovascular problems in young patients. A case study is presented that shows the use of AREX for the classification of patients and for discovering possible new medical knowledge from the dataset. The defined knowledge discovery loop comprises a medical expert's assessment of induced rules to drive the evolution of rule sets towards more appropriate solutions. The final result is the discovery of a possible new medical knowledge in the field of pediatric cardiology.

  13. Augmented Reality Prototype for Visualizing Large Sensors’ Datasets

    Directory of Open Access Journals (Sweden)

    Folorunso Olufemi A.

    2011-04-01

    Full Text Available This paper addressed the development of an augmented reality (AR based scientific visualization system prototype that supports identification, localisation, and 3D visualisation of oil leakages sensors datasets. Sensors generates significant amount of multivariate datasets during normal and leak situations which made data exploration and visualisation daunting tasks. Therefore a model to manage such data and enhance computational support needed for effective explorations are developed in this paper. A challenge of this approach is to reduce the data inefficiency. This paper presented a model for computing information gain for each data attributes and determine a lead attribute.The computed lead attribute is then used for the development of an AR-based scientific visualization interface which automatically identifies, localises and visualizes all necessary data relevant to a particularly selected region of interest (ROI on the network. Necessary architectural system supports and the interface requirements for such visualizations are also presented.

  14. An integrated dataset for in silico drug discovery

    Directory of Open Access Journals (Sweden)

    Cockell Simon J

    2010-12-01

    Full Text Available Drug development is expensive and prone to failure. It is potentially much less risky and expensive to reuse a drug developed for one condition for treating a second disease, than it is to develop an entirely new compound. Systematic approaches to drug repositioning are needed to increase throughput and find candidates more reliably. Here we address this need with an integrated systems biology dataset, developed using the Ondex data integration platform, for the in silico discovery of new drug repositioning candidates. We demonstrate that the information in this dataset allows known repositioning examples to be discovered. We also propose a means of automating the search for new treatment indications of existing compounds.

  15. Application of Density Estimation Methods to Datasets from a Glider

    Science.gov (United States)

    2014-09-30

    humpback and sperm whales as well as different dolphin species. OBJECTIVES The objective of this research is to extend existing methods for cetacean...collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources...estimation from single sensor datasets. Required steps for a cue counting approach, where a cue has been defined as a clicking event (Küsel et al., 2011), to

  16. A review of continent scale hydrological datasets available for Africa

    OpenAIRE

    Bonsor, H.C.

    2010-01-01

    As rainfall becomes less reliable with predicted climate change the ability to assess the spatial and seasonal variations in groundwater availability on a large-scale (catchment and continent) is becoming increasingly important (Bates, et al. 2007; MacDonald et al. 2009). The scarcity of observed hydrological data, or difficulty in obtaining such data, within Africa means remotely sensed (RS) datasets must often be used to drive large-scale hydrological models. The different ap...

  17. Dataset of mitochondrial genome variants in oncocytic tumors

    Directory of Open Access Journals (Sweden)

    Lihua Lyu

    2018-04-01

    Full Text Available This dataset presents the mitochondrial genome variants associated with oncocytic tumors. These data were obtained by Sanger sequencing of the whole mitochondrial genomes of oncocytic tumors and the adjacent normal tissues from 32 patients. The mtDNA variants are identified after compared with the revised Cambridge sequence, excluding those defining haplogroups of our patients. The pathogenic prediction for the novel missense variants found in this study was performed with the Mitimpact 2 program.

  18. GLEAM version 3: Global Land Evaporation Datasets and Model

    Science.gov (United States)

    Martens, B.; Miralles, D. G.; Lievens, H.; van der Schalie, R.; de Jeu, R.; Fernandez-Prieto, D.; Verhoest, N.

    2015-12-01

    Terrestrial evaporation links energy, water and carbon cycles over land and is therefore a key variable of the climate system. However, the global-scale magnitude and variability of the flux, and the sensitivity of the underlying physical process to changes in environmental factors, are still poorly understood due to limitations in in situ measurements. As a result, several methods have risen to estimate global patterns of land evaporation from satellite observations. However, these algorithms generally differ in their approach to model evaporation, resulting in large differences in their estimates. One of these methods is GLEAM, the Global Land Evaporation: the Amsterdam Methodology. GLEAM estimates terrestrial evaporation based on daily satellite observations of meteorological variables, vegetation characteristics and soil moisture. Since the publication of the first version of the algorithm (2011), the model has been widely applied to analyse trends in the water cycle and land-atmospheric feedbacks during extreme hydrometeorological events. A third version of the GLEAM global datasets is foreseen by the end of 2015. Given the relevance of having a continuous and reliable record of global-scale evaporation estimates for climate and hydrological research, the establishment of an online data portal to host these data to the public is also foreseen. In this new release of the GLEAM datasets, different components of the model have been updated, with the most significant change being the revision of the data assimilation algorithm. In this presentation, we will highlight the most important changes of the methodology and present three new GLEAM datasets and their validation against in situ observations and an alternative dataset of terrestrial evaporation (ERA-Land). Results of the validation exercise indicate that the magnitude and the spatiotemporal variability of the modelled evaporation agree reasonably well with the estimates of ERA-Land and the in situ

  19. Soil chemistry in lithologically diverse datasets: the quartz dilution effect

    Science.gov (United States)

    Bern, Carleton R.

    2009-01-01

    National- and continental-scale soil geochemical datasets are likely to move our understanding of broad soil geochemistry patterns forward significantly. Patterns of chemistry and mineralogy delineated from these datasets are strongly influenced by the composition of the soil parent material, which itself is largely a function of lithology and particle size sorting. Such controls present a challenge by obscuring subtler patterns arising from subsequent pedogenic processes. Here the effect of quartz concentration is examined in moist-climate soils from a pilot dataset of the North American Soil Geochemical Landscapes Project. Due to variable and high quartz contents (6.2–81.7 wt.%), and its residual and inert nature in soil, quartz is demonstrated to influence broad patterns in soil chemistry. A dilution effect is observed whereby concentrations of various elements are significantly and strongly negatively correlated with quartz. Quartz content drives artificial positive correlations between concentrations of some elements and obscures negative correlations between others. Unadjusted soil data show the highly mobile base cations Ca, Mg, and Na to be often strongly positively correlated with intermediately mobile Al or Fe, and generally uncorrelated with the relatively immobile high-field-strength elements (HFS) Ti and Nb. Both patterns are contrary to broad expectations for soils being weathered and leached. After transforming bulk soil chemistry to a quartz-free basis, the base cations are generally uncorrelated with Al and Fe, and negative correlations generally emerge with the HFS elements. Quartz-free element data may be a useful tool for elucidating patterns of weathering or parent-material chemistry in large soil datasets.

  20. Dataset on records of Hericium erinaceus in Slovakia

    OpenAIRE

    Vladimír Kunca; Marek Čiliak

    2017-01-01

    The data presented in this article are related to the research article entitled ?Habitat preferences of Hericium erinaceus in Slovakia? (Kunca and ?iliak, 2016) [FUNECO607] [2]. The dataset include all available and unpublished data from Slovakia, besides the records from the same tree or stem. We compiled a database of records of collections by processing data from herbaria, personal records and communication with mycological activists. Data on altitude, tree species, host tree vital status,...

  1. Diffeomorphic Iterative Centroid Methods for Template Estimation on Large Datasets

    OpenAIRE

    Cury , Claire; Glaunès , Joan Alexis; Colliot , Olivier

    2014-01-01

    International audience; A common approach for analysis of anatomical variability relies on the stimation of a template representative of the population. The Large Deformation Diffeomorphic Metric Mapping is an attractive framework for that purpose. However, template estimation using LDDMM is computationally expensive, which is a limitation for the study of large datasets. This paper presents an iterative method which quickly provides a centroid of the population in the shape space. This centr...

  2. A Dataset from TIMSS to Examine the Relationship between Computer Use and Mathematics Achievement

    Science.gov (United States)

    Kadijevich, Djordje M.

    2015-01-01

    Because the relationship between computer use and achievement is still puzzling, there is a need to prepare and analyze good quality datasets on computer use and achievement. Such a dataset can be derived from TIMSS data. This paper describes how this dataset can be prepared. It also gives an example of how the dataset may be analyzed. The…

  3. An Analysis on Better Testing than Training Performances on the Iris Dataset

    NARCIS (Netherlands)

    Schutten, Marten; Wiering, Marco

    2016-01-01

    The Iris dataset is a well known dataset containing information on three different types of Iris flowers. A typical and popular method for solving classification problems on datasets such as the Iris set is the support vector machine (SVM). In order to do so the dataset is separated in a set used

  4. Parton Distributions based on a Maximally Consistent Dataset

    Science.gov (United States)

    Rojo, Juan

    2016-04-01

    The choice of data that enters a global QCD analysis can have a substantial impact on the resulting parton distributions and their predictions for collider observables. One of the main reasons for this has to do with the possible presence of inconsistencies, either internal within an experiment or external between different experiments. In order to assess the robustness of the global fit, different definitions of a conservative PDF set, that is, a PDF set based on a maximally consistent dataset, have been introduced. However, these approaches are typically affected by theory biases in the selection of the dataset. In this contribution, after a brief overview of recent NNPDF developments, we propose a new, fully objective, definition of a conservative PDF set, based on the Bayesian reweighting approach. Using the new NNPDF3.0 framework, we produce various conservative sets, which turn out to be mutually in agreement within the respective PDF uncertainties, as well as with the global fit. We explore some of their implications for LHC phenomenology, finding also good consistency with the global fit result. These results provide a non-trivial validation test of the new NNPDF3.0 fitting methodology, and indicate that possible inconsistencies in the fitted dataset do not affect substantially the global fit PDFs.

  5. New public dataset for spotting patterns in medieval document images

    Science.gov (United States)

    En, Sovann; Nicolas, Stéphane; Petitjean, Caroline; Jurie, Frédéric; Heutte, Laurent

    2017-01-01

    With advances in technology, a large part of our cultural heritage is becoming digitally available. In particular, in the field of historical document image analysis, there is now a growing need for indexing and data mining tools, thus allowing us to spot and retrieve the occurrences of an object of interest, called a pattern, in a large database of document images. Patterns may present some variability in terms of color, shape, or context, making the spotting of patterns a challenging task. Pattern spotting is a relatively new field of research, still hampered by the lack of available annotated resources. We present a new publicly available dataset named DocExplore dedicated to spotting patterns in historical document images. The dataset contains 1500 images and 1464 queries, and allows the evaluation of two tasks: image retrieval and pattern localization. A standardized benchmark protocol along with ad hoc metrics is provided for a fair comparison of the submitted approaches. We also provide some first results obtained with our baseline system on this new dataset, which show that there is room for improvement and that should encourage researchers of the document image analysis community to design new systems and submit improved results.

  6. Kernel-based discriminant feature extraction using a representative dataset

    Science.gov (United States)

    Li, Honglin; Sancho Gomez, Jose-Luis; Ahalt, Stanley C.

    2002-07-01

    Discriminant Feature Extraction (DFE) is widely recognized as an important pre-processing step in classification applications. Most DFE algorithms are linear and thus can only explore the linear discriminant information among the different classes. Recently, there has been several promising attempts to develop nonlinear DFE algorithms, among which is Kernel-based Feature Extraction (KFE). The efficacy of KFE has been experimentally verified by both synthetic data and real problems. However, KFE has some known limitations. First, KFE does not work well for strongly overlapped data. Second, KFE employs all of the training set samples during the feature extraction phase, which can result in significant computation when applied to very large datasets. Finally, KFE can result in overfitting. In this paper, we propose a substantial improvement to KFE that overcomes the above limitations by using a representative dataset, which consists of critical points that are generated from data-editing techniques and centroid points that are determined by using the Frequency Sensitive Competitive Learning (FSCL) algorithm. Experiments show that this new KFE algorithm performs well on significantly overlapped datasets, and it also reduces computational complexity. Further, by controlling the number of centroids, the overfitting problem can be effectively alleviated.

  7. Decoys Selection in Benchmarking Datasets: Overview and Perspectives

    Science.gov (United States)

    Réau, Manon; Langenfeld, Florent; Zagury, Jean-François; Lagarde, Nathalie; Montes, Matthieu

    2018-01-01

    Virtual Screening (VS) is designed to prospectively help identifying potential hits, i.e., compounds capable of interacting with a given target and potentially modulate its activity, out of large compound collections. Among the variety of methodologies, it is crucial to select the protocol that is the most adapted to the query/target system under study and that yields the most reliable output. To this aim, the performance of VS methods is commonly evaluated and compared by computing their ability to retrieve active compounds in benchmarking datasets. The benchmarking datasets contain a subset of known active compounds together with a subset of decoys, i.e., assumed non-active molecules. The composition of both the active and the decoy compounds subsets is critical to limit the biases in the evaluation of the VS methods. In this review, we focus on the selection of decoy compounds that has considerably changed over the years, from randomly selected compounds to highly customized or experimentally validated negative compounds. We first outline the evolution of decoys selection in benchmarking databases as well as current benchmarking databases that tend to minimize the introduction of biases, and secondly, we propose recommendations for the selection and the design of benchmarking datasets. PMID:29416509

  8. ENHANCED DATA DISCOVERABILITY FOR IN SITU HYPERSPECTRAL DATASETS

    Directory of Open Access Journals (Sweden)

    B. Rasaiah

    2016-06-01

    Full Text Available Field spectroscopic metadata is a central component in the quality assurance, reliability, and discoverability of hyperspectral data and the products derived from it. Cataloguing, mining, and interoperability of these datasets rely upon the robustness of metadata protocols for field spectroscopy, and on the software architecture to support the exchange of these datasets. Currently no standard for in situ spectroscopy data or metadata protocols exist. This inhibits the effective sharing of growing volumes of in situ spectroscopy datasets, to exploit the benefits of integrating with the evolving range of data sharing platforms. A core metadataset for field spectroscopy was introduced by Rasaiah et al., (2011-2015 with extended support for specific applications. This paper presents a prototype model for an OGC and ISO compliant platform-independent metadata discovery service aligned to the specific requirements of field spectroscopy. In this study, a proof-of-concept metadata catalogue has been described and deployed in a cloud-based architecture as a demonstration of an operationalized field spectroscopy metadata standard and web-based discovery service.

  9. Multiresolution persistent homology for excessively large biomolecular datasets

    Energy Technology Data Exchange (ETDEWEB)

    Xia, Kelin; Zhao, Zhixiong [Department of Mathematics, Michigan State University, East Lansing, Michigan 48824 (United States); Wei, Guo-Wei, E-mail: wei@math.msu.edu [Department of Mathematics, Michigan State University, East Lansing, Michigan 48824 (United States); Department of Electrical and Computer Engineering, Michigan State University, East Lansing, Michigan 48824 (United States); Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824 (United States)

    2015-10-07

    Although persistent homology has emerged as a promising tool for the topological simplification of complex data, it is computationally intractable for large datasets. We introduce multiresolution persistent homology to handle excessively large datasets. We match the resolution with the scale of interest so as to represent large scale datasets with appropriate resolution. We utilize flexibility-rigidity index to access the topological connectivity of the data set and define a rigidity density for the filtration analysis. By appropriately tuning the resolution of the rigidity density, we are able to focus the topological lens on the scale of interest. The proposed multiresolution topological analysis is validated by a hexagonal fractal image which has three distinct scales. We further demonstrate the proposed method for extracting topological fingerprints from DNA molecules. In particular, the topological persistence of a virus capsid with 273 780 atoms is successfully analyzed which would otherwise be inaccessible to the normal point cloud method and unreliable by using coarse-grained multiscale persistent homology. The proposed method has also been successfully applied to the protein domain classification, which is the first time that persistent homology is used for practical protein domain analysis, to our knowledge. The proposed multiresolution topological method has potential applications in arbitrary data sets, such as social networks, biological networks, and graphs.

  10. Tissue-Based MRI Intensity Standardization: Application to Multicentric Datasets

    Directory of Open Access Journals (Sweden)

    Nicolas Robitaille

    2012-01-01

    Full Text Available Intensity standardization in MRI aims at correcting scanner-dependent intensity variations. Existing simple and robust techniques aim at matching the input image histogram onto a standard, while we think that standardization should aim at matching spatially corresponding tissue intensities. In this study, we present a novel automatic technique, called STI for STandardization of Intensities, which not only shares the simplicity and robustness of histogram-matching techniques, but also incorporates tissue spatial intensity information. STI uses joint intensity histograms to determine intensity correspondence in each tissue between the input and standard images. We compared STI to an existing histogram-matching technique on two multicentric datasets, Pilot E-ADNI and ADNI, by measuring the intensity error with respect to the standard image after performing nonlinear registration. The Pilot E-ADNI dataset consisted in 3 subjects each scanned in 7 different sites. The ADNI dataset consisted in 795 subjects scanned in more than 50 different sites. STI was superior to the histogram-matching technique, showing significantly better intensity matching for the brain white matter with respect to the standard image.

  11. Exploring massive, genome scale datasets with the genometricorr package

    KAUST Repository

    Favorov, Alexander; Mularoni, Loris; Cope, Leslie M.; Medvedeva, Yulia; Mironov, Andrey A.; Makeev, Vsevolod J.; Wheelan, Sarah J.

    2012-01-01

    We have created a statistically grounded tool for determining the correlation of genomewide data with other datasets or known biological features, intended to guide biological exploration of high-dimensional datasets, rather than providing immediate answers. The software enables several biologically motivated approaches to these data and here we describe the rationale and implementation for each approach. Our models and statistics are implemented in an R package that efficiently calculates the spatial correlation between two sets of genomic intervals (data and/or annotated features), for use as a metric of functional interaction. The software handles any type of pointwise or interval data and instead of running analyses with predefined metrics, it computes the significance and direction of several types of spatial association; this is intended to suggest potentially relevant relationships between the datasets. Availability and implementation: The package, GenometriCorr, can be freely downloaded at http://genometricorr.sourceforge.net/. Installation guidelines and examples are available from the sourceforge repository. The package is pending submission to Bioconductor. © 2012 Favorov et al.

  12. Image segmentation evaluation for very-large datasets

    Science.gov (United States)

    Reeves, Anthony P.; Liu, Shuang; Xie, Yiting

    2016-03-01

    With the advent of modern machine learning methods and fully automated image analysis there is a need for very large image datasets having documented segmentations for both computer algorithm training and evaluation. Current approaches of visual inspection and manual markings do not scale well to big data. We present a new approach that depends on fully automated algorithm outcomes for segmentation documentation, requires no manual marking, and provides quantitative evaluation for computer algorithms. The documentation of new image segmentations and new algorithm outcomes are achieved by visual inspection. The burden of visual inspection on large datasets is minimized by (a) customized visualizations for rapid review and (b) reducing the number of cases to be reviewed through analysis of quantitative segmentation evaluation. This method has been applied to a dataset of 7,440 whole-lung CT images for 6 different segmentation algorithms designed to fully automatically facilitate the measurement of a number of very important quantitative image biomarkers. The results indicate that we could achieve 93% to 99% successful segmentation for these algorithms on this relatively large image database. The presented evaluation method may be scaled to much larger image databases.

  13. Exploring massive, genome scale datasets with the genometricorr package

    KAUST Repository

    Favorov, Alexander

    2012-05-31

    We have created a statistically grounded tool for determining the correlation of genomewide data with other datasets or known biological features, intended to guide biological exploration of high-dimensional datasets, rather than providing immediate answers. The software enables several biologically motivated approaches to these data and here we describe the rationale and implementation for each approach. Our models and statistics are implemented in an R package that efficiently calculates the spatial correlation between two sets of genomic intervals (data and/or annotated features), for use as a metric of functional interaction. The software handles any type of pointwise or interval data and instead of running analyses with predefined metrics, it computes the significance and direction of several types of spatial association; this is intended to suggest potentially relevant relationships between the datasets. Availability and implementation: The package, GenometriCorr, can be freely downloaded at http://genometricorr.sourceforge.net/. Installation guidelines and examples are available from the sourceforge repository. The package is pending submission to Bioconductor. © 2012 Favorov et al.

  14. Principal Component Analysis of Process Datasets with Missing Values

    Directory of Open Access Journals (Sweden)

    Kristen A. Severson

    2017-07-01

    Full Text Available Datasets with missing values arising from causes such as sensor failure, inconsistent sampling rates, and merging data from different systems are common in the process industry. Methods for handling missing data typically operate during data pre-processing, but can also occur during model building. This article considers missing data within the context of principal component analysis (PCA, which is a method originally developed for complete data that has widespread industrial application in multivariate statistical process control. Due to the prevalence of missing data and the success of PCA for handling complete data, several PCA algorithms that can act on incomplete data have been proposed. Here, algorithms for applying PCA to datasets with missing values are reviewed. A case study is presented to demonstrate the performance of the algorithms and suggestions are made with respect to choosing which algorithm is most appropriate for particular settings. An alternating algorithm based on the singular value decomposition achieved the best results in the majority of test cases involving process datasets.

  15. A cross-country Exchange Market Pressure (EMP dataset

    Directory of Open Access Journals (Sweden)

    Mohit Desai

    2017-06-01

    Full Text Available The data presented in this article are related to the research article titled - “An exchange market pressure measure for cross country analysis” (Patnaik et al. [1]. In this article, we present the dataset for Exchange Market Pressure values (EMP for 139 countries along with their conversion factors, ρ (rho. Exchange Market Pressure, expressed in percentage change in exchange rate, measures the change in exchange rate that would have taken place had the central bank not intervened. The conversion factor ρ can interpreted as the change in exchange rate associated with $1 billion of intervention. Estimates of conversion factor ρ allow us to calculate a monthly time series of EMP for 139 countries. Additionally, the dataset contains the 68% confidence interval (high and low values for the point estimates of ρ’s. Using the standard errors of estimates of ρ’s, we obtain one sigma intervals around mean estimates of EMP values. These values are also reported in the dataset.

  16. Una escuela de todas (las personas) para todas (las personas)

    OpenAIRE

    Montolio Pastor, Rosa; Cervellera Martínez, Luiso

    2008-01-01

    Escuela 2 representa una particular manera de entender la educación. Un proyecto educativo nacido de la cooperación entre las personas, concebido para albergar todo un universo de realidades y gestionado de manera colaborativa bajo los principios de la economía social y el movimiento cooperativo. Una escuela de todas, las personas, para todas, las personas. Una escuela decidida a hacer realidad los principios de la inclusión educativa desde el día a día, esforzada por resituar a cada paso su ...

  17. The Role of Datasets on Scientific Influence within Conflict Research

    Science.gov (United States)

    Van Holt, Tracy; Johnson, Jeffery C.; Moates, Shiloh; Carley, Kathleen M.

    2016-01-01

    We inductively tested if a coherent field of inquiry in human conflict research emerged in an analysis of published research involving “conflict” in the Web of Science (WoS) over a 66-year period (1945–2011). We created a citation network that linked the 62,504 WoS records and their cited literature. We performed a critical path analysis (CPA), a specialized social network analysis on this citation network (~1.5 million works), to highlight the main contributions in conflict research and to test if research on conflict has in fact evolved to represent a coherent field of inquiry. Out of this vast dataset, 49 academic works were highlighted by the CPA suggesting a coherent field of inquiry; which means that researchers in the field acknowledge seminal contributions and share a common knowledge base. Other conflict concepts that were also analyzed—such as interpersonal conflict or conflict among pharmaceuticals, for example, did not form their own CP. A single path formed, meaning that there was a cohesive set of ideas that built upon previous research. This is in contrast to a main path analysis of conflict from 1957–1971 where ideas didn’t persist in that multiple paths existed and died or emerged reflecting lack of scientific coherence (Carley, Hummon, and Harty, 1993). The critical path consisted of a number of key features: 1) Concepts that built throughout include the notion that resource availability drives conflict, which emerged in the 1960s-1990s and continued on until 2011. More recent intrastate studies that focused on inequalities emerged from interstate studies on the democracy of peace earlier on the path. 2) Recent research on the path focused on forecasting conflict, which depends on well-developed metrics and theories to model. 3) We used keyword analysis to independently show how the CP was topically linked (i.e., through democracy, modeling, resources, and geography). Publically available conflict datasets developed early on helped

  18. The Role of Datasets on Scientific Influence within Conflict Research.

    Directory of Open Access Journals (Sweden)

    Tracy Van Holt

    Full Text Available We inductively tested if a coherent field of inquiry in human conflict research emerged in an analysis of published research involving "conflict" in the Web of Science (WoS over a 66-year period (1945-2011. We created a citation network that linked the 62,504 WoS records and their cited literature. We performed a critical path analysis (CPA, a specialized social network analysis on this citation network (~1.5 million works, to highlight the main contributions in conflict research and to test if research on conflict has in fact evolved to represent a coherent field of inquiry. Out of this vast dataset, 49 academic works were highlighted by the CPA suggesting a coherent field of inquiry; which means that researchers in the field acknowledge seminal contributions and share a common knowledge base. Other conflict concepts that were also analyzed-such as interpersonal conflict or conflict among pharmaceuticals, for example, did not form their own CP. A single path formed, meaning that there was a cohesive set of ideas that built upon previous research. This is in contrast to a main path analysis of conflict from 1957-1971 where ideas didn't persist in that multiple paths existed and died or emerged reflecting lack of scientific coherence (Carley, Hummon, and Harty, 1993. The critical path consisted of a number of key features: 1 Concepts that built throughout include the notion that resource availability drives conflict, which emerged in the 1960s-1990s and continued on until 2011. More recent intrastate studies that focused on inequalities emerged from interstate studies on the democracy of peace earlier on the path. 2 Recent research on the path focused on forecasting conflict, which depends on well-developed metrics and theories to model. 3 We used keyword analysis to independently show how the CP was topically linked (i.e., through democracy, modeling, resources, and geography. Publically available conflict datasets developed early on helped

  19. The Role of Datasets on Scientific Influence within Conflict Research.

    Science.gov (United States)

    Van Holt, Tracy; Johnson, Jeffery C; Moates, Shiloh; Carley, Kathleen M

    2016-01-01

    We inductively tested if a coherent field of inquiry in human conflict research emerged in an analysis of published research involving "conflict" in the Web of Science (WoS) over a 66-year period (1945-2011). We created a citation network that linked the 62,504 WoS records and their cited literature. We performed a critical path analysis (CPA), a specialized social network analysis on this citation network (~1.5 million works), to highlight the main contributions in conflict research and to test if research on conflict has in fact evolved to represent a coherent field of inquiry. Out of this vast dataset, 49 academic works were highlighted by the CPA suggesting a coherent field of inquiry; which means that researchers in the field acknowledge seminal contributions and share a common knowledge base. Other conflict concepts that were also analyzed-such as interpersonal conflict or conflict among pharmaceuticals, for example, did not form their own CP. A single path formed, meaning that there was a cohesive set of ideas that built upon previous research. This is in contrast to a main path analysis of conflict from 1957-1971 where ideas didn't persist in that multiple paths existed and died or emerged reflecting lack of scientific coherence (Carley, Hummon, and Harty, 1993). The critical path consisted of a number of key features: 1) Concepts that built throughout include the notion that resource availability drives conflict, which emerged in the 1960s-1990s and continued on until 2011. More recent intrastate studies that focused on inequalities emerged from interstate studies on the democracy of peace earlier on the path. 2) Recent research on the path focused on forecasting conflict, which depends on well-developed metrics and theories to model. 3) We used keyword analysis to independently show how the CP was topically linked (i.e., through democracy, modeling, resources, and geography). Publically available conflict datasets developed early on helped shape the

  20. Las aristas del racismo

    Directory of Open Access Journals (Sweden)

    Fredy Rivera Vélez

    2000-01-01

    Full Text Available En el presente trabajo se estudia un ámbito que normalmente se esquiva y se enmascara: Las prácticas racistas. Esta evasión del tema se da en diferentes planos discursivos, y es una práctica, cada vez más frecuente en América Latina, pues aquí se hace de la integración étnica uno de los ejes centrales de la construcción de sus relaciones identitarias y políticas. Se propone, entonces, este trabajo develar la naturaleza y el movimiento de prácticas racistas presentes no sólo discursivamente, en la práctica política, en la práctica laboral, sino que también están presente en nuestra vida cotidiana, bajo nuevas formas y contenidos que se distancian del racismo tradicional e incorpora una nueva gramática racista, una suerte de racismo moderno, no basado en el fenotipo o características físicas como elemento central, sino en un racismo basado en la diferencia cultural, lo cual plantea un plano más excluyente, pues en este planteamiento el “objeto de racismo” no es un objeto de racismo limitado, como en el racismo tradicional, en el cual siempre es utilizado por quien practica el racismo; en el racismo contemporáneo se pretende una ruptura completa y una exclusión total del otro.

  1. Las momias de las pirámides

    Directory of Open Access Journals (Sweden)

    José Miguel Parra

    2011-01-01

    Full Text Available El público general cree, como si fuera un dogma religioso, que las pirámides egipcias no fueron las tumbas de los faraones de los Reino Antiguo y Medio porque en ninguna de las cámaras funerarias se ha encontrado nunca una momia. Este estudio compila todos los datos relevantes al respecto y describe brevemente todos los restos humanos encontrados dentro de las pirámides, demostrando la falta de base de esa extendida creencia.As a religious dogma the general public believe that the Egyptian pyramids were not the tombs for the Old and Middle Kingdom pharaohs, because never was a mummy found inside the burial chamber of any of them. This study just compile all the relevant data on the subject and describe briefly all the human remains found in the pyramids, showing the nonsense of the general belief.

  2. Las multitudes y las revoluciones de nuestro tiempo

    Directory of Open Access Journals (Sweden)

    John Harold Biervliet

    2015-08-01

    Full Text Available Este artículo examina el motivo de las multitudes y las revoluciones de nuestro tiempo. Primeramente, se discuten las diferencias entre los conceptos de multitud y masa. Así, podemos considerar las revoluciones como acontecimientos de cambios sociales, económicos y políticos, provenientes de las clases baja y media de la sociedad. El meollo de este artículo refiere a la dignidad humana articulada a la contracción de los Estados de bienestar. Los ciudadanos están reflejando un desencanto hacia la clase política y una frustración con respecto al empeoramiento de las condiciones económicas y sociales. Podemos observar los casos de indignados en Grecia, España y Portugal pero también en los países árabes. De esta misma manera, las revoluciones de las multitudes siguen avanzando por medio de las demostraciones públicas y protestas sobre los espacios geográficos. Finalmente, las dinámicas de cambio a través de las revoluciones árabes son una cuestión compleja debido al círculo vicioso entre la tendencia autoritaria y la islámica. This article examines the reason of multitudes and revolutions in our time. First of all, it discusses the difference between the concept of multitude and mass. Consequently, revolutions can be considered as social, economic, and political events of changes, which come from the low and middle classes of the society. This article refers to the human dignity articulated with the contraction of Welfare states. Citizens are reflecting disenchantment towards the political class and frustration regarding the deterioration of social and economic conditions. We can observe angry people in Greece, Spain, and Portugal but also in Arabian countries. In this same way, revolutions of multitudes continue by means of public demonstrations and protests on geographical spaces. Finally, the dynamics of change through Arabian Revolutions are a complex matter due to the vicious circle between the authoritarian or Islamic

  3. Las "auctoritates" escolásticas en las Siete Partidas

    OpenAIRE

    Vázquez Janeiro, Isaac

    1992-01-01

    Examen de las fuentes utilizadas en los pasajes de las Siete Partidas relativos a la doctrina general de los sacramentos y en particular de los sacramentos del bautismo y del matrimonio. A base de los resultados obtenidos se sacan conclusiones sobre el carácter de la obra alfonsina. Examination of the sources used in the passages of the Siete Partidas relating to the general doctrine of the Sacraments and specially of the sacraments of Baptism and Marriage.

  4. Las desterradas hijas de Eva

    OpenAIRE

    Quiñonero Hernández, Llum

    2015-01-01

    Ocurrió en España, durante década, en tiempos de Franco y después de Franco. El estado franquista creo una tupida red de instituciones para el control social de las mujeres: para las rebeldes y descarriadas, el encierro; para las madres solteras también el estigma. Carentes de derechos, de garantías, miles, decenas de miles de mujeres, aún sin cuantificar, vieron desaparecer a sus recién nacidos en instituciones privadas y públicas de muy diversa índole: cárceles, reformatorios, maternidades ...

  5. Patagonia: tocando las fibras internas

    OpenAIRE

    Velasco Tafur, Ximena; Borrero Caldas, Silvio

    2013-01-01

    El presente caso, está enfocado en Patagonia, una compañía que desde sus inicios ha desarrollado una forma poco convencional en su estilo de hacer negocios. La gerencia de Patagonia considera que muchos errores administrativos obedecen a que las empresas formulan las mismas soluciones para todos los problemas que enfrentan. Además, desde la revolución industrial, tales soluciones han privilegiado las ganancias económicas sobre la sostenibilidad ambiental, y el planeta está pasando factura por...

  6. Unificar las ciencias del deporte

    Directory of Open Access Journals (Sweden)

    Natàlia Balagué Serre

    2013-12-01

    Full Text Available El deporte no es solo un fenómeno social de nuestro mundo, sino que también es un campo privilegiado para el estudio del comportamiento social y humano. Durante las últimas décadas, se ha producido un enorme crecimiento y especialización de las ciencias del deporte y el lema del 18º Congreso del European College of Sport Sciences (ECSS “Unificar las ciencias del deporte” representa desafiar este proceso de fragmentación. El lema conlleva un cambio de la especialización a la integración, de una concepción de los sistemas vivos basada en la teoría de la información y la ingeniería a una de base biológica, de la investigación multidisciplinaria a la transdisciplinaria. Pero aparece una pregunta: ¿es posible integrar las áreas fragmentadas y facilitar la transferencia de los principios explicativos teóricos, técnicas y perspectivas metodológicas entre disciplinas? En el marco de las contribuciones hechas en el congreso, este artículo tiene como objetivo introducir enfoques científicos ya extendidos en los ámbitos de la física, la química, la biología (incluyendo las ómicas y las ciencias sociales, y que centradas en las interacciones dinámicas complejas de los componentes sistémicos (proteínas, células, organismos, grupos, sociedades, revelan principios explicativos generales que contribuyen a la unificación del conocimiento. Intentamos animar a las personas interesadas en las ciencias del deporte a percibir nuevas formas de investigación y a complementar, sin sustituirlos, los enfoques dominantes, con la esperanza de que ir de las partes al todo y del todo a las partes ayudará a los científicos a reconocer los caminos más adecuados.

  7. Las nuevas formas de racismo

    OpenAIRE

    Gutiérrez López-Dóriga, Cristina

    2012-01-01

    El racismo tal y como lo conocíamos ha disminuido considerablemente ya que ha debido adaptarse a las nuevas exigencias sociales. Esta adaptación ha creado una nueva forma de expresión de este fenómeno, más larvada pero igual de dañina que las anteriores formas manifiestas. Y aunque las bases que sustentan esta nueva forma de racismo dependen de la historia cultural del lugar, siendo así diferentes en Europa y Estados Unidos, ambas aluden a los mismos procesos que crean y mantienen la desigual...

  8. Las escalas de la luz

    OpenAIRE

    Beckers, Benoit

    2009-01-01

    Variando con el paso del día y del año, la luz del Sol y del cielo modula, visual y energéticamente, los territorios, las ciudades y los edificios. ¿Cómo sintetizar y manejar en el proyecto estas informaciones donde se mezcla el azar de las nubes pasajeras con la regularidad astronómica de los trayectos solares? En cuanto a las herramientas de simulación, el mayor avance de estos últimos años se ha producido en los programas de renderización, con los cuales nos vemos forzados a construi...

  9. Efectividad en las redes empresariales

    OpenAIRE

    Álvarez Rey, Natalia; Correal López, Ana María; García Algarra, Laura Ximena

    2014-01-01

    Este estudio tiene como objetivo identificar cuáles son las variables que repercuten en la efectividad de las redes empresariales. Esto, con base en la búsqueda de literatura existente de la efectividad en equipos, en organizaciones y en las redes interorganizacionales, así como el análisis de modelos y estudios empíricos que permitieron el análisis. De acuerdo con la búsqueda, se encontró que variables como la estructura de la red, la estabilidad del sistema, el compromiso de los empleados e...

  10. Characterization of pottery from Cerro de Las Ventanas, Zacatecas, Mexico

    Energy Technology Data Exchange (ETDEWEB)

    Lopez-del-Rio, H.; Mireles-Garcia, F. [Unidad Academica de Estudios Nucleares, UAZ, Zacatecas (Mexico); Mendez-Cardona, R.Y. [Unidad Academica de Antropologia, UAZ, Zacatecas (Mexico); Nicolas-Caretta, M. [INAH Delegacion Zacatecas (Mexico); Coordinacion de Ciencias Sociales y Humanidades, UASLP, Fracc. Talleres, SLP (Mexico); Speakman, R.J. [Museum Conservation Inst., Smithsonian Institution, Suitland, MD (United States); Glascock, M.D. [Research Reactor Center, Univ. of Missouri, Columbia, MO (United States)

    2009-09-15

    With the aim of classifying prehispanic pottery from Cerro de Las Ventanas site, Juchipila, Zacatecas, Mexico, instrumental neutron activation analysis (INAA) was used to analyze ceramic samples at the University of Missouri Research Reactor Center. Thirty-two chemical elements were measured: Al, As, Ba, Ca, Ce, Co, Cr, Cs, Dy, Eu, Fe, Hf, K, La, Lu, Mn, Na, Nd, Rb, Sb, Sc, Sm, Sr, Ta, Tb, Ti, Th, U, V, Yb, Zn, and Zr. Two multivariate statistical methods, cluster analysis and principal component analysis, were performed on the dataset to examine similarities between samples and to establish compositional groups. The statistical analyses of the dataset suggest that the pottery samples form a unique chemically homogeneous group, with the exception of one pottery sample. The compositional data were compared to an existing Mesoamerican ceramic database. It was found that the newly generated data fit best with data from a previous chemical analysis of pottery from the Malpaso Valley. However, despite the apparent similarity, pottery samples from the site of Cerro de Las Ventanas represent a new and unique chemical fingerprint in the region. (orig.)

  11. Characterization of pottery from Cerro de Las Ventanas, Zacatecas, Mexico

    International Nuclear Information System (INIS)

    Lopez-del-Rio, H.; Mireles-Garcia, F.; Mendez-Cardona, R.Y.; Nicolas-Caretta, M.; Speakman, R.J.; Glascock, M.D.

    2009-01-01

    With the aim of classifying prehispanic pottery from Cerro de Las Ventanas site, Juchipila, Zacatecas, Mexico, instrumental neutron activation analysis (INAA) was used to analyze ceramic samples at the University of Missouri Research Reactor Center. Thirty-two chemical elements were measured: Al, As, Ba, Ca, Ce, Co, Cr, Cs, Dy, Eu, Fe, Hf, K, La, Lu, Mn, Na, Nd, Rb, Sb, Sc, Sm, Sr, Ta, Tb, Ti, Th, U, V, Yb, Zn, and Zr. Two multivariate statistical methods, cluster analysis and principal component analysis, were performed on the dataset to examine similarities between samples and to establish compositional groups. The statistical analyses of the dataset suggest that the pottery samples form a unique chemically homogeneous group, with the exception of one pottery sample. The compositional data were compared to an existing Mesoamerican ceramic database. It was found that the newly generated data fit best with data from a previous chemical analysis of pottery from the Malpaso Valley. However, despite the apparent similarity, pottery samples from the site of Cerro de Las Ventanas represent a new and unique chemical fingerprint in the region. (orig.)

  12. La intangibilidad de las acciones privadas de las personas

    Directory of Open Access Journals (Sweden)

    Mauricio Maldonado Muñoz

    2014-03-01

    Full Text Available En este artículo pretendemos acercarnos a una garantía que —siguiendo cierta doctrina— hemos llamado: intangibilidad de las acciones privadas de las personas. Desde una visión que busca ser omnicomprensiva, se analizan las fuentes de las que mana la privacidad y, posteriormente, su contenido y alcances. Sobre todo, se analiza el problema de los límites de la injerencia y regulación estatal, partiendo de una posición que niega las visiones comunitaristas. En general, se trata de conceptualizar a la garantía planteada desde el punto de vista de la libertad, el derecho y las virtudes humanas, los derechos de terceros y la moral pública; concretando su vinculación con otros derechos relacionados con la privacidad. La idea central del presente trabajo consiste en demostrar la transversalidad de la garantía señalada, implicando —en ese proceso— cuestiones trascendentes para la filosofía del derecho, la teoría del derecho y, por supuesto, para el estudio del derecho de los derechos humanos.  

  13. Las cinco grandes dimensiones de la personalidad

    Directory of Open Access Journals (Sweden)

    Jan ter Laak

    1996-12-01

    Full Text Available Este artículo revisa las distintas posiciones teóricas sobre las cinco grandes dimensiones de la personalidad, mostrando las semejanzas y diferencias entre las posturas teóricas. Esta contribución presenta lo siguiente: (a la génesis del contenido y la estructura de las cinco dimensiones; (b la fortaleza de las cinco dimensiones; (e la relación de las cinco grandes dimensiones con otros constructos de personalidad; (d discute el valor predictivo de las puntuaciones del perfil de las cinco dimensiones para criterios pertinentes; (e analiza el estatus teórico de las cinco dimensiones; (f discute críticas históricas sobre las cinco grandes dimensiones y se formulan respuestas a estas críticas; (g hace conjeturas para el futuro de las cinco grandes dimensiones; y (h concluye con algunas conclusiones y comentarios.

  14. El estado actual de las vacunas contra las drogas

    Directory of Open Access Journals (Sweden)

    Maura Epifanía Matus Ortega

    2017-12-01

    Full Text Available Introducción: por lo común, la adicción a las drogas se trata con psicoterapia y farmacología que evita la unión de las sustancias psicoactivas a receptores específicos en el cerebro. El resultado de estos tratamientos no ha sido del todo satisfactorio, por lo que el desarrollo de terapias más eficaces representa un reto constante para tratar las adicciones. Una alternativa a la farmacología antiadictiva es la vacunación activa dirigida contra las sustancias de abuso. Objetivo: esta revisión reúne la información disponible sobre los fundamentos y avances científicos en la generación de una terapia inmunológica, que coadyuve al tratamiento de la adicción a sustancias como la heroína-morfina, la cocaína, la nicotina y la anfetamina. Método: se consideraron los reportes científicos disponibles en PubMed –de 2005 a abril de 2017–, sobre los fundamentos, la metodología empleada, los estudios preclínicos y clínicos, y los resultados obtenidos en dichas investigaciones para generar vacunas contra las drogas. Resultados: las vacunas lograron mitigar los efectos producidos por las sustancias en los estudios preclínicos en modelos de estudio en animales; sin embargo, con pacientes humanos los resultados no han sido del todo satisfactorios. Discusión y conclusiones: a pesar de los esfuerzos realizados por diferentes grupos de investigación y compañías farmacéuticas para generar vacunas terapéuticas contra el uso de diferentes drogas, ninguna ha alcanzado la fase III de estudios clínicos. En la actualidad, se continúa con los esfuerzos para lograr que las vacunas contra las adicciones alcancen su máxima eficiencia y eficacia, y contribuyan al tratamiento de la adicción a las drogas.

  15. Animated analysis of geoscientific datasets: An interactive graphical application

    Science.gov (United States)

    Morse, Peter; Reading, Anya; Lueg, Christopher

    2017-12-01

    Geoscientists are required to analyze and draw conclusions from increasingly large volumes of data. There is a need to recognise and characterise features and changing patterns of Earth observables within such large datasets. It is also necessary to identify significant subsets of the data for more detailed analysis. We present an innovative, interactive software tool and workflow to visualise, characterise, sample and tag large geoscientific datasets from both local and cloud-based repositories. It uses an animated interface and human-computer interaction to utilise the capacity of human expert observers to identify features via enhanced visual analytics. 'Tagger' enables users to analyze datasets that are too large in volume to be drawn legibly on a reasonable number of single static plots. Users interact with the moving graphical display, tagging data ranges of interest for subsequent attention. The tool provides a rapid pre-pass process using fast GPU-based OpenGL graphics and data-handling and is coded in the Quartz Composer visual programing language (VPL) on Mac OSX. It makes use of interoperable data formats, and cloud-based (or local) data storage and compute. In a case study, Tagger was used to characterise a decade (2000-2009) of data recorded by the Cape Sorell Waverider Buoy, located approximately 10 km off the west coast of Tasmania, Australia. These data serve as a proxy for the understanding of Southern Ocean storminess, which has both local and global implications. This example shows use of the tool to identify and characterise 4 different types of storm and non-storm events during this time. Events characterised in this way are compared with conventional analysis, noting advantages and limitations of data analysis using animation and human interaction. Tagger provides a new ability to make use of humans as feature detectors in computer-based analysis of large-volume geosciences and other data.

  16. Designing the colorectal cancer core dataset in Iran

    Directory of Open Access Journals (Sweden)

    Sara Dorri

    2017-01-01

    Full Text Available Background: There is no need to explain the importance of collection, recording and analyzing the information of disease in any health organization. In this regard, systematic design of standard data sets can be helpful to record uniform and consistent information. It can create interoperability between health care systems. The main purpose of this study was design the core dataset to record colorectal cancer information in Iran. Methods: For the design of the colorectal cancer core data set, a combination of literature review and expert consensus were used. In the first phase, the draft of the data set was designed based on colorectal cancer literature review and comparative studies. Then, in the second phase, this data set was evaluated by experts from different discipline such as medical informatics, oncology and surgery. Their comments and opinion were taken. In the third phase refined data set, was evaluated again by experts and eventually data set was proposed. Results: In first phase, based on the literature review, a draft set of 85 data elements was designed. In the second phase this data set was evaluated by experts and supplementary information was offered by professionals in subgroups especially in treatment part. In this phase the number of elements totally were arrived to 93 numbers. In the third phase, evaluation was conducted by experts and finally this dataset was designed in five main parts including: demographic information, diagnostic information, treatment information, clinical status assessment information, and clinical trial information. Conclusion: In this study the comprehensive core data set of colorectal cancer was designed. This dataset in the field of collecting colorectal cancer information can be useful through facilitating exchange of health information. Designing such data set for similar disease can help providers to collect standard data from patients and can accelerate retrieval from storage systems.

  17. FTSPlot: fast time series visualization for large datasets.

    Directory of Open Access Journals (Sweden)

    Michael Riss

    Full Text Available The analysis of electrophysiological recordings often involves visual inspection of time series data to locate specific experiment epochs, mask artifacts, and verify the results of signal processing steps, such as filtering or spike detection. Long-term experiments with continuous data acquisition generate large amounts of data. Rapid browsing through these massive datasets poses a challenge to conventional data plotting software because the plotting time increases proportionately to the increase in the volume of data. This paper presents FTSPlot, which is a visualization concept for large-scale time series datasets using techniques from the field of high performance computer graphics, such as hierarchic level of detail and out-of-core data handling. In a preprocessing step, time series data, event, and interval annotations are converted into an optimized data format, which then permits fast, interactive visualization. The preprocessing step has a computational complexity of O(n x log(N; the visualization itself can be done with a complexity of O(1 and is therefore independent of the amount of data. A demonstration prototype has been implemented and benchmarks show that the technology is capable of displaying large amounts of time series data, event, and interval annotations lag-free with < 20 ms ms. The current 64-bit implementation theoretically supports datasets with up to 2(64 bytes, on the x86_64 architecture currently up to 2(48 bytes are supported, and benchmarks have been conducted with 2(40 bytes/1 TiB or 1.3 x 10(11 double precision samples. The presented software is freely available and can be included as a Qt GUI component in future software projects, providing a standard visualization method for long-term electrophysiological experiments.

  18. A synthetic dataset for evaluating soft and hard fusion algorithms

    Science.gov (United States)

    Graham, Jacob L.; Hall, David L.; Rimland, Jeffrey

    2011-06-01

    There is an emerging demand for the development of data fusion techniques and algorithms that are capable of combining conventional "hard" sensor inputs such as video, radar, and multispectral sensor data with "soft" data including textual situation reports, open-source web information, and "hard/soft" data such as image or video data that includes human-generated annotations. New techniques that assist in sense-making over a wide range of vastly heterogeneous sources are critical to improving tactical situational awareness in counterinsurgency (COIN) and other asymmetric warfare situations. A major challenge in this area is the lack of realistic datasets available for test and evaluation of such algorithms. While "soft" message sets exist, they tend to be of limited use for data fusion applications due to the lack of critical message pedigree and other metadata. They also lack corresponding hard sensor data that presents reasonable "fusion opportunities" to evaluate the ability to make connections and inferences that span the soft and hard data sets. This paper outlines the design methodologies, content, and some potential use cases of a COIN-based synthetic soft and hard dataset created under a United States Multi-disciplinary University Research Initiative (MURI) program funded by the U.S. Army Research Office (ARO). The dataset includes realistic synthetic reports from a variety of sources, corresponding synthetic hard data, and an extensive supporting database that maintains "ground truth" through logical grouping of related data into "vignettes." The supporting database also maintains the pedigree of messages and other critical metadata.

  19. Los que van a morir te saludan: las huellas, las cargas, las historias del cuerpo

    OpenAIRE

    Toro, Marisel; Serna, Yamid; Suárez, Rútber; Patiño, Ana; Moreno, Andrés

    2017-01-01

    Tesis (Maestría en Educación y Desarrollo Humano). Universidad de Manizales. Facultad de Ciencias Sociales y Humanas. CINDE, 2017 Las inquietudes desplegadas en el Programa de Investigación de la Línea en torno a las formas contemporáneas del Juvenicidio y la Necropolítica en Colombia, nos fueron arrojando al enunciado que soporta el título de esta serie: Los que van a morir te saludan. Amparados en el fondo doloroso que emana de esta antigua expresión latina, nosotros encontramos las v...

  20. Identifying frauds and anomalies in Medicare-B dataset.

    Science.gov (United States)

    Jiwon Seo; Mendelevitch, Ofer

    2017-07-01

    Healthcare industry is growing at a rapid rate to reach a market value of $7 trillion dollars world wide. At the same time, fraud in healthcare is becoming a serious problem, amounting to 5% of the total healthcare spending, or $100 billion dollars each year in US. Manually detecting healthcare fraud requires much effort. Recently, machine learning and data mining techniques are applied to automatically detect healthcare frauds. This paper proposes a novel PageRank-based algorithm to detect healthcare frauds and anomalies. We apply the algorithm to Medicare-B dataset, a real-life data with 10 million healthcare insurance claims. The algorithm successfully identifies tens of previously unreported anomalies.

  1. Power analysis dataset for QCA based multiplexer circuits

    Directory of Open Access Journals (Sweden)

    Md. Abdullah-Al-Shafi

    2017-04-01

    Full Text Available Power consumption in irreversible QCA logic circuits is a vital and a major issue; however in the practical cases, this focus is mostly omitted.The complete power depletion dataset of different QCA multiplexers have been worked out in this paper. At −271.15 °C temperature, the depletion is evaluated under three separate tunneling energy levels. All the circuits are designed with QCADesigner, a broadly used simulation engine and QCAPro tool has been applied for estimating the power dissipation.

  2. Equalizing imbalanced imprecise datasets for genetic fuzzy classifiers

    Directory of Open Access Journals (Sweden)

    AnaM. Palacios

    2012-04-01

    Full Text Available Determining whether an imprecise dataset is imbalanced is not immediate. The vagueness in the data causes that the prior probabilities of the classes are not precisely known, and therefore the degree of imbalance can also be uncertain. In this paper we propose suitable extensions of different resampling algorithms that can be applied to interval valued, multi-labelled data. By means of these extended preprocessing algorithms, certain classification systems designed for minimizing the fraction of misclassifications are able to produce knowledge bases that are also adequate under common metrics for imbalanced classification.

  3. Scientific Datasets: Discovery and Aggregation for Semantic Interpretation.

    Science.gov (United States)

    Lopez, L. A.; Scott, S.; Khalsa, S. J. S.; Duerr, R.

    2015-12-01

    One of the biggest challenges that interdisciplinary researchers face is finding suitable datasets in order to advance their science; this problem remains consistent across multiple disciplines. A surprising number of scientists, when asked what tool they use for data discovery, reply "Google", which is an acceptable solution in some cases but not even Google can find -or cares to compile- all the data that's relevant for science and particularly geo sciences. If a dataset is not discoverable through a well known search provider it will remain dark data to the scientific world.For the past year, BCube, an EarthCube Building Block project, has been developing, testing and deploying a technology stack capable of data discovery at web-scale using the ultimate dataset: The Internet. This stack has 2 principal components, a web-scale crawling infrastructure and a semantic aggregator. The web-crawler is a modified version of Apache Nutch (the originator of Hadoop and other big data technologies) that has been improved and tailored for data and data service discovery. The second component is semantic aggregation, carried out by a python-based workflow that extracts valuable metadata and stores it in the form of triples through the use semantic technologies.While implementing the BCube stack we have run into several challenges such as a) scaling the project to cover big portions of the Internet at a reasonable cost, b) making sense of very diverse and non-homogeneous data, and lastly, c) extracting facts about these datasets using semantic technologies in order to make them usable for the geosciences community. Despite all these challenges we have proven that we can discover and characterize data that otherwise would have remained in the dark corners of the Internet. Having all this data indexed and 'triplelized' will enable scientists to access a trove of information relevant to their work in a more natural way. An important characteristic of the BCube stack is that all

  4. Dataset concerning the analytical approximation of the Ae3 temperature

    Directory of Open Access Journals (Sweden)

    B.L. Ennis

    2017-02-01

    The dataset includes the terms of the function and the values for the polynomial coefficients for major alloying elements in steel. A short description of the approximation method used to derive and validate the coefficients has also been included. For discussion and application of this model, please refer to the full length article entitled “The role of aluminium in chemical and phase segregation in a TRIP-assisted dual phase steel” 10.1016/j.actamat.2016.05.046 (Ennis et al., 2016 [1].

  5. Gene set analysis of the EADGENE chicken data-set

    DEFF Research Database (Denmark)

    Skarman, Axel; Jiang, Li; Hornshøj, Henrik

    2009-01-01

     Abstract Background: Gene set analysis is considered to be a way of improving our biological interpretation of the observed expression patterns. This paper describes different methods applied to analyse expression data from a chicken DNA microarray dataset. Results: Applying different gene set...... analyses to the chicken expression data led to different ranking of the Gene Ontology terms tested. A method for prediction of possible annotations was applied. Conclusion: Biological interpretation based on gene set analyses dependent on the statistical method used. Methods for predicting the possible...

  6. A Validation Dataset for CryoSat Sea Ice Investigators

    DEFF Research Database (Denmark)

    Julia, Gaudelli,; Baker, Steve; Haas, Christian

    Since its launch in April 2010 Cryosat has been collecting valuable sea ice data over the Arctic region. Over the same period ESA’s CryoVEx and NASA IceBridge validation campaigns have been collecting a unique set of coincident airborne measurements in the Arctic. The CryoVal-SI project has...... community. In this talk we will describe the composition of the validation dataset, summarising how it was processed and how to understand the content and format of the data. We will also explain how to access the data and the supporting documentation....

  7. Dataset of statements on policy integration of selected intergovernmental organizations

    Directory of Open Access Journals (Sweden)

    Jale Tosun

    2018-04-01

    Full Text Available This article describes data for 78 intergovernmental organizations (IGOs working on topics related to energy governance, environmental protection, and the economy. The number of IGOs covered also includes organizations active in other sectors. The point of departure for data construction was the Correlates of War dataset, from which we selected this sample of IGOs. We updated and expanded the empirical information on the IGOs selected by manual coding. Most importantly, we collected the primary law texts of the individual IGOs in order to code whether they commit themselves to environmental policy integration (EPI, climate policy integration (CPI and/or energy policy integration (EnPI.

  8. Dataset on the energy performance of atrium type hotel buildings.

    Science.gov (United States)

    Vujosevic, Milica; Krstic-Furundzic, Aleksandra

    2018-04-01

    The data presented in this article are related to the research article entitled "The Influence of Atrium on Energy Performance of Hotel Building" (Vujosevic and Krstic-Furundzic, 2017) [1], which describes the annual energy performance of atrium type hotel building in Belgrade climate conditions, with the objective to present the impact of the atrium on the hotel building's energy demands for space heating and cooling. This dataset is made publicly available to show energy performance of selected hotel design alternatives, in order to enable extended analyzes of these data for other researchers.

  9. Dataset on records of Hericium erinaceus in Slovakia.

    Science.gov (United States)

    Kunca, Vladimír; Čiliak, Marek

    2017-06-01

    The data presented in this article are related to the research article entitled "Habitat preferences of Hericium erinaceus in Slovakia" (Kunca and Čiliak, 2016) [FUNECO607] [2]. The dataset include all available and unpublished data from Slovakia, besides the records from the same tree or stem. We compiled a database of records of collections by processing data from herbaria, personal records and communication with mycological activists. Data on altitude, tree species, host tree vital status, host tree position and intensity of management of forest stands were evaluated in this study. All surveys were based on basidioma occurrence and some result from targeted searches.

  10. Dataset on records of Hericium erinaceus in Slovakia

    Directory of Open Access Journals (Sweden)

    Vladimír Kunca

    2017-06-01

    Full Text Available The data presented in this article are related to the research article entitled “Habitat preferences of Hericium erinaceus in Slovakia” (Kunca and Čiliak, 2016 [FUNECO607] [2]. The dataset include all available and unpublished data from Slovakia, besides the records from the same tree or stem. We compiled a database of records of collections by processing data from herbaria, personal records and communication with mycological activists. Data on altitude, tree species, host tree vital status, host tree position and intensity of management of forest stands were evaluated in this study. All surveys were based on basidioma occurrence and some result from targeted searches.

  11. Nuevos discursos sobre las ciudades, los municipios y las mujeres

    Directory of Open Access Journals (Sweden)

    Ana María Goetschel

    2009-01-01

    Full Text Available Escritora y activista feminista. Estudió Literatura y Periodismo en la Pontificia Universidad Católica del Perú y se gradúo de Magíster en Política Social por la Universidad Nacional Mayor de San Marcos. Por espacio de más de dos décadas se desempeña como profesora en el Diploma de Estudios de Género de la Pontificia Universidad Católica del Perú y como consultora de proyectos de desarrollo en las áreas de planificación, evaluación institucional y enfoque de género para diversas agencias internacionales de cooperación en Perú y otros países de América Latina. Es Investigadora asociada y miembro del Consejo Directivo del Centro Peruano de Estudios Sociales (CEPES. Maruja Barrig acredita un vasto trabajo de investigación sobre empleo femenino, desarrollo local y organizaciones de mujeres. En una reciente conferencia “Nuevos discursos sobre las ciudades, los municipios y las mujeres” , esta investigadora peruana hizo un balance sobre las formas en que el género ha sido incorporado en las prácticas de desarrollo y las consecuencias que conlleva este tipo de intervención para el avance de la agenda del feminismo crítico. Sobre este y otros temas relacionados con su trayectoria intelectual y la acción del movimiento de mujeres en el contexto político actual de las sociedades latinoamericanas, dialogamos con Maruja Barrig.

  12. Parallel Framework for Dimensionality Reduction of Large-Scale Datasets

    Directory of Open Access Journals (Sweden)

    Sai Kiranmayee Samudrala

    2015-01-01

    Full Text Available Dimensionality reduction refers to a set of mathematical techniques used to reduce complexity of the original high-dimensional data, while preserving its selected properties. Improvements in simulation strategies and experimental data collection methods are resulting in a deluge of heterogeneous and high-dimensional data, which often makes dimensionality reduction the only viable way to gain qualitative and quantitative understanding of the data. However, existing dimensionality reduction software often does not scale to datasets arising in real-life applications, which may consist of thousands of points with millions of dimensions. In this paper, we propose a parallel framework for dimensionality reduction of large-scale data. We identify key components underlying the spectral dimensionality reduction techniques, and propose their efficient parallel implementation. We show that the resulting framework can be used to process datasets consisting of millions of points when executed on a 16,000-core cluster, which is beyond the reach of currently available methods. To further demonstrate applicability of our framework we perform dimensionality reduction of 75,000 images representing morphology evolution during manufacturing of organic solar cells in order to identify how processing parameters affect morphology evolution.

  13. The Path from Large Earth Science Datasets to Information

    Science.gov (United States)

    Vicente, G. A.

    2013-12-01

    The NASA Goddard Earth Sciences Data (GES) and Information Services Center (DISC) is one of the major Science Mission Directorate (SMD) for archiving and distribution of Earth Science remote sensing data, products and services. This virtual portal provides convenient access to Atmospheric Composition and Dynamics, Hydrology, Precipitation, Ozone, and model derived datasets (generated by GSFC's Global Modeling and Assimilation Office), the North American Land Data Assimilation System (NLDAS) and the Global Land Data Assimilation System (GLDAS) data products (both generated by GSFC's Hydrological Sciences Branch). This presentation demonstrates various tools and computational technologies developed in the GES DISC to manage the huge volume of data and products acquired from various missions and programs over the years. It explores approaches to archive, document, distribute, access and analyze Earth Science data and information as well as addresses the technical and scientific issues, governance and user support problem faced by scientists in need of multi-disciplinary datasets. It also discusses data and product metrics, user distribution profiles and lessons learned through interactions with the science communities around the world. Finally it demonstrates some of the most used data and product visualization and analyses tools developed and maintained by the GES DISC.

  14. Benchmarking undedicated cloud computing providers for analysis of genomic datasets.

    Science.gov (United States)

    Yazar, Seyhan; Gooden, George E C; Mackey, David A; Hewitt, Alex W

    2014-01-01

    A major bottleneck in biological discovery is now emerging at the computational level. Cloud computing offers a dynamic means whereby small and medium-sized laboratories can rapidly adjust their computational capacity. We benchmarked two established cloud computing services, Amazon Web Services Elastic MapReduce (EMR) on Amazon EC2 instances and Google Compute Engine (GCE), using publicly available genomic datasets (E.coli CC102 strain and a Han Chinese male genome) and a standard bioinformatic pipeline on a Hadoop-based platform. Wall-clock time for complete assembly differed by 52.9% (95% CI: 27.5-78.2) for E.coli and 53.5% (95% CI: 34.4-72.6) for human genome, with GCE being more efficient than EMR. The cost of running this experiment on EMR and GCE differed significantly, with the costs on EMR being 257.3% (95% CI: 211.5-303.1) and 173.9% (95% CI: 134.6-213.1) more expensive for E.coli and human assemblies respectively. Thus, GCE was found to outperform EMR both in terms of cost and wall-clock time. Our findings confirm that cloud computing is an efficient and potentially cost-effective alternative for analysis of large genomic datasets. In addition to releasing our cost-effectiveness comparison, we present available ready-to-use scripts for establishing Hadoop instances with Ganglia monitoring on EC2 or GCE.

  15. Robust computational analysis of rRNA hypervariable tag datasets.

    Directory of Open Access Journals (Sweden)

    Maksim Sipos

    Full Text Available Next-generation DNA sequencing is increasingly being utilized to probe microbial communities, such as gastrointestinal microbiomes, where it is important to be able to quantify measures of abundance and diversity. The fragmented nature of the 16S rRNA datasets obtained, coupled with their unprecedented size, has led to the recognition that the results of such analyses are potentially contaminated by a variety of artifacts, both experimental and computational. Here we quantify how multiple alignment and clustering errors contribute to overestimates of abundance and diversity, reflected by incorrect OTU assignment, corrupted phylogenies, inaccurate species diversity estimators, and rank abundance distribution functions. We show that straightforward procedural optimizations, combining preexisting tools, are effective in handling large (10(5-10(6 16S rRNA datasets, and we describe metrics to measure the effectiveness and quality of the estimators obtained. We introduce two metrics to ascertain the quality of clustering of pyrosequenced rRNA data, and show that complete linkage clustering greatly outperforms other widely used methods.

  16. BLAST-EXPLORER helps you building datasets for phylogenetic analysis

    Directory of Open Access Journals (Sweden)

    Claverie Jean-Michel

    2010-01-01

    Full Text Available Abstract Background The right sampling of homologous sequences for phylogenetic or molecular evolution analyses is a crucial step, the quality of which can have a significant impact on the final interpretation of the study. There is no single way for constructing datasets suitable for phylogenetic analysis, because this task intimately depends on the scientific question we want to address, Moreover, database mining softwares such as BLAST which are routinely used for searching homologous sequences are not specifically optimized for this task. Results To fill this gap, we designed BLAST-Explorer, an original and friendly web-based application that combines a BLAST search with a suite of tools that allows interactive, phylogenetic-oriented exploration of the BLAST results and flexible selection of homologous sequences among the BLAST hits. Once the selection of the BLAST hits is done using BLAST-Explorer, the corresponding sequence can be imported locally for external analysis or passed to the phylogenetic tree reconstruction pipelines available on the Phylogeny.fr platform. Conclusions BLAST-Explorer provides a simple, intuitive and interactive graphical representation of the BLAST results and allows selection and retrieving of the BLAST hit sequences based a wide range of criterions. Although BLAST-Explorer primarily aims at helping the construction of sequence datasets for further phylogenetic study, it can also be used as a standard BLAST server with enriched output. BLAST-Explorer is available at http://www.phylogeny.fr

  17. Multiresolution comparison of precipitation datasets for large-scale models

    Science.gov (United States)

    Chun, K. P.; Sapriza Azuri, G.; Davison, B.; DeBeer, C. M.; Wheater, H. S.

    2014-12-01

    Gridded precipitation datasets are crucial for driving large-scale models which are related to weather forecast and climate research. However, the quality of precipitation products is usually validated individually. Comparisons between gridded precipitation products along with ground observations provide another avenue for investigating how the precipitation uncertainty would affect the performance of large-scale models. In this study, using data from a set of precipitation gauges over British Columbia and Alberta, we evaluate several widely used North America gridded products including the Canadian Gridded Precipitation Anomalies (CANGRD), the National Center for Environmental Prediction (NCEP) reanalysis, the Water and Global Change (WATCH) project, the thin plate spline smoothing algorithms (ANUSPLIN) and Canadian Precipitation Analysis (CaPA). Based on verification criteria for various temporal and spatial scales, results provide an assessment of possible applications for various precipitation datasets. For long-term climate variation studies (~100 years), CANGRD, NCEP, WATCH and ANUSPLIN have different comparative advantages in terms of their resolution and accuracy. For synoptic and mesoscale precipitation patterns, CaPA provides appealing performance of spatial coherence. In addition to the products comparison, various downscaling methods are also surveyed to explore new verification and bias-reduction methods for improving gridded precipitation outputs for large-scale models.

  18. Benchmarking Deep Learning Models on Large Healthcare Datasets.

    Science.gov (United States)

    Purushotham, Sanjay; Meng, Chuizheng; Che, Zhengping; Liu, Yan

    2018-06-04

    Deep learning models (aka Deep Neural Networks) have revolutionized many fields including computer vision, natural language processing, speech recognition, and is being increasingly used in clinical healthcare applications. However, few works exist which have benchmarked the performance of the deep learning models with respect to the state-of-the-art machine learning models and prognostic scoring systems on publicly available healthcare datasets. In this paper, we present the benchmarking results for several clinical prediction tasks such as mortality prediction, length of stay prediction, and ICD-9 code group prediction using Deep Learning models, ensemble of machine learning models (Super Learner algorithm), SAPS II and SOFA scores. We used the Medical Information Mart for Intensive Care III (MIMIC-III) (v1.4) publicly available dataset, which includes all patients admitted to an ICU at the Beth Israel Deaconess Medical Center from 2001 to 2012, for the benchmarking tasks. Our results show that deep learning models consistently outperform all the other approaches especially when the 'raw' clinical time series data is used as input features to the models. Copyright © 2018 Elsevier Inc. All rights reserved.

  19. Testing the Neutral Theory of Biodiversity with Human Microbiome Datasets.

    Science.gov (United States)

    Li, Lianwei; Ma, Zhanshan Sam

    2016-08-16

    The human microbiome project (HMP) has made it possible to test important ecological theories for arguably the most important ecosystem to human health-the human microbiome. Existing limited number of studies have reported conflicting evidence in the case of the neutral theory; the present study aims to comprehensively test the neutral theory with extensive HMP datasets covering all five major body sites inhabited by the human microbiome. Utilizing 7437 datasets of bacterial community samples, we discovered that only 49 communities (less than 1%) satisfied the neutral theory, and concluded that human microbial communities are not neutral in general. The 49 positive cases, although only a tiny minority, do demonstrate the existence of neutral processes. We realize that the traditional doctrine of microbial biogeography "Everything is everywhere, but the environment selects" first proposed by Baas-Becking resolves the apparent contradiction. The first part of Baas-Becking doctrine states that microbes are not dispersal-limited and therefore are neutral prone, and the second part reiterates that the freely dispersed microbes must endure selection by the environment. Therefore, in most cases, it is the host environment that ultimately shapes the community assembly and tip the human microbiome to niche regime.

  20. Overview of the CERES Edition-4 Multilayer Cloud Property Datasets

    Science.gov (United States)

    Chang, F. L.; Minnis, P.; Sun-Mack, S.; Chen, Y.; Smith, R. A.; Brown, R. R.

    2014-12-01

    Knowledge of the cloud vertical distribution is important for understanding the role of clouds on earth's radiation budget and climate change. Since high-level cirrus clouds with low emission temperatures and small optical depths can provide a positive feedback to a climate system and low-level stratus clouds with high emission temperatures and large optical depths can provide a negative feedback effect, the retrieval of multilayer cloud properties using satellite observations, like Terra and Aqua MODIS, is critically important for a variety of cloud and climate applications. For the objective of the Clouds and the Earth's Radiant Energy System (CERES), new algorithms have been developed using Terra and Aqua MODIS data to allow separate retrievals of cirrus and stratus cloud properties when the two dominant cloud types are simultaneously present in a multilayer system. In this paper, we will present an overview of the new CERES Edition-4 multilayer cloud property datasets derived from Terra as well as Aqua. Assessment of the new CERES multilayer cloud datasets will include high-level cirrus and low-level stratus cloud heights, pressures, and temperatures as well as their optical depths, emissivities, and microphysical properties.

  1. Predicting weather regime transitions in Northern Hemisphere datasets

    Energy Technology Data Exchange (ETDEWEB)

    Kondrashov, D. [University of California, Department of Atmospheric and Oceanic Sciences and Institute of Geophysics and Planetary Physics, Los Angeles, CA (United States); Shen, J. [UCLA, Department of Statistics, Los Angeles, CA (United States); Berk, R. [UCLA, Department of Statistics, Los Angeles, CA (United States); University of Pennsylvania, Department of Criminology, Philadelphia, PA (United States); D' Andrea, F.; Ghil, M. [Ecole Normale Superieure, Departement Terre-Atmosphere-Ocean and Laboratoire de Meteorologie Dynamique (CNRS and IPSL), Paris Cedex 05 (France)

    2007-10-15

    A statistical learning method called random forests is applied to the prediction of transitions between weather regimes of wintertime Northern Hemisphere (NH) atmospheric low-frequency variability. A dataset composed of 55 winters of NH 700-mb geopotential height anomalies is used in the present study. A mixture model finds that the three Gaussian components that were statistically significant in earlier work are robust; they are the Pacific-North American (PNA) regime, its approximate reverse (the reverse PNA, or RNA), and the blocked phase of the North Atlantic Oscillation (BNAO). The most significant and robust transitions in the Markov chain generated by these regimes are PNA {yields} BNAO, PNA {yields} RNA and BNAO {yields} PNA. The break of a regime and subsequent onset of another one is forecast for these three transitions. Taking the relative costs of false positives and false negatives into account, the random-forests method shows useful forecasting skill. The calculations are carried out in the phase space spanned by a few leading empirical orthogonal functions of dataset variability. Plots of estimated response functions to a given predictor confirm the crucial influence of the exit angle on a preferred transition path. This result points to the dynamic origin of the transitions. (orig.)

  2. Digital Astronaut Photography: A Discovery Dataset for Archaeology

    Science.gov (United States)

    Stefanov, William L.

    2010-01-01

    Astronaut photography acquired from the International Space Station (ISS) using commercial off-the-shelf cameras offers a freely-accessible source for high to very high resolution (4-20 m/pixel) visible-wavelength digital data of Earth. Since ISS Expedition 1 in 2000, over 373,000 images of the Earth-Moon system (including land surface, ocean, atmospheric, and lunar images) have been added to the Gateway to Astronaut Photography of Earth online database (http://eol.jsc.nasa.gov ). Handheld astronaut photographs vary in look angle, time of acquisition, solar illumination, and spatial resolution. These attributes of digital astronaut photography result from a unique combination of ISS orbital dynamics, mission operations, camera systems, and the individual skills of the astronaut. The variable nature of astronaut photography makes the dataset uniquely useful for archaeological applications in comparison with more traditional nadir-viewing multispectral datasets acquired from unmanned orbital platforms. For example, surface features such as trenches, walls, ruins, urban patterns, and vegetation clearing and regrowth patterns may be accentuated by low sun angles and oblique viewing conditions (Fig. 1). High spatial resolution digital astronaut photographs can also be used with sophisticated land cover classification and spatial analysis approaches like Object Based Image Analysis, increasing the potential for use in archaeological characterization of landscapes and specific sites.

  3. ISC-EHB: Reconstruction of a robust earthquake dataset

    Science.gov (United States)

    Weston, J.; Engdahl, E. R.; Harris, J.; Di Giacomo, D.; Storchak, D. A.

    2018-04-01

    The EHB Bulletin of hypocentres and associated travel-time residuals was originally developed with procedures described by Engdahl, Van der Hilst and Buland (1998) and currently ends in 2008. It is a widely used seismological dataset, which is now expanded and reconstructed, partly by exploiting updated procedures at the International Seismological Centre (ISC), to produce the ISC-EHB. The reconstruction begins in the modern period (2000-2013) to which new and more rigorous procedures for event selection, data preparation, processing, and relocation are applied. The selection criteria minimise the location bias produced by unmodelled 3D Earth structure, resulting in events that are relatively well located in any given region. Depths of the selected events are significantly improved by a more comprehensive review of near station and secondary phase travel-time residuals based on ISC data, especially for the depth phases pP, pwP and sP, as well as by a rigorous review of the event depths in subduction zone cross sections. The resulting cross sections and associated maps are shown to provide details of seismicity in subduction zones in much greater detail than previously achievable. The new ISC-EHB dataset will be especially useful for global seismicity studies and high-frequency regional and global tomographic inversions.

  4. Benchmarking undedicated cloud computing providers for analysis of genomic datasets.

    Directory of Open Access Journals (Sweden)

    Seyhan Yazar

    Full Text Available A major bottleneck in biological discovery is now emerging at the computational level. Cloud computing offers a dynamic means whereby small and medium-sized laboratories can rapidly adjust their computational capacity. We benchmarked two established cloud computing services, Amazon Web Services Elastic MapReduce (EMR on Amazon EC2 instances and Google Compute Engine (GCE, using publicly available genomic datasets (E.coli CC102 strain and a Han Chinese male genome and a standard bioinformatic pipeline on a Hadoop-based platform. Wall-clock time for complete assembly differed by 52.9% (95% CI: 27.5-78.2 for E.coli and 53.5% (95% CI: 34.4-72.6 for human genome, with GCE being more efficient than EMR. The cost of running this experiment on EMR and GCE differed significantly, with the costs on EMR being 257.3% (95% CI: 211.5-303.1 and 173.9% (95% CI: 134.6-213.1 more expensive for E.coli and human assemblies respectively. Thus, GCE was found to outperform EMR both in terms of cost and wall-clock time. Our findings confirm that cloud computing is an efficient and potentially cost-effective alternative for analysis of large genomic datasets. In addition to releasing our cost-effectiveness comparison, we present available ready-to-use scripts for establishing Hadoop instances with Ganglia monitoring on EC2 or GCE.

  5. Condensing Massive Satellite Datasets For Rapid Interactive Analysis

    Science.gov (United States)

    Grant, G.; Gallaher, D. W.; Lv, Q.; Campbell, G. G.; Fowler, C.; LIU, Q.; Chen, C.; Klucik, R.; McAllister, R. A.

    2015-12-01

    Our goal is to enable users to interactively analyze massive satellite datasets, identifying anomalous data or values that fall outside of thresholds. To achieve this, the project seeks to create a derived database containing only the most relevant information, accelerating the analysis process. The database is designed to be an ancillary tool for the researcher, not an archival database to replace the original data. This approach is aimed at improving performance by reducing the overall size by way of condensing the data. The primary challenges of the project include: - The nature of the research question(s) may not be known ahead of time. - The thresholds for determining anomalies may be uncertain. - Problems associated with processing cloudy, missing, or noisy satellite imagery. - The contents and method of creation of the condensed dataset must be easily explainable to users. The architecture of the database will reorganize spatially-oriented satellite imagery into temporally-oriented columns of data (a.k.a., "data rods") to facilitate time-series analysis. The database itself is an open-source parallel database, designed to make full use of clustered server technologies. A demonstration of the system capabilities will be shown. Applications for this technology include quick-look views of the data, as well as the potential for on-board satellite processing of essential information, with the goal of reducing data latency.

  6. Las artes de leer e interpretar las hojas de coca

    Directory of Open Access Journals (Sweden)

    Eugenia Flores

    2017-03-01

    Full Text Available En este artículo quisiera hablar sobre un oficio secreto, subterráneo, clandestino, que es el arte de leer e interpretar las hojas de coca en espacios marginales del valle de Salta. Muestro en estas páginas las diferentes formas y modos en que la coca es utilizada por la población local, diversa y heterogénea, donde tradiciones indígenas ancestrales se manifiestan en prácticas concretas que rozan los límites entre la magia y la terapia. Presto especial interés, a partir del trabajo de campo realizado, a aquellas personas que manejan estas artes de leer las hojas de coca, las cuales en los casos analizados fueron adquiridas a partir de un ritual de paso, que fue el haber sido “tocado por el rayo”. Me interesa particularmente trabajar con estos hombres y mujeres, cuyo arte de hacer con coca es equiparable a las “maneras de hacer”. Estas maneras de hacer con la coca ritual no están visibilizadas y son subterráneas, marginales y silenciosas, en este artículo se pretende dar cuenta de estas prácticas con coca y sus dinámicas relaciones inter e intra comunales. Palabras-claves: Artes; maneras de hacer; coca; ritual.

  7. Las relaciones de Podemos con las organizaciones sociales

    Directory of Open Access Journals (Sweden)

    Ignacio Mariñas

    2015-09-01

    Full Text Available El 15M fue el principio del proceso de cambio en España y tiene una trascendencia en el mundo porque, por primera vez desde 1989, un movimiento contestatario al sistema alcanza una repercusión mundial. En consecuencia, todos partidos políticos en España atienden las demandas de este movimiento, bien para impulsar cambios en sus programas que atiendan a sus demandas o para intentar asumir su representación política como hace Podemos. En esta exposición se considera la tradición política de este país y el surgir del 15M, para comentar, después, las críticas a Podemos como heredero del 15M, y terminar con el análisis de su relación con las organizaciones sociales. La exposición se organiza en cinco tiempos que en su conjunto dan una visión poliédrica del problema. El objetivo es tratar de contradecir el escepticismo del Quijote sobre la posibilidad de comprender las razones y las dinámicas de los cambios sociales.

  8. Utilizing the Antarctic Master Directory to find orphan datasets

    Science.gov (United States)

    Bonczkowski, J.; Carbotte, S. M.; Arko, R. A.; Grebas, S. K.

    2011-12-01

    While most Antarctic data are housed at an established disciplinary-specific data repository, there are data types for which no suitable repository exists. In some cases, these "orphan" data, without an appropriate national archive, are served from local servers by the principal investigators who produced the data. There are many pitfalls with data served privately, including the frequent lack of adequate documentation to ensure the data can be understood by others for re-use and the impermanence of personal web sites. For example, if an investigator leaves an institution and the data moves, the link published is no longer accessible. To ensure continued availability of data, submission to long-term national data repositories is needed. As stated in the National Science Foundation Office of Polar Programs (NSF/OPP) Guidelines and Award Conditions for Scientific Data, investigators are obligated to submit their data for curation and long-term preservation; this includes the registration of a dataset description into the Antarctic Master Directory (AMD), http://gcmd.nasa.gov/Data/portals/amd/. The AMD is a Web-based, searchable directory of thousands of dataset descriptions, known as DIF records, submitted by scientists from over 20 countries. It serves as a node of the International Directory Network/Global Change Master Directory (IDN/GCMD). The US Antarctic Program Data Coordination Center (USAP-DCC), http://www.usap-data.org/, funded through NSF/OPP, was established in 2007 to help streamline the process of data submission and DIF record creation. When data does not quite fit within any existing disciplinary repository, it can be registered within the USAP-DCC as the fallback data repository. Within the scope of the USAP-DCC we undertook the challenge of discovering and "rescuing" orphan datasets currently registered within the AMD. In order to find which DIF records led to data served privately, all records relating to US data within the AMD were parsed. After

  9. Relaciones amorosas de pareja en las trayectorias vitales de las mujeres encarceladas

    OpenAIRE

    De Miguel Calvo, Estibaliz

    2012-01-01

    [ES]La tesis doctoral analiza las experiencias amorosas de pareja de mujeres encarceladas, con el doble objetivo de visibilizar a las mujeres presas en el ámbito de las ciencias sociales y de introducir las especificidades de las mujeres encarceladas en los debates sociológicos y feministas acerca del amor. Las escasas aproximaciones al amor entre las mujeres presas han tendido a explicar sus relaciones de pareja desde el concepto de “depende...

  10. ¿Las mujeres prefieren las cesáreas?

    Directory of Open Access Journals (Sweden)

    A. Vallejos Parás

    2016-07-01

    Full Text Available Las tasas de cesárea han aumentado de manera constante en la mayoría de los países de medianos y altos ingresos en los últimos decenios sin justificación médica. La solicitud materna es uno de los factores no médicos citados con frecuencia que contribuyen a esta tendencia. El objetivo de este documento es realizar una revisión de la bibliografía actual sobre las preferencias maternas por la cesárea.

  11. Las TIC, el proceso del conocimiento y las competencias docentes

    Directory of Open Access Journals (Sweden)

    Augusto Perez Lindo

    2014-11-01

    Full Text Available En el marco del Seminario Internacional "Formación y conocimiento" organizado por la Universidad de Sorocaba presentamos un análisis de los efectos de las TICs en los procesos del conocimiento. Tratamos de demostrar que esos efectos se producen en varias direcciones: cambios en la visión de la realidad, de la subjetividad, del lenguaje, de los paradigmas, de las relaciones sociales. Propone fortalecer competencias adecuadas al cambio de paradigma, como la capacidad para seleccionar e interpretar informaciones, la capacidad para pensar científicamente, la capacidad para comunicar, la capacidad para convivir, la capacidad para dominar varios lenguajes.

  12. Ecologia de las lombrices de tierra

    Science.gov (United States)

    Grizelle Gonzalez

    2014-01-01

    De los organismos de suelo, las lombrices de tierra son las mas conocidas y a menudo son consideradas las mas importantes por su influencia en el funcionamiento de ecosistemas de suelo (Hendriz y Bohlen, 2002). Tienen un efecto significativo en la estructura del suelo, el ciclo de nutrimentos y ls productividad de las cosechas. En terminos de biomasa, generalmente...

  13. NERIES: Seismic Data Gateways and User Composed Datasets Metadata Management

    Science.gov (United States)

    Spinuso, Alessandro; Trani, Luca; Kamb, Linus; Frobert, Laurent

    2010-05-01

    One of the NERIES EC project main objectives is to establish and improve the networking of seismic waveform data exchange and access among four main data centers in Europe: INGV, GFZ, ORFEUS and IPGP. Besides the implementation of the data backbone, several investigations and developments have been conducted in order to offer to the users the data available from this network, either programmatically or interactively. One of the challenges is to understand how to enable users` activities such as discovering, aggregating, describing and sharing datasets to obtain a decrease in the replication of similar data queries towards the network, exempting the data centers to guess and create useful pre-packed products. We`ve started to transfer this task more and more towards the users community, where the users` composed data products could be extensively re-used. The main link to the data is represented by a centralized webservice (SeismoLink) acting like a single access point to the whole data network. Users can download either waveform data or seismic station inventories directly from their own software routines by connecting to this webservice, which routes the request to the data centers. The provenance of the data is maintained and transferred to the users in the form of URIs, that identify the dataset and implicitly refer to the data provider. SeismoLink, combined with other webservices (eg EMSC-QuakeML earthquakes catalog service), is used from a community gateway such as the NERIES web portal (http://www.seismicportal.eu). Here the user interacts with a map based portlet which allows the dynamic composition of a data product, binding seismic event`s parameters with a set of seismic stations. The requested data is collected by the back-end processes of the portal, preserved and offered to the user in a personal data cart, where metadata can be generated interactively on-demand. The metadata, expressed in RDF, can also be remotely ingested. They offer rating

  14. Accuracy assessment of seven global land cover datasets over China

    Science.gov (United States)

    Yang, Yongke; Xiao, Pengfeng; Feng, Xuezhi; Li, Haixing

    2017-03-01

    Land cover (LC) is the vital foundation to Earth science. Up to now, several global LC datasets have arisen with efforts of many scientific communities. To provide guidelines for data usage over China, nine LC maps from seven global LC datasets (IGBP DISCover, UMD, GLC, MCD12Q1, GLCNMO, CCI-LC, and GlobeLand30) were evaluated in this study. First, we compared their similarities and discrepancies in both area and spatial patterns, and analysed their inherent relations to data sources and classification schemes and methods. Next, five sets of validation sample units (VSUs) were collected to calculate their accuracy quantitatively. Further, we built a spatial analysis model and depicted their spatial variation in accuracy based on the five sets of VSUs. The results show that, there are evident discrepancies among these LC maps in both area and spatial patterns. For LC maps produced by different institutes, GLC 2000 and CCI-LC 2000 have the highest overall spatial agreement (53.8%). For LC maps produced by same institutes, overall spatial agreement of CCI-LC 2000 and 2010, and MCD12Q1 2001 and 2010 reach up to 99.8% and 73.2%, respectively; while more efforts are still needed if we hope to use these LC maps as time series data for model inputting, since both CCI-LC and MCD12Q1 fail to represent the rapid changing trend of several key LC classes in the early 21st century, in particular urban and built-up, snow and ice, water bodies, and permanent wetlands. With the highest spatial resolution, the overall accuracy of GlobeLand30 2010 is 82.39%. For the other six LC datasets with coarse resolution, CCI-LC 2010/2000 has the highest overall accuracy, and following are MCD12Q1 2010/2001, GLC 2000, GLCNMO 2008, IGBP DISCover, and UMD in turn. Beside that all maps exhibit high accuracy in homogeneous regions; local accuracies in other regions are quite different, particularly in Farming-Pastoral Zone of North China, mountains in Northeast China, and Southeast Hills. Special

  15. LAS MIGRACIONES INTERNACIONALES EN COLOMBIA

    Directory of Open Access Journals (Sweden)

    Maguemati Wabgou

    2012-01-01

    Full Text Available El artículo expone los resultados de una investigación sobre los flujos migratorios internacionales que han estado llegando a Colombia desde el siglo XVI hasta la actualidad. La metodología investigativa se fundamenta en la recolección y el análisis de datos derivados de investigaciones anteriores. Los resultados son significativos en lamedida que presentan el estado de las migraciones en Colombia entre los siglos XVI y XIX y desde la primera mitad del siglo XX hasta la actualidad. Por un lado, engloban a la inmigración británica, jamaiquina y africana en las islas de San Andrés y Providencia; junto con la inmigración árabe, judía, alemana, francesa, italiana y gitana en Colombia continental. Por otro, aluden a las migraciones japonesas, suramericanas, junto con los crecientes asentamientos de migrantes norteamericanos, europeos y las nuevas oleadas de migraciones asiáticas y africanas en Colombia. Todo ello lleva a aprehender la envergadura del papel jugado por Colombia como un país de destino de las migraciones internacionales, lo que contrasta con la imagen más conocida de Colombia como un país expulsor de emigrantes.

  16. Las canciones mueven tus inteligencias

    Directory of Open Access Journals (Sweden)

    Dayane Mónica Cordeiro

    2014-09-01

    Full Text Available La canción es uno de los recursos más estimulantes y provechosos para enseñar lengua española, pues responde a los objetivos lingüísticos y comunicativos que ideamos fomentar en al aula de E/LE. Llevar las aportaciones de la teoría de las inteligencias múltiples al campo de la didáctica de la enseñanza de lenguas extranjeras ha sido el objetivo de esta experiencia práctica. La unidad didáctica Primavera Trompetera parte de la premisa de que fomentar la diversidad cognitiva propuesta en laTeoría de las Inteligencias Múltiples del psicólogo Howard Gardner, utilizando comorecurso didáctico la canción Primavera Trompetera del grupo «Los Delinquentes», puede facilitar el proceso de enseñanza/aprendizaje de lengua española, trabajando las destrezas integradas a las inteligencias múltiples.

  17. Gridded 5km GHCN-Daily Temperature and Precipitation Dataset, Version 1

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The Gridded 5km GHCN-Daily Temperature and Precipitation Dataset (nClimGrid) consists of four climate variables derived from the GHCN-D dataset: maximum temperature,...

  18. Active Semisupervised Clustering Algorithm with Label Propagation for Imbalanced and Multidensity Datasets

    Directory of Open Access Journals (Sweden)

    Mingwei Leng

    2013-01-01

    Full Text Available The accuracy of most of the existing semisupervised clustering algorithms based on small size of labeled dataset is low when dealing with multidensity and imbalanced datasets, and labeling data is quite expensive and time consuming in many real-world applications. This paper focuses on active data selection and semisupervised clustering algorithm in multidensity and imbalanced datasets and proposes an active semisupervised clustering algorithm. The proposed algorithm uses an active mechanism for data selection to minimize the amount of labeled data, and it utilizes multithreshold to expand labeled datasets on multidensity and imbalanced datasets. Three standard datasets and one synthetic dataset are used to demonstrate the proposed algorithm, and the experimental results show that the proposed semisupervised clustering algorithm has a higher accuracy and a more stable performance in comparison to other clustering and semisupervised clustering algorithms, especially when the datasets are multidensity and imbalanced.

  19. Dataset for Probabilistic estimation of residential air exchange rates for population-based exposure modeling

    Data.gov (United States)

    U.S. Environmental Protection Agency — This dataset provides the city-specific air exchange rate measurements, modeled, literature-based as well as housing characteristics. This dataset is associated with...

  20. An Affinity Propagation Clustering Algorithm for Mixed Numeric and Categorical Datasets

    Directory of Open Access Journals (Sweden)

    Kang Zhang

    2014-01-01

    Full Text Available Clustering has been widely used in different fields of science, technology, social science, and so forth. In real world, numeric as well as categorical features are usually used to describe the data objects. Accordingly, many clustering methods can process datasets that are either numeric or categorical. Recently, algorithms that can handle the mixed data clustering problems have been developed. Affinity propagation (AP algorithm is an exemplar-based clustering method which has demonstrated good performance on a wide variety of datasets. However, it has limitations on processing mixed datasets. In this paper, we propose a novel similarity measure for mixed type datasets and an adaptive AP clustering algorithm is proposed to cluster the mixed datasets. Several real world datasets are studied to evaluate the performance of the proposed algorithm. Comparisons with other clustering algorithms demonstrate that the proposed method works well not only on mixed datasets but also on pure numeric and categorical datasets.

  1. Ecohydrological Index, Native Fish, and Climate Trends and Relationships in the Kansas River Basin_dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — The dataset is an excel file that contain data for the figures in the manuscript. This dataset is associated with the following publication: Sinnathamby, S., K....

  2. Global Human Built-up And Settlement Extent (HBASE) Dataset From Landsat

    Data.gov (United States)

    National Aeronautics and Space Administration — The Global Human Built-up And Settlement Extent (HBASE) Dataset from Landsat is a global map of HBASE derived from the Global Land Survey (GLS) Landsat dataset for...

  3. An Automatic Matcher and Linker for Transportation Datasets

    Directory of Open Access Journals (Sweden)

    Ali Masri

    2017-01-01

    Full Text Available Multimodality requires the integration of heterogeneous transportation data to construct a broad view of the transportation network. Many new transportation services are emerging while being isolated from previously-existing networks. This leads them to publish their data sources to the web, according to linked data principles, in order to gain visibility. Our interest is to use these data to construct an extended transportation network that links these new services to existing ones. The main problems we tackle in this article fall in the categories of automatic schema matching and data interlinking. We propose an approach that uses web services as mediators to help in automatically detecting geospatial properties and mapping them between two different schemas. On the other hand, we propose a new interlinking approach that enables the user to define rich semantic links between datasets in a flexible and customizable way.

  4. [Parallel virtual reality visualization of extreme large medical datasets].

    Science.gov (United States)

    Tang, Min

    2010-04-01

    On the basis of a brief description of grid computing, the essence and critical techniques of parallel visualization of extreme large medical datasets are discussed in connection with Intranet and common-configuration computers of hospitals. In this paper are introduced several kernel techniques, including the hardware structure, software framework, load balance and virtual reality visualization. The Maximum Intensity Projection algorithm is realized in parallel using common PC cluster. In virtual reality world, three-dimensional models can be rotated, zoomed, translated and cut interactively and conveniently through the control panel built on virtual reality modeling language (VRML). Experimental results demonstrate that this method provides promising and real-time results for playing the role in of a good assistant in making clinical diagnosis.

  5. The wildland-urban interface raster dataset of Catalonia.

    Science.gov (United States)

    Alcasena, Fermín J; Evers, Cody R; Vega-Garcia, Cristina

    2018-04-01

    We provide the wildland urban interface (WUI) map of the autonomous community of Catalonia (Northeastern Spain). The map encompasses an area of some 3.21 million ha and is presented as a 150-m resolution raster dataset. Individual housing location, structure density and vegetation cover data were used to spatially assess in detail the interface, intermix and dispersed rural WUI communities with a geographical information system. Most WUI areas concentrate in the coastal belt where suburban sprawl has occurred nearby or within unmanaged forests. This geospatial information data provides an approximation of residential housing potential for loss given a wildfire, and represents a valuable contribution to assist landscape and urban planning in the region.

  6. xarray: N-D labeled Arrays and Datasets in Python

    Directory of Open Access Journals (Sweden)

    Stephan Hoyer

    2017-04-01

    Full Text Available xarray is an open source project and Python package that provides a toolkit and data structures for N-dimensional labeled arrays. Our approach combines an application programing interface (API inspired by pandas with the Common Data Model for self-described scientific data. Key features of the xarray package include label-based indexing and arithmetic, interoperability with the core scientific Python packages (e.g., pandas, NumPy, Matplotlib, out-of-core computation on datasets that don’t fit into memory, a wide range of serialization and input/output (I/O options, and advanced multi-dimensional data manipulation tools such as group-by and resampling. xarray, as a data model and analytics toolkit, has been widely adopted in the geoscience community but is also used more broadly for multi-dimensional data analysis in physics, machine learning and finance.

  7. The wildland-urban interface raster dataset of Catalonia

    Directory of Open Access Journals (Sweden)

    Fermín J. Alcasena

    2018-04-01

    Full Text Available We provide the wildland urban interface (WUI map of the autonomous community of Catalonia (Northeastern Spain. The map encompasses an area of some 3.21 million ha and is presented as a 150-m resolution raster dataset. Individual housing location, structure density and vegetation cover data were used to spatially assess in detail the interface, intermix and dispersed rural WUI communities with a geographical information system. Most WUI areas concentrate in the coastal belt where suburban sprawl has occurred nearby or within unmanaged forests. This geospatial information data provides an approximation of residential housing potential for loss given a wildfire, and represents a valuable contribution to assist landscape and urban planning in the region. Keywords: Wildland-urban interface, Wildfire risk, Urban planning, Human communities, Catalonia

  8. Reconstructing flaw image using dataset of full matrix capture technique

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Tae Hun; Kim, Yong Sik; Lee, Jeong Seok [KHNP Central Research Institute, Daejeon (Korea, Republic of)

    2017-02-15

    A conventional phased array ultrasonic system offers the ability to steer an ultrasonic beam by applying independent time delays of individual elements in the array and produce an ultrasonic image. In contrast, full matrix capture (FMC) is a data acquisition process that collects a complete matrix of A-scans from every possible independent transmit-receive combination in a phased array transducer and makes it possible to reconstruct various images that cannot be produced by conventional phased array with the post processing as well as images equivalent to a conventional phased array image. In this paper, a basic algorithm based on the LLL mode total focusing method (TFM) that can image crack type flaws is described. And this technique was applied to reconstruct flaw images from the FMC dataset obtained from the experiments and ultrasonic simulation.

  9. Survey dataset on occupational hazards on construction sites

    Directory of Open Access Journals (Sweden)

    Patience F. Tunji-Olayeni

    2018-06-01

    Full Text Available The construction site provides an unfriendly working conditions, exposing workers to one of the harshest environments at a workplace. In this dataset, a structured questionnaire was design directed to thirty-five (35 craftsmen selected through a purposive sampling technique on various construction sites in one of the most populous cities in sub-Saharan Africa. The set of descriptive statistics is presented with tables, stacked bar chats and pie charts. Common occupational health conditions affecting the cardiovascular, respiratory and musculoskeletal systems of craftsmen on construction sites were identified. The effects of occupational health hazards on craftsmen and on construction project performance can be determined when the data is analyzed. Moreover, contractors’ commitment to occupational health and safety (OHS can be obtained from the analysis of the survey data. Keywords: Accidents, Construction industry, Craftsmen, Health, Occupational hazards

  10. Feedback control in deep drawing based on experimental datasets

    Science.gov (United States)

    Fischer, P.; Heingärtner, J.; Aichholzer, W.; Hortig, D.; Hora, P.

    2017-09-01

    In large-scale production of deep drawing parts, like in automotive industry, the effects of scattering material properties as well as warming of the tools have a significant impact on the drawing result. In the scope of the work, an approach is presented to minimize the influence of these effects on part quality by optically measuring the draw-in of each part and adjusting the settings of the press to keep the strain distribution, which is represented by the draw-in, inside a certain limit. For the design of the control algorithm, a design of experiments for in-line tests is used to quantify the influence of the blank holder force as well as the force distribution on the draw-in. The results of this experimental dataset are used to model the process behavior. Based on this model, a feedback control loop is designed. Finally, the performance of the control algorithm is validated in the production line.

  11. Orthology detection combining clustering and synteny for very large datasets.

    Science.gov (United States)

    Lechner, Marcus; Hernandez-Rosales, Maribel; Doerr, Daniel; Wieseke, Nicolas; Thévenin, Annelyse; Stoye, Jens; Hartmann, Roland K; Prohaska, Sonja J; Stadler, Peter F

    2014-01-01

    The elucidation of orthology relationships is an important step both in gene function prediction as well as towards understanding patterns of sequence evolution. Orthology assignments are usually derived directly from sequence similarities for large data because more exact approaches exhibit too high computational costs. Here we present PoFF, an extension for the standalone tool Proteinortho, which enhances orthology detection by combining clustering, sequence similarity, and synteny. In the course of this work, FFAdj-MCS, a heuristic that assesses pairwise gene order using adjacencies (a similarity measure related to the breakpoint distance) was adapted to support multiple linear chromosomes and extended to detect duplicated regions. PoFF largely reduces the number of false positives and enables more fine-grained predictions than purely similarity-based approaches. The extension maintains the low memory requirements and the efficient concurrency options of its basis Proteinortho, making the software applicable to very large datasets.

  12. Orthology detection combining clustering and synteny for very large datasets.

    Directory of Open Access Journals (Sweden)

    Marcus Lechner

    Full Text Available The elucidation of orthology relationships is an important step both in gene function prediction as well as towards understanding patterns of sequence evolution. Orthology assignments are usually derived directly from sequence similarities for large data because more exact approaches exhibit too high computational costs. Here we present PoFF, an extension for the standalone tool Proteinortho, which enhances orthology detection by combining clustering, sequence similarity, and synteny. In the course of this work, FFAdj-MCS, a heuristic that assesses pairwise gene order using adjacencies (a similarity measure related to the breakpoint distance was adapted to support multiple linear chromosomes and extended to detect duplicated regions. PoFF largely reduces the number of false positives and enables more fine-grained predictions than purely similarity-based approaches. The extension maintains the low memory requirements and the efficient concurrency options of its basis Proteinortho, making the software applicable to very large datasets.

  13. Comprehensive comparison of large-scale tissue expression datasets

    DEFF Research Database (Denmark)

    Santos Delgado, Alberto; Tsafou, Kalliopi; Stolte, Christian

    2015-01-01

    a comprehensive evaluation of tissue expression data from a variety of experimental techniques and show that these agree surprisingly well with each other and with results from literature curation and text mining. We further found that most datasets support the assumed but not demonstrated distinction between......For tissues to carry out their functions, they rely on the right proteins to be present. Several high-throughput technologies have been used to map out which proteins are expressed in which tissues; however, the data have not previously been systematically compared and integrated. We present......://tissues.jensenlab.org), which makes all the scored and integrated data available through a single user-friendly web interface....

  14. The SAIL databank: linking multiple health and social care datasets

    Directory of Open Access Journals (Sweden)

    Ford David V

    2009-01-01

    Full Text Available Abstract Background Vast amounts of data are collected about patients and service users in the course of health and social care service delivery. Electronic data systems for patient records have the potential to revolutionise service delivery and research. But in order to achieve this, it is essential that the ability to link the data at the individual record level be retained whilst adhering to the principles of information governance. The SAIL (Secure Anonymised Information Linkage databank has been established using disparate datasets, and over 500 million records from multiple health and social care service providers have been loaded to date, with further growth in progress. Methods Having established the infrastructure of the databank, the aim of this work was to develop and implement an accurate matching process to enable the assignment of a unique Anonymous Linking Field (ALF to person-based records to make the databank ready for record-linkage research studies. An SQL-based matching algorithm (MACRAL, Matching Algorithm for Consistent Results in Anonymised Linkage was developed for this purpose. Firstly the suitability of using a valid NHS number as the basis of a unique identifier was assessed using MACRAL. Secondly, MACRAL was applied in turn to match primary care, secondary care and social services datasets to the NHS Administrative Register (NHSAR, to assess the efficacy of this process, and the optimum matching technique. Results The validation of using the NHS number yielded specificity values > 99.8% and sensitivity values > 94.6% using probabilistic record linkage (PRL at the 50% threshold, and error rates were Conclusion With the infrastructure that has been put in place, the reliable matching process that has been developed enables an ALF to be consistently allocated to records in the databank. The SAIL databank represents a research-ready platform for record-linkage studies.

  15. Analysis of Public Datasets for Wearable Fall Detection Systems.

    Science.gov (United States)

    Casilari, Eduardo; Santoyo-Ramón, José-Antonio; Cano-García, José-Manuel

    2017-06-27

    Due to the boom of wireless handheld devices such as smartwatches and smartphones, wearable Fall Detection Systems (FDSs) have become a major focus of attention among the research community during the last years. The effectiveness of a wearable FDS must be contrasted against a wide variety of measurements obtained from inertial sensors during the occurrence of falls and Activities of Daily Living (ADLs). In this regard, the access to public databases constitutes the basis for an open and systematic assessment of fall detection techniques. This paper reviews and appraises twelve existing available data repositories containing measurements of ADLs and emulated falls envisaged for the evaluation of fall detection algorithms in wearable FDSs. The analysis of the found datasets is performed in a comprehensive way, taking into account the multiple factors involved in the definition of the testbeds deployed for the generation of the mobility samples. The study of the traces brings to light the lack of a common experimental benchmarking procedure and, consequently, the large heterogeneity of the datasets from a number of perspectives (length and number of samples, typology of the emulated falls and ADLs, characteristics of the test subjects, features and positions of the sensors, etc.). Concerning this, the statistical analysis of the samples reveals the impact of the sensor range on the reliability of the traces. In addition, the study evidences the importance of the selection of the ADLs and the need of categorizing the ADLs depending on the intensity of the movements in order to evaluate the capability of a certain detection algorithm to discriminate falls from ADLs.

  16. Clusterflock: a flocking algorithm for isolating congruent phylogenomic datasets.

    Science.gov (United States)

    Narechania, Apurva; Baker, Richard; DeSalle, Rob; Mathema, Barun; Kolokotronis, Sergios-Orestis; Kreiswirth, Barry; Planet, Paul J

    2016-10-24

    Collective animal behavior, such as the flocking of birds or the shoaling of fish, has inspired a class of algorithms designed to optimize distance-based clusters in various applications, including document analysis and DNA microarrays. In a flocking model, individual agents respond only to their immediate environment and move according to a few simple rules. After several iterations the agents self-organize, and clusters emerge without the need for partitional seeds. In addition to its unsupervised nature, flocking offers several computational advantages, including the potential to reduce the number of required comparisons. In the tool presented here, Clusterflock, we have implemented a flocking algorithm designed to locate groups (flocks) of orthologous gene families (OGFs) that share an evolutionary history. Pairwise distances that measure phylogenetic incongruence between OGFs guide flock formation. We tested this approach on several simulated datasets by varying the number of underlying topologies, the proportion of missing data, and evolutionary rates, and show that in datasets containing high levels of missing data and rate heterogeneity, Clusterflock outperforms other well-established clustering techniques. We also verified its utility on a known, large-scale recombination event in Staphylococcus aureus. By isolating sets of OGFs with divergent phylogenetic signals, we were able to pinpoint the recombined region without forcing a pre-determined number of groupings or defining a pre-determined incongruence threshold. Clusterflock is an open-source tool that can be used to discover horizontally transferred genes, recombined areas of chromosomes, and the phylogenetic 'core' of a genome. Although we used it here in an evolutionary context, it is generalizable to any clustering problem. Users can write extensions to calculate any distance metric on the unit interval, and can use these distances to 'flock' any type of data.

  17. The SAIL databank: linking multiple health and social care datasets.

    Science.gov (United States)

    Lyons, Ronan A; Jones, Kerina H; John, Gareth; Brooks, Caroline J; Verplancke, Jean-Philippe; Ford, David V; Brown, Ginevra; Leake, Ken

    2009-01-16

    Vast amounts of data are collected about patients and service users in the course of health and social care service delivery. Electronic data systems for patient records have the potential to revolutionise service delivery and research. But in order to achieve this, it is essential that the ability to link the data at the individual record level be retained whilst adhering to the principles of information governance. The SAIL (Secure Anonymised Information Linkage) databank has been established using disparate datasets, and over 500 million records from multiple health and social care service providers have been loaded to date, with further growth in progress. Having established the infrastructure of the databank, the aim of this work was to develop and implement an accurate matching process to enable the assignment of a unique Anonymous Linking Field (ALF) to person-based records to make the databank ready for record-linkage research studies. An SQL-based matching algorithm (MACRAL, Matching Algorithm for Consistent Results in Anonymised Linkage) was developed for this purpose. Firstly the suitability of using a valid NHS number as the basis of a unique identifier was assessed using MACRAL. Secondly, MACRAL was applied in turn to match primary care, secondary care and social services datasets to the NHS Administrative Register (NHSAR), to assess the efficacy of this process, and the optimum matching technique. The validation of using the NHS number yielded specificity values > 99.8% and sensitivity values > 94.6% using probabilistic record linkage (PRL) at the 50% threshold, and error rates were SAIL databank represents a research-ready platform for record-linkage studies.

  18. Analysis of Public Datasets for Wearable Fall Detection Systems

    Directory of Open Access Journals (Sweden)

    Eduardo Casilari

    2017-06-01

    Full Text Available Due to the boom of wireless handheld devices such as smartwatches and smartphones, wearable Fall Detection Systems (FDSs have become a major focus of attention among the research community during the last years. The effectiveness of a wearable FDS must be contrasted against a wide variety of measurements obtained from inertial sensors during the occurrence of falls and Activities of Daily Living (ADLs. In this regard, the access to public databases constitutes the basis for an open and systematic assessment of fall detection techniques. This paper reviews and appraises twelve existing available data repositories containing measurements of ADLs and emulated falls envisaged for the evaluation of fall detection algorithms in wearable FDSs. The analysis of the found datasets is performed in a comprehensive way, taking into account the multiple factors involved in the definition of the testbeds deployed for the generation of the mobility samples. The study of the traces brings to light the lack of a common experimental benchmarking procedure and, consequently, the large heterogeneity of the datasets from a number of perspectives (length and number of samples, typology of the emulated falls and ADLs, characteristics of the test subjects, features and positions of the sensors, etc.. Concerning this, the statistical analysis of the samples reveals the impact of the sensor range on the reliability of the traces. In addition, the study evidences the importance of the selection of the ADLs and the need of categorizing the ADLs depending on the intensity of the movements in order to evaluate the capability of a certain detection algorithm to discriminate falls from ADLs.

  19. Las mujeres en el espejo

    Directory of Open Access Journals (Sweden)

    Graciela Estrada

    2007-12-01

    Full Text Available Las mujeres establecen con el espejo una relación particular en la constitución de su imagen dada la semejanza de un mismo cuerpo con la madre y la trasladan al espejo de otras mujeres en la búsqueda de respuestas al enigma de su sexualidad. En la construcción del cuerpo y sexualidad femeninos tienen un lugar privilegiado el primer vínculo materno y un tercero como instancia simbólica de cuya mirada de aceptación dependerá la relación futura con su cuerpo y con el deseo. Esta estructura se pone a prueba en la pubertad: junto con las transformaciones reales del cuerpo femenino, cobra un primer plano la imagen, irrumpen las pulsiones y aparece la mirada deseante del otro sexo.

  20. El (desorden de las ciudades

    Directory of Open Access Journals (Sweden)

    Carlos Eduardo Maldonado

    2014-01-01

    Full Text Available Las ciudades son sistemas físicos, y estos tienen un orden. En la física clásica –esto es, en la mecánica clásica–, el orden era estable, rígido, jerárquico o regular y periódico. Los principios que explican dicho orden son, en verdad, elementales. Sin embargo, con la nueva física, dicho orden se revela como de mayor complejidad. Asimismo, las ciudades son sistemas vivos exactamente en la medida en que metabolizan información, materia y energía. Ahora bien, la complejidad del mundo actual está relacionada profundamente con la digitalización del mismo. Ello implica nuevos modos de escritura, nuevos modos de lectura, nuevos modos de comunicación y, manifiestamente, nuevos lenguajes. De allí que se advierta la emergencia de una nueva clase social: la de los migrantes y los nativos en las tecnologías convergentes. Son ellos los que están redefiniendo el (desorden de las ciudades. Pues no sin ambages, la ciudad es ahora y cada vez más, el lugar de la comunicación y el lenguaje, de las vivencias y el convivio. Y para escándalo de las mentalidades tradicionales, el convivio y el habitar, la comunicación y el lenguaje son cada vez más digitales. Una auténtica revolución, en toda la línea de la palabra.

  1. El (desorden de las ciudades

    Directory of Open Access Journals (Sweden)

    Carlos Eduardo Maldonado

    2014-07-01

    Full Text Available Las ciudades son sistemas físicos, y estos tienen un orden. En la física clásica –esto es, en la mecánica clásica–, el orden era estable, rígido, jerárquico o regular y periódico. Los principios que explican dicho orden son, en verdad, elementales. Sin embargo, con la nueva física, dicho orden se revela como de mayor complejidad. Asimismo, las ciudades son sistemas vivos exactamente en la medida en que metabolizan información, materia y energía. Ahora bien, la complejidad del mundo actual está relacionada profundamente con la digitalización del mismo. Ello implica nuevos modos de escritura, nuevos modos de lectura, nuevos modos de comunicación y, manifiestamente, nuevos lenguajes. De allí que se advierta la emergencia de una nueva clase social: la de los migrantes y los nativos en las tecnologías convergentes. Son ellos los que están redefiniendo el (desorden de las ciudades. Pues no sin ambages, la ciudad es ahora y cada vez más, el lugar de la comunicación y el lenguaje, de las vivencias y el convivio. Y para escándalo de las mentalidades tradicionales, el convivio y el habitar, la comunicación y el lenguaje son cada vez más digitales. Una auténtica revolución, en toda la línea de la palabra.

  2. A high quality finger vascular pattern dataset collected using a custom designed capturing device

    NARCIS (Netherlands)

    Ton, B.T.; Veldhuis, Raymond N.J.

    2013-01-01

    The number of finger vascular pattern datasets available for the research community is scarce, therefore a new finger vascular pattern dataset containing 1440 images is prsented. This dataset is unique in its kind as the images are of high resolution and have a known pixel density. Furthermore this

  3. Something From Nothing (There): Collecting Global IPv6 Datasets from DNS

    NARCIS (Netherlands)

    Fiebig, T.; Borgolte, Kevin; Hao, Shuang; Kruegel, Christopher; Vigna, Giovanny; Spring, Neil; Riley, George F.

    2017-01-01

    Current large-scale IPv6 studies mostly rely on non-public datasets, asmost public datasets are domain specific. For instance, traceroute-based datasetsare biased toward network equipment. In this paper, we present a new methodologyto collect IPv6 address datasets that does not require access to

  4. Las cosas por su nombre

    OpenAIRE

    Storani, Emilia

    2017-01-01

    El artículo propone indagar sobre los modos y diferentes formatos que se utilizan tanto en la escritura como en la lectura, para articular con las luchas por la identidad de género. La Ley de Identidad de Género ha sido un puntapié clave para pensarnos a nosotros mismos culturalmente y para pensar a los demás. Pero, ¿cómo mencionamos, escribimos y leemos las diferentes identidades? La escritura, también es un mundo transformador para quienes bregan por una sociedad más libre y sin prejuicios....

  5. Las normas internacionales de contabilidad

    OpenAIRE

    Monge, Pedro

    2006-01-01

    SUMARIO Editorial Peña, Aura Elena Artículos La calidad en la microempresa merideña y su impacto en el ambiente. Bustos F, Carlos E. Mecanismos de transmisión de la política monetaria. Chuecos, Alicia Contabilidad de Costos: una evaluación a la cátedra. Molina, Olga Rosa Las normas internacionales de contabilidad. Monge, Pedro Análisis de la cadena de valor industrial y de la cadena de valor agregado para las pequeñas y medianas indust...

  6. Gobierno corporativo en las pymes

    OpenAIRE

    Vásquez Vélez, Stefany; Dorado Paz, María Isabel

    2016-01-01

    Las pequeñas y medianas empresas - PYMES representan el más alto porcentaje de compañías existentes a nivel mundial 1 y en consecuencia, son el motor que mueve la economía. Pese a ello, experimentan varios inconvenientes que en muchos casos les impide crecer. El gobierno corporativo hoy en día se impone como un tema inquietante en el ámbito empresarial, porque es visto como un mecanismo que les ayuda a las compañías a alcanzar sus objetivos y los de sus grupos de inter...

  7. Las manifestaciones sociales en Brasil

    OpenAIRE

    Gonçalves Chaves, Mariluci

    2013-01-01

    El país del fútbol se ha convertido en el país de las protestas y manifestaciones. Manifestaciones que han sido pacíficas algunas veces y violenta otras, surgió inicialmente como respuesta a los aumentos en las tarifas de transporte público, con convocatorias a través de redes sociales, obtuvo un fuerte apoyo popular después de la violenta y desproporcionada represión de la policía , convirtiéndose en uno de los mayores movimientos sociales en la historia de Brasil, comparable sólo con los mo...

  8. ¿Quién habla de las mujeres en las noticias donde ellas son las protagonistas?

    Directory of Open Access Journals (Sweden)

    Ana Tamarit

    2011-01-01

    Full Text Available Este artículo presenta un análisis de las fuentes que los periodistas utilizan en las informaciones donde las mujeres son protagonistas de las noticias. La investigación -cuantitativa y cualitativa- se ha realizado en los periódicos de ámbito nacional y local de Castilla y León. Con los resultados obtenidos observamos cómo en los periódicos las noticias que hablan de las mujeres en la mayoría de las ocasiones no se firman. Comprobamos cuáles son las fuentes más utilizadas para la elaboración de esas noticias y las diferentes formas de citar a la fuente cuando se trata de un hombre o de una mujer.

  9. Usos de las herramientas digitales entre las personas mayores

    Directory of Open Access Journals (Sweden)

    Javier Fombona Cadavieco

    2012-10-01

    Full Text Available Una sociedad «multiedades» supone diseñar y crear nuevos espacios de aprendizaje y comunicación, capaces de gestionar la demanda existente por parte de las personas mayores. En este artículo, se aborda la relación de las personas mayores con las tecnologías de la información y la comunicación (TIC y para ello se plantean dos objetivos: el primero va dirigido a conocer los recursos tecnológicos que utilizan y el segundo, a describir objetivamente los tipos de uso que estas personas hacen de las TIC. Para ello, se utiliza la técnica de encuesta, cuyos resultados son contrastados mediante grupos de discusión. En el estudio participaron 215 personas mayores usuarias de las TIC y siete grupos de discusión de cinco personas cada uno. Los resultados encontrados indican que los recursos que más utilizan los mayores son, los ordenadores e Internet y el uso que hacen de los mismos se ha agrupado en cuatro grandes categorías: formación, información, comunicación y entretenimiento, no encontrándose diferencias significativas en función del género o de la edad y sí se encontraron diferencias en cuanto a la disponibilidad de dichos recursos para uso particular en función del nivel de estudios.

  10. Las rondas de las mujeres por las ventanillas del Estado: Etnografia de un trabajo invisible

    Directory of Open Access Journals (Sweden)

    Emilia Schijman

    2011-06-01

    Full Text Available A partir de una investigación de campo en un barrio de vivienda social de la periferia parisina, el artículo explora los circuitos diarios de las mujeres por las oficinas del Estado. El análisis muestra el trabajo en la ventanilla como una movilización cotidiana, individual y colectiva, que liga sin cesar la esfera doméstica y la esfera pública, la práctica burocrática y la micro-política. Familiarizarse con las categorías administrativas y jurídicas, manipular diferentes regímenes de solicitación, juntar testigos y acumular pruebas escritas, todas estas actividades forman un trabajo invisible pero imprescindible para reclamar derechos y activar la economía de la familia.

  11. Las herramientas del marketing y las tics: su uso en las Pymes para el desarrollo empresarial

    Directory of Open Access Journals (Sweden)

    Dianexy Carreño Villavicencio

    2015-12-01

    Full Text Available El mundo del marketing ha cambiado significativamente en los últimos años debido, en gran parte, al surgimiento del Internet como herramienta de investigación y búsqueda. Este se ha constituido en uno de los elementos  tecnológicos  más significantes  dentro  del ambiente empresarial. La facilidad que el Internet ofrece a las empresas para acceder a información y proveer datos relacionados  a las transacciones  de negocio,  habilita a dichas empresas  a lograr mayores resultados en su estrategia de marketing. Independientemente del tamaño de la empresa, el Internet ha permitido que estas hayan cambiado la forma de acceder y proveer información.  Este, además,  ha sentado nuevas pautas en la manera de comprar, buscar, recopilar  y aprender.  En este  estudio,  realizado  en  la  ciudad  de  Manta  -  Ecuador,  los resultados demuestran que todas las pymes están conectadas al Internet y solo 3 de cada 10 tienen departamento de marketing  y no emplean adecuadamente  las herramientas que el marketing y las TIC les proporcionan   para alcanzar un mejor desarrollo empresarial. La investigación   realizada   fue cualitativa,   usándose   la   técnica   de   la   encuesta    para  el levantamiento de la información. Por tanto, puede afirmarse que no existe una cultura de marketing en las pymes de la ciudad de Manta - Ecuador. Palabras clave: Generalidades, marketing, publicidad, otros.

  12. Las fotocomposiciones de Enric Miralles

    Directory of Open Access Journals (Sweden)

    Dra. Arq. Laura Lopes Cezar

    2013-04-01

    Full Text Available Las fotocomposiciones del arquitecto catalán Enric Miralles fueron investigadas en mi tesis doctoral realizada en la Universidad Politécnica da Cataluña como integrantes de su proceso de diseño. Este trabajo inicia con una breve introducción al tema de la fotocomposición a respecto de sus orígenes y evolución en el arte dadaísta y surrealista, presentando una continuidad con el cubismo de Picasso y posteriormente las fotocomposiciones de David Hockney que fueron sin duda una referencia al trabajo de Enric Miralles. Las fotocomposiciones de Enric Miralles se confguran como un medio investigativo y representativo dentro del universo de la arquitectura y de su proceso de diseño. El análisis de sus fotocomposiciones es de fundamental relevancia para el acercamiento de las asignaturas de representación gráfca y de proyectación con la puesta en valor en los procesos creativos

  13. Los Problemas de las Gimnasias

    Directory of Open Access Journals (Sweden)

    Viviana Bulus Rossini

    2011-11-01

    Full Text Available Este artículo se propone sintetizar las principales ideas de las ponencias presentadas en la Mesa de Trabajo "Los problemas de las Gimnasias" que se desarrolló en el marco del 9º Congreso Argentino y 4º Latinoamericano de Educación Física y Ciencias realizado en la Universidad Nacional de La Plata en junio de 2011. Todas ellas coinciden en pensar a la Gimnasia como parte fundamental de una Educación Corporal que se proponga transmitir aquellos saberes corporales culturalmente significativos que sirvan de herramientas para el conocimiento, el cuidado y la autogestión del propio cuerpo. A partir de estos puntos de coincidencia cada ponencia se explaya en su especificidad: ya sea desde la investigación y el análisis de los textos de divulgación y académicos, de los discursos de profesores, alumnos y practicantes; desde el análisis crítico del campo gímnico actual; desde propuestas bien concretas de trabajo de las gimnasias, en este caso la gimnasia aeróbica, la gimnasia funcional, el Método Pilates, o la gimnasia en el Profesorado en Educación Física con proyección en la gimnasia escolar

  14. Del aislacionismo a las alianzas

    Directory of Open Access Journals (Sweden)

    Michavila, Francisco

    2011-12-01

    Full Text Available The universities have passed to look within to seek opportunities for collaboration, either between universities or with public and private entities in other areas. The construction of the European Higher Education Area and the internationalization of universities have made cooperation and partnerships an emerging value for the future. In a scenario of competition between universities, partnerships are essential for the multiplication of opportunities in attracting top students, teachers and university researchers, and to improve the results of university activity.

    Las universidades han pasado de mirar al interior a la búsqueda de oportunidades para la colaboración, sea entre universidades o con entidades públicas y privadas de otros ámbitos. La construcción del Espacio Europeo de Educación Superior y la internacionalización de las universidades han convertido a la cooperación y las alianzas en un valor emergente para el futuro. En un escenario de competencia entre universidades, las alianzas son indispensables para la multiplicación de oportunidades en la captación de los mejores estudiantes, profesores e investigadores universitarios, así como para favorecer los resultados de la actividad universitarias.

  15. Violencia intrafamiliar contra las mujeres

    Directory of Open Access Journals (Sweden)

    Omar Huertas Díaz

    2012-07-01

    Full Text Available El presente texto pretende abordar el tema de violencia en contra de las mujeres desde la perspectiva familiar, es decir la violencia intrafamiliar o también llamada doméstica, entendida y afirmada desde la Corte Constitucional, como aquella violencia que causa daño o maltrato físico, psíquico o sexual, trato cruel, intimidatorio o degradante, amenaza, agravio, ofensa o cualquier otra forma de agresión, producida entre miembros de una familia, llámese cónyuge o compañero permanente, padre o madre, aunque no convivan bajo el mismo techo, ascendientes o descendientes de estos incluyendo hijos adoptivos, y en general todas las personas que de manera permanente se hallaren integrados a la unidad doméstica. Junto a ello el documento explicará de qué forma afecta esta particular clase de violencia a las mujeres, los abusos y malos tratos que la materializan, y las medidas adoptadas por el Estado Colombiano para enfrentar sus consecuencias, y por último la presentación de los conceptos que maneja la Corte Constitucional en la interpretación de dicho fenómeno.

  16. Impacto ambiental de las incineradoras

    OpenAIRE

    Saiz de Omeñaca, José Antonio; Saiz de Omeñaca, Jesús

    1996-01-01

    The pollution caused by incineration plants may be considered quite substantial. It is important to know the risks in order to be able to reduce the impacts.

    La contaminación causada por las incineradoras es, potencialmente, muy importante. Conocer los riesgos ayuda a tomar decisiones para minimizar los impactos.

  17. Privacy preserving data anonymization of spontaneous ADE reporting system dataset.

    Science.gov (United States)

    Lin, Wen-Yang; Yang, Duen-Chuan; Wang, Jie-Teng

    2016-07-18

    To facilitate long-term safety surveillance of marketing drugs, many spontaneously reporting systems (SRSs) of ADR events have been established world-wide. Since the data collected by SRSs contain sensitive personal health information that should be protected to prevent the identification of individuals, it procures the issue of privacy preserving data publishing (PPDP), that is, how to sanitize (anonymize) raw data before publishing. Although much work has been done on PPDP, very few studies have focused on protecting privacy of SRS data and none of the anonymization methods is favorable for SRS datasets, due to which contain some characteristics such as rare events, multiple individual records, and multi-valued sensitive attributes. We propose a new privacy model called MS(k, θ (*) )-bounding for protecting published spontaneous ADE reporting data from privacy attacks. Our model has the flexibility of varying privacy thresholds, i.e., θ (*) , for different sensitive values and takes the characteristics of SRS data into consideration. We also propose an anonymization algorithm for sanitizing the raw data to meet the requirements specified through the proposed model. Our algorithm adopts a greedy-based clustering strategy to group the records into clusters, conforming to an innovative anonymization metric aiming to minimize the privacy risk as well as maintain the data utility for ADR detection. Empirical study was conducted using FAERS dataset from 2004Q1 to 2011Q4. We compared our model with four prevailing methods, including k-anonymity, (X, Y)-anonymity, Multi-sensitive l-diversity, and (α, k)-anonymity, evaluated via two measures, Danger Ratio (DR) and Information Loss (IL), and considered three different scenarios of threshold setting for θ (*) , including uniform setting, level-wise setting and frequency-based setting. We also conducted experiments to inspect the impact of anonymized data on the strengths of discovered ADR signals. With all three

  18. Colombia: Escenario de las desigualdades

    Directory of Open Access Journals (Sweden)

    Amylkar Acosta Medina

    2013-01-01

    Full Text Available La distribución del ingreso de la sociedad colombiana dista de ser equitativa. La desigualdad se manifiesta en todos los órdenes de la vida nacional, como consecuencia de un enfoque histórico de las políticas públicas, hacia la regionalización del desarrollo. Las costas Caribe y Pacífica colombianas, presentan las cifras más críticas en materia de desempeño económico y social, situación que se opone al contexto internacional, en donde las zonas costeras son justamente las regiones más prósperas. Los bajo niveles de integración y crecimiento económico regionales, inherentes al atraso en infraestructura de transporte, permiten identificar la recurrente centralización del poder público, que privilegia el desarrollo de unas regiones, mientras otras permanecen rezagadas. Esta estrategia refuerza el desequilibrio económico, social y político, y redunda en la ineficiencia económica nacional. En consecuencia, si bien los índices de producción van en aumento, las tasas de desempleo y la informalidad laboral también muestran una tendencia creciente, teniendo en cuenta que los sectores productivos de mayor crecimiento son aquellos intensivos en capital y no tan intensivos en recurso humano. Lo anterior, sumado al creciente desplazamiento forzado de la población rural que tienen origen en la desigualdad, el desempleo y la pobreza. Así pues, el país enfrenta un gran reto en el sentido de reorientar las políticas de desarrollo, que corrijan los desequilibrios interregionales e intraregionales. Esto, acompañado de la transparencia en el ejercicio de la Política y la integridad en la administración pública, elementos indispensables en la estrategia de desarrollo que requiere el país.

  19. Nota de las editoras invitadas

    Directory of Open Access Journals (Sweden)

    María Eugenia Ibarra Melo

    2016-01-01

    Full Text Available Los estudios de género cada vez adquieren mayor importancia en las Ciencias Sociales, en general y en la sociología, en particular, sobre todo porque en ellos se tratan problemas contemporáneos ligados a la sexualidad, la diversidad, la estructuración de las relaciones de poder, los cuales han sido expresados en público a través de las resistencias que despliegan sus principales afectados. Como demuestran los artículos publicados en revistas dedicadas a estas reflexiones[1], estos estudios se ocupan de las desigualdades sociales entre hombres y mujeres, de la producción de exclusiones y de las identidades diversas que no se ajustan al modelo de género. [1] Entre otras revistas consultadas están: la Revista Interdisciplinaria de Estudios de Género, de El Colegio de México; la Revista de Estudios de Género. La Ventana, de la Universidad de Guadalajara; la Revista Punto G(énero del Núcleo género y sociedad Julieta Kirkwood, de la Universidad de Chile; Nomadías. Revista del Centro de Estudios de Género y Cultura de América Latina, de la Universidad de Chile; Íconos. Revista de Ciencias Sociales, de la Facultad Latinoamericana de Ciencias Sociales. Ecuador, que dedicó tres números a temas relacionados: n.° 45: Nuevas voces feministas en América Latina: ¿continuidades, rupturas, resistencias?, n.° 39: ¿Cómo se piensa lo Queer en América Latina, n.° 35: Ciudadanías y sexualidades en América Latina; La Aljaba Segunda época: Revista de Estudios de la Mujer, de las universidades del Comahue, de Luján y de La Pampa (Argentina, y Cadernos Pagu, Nucleo de estudos de gênero Pagu, de la Universidad de Campinas, de Brasil. Además de las revistas colombianas La Manzana de la Discordia, Sociedad y Economía, CS y otras que recientemente dedicaron monográficos a estos estudios.

  20. Standardization of GIS datasets for emergency preparedness of NPPs

    International Nuclear Information System (INIS)

    Saindane, Shashank S.; Suri, M.M.K.; Otari, Anil; Pradeepkumar, K.S.

    2012-01-01

    Probability of a major nuclear accident which can lead to large scale release of radioactivity into environment is extremely small by the incorporation of safety systems and defence-in-depth philosophy. Nevertheless emergency preparedness for implementation of counter measures to reduce the consequences are required for all major nuclear facilities. Iodine prophylaxis, Sheltering, evacuation etc. are protective measures to be implemented for members of public in the unlikely event of any significant releases from nuclear facilities. Bhabha Atomic Research Centre has developed a GIS supported Nuclear Emergency Preparedness Program. Preparedness for Response to Nuclear emergencies needs geographical details of the affected locations specially Nuclear Power Plant Sites and nearby public domain. Geographical information system data sets which the planners are looking for will have appropriate details in order to take decision and mobilize the resources in time and follow the Standard Operating Procedures. Maps are 2-dimensional representations of our real world and GIS makes it possible to manipulate large amounts of geo-spatially referenced data and convert it into information. This has become an integral part of the nuclear emergency preparedness and response planning. This GIS datasets consisting of layers such as village settlements, roads, hospitals, police stations, shelters etc. is standardized and effectively used during the emergency. The paper focuses on the need of standardization of GIS datasets which in turn can be used as a tool to display and evaluate the impact of standoff distances and selected zones in community planning. It will also highlight the database specifications which will help in fast processing of data and analysis to derive useful and helpful information. GIS has the capability to store, manipulate, analyze and display the large amount of required spatial and tabular data. This study intends to carry out a proper response and preparedness

  1. Forest restoration: a global dataset for biodiversity and vegetation structure.

    Science.gov (United States)

    Crouzeilles, Renato; Ferreira, Mariana S; Curran, Michael

    2016-08-01

    Restoration initiatives are becoming increasingly applied around the world. Billions of dollars have been spent on ecological restoration research and initiatives, but restoration outcomes differ widely among these initiatives in part due to variable socioeconomic and ecological contexts. Here, we present the most comprehensive dataset gathered to date on forest restoration. It encompasses 269 primary studies across 221 study landscapes in 53 countries and contains 4,645 quantitative comparisons between reference ecosystems (e.g., old-growth forest) and degraded or restored ecosystems for five taxonomic groups (mammals, birds, invertebrates, herpetofauna, and plants) and five measures of vegetation structure reflecting different ecological processes (cover, density, height, biomass, and litter). We selected studies that (1) were conducted in forest ecosystems; (2) had multiple replicate sampling sites to measure indicators of biodiversity and/or vegetation structure in reference and restored and/or degraded ecosystems; and (3) used less-disturbed forests as a reference to the ecosystem under study. We recorded (1) latitude and longitude; (2) study year; (3) country; (4) biogeographic realm; (5) past disturbance type; (6) current disturbance type; (7) forest conversion class; (8) restoration activity; (9) time that a system has been disturbed; (10) time elapsed since restoration started; (11) ecological metric used to assess biodiversity; and (12) quantitative value of the ecological metric of biodiversity and/or vegetation structure for reference and restored and/or degraded ecosystems. These were the most common data available in the selected studies. We also estimated forest cover and configuration in each study landscape using a recently developed 1 km consensus land cover dataset. We measured forest configuration as the (1) mean size of all forest patches; (2) size of the largest forest patch; and (3) edge:area ratio of forest patches. Global analyses of the

  2. BanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters

    Directory of Open Access Journals (Sweden)

    Mithun Biswas

    2017-06-01

    Full Text Available BanglaLekha-Isolated, a Bangla handwritten isolated character dataset is presented in this article. This dataset contains 84 different characters comprising of 50 Bangla basic characters, 10 Bangla numerals and 24 selected compound characters. 2000 handwriting samples for each of the 84 characters were collected, digitized and pre-processed. After discarding mistakes and scribbles, 1,66,105 handwritten character images were included in the final dataset. The dataset also includes labels indicating the age and the gender of the subjects from whom the samples were collected. This dataset could be used not only for optical handwriting recognition research but also to explore the influence of gender and age on handwriting. The dataset is publicly available at https://data.mendeley.com/datasets/hf6sf8zrkc/2.

  3. A Research Graph dataset for connecting research data repositories using RD-Switchboard.

    Science.gov (United States)

    Aryani, Amir; Poblet, Marta; Unsworth, Kathryn; Wang, Jingbo; Evans, Ben; Devaraju, Anusuriya; Hausstein, Brigitte; Klas, Claus-Peter; Zapilko, Benjamin; Kaplun, Samuele

    2018-05-29

    This paper describes the open access graph dataset that shows the connections between Dryad, CERN, ANDS and other international data repositories to publications and grants across multiple research data infrastructures. The graph dataset was created using the Research Graph data model and the Research Data Switchboard (RD-Switchboard), a collaborative project by the Research Data Alliance DDRI Working Group (DDRI WG) with the aim to discover and connect the related research datasets based on publication co-authorship or jointly funded grants. The graph dataset allows researchers to trace and follow the paths to understanding a body of work. By mapping the links between research datasets and related resources, the graph dataset improves both their discovery and visibility, while avoiding duplicate efforts in data creation. Ultimately, the linked datasets may spur novel ideas, facilitate reproducibility and re-use in new applications, stimulate combinatorial creativity, and foster collaborations across institutions.

  4. El lavado y cuidado de las manos

    OpenAIRE

    Troconis Ganimez, J.E.

    2003-01-01

    Resumen En el presente trabajo hacemos notar la importancia del lavado y cuidado de las manos en el personal del equipo de salud odontológico, aquí incluimos las técnicas correctas del lavado de las manos, la importancia del jabón y los antisépticos que han sido incluidos en los mismos, la técnica como deben ser secadas las manos una vez lavadas, por último hacemos algunas consideraciones en el cuidado general de las manos, mantenimiento de las uñas y el uso de cremas hidratantes.

  5. Provenance of Earth Science Datasets - How Deep Should One Go?

    Science.gov (United States)

    Ramapriyan, H.; Manipon, G. J. M.; Aulenbach, S.; Duggan, B.; Goldstein, J.; Hua, H.; Tan, D.; Tilmes, C.; Wilson, B. D.; Wolfe, R.; Zednik, S.

    2015-12-01

    For credibility of scientific research, transparency and reproducibility are essential. This fundamental tenet has been emphasized for centuries, and has been receiving increased attention in recent years. The Office of Management and Budget (2002) addressed reproducibility and other aspects of quality and utility of information from federal agencies. Specific guidelines from NASA (2002) are derived from the above. According to these guidelines, "NASA requires a higher standard of quality for information that is considered influential. Influential scientific, financial, or statistical information is defined as NASA information that, when disseminated, will have or does have clear and substantial impact on important public policies or important private sector decisions." For information to be compliant, "the information must be transparent and reproducible to the greatest possible extent." We present how the principles of transparency and reproducibility have been applied to NASA data supporting the Third National Climate Assessment (NCA3). The depth of trace needed of provenance of data used to derive conclusions in NCA3 depends on how the data were used (e.g., qualitatively or quantitatively). Given that the information is diligently maintained in the agency archives, it is possible to trace from a figure in the publication through the datasets, specific files, algorithm versions, instruments used for data collection, and satellites, as well as the individuals and organizations involved in each step. Such trace back permits transparency and reproducibility.

  6. A dataset from bottom trawl survey around Taiwan

    Directory of Open Access Journals (Sweden)

    Kwang-tsao Shao

    2012-05-01

    Full Text Available Bottom trawl fishery is one of the most important coastal fisheries in Taiwan both in production and economic values. However, its annual production started to decline due to overfishing since the 1980s. Its bycatch problem also damages the fishery resource seriously. Thus, the government banned the bottom fishery within 3 nautical miles along the shoreline in 1989. To evaluate the effectiveness of this policy, a four year survey was conducted from 2000–2003, in the waters around Taiwan and Penghu (Pescadore Islands, one region each year respectively. All fish specimens collected from trawling were brought back to lab for identification, individual number count and body weight measurement. These raw data have been integrated and established in Taiwan Fish Database (http://fishdb.sinica.edu.tw. They have also been published through TaiBIF (http://taibif.tw, FishBase and GBIF (website see below. This dataset contains 631 fish species and 3,529 records, making it the most complete demersal fish fauna and their temporal and spatial distributional data on the soft marine habitat in Taiwan.

  7. Integrated interpretation of overlapping AEM datasets achieved through standardisation

    Science.gov (United States)

    Sørensen, Camilla C.; Munday, Tim; Heinson, Graham

    2015-12-01

    Numerous airborne electromagnetic surveys have been acquired in Australia using a variety of systems. It is not uncommon to find two or more surveys covering the same ground, but acquired using different systems and at different times. Being able to combine overlapping datasets and get a spatially coherent resistivity-depth image of the ground can assist geological interpretation, particularly when more subtle geophysical responses are important. Combining resistivity-depth models obtained from the inversion of airborne electromagnetic (AEM) data can be challenging, given differences in system configuration, geometry, flying height and preservation or monitoring of system acquisition parameters such as waveform. In this study, we define and apply an approach to overlapping AEM surveys, acquired by fixed wing and helicopter time domain electromagnetic (EM) systems flown in the vicinity of the Goulds Dam uranium deposit in the Frome Embayment, South Australia, with the aim of mapping the basement geometry and the extent of the Billeroo palaeovalley. Ground EM soundings were used to standardise the AEM data, although results indicated that only data from the REPTEM system needed to be corrected to bring the two surveys into agreement and to achieve coherent spatial resistivity-depth intervals.

  8. A global dataset of sub-daily rainfall indices

    Science.gov (United States)

    Fowler, H. J.; Lewis, E.; Blenkinsop, S.; Guerreiro, S.; Li, X.; Barbero, R.; Chan, S.; Lenderink, G.; Westra, S.

    2017-12-01

    It is still uncertain how hydrological extremes will change with global warming as we do not fully understand the processes that cause extreme precipitation under current climate variability. The INTENSE project is using a novel and fully-integrated data-modelling approach to provide a step-change in our understanding of the nature and drivers of global precipitation extremes and change on societally relevant timescales, leading to improved high-resolution climate model representation of extreme rainfall processes. The INTENSE project is in conjunction with the World Climate Research Programme (WCRP)'s Grand Challenge on 'Understanding and Predicting Weather and Climate Extremes' and the Global Water and Energy Exchanges Project (GEWEX) Science questions. A new global sub-daily precipitation dataset has been constructed (data collection is ongoing). Metadata for each station has been calculated, detailing record lengths, missing data, station locations. A set of global hydroclimatic indices have been produced based upon stakeholder recommendations including indices that describe maximum rainfall totals and timing, the intensity, duration and frequency of storms, frequency of storms above specific thresholds and information about the diurnal cycle. This will provide a unique global data resource on sub-daily precipitation whose derived indices will be freely available to the wider scientific community.

  9. The Centennial Trends Greater Horn of Africa precipitation dataset

    Science.gov (United States)

    Funk, Chris; Nicholson, Sharon E.; Landsfeld, Martin F.; Klotter, Douglas; Peterson, Pete J.; Harrison, Laura

    2015-01-01

    East Africa is a drought prone, food and water insecure region with a highly variable climate. This complexity makes rainfall estimation challenging, and this challenge is compounded by low rain gauge densities and inhomogeneous monitoring networks. The dearth of observations is particularly problematic over the past decade, since the number of records in globally accessible archives has fallen precipitously. This lack of data coincides with an increasing scientific and humanitarian need to place recent seasonal and multi-annual East African precipitation extremes in a deep historic context. To serve this need, scientists from the UC Santa Barbara Climate Hazards Group and Florida State University have pooled their station archives and expertise to produce a high quality gridded ‘Centennial Trends’ precipitation dataset. Additional observations have been acquired from the national meteorological agencies and augmented with data provided by other universities. Extensive quality control of the data was carried out and seasonal anomalies interpolated using kriging. This paper documents the CenTrends methodology and data.

  10. Dataset on daytime outdoor thermal comfort for Belo Horizonte, Brazil.

    Science.gov (United States)

    Hirashima, Simone Queiroz da Silveira; Assis, Eleonora Sad de; Nikolopoulou, Marialena

    2016-12-01

    This dataset describe microclimatic parameters of two urban open public spaces in the city of Belo Horizonte, Brazil; physiological equivalent temperature (PET) index values and the related subjective responses of interviewees regarding thermal sensation perception and preference and thermal comfort evaluation. Individuals and behavioral characteristics of respondents were also presented. Data were collected at daytime, in summer and winter, 2013. Statistical treatment of this data was firstly presented in a PhD Thesis ("Percepção sonora e térmica e avaliação de conforto em espaços urbanos abertos do município de Belo Horizonte - MG, Brasil" (Hirashima, 2014) [1]), providing relevant information on thermal conditions in these locations and on thermal comfort assessment. Up to now, this data was also explored in the article "Daytime Thermal Comfort in Urban Spaces: A Field Study in Brazil" (Hirashima et al., in press) [2]. These references are recommended for further interpretation and discussion.

  11. Statistically and Computationally Efficient Estimating Equations for Large Spatial Datasets

    KAUST Repository

    Sun, Ying

    2014-11-07

    For Gaussian process models, likelihood based methods are often difficult to use with large irregularly spaced spatial datasets, because exact calculations of the likelihood for n observations require O(n3) operations and O(n2) memory. Various approximation methods have been developed to address the computational difficulties. In this paper, we propose new unbiased estimating equations based on score equation approximations that are both computationally and statistically efficient. We replace the inverse covariance matrix that appears in the score equations by a sparse matrix to approximate the quadratic forms, then set the resulting quadratic forms equal to their expected values to obtain unbiased estimating equations. The sparse matrix is constructed by a sparse inverse Cholesky approach to approximate the inverse covariance matrix. The statistical efficiency of the resulting unbiased estimating equations are evaluated both in theory and by numerical studies. Our methods are applied to nearly 90,000 satellite-based measurements of water vapor levels over a region in the Southeast Pacific Ocean.

  12. Statistically and Computationally Efficient Estimating Equations for Large Spatial Datasets

    KAUST Repository

    Sun, Ying; Stein, Michael L.

    2014-01-01

    For Gaussian process models, likelihood based methods are often difficult to use with large irregularly spaced spatial datasets, because exact calculations of the likelihood for n observations require O(n3) operations and O(n2) memory. Various approximation methods have been developed to address the computational difficulties. In this paper, we propose new unbiased estimating equations based on score equation approximations that are both computationally and statistically efficient. We replace the inverse covariance matrix that appears in the score equations by a sparse matrix to approximate the quadratic forms, then set the resulting quadratic forms equal to their expected values to obtain unbiased estimating equations. The sparse matrix is constructed by a sparse inverse Cholesky approach to approximate the inverse covariance matrix. The statistical efficiency of the resulting unbiased estimating equations are evaluated both in theory and by numerical studies. Our methods are applied to nearly 90,000 satellite-based measurements of water vapor levels over a region in the Southeast Pacific Ocean.

  13. Challenges and Experiences of Building Multidisciplinary Datasets across Cultures

    Science.gov (United States)

    Jamiyansharav, K.; Laituri, M.; Fernandez-Gimenez, M.; Fassnacht, S. R.; Venable, N. B. H.; Allegretti, A. M.; Reid, R.; Baival, B.; Jamsranjav, C.; Ulambayar, T.; Linn, S.; Angerer, J.

    2017-12-01

    Efficient data sharing and management are key challenges to multidisciplinary scientific research. These challenges are further complicated by adding a multicultural component. We address the construction of a complex database for social-ecological analysis in Mongolia. Funded by the National Science Foundation (NSF) Dynamics of Coupled Natural and Human (CNH) Systems, the Mongolian Rangelands and Resilience (MOR2) project focuses on the vulnerability of Mongolian pastoral systems to climate change and adaptive capacity. The MOR2 study spans over three years of fieldwork in 36 paired districts (Soum) from 18 provinces (Aimag) of Mongolia that covers steppe, mountain forest steppe, desert steppe and eastern steppe ecological zones. Our project team is composed of hydrologists, social scientists, geographers, and ecologists. The MOR2 database includes multiple ecological, social, meteorological, geospatial and hydrological datasets, as well as archives of original data and survey in multiple formats. Managing this complex database requires significant organizational skills, attention to detail and ability to communicate within collective team members from diverse disciplines and across multiple institutions in the US and Mongolia. We describe the database's rich content, organization, structure and complexity. We discuss lessons learned, best practices and recommendations for complex database management, sharing, and archiving in creating a cross-cultural and multi-disciplinary database.

  14. Automated Fault Interpretation and Extraction using Improved Supplementary Seismic Datasets

    Science.gov (United States)

    Bollmann, T. A.; Shank, R.

    2017-12-01

    During the interpretation of seismic volumes, it is necessary to interpret faults along with horizons of interest. With the improvement of technology, the interpretation of faults can be expedited with the aid of different algorithms that create supplementary seismic attributes, such as semblance and coherency. These products highlight discontinuities, but still need a large amount of human interaction to interpret faults and are plagued by noise and stratigraphic discontinuities. Hale (2013) presents a method to improve on these datasets by creating what is referred to as a Fault Likelihood volume. In general, these volumes contain less noise and do not emphasize stratigraphic features. Instead, planar features within a specified strike and dip range are highlighted. Once a satisfactory Fault Likelihood Volume is created, extraction of fault surfaces is much easier. The extracted fault surfaces are then exported to interpretation software for QC. Numerous software packages have implemented this methodology with varying results. After investigating these platforms, we developed a preferred Automated Fault Interpretation workflow.

  15. Privacy-preserving record linkage on large real world datasets.

    Science.gov (United States)

    Randall, Sean M; Ferrante, Anna M; Boyd, James H; Bauer, Jacqueline K; Semmens, James B

    2014-08-01

    Record linkage typically involves the use of dedicated linkage units who are supplied with personally identifying information to determine individuals from within and across datasets. The personally identifying information supplied to linkage units is separated from clinical information prior to release by data custodians. While this substantially reduces the risk of disclosure of sensitive information, some residual risks still exist and remain a concern for some custodians. In this paper we trial a method of record linkage which reduces privacy risk still further on large real world administrative data. The method uses encrypted personal identifying information (bloom filters) in a probability-based linkage framework. The privacy preserving linkage method was tested on ten years of New South Wales (NSW) and Western Australian (WA) hospital admissions data, comprising in total over 26 million records. No difference in linkage quality was found when the results were compared to traditional probabilistic methods using full unencrypted personal identifiers. This presents as a possible means of reducing privacy risks related to record linkage in population level research studies. It is hoped that through adaptations of this method or similar privacy preserving methods, risks related to information disclosure can be reduced so that the benefits of linked research taking place can be fully realised. Copyright © 2013 Elsevier Inc. All rights reserved.

  16. Genomics dataset on unclassified published organism (patent US 7547531

    Directory of Open Access Journals (Sweden)

    Mohammad Mahfuz Ali Khan Shawan

    2016-12-01

    Full Text Available Nucleotide (DNA sequence analysis provides important clues regarding the characteristics and taxonomic position of an organism. With the intention that, DNA sequence analysis is very crucial to learn about hierarchical classification of that particular organism. This dataset (patent US 7547531 is chosen to simplify all the complex raw data buried in undisclosed DNA sequences which help to open doors for new collaborations. In this data, a total of 48 unidentified DNA sequences from patent US 7547531 were selected and their complete sequences were retrieved from NCBI BioSample database. Quick response (QR code of those DNA sequences was constructed by DNA BarID tool. QR code is useful for the identification and comparison of isolates with other organisms. AT/GC content of the DNA sequences was determined using ENDMEMO GC Content Calculator, which indicates their stability at different temperature. The highest GC content was observed in GP445188 (62.5% which was followed by GP445198 (61.8% and GP445189 (59.44%, while lowest was in GP445178 (24.39%. In addition, New England BioLabs (NEB database was used to identify cleavage code indicating the 5, 3 and blunt end and enzyme code indicating the methylation site of the DNA sequences was also shown. These data will be helpful for the construction of the organisms’ hierarchical classification, determination of their phylogenetic and taxonomic position and revelation of their molecular characteristics.

  17. Structural dataset for the PPARγ V290M mutant

    Directory of Open Access Journals (Sweden)

    Ana C. Puhl

    2016-06-01

    Full Text Available Loss-of-function mutation V290M in the ligand-binding domain of the peroxisome proliferator activated receptor γ (PPARγ is associated with a ligand resistance syndrome (PLRS, characterized by partial lipodystrophy and severe insulin resistance. In this data article we discuss an X-ray diffraction dataset that yielded the structure of PPARγ LBD V290M mutant refined at 2.3 Å resolution, that allowed building of 3D model of the receptor mutant with high confidence and revealed continuous well-defined electron density for the partial agonist diclofenac bound to hydrophobic pocket of the PPARγ. These structural data provide significant insights into molecular basis of PLRS caused by V290M mutation and are correlated with the receptor disability of rosiglitazone binding and increased affinity for corepressors. Furthermore, our structural evidence helps to explain clinical observations which point out to a failure to restore receptor function by the treatment with a full agonist of PPARγ, rosiglitazone.

  18. Las mujeres, las guerras y el derecho internacional humanitario

    Directory of Open Access Journals (Sweden)

    Marta Postigo Asenjo

    2011-12-01

    Full Text Available La violencia sexual se emplea como instrumento de intimidación, castigo y terror, e incluso coadyuvante de limpieza étnica, en los conflictos armados. En las últimas décadas, se han producido importantes avances en la inclusión de la violación entre los crímenes contra el derecho internacional humanitario. Sin embargo, aún son necesarias acciones más eficaces para evitar que se produzcan violaciones sistemáticas en las zonas en conflictos y post-conflicto. Este trabajo destaca la importancia que tiene la lucha por la igualdad y la incorporación de las mujeres en los procesos de toma de decisiones para combatir esta lacra y asegurar la paz y la estabilidad.  Sexual violence has long been used as a weapon of war, with the purpose of intimidating, injuring and punishing civilians, and even as an ethnic cleansing adjuvant, in armed conflicts. In the last decades, though, there have been important advances towards the definition and prosecution of rape as a crime against international humanitarian law. Notwithstanding, more effective measures are needed to protect women from this heinous crime in the conflict zones and post-conflict. This article stresses the need to keep struggling for gender equality and improving women’s participation in decision making processes to achieve peace and stability

  19. REPASO DE LAS DROGAS ANTIAMIBIANAS

    Directory of Open Access Journals (Sweden)

    Alberto Albornoz-Plata

    1978-10-01

    Full Text Available

    La amibiasis tiene importancia grande en nuestro medio, ya que puede considerarse epidémica o endémica en muchas zonas. No es raro observar que hay numerosos casos
    graves, incluso de diagnóstico difícil, que solamente un tratamiento de prueba permite aclarar.
    Se han ensayado no pocos remedios para el tratamiento de esta enfermedad. Pero. hasta hace poco, con el advenimiento de los derivados del IMIDAZOL, se han abierto nuevas perspectivas terapéuticas. Es lo que se está comprobando a diario, hasta el punto de que la mayoría de las drogas que se utilizaban antes, han quedado desplazadas por las recientemente descubiertas. En general, las drogas comprenden ahora tres tipos diferentes:
    1 - Amebicidas Intestinales o Luminales:
    Han sido las más empleadasen el curso de los años. Unicamente actúan en la luz del intestino. De donde, el nombre de luminales. Tienen como característica que no se absorben y que obran localmente, en forma directa contra la amiba alojada en la mucosa intestinal. Por ese motivo, no pueden utilizarse en las formas extraintestinales de la
    amibiasis, por ejemplo, en el absceso hepático.
    2 . Amebicidas Tisulares:
    Son drogas que sí se absorben y que van a obrar en los tejidos, especialmente a nivel hepático. Aún se utilizan y conservan bastante importancia terapéutica.
    3 - Amebicidas Tisulares y Luminales:
    Son, realmente, las principales drogas en la actualidad, porque obran a todo nivel. A nivel de la mucosa intestinal, cuando el trofozoito ha formado úlcera y donde pueden
    encontrarse quistes, y a distancia del intestino, por ejemplo, en la amibiasis hepática, en la amibiasis cutánea etc.
    De manera que este tercer tipo de drogas tiene doble acción. Su efecto local les concede importancia para el tratamiento de los portadores asintomáticos de quistes...

  20. La positividad de las lenguas

    Directory of Open Access Journals (Sweden)

    José Antonio Jódar Sánchez

    2015-03-01

    Full Text Available Como seres humanos, todos experimentamos sentimientos placenteros, pero también desgradables, que quedan reflejados en la lengua. Investigaciones recientes confirman que el lenguaje es naturalmente más positivo que negativo. De diez lenguas evaluadas en Dodds y col. (2015, el español es la más positiva, seguido del portugués y el inglés, de acuerdo con las evaluaciones sobre el grado de positividad de palabras en esas lenguas. También obras literarias pueden ser evaluadas de forma similar por el llamado “hedonímetro”. Todo ello apunta a que este positivismo natural en las lenguas se podría explicar a partir de nuestro desarrollo social actual.

  1. Civilidad en las relaciones laborales

    Directory of Open Access Journals (Sweden)

    Álvaro Enrique Figueroa Bolaño

    2013-07-01

    Full Text Available ResumenA pesar que actualmente no resultan pocas las institucionesque por fuera del ámbito propio del DerechoLaboral vienen siendo aplicadas a las Relaciones Laboralesen Colombia, más que la ley y sí mucho la doctrina yla jurisprudencia no solamente nacional sino extranjera,han venido revaluando el concepto tradicional de Empresa,sacándolo de concepciones eminentemente comercialistasy economicistas para concebir que en la actividadempresarial moderna, en especial, en los asuntos laborales,debe primar el lado humano del trabajo, dicho enotras palabras, una Civilidad en las Relaciones Laborales.El artículo busca contribuir en el rescate o adaptaciónde la legislación laboral frente a las nuevas formas decontratación que eventualmente pueden ser utilizadas endetrimento del Derecho Laboral y de la Seguridad Socialcolombianos.Palabras clave: Relaciones laborales, Trabajo realidad,Autónomo, Independiente, Unidad de explotacióneconómica. AbstractAlthough not currently few institutions are outsidethe proper scope of labor law are being applied to the LabourRelations in Colombia, more than the law and a lotof doctrine and not only domestic but foreign jurisprudence,have been reassessing the traditional concept ofEnterprise, removing eminently commercialist and economisticconceptions conceiving that in modern business,especially in labor matters, should prioritize the humanside of the work, in other words, a Civility in Labor Relations.The article seeks to contribute to the rescue or adaptationof labor law against new forms of procurementthat can eventually be used to the detriment of the LaborLaw and Social Security Colombians.Keywords: Industrial relations, Actually work, Autonomous,Independent, Economic exploitation unit.

  2. Las aguas termominerales de Tabio

    Directory of Open Access Journals (Sweden)

    Antonio María Barriga Villalba

    1933-01-01

    Full Text Available La pintoresca población de Tabio, situada en el vallecito que forman dos repliegues de la gran cordillera montañosa que rodea la sabana de Bogotá, es un pueblecito apacible, famoso desde tiempo inmemorial por las aguas termales que brotan en la falda de los cerros situados al occidente, a unos mil metros de la iglesia de la población, en terrenos de propiedad del doctor Pompilio Martínez. Fue fundada Tabio en el año 1761 (*. Se halla a 4°50'50" de latitud norte y 0° 04'00" de longitud del meridiano de Bogotá. Clima seco en la mayor parte del año, temperatura media 14°.5 C. Presión atmosférica 560 mm.; altura sobre el nivel del mar 2.650 metros; temperatura de. ebullición del agua 91°.62 C.; humedad en vapor de agua, por metro cúbico, en los días secos 127,5 gramos por metro cúbico; en los días húmedos, 251,8 gramos por metro cúbico. En el sitio de las fuentes la humedad relativa media es de 38.38 por 100 y la temperatura de la atmósfera varía de 10° C. hasta 23°C.; en los días de sol, hacia las doce del día, la temperatura de la atmósfera cerca de las fuentes es de 20°-22°C.

  3. Los trotacalles de las ciudades

    Directory of Open Access Journals (Sweden)

    Michel Dorais

    2005-01-01

    Full Text Available Se presentan los resultados de una investigación que tuvo como objetivo comprender las conductas, en particular amorosas y sexuales, de jóvenes de la calle de la ciudad de Quebec, Canadá, en relación con los factores de riesgo y los factores de protección concernientes a la transmisión de Infecciones de Transmisión Sexual o Sanguínea y del Virus de inmunodeficiencia Humana. Se utilizó un método cualitativo basado en la recopilación de relatos de experiencias de vida. Tal aproximación ha permitido cernir las experiencias de estos jóvenes a partir de los significados que tenían para ellos y ellas. Las cincuenta entrevistas individuales realizadas para esta investigación involucraron a treinta varones y veinte mujeres con una edad media de poco más de veinte años. Hemos identificado rápidamente dos tipos de jóvenes. El primer tipo es el del "sobreviviente" y el segundo el del "aventurero". El "sobreviviente" de la calle es aquel que sobrevive a la adversidad a través de una historia de vida dolorosa (en particular con su familia. 2El aventurero" ha escogido romper, provisoriamente, con su medio familiar, con el cual no vive, pero con quien mantiene vínculos. En sus prácticas amorosas y sexuales hemos descubierto en los jóvenes seis diferentes estrategias para evitar el uso del condón. Concluimos en que es importante replantear la prevención del SIDA y de las ITSS adaptando el discurso a la realidad vivida por los jóvenes de la calle; es decir, la búsqueda, ante todo, del placer.

  4. Las neuronas de la conciencia

    Directory of Open Access Journals (Sweden)

    Rodrigo Quian Quiroga

    2008-05-01

    Full Text Available El estudio de la conciencia ha sido descrito como uno de los grandes desafíos de la humanidad. Es por ello que los neurocientíficos se han dedicado a estudiar la percepción visual consciente de objetos. Un reciente estudio en humanos –implantados con electrodos intracraneales por motivos clínicos- mostró la presencia de neuronas que disparan exclusivamente cuando las imágenes son percibidas conscientemente.

  5. Oil palm mapping for Malaysia using PALSAR-2 dataset

    Science.gov (United States)

    Gong, P.; Qi, C. Y.; Yu, L.; Cracknell, A.

    2016-12-01

    Oil palm is one of the most productive vegetable oil crops in the world. The main oil palm producing areas are distributed in humid tropical areas such as Malaysia, Indonesia, Thailand, western and central Africa, northern South America, and central America. Increasing market demands, high yields and low production costs of palm oil are the primary factors driving large-scale commercial cultivation of oil palm, especially in Malaysia and Indonesia. Global demand for palm oil has grown exponentially during the last 50 years, and the expansion of oil palm plantations is linked directly to the deforestation of natural forests. Satellite remote sensing plays an important role in monitoring expansion of oil palm. However, optical remote sensing images are difficult to acquire in the Tropics because of the frequent occurrence of thick cloud cover. This problem has led to the use of data obtained by synthetic aperture radar (SAR), which is a sensor capable of all-day/all-weather observation for studies in the Tropics. In this study, the ALOS-2 (Advanced Land Observing Satellite) PALSAR-2 (Phased Array type L-band SAR) datasets for year 2015 were used as an input to a support vector machine (SVM) based machine learning algorithm. Oil palm/non-oil palm samples were collected using a hexagonal equal-area sampling design. High-resolution images in Google Earth and PALSAR-2 imagery were used in human photo-interpretation to separate oil palm from others (i.e. cropland, forest, grassland, shrubland, water, hard surface and bareland). The characteristics of oil palms from various aspects, including PALSAR-2 backscattering coefficients (HH, HV), terrain and climate by using this sample set were further explored to post-process the SVM output. The average accuracy of oil palm type is better than 80% in the final oil palm map for Malaysia.

  6. Automatic aortic root segmentation in CTA whole-body dataset

    Science.gov (United States)

    Gao, Xinpei; Kitslaar, Pieter H.; Scholte, Arthur J. H. A.; Lelieveldt, Boudewijn P. F.; Dijkstra, Jouke; Reiber, Johan H. C.

    2016-03-01

    Trans-catheter aortic valve replacement (TAVR) is an evolving technique for patients with serious aortic stenosis disease. Typically, in this application a CTA data set is obtained of the patient's arterial system from the subclavian artery to the femoral arteries, to evaluate the quality of the vascular access route and analyze the aortic root to determine if and which prosthesis should be used. In this paper, we concentrate on the automated segmentation of the aortic root. The purpose of this study was to automatically segment the aortic root in computed tomography angiography (CTA) datasets to support TAVR procedures. The method in this study includes 4 major steps. First, the patient's cardiac CTA image was resampled to reduce the computation time. Next, the cardiac CTA image was segmented using an atlas-based approach. The most similar atlas was selected from a total of 8 atlases based on its image similarity to the input CTA image. Third, the aortic root segmentation from the previous step was transferred to the patient's whole-body CTA image by affine registration and refined in the fourth step using a deformable subdivision surface model fitting procedure based on image intensity. The pipeline was applied to 20 patients. The ground truth was created by an analyst who semi-automatically corrected the contours of the automatic method, where necessary. The average Dice similarity index between the segmentations of the automatic method and the ground truth was found to be 0.965±0.024. In conclusion, the current results are very promising.

  7. Las Plantas Cultivadas y Sus Plagas.

    Directory of Open Access Journals (Sweden)

    Universidad Nacional de Colombia Facultad de Ciencias Agropecuarias

    1942-12-01

    Full Text Available El artículo contiene en orden alfabético las plantas cultivadas con las diferentes plagas por las cuales se ven afectadas y para las que se han venido buscando mecanismos de control y erradicación. Algunas de estas plantas son: Aguacate, Acacia, Almendro, Algodón, Banana, Cacahuate, Cacao, Cafeto, Caña de azúcar, Caoba, Carambola, Cedro, Granadilla, Limón, Maíz, Tabaco, entre otras.

  8. Bases moleculares de las leucemias agudas

    Directory of Open Access Journals (Sweden)

    G. Martínez Antuña

    2006-04-01

    Full Text Available El gran desarrollo de la biología molecular en los últimos años ha contribuido a un importante avance en los conocimientos relacionados con las bases moleculares de las leucemias agudas (LA. Ademas de profundizar en la biología de estas enfermedades y conocer las bases moleculares, ha renido también gran impacto en mejorar el resultado de los tratamientos y disminuir la toxicidad de las terapias.

  9. GERENCIA DE LAS ORGANIZACIONES EDUCATIVAS

    Directory of Open Access Journals (Sweden)

    Evaristo Méndez Quintero

    2012-11-01

    Full Text Available Este artículo tiene por objetivos: 1 analizar el concepto de gerencia académica o educativa, 2 analizar la evolución histórica y coyuntural de esta disciplina en las instituciones educativas, en particular de la Universidad del Zulia. 3 proponer un modelo de gestión del cambio educativo basado en la innovación. La metodología utilizada fue cualitativa, con un diseño documental, analítico, histórico y de prospectiva. Se concluye que las organizaciones educativas, especialmente las universidades presentan una gerencia ejecutiva, normativa, empirista y voluntarista que si bien ha dado resultados en el tiempo, impide asimilar los cambios tecnológicos, científicos, epistemológicos y organizacionales necesarios para que puedan estar a la vanguardia del desarrollo. Se propone la constitución de un sistema gerencial estratégico que permita lograr la eficacia, la eficiencia, la efectividad y la calidad de la educación.

  10. Las organizaciones redefinen su futuro

    Directory of Open Access Journals (Sweden)

    Marcelo Manucci

    2015-01-01

    Full Text Available La propuesta considera la comunicación estratégica como un espacio tan rico como incierto y donde las subjetividades en interacción van trazando una red especial de conceptos. En este trabajo se explora una metodología para el desarrollo de intervenciones en comunicación. El modelo está basado en una plataforma de tecnología que permite, a través de un software, analizar alternativas y trazar nuevos cursos de acción para las acciones de la organización. El modelo consta de cuatro estructuras (matrices. Tres son de aplicación de procesos internos y la cuarta es una estructura de control de los resultados. Se concluye en que el encuentro entre la cultura y la tecnología en las organizaciones permite una plataforma para trazar caminos alternativos y construir el futuro a partir de subjetividades compartidas.

  11. Calidad del aire interior en las escuelas

    Science.gov (United States)

    EPA ha desarrollado el Programa de Herramientas de Calidad del Aire Interior para las Escuelas para reducir la exposición a los contaminantes ambientales en las mismas a través de la adopción voluntaria de las prácticas para manejar la calidad del aire int

  12. Las elecciones de gobernador en Sonora, 1997*

    Directory of Open Access Journals (Sweden)

    Juan Poom Medina

    2003-01-01

    Full Text Available El objetivo del presente artículo es analizar el desarrollo y resultados de las elecciones de gobernador re alizadas en Sonora en 1997. Se exponen y analizan las distintas fases que confo rm a ron dicho proceso elector al, tomando como punto de partida la caracterización de las elecciones en México.

  13. Introduction of a simple-model-based land surface dataset for Europe

    Science.gov (United States)

    Orth, Rene; Seneviratne, Sonia I.

    2015-04-01

    Land surface hydrology can play a crucial role during extreme events such as droughts, floods and even heat waves. We introduce in this study a new hydrological dataset for Europe that consists of soil moisture, runoff and evapotranspiration (ET). It is derived with a simple water balance model (SWBM) forced with precipitation, temperature and net radiation. The SWBM dataset extends over the period 1984-2013 with a daily time step and 0.5° × 0.5° resolution. We employ a novel calibration approach, in which we consider 300 random parameter sets chosen from an observation-based range. Using several independent validation datasets representing soil moisture (or terrestrial water content), ET and streamflow, we identify the best performing parameter set and hence the new dataset. To illustrate its usefulness, the SWBM dataset is compared against several state-of-the-art datasets (ERA-Interim/Land, MERRA-Land, GLDAS-2-Noah, simulations of the Community Land Model Version 4), using all validation datasets as reference. For soil moisture dynamics it outperforms the benchmarks. Therefore the SWBM soil moisture dataset constitutes a reasonable alternative to sparse measurements, little validated model results, or proxy data such as precipitation indices. Also in terms of runoff the SWBM dataset performs well, whereas the evaluation of the SWBM ET dataset is overall satisfactory, but the dynamics are less well captured for this variable. This highlights the limitations of the dataset, as it is based on a simple model that uses uniform parameter values. Hence some processes impacting ET dynamics may not be captured, and quality issues may occur in regions with complex terrain. Even though the SWBM is well calibrated, it cannot replace more sophisticated models; but as their calibration is a complex task the present dataset may serve as a benchmark in future. In addition we investigate the sources of skill of the SWBM dataset and find that the parameter set has a similar

  14. VideoWeb Dataset for Multi-camera Activities and Non-verbal Communication

    Science.gov (United States)

    Denina, Giovanni; Bhanu, Bir; Nguyen, Hoang Thanh; Ding, Chong; Kamal, Ahmed; Ravishankar, Chinya; Roy-Chowdhury, Amit; Ivers, Allen; Varda, Brenda

    Human-activity recognition is one of the most challenging problems in computer vision. Researchers from around the world have tried to solve this problem and have come a long way in recognizing simple motions and atomic activities. As the computer vision community heads toward fully recognizing human activities, a challenging and labeled dataset is needed. To respond to that need, we collected a dataset of realistic scenarios in a multi-camera network environment (VideoWeb) involving multiple persons performing dozens of different repetitive and non-repetitive activities. This chapter describes the details of the dataset. We believe that this VideoWeb Activities dataset is unique and it is one of the most challenging datasets available today. The dataset is publicly available online at http://vwdata.ee.ucr.edu/ along with the data annotation.

  15. Las buenas practicas espanolas 1998

    Directory of Open Access Journals (Sweden)

    Nasarre y de Goicoechea, Fernando

    2000-02-01

    Full Text Available The II HABITAT Conference held at Istanbul on June 1996, adopted the Habitat Program -Global Action Plan, in within which two performance lines are especially reinforced to reach the two main goals of suitable e housing for all and sustainable human settlements: the collaboration and participation among all the levels of the government and the civil society and, the recognition of the importance of the urban policies. In this framework is addressed the International Award Best Practices for the Improvement of the Life Conditions that, sponsored by the municipality of Dubai, is celebrated every two years and is awarded to the 10 selected by an International Independent Jury. The goal of the Competition is to promote policies and strategies more effective for the sustainable development of the humanity and their settlements, through the transmission of information and knowledge on experiences and solutions proved in the practice. The obtained results in the last international competition, with 32 of the presented Practices included in the Data Base of the United Nations Good Practices, 18 classified among the 100 best (3 of them included among the 40 finalists and finally, one, among the 10 awarded (Programs for the improvement of the urban environment of Malaga, allow us to look forward with satisfaction the panorama of the urban policy in our country.La Conferencia HABITAT II celebrada en Estambul en junio de 1996 adopto el Programa Hábitat-Plan Global de Acción-, dentro del cual se refuerzan especialmente dos líneas de actuación: la colaboración y participación entre todos los niveles de gobierno y la sociedad civil y el reconocimiento de la importancia de las políticas urbanas para alcanzar los dos objetivos principales: vivienda adecuada para todos y asentamientos humanos sostenibles. En este marco se inscribe el Concurso Internacional de Buenas Practicas para la Mejora de las Condiciones de Vida que, patrocinado par la municipalidad

  16. USGS Watershed Boundary Dataset (WBD) Overlay Map Service from The National Map - National Geospatial Data Asset (NGDA) Watershed Boundary Dataset (WBD)

    Data.gov (United States)

    U.S. Geological Survey, Department of the Interior — The Watershed Boundary Dataset (WBD) from The National Map (TNM) defines the perimeter of drainage areas formed by the terrain and other landscape characteristics....

  17. The StreamCat Dataset: Accumulated Attributes for NHDPlusV2 Catchments (Version 2.1) for the Conterminous United States: National Coal Resource Dataset System

    Data.gov (United States)

    U.S. Environmental Protection Agency — This dataset represents the coal mine density and storage volumes within individual, local NHDPlusV2 catchments and upstream, contributing watersheds based on the...

  18. The StreamCat Dataset: Accumulated Attributes for NHDPlusV2 (Version 2.1) Catchments for the Conterminous United States: National Anthropogenic Barrier Dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — This dataset represents the dam density and storage volumes within individual, local NHDPlusV2 catchments and upstream, contributing watersheds based on the National...

  19. The StreamCat Dataset: Accumulated Attributes for NHDPlusV2 (Version 2.1) Catchments for the Conterminous United States: National Elevation Dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — This dataset represents the elevation values within individual local NHDPlusV2 catchments and upstream, contributing watersheds based on the National Elevation...

  20. Topic modeling for cluster analysis of large biological and medical datasets.

    Science.gov (United States)

    Zhao, Weizhong; Zou, Wen; Chen, James J

    2014-01-01

    The big data moniker is nowhere better deserved than to describe the ever-increasing prodigiousness and complexity of biological and medical datasets. New methods are needed to generate and test hypotheses, foster biological interpretation, and build validated predictors. Although multivariate techniques such as cluster analysis may allow researchers to identify groups, or clusters, of related variables, the accuracies and effectiveness of traditional clustering methods diminish for large and hyper dimensional datasets. Topic modeling is an active research field in machine learning and has been mainly used as an analytical tool to structure large textual corpora for data mining. Its ability to reduce high dimensionality to a small number of latent variables makes it suitable as a means for clustering or overcoming clustering difficulties in large biological and medical datasets. In this study, three topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, are proposed and tested on the cluster analysis of three large datasets: Salmonella pulsed-field gel electrophoresis (PFGE) dataset, lung cancer dataset, and breast cancer dataset, which represent various types of large biological or medical datasets. All three various methods are shown to improve the efficacy/effectiveness of clustering results on the three datasets in comparison to traditional methods. A preferable cluster analysis method emerged for each of the three datasets on the basis of replicating known biological truths. Topic modeling could be advantageously applied to the large datasets of biological or medical research. The three proposed topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, yield clustering improvements for the three different data types. Clusters more efficaciously represent truthful groupings and subgroupings in the data than traditional methods, suggesting

  1. A new dataset and algorithm evaluation for mood estimation in music

    OpenAIRE

    Godec, Primož

    2014-01-01

    This thesis presents a new dataset of perceived and induced emotions for 200 audio clips. The gathered dataset provides users' perceived and induced emotions for each clip, the association of color, along with demographic and personal data, such as user's emotion state and emotion ratings, genre preference, music experience, among others. With an online survey we collected more than 7000 responses for a dataset of 200 audio excerpts, thus providing about 37 user responses per clip. The foc...

  2. Hydrological simulation of the Brahmaputra basin using global datasets

    Science.gov (United States)

    Bhattacharya, Biswa; Conway, Crystal; Craven, Joanne; Masih, Ilyas; Mazzolini, Maurizio; Shrestha, Shreedeepy; Ugay, Reyne; van Andel, Schalk Jan

    2017-04-01

    Brahmaputra River flows through China, India and Bangladesh to the Bay of Bengal and is one of the largest rivers of the world with a catchment size of 580K km2. The catchment is largely hilly and/or forested with sparse population and with limited urbanisation and economic activities. The catchment experiences heavy monsoon rainfall leading to very high flood discharges. Large inter-annual variation of discharge leading to flooding, erosion and morphological changes are among the major challenges. The catchment is largely ungauged; moreover, limited availability of hydro-meteorological data limits the possibility of carrying out evidence based research, which could provide trustworthy information for managing and when needed, controlling, the basin processes by the riparian countries for overall basin development. The paper presents initial results of a current research project on Brahmaputra basin. A set of hydrological and hydraulic models (SWAT, HMS, RAS) are developed by employing publicly available datasets of DEM, land use and soil and simulated using satellite based rainfall products, evapotranspiration and temperature estimates. Remotely sensed data are compared with sporadically available ground data. The set of models are able to produce catchment wide hydrological information that potentially can be used in the future in managing the basin's water resources. The model predications should be used with caution due to high level of uncertainty because the semi-calibrated models are developed with uncertain physical representation (e.g. cross-section) and simulated with global meteorological forcing (e.g. TRMM) with limited validation. Major scientific challenges are seen in producing robust information that can be reliably used in managing the basin. The information generated by the models are uncertain and as a result, instead of using them per se, they are used in improving the understanding of the catchment, and by running several scenarios with varying

  3. Investigating automated depth modelling of archaeo-magnetic datasets

    Science.gov (United States)

    Cheyney, Samuel; Hill, Ian; Linford, Neil; Leech, Christopher

    2010-05-01

    Magnetic surveying is a commonly used tool for first-pass non-invasive archaeological surveying, and is often used to target areas for more detailed geophysical investigation, or excavation. Quick and routine processing of magnetic datasets mean survey results are typically viewed as 2D greyscale maps and the shapes of anomalies are interpreted in terms of likely archaeological structures. This technique is simple, but ignores some of the information content of the data. The data collected using dense spatial sampling with modern precise instrumentation are capable of yielding numerical estimates of the depths to buried structures, and their physical properties. The magnetic field measured at the surface is a superposition of the responses to all anomalous magnetic susceptibilities in the subsurface, and is therefore capable of revealing a 3D model of the magnetic properties. The application of mathematical modelling techniques to very-near-surface surveys such as for archaeology is quite rare, however similar methods are routinely used in regional scale mineral exploration surveys. Inverse modelling techniques have inherent ambiguity due to the nature of the mathematical "inverse problem". Often, although a good fit to the recorded values can be obtained, the final model will be non-unique and may be heavily biased by the starting model provided. Also the run time and computer resources required can be restrictive. Our approach is to derive as much information as possible from the data directly, and use this to define a starting model for inversion. This addresses both the ambiguity of the inverse problem and reduces the task for the inversion computation. A number of alternative methods exist that can be used to obtain parameters for source bodies in potential field data. Here, methods involving the derivatives of the total magnetic field are used in association with advanced image processing techniques to outline the edges of anomalous bodies more accurately

  4. Reliability of Source Mechanisms for a Hydraulic Fracturing Dataset

    Science.gov (United States)

    Eyre, T.; Van der Baan, M.

    2016-12-01

    Non-double-couple components have been inferred for induced seismicity due to fluid injection, yet these components are often poorly constrained due to the acquisition geometry. Likewise non-double-couple components in microseismic recordings are not uncommon. Microseismic source mechanisms provide an insight into the fracturing behaviour of a hydraulically stimulated reservoir. However, source inversion in a hydraulic fracturing environment is complicated by the likelihood of volumetric contributions to the source due to the presence of high pressure fluids, which greatly increases the possible solution space and therefore the non-uniqueness of the solutions. Microseismic data is usually recorded on either 2D surface or borehole arrays of sensors. In many cases, surface arrays appear to constrain source mechanisms with high shear components, whereas borehole arrays tend to constrain more variable mechanisms including those with high tensile components. The abilities of each geometry to constrain the true source mechanisms are therefore called into question.The ability to distinguish between shear and tensile source mechanisms with different acquisition geometries is investigated using synthetic data. For both inversions, both P- and S- wave amplitudes recorded on three component sensors need to be included to obtain reliable solutions. Surface arrays appear to give more reliable solutions due to a greater sampling of the focal sphere, but in reality tend to record signals with a low signal to noise ratio. Borehole arrays can produce acceptable results, however the reliability is much more affected by relative source-receiver locations and source orientation, with biases produced in many of the solutions. Therefore more care must be taken when interpreting results.These findings are taken into account when interpreting a microseismic dataset of 470 events recorded by two vertical borehole arrays monitoring a horizontal treatment well. Source locations and

  5. Estimated Perennial Streams of Idaho and Related Geospatial Datasets

    Science.gov (United States)

    Rea, Alan; Skinner, Kenneth D.

    2009-01-01

    record, generally would be considered to represent flow conditions better at a given site than flow estimates based on regionalized regression models. The geospatial datasets of modeled perennial streams are considered a first-cut estimate, and should not be construed to override site-specific flow data.

  6. Scalable and portable visualization of large atomistic datasets

    Science.gov (United States)

    Sharma, Ashish; Kalia, Rajiv K.; Nakano, Aiichiro; Vashishta, Priya

    2004-10-01

    A scalable and portable code named Atomsviewer has been developed to interactively visualize a large atomistic dataset consisting of up to a billion atoms. The code uses a hierarchical view frustum-culling algorithm based on the octree data structure to efficiently remove atoms outside of the user's field-of-view. Probabilistic and depth-based occlusion-culling algorithms then select atoms, which have a high probability of being visible. Finally a multiresolution algorithm is used to render the selected subset of visible atoms at varying levels of detail. Atomsviewer is written in C++ and OpenGL, and it has been tested on a number of architectures including Windows, Macintosh, and SGI. Atomsviewer has been used to visualize tens of millions of atoms on a standard desktop computer and, in its parallel version, up to a billion atoms. Program summaryTitle of program: Atomsviewer Catalogue identifier: ADUM Program summary URL:http://cpc.cs.qub.ac.uk/summaries/ADUM Program obtainable from: CPC Program Library, Queen's University of Belfast, N. Ireland Computer for which the program is designed and others on which it has been tested: 2.4 GHz Pentium 4/Xeon processor, professional graphics card; Apple G4 (867 MHz)/G5, professional graphics card Operating systems under which the program has been tested: Windows 2000/XP, Mac OS 10.2/10.3, SGI IRIX 6.5 Programming languages used: C++, C and OpenGL Memory required to execute with typical data: 1 gigabyte of RAM High speed storage required: 60 gigabytes No. of lines in the distributed program including test data, etc.: 550 241 No. of bytes in the distributed program including test data, etc.: 6 258 245 Number of bits in a word: Arbitrary Number of processors used: 1 Has the code been vectorized or parallelized: No Distribution format: tar gzip file Nature of physical problem: Scientific visualization of atomic systems Method of solution: Rendering of atoms using computer graphic techniques, culling algorithms for data

  7. GUDM: Automatic Generation of Unified Datasets for Learning and Reasoning in Healthcare.

    Science.gov (United States)

    Ali, Rahman; Siddiqi, Muhammad Hameed; Idris, Muhammad; Ali, Taqdir; Hussain, Shujaat; Huh, Eui-Nam; Kang, Byeong Ho; Lee, Sungyoung

    2015-07-02

    A wide array of biomedical data are generated and made available to healthcare experts. However, due to the diverse nature of data, it is difficult to predict outcomes from it. It is therefore necessary to combine these diverse data sources into a single unified dataset. This paper proposes a global unified data model (GUDM) to provide a global unified data structure for all data sources and generate a unified dataset by a "data modeler" tool. The proposed tool implements user-centric priority based approach which can easily resolve the problems of unified data modeling and overlapping attributes across multiple datasets. The tool is illustrated using sample diabetes mellitus data. The diverse data sources to generate the unified dataset for diabetes mellitus include clinical trial information, a social media interaction dataset and physical activity data collected using different sensors. To realize the significance of the unified dataset, we adopted a well-known rough set theory based rules creation process to create rules from the unified dataset. The evaluation of the tool on six different sets of locally created diverse datasets shows that the tool, on average, reduces 94.1% time efforts of the experts and knowledge engineer while creating unified datasets.

  8. Wehmas et al. 94-04 Toxicol Sci: Datasets for manuscript

    Data.gov (United States)

    U.S. Environmental Protection Agency — Dataset includes overview text document (accepted version of manuscript) and tables, figures, and supplementary materials. Supplementary tables provide summary data...

  9. A New Dataset Size Reduction Approach for PCA-Based Classification in OCR Application

    Directory of Open Access Journals (Sweden)

    Mohammad Amin Shayegan

    2014-01-01

    Full Text Available A major problem of pattern recognition systems is due to the large volume of training datasets including duplicate and similar training samples. In order to overcome this problem, some dataset size reduction and also dimensionality reduction techniques have been introduced. The algorithms presently used for dataset size reduction usually remove samples near to the centers of classes or support vector samples between different classes. However, the samples near to a class center include valuable information about the class characteristics and the support vector is important for evaluating system efficiency. This paper reports on the use of Modified Frequency Diagram technique for dataset size reduction. In this new proposed technique, a training dataset is rearranged and then sieved. The sieved training dataset along with automatic feature extraction/selection operation using Principal Component Analysis is used in an OCR application. The experimental results obtained when using the proposed system on one of the biggest handwritten Farsi/Arabic numeral standard OCR datasets, Hoda, show about 97% accuracy in the recognition rate. The recognition speed increased by 2.28 times, while the accuracy decreased only by 0.7%, when a sieved version of the dataset, which is only as half as the size of the initial training dataset, was used.

  10. Topographical effects of climate dataset and their impacts on the estimation of regional net primary productivity

    Science.gov (United States)

    Sun, L. Qing; Feng, Feng X.

    2014-11-01

    In this study, we first built and compared two different climate datasets for Wuling mountainous area in 2010, one of which considered topographical effects during the ANUSPLIN interpolation was referred as terrain-based climate dataset, while the other one did not was called ordinary climate dataset. Then, we quantified the topographical effects of climatic inputs on NPP estimation by inputting two different climate datasets to the same ecosystem model, the Boreal Ecosystem Productivity Simulator (BEPS), to evaluate the importance of considering relief when estimating NPP. Finally, we found the primary contributing variables to the topographical effects through a series of experiments given an overall accuracy of the model output for NPP. The results showed that: (1) The terrain-based climate dataset presented more reliable topographic information and had closer agreements with the station dataset than the ordinary climate dataset at successive time series of 365 days in terms of the daily mean values. (2) On average, ordinary climate dataset underestimated NPP by 12.5% compared with terrain-based climate dataset over the whole study area. (3) The primary climate variables contributing to the topographical effects of climatic inputs for Wuling mountainous area were temperatures, which suggest that it is necessary to correct temperature differences for estimating NPP accurately in such a complex terrain.

  11. USO DE LAS TICS EN APLICACIONES MATEMATICAS

    OpenAIRE

    ROA, NESTOR; MENDEZ, ADOLFO; TARAZONA, JORGE

    2011-01-01

    En este objeto de aprendizaje se desarrollaran los siguientes temas (1) ¿Qué significa la competencia Usar las TIC´s? (2) ¿Por qué es relevante que adquiera la competencia Usar las TIC´s en mi formación? (3) ¿Cómo aprendo la competencia Usar las TIC´s? (4) ¿Cómo aplico la competencia Usar las TIC´s? (5) ¿Cómo puedo autoevaluar mi competencia Usar las TIC´s (con miras a un constante crecimiento)?

  12. Las tragedias latinas de tesis

    Directory of Open Access Journals (Sweden)

    Andrés Pociña

    1978-06-01

    Full Text Available An analysis of Latin praetextae as a mean of diffusion of political theories. This paper (with the same groundwork and methods proposed in «Finalidad político-didáctica de las tragedias de Séneca», Emerita 44, 1976, pp. 279-301, reviews exclusively those plays in which tragic authors consciously and intentionally proposed a political thesis. This happens in the republican praetextae, in the Octauia and in the work of Curiatius Maternus. All this tend to confirm the politico-didactical interpretation of Seneca’s drama proposed by the author in the above mentioned work.

  13. Cuando son muchas las voces

    Directory of Open Access Journals (Sweden)

    Cruz, Manuel

    2003-12-01

    Full Text Available The work departs from an approach to the genesis of the concept of responsibility. Some features of the concept that have today become obvious can already come into sight in this first approach. The work devotes itself then to point out the implications (and, most of all, the consequences involved in openly ascribing responsibility to the Modern Age and to the rise of industrial civilization. One of these main consequences is perhaps to have to deal with a number of theoretical topics in a necessary connexion with another axial topic of modern thought, namely, identity. Besides, the essay points also to a number of topics, such as the problem of the alleged collective responsibilities and also that of evil, just to quote two of them, whose magnitude may serve as an indicator to the relevance of the concept of responsibility in contemporary thought.

    Tras una inicial aproximación a la génesis del concepto de responsabilidad, génesis en la que se empiezan a percibir algunos de los rasgos del mismo que terminarán por hacerse explícitos en nuestros días, el trabajo se centra en señalar las implicaciones (y, sobre todo, las consecuencias que tiene adscribir decididamente la responsabilidad a la época moderna y, en concreto, al surgimiento de la civilización industrial. Acaso una de las más importantes sea la de obligarnos a plantear toda una serie de asuntos en conexión necesaria con otro tema-eje del pensamiento moderno, el de la identidad. Junto a dicho tema, en el texto quedan apuntados otros, como el de la justa escala en la que abordar tales asuntos (o, lo que viene a ser lo mismo, el problema de las presuntas responsabilidades colectivas o el del mal, por citar sólo dos, cuya envergadura puede servir como indicador de la importancia que presenta el concepto de responsabilidad en el pensamiento contemporáneo.

  14. Las Ermitas de la Albufera

    OpenAIRE

    ZÁRATE OLASO, ROCÍO DE

    2015-01-01

    [es] El Parque Natural de La Albufera es uno de los espacios naturales más importantes de la Comunidad Valenciana. Esto se aprecia en la cantidad de asentamientos de población que dan como resultado una gran cantidad de edificaciones, entre ellas las ermitas estudiadas en este trabajo. En este Trabajo Fin de Grado se ha realizado el estudio tipológico de tres ermitas situadas en el entorno del Parque Natural de la Albufera: la Ermita dels Sants de la Pedra de Sueca, la Ermita dels Sants de la...

  15. Cultura emprendedora en las pymes

    Directory of Open Access Journals (Sweden)

    Fernando Bojórquez Gutiérrez

    2016-03-01

    Full Text Available El propósito de este trabajo es identificar la cultura corporativa imperante en las pymes localizadas en Sinaloa y particularmente determinar el nivel de cultura emprendedora. La gerencia de la pymes no es capaz de identificar con precisión la modalidad de cultura organizacional imperante en su empresa, ni el nivel de cultura emprendedora en la misma. Esta situación contribuye a no seleccionar la estrategia competitiva adecuada, lo que resulta en no maximizar los resultados o alcanzar logros potenciales. Por otra parte, se busca conocer la relación estadística significativa entre el estilo directivo observable, el nivel de cultura emprendedora y la estrategia competitiva de las empresas. Por el lado del ámbito académico, se observa una imprecisión y desconocimiento de factores o variables para explicar la cultura corporativa y sus modalidades, así como los elementos de la cultura emprendedora

  16. EL LENGUAJE DE LAS CAMPANAS

    Directory of Open Access Journals (Sweden)

    MARCELA DÁVALOS

    2011-10-01

    Full Text Available A los políticos liberales del siglo diecinueve les molestaba escuchar las campanas, pues su sonido les recordaba el dominio que curas y religiosos tenían entre la población de la Ciudad de México. Pese a que la historia de los campanarios ha sido parte del pasado estético, religioso y político, hasta hoy poco sabemos del papel que ellas han cumplido como transmisoras de información, de su influencia en la vida cotidiana o de su pertenencia a una cultura auditiva ya olvidada.[1] Décadas antes de la Independencia, comenzó una beligerante polémica entre los letrados en torno del uso de las campanadas. En tanto algunos sostenían que el horario de los tañidos debía reglamentarse, otros pensaron en fundirlas y emplear su metal para cañones o monedas. A pesar de los reiterados reglamentos o de los deseos anticlericales de terminar con el significado sagrado que habían tenido durante la Colonia, los campanarios siguieron marcando el horario ciudadano más allá de la segunda mitad del siglo diecinueve.

  17. A novel dataset for real-life evaluation of facial expression recognition methodologies

    NARCIS (Netherlands)

    Siddiqi, Muhammad Hameed; Ali, Maqbool; Idris, Muhammad; Banos Legran, Oresti; Lee, Sungyoung; Choo, Hyunseung

    2016-01-01

    One limitation seen among most of the previous methods is that they were evaluated under settings that are far from real-life scenarios. The reason is that the existing facial expression recognition (FER) datasets are mostly pose-based and assume a predefined setup. The expressions in these datasets

  18. Full-Scale Approximations of Spatio-Temporal Covariance Models for Large Datasets

    KAUST Repository

    Zhang, Bohai; Sang, Huiyan; Huang, Jianhua Z.

    2014-01-01

    of dataset and application of such models is not feasible for large datasets. This article extends the full-scale approximation (FSA) approach by Sang and Huang (2012) to the spatio-temporal context to reduce computational complexity. A reversible jump Markov

  19. Gridded precipitation dataset for the Rhine basin made with the genRE interpolation method

    NARCIS (Netherlands)

    Osnabrugge, van B.; Uijlenhoet, R.

    2017-01-01

    A high resolution (1.2x1.2km) gridded precipitation dataset with hourly time step that covers the whole Rhine basin for the period 1997-2015. Made from gauge data with the genRE interpolation scheme. See "genRE: A method to extend gridded precipitation climatology datasets in near real-time for

  20. TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild

    KAUST Repository

    Mü ller, Matthias; Bibi, Adel Aamer; Giancola, Silvio; Al-Subaihi, Salman; Ghanem, Bernard

    2018-01-01

    Despite the numerous developments in object tracking, further development of current tracking algorithms is limited by small and mostly saturated datasets. As a matter of fact, data-hungry trackers based on deep-learning currently rely on object detection datasets due to the scarcity of dedicated large-scale tracking datasets. In this work, we present TrackingNet, the first large-scale dataset and benchmark for object tracking in the wild. We provide more than 30K videos with more than 14 million dense bounding box annotations. Our dataset covers a wide selection of object classes in broad and diverse context. By releasing such a large-scale dataset, we expect deep trackers to further improve and generalize. In addition, we introduce a new benchmark composed of 500 novel videos, modeled with a distribution similar to our training dataset. By sequestering the annotation of the test set and providing an online evaluation server, we provide a fair benchmark for future development of object trackers. Deep trackers fine-tuned on a fraction of our dataset improve their performance by up to 1.6% on OTB100 and up to 1.7% on TrackingNet Test. We provide an extensive benchmark on TrackingNet by evaluating more than 20 trackers. Our results suggest that object tracking in the wild is far from being solved.

  1. Omicseq: a web-based search engine for exploring omics datasets

    Science.gov (United States)

    Sun, Xiaobo; Pittard, William S.; Xu, Tianlei; Chen, Li; Zwick, Michael E.; Jiang, Xiaoqian; Wang, Fusheng

    2017-01-01

    Abstract The development and application of high-throughput genomics technologies has resulted in massive quantities of diverse omics data that continue to accumulate rapidly. These rich datasets offer unprecedented and exciting opportunities to address long standing questions in biomedical research. However, our ability to explore and query the content of diverse omics data is very limited. Existing dataset search tools rely almost exclusively on the metadata. A text-based query for gene name(s) does not work well on datasets wherein the vast majority of their content is numeric. To overcome this barrier, we have developed Omicseq, a novel web-based platform that facilitates the easy interrogation of omics datasets holistically to improve ‘findability’ of relevant data. The core component of Omicseq is trackRank, a novel algorithm for ranking omics datasets that fully uses the numerical content of the dataset to determine relevance to the query entity. The Omicseq system is supported by a scalable and elastic, NoSQL database that hosts a large collection of processed omics datasets. In the front end, a simple, web-based interface allows users to enter queries and instantly receive search results as a list of ranked datasets deemed to be the most relevant. Omicseq is freely available at http://www.omicseq.org. PMID:28402462

  2. Document Questionnaires and Datasets with DDI: A Hands-On Introduction with Colectica

    OpenAIRE

    Iverson, Jeremy; Smith, Dan

    2018-01-01

    This workshop offers a hands-on, practical approach to creating and documenting both surveys and datasets with DDI and Colectica. Participants will build and field a DDI-driven survey using their own questions or samples provided in the workshop. They will then ingest, annotate, and publish DDI dataset descriptions using the collected survey data.

  3. An integrated pan-tropical biomass map using multiple reference datasets

    NARCIS (Netherlands)

    Avitabile, V.; Herold, M.; Heuvelink, G.B.M.; Lewis, S.L.; Phillips, O.L.; Asner, G.P.; Armston, J.; Asthon, P.; Banin, L.F.; Bayol, N.; Berry, N.; Boeckx, P.; Jong, De B.; Devries, B.; Girardin, C.; Kearsley, E.; Lindsell, J.A.; Lopez-gonzalez, G.; Lucas, R.; Malhi, Y.; Morel, A.; Mitchard, E.; Nagy, L.; Qie, L.; Quinones, M.; Ryan, C.M.; Slik, F.; Sunderland, T.; Vaglio Laurin, G.; Valentini, R.; Verbeeck, H.; Wijaya, A.; Willcock, S.

    2016-01-01

    We combined two existing datasets of vegetation aboveground biomass (AGB) (Proceedings of the National Academy of Sciences of the United States of America, 108, 2011, 9899; Nature Climate Change, 2, 2012, 182) into a pan-tropical AGB map at 1-km resolution using an independent reference dataset of

  4. Large-scale Labeled Datasets to Fuel Earth Science Deep Learning Applications

    Science.gov (United States)

    Maskey, M.; Ramachandran, R.; Miller, J.

    2017-12-01

    Deep learning has revolutionized computer vision and natural language processing with various algorithms scaled using high-performance computing. However, generic large-scale labeled datasets such as the ImageNet are the fuel that drives the impressive accuracy of deep learning results. Large-scale labeled datasets already exist in domains such as medical science, but creating them in the Earth science domain is a challenge. While there are ways to apply deep learning using limited labeled datasets, there is a need in the Earth sciences for creating large-scale labeled datasets for benchmarking and scaling deep learning applications. At the NASA Marshall Space Flight Center, we are using deep learning for a variety of Earth science applications where we have encountered the need for large-scale labeled datasets. We will discuss our approaches for creating such datasets and why these datasets are just as valuable as deep learning algorithms. We will also describe successful usage of these large-scale labeled datasets with our deep learning based applications.

  5. SAR image classification based on CNN in real and simulation datasets

    Science.gov (United States)

    Peng, Lijiang; Liu, Ming; Liu, Xiaohua; Dong, Liquan; Hui, Mei; Zhao, Yuejin

    2018-04-01

    Convolution neural network (CNN) has made great success in image classification tasks. Even in the field of synthetic aperture radar automatic target recognition (SAR-ATR), state-of-art results has been obtained by learning deep representation of features on the MSTAR benchmark. However, the raw data of MSTAR have shortcomings in training a SAR-ATR model because of high similarity in background among the SAR images of each kind. This indicates that the CNN would learn the hierarchies of features of backgrounds as well as the targets. To validate the influence of the background, some other SAR images datasets have been made which contains the simulation SAR images of 10 manufactured targets such as tank and fighter aircraft, and the backgrounds of simulation SAR images are sampled from the whole original MSTAR data. The simulation datasets contain the dataset that the backgrounds of each kind images correspond to the one kind of backgrounds of MSTAR targets or clutters and the dataset that each image shares the random background of whole MSTAR targets or clutters. In addition, mixed datasets of MSTAR and simulation datasets had been made to use in the experiments. The CNN architecture proposed in this paper are trained on all datasets mentioned above. The experimental results shows that the architecture can get high performances on all datasets even the backgrounds of the images are miscellaneous, which indicates the architecture can learn a good representation of the targets even though the drastic changes on background.

  6. Resolution testing and limitations of geodetic and tsunami datasets for finite fault inversions along subduction zones

    Science.gov (United States)

    Williamson, A.; Newman, A. V.

    2017-12-01

    Finite fault inversions utilizing multiple datasets have become commonplace for large earthquakes pending data availability. The mixture of geodetic datasets such as Global Navigational Satellite Systems (GNSS) and InSAR, seismic waveforms, and when applicable, tsunami waveforms from Deep-Ocean Assessment and Reporting of Tsunami (DART) gauges, provide slightly different observations that when incorporated together lead to a more robust model of fault slip distribution. The merging of different datasets is of particular importance along subduction zones where direct observations of seafloor deformation over the rupture area are extremely limited. Instead, instrumentation measures related ground motion from tens to hundreds of kilometers away. The distance from the event and dataset type can lead to a variable degree of resolution, affecting the ability to accurately model the spatial distribution of slip. This study analyzes the spatial resolution attained individually from geodetic and tsunami datasets as well as in a combined dataset. We constrain the importance of distance between estimated parameters and observed data and how that varies between land-based and open ocean datasets. Analysis focuses on accurately scaled subduction zone synthetic models as well as analysis of the relationship between slip and data in recent large subduction zone earthquakes. This study shows that seafloor deformation sensitive datasets, like open-ocean tsunami waveforms or seafloor geodetic instrumentation, can provide unique offshore resolution for understanding most large and particularly tsunamigenic megathrust earthquake activity. In most environments, we simply lack the capability to resolve static displacements using land-based geodetic observations.

  7. TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild

    KAUST Repository

    Müller, Matthias

    2018-03-28

    Despite the numerous developments in object tracking, further development of current tracking algorithms is limited by small and mostly saturated datasets. As a matter of fact, data-hungry trackers based on deep-learning currently rely on object detection datasets due to the scarcity of dedicated large-scale tracking datasets. In this work, we present TrackingNet, the first large-scale dataset and benchmark for object tracking in the wild. We provide more than 30K videos with more than 14 million dense bounding box annotations. Our dataset covers a wide selection of object classes in broad and diverse context. By releasing such a large-scale dataset, we expect deep trackers to further improve and generalize. In addition, we introduce a new benchmark composed of 500 novel videos, modeled with a distribution similar to our training dataset. By sequestering the annotation of the test set and providing an online evaluation server, we provide a fair benchmark for future development of object trackers. Deep trackers fine-tuned on a fraction of our dataset improve their performance by up to 1.6% on OTB100 and up to 1.7% on TrackingNet Test. We provide an extensive benchmark on TrackingNet by evaluating more than 20 trackers. Our results suggest that object tracking in the wild is far from being solved.

  8. A Dataset of Three Educational Technology Experiments on Differentiation, Formative Testing and Feedback

    Science.gov (United States)

    Haelermans, Carla; Ghysels, Joris; Prince, Fernao

    2015-01-01

    This paper describes a dataset with data from three individually randomized educational technology experiments on differentiation, formative testing and feedback during one school year for a group of 8th grade students in the Netherlands, using administrative data and the online motivation questionnaire of Boekaerts. The dataset consists of pre-…

  9. Developing predictive imaging biomarkers using whole-brain classifiers: Application to the ABIDE I dataset

    Directory of Open Access Journals (Sweden)

    Swati Rane

    2017-03-01

    Full Text Available We designed a modular machine learning program that uses functional magnetic resonance imaging (fMRI data in order to distinguish individuals with autism spectrum disorders from neurodevelopmentally normal individuals. Data was selected from the Autism Brain Imaging Dataset Exchange (ABIDE I Preprocessed Dataset.

  10. Omicseq: a web-based search engine for exploring omics datasets.

    Science.gov (United States)

    Sun, Xiaobo; Pittard, William S; Xu, Tianlei; Chen, Li; Zwick, Michael E; Jiang, Xiaoqian; Wang, Fusheng; Qin, Zhaohui S

    2017-07-03

    The development and application of high-throughput genomics technologies has resulted in massive quantities of diverse omics data that continue to accumulate rapidly. These rich datasets offer unprecedented and exciting opportunities to address long standing questions in biomedical research. However, our ability to explore and query the content of diverse omics data is very limited. Existing dataset search tools rely almost exclusively on the metadata. A text-based query for gene name(s) does not work well on datasets wherein the vast majority of their content is numeric. To overcome this barrier, we have developed Omicseq, a novel web-based platform that facilitates the easy interrogation of omics datasets holistically to improve 'findability' of relevant data. The core component of Omicseq is trackRank, a novel algorithm for ranking omics datasets that fully uses the numerical content of the dataset to determine relevance to the query entity. The Omicseq system is supported by a scalable and elastic, NoSQL database that hosts a large collection of processed omics datasets. In the front end, a simple, web-based interface allows users to enter queries and instantly receive search results as a list of ranked datasets deemed to be the most relevant. Omicseq is freely available at http://www.omicseq.org. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Spatially continuous dataset at local scale of Taita Hills in Kenya and Mount Kilimanjaro in Tanzania

    Directory of Open Access Journals (Sweden)

    Sizah Mwalusepo

    2016-09-01

    Full Text Available Climate change is a global concern, requiring local scale spatially continuous dataset and modeling of meteorological variables. This dataset article provided the interpolated temperature, rainfall and relative humidity dataset at local scale along Taita Hills and Mount Kilimanjaro altitudinal gradients in Kenya and Tanzania, respectively. The temperature and relative humidity were recorded hourly using automatic onset THHOBO data loggers and rainfall was recorded daily using GENERALR wireless rain gauges. Thin plate spline (TPS was used to interpolate, with the degree of data smoothing determined by minimizing the generalized cross validation. The dataset provide information on the status of the current climatic conditions along the two mountainous altitudinal gradients in Kenya and Tanzania. The dataset will, thus, enhance future research. Keywords: Spatial climate data, Climate change, Modeling, Local scale

  12. PERFORMANCE COMPARISON FOR INTRUSION DETECTION SYSTEM USING NEURAL NETWORK WITH KDD DATASET

    Directory of Open Access Journals (Sweden)

    S. Devaraju

    2014-04-01

    Full Text Available Intrusion Detection Systems are challenging task for finding the user as normal user or attack user in any organizational information systems or IT Industry. The Intrusion Detection System is an effective method to deal with the kinds of problem in networks. Different classifiers are used to detect the different kinds of attacks in networks. In this paper, the performance of intrusion detection is compared with various neural network classifiers. In the proposed research the four types of classifiers used are Feed Forward Neural Network (FFNN, Generalized Regression Neural Network (GRNN, Probabilistic Neural Network (PNN and Radial Basis Neural Network (RBNN. The performance of the full featured KDD Cup 1999 dataset is compared with that of the reduced featured KDD Cup 1999 dataset. The MATLAB software is used to train and test the dataset and the efficiency and False Alarm Rate is measured. It is proved that the reduced dataset is performing better than the full featured dataset.

  13. Bibliotecas accesibles para todos: pautas para acercar las bibliotecas a las personas con discapacidad y a las personas mayores

    OpenAIRE

    Centro Estatal de Autonomía Personal y Ayudas Técnicas

    2008-01-01

    El fuerte componente social, educativo y cultural de las bibliotecas las convierte en instituciones clave para conseguir la plena integración de las personas en situación de vulnerabilidad. Sin embargo, para poder considerar que las bibliotecas son accesibles a toda la sociedad, la persona ha de tener a su alcance todos los servicios y productos culturales que en ellas se ofrecen. Con este objetivo, el Ministerio de Educación, Política Social y Deporte, a través del CEAPAT-IMSERSO, ha present...

  14. A global gridded dataset of daily precipitation going back to 1950, ideal for analysing precipitation extremes

    Science.gov (United States)

    Contractor, S.; Donat, M.; Alexander, L. V.

    2017-12-01

    Reliable observations of precipitation are necessary to determine past changes in precipitation and validate models, allowing for reliable future projections. Existing gauge based gridded datasets of daily precipitation and satellite based observations contain artefacts and have a short length of record, making them unsuitable to analyse precipitation extremes. The largest limiting factor for the gauge based datasets is a dense and reliable station network. Currently, there are two major data archives of global in situ daily rainfall data, first is Global Historical Station Network (GHCN-Daily) hosted by National Oceanic and Atmospheric Administration (NOAA) and the other by Global Precipitation Climatology Centre (GPCC) part of the Deutsche Wetterdienst (DWD). We combine the two data archives and use automated quality control techniques to create a reliable long term network of raw station data, which we then interpolate using block kriging to create a global gridded dataset of daily precipitation going back to 1950. We compare our interpolated dataset with existing global gridded data of daily precipitation: NOAA Climate Prediction Centre (CPC) Global V1.0 and GPCC Full Data Daily Version 1.0, as well as various regional datasets. We find that our raw station density is much higher than other datasets. To avoid artefacts due to station network variability, we provide multiple versions of our dataset based on various completeness criteria, as well as provide the standard deviation, kriging error and number of stations for each grid cell and timestep to encourage responsible use of our dataset. Despite our efforts to increase the raw data density, the in situ station network remains sparse in India after the 1960s and in Africa throughout the timespan of the dataset. Our dataset would allow for more reliable global analyses of rainfall including its extremes and pave the way for better global precipitation observations with lower and more transparent uncertainties.

  15. Resampling Methods Improve the Predictive Power of Modeling in Class-Imbalanced Datasets

    Directory of Open Access Journals (Sweden)

    Paul H. Lee

    2014-09-01

    Full Text Available In the medical field, many outcome variables are dichotomized, and the two possible values of a dichotomized variable are referred to as classes. A dichotomized dataset is class-imbalanced if it consists mostly of one class, and performance of common classification models on this type of dataset tends to be suboptimal. To tackle such a problem, resampling methods, including oversampling and undersampling can be used. This paper aims at illustrating the effect of resampling methods using the National Health and Nutrition Examination Survey (NHANES wave 2009–2010 dataset. A total of 4677 participants aged ≥20 without self-reported diabetes and with valid blood test results were analyzed. The Classification and Regression Tree (CART procedure was used to build a classification model on undiagnosed diabetes. A participant demonstrated evidence of diabetes according to WHO diabetes criteria. Exposure variables included demographics and socio-economic status. CART models were fitted using a randomly selected 70% of the data (training dataset, and area under the receiver operating characteristic curve (AUC was computed using the remaining 30% of the sample for evaluation (testing dataset. CART models were fitted using the training dataset, the oversampled training dataset, the weighted training dataset, and the undersampled training dataset. In addition, resampling case-to-control ratio of 1:1, 1:2, and 1:4 were examined. Resampling methods on the performance of other extensions of CART (random forests and generalized boosted trees were also examined. CARTs fitted on the oversampled (AUC = 0.70 and undersampled training data (AUC = 0.74 yielded a better classification power than that on the training data (AUC = 0.65. Resampling could also improve the classification power of random forests and generalized boosted trees. To conclude, applying resampling methods in a class-imbalanced dataset improved the classification power of CART, random forests

  16. LAS ALFABETIZACIONES POSMODERNAS, LAS PUGNAS CULTURALES Y LOS NUEVOS SIGNIFICADOS DE LA CIUDADANIA

    Directory of Open Access Journals (Sweden)

    JORGE A. HUERGO

    1998-01-01

    Full Text Available En el trabajo se pasa revista a las vinculaciones entre las alfabetizaciones moderna y posmoderna con las culturales que ellas producen y por las que son producidas. La noción de Alfabetizaciones Posmodernas se describe como correlativa de conflictos culturales que se juegan en los ámbitos educativos. Luego se presentan las narrativas político-culturales dominantes, en relación con el problema de la alfabetización y los modelos de ciudadanía, para finalmente mostrar algunos aspectos de una construcción narrativa poscolonial que enmarque las relaciones entre alfabetizaciones posmodernas y formación ciudadana.

  17. LAS PROTEINAS SEMINALES DEL MANI (ARACHIS HYPOGAEA, LEGUMINOSAE y SU RELACION CON LAS CATEGORIAS INFRAESPECIFICAS

    Directory of Open Access Journals (Sweden)

    N R Grosso

    1994-01-01

    Full Text Available Las proteínas seminales de 122 muestras diferentes de Arachis hypogaea L. originarios de Bolivia, Perú y Ecuador fueron estudiadas por electroforesis en gel de poliacrilamida.Se detectaron siete bandas constantes y 27 bandas inconstantes. Los resultados de las últimas se utilizaron para analizar las similitudes entre las muestras empleando el coeficiente de Jaccard y el método de ligamiento promedio de la media aritmética no ponderada(UPGMA.Las proteínas seminales permitieron separar totalmente la subespecies de A.hypogaea y las variedades en menor medida.

  18. Redes sociales en las bibliotecas escolares

    Directory of Open Access Journals (Sweden)

    Vicent Giménez Chornet

    2015-04-01

    Full Text Available Las bibliotecas escolares deben ser un medio para acceder al conocimiento, y las Tecnologías de la Información y la Comunicación pueden facilitar que los estudiantes adolescentes se inicien en el uso de estas tecnologías para desarrollar sus capacidades y habilidades en la búsqueda de información. Las redes sociales e Internet preocupan por las situaciones problemáticas que pueden provocar en los niños y adolescentes que no sean conscientes de los peligros de la red, pero ello no debe impedir que conozcan las ventajas que las TIC ofrecen como forma y medio de aprender. En el artículo se analizan diferentes propuestas innovadoras que se han implantado en distintas bibliotecas escolares del mundo. Se concluye que es importante que los estudiantes preuniversitarios conozcan y dominen estas herramientas antes de incorporarse al mundo laboral o a la universidad.

  19. El liderazgo integral en las organizaciones

    Directory of Open Access Journals (Sweden)

    Laura Reyes-Jácome

    2011-01-01

    Full Text Available El liderazgo integral es una concepción surgida desde el modelo integral de Wilber, el cual tiene en cuenta las dimensiones interior, exterior, individual y grupal que se encuentran presentes en todas las circunstancias de la vida y que configuran la manera de concebir, interpretar y llevar a cabo las acciones del líder. El presente artículo expone cómo es entendido el líder integral a partir de la observación de las diferentes dimensiones y sus interrelaciones, teniendo en cuenta los cuadrantes, niveles, estados, líneas de desarrollo y tipos, características que debe tener el líder integral en su rol dentro de la organización y las prácticas de transformación de las que puede hacer uso para convertirse en líder integral

  20. La sexualidad y las hormonas extragonadales

    Directory of Open Access Journals (Sweden)

    Francisco Gnecco Mozo

    1940-12-01

    Full Text Available A pesar de que en la clínica es corriente obtener comprobación  de la influencia de las glándulas aisladas sobre algunos trastornos sexuales, la influencia de las glándulas endocrinas distintas de las sexuales, sobre la sexualidad en general, es apenas posible entreverla por medio de la relación de esas glándulas con los ovarios o con los testículos

  1. Comunicándonos con las cosas

    Directory of Open Access Journals (Sweden)

    Miguel Delgado

    2009-09-01

    Full Text Available Tras la Web 2.0 y gracias a la colaboración entre las personas, Internet sigue creciendo y evolucionando, pero esta vez a través de los objetos. La siguiente evolución pretende que las personas puedan interactuar con las cosas de una forma inteligente mediante tecnologías ampliamente utilizadas como el teléfono móvil e Internet, y tecnologías que permitan etiquetar objetos.

  2. Importancia de las emociones en el TDAH

    OpenAIRE

    Hernández Cascón, Cristina

    2015-01-01

    El TDAH es un trastorno de origen neurobiológico caracterizado por los síntomas de inatención, hiperactividad e impulsividad y que a su vez interfiere y dificulta en el manejo de las distintas emociones. A través de una larga revisión bibliográfica, se ha intentado proporcionar un conocimiento mayor sobre la importancia que tienen las emociones y adquirir una autoregulación de las mismas

  3. El sujeto posmoderno en las redes sociales

    OpenAIRE

    Elizondo, Mauricio; Picot, Carla

    2011-01-01

    Nuestro recorrido comienza con las características contemporáneas de los conceptos de vida pública y vida privada, y bajo el marco de las redes sociales, cómo estos ámbitos se reflejan en las subjetividades de los usuarios. En suma, intentaremos desarrollar qué son y cómo son utilizadas estas redes sociales en la actualidad por este sujeto posmoderno.

  4. Las franjas electorales en la experiencia chilena

    OpenAIRE

    Juan Ignacio García Rodríguez

    2013-01-01

    Expone la experiencia chilena sobre el establecimiento y regulación de las franjas electorales, para lo cual analiza la constitucionalidad de la franja, los principios involucrados, las disposiciones regulatorias, el papel del Consejo Nacional de Televisión, la jurisprudencia que sobre el tema ha sido emitida por el Tribunal Calificador de Elecciones y los retos que el sistema de franjas electorales enfrenta a futuro como lo es su aplicación a las elecciones primarias de los pa...

  5. Una mirada a las competencias

    Directory of Open Access Journals (Sweden)

    Germán Albeiro Castaño Duque

    2013-07-01

    Full Text Available El concepto de competencias cobra cada vez más fuerza y se extiende cada vez más en diferentes ámbitos y niveles. El presente artículo efectúa un breve recorrido por algunos autores de renombre que han hablado sobre el tema, hace referencia a la necesidad que tiene la universidad de flexibilizar sus currículos para responder a las nuevas demandas de la sociedad del conocimiento y la globalización, de manera que favorezcan el perfil ocupacional de sus egresados mediante la formación en competencias y, además, da una muestra clara de cómo se está llevando a cabo este proceso en la Universidad Nacional de Colombia e indica, finalmente, un breve esbozo de cómo será la educación del futuro.

  6. Leyendas mercantiles y sabotaje a las corporaciones

    Directory of Open Access Journals (Sweden)

    Miguel Ángel Santagada

    2007-12-01

    Full Text Available Las leyendas mercantiles son analizadas como reacciones frente a los símbolos de poder y riqueza que exhiben las marcas y las empresas corporativas. Desde el punto de vista del consumidor, las mismas narraciones parecen ratificar cierto sentimiento de inferioridad o de insignificancia frente a los atributos más exaltados por la propia actividad publicitaria (Brodin, 1995: además de leyendas anticorporativas, también circulan historias de recompensas recibidas por clientes en retribución a su lealtad para con ciertas empresas o marcas.

  7. Liderazgo y poder en las organizaciones

    OpenAIRE

    Valcarce Fernández, Antonio

    2015-01-01

    En este trabajo, presentamos una revisión sobre el liderazgo y el poder como fenómenos presentes dentro de la organización y empleando para ello la perspectiva de la Psicología de las Organizaciones. En cuanto al liderazgo señalaremos una contextualizacion, asi como las definiciones que se han aceptado y las distintas teorias que han ido formando el estudio del liderazgo así como las habilidades directivas que se enmarcan dentro del liderazgo efectivo,a su vez explicaremos los nue...

  8. Las elecciones mexicanas de 1995

    Directory of Open Access Journals (Sweden)

    Alonso LUJAMBIO

    2009-11-01

    Full Text Available RESUMEN: El artículo analiza las elecciones estatales y locales celebradas en México en 1995. El autor destaca cuatro factores presentes en estos comiciones. En primer lugar, se celebraron en un contexto de crisis económica. En segundo lugar, el PRI cosechó los peores resultados electorales de su historia. En tercer lugar, los procesos electorales (salvo quizá el de Yucatán fueron limpios y transparentes. Por último, la competitividad del mercado electoral aumentó al concentrarse el voto opositor en una de las opciones, de la centro-derecha representada por el Partido Acción Nacional (PAN, y al perder presencia el partido opositor de centro-iexquierda Partido de la Revolución Democrática (PRD. Así, pues, tendió a fortalecerse el esquema bipartito PRI-PAN y a debilitarse el tripartidismo PRD-PRI-PAN. El autor concluye que 1995 fue un año de aceleración para el proceso de transición a la democracia en México.ABSTRACT: This article analyzes the local and federal state elections held in México in 1995. The author mark four principal factors in this elections. First, this elections took place in a context of economical crisis. Second, the PRI had got the worst results in his history. Third, the elections (xcepts maybe in Yucatán were clean. At last, the election competitiveness increased because of the concentration of the opposing vote in the Partido de Acción Nacional (PAN, centre-right. The centre-leftist Partido de la Revolución Democrática (PRD losed influence. The author conclusion is that 1995 was a year in wich the transition to democracy go faster.

  9. Los sentidos y las ruinas

    Directory of Open Access Journals (Sweden)

    Francine Masiello

    2014-06-01

    Full Text Available Camino a Cafayate, al norte de Tucumán, la ciudadela de los indios quilmes me permite el descanso y la reflexión. Escalando lo que en otra época fue la fortaleza de una cultura poderosa, estudio el horizonte, admiro las alturas, intento reconstruir la vida de los indígenas tal como hubiera sido en una época muy lejana. Sin embargo, estos pensamientos no son libres, ni son míos; más bien, el libro de turismo me dice cómo tengo que mirar, me cuenta la historia de la cultura quilmes para que no me quede ninguna sorpresa. Los quilmes, la civilización más grande de la Argentina antes de la llegada de los españoles, primero resistían los avances del imperio incaico y después, durante 130 años, se oponían a los conquistadores. Sabemos por los guías de turismo que los españoles arrancaron a los últimos sobrevivientes a pie a Buenos Aires. La gran mayoría pereció en el camino. También sabemos que las ruinas de los quilmes fueron rehabilitadas durante la última dictadura militar. Imposible, entonces, no captar la ironía del gesto de los militares al recordar a los indígenas desaparecidos mientras seguían desapareciendo al pueblo argentino durante la década de los años setenta.

  10. Lo invisible de las emergencias

    Directory of Open Access Journals (Sweden)

    Salua Osorio Mrad

    2011-04-01

    • en el caso de los estudios de impactos a mediano y a largo plazo, el seguimiento a las cohortes de población afectada se convierte en un reto, dadas sus condiciones de desplazamiento. En conclusión, ante todo debemos reconocer nuestra falta de conocimientos en el tema y aprovechar la situación de emergencia recientemente ocurrida para obtener la mayor cantidad posible de información y generar proyectos de investigación que den respuestas a los vacíos de conocimiento sobre los efectos de las inundaciones en la salud, teniendo en cuenta el objetivo final de proporcionar herramientas para la toma de decisiones y para actuar ante una situación similar en el futuro. Además, debemos reconocer la importancia de la relación entre el medio ambiente y la salud, y aumentar nuestra comprensión de esta relación como línea base, que nos permita en los casos de emergencias, inferir o extrapolar con mayor grado de certidumbre y definir la línea de acción. Este es el momento para iniciar el proceso y quisiera extender una invitación a los grupos de investigación para que amplíen sus intereses, objetivos y preguntas de investigación, e incluyan temas como éstos, tan importantes para la salud pública del país.

  11. Sensitivity of a numerical wave model on wind re-analysis datasets

    Science.gov (United States)

    Lavidas, George; Venugopal, Vengatesan; Friedrich, Daniel

    2017-03-01

    Wind is the dominant process for wave generation. Detailed evaluation of metocean conditions strengthens our understanding of issues concerning potential offshore applications. However, the scarcity of buoys and high cost of monitoring systems pose a barrier to properly defining offshore conditions. Through use of numerical wave models, metocean conditions can be hindcasted and forecasted providing reliable characterisations. This study reports the sensitivity of wind inputs on a numerical wave model for the Scottish region. Two re-analysis wind datasets with different spatio-temporal characteristics are used, the ERA-Interim Re-Analysis and the CFSR-NCEP Re-Analysis dataset. Different wind products alter results, affecting the accuracy obtained. The scope of this study is to assess different available wind databases and provide information concerning the most appropriate wind dataset for the specific region, based on temporal, spatial and geographic terms for wave modelling and offshore applications. Both wind input datasets delivered results from the numerical wave model with good correlation. Wave results by the 1-h dataset have higher peaks and lower biases, in expense of a high scatter index. On the other hand, the 6-h dataset has lower scatter but higher biases. The study shows how wind dataset affects the numerical wave modelling performance, and that depending on location and study needs, different wind inputs should be considered.

  12. Knowledge Mining from Clinical Datasets Using Rough Sets and Backpropagation Neural Network

    Directory of Open Access Journals (Sweden)

    Kindie Biredagn Nahato

    2015-01-01

    Full Text Available The availability of clinical datasets and knowledge mining methodologies encourages the researchers to pursue research in extracting knowledge from clinical datasets. Different data mining techniques have been used for mining rules, and mathematical models have been developed to assist the clinician in decision making. The objective of this research is to build a classifier that will predict the presence or absence of a disease by learning from the minimal set of attributes that has been extracted from the clinical dataset. In this work rough set indiscernibility relation method with backpropagation neural network (RS-BPNN is used. This work has two stages. The first stage is handling of missing values to obtain a smooth data set and selection of appropriate attributes from the clinical dataset by indiscernibility relation method. The second stage is classification using backpropagation neural network on the selected reducts of the dataset. The classifier has been tested with hepatitis, Wisconsin breast cancer, and Statlog heart disease datasets obtained from the University of California at Irvine (UCI machine learning repository. The accuracy obtained from the proposed method is 97.3%, 98.6%, and 90.4% for hepatitis, breast cancer, and heart disease, respectively. The proposed system provides an effective classification model for clinical datasets.

  13. An assessment of differences in gridded precipitation datasets in complex terrain

    Science.gov (United States)

    Henn, Brian; Newman, Andrew J.; Livneh, Ben; Daly, Christopher; Lundquist, Jessica D.

    2018-01-01

    Hydrologic modeling and other geophysical applications are sensitive to precipitation forcing data quality, and there are known challenges in spatially distributing gauge-based precipitation over complex terrain. We conduct a comparison of six high-resolution, daily and monthly gridded precipitation datasets over the Western United States. We compare the long-term average spatial patterns, and interannual variability of water-year total precipitation, as well as multi-year trends in precipitation across the datasets. We find that the greatest absolute differences among datasets occur in high-elevation areas and in the maritime mountain ranges of the Western United States, while the greatest percent differences among datasets relative to annual total precipitation occur in arid and rain-shadowed areas. Differences between datasets in some high-elevation areas exceed 200 mm yr-1 on average, and relative differences range from 5 to 60% across the Western United States. In areas of high topographic relief, true uncertainties and biases are likely higher than the differences among the datasets; we present evidence of this based on streamflow observations. Precipitation trends in the datasets differ in magnitude and sign at smaller scales, and are sensitive to how temporal inhomogeneities in the underlying precipitation gauge data are handled.

  14. LAS DIMENSIONES DEL OCIO EN LAS SOCIEDADES RURALES

    Directory of Open Access Journals (Sweden)

    S. Rebollo

    2010-09-01

    Full Text Available

     

     

    RESUMEN

    El presente artículo es un análisis sobre el Ocio en la sociedad rural. El medio rural se ha estudiado desde el punto de vista económico, político y social y sin duda ha despertado un gran interés. Numerosos son los problemas que arrastran estas sociedades a lo largo de la historia. Problemas que han dado lugar a una situación de subdesarrollo y marginalidad. Hoy día indicadores de desarrollo y calidad de vida de una población pueden ser las actividades de ocio realizadas. El nivel de práctica deportiva y el empleo del tiempo libre como actividades de ocio son fenómenos que despiertan gran interés desde el mundo académico. En esta investigación se evalúa estos parámetros y se da información sobre las necesidades de sus habitantes en materia de tiempo libre, actividades de ocio y de forma concreta actividades deportivas.
    PALABRAS CLAVE: Ocio, Tiempo libre, Deporte, Rural.

     

    ABSTRACT

    The present article is an analysis of leisure in rural society. Rural areas have been studied from economic, political and social points of view and without a doubt there is heightened interest in these areas. Numerous problems have plagued these societies throughout history, and consequently led to underdevelopment and marginality. Nowadays leisure activities indicate development and quality of life in given populations. The level of sport practice and the use of the free time to do leisure activities are phenomena that interest the academic world. This investigation evaluated these parameters and information on the necessities of its inhabitants in the matter of free time, leisure activities and specifically sport activities.
    KEY WORDS: Leisure, free Time, Sport, Rural.

  15. Assessment of NASA's Physiographic and Meteorological Datasets as Input to HSPF and SWAT Hydrological Models

    Science.gov (United States)

    Alacron, Vladimir J.; Nigro, Joseph D.; McAnally, William H.; OHara, Charles G.; Engman, Edwin Ted; Toll, David

    2011-01-01

    This paper documents the use of simulated Moderate Resolution Imaging Spectroradiometer land use/land cover (MODIS-LULC), NASA-LIS generated precipitation and evapo-transpiration (ET), and Shuttle Radar Topography Mission (SRTM) datasets (in conjunction with standard land use, topographical and meteorological datasets) as input to hydrological models routinely used by the watershed hydrology modeling community. The study is focused in coastal watersheds in the Mississippi Gulf Coast although one of the test cases focuses in an inland watershed located in northeastern State of Mississippi, USA. The decision support tools (DSTs) into which the NASA datasets were assimilated were the Soil Water & Assessment Tool (SWAT) and the Hydrological Simulation Program FORTRAN (HSPF). These DSTs are endorsed by several US government agencies (EPA, FEMA, USGS) for water resources management strategies. These models use physiographic and meteorological data extensively. Precipitation gages and USGS gage stations in the region were used to calibrate several HSPF and SWAT model applications. Land use and topographical datasets were swapped to assess model output sensitivities. NASA-LIS meteorological data were introduced in the calibrated model applications for simulation of watershed hydrology for a time period in which no weather data were available (1997-2006). The performance of the NASA datasets in the context of hydrological modeling was assessed through comparison of measured and model-simulated hydrographs. Overall, NASA datasets were as useful as standard land use, topographical , and meteorological datasets. Moreover, NASA datasets were used for performing analyses that the standard datasets could not made possible, e.g., introduction of land use dynamics into hydrological simulations

  16. FASTQSim: platform-independent data characterization and in silico read generation for NGS datasets.

    Science.gov (United States)

    Shcherbina, Anna

    2014-08-15

    High-throughput next generation sequencing technologies have enabled rapid characterization of clinical and environmental samples. Consequently, the largest bottleneck to actionable data has become sample processing and bioinformatics analysis, creating a need for accurate and rapid algorithms to process genetic data. Perfectly characterized in silico datasets are a useful tool for evaluating the performance of such algorithms. Background contaminating organisms are observed in sequenced mixtures of organisms. In silico samples provide exact truth. To create the best value for evaluating algorithms, in silico data should mimic actual sequencer data as closely as possible. FASTQSim is a tool that provides the dual functionality of NGS dataset characterization and metagenomic data generation. FASTQSim is sequencing platform-independent, and computes distributions of read length, quality scores, indel rates, single point mutation rates, indel size, and similar statistics for any sequencing platform. To create training or testing datasets, FASTQSim has the ability to convert target sequences into in silico reads with specific error profiles obtained in the characterization step. FASTQSim enables users to assess the quality of NGS datasets. The tool provides information about read length, read quality, repetitive and non-repetitive indel profiles, and single base pair substitutions. FASTQSim allows the user to simulate individual read datasets that can be used as standardized test scenarios for planning sequencing projects or for benchmarking metagenomic software. In this regard, in silico datasets generated with the FASTQsim tool hold several advantages over natural datasets: they are sequencing platform independent, extremely well characterized, and less expensive to generate. Such datasets are valuable in a number of applications, including the training of assemblers for multiple platforms, benchmarking bioinformatics algorithm performance, and creating challenge

  17. Spatially-explicit estimation of geographical representation in large-scale species distribution datasets.

    Science.gov (United States)

    Kalwij, Jesse M; Robertson, Mark P; Ronk, Argo; Zobel, Martin; Pärtel, Meelis

    2014-01-01

    Much ecological research relies on existing multispecies distribution datasets. Such datasets, however, can vary considerably in quality, extent, resolution or taxonomic coverage. We provide a framework for a spatially-explicit evaluation of geographical representation within large-scale species distribution datasets, using the comparison of an occurrence atlas with a range atlas dataset as a working example. Specifically, we compared occurrence maps for 3773 taxa from the widely-used Atlas Florae Europaeae (AFE) with digitised range maps for 2049 taxa of the lesser-known Atlas of North European Vascular Plants. We calculated the level of agreement at a 50-km spatial resolution using average latitudinal and longitudinal species range, and area of occupancy. Agreement in species distribution was calculated and mapped using Jaccard similarity index and a reduced major axis (RMA) regression analysis of species richness between the entire atlases (5221 taxa in total) and between co-occurring species (601 taxa). We found no difference in distribution ranges or in the area of occupancy frequency distribution, indicating that atlases were sufficiently overlapping for a valid comparison. The similarity index map showed high levels of agreement for central, western, and northern Europe. The RMA regression confirmed that geographical representation of AFE was low in areas with a sparse data recording history (e.g., Russia, Belarus and the Ukraine). For co-occurring species in south-eastern Europe, however, the Atlas of North European Vascular Plants showed remarkably higher richness estimations. Geographical representation of atlas data can be much more heterogeneous than often assumed. Level of agreement between datasets can be used to evaluate geographical representation within datasets. Merging atlases into a single dataset is worthwhile in spite of methodological differences, and helps to fill gaps in our knowledge of species distribution ranges. Species distribution

  18. EL MST Y LAS DISPUTAS POR LAS ALTERNATIVAS EN BRASIL

    Directory of Open Access Journals (Sweden)

    João Pedro Stédile

    2012-04-01

    Full Text Available La victoria del presidente Lulaen las últimas elecciones cambió la correlaciónde fuerzas de la lucha por la reforma agraria enBrasil. Por los compromisos históricos del PT,su liderazgo y como partido de izquierda,tenemos ahora un gobierno federal que apoyala reforma agraria, al contrario de lo que fue elgobierno de Fernando Henrique Cardoso. Porlo tanto, la disputa se sitúa en otro nivel. Sinembargo, hay otras fuerzas poderosas que seoponen a la reforma agraria como lo son ellatifundio, el modelo neoliberal del agro-negocio, la clase dominante como un todo, elEstado burgués brasileño y los medios decomunicación que actúan como un espacio delucha ideológica que disputa la hegemonía en lasociedad contra nosotros. En este nuevocontexto, el MST evalúa que ahora sí se puedeavanzar en la reforma agraria, pero que es unmomento de acumulación de fuerzas y no degrandes definiciones que consoliden la reformaagraria de nuevo tipo que nosotrosdefendemos. Es decir, estamos acumulandopara el futuro.

  19. Medical Image Data and Datasets in the Era of Machine Learning-Whitepaper from the 2016 C-MIMI Meeting Dataset Session.

    Science.gov (United States)

    Kohli, Marc D; Summers, Ronald M; Geis, J Raymond

    2017-08-01

    At the first annual Conference on Machine Intelligence in Medical Imaging (C-MIMI), held in September 2016, a conference session on medical image data and datasets for machine learning identified multiple issues. The common theme from attendees was that everyone participating in medical image evaluation with machine learning is data starved. There is an urgent need to find better ways to collect, annotate, and reuse medical imaging data. Unique domain issues with medical image datasets require further study, development, and dissemination of best practices and standards, and a coordinated effort among medical imaging domain experts, medical imaging informaticists, government and industry data scientists, and interested commercial, academic, and government entities. High-level attributes of reusable medical image datasets suitable to train, test, validate, verify, and regulate ML products should be better described. NIH and other government agencies should promote and, where applicable, enforce, access to medical image datasets. We should improve communication among medical imaging domain experts, medical imaging informaticists, academic clinical and basic science researchers, government and industry data scientists, and interested commercial entities.

  20. Las decisiones de los hogares en Venezuela

    Directory of Open Access Journals (Sweden)

    Marakah Mancini

    2008-06-01

    Full Text Available Este artículo analiza quién toma las decisiones en los hogares y los factores que afectan la decisión. Esta investigación se condujo en Venezuela, donde se preguntó a las mujeres sobre las decisiones de los hogares en cuatro áreas. Los resultados indican que la mayoría de los hogares toman decisiones conjuntamente. Excluyendo esta categoría, la mayoría de las mujeres toman decisiones concernientes a la compra de bienes y la educación de los hijos, mientras que los hombres dominan las decisiones acerca de las finanzas del hogar y el cambio de residencia. También se encontró que la mayoría de las parejas que trabajan comparten los gastos del hogar. Además, se identifican factores que afectan el poder de decisión de las mujeres como la participación femenina en el mercado laboral y la edad, no obstante, el nivel de educación no afecta su poder decisión.

  1. jóvenes en las videosalas

    Directory of Open Access Journals (Sweden)

    Fernando Huerta Rojas

    2005-01-01

    Full Text Available En este trabajo planteo una serie de primeras reflexiones con relación a una de las formas como los jóvenes aprenden, introyectan y practican la violencia de género: el juego. Éste, como institución política, relación social y práctica cultural, es uno de los escenarios pedagógicos donde los hombres expresan y significan el desideratum cultural y la asunción genérica de su condición masculina. Para el caso que nos ocupa, los jóvenes tienen en los juegos virtuales la práctica, las tecnologías y los espacios de socialización, aculturación e interacción contemporánea y globalizada en los que crean, recrean y simbolizan las identidades, las subjetividades, las sexualidades, las estéticas y las culturas juveniles como parte de la experiencia cyborg. En este sentido, la competencia es una de las características del juego, lo cual, como práctica cultural histórica, ha desarrollado un sentido de rivalidad con altos contenidos y significaciones de violencia.

  2. Kunstimeka Las Vegases / Maria-Kristiina Soomre

    Index Scriptorium Estoniae

    Soomre, Maria-Kristiina, 1978-

    2001-01-01

    Las Vegase kasiino-hotellis The Venetian avatud Guggenheim-Hermitage Museum'ist, selle avanäitusest "Meistriteosed ja meisterkogujad" ja Guggenheim Las Vegas'est, mis avati Frank Gehry kujundatud rändnäitusega "Mootorrataste kunst". Muuseumiruumide arhitekt Rem Koolhaas

  3. Las construcciones con esqueleto en madera

    Directory of Open Access Journals (Sweden)

    Heinz Leser S.

    1984-06-01

    Estos sistemas se emplean preferentemente en viviendas prefabricadas y edificios de pequeña envergadura. Finalmente, consideramos las construcciones con "esqueleto en madera", cuyo campo de aplicación principal está en las edificaciones de mediana extensión: escuelas, centros comunitarios, pequeñas industrias o bloques habitacionales.

  4. CAMBIO DE LAS INSTITUCIONES PARA EL DESARROLLO

    Directory of Open Access Journals (Sweden)

    Antonio Vázquez Barquero

    2006-06-01

    Full Text Available Existe un amplio acuerdo sobre que el funcionamiento de las instituciones determina la evolución de las economías y la senda específica de crecimiento de las ciudades y países; es decir, que las normas y reglas, formales (como los contratos y los acuerdos entre empresas y actores pero también informales (como los códigos de conducta y las convenciones, existentes en cada sociedad, juegan un papel estratégico en los procesos de desarrollo. Las empresas y las organizaciones toman sus decisiones de inversión en un entorno institucional y las realizan a través de un sistema de relaciones e interacciones con otras empresas y organizaciones, que forman el sistema institucional, lo que afecta a los resultados de la inversión, y, por lo tanto, al comportamiento de la productividad y al propio proceso de desarrollo económico (Vázquez Barquero, 2005.  

  5. Las radiaciones ionizantes: una realidad cotidiana

    Directory of Open Access Journals (Sweden)

    Eduardo Gallego Díaz

    2010-12-01

    Full Text Available Este trabajo introduce la naturaleza de las sustancias radiactivas y de la radiación ionizante, los efectos que causa sobre la materia y los medios disponibles para su detección y medida, así como las fuentes de radiación naturales a las que los seres humanos estamos expuestos. Seguidamente, en el apartado más amplio del trabajo, se describen las múltiples aplicaciones de las radiaciones ionizantes en la medicina, la agricultura, la industria, las ciencias de la tierra, la biología y otras ramas, lo que permite poder poner su impacto en perspectiva frente al de las fuentes naturales. La tesis final del artículo es que para evitar sufrir daños resulta necesario protegerse adecuadamente de los efectos nocivos de la radiación y las sustancias radiactivas, pero sin limitar innecesariamente su utilización beneficiosa en los numerosos ámbitos descritos. Ese es el objetivo fundamental de la protección radiológica, cuyos principios básicos se presentan para terminar.

  6. de la historia de las ciencias

    Directory of Open Access Journals (Sweden)

    José Antonio Acevedo-Díaz

    2004-01-01

    Full Text Available La metodología científica incluye aspectos como la capacidad de invención de hipótesis y modelos, la creatividad y el uso del razonamiento analógico, entre otros muchos más. El presente artículo se ocupa del papel de las analogías en el pensamiento creativo de los científicos, aplicado a un caso paradigmático de la historia de las ciencias del siglo XIX, como fue el desarrollo de la teoría del campo electromagnético de Maxwell, que daría lugar a una de las grandes síntesis de la física clásica: la de los fenómenos ópticos, eléctricos y magnéticos. Ilustrado con las palabras del principal protagonista, las de otros físicos de la época y las de historiadores de las ciencias que se han ocupado de este tema, el artículo muestra el exhaustivo uso que Maxwell hizo de las analogías y el razonamiento analógico en su intento de conseguir sus más importantes propósitos científicos

  7. Nosotras las obreras. Huelga en Vanytex, 1976

    Directory of Open Access Journals (Sweden)

    Ricardo Sánchez Ángel

    2008-01-01

    Full Text Available Este artículo constituye una microhistoria sobre la huelga de las trabajadoras de Vanytex, mostrando las dinámicas de género, económico-sociales y las contradicciones que llevaron a la declaratoria del cese de actividades. Se trata de un análisis de los sucesos y del contexto nacional en que se va a desarrollar el movimiento huelguístico, privilegiando la acción de las protagonistas, sus formas de lucha y organización y los enfoques políticos en las consignas utilizadas en su acción colectiva. El conflicto se enmarca en un momento particular de auge huelguístico nacional, que permitió realizar alianzas y confluencias solidarias con otros movimientos de protesta social.

  8. Map Coordinate Referencing and the use of GPS Datasets in Ghana ...

    African Journals Online (AJOL)

    Map Coordinate Referencing and the use of GPS Datasets in Ghana. ... Journal of Science and Technology (Ghana) ... systems used in Ghana (the Ghana war office system and also the Clarke1880 system) using the Bursa-Wolf model.

  9. Climate Prediction Center(CPC)Infra-Red (IR) 0.5 degree Dataset

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — Climate Prediction Center 0.5 degree IR dataset was created from all available individual geostationary satellite data which have been merged to form nearly seamless...

  10. USGS HYDRoacoustic dataset in support of the Surface Water Oceanographic Topography satellite mission (HYDRoSWOT)

    Data.gov (United States)

    Department of the Interior — HYDRoSWOT – HYDRoacoustic dataset in support of Surface Water Oceanographic Topography – is a data set that aggregates channel and flow data collected from the USGS...

  11. Vector Nonlinear Time-Series Analysis of Gamma-Ray Burst Datasets on Heterogeneous Clusters

    Directory of Open Access Journals (Sweden)

    Ioana Banicescu

    2005-01-01

    Full Text Available The simultaneous analysis of a number of related datasets using a single statistical model is an important problem in statistical computing. A parameterized statistical model is to be fitted on multiple datasets and tested for goodness of fit within a fixed analytical framework. Definitive conclusions are hopefully achieved by analyzing the datasets together. This paper proposes a strategy for the efficient execution of this type of analysis on heterogeneous clusters. Based on partitioning processors into groups for efficient communications and a dynamic loop scheduling approach for load balancing, the strategy addresses the variability of the computational loads of the datasets, as well as the unpredictable irregularities of the cluster environment. Results from preliminary tests of using this strategy to fit gamma-ray burst time profiles with vector functional coefficient autoregressive models on 64 processors of a general purpose Linux cluster demonstrate the effectiveness of the strategy.

  12. Dataset of Passerine bird communities in a Mediterranean high mountain (Sierra Nevada, Spain)

    Science.gov (United States)

    Pérez-Luque, Antonio Jesús; Barea-Azcón, José Miguel; Álvarez-Ruiz, Lola; Bonet-García, Francisco Javier; Zamora, Regino

    2016-01-01

    Abstract In this data paper, a dataset of passerine bird communities is described in Sierra Nevada, a Mediterranean high mountain located in southern Spain. The dataset includes occurrence data from bird surveys conducted in four representative ecosystem types of Sierra Nevada from 2008 to 2015. For each visit, bird species numbers as well as distance to the transect line were recorded. A total of 27847 occurrence records were compiled with accompanying measurements on distance to the transect and animal counts. All records are of species in the order Passeriformes. Records of 16 different families and 44 genera were collected. Some of the taxa in the dataset are included in the European Red List. This dataset belongs to the Sierra Nevada Global-Change Observatory (OBSNEV), a long-term research project designed to compile socio-ecological information on the major ecosystem types in order to identify the impacts of global change in this area. PMID:26865820

  13. Telephone Interpreter Services (TIS)-Asian and Pacific Islander (API) Language Yearly Dataset

    Data.gov (United States)

    Social Security Administration — This dataset displays our national TIS call volume for over 45 API languages for fiscal year 2011 onward. A fiscal year runs from October through September. We will...

  14. Telephone Interpreter Services (TIS) - Asian and Pacific Islander (API) Language Fiscal Year Quarterly Dataset

    Data.gov (United States)

    Social Security Administration — This dataset displays our quarterly national TIS call volume for over 45 API languages for fiscal year 2013 onward. A fiscal year runs from October through September...

  15. Dataset of Passerine bird communities in a Mediterranean high mountain (Sierra Nevada, Spain).

    Science.gov (United States)

    Pérez-Luque, Antonio Jesús; Barea-Azcón, José Miguel; Álvarez-Ruiz, Lola; Bonet-García, Francisco Javier; Zamora, Regino

    2016-01-01

    In this data paper, a dataset of passerine bird communities is described in Sierra Nevada, a Mediterranean high mountain located in southern Spain. The dataset includes occurrence data from bird surveys conducted in four representative ecosystem types of Sierra Nevada from 2008 to 2015. For each visit, bird species numbers as well as distance to the transect line were recorded. A total of 27847 occurrence records were compiled with accompanying measurements on distance to the transect and animal counts. All records are of species in the order Passeriformes. Records of 16 different families and 44 genera were collected. Some of the taxa in the dataset are included in the European Red List. This dataset belongs to the Sierra Nevada Global-Change Observatory (OBSNEV), a long-term research project designed to compile socio-ecological information on the major ecosystem types in order to identify the impacts of global change in this area.

  16. A multimodal dataset for authoring and editing multimedia content: The MAMEM project

    Directory of Open Access Journals (Sweden)

    Spiros Nikolopoulos

    2017-12-01

    Full Text Available We present a dataset that combines multimodal biosignals and eye tracking information gathered under a human-computer interaction framework. The dataset was developed in the vein of the MAMEM project that aims to endow people with motor disabilities with the ability to edit and author multimedia content through mental commands and gaze activity. The dataset includes EEG, eye-tracking, and physiological (GSR and Heart rate signals collected from 34 individuals (18 able-bodied and 16 motor-impaired. Data were collected during the interaction with specifically designed interface for web browsing and multimedia content manipulation and during imaginary movement tasks. The presented dataset will contribute towards the development and evaluation of modern human-computer interaction systems that would foster the integration of people with severe motor impairments back into society.

  17. Toxics Release Inventory Chemical Hazard Information Profiles (TRI-CHIP) Dataset

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Toxics Release Inventory (TRI) Chemical Hazard Information Profiles (TRI-CHIP) dataset contains hazard information about the chemicals reported in TRI. Users can...

  18. Integration of geophysical datasets by a conjoint probability tomography approach: application to Italian active volcanic areas

    Directory of Open Access Journals (Sweden)

    D. Patella

    2008-06-01

    Full Text Available We expand the theory of probability tomography to the integration of different geophysical datasets. The aim of the new method is to improve the information quality using a conjoint occurrence probability function addressed to highlight the existence of common sources of anomalies. The new method is tested on gravity, magnetic and self-potential datasets collected in the volcanic area of Mt. Vesuvius (Naples, and on gravity and dipole geoelectrical datasets collected in the volcanic area of Mt. Etna (Sicily. The application demonstrates that, from a probabilistic point of view, the integrated analysis can delineate the signature of some important volcanic targets better than the analysis of the tomographic image of each dataset considered separately.

  19. Potential Impacts of Climate Change on World Food Supply: Datasets from a Major Crop Modeling Study

    Data.gov (United States)

    National Aeronautics and Space Administration — Datasets from a Major Crop Modeling Study contain projected country and regional changes in grain crop yields due to global climate change. Equilibrium and transient...

  20. CoVennTree: A new method for the comparative analysis of large datasets

    Directory of Open Access Journals (Sweden)

    Steffen C. Lott

    2015-02-01

    Full Text Available The visualization of massive datasets, such as those resulting from comparative metatranscriptome analyses or the analysis of microbial population structures using ribosomal RNA sequences, is a challenging task. We developed a new method called CoVennTree (Comparative weighted Venn Tree that simultaneously compares up to three multifarious datasets by aggregating and propagating information from the bottom to the top level and produces a graphical output in Cytoscape. With the introduction of weighted Venn structures, the contents and relationships of various datasets can be correlated and simultaneously aggregated without losing information. We demonstrate the suitability of this approach using a dataset of 16S rDNA sequences obtained from microbial populations at three different depths of the Gulf of Aqaba in the Red Sea. CoVennTree has been integrated into the Galaxy ToolShed and can be directly downloaded and integrated into the user instance.