Recommending Multidimensional Queries
Giacometti, Arnaud; Marcel, Patrick; Negre, Elsa
Interactive analysis of datacube, in which a user navigates a cube by launching a sequence of queries is often tedious since the user may have no idea of what the forthcoming query should be in his current analysis. To better support this process we propose in this paper to apply a Collaborative Work approach that leverages former explorations of the cube to recommend OLAP queries. The system that we have developed adapts Approximate String Matching, a technique popular in Information Retrieval, to match the current analysis with the former explorations and help suggesting a query to the user. Our approach has been implemented with the open source Mondrian OLAP server to recommend MDX queries and we have carried out some preliminary experiments that show its efficiency for generating effective query recommendations.
Recommendation Sets and Choice Queries
DEFF Research Database (Denmark)
Viappiani, Paolo Renato; Boutilier, Craig
2011-01-01
Utility elicitation is an important component of many applications, such as decision support systems and recommender systems. Such systems query users about their preferences and offer recommendations based on the system's belief about the user's utility function. We analyze the connection between...... the problem of generating optimal recommendation sets and the problem of generating optimal choice queries, considering both Bayesian and regret-based elicitation. Our results show that, somewhat surprisingly, under very general circumstances, the optimal recommendation set coincides with the optimal query....
Keyword Query Expansion Paradigm Based on Recommendation and Interpretation in Relational Databases
Directory of Open Access Journals (Sweden)
Yingqi Wang
2017-01-01
Full Text Available Due to the ambiguity and impreciseness of keyword query in relational databases, the research on keyword query expansion has attracted wide attention. Existing query expansion methods expose users’ query intention to a certain extent, but most of them cannot balance the precision and recall. To address this problem, a novel two-step query expansion approach is proposed based on query recommendation and query interpretation. First, a probabilistic recommendation algorithm is put forward by constructing a term similarity matrix and Viterbi model. Second, by using the translation algorithm of triples and construction algorithm of query subgraphs, query keywords are translated to query subgraphs with structural and semantic information. Finally, experimental results on a real-world dataset demonstrate the effectiveness and rationality of the proposed method.
Query recommendation for children
Duarte Torres, Sergio; Hiemstra, Djoerd; Weber, Ingmar; Serdyukov, Pavel
2012-01-01
One of the biggest problems that children experience while searching the web occurs during the query formulation process. Children have been found to struggle formulating queries based on keywords given their limited vocabulary and their difficulty to choose the right keywords. In this work we
Estudio comparativo entre SIG propietario y SIG libre
Mesa Díaz, Juan Ramón
2008-01-01
Estudio comparativo entre SIG propietario y SIG libre, focalizado en los casos particulares de Geomedia Pro (SIG Propietario) y gvSIG (SIG Libre). En el estudio se procede a determinar cuáles son los aspectos destacables de un SIG, para poder evaluarlos, posteriormente, en los dos SIG objeto del estudio y obtener una ponderación definitoria de cada SIG. A continuación, algunos de los aspectos evaluados en cada SIG: interoperabilidad, conexión a bases de datos espaciales, aspectos económ...
Hanauer, David A; Wu, Danny T Y; Yang, Lei; Mei, Qiaozhu; Murkowski-Steffy, Katherine B; Vydiswaran, V G Vinod; Zheng, Kai
2017-03-01
The utility of biomedical information retrieval environments can be severely limited when users lack expertise in constructing effective search queries. To address this issue, we developed a computer-based query recommendation algorithm that suggests semantically interchangeable terms based on an initial user-entered query. In this study, we assessed the value of this approach, which has broad applicability in biomedical information retrieval, by demonstrating its application as part of a search engine that facilitates retrieval of information from electronic health records (EHRs). The query recommendation algorithm utilizes MetaMap to identify medical concepts from search queries and indexed EHR documents. Synonym variants from UMLS are used to expand the concepts along with a synonym set curated from historical EHR search logs. The empirical study involved 33 clinicians and staff who evaluated the system through a set of simulated EHR search tasks. User acceptance was assessed using the widely used technology acceptance model. The search engine's performance was rated consistently higher with the query recommendation feature turned on vs. off. The relevance of computer-recommended search terms was also rated high, and in most cases the participants had not thought of these terms on their own. The questions on perceived usefulness and perceived ease of use received overwhelmingly positive responses. A vast majority of the participants wanted the query recommendation feature to be available to assist in their day-to-day EHR search tasks. Challenges persist for users to construct effective search queries when retrieving information from biomedical documents including those from EHRs. This study demonstrates that semantically-based query recommendation is a viable solution to addressing this challenge. Published by Elsevier Inc.
Kirk, David G.; Zhang, Zhen; Korkeala, Hannu; Lindström, Miia
2014-01-01
Clostridium botulinum produces heat-resistant endospores that may germinate and outgrow into neurotoxic cultures in foods. Sporulation is regulated by the transcription factor Spo0A and the alternative sigma factors SigF, SigE, SigG, and SigK in most spore formers studied to date. We constructed mutants of sigF, sigE, and sigG in C. botulinum ATCC 3502 and used quantitative reverse transcriptase PCR and electron microscopy to assess their expression of the sporulation pathway on transcription...
Baseline Analyses of SIG Applications and SIG-Eligible and SIG-Awarded Schools. NCEE 2011-4019
Hurlburt, Steven; Le Floch, Kerstin Carlson; Therriault, Susan Bowles; Cole, Susan
2011-01-01
The Study of School Turnaround is an examination of the implementation of School Improvement Grants (SIG) authorized under Title I section 1003(g) of the "Elementary and Secondary Education Act" and supplemented by the "American Recovery and Reinvestment Act of 2009." "Baseline Analyses of SIG Applications and SIG-Eligible…
SigE Is a Chaperone for the Salmonella enterica Serovar Typhimurium Invasion Protein SigD
Darwin, K. Heran; Robinson, Lloyd S.; Miller, Virginia L.
2001-01-01
SigD is translocated into eucaryotic cells by a type III secretion system. In this work, evidence that the putative chaperone SigE directly interacts with SigD is presented. A bacterial two-hybrid system demonstrated that SigE can interact with itself and SigD. In addition, SigD was specifically copurified with SigE-His6 on a nickel column.
Roles of SigB and SigF in the Mycobacterium tuberculosis Sigma Factor Network▿ †
Lee, Jong-Hee; Karakousis, Petros C.; Bishai, William R.
2007-01-01
To characterize the roles of SigB and SigF in sigma factor regulation in Mycobacterium tuberculosis, we used chemically inducible recombinant strains to conditionally overexpress sigB and sigF. Using whole genomic microarray analysis and quantitative reverse transcription-PCR, we investigated the resulting global transcriptional changes after sigB induction, and we specifically tested the relative expression of other sigma factor genes after knock-in expression of sigB and sigF. Overexpressio...
DEFF Research Database (Denmark)
Horn, Heiko; Lawrence, Michael S; Chouinard, Candace R
2018-01-01
Methods that integrate molecular network information and tumor genome data could complement gene-based statistical tests to identify likely new cancer genes; but such approaches are challenging to validate at scale, and their predictive value remains unclear. We developed a robust statistic (Net......Sig) that integrates protein interaction networks with data from 4,742 tumor exomes. NetSig can accurately classify known driver genes in 60% of tested tumor types and predicts 62 new driver candidates. Using a quantitative experimental framework to determine in vivo tumorigenic potential in mice, we found that Net......Sig candidates induce tumors at rates that are comparable to those of known oncogenes and are ten-fold higher than those of random genes. By reanalyzing nine tumor-inducing NetSig candidates in 242 patients with oncogene-negative lung adenocarcinomas, we find that two (AKT2 and TFDP2) are significantly amplified...
SigWinR; the SigWin-detector updated and ported to R.
de Leeuw, Wim C; Rauwerda, Han; Inda, Márcia A; Bruning, Oskar; Breit, Timo M
2009-10-06
Our SigWin-detector discovers significantly enriched windows of (genomic) elements in any sequence of values (genes or other genomic elements in a DNA sequence) in a fast and reproducible way. However, since it is grid based, only (life) scientists with access to the grid can use this tool. Therefore and on request, we have developed the SigWinR package which makes the SigWin-detector available to a much wider audience. At the same time, we have introduced several improvements to its algorithm as well as its functionality, based on the feedback of SigWin-detector end users. To allow usage of the SigWin-detector on a desktop computer, we have rewritten it as a package for R: SigWinR. R is a free and widely used multi platform software environment for statistical computing and graphics. The package can be installed and used on all platforms for which R is available. The improvements involve: a visualization of the input-sequence values supporting the interpretation of Ridgeograms; a visualization allowing for an easy interpretation of enriched or depleted regions in the sequence using windows of pre-defined size; an option that allows the analysis of circular sequences, which results in rectangular Ridgeograms; an application to identify regions of co-altered gene expression (ROCAGEs) with a real-life biological use-case; adaptation of the algorithm to allow analysis of non-regularly sampled data using a constant window size in physical space without resampling the data. To achieve this, support for analysis of windows with an even number of elements was added. By porting the SigWin-detector as an R package, SigWinR, improving its algorithm and functionality combined with adequate performance, we have made SigWin-detector more useful as well as more easily accessible to scientists without a grid infrastructure.
SigWinR; the SigWin-detector updated and ported to R
Directory of Open Access Journals (Sweden)
Breit Timo M
2009-10-01
Full Text Available Abstract Background Our SigWin-detector discovers significantly enriched windows of (genomic elements in any sequence of values (genes or other genomic elements in a DNA sequence in a fast and reproducible way. However, since it is grid based, only (life scientists with access to the grid can use this tool. Therefore and on request, we have developed the SigWinR package which makes the SigWin-detector available to a much wider audience. At the same time, we have introduced several improvements to its algorithm as well as its functionality, based on the feedback of SigWin-detector end users. Findings To allow usage of the SigWin-detector on a desktop computer, we have rewritten it as a package for R: SigWinR. R is a free and widely used multi platform software environment for statistical computing and graphics. The package can be installed and used on all platforms for which R is available. The improvements involve: a visualization of the input-sequence values supporting the interpretation of Ridgeograms; a visualization allowing for an easy interpretation of enriched or depleted regions in the sequence using windows of pre-defined size; an option that allows the analysis of circular sequences, which results in rectangular Ridgeograms; an application to identify regions of co-altered gene expression (ROCAGEs with a real-life biological use-case; adaptation of the algorithm to allow analysis of non-regularly sampled data using a constant window size in physical space without resampling the data. To achieve this, support for analysis of windows with an even number of elements was added. Conclusion By porting the SigWin-detector as an R package, SigWinR, improving its algorithm and functionality combined with adequate performance, we have made SigWin-detector more useful as well as more easily accessible to scientists without a grid infrastructure.
Desarrollo de un SIG para dispositivos móviles utilizando gvSIG Mobile
Pérez Álvarez, Francisco
2012-01-01
Este proyecto se inicia con la introducción al mundo de los SIG basados en software de código abierto, como es el caso del gvSIG Mobile y gvSIG Desktop. Basándonos en estos dos programas, hemos creado una aplicación SIG para dispositivos móviles (PDA’s y smartphones) gracias a la cual, será posible actualizar cartografía en tiempo real directamente en campo. Para introducirse de pleno en el tema del proyecto, se realizó un análisis detallado sobre las necesidades que podrían existir a l...
Smart Query Answering for Marine Sensor Data
Directory of Open Access Journals (Sweden)
Paulo de Souza
2011-03-01
Full Text Available We review existing query answering systems for sensor data. We then propose an extended query answering approach termed smart query, specifically for marine sensor data. The smart query answering system integrates pattern queries and continuous queries. The proposed smart query system considers both streaming data and historical data from marine sensor networks. The smart query also uses query relaxation technique and semantics from domain knowledge as a recommender system. The proposed smart query benefits in building data and information systems for marine sensor networks.
Smart query answering for marine sensor data.
Shahriar, Md Sumon; de Souza, Paulo; Timms, Greg
2011-01-01
We review existing query answering systems for sensor data. We then propose an extended query answering approach termed smart query, specifically for marine sensor data. The smart query answering system integrates pattern queries and continuous queries. The proposed smart query system considers both streaming data and historical data from marine sensor networks. The smart query also uses query relaxation technique and semantics from domain knowledge as a recommender system. The proposed smart query benefits in building data and information systems for marine sensor networks.
DEFF Research Database (Denmark)
Nielsen, Mette Lykke; Katznelson, Noemi
Når fremtiden tegner sig er en rapport om unge i et yderkantsområder i Danmark. Den giver indblik i nogle af de subjektive, kulturelle og samfundsmæssige mekanismer der kan få betydning for, hvordan fremtiden tegner sig for unge.......Når fremtiden tegner sig er en rapport om unge i et yderkantsområder i Danmark. Den giver indblik i nogle af de subjektive, kulturelle og samfundsmæssige mekanismer der kan få betydning for, hvordan fremtiden tegner sig for unge....
Sig, sig selv> og KorpusDK - hvorfor det er svært både at skælde sig ud og at skille sig selv ud
DEFF Research Database (Denmark)
Ehlers, Katrine Rosendal; Vikner, Sten
2017-01-01
Denne artikel vil undersøge det danske refleksivsystem, baseret på den antagelse at der er to forskellige typer betingelser angående koreference som hver af de fire refleksive/non-refleksive pronominaltyper i dansk, dvs. sig/sig selv/hende/hende selv skal opfylde (jf. Vikner 1985). En type beting...
EduSIG: gvSIG aplicado a la enseñanza de la geografía
Bermejo Domínguez, Juan A.; Anguix Alfaro, Álvaro; Juncos, Raúl
2009-01-01
EduSIG parte de la idea de disponer de un SIG como herramienta educativa para el aprendizaje de la geografía. Por un lado EduSIG consiste en un gvSIG más simple, sin herramientas complejas o muy técnicas, que permite navegar, consultar, construir y entender los mapas sin necesitar ninguna formación en Sistemas de Información Geográfica. Por otro lado pretende dar una visión didáctica de la enseñanza de la geografía, incluyendo tanto vistas temáticas predefinidas y diverso...
Song, Taeksun; Song, Seung-Eun; Raman, Sahadevan; Anaya, Mauricio; Husson, Robert N.
2008-01-01
Mycobacterial SigE and SigH both initiate transcription from the sigB promoter, suggesting that they recognize similar sequences. Through mutational and primer extension analyses, we determined that SigE and SigH recognize nearly identical promoters, with differences at the 3′ end of the −35 element distinguishing between SigE- and SigH-dependent promoters.
Federated query processing for the semantic web
Buil-Aranda, C
2014-01-01
During the last years, the amount of RDF data has increased exponentially over the Web, exposed via SPARQL endpoints. These SPARQL endpoints allow users to direct SPARQL queries to the RDF data. Federated SPARQL query processing allows to query several of these RDF databases as if they were a single one, integrating the results from all of them. This is a key concept in the Web of Data and it is also a hot topic in the community. Besides of that, the W3C SPARQL-WG has standardized it in the new Recommendation SPARQL 1.1.This book provides a formalisation of the W3C proposed recommendation. Thi
Ray, Beverly; Faure, Caroline; Kelle, Fay
2013-01-01
This paper examines how Social Impact Games (SIGs) can provide important instructional support in secondary social studies classrooms. When used within the framework of the constructivist teaching philosophy and teaching methods, as recommended by the NCSS (2010), SIGs have the potential to hone critical thinking, collaboration, and problem…
BioSig - An application of Octave
Schlögl, Alois
2006-01-01
BioSig is an open source software library for biomedical signal processing. Most users in the field are using Matlab; however, significant effort was undertaken to provide compatibility to Octave, too. This effort has been widely successful, only some non-critical components relying on a graphical user interface are missing. Now, installing BioSig on Octave is as easy as on Matlab. Moreover, a benchmark test based on BioSig has been developed and the benchmark results of several platforms are...
Ungdomslitteratur former(er) sig
DEFF Research Database (Denmark)
Henkel, Ayoe Qvist
2016-01-01
Igennem en 'mediesensitiv' analyse af romanen "Akavet" af Ronnie Andersen (2014) og perspektivering til andre aktuelle romaner for og med unge undersøger artiklen, hvordan ungdomslitteratur udvikler sig i dialog med digitale og mediebaserede impulser, og hvilke konsekvenser for udsigelserne om...... ungdomsliv og ungdomslitteraturens æstetik og mulige egenart, denne udvikling har. Artiklen baserer sig på en materialitetstilgang særligt inspireret af N. Katherine Hayles, som ikke tidligere har fået opmærksomhed i læsninger af ungdomslitteratur eller i diskussioner af ungdomslitteraturens mulige egenart......, at ungdomslitteratur drejer sig om unges udviklingsproces fra barndom og til voksenhed og dermed skildrer en overgang præget af linearitet, modning og vækst. Artiklen konkluderer, at der er ungdomslitteratur, der realiseres på andre præmisser, og Akavet kan ses som eksponent for en bevægelse fra ungdomslitteratur som...
Flexible Query Answering Systems
DEFF Research Database (Denmark)
This book constitutes the refereed proceedings of the 10th International Conference on Flexible Query Answering Systems, FQAS 2013, held in Granada, Spain, in September 2013. The 59 full papers included in this volume were carefully reviewed and selected from numerous submissions. The papers...... are organized in a general session train and a parallel special session track. The general session train covers the following topics: querying-answering systems; semantic technology; patterns and classification; personalization and recommender systems; searching and ranking; and Web and human...
[Genome similarity of Baikal omul and sig].
Bychenko, O S; Sukhanova, L V; Ukolova, S S; Skvortsov, T A; Potapov, V K; Azhikina, T L; Sverdlov, E D
2009-01-01
Two members of the Baikal sig family, a lake sig (Coregonus lavaretus baicalensis Dybovsky) and omul (C. autumnalis migratorius Georgi), are close relatives that diverged from the same ancestor 10-20 thousand years ago. In this work, we studied genomic polymorphism of these two fish species. The method of subtraction hybridization (SH) did not reveal the presence of extended sequences in the sig genome and their absence in the omul genome. All the fragments found by SH corresponded to polymorphous noncoding genome regions varying in mononucleotide substitutions and short deletions. Many of them are mapped close to genes of the immune system and have regions identical to the Tc-1-like transposons abundant among fish, whose transcription activity may affect the expression of adjacent genes. Thus, we showed for the first time that genetic differences between Baikal sig family members are extremely small and cannot be revealed by the SH method. This is another endorsement of the hypothesis on the close relationship between Baikal sig and omul and their evolutionarily recent divergence from a common ancestor.
Rediscovering Sig Socransky, the genius and his legacy.
Teles, R P; Teles, F R F; Loesche, W J; Listgarten, M; Fine, D; Lindhe, J; Malament, K; Haffajee, A D
2012-05-01
Some individuals make contributions so vital to their field of knowledge that their names become almost synonymous with that field. This is the case of Sig Socransky and the field of periodontal microbiology. Sig Socransky, or simply Sig, was born in Toronto, Canada and received his DDS degree from the University of Toronto in 1957. He studied microbiology and periodontology at Harvard, receiving a certificate in 1961. That same year he was recruited to work as a Research Associate at the Forsyth Dental Center. In 1968, he was nominated Senior Member of the Staff and Head of the Department of Periodontology. During his 50-year career at Forsyth, Sig published over 300 manuscripts, keeping an average of 7 publications per year. His work had an indelible impact in the fields of periodontology and oral microbiology. All these accomplishments pale in comparison with the impact that Sig had on a personal level. We have collected testimonials from some of his former students, closest collaborators, and friends in an attempt to give readers an insight into Sig's personality. We hope we can offer those who knew him through his work a glimpse of how it felt to interact with this remarkable individual.
SM4MQ: A Semantic Model for Multidimensional Queries
DEFF Research Database (Denmark)
Varga, Jovan; Dobrokhotova, Ekaterina; Romero, Oscar
2017-01-01
metadata artifacts (e.g., queries) to assist users with the analysis. However, modeling and sharing of most of these artifacts are typically overlooked. Thus, in this paper we focus on the query metadata artifact in the Exploratory OLAP context and propose an RDF-based vocabulary for its representation......, sharing, and reuse on the SW. As OLAP is based on the underlying multidimensional (MD) data model we denote such queries as MD queries and define SM4MQ: A Semantic Model for Multidimensional Queries. Furthermore, we propose a method to automate the exploitation of queries by means of SPARQL. We apply...... the method to a use case of transforming queries from SM4MQ to a vector representation. For the use case, we developed the prototype and performed an evaluation that shows how our approach can significantly ease and support user assistance such as query recommendation....
Antibiotic resistance of canine Staphylococcus intermedius group (SIG)--practical implications.
Chrobak, D; Kizerwetter-Swida, M; Rzewuska, M; Binek, M
2011-01-01
A total of 221 SIG strains were isolated from clinical samples of canine origin submitted to the Diagnostic Laboratory of the Division of Bacteriology and Molecular Biology at the Warsaw University of Life Sciences in Warsaw during the period 2006-2010. The aim of the study was to investigate the frequency of prevalence of methicillin-resistant SIG strains and to determine the MIC values of cephalotin, amoxicillin/clavulanic acid, ciprofloxacin, clindamycin, gentamicin, chloramphenicol, mupirocin for a collection of randomly selected 79 strains belonging to Staphylococcus intermedius group (SIG), including 23 mecA-positive and 56 mecA-negative strains. All isolates were identified as belonging to SIG based on their phenotypic properties and PCR amplification of S. intermedius-specific fragment of the 16S rRNA gene. The mecA gene was detected in 26 (12%) of 221 SIG strains. All tested mecA-negative SIG strains were susceptible to amoxicillin/clavulanic acid and cephalotin. One of the 56 mecA-negative SIG strains was resistant to ciprofloxacin, six (11%) to gentamicin. It was found that sixteen (29%) of 56 mecA-negative SIG strains were resistant to clindamycin. Most of the mecA-positive SIG strains were resistant to ciprofloxacin (96%), clindamycin (96%), and gentamicin (96%). Only one MRSIG strain was resistant to chloramphenicol. All examined mecA-positive SIG strains were found to be susceptible to mupirocin. Our results imply that staphylococcal multidrug resistance has become more prevalent, which could lead to difficulties in effective treatment. With some resistant strains the only therapeutic possibility are antimicrobial agents important in human medicine. New regulations for veterinary medicine concerning appropriate therapy of infections caused by multidrug-resistat staphylococci are needed.
KoralQuery -- A General Corpus Query Protocol
DEFF Research Database (Denmark)
Bingel, Joachim; Diewald, Nils
2015-01-01
. In this paper, we present KoralQuery, a JSON-LD based general corpus query protocol, aiming to be independent of particular QLs, tasks and corpus formats. In addition to describing the system of types and operations that KoralQuery is built on, we exemplify the representation of corpus queries in the serialized...
Proceedings of the ASIS Annual Meeting, 1994
1994-01-01
Includes abstracts of 18 special interest group (SIG) sessions. Highlights include natural language processing, information science and terminology science, classification, knowledge-intensive information systems, information value and ownership issues, economics and theories of information science, information retrieval interfaces, fuzzy thinking…
A Network SIG is born, DECUS (Switzerland) Newsletter, May 1990
Heagerty, Denise
1990-01-01
This article announces the formation of a Swiss DECUS (DEC Users Group) Network SIG in May 1990. The goal of this SIG is to help Swiss DECnet managers to plan transition from their proprietary DECnet Phase IV networks (e.g. the HEP/SPAN DECnet) to open networks based on DECnet Phase V/OSI. The SIG also proposes to address integration with UNIX based workstations using the Internet's TCP/IP protocols.
Approche SIG pour une analyse spatiale des infrastructures ...
African Journals Online (AJOL)
SARAH
31 janv. 2014 ... SIG jouent un rôle primordial dans l'implantation, le suivi et la gestion des infrastructures hydrauliques. L'utilisation de ces outils peut atténuer les difficultés d'approvisionnement en eau. Mots clés : SIG, distribution spatiale, infrastructure hydraulique, Zè, Bénin. ... accès à un dispositif d'assainissement de.
Hvorfor sætter folk ild til sig selv?
DEFF Research Database (Denmark)
Harrebye, Silas
2011-01-01
Mellemøsten brød i brand efter en mand satte ild til sig selv. Et bevis på revolutioner skal selvantændes?......Mellemøsten brød i brand efter en mand satte ild til sig selv. Et bevis på revolutioner skal selvantændes?...
Query Log Analysis of an Electronic Health Record Search Engine
Yang, Lei; Mei, Qiaozhu; Zheng, Kai; Hanauer, David A.
2011-01-01
We analyzed a longitudinal collection of query logs of a full-text search engine designed to facilitate information retrieval in electronic health records (EHR). The collection, 202,905 queries and 35,928 user sessions recorded over a course of 4 years, represents the information-seeking behavior of 533 medical professionals, including frontline practitioners, coding personnel, patient safety officers, and biomedical researchers for patient data stored in EHR systems. In this paper, we present descriptive statistics of the queries, a categorization of information needs manifested through the queries, as well as temporal patterns of the users’ information-seeking behavior. The results suggest that information needs in medical domain are substantially more sophisticated than those that general-purpose web search engines need to accommodate. Therefore, we envision there exists a significant challenge, along with significant opportunities, to provide intelligent query recommendations to facilitate information retrieval in EHR. PMID:22195150
User Oriented Trajectory Search for Trip Recommendation
Ding, Ruogu
2012-07-08
Trajectory sharing and searching have received significant attention in recent years. In this thesis, we propose and investigate the methods to find and recommend the best trajectory to the traveler, and mainly focus on a novel technique named User Oriented Trajectory Search (UOTS) query processing. In contrast to conventional trajectory search by locations (spatial domain only), we consider both spatial and textual domains in the new UOTS query. Given a trajectory data set, the query input contains a set of intended places given by the traveler and a set of textual attributes describing the traveler’s preference. If a trajectory is connecting/close to the specified query locations, and the textual attributes of the trajectory are similar to the traveler’s preference, it will be recommended to the traveler. This type of queries can enable many popular applications such as trip planning and recommendation. There are two challenges in UOTS query processing, (i) how to constrain the searching range in two domains and (ii) how to schedule multiple query sources effectively. To overcome the challenges and answer the UOTS query efficiently, a novel collaborative searching approach is developed. Conceptually, the UOTS query processing is conducted in the spatial and textual domains alternately. A pair of upper and lower bounds are devised to constrain the searching range in two domains. In the meantime, a heuristic searching strategy based on priority ranking is adopted for scheduling the multiple query sources, which can further reduce the searching range and enhance the query efficiency notably. Furthermore, the devised collaborative searching approach can be extended to situations where the query locations are ordered. Extensive experiments are conducted on both real and synthetic trajectory data in road networks. Our approach is verified to be effective in reducing both CPU time and disk I/O time.
Child-Computer Interaction SIG
DEFF Research Database (Denmark)
Hourcade, Juan Pablo; Revelle, Glenda; Zeising, Anja
2016-01-01
This SIG will provide child-computer interaction researchers and practitioners an opportunity to discuss four topics that represent new challenges and opportunities for the community. The four areas are: interactive technologies for children under the age of five, technology for inclusion, privacy...... and information security in the age of the quantified self, and the maker movement....
Sigüenza y el alma del paisaje
Directory of Open Access Journals (Sweden)
Juan Navarro de San Pío
2017-09-01
Full Text Available Este artículo propone un análisis de la estética del paisaje en la obra del escritor Gabriel Miró. Para ello se investiga el significado y valor atribuido al paisaje en la trilogía novelesca (Del vivir, Libro de Sigüenza, Años y Leguas de Sigüenza, héroe modernista de su paisaje literario, así como en el ensayo Sigüenza y el mirador azul. La visión del paisaje en Miró se reconstruye a través del diálogo con diferentes fuentes culturales y filosóficas (especialmente, Giner de los Ríos y Ortega y Gasset. El paisaje mironiano muestra así la evolución desde una inicial actitud panteísta hacia una posterior mirada fenomenológica y hermenéutica.
Grust, Torsten; Scholl, Marc H.
1998-01-01
The construction of a declarative query engine for a DBMS includes the challenge of compiling algebraic queries into efficient execution plans that can be run on top of the persistent storage. This work pursues the goal of employing foldr-build deforestation for the derivation of efficient streaming programs - programs that do not allocate intermediate data structures to perform their task - from algebraic (combinator) query plans. The query engine is based on the insertion representation of ...
Radioimmunoassay and evaluation for serum SIgA and CG of the hepatopathy patients
International Nuclear Information System (INIS)
Liu Junchi; Du Shujun; Lu Jun; Tang Xiaolin; Mao Shaorong
1995-01-01
Serum SIgA and CG of 247 hepatopathy (male 140, female 99) are radioimmunoassayed. The Clinical Significance of SIgA, CG for hepatisms is evaluated. The results show that the assay of SIgA, CG has an important significance for the doctor to choose the treatment plans and determine the prognosis
User oriented trajectory search for trip recommendation
Shang, Shuo
2012-01-01
Trajectory sharing and searching have received significant attentions in recent years. In this paper, we propose and investigate a novel problem called User Oriented Trajectory Search (UOTS) for trip recommendation. In contrast to conventional trajectory search by locations (spatial domain only), we consider both spatial and textual domains in the new UOTS query. Given a trajectory data set, the query input contains a set of intended places given by the traveler and a set of textual attributes describing the traveler\\'s preference. If a trajectory is connecting/close to the specified query locations, and the textual attributes of the trajectory are similar to the traveler\\'e preference, it will be recommended to the traveler for reference. This type of queries can bring significant benefits to travelers in many popular applications such as trip planning and recommendation. There are two challenges in the UOTS problem, (i) how to constrain the searching range in two domains and (ii) how to schedule multiple query sources effectively. To overcome the challenges and answer the UOTS query efficiently, a novel collaborative searching approach is developed. Conceptually, the UOTS query processing is conducted in the spatial and textual domains alternately. A pair of upper and lower bounds are devised to constrain the searching range in two domains. In the meantime, a heuristic searching strategy based on priority ranking is adopted for scheduling the multiple query sources, which can further reduce the searching range and enhance the query efficiency notably. Furthermore, the devised collaborative searching approach can be extended to situations where the query locations are ordered. The performance of the proposed UOTS query is verified by extensive experiments based on real and synthetic trajectory data in road networks. © 2012 ACM.
… vandt sig Danmark al … - hvad mente Harald?
DEFF Research Database (Denmark)
Roesdahl, Else
2013-01-01
Diskussion af meningen med sætningen 'vandt sig Danmark al' på den store Jellingsten. Ordet 'vandt' tolkes i retning af nudansk 'vandt' og ikke som 'samlede', hvilket tit fremføres.......Diskussion af meningen med sætningen 'vandt sig Danmark al' på den store Jellingsten. Ordet 'vandt' tolkes i retning af nudansk 'vandt' og ikke som 'samlede', hvilket tit fremføres....
Processing SPARQL queries with regular expressions in RDF databases
2011-01-01
Background As the Resource Description Framework (RDF) data model is widely used for modeling and sharing a lot of online bioinformatics resources such as Uniprot (dev.isb-sib.ch/projects/uniprot-rdf) or Bio2RDF (bio2rdf.org), SPARQL - a W3C recommendation query for RDF databases - has become an important query language for querying the bioinformatics knowledge bases. Moreover, due to the diversity of users’ requests for extracting information from the RDF data as well as the lack of users’ knowledge about the exact value of each fact in the RDF databases, it is desirable to use the SPARQL query with regular expression patterns for querying the RDF data. To the best of our knowledge, there is currently no work that efficiently supports regular expression processing in SPARQL over RDF databases. Most of the existing techniques for processing regular expressions are designed for querying a text corpus, or only for supporting the matching over the paths in an RDF graph. Results In this paper, we propose a novel framework for supporting regular expression processing in SPARQL query. Our contributions can be summarized as follows. 1) We propose an efficient framework for processing SPARQL queries with regular expression patterns in RDF databases. 2) We propose a cost model in order to adapt the proposed framework in the existing query optimizers. 3) We build a prototype for the proposed framework in C++ and conduct extensive experiments demonstrating the efficiency and effectiveness of our technique. Conclusions Experiments with a full-blown RDF engine show that our framework outperforms the existing ones by up to two orders of magnitude in processing SPARQL queries with regular expression patterns. PMID:21489225
Processing SPARQL queries with regular expressions in RDF databases.
Lee, Jinsoo; Pham, Minh-Duc; Lee, Jihwan; Han, Wook-Shin; Cho, Hune; Yu, Hwanjo; Lee, Jeong-Hoon
2011-03-29
As the Resource Description Framework (RDF) data model is widely used for modeling and sharing a lot of online bioinformatics resources such as Uniprot (dev.isb-sib.ch/projects/uniprot-rdf) or Bio2RDF (bio2rdf.org), SPARQL - a W3C recommendation query for RDF databases - has become an important query language for querying the bioinformatics knowledge bases. Moreover, due to the diversity of users' requests for extracting information from the RDF data as well as the lack of users' knowledge about the exact value of each fact in the RDF databases, it is desirable to use the SPARQL query with regular expression patterns for querying the RDF data. To the best of our knowledge, there is currently no work that efficiently supports regular expression processing in SPARQL over RDF databases. Most of the existing techniques for processing regular expressions are designed for querying a text corpus, or only for supporting the matching over the paths in an RDF graph. In this paper, we propose a novel framework for supporting regular expression processing in SPARQL query. Our contributions can be summarized as follows. 1) We propose an efficient framework for processing SPARQL queries with regular expression patterns in RDF databases. 2) We propose a cost model in order to adapt the proposed framework in the existing query optimizers. 3) We build a prototype for the proposed framework in C++ and conduct extensive experiments demonstrating the efficiency and effectiveness of our technique. Experiments with a full-blown RDF engine show that our framework outperforms the existing ones by up to two orders of magnitude in processing SPARQL queries with regular expression patterns.
Child Computer Interaction SIG
DEFF Research Database (Denmark)
Read, Janet; Hourcade, Juan Pablo; Markopoulos, Panos
a mixture of facilitated creative thinking and a world café approach to bring the community together to tackle these two key challenges. The CCI SIG will be the natural meeting place for members of this community at CHI and will disseminate its discussion to the CCI and CHI communities through...... the production of visual and interactive materials at the CHI conference....
BioSig: the free and open source software library for biomedical signal processing.
Vidaurre, Carmen; Sander, Tilmann H; Schlögl, Alois
2011-01-01
BioSig is an open source software library for biomedical signal processing. The aim of the BioSig project is to foster research in biomedical signal processing by providing free and open source software tools for many different application areas. Some of the areas where BioSig can be employed are neuroinformatics, brain-computer interfaces, neurophysiology, psychology, cardiovascular systems, and sleep research. Moreover, the analysis of biosignals such as the electroencephalogram (EEG), electrocorticogram (ECoG), electrocardiogram (ECG), electrooculogram (EOG), electromyogram (EMG), or respiration signals is a very relevant element of the BioSig project. Specifically, BioSig provides solutions for data acquisition, artifact processing, quality control, feature extraction, classification, modeling, and data visualization, to name a few. In this paper, we highlight several methods to help students and researchers to work more efficiently with biomedical signals.
DEFF Research Database (Denmark)
Beedholm, Kirsten; Frederiksen, Kirsten
1999-01-01
Hvad ville der ske, hvis alle eskimoer begyndte at studere eskimologi på bekostning af at leve som en 'rigtig' eskimo? Det spørger forfatterne om i denne artikel. De underviser begge på sygeplejeskoler og finder det en tankevækkende tendens, at det i sygeplejerskernes grunduddannelse er faget sel......, der studeres frem for det, som faget retter sig imod nemlig patienterne....
Directory of Open Access Journals (Sweden)
Jared D Sharp
Full Text Available Expression of SigH, one of 12 Mycobacterium tuberculosis alternative sigma factors, is induced by heat, oxidative and nitric oxide stresses. SigH activation has been shown to increase expression of several genes, including genes involved in maintaining redox equilibrium and in protein degradation. However, few of these are known to be directly regulated by SigH. The goal of this project is to comprehensively define the Mycobacterium tuberculosis genes and operons that are directly controlled by SigH in order to gain insight into the role of SigH in regulating M. tuberculosis physiology. We used ChIP-Seq to identify in vivo SigH binding sites throughout the M. tuberculosis genome, followed by quantification of SigH-dependent expression of genes linked to these sites and identification of SigH-regulated promoters. We identified 69 SigH binding sites, which are located both in intergenic regions and within annotated coding sequences in the annotated M. tuberculosis genome. 41 binding sites were linked to genes that showed greater expression following heat stress in a SigH-dependent manner. We identified several genes not previously known to be regulated by SigH, including genes involved in DNA repair, cysteine biosynthesis, translation, and genes of unknown function. Experimental and computational analysis of SigH-regulated promoter sequences within these binding sites identified strong consensus -35 and -10 promoter sequences, but with tolerance for non-consensus bases at specific positions. This comprehensive identification and validation of SigH-regulated genes demonstrates an extended SigH regulon that controls an unexpectedly broad range of stress response functions.
Sharp, Jared D; Singh, Atul K; Park, Sang Tae; Lyubetskaya, Anna; Peterson, Matthew W; Gomes, Antonio L C; Potluri, Lakshmi-Prasad; Raman, Sahadevan; Galagan, James E; Husson, Robert N
2016-01-01
Expression of SigH, one of 12 Mycobacterium tuberculosis alternative sigma factors, is induced by heat, oxidative and nitric oxide stresses. SigH activation has been shown to increase expression of several genes, including genes involved in maintaining redox equilibrium and in protein degradation. However, few of these are known to be directly regulated by SigH. The goal of this project is to comprehensively define the Mycobacterium tuberculosis genes and operons that are directly controlled by SigH in order to gain insight into the role of SigH in regulating M. tuberculosis physiology. We used ChIP-Seq to identify in vivo SigH binding sites throughout the M. tuberculosis genome, followed by quantification of SigH-dependent expression of genes linked to these sites and identification of SigH-regulated promoters. We identified 69 SigH binding sites, which are located both in intergenic regions and within annotated coding sequences in the annotated M. tuberculosis genome. 41 binding sites were linked to genes that showed greater expression following heat stress in a SigH-dependent manner. We identified several genes not previously known to be regulated by SigH, including genes involved in DNA repair, cysteine biosynthesis, translation, and genes of unknown function. Experimental and computational analysis of SigH-regulated promoter sequences within these binding sites identified strong consensus -35 and -10 promoter sequences, but with tolerance for non-consensus bases at specific positions. This comprehensive identification and validation of SigH-regulated genes demonstrates an extended SigH regulon that controls an unexpectedly broad range of stress response functions.
Processing SPARQL queries with regular expressions in RDF databases
Directory of Open Access Journals (Sweden)
Cho Hune
2011-03-01
Full Text Available Abstract Background As the Resource Description Framework (RDF data model is widely used for modeling and sharing a lot of online bioinformatics resources such as Uniprot (dev.isb-sib.ch/projects/uniprot-rdf or Bio2RDF (bio2rdf.org, SPARQL - a W3C recommendation query for RDF databases - has become an important query language for querying the bioinformatics knowledge bases. Moreover, due to the diversity of users’ requests for extracting information from the RDF data as well as the lack of users’ knowledge about the exact value of each fact in the RDF databases, it is desirable to use the SPARQL query with regular expression patterns for querying the RDF data. To the best of our knowledge, there is currently no work that efficiently supports regular expression processing in SPARQL over RDF databases. Most of the existing techniques for processing regular expressions are designed for querying a text corpus, or only for supporting the matching over the paths in an RDF graph. Results In this paper, we propose a novel framework for supporting regular expression processing in SPARQL query. Our contributions can be summarized as follows. 1 We propose an efficient framework for processing SPARQL queries with regular expression patterns in RDF databases. 2 We propose a cost model in order to adapt the proposed framework in the existing query optimizers. 3 We build a prototype for the proposed framework in C++ and conduct extensive experiments demonstrating the efficiency and effectiveness of our technique. Conclusions Experiments with a full-blown RDF engine show that our framework outperforms the existing ones by up to two orders of magnitude in processing SPARQL queries with regular expression patterns.
Approximate dictionary queries
DEFF Research Database (Denmark)
Brodal, Gerth Stølting; Gasieniec, Leszek
1996-01-01
Given a set of n binary strings of length m each. We consider the problem of answering d-queries. Given a binary query string of length m, a d-query is to report if there exists a string in the set within Hamming distance d of . We present a data structure of size O(nm) supporting 1-queries in ti...
A Query Cache Tool for Optimizing Repeatable and Parallel OLAP Queries
Santos, Ricardo Jorge; Bernardino, Jorge
On-line analytical processing against data warehouse databases is a common form of getting decision making information for almost every business field. Decision support information oftenly concerns periodic values based on regular attributes, such as sales amounts, percentages, most transactioned items, etc. This means that many similar OLAP instructions are periodically repeated, and simultaneously, between the several decision makers. Our Query Cache Tool takes advantage of previously executed queries, storing their results and the current state of the data which was accessed. Future queries only need to execute against the new data, inserted since the queries were last executed, and join these results with the previous ones. This makes query execution much faster, because we only need to process the most recent data. Our tool also minimizes the execution time and resource consumption for similar queries simultaneously executed by different users, putting the most recent ones on hold until the first finish and returns the results for all of them. The stored query results are held until they are considered outdated, then automatically erased. We present an experimental evaluation of our tool using a data warehouse based on a real-world business dataset and use a set of typical decision support queries to discuss the results, showing a very high gain in query execution time.
Structural insights into the regulation of Bacillus subtilis SigW activity by anti-sigma RsiW.
Directory of Open Access Journals (Sweden)
Shankar Raj Devkota
Full Text Available Bacillus subtilis SigW is localized to the cell membrane and is inactivated by the tight interaction with anti-sigma RsiW under normal growth conditions. Whereas SigW is discharged from RsiW binding and thus initiates the transcription of its regulon under diverse stress conditions such as antibiotics and alkaline shock. The release and activation of SigW in response to extracytoplasmic signals is induced by the regulated intramembrane proteolysis of RsiW. As a ZAS (Zinc-containing anti-sigma family protein, RsiW has a CHCC zinc binding motif, which implies that its anti-sigma activity may be regulated by the state of zinc coordination in addition to the proteolytic cleavage of RsiW. To understand the regulation mode of SigW activity by RsiW, we determined the crystal structures of SigW in complex with the cytoplasmic domain of RsiW, and compared the conformation of the CHCC motif in the reduced/zinc binding and the oxidized states. The structures revealed that RsiW inhibits the promoter binding of SigW by interacting with the surface groove of SigW. The interaction between SigW and RsiW is not disrupted by the oxidation of the CHCC motif in RsiW, suggesting that SigW activity might not be regulated by the zinc coordination states of the CHCC motif.
Directory of Open Access Journals (Sweden)
Paweł Łupkowski
2017-05-01
Full Text Available In this article we consider the phenomenon of answering a query with a query. Although such answers are common, no large scale, corpus-based characterization exists, with the exception of clarification requests. After briefly reviewing different theoretical approaches on this subject, we present a corpus study of query responses in the British National Corpus and develop a taxonomy for query responses. We point at a variety of response categories that have not been formalized in previous dialogue work, particularly those relevant to adversarial interaction. We show that different response categories have significantly different rates of subsequent answer provision. We provide a formal analysis of the response categories in the framework of KoS.
Kimura, Akio; Tanaka, Noriko
2018-04-11
The shock index (SI), defined as heart rate (HR) divided by systolic blood pressure (SBP), is reported to be a more sensitive marker of shock than traditional vital signs alone. In previous literature, use of the reverse shock index (rSI), taken as SBP divided by HR, is recommended instead of SI for hospital triage. Among traumatized patients aged > 55 years, SI multiplied by age (SIA) might provide better prediction of early post-injury mortality. Separately, the Glasgow Coma Scale (GCS) score has been shown to be a very strong predictor. When considering these points together, rSI multiplied by GCS score (rSIG) or rSIG divided by age (rSIG/A) could provide even better prediction of in-hospital mortality. This retrospective, multicenter study used data from 168,517 patients registered in the Japan Trauma Data Bank for the period 2006-2015. We calculated areas under receiver operating characteristic curves (AUROCs) to measure the discriminant ability by comparing those of SI (or rSI), SIA, rSIG, and rSIG/A for in-hospital mortality and for 24-h blood transfusion. The highest ROC AUC (AUROC), 0.901(0.894-0.908) for in-hospital mortality in younger patients (aged < 55 years), was seen for rSIG. In older patients (aged ≥ 55 years), the AUROC of rSIG/A, 0.845(0.840-0.850), was highest for in-hospital mortality. However, the difference between rSIG and rSIG/A was slight and did not seem to be clinically important. rSIG also had the highest AUROC of 0.745 (0.741-749) for 24-h blood transfusion. rSIG ((SBP/HR) × GCS score) is easy to calculate without the need for additional information, charts or equipment, and can be a more reliable triage tool for identifying risk levels in trauma patients.
Salivary levels of SIgA and perceived stress among dental students
Directory of Open Access Journals (Sweden)
João Paulo Menck Sangiorgio
2017-12-01
Full Text Available Background: Academic stress may impair mucosal immunity and expose dental students to an increased risk of infections. Objective: to assess stress scores in dental students and their relationship with variation in SIgA levels. Methods: All students (n = 289 were invited to take part of the study, and 207 (71.63% effectively participated, being 152 (73.4% females. At the day of data collection, the students answered The Dental Environmental Stress Questionnaire (DES and unstimulated saliva samples were collected for determination of salivary flow rate and SIgA concentration and secretion rate. Results: Mean DES scores were higher in females (78.97 ± 16.42, but no correlations between the sum of DES scores and salivary parameters were observed (P=0.08. A moderate inverse relationship was observed between SIgA secretion rates and the subscales Academic Performance (P=0.01, Interpersonal relationships (P=0.02 and Difficulties and Insecurities about Professional Future (P=0.05. A weak correlation was found between SIgA concentration and the items Amount of assigned classwork (P=0.02, Lack of confidence in self to be a successful dentist (P=0.01, Lack of time for relaxation (P=0.01, Financial responsibilities (P=0.02 and Personal physical health (P=0.005. Weak correlations between SIgA secretion rates and DES items were also found for Lack of cooperation by patient in their home care (P=0.003, Patients being late or not showing up for their appointments (P=0.02, Lack of self confidence to be a successful dentist (P=0.008, Personal physical health (P=0.019, and others. Conclusion: Different sources of stress were observed among first to fifth year students and some of these stressors may negatively impact on salivary SIgA secretion.
The role of economics in the QUERI program: QUERI Series.
Smith, Mark W; Barnett, Paul G
2008-04-22
The United States (U.S.) Department of Veterans Affairs (VA) Quality Enhancement Research Initiative (QUERI) has implemented economic analyses in single-site and multi-site clinical trials. To date, no one has reviewed whether the QUERI Centers are taking an optimal approach to doing so. Consistent with the continuous learning culture of the QUERI Program, this paper provides such a reflection. We present a case study of QUERI as an example of how economic considerations can and should be integrated into implementation research within both single and multi-site studies. We review theoretical and applied cost research in implementation studies outside and within VA. We also present a critique of the use of economic research within the QUERI program. Economic evaluation is a key element of implementation research. QUERI has contributed many developments in the field of implementation but has only recently begun multi-site implementation trials across multiple regions within the national VA healthcare system. These trials are unusual in their emphasis on developing detailed costs of implementation, as well as in the use of business case analyses (budget impact analyses). Economics appears to play an important role in QUERI implementation studies, only after implementation has reached the stage of multi-site trials. Economic analysis could better inform the choice of which clinical best practices to implement and the choice of implementation interventions to employ. QUERI economics also would benefit from research on costing methods and development of widely accepted international standards for implementation economics.
Kan migration føre noget godt med sig?
DEFF Research Database (Denmark)
Pærregaard, Karsten
2008-01-01
Som regel har den globale migration rod i økonomisk ulighed, social uretfærdighed eller politisk forfølgelse i de lande, hvor migranterne kommer fra og for mange migranter. Mange år efter, at disse har fundet sig til rette og tilpasset sig de samfund, som har migreret til, fortsætter deres families...... hjemlandsbyer i Peru. Artiklen er baseret på et antropologisk forskningsprojekt, som jeg gennemførte fra 2003-2006 blandt peruanere i USA, Spanien, Japan og Argentina. Udgivelsesdato: September...
The role of economics in the QUERI program: QUERI Series
Directory of Open Access Journals (Sweden)
Smith Mark W
2008-04-01
Full Text Available Abstract Background The United States (U.S. Department of Veterans Affairs (VA Quality Enhancement Research Initiative (QUERI has implemented economic analyses in single-site and multi-site clinical trials. To date, no one has reviewed whether the QUERI Centers are taking an optimal approach to doing so. Consistent with the continuous learning culture of the QUERI Program, this paper provides such a reflection. Methods We present a case study of QUERI as an example of how economic considerations can and should be integrated into implementation research within both single and multi-site studies. We review theoretical and applied cost research in implementation studies outside and within VA. We also present a critique of the use of economic research within the QUERI program. Results Economic evaluation is a key element of implementation research. QUERI has contributed many developments in the field of implementation but has only recently begun multi-site implementation trials across multiple regions within the national VA healthcare system. These trials are unusual in their emphasis on developing detailed costs of implementation, as well as in the use of business case analyses (budget impact analyses. Conclusion Economics appears to play an important role in QUERI implementation studies, only after implementation has reached the stage of multi-site trials. Economic analysis could better inform the choice of which clinical best practices to implement and the choice of implementation interventions to employ. QUERI economics also would benefit from research on costing methods and development of widely accepted international standards for implementation economics.
In-context query reformulation for failing SPARQL queries
Viswanathan, Amar; Michaelis, James R.; Cassidy, Taylor; de Mel, Geeth; Hendler, James
2017-05-01
Knowledge bases for decision support systems are growing increasingly complex, through continued advances in data ingest and management approaches. However, humans do not possess the cognitive capabilities to retain a bird's-eyeview of such knowledge bases, and may end up issuing unsatisfiable queries to such systems. This work focuses on the implementation of a query reformulation approach for graph-based knowledge bases, specifically designed to support the Resource Description Framework (RDF). The reformulation approach presented is instance-and schema-aware. Thus, in contrast to relaxation techniques found in the state-of-the-art, the presented approach produces in-context query reformulation.
Tigani, Jordan
2014-01-01
How to effectively use BigQuery, avoid common mistakes, and execute sophisticated queries against large datasets Google BigQuery Analytics is the perfect guide for business and data analysts who want the latest tips on running complex queries and writing code to communicate with the BigQuery API. The book uses real-world examples to demonstrate current best practices and techniques, and also explains and demonstrates streaming ingestion, transformation via Hadoop in Google Compute engine, AppEngine datastore integration, and using GViz with Tableau to generate charts of query results. In addit
SIG aplicado ao Ensino de Geografia
Dornelles, Liane Maria Azevedo; Universidade do Estado do Rio de Janeiro - UERJ
2009-01-01
Este trabalho descreve as atividades desenvolvidas junto à disciplina eletiva SIG aplicado ao ensino de Geografia. Foram elaboradas aplicações ambientais para os Ensinos Fundamental e Médio, com auxílio dos programas VistaSAGA/UFRJ, SISPLAMTE 5as com GIS, SPRING/INPE e MapServer.
Query optimization over crowdsourced data
Park, Hyunjung
2013-08-26
Deco is a comprehensive system for answering declarative queries posed over stored relational data together with data obtained on-demand from the crowd. In this paper we describe Deco\\'s cost-based query optimizer, building on Deco\\'s data model, query language, and query execution engine presented earlier. Deco\\'s objective in query optimization is to find the best query plan to answer a query, in terms of estimated monetary cost. Deco\\'s query semantics and plan execution strategies require several fundamental changes to traditional query optimization. Novel techniques incorporated into Deco\\'s query optimizer include a cost model distinguishing between "free" existing data versus paid new data, a cardinality estimation algorithm coping with changes to the database state during query execution, and a plan enumeration algorithm maximizing reuse of common subplans in a setting that makes reuse challenging. We experimentally evaluate Deco\\'s query optimizer, focusing on the accuracy of cost estimation and the efficiency of plan enumeration.
SigB is a dominant regulator of virulence in Staphylococcus aureus small-colony variants.
Mitchell, Gabriel; Fugère, Alexandre; Pépin Gaudreau, Karine; Brouillette, Eric; Frost, Eric H; Cantin, André M; Malouin, François
2013-01-01
Staphylococcus aureus small-colony variants (SCVs) are persistent pathogenic bacteria characterized by slow growth and, for many of these strains, an increased ability to form biofilms and to persist within host cells. The virulence-associated gene expression profile of SCVs clearly differs from that of prototypical strains and is often influenced by SigB rather than by the agr system. One objective of this work was to confirm the role of SigB in the control of the expression of virulence factors involved in biofilm formation and intracellular persistence of SCVs. This study shows that extracellular proteins are involved in the formation of biofilm by three SCV strains, which, additionally, have a low biofilm-dispersing activity. It was determined that SigB activity modulates biofilm formation by strain SCV CF07-S and is dominant over that of the agr system without being solely responsible for the repression of proteolytic activity. On the other hand, the expression of fnbA and the control of nuclease activity contributed to the SigB-dependent formation of biofilm of this SCV strain. SigB was also required for the replication of CF07-S within epithelial cells and may be involved in the colonization of lungs by SCVs in a mouse infection model. This study methodically investigated SigB activity and associated mechanisms in the various aspects of SCV pathogenesis. Results confirm that SigB activity importantly influences the production of virulence factors, biofilm formation and intracellular persistence for some clinical SCV strains.
Snapshot of SIG: A Look at Four States' Approaches to School Turnaround
Quillin, Jessica
2012-01-01
Thousands of schools across the country are chronically low performing, and they operate within districts and states that are struggling to help them improve. The School Improvement Grants (SIG) program is designed to channel federal funds to states and districts facing the task of turning around struggling schools. SIG, a part of the Elementary…
Querying on Federated Sensor Networks
Directory of Open Access Journals (Sweden)
Zuhal Can
2016-09-01
Full Text Available A Federated Sensor Network (FSN is a network of geographically distributed Wireless Sensor Networks (WSNs called islands. For querying on an FSN, we introduce the Layered Federated Sensor Network (L-FSN Protocol. For layered management, L-FSN provides communication among islands by its inter-island querying protocol by which a query packet routing path is determined according to some path selection policies. L-FSN allows autonomous management of each island by island-specific intra-island querying protocols that can be selected according to island properties. We evaluate the applicability of L-FSN and compare the L-FSN protocol with various querying protocols running on the flat federation model. Flat federation is a method to federate islands by running a single querying protocol on an entire FSN without distinguishing communication among and within islands. For flat federation, we select a querying protocol from geometrical, hierarchical cluster-based, hash-based, and tree-based WSN querying protocol categories. We found that a layered federation of islands by L-FSN increases the querying performance with respect to energy-efficiency, query resolving distance, and query resolving latency. Moreover, L-FSN’s flexibility of choosing intra-island querying protocols regarding the island size brings advantages on energy-efficiency and query resolving latency.
Evaluation of the NCPDP Structured and Codified Sig Format for e-prescriptions.
Liu, Hangsheng; Burkhart, Q; Bell, Douglas S
2011-01-01
To evaluate the ability of the structure and code sets specified in the National Council for Prescription Drug Programs Structured and Codified Sig Format to represent ambulatory electronic prescriptions. We parsed the Sig strings from a sample of 20,161 de-identified ambulatory e-prescriptions into variables representing the fields of the Structured and Codified Sig Format. A stratified random sample of these representations was then reviewed by a group of experts. For codified Sig fields, we attempted to map the actual words used by prescribers to the equivalent terms in the designated terminology. Proportion of prescriptions that the Format could fully represent; proportion of terms used that could be mapped to the designated terminology. The fields defined in the Format could fully represent 95% of Sigs (95% CI 93% to 97%), but ambiguities were identified, particularly in representing multiple-step instructions. The terms used by prescribers could be codified for only 60% of dose delivery methods, 84% of dose forms, 82% of vehicles, 95% of routes, 70% of sites, 33% of administration timings, and 93% of indications. The findings are based on a retrospective sample of ambulatory prescriptions derived mostly from primary care physicians. The fields defined in the Format could represent most of the patient instructions in a large prescription sample, but prior to its mandatory adoption, further work is needed to ensure that potential ambiguities are addressed and that a complete set of terms is available for the codified fields.
Regressão linear geograficamente ponderada em ambiente SIG
Directory of Open Access Journals (Sweden)
Luís Eduardo Ximenes Carvalho
2009-10-01
Full Text Available Este artigo aborda considerações teóricas e resultados da implementação em ambiente SIG de um modelo confirmatório de estatística espacial — regressão linear geograficamente ponderada (RGP — não disponível em ambiente livre. Os aspectos teóricos deste modelo local de regressão espacial foram amplamente discutidos em virtude da escassa bibliografia existente. O modelo RGP foi implementado na linguagem de programação GISDK do SIG-T TransCAD, utilizando compreensivamente as ferramentas de manipulação, tratamento georreferenciado dos dados e rotinas de análise espacial disponibilizadas em plataformas SIG. Ao final, espera-se ter desenvolvido, ainda que de maneira parcial, uma importante ferramenta que contribuirá para a compreensão e refinamento da modelagem de fenômenos geográficos tão amplamente analisados em estudos de Planejamento de Transportes.
Collective spatial keyword querying
DEFF Research Database (Denmark)
Cao, Xin; Cong, Gao; Jensen, Christian S.
2011-01-01
With the proliferation of geo-positioning and geo-tagging, spatial web objects that possess both a geographical location and a textual description are gaining in prevalence, and spatial keyword queries that exploit both location and textual description are gaining in prominence. However, the quer......With the proliferation of geo-positioning and geo-tagging, spatial web objects that possess both a geographical location and a textual description are gaining in prevalence, and spatial keyword queries that exploit both location and textual description are gaining in prominence. However......, the queries studied so far generally focus on finding individual objects that each satisfy a query rather than finding groups of objects where the objects in a group collectively satisfy a query. We define the problem of retrieving a group of spatial web objects such that the group's keywords cover the query......'s keywords and such that objects are nearest to the query location and have the lowest inter-object distances. Specifically, we study two variants of this problem, both of which are NP-complete. We devise exact solutions as well as approximate solutions with provable approximation bounds to the problems. We...
Directory of Open Access Journals (Sweden)
Yan Tang
2018-01-01
Full Text Available A business process or workflow is an assembly of tasks that accomplishes a business goal. Business process management is the study of the design, configuration/implementation, enactment and monitoring, analysis, and re-design of workflows. The traditional methodology for the re-design and improvement of workflows relies on the well-known sequence of extract, transform, and load (ETL, data/process warehousing, and online analytical processing (OLAP tools. In this paper, we study the ad hoc queryiny of process enactments for (data-centric business processes, bypassing the traditional methodology for more flexibility in querying. We develop an algebraic query language based on “incident patterns” with four operators inspired from Business Process Model and Notation (BPMN representation, allowing the user to formulate ad hoc queries directly over workflow logs. A formal semantics of this query language, a preliminary query evaluation algorithm, and a group of elementary properties of the operators are provided.
Flanagan, David
2010-01-01
"As someone who uses jQuery on a regular basis, it was surprising to discover how much of the library I'm not using. This book is indispensable for anyone who is serious about using jQuery for non-trivial applications."-- Raffaele Cecco, longtime developer of video games, including Cybernoid, Exolon, and Stormlord jQuery is the "write less, do more" JavaScript library. Its powerful features and ease of use have made it the most popular client-side JavaScript framework for the Web. This book is jQuery's trusty companion: the definitive "read less, learn more" guide to the library. jQuery P
Boduch, Adam
2013-01-01
Filled with a practical collection of recipes, jQuery UI Cookbook is full of clear, step-by-step instructions that will help you harness the powerful UI framework in jQuery. Depending on your needs, you can dip in and out of the Cookbook and its recipes, or follow the book from start to finish.If you are a jQuery UI developer looking to improve your existing applications, extract ideas for your new application, or to better understand the overall widget architecture, then jQuery UI Cookbook is a must-have for you. The reader should at least have a rudimentary understanding of what jQuery UI is
De Rosa, Aurelio
2013-01-01
Filled with practical, step-by-step instructions and clear explanations for the most important and useful tasks. Instant jQuery Selectors follows a simple how-to format with recipes aimed at making you well versed with the wide range of selectors that jQuery has to offer through a myriad of examples.Instant jQuery Selectors is for web developers who want to delve into jQuery from its very starting point: selectors. Even if you're already familiar with the framework and its selectors, you could find several tips and tricks that you aren't aware of, especially about performance and how jQuery ac
SkyQuery - A Prototype Distributed Query and Cross-Matching Web Service for the Virtual Observatory
Thakar, A. R.; Budavari, T.; Malik, T.; Szalay, A. S.; Fekete, G.; Nieto-Santisteban, M.; Haridas, V.; Gray, J.
2002-12-01
We have developed a prototype distributed query and cross-matching service for the VO community, called SkyQuery, which is implemented with hierarchichal Web Services. SkyQuery enables astronomers to run combined queries on existing distributed heterogeneous astronomy archives. SkyQuery provides a simple, user-friendly interface to run distributed queries over the federation of registered astronomical archives in the VO. The SkyQuery client connects to the portal Web Service, which farms the query out to the individual archives, which are also Web Services called SkyNodes. The cross-matching algorithm is run recursively on each SkyNode. Each archive is a relational DBMS with a HTM index for fast spatial lookups. The results of the distributed query are returned as an XML DataSet that is automatically rendered by the client. SkyQuery also returns the image cutout corresponding to the query result. SkyQuery finds not only matches between the various catalogs, but also dropouts - objects that exist in some of the catalogs but not in others. This is often as important as finding matches. We demonstrate the utility of SkyQuery with a brown-dwarf search between SDSS and 2MASS, and a search for radio-quiet quasars in SDSS, 2MASS and FIRST. The importance of a service like SkyQuery for the worldwide astronomical community cannot be overstated: data on the same objects in various archives is mapped in different wavelength ranges and looks very different due to different errors, instrument sensitivities and other peculiarities of each archive. Our cross-matching algorithm preforms a fuzzy spatial join across multiple catalogs. This type of cross-matching is currently often done by eye, one object at a time. A static cross-identification table for a set of archives would become obsolete by the time it was built - the exponential growth of astronomical data means that a dynamic cross-identification mechanism like SkyQuery is the only viable option. SkyQuery was funded by a
A Query Evaluation Approach using Opinions of Turkish Financial Market Professionals
Directory of Open Access Journals (Sweden)
Bora Uğurlu
2015-08-01
Full Text Available People who do not have expertise in the financial area may not see the relationship between the numerical and linguistic data. In our study, a knowledge discovery approach using Turkish natural language processing is recommended in order to respond to meaningful queries and classify them with high accuracy. Query corpus consists of randomly selected unique keywords. Quantitative evaluation is done in order to measure the classification performance. Experimental results indicate that our proposed approach is sufficiently consistent with and able to make categorical classifications correctly. The approach highlights the relationship between numerical and linguistic data obtained from Turkish financial market.
Chlorella intake attenuates reduced salivary SIgA secretion in kendo training camp participants
Directory of Open Access Journals (Sweden)
Otsuki Takeshi
2012-12-01
Full Text Available Abstract Background The green alga Chlorella contains high levels of proteins, vitamins, and minerals. We previously reported that a chlorella-derived multicomponent supplement increased the secretion rate of salivary secretory immunoglobulin A (SIgA in humans. Here, we investigated whether intake of this chlorella-derived supplement attenuated the reduced salivary SIgA secretion rate during a kendo training camp. Methods Ten female kendo athletes participated in inter-university 6-day spring and 4-day summer camps. They were randomized into two groups; one took placebo tablets during the spring camp and chlorella tablets during the summer camp, while the other took chlorella tablets during the spring camp and placebo tablets during the summer camp. Subjects took these tablets starting 4 weeks before the camp until post-camp saliva sampling. Salivary SIgA concentrations were measured by ELISA. Results All subjects participated in nearly all training programs, and body-mass changes and subjective physical well-being scores during the camps were comparable between the groups. However, salivary SIgA secretion rate changes were different between these groups. Salivary SIgA secretion rates decreased during the camp in the placebo group (before vs. second, middle, and final day of camp, and after the camp: 146 ± 89 vs. 87 ± 56, 70 ± 45, 94 ± 58, and 116 ± 71 μg/min, whereas no such decreases were observed in the chlorella group (121 ± 53 vs. 113 ± 68, 98 ± 69,115 ± 80, and 128 ± 59 μg/min. Conclusion Our results suggest that a use of a chlorella-derived dietary supplement attenuates reduced salivary SIgA secretion during a training camp for a competitive sport.
Gicquel, Gwendoline; Bouffartigues, Emeline; Bains, Manjeet; Oxaran, Virginie; Rosay, Thibaut; Lesouhaitier, Olivier; Connil, Nathalie; Bazire, Alexis; Maillot, Olivier; Bénard, Magalie; Cornelis, Pierre; Hancock, Robert E. W.; Dufour, Alain; Feuilloley, Marc G. J.; Orange, Nicole; Déziel, Eric; Chevalier, Sylvie
2013-01-01
SigX, one of the 19 extra-cytoplasmic function sigma factors of P. aeruginosa, was only known to be involved in transcription of the gene encoding the major outer membrane protein OprF. We conducted a comparative transcriptomic study between the wildtype H103 strain and its sigX mutant PAOSX, which revealed a total of 307 differentially expressed genes that differed by more than 2 fold. Most dysregulated genes belonged to six functional classes, including the “chaperones and heat shock proteins”, “antibiotic resistance and susceptibility”, “energy metabolism”, “protein secretion/export apparatus”, and “secreted factors”, and “motility and attachment” classes. In this latter class, the large majority of the affected genes were down-regulated in the sigX mutant. In agreement with the array data, the sigX mutant was shown to demonstrate substantially reduced motility, attachment to biotic and abiotic surfaces, and biofilm formation. In addition, virulence towards the nematode Caenorhabditis elegans was reduced in the sigX mutant, suggesting that SigX is involved in virulence-related phenotypes. PMID:24260387
Learning semantic query suggestions
Meij, E.; Bron, M.; Hollink, L.; Huurnink, B.; de Rijke, M.
2009-01-01
An important application of semantic web technology is recognizing human-defined concepts in text. Query transformation is a strategy often used in search engines to derive queries that are able to return more useful search results than the original query and most popular search engines provide
DEFF Research Database (Denmark)
Yi, Ke; Wang, Lu; Wei, Zhewei
2014-01-01
), of a particular attribute of these records. Aggregation queries are especially useful in business intelligence and data analysis applications where users are interested not in the actual records, but some statistics of them. They can also be executed much more efficiently than reporting queries, by embedding...... returned by reporting queries. In this article, we design indexing techniques that allow for extracting a statistical summary of all the records in the query. The summaries we support include frequent items, quantiles, and various sketches, all of which are of central importance in massive data analysis....... Our indexes require linear space and extract a summary with the optimal or near-optimal query cost. We illustrate the efficiency and usefulness of our designs through extensive experiments and a system demonstration....
Mand falder - og rejser sig igen
DEFF Research Database (Denmark)
Evron, Lotte Orr
2016-01-01
Mand falder - og rejser sig igen Anmeldelse af Lotte Evron 6 stjerner Gennem et år følger Anne Wivel med kamera og samtaler sin mangeårige ven, Per Kirkebys, vej mod en ny hverdag efter et fald. Med venskabet som ramme opbygges et usædvanligt hudløst og respektfuldt portræt af Per Kirkeby – både...
International Nuclear Information System (INIS)
Kuznetsov, Valentin; Riley, Daniel; Afaq, Anzar; Sekhri, Vijay; Guo Yuyi; Lueking, Lee
2010-01-01
The CMS experiment has implemented a flexible and powerful system enabling users to find data within the CMS physics data catalog. The Dataset Bookkeeping Service (DBS) comprises a database and the services used to store and access metadata related to CMS physics data. To this, we have added a generalized query system in addition to the existing web and programmatic interfaces to the DBS. This query system is based on a query language that hides the complexity of the underlying database structure by discovering the join conditions between database tables. This provides a way of querying the system that is simple and straightforward for CMS data managers and physicists to use without requiring knowledge of the database tables or keys. The DBS Query Language uses the ANTLR tool to build the input query parser and tokenizer, followed by a query builder that uses a graph representation of the DBS schema to construct the SQL query sent to underlying database. We will describe the design of the query system, provide details of the language components and overview of how this component fits into the overall data discovery system architecture.
Lambert, Chip
2015-01-01
You've started down the path of jQuery Mobile, now begin mastering some of jQuery Mobile's higher level topics. Go beyond jQuery Mobile's documentation and master one of the hottest mobile technologies out there. Previous JavaScript and PHP experience can help you get the most out of this book.
CUFID-query: accurate network querying through random walk based network flow estimation.
Jeong, Hyundoo; Qian, Xiaoning; Yoon, Byung-Jun
2017-12-28
Functional modules in biological networks consist of numerous biomolecules and their complicated interactions. Recent studies have shown that biomolecules in a functional module tend to have similar interaction patterns and that such modules are often conserved across biological networks of different species. As a result, such conserved functional modules can be identified through comparative analysis of biological networks. In this work, we propose a novel network querying algorithm based on the CUFID (Comparative network analysis Using the steady-state network Flow to IDentify orthologous proteins) framework combined with an efficient seed-and-extension approach. The proposed algorithm, CUFID-query, can accurately detect conserved functional modules as small subnetworks in the target network that are expected to perform similar functions to the given query functional module. The CUFID framework was recently developed for probabilistic pairwise global comparison of biological networks, and it has been applied to pairwise global network alignment, where the framework was shown to yield accurate network alignment results. In the proposed CUFID-query algorithm, we adopt the CUFID framework and extend it for local network alignment, specifically to solve network querying problems. First, in the seed selection phase, the proposed method utilizes the CUFID framework to compare the query and the target networks and to predict the probabilistic node-to-node correspondence between the networks. Next, the algorithm selects and greedily extends the seed in the target network by iteratively adding nodes that have frequent interactions with other nodes in the seed network, in a way that the conductance of the extended network is maximally reduced. Finally, CUFID-query removes irrelevant nodes from the querying results based on the personalized PageRank vector for the induced network that includes the fully extended network and its neighboring nodes. Through extensive
Betaler det sig? Fleksibilitet og løn i den danske flexicurity-model
DEFF Research Database (Denmark)
Ibsen, Flemming
2007-01-01
Indkomstsikkerhed er at bevæge sig fra et job til at andet og få en højere løn, men er det tilfældet i dan danske flexicurity-model, betaler det sig at være fleksibel? Belønnes numerisk ekstern fleksibiblitet højrere end intern funktionel fleksibilbitet? Artiklen viser, at det bedst kan betale si...
Directory of Open Access Journals (Sweden)
Alfredo Ramón Morte
1998-01-01
Full Text Available SIG-UA es un proyecto de investigación de Geografía aplicada destinado a la elabo- ración y puesta en explotación de un sistema de información geográfica (SIG para la gestión de espacios e infraestructuras de la Universidad de Alicante. El objetivo es demostrar cómo la contemplación gráfica de la información contenida en bases de datos, es decir, su representación cartográfica, es el medio más eficaz para comprender y utilizar estrategias de actuación territorial, ventajas que se ven potenciadas al ser u sadas junto a otra forma muy sugerente de consulta de datos, los protocolos de transferencia de hipertextos en una red local de tipo corporativo. En apretada síntesis, los servidores WEB de aplicaciones SIG pueden estar llamados a convertirse en una extraordinaria opción para el control adecuado de los activos inmuebles, tanto en empresas privadas como en organismos públicos.
2010-01-01
jQuery simplifies building rich, interactive web frontends. Getting started with this JavaScript library is easy, but it can take years to fully realize its breadth and depth; this cookbook shortens the learning curve considerably. With these recipes, you'll learn patterns and practices from 19 leading developers who use jQuery for everything from integrating simple components into websites and applications to developing complex, high-performance user interfaces. Ideal for newcomers and JavaScript veterans alike, jQuery Cookbook starts with the basics and then moves to practical use cases w
libLocation: acceso a dispositivos de localización para gvSIG Desktop y Mobile
Jordán Aldasorro, Juan G.; Planells Jiménez, Manuel
2009-01-01
Inicialmente integrada en el piloto de gvSIG Mobile, la librería libLocation tiene como objetivo dotar a los proyectos gvSIG Desktop y gvSIG Mobile un acceso transparente a fuentes de localización. La librería se fundamenta en las especificaciones JSR-179 -API de localización para J2ME- y JSR-293 -API de localización para J2ME v2.0-, proporcionando una interfaz uniforme a diferentes fuentes de localización, mediante funciones de alto nivel. Asimismo, se extiende la funcionalida...
User perspectives on query difficulty
DEFF Research Database (Denmark)
Lioma, Christina; Larsen, Birger; Schütze, Hinrich
2011-01-01
be difficult for the system to address? (2) Are users aware of specific features in their query (e.g., domain-specificity, vagueness) that may render their query difficult for an IR system to address? A study of 420 queries from a Web search engine query log that are pre-categorised as easy, medium, hard...
SIG y análisis espacial de datos: perspectivas convergentes
Directory of Open Access Journals (Sweden)
Michael F. Goodchild
2005-01-01
Full Text Available En este artículo se identifican algunos de los desarrollos más importantes experimentados por los SIG y el análisis espacial de datos desde los inicios de los 50. Aunque tanto los SIG como el análisis espacial de datos comenzaron como dos áreas de investigación y aplicación más o menos separadas, han crecido unidos estrechamente a lo largo del tiempo. En el trabajo se mantiene que estas dos disciplinas se unen en el terreno de la Ciencia de la Información Geográfica, proporcionando cada una de ellas apoyo o añadiendo valor a la otra. El artículo comienza proporcionando una visión crítica retrospectiva de los desarrollos que han tenido lugar en los últimos cincuenta años. A continuación, se reflexiona acerca de los desafíos actuales y se especula sobre el futuro. Por último se comenta el potencial de convergencia del desarrollo de los SIG y del análisis espacial de datos bajo la rubrica de la Ciencia de la Información Geográfica (o SIGciencia.
SIG y análisis espacial de datos: perspectivas convergentes
Directory of Open Access Journals (Sweden)
Michael F. Goodchild
2005-06-01
Full Text Available En este artículo se identifican algunos de los desarrollos más importantes experimentados por los SIG y el análisis espacial de datos desde los inicios de los 50. Aunque tanto los SIG como el análisis espacial de datos comenzaron como dos áreas de investigación y aplicación más o menos separadas, han crecido unidos estrechamente a lo largo del tiempo. En el trabajo se mantiene que estas dos disciplinas se unen en el terreno de la Ciencia de la Información Geográfica, proporcionando cada una de ellas apoyo o añadiendo valor a la otra. El artículo comienza proporcionando una visión crítica retrospectiva de los desarrollos que han tenido lugar en los últimos cincuenta años. A continuación, se reflexiona acerca de los desafíos actuales y se especula sobre el futuro. Por último se comenta el potencial de convergencia del desarrollo de los SIG y del análisis espacial de datos bajo la rubrica de la Ciencia de la Información Geográfica (o SIGciencia.
Identification and Analysis of Multi-tasking Product Information Search Sessions with Query Logs
Directory of Open Access Journals (Sweden)
Xiang Zhou
2016-09-01
Full Text Available Purpose: This research aims to identify product search tasks in online shopping and analyze the characteristics of consumer multi-tasking search sessions. Design/methodology/approach: The experimental dataset contains 8,949 queries of 582 users from 3,483 search sessions. A sequential comparison of the Jaccard similarity coefficient between two adjacent search queries and hierarchical clustering of queries is used to identify search tasks. Findings: (1 Users issued a similar number of queries (1.43 to 1.47 with similar lengths (7.3-7.6 characters per task in mono-tasking and multi-tasking sessions, and (2 Users spent more time on average in sessions with more tasks, but spent less time for each task when the number of tasks increased in a session. Research limitations: The task identification method that relies only on query terms does not completely reflect the complex nature of consumer shopping behavior. Practical implications: These results provide an exploratory understanding of the relationships among multiple shopping tasks, and can be useful for product recommendation and shopping task prediction. Originality/value: The originality of this research is its use of query clustering with online shopping task identification and analysis, and the analysis of product search session characteristics.
Interface 3D de aplicações SIG como espaços de comunicação
Juliano Schimiguel
2002-01-01
Resumo: Um Sistema de Informação Geográfica (SIG) é um sistema voltado para manipulação, gerenciamento e visualização de dados geo-referenciados. O termo geo-referenciado denota dados que possuem representação em um sistema de coordenadas geográficas. Os SIG permitem a criação de aplicações para domínios específicos, como é o caso de planejamento urbano e ambiental. Uma aplicação envolve dados, algoritmos, funções e visualização (interface de aplicação). Existem duas categorias de SIG: SIG 2D...
Directory of Open Access Journals (Sweden)
Imane El Meouche
Full Text Available Clostridium difficile intestinal disease is mediated largely by the actions of toxins A (TcdA and B (TcdB, whose production occurs after the initial steps of colonization involving different surface or flagellar proteins. In B. subtilis, the sigma factor SigD controls flagellar synthesis, motility, and vegetative autolysins. A homolog of SigD encoding gene is present in the C.difficile 630 genome. We constructed a sigD mutant in C. difficile 630 ∆erm to analyze the regulon of SigD using a global transcriptomic approach. A total of 103 genes were differentially expressed between the wild-type and the sigD mutant, including genes involved in motility, metabolism and regulation. In addition, the sigD mutant displayed decreased expression of genes involved in flagellar biosynthesis, and also of genes encoding TcdA and TcdB as well as TcdR, the positive regulator of the toxins. Genomic analysis and RACE-PCR experiments allowed us to characterize promoter sequences of direct target genes of SigD including tcdR and to identify the SigD consensus. We then established that SigD positively regulates toxin expression via direct control of tcdR transcription. Interestingly, the overexpression of FlgM, a putative anti-SigD factor, inhibited the positive regulation of motility and toxin synthesis by SigD. Thus, SigD appears to be the first positive regulator of the toxin synthesis in C. difficile.
Indrayana, I. N. E.; P, N. M. Wirasyanti D.; Sudiartha, I. KG
2018-01-01
Mobile application allow many users to access data from the application without being limited to space, space and time. Over time the data population of this application will increase. Data access time will cause problems if the data record has reached tens of thousands to millions of records.The objective of this research is to maintain the performance of data execution for large data records. One effort to maintain data access time performance is to apply query optimization method. The optimization used in this research is query heuristic optimization method. The built application is a mobile-based financial application using MySQL database with stored procedure therein. This application is used by more than one business entity in one database, thus enabling rapid data growth. In this stored procedure there is an optimized query using heuristic method. Query optimization is performed on a “Select” query that involves more than one table with multiple clausa. Evaluation is done by calculating the average access time using optimized and unoptimized queries. Access time calculation is also performed on the increase of population data in the database. The evaluation results shown the time of data execution with query heuristic optimization relatively faster than data execution time without using query optimization.
Libby, Alex
2015-01-01
If you are a developer who is already familiar with using jQuery and wants to push your skill set further, then this book is for you. The book assumes an intermediate knowledge level of jQuery, JavaScript, HTML5, and CSS.
Directory of Open Access Journals (Sweden)
Nicolas Rochereau
2013-09-01
Full Text Available Intestinal microfold (M cells possess a high transcytosis capacity and are able to transport a broad range of materials including particulate antigens, soluble macromolecules, and pathogens from the intestinal lumen to inductive sites of the mucosal immune system. M cells are also the primary pathway for delivery of secretory IgA (SIgA to the gut-associated lymphoid tissue. However, although the consequences of SIgA uptake by M cells are now well known and described, the mechanisms whereby SIgA is selectively bound and taken up remain poorly understood. Here we first demonstrate that both the Cα1 region and glycosylation, more particularly sialic acid residues, are involved in M cell-mediated reverse transcytosis. Second, we found that SIgA is taken up by M cells via the Dectin-1 receptor, with the possible involvement of Siglec-5 acting as a co-receptor. Third, we establish that transcytosed SIgA is taken up by mucosal CX3CR1⁺ dendritic cells (DCs via the DC-SIGN receptor. Fourth, we show that mucosal and systemic antibody responses against the HIV p24-SIgA complexes administered orally is strictly dependent on the expression of Dectin-1. Having deciphered the mechanisms leading to specific targeting of SIgA-based Ag complexes paves the way to the use of such a vehicle for mucosal vaccination against various infectious diseases.
2012-04-23
... DEPARTMENT OF ENERGY Federal Energy Regulatory Commission [Docket No. EL12-55-000] SIG Energy, LLLP v. California Independent System Operator Corporation; Notice of Complaint Take notice that on.... 824(e) and 825(e), SIG Energy, LLLP (Complainant) filed a formal complaint against the California...
TRPV1 channels and the progesterone receptor Sig-1R interact to regulate pain.
Ortíz-Rentería, Miguel; Juárez-Contreras, Rebeca; González-Ramírez, Ricardo; Islas, León D; Sierra-Ramírez, Félix; Llorente, Itzel; Simon, Sidney A; Hiriart, Marcia; Rosenbaum, Tamara; Morales-Lázaro, Sara L
2018-02-13
The Transient Receptor Potential Vanilloid 1 (TRPV1) ion channel is expressed in nociceptors where, when activated by chemical or thermal stimuli, it functions as an important transducer of painful and itch-related stimuli. Although the interaction of TRPV1 with proteins that regulate its function has been previously explored, their modulation by chaperones has not been elucidated, as is the case for other mammalian TRP channels. Here we show that TRPV1 physically interacts with the Sigma 1 Receptor (Sig-1R), a chaperone that binds progesterone, an antagonist of Sig-1R and an important neurosteroid associated to the modulation of pain. Antagonism of Sig-1R by progesterone results in the down-regulation of TRPV1 expression in the plasma membrane of sensory neurons and, consequently, a decrease in capsaicin-induced nociceptive responses. This is observed both in males treated with a synthetic antagonist of Sig-1R and in pregnant females where progesterone levels are elevated. This constitutes a previously undescribed mechanism by which TRPV1-dependent nociception and pain can be regulated.
Beighley, Lynn
2010-01-01
Learn how jQuery can make your Web page or blog stand out from the crowd!. jQuery is free, open source software that allows you to extend and customize Joomla!, Drupal, AJAX, and WordPress via plug-ins. Assuming no previous programming experience, Lynn Beighley takes you through the basics of jQuery from the very start. You'll discover how the jQuery library separates itself from other JavaScript libraries through its ease of use, compactness, and friendliness if you're a beginner programmer. Written in the easy-to-understand style of the For Dummies brand, this book demonstrates how you can a
York, Richard
2015-01-01
Newly revised and updated resource on jQuery's many features and advantages Web Development with jQuery offers a major update to the popular Beginning JavaScript and CSS Development with jQuery from 2009. More than half of the content is new or updated, and reflects recent innovations with regard to mobile applications, jQuery mobile, and the spectrum of associated plugins. Readers can expect thorough revisions with expanded coverage of events, CSS, AJAX, animation, and drag and drop. New chapters bring developers up to date on popular features like jQuery UI, navigation, tables, interacti
DEFF Research Database (Denmark)
Toman, David; Bowman, Ivan Thomas
2003-01-01
Recent research in the area of temporal databases has proposed a number of query languages that vary in their expressive power and the semantics they provide to users. These query languages represent a spectrum of solutions to the tension between clean semantics and efficient evaluation. Often, t...
An Efficient and Privacy-Preserving Multiuser Cloud-Based LBS Query Scheme
Directory of Open Access Journals (Sweden)
Lu Ou
2018-01-01
Full Text Available Location-based services (LBSs are increasingly popular in today’s society. People reveal their location information to LBS providers to obtain personalized services such as map directions, restaurant recommendations, and taxi reservations. Usually, LBS providers offer user privacy protection statement to assure users that their private location information would not be given away. However, many LBSs run on third-party cloud infrastructures. It is challenging to guarantee user location privacy against curious cloud operators while still permitting users to query their own location information data. In this paper, we propose an efficient privacy-preserving cloud-based LBS query scheme for the multiuser setting. We encrypt LBS data and LBS queries with a hybrid encryption mechanism, which can efficiently implement privacy-preserving search over encrypted LBS data and is very suitable for the multiuser setting with secure and effective user enrollment and user revocation. This paper contains security analysis and performance experiments to demonstrate the privacy-preserving properties and efficiency of our proposed scheme.
The response of Bacillus licheniformis to heat and ethanol stress and the role of the SigB regulon.
Voigt, Birgit; Schroeter, Rebecca; Jürgen, Britta; Albrecht, Dirk; Evers, Stefan; Bongaerts, Johannes; Maurer, Karl-Heinz; Schweder, Thomas; Hecker, Michael
2013-07-01
The heat and ethanol stress response of Bacillus licheniformis DSM13 was analyzed at the transcriptional and/or translational level. During heat shock, regulons known to be heat-induced in Bacillus subtilis 168 are upregulated in B. licheniformis, such as the HrcA, SigB, CtsR, and CssRS regulon. Upregulation of the SigY regulon and of genes controlled by other extracytoplasmic function (ECF) sigma factors indicates a cell-wall stress triggered by the heat shock. Furthermore, tryptophan synthesis enzymes were upregulated in heat stressed cells as well as regulons involved in usage of alternative carbon and nitrogen sources. Ethanol stress led to an induction of the SigB, HrcA, and CtsR regulons. As indicated by the upregulation of a SigM-dependent protein, ethanol also triggered a cell wall stress. To characterize the SigB regulon of B. licheniformis, we analyzed the heat stress response of a sigB mutant. It is shown that the B. licheniformis SigB regulon comprises additional genes, some of which do not exist in B. subtilis, such as BLi03885, encoding a hypothetical protein, the Na/solute symporter gene BLi02212, the arginase homolog-encoding gene BLi00198 and mcrA, encoding a protein with endonuclease activity. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Educational Researcher, 2011
2011-01-01
A recent moratorium has temporarily halted the creation of new Special Interest Groups (SIGs) in the American Educational Research Association (AERA). The AERA SIG Executive Committee, the official governance body that oversees approximately 160 SIGs, requested this moratorium, which was subsequently approved by AERA Council. The purpose of the…
213 SIG et distribution spatiale des infrastructures hydrauliques ...
African Journals Online (AJOL)
CARMELLE
Une frange importante de la population de cette commune continue de souffrir de cette .... présente sous quatre ensembles à savoir : les forêts claires et formations boisées, les formations ..... barrage Hachef (Maroc) par télédétection et SIG".
Reid, Jon
2011-01-01
Native apps have distinct advantages, but the future belongs to mobile web apps that function on a broad range of smartphones and tablets. Get started with jQuery Mobile, the touch-optimized framework for creating apps that look and behave consistently across many devices. This concise book provides HTML5, CSS3, and JavaScript code examples, screen shots, and step-by-step guidance to help you build a complete working app with jQuery Mobile. If you're already familiar with the jQuery JavaScript library, you can use your existing skills to build cross-platform mobile web apps right now. This b
« Éléments pour un état des lieux sur les SIG et SIG-P »
International Development Research Centre (IDRC) Digital Library (Canada)
L'objectif général de ce programme panafricain est de contribuer à rendre disponibles des systèmes d'information de bonne qualité, fiables et accessibles grâce à l'usage de SIG-P en vue d'améliorer la gestion des ressources naturelles (eau, terres, forêts, etc.) et de promouvoir la sécurité alimentaire. Le programme ...
Woo, Hyekyung; Cho, Youngtae; Shim, Eunyoung; Lee, Jong-Koo; Lee, Chang-Gun; Kim, Seong Hwan
2016-07-04
As suggested as early as in 2006, logs of queries submitted to search engines seeking information could be a source for detection of emerging influenza epidemics if changes in the volume of search queries are monitored (infodemiology). However, selecting queries that are most likely to be associated with influenza epidemics is a particular challenge when it comes to generating better predictions. In this study, we describe a methodological extension for detecting influenza outbreaks using search query data; we provide a new approach for query selection through the exploration of contextual information gleaned from social media data. Additionally, we evaluate whether it is possible to use these queries for monitoring and predicting influenza epidemics in South Korea. Our study was based on freely available weekly influenza incidence data and query data originating from the search engine on the Korean website Daum between April 3, 2011 and April 5, 2014. To select queries related to influenza epidemics, several approaches were applied: (1) exploring influenza-related words in social media data, (2) identifying the chief concerns related to influenza, and (3) using Web query recommendations. Optimal feature selection by least absolute shrinkage and selection operator (Lasso) and support vector machine for regression (SVR) were used to construct a model predicting influenza epidemics. In total, 146 queries related to influenza were generated through our initial query selection approach. A considerable proportion of optimal features for final models were derived from queries with reference to the social media data. The SVR model performed well: the prediction values were highly correlated with the recent observed influenza-like illness (r=.956; Psearch queries to enhance influenza surveillance in South Korea. In addition, an approach for query selection using social media data seems ideal for supporting influenza surveillance based on search query data.
Directory of Open Access Journals (Sweden)
Barchinger Sarah E
2012-08-01
Full Text Available Abstract Background The cell envelope of a bacterial pathogen can be damaged by harsh conditions in the environment outside a host and by immune factors during infection. Cell envelope stress responses preserve the integrity of this essential compartment and are often required for virulence. Bordetella species are important respiratory pathogens that possess a large number of putative transcription factors. However, no cell envelope stress responses have been described in these species. Among the putative Bordetella transcription factors are a number of genes belonging to the extracytoplasmic function (ECF group of alternative sigma factors, some of which are known to mediate cell envelope stress responses in other bacteria. Here we investigate the role of one such gene, sigE, in stress survival and pathogenesis of Bordetella bronchiseptica. Results We demonstrate that sigE encodes a functional sigma factor that mediates a cell envelope stress response. Mutants of B. bronchiseptica strain RB50 lacking sigE are more sensitive to high temperature, ethanol, and perturbation of the envelope by SDS-EDTA and certain β-lactam antibiotics. Using a series of immunocompromised mice deficient in different components of the innate and adaptive immune responses, we show that SigE plays an important role in evading the innate immune response during lethal infections of mice lacking B cells and T cells. SigE is not required, however, for colonization of the respiratory tract of immunocompetent mice. The sigE mutant is more efficiently phagocytosed and killed by peripheral blood polymorphonuclear leukocytes (PMNs than RB50, and exhibits decreased cytotoxicity toward macrophages. These altered interactions with phagocytes could contribute to the defects observed during lethal infection. Conclusions Much of the work on transcriptional regulation during infection in B. bronchiseptica has focused on the BvgAS two-component system. This study reveals that the Sig
CHI 2013 Human Work Interaction Design (HWID) SIG
DEFF Research Database (Denmark)
Clemmensen, Torkil; Campos, Pedro F.; Katre, Dinesh S.
2013-01-01
In this SIG we aim to introduce the IFIP 13.6 Human Work Interaction Design (HWID) approach to the CHI audience. The HWID working group aims at establishing relationships between extensive empirical work-domain studies and HCI design. We invite participants from industry and academia with an inte...
Feeney, Morgan A; Chandra, Govind; Findlay, Kim C; Paget, Mark S B; Buttner, Mark J
2017-06-13
The major oxidative stress response in Streptomyces is controlled by the sigma factor SigR and its cognate antisigma factor RsrA, and SigR activity is tightly controlled through multiple mechanisms at both the transcriptional and posttranslational levels. Here we show that sigR has a highly unusual GTC start codon and that this leads to another level of SigR regulation, in which SigR translation is repressed by translation initiation factor 3 (IF3). Changing the GTC to a canonical start codon causes SigR to be overproduced relative to RsrA, resulting in unregulated and constitutive expression of the SigR regulon. Similarly, introducing IF3* mutations that impair its ability to repress SigR translation has the same effect. Thus, the noncanonical GTC sigR start codon and its repression by IF3 are critical for the correct and proper functioning of the oxidative stress regulatory system. sigR and rsrA are cotranscribed and translationally coupled, and it had therefore been assumed that SigR and RsrA are produced in stoichiometric amounts. Here we show that RsrA can be transcribed and translated independently of SigR, present evidence that RsrA is normally produced in excess of SigR, and describe the factors that determine SigR-RsrA stoichiometry. IMPORTANCE In all sigma factor-antisigma factor regulatory switches, the relative abundance of the two proteins is critical to the proper functioning of the system. Many sigma-antisigma operons are cotranscribed and translationally coupled, leading to a generic assumption that the sigma and antisigma factors are produced in a fixed 1:1 ratio. In the case of sigR - rsrA , we show instead that the antisigma factor is produced in excess over the sigma factor, providing a buffer to prevent spurious release of sigma activity. This excess arises in part because sigR has an extremely rare noncanonical GTC start codon, and as a result, SigR translation initiation is repressed by IF3. This finding highlights the potential significance
Incremental Query Rewriting with Resolution
Riazanov, Alexandre; Aragão, Marcelo A. T.
We address the problem of semantic querying of relational databases (RDB) modulo knowledge bases using very expressive knowledge representation formalisms, such as full first-order logic or its various fragments. We propose to use a resolution-based first-order logic (FOL) reasoner for computing schematic answers to deductive queries, with the subsequent translation of these schematic answers to SQL queries which are evaluated using a conventional relational DBMS. We call our method incremental query rewriting, because an original semantic query is rewritten into a (potentially infinite) series of SQL queries. In this chapter, we outline the main idea of our technique - using abstractions of databases and constrained clauses for deriving schematic answers, and provide completeness and soundness proofs to justify the applicability of this technique to the case of resolution for FOL without equality. The proposed method can be directly used with regular RDBs, including legacy databases. Moreover, we propose it as a potential basis for an efficient Web-scale semantic search technology.
Region 7 Significant Ecological Resource Areas (ECO_RES.SIG_REGIONS)
U.S. Environmental Protection Agency — SIG_REGIONS is a boundary layer that displays Region 7's Significant Ecological Resource Areas. This layer represents large areas within which different ecosystem...
Abrahamsen, M.; de Berg, M.T.; Buchin, K.A.; Mehr, M.; Mehrabi, A.D.
2017-01-01
In a geometric k -clustering problem the goal is to partition a set of points in R d into k subsets such that a certain cost function of the clustering is minimized. We present data structures for orthogonal range-clustering queries on a point set S : given a query box Q and an integer k>2 , compute
SPARK: Adapting Keyword Query to Semantic Search
Zhou, Qi; Wang, Chong; Xiong, Miao; Wang, Haofen; Yu, Yong
Semantic search promises to provide more accurate result than present-day keyword search. However, progress with semantic search has been delayed due to the complexity of its query languages. In this paper, we explore a novel approach of adapting keywords to querying the semantic web: the approach automatically translates keyword queries into formal logic queries so that end users can use familiar keywords to perform semantic search. A prototype system named 'SPARK' has been implemented in light of this approach. Given a keyword query, SPARK outputs a ranked list of SPARQL queries as the translation result. The translation in SPARK consists of three major steps: term mapping, query graph construction and query ranking. Specifically, a probabilistic query ranking model is proposed to select the most likely SPARQL query. In the experiment, SPARK achieved an encouraging translation result.
Sigma Factor SigB Is Crucial to Mediate Staphylococcus aureus Adaptation during Chronic Infections.
Directory of Open Access Journals (Sweden)
Lorena Tuchscherr
2015-04-01
Full Text Available Staphylococcus aureus is a major human pathogen that causes a range of infections from acute invasive to chronic and difficult-to-treat. Infection strategies associated with persisting S. aureus infections are bacterial host cell invasion and the bacterial ability to dynamically change phenotypes from the aggressive wild-type to small colony variants (SCVs, which are adapted for intracellular long-term persistence. The underlying mechanisms of the bacterial switching and adaptation mechanisms appear to be very dynamic, but are largely unknown. Here, we analyzed the role and the crosstalk of the global S. aureus regulators agr, sarA and SigB by generating single, double and triple mutants, and testing them with proteome analysis and in different in vitro and in vivo infection models. We were able to demonstrate that SigB is the crucial factor for adaptation in chronic infections. During acute infection, the bacteria require the simultaneous action of the agr and sarA loci to defend against invading immune cells by causing inflammation and cytotoxicity and to escape from phagosomes in their host cells that enable them to settle an infection at high bacterial density. To persist intracellularly the bacteria subsequently need to silence agr and sarA. Indeed agr and sarA deletion mutants expressed a much lower number of virulence factors and could persist at high numbers intracellularly. SigB plays a crucial function to promote bacterial intracellular persistence. In fact, ΔsigB-mutants did not generate SCVs and were completely cleared by the host cells within a few days. In this study we identified SigB as an essential factor that enables the bacteria to switch from the highly aggressive phenotype that settles an acute infection to a silent SCV-phenotype that allows for long-term intracellular persistence. Consequently, the SigB-operon represents a possible target to develop preventive and therapeutic strategies against chronic and therapy
Sigma Factor SigB Is Crucial to Mediate Staphylococcus aureus Adaptation during Chronic Infections.
Tuchscherr, Lorena; Bischoff, Markus; Lattar, Santiago M; Noto Llana, Mariangeles; Pförtner, Henrike; Niemann, Silke; Geraci, Jennifer; Van de Vyver, Hélène; Fraunholz, Martin J; Cheung, Ambrose L; Herrmann, Mathias; Völker, Uwe; Sordelli, Daniel O; Peters, Georg; Löffler, Bettina
2015-04-01
Staphylococcus aureus is a major human pathogen that causes a range of infections from acute invasive to chronic and difficult-to-treat. Infection strategies associated with persisting S. aureus infections are bacterial host cell invasion and the bacterial ability to dynamically change phenotypes from the aggressive wild-type to small colony variants (SCVs), which are adapted for intracellular long-term persistence. The underlying mechanisms of the bacterial switching and adaptation mechanisms appear to be very dynamic, but are largely unknown. Here, we analyzed the role and the crosstalk of the global S. aureus regulators agr, sarA and SigB by generating single, double and triple mutants, and testing them with proteome analysis and in different in vitro and in vivo infection models. We were able to demonstrate that SigB is the crucial factor for adaptation in chronic infections. During acute infection, the bacteria require the simultaneous action of the agr and sarA loci to defend against invading immune cells by causing inflammation and cytotoxicity and to escape from phagosomes in their host cells that enable them to settle an infection at high bacterial density. To persist intracellularly the bacteria subsequently need to silence agr and sarA. Indeed agr and sarA deletion mutants expressed a much lower number of virulence factors and could persist at high numbers intracellularly. SigB plays a crucial function to promote bacterial intracellular persistence. In fact, ΔsigB-mutants did not generate SCVs and were completely cleared by the host cells within a few days. In this study we identified SigB as an essential factor that enables the bacteria to switch from the highly aggressive phenotype that settles an acute infection to a silent SCV-phenotype that allows for long-term intracellular persistence. Consequently, the SigB-operon represents a possible target to develop preventive and therapeutic strategies against chronic and therapy-refractory infections.
Effective Filtering of Query Results on Updated User Behavioral Profiles in Web Mining
Directory of Open Access Journals (Sweden)
S. Sadesh
2015-01-01
Full Text Available Web with tremendous volume of information retrieves result for user related queries. With the rapid growth of web page recommendation, results retrieved based on data mining techniques did not offer higher performance filtering rate because relationships between user profile and queries were not analyzed in an extensive manner. At the same time, existing user profile based prediction in web data mining is not exhaustive in producing personalized result rate. To improve the query result rate on dynamics of user behavior over time, Hamilton Filtered Regime Switching User Query Probability (HFRS-UQP framework is proposed. HFRS-UQP framework is split into two processes, where filtering and switching are carried out. The data mining based filtering in our research work uses the Hamilton Filtering framework to filter user result based on personalized information on automatic updated profiles through search engine. Maximized result is fetched, that is, filtered out with respect to user behavior profiles. The switching performs accurate filtering updated profiles using regime switching. The updating in profile change (i.e., switches regime in HFRS-UQP framework identifies the second- and higher-order association of query result on the updated profiles. Experiment is conducted on factors such as personalized information search retrieval rate, filtering efficiency, and precision ratio.
Querying and Mining Strings Made Easy
Sahli, Majed
2017-10-13
With the advent of large string datasets in several scientific and business applications, there is a growing need to perform ad-hoc analysis on strings. Currently, strings are stored, managed, and queried using procedural codes. This limits users to certain operations supported by existing procedural applications and requires manual query planning with limited tuning opportunities. This paper presents StarQL, a generic and declarative query language for strings. StarQL is based on a native string data model that allows StarQL to support a large variety of string operations and provide semantic-based query optimization. String analytic queries are too intricate to be solved on one machine. Therefore, we propose a scalable and efficient data structure that allows StarQL implementations to handle large sets of strings and utilize large computing infrastructures. Our evaluation shows that StarQL is able to express workloads of application-specific tools, such as BLAST and KAT in bioinformatics, and to mine Wikipedia text for interesting patterns using declarative queries. Furthermore, the StarQL query optimizer shows an order of magnitude reduction in query execution time.
Secure Skyline Queries on Cloud Platform.
Liu, Jinfei; Yang, Juncheng; Xiong, Li; Pei, Jian
2017-04-01
Outsourcing data and computation to cloud server provides a cost-effective way to support large scale data storage and query processing. However, due to security and privacy concerns, sensitive data (e.g., medical records) need to be protected from the cloud server and other unauthorized users. One approach is to outsource encrypted data to the cloud server and have the cloud server perform query processing on the encrypted data only. It remains a challenging task to support various queries over encrypted data in a secure and efficient way such that the cloud server does not gain any knowledge about the data, query, and query result. In this paper, we study the problem of secure skyline queries over encrypted data. The skyline query is particularly important for multi-criteria decision making but also presents significant challenges due to its complex computations. We propose a fully secure skyline query protocol on data encrypted using semantically-secure encryption. As a key subroutine, we present a new secure dominance protocol, which can be also used as a building block for other queries. Finally, we provide both serial and parallelized implementations and empirically study the protocols in terms of efficiency and scalability under different parameter settings, verifying the feasibility of our proposed solutions.
Smollett, Katherine L.; Dawson, Lisa F.; Davis, Elaine O.
2010-01-01
Expression of the Mycobacterium tuberculosis sigG sigma factor was induced by a variety of DNA-damaging agents, but inactivation of sigG did not affect induction of gene expression or bacterial survival under these conditions. Therefore, SigG does not control the DNA repair response of M. tuberculosis H37Rv.
Schuers, Matthieu; Joulakian, Mher; Kerdelhué, Gaetan; Segas, Léa; Grosjean, Julien; Darmoni, Stéfan J; Griffon, Nicolas
2017-07-03
MEDLINE is the most widely used medical bibliographic database in the world. Most of its citations are in English and this can be an obstacle for some researchers to access the information the database contains. We created a multilingual query builder to facilitate access to the PubMed subset using a language other than English. The aim of our study was to assess the impact of this multilingual query builder on the quality of PubMed queries for non-native English speaking physicians and medical researchers. A randomised controlled study was conducted among French speaking general practice residents. We designed a multi-lingual query builder to facilitate information retrieval, based on available MeSH translations and providing users with both an interface and a controlled vocabulary in their own language. Participating residents were randomly allocated either the French or the English version of the query builder. They were asked to translate 12 short medical questions into MeSH queries. The main outcome was the quality of the query. Two librarians blind to the arm independently evaluated each query, using a modified published classification that differentiated eight types of errors. Twenty residents used the French version of the query builder and 22 used the English version. 492 queries were analysed. There were significantly more perfect queries in the French group vs. the English group (respectively 37.9% vs. 17.9%; p PubMed queries in particular for researchers whose first language is not English.
Multi-Dimensional Path Queries
DEFF Research Database (Denmark)
Bækgaard, Lars
1998-01-01
to create nested path structures. We present an SQL-like query language that is based on path expressions and we show how to use it to express multi-dimensional path queries that are suited for advanced data analysis in decision support environments like data warehousing environments......We present the path-relationship model that supports multi-dimensional data modeling and querying. A path-relationship database is composed of sets of paths and sets of relationships. A path is a sequence of related elements (atoms, paths, and sets of paths). A relationship is a binary path...
Alabdulmohsin, Ibrahim
2017-01-01
Active learning is a subfield of machine learning that has been successfully used in many applications. One of the main branches of active learning is query synthe- sis, where the learning agent constructs artificial queries from scratch in order
SIG EN LA NUBE: WEBSIG PARA LA ENSEÑANZA DE LA GEOGRAFÍA
Directory of Open Access Journals (Sweden)
Andrew J. Milson
Full Text Available RESUMEN:Para la mayoría de los profesores dedicados a la enseñanza de la Geografía no hay duda de que el SIG es una herramienta importante en el proceso de enseñanza-aprendizaje, pero su uso se ha ido retrasando por problemas tales como el coste del software y la gestión de grandes archivos de datos espaciales. El movimiento hacia la computación en la nube, conocida como la nube de Internet, es una tendencia prometedora para los SIG en la educación. La "nube" se refiere a una red virtual que ofrece a los usuarios acceso a archivos, servicios y aplicaciones. En este artículo se pone de manifiesto que la nube de Internet y el WebSIG tienen un gran potencial para enriquecer la educación geográfica. Se presentan tres experiencias sustentadas en el uso de estas nuevas herramientas en las aulas en los EE.UU. con las conclusiones de carácter didáctico derivadas de cada caso. PALABRAS CLAVE WEBSIG; SIG; enseñanza de la geografía; la nube de Internet; ArcGIS Online; ArcGIS Explorer Desktop (AGX. ABSTRACT There is no doubt among most geography educators that GIS is an important tool for teaching and learning, but its use has been slowed by issues such as the cost of the software and the management of large spatial data files. The move to cloud computingis one trend that is promising for GIS in education. The "cloud" refers to a virtual network that provides many users with access to files, services, and applications. In this article I argue that cloud computing and WebGIS have the potential to transform geography education. I will describe three case studies that make use of these emerging tools in classrooms in the US, and discuss the lessons that we can learn from these cases. KEY WORDS WEBGIS; GIS; cloud computing; ArcGIS Online; ArcGIS Explorer Desktop (AGX. RÉSUMÉ Il n'ya aucun doute parmi les éducateurs les plus géographie que le SIG est un outil important pour l'enseignement et l'apprentissage, mais son utilisation a
Child-Computer Interaction SIG: Ethics and Values
DEFF Research Database (Denmark)
Hourcade, Juan Pablo; Zeising, Anja; Iversen, Ole Sejer
2017-01-01
This SIG will provide child computer interaction researchers and practitioners an opportunity to discuss topics related to ethical challenges in the design, and use of interactive technologies for children. Topics include the role of big data, the impact of technology in children’s social...... and physical ecosystem, and the consideration of ethics in children’s participation in the design of technologies, and in the conceptualization of technologies for children....
Uso di gvSIG e SEXTANTE per la perimetrazione degli ambiti periurbani
Directory of Open Access Journals (Sweden)
Gabriele Nolè
2010-03-01
Full Text Available Use of gvSIG and SEXTANTE for the perimetration of periurban areasThe periurban fringe is the portion of land with characteristics of urbanization that cannot be considered neither urban nor rural. These areas are often characterized by a building expectancy, whose detection requires careful consideration of several territorial and environmental variables. It was implemented using a model of spatial analysis based on kernel Density Estimation (KDE for the detection of periurban areas. The model is tested in the province of Potenza using gvSIG and SEXTANTE on Ubuntu Linux.
Uso di gvSIG e SEXTANTE per la perimetrazione degli ambiti periurbani
Directory of Open Access Journals (Sweden)
Gabriele Nolè
2010-03-01
Full Text Available Use of gvSIG and SEXTANTE for the perimetration of periurban areas The periurban fringe is the portion of land with characteristics of urbanization that cannot be considered neither urban nor rural. These areas are often characterized by a building expectancy, whose detection requires careful consideration of several territorial and environmental variables. It was implemented using a model of spatial analysis based on kernel Density Estimation (KDE for the detection of periurban areas. The model is tested in the province of Potenza using gvSIG and SEXTANTE on Ubuntu Linux.
Truth Space Method for Caching Database Queries
Directory of Open Access Journals (Sweden)
S. V. Mosin
2015-01-01
Full Text Available We propose a new method of client-side data caching for relational databases with a central server and distant clients. Data are loaded into the client cache based on queries executed on the server. Every query has the corresponding DB table – the result of the query execution. These queries have a special form called "universal relational query" based on three fundamental Relational Algebra operations: selection, projection and natural join. We have to mention that such a form is the closest one to the natural language and the majority of database search queries can be expressed in this way. Besides, this form allows us to analyze query correctness by checking lossless join property. A subsequent query may be executed in a client’s local cache if we can determine that the query result is entirely contained in the cache. For this we compare truth spaces of the logical restrictions in a new user’s query and the results of the queries execution in the cache. Such a comparison can be performed analytically , without need in additional Database queries. This method may be used to define lacking data in the cache and execute the query on the server only for these data. To do this the analytical approach is also used, what distinguishes our paper from the existing technologies. We propose four theorems for testing the required conditions. The first and the third theorems conditions allow us to define the existence of required data in cache. The second and the fourth theorems state conditions to execute queries with cache only. The problem of cache data actualizations is not discussed in this paper. However, it can be solved by cataloging queries on the server and their serving by triggers in background mode. The article is published in the author’s wording.
Optimizing Temporal Queries: Efficient Handling of Duplicates
DEFF Research Database (Denmark)
Toman, David; Bowman, Ivan Thomas
2001-01-01
, these query languages are implemented by translating temporal queries into standard relational queries. However, the compiled queries are often quite cumbersome and expensive to execute even using state-of-the- art relational products. This paper presents an optimization technique that produces more efficient...... translated SQL queries by taking into account the properties of the encoding used for temporal attributes. For concreteness, this translation technique is presented in the context of SQL/TP; however, these techniques are also applicable to other temporal query languages....
Griffon, N; Schuers, M; Dhombres, F; Merabti, T; Kerdelhué, G; Rollin, L; Darmoni, S J
2016-08-02
Despite international initiatives like Orphanet, it remains difficult to find up-to-date information about rare diseases. The aim of this study is to propose an exhaustive set of queries for PubMed based on terminological knowledge and to evaluate it versus the queries based on expertise provided by the most frequently used resource in Europe: Orphanet. Four rare disease terminologies (MeSH, OMIM, HPO and HRDO) were manually mapped to each other permitting the automatic creation of expended terminological queries for rare diseases. For 30 rare diseases, 30 citations retrieved by Orphanet expert query and/or query based on terminological knowledge were assessed for relevance by two independent reviewers unaware of the query's origin. An adjudication procedure was used to resolve any discrepancy. Precision, relative recall and F-measure were all computed. For each Orphanet rare disease (n = 8982), there was a corresponding terminological query, in contrast with only 2284 queries provided by Orphanet. Only 553 citations were evaluated due to queries with 0 or only a few hits. There were no significant differences between the Orpha query and terminological query in terms of precision, respectively 0.61 vs 0.52 (p = 0.13). Nevertheless, terminological queries retrieved more citations more often than Orpha queries (0.57 vs. 0.33; p = 0.01). Interestingly, Orpha queries seemed to retrieve older citations than terminological queries (p < 0.0001). The terminological queries proposed in this study are now currently available for all rare diseases. They may be a useful tool for both precision or recall oriented literature search.
Monitoring the ethanol stress response of a sigM deletion strain of B. cereus ATCC 14579.
Voort, van der M.
2008-01-01
Here, the role of σM and its regulon in stress response and survival of B. cereus ATCC 14579 was assessed by comparative transciptome and phenotypic analysis of this strain and its sigM deletion strain. Exposure of B. cereus ATCC 14579 to a wide range of stresses revealed expression of sigM,
Lengstorf, Jason
2010-01-01
This book is for intermediate programmers interested in building AJAX web applications using jQuery and PHP. Along with teaching some advanced PHP techniques, it will teach you how to take your dynamic applications to the next level by adding a JavaScript layer with jQuery. * Learn to utilize built-in PHP functions to build calendar tools.* Learn how jQuery can be used for AJAX, animation, client-side validation, and more.What you'll learn* Use PHP to build a calendar application that allows users to post, view, edit, and delete events.* Use jQuery to allow the calendar app to be viewed and ed
BioSig3D: High Content Screening of Three-Dimensional Cell Culture Models.
Directory of Open Access Journals (Sweden)
Cemal Cagatay Bilgin
Full Text Available BioSig3D is a computational platform for high-content screening of three-dimensional (3D cell culture models that are imaged in full 3D volume. It provides an end-to-end solution for designing high content screening assays, based on colony organization that is derived from segmentation of nuclei in each colony. BioSig3D also enables visualization of raw and processed 3D volumetric data for quality control, and integrates advanced bioinformatics analysis. The system consists of multiple computational and annotation modules that are coupled together with a strong use of controlled vocabularies to reduce ambiguities between different users. It is a web-based system that allows users to: design an experiment by defining experimental variables, upload a large set of volumetric images into the system, analyze and visualize the dataset, and either display computed indices as a heatmap, or phenotypic subtypes for heterogeneity analysis, or download computed indices for statistical analysis or integrative biology. BioSig3D has been used to profile baseline colony formations with two experiments: (i morphogenesis of a panel of human mammary epithelial cell lines (HMEC, and (ii heterogeneity in colony formation using an immortalized non-transformed cell line. These experiments reveal intrinsic growth properties of well-characterized cell lines that are routinely used for biological studies. BioSig3D is being released with seed datasets and video-based documentation.
Dalton, KA; Thibessard, A; Hunter, JI; Kelemen, GH
2007-01-01
Streptomyces coelicolor has nine SigB-like RNA polymerase sigma factors, several of them implicated in morphological differentiation and/or responses to different stresses. One of the nine, SigN, is the focus of this article. A constructed sigN null mutant was delayed in development and exhibited a bald phenotype when grown on minimal medium containing glucose as carbon source. One of two distinct sigN promoters, sigNP1, was active only during growth on solid medium, when its activation coinc...
Tao, Shiqiang; Cui, Licong; Wu, Xi; Zhang, Guo-Qiang
2017-01-01
To help researchers better access clinical data, we developed a prototype query engine called DataSphere for exploring large-scale integrated clinical data repositories. DataSphere expedites data importing using a NoSQL data management system and dynamically renders its user interface for concept-based querying tasks. DataSphere provides an interactive query-building interface together with query translation and optimization strategies, which enable users to build and execute queries effectively and efficiently. We successfully loaded a dataset of one million patients for University of Kentucky (UK) Healthcare into DataSphere with more than 300 million clinical data records. We evaluated DataSphere by comparing it with an instance of i2b2 deployed at UK Healthcare, demonstrating that DataSphere provides enhanced user experience for both query building and execution. PMID:29854239
Tao, Shiqiang; Cui, Licong; Wu, Xi; Zhang, Guo-Qiang
2017-01-01
To help researchers better access clinical data, we developed a prototype query engine called DataSphere for exploring large-scale integrated clinical data repositories. DataSphere expedites data importing using a NoSQL data management system and dynamically renders its user interface for concept-based querying tasks. DataSphere provides an interactive query-building interface together with query translation and optimization strategies, which enable users to build and execute queries effectively and efficiently. We successfully loaded a dataset of one million patients for University of Kentucky (UK) Healthcare into DataSphere with more than 300 million clinical data records. We evaluated DataSphere by comparing it with an instance of i2b2 deployed at UK Healthcare, demonstrating that DataSphere provides enhanced user experience for both query building and execution.
Towards Verbalizing SPARQL Queries in Arabic
Directory of Open Access Journals (Sweden)
I. Al Agha
2016-04-01
Full Text Available With the wide spread of Open Linked Data and Semantic Web technologies, a larger amount of data has been published on the Web in the RDF and OWL formats. This data can be queried using SPARQL, the Semantic Web Query Language. SPARQL cannot be understood by ordinary users and is not directly accessible to humans, and thus they will not be able to check whether the retrieved answers truly correspond to the intended information need. Driven by this challenge, natural language generation from SPARQL data has recently attracted a considerable attention. However, most existing solutions to verbalize SPARQL in natural language focused on English and Latin-based languages. Little effort has been made on the Arabic language which has different characteristics and morphology. This work aims to particularly help Arab users to perceive SPARQL queries on the Semantic Web by translating SPARQL to Arabic. It proposes an approach that gets a SPARQL query as an input and generates a query expressed in Arabic as an output. The translation process combines both morpho-syntactic analysis and language dependencies to generate a legible and understandable Arabic query. The approach was preliminary assessed with a sample query set, and results indicated that 75% of the queries were correctly translated into Arabic.
Directory of Open Access Journals (Sweden)
Keith Al-Hasani
Full Text Available BACKGROUND: We have previously shown that the enterotoxin SigA which resides on the she pathogenicity island (PAI of S. flexneri 2a is an autonomously secreted serine protease capable of degrading casein. We have also demonstrated that SigA is cytopathic for HEp-2 cells and plays a role in the intestinal fluid accumulation associated with S. flexneri infections. METHODS/PRINCIPAL FINDINGS: In this work we show that SigA binds specifically to HEp-2 cells and degrades recombinant human alphaII spectrin (alpha-fodrin in vitro, suggesting that the cytotoxic and enterotoxic effects mediated by SigA are likely associated with the degradation of epithelial fodrin. Consistent with our data, this study also demonstrates that SigA cleaves intracellular fodrin in situ, causing its redistribution within cells. These results strongly implicate SigA in altering the cytoskeleton during the pathogenesis of shigellosis. On the basis of these findings, cleavage of fodrin is a novel mechanism of cellular intoxication for a Shigella toxin. Furthermore, information regarding immunogenicity to SigA in infected patients is lacking. We studied the immune response of SigA from day 28 post-challenge serum of one volunteer from S. flexneri 2a challenge studies. Our results demonstrate that SigA is immunogenic following infection with S. flexneri 2a. CONCLUSIONS: This work shows that SigA binds to epithelial HEp-2 cells as well as being able to induce fodrin degradation in vitro and in situ, further extending its documented role in the pathogenesis of Shigella infections.
A Framework for WWW Query Processing
Wu, Binghui Helen; Wharton, Stephen (Technical Monitor)
2000-01-01
Query processing is the most common operation in a DBMS. Sophisticated query processing has been mainly targeted at a single enterprise environment providing centralized control over data and metadata. Submitting queries by anonymous users on the web is different in such a way that load balancing or DBMS' accessing control becomes the key issue. This paper provides a solution by introducing a framework for WWW query processing. The success of this framework lies in the utilization of query optimization techniques and the ontological approach. This methodology has proved to be cost effective at the NASA Goddard Space Flight Center Distributed Active Archive Center (GDAAC).
Alabdulmohsin, Ibrahim Mansour
2017-05-07
Active learning is a subfield of machine learning that has been successfully used in many applications. One of the main branches of active learning is query synthe- sis, where the learning agent constructs artificial queries from scratch in order to reveal sensitive information about the underlying decision boundary. It has found applications in areas, such as adversarial reverse engineering, automated science, and computational chemistry. Nevertheless, the existing literature on membership query synthesis has, generally, focused on finite concept classes or toy problems, with a limited extension to real-world applications. In this thesis, I develop two spectral algorithms for learning halfspaces via query synthesis. The first algorithm is a maximum-determinant convex optimization method while the second algorithm is a Markovian method that relies on Khachiyan’s classical update formulas for solving linear programs. The general theme of these methods is to construct an ellipsoidal approximation of the version space and to synthesize queries, afterward, via spectral decomposition. Moreover, I also describe how these algorithms can be extended to other settings as well, such as pool-based active learning. Having demonstrated that halfspaces can be learned quite efficiently via query synthesis, the second part of this thesis proposes strategies for mitigating the risk of reverse engineering in adversarial environments. One approach that can be used to render query synthesis algorithms ineffective is to implement a randomized response. In this thesis, I propose a semidefinite program (SDP) for learning a distribution of classifiers, subject to the constraint that any individual classifier picked at random from this distributions provides reliable predictions with a high probability. This algorithm is, then, justified both theoretically and empirically. A second approach is to use a non-parametric classification method, such as similarity-based classification. In this
Libby, Alex
2012-01-01
A practical tutorial with powerful yet simple projects that are quick to implement. This book is aimed at developers who have prior jQuery knowledge, but may not have any prior experience with jQuery Tools. It is possible that they may have started with the basics of jQuery Tools, but want to learn more about how it can be used, as well as get ideas for future projects.
Joint Top-K Spatial Keyword Query Processing
DEFF Research Database (Denmark)
Wu, Dingming; Yiu, Man Lung; Cong, Gao
2012-01-01
Web users and content are increasingly being geopositioned, and increased focus is being given to serving local content in response to web queries. This development calls for spatial keyword queries that take into account both the locations and textual descriptions of content. We study the effici......Web users and content are increasingly being geopositioned, and increased focus is being given to serving local content in response to web queries. This development calls for spatial keyword queries that take into account both the locations and textual descriptions of content. We study...... the efficient, joint processing of multiple top-k spatial keyword queries. Such joint processing is attractive during high query loads and also occurs when multiple queries are used to obfuscate a user's true query. We propose a novel algorithm and index structure for the joint processing of top-k spatial...... keyword queries. Empirical studies show that the proposed solution is efficient on real data sets. We also offer analytical studies on synthetic data sets to demonstrate the efficiency of the proposed solution. Index Terms IEEE Terms Electronic mail , Google , Indexes , Joints , Mobile communication...
Web Page Recommendation Using Web Mining
Modraj Bhavsar; Mrs. P. M. Chavan
2014-01-01
On World Wide Web various kind of content are generated in huge amount, so to give relevant result to user web recommendation become important part of web application. On web different kind of web recommendation are made available to user every day that includes Image, Video, Audio, query suggestion and web page. In this paper we are aiming at providing framework for web page recommendation. 1) First we describe the basics of web mining, types of web mining. 2) Details of each...
Research Issues in Mobile Querying
DEFF Research Database (Denmark)
Breunig, M.; Jensen, Christian Søndergaard; Klein, M.
2004-01-01
This document reports on key aspects of the discussions conducted within the working group. In particular, the document aims to offer a structured and somewhat digested summary of the group's discussions. The document first offers concepts that enable characterization of "mobile queries" as well...... as the types of systems that enable such queries. It explores the notion of context in mobile queries. The document ends with a few observations, mainly regarding challenges....
Automated software system for checking the structure and format of ACM SIG documents
Mirza, Arsalan Rahman; Sah, Melike
2017-04-01
Microsoft (MS) Office Word is one of the most commonly used software tools for creating documents. MS Word 2007 and above uses XML to represent the structure of MS Word documents. Metadata about the documents are automatically created using Office Open XML (OOXML) syntax. We develop a new framework, which is called ADFCS (Automated Document Format Checking System) that takes the advantage of the OOXML metadata, in order to extract semantic information from MS Office Word documents. In particular, we develop a new ontology for Association for Computing Machinery (ACM) Special Interested Group (SIG) documents for representing the structure and format of these documents by using OWL (Web Ontology Language). Then, the metadata is extracted automatically in RDF (Resource Description Framework) according to this ontology using the developed software. Finally, we generate extensive rules in order to infer whether the documents are formatted according to ACM SIG standards. This paper, introduces ACM SIG ontology, metadata extraction process, inference engine, ADFCS online user interface, system evaluation and user study evaluations.
On tractable query evaluation for SPARQL
Mengel, Stefan; Skritek, Sebastian
2017-01-01
Despite much work within the last decade on foundational properties of SPARQL - the standard query language for RDF data - rather little is known about the exact limits of tractability for this language. In particular, this is the case for SPARQL queries that contain the OPTIONAL-operator, even though it is one of the most intensively studied features of SPARQL. The aim of our work is to provide a more thorough picture of tractable classes of SPARQL queries. In general, SPARQL query evaluatio...
Man vs. Machine: Differences in SPARQL Queries
Rietveld, L.; Hoekstra, R.
2014-01-01
Server-side SPARQL query logs have been a topic of study for some time now. The USEWOD collection of query logs is currently the primary source of information for researchers. A recurring problem is that these logs leave application queries and queries created by humans indistinguishable. In this
How Good Are Query Optimizers, Really?
Leis, Viktor; Gubichev, Andrey; Mirchev, Atanas; Boncz, Peter; Kemper, Alfons; Neumann, Thomas
2016-01-01
Finding a good join order is crucial for query performance. In this paper, we introduce the Join Order Benchmark (JOB) and experimentally revisit the main components in the classic query optimizer architecture using a complex, real-world data set and realistic multi-join queries. We investigate the
Bikakis, Nikos; Gioldasis, Nektarios; Tsinaraki, Chrisa; Christodoulakis, Stavros
SPARQL is today the standard access language for Semantic Web data. In the recent years XML databases have also acquired industrial importance due to the widespread applicability of XML in the Web. In this paper we present a framework that bridges the heterogeneity gap and creates an interoperable environment where SPARQL queries are used to access XML databases. Our approach assumes that fairly generic mappings between ontology constructs and XML Schema constructs have been automatically derived or manually specified. The mappings are used to automatically translate SPARQL queries to semantically equivalent XQuery queries which are used to access the XML databases. We present the algorithms and the implementation of SPARQL2XQuery framework, which is used for answering SPARQL queries over XML databases.
U.S. Environmental Protection Agency — The Superfund Query allows users to retrieve data from the Comprehensive Environmental Response, Compensation, and Liability Information System (CERCLIS) database.
Wildsmith-Cromarty, Rosemary
2015-01-01
This report describes ongoing research on reading in African languages. It draws mainly on contributions from two British Association for Applied Linguistics (BAAL) "Language in Africa" (LiA) Special Interest Group (SIG) meetings: the LiA SIG strand at BAAL 2013 and the seminar on "Reading Methodologies in African Languages"…
Optimizing queries in distributed systems
Directory of Open Access Journals (Sweden)
Ion LUNGU
2006-01-01
Full Text Available This research presents the main elements of query optimizations in distributed systems. First, data architecture according with system level architecture in a distributed environment is presented. Then the architecture of a distributed database management system (DDBMS is described on conceptual level followed by the presentation of the distributed query execution steps on these information systems. The research ends with presentation of some aspects of distributed database query optimization and strategies used for that.
Advanced Query Formulation in Deductive Databases.
Niemi, Timo; Jarvelin, Kalervo
1992-01-01
Discusses deductive databases and database management systems (DBMS) and introduces a framework for advanced query formulation for end users. Recursive processing is described, a sample extensional database is presented, query types are explained, and criteria for advanced query formulation from the end user's viewpoint are examined. (31…
Dynamic Planar Range Maxima Queries
DEFF Research Database (Denmark)
Brodal, Gerth Stølting; Tsakalidis, Konstantinos
2011-01-01
We consider the dynamic two-dimensional maxima query problem. Let P be a set of n points in the plane. A point is maximal if it is not dominated by any other point in P. We describe two data structures that support the reporting of the t maximal points that dominate a given query point, and allow...... for insertions and deletions of points in P. In the pointer machine model we present a linear space data structure with O(logn + t) worst case query time and O(logn) worst case update time. This is the first dynamic data structure for the planar maxima dominance query problem that achieves these bounds...... are integers in the range U = {0, …,2 w − 1 }. We present a linear space data structure that supports 3-sided range maxima queries in O(logn/loglogn+t) worst case time and updates in O(logn/loglogn) worst case time. These are the first sublogarithmic worst case bounds for all operations in the RAM model....
Nearest Neighbor Queries in Road Networks
DEFF Research Database (Denmark)
Jensen, Christian Søndergaard; Kolar, Jan; Pedersen, Torben Bach
2003-01-01
in road networks. Such queries may be of use in many services. Specifically, we present an easily implementable data model that serves well as a foundation for such queries. We also present the design of a prototype system that implements the queries based on the data model. The algorithm used...
Kodama, Takeko; Takamatsu, Hiromu; Asai, Kei; Kobayashi, Kazuo; Ogasawara, Naotake; Watabe, Kazuhito
1999-01-01
The expression of 21 novel genes located in the region from dnaA to abrB of the Bacillus subtilis chromosome was analyzed. One of the genes, yaaH, had a predicted promoter sequence conserved among SigE-dependent genes. Northern blot analysis revealed that yaaH mRNA was first detected from 2 h after the cessation of logarithmic growth (T2) of sporulation in wild-type cells and in spoIIIG (SigG−) and spoIVCB (SigK−) mutants but not in spoIIAC (SigF−) and spoIIGAB (SigE−) mutants. The transcript...
Fingerprinting Keywords in Search Queries over Tor
Directory of Open Access Journals (Sweden)
Oh Se Eun
2017-10-01
Full Text Available Search engine queries contain a great deal of private and potentially compromising information about users. One technique to prevent search engines from identifying the source of a query, and Internet service providers (ISPs from identifying the contents of queries is to query the search engine over an anonymous network such as Tor.
Hybrid employment recommendation algorithm based on Spark
Li, Zuoquan; Lin, Yubei; Zhang, Xingming
2017-08-01
Aiming at the real-time application of collaborative filtering employment recommendation algorithm (CF), a clustering collaborative filtering recommendation algorithm (CCF) is developed, which applies hierarchical clustering to CF and narrows the query range of neighbour items. In addition, to solve the cold-start problem of content-based recommendation algorithm (CB), a content-based algorithm with users’ information (CBUI) is introduced for job recommendation. Furthermore, a hybrid recommendation algorithm (HRA) which combines CCF and CBUI algorithms is proposed, and implemented on Spark platform. The experimental results show that HRA can overcome the problems of cold start and data sparsity, and achieve good recommendation accuracy and scalability for employment recommendation.
Adding Query Privacy to Robust DHTs
DEFF Research Database (Denmark)
Backes, Michael; Goldberg, Ian; Kate, Aniket
2011-01-01
intermediate peers that (help to) route the queries towards their destinations. In this paper, we satisfy this requirement by presenting an approach for providing privacy for the keys in DHT queries. We use the concept of oblivious transfer (OT) in communication over DHTs to preserve query privacy without...... of obtaining query privacy over robust DHTs. Finally, we compare the performance of our privacy-preserving protocols with their more privacy-invasive counterparts. We observe that there is no increase in the message complexity and only a small overhead in the computational complexity....
Directory of Open Access Journals (Sweden)
A. Khandelwal
2017-07-01
Full Text Available Generic text-based compression models are simple and fast but there are two issues that needs to be addressed. They cannot leverage the structure that exists in data to achieve better compression and there is an unnecessary decompression step before the user can actually use the data. To address these issues, we came up with GMZ, a lossless compression model aimed at achieving high compression ratios. The decision to design GMZ (Khandelwal and Rajan, 2017 exclusively for GML's Simple Features Profile (SFP seems fair because of the high use of SFP in WFS and that it facilitates high optimisation of the compression model. This is an extension of our work on GMZ. In a typical server-client model such as Web Feature Service, the server is the primary creator and provider of GML, and therefore, requires compression and query capabilities. On the other hand, the client is the primary consumer of GML, and therefore, requires decompression and visualisation capabilities. In the first part of our work, we demonstrated compression using a python script that can be plugged in a server architecture, and decompression and visualisation in a web browser using a Firefox addon. The focus of this work is to develop the already existing tools to provide query capability to server. Our model provides the ability to decompress individual features in isolation, which is an essential requirement for realising query in compressed state. We con - struct an R-Tree index for spatial data and a custom index for non-spatial data and store these in a separate index file to prevent alter - ing the compression model. This facilitates independent use of compressed GMZ file where index can be constructed when required. The focus of this work is the bounding-box or range query commonly used in webGIS with provision for other spatial and non-spatial queries. The decrement in compression ratios due to the new index file is in the range of 1–3 percent which is trivial considering
Ranking Queries on Uncertain Data
Hua, Ming
2011-01-01
Uncertain data is inherent in many important applications, such as environmental surveillance, market analysis, and quantitative economics research. Due to the importance of those applications and rapidly increasing amounts of uncertain data collected and accumulated, analyzing large collections of uncertain data has become an important task. Ranking queries (also known as top-k queries) are often natural and useful in analyzing uncertain data. Ranking Queries on Uncertain Data discusses the motivations/applications, challenging problems, the fundamental principles, and the evaluation algorith
Predecessor queries in dynamic integer sets
DEFF Research Database (Denmark)
Brodal, Gerth Stølting
1997-01-01
We consider the problem of maintaining a set of n integers in the range 0.2w–1 under the operations of insertion, deletion, predecessor queries, minimum queries and maximum queries on a unit cost RAM with word size w bits. Let f (n) be an arbitrary nondecreasing smooth function satisfying n...
Rapport fra projektet ”At skrive sig til læsning”
DEFF Research Database (Denmark)
Labuz, Nadia; Bundsgaard, Jeppe; Kjertmann, Kjeld
Projektet At skrive sig til læsning er et Videnkupon-projekt finansieret af Forsknings- og Innovationsstyrelsen og firmaet Jamus. Projektet havde til formål at bidrage til viden om og kvalificere udviklingen af en app til iPad, som har til formål at støtte børn i deres skrive- og læseudvikling....... Projektet bestod konkret i diskussioner og konsulentbistand under udviklingen af app'en, og i empiriske undersøgelser af den første brug af app'en i to herboende amerikanske børnefamilier. Projektet resulterede i positiv evaluering af app'en og den praksis, der kan udvikle sig omkring brugen af den, og det...... resulterede i en række konkrete forslag til videreudvikling, samt i 10 principper for god brug af app'en....
Flexible Query Answering Systems 2006
DEFF Research Database (Denmark)
-computer interaction. The overall theme of the FQAS conferences is innovative query systems aimed at providing easy, flexible, and intuitive access to information. Such systems are intended to facilitate retrieval from information repositories such as databases, libraries, and the World-Wide Web. These repositories......This volume constitutes the proceedings of the Seventh International Conference on Flexible Query Answering Systems, FQAS 2006, held in Milan, Italy, on June 7--10, 2006. FQAS is the premier conference for researchers and practitioners concerned with the vital task of providing easy, flexible...... are typically equipped with standard query systems which are often inadequate, and the focus of FQAS is the development of query systems that are more expressive, informative, cooperative, and productive. These proceedings contain contributions from invited speakers and 53 original papers out of about 100...
Sistemas integrados de gestión (SIG)
García Pantigozo, Manuel; Quispe Atúncar, Carlos; Ráez Guevara, Luis
2014-01-01
La cultura de la normalización es necesaria para competir globalmente, y para esto la ISO ha elaborado una serie de sistemas de gestión orientados a las calidad, medio ambiente, seguridad en el trabajo y recursos humanos, esto significa la necesidad de integrar sistemas mediante el Sistema integrado de Gestión - SIG The culture of standardization is necessary to compete globally, and to this ISO has developed a number of management systems geared to quality, environment, occupational safe...
SIG XX - A generation of intelligent gamma ray probes
International Nuclear Information System (INIS)
Rusu, Al.; Bartos, D.; Constantin, F.; Caragheopol, Gh.; Cruceru, I.; Lupu, A.; Serbina, L.
2003-01-01
Nowadays, the radioprotection activities are governed by the ALARA principle. To comply with, we have decided to use scintillators, due to their high efficiency. The surface mounted devices allow the design of the entire gross gamma ray measuring system into a volume of about 0.5 liters. The microcontrollers having an EPROM of 4k bytes offer the opportunity to run resident programmes dedicated to: data acquisition, local processing, data transmission, and system supervising. Such an intelligence is embedded into SIG XX probes. By designing an array of such probes, one can easily obtain a portal monitor, an area monitor and so on, each of them under the the control of a PC. A few modifications may transform an intelligent probe into a portable instrument for radioprotection. In such a case, to make the probe shorter, replacing the photomultiplier by a photodiode, is an attractive goal. To reach it, a dedicated charge preamplifier has to be developed. The works and results on SIG XX probes and charge preamplifier are reported. (authors)
SIG XX - a generation of intelligent gamma ray probes
International Nuclear Information System (INIS)
Rusu, Al.; Bartos, D.; Constantin, F.; Caragheopol, Gh; Cruceru, I.; Lupu, A.; Serbina, L.
2005-01-01
Full text: Nowadays, the radioprotection activities are governed by the ALARA principle. To comply with, we have decided to use scintillators, due to their large efficiency. The surface mounted devices allow the design of the entire gross gamma ray measuring system into a volume of about 0.5 liters. The microcontrollers having an EPROM of 4k bytes offer the opportunity to run resident programmes dedicated to data acquisition, local processing, data communication, system supervising. Such an intelligence is embedded into SIG XX probes. By designing an array of such probes, one can easily obtain a portal monitor, an area monitor and so on, each of them under the the control of a PC. A few modifications may transform an intelligent probe into a portable instrument for radioprotection. In such a case, to make the probe shorter, replacing the photomultiplier by a photodiode, is an attractive goal. To reach it, a dedicated charge preamplifier has to be developed. The works and results on SIG XX probes and charge preamplifier are reported. (author)
Spatio-temporal databases complex motion pattern queries
Vieira, Marcos R
2013-01-01
This brief presents several new query processing techniques, called complex motion pattern queries, specifically designed for very large spatio-temporal databases of moving objects. The brief begins with the definition of flexible pattern queries, which are powerful because of the integration of variables and motion patterns. This is followed by a summary of the expressive power of patterns and flexibility of pattern queries. The brief then present the Spatio-Temporal Pattern System (STPS) and density-based pattern queries. STPS databases contain millions of records with information about mobi
CrossQuery: a web tool for easy associative querying of transcriptome data.
Directory of Open Access Journals (Sweden)
Toni U Wagner
Full Text Available Enormous amounts of data are being generated by modern methods such as transcriptome or exome sequencing and microarray profiling. Primary analyses such as quality control, normalization, statistics and mapping are highly complex and need to be performed by specialists. Thereafter, results are handed back to biomedical researchers, who are then confronted with complicated data lists. For rather simple tasks like data filtering, sorting and cross-association there is a need for new tools which can be used by non-specialists. Here, we describe CrossQuery, a web tool that enables straight forward, simple syntax queries to be executed on transcriptome sequencing and microarray datasets. We provide deep-sequencing data sets of stem cell lines derived from the model fish Medaka and microarray data of human endothelial cells. In the example datasets provided, mRNA expression levels, gene, transcript and sample identification numbers, GO-terms and gene descriptions can be freely correlated, filtered and sorted. Queries can be saved for later reuse and results can be exported to standard formats that allow copy-and-paste to all widespread data visualization tools such as Microsoft Excel. CrossQuery enables researchers to quickly and freely work with transcriptome and microarray data sets requiring only minimal computer skills. Furthermore, CrossQuery allows growing association of multiple datasets as long as at least one common point of correlated information, such as transcript identification numbers or GO-terms, is shared between samples. For advanced users, the object-oriented plug-in and event-driven code design of both server-side and client-side scripts allow easy addition of new features, data sources and data types.
CrossQuery: a web tool for easy associative querying of transcriptome data.
Wagner, Toni U; Fischer, Andreas; Thoma, Eva C; Schartl, Manfred
2011-01-01
Enormous amounts of data are being generated by modern methods such as transcriptome or exome sequencing and microarray profiling. Primary analyses such as quality control, normalization, statistics and mapping are highly complex and need to be performed by specialists. Thereafter, results are handed back to biomedical researchers, who are then confronted with complicated data lists. For rather simple tasks like data filtering, sorting and cross-association there is a need for new tools which can be used by non-specialists. Here, we describe CrossQuery, a web tool that enables straight forward, simple syntax queries to be executed on transcriptome sequencing and microarray datasets. We provide deep-sequencing data sets of stem cell lines derived from the model fish Medaka and microarray data of human endothelial cells. In the example datasets provided, mRNA expression levels, gene, transcript and sample identification numbers, GO-terms and gene descriptions can be freely correlated, filtered and sorted. Queries can be saved for later reuse and results can be exported to standard formats that allow copy-and-paste to all widespread data visualization tools such as Microsoft Excel. CrossQuery enables researchers to quickly and freely work with transcriptome and microarray data sets requiring only minimal computer skills. Furthermore, CrossQuery allows growing association of multiple datasets as long as at least one common point of correlated information, such as transcript identification numbers or GO-terms, is shared between samples. For advanced users, the object-oriented plug-in and event-driven code design of both server-side and client-side scripts allow easy addition of new features, data sources and data types.
Multi-Dimensional Top-k Dominating Queries
DEFF Research Database (Denmark)
Yiu, Man Lung; Mamoulis, Nikos
2009-01-01
The top-k dominating query returns k data objects which dominate the highest number of objects in a dataset. This query is an important tool for decision support since it provides data analysts an intuitive way for finding significant objects. In addition, it combines the advantages of top......-k and skyline queries without sharing their disadvantages: (i) the output size can be controlled, (ii) no ranking functions need to be specified by users, and (iii) the result is independent of the scales at different dimensions. Despite their importance, top-k dominating queries have not received adequate...
Query optimization over crowdsourced data
Park, Hyunjung; Widom, Jennifer
2013-01-01
Deco is a comprehensive system for answering declarative queries posed over stored relational data together with data obtained on-demand from the crowd. In this paper we describe Deco's cost-based query optimizer, building on Deco's data model
Physiological roles of sigma factor SigD in Corynebacterium glutamicum
Czech Academy of Sciences Publication Activity Database
Taniguchi, H.; Busche, T.; Patschkowski, T.; Niehaus, K.; Pátek, Miroslav; Kalinowski, J.; Wendisch, V.F.
2017-01-01
Roč. 17, č. 158 (2017), s. 158 ISSN 1471-2180 R&D Projects: GA ČR(CZ) GA17-06991S Institutional support: RVO:61388971 Keywords : Corynebacterium glutamicum * Sigma factor * SigD Subject RIV: EE - Microbiology, Virology OBOR OECD: Microbiology Impact factor: 2.644, year: 2016
Query Optimizations over Decentralized RDF Graphs
Abdelaziz, Ibrahim
2017-05-18
Applications in life sciences, decentralized social networks, Internet of Things, and statistical linked dataspaces integrate data from multiple decentralized RDF graphs via SPARQL queries. Several approaches have been proposed to optimize query processing over a small number of heterogeneous data sources by utilizing schema information. In the case of schema similarity and interlinks among sources, these approaches cause unnecessary data retrieval and communication, leading to poor scalability and response time. This paper addresses these limitations and presents Lusail, a system for scalable and efficient SPARQL query processing over decentralized graphs. Lusail achieves scalability and low query response time through various optimizations at compile and run times. At compile time, we use a novel locality-aware query decomposition technique that maximizes the number of query triple patterns sent together to a source based on the actual location of the instances satisfying these triple patterns. At run time, we use selectivity-awareness and parallel query execution to reduce network latency and to increase parallelism by delaying the execution of subqueries expected to return large results. We evaluate Lusail using real and synthetic benchmarks, with data sizes up to billions of triples on an in-house cluster and a public cloud. We show that Lusail outperforms state-of-the-art systems by orders of magnitude in terms of scalability and response time.
jQuery UI 1.10 the user interface library for jQuery
Libby, Alex
2013-01-01
This book consists of an easy-to-follow, example-based approach that leads you step-by-step through the implementation and customization of each library component.This book is for frontend designers and developers who need to learn how to use jQuery UI quickly. To get the most out of this book, you should have a good working knowledge of HTML, CSS, and JavaScript, and should ideally be comfortable using jQuery.
Optimal Planar Orthogonal Skyline Counting Queries
DEFF Research Database (Denmark)
Brodal, Gerth Stølting; Larsen, Kasper Green
2014-01-01
counting queries, i.e. given a query rectangle R to report the size of the skyline of P\\cap R. We present a data structure for storing n points with integer coordinates having query time O(lg n/lglg n) and space usage O(n). The model of computation is a unit cost RAM with logarithmic word size. We prove...
SIG-VISA: Signal-based Vertically Integrated Seismic Monitoring
Moore, D.; Mayeda, K. M.; Myers, S. C.; Russell, S.
2013-12-01
Traditional seismic monitoring systems rely on discrete detections produced by station processing software; however, while such detections may constitute a useful summary of station activity, they discard large amounts of information present in the original recorded signal. We present SIG-VISA (Signal-based Vertically Integrated Seismic Analysis), a system for seismic monitoring through Bayesian inference on seismic signals. By directly modeling the recorded signal, our approach incorporates additional information unavailable to detection-based methods, enabling higher sensitivity and more accurate localization using techniques such as waveform matching. SIG-VISA's Bayesian forward model of seismic signal envelopes includes physically-derived models of travel times and source characteristics as well as Gaussian process (kriging) statistical models of signal properties that combine interpolation of historical data with extrapolation of learned physical trends. Applying Bayesian inference, we evaluate the model on earthquakes as well as the 2009 DPRK test event, demonstrating a waveform matching effect as part of the probabilistic inference, along with results on event localization and sensitivity. In particular, we demonstrate increased sensitivity from signal-based modeling, in which the SIGVISA signal model finds statistical evidence for arrivals even at stations for which the IMS station processing failed to register any detection.
PAQ: Persistent Adaptive Query Middleware for Dynamic Environments
Rajamani, Vasanth; Julien, Christine; Payton, Jamie; Roman, Gruia-Catalin
Pervasive computing applications often entail continuous monitoring tasks, issuing persistent queries that return continuously updated views of the operational environment. We present PAQ, a middleware that supports applications' needs by approximating a persistent query as a sequence of one-time queries. PAQ introduces an integration strategy abstraction that allows composition of one-time query responses into streams representing sophisticated spatio-temporal phenomena of interest. A distinguishing feature of our middleware is the realization that the suitability of a persistent query's result is a function of the application's tolerance for accuracy weighed against the associated overhead costs. In PAQ, programmers can specify an inquiry strategy that dictates how information is gathered. Since network dynamics impact the suitability of a particular inquiry strategy, PAQ associates an introspection strategy with a persistent query, that evaluates the quality of the query's results. The result of introspection can trigger application-defined adaptation strategies that alter the nature of the query. PAQ's simple API makes developing adaptive querying systems easily realizable. We present the key abstractions, describe their implementations, and demonstrate the middleware's usefulness through application examples and evaluation.
Pareto-depth for multiple-query image retrieval.
Hsiao, Ko-Jen; Calder, Jeff; Hero, Alfred O
2015-02-01
Most content-based image retrieval systems consider either one single query, or multiple queries that include the same object or represent the same semantic information. In this paper, we consider the content-based image retrieval problem for multiple query images corresponding to different image semantics. We propose a novel multiple-query information retrieval algorithm that combines the Pareto front method with efficient manifold ranking. We show that our proposed algorithm outperforms state of the art multiple-query retrieval algorithms on real-world image databases. We attribute this performance improvement to concavity properties of the Pareto fronts, and prove a theoretical result that characterizes the asymptotic concavity of the fronts.
EquiX-A Search and Query Language for XML.
Cohen, Sara; Kanza, Yaron; Kogan, Yakov; Sagiv, Yehoshua; Nutt, Werner; Serebrenik, Alexander
2002-01-01
Describes EquiX, a search language for XML that combines querying with searching to query the data and the meta-data content of Web pages. Topics include search engines; a data model for XML documents; search query syntax; search query semantics; an algorithm for evaluating a query on a document; and indexing EquiX queries. (LRW)
QUERY RESPONSE TIME COMPARISON NOSQLDB MONGODB WITH SQLDB ORACLE
Directory of Open Access Journals (Sweden)
Humasak T. A. Simanjuntak
2015-01-01
Full Text Available Penyimpanan data saat ini terdapat dua jenis yakni relational database dan non-relational database. Kedua jenis DBMS (Database Managemnet System tersebut berbeda dalam berbagai aspek seperti per-formansi eksekusi query, scalability, reliability maupun struktur penyimpanan data. Kajian ini memiliki tujuan untuk mengetahui perbandingan performansi DBMS antara Oracle sebagai jenis relational data-base dan MongoDB sebagai jenis non-relational database dalam mengolah data terstruktur. Eksperimen dilakukan untuk mengetahui perbandingan performansi kedua DBMS tersebut untuk operasi insert, select, update dan delete dengan menggunakan query sederhana maupun kompleks pada database Northwind. Untuk mencapai tujuan eksperimen, 18 query yang terdiri dari 2 insert query, 10 select query, 2 update query dan 2 delete query dieksekusi. Query dieksekusi melalui sebuah aplikasi .Net yang dibangun sebagai perantara antara user dengan basis data. Eksperimen dilakukan pada tabel dengan atau tanpa relasi pada Oracle dan embedded atau bukan embedded dokumen pada MongoDB. Response time untuk setiap eksekusi query dibandingkan dengan menggunakan metode statistik. Eksperimen menunjukkan response time query untuk proses select, insert, dan update pada MongoDB lebih cepatdaripada Oracle. MongoDB lebih cepat 64.8 % untuk select query;MongoDB lebihcepat 72.8 % untuk insert query dan MongoDB lebih cepat 33.9 % untuk update query. Pada delete query, Oracle lebih cepat 96.8 % daripada MongoDB untuk table yang berelasi, tetapi MongoDB lebih cepat 83.8 % daripada Oracle untuk table yang tidak memiliki relasi.Untuk query kompleks dengan Map Reduce pada MongoDB lebih lambat 97.6% daripada kompleks query dengan aggregate function pada Oracle.
Chaffer, Jonathan
2013-01-01
Step through each of the core concepts of the jQuery library, building an overall picture of its capabilities. Once you have thoroughly covered the basics, the book returns to each concept to cover more advanced examples and techniques.This book is for web designers who want to create interactive elements for their designs, and for developers who want to create the best user interface for their web applications. Basic JavaScript programming and knowledge of HTML and CSS is required. No knowledge of jQuery is assumed, nor is experience with any other JavaScript libraries.
Knowledge Query Language (KQL)
2016-02-12
described as a sparse, distributed multidimensional sorted map. Unlike a relational database , BigTable has no multicolumn primary keys or constraints. The...in query languages such as SQL. Figure 3. Address expression-based querying. Each circled step in Figure 3 is described below. Datastore/ Database ...implementation we describe in later sections stores the instance of registry ontology in JSON files. 7 Throughout the rest of this report, we use the
Um framework para a avaliação de interfaces de aplicações SIG Web no dominio agricola
Juliano Schimiguel
2006-01-01
Resumo: Sistemas de Informação Geográfica (SIG) são categorias de software que permitem a manipulação, gerenciamento e visualização de dados geo referenciados. O termo georeferenciado denota associação a um sistema de coordenadas geográficas. Existem inúmeras categorias de aplicações SIG. em diferentes escalas e domínios, abrangendo desde temas urbanos até ambientais. Aplicações de Sistemas de Informação Geográfica na Web, neste trabalho denominadas "aplicações SIG Web", são sistemas onde a '...
Yoo, Ji-Sun; Oh, Gyeong-Seok; Ryoo, Sungweon; Roe, Jung-Hye
2016-01-01
Antibiotic-producing streptomycetes are rich sources of resistance mechanisms against endogenous and exogenous antibiotics. An ECF sigma factor ?R (SigR) is known to govern the thiol-oxidative stress response in Streptomyces coelicolor. Amplification of this response is achieved by producing an unstable isoform of ?R called ?R?. In this work, we present evidence that antibiotics induce the SigR regulon via a redox-independent pathway, leading to antibiotic resistance. The translation-inhibiti...
Enhancing Recall in Semantic Querying
DEFF Research Database (Denmark)
Rouces, Jacobo
2013-01-01
lexically and structurally different, which we will introduce in the next section. As RDF graphs from different sources are expected to be linked, the modeling heterogeneities will make the federated graph become sparser and inconsistent. This is detrimental to the recall of SPARQL queries, as the query...
Location-Dependent Query Processing Under Soft Real-Time Constraints
Directory of Open Access Journals (Sweden)
Zoubir Mammeri
2009-01-01
Full Text Available In recent years, mobile devices and applications achieved an increasing development. In database field, this development required methods to consider new query types like location-dependent queries (i.e. the query results depend on the query issuer location. Although several researches addressed problems related to location-dependent query processing, a few works considered timing requirements that may be associated with queries (i.e., the query results must be delivered to mobile clients on time. The main objective of this paper is to propose a solution for location-dependent query processing under soft real-time constraints. Hence, we propose methods to take into account client location-dependency and to maximize the percentage of queries respecting their deadlines. We validate our proposal by implementing a prototype based on Oracle DBMS. Performance evaluation results show that the proposed solution optimizes the percentage of queries meeting their deadlines and the communication cost.
SCRY: Enabling quantitative reasoning in SPARQL queries
Meroño-Peñuela, A.; Stringer, Bas; Loizou, Antonis; Abeln, Sanne; Heringa, Jaap
2015-01-01
The inability to include quantitative reasoning in SPARQL queries slows down the application of Semantic Web technology in the life sciences. SCRY, our SPARQL compatible service layer, improves this by executing services at query time and making their outputs query-accessible, generating RDF data on
Answering SPARQL queries modulo RDF Schema with paths
Alkhateeb, Faisal; Euzenat, Jérôme
2013-01-01
alkhateeb2013a; SPARQL is the standard query language for RDF graphs. In its strict instantiation, it only offers querying according to the RDF semantics and would thus ignore the semantics of data expressed with respect to (RDF) schemas or (OWL) ontologies. Several extensions to SPARQL have been proposed to query RDF data modulo RDFS, i.e., interpreting the query with RDFS semantics and/or considering external ontologies. We introduce a general framework which allows for expressing query ans...
Directory of Open Access Journals (Sweden)
Suzuki Motoyuki
2009-01-01
Full Text Available Abstract We are developing a method of Web-based unsupervised language model adaptation for recognition of spoken documents. The proposed method chooses keywords from the preliminary recognition result and retrieves Web documents using the chosen keywords. A problem is that the selected keywords tend to contain misrecognized words. The proposed method introduces two new ideas for avoiding the effects of keywords derived from misrecognized words. The first idea is to compose multiple queries from selected keyword candidates so that the misrecognized words and correct words do not fall into one query. The second idea is that the number of Web documents downloaded for each query is determined according to the "query relevance." Combining these two ideas, we can alleviate bad effect of misrecognized keywords by decreasing the number of downloaded Web documents from queries that contain misrecognized keywords. Finally, we examine a method of determining the number of iterative adaptations based on the recognition likelihood. Experiments have shown that the proposed stopping criterion can determine almost the optimum number of iterations. In the final experiment, the word accuracy without adaptation (55.29% was improved to 60.38%, which was 1.13 point better than the result of the conventional unsupervised adaptation method (59.25%.
Directory of Open Access Journals (Sweden)
Akinori Ito
2009-01-01
Full Text Available We are developing a method of Web-based unsupervised language model adaptation for recognition of spoken documents. The proposed method chooses keywords from the preliminary recognition result and retrieves Web documents using the chosen keywords. A problem is that the selected keywords tend to contain misrecognized words. The proposed method introduces two new ideas for avoiding the effects of keywords derived from misrecognized words. The first idea is to compose multiple queries from selected keyword candidates so that the misrecognized words and correct words do not fall into one query. The second idea is that the number of Web documents downloaded for each query is determined according to the “query relevance.” Combining these two ideas, we can alleviate bad effect of misrecognized keywords by decreasing the number of downloaded Web documents from queries that contain misrecognized keywords. Finally, we examine a method of determining the number of iterative adaptations based on the recognition likelihood. Experiments have shown that the proposed stopping criterion can determine almost the optimum number of iterations. In the final experiment, the word accuracy without adaptation (55.29% was improved to 60.38%, which was 1.13 point better than the result of the conventional unsupervised adaptation method (59.25%.
Implementation of Quantum Private Queries Using Nuclear Magnetic Resonance
International Nuclear Information System (INIS)
Wang Chuan; Hao Liang; Zhao Lian-Jie
2011-01-01
We present a modified protocol for the realization of a quantum private query process on a classical database. Using one-qubit query and CNOT operation, the query process can be realized in a two-mode database. In the query process, the data privacy is preserved as the sender would not reveal any information about the database besides her query information, and the database provider cannot retain any information about the query. We implement the quantum private query protocol in a nuclear magnetic resonance system. The density matrix of the memory registers are constructed. (general)
SPARQL Query Re-writing Using Partonomy Based Transformation Rules
Jain, Prateek; Yeh, Peter Z.; Verma, Kunal; Henson, Cory A.; Sheth, Amit P.
Often the information present in a spatial knowledge base is represented at a different level of granularity and abstraction than the query constraints. For querying ontology's containing spatial information, the precise relationships between spatial entities has to be specified in the basic graph pattern of SPARQL query which can result in long and complex queries. We present a novel approach to help users intuitively write SPARQL queries to query spatial data, rather than relying on knowledge of the ontology structure. Our framework re-writes queries, using transformation rules to exploit part-whole relations between geographical entities to address the mismatches between query constraints and knowledge base. Our experiments were performed on completely third party datasets and queries. Evaluations were performed on Geonames dataset using questions from National Geographic Bee serialized into SPARQL and British Administrative Geography Ontology using questions from a popular trivia website. These experiments demonstrate high precision in retrieval of results and ease in writing queries.
Mobile Information Access with Spoken Query Answering
DEFF Research Database (Denmark)
Brøndsted, Tom; Larsen, Henrik Legind; Larsen, Lars Bo
2006-01-01
window focused over the part which most likely contains an answer to the query. The two systems are integrated into a full spoken query answering system. The prototype can answer queries and questions within the chosen football (soccer) test domain, but the system has the flexibility for being ported...
On the formulation of performant sparql queries
Loizou, A.; Angles, R.; Groth, P.T.
2014-01-01
Abstract The combination of the flexibility of RDF and the expressiveness of SPARQL provides a powerful mechanism to model, integrate and query data. However, these properties also mean that it is nontrivial to write performant SPARQL queries. Indeed, it is quite easy to create queries that tax even
Evaluation of Sub Query Performance in SQL Server
Oktavia, Tanty; Sujarwo, Surya
2014-03-01
The paper explores several sub query methods used in a query and their impact on the query performance. The study uses experimental approach to evaluate the performance of each sub query methods combined with indexing strategy. The sub query methods consist of in, exists, relational operator and relational operator combined with top operator. The experimental shows that using relational operator combined with indexing strategy in sub query has greater performance compared with using same method without indexing strategy and also other methods. In summary, for application that emphasized on the performance of retrieving data from database, it better to use relational operator combined with indexing strategy. This study is done on Microsoft SQL Server 2012.
Responsive web design with jQuery
Carlos, Gilberto
2013-01-01
Responsive Web Design with jQuery follows a standard tutorial-based approach, covering various aspects of responsive web design by building a comprehensive website.""Responsive Web Design with jQuery"" is aimed at web designers who are interested in building device-agnostic websites. You should have a grasp of standard HTML, CSS, and JavaScript development, and have a familiarity with graphic design. Some exposure to jQuery and HTML5 will be beneficial but isn't essential.
Adding query privacy to robust DHTs
DEFF Research Database (Denmark)
Backes, Michael; Goldberg, Ian; Kate, Aniket
2012-01-01
intermediate peers that (help to) route the queries towards their destinations. In this paper, we satisfy this requirement by presenting an approach for providing privacy for the keys in DHT queries. We use the concept of oblivious transfer (OT) in communication over DHTs to preserve query privacy without...... privacy over robust DHTs. Finally, we compare the performance of our privacy-preserving protocols with their more privacy-invasive counterparts. We observe that there is no increase in the message complexity...
SPARQL Assist language-neutral query composer
2012-01-01
Background SPARQL query composition is difficult for the lay-person, and even the experienced bioinformatician in cases where the data model is unfamiliar. Moreover, established best-practices and internationalization concerns dictate that the identifiers for ontological terms should be opaque rather than human-readable, which further complicates the task of synthesizing queries manually. Results We present SPARQL Assist: a Web application that addresses these issues by providing context-sensitive type-ahead completion during SPARQL query construction. Ontological terms are suggested using their multi-lingual labels and descriptions, leveraging existing support for internationalization and language-neutrality. Moreover, the system utilizes the semantics embedded in ontologies, and within the query itself, to help prioritize the most likely suggestions. Conclusions To ensure success, the Semantic Web must be easily available to all users, regardless of locale, training, or preferred language. By enhancing support for internationalization, and moreover by simplifying the manual construction of SPARQL queries through the use of controlled-natural-language interfaces, we believe we have made some early steps towards simplifying access to Semantic Web resources. PMID:22373327
SPARQL assist language-neutral query composer.
McCarthy, Luke; Vandervalk, Ben; Wilkinson, Mark
2012-01-25
SPARQL query composition is difficult for the lay-person, and even the experienced bioinformatician in cases where the data model is unfamiliar. Moreover, established best-practices and internationalization concerns dictate that the identifiers for ontological terms should be opaque rather than human-readable, which further complicates the task of synthesizing queries manually. We present SPARQL Assist: a Web application that addresses these issues by providing context-sensitive type-ahead completion during SPARQL query construction. Ontological terms are suggested using their multi-lingual labels and descriptions, leveraging existing support for internationalization and language-neutrality. Moreover, the system utilizes the semantics embedded in ontologies, and within the query itself, to help prioritize the most likely suggestions. To ensure success, the Semantic Web must be easily available to all users, regardless of locale, training, or preferred language. By enhancing support for internationalization, and moreover by simplifying the manual construction of SPARQL queries through the use of controlled-natural-language interfaces, we believe we have made some early steps towards simplifying access to Semantic Web resources.
Query Optimizations over Decentralized RDF Graphs
Abdelaziz, Ibrahim; Mansour, Essam; Ouzzani, Mourad; Aboulnaga, Ashraf; Kalnis, Panos
2017-01-01
Applications in life sciences, decentralized social networks, Internet of Things, and statistical linked dataspaces integrate data from multiple decentralized RDF graphs via SPARQL queries. Several approaches have been proposed to optimize query
Machine learning for recommendation systems in job postings selection
Marcos Santamarta, Victor
2016-01-01
Recommendation is a particular form of information filtering, that exploits past behaviors and user similarities to generate a list of information items that is personally tailored to an end-user?s preferences. Recommender systems have become extremely common in recent years, and are applied in a variety of applications. The most popular ones are probably movies, music, news, books, research articles, search queries, social tags, and products in general. However, there are also recommender sy...
Peute, Linda W P; de Keizer, Nicolette F; Jaspers, Monique W M
2015-06-01
To compare the performance of the Concurrent (CTA) and Retrospective (RTA) Think Aloud method and to assess their value in a formative usability evaluation of an Intensive Care Registry-physician data query tool designed to support ICU quality improvement processes. Sixteen representative intensive care physicians participated in the usability evaluation study. Subjects were allocated to either the CTA or RTA method by a matched randomized design. Each subject performed six usability-testing tasks of varying complexity in the query tool in a real-working context. Methods were compared with regard to number and type of problems detected. Verbal protocols of CTA and RTA were analyzed in depth to assess differences in verbal output. Standardized measures were applied to assess thoroughness in usability problem detection weighted per problem severity level and method overall effectiveness in detecting usability problems with regard to the time subjects spent per method. The usability evaluation of the data query tool revealed a total of 43 unique usability problems that the intensive care physicians encountered. CTA detected unique usability problems with regard to graphics/symbols, navigation issues, error messages, and the organization of information on the query tool's screens. RTA detected unique issues concerning system match with subjects' language and applied terminology. The in-depth verbal protocol analysis of CTA provided information on intensive care physicians' query design strategies. Overall, CTA performed significantly better than RTA in detecting usability problems. CTA usability problem detection effectiveness was 0.80 vs. 0.62 (pusability problems of a moderate (0.85 vs. 0.7) and severe nature (0.71 vs. 0.57). In this study, the CTA is more effective in usability-problem detection and provided clarification of intensive care physician query design strategies to inform redesign of the query tool. However, CTA does not outperform RTA. The RTA
PERANGKAT BANTU UNTUK OPTIMASI QUERY PADA ORACLE DENGAN RESTRUKTURISASI SQL
Directory of Open Access Journals (Sweden)
Darlis Heru Murti
2006-07-01
Full Text Available Query merupakan bagian dari bahasa pemrograman SQL (Structured Query Language yang berfungsi untuk mengambil data (read dalam DBMS (Database Management System, termasuk Oracle [3]. Pada Oracle, ada tiga tahap proses yang dilakukan dalam pengeksekusian query, yaitu Parsing, Execute dan Fetch. Sebelum proses execute dijalankan, Oracle terlebih dahulu membuat execution plan yang akan menjadi skenario dalam proses excute.Dalam proses pengeksekusian query, terdapat faktor-faktor yang mempengaruhi kinerja query, di antaranya access path (cara pengambilan data dari sebuah tabel dan operasi join (cara menggabungkan data dari dua tabel. Untuk mendapatkan query dengan kinerja optimal, maka diperlukan pertimbangan-pertimbangan dalam menyikapi faktor-faktor tersebut. Optimasi query merupakan suatu cara untuk mendapatkan query dengan kinerja seoptimal mungkin, terutama dilihat dari sudut pandang waktu. Ada banyak metode untuk mengoptimasi query, tapi pada Penelitian ini, penulis membuat sebuah aplikasi untuk mengoptimasi query dengan metode restrukturisasi SQL statement. Pada metode ini, objek yang dianalisa adalah struktur klausa yang membangun sebuah query. Aplikasi ini memiliki satu input dan lima jenis output. Input dari aplikasi ini adalah sebuah query sedangkan kelima jenis output aplikasi ini adalah berupa query hasil optimasi, saran perbaikan, saran pembuatan indeks baru, execution plan dan data statistik. Cara kerja aplikasi ini dibagi menjadi empat tahap yaitu mengurai query menjadi sub query, mengurai query per-klausa, menentukan access path dan operasi join dan restrukturisasi query.Dari serangkaian ujicoba yang dilakukan penulis, aplikasi telah dapat berjalan sesuai dengan tujuan pembuatan Penelitian ini, yaitu mendapatkan query dengan kinerja optimal.Kata Kunci : Query, SQL, DBMS, Oracle, Parsing, Execute, Fetch, Execution Plan, Access Path, Operasi Join, Restrukturisasi SQL statement.
Evaluating SPARQL queries on massive RDF datasets
Al-Harbi, Razen; Abdelaziz, Ibrahim; Kalnis, Panos; Mamoulis, Nikos
2015-01-01
In this paper, we propose AdHash, a distributed RDF system which addresses the shortcomings of previous work. First, AdHash initially applies lightweight hash partitioning, which drastically minimizes the startup cost, while favoring the parallel processing of join patterns on subjects, without any data communication. Using a locality-aware planner, queries that cannot be processed in parallel are evaluated with minimal communication. Second, AdHash monitors the data access patterns and adapts dynamically to the query load by incrementally redistributing and replicating frequently accessed data. As a result, the communication cost for future queries is drastically reduced or even eliminated. Our experiments with synthetic and real data verify that AdHash (i) starts faster than all existing systems, (ii) processes thousands of queries before other systems become online, and (iii) gracefully adapts to the query load, being able to evaluate queries on billion-scale RDF data in sub-seconds. In this demonstration, audience can use a graphical interface of AdHash to verify its performance superiority compared to state-of-the-art distributed RDF systems.
Anti-dentine Salivary SIgA in young adults with a history of dental trauma in deciduous teeth
Directory of Open Access Journals (Sweden)
Gabriela Fleury SEIXAS
2015-01-01
Full Text Available Anti-dentin autoantibodies are associated with inflammatory root resorption in permanent teeth and are modulated by dental trauma and orthodontic force. However, it is not known whether deciduous tooth trauma can stimulate the development of a humoral immune response against dentin. The aim of this study was to evaluate the levels of salivary SIgA reactivity against human dentin extract in young adults with a history of trauma in the primary dentition. A sample of 78 patients, aged 18 to 25, who had completed an early childhood (0 to 5 years old caries prevention program years earlier at the Universidade Estadual de LondrinaPediatric Clinic, underwent radiographic examination and salivary sampling. Anti-dentin SIgA levels were analyzed by immunoenzymatic assay and Western blotting. Although dental trauma to deciduous teeth had occurred in 34 (43.6% of the patients, no differences in SIgA levels were detected between individuals who had experienced trauma and those who had not (p > 0.05. Multivariate regression analysis showed no association between dental trauma and SIgA levels (p > 0.05. Patients with a history of deciduous trauma presented low levels of anti-dentin antibodies, associated with orthodontic root resorption (p
Evaluating SPARQL queries on massive RDF datasets
Al-Harbi, Razen
2015-08-01
Distributed RDF systems partition data across multiple computer nodes. Partitioning is typically based on heuristics that minimize inter-node communication and it is performed in an initial, data pre-processing phase. Therefore, the resulting partitions are static and do not adapt to changes in the query workload; as a result, existing systems are unable to consistently avoid communication for queries that are not favored by the initial data partitioning. Furthermore, for very large RDF knowledge bases, the partitioning phase becomes prohibitively expensive, leading to high startup costs. In this paper, we propose AdHash, a distributed RDF system which addresses the shortcomings of previous work. First, AdHash initially applies lightweight hash partitioning, which drastically minimizes the startup cost, while favoring the parallel processing of join patterns on subjects, without any data communication. Using a locality-aware planner, queries that cannot be processed in parallel are evaluated with minimal communication. Second, AdHash monitors the data access patterns and adapts dynamically to the query load by incrementally redistributing and replicating frequently accessed data. As a result, the communication cost for future queries is drastically reduced or even eliminated. Our experiments with synthetic and real data verify that AdHash (i) starts faster than all existing systems, (ii) processes thousands of queries before other systems become online, and (iii) gracefully adapts to the query load, being able to evaluate queries on billion-scale RDF data in sub-seconds. In this demonstration, audience can use a graphical interface of AdHash to verify its performance superiority compared to state-of-the-art distributed RDF systems.
Vaucouleur, Sebastien
2011-02-01
We introduce code query by example for customisation of evolvable software products in general and of enterprise resource planning systems (ERPs) in particular. The concept is based on an initial empirical study on practices around ERP systems. We motivate our design choices based on those empirical results, and we show how the proposed solution helps with respect to the infamous upgrade problem: the conflict between the need for customisation and the need for upgrade of ERP systems. We further show how code query by example can be used as a form of lightweight static analysis, to detect automatically potential defects in large software products. Code query by example as a form of lightweight static analysis is particularly interesting in the context of ERP systems: it is often the case that programmers working in this field are not computer science specialists but more of domain experts. Hence, they require a simple language to express custom rules.
Ferramenta SIG para Modelos de Propagação de Ondas. Desenvolvimentos Preliminares
Zózimo, A. C.; Charneca, N.; Gonçalves, A.; Fortes, C. J. E. M.
2005-01-01
SIMAR é um Sistema Integrado de Modelação da Agitação maRítima, para estudos de propagação e deformação da agitação marítima em zonas costeiras e portuárias. Este Sistema é baseado num Sistema de Informação Geográfica (SIG) e inclui um conjunto de módulos correspondentes aos modelos de propagação e deformação de ondas. A comunicação entre o SIG e os módulos é efectuada por uma interface gráfica construída para o efeito. Nesta comunicação, apresenta-se uma versão preliminar do SIMAR e das s...
Efficient Approximate OLAP Querying Over Time Series
DEFF Research Database (Denmark)
Perera, Kasun Baruhupolage Don Kasun Sanjeewa; Hahmann, Martin; Lehner, Wolfgang
2016-01-01
The ongoing trend for data gathering not only produces larger volumes of data, but also increases the variety of recorded data types. Out of these, especially time series, e.g. various sensor readings, have attracted attention in the domains of business intelligence and decision making. As OLAP...... queries play a major role in these domains, it is desirable to also execute them on time series data. While this is not a problem on the conceptual level, it can become a bottleneck with regards to query run-time. In general, processing OLAP queries gets more computationally intensive as the volume...... of data grows. This is a particular problem when querying time series data, which generally contains multiple measures recorded at fine time granularities. Usually, this issue is addressed either by scaling up hardware or by employing workload based query optimization techniques. However, these solutions...
Algebraic Optimization of Recursive Database Queries
DEFF Research Database (Denmark)
Hansen, Michael Reichhardt
1988-01-01
Queries are expressed by relational algebra expressions including a fixpoint operation. A condition is presented under which a natural join commutes with a fixpoint operation. This condition is a simple check of attribute sets of sub-expressions of the query. The work may be considered a generali......Queries are expressed by relational algebra expressions including a fixpoint operation. A condition is presented under which a natural join commutes with a fixpoint operation. This condition is a simple check of attribute sets of sub-expressions of the query. The work may be considered...... a generalization of Aho and Ullman, (1979). The result is interpreted in function free logic database terms as a transformation of the recursively defined predicate involving: (a) elimination of an argument, and (b) propagation of selections (instantiations) to the extensionally defined predicates. A collection...
The effect of query complexity on Web searching results
Directory of Open Access Journals (Sweden)
B.J. Jansen
2000-01-01
Full Text Available This paper presents findings from a study of the effects of query structure on retrieval by Web search services. Fifteen queries were selected from the transaction log of a major Web search service in simple query form with no advanced operators (e.g., Boolean operators, phrase operators, etc. and submitted to 5 major search engines - Alta Vista, Excite, FAST Search, Infoseek, and Northern Light. The results from these queries became the baseline data. The original 15 queries were then modified using the various search operators supported by each of the 5 search engines for a total of 210 queries. Each of these 210 queries was also submitted to the applicable search service. The results obtained were then compared to the baseline results. A total of 2,768 search results were returned by the set of all queries. In general, increasing the complexity of the queries had little effect on the results with a greater than 70% overlap in results, on average. Implications for the design of Web search services and directions for future research are discussed.
Mining the SDSS SkyServer SQL queries log
Hirota, Vitor M.; Santos, Rafael; Raddick, Jordan; Thakar, Ani
2016-05-01
SkyServer, the Internet portal for the Sloan Digital Sky Survey (SDSS) astronomic catalog, provides a set of tools that allows data access for astronomers and scientific education. One of SkyServer data access interfaces allows users to enter ad-hoc SQL statements to query the catalog. SkyServer also presents some template queries that can be used as basis for more complex queries. This interface has logged over 330 million queries submitted since 2001. It is expected that analysis of this data can be used to investigate usage patterns, identify potential new classes of queries, find similar queries, etc. and to shed some light on how users interact with the Sloan Digital Sky Survey data and how scientists have adopted the new paradigm of e-Science, which could in turn lead to enhancements on the user interfaces and experience in general. In this paper we review some approaches to SQL query mining, apply the traditional techniques used in the literature and present lessons learned, namely, that the general text mining approach for feature extraction and clustering does not seem to be adequate for this type of data, and, most importantly, we find that this type of analysis can result in very different queries being clustered together.
Unge er en flok egoistiske individualister, der har nok i sig selv
DEFF Research Database (Denmark)
Kofod, Anne
2005-01-01
Nutidens unge har som oftest en tvetydig identitet: På én og samme tid søger de sammen i sociale fællesskaber og gør alt for at skille sig ud som individualister. Udgivelsesdato: 27. september...
Fragger: a protein fragment picker for structural queries.
Berenger, Francois; Simoncini, David; Voet, Arnout; Shrestha, Rojan; Zhang, Kam Y J
2017-01-01
Protein modeling and design activities often require querying the Protein Data Bank (PDB) with a structural fragment, possibly containing gaps. For some applications, it is preferable to work on a specific subset of the PDB or with unpublished structures. These requirements, along with specific user needs, motivated the creation of a new software to manage and query 3D protein fragments. Fragger is a protein fragment picker that allows protein fragment databases to be created and queried. All fragment lengths are supported and any set of PDB files can be used to create a database. Fragger can efficiently search a fragment database with a query fragment and a distance threshold. Matching fragments are ranked by distance to the query. The query fragment can have structural gaps and the allowed amino acid sequences matching a query can be constrained via a regular expression of one-letter amino acid codes. Fragger also incorporates a tool to compute the backbone RMSD of one versus many fragments in high throughput. Fragger should be useful for protein design, loop grafting and related structural bioinformatics tasks.
GMB: An Efficient Query Processor for Biological Data
Directory of Open Access Journals (Sweden)
Taha Kamal
2011-06-01
Full Text Available Bioinformatics applications manage complex biological data stored into distributed and often heterogeneous databases and require large computing power. These databases are too big and complicated to be rapidly queried every time a user submits a query, due to the overhead involved in decomposing the queries, sending the decomposed queries to remote databases, and composing the results. There is also considerable communication costs involved. This study addresses the mentioned problems in Grid-based environment for bioinformatics. We propose a Grid middleware called GMB that alleviates these problems by caching the results of Frequently Used Queries (FUQ. Queries are classified based on their types and frequencies. FUQ are answered from the middleware, which improves their response time. GMB acts as a gateway to TeraGrid Grid: it resides between users’ applications and TeraGrid Grid. We evaluate GMB experimentally.
The Data Cyclotron query processing scheme
Goncalves, R.; Kersten, M.
2011-01-01
A grand challenge of distributed query processing is to devise a self-organizing architecture which exploits all hardware resources optimally to manage the database hot set, minimize query response time, and maximize throughput without single point global coordination. The Data Cyclotron
Approximate furthest neighbor with application to annulus query
DEFF Research Database (Denmark)
Pagh, Rasmus; Silvestri, Francesco; Sivertsen, Johan von Tangen
2016-01-01
-dimensional Euclidean space. The method builds on the technique of Indyk (SODA 2003), storing random projections to provide sublinear query time for AFN. However, we introduce a different query algorithm, improving on Indyk׳s approximation factor and reducing the running time by a logarithmic factor. We also present......, the query-dependent approach is used for deriving a data structure for the approximate annulus query problem, which is defined as follows: given an input set S and two parameters r>0 and w≥1, construct a data structure that returns for each query point q a point p∈S such that the distance between p and q...
Manchester visual query language
Oakley, John P.; Davis, Darryl N.; Shann, Richard T.
1993-04-01
We report a database language for visual retrieval which allows queries on image feature information which has been computed and stored along with images. The language is novel in that it provides facilities for dealing with feature data which has actually been obtained from image analysis. Each line in the Manchester Visual Query Language (MVQL) takes a set of objects as input and produces another, usually smaller, set as output. The MVQL constructs are mainly based on proven operators from the field of digital image analysis. An example is the Hough-group operator which takes as input a specification for the objects to be grouped, a specification for the relevant Hough space, and a definition of the voting rule. The output is a ranked list of high scoring bins. The query could be directed towards one particular image or an entire image database, in the latter case the bins in the output list would in general be associated with different images. We have implemented MVQL in two layers. The command interpreter is a Lisp program which maps each MVQL line to a sequence of commands which are used to control a specialized database engine. The latter is a hybrid graph/relational system which provides low-level support for inheritance and schema evolution. In the paper we outline the language and provide examples of useful queries. We also describe our solution to the engineering problems associated with the implementation of MVQL.
A structural query system for Han characters
DEFF Research Database (Denmark)
Skala, Matthew
2016-01-01
The IDSgrep structural query system for Han character dictionaries is presented. This dictionary search system represents the spatial structure of Han characters using Extended Ideographic Description Sequences (EIDSes), a data model and syntax based on the Unicode IDS concept. It includes a query...... language for EIDS databases, with a freely available implementation and format translation from popular third-party IDS and XML character databases. The system is designed to suit the needs of font developers and foreign language learners. The search algorithm includes a bit vector index inspired by Bloom...... filters to support faster query operations. Experimental results are presented, evaluating the effect of the indexing on query performance....
Domogis: prototipo de un interfaz del sistema de control de un edificio integrado en un SIG
Directory of Open Access Journals (Sweden)
Álvarez, M.
2010-06-01
Full Text Available This paper deals with of a the use of Geographical Information Systems (GIS for domotic control. The foccus is put on the communication interface between the building control system (BCS integrated in a GIS. For get this aim, the GIS of the Montegancedo Campus where is located the Facultad de infomatica of UPM and the creation of an interface is needed. The implemented interface in Microsoft C# language allows the control, monotorizing and management of the sensors data installated in the Campus.
Este trabajo trata de la utilización de los Sistemas de Información Geográfica (SIG en uno de las nuevos requerimientos de la arquitectura, el control domótico. El objetivo es el desarrollo de un interfaz de comunicación del Sistema de Control de un Edificio (SCE integrado en un SIG. La consecución de este objetivo implica previamente el desarrollo del SIG del Campus de Montegancedo sede de la Facultad de Informática de la UPM y la creación de un interfaz integrado en el SIG, desarrollado en lenguaje de programacion C# de Microsoft. Este interfaz dirige al usuario en la realización de ciertas tareas de control domótico de las instalaciones urbanas y edificios del Campus universitario, como evaluar, monotorizar y gestionar datos procedentes de sensores estratégicamente situados en dicho Campus.
Enabling Incremental Query Re-Optimization.
Liu, Mengmeng; Ives, Zachary G; Loo, Boon Thau
2016-01-01
As declarative query processing techniques expand to the Web, data streams, network routers, and cloud platforms, there is an increasing need to re-plan execution in the presence of unanticipated performance changes. New runtime information may affect which query plan we prefer to run. Adaptive techniques require innovation both in terms of the algorithms used to estimate costs , and in terms of the search algorithm that finds the best plan. We investigate how to build a cost-based optimizer that recomputes the optimal plan incrementally given new cost information, much as a stream engine constantly updates its outputs given new data. Our implementation especially shows benefits for stream processing workloads. It lays the foundations upon which a variety of novel adaptive optimization algorithms can be built. We start by leveraging the recently proposed approach of formulating query plan enumeration as a set of recursive datalog queries ; we develop a variety of novel optimization approaches to ensure effective pruning in both static and incremental cases. We further show that the lessons learned in the declarative implementation can be equally applied to more traditional optimizer implementations.
Directory of Open Access Journals (Sweden)
Busche Tobias
2012-09-01
Full Text Available Abstract Background The expression of genes in Corynebacterium glutamicum, a Gram-positive non-pathogenic bacterium used mainly for the industrial production of amino acids, is regulated by seven different sigma factors of RNA polymerase, including the stress-responsive ECF-sigma factor SigH. The sigH gene is located in a gene cluster together with the rshA gene, putatively encoding an anti-sigma factor. The aim of this study was to analyze the transcriptional regulation of the sigH and rshA gene cluster and the effects of RshA on the SigH regulon, in order to refine the model describing the role of SigH and RshA during stress response. Results Transcription analyses revealed that the sigH gene and rshA gene are cotranscribed from four sigH housekeeping promoters in C. glutamicum. In addition, a SigH-controlled rshA promoter was found to only drive the transcription of the rshA gene. To test the role of the putative anti-sigma factor gene rshA under normal growth conditions, a C. glutamicum rshA deletion strain was constructed and used for genome-wide transcription profiling with DNA microarrays. In total, 83 genes organized in 61 putative transcriptional units, including those previously detected using sigH mutant strains, exhibited increased transcript levels in the rshA deletion mutant compared to its parental strain. The genes encoding proteins related to disulphide stress response, heat stress proteins, components of the SOS-response to DNA damage and proteasome components were the most markedly upregulated gene groups. Altogether six SigH-dependent promoters upstream of the identified genes were determined by primer extension and a refined consensus promoter consisting of 45 original promoter sequences was constructed. Conclusions The rshA gene codes for an anti-sigma factor controlling the function of the stress-responsive sigma factor SigH in C. glutamicum. Transcription of rshA from a SigH-dependent promoter may serve to quickly
Busche, Tobias; Silar, Radoslav; Pičmanová, Martina; Pátek, Miroslav; Kalinowski, Jörn
2012-09-03
The expression of genes in Corynebacterium glutamicum, a Gram-positive non-pathogenic bacterium used mainly for the industrial production of amino acids, is regulated by seven different sigma factors of RNA polymerase, including the stress-responsive ECF-sigma factor SigH. The sigH gene is located in a gene cluster together with the rshA gene, putatively encoding an anti-sigma factor. The aim of this study was to analyze the transcriptional regulation of the sigH and rshA gene cluster and the effects of RshA on the SigH regulon, in order to refine the model describing the role of SigH and RshA during stress response. Transcription analyses revealed that the sigH gene and rshA gene are cotranscribed from four sigH housekeeping promoters in C. glutamicum. In addition, a SigH-controlled rshA promoter was found to only drive the transcription of the rshA gene. To test the role of the putative anti-sigma factor gene rshA under normal growth conditions, a C. glutamicum rshA deletion strain was constructed and used for genome-wide transcription profiling with DNA microarrays. In total, 83 genes organized in 61 putative transcriptional units, including those previously detected using sigH mutant strains, exhibited increased transcript levels in the rshA deletion mutant compared to its parental strain. The genes encoding proteins related to disulphide stress response, heat stress proteins, components of the SOS-response to DNA damage and proteasome components were the most markedly upregulated gene groups. Altogether six SigH-dependent promoters upstream of the identified genes were determined by primer extension and a refined consensus promoter consisting of 45 original promoter sequences was constructed. The rshA gene codes for an anti-sigma factor controlling the function of the stress-responsive sigma factor SigH in C. glutamicum. Transcription of rshA from a SigH-dependent promoter may serve to quickly shutdown the SigH-dependent stress response after the cells have
Proceedings of the 10th ASIS SIG/CR classification research workshop
DEFF Research Database (Denmark)
This volume is a working copy of the papers presented at the 9th ASIS SIG/CR workshop on classification research, held in Washington, DC, at the ASIS Annual Meeting on Sunday 31 October 1999. The contributions printed here are working papers, and thus, not necessarily in their final form...
Spatial Keyword Query Processing
DEFF Research Database (Denmark)
Chen, Lisi; Jensen, Christian S.; Wu, Dingming
2013-01-01
Geo-textual indices play an important role in spatial keyword query- ing. The existing geo-textual indices have not been compared sys- tematically under the same experimental framework. This makes it difficult to determine which indexing technique best supports specific functionality. We provide...... an all-around survey of 12 state- of-the-art geo-textual indices. We propose a benchmark that en- ables the comparison of the spatial keyword query performance. We also report on the findings obtained when applying the bench- mark to the indices, thus uncovering new insights that may guide index...
RDF-GL: A SPARQL-Based Graphical Query Language for RDF
Hogenboom, Frederik; Milea, Viorel; Frasincar, Flavius; Kaymak, Uzay
This chapter presents RDF-GL, a graphical query language (GQL) for RDF. The GQL is based on the textual query language SPARQL and mainly focuses on SPARQL SELECT queries. The advantage of a GQL over textual query languages is that complexity is hidden through the use of graphical symbols. RDF-GL is supported by a Java-based editor, SPARQLinG, which is presented as well. The editor does not only allow for RDF-GL query creation, but also converts RDF-GL queries to SPARQL queries and is able to subsequently execute these. Experiments show that using the GQL in combination with the editor makes RDF querying more accessible for end users.
The Data Cyclotron query processing scheme.
R.A. Goncalves (Romulo); M.L. Kersten (Martin)
2011-01-01
htmlabstractA grand challenge of distributed query processing is to devise a self-organizing architecture which exploits all hardware resources optimally to manage the database hot set, minimize query response time, and maximize throughput without single point global coordination. The Data Cyclotron
A Multi-Query Optimizer for Monet
S. Manegold (Stefan); A.J. Pellenkoft (Jan); M.L. Kersten (Martin)
2000-01-01
textabstractDatabase systems allow for concurrent use of several applications (and query interfaces). Each application generates an ``optimal'' plan---a sequence of low-level database operators---for accessing the database. The queries posed by users through the same application can be optimized
A multi-query optimizer for Monet
S. Manegold (Stefan); A.J. Pellenkoft (Jan); M.L. Kersten (Martin)
2000-01-01
textabstractDatabase systems allow for concurrent use of several applications (and query interfaces). Each application generates an ``optimal'' plan---a sequence of low-level database operators---for accessing the database. The queries posed by users through the same application can be optimized
Path-based Queries on Trajectory Data
DEFF Research Database (Denmark)
Krogh, Benjamin Bjerre; Pelekis, Nikos; Theodoridis, Yannis
2014-01-01
In traffic research, management, and planning a number of path-based analyses are heavily used, e.g., for computing turn-times, evaluating green waves, or studying traffic flow. These analyses require retrieving the trajectories that follow the full path being analyzed. Existing path queries cannot...... sufficiently support such path-based analyses because they retrieve all trajectories that touch any edge in the path. In this paper, we define and formalize the strict path query. This is a novel query type tailored to support path-based analysis, where trajectories must follow all edges in the path...... a specific path by only retrieving data from the first and last edge in the path. To correctly answer strict path queries existing network-constrained trajectory indexes must retrieve data from all edges in the path. An extensive performance study of NETTRA using a very large real-world trajectory data set...
Result Diversification Based on Query-Specific Cluster Ranking
J. He (Jiyin); E. Meij; M. de Rijke (Maarten)
2011-01-01
htmlabstractResult diversification is a retrieval strategy for dealing with ambiguous or multi-faceted queries by providing documents that cover as many facets of the query as possible. We propose a result diversification framework based on query-specific clustering and cluster ranking,
SIG-TUR: UMA HERRAMIENTA PARA LA PLANIFICACIÓN, GESTIÓN Y CONTROL DE LOS DESTINOS TURISTICOS
Ghedin, Leila Marcia; Alves da Silva, Moivan; Duarte Sevalho, Carla Danielle; da Silva Level, Tainah
2012-01-01
El Sistema de Información Geográfica Aplicado al Turismo – SIG-Tur es una herramienta tecnológica que optimiza los estudios turísticos desarrollados por los planificadores del sector. En el estudio se buscó evidenciar el instrumento de planificación SIG-Tur, que dispone de funciones de registro, almacenamiento y manipulación de informaciones, además de considerar las dificultades enfrentadas al respecto de la representación de fenómenos espaciales que el turismo abarca. Para el estudio se ado...
Visual Querying in Chemical Databases using SMARTS Patterns
Šípek, Vojtěch
2014-01-01
The purpose of this thesis is to create framework for visual querying in chemical databases which will be implemented as a web application. By using graphical editor, which is a part of client side, the user creates queries which are translated into chemical query language SMARTS. This query is parsed on the application server which is connected to the chemical database. This framework also contains tooling for creating the database and index structure above it. 1
DrugSig: A resource for computational drug repositioning utilizing gene expression signatures.
Directory of Open Access Journals (Sweden)
Hongyu Wu
Full Text Available Computational drug repositioning has been proved as an effective approach to develop new drug uses. However, currently existing strategies strongly rely on drug response gene signatures which scattered in separated or individual experimental data, and resulted in low efficient outputs. So, a fully drug response gene signatures database will be very helpful to these methods. We collected drug response microarray data and annotated related drug and targets information from public databases and scientific literature. By selecting top 500 up-regulated and down-regulated genes as drug signatures, we manually established the DrugSig database. Currently DrugSig contains more than 1300 drugs, 7000 microarray and 800 targets. Moreover, we developed the signature based and target based functions to aid drug repositioning. The constructed database can serve as a resource to quicken computational drug repositioning. Database URL: http://biotechlab.fudan.edu.cn/database/drugsig/.
Result diversification based on query-specific cluster ranking
He, J.; Meij, E.; de Rijke, M.
2011-01-01
Result diversification is a retrieval strategy for dealing with ambiguous or multi-faceted queries by providing documents that cover as many facets of the query as possible. We propose a result diversification framework based on query-specific clustering and cluster ranking, in which diversification
Cumulative query method for influenza surveillance using search engine data.
Seo, Dong-Woo; Jo, Min-Woo; Sohn, Chang Hwan; Shin, Soo-Yong; Lee, JaeHo; Yu, Maengsoo; Kim, Won Young; Lim, Kyoung Soo; Lee, Sang-Il
2014-12-16
Internet search queries have become an important data source in syndromic surveillance system. However, there is currently no syndromic surveillance system using Internet search query data in South Korea. The objective of this study was to examine correlations between our cumulative query method and national influenza surveillance data. Our study was based on the local search engine, Daum (approximately 25% market share), and influenza-like illness (ILI) data from the Korea Centers for Disease Control and Prevention. A quota sampling survey was conducted with 200 participants to obtain popular queries. We divided the study period into two sets: Set 1 (the 2009/10 epidemiological year for development set 1 and 2010/11 for validation set 1) and Set 2 (2010/11 for development Set 2 and 2011/12 for validation Set 2). Pearson's correlation coefficients were calculated between the Daum data and the ILI data for the development set. We selected the combined queries for which the correlation coefficients were .7 or higher and listed them in descending order. Then, we created a cumulative query method n representing the number of cumulative combined queries in descending order of the correlation coefficient. In validation set 1, 13 cumulative query methods were applied, and 8 had higher correlation coefficients (min=.916, max=.943) than that of the highest single combined query. Further, 11 of 13 cumulative query methods had an r value of ≥.7, but 4 of 13 combined queries had an r value of ≥.7. In validation set 2, 8 of 15 cumulative query methods showed higher correlation coefficients (min=.975, max=.987) than that of the highest single combined query. All 15 cumulative query methods had an r value of ≥.7, but 6 of 15 combined queries had an r value of ≥.7. Cumulative query method showed relatively higher correlation with national influenza surveillance data than combined queries in the development and validation set.
Query Health: standards-based, cross-platform population health surveillance.
Klann, Jeffrey G; Buck, Michael D; Brown, Jeffrey; Hadley, Marc; Elmore, Richard; Weber, Griffin M; Murphy, Shawn N
2014-01-01
Understanding population-level health trends is essential to effectively monitor and improve public health. The Office of the National Coordinator for Health Information Technology (ONC) Query Health initiative is a collaboration to develop a national architecture for distributed, population-level health queries across diverse clinical systems with disparate data models. Here we review Query Health activities, including a standards-based methodology, an open-source reference implementation, and three pilot projects. Query Health defined a standards-based approach for distributed population health queries, using an ontology based on the Quality Data Model and Consolidated Clinical Document Architecture, Health Quality Measures Format (HQMF) as the query language, the Query Envelope as the secure transport layer, and the Quality Reporting Document Architecture as the result language. We implemented this approach using Informatics for Integrating Biology and the Bedside (i2b2) and hQuery for data analytics and PopMedNet for access control, secure query distribution, and response. We deployed the reference implementation at three pilot sites: two public health departments (New York City and Massachusetts) and one pilot designed to support Food and Drug Administration post-market safety surveillance activities. The pilots were successful, although improved cross-platform data normalization is needed. This initiative resulted in a standards-based methodology for population health queries, a reference implementation, and revision of the HQMF standard. It also informed future directions regarding interoperability and data access for ONC's Data Access Framework initiative. Query Health was a test of the learning health system that supplied a functional methodology and reference implementation for distributed population health queries that has been validated at three sites. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under
Interference Measurements in the European 868 MHz ISM Band with Focus on LoRa and SigFox
DEFF Research Database (Denmark)
Lauridsen, Mads; Vejlgaard, Benny; Kovács, István
2017-01-01
In this measurement study the signal activity and power levels are measured in the European Industrial, Scientific, and Medical band 863-870 MHz in the city of Aalborg, Denmark. The target is to determine if there is any interference, which may impact deployment of Internet of Things devices....... The focus is on the Low Power Wide Area technologies LoRa and SigFox. The measurements show that there is a 22-33 % probability of interfering signals above -105 dBm within the mandatory LoRa and SigFox 868.0-868.6 MHz band in a shopping area and a business park in downtown Aalborg, which thus limits...... the potential coverage and capacity of LoRa and SigFox. However, the probability of interference is less than 3 % in the three other measurement locations in Aalborg. Finally, a hospital and an industrial area are shown to experience high activity in the RFID subband 865-868 MHz, while the wireless audio band...
Fallstudie SIG - Supply Chain Prototyp mit Coca Cola Beverages
Senger, Enrico
2003-01-01
SIG, ein führender Lieferant von Verpackungsmaterialien für Getränke, hat mit Coca Cola CPFR, collaborative planning, forecasting and replenishment realisiert. Das Unternehmen kann elektronisch und ohne Zeitverzug (vorher mit bis zu 15 Tagen Verspätung) auf die Lagerbestände und Verkaufsvorhersagen bei Coca Cola zugreifen. Coca Cola erhält die exakten Liefermengen und zeiten. Coca Cola konnte dadurch den Lagerbestand von Verpackungsmaterial um 50% senken. Dies reduziert die Bearbeitungszeit d...
A general approach to query flattening
van Ruth, J.
The translation of queries from complex data models to simpler data models is a recurring theme in the construction of efficient data management systems. In this paper we propose a general framework to guide the translation from data models with nested types to a flat relational model (query
Exploiting External Collections for Query Expansion
Weerkamp, W.; Balog, K.; de Rijke, M.
2012-01-01
A persisting challenge in the field of information retrieval is the vocabulary mismatch between a user’s information need and the relevant documents. One way of addressing this issue is to apply query modeling: to add terms to the original query and reweigh the terms. In social media, where
Incorporating Early Learning Strategies in the School Improvement Grants (SIG) Program
Connors-Tadros, Lori; Dunn, Lenay; Martella, Jana; McCauley, Carlas
2015-01-01
The Center on Enhancing Early Learning Outcomes (CEELO) and the Center on School Turnaround (CST) collaborated to develop case studies of three selected schools receiving SIG funds that have, with the support of their districts, promoted the use of early childhood programming (PK-3) as a key strategy in their schools' turnaround models. The goal…
Sonata: Query-Driven Network Telemetry
Gupta, Arpit; Harrison, Rob; Pawar, Ankita; Birkner, Rü diger; Canini, Marco; Feamster, Nick; Rexford, Jennifer; Willinger, Walter
2017-01-01
Operating networks depends on collecting and analyzing measurement data. Current technologies do not make it easy to do so, typically because they separate data collection (e.g., packet capture or flow monitoring) from analysis, producing either too much data to answer a general question or too little data to answer a detailed question. In this paper, we present Sonata, a network telemetry system that uses a uniform query interface to drive the joint collection and analysis of network traffic. Sonata takes the advantage of two emerging technologies---streaming analytics platforms and programmable network devices---to facilitate joint collection and analysis. Sonata allows operators to more directly express network traffic analysis tasks in terms of a high-level language. The underlying runtime partitions each query into a portion that runs on the switch and another that runs on the streaming analytics platform iteratively refines the query to efficiently capture only the traffic that pertains to the operator's query, and exploits sketches to reduce state in switches in exchange for more approximate results. Through an evaluation of a prototype implementation, we demonstrate that Sonata can support a wide range of network telemetry tasks with less state in the network, and lower data rates to streaming analytics systems, than current approaches can achieve.
Sonata: Query-Driven Network Telemetry
Gupta, Arpit
2017-05-02
Operating networks depends on collecting and analyzing measurement data. Current technologies do not make it easy to do so, typically because they separate data collection (e.g., packet capture or flow monitoring) from analysis, producing either too much data to answer a general question or too little data to answer a detailed question. In this paper, we present Sonata, a network telemetry system that uses a uniform query interface to drive the joint collection and analysis of network traffic. Sonata takes the advantage of two emerging technologies---streaming analytics platforms and programmable network devices---to facilitate joint collection and analysis. Sonata allows operators to more directly express network traffic analysis tasks in terms of a high-level language. The underlying runtime partitions each query into a portion that runs on the switch and another that runs on the streaming analytics platform iteratively refines the query to efficiently capture only the traffic that pertains to the operator\\'s query, and exploits sketches to reduce state in switches in exchange for more approximate results. Through an evaluation of a prototype implementation, we demonstrate that Sonata can support a wide range of network telemetry tasks with less state in the network, and lower data rates to streaming analytics systems, than current approaches can achieve.
Firtman, Maximiliano
2012-01-01
Would you like to build one mobile web application that works on iPad and Kindle Fire as well as iPhone and Android smartphones? This introductory guide to jQuery Mobile shows you how. Through a series of hands-on exercises, you'll learn the best ways to use this framework's many interface components to build customizable, multiplatform apps. You don't need any programming skills or previous experience with jQuery to get started. By the time you finish this book, you'll know how to create responsive, Ajax-based interfaces that work on a variety of smartphones and tablets, using jQuery Mobile
jQuery for designers beginner's guide
MacLees, Natalie
2014-01-01
A step-by-step guide that spices up your web pages and designs them in the way you want using the most widely used JavaScript library, jQuery. The beginner-friendly and easy-to-understand approach of the book will help get to grips with jQuery in no time. If you know the fundamentals of HTML and CSS, and want to extend your knowledge by learning to use JavaScript, then this is just the book for you. jQuery makes JavaScript straightforward and approachable - you'll be surprised at how easy it can be to add animations and special effects to your beautifully designed pages.
Querying Business Process Models with VMQL
DEFF Research Database (Denmark)
Störrle, Harald; Acretoaie, Vlad
2013-01-01
The Visual Model Query Language (VMQL) has been invented with the objectives (1) to make it easier for modelers to query models effectively, and (2) to be universally applicable to all modeling languages. In previous work, we have applied VMQL to UML, and validated the first of these two claims. ...
Goetz, Matthew B; Bowman, Candice; Hoang, Tuyen; Anaya, Henry; Osborn, Teresa; Gifford, Allen L; Asch, Steven M
2008-03-19
We describe how we used the framework of the U.S. Department of Veterans Affairs (VA) Quality Enhancement Research Initiative (QUERI) to develop a program to improve rates of diagnostic testing for the Human Immunodeficiency Virus (HIV). This venture was prompted by the observation by the CDC that 25% of HIV-infected patients do not know their diagnosis - a point of substantial importance to the VA, which is the largest provider of HIV care in the United States. Following the QUERI steps (or process), we evaluated: 1) whether undiagnosed HIV infection is a high-risk, high-volume clinical issue within the VA, 2) whether there are evidence-based recommendations for HIV testing, 3) whether there are gaps in the performance of VA HIV testing, and 4) the barriers and facilitators to improving current practice in the VA.Based on our findings, we developed and initiated a QUERI step 4/phase 1 pilot project using the precepts of the Chronic Care Model. Our improvement strategy relies upon electronic clinical reminders to provide decision support; audit/feedback as a clinical information system, and appropriate changes in delivery system design. These activities are complemented by academic detailing and social marketing interventions to achieve provider activation. Our preliminary formative evaluation indicates the need to ensure leadership and team buy-in, address facility-specific barriers, refine the reminder, and address factors that contribute to inter-clinic variances in HIV testing rates. Preliminary unadjusted data from the first seven months of our program show 3-5 fold increases in the proportion of at-risk patients who are offered HIV testing at the VA sites (stations) where the pilot project has been undertaken; no change was seen at control stations. This project demonstrates the early success of the application of the QUERI process to the development of a program to improve HIV testing rates. Preliminary unadjusted results show that the coordinated use of
Pentoney, Christopher; Harwell, Jeff; Leroy, Gondy
2014-01-01
Searching for medical information online is a common activity. While it has been shown that forming good queries is difficult, Google's query suggestion tool, a type of query expansion, aims to facilitate query formation. However, it is unknown how this expansion, which is based on what others searched for, affects the information gathering of the online community. To measure the impact of social-based query expansion, this study compared it with content-based expansion, i.e., what is really in the text. We used 138,906 medical queries from the AOL User Session Collection and expanded them using Google's Autocomplete method (social-based) and the content of the Google Web Corpus (content-based). We evaluated the specificity and ambiguity of the expansion terms for trigram queries. We also looked at the impact on the actual results using domain diversity and expansion edit distance. Results showed that the social-based method provided more precise expansion terms as well as terms that were less ambiguous. Expanded queries do not differ significantly in diversity when expanded using the social-based method (6.72 different domains returned in the first ten results, on average) vs. content-based method (6.73 different domains, on average).
Research in Mobile Database Query Optimization and Processing
Directory of Open Access Journals (Sweden)
Agustinus Borgy Waluyo
2005-01-01
Full Text Available The emergence of mobile computing provides the ability to access information at any time and place. However, as mobile computing environments have inherent factors like power, storage, asymmetric communication cost, and bandwidth limitations, efficient query processing and minimum query response time are definitely of great interest. This survey groups a variety of query optimization and processing mechanisms in mobile databases into two main categories, namely: (i query processing strategy, and (ii caching management strategy. Query processing includes both pull and push operations (broadcast mechanisms. We further classify push operation into on-demand broadcast and periodic broadcast. Push operation (on-demand broadcast relates to designing techniques that enable the server to accommodate multiple requests so that the request can be processed efficiently. Push operation (periodic broadcast corresponds to data dissemination strategies. In this scheme, several techniques to improve the query performance by broadcasting data to a population of mobile users are described. A caching management strategy defines a number of methods for maintaining cached data items in clients' local storage. This strategy considers critical caching issues such as caching granularity, caching coherence strategy and caching replacement policy. Finally, this survey concludes with several open issues relating to mobile query optimization and processing strategy.
Hvorfor støtter det eleverne at bevæge sig fysisk på en tallinje?
DEFF Research Database (Denmark)
Ejersbo, Lisser Rye
2014-01-01
Som reaktion på min forrige blok, vil jeg her uddybe og begrunde, hvorfor det er en god ide, at lade eleverne få kropslige oplevelser med tallinjen. Det drejer sig bl.a. om fænomenet SNARC, der er en forkortelse for ‘Spatical Numerical Association of Response Codes’, og som på godt dansk betyder,......, at venstre hånd er hurtigere til at reagere på små tal i forhold til at placere dem på en tallinje, mens højre hånd er hurtigere, når det drejer sig om større tal...
Selenide isotope generators for the Galileo Mission: SIG hermetic bimetal weld transition joint
International Nuclear Information System (INIS)
Barnett, W.J.
1979-08-01
The successful development of the commercial 6061-T651/Silver/304L explosive clad plate material as a bimetal weld transition joint material, as described herein, satisfies all SIG Galileo design requirements for hermetic weld attachment of stainless steel subassemblies to aluminum alloy generator housing or end cover structures. The application of this type weld transition joint to the hermetic attachment of stainless steel shell connectors is well-developed and tested. Based on on-going life tests of stainless steel receptacle/bimetal ring attachment assemblies and metallurgical characterization studies of this transition joint material, it appears evident that this transition joint material has more than adequate capability to meet the 250 to 300 0 F and 50,000 hr. design life of the SIG/Galileo mission. Its extended life temperture capability may well approach 350 to 400 0 F
Federal School Improvement Grants (SIGs): How Capacity and Local Conditions Matter
Yatsko, Sarah; Lake, Robin; Bowen, Melissa; Cooley Nelson, Elizabeth
2015-01-01
In 2009, the federal government committed over $3 billion nationwide to help states and districts turn around their worst-performing schools. The U.S. Department of Education intended for the School Improvement Grants (SIGs) to spur dramatic change.This report looks at the results of a field study of the first-year implementation of those grants…
Reformulating XQuery queries using GLAV mapping and complex unification
Directory of Open Access Journals (Sweden)
Saber Benharzallah
2016-01-01
Full Text Available This paper describes an algorithm for reformulation of XQuery queries. The mediation is based on an essential component called mediator. Its main role is to reformulate a user query, written in terms of global schema, into queries written in terms of source schemas. Our algorithm is based on the principle of logical equivalence, simple and complex unification, to obtain a better reformulation. It takes XQuery query, global schema (written in XMLSchema, and mappings GLAV as input parameters and provides resultant query written in terms of source schemas. The results of implementation show the proper functioning of the algorithm.
Towards Optimal Multi-Dimensional Query Processing with BitmapIndices
Energy Technology Data Exchange (ETDEWEB)
Rotem, Doron; Stockinger, Kurt; Wu, Kesheng
2005-09-30
Bitmap indices have been widely used in scientific applications and commercial systems for processing complex, multi-dimensional queries where traditional tree-based indices would not work efficiently. This paper studies strategies for minimizing the access costs for processing multi-dimensional queries using bitmap indices with binning. Innovative features of our algorithm include (a) optimally placing the bin boundaries and (b) dynamically reordering the evaluation of the query terms. In addition, we derive several analytical results concerning optimal bin allocation for a probabilistic query model. Our experimental evaluation with real life data shows an average I/O cost improvement of at least a factor of 10 for multi-dimensional queries on datasets from two different applications. Our experiments also indicate that the speedup increases with the number of query dimensions.
ASIST SIG/CR Classification Workshop 2000: Classification for User Support and Learning.
Soergel, Dagobert
2001-01-01
Reports on papers presented at the 62nd Annual Meeting of ASIST (American Society for Information Science and Technology) for the Special Interest Group in Classification Research (SIG/CR). Topics include types of knowledge; developing user-oriented classifications, including domain analysis; classification in the user interface; and automatic…
AQBE — QBE Style Queries for Archetyped Data
Sachdeva, Shelly; Yaginuma, Daigo; Chu, Wanming; Bhalla, Subhash
Large-scale adoption of electronic healthcare applications requires semantic interoperability. The new proposals propose an advanced (multi-level) DBMS architecture for repository services for health records of patients. These also require query interfaces at multiple levels and at the level of semi-skilled users. In this regard, a high-level user interface for querying the new form of standardized Electronic Health Records system has been examined in this study. It proposes a step-by-step graphical query interface to allow semi-skilled users to write queries. Its aim is to decrease user effort and communication ambiguities, and increase user friendliness.
Group-by Skyline Query Processing in Relational Engines
DEFF Research Database (Denmark)
Yiu, Man Lung; Luk, Ming-Hay; Lo, Eric
2009-01-01
the missing cost model for the BBS algorithm. Experimental results show that our techniques are able to devise the best query plans for a variety of group-by skyline queries. Our focus is on algorithms that can be directly implemented in today's commercial database systems without the addition of new access......The skyline operator was first proposed in 2001 for retrieving interesting tuples from a dataset. Since then, 100+ skyline-related papers have been published; however, we discovered that one of the most intuitive and practical type of skyline queries, namely, group-by skyline queries remains...
Benedetti, Ryan
2011-01-01
Want to add more interactivity and polish to your websites? Discover how jQuery can help you build complex scripting functionality in just a few lines of code. With Head First jQuery, you'll quickly get up to speed on this amazing JavaScript library by learning how to navigate HTML documents while handling events, effects, callbacks, and animations. By the time you've completed the book, you'll be incorporating Ajax apps, working seamlessly with HTML and CSS, and handling data with PHP, MySQL and JSON. If you want to learn-and understand-how to create interactive web pages, unobtrusive scrip
ALGORITMA RC4 DALAM PROTEKSI TRANSMISI DAN HASIL QUERY UNTUK ORDBMS POSTGRESQL
Directory of Open Access Journals (Sweden)
Yuri Ariyanto
2009-01-01
Full Text Available In this research will be worked through about how cryptography RC4's algorithm implementation in protection to query result and of query, security by encryption and descryption up to both is in network. Implementation of this research which is build software in client that function access databases that is placed by the side of server. Software that building to have facility for encryption and descryption query result and of query that is sent from client goes to server and. transmission query result and of query can secure its security. Well guaranted transmission security him of query result and of query can be told to succeed if success software can encryption query result and of query which transmission so that in the event of scanning to both, scanning will not understand data content. Conclusion of this research that is woke up software succeed encryption query and result of query which transmission between application of client and of server databases. Abstract in Bahasa Indonesia: Pada penelitian ini dibahas mengenai bagaimana mengimplementasikan algoritma kriptografi RC4 dalam proteksi terhadap query dan hasil query, pengamanan dilakukan dengan cara melakukan enkripsi dan dekripsi selama keduanya berada di dalam jaringan. Pengimplementasian dari penelitian ini yaitu membangun sebuah software yang akan diletakkan di sisi client yang berfungsi mengakses database yang diletakkan di sisi server. Software yang dibangun memiliki fasilitas untuk mengenkripsi dan mendektipsi query dan hasil query yang dikirimkan dari client ke server dan juga sebaliknya. Dengan demikian tramsmisi query dan hasil query dapat terjamin keamanannya.Terjaminnya keamanan transmisi query dan hasil query dapat dikatakan berhasil jika software berhasil mengenkripsi query dan hasil query yang ditransmisikan sehingga apabila terjadi penyadapan terhadap keduanya, penyadap tidak akan mengerti isi data tersebut. Kesimpulan dari penelitian ini yaitu software yang dibangun
Relative aggregation operator in database fuzzy querying
Directory of Open Access Journals (Sweden)
Luminita DUMITRIU
2005-12-01
Full Text Available Fuzzy selection criteria querying relational databases include vague terms; they usually refer linguistic values form the attribute linguistic domains, defined as fuzzy sets. Generally, when a vague query is processed, the definitions of vague terms must already exist in a knowledge base. But there are also cases when vague terms must be dynamically defined, when a particular operation is used to aggregate simple criteria in a complex selection. The paper presents a new aggregation operator and the corresponding algorithm to evaluate the fuzzy query.
Query-Time Optimization Techniques for Structured Queries in Information Retrieval
Cartright, Marc-Allen
2013-01-01
The use of information retrieval (IR) systems is evolving towards larger, more complicated queries. Both the IR industrial and research communities have generated significant evidence indicating that in order to continue improving retrieval effectiveness, increases in retrieval model complexity may be unavoidable. From an operational perspective,…
Improving Web Search for Difficult Queries
Wang, Xuanhui
2009-01-01
Search engines have now become essential tools in all aspects of our life. Although a variety of information needs can be served very successfully, there are still a lot of queries that search engines can not answer very effectively and these queries always make users feel frustrated. Since it is quite often that users encounter such "difficult…
Matching health information seekers' queries to medical terms.
Soualmia, Lina F; Prieur-Gaston, Elise; Moalla, Zied; Lecroq, Thierry; Darmoni, Stéfan J
2012-01-01
The Internet is a major source of health information but most seekers are not familiar with medical vocabularies. Hence, their searches fail due to bad query formulation. Several methods have been proposed to improve information retrieval: query expansion, syntactic and semantic techniques or knowledge-based methods. However, it would be useful to clean those queries which are misspelled. In this paper, we propose a simple yet efficient method in order to correct misspellings of queries submitted by health information seekers to a medical online search tool. In addition to query normalizations and exact phonetic term matching, we tested two approximate string comparators: the similarity score function of Stoilos and the normalized Levenshtein edit distance. We propose here to combine them to increase the number of matched medical terms in French. We first took a sample of query logs to determine the thresholds and processing times. In the second run, at a greater scale we tested different combinations of query normalizations before or after misspelling correction with the retained thresholds in the first run. According to the total number of suggestions (around 163, the number of the first sample of queries), at a threshold comparator score of 0.3, the normalized Levenshtein edit distance gave the highest F-Measure (88.15%) and at a threshold comparator score of 0.7, the Stoilos function gave the highest F-Measure (84.31%). By combining Levenshtein and Stoilos, the highest F-Measure (80.28%) is obtained with 0.2 and 0.7 thresholds respectively. However, queries are composed by several terms that may be combination of medical terms. The process of query normalization and segmentation is thus required. The highest F-Measure (64.18%) is obtained when this process is realized before spelling-correction. Despite the widely known high performance of the normalized edit distance of Levenshtein, we show in this paper that its combination with the Stoilos algorithm improved
Lau, Steven K M; Patel, Kunal; Kim, Teddy; Knipprath, Erik; Kim, Gwe-Ya; Cerviño, Laura I; Lawson, Joshua D; Murphy, Kevin T; Sanghvi, Parag; Carter, Bob S; Chen, Clark C
2017-04-01
Frameless, surface imaging guided radiosurgery (SIG-RS) is a novel platform for stereotactic radiosurgery (SRS) wherein patient positioning is monitored in real-time through infra-red camera tracking of facial topography. Here we describe our initial clinical experience with SIG-RS for the treatment of benign neoplasms of the skull base. We identified 48 patients with benign skull base tumors consecutively treated with SIG-RS at a single institution between 2009 and 2011. Patients were diagnosed with meningioma (n = 22), vestibular schwannoma (n = 20), or nonfunctional pituitary adenoma (n = 6). Local control and treatment-related toxicity were retrospectively assessed. Median follow-up was 65 months (range 61-72 months). Prescription doses were 12-13 Gy in a single fraction (n = 18), 8 Gy × 3 fractions (n = 6), and 5 Gy × 5 fractions (n = 24). Actuarial tumor control rate at 5 years was 98%. No grade ≥3 treatment-related toxicity was observed. Grade ≤2 toxicity was associated with symptomatic lesions (p = 0.049) and single fraction treatment (p = 0.005). SIG-RS for benign skull base tumors produces clinical outcomes comparable to conventional frame-based SRS techniques while enhancing patient comfort.
RCQ-GA: RDF Chain Query Optimization Using Genetic Algorithms
Hogenboom, Alexander; Milea, Viorel; Frasincar, Flavius; Kaymak, Uzay
The application of Semantic Web technologies in an Electronic Commerce environment implies a need for good support tools. Fast query engines are needed for efficient querying of large amounts of data, usually represented using RDF. We focus on optimizing a special class of SPARQL queries, the so-called RDF chain queries. For this purpose, we devise a genetic algorithm called RCQ-GA that determines the order in which joins need to be performed for an efficient evaluation of RDF chain queries. The approach is benchmarked against a two-phase optimization algorithm, previously proposed in literature. The more complex a query is, the more RCQ-GA outperforms the benchmark in solution quality, execution time needed, and consistency of solution quality. When the algorithms are constrained by a time limit, the overall performance of RCQ-GA compared to the benchmark further improves.
A Streams-Based Framework for Defining Location-Based Queries
DEFF Research Database (Denmark)
Jensen, Christian Søndergaard; Xuegang, Huang
2007-01-01
n infrastructure is emerging that supports the delivery of on-line, location-enabled services to mobile users. Such services involve novel database queries, and the database research community is quite active in proposing techniques for the efficient processing of such queries. In parallel to this......, the management of data streams has become an active area of research. While most research in mobile services concerns performance issues, this paper aims to establish a formal framework for defining the semantics of queries encountered in mobile services, most notably the so-called continuous queries...... that are particularly relevant in this context. Rather than inventing an entirely new framework, the paper proposes a framework that builds on concepts from data streams and temporal databases. Definitions of example queries demonstrates how the framework enables clear formulation of query semantics and the comparison...
Query Expansion: Is It Necessary In Textual Case-Based Reasoning ...
African Journals Online (AJOL)
Query expansion (QE) is the process of transforming a seed query to improve retrieval performance in information retrieval operations. It is often intended to overcome a vocabulary mismatch between the query and the document collection. Query expansion is known to improve retrieval effectiveness of some information ...
Determinacy in Static Analysis of jQuery
DEFF Research Database (Denmark)
Andreasen, Esben; Møller, Anders
2014-01-01
Static analysis for JavaScript can potentially help programmers find errors early during development. Although much progress has been made on analysis techniques, a major obstacle is the prevalence of libraries, in particular jQuery, which apply programming patterns that have detrimental conseque......Static analysis for JavaScript can potentially help programmers find errors early during development. Although much progress has been made on analysis techniques, a major obstacle is the prevalence of libraries, in particular jQuery, which apply programming patterns that have detrimental...... present a static dataflow analysis for JavaScript that infers and exploits determinacy information on-the-fly, to enable analysis of some of the most complex parts of jQuery. The techniques are implemented in the TAJS analysis tool and evaluated on a collection of small programs that use jQuery. Our...
Experimental quantum private queries with linear optics
International Nuclear Information System (INIS)
De Martini, Francesco; Giovannetti, Vittorio; Lloyd, Seth; Maccone, Lorenzo; Nagali, Eleonora; Sansoni, Linda; Sciarrino, Fabio
2009-01-01
The quantum private query is a quantum cryptographic protocol to recover information from a database, preserving both user and data privacy: the user can test whether someone has retained information on which query was asked and the database provider can test the amount of information released. Here we discuss a variant of the quantum private query algorithm that admits a simple linear optical implementation: it employs the photon's momentum (or time slot) as address qubits and its polarization as bus qubit. A proof-of-principle experimental realization is implemented.
Evaluating XML-Extended OLAP Queries Based on a Physical Algebra
DEFF Research Database (Denmark)
Yin, Xuepeng; Pedersen, Torben Bach
2006-01-01
. In this paper, we extend previous work on the logical federation of OLAP and XML data sources by presenting a simplified query semantics, a physical query algebra and a robust OLAP-XML query engine as well as the query evaluation techniques. Performance experiments with a prototypical implementation suggest...
Information Retrieval and Graph Analysis Approaches for Book Recommendation
Chahinez Benkoussas; Patrice Bellot
2015-01-01
A combination of multiple information retrieval approaches is proposed for the purpose of book recommendation. In this paper, book recommendation is based on complex user's query. We used different theoretical retrieval models: probabilistic as InL2 (Divergence from Randomness model) and language model and tested their interpolated combination. Graph analysis algorithms such as PageRank have been successful in Web environments. We consider the application of this algorithm in a new retrieval ...
Parallelizing Federated SPARQL Queries in Presence of Replicated Data
DEFF Research Database (Denmark)
Minier, Thomas; Montoya, Gabriela; Skaf-Molli, Hala
2017-01-01
Federated query engines have been enhanced to exploit new data localities created by replicated data, e.g., Fedra. However, existing replication aware federated query engines mainly focus on pruning sources during the source selection and query decomposition in order to reduce intermediate result...
Modeling Large Time Series for Efficient Approximate Query Processing
DEFF Research Database (Denmark)
Perera, Kasun S; Hahmann, Martin; Lehner, Wolfgang
2015-01-01
query statistics derived from experiments and when running the system. Our approach can also reduce communication load by exchanging models instead of data. To allow seamless integration of model-based querying into traditional data warehouses, we introduce a SQL compatible query terminology. Our...
Approximating terminological queries
Stuckenschmidt, Heiner; Van Harmelen, Frank
2002-01-01
Current proposals for languages to encode terminological knowledge in intelligent systems support logical reasoning for answering user queries about objects and classes. An application of these languages on the World Wide Web, however, is hampered by the limitations of logical reasoning in terms
A study of medical and health queries to web search engines.
Spink, Amanda; Yang, Yin; Jansen, Jim; Nykanen, Pirrko; Lorence, Daniel P; Ozmutlu, Seda; Ozmutlu, H Cenk
2004-03-01
This paper reports findings from an analysis of medical or health queries to different web search engines. We report results: (i). comparing samples of 10000 web queries taken randomly from 1.2 million query logs from the AlltheWeb.com and Excite.com commercial web search engines in 2001 for medical or health queries, (ii). comparing the 2001 findings from Excite and AlltheWeb.com users with results from a previous analysis of medical and health related queries from the Excite Web search engine for 1997 and 1999, and (iii). medical or health advice-seeking queries beginning with the word 'should'. Findings suggest: (i). a small percentage of web queries are medical or health related, (ii). the top five categories of medical or health queries were: general health, weight issues, reproductive health and puberty, pregnancy/obstetrics, and human relationships, and (iii). over time, the medical and health queries may have declined as a proportion of all web queries, as the use of specialized medical/health websites and e-commerce-related queries has increased. Findings provide insights into medical and health-related web querying and suggests some implications for the use of the general web search engines when seeking medical/health information.
Développement d'un outil SIG d'estimation des dommages ...
African Journals Online (AJOL)
Pour atteindre cet objectif, la méthodologie de simulation présentée dans le programme "HAZUS” en combinaison avec les systèmes d'information géographique (SIG) ont été utilisés. La ville a été divisée en plusieurs parties qui reflètent la typologie du tissu urbain qui à son tour n'est pas uniforme, puisque les structures ...
Macromolecular query language (MMQL): prototype data model and implementation.
Shindyalov, I N; Chang, W; Pu, C; Bourne, P E
1994-11-01
Macromolecular query language (MMQL) is an extensible interpretive language in which to pose questions concerning the experimental or derived features of the 3-D structure of biological macromolecules. MMQL portends to be intuitive with a simple syntax, so that from a user's perspective complex queries are easily written. A number of basic queries and a more complex query--determination of structures containing a five-strand Greek key motif--are presented to illustrate the strengths and weaknesses of the language. The predominant features of MMQL are a filter and pattern grammar which are combined to express a wide range of interesting biological queries. Filters permit the selection of object attributes, for example, compound name and resolution, whereas the patterns currently implemented query primary sequence, close contacts, hydrogen bonding, secondary structure, conformation and amino acid properties (volume, polarity, isoelectric point, hydrophobicity and different forms of exposure). MMQL queries are processed by MMQLlib; a C++ class library, to which new query methods and pattern types are easily added. The prototype implementation described uses PDBlib, another C(++)-based class library from representing the features of biological macromolecules at the level of detail parsable from a PDB file. Since PDBlib can represent data stored in relational and object-oriented databases, as well as PDB files, once these data are loaded they too can be queried by MMQL. Performance metrics are given for queries of PDB files for which all derived data are calculated at run time and compared to a preliminary version of OOPDB, a prototype object-oriented database with a schema based on a persistent version of PDBlib which offers more efficient data access and the potential to maintain derived information. MMQLlib, PDBlib and associated software are available via anonymous ftp from cuhhca.hhmi.columbia.edu.
RDF-GL : a SPARQL-based graphical query language for RDF
Hogenboom, F.P.; Milea, D.V.; Frasincar, F.; Kaymak, U.; Chbeir, R.; Badr, Y.; Abraham, A.; Hassanien, A.-E.
2010-01-01
This chapter presents RDF-GL, a graphical query language (GQL) for RDF. The GQL is based on the textual query language SPARQL and mainly focuses on SPARQL SELECT queries. The advantage of a GQL over textual query languages is that complexity is hidden through the use of graphical symbols. RDF-GL is
Executing SPARQL Queries over the Web of Linked Data
Hartig, Olaf; Bizer, Christian; Freytag, Johann-Christoph
The Web of Linked Data forms a single, globally distributed dataspace. Due to the openness of this dataspace, it is not possible to know in advance all data sources that might be relevant for query answering. This openness poses a new challenge that is not addressed by traditional research on federated query processing. In this paper we present an approach to execute SPARQL queries over the Web of Linked Data. The main idea of our approach is to discover data that might be relevant for answering a query during the query execution itself. This discovery is driven by following RDF links between data sources based on URIs in the query and in partial results. The URIs are resolved over the HTTP protocol into RDF data which is continuously added to the queried dataset. This paper describes concepts and algorithms to implement our approach using an iterator-based pipeline. We introduce a formalization of the pipelining approach and show that classical iterators may cause blocking due to the latency of HTTP requests. To avoid blocking, we propose an extension of the iterator paradigm. The evaluation of our approach shows its strengths as well as the still existing challenges.
A Fuzzy Query Mechanism for Human Resource Websites
Lai, Lien-Fu; Wu, Chao-Chin; Huang, Liang-Tsung; Kuo, Jung-Chih
Users' preferences often contain imprecision and uncertainty that are difficult for traditional human resource websites to deal with. In this paper, we apply the fuzzy logic theory to develop a fuzzy query mechanism for human resource websites. First, a storing mechanism is proposed to store fuzzy data into conventional database management systems without modifying DBMS models. Second, a fuzzy query language is proposed for users to make fuzzy queries on fuzzy databases. User's fuzzy requirement can be expressed by a fuzzy query which consists of a set of fuzzy conditions. Third, each fuzzy condition associates with a fuzzy importance to differentiate between fuzzy conditions according to their degrees of importance. Fourth, the fuzzy weighted average is utilized to aggregate all fuzzy conditions based on their degrees of importance and degrees of matching. Through the mutual compensation of all fuzzy conditions, the ordering of query results can be obtained according to user's preference.
Evaluating Trajectory Queries over Imprecise Location Data
DEFF Research Database (Denmark)
Xie, Scott, Xike; Cheng, Reynold; Yiu, Man Lung
2012-01-01
Trajectory queries, which retrieve nearby objects for every point of a given route, can be used to identify alerts of potential threats along a vessel route, or monitor the adjacent rescuers to a travel path. However, the locations of these objects (e.g., threats, succours) may not be precisely...... obtained due to hardware limitations of measuring devices, as well as the constantly-changing nature of the external environment. Ignoring data uncertainty can render low query quality, and cause undesirable consequences such as missing alerts of threats and poor response time in rescue operations. Also......, the query is quite time-consuming, since all the points on the trajectory are considered. In this paper, we study how to efficiently evaluate trajectory queries over imprecise location data, by proposing a new concept called the u-bisector. In general, the u-bisector is an extension of bisector to handle...
Semantic querying of data guided by Formal Concept Analysis
Codocedo , Victor; Lykourentzou , Ioanna; Napoli , Amedeo
2012-01-01
International audience; In this paper we present a novel approach to handle querying over a concept lattice of documents and annotations. We focus on the problem of "non-matching documents", which are those that, despite being semantically relevant to the user query, do not contain the query's elements and hence cannot be retrieved by typical string matching approaches. In order to find these documents, we modify the initial user query using the concept lattice as a guide. We achieve this by ...
Unemployment Insurance Query (UIQ)
Social Security Administration — The Unemployment Insurance Query (UIQ) provides State Unemployment Insurance agencies real-time online access to SSA data. This includes SSN verification and Title...
A Relational Algebra Query Language for Programming Relational Databases
McMaster, Kirby; Sambasivam, Samuel; Anderson, Nicole
2011-01-01
In this paper, we describe a Relational Algebra Query Language (RAQL) and Relational Algebra Query (RAQ) software product we have developed that allows database instructors to teach relational algebra through programming. Instead of defining query operations using mathematical notation (the approach commonly taken in database textbooks), students…
Instant MDX queries for SQL Server 2012
Emond, Nicholas
2013-01-01
Get to grips with a new technology, understand what it is and what it can do for you, and then get to work with the most important features and tasks. This short, focused guide is a great way to get stated with writing MDX queries. New developers can use this book as a reference for how to use functions and the syntax of a query as well as how to use Calculated Members and Named Sets.This book is great for new developers who want to learn the MDX query language from scratch and install SQL Server 2012 with Analysis Services
Enabling Semantic Queries Against the Spatial Database
Directory of Open Access Journals (Sweden)
PENG, X.
2012-02-01
Full Text Available The spatial database based upon the object-relational database management system (ORDBMS has the merits of a clear data model, good operability and high query efficiency. That is why it has been widely used in spatial data organization and management. However, it cannot express the semantic relationships among geospatial objects, making the query results difficult to meet the user's requirement well. Therefore, this paper represents an attempt to combine the Semantic Web technology with the spatial database so as to make up for the traditional database's disadvantages. In this way, on the one hand, users can take advantages of ORDBMS to store and manage spatial data; on the other hand, if the spatial database is released in the form of Semantic Web, the users could describe a query more concisely with the cognitive pattern which is similar to that of daily life. As a consequence, this methodology enables the benefits of both Semantic Web and the object-relational database (ORDB available. The paper discusses systematically the semantic enriched spatial database's architecture, key technologies and implementation. Subsequently, we demonstrate the function of spatial semantic queries via a practical prototype system. The query results indicate that the method used in this study is feasible.
Extracting Rankings for Spatial Keyword Queries from GPS Data
DEFF Research Database (Denmark)
Keles, Ilkcan; Jensen, Christian Søndergaard; Saltenis, Simonas
2018-01-01
Studies suggest that many search engine queries have local intent. We consider the evaluation of ranking functions important for such queries. The key challenge is to be able to determine the “best” ranking for a query, as this enables evaluation of the results of ranking functions. We propose...
An Object-Oriented Approach of Keyword Querying over Fuzzy XML
Directory of Open Access Journals (Sweden)
Ting Li
2016-09-01
Full Text Available As the fuzzy data management has become one of the main research topics and directions, the question of how to obtain the useful information by means of keyword query from fuzzy XML documents is becoming a subject of an increasing needed investigation. Considering the keyword query methods on crisp XML documents, smallest lowest common ancestor (SLCA semantics is one of the most widely accepted semantics. When users propose the keyword query on fuzzy XML documents with the SLCA semantics, the query results are always incomplate, with low precision, and with no possibilities values returned. Most of keyword query semantics on XML documents only consider query results matching all keywords, yet users may also be interested in the query results matching partial keywords. To overcome these limitations, in this paper, we investigate how to obtain more comprehensive and meaningful results of keyword querying on fuzzy XML documents. We propose a semantics of object-oriented keyword querying on fuzzy XML documents. First, we introduce the concept of "object tree", analyze different types of matching result object trees and find the "minimum result object trees" which contain all keywords and "result object trees" which contain partial keywords. Then an object-oriented keyword query algorithm ROstack is proposed to obtain the root nodes of these matching result object trees, together with their possibilities. At last, experiments are conducted to verify the effectiveness and efficiency of our proposed algorithm.
Directory of Open Access Journals (Sweden)
Osborn Teresa
2008-03-01
Full Text Available Abstract Background We describe how we used the framework of the U.S. Department of Veterans Affairs (VA Quality Enhancement Research Initiative (QUERI to develop a program to improve rates of diagnostic testing for the Human Immunodeficiency Virus (HIV. This venture was prompted by the observation by the CDC that 25% of HIV-infected patients do not know their diagnosis – a point of substantial importance to the VA, which is the largest provider of HIV care in the United States. Methods Following the QUERI steps (or process, we evaluated: 1 whether undiagnosed HIV infection is a high-risk, high-volume clinical issue within the VA, 2 whether there are evidence-based recommendations for HIV testing, 3 whether there are gaps in the performance of VA HIV testing, and 4 the barriers and facilitators to improving current practice in the VA. Based on our findings, we developed and initiated a QUERI step 4/phase 1 pilot project using the precepts of the Chronic Care Model. Our improvement strategy relies upon electronic clinical reminders to provide decision support; audit/feedback as a clinical information system, and appropriate changes in delivery system design. These activities are complemented by academic detailing and social marketing interventions to achieve provider activation. Results Our preliminary formative evaluation indicates the need to ensure leadership and team buy-in, address facility-specific barriers, refine the reminder, and address factors that contribute to inter-clinic variances in HIV testing rates. Preliminary unadjusted data from the first seven months of our program show 3–5 fold increases in the proportion of at-risk patients who are offered HIV testing at the VA sites (stations where the pilot project has been undertaken; no change was seen at control stations. Discussion This project demonstrates the early success of the application of the QUERI process to the development of a program to improve HIV testing rates
Genetic algorithms for RDF chain query optimization
Hogenboom, A.C.; Milea, D.V.; Frasincar, F.; Kaymak, U.; Calders, T.; Tuyls, K.; Pechenizkiy, M.
2009-01-01
The application of Semantic Web technologies in an Electronic Commerce environment implies a need for good support tools. Fast query engines are required for efficient real-time querying of large amounts of data, usually represented using RDF. We focus on optimizing a special class of SPARQL
Directory of Open Access Journals (Sweden)
Isna Nur Mahmud
2015-03-01
Full Text Available Perubahan dari sistem televisi analog menjadi sistem televisi digital terestrial di Indonesia tinggal menunggu waktu. Namun masih banyak infrastruktur yang masih perlu dibangun untuk menunjang sistem televisi digital terestrial agar dapat beroperasi dengan baik. Belum meratanya sistem pemancar televisi digital terestrial yang keberadaannya masih terbenturnya undang – undang yang berlaku di negara ini menjadi salah satu permasalahannya. Salah satu solusinya adalah memetakannya dalam sebuah SIG. Pemetaan pemancar tv digital terestrial ini dibuat untuk mempermudah KPI dalam melakukan identifikasi letak pemancar televisi digital terestrial serta memberikan informasi yang berkaitan dengan daya pemancar, spesifikasi pemancar televisi digital terestrial di Indonesia dalam kondisi offline. Dari pengujian didapatkan hasil antara lain, untuk pengujian black-box, didapatkan hasil yang sesuai dengan fungsionalitas sistem. Untuk nilai MOS, kemudahan menu aplikasi 3.9, kemudahan dlm navigasi aplikasi 4.1, kemudahan dlm menggunakan tools 4.05, penilaian tampilan interface 3.952, penilaian keseluruhan aplikasi SIG 4.hasil SUS yang dilakukan didapatkan nilai 65.71
Error Checking for Chinese Query by Mining Web Log
Directory of Open Access Journals (Sweden)
Jianyong Duan
2015-01-01
Full Text Available For the search engine, error-input query is a common phenomenon. This paper uses web log as the training set for the query error checking. Through the n-gram language model that is trained by web log, the queries are analyzed and checked. Some features including query words and their number are introduced into the model. At the same time data smoothing algorithm is used to solve data sparseness problem. It will improve the overall accuracy of the n-gram model. The experimental results show that it is effective.
Query construction, entropy, and generalization in neural-network models
Sollich, Peter
1994-05-01
We study query construction algorithms, which aim at improving the generalization ability of systems that learn from examples by choosing optimal, nonredundant training sets. We set up a general probabilistic framework for deriving such algorithms from the requirement of optimizing a suitable objective function; specifically, we consider the objective functions entropy (or information gain) and generalization error. For two learning scenarios, the high-low game and the linear perceptron, we evaluate the generalization performance obtained by applying the corresponding query construction algorithms and compare it to training on random examples. We find qualitative differences between the two scenarios due to the different structure of the underlying rules (nonlinear and ``noninvertible'' versus linear); in particular, for the linear perceptron, random examples lead to the same generalization ability as a sequence of queries in the limit of an infinite number of examples. We also investigate learning algorithms which are ill matched to the learning environment and find that, in this case, minimum entropy queries can in fact yield a lower generalization ability than random examples. Finally, we study the efficiency of single queries and its dependence on the learning history, i.e., on whether the previous training examples were generated randomly or by querying, and the difference between globally and locally optimal query construction.
Accelerating SPARQL Queries and Analytics on RDF Data
Al-Harbi, Razen
2016-11-09
The complexity of SPARQL queries and RDF applications poses great challenges on distributed RDF management systems. SPARQL workloads are dynamic and con- sist of queries with variable complexities. Hence, systems that use static partitioning su↵er from communication overhead for workloads that generate excessive communi- cation. Concurrently, RDF applications are becoming more sophisticated, mandating analytical operations that extend beyond SPARQL queries. Being primarily designed and optimized to execute SPARQL queries, which lack procedural capabilities, exist- ing systems are not suitable for rich RDF analytics. This dissertation tackles the problem of accelerating SPARQL queries and RDF analytics on distributed shared-nothing RDF systems. First, a distributed RDF en- gine, coined AdPart, is introduced. AdPart uses lightweight hash partitioning for sharding triples using their subject values; rendering its startup overhead very low. The locality-aware query optimizer of AdPart takes full advantage of the partition- ing to (i) support the fully parallel processing of join patterns on subjects and (ii) minimize data communication for general queries by applying hash distribution of intermediate results instead of broadcasting, wherever possible. By exploiting hash- based locality, AdPart achieves better or comparable performance to systems that employ sophisticated partitioning schemes. To cope with workloads dynamism, AdPart is extended to dynamically adapt to workload changes. AdPart monitors the data access patterns and dynamically redis- tributes and replicates the instances of the most frequent patterns among workers.Consequently, the communication cost for future queries is drastically reduced or even eliminated. Experiments with synthetic and real data verify that AdPart starts faster than all existing systems and gracefully adapts to the query load. Finally, to support and accelerate rich RDF analytical tasks, a vertex-centric RDF analytics framework is
A semantic perspective on query log analysis
Hofmann, K.; de Rijke, M.; Huurnink, B.; Meij, E.
2009-01-01
We present our views on the CLEF log file analysis task. We argue for a task definition that focuses on the semantic enrichment of query logs. In addition, we discuss how additional information about the context in which queries are being made could further our understanding of users’ information
GeoSpark SQL: An Effective Framework Enabling Spatial Queries on Spark
Directory of Open Access Journals (Sweden)
Zhou Huang
2017-09-01
Full Text Available In the era of big data, Internet-based geospatial information services such as various LBS apps are deployed everywhere, followed by an increasing number of queries against the massive spatial data. As a result, the traditional relational spatial database (e.g., PostgreSQL with PostGIS and Oracle Spatial cannot adapt well to the needs of large-scale spatial query processing. Spark is an emerging outstanding distributed computing framework in the Hadoop ecosystem. This paper aims to address the increasingly large-scale spatial query-processing requirement in the era of big data, and proposes an effective framework GeoSpark SQL, which enables spatial queries on Spark. On the one hand, GeoSpark SQL provides a convenient SQL interface; on the other hand, GeoSpark SQL achieves both efficient storage management and high-performance parallel computing through integrating Hive and Spark. In this study, the following key issues are discussed and addressed: (1 storage management methods under the GeoSpark SQL framework, (2 the spatial operator implementation approach in the Spark environment, and (3 spatial query optimization methods under Spark. Experimental evaluation is also performed and the results show that GeoSpark SQL is able to achieve real-time query processing. It should be noted that Spark is not a panacea. It is observed that the traditional spatial database PostGIS/PostgreSQL performs better than GeoSpark SQL in some query scenarios, especially for the spatial queries with high selectivity, such as the point query and the window query. In general, GeoSpark SQL performs better when dealing with compute-intensive spatial queries such as the kNN query and the spatial join query.
Adaptive and Optimized RDF Query Interface for Distributed WFS Data
Directory of Open Access Journals (Sweden)
Tian Zhao
2017-04-01
Full Text Available Web Feature Service (WFS is a protocol for accessing geospatial data stores such as databases and Shapefiles over the Web. However, WFS does not provide direct access to data distributed in multiple servers. In addition, WFS features extracted from their original sources are not convenient for user access due to the lack of connection to high-level concepts. Users are facing the choices of either querying each WFS server first and then integrating the results, or converting the data from all WFS servers to a more expressive format such as RDF (Resource Description Framework and then querying the integrated data. The first choice requires additional programming while the second choice is not practical for large or frequently updated datasets. The new contribution of this paper is that we propose a novel adaptive and optimized RDF query interface to overcome the aforementioned limitation. Specifically, in this paper, we propose a novel algorithm to query and synthesize distributed WFS data through an RDF query interface, where users can specify data requests to multiple WFS servers using a single RDF query. Users can also define a simple configuration to associate WFS feature types, attributes, and values with RDF classes, properties, and values so that user queries can be written using a more uniform and informative vocabulary. The algorithm translates each RDF query written in SPARQL-like syntax to multiple WFS GetFeature requests, and then converts and integrates the multiple WFS results to get the answers to the original query. The generated GetFeature requests are sent asynchronously and simultaneously to WFS servers to take advantage of the server parallelism. The results of each GetFeature request are cached to improve query response time for subsequent queries that involve one or more of the cached requests. A JavaScript-based prototype is implemented and experimental results show that the query response time can be greatly reduced through
Recommender engine for continuous-time quantum Monte Carlo methods
Huang, Li; Yang, Yi-feng; Wang, Lei
2017-03-01
Recommender systems play an essential role in the modern business world. They recommend favorable items such as books, movies, and search queries to users based on their past preferences. Applying similar ideas and techniques to Monte Carlo simulations of physical systems boosts their efficiency without sacrificing accuracy. Exploiting the quantum to classical mapping inherent in the continuous-time quantum Monte Carlo methods, we construct a classical molecular gas model to reproduce the quantum distributions. We then utilize powerful molecular simulation techniques to propose efficient quantum Monte Carlo updates. The recommender engine approach provides a general way to speed up the quantum impurity solvers.
Towards A Streams-Based Framework for Defining Location-Based Queries
DEFF Research Database (Denmark)
Huang, Xuegang; Jensen, Christian S.
2004-01-01
An infrastructure is emerging that supports the delivery of on-line, location-enabled services to mobile users. Such services involve novel database queries, and the database research community is quite active in proposing techniques for the effi- cient processing of such queries. In parallel...... to this, the management of data streams has become an active area of research. While most research in mobile services concerns performance issues, this paper aims to establish a formal framework for defining the semantics of queries encountered in mobile services, most notably the so-called continuous...... queries that are particularly relevant in this context. Rather than inventing an entirely new framework, the paper proposes a framework that builds on concepts from data streams and temporal databases. Definitions of example queries demonstrates how the framework enables clear formulation of query...
TIIREC: A Tensor Approach for Tag-Driven Item Recommendation with Sparse User Generated Content
Yu, Lu
2017-05-17
In recent years, tagging system has become a building block o summarize the content of items for further functions like retrieval or personalized recommendation in various web applications. One nontrivial requirement is to precisely deliver a list of suitable items when users interact with the systems via inputing a specific tag (i.e. a query term). Different from traditional recommender systems, we need deal with a collaborative retrieval (CR) problem, where both characteristics of retrieval and recommendation should be considered to model a ternary relationship involved with query× user× item. Recently, several works are proposed to study CR task from users’ perspective. However, they miss a significant challenge raising from the sparse content of items. In this work, we argue that items will suffer from the sparsity problem more severely than users, since items are usually observed with fewer features to support a feature-based or content-based algorithm. To tackle this problem, we aim to sufficiently explore the sophisticated relationship of each query× user× item triple from items’ perspective. By integrating item-based collaborative information for this joint task, we present an alternative factorized model that could better evaluate the ranks of those items with sparse information for the given query-user pair. In addition, we suggest to employ a recently proposed bayesian personalized ranking (BPR) algorithm to optimize latent collaborative retrieval problem from pairwise learning perspective. The experimental results on two real-world datasets, (i.e. Last.fm, Yelp), verified the efficiency and effectiveness of our proposed approach at top-k ranking metric.
TIIREC: A Tensor Approach for Tag-Driven Item Recommendation with Sparse User Generated Content
Yu, Lu; Huang, Junming; Zhou, Ge; Liu, Chuang; Zhang, Zi-Ke
2017-01-01
In recent years, tagging system has become a building block o summarize the content of items for further functions like retrieval or personalized recommendation in various web applications. One nontrivial requirement is to precisely deliver a list of suitable items when users interact with the systems via inputing a specific tag (i.e. a query term). Different from traditional recommender systems, we need deal with a collaborative retrieval (CR) problem, where both characteristics of retrieval and recommendation should be considered to model a ternary relationship involved with query× user× item. Recently, several works are proposed to study CR task from users’ perspective. However, they miss a significant challenge raising from the sparse content of items. In this work, we argue that items will suffer from the sparsity problem more severely than users, since items are usually observed with fewer features to support a feature-based or content-based algorithm. To tackle this problem, we aim to sufficiently explore the sophisticated relationship of each query× user× item triple from items’ perspective. By integrating item-based collaborative information for this joint task, we present an alternative factorized model that could better evaluate the ranks of those items with sparse information for the given query-user pair. In addition, we suggest to employ a recently proposed bayesian personalized ranking (BPR) algorithm to optimize latent collaborative retrieval problem from pairwise learning perspective. The experimental results on two real-world datasets, (i.e. Last.fm, Yelp), verified the efficiency and effectiveness of our proposed approach at top-k ranking metric.
A Policy Language for Modelling Recommendations
Abou El Kalam, Anas; Balbiani, Philippe
While current and emergent applications become more and more complex, most of existing security policies and models only consider a yes/no response to the access requests. Consequently, modelling, formalizing and implementing permissions, obligations and prohibitions do not cover the richness of all the possible scenarios. In fact, several applications have access rules with the recommendation access modality. In this paper we focus on the problem of formalizing security policies with recommendation needs. The aim is to provide a generic domain-independent formal system for modelling not only permissions, prohibitions and obligations, but also recommendations. In this respect, we present our logic-based language, the semantics, the truth conditions, our axiomatic as well as inference rules. We also give a representative use case with our specification of recommendation requirements. Finally, we explain how our logical framework could be used to query the security policy and to check its consistency.
Mendez, Rebecca; Gutierrez, Alba; Reyes, Jasmin; Márquez-Magaña, Leticia
2012-06-01
Many strains of the soil bacterium Bacillus subtilis are capable of producing and being resistant to the antibiotic sublancin because they harbor the Spβ prophage. This 135 kb viral genome is integrated into the circular DNA chromosome of B. subtilis, and contains genes for the production of and resistance to sublancin. We investigated the role of SigY in sublancin production and resistance, finding that it is important for efficient maintenance of the Spβ prophage. We were unable to detect the prophage in mutants lacking SigY. Additionally, these mutants were no longer able to produce sublancin, were sensitive to killing by this factor, and displayed a delay in sporulation. Wild-type cells with normal SigY activity were found to partially lose the Spβ prophage during growth and early sporulation, suggesting a mechanism for the bistable outcome of sibling cells capable of killing and of being killed. The appropriate regulation of SigY appears to be essential for growth as evidenced by the inability to disrupt the gene for its putative antisigma. Our results confirm a role for SigY in antibiotic production and resistance, as has been found for other members of the extracytoplasmic function sigma factor family in B. subtilis, and shows that this role is achieved by affecting maintenance of the Spβ prophage.
Orwin, Robert G; Stein-Seroussi, Alan; Edwards, Jessica M; Landy, Ann L; Flewelling, Robert L
2014-06-01
The Strategic Prevention Framework State Incentive Grant (SPF SIG) program is a national public health initiative sponsored by the U.S. Substance Abuse and Mental Health Services Administration's Center for Substance Abuse Prevention to prevent substance abuse and its consequences. State grantees used a data-driven planning model to allocate resources to 450 communities, which in turn launched over 2,200 intervention strategies to target prevention priorities in their respective populations. An additional goal was to build prevention capacity and infrastructure at the state and community levels. This paper addresses whether the state infrastructure goal was achieved, and what contextual and implementation factors were associated with success. The findings are consistent with claims that, overall, the SPF SIG program met its goal of increasing prevention capacity and infrastructure across multiple infrastructure domains, though the mediating effects of implementation were evident only in the evaluation/monitoring domain. The results also show that an initiative like the SPF SIG, which could easily have been compartmentalized within the states, has the potential to permeate more broadly throughout state prevention systems.
Les SIG-P au service d'une gestion durable des ressources ...
International Development Research Centre (IDRC) Digital Library (Canada)
Les SIG-P au service d'une gestion durable des ressources naturelles et de la sécurité alimentaire en Afrique. Les pays africains ont besoin de données convenables aux fins de la formulation et de la mise en oeuvre de politiques et de stratégies de sécurité alimentaire systématiques et cohérentes. Ces pays disposent de ...
At Forberede sig til den mundtlige prøve ( kap12)
DEFF Research Database (Denmark)
Nørgaard, Britta Kusk
2015-01-01
Et opslagsværk for studerende til støtte for bacheloropgaveskrivning. Klæd dine studerende på til at skrive den store afsluttende opgave. De studerende får med denne bog et gedigent opslagsværk, som giver svar på alle de spørgsmål, det typisk trænger sig på, når bacheloropgaven skal påbegyndes. B...
SM4MQ: A Semantic Model for Multidimensional Queries
DEFF Research Database (Denmark)
Varga, Jovan; Dobrokhotova, Ekaterina; Romero, Oscar
2017-01-01
On-Line Analytical Processing (OLAP) is a data analysis approach to support decision-making. On top of that, Exploratory OLAP is a novel initiative for the convergence of OLAP and the Semantic Web (SW) that enables the use of OLAP techniques on SW data. Moreover, OLAP approaches exploit different......, sharing, and reuse on the SW. As OLAP is based on the underlying multidimensional (MD) data model we denote such queries as MD queries and define SM4MQ: A Semantic Model for Multidimensional Queries. Furthermore, we propose a method to automate the exploitation of queries by means of SPARQL. We apply...
Accelerating SPARQL Queries and Analytics on RDF Data
Al-Harbi, Razen
2016-01-01
The complexity of SPARQL queries and RDF applications poses great challenges on distributed RDF management systems. SPARQL workloads are dynamic and con- sist of queries with variable complexities. Hence, systems that use static partitioning su
Towards Analogy-based Recommendation : Benchmarking of Perceived Analogy Semantics
Lofi, C.; Tintarev, N.; Bogers, T.; Koolen, M.; Mobasher, B.; Said, A.; Tuzhilin, A.
2017-01-01
Requests for recommendation can be seen as a form of query for candidate items, ranked by relevance. Users are however o‰en
unable to crisply de€ne what they are looking for. One of the core concepts of natural communication for describing and explaining
complex information needs in an
Query Language for Location-Based Services: A Model Checking Approach
Hoareau, Christian; Satoh, Ichiro
We present a model checking approach to the rationale, implementation, and applications of a query language for location-based services. Such query mechanisms are necessary so that users, objects, and/or services can effectively benefit from the location-awareness of their surrounding environment. The underlying data model is founded on a symbolic model of space organized in a tree structure. Once extended to a semantic model for modal logic, we regard location query processing as a model checking problem, and thus define location queries as hybrid logicbased formulas. Our approach is unique to existing research because it explores the connection between location models and query processing in ubiquitous computing systems, relies on a sound theoretical basis, and provides modal logic-based query mechanisms for expressive searches over a decentralized data structure. A prototype implementation is also presented and will be discussed.
Energy-aware SQL query acceleration through FPGA-based dynamic partial reconfiguration
Becher, Andreas; Bauer, Florian; Ziener, Daniel; Teich, Jürgen
2014-01-01
In this paper, we propose an approach for energy-aware FPGA-based query acceleration for databases on embedded devices. After the analysis of an incoming query, a query-specific hardware accelerator is generated on-the-fly and loaded on the FPGA for subsequent query execution using partial dynamic
How Do Children Reformulate Their Search Queries?
Rutter, Sophie; Ford, Nigel; Clough, Paul
2015-01-01
Introduction: This paper investigates techniques used by children in year 4 (age eight to nine) of a UK primary school to reformulate their queries, and how they use information retrieval systems to support query reformulation. Method: An in-depth study analysing the interactions of twelve children carrying out search tasks in a primary school…
ConnectomeExplorer: Query-guided visual analysis of large volumetric neuroscience data
Beyer, Johanna
2013-12-01
This paper presents ConnectomeExplorer, an application for the interactive exploration and query-guided visual analysis of large volumetric electron microscopy (EM) data sets in connectomics research. Our system incorporates a knowledge-based query algebra that supports the interactive specification of dynamically evaluated queries, which enable neuroscientists to pose and answer domain-specific questions in an intuitive manner. Queries are built step by step in a visual query builder, building more complex queries from combinations of simpler queries. Our application is based on a scalable volume visualization framework that scales to multiple volumes of several teravoxels each, enabling the concurrent visualization and querying of the original EM volume, additional segmentation volumes, neuronal connectivity, and additional meta data comprising a variety of neuronal data attributes. We evaluate our application on a data set of roughly one terabyte of EM data and 750 GB of segmentation data, containing over 4,000 segmented structures and 1,000 synapses. We demonstrate typical use-case scenarios of our collaborators in neuroscience, where our system has enabled them to answer specific scientific questions using interactive querying and analysis on the full-size data for the first time. © 1995-2012 IEEE.
A distributed query execution engine of big attributed graphs.
Batarfi, Omar; Elshawi, Radwa; Fayoumi, Ayman; Barnawi, Ahmed; Sakr, Sherif
2016-01-01
A graph is a popular data model that has become pervasively used for modeling structural relationships between objects. In practice, in many real-world graphs, the graph vertices and edges need to be associated with descriptive attributes. Such type of graphs are referred to as attributed graphs. G-SPARQL has been proposed as an expressive language, with a centralized execution engine, for querying attributed graphs. G-SPARQL supports various types of graph querying operations including reachability, pattern matching and shortest path where any G-SPARQL query may include value-based predicates on the descriptive information (attributes) of the graph edges/vertices in addition to the structural predicates. In general, a main limitation of centralized systems is that their vertical scalability is always restricted by the physical limits of computer systems. This article describes the design, implementation in addition to the performance evaluation of DG-SPARQL, a distributed, hybrid and adaptive parallel execution engine of G-SPARQL queries. In this engine, the topology of the graph is distributed over the main memory of the underlying nodes while the graph data are maintained in a relational store which is replicated on the disk of each of the underlying nodes. DG-SPARQL evaluates parts of the query plan via SQL queries which are pushed to the underlying relational stores while other parts of the query plan, as necessary, are evaluated via indexless memory-based graph traversal algorithms. Our experimental evaluation shows the efficiency and the scalability of DG-SPARQL on querying massive attributed graph datasets in addition to its ability to outperform the performance of Apache Giraph, a popular distributed graph processing system, by orders of magnitudes.
Path Minima Queries in Dynamic Weighted Trees
DEFF Research Database (Denmark)
Davoodi, Pooya; Brodal, Gerth Stølting; Satti, Srinivasa Rao
2011-01-01
In the path minima problem on a tree, each edge is assigned a weight and a query asks for the edge with minimum weight on a path between two nodes. For the dynamic version of the problem, where the edge weights can be updated, we give data structures that achieve optimal query time\\todo{what about...
Secure Nearest Neighbor Query on Crowd-Sensing Data
Directory of Open Access Journals (Sweden)
Ke Cheng
2016-09-01
Full Text Available Nearest neighbor queries are fundamental in location-based services, and secure nearest neighbor queries mainly focus on how to securely and quickly retrieve the nearest neighbor in the outsourced cloud server. However, the previous big data system structure has changed because of the crowd-sensing data. On the one hand, sensing data terminals as the data owner are numerous and mistrustful, while, on the other hand, in most cases, the terminals find it difficult to finish many safety operation due to computation and storage capability constraints. In light of they Multi Owners and Multi Users (MOMU situation in the crowd-sensing data cloud environment, this paper presents a secure nearest neighbor query scheme based on the proxy server architecture, which is constructed by protocols of secure two-party computation and secure Voronoi diagram algorithm. It not only preserves the data confidentiality and query privacy but also effectively resists the collusion between the cloud server and the data owners or users. Finally, extensive theoretical and experimental evaluations are presented to show that our proposed scheme achieves a superior balance between the security and query performance compared to other schemes.
73 Utilisation du SIG pour une réorganisation urbaine du centre-ville ...
African Journals Online (AJOL)
TOHOZIN
restructuration des routes, du chemin de fer, des bâtis et des cours d'eaux. ... aussi la réalisation d'une mappe foncière qui est un instrument capital en matière ..... confirmer que le SIG, en tant qu'outil de décision et de planification a ...
Fernández, José R; Webb, Corey; Rouzard, Karl; Voronkov, Michael; Huber, Kristen L; Stock, Jeffry B; Stock, Maxwell; Gordon, Joel S; Perez, Eduardo
2017-03-01
Isoprenylcysteine (IPC) small molecules were discovered as signal transduction modulating compounds ~25 years ago. More recently, IPC molecules have demonstrated antioxidant and anti-inflammatory properties in a variety of dermal cells as well as antimicrobial activity, representing a novel class of compounds to ameliorate skin conditions and disease. Here, we demonstrate a new IPC compound, N-acetylglutaminoyl-S-farnesyl-L-cysteine (SIG-1191), which inhibits UVB-induced inflammation blocking pro-inflammatory cytokine interleukin-6 (IL-6) and tumor necrosis factor alpha (TNF-α) production. To investigate further the previously reported hydrating potential of IPC compounds, SIG-1191 was tested for its ability to modulate aquaporin expression. Specifically, aquaporin 3 (AQP3) the most abundant aquaporin found in skin has been reported to play a key role in skin hydration, elasticity and barrier repair. Results show here for the first time that SIG-1191 increases AQP3 expression in both cultured normal human epidermal keratinocytes as well as when applied topically in a three-dimensional (3D) reconstructed human skin equivalent. Additionally, SIG-1191 dose dependently increased AQP3 protein levels, as determined by specific antibody staining, in the epidermis of the 3D skin equivalents. To begin to elucidate which signaling pathways SIG-1191 may be modulating to increase AQP3 levels, we used several pharmacological pathway inhibitors and determined that AQP3 expression is mediated by the Mitogen-activated protein kinase/Extracellular signal-regulated kinase kinase (MEK) pathway. Altogether, these data suggest SIG-1191 represents a new IPC derivative with anti-inflammatory activity that may also promote increased skin hydration based on its ability to increase AQP3 levels.
Kodama, Takeko; Takamatsu, Hiromu; Asai, Kei; Kobayashi, Kazuo; Ogasawara, Naotake; Watabe, Kazuhito
1999-01-01
The expression of 21 novel genes located in the region from dnaA to abrB of the Bacillus subtilis chromosome was analyzed. One of the genes, yaaH, had a predicted promoter sequence conserved among SigE-dependent genes. Northern blot analysis revealed that yaaH mRNA was first detected from 2 h after the cessation of logarithmic growth (T2) of sporulation in wild-type cells and in spoIIIG (SigG−) and spoIVCB (SigK−) mutants but not in spoIIAC (SigF−) and spoIIGAB (SigE−) mutants. The transcription start point was determined by primer extension analysis; the −10 and −35 regions are very similar to the consensus sequences recognized by SigE-containing RNA polymerase. A YaaH-His tag fusion encoded by a plasmid with a predicted promoter for the yaaH gene was produced from T2 of sporulation in a B. subtilis transformant and extracted from mature spores, indicating that the yaaH gene product is a spore protein. Inactivation of the yaaH gene by insertion of an erythromycin resistance gene did not affect vegetative growth or spore resistance to heat, chloroform, and lysozyme. The germination of yaaH mutant spores in a mixture of l-asparagine, d-glucose, d-fructose, and potassium chloride was almost the same as that of wild-type spores, but the mutant spores were defective in l-alanine-stimulated germination. These results suggest that yaaH is a novel gene encoding a spore protein produced in the mother cell compartment from T2 of sporulation and that it is required for the l-alanine-stimulated germination pathway. PMID:10419957
CSRQ: Communication-Efficient Secure Range Queries in Two-Tiered Sensor Networks
Directory of Open Access Journals (Sweden)
Hua Dai
2016-02-01
Full Text Available In recent years, we have seen many applications of secure query in two-tiered wireless sensor networks. Storage nodes are responsible for storing data from nearby sensor nodes and answering queries from Sink. It is critical to protect data security from a compromised storage node. In this paper, the Communication-efficient Secure Range Query (CSRQ—a privacy and integrity preserving range query protocol—is proposed to prevent attackers from gaining information of both data collected by sensor nodes and queries issued by Sink. To preserve privacy and integrity, in addition to employing the encoding mechanisms, a novel data structure called encrypted constraint chain is proposed, which embeds the information of integrity verification. Sink can use this encrypted constraint chain to verify the query result. The performance evaluation shows that CSRQ has lower communication cost than the current range query protocols.
In-route skyline querying for location-based services
DEFF Research Database (Denmark)
Xuegang, Huang; Jensen, Kristian S.
2005-01-01
With the emergence of an infrastructure for location-aware mobile services, the processing of advanced, location-based queries that are expected to underlie such services is gaining in relevance, While much work has assumed that users move in Euclidean space, this paper assumes that movement...... their efficient computation. The queries take into account several spatial preferences. and they intuitively return a set of most interesting results for each result returned by the corresponding non-skyline queries. The paper also covers a performance study of the proposed techniques based on real point...
jQuery 2.0 animation techniques beginner's guide
Culpepper, Adam
2013-01-01
This book is a guide to help you create attractive web page animations using jQuery. Written in a friendly and engaging approach this book is designed to be placed alongside your computer as a mentor.If you are a web designer or a frontend developer or if you want to learn how to animate the user interface of your web applications with jQuery, this book is for you. Experience with jQuery or Javascript would be helpful but solid knowledge base of HTML and CSS is assumed.
An Adaptive Directed Query Dissemination Scheme for Wireless Sensor Networks
Chatterjea, Supriyo; De Luigi, Simone; Havinga, Paul J.M.; Sun, M.T.
This paper describes a directed query dissemination scheme, DirQ that routes queries to the appropriate source nodes based on both constant and dynamicvalued attributes such as sensor types and sensor values. Unlike certain other query dissemination schemes, location information is not essential for
Querying Natural Logic Knowledge Bases
DEFF Research Database (Denmark)
Andreasen, Troels; Bulskov, Henrik; Jensen, Per Anker
2017-01-01
This paper describes the principles of a system applying natural logic as a knowledge base language. Natural logics are regimented fragments of natural language employing high level inference rules. We advocate the use of natural logic for knowledge bases dealing with querying of classes...... in ontologies and class-relationships such as are common in life-science descriptions. The paper adopts a version of natural logic with recursive restrictive clauses such as relative clauses and adnominal prepositional phrases. It includes passive as well as active voice sentences. We outline a prototype...... for partial translation of natural language into natural logic, featuring further querying and conceptual path finding in natural logic knowledge bases....
Memory aware query scheduling in a database cluster
F. Waas; M.L. Kersten (Martin)
2000-01-01
textabstractQuery throughput is one of the primary optimization goals in interactive web-based information systems in order to achieve the performance necessary to serve large user communities. Queries in this application domain differ significantly from those in traditional database applications:
Templates and Queries in Contextual Hypermedia
DEFF Research Database (Denmark)
Anderson, Kenneth Mark; Hansen, Frank Allan; Bouvin, Niels Olof
2006-01-01
discuss a framework, HyConSC, that implements this model and describe how it can be used to build new contextual hypermedia systems. Our framework aids the developer in the iterative development of contextual queries (via a dynamic query browser) and offers support for con-text matching, a key feature...... of contextual hypermedia. We have tested the framework with data and sensors taken from the HyCon contextual hypermedia system and are now migrating HyCon to this new framework....
Ho, Kwok M; Lan, Norris S H; Williams, Teresa A; Harahsheh, Yusra; Chapman, Andrew R; Dobb, Geoffrey J; Magder, Sheldon
2016-01-01
This cohort study compared the prognostic significance of strong ion gap (SIG) with other acid-base markers in the critically ill. The relationships between SIG, lactate, anion gap (AG), anion gap albumin-corrected (AG-corrected), base excess or strong ion difference-effective (SIDe), all obtained within the first hour of intensive care unit (ICU) admission, and the hospital mortality of 6878 patients were analysed. The prognostic significance of each acid-base marker, both alone and in combination with the Admission Mortality Prediction Model (MPM0 III) predicted mortality, were assessed by the area under the receiver operating characteristic curve (AUROC). Of the 6878 patients included in the study, 924 patients (13.4 %) died after ICU admission. Except for plasma chloride concentrations, all acid-base markers were significantly different between the survivors and non-survivors. SIG (with lactate: AUROC 0.631, confidence interval [CI] 0.611-0.652; without lactate: AUROC 0.521, 95 % CI 0.500-0.542) only had a modest ability to predict hospital mortality, and this was no better than using lactate concentration alone (AUROC 0.701, 95 % 0.682-0.721). Adding AG-corrected or SIG to a combination of lactate and MPM0 III predicted risks also did not substantially improve the latter's ability to differentiate between survivors and non-survivors. Arterial lactate concentrations explained about 11 % of the variability in the observed mortality, and it was more important than SIG (0.6 %) and SIDe (0.9 %) in predicting hospital mortality after adjusting for MPM0 III predicted risks. Lactate remained as the strongest predictor for mortality in a sensitivity multivariate analysis, allowing for non-linearity of all acid-base markers. The prognostic significance of SIG was modest and inferior to arterial lactate concentration for the critically ill. Lactate concentration should always be considered regardless whether physiological, base excess or physical-chemical approach
DEFF Research Database (Denmark)
Cao, Xin; Chen, Lisi; Cong, Gao
2012-01-01
The web is increasingly being used by mobile users. In addition, it is increasingly becoming possible to accurately geo-position mobile users and web content. This development gives prominence to spatial web data management. Specifically, a spatial keyword query takes a user location and user-sup...... different kinds of functionality as well as the ideas underlying their definition....
MetSigDis: a manually curated resource for the metabolic signatures of diseases.
Cheng, Liang; Yang, Haixiu; Zhao, Hengqiang; Pei, Xiaoya; Shi, Hongbo; Sun, Jie; Zhang, Yunpeng; Wang, Zhenzhen; Zhou, Meng
2017-08-22
Complex diseases cannot be understood only on the basis of single gene, single mRNA transcript or single protein but the effect of their collaborations. The combination consequence in molecular level can be captured by the alterations of metabolites. With the rapidly developing of biomedical instruments and analytical platforms, a large number of metabolite signatures of complex diseases were identified and documented in the literature. Biologists' hardship in the face of this large amount of papers recorded metabolic signatures of experiments' results calls for an automated data repository. Therefore, we developed MetSigDis aiming to provide a comprehensive resource of metabolite alterations in various diseases. MetSigDis is freely available at http://www.bio-annotation.cn/MetSigDis/. By reviewing hundreds of publications, we collected 6849 curated relationships between 2420 metabolites and 129 diseases across eight species involving Homo sapiens and model organisms. All of these relationships were used in constructing a metabolite disease network (MDN). This network displayed scale-free characteristics according to the degree distribution (power-law distribution with R2 = 0.909), and the subnetwork of MDN for interesting diseases and their related metabolites can be visualized in the Web. The common alterations of metabolites reflect the metabolic similarity of diseases, which is measured using Jaccard index. We observed that metabolite-based similar diseases are inclined to share semantic associations of Disease Ontology. A human disease network was then built, where a node represents a disease, and an edge indicates similarity of pair-wise diseases. The network validated the observation that linked diseases based on metabolites should have more overlapped genes. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Parallel Index and Query for Large Scale Data Analysis
Energy Technology Data Exchange (ETDEWEB)
Chou, Jerry; Wu, Kesheng; Ruebel, Oliver; Howison, Mark; Qiang, Ji; Prabhat,; Austin, Brian; Bethel, E. Wes; Ryne, Rob D.; Shoshani, Arie
2011-07-18
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-art index and query technologies are critical for facilitating interactive exploration of large datasets, but numerous challenges remain in terms of designing a system for process- ing general scientific datasets. The system needs to be able to run on distributed multi-core platforms, efficiently utilize underlying I/O infrastructure, and scale to massive datasets. We present FastQuery, a novel software framework that address these challenges. FastQuery utilizes a state-of-the-art index and query technology (FastBit) and is designed to process mas- sive datasets on modern supercomputing platforms. We apply FastQuery to processing of a massive 50TB dataset generated by a large scale accelerator modeling code. We demonstrate the scalability of the tool to 11,520 cores. Motivated by the scientific need to search for inter- esting particles in this dataset, we use our framework to reduce search time from hours to tens of seconds.
Conceptual querying through ontologies
DEFF Research Database (Denmark)
Andreasen, Troels; Bulskov, Henrik
2009-01-01
is motivated by an obvious need for users to survey huge volumes of objects in query answers. An ontology formalism and a special notion of-instantiated ontology" are introduced. The latter is a structure reflecting the content in the document collection in that; it is a restriction of a general world......We present here ail approach to conceptual querying where the aim is, given a collection of textual database objects or documents, to target an abstraction of the entire database content in terms of the concepts appearing in documents, rather than the documents in the collection. The approach...... knowledge ontology to the concepts instantiated in the collection. The notion of ontology-based similarity is briefly described, language constructs for direct navigation and retrieval of concepts in the ontology are discussed and approaches to conceptual summarization are presented....
Expression of the Arabidopsis Sigma Factor SIG5 Is Photoreceptor and Photosynthesis Controlled
Directory of Open Access Journals (Sweden)
Marina Mellenthin
2014-08-01
Full Text Available Two collections of Arabidopsis GAL4 enhancer trap lines were screened for light-intensity dependent reporter gene activation. Line N9313 was isolated for its strong light-intensity regulation. The T-DNA element trapped distant enhancers of the SIG5 promoter, which drives expression of a sigma factor involved in regulation of chloroplast genes for photosystem II core proteins. The T-DNA insertion 715 bp upstream of the transcription initiation site splits the promoter in a distal and proximal part. Both parts are sensitive to blue and red light and depend on photosynthetic electron transport activity between photosystem II and the plastoquinone pool. The mainblue-light sensitivity is localized within a 196-bp sequence (–887 to –691 bp in the proximal promoter region It is preferentially CRY1 and PHYB controlled. Type-I and type-II phytochromes mediate red-light sensitivity via various promoter elements spread over the proximal and distal upstream region. This work characterizes SIG5 as an anterograde control factor of chloroplast gene expression, which is controlled by chloroplast signals in a retrograde manner.
Constraint-based query distribution framework for an integrated global schema
DEFF Research Database (Denmark)
Malik, Ahmad Kamran; Qadir, Muhammad Abdul; Iftikhar, Nadeem
2009-01-01
and replicated data sources. The provided system is all XML-based which poses query in XML form, transforms, and integrates local results in an XML document. Contributions include the use of constraints in our existing global schema which help in source selection and query optimization, and a global query...
Evaluating XML-Extended OLAP Queries Based on a Physical Algebra
DEFF Research Database (Denmark)
Yin, Xuepeng; Pedersen, Torben Bach
2004-01-01
is desirable. In this paper, we extend previous work on the logical federation of OLAP and XML data sources by presenting a simplified query semantics,a physical query algebra and a robust OLAP-XML query engine.Performance experiments with a prototypical implementation suggest that the performance for OLAP...
Parasol: An Architecture for Cross-Cloud Federated Graph Querying
Energy Technology Data Exchange (ETDEWEB)
Lieberman, Michael; Choudhury, Sutanay; Hughes, Marisa; Patrone, Dennis; Hider, Sandy; Piatko, Christine; Chapman, Matthew; Marple, JP; Silberberg, David
2014-06-22
Large scale data fusion of multiple datasets can often provide in- sights that examining datasets individually cannot. However, when these datasets reside in different data centers and cannot be collocated due to technical, administrative, or policy barriers, a unique set of problems arise that hamper querying and data fusion. To ad- dress these problems, a system and architecture named Parasol is presented that enables federated queries over graph databases residing in multiple clouds. Parasol’s design is flexible and requires only minimal assumptions for participant clouds. Query optimization techniques are also described that are compatible with Parasol’s lightweight architecture. Experiments on a prototype implementation of Parasol indicate its suitability for cross-cloud federated graph queries.
Labeling RDF Graphs for Linear Time and Space Querying
Furche, Tim; Weinzierl, Antonius; Bry, François
Indices and data structures for web querying have mostly considered tree shaped data, reflecting the view of XML documents as tree-shaped. However, for RDF (and when querying ID/IDREF constraints in XML) data is indisputably graph-shaped. In this chapter, we first study existing indexing and labeling schemes for RDF and other graph datawith focus on support for efficient adjacency and reachability queries. For XML, labeling schemes are an important part of the widespread adoption of XML, in particular for mapping XML to existing (relational) database technology. However, the existing indexing and labeling schemes for RDF (and graph data in general) sacrifice one of the most attractive properties of XML labeling schemes, the constant time (and per-node space) test for adjacency (child) and reachability (descendant). In the second part, we introduce the first labeling scheme for RDF data that retains this property and thus achieves linear time and space processing of acyclic RDF queries on a significantly larger class of graphs than previous approaches (which are mostly limited to tree-shaped data). Finally, we show how this labeling scheme can be applied to (acyclic) SPARQL queries to obtain an evaluation algorithm with time and space complexity linear in the number of resources in the queried RDF graph.
Concept-based query language approach to enterprise information systems
Niemi, Timo; Junkkari, Marko; Järvelin, Kalervo
2014-01-01
In enterprise information systems (EISs) it is necessary to model, integrate and compute very diverse data. In advanced EISs the stored data often are based both on structured (e.g. relational) and semi-structured (e.g. XML) data models. In addition, the ad hoc information needs of end-users may require the manipulation of data-oriented (structural), behavioural and deductive aspects of data. Contemporary languages capable of treating this kind of diversity suit only persons with good programming skills. In this paper we present a concept-oriented query language approach to manipulate this diversity so that the programming skill requirements are considerably reduced. In our query language, the features which need technical knowledge are hidden in application-specific concepts and structures. Therefore, users need not be aware of the underlying technology. Application-specific concepts and structures are represented by the modelling primitives of the extended RDOOM (relational deductive object-oriented modelling) which contains primitives for all crucial real world relationships (is-a relationship, part-of relationship, association), XML documents and views. Our query language also supports intensional and extensional-intensional queries, in addition to conventional extensional queries. In its query formulation, the end-user combines available application-specific concepts and structures through shared variables.
Efficient Processing of Multiple DTW Queries in Time Series Databases
DEFF Research Database (Denmark)
Kremer, Hardy; Günnemann, Stephan; Ivanescu, Anca-Maria
2011-01-01
. In many of today’s applications, however, large numbers of queries arise at any given time. Existing DTW techniques do not process multiple DTW queries simultaneously, a serious limitation which slows down overall processing. In this paper, we propose an efficient processing approach for multiple DTW...... for multiple DTW queries....
Advanced SPARQL querying in small molecule databases.
Galgonek, Jakub; Hurt, Tomáš; Michlíková, Vendula; Onderka, Petr; Schwarz, Jan; Vondrášek, Jiří
2016-01-01
In recent years, the Resource Description Framework (RDF) and the SPARQL query language have become more widely used in the area of cheminformatics and bioinformatics databases. These technologies allow better interoperability of various data sources and powerful searching facilities. However, we identified several deficiencies that make usage of such RDF databases restrictive or challenging for common users. We extended a SPARQL engine to be able to use special procedures inside SPARQL queries. This allows the user to work with data that cannot be simply precomputed and thus cannot be directly stored in the database. We designed an algorithm that checks a query against data ontology to identify possible user errors. This greatly improves query debugging. We also introduced an approach to visualize retrieved data in a user-friendly way, based on templates describing visualizations of resource classes. To integrate all of our approaches, we developed a simple web application. Our system was implemented successfully, and we demonstrated its usability on the ChEBI database transformed into RDF form. To demonstrate procedure call functions, we employed compound similarity searching based on OrChem. The application is publicly available at https://bioinfo.uochb.cas.cz/projects/chemRDF.
Algebra-Based Optimization of XML-Extended OLAP Queries
DEFF Research Database (Denmark)
Yin, Xuepeng; Pedersen, Torben Bach
In today’s OLAP systems, integrating fast changing data, e.g., stock quotes, physically into a cube is complex and time-consuming. The widespread use of XML makes it very possible that this data is available in XML format on the WWW; thus, making XML data logically federated with OLAP systems...... is desirable. This report presents a complete foundation for such OLAP-XML federations. This includes a prototypical query engine, a simplified query semantics based on previous work, and a complete physical algebra which enables precise modeling of the execution tasks of an OLAP-XML query. Effective algebra...
Directory of Open Access Journals (Sweden)
Giorgio Agugiaro
2011-12-01
Full Text Available Constant improvements in the field of surveying, computing and distribution of digital-content are reshaping the way Cultural Heritage can be digitised and virtually accessed, even remotely via web. A traditional 2D approach for data access, exploration, retrieval and exploration may generally suffice, however more complex analyses concerning spatial and temporal features require 3D tools, which, in some cases, have not yet been implemented or are not yet generally commercially available. Efficient organisation and integration strategies applicable to the wide array of heterogeneous data in the field of Cultural Heritage represent a hot research topic nowadays. This article presents a visualisation and query tool (QueryArch3D conceived to deal with multi-resolution 3D models. Geometric data are organised in successive levels of detail (LoD, provided with geometric and semantic hierarchies and enriched with attributes coming from external data sources. The visualisation and query front-end enables the 3D navigation of the models in a virtual environment, as well as the interaction with the objects by means of queries based on attributes or on geometries. The tool can be used as a standalone application, or served through the web. The characteristics of the research work, along with some implementation issues and the developed QueryArch3D tool will be discussed and presented.
The SQL++ Query Language: Configurable, Unifying and Semi-structured
Ong, Kian Win; Papakonstantinou, Yannis; Vernoux, Romain
2014-01-01
NoSQL databases support semi-structured data, typically modeled as JSON. They also provide limited (but expanding) query languages. Their idiomatic, non-SQL language constructs, the many variations, and the lack of formal semantics inhibit deep understanding of the query languages, and also impede progress towards clean, powerful, declarative query languages. This paper specifies the syntax and semantics of SQL++, which is applicable to both JSON native stores and SQL databases. The SQL++ sem...
Path Index Based Keywords to SPARQL Query Transformation for Semantic Data Federations
Directory of Open Access Journals (Sweden)
Thilini Cooray
2016-06-01
Full Text Available Semantic web is a highly emerging research domain. Enhancing the ability of keyword query processing on Semantic Web data provides a huge support for familiarizing the usefulness of Semantic Web to the general public. Most of the existing approaches focus on just user keyword matching to RDF graphs and output the connecting elements as results. Semantic Web consists of SPARQL query language which can process queries more accurately and efficiently than general keyword matching. There are only about a couple of approaches available for transforming keyword queries to SPARQL. They basically rely on real time graph traversals? for identifying subgraphs which can connect user keywords. Those approaches are either limited to query processing on a single data store or a set of interlinked data sets. They have not focused on query processing on a federation of independent data sets which belongs to the same domain. This research proposes a Path Index based approach eliminating real time graph traversal for transforming keyword queries to SPARQL. We have introduced an ontology alignment based approach for keyword query transforming on a federation of RDF data stored using multiple heterogeneous vocabularies. Evaluation shows that the proposed approach have the ability to generate SPARQL queries which can provide highly relevant results for user keyword queries. The Path Index based query transformation approach has also achieved high efficiency compared to the existing approach.
Lazy Toggle PRM: A single-query approach to motion planning
Denny, Jory
2013-05-01
Probabilistic RoadMaps (PRMs) are quite suc-cessful in solving complex and high-dimensional motion plan-ning problems. While particularly suited for multiple-query scenarios and expansive spaces, they lack efficiency in both solving single-query scenarios and mapping narrow spaces. Two PRM variants separately tackle these gaps. Lazy PRM reduces the computational cost of roadmap construction for single-query scenarios by delaying roadmap validation until query time. Toggle PRM is well suited for mapping narrow spaces by mapping both Cfree and Cobst, which gives certain theoretical benefits. However, fully validating the two resulting roadmaps can be costly. We present a strategy, Lazy Toggle PRM, for integrating these two approaches into a method which is both suited for narrow passages and efficient single-query calculations. This simultaneously addresses two challenges of PRMs. Like Lazy PRM, Lazy Toggle PRM delays validation of roadmaps until query time, but if no path is found, the algorithm augments the roadmap using the Toggle PRM methodology. We demonstrate the effectiveness of Lazy Toggle PRM in a wide range of scenarios, including those with narrow passages and high descriptive complexity (e.g., those described by many triangles), concluding that it is more effective than existing methods in solving difficult queries. © 2013 IEEE.
Efficient external memory structures for range-aggregate queries
DEFF Research Database (Denmark)
Agarwal, P.K.; Yang, J.; Arge, L.
2013-01-01
We present external memory data structures for efficiently answering range-aggregate queries. The range-aggregate problem is defined as follows: Given a set of weighted points in Rd, compute the aggregate of the weights of the points that lie inside a d-dimensional orthogonal query rectangle. The...
Entropy Based Analysis of DNS Query Traffic in the Campus Network
Directory of Open Access Journals (Sweden)
Dennis Arturo Ludeña Romaña
2008-10-01
Full Text Available We carried out the entropy based study on the DNS query traffic from the campus network in a university through January 1st, 2006 to March 31st, 2007. The results are summarized, as follows: (1 The source IP addresses- and query keyword-based entropies change symmetrically in the DNS query traffic from the outside of the campus network when detecting the spam bot activity on the campus network. On the other hand (2, the source IP addresses- and query keywordbased entropies change similarly each other when detecting big DNS query traffic caused by prescanning or distributed denial of service (DDoS attack from the campus network. Therefore, we can detect the spam bot and/or DDoS attack bot by only watching DNS query access traffic.
Top-k aggregation queries in large-scale distributed systems
Michel, Sebastian
2007-01-01
Distributed top-k query processing has recently become an essential functionality in a large number of emerging application classes like Internet traffic monitoring and Peer-to-Peer Web search. This work addresses efficient algorithms for distributed top-k queries in wide-area networks where the index lists for the attribute values (or text terms) of a query are distributed across a number of data peers. More precisely, in this thesis, we make the following distributions: We present the fa...
Fast Inbound Top-K Query for Random Walk with Restart.
Zhang, Chao; Jiang, Shan; Chen, Yucheng; Sun, Yidan; Han, Jiawei
2015-09-01
Random walk with restart (RWR) is widely recognized as one of the most important node proximity measures for graphs, as it captures the holistic graph structure and is robust to noise in the graph. In this paper, we study a novel query based on the RWR measure, called the inbound top-k (Ink) query. Given a query node q and a number k , the Ink query aims at retrieving k nodes in the graph that have the largest weighted RWR scores to q . Ink queries can be highly useful for various applications such as traffic scheduling, disease treatment, and targeted advertising. Nevertheless, none of the existing RWR computation techniques can accurately and efficiently process the Ink query in large graphs. We propose two algorithms, namely Squeeze and Ripple, both of which can accurately answer the Ink query in a fast and incremental manner. To identify the top- k nodes, Squeeze iteratively performs matrix-vector multiplication and estimates the lower and upper bounds for all the nodes in the graph. Ripple employs a more aggressive strategy by only estimating the RWR scores for the nodes falling in the vicinity of q , the nodes outside the vicinity do not need to be evaluated because their RWR scores are propagated from the boundary of the vicinity and thus upper bounded. Ripple incrementally expands the vicinity until the top- k result set can be obtained. Our extensive experiments on real-life graph data sets show that Ink queries can retrieve interesting results, and the proposed algorithms are orders of magnitude faster than state-of-the-art method.
Query-by-example surgical activity detection.
Gao, Yixin; Vedula, S Swaroop; Lee, Gyusung I; Lee, Mija R; Khudanpur, Sanjeev; Hager, Gregory D
2016-06-01
Easy acquisition of surgical data opens many opportunities to automate skill evaluation and teaching. Current technology to search tool motion data for surgical activity segments of interest is limited by the need for manual pre-processing, which can be prohibitive at scale. We developed a content-based information retrieval method, query-by-example (QBE), to automatically detect activity segments within surgical data recordings of long duration that match a query. The example segment of interest (query) and the surgical data recording (target trial) are time series of kinematics. Our approach includes an unsupervised feature learning module using a stacked denoising autoencoder (SDAE), two scoring modules based on asymmetric subsequence dynamic time warping (AS-DTW) and template matching, respectively, and a detection module. A distance matrix of the query against the trial is computed using the SDAE features, followed by AS-DTW combined with template scoring, to generate a ranked list of candidate subsequences (substrings). To evaluate the quality of the ranked list against the ground-truth, thresholding conventional DTW distances and bipartite matching are applied. We computed the recall, precision, F1-score, and a Jaccard index-based score on three experimental setups. We evaluated our QBE method using a suture throw maneuver as the query, on two tool motion datasets (JIGSAWS and MISTIC-SL) captured in a training laboratory. We observed a recall of 93, 90 and 87 % and a precision of 93, 91, and 88 % with same surgeon same trial (SSST), same surgeon different trial (SSDT) and different surgeon (DS) experiment setups on JIGSAWS, and a recall of 87, 81 and 75 % and a precision of 72, 61, and 53 % with SSST, SSDT and DS experiment setups on MISTIC-SL, respectively. We developed a novel, content-based information retrieval method to automatically detect multiple instances of an activity within long surgical recordings. Our method demonstrated adequate recall
Geometric Representations of Condition Queries on Three-Dimensional Vector Fields
Henze, Chris
1999-01-01
Condition queries on distributed data ask where particular conditions are satisfied. It is possible to represent condition queries as geometric objects by plotting field data in various spaces derived from the data, and by selecting loci within these derived spaces which signify the desired conditions. Rather simple geometric partitions of derived spaces can represent complex condition queries because much complexity can be encapsulated in the derived space mapping itself A geometric view of condition queries provides a useful conceptual unification, allowing one to intuitively understand many existing vector field feature detection algorithms -- and to design new ones -- as variations on a common theme. A geometric representation of condition queries also provides a simple and coherent basis for computer implementation, reducing a wide variety of existing and potential vector field feature detection techniques to a few simple geometric operations.
Efficient processing of containment queries on nested sets
Ibrahim, A.; Fletcher, G.H.L.
2013-01-01
We study the problem of computing containment queries on sets which can have both atomic and set-valued objects as elements, i.e., nested sets. Containment is a fundamental query pattern with many basic applications. Our study of nested set containment is motivated by the ubiquity of nested data in
Ontology Based Queries - Investigating a Natural Language Interface
van der Sluis, Ielka; Hielkema, F.; Mellish, C.; Doherty, G.
2010-01-01
In this paper we look at what may be learned from a comparative study examining non-technical users with a background in social science browsing and querying metadata. Four query tasks were carried out with a natural language interface and with an interface that uses a web paradigm with hyperlinks.
DISPAQ: Distributed Profitable-Area Query from Big Taxi Trip Data.
Putri, Fadhilah Kurnia; Song, Giltae; Kwon, Joonho; Rao, Praveen
2017-09-25
One of the crucial problems for taxi drivers is to efficiently locate passengers in order to increase profits. The rapid advancement and ubiquitous penetration of Internet of Things (IoT) technology into transportation industries enables us to provide taxi drivers with locations that have more potential passengers (more profitable areas) by analyzing and querying taxi trip data. In this paper, we propose a query processing system, called Distributed Profitable-Area Query ( DISPAQ ) which efficiently identifies profitable areas by exploiting the Apache Software Foundation's Spark framework and a MongoDB database. DISPAQ first maintains a profitable-area query index (PQ-index) by extracting area summaries and route summaries from raw taxi trip data. It then identifies candidate profitable areas by searching the PQ-index during query processing. Then, it exploits a Z-Skyline algorithm, which is an extension of skyline processing with a Z-order space filling curve, to quickly refine the candidate profitable areas. To improve the performance of distributed query processing, we also propose local Z-Skyline optimization, which reduces the number of dominant tests by distributing killer profitable areas to each cluster node. Through extensive evaluation with real datasets, we demonstrate that our DISPAQ system provides a scalable and efficient solution for processing profitable-area queries from huge amounts of big taxi trip data.
Incentives for Delay-Constrained Data Query and Feedback in Mobile Opportunistic Crowdsensing
Directory of Open Access Journals (Sweden)
Yang Liu
2016-07-01
Full Text Available In this paper, we propose effective data collection schemes that stimulate cooperation between selfish users in mobile opportunistic crowdsensing. A query issuer generates a query and requests replies within a given delay budget. When a data provider receives the query for the first time from an intermediate user, the former replies to it and authorizes the latter as the owner of the reply. Different data providers can reply to the same query. When a user that owns a reply meets the query issuer that generates the query, it requests the query issuer to pay credits. The query issuer pays credits and provides feedback to the data provider, which gives the reply. When a user that carries a feedback meets the data provider, the data provider pays credits to the user in order to adjust its claimed expertise. Queries, replies and feedbacks can be traded between mobile users. We propose an effective mechanism to define rewards for queries, replies and feedbacks. We formulate the bargain process as a two-person cooperative game, whose solution is found by using the Nash theorem. To improve the credit circulation, we design an online auction process, in which the wealthy user can buy replies and feedbacks from the starving one using credits. We have carried out extensive simulations based on real-world traces to evaluate the proposed schemes.
General Services Administration — Schedule Sales Query presents sales volume figures as reported to GSA by contractors. The reports are generated as quarterly reports for the current year and the...
Improving accuracy for identifying related PubMed queries by an integrated approach.
Lu, Zhiyong; Wilbur, W John
2009-10-01
PubMed is the most widely used tool for searching biomedical literature online. As with many other online search tools, a user often types a series of multiple related queries before retrieving satisfactory results to fulfill a single information need. Meanwhile, it is also a common phenomenon to see a user type queries on unrelated topics in a single session. In order to study PubMed users' search strategies, it is necessary to be able to automatically separate unrelated queries and group together related queries. Here, we report a novel approach combining both lexical and contextual analyses for segmenting PubMed query sessions and identifying related queries and compare its performance with the previous approach based solely on concept mapping. We experimented with our integrated approach on sample data consisting of 1539 pairs of consecutive user queries in 351 user sessions. The prediction results of 1396 pairs agreed with the gold-standard annotations, achieving an overall accuracy of 90.7%. This demonstrates that our approach is significantly better than the previously published method. By applying this approach to a one day query log of PubMed, we found that a significant proportion of information needs involved more than one PubMed query, and that most of the consecutive queries for the same information need are lexically related. Finally, the proposed PubMed distance is shown to be an accurate and meaningful measure for determining the contextual similarity between biological terms. The integrated approach can play a critical role in handling real-world PubMed query log data as is demonstrated in our experiments.
Flexible Query Answering Systems
DEFF Research Database (Denmark)
This book constitutes the refereed proceedings of the 12th International Conference on Flexible Query Answering Systems, FQAS 2017, held in London, UK, in June 2017. The 21 full papers presented in this book together with 4 short papers were carefully reviewed and selected from 43 submissions...
Querying Sentiment Development over Time
DEFF Research Database (Denmark)
Andreasen, Troels; Christiansen, Henning; Have, Christian Theil
2013-01-01
A new language is introduced for describing hypotheses about fluctuations of measurable properties in streams of timestamped data, and as prime example, we consider trends of emotions in the constantly flowing stream of Twitter messages. The language, called EmoEpisodes, has a precise semantics...... that measures how well a hypothesis characterizes a given time interval; the semantics is parameterized so it can be adjusted to different views of the data. EmoEpisodes is extended to a query language with variables standing for unknown topics and emotions, and the query-answering mechanism will return...... instantiations for topics and emotions as well as time intervals that provide the largest deflections in this measurement. Experiments are performed on a selection of Twitter data to demonstrates the usefulness of the approach....
Multiple k Nearest Neighbor Query Processing in Spatial Network Databases
DEFF Research Database (Denmark)
Xuegang, Huang; Jensen, Christian Søndergaard; Saltenis, Simonas
2006-01-01
This paper concerns the efficient processing of multiple k nearest neighbor queries in a road-network setting. The assumed setting covers a range of scenarios such as the one where a large population of mobile service users that are constrained to a road network issue nearest-neighbor queries...... for points of interest that are accessible via the road network. Given multiple k nearest neighbor queries, the paper proposes progressive techniques that selectively cache query results in main memory and subsequently reuse these for query processing. The paper initially proposes techniques for the case...... where an upper bound on k is known a priori and then extends the techniques to the case where this is not so. Based on empirical studies with real-world data, the paper offers insight into the circumstances under which the different proposed techniques can be used with advantage for multiple k nearest...
Parallel main-memory indexing for moving-object query and update workloads
DEFF Research Database (Denmark)
Sidlauskas, Darius; Saltenis, Simonas; Jensen, Christian Søndergaard
2012-01-01
of supporting the location-related query and update workloads generated by very large populations of such moving objects. This paper presents a main-memory indexing technique that aims to support such workloads. The technique, called PGrid, uses a grid structure that is capable of exploiting the parallelism...... offered by modern processors. Unlike earlier proposals that maintain separate structures for updates and queries, PGrid allows both long-running queries and rapid updates to operate on a single data structure and thus offers up-to-date query results. Because PGrid does not rely on creating snapshots...... on the same current data-store state, PGrid outperforms snapshot-based techniques in terms of both query freshness and CPU cycle-wise efficiency....
VPipe: Virtual Pipelining for Scheduling of DAG Stream Query Plans
Wang, Song; Gupta, Chetan; Mehta, Abhay
There are data streams all around us that can be harnessed for tremendous business and personal advantage. For an enterprise-level stream processing system such as CHAOS [1] (Continuous, Heterogeneous Analytic Over Streams), handling of complex query plans with resource constraints is challenging. While several scheduling strategies exist for stream processing, efficient scheduling of complex DAG query plans is still largely unsolved. In this paper, we propose a novel execution scheme for scheduling complex directed acyclic graph (DAG) query plans with meta-data enriched stream tuples. Our solution, called Virtual Pipelined Chain (or VPipe Chain for short), effectively extends the "Chain" pipelining scheduling approach to complex DAG query plans.
A hierarchical recurrent encoder-decoder for generative context-aware query suggestion
DEFF Research Database (Denmark)
Sordoni, Alessandro; Bengio, Yoshua; Vahabi, Hossein
2015-01-01
Users may strive to formulate an adequate textual query for their information need. Search engines assist the users by presenting query suggestions. To preserve the original search intent, suggestions should be context-aware and account for the previous queries issued by the user. Achieving context...
Bat-Inspired Algorithm Based Query Expansion for Medical Web Information Retrieval.
Khennak, Ilyes; Drias, Habiba
2017-02-01
With the increasing amount of medical data available on the Web, looking for health information has become one of the most widely searched topics on the Internet. Patients and people of several backgrounds are now using Web search engines to acquire medical information, including information about a specific disease, medical treatment or professional advice. Nonetheless, due to a lack of medical knowledge, many laypeople have difficulties in forming appropriate queries to articulate their inquiries, which deem their search queries to be imprecise due the use of unclear keywords. The use of these ambiguous and vague queries to describe the patients' needs has resulted in a failure of Web search engines to retrieve accurate and relevant information. One of the most natural and promising method to overcome this drawback is Query Expansion. In this paper, an original approach based on Bat Algorithm is proposed to improve the retrieval effectiveness of query expansion in medical field. In contrast to the existing literature, the proposed approach uses Bat Algorithm to find the best expanded query among a set of expanded query candidates, while maintaining low computational complexity. Moreover, this new approach allows the determination of the length of the expanded query empirically. Numerical results on MEDLINE, the on-line medical information database, show that the proposed approach is more effective and efficient compared to the baseline.
Active Learning by Querying Informative and Representative Examples.
Huang, Sheng-Jun; Jin, Rong; Zhou, Zhi-Hua
2014-10-01
Active learning reduces the labeling cost by iteratively selecting the most valuable data to query their labels. It has attracted a lot of interests given the abundance of unlabeled data and the high cost of labeling. Most active learning approaches select either informative or representative unlabeled instances to query their labels, which could significantly limit their performance. Although several active learning algorithms were proposed to combine the two query selection criteria, they are usually ad hoc in finding unlabeled instances that are both informative and representative. We address this limitation by developing a principled approach, termed QUIRE, based on the min-max view of active learning. The proposed approach provides a systematic way for measuring and combining the informativeness and representativeness of an unlabeled instance. Further, by incorporating the correlation among labels, we extend the QUIRE approach to multi-label learning by actively querying instance-label pairs. Extensive experimental results show that the proposed QUIRE approach outperforms several state-of-the-art active learning approaches in both single-label and multi-label learning.
SPARQLGraph: a web-based platform for graphically querying biological Semantic Web databases.
Schweiger, Dominik; Trajanoski, Zlatko; Pabinger, Stephan
2014-08-15
Semantic Web has established itself as a framework for using and sharing data across applications and database boundaries. Here, we present a web-based platform for querying biological Semantic Web databases in a graphical way. SPARQLGraph offers an intuitive drag & drop query builder, which converts the visual graph into a query and executes it on a public endpoint. The tool integrates several publicly available Semantic Web databases, including the databases of the just recently released EBI RDF platform. Furthermore, it provides several predefined template queries for answering biological questions. Users can easily create and save new query graphs, which can also be shared with other researchers. This new graphical way of creating queries for biological Semantic Web databases considerably facilitates usability as it removes the requirement of knowing specific query languages and database structures. The system is freely available at http://sparqlgraph.i-med.ac.at.
A Novel Visual Data Mining Module for the Geographical Information System gvSIG
Directory of Open Access Journals (Sweden)
Romel Vázquez-Rodríguez
2013-01-01
Full Text Available The exploration of large GIS models containing spatio-temporal information is a challenge. In this paper we propose the integration of scientific visualization (ScVis techniques into geographic information systems (GIS as an alternative for the visual analysis of data. Providing GIS with such tools improves the analysis and understanding of datasets with very low spatial density and allows to find correlations between variables in time and space. In this regard, we present a new visual data mining tool for the GIS gvSIG. This tool has been implemented as a gvSIG module and contains several ScVis techniques for multiparameter data with a wide range of possibilities to explore interactively the data. The developed module is a powerful visual data mining and data visualization tool to obtain knowledge from multiple datasets in time and space. A real case study with meteorological data from Villa Clara province (Cuba is presented, where the implemented visualization techniques were used to analyze the available datasets. Although it is tested with meteorological data, the developed module is of general application in the sense that it can be used in multiple application fields related with Earth Sciences.
Graphical modeling and query language for hospitals.
Barzdins, Janis; Barzdins, Juris; Rencis, Edgars; Sostaks, Agris
2013-01-01
So far there has been little evidence that implementation of the health information technologies (HIT) is leading to health care cost savings. One of the reasons for this lack of impact by the HIT likely lies in the complexity of the business process ownership in the hospitals. The goal of our research is to develop a business model-based method for hospital use which would allow doctors to retrieve directly the ad-hoc information from various hospital databases. We have developed a special domain-specific process modelling language called the MedMod. Formally, we define the MedMod language as a profile on UML Class diagrams, but we also demonstrate it on examples, where we explain the semantics of all its elements informally. Moreover, we have developed the Process Query Language (PQL) that is based on MedMod process definition language. The purpose of PQL is to allow a doctor querying (filtering) runtime data of hospital's processes described using MedMod. The MedMod language tries to overcome deficiencies in existing process modeling languages, allowing to specify the loosely-defined sequence of the steps to be performed in the clinical process. The main advantages of PQL are in two main areas - usability and efficiency. They are: 1) the view on data through "glasses" of familiar process, 2) the simple and easy-to-perceive means of setting filtering conditions require no more expertise than using spreadsheet applications, 3) the dynamic response to each step in construction of the complete query that shortens the learning curve greatly and reduces the error rate, and 4) the selected means of filtering and data retrieving allows to execute queries in O(n) time regarding the size of the dataset. We are about to continue developing this project with three further steps. First, we are planning to develop user-friendly graphical editors for the MedMod process modeling and query languages. The second step is to do evaluation of usability the proposed language and tool
Secretory IgA (SIgA: designed for antimicrobial defence
Directory of Open Access Journals (Sweden)
Per eBrandtzaeg
2013-08-01
Full Text Available Prevention of infections by vaccination remains a compelling goal to improve public health. Mucosal vaccines would make immunization procedures easier, be better suited for mass administration, and most efficiently induce immune exclusion -- a term coined for non-inflammatory antibody shielding of internal body surfaces, mediated principally by secretory immunoglobulin A (SIgA. The exported antibodies are polymeric, mainly IgA dimers (pIgA, produced by local plasma cells stimulated by antigens that target the mucosae. SIgA was early shown to be complexed with an epithelial glycoprotein -- the secretory component (SC. A common SC-dependent transport mechanism for pIgA and pentameric IgM was then proposed, implying that membrane SC acts as a receptor, now usually called the polymeric Ig receptor (pIgR. From the basolateral surface, pIg-pIgR complexes are taken up by endocytosis and then extruded into the lumen after apical cleavage of the receptor -- bound SC having stabilizing and innate functions in the secretory antibodies. Mice deficient for pIgR show that this is the only receptor responsible for epithelial export of IgA and IgM. These knockout mice show a variety of defects in their mucosal defensce and changes in their intestinal microbiota. In the gut, induction of B cells occurs in gut-associated lymphoid tissue (GALT, particularly the Peyer’s patches and isolated lymphoid follicles, but also in mesenteric lymph nodes. Plasma cell differentiation is accomplished in the lamina propria to which the activated memory/effector B cells home. The airways also receive such cells from nasopharynx-associated lymphoid tissue (NALT but by different homing receptors. This compartmentalization is a challenge for mucosal vaccination, as are the mechanisms used by the mucosal immune system to discriminate between commensal symbionts (mutualism, pathobionts and overt pathogens (elimination.
On (dynamic) range minimum queries in external memory
DEFF Research Database (Denmark)
Arge, L.; Fischer, Johannes; Sanders, Peter
2013-01-01
We study the one-dimensional range minimum query (RMQ) problem in the external memory model. We provide the first space-optimal solution to the batched static version of the problem. On an instance with N elements and Q queries, our solution takes Θ(sort(N + Q)) = Θ( N+QB log M /B N+QB ) I...
Menangkal Serangan SQL Injection Dengan Parameterized Query
Directory of Open Access Journals (Sweden)
Yulianingsih Yulianingsih
2016-06-01
Full Text Available Semakin meningkat pertumbuhan layanan informasi maka semakin tinggi pula tingkat kerentanan keamanan dari suatu sumber informasi. Melalui tulisan ini disajikan penelitian yang dilakukan secara eksperimen yang membahas tentang kejahatan penyerangan database secara SQL Injection. Penyerangan dilakukan melalui halaman autentikasi dikarenakan halaman ini merupakan pintu pertama akses yang seharusnya memiliki pertahanan yang cukup. Kemudian dilakukan eksperimen terhadap metode Parameterized Query untuk mendapatkan solusi terhadap permasalahan tersebut. Kata kunci— Layanan Informasi, Serangan, eksperimen, SQL Injection, Parameterized Query.
A Revisit of Query Expansion with Different Semantic Levels
DEFF Research Database (Denmark)
Zhang, Ce; Cui, Bin; Cong, Gao
2009-01-01
Query expansion has received extensive attention in information retrieval community. Although semantic based query expansion appears to be promising in improving retrieval performance, previous research has shown that it cannot consistently improve retrieval performance. It is a tricky problem to...
Node Query Preservation for Deterministic Linear Top-Down Tree Transducers
Directory of Open Access Journals (Sweden)
Kazuki Miyahara
2013-11-01
Full Text Available This paper discusses the decidability of node query preservation problems for XML document transformations. We assume a transformation given by a deterministic linear top-down data tree transducer (abbreviated as DLT^V and an n-ary query based on runs of a tree automaton. We say that a DLT^V Tr strongly preserves a query Q if there is a query Q' such that for every document t, the answer set of Q' for Tr(t is equal to the answer set of Q for t. Also we say that Tr weakly preserves Q if there is a query Q' such that for every t_d in the range of Tr, the answer set of Q' for t_d is equal to the union of the answer set of Q for t such that t_d = Tr(t. We show that the weak preservation problem is coNP-complete and the strong preservation problem is in 2-EXPTIME.
Predicting Drug Recalls From Internet Search Engine Queries.
Yom-Tov, Elad
2017-01-01
Batches of pharmaceuticals are sometimes recalled from the market when a safety issue or a defect is detected in specific production runs of a drug. Such problems are usually detected when patients or healthcare providers report abnormalities to medical authorities. Here, we test the hypothesis that defective production lots can be detected earlier by monitoring queries to Internet search engines. We extracted queries from the USA to the Bing search engine, which mentioned one of the 5195 pharmaceutical drugs during 2015 and all recall notifications issued by the Food and Drug Administration (FDA) during that year. By using attributes that quantify the change in query volume at the state level, we attempted to predict if a recall of a specific drug will be ordered by FDA in a time horizon ranging from 1 to 40 days in future. Our results show that future drug recalls can indeed be identified with an AUC of 0.791 and a lift at 5% of approximately 6 when predicting a recall occurring one day ahead. This performance degrades as prediction is made for longer periods ahead. The most indicative attributes for prediction are sudden spikes in query volume about a specific medicine in each state. Recalls of prescription drugs and those estimated to be of medium-risk are more likely to be identified using search query data. These findings suggest that aggregated Internet search engine data can be used to facilitate in early warning of faulty batches of medicines.
BioSig: A bioinformatic system for studying the mechanism of intra-cell signaling
Parvin, B.; Cong, G.; Fontenay, G.; Taylor, J.; Henshall, R.; Barcellos-Hoff, M.H.
2000-01-01
Mapping inter-cell signaling pathways requires an integrated view of experimental and informatic protocols. BioSig provides the foundation of cataloging inter-cell responses as a function of particular conditioning, treatment, staining, etc. for either in vivo or in vitro experiments. This paper outlines the system architecture, a functional data model for representing experimental protocols, algorithms for image analysis, and the requried statistical analysis. The architecture provides...
External Data Structures for Shortest Path Queries on Planar Digraphs
DEFF Research Database (Denmark)
Arge, Lars; Toma, Laura
2005-01-01
In this paper we present space-query trade-offs for external memory data structures that answer shortest path queries on planar directed graphs. For any S = Ω(N 1 + ε) and S = O(N2/B), our main result is a family of structures that use S space and answer queries in O(N2/ S B) I/Os, thus obtaining...... optimal space-query product O(N2/B). An S space structure can be constructed in O(√S · sort(N)) I/Os, where sort(N) is the number of I/Os needed to sort N elements, B is the disk block size, and N is the size of the graph....
Minimizing I/O Costs of Multi-Dimensional Queries with BitmapIndices
Energy Technology Data Exchange (ETDEWEB)
Rotem, Doron; Stockinger, Kurt; Wu, Kesheng
2006-03-30
Bitmap indices have been widely used in scientific applications and commercial systems for processing complex,multi-dimensional queries where traditional tree-based indices would not work efficiently. A common approach for reducing the size of a bitmap index for high cardinality attributes is to group ranges of values of an attribute into bins and then build a bitmap for each bin rather than a bitmap for each value of the attribute. Binning reduces storage costs,however, results of queries based on bins often require additional filtering for discarding it false positives, i.e., records in the result that do not satisfy the query constraints. This additional filtering,also known as ''candidate checking,'' requires access to the base data on disk and involves significant I/O costs. This paper studies strategies for minimizing the I/O costs for ''candidate checking'' for multi-dimensional queries. This is done by determining the number of bins allocated for each dimension and then placing bin boundaries in optimal locations. Our algorithms use knowledge of data distribution and query workload. We derive several analytical results concerning optimal bin allocation for a probabilistic query model. Our experimental evaluation with real life data shows an average I/O cost improvement of at least a factor of 10 for multi-dimensional queries on datasets from two different applications. Our experiments also indicate that the speedup increases with the number of query dimensions.
Video Stream Retrieval of Unseen Queries using Semantic Memory
Cappallo, S.; Mensink, T.; Snoek, C.G.M.; Wilson, R.C.; Hancock, E.R.; Smith, W.A.P.
2016-01-01
Retrieval of live, user-broadcast video streams is an under-addressed and increasingly relevant challenge. The on-line nature of the problem requires temporal evaluation and the unforeseeable scope of potential queries motivates an approach which can accommodate arbitrary search queries. To account
A framework for query optimization to support data mining
S.R. Choenni (Sunil); A.P.J.M. Siebes (Arno)
1996-01-01
textabstractIn order to extract knowledge from databases, data mining algorithms heavily query the databases. Inefficient processing of these queries will inevitably have its impact on the performance of these algorithms, making them less valuable. In this paper, we describe an optimization
Representation and alignment of sung queries for music information retrieval
Adams, Norman H.; Wakefield, Gregory H.
2005-09-01
The pursuit of robust and rapid query-by-humming systems, which search melodic databases using sung queries, is a common theme in music information retrieval. The retrieval aspect of this database problem has received considerable attention, whereas the front-end processing of sung queries and the data structure to represent melodies has been based on musical intuition and historical momentum. The present work explores three time series representations for sung queries: a sequence of notes, a ``smooth'' pitch contour, and a sequence of pitch histograms. The performance of the three representations is compared using a collection of naturally sung queries. It is found that the most robust performance is achieved by the representation with highest dimension, the smooth pitch contour, but that this representation presents a formidable computational burden. For all three representations, it is necessary to align the query and target in order to achieve robust performance. The computational cost of the alignment is quadratic, hence it is necessary to keep the dimension small for rapid retrieval. Accordingly, iterative deepening is employed to achieve both robust performance and rapid retrieval. Finally, the conventional iterative framework is expanded to adapt the alignment constraints based on previous iterations, further expediting retrieval without degrading performance.
Design and analysis of stochastic DSS query optimizers in a distributed database system
Directory of Open Access Journals (Sweden)
Manik Sharma
2016-07-01
Full Text Available Query optimization is a stimulating task of any database system. A number of heuristics have been applied in recent times, which proposed new algorithms for substantially improving the performance of a query. The hunt for a better solution still continues. The imperishable developments in the field of Decision Support System (DSS databases are presenting data at an exceptional rate. The massive volume of DSS data is consequential only when it is able to access and analyze by distinctive researchers. Here, an innovative stochastic framework of DSS query optimizer is proposed to further optimize the design of existing query optimization genetic approaches. The results of Entropy Based Restricted Stochastic Query Optimizer (ERSQO are compared with the results of Exhaustive Enumeration Query Optimizer (EAQO, Simple Genetic Query Optimizer (SGQO, Novel Genetic Query Optimizer (NGQO and Restricted Stochastic Query Optimizer (RSQO. In terms of Total Costs, EAQO outperforms SGQO, NGQO, RSQO and ERSQO. However, stochastic approaches dominate in terms of runtime. The Total Costs produced by ERSQO is better than SGQO, NGQO and RGQO by 12%, 8% and 5% respectively. Moreover, the effect of replicating data on the Total Costs of DSS query is also examined. In addition, the statistical analysis revealed a 2-tailed significant correlation between the number of join operations and the Total Costs of distributed DSS query. Finally, in regard to the consistency of stochastic query optimizers, the results of SGQO, NGQO, RSQO and ERSQO are 96.2%, 97.2%, 97.45 and 97.8% consistent respectively.
Efficient and Flexible KNN Query Processing in Real-Life Road Networks
DEFF Research Database (Denmark)
Lu, Yang; Bui, Bin; Zhao, Jiakui
2008-01-01
are included into the RNG index, which enables the index to support both distance-based and time-based KNN queries and continuous KNN queries. Our work extends previous ones by taking into account more practical scenarios, such as complexities in real-life road networks and time-based KNN queries. Extensive......Along with the developments of mobile services, effectively modeling road networks and efficiently indexing and querying network constrained objects has become a challenging problem. In this paper, we first introduce a road network model which captures real-life road networks better than previous...
PubMedReco: A Real-Time Recommender System for PubMed Citations.
Samuel, Hamman W; Zaïane, Osmar R
2017-01-01
We present a recommender system, PubMedReco, for real-time suggestions of medical articles from PubMed, a database of over 23 million medical citations. PubMedReco can recommend medical article citations while users are conversing in a synchronous communication environment such as a chat room. Normally, users would have to leave their chat interface to open a new web browser window, and formulate an appropriate search query to retrieve relevant results. PubMedReco automatically generates the search query and shows relevant citations within the same integrated user interface. PubMedReco analyzes relevant keywords associated with the conversation and uses them to search for relevant citations using the PubMed E-utilities programming interface. Our contributions include improvements to the user experience for searching PubMed from within health forums and chat rooms, and a machine learning model for identifying relevant keywords. We demonstrate the feasibility of PubMedReco using BMJ's Doc2Doc forum discussions.
Two Dimensional Range Minimum Queries and Fibonacci Lattices
DEFF Research Database (Denmark)
Brodal, Gerth Stølting; Davoodi, Pooya; Lewenstein, Moshe
2012-01-01
technique—the discrepancy properties of Fibonacci lattices—we give an indexing data structure for 2D-RMQs that uses O(N/c) bits additional space with O(clogc(loglogc)2) query time, for any parameter c, 4 ≤ c ≤ N. Also, when the entries of the input matrix are from {0,1}, we show that the query time can...
Can Internet search queries help to predict stock market volatility?
Dimpfl, Thomas; Jank, Stephan
2011-01-01
This paper studies the dynamics of stock market volatility and retail investor attention measured by internet search queries. We find a strong co-movement of stock market indices’ realized volatility and the search queries for their names. Furthermore, Granger causality is bi-directional: high searches follow high volatility, and high volatility follows high searches. Using the latter feedback effect to predict volatility we find that search queries contain additional information about market...
A high performance, ad-hoc, fuzzy query processing system for relational databases
Mansfield, William H., Jr.; Fleischman, Robert M.
1992-01-01
Database queries involving imprecise or fuzzy predicates are currently an evolving area of academic and industrial research. Such queries place severe stress on the indexing and I/O subsystems of conventional database environments since they involve the search of large numbers of records. The Datacycle architecture and research prototype is a database environment that uses filtering technology to perform an efficient, exhaustive search of an entire database. It has recently been modified to include fuzzy predicates in its query processing. The approach obviates the need for complex index structures, provides unlimited query throughput, permits the use of ad-hoc fuzzy membership functions, and provides a deterministic response time largely independent of query complexity and load. This paper describes the Datacycle prototype implementation of fuzzy queries and some recent performance results.
On the Suitability of Skyline Queries for Data Exploration
DEFF Research Database (Denmark)
Chester, Sean; Mortensen, Michael Lind; Assent, Ira
2014-01-01
The skyline operator has been studied in database research for multi-criteria decision making. Until now the focus has been on the efficiency or accuracy of single queries. In practice, however, users are increasingly confronted with unknown data collections, where precise query formulation proves...
Intelligent query processing for semantic mediation of information systems
Directory of Open Access Journals (Sweden)
Saber Benharzallah
2011-11-01
Full Text Available We propose an intelligent and an efficient query processing approach for semantic mediation of information systems. We propose also a generic multi agent architecture that supports our approach. Our approach focuses on the exploitation of intelligent agents for query reformulation and the use of a new technology for the semantic representation. The algorithm is self-adapted to the changes of the environment, offers a wide aptitude and solves the various data conflicts in a dynamic way; it also reformulates the query using the schema mediation method for the discovered systems and the context mediation for the other systems.
Relaxing rdf queries based on user and domain preferences
DEFF Research Database (Denmark)
Dolog, Peter; Stueckenschmidt, Heiner; Wache, Holger
2009-01-01
Research in cooperative query answering is triggered by the observation that users are often not able to correctly formulate queries to databases such that they return the intended result. Due to lacking knowledge about the contents and the structure of a database, users will often only be able t...... application in the context of e-learning systems....... knowledge and user preferences. We describe a framework for information access that combines query refinement and relaxation in order to provide robust, personalized access to heterogeneous resource description framework data as well as an implementation in terms of rewriting rules and explain its...
AN EFFECTIVE RECOMMENDATIONS BY DIFFUSION ALGORITHM FOR WEB GRAPH MINING
Directory of Open Access Journals (Sweden)
S. Vasukipriya
2013-04-01
Full Text Available The information on the World Wide Web grows in an explosive rate. Societies are relying more on the Web for their miscellaneous needs of information. Recommendation systems are active information filtering systems that attempt to present the information items like movies, music, images, books recommendations, tags recommendations, query suggestions, etc., to the users. Various kinds of data bases are used for the recommendations; fundamentally these data bases can be molded in the form of many types of graphs. Aiming at provided that a general framework on effective DR (Recommendations by Diffusion algorithm for web graphs mining. First introduce a novel graph diffusion model based on heat diffusion. This method can be applied to both undirected graphs and directed graphs. Then it shows how to convert different Web data sources into correct graphs in our models.
Blink and it's done: Interactive queries on very large data
Agarwal, Sameer; Iyer, Anand P.; Panda, Aurojit; Mozafari, Barzan; Stoica, Ion; Madden, Samuel R.
2012-01-01
In this demonstration, we present BlinkDB, a massively parallel, sampling-based approximate query processing framework for running interactive queries on large volumes of data. The key observation in BlinkDB is that one can make reasonable decisions in the absence of perfect answers. BlinkDB extends the Hive/HDFS stack and can handle the same set of SPJA (selection, projection, join and aggregate) queries as supported by these systems. BlinkDB provides real-time answers along with statistical...
Accelerating SPARQL queries by exploiting hash-based locality and adaptive partitioning
Al-Harbi, Razen
2016-02-08
State-of-the-art distributed RDF systems partition data across multiple computer nodes (workers). Some systems perform cheap hash partitioning, which may result in expensive query evaluation. Others try to minimize inter-node communication, which requires an expensive data preprocessing phase, leading to a high startup cost. Apriori knowledge of the query workload has also been used to create partitions, which, however, are static and do not adapt to workload changes. In this paper, we propose AdPart, a distributed RDF system, which addresses the shortcomings of previous work. First, AdPart applies lightweight partitioning on the initial data, which distributes triples by hashing on their subjects; this renders its startup overhead low. At the same time, the locality-aware query optimizer of AdPart takes full advantage of the partitioning to (1) support the fully parallel processing of join patterns on subjects and (2) minimize data communication for general queries by applying hash distribution of intermediate results instead of broadcasting, wherever possible. Second, AdPart monitors the data access patterns and dynamically redistributes and replicates the instances of the most frequent ones among workers. As a result, the communication cost for future queries is drastically reduced or even eliminated. To control replication, AdPart implements an eviction policy for the redistributed patterns. Our experiments with synthetic and real data verify that AdPart: (1) starts faster than all existing systems; (2) processes thousands of queries before other systems become online; and (3) gracefully adapts to the query load, being able to evaluate queries on billion-scale RDF data in subseconds.
Query-dependent banding (QDB for faster RNA similarity searches.
Directory of Open Access Journals (Sweden)
Eric P Nawrocki
2007-03-01
Full Text Available When searching sequence databases for RNAs, it is desirable to score both primary sequence and RNA secondary structure similarity. Covariance models (CMs are probabilistic models well-suited for RNA similarity search applications. However, the computational complexity of CM dynamic programming alignment algorithms has limited their practical application. Here we describe an acceleration method called query-dependent banding (QDB, which uses the probabilistic query CM to precalculate regions of the dynamic programming lattice that have negligible probability, independently of the target database. We have implemented QDB in the freely available Infernal software package. QDB reduces the average case time complexity of CM alignment from LN(2.4 to LN(1.3 for a query RNA of N residues and a target database of L residues, resulting in a 4-fold speedup for typical RNA queries. Combined with other improvements to Infernal, including informative mixture Dirichlet priors on model parameters, benchmarks also show increased sensitivity and specificity resulting from improved parameterization.
Directory of Open Access Journals (Sweden)
André Le Jeune
Full Text Available BACKGROUND: Enterococcus faecalis is one of the leading agents of nosocomial infections. To cause diseases, pathogens or opportunistic bacteria have to adapt and survive to the defense systems encountered in the host. One of the most important compounds of the host innate defense response against invading microorganisms is lysozyme. It is found in a wide variety of body fluids, as well as in cells of the innate immune system. Lysozyme could act either as a muramidase and/or as a cationic antimicrobial peptide. Like Staphylococcus aureus, E. faecalis is one of the few bacteria that are completely lysozyme resistant. RESULTS: This study revealed that oatA (O-acetyl transferase and dlt (D-Alanylation of lipoteicoic acids genes contribute only partly to the lysozyme resistance of E. faecalis and that a specific transcriptional regulator, the extracytoplasmic function SigV sigma factor plays a key role in this event. Indeed, the sigV single mutant is as sensitive as the oatA/dltA double mutant, and the sigV/oatA/dltA triple mutant displays the highest level of lysozyme sensitivity suggesting synergistic effects of these genes. In S. aureus, mutation of both oatA and dlt genes abolishes completely the lysozyme resistance, whereas this is not the case in E. faecalis. Interestingly SigV does not control neither oatA nor dlt genes. Moreover, the sigV mutants clearly showed a reduced capacity to colonize host tissues, as they are significantly less recovered than the parental JH2-2 strain from organs of mice subjected to intravenous or urinary tract infections. CONCLUSIONS: This work led to the discovery of an original model of lysozyme resistance mechanism which is obviously more complex than those described for other Gram positive pathogens. Moreover, our data provide evidences for a direct link between lysozyme resistance and virulence of E. faecalis.
Robust Optimization of Database Queries
Indian Academy of Sciences (India)
JAYANT
2011-07-06
Jul 6, 2011 ... Based on first-order logic. ○ Edgar ... Cost-based Query Optimizer s choice of execution plan ... Determines the values of goods shipped between nations in a time period select ..... Born: 1881 Elected: 1934 Section: Medicine.
Fuzzy Querying: Issues and Perspectives..
Czech Academy of Sciences Publication Activity Database
Kacprzyk, J.; Pasi, G.; Vojtáš, Peter; Zadrozny, S.
2000-01-01
Roč. 36, č. 6 (2000), s. 605-616 ISSN 0023-5954 Institutional research plan: AV0Z1030915 Keywords : flexible querying * information retrieval * fuzzy databases Subject RIV: BA - General Mathematics http://dml.cz/handle/10338.dmlcz/135376
Using Bitmap Indexing Technology for Combined Numerical and TextQueries
Energy Technology Data Exchange (ETDEWEB)
Stockinger, Kurt; Cieslewicz, John; Wu, Kesheng; Rotem, Doron; Shoshani, Arie
2006-10-16
In this paper, we describe a strategy of using compressedbitmap indices to speed up queries on both numerical data and textdocuments. By using an efficient compression algorithm, these compressedbitmap indices are compact even for indices with millions of distinctterms. Moreover, bitmap indices can be used very efficiently to answerBoolean queries over text documents involving multiple query terms.Existing inverted indices for text searches are usually inefficient forcorpora with a very large number of terms as well as for queriesinvolving a large number of hits. We demonstrate that our compressedbitmap index technology overcomes both of those short-comings. In aperformance comparison against a commonly used database system, ourindices answer queries 30 times faster on average. To provide full SQLsupport, we integrated our indexing software, called FastBit, withMonetDB. The integrated system MonetDB/FastBit provides not onlyefficient searches on a single table as FastBit does, but also answersjoin queries efficiently. Furthermore, MonetDB/FastBit also provides avery efficient retrieval mechanism of result records.
BioFed: federated query processing over life sciences linked open data.
Hasnain, Ali; Mehmood, Qaiser; Sana E Zainab, Syeda; Saleem, Muhammad; Warren, Claude; Zehra, Durre; Decker, Stefan; Rebholz-Schuhmann, Dietrich
2017-03-15
Biomedical data, e.g. from knowledge bases and ontologies, is increasingly made available following open linked data principles, at best as RDF triple data. This is a necessary step towards unified access to biological data sets, but this still requires solutions to query multiple endpoints for their heterogeneous data to eventually retrieve all the meaningful information. Suggested solutions are based on query federation approaches, which require the submission of SPARQL queries to endpoints. Due to the size and complexity of available data, these solutions have to be optimised for efficient retrieval times and for users in life sciences research. Last but not least, over time, the reliability of data resources in terms of access and quality have to be monitored. Our solution (BioFed) federates data over 130 SPARQL endpoints in life sciences and tailors query submission according to the provenance information. BioFed has been evaluated against the state of the art solution FedX and forms an important benchmark for the life science domain. The efficient cataloguing approach of the federated query processing system 'BioFed', the triple pattern wise source selection and the semantic source normalisation forms the core to our solution. It gathers and integrates data from newly identified public endpoints for federated access. Basic provenance information is linked to the retrieved data. Last but not least, BioFed makes use of the latest SPARQL standard (i.e., 1.1) to leverage the full benefits for query federation. The evaluation is based on 10 simple and 10 complex queries, which address data in 10 major and very popular data sources (e.g., Dugbank, Sider). BioFed is a solution for a single-point-of-access for a large number of SPARQL endpoints providing life science data. It facilitates efficient query generation for data access and provides basic provenance information in combination with the retrieved data. BioFed fully supports SPARQL 1.1 and gives access to the
Dataflow Query Execution in a Parallel Main-Memory Environment
Wilschut, A.N.; Apers, Peter M.G.
1991-01-01
The performance and characteristics of the execution of various join-trees on a parallel DBMS are studied. The results are a step in the direction of the design of a query optimization strategy that is fit for parallel execution of complex queries. Among others, synchronization issues are identified
DISPAQ: Distributed Profitable-Area Query from Big Taxi Trip Data †
Putri, Fadhilah Kurnia; Song, Giltae; Rao, Praveen
2017-01-01
One of the crucial problems for taxi drivers is to efficiently locate passengers in order to increase profits. The rapid advancement and ubiquitous penetration of Internet of Things (IoT) technology into transportation industries enables us to provide taxi drivers with locations that have more potential passengers (more profitable areas) by analyzing and querying taxi trip data. In this paper, we propose a query processing system, called Distributed Profitable-Area Query (DISPAQ) which efficiently identifies profitable areas by exploiting the Apache Software Foundation’s Spark framework and a MongoDB database. DISPAQ first maintains a profitable-area query index (PQ-index) by extracting area summaries and route summaries from raw taxi trip data. It then identifies candidate profitable areas by searching the PQ-index during query processing. Then, it exploits a Z-Skyline algorithm, which is an extension of skyline processing with a Z-order space filling curve, to quickly refine the candidate profitable areas. To improve the performance of distributed query processing, we also propose local Z-Skyline optimization, which reduces the number of dominant tests by distributing killer profitable areas to each cluster node. Through extensive evaluation with real datasets, we demonstrate that our DISPAQ system provides a scalable and efficient solution for processing profitable-area queries from huge amounts of big taxi trip data. PMID:28946679
SIG et risques naturels: le glissement de terrain de Séchilienne (Isère
Directory of Open Access Journals (Sweden)
Jean-Pierre ASTÉ
1993-12-01
Full Text Available À Séchilienne (massif de l’Oisans s’est produit un glissement de terrain qui, selon les experts, peut devenir glissement de versant entier. Leurs scénarios ont été traduits cartographiquement en un SIG qui, malgré ses limites, constitue un premier outil de prise de conscience et d’aide à la décision.
Directory of Open Access Journals (Sweden)
Arafat Rahman Oany
2017-01-01
Full Text Available Shigellosis, a bacillary dysentery, is closely associated with diarrhoea in human and causes infection of 165 million people worldwide per year. Casein-degrading serine protease autotransporter of enterobacteriaceae (SPATE subfamily protein SigA, an outer membrane protein, exerts both cytopathic and enterotoxic effects especially cytopathic to human epithelial cell type-2 (HEp-2 and is shown to be highly immunogenic. In the present study, we have tried to impose the vaccinomics approach for designing a common peptide vaccine candidate against the immunogenic SigA of Shigella spp. At first, 44 SigA proteins from different variants of S. flexneri, S. dysenteriae, S. boydii, and S. sonnei were assessed to find the most antigenic protein. We retrieved 12 peptides based on the highest score for human leukocyte antigen (HLA supertypes analysed by NetCTL. Initially, these peptides were assessed for the affinity with MHC class I and class II alleles, and four potential core epitopes VTARAGLGY, FHTVTVNTL, HTTWTLTGY, and IELAGTLTL were selected. From these, FHTVTVNTL and IELAGTLTL peptides were shown to have 100% conservancy. Finally, IELAGTLTL was shown to have the highest population coverage (83.86% among the whole world population. In vivo study of the proposed epitope might contribute to the development of functional and unique widespread vaccine, which might be an operative alleyway to thwart dysentery from the world.
A new weighted fuzzy grammar on object oriented database queries
Directory of Open Access Journals (Sweden)
Ali Haroonabadi
2012-08-01
Full Text Available The fuzzy object oriented database model is often used to handle the existing imprecise and complicated objects for many real-world applications. The main focus of this paper is on fuzzy queries and tries to analyze a complicated and complex query to get more meaningful and closer responses. The method permits the user to provide the possibility of allocating the weight to various parts of the query, which makes it easier to follow better goals and return the target objects.
A few examples go a long way: Constructing query models from elaborate query formulations
Balog, K.; Weerkamp, W.; de Rijke, M.; Myaeng, S.-H.; Oard, D.W.; Sebastiani, F.; Chua, T.-S.; Leong, M.-K.
2008-01-01
We address a specific enterprise document search scenario, where the information need is expressed in an elaborate manner. In our scenario, information needs are expressed using a short query (of a few keywords) together with examples of key reference pages. Given this setup, we investigate how the
Dataflow Query Execution in a Parallel, Main-memory Environment
Wilschut, A.N.; Apers, Peter M.G.
In this paper, the performance and characteristics of the execution of various join-trees on a parallel DBMS are studied. The results of this study are a step into the direction of the design of a query optimization strategy that is fit for parallel execution of complex queries. Among others,
An XML-Enabled Data Mining Query Language XML-DMQL
Feng, L.; Dillon, T.
2005-01-01
Inspired by the good work of Han et al. (1996) and Elfeky et al. (2001) on the design of data mining query languages for relational and object-oriented databases, in this paper, we develop an expressive XML-enabled data mining query language by extension of XQuery. We first describe some
Study of Query Expansion Techniques and Their Application in the Biomedical Information Retrieval
Directory of Open Access Journals (Sweden)
A. R. Rivas
2014-01-01
retrieval systems. These techniques help to overcome vocabulary mismatch issues by expanding the original query with additional relevant terms and reweighting the terms in the expanded query. In this paper, different text preprocessing and query expansion approaches are combined to improve the documents initially retrieved by a query in a scientific documental database. A corpus belonging to MEDLINE, called Cystic Fibrosis, is used as a knowledge source. Experimental results show that the proposed combinations of techniques greatly enhance the efficiency obtained by traditional queries.
A System for Recommending Rental Properties
Directory of Open Access Journals (Sweden)
Bernard Shibwabo Kasamani
2017-07-01
Full Text Available This paper presents an implementation of recommender technology to online search of rental properties. In particular, the paper uses the preference-based search approach combined with a technique called example-critiquing. Rather than perform a query against the database, this approach prompts the user to express some preferences on rental properties, uses them to construct a preference model for the user, and finally generates a list of properties that best match that preferences. The system is developed as Web application using the Ruby on Rails framework
Directory of Open Access Journals (Sweden)
Wibisono Adianto
2008-08-01
Full Text Available Abstract Background Chromosome location is often used as a scaffold to organize genomic information in both the living cell and molecular biological research. Thus, ever-increasing amounts of data about genomic features are stored in public databases and can be readily visualized by genome browsers. To perform in silico experimentation conveniently with this genomics data, biologists need tools to process and compare datasets routinely and explore the obtained results interactively. The complexity of such experimentation requires these tools to be based on an e-Science approach, hence generic, modular, and reusable. A virtual laboratory environment with workflows, workflow management systems, and Grid computation are therefore essential. Findings Here we apply an e-Science approach to develop SigWin-detector, a workflow-based tool that can detect significantly enriched windows of (genomic features in a (DNA sequence in a fast and reproducible way. For proof-of-principle, we utilize a biological use case to detect regions of increased and decreased gene expression (RIDGEs and anti-RIDGEs in human transcriptome maps. We improved the original method for RIDGE detection by replacing the costly step of estimation by random sampling with a faster analytical formula for computing the distribution of the null hypothesis being tested and by developing a new algorithm for computing moving medians. SigWin-detector was developed using the WS-VLAM workflow management system and consists of several reusable modules that are linked together in a basic workflow. The configuration of this basic workflow can be adapted to satisfy the requirements of the specific in silico experiment. Conclusion As we show with the results from analyses in the biological use case on RIDGEs, SigWin-detector is an efficient and reusable Grid-based tool for discovering windows enriched for features of a particular type in any sequence of values. Thus, SigWin-detector provides the
Inda, Márcia A; van Batenburg, Marinus F; Roos, Marco; Belloum, Adam S Z; Vasunin, Dmitry; Wibisono, Adianto; van Kampen, Antoine H C; Breit, Timo M
2008-08-08
Chromosome location is often used as a scaffold to organize genomic information in both the living cell and molecular biological research. Thus, ever-increasing amounts of data about genomic features are stored in public databases and can be readily visualized by genome browsers. To perform in silico experimentation conveniently with this genomics data, biologists need tools to process and compare datasets routinely and explore the obtained results interactively. The complexity of such experimentation requires these tools to be based on an e-Science approach, hence generic, modular, and reusable. A virtual laboratory environment with workflows, workflow management systems, and Grid computation are therefore essential. Here we apply an e-Science approach to develop SigWin-detector, a workflow-based tool that can detect significantly enriched windows of (genomic) features in a (DNA) sequence in a fast and reproducible way. For proof-of-principle, we utilize a biological use case to detect regions of increased and decreased gene expression (RIDGEs and anti-RIDGEs) in human transcriptome maps. We improved the original method for RIDGE detection by replacing the costly step of estimation by random sampling with a faster analytical formula for computing the distribution of the null hypothesis being tested and by developing a new algorithm for computing moving medians. SigWin-detector was developed using the WS-VLAM workflow management system and consists of several reusable modules that are linked together in a basic workflow. The configuration of this basic workflow can be adapted to satisfy the requirements of the specific in silico experiment. As we show with the results from analyses in the biological use case on RIDGEs, SigWin-detector is an efficient and reusable Grid-based tool for discovering windows enriched for features of a particular type in any sequence of values. Thus, SigWin-detector provides the proof-of-principle for the modular e-Science based concept
References and arrow notation instead of join operation in query languages
Directory of Open Access Journals (Sweden)
Alexandr Savinov
2012-10-01
Full Text Available We study properties of the join operation in query languages and describe some of its major drawbacks. We provide strong arguments against using joins as a main construct for retrieving related data elements in general purpose query languages and argue for using references instead. Since conventional references are quite restrictive when applied to data modeling and query languages, we propose to use generalized references as they are defined in the concept-oriented model (COM. These references are used by two new operations, called projection and de-projection, which are denoted by right and left arrows and therefore this access method is referred to as arrow notation. We demonstrate advantages of the arrow notation in comparison to joins and argue that it makes queries simpler, more natural, easier to understand, and the whole query writing process more productive and less error-prone.
Sharing-Aware Horizontal Partitioning for Exploiting Correlations during Query Processing
DEFF Research Database (Denmark)
Tzoumas, Kostas; Deshpande, Amol; Jensen, Christian Søndergaard
2010-01-01
Optimization of join queries based on average selectivities is suboptimal in highly correlated databases. In such databases, relations are naturally divided into partitions, each partition having substantially different statistical characteristics. It is very compelling to discover such data...... partitions during query optimization and create multiple plans for a given query, one plan being optimal for a particular combination of data partitions. This scenario calls for the sharing of state among plans, so that common intermediate results are not recomputed. We study this problem in a setting...
Processing Incomplete Query Specifications in a Context-Dependent Reasoning Framework
Directory of Open Access Journals (Sweden)
Neli P. Zlatareva
2013-04-01
Full Text Available Search is the most prominent web service, which is about to change dramatically with the transition to the Semantic Web. Semantic Web applications are expected to deal with complex conjunctive queries, and not always such queries can be completely and precisely defined. Current Semantic Web reasoners built upon Description Logics have limited processing power in such environments. We discuss some of their limitations, and show how an alternative logical framework utilizing context-dependent rules can be extended to handle incomplete or imprecise query specifications.
A Typed Text Retrieval Query Language for XML Documents.
Colazzo, Dario; Sartiani, Carlo; Albano, Antonio; Manghi, Paolo; Ghelli, Giorgio; Lini, Luca; Paoli, Michele
2002-01-01
Discussion of XML focuses on a description of Tequyla-TX, a typed text retrieval query language for XML documents that can search on both content and structures. Highlights include motivations; numerous examples; word-based and char-based searches; tag-dependent full-text searches; text normalization; query algebra; data models and term language;…
Directory of Open Access Journals (Sweden)
Hyung-Ju Cho
2012-01-01
Full Text Available Given two positive parameters k and r, a constrained k-nearest neighbor (CkNN query returns the k closest objects within a network distance r of the query location in road networks. In terms of the scalability of monitoring these CkNN queries, existing solutions based on central processing at a server suffer from a sudden and sharp rise in server load as well as messaging cost as the number of queries increases. In this paper, we propose a distributed and scalable scheme called DAEMON for the continuous monitoring of CkNN queries in road networks. Our query processing is distributed among clients (query objects and server. Specifically, the server evaluates CkNN queries issued at intersections of road segments, retrieves the objects on the road segments between neighboring intersections, and sends responses to the query objects. Finally, each client makes its own query result using this server response. As a result, our distributed scheme achieves close-to-optimal communication costs and scales well to large numbers of monitoring queries. Exhaustive experimental results demonstrate that our scheme substantially outperforms its competitor in terms of query processing time and messaging cost.
Application of Machine Learning Algorithms for the Query Performance Prediction
Directory of Open Access Journals (Sweden)
MILICEVIC, M.
2015-08-01
Full Text Available This paper analyzes the relationship between the system load/throughput and the query response time in a real Online transaction processing (OLTP system environment. Although OLTP systems are characterized by short transactions, which normally entail high availability and consistent short response times, the need for operational reporting may jeopardize these objectives. We suggest a new approach to performance prediction for concurrent database workloads, based on the system state vector which consists of 36 attributes. There is no bias to the importance of certain attributes, but the machine learning methods are used to determine which attributes better describe the behavior of the particular database server and how to model that system. During the learning phase, the system's profile is created using multiple reference queries, which are selected to represent frequent business processes. The possibility of the accurate response time prediction may be a foundation for automated decision-making for database (DB query scheduling. Possible applications of the proposed method include adaptive resource allocation, quality of service (QoS management or real-time dynamic query scheduling (e.g. estimation of the optimal moment for a complex query execution.
An Approach to Assist Designers With Their Queries and Designs
DEFF Research Database (Denmark)
Ahmed, Saeema
2006-01-01
Recent research investigating how engineers search for information has concluded that engineering designers acquire assistance when formulating queries. An approach to assist designers with their queries is presented. This approach forms part of a knowledge management system, where indexed...... documents are entered into the system (or are automatically indexed by tools within a system). The method builds up a network based upon indices assigned to documents. The network (or chunk) is presented back to the user once a search for knowledge has been completed. The network is build up as indexed...... documents are entered in to a knowledge-based system and is generated dynamically. The network can be used to assist a designer in searching for information; reformulating a query and; to prompt design tasks. This paper presents an approach to prompt designers with their design queries, along with some...
Indexing, Query Processing, and Clustering of Spatio-Temporal Text Objects
DEFF Research Database (Denmark)
Skovsgaard, Anders
With the increasing mobile use of the web from geo-positioned devices, the Internet is increasingly acquiring a spatial aspect, with still more types of content being geo-tagged. As a result of this development, a wide range of location-aware queries and applications have emerged. The large amounts...... of data available coupled with the increasing number of location-aware queries calls for efficient indexing and query processing techniques. This dissertation investigates how to manage geo-tagged text content to support these workloads in three specific areas: (i) grouping of spatio-textual objects, (ii......, the grouping of spatio-textual objects is done without considering query locations, and a clustering approach is proposed that takes into account both the spatial and textual attributes of the objects. The technique expands clusters based on a proposed quality function that enables clusters of arbitrary shape...
Object-Oriented Query Language For Events Detection From Images Sequences
Ganea, Ion Eugen
2015-09-01
In this paper is presented a method to represent the events extracted from images sequences and the query language used for events detection. Using an object oriented model the spatial and temporal relationships between salient objects and also between events are stored and queried. This works aims to unify the storing and querying phases for video events processing. The object oriented language syntax used for events processing allow the instantiation of the indexes classes in order to improve the accuracy of the query results. The experiments were performed on images sequences provided from sport domain and it shows the reliability and the robustness of the proposed language. To extend the language will be added a specific syntax for constructing the templates for abnormal events and for detection of the incidents as the final goal of the research.
Inductive queries for a drug designing robot scientist
King, Ross D.; Schierz, Amanda; Clare, Amanda; Rowland, Jem; Sparkes, Andrew; Nijssen, Siegfried; Ramon, Jan
2010-01-01
It is increasingly clear that machine learning algorithms need to be integrated in an iterative scientific discovery loop, in which data is queried repeatedly by means of inductive queries and where the computer provides guidance to the experiments that are being performed. In this chapter, we summarise several key challenges in achieving this integration of machine learning and data mining algorithms in methods for the discovery of Quantitative Structure Activity Relationships (QSARs). We in...
Four queries concerning the metaphysics of early human embryogenesis.
Howsepian, A A
2008-04-01
In this essay, I attempt to provide answers to the following four queries concerning the metaphysics of early human embryogenesis. (1) Following its first cellular fission, is it coherent to claim that one and only one of two "blastomeric" twins of a human zygote is identical with that zygote? (2) Following the fusion of two human pre-embryos, is it coherent to claim that one and only one pre-fusion pre-embryo is identical with that postfusion pre-embryo? (3) Does a live human being come into existence only when its brain comes into existence? (4) At implantation, does a pre-embryo become a mere part of its mother? I argue that either if things have quidditative properties or if criterialism is false, then queries (1) and (2) can be answered in the affirmative; that in light of recent developments in theories of human death and in light of a more "functional" theory of brains, query (3) can be answered in the negative; and that plausible mereological principles require a negative answer to query (4).
QuerySpaces on Hadoop for the ATLAS EventIndex
Hrivnac, Julius; The ATLAS collaboration; Cranshaw, Jack; Favareto, Andrea; Prokoshin, Fedor; Glasman, Claudia; Toebbicke, Rainer
2015-01-01
A Hadoop-based implementation of the adaptive query engine serving as the back-end for the ATLAS EventIndex. The QuerySpaces implementation handles both original data and search results providing fast and efficient mechanisms for new user queries using already accumulated knowledge for optimization. Detailed descriptions and statistics about user requests are collected in HBase tables and HDFS files. Requests are associated to their results and a graph of relations between them is created to be used to find the most efficient way of providing answers to new requests The environment is completely transparent to users and is accessible over several command-line interfaces, a Web Service and a programming API.
Improving the Usability of OCL as an Ad-hoc Model Querying Language
DEFF Research Database (Denmark)
Störrle, Harald
2013-01-01
from our research and make it accessible to the OCL community, we propose the OCL Query API (OQAPI), a library of query-predicates to improve the user-friendliness of OCL for ad-hoc querying. The usability of OQAPI is studied using controlled experiments. We nd considerable evidence to support our...
Directory of Open Access Journals (Sweden)
María Victoria Zapata Pardo
1999-05-01
Full Text Available El objetivo de este proyecto fue generar una herramienta para mejorar el manejo, conservación y administración del Parque Nacional Natural Farallones de Cali adscrito a la Unidad Administrativa Especial del Sistema de Parques Nacionales Naturales, UAESPNN, dependiente del Ministerio del Medio Ambiente. Con este propósito se implementó un sistema de información geográfica, SIG, como modelo metodológico. El SIG Farallones de CaIi utilizó una base de datos relacional, desarroUada con el software ACCESTM, compatible con los SIG utilizados ARC/INFOTM y ARCI/VIEWTM (para estación de trabajo. Los datos espaciales accesados a la base de datos fueron los de topografía, hidrología, zonas de vida Holdridge, geología, limite, frentes, zonificación con fmes de manejo, precipitación, ocupación indígena, veredas y corregimientos; los cuales contaron con información alfanumérica relacionada, que abarcaba el manejo administrativo, socioeconómíco y tísico entre otros.
Memory-Aware Query Routing in Interactive Web-based Information Systems
F. Waas; M.L. Kersten (Martin)
2001-01-01
textabstractQuery throughput is one of the primary optimization goals in interactive web-based information systems in order to achieve the performance necessary to serve large user communities. Queries in this application domain differ significantly from those in traditional database applications:
Schedule Sales Query Report Generation System
General Services Administration — Schedule Sales Query presents sales volume figures as reported to GSA by contractors. The reports are generated as quarterly reports for the current year and the...
Query Classification and Study of University Students' Search Trends
Maabreh, Majdi A.; Al-Kabi, Mohammed N.; Alsmadi, Izzat M.
2012-01-01
Purpose: This study is an attempt to develop an automatic identification method for Arabic web queries and divide them into several query types using data mining. In addition, it seeks to evaluate the impact of the academic environment on using the internet. Design/methodology/approach: The web log files were collected from one of the higher…
Linked data querying through FCA-based schema indexing
Brosius, Dominik; Staab, Steffen
2016-01-01
The effciency of SPARQL query evaluation against Linked Open Data may benefit from schema-based indexing. However, many data items come with incomplete schema information or lack schema descriptions entirely. In this position paper, we outline an approach to an indexing of linked data graphs based on schemata induced through Formal Concept Analysis. We show how to map queries onto RDF graphs based on such derived schema information. We sketch next steps for realizing and optimizing the sugges...
Towards Intelligible Query Processing in Relevance Feedback-Based Image Retrieval Systems
Mohammed, Belkhatir
2008-01-01
We have specified within the scope of this paper a framework combining semantics and relational (spatial) characterizations within a coupled architecture in order to address the semantic gap. This framework is instantiated by an operational model based on a sound logic-based formalism, allowing to define a representation for image documents and a matching function to compare index and query structures. We have specified a query framework coupling keyword-based querying with a relevance feedba...
A Modular Design for Geo-Distributed Querying : Work in Progress Report
Vasilas , Dimitrios; Shapiro , Marc; King , Bradley
2018-01-01
International audience; Most distributed storage systems provide limited abilities for querying data by attributes other than their primary keys. Supporting efficient search on secondary attributes is challenging as applications pose varying requirements to query processing systems, and no single system design can be suitable for all needs. In this paper, we show how to overcome these challenges in order to extend distributed data stores to support queries on secondary attributes. We propose ...
Automatically Preparing Safe SQL Queries
Bisht, Prithvi; Sistla, A. Prasad; Venkatakrishnan, V. N.
We present the first sound program source transformation approach for automatically transforming the code of a legacy web application to employ PREPARE statements in place of unsafe SQL queries. Our approach therefore opens the way for eradicating the SQL injection threat vector from legacy web applications.
Efficient Verifiable Range and Closest Point Queries in Zero-Knowledge
Directory of Open Access Journals (Sweden)
Ghosh Esha
2016-10-01
Full Text Available We present an efficient method for answering one-dimensional range and closest-point queries in a verifiable and privacy-preserving manner. We consider a model where a data owner outsources a dataset of key-value pairs to a server, who answers range and closest-point queries issued by a client and provides proofs of the answers. The client verifies the correctness of the answers while learning nothing about the dataset besides the answers to the current and previous queries. Our work yields for the first time a zero-knowledge privacy assurance to authenticated range and closest-point queries. Previous work leaked the size of the dataset and used an inefficient proof protocol. Our construction is based on hierarchical identity-based encryption. We prove its security and analyze its efficiency both theoretically and with experiments on synthetic and real data (Enron email and Boston taxi datasets.
Towards Hybrid Online On-Demand Querying of Realtime Data with Stateful Complex Event Processing
Energy Technology Data Exchange (ETDEWEB)
Zhou, Qunzhi; Simmhan, Yogesh; Prasanna, Viktor K.
2013-10-09
Emerging Big Data applications in areas like e-commerce and energy industry require both online and on-demand queries to be performed over vast and fast data arriving as streams. These present novel challenges to Big Data management systems. Complex Event Processing (CEP) is recognized as a high performance online query scheme which in particular deals with the velocity aspect of the 3-V’s of Big Data. However, traditional CEP systems do not consider data variety and lack the capability to embed ad hoc queries over the volume of data streams. In this paper, we propose H2O, a stateful complex event processing framework, to support hybrid online and on-demand queries over realtime data. We propose a semantically enriched event and query model to address data variety. A formal query algebra is developed to precisely capture the stateful and containment semantics of online and on-demand queries. We describe techniques to achieve the interactive query processing over realtime data featured by efficient online querying, dynamic stream data persistence and on-demand access. The system architecture is presented and the current implementation status reported.
Feary, Michael; Palanque, Philippe; Martinie, Célia; Tscheligi, Manfred
2016-01-01
This SIG focuses on the engineering of automation in interactive critical systems. Automation has already been studied in a number of (sub-) disciplines and application fields: design, human factors, psychology, (software) engineering, aviation, health care, games. One distinguishing feature of the area we are focusing on is that in the field of interactive critical systems properties such as reliability, dependability, fault tolerance are as important as usability, user experience or overall acceptance issues. The SIG targets at two problem areas: first the engineering of the user interaction with (partly-) autonomous systems: how to design, build and assess autonomous behavior, especially in cases where there is a need to represent on the user interface both autonomous and interactive objects. An example of such integration is the representation of an unmanned aerial vehicle (UAV) (where no direct interaction is possible), together with aircrafts (that have to be instructed by an air traffic controller to avoid the UAV). Second the design and engineering of user interaction in general for autonomous objects/systems (for example a cruise control in a car or an autopilot in an aircraft). The goal of the SIG is to raise interest in the CHI community on the general aspects of automation and to identify a community of researchers and practitioners interested in those increasingly prominent issues of interfaces towards (semi)-autonomous systems. The expected audience should be interested in addressing the issues of integration of mainly unconnected research domains to formulate a new joint research agenda.
Fragger: a protein fragment picker for structural queries [version 2; referees: 2 approved
Directory of Open Access Journals (Sweden)
Francois Berenger
2018-04-01
Full Text Available Protein modeling and design activities often require querying the Protein Data Bank (PDB with a structural fragment, possibly containing gaps. For some applications, it is preferable to work on a specific subset of the PDB or with unpublished structures. These requirements, along with specific user needs, motivated the creation of a new software to manage and query 3D protein fragments. Fragger is a protein fragment picker that allows protein fragment databases to be created and queried. All fragment lengths are supported and any set of PDB files can be used to create a database. Fragger can efficiently search a fragment database with a query fragment and a distance threshold. Matching fragments are ranked by distance to the query. The query fragment can have structural gaps and the allowed amino acid sequences matching a query can be constrained via a regular expression of one-letter amino acid codes. Fragger also incorporates a tool to compute the backbone RMSD of one versus many fragments in high throughput. Fragger should be useful for protein design, loop grafting and related structural bioinformatics tasks.
A web-based data-querying tool based on ontology-driven methodology and flowchart-based model.
Ping, Xiao-Ou; Chung, Yufang; Tseng, Yi-Ju; Liang, Ja-Der; Yang, Pei-Ming; Huang, Guan-Tarn; Lai, Feipei
2013-10-08
Because of the increased adoption rate of electronic medical record (EMR) systems, more health care records have been increasingly accumulating in clinical data repositories. Therefore, querying the data stored in these repositories is crucial for retrieving the knowledge from such large volumes of clinical data. The aim of this study is to develop a Web-based approach for enriching the capabilities of the data-querying system along the three following considerations: (1) the interface design used for query formulation, (2) the representation of query results, and (3) the models used for formulating query criteria. The Guideline Interchange Format version 3.5 (GLIF3.5), an ontology-driven clinical guideline representation language, was used for formulating the query tasks based on the GLIF3.5 flowchart in the Protégé environment. The flowchart-based data-querying model (FBDQM) query execution engine was developed and implemented for executing queries and presenting the results through a visual and graphical interface. To examine a broad variety of patient data, the clinical data generator was implemented to automatically generate the clinical data in the repository, and the generated data, thereby, were employed to evaluate the system. The accuracy and time performance of the system for three medical query tasks relevant to liver cancer were evaluated based on the clinical data generator in the experiments with varying numbers of patients. In this study, a prototype system was developed to test the feasibility of applying a methodology for building a query execution engine using FBDQMs by formulating query tasks using the existing GLIF. The FBDQM-based query execution engine was used to successfully retrieve the clinical data based on the query tasks formatted using the GLIF3.5 in the experiments with varying numbers of patients. The accuracy of the three queries (ie, "degree of liver damage," "degree of liver damage when applying a mutually exclusive setting
Evolutionary Multiobjective Query Workload Optimization of Cloud Data Warehouses
Dokeroglu, Tansel; Sert, Seyyit Alper; Cinar, Muhammet Serkan
2014-01-01
With the advent of Cloud databases, query optimizers need to find paretooptimal solutions in terms of response time and monetary cost. Our novel approach minimizes both objectives by deploying alternative virtual resources and query plans making use of the virtual resource elasticity of the Cloud. We propose an exact multiobjective branch-and-bound and a robust multiobjective genetic algorithm for the optimization of distributed data warehouse query workloads on the Cloud. In order to investigate the effectiveness of our approach, we incorporate the devised algorithms into a prototype system. Finally, through several experiments that we have conducted with different workloads and virtual resource configurations, we conclude remarkable findings of alternative deployments as well as the advantages and disadvantages of the multiobjective algorithms we propose. PMID:24892048
An Application of Multivariate Statistical Analysis for Query-Driven Visualization
Energy Technology Data Exchange (ETDEWEB)
Gosink, Luke J. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Garth, Christoph [Univ. of California, Davis, CA (United States); Anderson, John C. [Univ. of California, Davis, CA (United States); Bethel, E. Wes [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Joy, Kenneth I. [Univ. of California, Davis, CA (United States)
2011-03-01
Driven by the ability to generate ever-larger, increasingly complex data, there is an urgent need in the scientific community for scalable analysis methods that can rapidly identify salient trends in scientific data. Query-Driven Visualization (QDV) strategies are among the small subset of techniques that can address both large and highly complex datasets. This paper extends the utility of QDV strategies with a statistics-based framework that integrates non-parametric distribution estimation techniques with a new segmentation strategy to visually identify statistically significant trends and features within the solution space of a query. In this framework, query distribution estimates help users to interactively explore their query's solution and visually identify the regions where the combined behavior of constrained variables is most important, statistically, to their inquiry. Our new segmentation strategy extends the distribution estimation analysis by visually conveying the individual importance of each variable to these regions of high statistical significance. We demonstrate the analysis benefits these two strategies provide and show how they may be used to facilitate the refinement of constraints over variables expressed in a user's query. We apply our method to datasets from two different scientific domains to demonstrate its broad applicability.
DEFF Research Database (Denmark)
Wilkowski, Bartlomiej; Szewczyk, Marcin Marek; Rasmussen, Peter Mondrup
2010-01-01
Query offers a direct link from SPM to the Brede Database coordinate-based search engine. BredeQuery is able to ‘grab’ brain location coordinates from the SPM windows and enter them as a query for the Brede Database. Moreover, results of the query can be displayed in a MATLAB window and/or exported directly...
A Database Query Processing Model in Peer-To-Peer Network ...
African Journals Online (AJOL)
Peer-to-peer databases are becoming more prevalent on the internet for sharing and distributing applications, documents, files, and other digital media. The problem associated with answering large-scale ad hoc analysis queries, aggregation queries, on these databases poses unique challenges. This paper presents an ...
Clean Air Markets - Allowances Query Wizard
U.S. Environmental Protection Agency — The Allowances Query Wizard is part of a suite of Clean Air Markets-related tools that are accessible at http://camddataandmaps.epa.gov/gdm/index.cfm. The Allowances...
Clean Air Markets - Compliance Query Wizard
U.S. Environmental Protection Agency — The Compliance Query Wizard is part of a suite of Clean Air Markets-related tools that are accessible at http://ampd.epa.gov/ampd/. The Compliance module provides...
Chung, EunKyung; Yoon, JungWon
2009-01-01
Introduction: The purpose of this study is to compare characteristics and features of user supplied tags and search query terms for images on the "Flickr" Website in terms of categories of pictorial meanings and level of term specificity. Method: This study focuses on comparisons between tags and search queries using Shatford's categorization…
Linking Health Records for Federated Query Processing
Directory of Open Access Journals (Sweden)
Dewri Rinku
2016-07-01
Full Text Available A federated query portal in an electronic health record infrastructure enables large epidemiology studies by combining data from geographically dispersed medical institutions. However, an individual’s health record has been found to be distributed across multiple carrier databases in local settings. Privacy regulations may prohibit a data source from revealing clear text identifiers, thereby making it non-trivial for a query aggregator to determine which records correspond to the same underlying individual. In this paper, we explore this problem of privately detecting and tracking the health records of an individual in a distributed infrastructure. We begin with a secure set intersection protocol based on commutative encryption, and show how to make it practical on comparison spaces as large as 1010 pairs. Using bigram matching, precomputed tables, and data parallelism, we successfully reduced the execution time to a matter of minutes, while retaining a high degree of accuracy even in records with data entry errors. We also propose techniques to prevent the inference of identifier information when knowledge of underlying data distributions is known to an adversary. Finally, we discuss how records can be tracked utilizing the detection results during query processing.
Query-Biased Preview over Outsourced and Encrypted Data
Directory of Open Access Journals (Sweden)
Ningduo Peng
2013-01-01
document to check if it contains the desired content. An informative query-biased preview feature, as applied in modern search engine, could help the users to learn about the content without downloading the entire document. However, when the data are encrypted, securely extracting a keyword-in-context snippet from the data as a preview becomes a challenge. Based on private information retrieval protocol and the core concept of searchable encryption, we propose a single-server and two-round solution to securely obtain a query-biased snippet over the encrypted data from the server. We achieve this novel result by making a document (plaintext previewable under any cryptosystem and constructing a secure index to support dynamic computation for a best matched snippet when queried by some keywords. For each document, the scheme has O(d storage complexity and O(log(d/s+s+d/s communication complexity, where d is the document size and s is the snippet length.
Using search engine query data to track pharmaceutical utilization: a study of statins.
Schuster, Nathaniel M; Rogers, Mary A M; McMahon, Laurence F
2010-08-01
To examine temporal and geographic associations between Google queries for health information and healthcare utilization benchmarks. Retrospective longitudinal study. Using Google Trends and Google Insights for Search data, the search terms Lipitor (atorvastatin calcium; Pfizer, Ann Arbor, MI) and simvastatin were evaluated for change over time and for association with Lipitor revenues. The relationship between query data and community-based resource use per Medicare beneficiary was assessed for 35 US metropolitan areas. Google queries for Lipitor significantly decreased from January 2004 through June 2009 and queries for simvastatin significantly increased (P patent (P global revenues from 2004 to 2008 (P search engine queries for medical information correlate with pharmaceutical revenue and with overall healthcare utilization in a community. This suggests that search query data can track community-wide characteristics in healthcare utilization and have the potential for informing payers and policy makers regarding trends in utilization.
a Novel Approach of Indexing and Retrieving Spatial Polygons for Efficient Spatial Region Queries
Zhao, J. H.; Wang, X. Z.; Wang, F. Y.; Shen, Z. H.; Zhou, Y. C.; Wang, Y. L.
2017-10-01
Spatial region queries are more and more widely used in web-based applications. Mechanisms to provide efficient query processing over geospatial data are essential. However, due to the massive geospatial data volume, heavy geometric computation, and high access concurrency, it is difficult to get response in real time. Spatial indexes are usually used in this situation. In this paper, based on k-d tree, we introduce a distributed KD-Tree (DKD-Tree) suitbable for polygon data, and a two-step query algorithm. The spatial index construction is recursive and iterative, and the query is an in memory process. Both the index and query methods can be processed in parallel, and are implemented based on HDFS, Spark and Redis. Experiments on a large volume of Remote Sensing images metadata have been carried out, and the advantages of our method are investigated by comparing with spatial region queries executed on PostgreSQL and PostGIS. Results show that our approach not only greatly improves the efficiency of spatial region query, but also has good scalability, Moreover, the two-step spatial range query algorithm can also save cluster resources to support a large number of concurrent queries. Therefore, this method is very useful when building large geographic information systems.
A NOVEL APPROACH OF INDEXING AND RETRIEVING SPATIAL POLYGONS FOR EFFICIENT SPATIAL REGION QUERIES
Directory of Open Access Journals (Sweden)
J. H. Zhao
2017-10-01
Full Text Available Spatial region queries are more and more widely used in web-based applications. Mechanisms to provide efficient query processing over geospatial data are essential. However, due to the massive geospatial data volume, heavy geometric computation, and high access concurrency, it is difficult to get response in real time. Spatial indexes are usually used in this situation. In this paper, based on k-d tree, we introduce a distributed KD-Tree (DKD-Tree suitbable for polygon data, and a two-step query algorithm. The spatial index construction is recursive and iterative, and the query is an in memory process. Both the index and query methods can be processed in parallel, and are implemented based on HDFS, Spark and Redis. Experiments on a large volume of Remote Sensing images metadata have been carried out, and the advantages of our method are investigated by comparing with spatial region queries executed on PostgreSQL and PostGIS. Results show that our approach not only greatly improves the efficiency of spatial region query, but also has good scalability, Moreover, the two-step spatial range query algorithm can also save cluster resources to support a large number of concurrent queries. Therefore, this method is very useful when building large geographic information systems.
Vectorization vs. compilation in query execution
J. Sompolski (Juliusz); M. Zukowski (Marcin); P.A. Boncz (Peter)
2011-01-01
textabstractCompiling database queries into executable (sub-) programs provides substantial benefits comparing to traditional interpreted execution. Many of these benefits, such as reduced interpretation overhead, better instruction code locality, and providing opportunities to use SIMD
A Query System Implementation Case Study.
Hiser, Judith N.; Neil, M. Elizabeth
1985-01-01
The Department of Administrative Programming Services of Clemson University investigated products available in user-friendly retrieval systems. The test of INTELLECT, a natural language query system written by Artifical Intelligence Corporation, is described. (Author/MLW)
Querying Large Biological Network Datasets
Gulsoy, Gunhan
2013-01-01
New experimental methods has resulted in increasing amount of genetic interaction data to be generated every day. Biological networks are used to store genetic interaction data gathered. Increasing amount of data available requires fast large scale analysis methods. Therefore, we address the problem of querying large biological network datasets.…
Abdulla, Ahmed AbdoAziz Ahmed; Lin, Hongfei; Xu, Bo; Banbhrani, Santosh Kumar
2016-07-25
Biomedical literature retrieval is becoming increasingly complex, and there is a fundamental need for advanced information retrieval systems. Information Retrieval (IR) programs scour unstructured materials such as text documents in large reserves of data that are usually stored on computers. IR is related to the representation, storage, and organization of information items, as well as to access. In IR one of the main problems is to determine which documents are relevant and which are not to the user's needs. Under the current regime, users cannot precisely construct queries in an accurate way to retrieve particular pieces of data from large reserves of data. Basic information retrieval systems are producing low-quality search results. In our proposed system for this paper we present a new technique to refine Information Retrieval searches to better represent the user's information need in order to enhance the performance of information retrieval by using different query expansion techniques and apply a linear combinations between them, where the combinations was linearly between two expansion results at one time. Query expansions expand the search query, for example, by finding synonyms and reweighting original terms. They provide significantly more focused, particularized search results than do basic search queries. The retrieval performance is measured by some variants of MAP (Mean Average Precision) and according to our experimental results, the combination of best results of query expansion is enhanced the retrieved documents and outperforms our baseline by 21.06 %, even it outperforms a previous study by 7.12 %. We propose several query expansion techniques and their combinations (linearly) to make user queries more cognizable to search engines and to produce higher-quality search results.
An algorithm to transform natural language into SQL queries for relational databases
Directory of Open Access Journals (Sweden)
Garima Singh
2016-09-01
Full Text Available Intelligent interface, to enhance efficient interactions between user and databases, is the need of the database applications. Databases must be intelligent enough to make the accessibility faster. However, not every user familiar with the Structured Query Language (SQL queries as they may not aware of structure of the database and they thus require to learn SQL. So, non-expert users need a system to interact with relational databases in their natural language such as English. For this, Database Management System (DBMS must have an ability to understand Natural Language (NL. In this research, an intelligent interface is developed using semantic matching technique which translates natural language query to SQL using set of production rules and data dictionary. The data dictionary consists of semantics sets for relations and attributes. A series of steps like lower case conversion, tokenization, speech tagging, database element and SQL element extraction is used to convert Natural Language Query (NLQ to SQL Query. The transformed query is executed and the results are obtained by the user. Intelligent Interface is the need of database applications to enhance efficient interaction between user and DBMS.
Jung, HaRim; Song, MoonBae; Youn, Hee Yong; Kim, Ung Mo
2015-09-18
A content-matched (CM) rangemonitoring query overmoving objects continually retrieves the moving objects (i) whose non-spatial attribute values are matched to given non-spatial query values; and (ii) that are currently located within a given spatial query range. In this paper, we propose a new query indexing structure, called the group-aware query region tree (GQR-tree) for efficient evaluation of CMrange monitoring queries. The primary role of the GQR-tree is to help the server leverage the computational capabilities of moving objects in order to improve the system performance in terms of the wireless communication cost and server workload. Through a series of comprehensive simulations, we verify the superiority of the GQR-tree method over the existing methods.
Directory of Open Access Journals (Sweden)
HaRim Jung
2015-09-01
Full Text Available A content-matched (CM rangemonitoring query overmoving objects continually retrieves the moving objects (i whose non-spatial attribute values are matched to given non-spatial query values; and (ii that are currently located within a given spatial query range. In this paper, we propose a new query indexing structure, called the group-aware query region tree (GQR-tree for efficient evaluation of CMrange monitoring queries. The primary role of the GQR-tree is to help the server leverage the computational capabilities of moving objects in order to improve the system performance in terms of the wireless communication cost and server workload. Through a series of comprehensive simulations, we verify the superiority of the GQR-tree method over the existing methods.
An Experimental Investigation of Complexity in Database Query Formulation Tasks
Casterella, Gretchen Irwin; Vijayasarathy, Leo
2013-01-01
Information Technology professionals and other knowledge workers rely on their ability to extract data from organizational databases to respond to business questions and support decision making. Structured query language (SQL) is the standard programming language for querying data in relational databases, and SQL skills are in high demand and are…
Research on presentation and query service of geo-spatial data based on ontology
Li, Hong-wei; Li, Qin-chao; Cai, Chang
2008-10-01
The paper analyzed the deficiency on presentation and query of geo-spatial data existed in current GIS, discussed the advantages that ontology possessed in formalization of geo-spatial data and the presentation of semantic granularity, taken land-use classification system as an example to construct domain ontology, and described it by OWL; realized the grade level and category presentation of land-use data benefited from the thoughts of vertical and horizontal navigation; and then discussed query mode of geo-spatial data based on ontology, including data query based on types and grade levels, instances and spatial relation, and synthetic query based on types and instances; these methods enriched query mode of current GIS, and is a useful attempt; point out that the key point of the presentation and query of spatial data based on ontology is to construct domain ontology that can correctly reflect geo-concept and its spatial relation and realize its fine formalization description.
VIGOR: Interactive Visual Exploration of Graph Query Results.
Pienta, Robert; Hohman, Fred; Endert, Alex; Tamersoy, Acar; Roundy, Kevin; Gates, Chris; Navathe, Shamkant; Chau, Duen Horng
2018-01-01
Finding patterns in graphs has become a vital challenge in many domains from biological systems, network security, to finance (e.g., finding money laundering rings of bankers and business owners). While there is significant interest in graph databases and querying techniques, less research has focused on helping analysts make sense of underlying patterns within a group of subgraph results. Visualizing graph query results is challenging, requiring effective summarization of a large number of subgraphs, each having potentially shared node-values, rich node features, and flexible structure across queries. We present VIGOR, a novel interactive visual analytics system, for exploring and making sense of query results. VIGOR uses multiple coordinated views, leveraging different data representations and organizations to streamline analysts sensemaking process. VIGOR contributes: (1) an exemplar-based interaction technique, where an analyst starts with a specific result and relaxes constraints to find other similar results or starts with only the structure (i.e., without node value constraints), and adds constraints to narrow in on specific results; and (2) a novel feature-aware subgraph result summarization. Through a collaboration with Symantec, we demonstrate how VIGOR helps tackle real-world problems through the discovery of security blindspots in a cybersecurity dataset with over 11,000 incidents. We also evaluate VIGOR with a within-subjects study, demonstrating VIGOR's ease of use over a leading graph database management system, and its ability to help analysts understand their results at higher speed and make fewer errors.
Towards Building a High Performance Spatial Query System for Large Scale Medical Imaging Data.
Aji, Ablimit; Wang, Fusheng; Saltz, Joel H
2012-11-06
Support of high performance queries on large volumes of scientific spatial data is becoming increasingly important in many applications. This growth is driven by not only geospatial problems in numerous fields, but also emerging scientific applications that are increasingly data- and compute-intensive. For example, digital pathology imaging has become an emerging field during the past decade, where examination of high resolution images of human tissue specimens enables more effective diagnosis, prediction and treatment of diseases. Systematic analysis of large-scale pathology images generates tremendous amounts of spatially derived quantifications of micro-anatomic objects, such as nuclei, blood vessels, and tissue regions. Analytical pathology imaging provides high potential to support image based computer aided diagnosis. One major requirement for this is effective querying of such enormous amount of data with fast response, which is faced with two major challenges: the "big data" challenge and the high computation complexity. In this paper, we present our work towards building a high performance spatial query system for querying massive spatial data on MapReduce. Our framework takes an on demand index building approach for processing spatial queries and a partition-merge approach for building parallel spatial query pipelines, which fits nicely with the computing model of MapReduce. We demonstrate our framework on supporting multi-way spatial joins for algorithm evaluation and nearest neighbor queries for microanatomic objects. To reduce query response time, we propose cost based query optimization to mitigate the effect of data skew. Our experiments show that the framework can efficiently support complex analytical spatial queries on MapReduce.
Real SQL queries 50 challenges : practice for reporting and analysis
Cohen, Brian; Mishra, Neerja
2015-01-01
Queries improve when challenges are authentic. This book sets your learning on the fast track with realistic problems to solve. Topics span sales, marketing, human resources, purchasing, and production. Real SQL Queries: 50 Challenges is perfect for analysts, report writers, or anyone searching for a hands-on approach to learning SQL Server.
Des SIG-P pour sauver le complexe forestier de la basse rivière Tana?
International Development Research Centre (IDRC) Digital Library (Canada)
L'objectif général de ce programme panafricain est de contribuer à rendre disponibles des systèmes d'information de bonne qualité, fiables et accessibles grâce à l'usage de SIG-P en vue d'améliorer la gestion des ressources naturelles (eau, terres, forêts, etc.) et de promouvoir la sécurité alimentaire. Le programme ...
On a Fuzzy Algebra for Querying Graph Databases
Pivert , Olivier; Thion , Virginie; Jaudoin , Hélène; Smits , Grégory
2014-01-01
International audience; This paper proposes a notion of fuzzy graph database and describes a fuzzy query algebra that makes it possible to handle such database, which may be fuzzy or not, in a flexible way. The algebra, based on fuzzy set theory and the concept of a fuzzy graph, is composed of a set of operators that can be used to express preference queries on fuzzy graph databases. The preferences concern i) the content of the vertices of the graph and ii) the structure of the graph. In a s...
Application of discriminative models for interactive query refinement in video retrieval
Srivastava, Amit; Khanwalkar, Saurabh; Kumar, Anoop
2013-12-01
The ability to quickly search for large volumes of videos for specific actions or events can provide a dramatic new capability to intelligence agencies. Example-based queries from video are a form of content-based information retrieval (CBIR) where the objective is to retrieve clips from a video corpus, or stream, using a representative query sample to find more like this. Often, the accuracy of video retrieval is largely limited by the gap between the available video descriptors and the underlying query concept, and such exemplar queries return many irrelevant results with relevant ones. In this paper, we present an Interactive Query Refinement (IQR) system which acts as a powerful tool to leverage human feedback and allow intelligence analyst to iteratively refine search queries for improved precision in the retrieved results. In our approach to IQR, we leverage discriminative models that operate on high dimensional features derived from low-level video descriptors in an iterative framework. Our IQR model solicits relevance feedback on examples selected from the region of uncertainty and updates the discriminating boundary to produce a relevance ranked results list. We achieved 358% relative improvement in Mean Average Precision (MAP) over initial retrieval list at a rank cutoff of 100 over 4 iterations. We compare our discriminative IQR model approach to a naïve IQR and show our model-based approach yields 49% relative improvement over the no model naïve system.
Advances in nowcasting influenza-like illness rates using search query logs
Lampos, Vasileios; Miller, Andrew C.; Crossan, Steve; Stefansen, Christian
2015-08-01
User-generated content can assist epidemiological surveillance in the early detection and prevalence estimation of infectious diseases, such as influenza. Google Flu Trends embodies the first public platform for transforming search queries to indications about the current state of flu in various places all over the world. However, the original model significantly mispredicted influenza-like illness rates in the US during the 2012-13 flu season. In this work, we build on the previous modeling attempt, proposing substantial improvements. Firstly, we investigate the performance of a widely used linear regularized regression solver, known as the Elastic Net. Then, we expand on this model by incorporating the queries selected by the Elastic Net into a nonlinear regression framework, based on a composite Gaussian Process. Finally, we augment the query-only predictions with an autoregressive model, injecting prior knowledge about the disease. We assess predictive performance using five consecutive flu seasons spanning from 2008 to 2013 and qualitatively explain certain shortcomings of the previous approach. Our results indicate that a nonlinear query modeling approach delivers the lowest cumulative nowcasting error, and also suggest that query information significantly improves autoregressive inferences, obtaining state-of-the-art performance.
Practical private database queries based on a quantum-key-distribution protocol
International Nuclear Information System (INIS)
Jakobi, Markus; Simon, Christoph; Gisin, Nicolas; Bancal, Jean-Daniel; Branciard, Cyril; Walenta, Nino; Zbinden, Hugo
2011-01-01
Private queries allow a user, Alice, to learn an element of a database held by a provider, Bob, without revealing which element she is interested in, while limiting her information about the other elements. We propose to implement private queries based on a quantum-key-distribution protocol, with changes only in the classical postprocessing of the key. This approach makes our scheme both easy to implement and loss tolerant. While unconditionally secure private queries are known to be impossible, we argue that an interesting degree of security can be achieved by relying on fundamental physical principles instead of unverifiable security assumptions in order to protect both the user and the database. We think that the scope exists for such practical private queries to become another remarkable application of quantum information in the footsteps of quantum key distribution.
An Adaptive Genetic Algorithm with Dynamic Population Size for Optimizing Join Queries
Vellev, Stoyan
2008-01-01
The problem of finding the optimal join ordering executing a query to a relational database management system is a combinatorial optimization problem, which makes deterministic exhaustive solution search unacceptable for queries with a great number of joined relations. In this work an adaptive genetic algorithm with dynamic population size is proposed for optimizing large join queries. The performance of the algorithm is compared with that of several classical non-determinis...
Persistent Identifiers for Improved Accessibility for Linked Data Querying
Shepherd, A.; Chandler, C. L.; Arko, R. A.; Fils, D.; Jones, M. B.; Krisnadhi, A.; Mecum, B.
2016-12-01
The adoption of linked open data principles within the geosciences has increased the amount of accessible information available on the Web. However, this data is difficult to consume for those who are unfamiliar with Semantic Web technologies such as Web Ontology Language (OWL), Resource Description Framework (RDF) and SPARQL - the RDF query language. Consumers would need to understand the structure of the data and how to efficiently query it. Furthermore, understanding how to query doesn't solve problems of poor precision and recall in search results. For consumers unfamiliar with the data, full-text searches are most accessible, but not ideal as they arrest the advantages of data disambiguation and co-reference resolution efforts. Conversely, URI searches across linked data can deliver improved search results, but knowledge of these exact URIs may remain difficult to obtain. The increased adoption of Persistent Identifiers (PIDs) can lead to improved linked data querying by a wide variety of consumers. Because PIDs resolve to a single entity, they are an excellent data point for disambiguating content. At the same time, PIDs are more accessible and prominent than a single data provider's linked data URI. When present in linked open datasets, PIDs provide balance between the technical and social hurdles of linked data querying as evidenced by the NSF EarthCube GeoLink project. The GeoLink project, funded by NSF's EarthCube initiative, have brought together data repositories include content from field expeditions, laboratory analyses, journal publications, conference presentations, theses/reports, and funding awards that span scientific studies from marine geology to marine ecosystems and biogeochemistry to paleoclimatology.
Secure quantum private information retrieval using phase-encoded queries
Energy Technology Data Exchange (ETDEWEB)
Olejnik, Lukasz [CERN, 1211 Geneva 23, Switzerland and Poznan Supercomputing and Networking Center, Noskowskiego 12/14, PL-61-704 Poznan (Poland)
2011-08-15
We propose a quantum solution to the classical private information retrieval (PIR) problem, which allows one to query a database in a private manner. The protocol offers privacy thresholds and allows the user to obtain information from a database in a way that offers the potential adversary, in this model the database owner, no possibility of deterministically establishing the query contents. This protocol may also be viewed as a solution to the symmetrically private information retrieval problem in that it can offer database security (inability for a querying user to steal its contents). Compared to classical solutions, the protocol offers substantial improvement in terms of communication complexity. In comparison with the recent quantum private queries [Phys. Rev. Lett. 100, 230502 (2008)] protocol, it is more efficient in terms of communication complexity and the number of rounds, while offering a clear privacy parameter. We discuss the security of the protocol and analyze its strengths and conclude that using this technique makes it challenging to obtain the unconditional (in the information-theoretic sense) privacy degree; nevertheless, in addition to being simple, the protocol still offers a privacy level. The oracle used in the protocol is inspired both by the classical computational PIR solutions as well as the Deutsch-Jozsa oracle.
Secure quantum private information retrieval using phase-encoded queries
International Nuclear Information System (INIS)
Olejnik, Lukasz
2011-01-01
We propose a quantum solution to the classical private information retrieval (PIR) problem, which allows one to query a database in a private manner. The protocol offers privacy thresholds and allows the user to obtain information from a database in a way that offers the potential adversary, in this model the database owner, no possibility of deterministically establishing the query contents. This protocol may also be viewed as a solution to the symmetrically private information retrieval problem in that it can offer database security (inability for a querying user to steal its contents). Compared to classical solutions, the protocol offers substantial improvement in terms of communication complexity. In comparison with the recent quantum private queries [Phys. Rev. Lett. 100, 230502 (2008)] protocol, it is more efficient in terms of communication complexity and the number of rounds, while offering a clear privacy parameter. We discuss the security of the protocol and analyze its strengths and conclude that using this technique makes it challenging to obtain the unconditional (in the information-theoretic sense) privacy degree; nevertheless, in addition to being simple, the protocol still offers a privacy level. The oracle used in the protocol is inspired both by the classical computational PIR solutions as well as the Deutsch-Jozsa oracle.
The Ontology Lookup Service: more data and better tools for controlled vocabulary queries.
Côté, Richard G; Jones, Philip; Martens, Lennart; Apweiler, Rolf; Hermjakob, Henning
2008-07-01
The Ontology Lookup Service (OLS) (http://www.ebi.ac.uk/ols) provides interactive and programmatic interfaces to query, browse and navigate an ever increasing number of biomedical ontologies and controlled vocabularies. The volume of data available for querying has more than quadrupled since it went into production and OLS functionality has been integrated into several high-usage databases and data entry tools. Improvements have been made to both OLS query interfaces, based on user feedback and requirements, to improve usability and service interoperability and provide novel ways to perform queries.
Efficient processing of 3-sided range queries with probabilistic guarantees
DEFF Research Database (Denmark)
Kaporis, Alexis; Papadopoulos, Apostolos; Sioutas, Spyros
2010-01-01
This work studies the problem of 2-dimensional searching for the 3-sided range query of the form [a, b] x (-∞, c] in both main and external memory, by considering a variety of input distributions. A dynamic linear main memory solution is proposed, which answers 3-sided queries in O(log n + t) worst...
Mathematical Formula Search using Natural Language Queries
Directory of Open Access Journals (Sweden)
YANG, S.
2014-11-01
Full Text Available This paper presents how to search mathematical formulae written in MathML when given plain words as a query. Since the proposed method allows natural language queries like the traditional Information Retrieval for the mathematical formula search, users do not need to enter any complicated math symbols and to use any formula input tool. For this, formula data is converted into plain texts, and features are extracted from the converted texts. In our experiments, we achieve an outstanding performance, a MRR of 0.659. In addition, we introduce how to utilize formula classification for formula search. By using class information, we finally achieve an improved performance, a MRR of 0.690.
The data cyclotron query processing scheme
R.A. Goncalves (Romulo); M.L. Kersten (Martin)
2010-01-01
htmlabstractDistributed database systems exploit static workload characteristics to steer data fragmentation and data allocation schemes. However, the grand challenge of distributed query processing is to come up with a self-organizing architecture, which exploits all resources to manage the hot
Optimizing Cost of Continuous Overlapping Queries over Data Streams by Filter Adaption
Xie, Qing
2016-01-12
The problem we aim to address is the optimization of cost management for executing multiple continuous queries on data streams, where each query is defined by several filters, each of which monitors certain status of the data stream. Specially the filter can be shared by different queries and expensive to evaluate. The conventional objective for such a problem is to minimize the overall execution cost to solve all queries, by planning the order of filter evaluation in shared strategy. However, in streaming scenario, the characteristics of data items may change in process, which can bring some uncertainty to the outcome of individual filter evaluation, and affect the plan of query execution as well as the overall execution cost. In our work, considering the influence of the uncertain variation of data characteristics, we propose a framework to deal with the dynamic adjustment of filter ordering for query execution on data stream, and focus on the issues of cost management. By incrementally monitoring and analyzing the results of filter evaluation, our proposed approach can be effectively adaptive to the varied stream behavior and adjust the optimal ordering of filter evaluation, so as to optimize the execution cost. In order to achieve satisfactory performance and efficiency, we also discuss the trade-off between the adaptivity of our framework and the overhead incurred by filter adaption. The experimental results on synthetic and two real data sets (traffic and multimedia) show that our framework can effectively reduce and balance the overall query execution cost and keep high adaptivity in streaming scenario.
STARS 2.0: 2nd-generation open-source archiving and query software
Winegar, Tom
2008-07-01
The Subaru Telescope is in process of developing an open-source alternative to the 1st-generation software and databases (STARS 1) used for archiving and query. For STARS 2, we have chosen PHP and Python for scripting and MySQL as the database software. We have collected feedback from staff and observers, and used this feedback to significantly improve the design and functionality of our future archiving and query software. Archiving - We identified two weaknesses in 1st-generation STARS archiving software: a complex and inflexible table structure and uncoordinated system administration for our business model: taking pictures from the summit and archiving them in both Hawaii and Japan. We adopted a simplified and normalized table structure with passive keyword collection, and we are designing an archive-to-archive file transfer system that automatically reports real-time status and error conditions and permits error recovery. Query - We identified several weaknesses in 1st-generation STARS query software: inflexible query tools, poor sharing of calibration data, and no automatic file transfer mechanisms to observers. We are developing improved query tools and sharing of calibration data, and multi-protocol unassisted file transfer mechanisms for observers. In the process, we have redefined a 'query': from an invisible search result that can only transfer once in-house right now, with little status and error reporting and no error recovery - to a stored search result that can be monitored, transferred to different locations with multiple protocols, reporting status and error conditions and permitting recovery from errors.
Bioqueries: a collaborative environment to create, explore and share SPARQL queries in Life Sciences
García-Godoy, Maria Jesús; López-Camacho, Esteban; Navas-Delgado, Ismael; Aldana-Montes, Jose Francisco
2016-01-01
Bioqueries provides a collaborative environment to create, explore, execute, clone and share SPARQL queries (including Federated Queries). Federated SPARQL queries can retrieve information from more than one data source. Universidad de Málaga. Campus de Excelencia Internacional Andalucía Tech.
Web-Based Distributed XML Query Processing
Smiljanic, M.; Feng, L.; Jonker, Willem; Blanken, Henk; Grabs, T.; Schek, H-J.; Schenkel, R.; Weikum, G.
2003-01-01
Web-based distributed XML query processing has gained in importance in recent years due to the widespread popularity of XML on the Web. Unlike centralized and tightly coupled distributed systems, Web-based distributed database systems are highly unpredictable and uncontrollable, with a rather
Web search queries can predict stock market volumes.
Bordino, Ilaria; Battiston, Stefano; Caldarelli, Guido; Cristelli, Matthieu; Ukkonen, Antti; Weber, Ingmar
2012-01-01
We live in a computerized and networked society where many of our actions leave a digital trace and affect other people's actions. This has lead to the emergence of a new data-driven research field: mathematical methods of computer science, statistical physics and sociometry provide insights on a wide range of disciplines ranging from social science to human mobility. A recent important discovery is that search engine traffic (i.e., the number of requests submitted by users to search engines on the www) can be used to track and, in some cases, to anticipate the dynamics of social phenomena. Successful examples include unemployment levels, car and home sales, and epidemics spreading. Few recent works applied this approach to stock prices and market sentiment. However, it remains unclear if trends in financial markets can be anticipated by the collective wisdom of on-line users on the web. Here we show that daily trading volumes of stocks traded in NASDAQ-100 are correlated with daily volumes of queries related to the same stocks. In particular, query volumes anticipate in many cases peaks of trading by one day or more. Our analysis is carried out on a unique dataset of queries, submitted to an important web search engine, which enable us to investigate also the user behavior. We show that the query volume dynamics emerges from the collective but seemingly uncoordinated activity of many users. These findings contribute to the debate on the identification of early warnings of financial systemic risk, based on the activity of users of the www.
Web search queries can predict stock market volumes.
Directory of Open Access Journals (Sweden)
Ilaria Bordino
Full Text Available We live in a computerized and networked society where many of our actions leave a digital trace and affect other people's actions. This has lead to the emergence of a new data-driven research field: mathematical methods of computer science, statistical physics and sociometry provide insights on a wide range of disciplines ranging from social science to human mobility. A recent important discovery is that search engine traffic (i.e., the number of requests submitted by users to search engines on the www can be used to track and, in some cases, to anticipate the dynamics of social phenomena. Successful examples include unemployment levels, car and home sales, and epidemics spreading. Few recent works applied this approach to stock prices and market sentiment. However, it remains unclear if trends in financial markets can be anticipated by the collective wisdom of on-line users on the web. Here we show that daily trading volumes of stocks traded in NASDAQ-100 are correlated with daily volumes of queries related to the same stocks. In particular, query volumes anticipate in many cases peaks of trading by one day or more. Our analysis is carried out on a unique dataset of queries, submitted to an important web search engine, which enable us to investigate also the user behavior. We show that the query volume dynamics emerges from the collective but seemingly uncoordinated activity of many users. These findings contribute to the debate on the identification of early warnings of financial systemic risk, based on the activity of users of the www.
SeqWare Query Engine: storing and searching sequence data in the cloud
Directory of Open Access Journals (Sweden)
Merriman Barry
2010-12-01
Full Text Available Abstract Background Since the introduction of next-generation DNA sequencers the rapid increase in sequencer throughput, and associated drop in costs, has resulted in more than a dozen human genomes being resequenced over the last few years. These efforts are merely a prelude for a future in which genome resequencing will be commonplace for both biomedical research and clinical applications. The dramatic increase in sequencer output strains all facets of computational infrastructure, especially databases and query interfaces. The advent of cloud computing, and a variety of powerful tools designed to process petascale datasets, provide a compelling solution to these ever increasing demands. Results In this work, we present the SeqWare Query Engine which has been created using modern cloud computing technologies and designed to support databasing information from thousands of genomes. Our backend implementation was built using the highly scalable, NoSQL HBase database from the Hadoop project. We also created a web-based frontend that provides both a programmatic and interactive query interface and integrates with widely used genome browsers and tools. Using the query engine, users can load and query variants (SNVs, indels, translocations, etc with a rich level of annotations including coverage and functional consequences. As a proof of concept we loaded several whole genome datasets including the U87MG cell line. We also used a glioblastoma multiforme tumor/normal pair to both profile performance and provide an example of using the Hadoop MapReduce framework within the query engine. This software is open source and freely available from the SeqWare project (http://seqware.sourceforge.net. Conclusions The SeqWare Query Engine provided an easy way to make the U87MG genome accessible to programmers and non-programmers alike. This enabled a faster and more open exploration of results, quicker tuning of parameters for heuristic variant calling filters
SeqWare Query Engine: storing and searching sequence data in the cloud
2010-01-01
Background Since the introduction of next-generation DNA sequencers the rapid increase in sequencer throughput, and associated drop in costs, has resulted in more than a dozen human genomes being resequenced over the last few years. These efforts are merely a prelude for a future in which genome resequencing will be commonplace for both biomedical research and clinical applications. The dramatic increase in sequencer output strains all facets of computational infrastructure, especially databases and query interfaces. The advent of cloud computing, and a variety of powerful tools designed to process petascale datasets, provide a compelling solution to these ever increasing demands. Results In this work, we present the SeqWare Query Engine which has been created using modern cloud computing technologies and designed to support databasing information from thousands of genomes. Our backend implementation was built using the highly scalable, NoSQL HBase database from the Hadoop project. We also created a web-based frontend that provides both a programmatic and interactive query interface and integrates with widely used genome browsers and tools. Using the query engine, users can load and query variants (SNVs, indels, translocations, etc) with a rich level of annotations including coverage and functional consequences. As a proof of concept we loaded several whole genome datasets including the U87MG cell line. We also used a glioblastoma multiforme tumor/normal pair to both profile performance and provide an example of using the Hadoop MapReduce framework within the query engine. This software is open source and freely available from the SeqWare project (http://seqware.sourceforge.net). Conclusions The SeqWare Query Engine provided an easy way to make the U87MG genome accessible to programmers and non-programmers alike. This enabled a faster and more open exploration of results, quicker tuning of parameters for heuristic variant calling filters, and a common data
SeqWare Query Engine: storing and searching sequence data in the cloud.
O'Connor, Brian D; Merriman, Barry; Nelson, Stanley F
2010-12-21
Since the introduction of next-generation DNA sequencers the rapid increase in sequencer throughput, and associated drop in costs, has resulted in more than a dozen human genomes being resequenced over the last few years. These efforts are merely a prelude for a future in which genome resequencing will be commonplace for both biomedical research and clinical applications. The dramatic increase in sequencer output strains all facets of computational infrastructure, especially databases and query interfaces. The advent of cloud computing, and a variety of powerful tools designed to process petascale datasets, provide a compelling solution to these ever increasing demands. In this work, we present the SeqWare Query Engine which has been created using modern cloud computing technologies and designed to support databasing information from thousands of genomes. Our backend implementation was built using the highly scalable, NoSQL HBase database from the Hadoop project. We also created a web-based frontend that provides both a programmatic and interactive query interface and integrates with widely used genome browsers and tools. Using the query engine, users can load and query variants (SNVs, indels, translocations, etc) with a rich level of annotations including coverage and functional consequences. As a proof of concept we loaded several whole genome datasets including the U87MG cell line. We also used a glioblastoma multiforme tumor/normal pair to both profile performance and provide an example of using the Hadoop MapReduce framework within the query engine. This software is open source and freely available from the SeqWare project (http://seqware.sourceforge.net). The SeqWare Query Engine provided an easy way to make the U87MG genome accessible to programmers and non-programmers alike. This enabled a faster and more open exploration of results, quicker tuning of parameters for heuristic variant calling filters, and a common data interface to simplify development of
GeoVanet: A Routing Protocol for Query Processing in Vehicular Networks
Directory of Open Access Journals (Sweden)
Thierry Delot
2011-01-01
Full Text Available In a vehicular ad hoc network (VANET, cars can exchange information by using short-range wireless communications. Along with the opportunities offered by vehicular networks, a number of challenges also arise. In particular, most works so far have focused on a push model, where potentially useful data are pushed towards vehicles. The use of pull models, that would allow users to send queries to a set of cars in order to find the desired information, has not been studied in depth. The main challenge for pull models is the difficulty to route the different results towards the query originator in a highly dynamic network where the nodes move very quickly. To solve this issue, we propose GeoVanet, an anonymous and non-intrusive geographic routing protocol which ensures that the sender of a query can get a consistent answer. Our goal is to ensure that the user will be able to retrieve the query results within a bounded time. To prove the effectiveness of GeoVanet, an extensive experimental evaluation has been performed, that proves the interest of the proposal for both rural and urban areas. It shows that up to 80% of the available query results are delivered to the user.
Evolutionary Algorithms for Boolean Queries Optimization
Czech Academy of Sciences Publication Activity Database
Húsek, Dušan; Snášel, Václav; Neruda, Roman; Owais, S.S.J.; Krömer, P.
2006-01-01
Roč. 3, č. 1 (2006), s. 15-20 ISSN 1790-0832 R&D Projects: GA AV ČR 1ET100300414 Institutional research plan: CEZ:AV0Z10300504 Keywords : evolutionary algorithms * genetic algorithms * information retrieval * Boolean query Subject RIV: BA - General Mathematics
Flattening Queries over Nested Data Types
van Ruth, J.
2006-01-01
The theory developed in this thesis provides a method to improve the efficiency of querying nested data. The roots of this research lie in the tension between data model expressiveness and performance. Obviously, more expressive data models are more convenient for application programmers. For many
An Ontology-Based Reasoning Framework for Querying Satellite Images for Disaster Monitoring.
Alirezaie, Marjan; Kiselev, Andrey; Längkvist, Martin; Klügl, Franziska; Loutfi, Amy
2017-11-05
This paper presents a framework in which satellite images are classified and augmented with additional semantic information to enable queries about what can be found on the map at a particular location, but also about paths that can be taken. This is achieved by a reasoning framework based on qualitative spatial reasoning that is able to find answers to high level queries that may vary on the current situation. This framework called SemCityMap, provides the full pipeline from enriching the raw image data with rudimentary labels to the integration of a knowledge representation and reasoning methods to user interfaces for high level querying. To illustrate the utility of SemCityMap in a disaster scenario, we use an urban environment-central Stockholm-in combination with a flood simulation. We show that the system provides useful answers to high-level queries also with respect to the current flood status. Examples of such queries concern path planning for vehicles or retrieval of safe regions such as "find all regions close to schools and far from the flooded area". The particular advantage of our approach lies in the fact that ontological information and reasoning is explicitly integrated so that queries can be formulated in a natural way using concepts on appropriate level of abstraction, including additional constraints.
An Ontology-Based Reasoning Framework for Querying Satellite Images for Disaster Monitoring
Directory of Open Access Journals (Sweden)
Marjan Alirezaie
2017-11-01
Full Text Available This paper presents a framework in which satellite images are classified and augmented with additional semantic information to enable queries about what can be found on the map at a particular location, but also about paths that can be taken. This is achieved by a reasoning framework based on qualitative spatial reasoning that is able to find answers to high level queries that may vary on the current situation. This framework called SemCityMap, provides the full pipeline from enriching the raw image data with rudimentary labels to the integration of a knowledge representation and reasoning methods to user interfaces for high level querying. To illustrate the utility of SemCityMap in a disaster scenario, we use an urban environment—central Stockholm—in combination with a flood simulation. We show that the system provides useful answers to high-level queries also with respect to the current flood status. Examples of such queries concern path planning for vehicles or retrieval of safe regions such as “find all regions close to schools and far from the flooded area”. The particular advantage of our approach lies in the fact that ontological information and reasoning is explicitly integrated so that queries can be formulated in a natural way using concepts on appropriate level of abstraction, including additional constraints.
FTree query construction for virtual screening: a statistical analysis.
Gerlach, Christof; Broughton, Howard; Zaliani, Andrea
2008-02-01
FTrees (FT) is a known chemoinformatic tool able to condense molecular descriptions into a graph object and to search for actives in large databases using graph similarity. The query graph is classically derived from a known active molecule, or a set of actives, for which a similar compound has to be found. Recently, FT similarity has been extended to fragment space, widening its capabilities. If a user were able to build a knowledge-based FT query from information other than a known active structure, the similarity search could be combined with other, normally separate, fields like de-novo design or pharmacophore searches. With this aim in mind, we performed a comprehensive analysis of several databases in terms of FT description and provide a basic statistical analysis of the FT spaces so far at hand. Vendors' catalogue collections and MDDR as a source of potential or known "actives", respectively, have been used. With the results reported herein, a set of ranges, mean values and standard deviations for several query parameters are presented in order to set a reference guide for the users. Applications on how to use this information in FT query building are also provided, using a newly built 3D-pharmacophore from 57 5HT-1F agonists and a published one which was used for virtual screening for tRNA-guanine transglycosylase (TGT) inhibitors.
Adverse Reactions Associated With Cannabis Consumption as Evident From Search Engine Queries.
Yom-Tov, Elad; Lev-Ran, Shaul
2017-10-26
Cannabis is one of the most widely used psychoactive substances worldwide, but adverse drug reactions (ADRs) associated with its use are difficult to study because of its prohibited status in many countries. Internet search engine queries have been used to investigate ADRs in pharmaceutical drugs. In this proof-of-concept study, we tested whether these queries can be used to detect the adverse reactions of cannabis use. We analyzed anonymized queries from US-based users of Bing, a widely used search engine, made over a period of 6 months and compared the results with the prevalence of cannabis use as reported in the US National Survey on Drug Use in the Household (NSDUH) and with ADRs reported in the Food and Drug Administration's Adverse Drug Reporting System. Predicted prevalence of cannabis use was estimated from the fraction of people making queries about cannabis, marijuana, and 121 additional synonyms. Predicted ADRs were estimated from queries containing layperson descriptions to 195 ICD-10 symptoms list. Our results indicated that the predicted prevalence of cannabis use at the US census regional level reaches an R 2 of .71 NSDUH data. Queries for ADRs made by people who also searched for cannabis reveal many of the known adverse effects of cannabis (eg, cough and psychotic symptoms), as well as plausible unknown reactions (eg, pyrexia). These results indicate that search engine queries can serve as an important tool for the study of adverse reactions of illicit drugs, which are difficult to study in other settings. ©Elad Yom-Tov, Shaul Lev-Ran. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 26.10.2017.
Visually defining and querying consistent multi-granular clinical temporal abstractions.
Combi, Carlo; Oliboni, Barbara
2012-02-01
The main goal of this work is to propose a framework for the visual specification and query of consistent multi-granular clinical temporal abstractions. We focus on the issue of querying patient clinical information by visually defining and composing temporal abstractions, i.e., high level patterns derived from several time-stamped raw data. In particular, we focus on the visual specification of consistent temporal abstractions with different granularities and on the visual composition of different temporal abstractions for querying clinical databases. Temporal abstractions on clinical data provide a concise and high-level description of temporal raw data, and a suitable way to support decision making. Granularities define partitions on the time line and allow one to represent time and, thus, temporal clinical information at different levels of detail, according to the requirements coming from the represented clinical domain. The visual representation of temporal information has been considered since several years in clinical domains. Proposed visualization techniques must be easy and quick to understand, and could benefit from visual metaphors that do not lead to ambiguous interpretations. Recently, physical metaphors such as strips, springs, weights, and wires have been proposed and evaluated on clinical users for the specification of temporal clinical abstractions. Visual approaches to boolean queries have been considered in the last years and confirmed that the visual support to the specification of complex boolean queries is both an important and difficult research topic. We propose and describe a visual language for the definition of temporal abstractions based on a set of intuitive metaphors (striped wall, plastered wall, brick wall), allowing the clinician to use different granularities. A new algorithm, underlying the visual language, allows the physician to specify only consistent abstractions, i.e., abstractions not containing contradictory conditions on
DirQ: A Directed Query Dissemination Scheme for Wireless Sensor Networks
Chatterjea, Supriyo; De Luigi, Simone; Havinga, Paul J.M.; Kaminska, B
This paper describes a Directed Query Dissemination Scheme, DirQ that routes queries to the appropriate source nodes based on both constant and dynamic-valued attributes such as sensor types and sensor values. Location information is not essential for the operation of DirQ. DirQ only uses locally
Boolean Queries Optimization by Genetic Algorithms
Czech Academy of Sciences Publication Activity Database
Húsek, Dušan; Owais, S.S.J.; Krömer, P.; Snášel, Václav
2005-01-01
Roč. 15, - (2005), s. 395-409 ISSN 1210-0552 R&D Projects: GA AV ČR 1ET100300414 Institutional research plan: CEZ:AV0Z10300504 Keywords : evolutionary algorithms * genetic algorithms * genetic programming * information retrieval * Boolean query Subject RIV: BB - Applied Statistics, Operational Research
Location-Based Top-k Term Querying over Sliding Window
Xu, Ying
2017-10-03
In part due to the proliferation of GPS-equipped mobile devices, massive svolumes of geo-tagged streaming text messages are becoming available on social media. It is of great interest to discover most frequent nearby terms from such tremendous stream data. In this paper, we present novel indexing, updating, and query processing techniques that are capable of discovering top-k locally popular nearby terms over a sliding window. Specifically, given a query location and a set of geo-tagged messages within a sliding window, we study the problem of searching for the top-k terms by considering both the term frequency and the proximities between the messages containing the term and the query location. We develop a novel and efficient mechanism to solve the problem, including a quad-tree based indexing structure, indexing update technique, and a best-first based searching algorithm. An empirical study is conducted to show that our proposed techniques are efficient and fit for users’ requirements through varying a number of parameters.
Regular paths in SparQL: querying the NCI Thesaurus.
Detwiler, Landon T; Suciu, Dan; Brinkley, James F
2008-11-06
OWL, the Web Ontology Language, provides syntax and semantics for representing knowledge for the semantic web. Many of the constructs of OWL have a basis in the field of description logics. While the formal underpinnings of description logics have lead to a highly computable language, it has come at a cognitive cost. OWL ontologies are often unintuitive to readers lacking a strong logic background. In this work we describe GLEEN, a regular path expression library, which extends the RDF query language SparQL to support complex path expressions over OWL and other RDF-based ontologies. We illustrate the utility of GLEEN by showing how it can be used in a query-based approach to defining simpler, more intuitive views of OWL ontologies. In particular we show how relatively simple GLEEN-enhanced SparQL queries can create views of the OWL version of the NCI Thesaurus that match the views generated by the web-based NCI browser.
Querying Large Physics Data Sets Over an Information Grid
Baker, N; Kovács, Z; Le Goff, J M; McClatchey, R
2001-01-01
Optimising use of the Web (WWW) for LHC data analysis is a complex problem and illustrates the challenges arising from the integration of and computation across massive amounts of information distributed worldwide. Finding the right piece of information can, at times, be extremely time-consuming, if not impossible. So-called Grids have been proposed to facilitate LHC computing and many groups have embarked on studies of data replication, data migration and networking philosophies. Other aspects such as the role of 'middleware' for Grids are emerging as requiring research. This paper positions the need for appropriate middleware that enables users to resolve physics queries across massive data sets. It identifies the role of meta-data for query resolution and the importance of Information Grids for high-energy physics analysis rather than just Computational or Data Grids. This paper identifies software that is being implemented at CERN to enable the querying of very large collaborating HEP data-sets, initially...
Federated querying architecture with clinical & translational health IT application.
Livne, Oren E; Schultz, N Dustin; Narus, Scott P
2011-10-01
We present a software architecture that federates data from multiple heterogeneous health informatics data sources owned by multiple organizations. The architecture builds upon state-of-the-art open-source Java and XML frameworks in innovative ways. It consists of (a) federated query engine, which manages federated queries and result set aggregation via a patient identification service; and (b) data source facades, which translate the physical data models into a common model on-the-fly and handle large result set streaming. System modules are connected via reusable Apache Camel integration routes and deployed to an OSGi enterprise service bus. We present an application of our architecture that allows users to construct queries via the i2b2 web front-end, and federates patient data from the University of Utah Enterprise Data Warehouse and the Utah Population database. Our system can be easily adopted, extended and integrated with existing SOA Healthcare and HL7 frameworks such as i2b2 and caGrid.
Location-Based Top-k Term Querying over Sliding Window
Xu, Ying; Chen, Lisi; Yao, Bin; Shang, Shuo; Zhu, Shunzhi; Zheng, Kai; Li, Fang
2017-01-01
In part due to the proliferation of GPS-equipped mobile devices, massive svolumes of geo-tagged streaming text messages are becoming available on social media. It is of great interest to discover most frequent nearby terms from such tremendous stream data. In this paper, we present novel indexing, updating, and query processing techniques that are capable of discovering top-k locally popular nearby terms over a sliding window. Specifically, given a query location and a set of geo-tagged messages within a sliding window, we study the problem of searching for the top-k terms by considering both the term frequency and the proximities between the messages containing the term and the query location. We develop a novel and efficient mechanism to solve the problem, including a quad-tree based indexing structure, indexing update technique, and a best-first based searching algorithm. An empirical study is conducted to show that our proposed techniques are efficient and fit for users’ requirements through varying a number of parameters.
Investigation in Query System Framework for High Energy Physics
Jatuphattharachat, Thanat
2017-01-01
We summarize an investigation in query system framework for HEP (High Energy Physics). Our work was an investigation on distributed server part of Femtocode, which is a query language that provides the ability for physicists to make plots and other aggregations in real-time. To make the system more robust and capable of processing large amount of data quickly, it is necessary to deploy the system on a redundant and distributed computing cluster. This project aims to investigate third party coordination and resource management frameworks which fit into the design of real-time distributed query system. Zookeeper, Mesos and Marathon are the main frameworks for this investigation. The results indicate that Zookeeper is good for job coordinator and job tracking as it provides robust, fast, simple and transparent read and write process for all connecting client across distributed Zookeeper server. Furthermore, it also supports high availability access and consistency guarantee within specific time bound.