WorldWideScience

Sample records for query sig recommendations

  1. Query recommendation for children

    NARCIS (Netherlands)

    Duarte Torres, Sergio; Hiemstra, Djoerd; Weber, Ingmar; Serdyukov, Pavel

    2012-01-01

    One of the biggest problems that children experience while searching the web occurs during the query formulation process. Children have been found to struggle formulating queries based on keywords given their limited vocabulary and their difficulty to choose the right keywords. In this work we propo

  2. Recommendation Sets and Choice Queries

    DEFF Research Database (Denmark)

    Viappiani, Paolo Renato; Boutilier, Craig

    2011-01-01

    Utility elicitation is an important component of many applications, such as decision support systems and recommender systems. Such systems query users about their preferences and offer recommendations based on the system's belief about the user's utility function. We analyze the connection between...... the problem of generating optimal recommendation sets and the problem of generating optimal choice queries, considering both Bayesian and regret-based elicitation. Our results show that, somewhat surprisingly, under very general circumstances, the optimal recommendation set coincides with the optimal query....

  3. Query Recommendation by Coupling Personalization with Clustering for Search Engine

    Directory of Open Access Journals (Sweden)

    Dhiliphanrajkumar.Thambidurai

    2016-11-01

    Full Text Available In the present world internet and web search engines have become an important part in one’s day-today life. For a user query, more than few thousand web pages are retrieved but most of them are irrelevant. A major problem in search engine is that the user queries are usually short and ambiguous, and they are not sufficient to satisfy the precise user needs. Also listing more number of results according to user make them worry about searching the desired results and it takes large amount of time to search from the huge list of results. To overcome all the problems, an effective approach is developed by capturing the users’ click through and bookmarking data to provide personalized query recommendation. For retrieving the results, Google API is used. Experimental results show that the proposed method is providing better query recommendation results than the existing query suggestion methods.

  4. Query recommendation in the information domain of children

    NARCIS (Netherlands)

    Duarte Torres, Sergio Raúl; Hiemstra, Djoerd; Weber, Ingmar; Serdyukov, Pavel

    2014-01-01

    Children represent an increasing part of web users. One of the key problems that hamper their search experience is their limited vocabulary, their difficulty to use the right keywords, and the inappropriateness of general- purpose query suggestions. In this work, we propose a method that uses tags f

  5. Recommendation Boosted Query Propagation in the Social Network

    Science.gov (United States)

    Kukła, Grzegorz; Kazienko, Przemysław; Bródka, Piotr; Filipowski, Tomasz

    Every single company or institution wants to utilize its resources in the most efficient way and one of the most important resources is knowledge. In the paper, a new SocLaKE system is introduced. It exploits the social network existing within the organization together with information about expertise of community members to recommend the best way to get the answer over the chains of acquaintances. The explanation how the system recommends people and experiments on the sample social network are presented as well.

  6. Sig'natur 3

    DEFF Research Database (Denmark)

    Wadskjær, Lise Fabricius; Guldager, Ida; Jepsen, Peter;

    Sig´natur 3 arbejder med: - kortlære - sundhed - dyrs overlevelse - tid - elektricitet - Danmark og Norden - affald - Livet i en sø......Sig´natur 3 arbejder med: - kortlære - sundhed - dyrs overlevelse - tid - elektricitet - Danmark og Norden - affald - Livet i en sø...

  7. Abstracts of SIG Sessions.

    Science.gov (United States)

    Proceedings of the ASIS Annual Meeting, 1997

    1997-01-01

    Presents abstracts of SIG Sessions. Highlights include digital collections; information retrieval methods; public interest/fair use; classification and indexing; electronic publication; funding; globalization; information technology projects; interface design; networking in developing countries; metadata; multilingual databases; networked…

  8. Smart Query Answering for Marine Sensor Data

    Directory of Open Access Journals (Sweden)

    Paulo de Souza

    2011-03-01

    Full Text Available We review existing query answering systems for sensor data. We then propose an extended query answering approach termed smart query, specifically for marine sensor data. The smart query answering system integrates pattern queries and continuous queries. The proposed smart query system considers both streaming data and historical data from marine sensor networks. The smart query also uses query relaxation technique and semantics from domain knowledge as a recommender system. The proposed smart query benefits in building data and information systems for marine sensor networks.

  9. Abstracts of SIG Sessions.

    Science.gov (United States)

    Proceedings of the ASIS Annual Meeting, 1995

    1995-01-01

    Presents abstracts of 15 special interest group (SIG) sessions. Topics include navigation and information utilization in the Internet, natural language processing, automatic indexing, image indexing, classification, users' models of database searching, online public access catalogs, education for information professions, information services,…

  10. At studere sig selv

    DEFF Research Database (Denmark)

    Beedholm, Kirsten; Frederiksen, Kirsten

    1999-01-01

    Hvad ville der ske, hvis alle eskimoer begyndte at studere eskimologi på bekostning af at leve som en 'rigtig' eskimo? Det spørger forfatterne om i denne artikel. De underviser begge på sygeplejeskoler og finder det en tankevækkende tendens, at det i sygeplejerskernes grunduddannelse er faget sel......, der studeres frem for det, som faget retter sig imod ­ nemlig patienterne....

  11. Child Computer Interaction SIG

    DEFF Research Database (Denmark)

    Read, Janet; Hourcade, Juan Pablo; Markopoulos, Panos

    The discipline of Child Computer Interaction (CCI) has been steadily growing and it is now firmly established as a community in its own right, having the annual IDC (Interaction and Design for Children) conference and its own journal and also enjoying its role as a highly recognisable and vibrant...... contributor to the ACM CHI conference. Having recently been given status as an IFIP (International Federation for Information Processing) TC13 working group, the community now needs to make plans around its academic themes and its coherence as a developing academic community. The CCI SIG at CHI aims to use...

  12. LocSigDB: a database of protein localization signals.

    Science.gov (United States)

    Negi, Simarjeet; Pandey, Sanjit; Srinivasan, Satish M; Mohammed, Akram; Guda, Chittibabu

    2015-01-01

    LocSigDB (http://genome.unmc.edu/LocSigDB/) is a manually curated database of experimental protein localization signals for eight distinct subcellular locations; primarily in a eukaryotic cell with brief coverage of bacterial proteins. Proteins must be localized at their appropriate subcellular compartment to perform their desired function. Mislocalization of proteins to unintended locations is a causative factor for many human diseases; therefore, collection of known sorting signals will help support many important areas of biomedical research. By performing an extensive literature study, we compiled a collection of 533 experimentally determined localization signals, along with the proteins that harbor such signals. Each signal in the LocSigDB is annotated with its localization, source, PubMed references and is linked to the proteins in UniProt database along with the organism information that contain the same amino acid pattern as the given signal. From LocSigDB webserver, users can download the whole database or browse/search for data using an intuitive query interface. To date, LocSigDB is the most comprehensive compendium of protein localization signals for eight distinct subcellular locations. Database URL: http://genome.unmc.edu/LocSigDB/

  13. Superfund Query

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Superfund Query allows users to retrieve data from the Comprehensive Environmental Response, Compensation, and Liability Information System (CERCLIS) database.

  14. Query responses

    Directory of Open Access Journals (Sweden)

    Paweł Łupkowski

    2017-05-01

    Full Text Available In this article we consider the phenomenon of answering a query with a query. Although such answers are common, no large scale, corpus-based characterization exists, with the exception of clarification requests. After briefly reviewing different theoretical approaches on this subject, we present a corpus study of query responses in the British National Corpus and develop a taxonomy for query responses. We point at a variety of response categories that have not been formalized in previous dialogue work, particularly those relevant to adversarial interaction. We show that different response categories have significantly different rates of subsequent answer provision. We provide a formal analysis of the response categories in the framework of KoS.

  15. Ungdomslitteratur former(er) sig

    DEFF Research Database (Denmark)

    Henkel, Ayoe Qvist

    2016-01-01

    Igennem en 'mediesensitiv' analyse af romanen "Akavet" af Ronnie Andersen (2014) og perspektivering til andre aktuelle romaner for og med unge undersøger artiklen, hvordan ungdomslitteratur udvikler sig i dialog med digitale og mediebaserede impulser, og hvilke konsekvenser for udsigelserne om...... ungdomsliv og ungdomslitteraturens æstetik og mulige egenart, denne udvikling har. Artiklen baserer sig på en materialitetstilgang særligt inspireret af N. Katherine Hayles, som ikke tidligere har fået opmærksomhed i læsninger af ungdomslitteratur eller i diskussioner af ungdomslitteraturens mulige egenart......, at ungdomslitteratur drejer sig om unges udviklingsproces fra barndom og til voksenhed og dermed skildrer en overgang præget af linearitet, modning og vækst. Artiklen konkluderer, at der er ungdomslitteratur, der realiseres på andre præmisser, og Akavet kan ses som eksponent for en bevægelse fra ungdomslitteratur som...

  16. Child-Computer Interaction SIG

    DEFF Research Database (Denmark)

    Hourcade, Juan Pablo; Revelle, Glenda; Zeising, Anja

    2016-01-01

    This SIG will provide child-computer interaction researchers and practitioners an opportunity to discuss four topics that represent new challenges and opportunities for the community. The four areas are: interactive technologies for children under the age of five, technology for inclusion, privacy...

  17. Baseline Analyses of SIG Applications and SIG-Eligible and SIG-Awarded Schools. NCEE 2011-4019

    Science.gov (United States)

    Hurlburt, Steven; Le Floch, Kerstin Carlson; Therriault, Susan Bowles; Cole, Susan

    2011-01-01

    The Study of School Turnaround is an examination of the implementation of School Improvement Grants (SIG) authorized under Title I section 1003(g) of the "Elementary and Secondary Education Act" and supplemented by the "American Recovery and Reinvestment Act of 2009." "Baseline Analyses of SIG Applications and SIG-Eligible…

  18. Recommendations

    Science.gov (United States)

    Brazelton, G. Blue; Renn, Kristen A.; Stewart, Dafina-Lazarus

    2015-01-01

    In this chapter, the editors provide a summary of the information shared in this sourcebook about the success of students who have minoritized identities of sexuality or gender and offer recommendations for policy, practice, and further research.

  19. Når fremtiden tegner sig

    DEFF Research Database (Denmark)

    Nielsen, Mette Lykke; Katznelson, Noemi

    Når fremtiden tegner sig er en rapport om unge i et yderkantsområder i Danmark. Den giver indblik i nogle af de subjektive, kulturelle og samfundsmæssige mekanismer der kan få betydning for, hvordan fremtiden tegner sig for unge.......Når fremtiden tegner sig er en rapport om unge i et yderkantsområder i Danmark. Den giver indblik i nogle af de subjektive, kulturelle og samfundsmæssige mekanismer der kan få betydning for, hvordan fremtiden tegner sig for unge....

  20. Når fremtiden tegner sig

    DEFF Research Database (Denmark)

    Katznelson, Noemi; Nielsen, Mette Lykke

    Når fremtiden tegner sig er en rapport om unge i et yderkantsområder i Danmark. Den giver indblik i nogle af de subjektive, kulturelle og samfundsmæssige mekanismer der kan få betydning for, hvordan fremtiden tegner sig for unge.......Når fremtiden tegner sig er en rapport om unge i et yderkantsområder i Danmark. Den giver indblik i nogle af de subjektive, kulturelle og samfundsmæssige mekanismer der kan få betydning for, hvordan fremtiden tegner sig for unge....

  1. SigE Is a Chaperone for the Salmonella enterica Serovar Typhimurium Invasion Protein SigD

    OpenAIRE

    Darwin, K Heran; Robinson, Lloyd S.; Miller, Virginia L.

    2001-01-01

    SigD is translocated into eucaryotic cells by a type III secretion system. In this work, evidence that the putative chaperone SigE directly interacts with SigD is presented. A bacterial two-hybrid system demonstrated that SigE can interact with itself and SigD. In addition, SigD was specifically copurified with SigE-His6 on a nickel column.

  2. Approximate dictionary queries

    DEFF Research Database (Denmark)

    Brodal, Gerth Stølting; Gasieniec, Leszek

    1996-01-01

    Given a set of n binary strings of length m each. We consider the problem of answering d-queries. Given a binary query string of length m, a d-query is to report if there exists a string in the set within Hamming distance d of . We present a data structure of size O(nm) supporting 1-queries in ti...

  3. Alternative Sigma Factors SigF, SigE, and SigG Are Essential for Sporulation in Clostridium botulinum ATCC 3502

    OpenAIRE

    Kirk, David G.; Zhang, Zhen; Korkeala, Hannu; Lindström, Miia

    2014-01-01

    Clostridium botulinum produces heat-resistant endospores that may germinate and outgrow into neurotoxic cultures in foods. Sporulation is regulated by the transcription factor Spo0A and the alternative sigma factors SigF, SigE, SigG, and SigK in most spore formers studied to date. We constructed mutants of sigF, sigE, and sigG in C. botulinum ATCC 3502 and used quantitative reverse transcriptase PCR and electron microscopy to assess their expression of the sporulation pathway on transcription...

  4. Sigma factor genes sigC, sigE, and sigG are upregulated in heterocysts of the cyanobacterium Anabaena sp. strain PCC 7120.

    Science.gov (United States)

    Aldea, M Ramona; Mella-Herrera, Rodrigo A; Golden, James W

    2007-11-01

    We used gfp transcriptional fusions to investigate the regulation of eight sigma factor genes during heterocyst development in the cyanobacterium Anabaena sp. strain PCC 7120. Reporter strains containing gfp fusions with the upstream regions of sigB2, sigD, sigI, and sigJ did not show developmental regulation. Time-lapse microscopy of sigC, sigE, and sigG reporter strains showed increased green fluorescent protein fluorescence in differentiating cells at 4 h, 16 h, and 9 h, respectively, after nitrogen step down.

  5. Federated query processing for the semantic web

    CERN Document Server

    Buil-Aranda, C

    2014-01-01

    During the last years, the amount of RDF data has increased exponentially over the Web, exposed via SPARQL endpoints. These SPARQL endpoints allow users to direct SPARQL queries to the RDF data. Federated SPARQL query processing allows to query several of these RDF databases as if they were a single one, integrating the results from all of them. This is a key concept in the Web of Data and it is also a hot topic in the community. Besides of that, the W3C SPARQL-WG has standardized it in the new Recommendation SPARQL 1.1.This book provides a formalisation of the W3C proposed recommendation. Thi

  6. Alternative sigma factors SigF, SigE, and SigG are essential for sporulation in Clostridium botulinum ATCC 3502.

    Science.gov (United States)

    Kirk, David G; Zhang, Zhen; Korkeala, Hannu; Lindström, Miia

    2014-08-01

    Clostridium botulinum produces heat-resistant endospores that may germinate and outgrow into neurotoxic cultures in foods. Sporulation is regulated by the transcription factor Spo0A and the alternative sigma factors SigF, SigE, SigG, and SigK in most spore formers studied to date. We constructed mutants of sigF, sigE, and sigG in C. botulinum ATCC 3502 and used quantitative reverse transcriptase PCR and electron microscopy to assess their expression of the sporulation pathway on transcriptional and morphological levels. In all three mutants the expression of spo0A was disrupted. The sigF and sigE mutants failed to induce sigG and sigK beyond exponential-phase levels and halted sporulation during asymmetric cell division. In the sigG mutant, peak transcription of sigE was delayed and sigK levels remained lower than that in the parent strain. The sigG mutant forespore was engulfed by the mother cell and possessed a spore coat but no peptidoglycan cortex. The findings suggest that SigF and SigE of C. botulinum ATCC 3502 are essential for early sporulation and late-stage induction of sigK, whereas SigG is essential for spore cortex formation but not for coat formation, as opposed to previous observations in B. subtilis sigG mutants. Our findings add to a growing body of evidence that regulation of sporulation in C. botulinum ATCC 3502, and among the clostridia, differs from the B. subtilis model.

  7. Optimizing Temporal Queries

    DEFF Research Database (Denmark)

    Toman, David; Bowman, Ivan Thomas

    2003-01-01

    Recent research in the area of temporal databases has proposed a number of query languages that vary in their expressive power and the semantics they provide to users. These query languages represent a spectrum of solutions to the tension between clean semantics and efficient evaluation. Often......, these query languages are implemented by translating temporal queries into standard relational queries. However, the compiled queries are often quite cumbersome and expensive to execute even using state-of-the-art relational products. This paper presents an optimization technique that produces more efficient...... translated SQL queries by taking into account the properties of the encoding used for temporal attributes. For concreteness, this translation technique is presented in the context of SQL/TP; however, these techniques are also applicable to other temporal query languages....

  8. Optimizing Temporal Queries

    DEFF Research Database (Denmark)

    Toman, David; Bowman, Ivan Thomas

    2003-01-01

    translated SQL queries by taking into account the properties of the encoding used for temporal attributes. For concreteness, this translation technique is presented in the context of SQL/TP; however, these techniques are also applicable to other temporal query languages......., these query languages are implemented by translating temporal queries into standard relational queries. However, the compiled queries are often quite cumbersome and expensive to execute even using state-of-the-art relational products. This paper presents an optimization technique that produces more efficient......Recent research in the area of temporal databases has proposed a number of query languages that vary in their expressive power and the semantics they provide to users. These query languages represent a spectrum of solutions to the tension between clean semantics and efficient evaluation. Often...

  9. Efficient Query Rewrite for Structured Web Queries

    CERN Document Server

    Gollapudi, Sreenivas; Ntoulas, Alexandros; Paparizos, Stelios

    2011-01-01

    Web search engines and specialized online verticals are increasingly incorporating results from structured data sources to answer semantically rich user queries. For example, the query \\WebQuery{Samsung 50 inch led tv} can be answered using information from a table of television data. However, the users are not domain experts and quite often enter values that do not match precisely the underlying data. Samsung makes 46- or 55- inch led tvs, but not 50-inch ones. So a literal execution of the above mentioned query will return zero results. For optimal user experience, a search engine would prefer to return at least a minimum number of results as close to the original query as possible. Furthermore, due to typical fast retrieval speeds in web-search, a search engine query execution is time-bound. In this paper, we address these challenges by proposing algorithms that rewrite the user query in a principled manner, surfacing at least the required number of results while satisfying the low-latency constraint. We f...

  10. Learning semantic query suggestions

    NARCIS (Netherlands)

    E. Meij; M. Bron; L. Hollink; B. Huurnink; M. de Rijke

    2009-01-01

    An important application of semantic web technology is recognizing human-defined concepts in text. Query transformation is a strategy often used in search engines to derive queries that are able to return more useful search results than the original query and most popular search engines provide faci

  11. Query Expansion Using SNOMED-CT and Weighing Schemes

    Science.gov (United States)

    2014-11-01

    recommend using the full capacity of the different Ontology that they used such as MeSH. Martinez et al. [2] from University of Melbourne, Australia and...for the first query. Query #1 58-year-old woman with hypertension and obesity presents with exercise-related episodic chest pain radiating to the...were then included with the original query as following. Query#1.0 <Summary>58-year-old woman with hypertension and obesity presents with exercise

  12. Et gespenst har sat sig fast

    DEFF Research Database (Denmark)

    Post, Bettina

    2011-01-01

    Det omsiggribende bureaukratiske monster æder sig ind på - og kvæler - alle forsøg på sund faglig fornuft. Og selv om alle kan se problemet, bliver det bare ved og ved.......Det omsiggribende bureaukratiske monster æder sig ind på - og kvæler - alle forsøg på sund faglig fornuft. Og selv om alle kan se problemet, bliver det bare ved og ved....

  13. Collective spatial keyword querying

    DEFF Research Database (Denmark)

    Cao, Xin; Cong, Gao; Jensen, Christian S.;

    2011-01-01

    With the proliferation of geo-positioning and geo-tagging, spatial web objects that possess both a geographical location and a textual description are gaining in prevalence, and spatial keyword queries that exploit both location and textual description are gaining in prominence. However......, the queries studied so far generally focus on finding individual objects that each satisfy a query rather than finding groups of objects where the objects in a group collectively satisfy a query. We define the problem of retrieving a group of spatial web objects such that the group's keywords cover the query...

  14. Roles of SigB and SigF in the Mycobacterium tuberculosis Sigma Factor Network▿ †

    OpenAIRE

    Lee, Jong-Hee; Karakousis, Petros C.; Bishai, William R.

    2007-01-01

    To characterize the roles of SigB and SigF in sigma factor regulation in Mycobacterium tuberculosis, we used chemically inducible recombinant strains to conditionally overexpress sigB and sigF. Using whole genomic microarray analysis and quantitative reverse transcription-PCR, we investigated the resulting global transcriptional changes after sigB induction, and we specifically tested the relative expression of other sigma factor genes after knock-in expression of sigB and sigF. Overexpressio...

  15. [Genome similarity of Baikal omul and sig].

    Science.gov (United States)

    Bychenko, O S; Sukhanova, L V; Ukolova, S S; Skvortsov, T A; Potapov, V K; Azhikina, T L; Sverdlov, E D

    2009-01-01

    Two members of the Baikal sig family, a lake sig (Coregonus lavaretus baicalensis Dybovsky) and omul (C. autumnalis migratorius Georgi), are close relatives that diverged from the same ancestor 10-20 thousand years ago. In this work, we studied genomic polymorphism of these two fish species. The method of subtraction hybridization (SH) did not reveal the presence of extended sequences in the sig genome and their absence in the omul genome. All the fragments found by SH corresponded to polymorphous noncoding genome regions varying in mononucleotide substitutions and short deletions. Many of them are mapped close to genes of the immune system and have regions identical to the Tc-1-like transposons abundant among fish, whose transcription activity may affect the expression of adjacent genes. Thus, we showed for the first time that genetic differences between Baikal sig family members are extremely small and cannot be revealed by the SH method. This is another endorsement of the hypothesis on the close relationship between Baikal sig and omul and their evolutionarily recent divergence from a common ancestor.

  16. Query Language for Complex Similarity Queries

    CERN Document Server

    Budikova, Petra; Zezula, Pavel

    2012-01-01

    For complex data types such as multimedia, traditional data management methods are not suitable. Instead of attribute matching approaches, access methods based on object similarity are becoming popular. Recently, this resulted in an intensive research of indexing and searching methods for the similarity-based retrieval. Nowadays, many efficient methods are already available, but using them to build an actual search system still requires specialists that tune the methods and build the system manually. Several attempts have already been made to provide a more convenient high-level interface in a form of query languages for such systems, but these are limited to support only basic similarity queries. In this paper, we propose a new language that allows to formulate content-based queries in a flexible way, taking into account the functionality offered by a particular search engine in use. To ensure this, the language is based on a general data model with an abstract set of operations. Consequently, the language s...

  17. Studerende - sig ja til international studieaktivitet

    DEFF Research Database (Denmark)

    Nielsen, Tove Lise; Steffens, Freja

    2010-01-01

    Vi vil gerne vække interesse for udfordring af faglige vaner og rutiner, et internationalt netværk, sproglig udvikling i arbejdstiden og et kulturelt boost. Læs fire indlæg om udveksling og sig ja til mere international studieaktivitet og erfaring.......Vi vil gerne vække interesse for udfordring af faglige vaner og rutiner, et internationalt netværk, sproglig udvikling i arbejdstiden og et kulturelt boost. Læs fire indlæg om udveksling og sig ja til mere international studieaktivitet og erfaring....

  18. Kan 1500 skoler skille sig ud?

    DEFF Research Database (Denmark)

    Mehlsen, Camilla

    2012-01-01

    Featureartikel: Kampen om opmærksomhed er skudt i gang, og uddannelsesinstitutioner er med i ræset om at være noget særligt. Men kan alle 1500 folkeskoler skille sig ud? Og betyder det, at vi mister den fælles skole?......Featureartikel: Kampen om opmærksomhed er skudt i gang, og uddannelsesinstitutioner er med i ræset om at være noget særligt. Men kan alle 1500 folkeskoler skille sig ud? Og betyder det, at vi mister den fælles skole?...

  19. Using Social Impact Games (SIGS) to Support Constructivist Learning: Creating a Foundation for Effective Use in the Secondary Social Studies Education

    Science.gov (United States)

    Ray, Beverly; Faure, Caroline; Kelle, Fay

    2013-01-01

    This paper examines how Social Impact Games (SIGs) can provide important instructional support in secondary social studies classrooms. When used within the framework of the constructivist teaching philosophy and teaching methods, as recommended by the NCSS (2010), SIGs have the potential to hone critical thinking, collaboration, and problem…

  20. Using Social Impact Games (SIGS) to Support Constructivist Learning: Creating a Foundation for Effective Use in the Secondary Social Studies Education

    Science.gov (United States)

    Ray, Beverly; Faure, Caroline; Kelle, Fay

    2013-01-01

    This paper examines how Social Impact Games (SIGs) can provide important instructional support in secondary social studies classrooms. When used within the framework of the constructivist teaching philosophy and teaching methods, as recommended by the NCSS (2010), SIGs have the potential to hone critical thinking, collaboration, and problem…

  1. Indexing for summary queries

    DEFF Research Database (Denmark)

    Yi, Ke; Wang, Lu; Wei, Zhewei

    2014-01-01

    ), of a particular attribute of these records. Aggregation queries are especially useful in business intelligence and data analysis applications where users are interested not in the actual records, but some statistics of them. They can also be executed much more efficiently than reporting queries, by embedding...

  2. WATERS Expert Query Tool

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Expert Query Tool is a web-based reporting tool using the EPA’s WATERS database.There are just three steps to using Expert Query:1. View Selection – Choose what...

  3. Mastering jQuery

    CERN Document Server

    Libby, Alex

    2015-01-01

    If you are a developer who is already familiar with using jQuery and wants to push your skill set further, then this book is for you. The book assumes an intermediate knowledge level of jQuery, JavaScript, HTML5, and CSS.

  4. Indexing for summary queries

    DEFF Research Database (Denmark)

    Yi, Ke; Wang, Lu; Wei, Zhewei

    2014-01-01

    ), of a particular attribute of these records. Aggregation queries are especially useful in business intelligence and data analysis applications where users are interested not in the actual records, but some statistics of them. They can also be executed much more efficiently than reporting queries, by embedding...

  5. The Semantic Retrieval of Spatial Data Service Based on Ontology in SIG

    Science.gov (United States)

    Sun, S.; Liu, D.; Li, G.; Yu, W.

    2011-08-01

    The research of SIG (Spatial Information Grid) mainly solves the problem of how to connect different computing resources, so that users can use all the resources in the Grid transparently and seamlessly. In SIG, spatial data service is described in some kinds of specifications, which use different meta-information of each kind of services. This kind of standardization cannot resolve the problem of semantic heterogeneity, which may limit user to obtain the required resources. This paper tries to solve two kinds of semantic heterogeneities (name heterogeneity and structure heterogeneity) in spatial data service retrieval based on ontology, and also, based on the hierarchical subsumption relationship among concept in ontology, the query words can be extended and more resource can be matched and found for user. These applications of ontology in spatial data resource retrieval can help to improve the capability of keyword matching, and find more related resources.

  6. SigWinR; the SigWin-detector updated and ported to R

    Directory of Open Access Journals (Sweden)

    Breit Timo M

    2009-10-01

    Full Text Available Abstract Background Our SigWin-detector discovers significantly enriched windows of (genomic elements in any sequence of values (genes or other genomic elements in a DNA sequence in a fast and reproducible way. However, since it is grid based, only (life scientists with access to the grid can use this tool. Therefore and on request, we have developed the SigWinR package which makes the SigWin-detector available to a much wider audience. At the same time, we have introduced several improvements to its algorithm as well as its functionality, based on the feedback of SigWin-detector end users. Findings To allow usage of the SigWin-detector on a desktop computer, we have rewritten it as a package for R: SigWinR. R is a free and widely used multi platform software environment for statistical computing and graphics. The package can be installed and used on all platforms for which R is available. The improvements involve: a visualization of the input-sequence values supporting the interpretation of Ridgeograms; a visualization allowing for an easy interpretation of enriched or depleted regions in the sequence using windows of pre-defined size; an option that allows the analysis of circular sequences, which results in rectangular Ridgeograms; an application to identify regions of co-altered gene expression (ROCAGEs with a real-life biological use-case; adaptation of the algorithm to allow analysis of non-regularly sampled data using a constant window size in physical space without resampling the data. To achieve this, support for analysis of windows with an even number of elements was added. Conclusion By porting the SigWin-detector as an R package, SigWinR, improving its algorithm and functionality combined with adequate performance, we have made SigWin-detector more useful as well as more easily accessible to scientists without a grid infrastructure.

  7. Når landbruget idylliserer sig selv

    DEFF Research Database (Denmark)

    Borkfelt, Sune; Kondrup, Sara Vincentzen; Gjerris, Mickey

    2013-01-01

    Den politiske forbruger bliver ganske enkelt efterladt uden magt, hvis virksomheder gennem reklamekampagner markedsfører produkter som tættere på naturen, når det ved nærmere eftersyn viser sig, at disse produkter i langt højere grad er tættere på fabrikken....

  8. Declarative Visualization Queries

    Science.gov (United States)

    Pinheiro da Silva, P.; Del Rio, N.; Leptoukh, G. G.

    2011-12-01

    In an ideal interaction with machines, scientists may prefer to write declarative queries saying "what" they want from a machine than to write code stating "how" the machine is going to address the user request. For example, in relational database, users have long relied on specifying queries using Structured Query Language (SQL), a declarative language to request data results from a database management system. In the context of visualizations, we see that users are still writing code based on complex visualization toolkit APIs. With the goal of improving the scientists' experience of using visualization technology, we have applied this query-answering pattern to a visualization setting, where scientists specify what visualizations they want generated using a declarative SQL-like notation. A knowledge enhanced management system ingests the query and knows the following: (1) know how to translate the query into visualization pipelines; and (2) how to execute the visualization pipelines to generate the requested visualization. We define visualization queries as declarative requests for visualizations specified in an SQL like language. Visualization queries specify what category of visualization to generate (e.g., volumes, contours, surfaces) as well as associated display attributes (e.g., color and opacity), without any regards for implementation, thus allowing scientists to remain partially unaware of a wide range of visualization toolkit (e.g., Generic Mapping Tools and Visualization Toolkit) specific implementation details. Implementation details are only a concern for our knowledge-based visualization management system, which uses both the information specified in the query and knowledge about visualization toolkit functions to construct visualization pipelines. Knowledge about the use of visualization toolkits includes what data formats the toolkit operates on, what formats they output, and what views they can generate. Visualization knowledge, which is not

  9. Orthogonal Query Expansion

    CERN Document Server

    Ackerman, Margareta; Lopez-Ortiz, Alejandro

    2011-01-01

    Over the last fifteen years, web searching has seen tremendous improvements. Starting from a nearly random collection of matching pages in 1995, today, search engines tend to satisfy the user's informational need on well-formulated queries. One of the main remaining challenges is to satisfy the users' needs when they provide a poorly formulated query. When the pages matching the user's original keywords are judged to be unsatisfactory, query expansion techniques are used to alter the result set. These techniques find keywords that are similar to the keywords given by the user, which are then appended to the original query leading to a perturbation of the result set. However, when the original query is sufficiently ill-posed, the user's informational need is best met using entirely different keywords, and a small perturbation of the original result set is bound to fail. We propose a novel approach that is not based on the keywords of the original query. We intentionally seek out orthogonal queries, which are r...

  10. Moving Spatial Keyword Queries

    DEFF Research Database (Denmark)

    Wu, Dingming; Yiu, Man Lung; Jensen, Christian S.

    2013-01-01

    Web users and content are increasingly being geo-positioned. This development gives prominence to spatial keyword queries, which involve both the locations and textual descriptions of content. We study the efficient processing of continuously moving top-k spatial keyword (MkSK) queries over spatial...... text data. State-of-the-art solutions for moving queries employ safe zones that guarantee the validity of reported results as long as the user remains within the safe zone associated with a result. However, existing safe-zone methods focus solely on spatial locations and ignore text relevancy. We...

  11. jQuery cookbook

    CERN Document Server

    2010-01-01

    jQuery simplifies building rich, interactive web frontends. Getting started with this JavaScript library is easy, but it can take years to fully realize its breadth and depth; this cookbook shortens the learning curve considerably. With these recipes, you'll learn patterns and practices from 19 leading developers who use jQuery for everything from integrating simple components into websites and applications to developing complex, high-performance user interfaces. Ideal for newcomers and JavaScript veterans alike, jQuery Cookbook starts with the basics and then moves to practical use cases w

  12. Range-Clustering Queries

    OpenAIRE

    Abrahamsen, Mikkel; de Berg, Mark; Buchin, Kevin; Mehr, Mehran; Mehrabi, Ali D.

    2017-01-01

    In a geometric $k$-clustering problem the goal is to partition a set of points in $\\mathbb{R}^d$ into $k$ subsets such that a certain cost function of the clustering is minimized. We present data structures for orthogonal range-clustering queries on a point set $S$: given a query box $Q$ and an integer $k>2$, compute an optimal $k$-clustering for $S\\setminus Q$. We obtain the following results. We present a general method to compute a $(1+\\epsilon)$-approximation to a range-clustering query, ...

  13. Benchmarking Query Execution Robustness

    Science.gov (United States)

    Wiener, Janet L.; Kuno, Harumi; Graefe, Goetz

    Benchmarks that focus on running queries on a well-tuned database system ignore a long-standing problem: adverse runtime conditions can cause database system performance to vary widely and unexpectedly. When the query execution engine does not exhibit resilience to these adverse conditions, addressing the resultant performance problems can contribute significantly to the total cost of ownership for a database system in over-provisioning, lost efficiency, and increased human administrative costs. For example, focused human effort may be needed to manually invoke workload management actions or fine-tune the optimization of specific queries.

  14. Localized Geometric Query Problems

    CERN Document Server

    Augustine, John; Maheshwari, Anil; Nandy, Subhas C; Roy, Sasanka; Sarvattomananda, Swami

    2011-01-01

    A new class of geometric query problems are studied in this paper. We are required to preprocess a set of geometric objects $P$ in the plane, so that for any arbitrary query point $q$, the largest circle that contains $q$ but does not contain any member of $P$, can be reported efficiently. The geometric sets that we consider are point sets and boundaries of simple polygons.

  15. Querying JSON Streams

    OpenAIRE

    Bo, Yang

    2010-01-01

    A data stream management system (DSMS) is similar to a database management system (DBMS) but can search data directly in on-line streams. Using its mediator-wrapper approach, the extensible database system, Amos II, allows different kinds of distributed data resource to be queried. It has been extended with a stream datatype to query possibly infinite streams, which provides DSMS functionality. Nowadays, more and more web applications start to offer their services in JSON format which is a te...

  16. Critical Role of a Single Position in the −35 Element for Promoter Recognition by Mycobacterium tuberculosis SigE and SigH▿

    OpenAIRE

    Song, Taeksun; Song, Seung-Eun; Raman, Sahadevan; Anaya, Mauricio; Robert N Husson

    2008-01-01

    Mycobacterial SigE and SigH both initiate transcription from the sigB promoter, suggesting that they recognize similar sequences. Through mutational and primer extension analyses, we determined that SigE and SigH recognize nearly identical promoters, with differences at the 3′ end of the −35 element distinguishing between SigE- and SigH-dependent promoters.

  17. Inverse Queries For Multidimensional Spaces

    CERN Document Server

    Bernecker, Thomas; Kriegel, Hans-Peter; Mamoulis, Nikos; Renz, Matthias; Zhang, Shiming; Züfle, Andreas

    2011-01-01

    Traditional spatial queries return, for a given query object $q$, all database objects that satisfy a given predicate, such as epsilon range and $k$-nearest neighbors. This paper defines and studies {\\em inverse} spatial queries, which, given a subset of database objects $Q$ and a query predicate, return all objects which, if used as query objects with the predicate, contain $Q$ in their result. We first show a straightforward solution for answering inverse spatial queries for any query predicate. Then, we propose a filter-and-refinement framework that can be used to improve efficiency. We show how to apply this framework on a variety of inverse queries, using appropriate space pruning strategies. In particular, we propose solutions for inverse epsilon range queries, inverse $k$-nearest neighbor queries, and inverse skyline queries. Our experiments show that our framework is significantly more efficient than naive approaches.

  18. Orion Scripted Interface Generator (OrionSIG)

    Science.gov (United States)

    Dooling, Robert J.

    2013-01-01

    The Orion spacecraft undergoing development at NASA and Lockheed Martin aims to launch the first humans to set foot on asteroids and Mars.' Sensors onboard Orion must transmit back to Earth astronomical amounts of data recording almost everything in 50,231 lb. (22,784 kg)2 of spacecraft, down to the temperatures, voltages, or torsions of even the most minor components. This report introduces the new Orion Scripted Interface Generator (OrionSIG) software created by summer 2013 NASA interns Robert Dooling and Samuel Harris. OrionSIG receives a list of Orion variables and produces a script to graph these measurements regardless of their size or type. The program also accepts many other input options to manipulate displays, such as limits on the graph's range or commands to graph different values in a reverse sawtooth wave. OrionSIG paves the way for monitoring stations on Earth to process, display, and test Orion data much more efficiently, a helpful asset in preparation for Orion's first test mission in 2014. Figure I.

  19. jQuery Mobile

    CERN Document Server

    Reid, Jon

    2011-01-01

    Native apps have distinct advantages, but the future belongs to mobile web apps that function on a broad range of smartphones and tablets. Get started with jQuery Mobile, the touch-optimized framework for creating apps that look and behave consistently across many devices. This concise book provides HTML5, CSS3, and JavaScript code examples, screen shots, and step-by-step guidance to help you build a complete working app with jQuery Mobile. If you're already familiar with the jQuery JavaScript library, you can use your existing skills to build cross-platform mobile web apps right now. This b

  20. XPath Whole Query Optimization

    CERN Document Server

    Maneth, Sebastian

    2010-01-01

    Previous work reports about SXSI, a fast XPath engine which executes tree automata over compressed XML indexes. Here, reasons are investigated why SXSI is so fast. It is shown that tree automata can be used as a general framework for fine grained XML query optimization. We define the "relevant nodes" of a query as those nodes that a minimal automaton must touch in order to answer the query. This notion allows to skip many subtrees during execution, and, with the help of particular tree indexes, even allows to skip internal nodes of the tree. We efficiently approximate runs over relevant nodes by means of on-the-fly removal of alternation and non-determinism of (alternating) tree automata. We also introduce many implementation techniques which allows us to efficiently evaluate tree automata, even in the absence of special indexes. Through extensive experiments, we demonstrate the impact of the different optimization techniques.

  1. Code query by example

    Science.gov (United States)

    Vaucouleur, Sebastien

    2011-02-01

    We introduce code query by example for customisation of evolvable software products in general and of enterprise resource planning systems (ERPs) in particular. The concept is based on an initial empirical study on practices around ERP systems. We motivate our design choices based on those empirical results, and we show how the proposed solution helps with respect to the infamous upgrade problem: the conflict between the need for customisation and the need for upgrade of ERP systems. We further show how code query by example can be used as a form of lightweight static analysis, to detect automatically potential defects in large software products. Code query by example as a form of lightweight static analysis is particularly interesting in the context of ERP systems: it is often the case that programmers working in this field are not computer science specialists but more of domain experts. Hence, they require a simple language to express custom rules.

  2. Learning jQuery

    CERN Document Server

    Chaffer, Jonathan

    2013-01-01

    Step through each of the core concepts of the jQuery library, building an overall picture of its capabilities. Once you have thoroughly covered the basics, the book returns to each concept to cover more advanced examples and techniques.This book is for web designers who want to create interactive elements for their designs, and for developers who want to create the best user interface for their web applications. Basic JavaScript programming and knowledge of HTML and CSS is required. No knowledge of jQuery is assumed, nor is experience with any other JavaScript libraries.

  3. KoralQuery -- A General Corpus Query Protocol

    DEFF Research Database (Denmark)

    Bingel, Joachim; Diewald, Nils

    2015-01-01

    The task-oriented and format-driven development of corpus query systems has led to the creation of numerous corpus query languages (QLs) that vary strongly in expressiveness and syntax. This is a severe impediment for the interoperability of corpus analysis systems, which lack a common protocol....... In this paper, we present KoralQuery, a JSON-LD based general corpus query protocol, aiming to be independent of particular QLs, tasks and corpus formats. In addition to describing the system of types and operations that KoralQuery is built on, we exemplify the representation of corpus queries in the serialized...

  4. User perspectives on query difficulty

    DEFF Research Database (Denmark)

    Lioma, Christina; Larsen, Birger; Schütze, Hinrich

    2011-01-01

    , or to statistical and linguistic features of the queries that may render them difficult. This work addresses query difficulty from a different angle, namely the users’ own perspectives on query difficulty. Two research questions are asked: (1) Are users aware that the query they submit to an IR system may......The difficulty of a user query can affect the performance of Information Retrieval (IR) systems. What makes a query difficult and how one may predict this is an active research area, focusing mainly on factors relating to the retrieval algorithm, to the properties of the retrieval data...... for synthesising the user-assessed causes of query difficulty through opinion fusion into an overall assessment of query difficulty. The resulting assessments of query difficulty are found to agree notably more to the TREC categories than the direct user assessments....

  5. Spatial Keyword Querying

    DEFF Research Database (Denmark)

    Cao, Xin; Chen, Lisi; Cong, Gao;

    2012-01-01

    The web is increasingly being used by mobile users. In addition, it is increasingly becoming possible to accurately geo-position mobile users and web content. This development gives prominence to spatial web data management. Specifically, a spatial keyword query takes a user location and user-sup...... different kinds of functionality as well as the ideas underlying their definition....

  6. Spatial Keyword Query Processing

    DEFF Research Database (Denmark)

    Chen, Lisi; Jensen, Christian S.; Wu, Dingming

    2013-01-01

    an all-around survey of 12 state- of-the-art geo-textual indices. We propose a benchmark that en- ables the comparison of the spatial keyword query performance. We also report on the findings obtained when applying the bench- mark to the indices, thus uncovering new insights that may guide index...

  7. Conceptual querying through ontologies

    DEFF Research Database (Denmark)

    Andreasen, Troels; Bulskov, Henrik

    2009-01-01

    We present here ail approach to conceptual querying where the aim is, given a collection of textual database objects or documents, to target an abstraction of the entire database content in terms of the concepts appearing in documents, rather than the documents in the collection. The approach is ...

  8. XIRAF: Ultimate Forensic Querying

    NARCIS (Netherlands)

    Alink, W.; Bhoedjang, R.; Vries, A.P. de; Boncz, P.A.

    2006-01-01

    This paper describes a novel, XML-based approach towards managing and querying forensic traces extracted from digital evidence. This approach has been implemented in XIRAF, a prototype system for forensic analysis. XIRAF systematically applies forensic analysis tools to evidence files (e.g., hard di

  9. Query Driven Visualization

    CERN Document Server

    Buddelmeijer, Hugo

    2011-01-01

    The request driven way of deriving data in Astro-WISE is extended to a query driven way of visualization. This allows scientists to focus on the science they want to perform, because all administration of their data is automated. This can be done over an abstraction layer that enhances control and flexibility for the scientist.

  10. Flexible Query Answering Systems

    DEFF Research Database (Denmark)

    -computer interaction. The special track covers some some specific and, typically, newer fields, namely: environmental scanning for strategic early warning; generating linguistic descriptions of data; advances in fuzzy querying and fuzzy databases: theory and applications; fusion and ensemble techniques for on......-line learning on data streams; and intelligent information extraction from texts....

  11. Flexible Query Answering Systems

    DEFF Research Database (Denmark)

    This book constitutes the refereed proceedings of the 12th International Conference on Flexible Query Answering Systems, FQAS 2017, held in London, UK, in June 2017. The 21 full papers presented in this book together with 4 short papers were carefully reviewed and selected from 43 submissions...

  12. Learning via Query Synthesis

    KAUST Repository

    Alabdulmohsin, Ibrahim Mansour

    2017-05-07

    Active learning is a subfield of machine learning that has been successfully used in many applications. One of the main branches of active learning is query synthe- sis, where the learning agent constructs artificial queries from scratch in order to reveal sensitive information about the underlying decision boundary. It has found applications in areas, such as adversarial reverse engineering, automated science, and computational chemistry. Nevertheless, the existing literature on membership query synthesis has, generally, focused on finite concept classes or toy problems, with a limited extension to real-world applications. In this thesis, I develop two spectral algorithms for learning halfspaces via query synthesis. The first algorithm is a maximum-determinant convex optimization method while the second algorithm is a Markovian method that relies on Khachiyan’s classical update formulas for solving linear programs. The general theme of these methods is to construct an ellipsoidal approximation of the version space and to synthesize queries, afterward, via spectral decomposition. Moreover, I also describe how these algorithms can be extended to other settings as well, such as pool-based active learning. Having demonstrated that halfspaces can be learned quite efficiently via query synthesis, the second part of this thesis proposes strategies for mitigating the risk of reverse engineering in adversarial environments. One approach that can be used to render query synthesis algorithms ineffective is to implement a randomized response. In this thesis, I propose a semidefinite program (SDP) for learning a distribution of classifiers, subject to the constraint that any individual classifier picked at random from this distributions provides reliable predictions with a high probability. This algorithm is, then, justified both theoretically and empirically. A second approach is to use a non-parametric classification method, such as similarity-based classification. In this

  13. Google BigQuery analytics

    CERN Document Server

    Tigani, Jordan

    2014-01-01

    How to effectively use BigQuery, avoid common mistakes, and execute sophisticated queries against large datasets Google BigQuery Analytics is the perfect guide for business and data analysts who want the latest tips on running complex queries and writing code to communicate with the BigQuery API. The book uses real-world examples to demonstrate current best practices and techniques, and also explains and demonstrates streaming ingestion, transformation via Hadoop in Google Compute engine, AppEngine datastore integration, and using GViz with Tableau to generate charts of query results. In addit

  14. SigWinR; the SigWin-detector updated and ported to R

    NARCIS (Netherlands)

    de Leeuw, W.C.; Rauwerda, H.; Inda, M.A.; Bruning, O.; Breit, T.M.

    2009-01-01

    Background Our SigWin-detector discovers significantly enriched windows of (genomic) elements in any sequence of values (genes or other genomic elements in a DNA sequence) in a fast and reproducible way. However, since it is grid based, only (life) scientists with access to the grid can use this too

  15. A Network SIG is born, DECUS (Switzerland) Newsletter, May 1990

    CERN Document Server

    Heagerty, Denise

    1990-01-01

    This article announces the formation of a Swiss DECUS (DEC Users Group) Network SIG in May 1990. The goal of this SIG is to help Swiss DECnet managers to plan transition from their proprietary DECnet Phase IV networks (e.g. the HEP/SPAN DECnet) to open networks based on DECnet Phase V/OSI. The SIG also proposes to address integration with UNIX based workstations using the Internet's TCP/IP protocols.

  16. User perspectives on query difficulty

    DEFF Research Database (Denmark)

    Lioma, Christina; Larsen, Birger; Schütze, Hinrich

    2011-01-01

    The difficulty of a user query can affect the performance of Information Retrieval (IR) systems. What makes a query difficult and how one may predict this is an active research area, focusing mainly on factors relating to the retrieval algorithm, to the properties of the retrieval data, or to sta......The difficulty of a user query can affect the performance of Information Retrieval (IR) systems. What makes a query difficult and how one may predict this is an active research area, focusing mainly on factors relating to the retrieval algorithm, to the properties of the retrieval data......, or to statistical and linguistic features of the queries that may render them difficult. This work addresses query difficulty from a different angle, namely the users’ own perspectives on query difficulty. Two research questions are asked: (1) Are users aware that the query they submit to an IR system may...

  17. COMPLEX QUERY AND METADATA

    OpenAIRE

    Nakatoh, Tetsuya; Omori, Keisuke; Yamada, Yasuhiro; Hirokawa, Sachio

    2003-01-01

    We are developing a search system DAISEn which integrates multiple search engines and generates a metasearch engine automatically. The target search engines of DAISEn are not general search engines, but are search engines specialized in some area. Integration of such engines yields efficiency and quality. There are search engines of new type which accept complex query and return structured data. Integration of such search engines is much harder than that of simple search engines which accept ...

  18. Querying genomic databases

    Energy Technology Data Exchange (ETDEWEB)

    Baehr, A.; Hagstrom, R.; Joerg, D.; Overbeek, R.

    1991-09-01

    A natural-language interface has been developed that retrieves genomic information by using a simple subset of English. The interface spares the biologist from the task of learning database-specific query languages and computer programming. Currently, the interface deals with the E. coli genome. It can, however, be readily extended and shows promise as a means of easy access to other sequenced genomic databases as well.

  19. A Semantic Graph Query Language

    Energy Technology Data Exchange (ETDEWEB)

    Kaplan, I L

    2006-10-16

    Semantic graphs can be used to organize large amounts of information from a number of sources into one unified structure. A semantic query language provides a foundation for extracting information from the semantic graph. The graph query language described here provides a simple, powerful method for querying semantic graphs.

  20. Query optimization over crowdsourced data

    KAUST Repository

    Park, Hyunjung

    2013-08-26

    Deco is a comprehensive system for answering declarative queries posed over stored relational data together with data obtained on-demand from the crowd. In this paper we describe Deco\\'s cost-based query optimizer, building on Deco\\'s data model, query language, and query execution engine presented earlier. Deco\\'s objective in query optimization is to find the best query plan to answer a query, in terms of estimated monetary cost. Deco\\'s query semantics and plan execution strategies require several fundamental changes to traditional query optimization. Novel techniques incorporated into Deco\\'s query optimizer include a cost model distinguishing between "free" existing data versus paid new data, a cardinality estimation algorithm coping with changes to the database state during query execution, and a plan enumeration algorithm maximizing reuse of common subplans in a setting that makes reuse challenging. We experimentally evaluate Deco\\'s query optimizer, focusing on the accuracy of cost estimation and the efficiency of plan enumeration.

  1. Mastering jQuery mobile

    CERN Document Server

    Lambert, Chip

    2015-01-01

    You've started down the path of jQuery Mobile, now begin mastering some of jQuery Mobile's higher level topics. Go beyond jQuery Mobile's documentation and master one of the hottest mobile technologies out there. Previous JavaScript and PHP experience can help you get the most out of this book.

  2. A query index for continuous queries on RFID streaming data

    Institute of Scientific and Technical Information of China (English)

    Jaekwan PARK; Bonghee HONG; Chaehoon BAN

    2008-01-01

    RFID middleware collects and filters RFID streaming data to process applications' requests called continuous queries, because they are executed continuously during tag movement. Several approaches to building an index on queries rather than data records, called a query index, have been proposed to evaluate continuous queries over streaming data. EPCgiobal proposed an Event Cycle Specification (ECSpec) model, which is a de facto standard query interface for RFID applications. Continuous queries based on ECSpec consist of a large number of segments that represent the query conditions. The problem when using any of the existing query indexes on these continuous queries is that it takes a long time to build the index, because it is necessary to insert a large number of segments into the index. To solve this problem, we propose a transform method that converts a group of segments into compressed data. We also propose an efficient query index scheme for the transformed space. Comparing with existing query indexes, the performance of proposed index outperforms the others on various datasets.

  3. TopSig: Topology Preserving Document Signatures

    CERN Document Server

    Geva, Shlomo

    2012-01-01

    Performance comparisons between File Signatures and Inverted Files for text retrieval have previously shown several significant shortcomings of file signatures relative to inverted files. The inverted file approach underpins most state-of-the-art search engine algorithms, such as Language and Probabilistic models. It has been widely accepted that traditional file signatures are inferior alternatives to inverted files. This paper describes TopSig, a new approach to the construction of file signatures. Many advances in semantic hashing and dimensionality reduction have been made in recent times, but these were not so far linked to general purpose, signature file based, search engines. This paper introduces a different signature file approach that builds upon and extends these recent advances. We are able to demonstrate significant improvements in the performance of signature file based indexing and retrieval, performance that is comparable to that of state of the art inverted file based systems, including Langu...

  4. Bayesian Query-Focused Summarization

    CERN Document Server

    Daumé, Hal

    2009-01-01

    We present BayeSum (for ``Bayesian summarization''), a model for sentence extraction in query-focused summarization. BayeSum leverages the common case in which multiple documents are relevant to a single query. Using these documents as reinforcement for query terms, BayeSum is not afflicted by the paucity of information in short queries. We show that approximate inference in BayeSum is possible on large data sets and results in a state-of-the-art summarization system. Furthermore, we show how BayeSum can be understood as a justified query expansion technique in the language modeling for IR framework.

  5. Kan migration føre noget godt med sig?

    DEFF Research Database (Denmark)

    Pærregaard, Karsten

    2008-01-01

    Som regel har den globale migration rod i økonomisk ulighed, social uretfærdighed eller politisk forfølgelse i de lande, hvor migranterne kommer fra og for mange migranter. Mange år efter, at disse har fundet sig til rette og tilpasset sig de samfund, som har migreret til, fortsætter deres families...

  6. Query log analysis of an electronic health record search engine.

    Science.gov (United States)

    Yang, Lei; Mei, Qiaozhu; Zheng, Kai; Hanauer, David A

    2011-01-01

    We analyzed a longitudinal collection of query logs of a full-text search engine designed to facilitate information retrieval in electronic health records (EHR). The collection, 202,905 queries and 35,928 user sessions recorded over a course of 4 years, represents the information-seeking behavior of 533 medical professionals, including frontline practitioners, coding personnel, patient safety officers, and biomedical researchers for patient data stored in EHR systems. In this paper, we present descriptive statistics of the queries, a categorization of information needs manifested through the queries, as well as temporal patterns of the users' information-seeking behavior. The results suggest that information needs in medical domain are substantially more sophisticated than those that general-purpose web search engines need to accommodate. Therefore, we envision there exists a significant challenge, along with significant opportunities, to provide intelligent query recommendations to facilitate information retrieval in EHR.

  7. Instant Cassandra query language

    CERN Document Server

    Singh, Amresh

    2013-01-01

    Get to grips with a new technology, understand what it is and what it can do for you, and then get to work with the most important features and tasks. It's an Instant Starter guide.Instant Cassandra Query Language is great for those who are working with Cassandra databases and who want to either learn CQL to check data from the console or build serious applications using CQL. If you're looking for something that helps you get started with CQL in record time and you hate the idea of learning a new language syntax, then this book is for you.

  8. An Efficient Algorithm for Query Transformation in Semantic Query Optimization

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    Semantic query optimization (SQO) is comparatively a recent approach for the transformation of given query into equivalent alternative query using matching rules in order to select an optimal query based on the costs of executing alternative queries. The key aspect of the algorithm proposed here is that previous proposed SQO techniques can be considered equally in the uniform cost model, with which optimization opportunities will not be missed. At the same time, the authors used the implication closure to guarantee that any matched rule will not be lost. The authors implemented their algorithm for the optimization of decomposed sub-query in local database in Multi-Database Integrator (MDBI), which is a multidatabase project. The experimental results verify that this algorithm is effective in the process of SQO.

  9. Visualizing multidimensional query results using animation

    Science.gov (United States)

    Sawant, Amit P.; Healey, Christopher G.

    2008-01-01

    Effective representation of large, complex collections of information (datasets) presents a difficult challenge. Visualization is a solution that uses a visual interface to support efficient analysis and discovery within the data. Our primary goal in this paper is a technique that allows viewers to compare multiple query results representing user-selected subsets of a multidimensional dataset. We present an algorithm that visualizes multidimensional information along a space-filling spiral. Graphical glyphs that vary their position, color, and texture appearance are used to represent attribute values for the data elements in each query result. Guidelines from human perception allow us to construct glyphs that are specifically designed to support exploration, facilitate the discovery of trends and relationships both within and between data elements, and highlight exceptions. A clustering algorithm applied to a user-chosen ranking attribute bundles together similar data elements. This encapsulation is used to show relationships across different queries via animations that morph between query results. We apply our techniques to the MovieLens recommender system, to demonstrate their applicability in a real-world environment, and then conclude with a simple validation experiment to identify the strengths and limitations of our design, compared to a traditional side-by-side visualization.

  10. Optimizing Phylogenetic Queries for Performance.

    Science.gov (United States)

    Jamil, Hasan M

    2017-08-24

    The vast majority of phylogenetic databases do not support declarative querying using which their contents can be flexibly and conveniently accessed and the template based query interfaces they support do not allow arbitrary speculative queries. They therefore also do not support query optimization leveraging unique phylogeny properties. While a small number of graph query languages such as XQuery, Cypher and GraphQL exist for computer savvy users, most are too general and complex to be useful for biologists, and too inefficient for large phylogeny querying. In this paper, we discuss a recently introduced visual query language, called PhyQL, that leverages phylogeny specific properties to support essential and powerful constructs for a large class of phylogentic queries. We develop a range of pruning aids, and propose a substantial set of query optimization strategies using these aids suitable for large phylogeny querying. A hybrid optimization technique that exploits a set of indices and ``graphlet" partitioning is discussed. A ``fail soonest" strategy is used to avoid hopeless processing and is shown to produce dividends. Possible novel optimization techniques yet to be explored are also discussed.

  11. Cooperative Answering of Fuzzy Queries

    Institute of Scientific and Technical Information of China (English)

    Narjes Hachani; Mohamed Ali Ben Hassine; Hanène Chettaoui; Habib Ounelli

    2009-01-01

    The majority of existing information systems deals with crisp data through crisp database systems. Traditional Database Management Systems (DBMS) have not taken into account imprecision so one can say there is some sort of lack of flexibility. The reason is that queries retrieve only elements which precisely match to the given Boolean query. That is, an element belongs to the result if the query is true for this element; otherwise, no answers are returned to the user. The aim of this paper is to present a cooperative approach to handling empty answers of fuzzy conjunctive queries by referring to the Formal Concept Analysis (FCA) theory and fuzzy logic. We present an architecture which combines FCA and databases. The processing of fuzzy queries allows detecting the minimal reasons of empty answers. We also use concept lattice in order to provide the user with the nearest answers in the case of a query failure.

  12. Ranking Queries on Uncertain Data

    CERN Document Server

    Hua, Ming

    2011-01-01

    Uncertain data is inherent in many important applications, such as environmental surveillance, market analysis, and quantitative economics research. Due to the importance of those applications and rapidly increasing amounts of uncertain data collected and accumulated, analyzing large collections of uncertain data has become an important task. Ranking queries (also known as top-k queries) are often natural and useful in analyzing uncertain data. Ranking Queries on Uncertain Data discusses the motivations/applications, challenging problems, the fundamental principles, and the evaluation algorith

  13. Research Issues in Mobile Querying

    DEFF Research Database (Denmark)

    Breunig, M.; Jensen, Christian Søndergaard; Klein, M.

    2004-01-01

    This document reports on key aspects of the discussions conducted within the working group. In particular, the document aims to offer a structured and somewhat digested summary of the group's discussions. The document first offers concepts that enable characterization of "mobile queries" as well...... as the types of systems that enable such queries. It explores the notion of context in mobile queries. The document ends with a few observations, mainly regarding challenges....

  14. Optimizing queries in distributed systems

    Directory of Open Access Journals (Sweden)

    Ion LUNGU

    2006-01-01

    Full Text Available This research presents the main elements of query optimizations in distributed systems. First, data architecture according with system level architecture in a distributed environment is presented. Then the architecture of a distributed database management system (DDBMS is described on conceptual level followed by the presentation of the distributed query execution steps on these information systems. The research ends with presentation of some aspects of distributed database query optimization and strategies used for that.

  15. Modeling Array Stations in SIG-VISA

    Science.gov (United States)

    Ding, N.; Moore, D.; Russell, S.

    2013-12-01

    We add support for array stations to SIG-VISA, a system for nuclear monitoring using probabilistic inference on seismic signals. Array stations comprise a large portion of the IMS network; they can provide increased sensitivity and more accurate directional information compared to single-component stations. Our existing model assumed that signals were independent at each station, which is false when lots of stations are close together, as in an array. The new model removes that assumption by jointly modeling signals across array elements. This is done by extending our existing Gaussian process (GP) regression models, also known as kriging, from a 3-dimensional single-component space of events to a 6-dimensional space of station-event pairs. For each array and each event attribute (including coda decay, coda height, amplitude transfer and travel time), we model the joint distribution across array elements using a Gaussian process that learns the correlation lengthscale across the array, thereby incorporating information of array stations into the probabilistic inference framework. To evaluate the effectiveness of our model, we perform ';probabilistic beamforming' on new events using our GP model, i.e., we compute the event azimuth having highest posterior probability under the model, conditioned on the signals at array elements. We compare the results from our probabilistic inference model to the beamforming currently performed by IMS station processing.

  16. Data Caching for XML Query

    Institute of Scientific and Technical Information of China (English)

    SU Fei; CI Lin-lin; ZHU Li-ping; ZHAO Xin-xin

    2006-01-01

    In order to apply the technique of data cache to extensible markup language (XML) database system, the XML-cache system to support data cache for XQuery is presented. According to the character of XML, the queries with nesting are normalized to facilitate the following operation. Based on the idea of incomplete tree, using the document type definition (DTD) schema tree and conditions from normalized XQuery, the results of previous queries are maintained to answer new queries, at the same time, the remainder queries are sent to XML database at the back. The results of experiment show all applications supported by XML database can use this technique to cache data for future use.

  17. From Questions to Queries

    Directory of Open Access Journals (Sweden)

    M. Drlík

    2007-12-01

    Full Text Available The extension of (Internet databases forceseveryone to become more familiar with techniques of datastorage and retrieval because users’ success often dependson their ability to pose right questions and to be able tointerpret their answers. University programs pay moreattention to developing database programming skills than todata exploitation skills. To educate our students to become“database users”, the authors intensively exploit supportivetools simplifying the production of database elements astables, queries, forms, reports, web pages, and macros.Videosequences demonstrating “standard operations” forcompleting them have been prepared to enhance out-ofclassroomlearning. The use of SQL and other professionaltools is reduced to the cases when the wizards are unable togenerate the intended construct.

  18. KoralQuery -- A General Corpus Query Protocol

    DEFF Research Database (Denmark)

    Bingel, Joachim; Diewald, Nils

    2015-01-01

    The task-oriented and format-driven development of corpus query systems has led to the creation of numerous corpus query languages (QLs) that vary strongly in expressiveness and syntax. This is a severe impediment for the interoperability of corpus analysis systems, which lack a common protocol...... format and illustrate use cases in the KorAP project....

  19. Salivary SIgA and dental caries activity.

    Science.gov (United States)

    Chawda, Jyoti G; Chaduvula, Nandini; Patel, Hemali R; Jain, Shikha S; Lala, Arti K

    2011-09-01

    This case-control study was conducted to determine the protective role of salivary secretory immunoglobulin A (SIgA) levels in the unstimulated whole saliva of dental caries active (Group I and II) and caries free children (Group III). Thirty children aged 4-8 years were selected. Their DMFT (Decayed Missing Filled teeth for permanent teeth) and/or df-t (decayed, filled teeth for deciduous teeth) scores were determined and the salivary SIgA levels were measured using Immunoturbidometry. SIgA levels of all three groups were in the the normal range of 4-30 mg/dL. The SIgA levels for both Group I and II were less than that in Group III (P=0.018 and P=0.0013, respectively).

  20. LocSigDB: a database of protein localization signals

    OpenAIRE

    Negi, Simarjeet; Pandey, Sanjit; Srinivasan, Satish M; Mohammed, Akram; Guda, Chittibabu

    2015-01-01

    LocSigDB (http://genome.unmc.edu/LocSigDB/) is a manually curated database of experimental protein localization signals for eight distinct subcellular locations; primarily in a eukaryotic cell with brief coverage of bacterial proteins. Proteins must be localized at their appropriate subcellular compartment to perform their desired function. Mislocalization of proteins to unintended locations is a causative factor for many human diseases; therefore, collection of known sorting signals will hel...

  1. … vandt sig Danmark al … - hvad mente Harald?

    DEFF Research Database (Denmark)

    Roesdahl, Else

    2013-01-01

    Diskussion af meningen med sætningen 'vandt sig Danmark al' på den store Jellingsten. Ordet 'vandt' tolkes i retning af nudansk 'vandt' og ikke som 'samlede', hvilket tit fremføres.......Diskussion af meningen med sætningen 'vandt sig Danmark al' på den store Jellingsten. Ordet 'vandt' tolkes i retning af nudansk 'vandt' og ikke som 'samlede', hvilket tit fremføres....

  2. Usability of XML Query Languages

    NARCIS (Netherlands)

    Graaumans, J.P.M.

    2005-01-01

    The eXtensible Markup Language (XML) is a markup language which enables re-use of information. Specific query languages for XML are developed to facilitate this. There are large differences between history, design goal, and syntax of the XML query languages. However, in practice these languages are

  3. The Semantics of Query Modification

    NARCIS (Netherlands)

    Hollink, V.; Tsikrika, T.; Vries, A.P. de

    2010-01-01

    We present a method that exploits `linked data' to determine semantic relations between consecutive user queries. Our method maps queries onto concepts in linked data and searches the linked data graph for direct or indirect relations between the concepts. By comparing relations between large number

  4. Querying Sentiment Development over Time

    DEFF Research Database (Denmark)

    Andreasen, Troels; Christiansen, Henning; Have, Christian Theil

    2013-01-01

    that measures how well a hypothesis characterizes a given time interval; the semantics is parameterized so it can be adjusted to different views of the data. EmoEpisodes is extended to a query language with variables standing for unknown topics and emotions, and the query-answering mechanism will return...

  5. Priming the Query Specification Process.

    Science.gov (United States)

    Toms, Elaine G.; Freund, Luanne

    2003-01-01

    Tests the use of questions as a technique in the query specification process. Using a within-subjects design, 48 people interacted with a modified Google interface to solve four information problems in four domains. Half the tasks were entered as typical keyword queries, and half as questions or statements. Results suggest the typical search box…

  6. jQuery Pocket Reference

    CERN Document Server

    Flanagan, David

    2010-01-01

    "As someone who uses jQuery on a regular basis, it was surprising to discover how much of the library I'm not using. This book is indispensable for anyone who is serious about using jQuery for non-trivial applications."-- Raffaele Cecco, longtime developer of video games, including Cybernoid, Exolon, and Stormlord jQuery is the "write less, do more" JavaScript library. Its powerful features and ease of use have made it the most popular client-side JavaScript framework for the Web. This book is jQuery's trusty companion: the definitive "read less, learn more" guide to the library. jQuery P

  7. Instant jQuery selectors

    CERN Document Server

    De Rosa, Aurelio

    2013-01-01

    Filled with practical, step-by-step instructions and clear explanations for the most important and useful tasks. Instant jQuery Selectors follows a simple how-to format with recipes aimed at making you well versed with the wide range of selectors that jQuery has to offer through a myriad of examples.Instant jQuery Selectors is for web developers who want to delve into jQuery from its very starting point: selectors. Even if you're already familiar with the framework and its selectors, you could find several tips and tricks that you aren't aware of, especially about performance and how jQuery ac

  8. jQuery UI cookbook

    CERN Document Server

    Boduch, Adam

    2013-01-01

    Filled with a practical collection of recipes, jQuery UI Cookbook is full of clear, step-by-step instructions that will help you harness the powerful UI framework in jQuery. Depending on your needs, you can dip in and out of the Cookbook and its recipes, or follow the book from start to finish.If you are a jQuery UI developer looking to improve your existing applications, extract ideas for your new application, or to better understand the overall widget architecture, then jQuery UI Cookbook is a must-have for you. The reader should at least have a rudimentary understanding of what jQuery UI is

  9. Rediscovering Sig Socransky, the genius and his legacy.

    Science.gov (United States)

    Teles, R P; Teles, F R F; Loesche, W J; Listgarten, M; Fine, D; Lindhe, J; Malament, K; Haffajee, A D

    2012-05-01

    Some individuals make contributions so vital to their field of knowledge that their names become almost synonymous with that field. This is the case of Sig Socransky and the field of periodontal microbiology. Sig Socransky, or simply Sig, was born in Toronto, Canada and received his DDS degree from the University of Toronto in 1957. He studied microbiology and periodontology at Harvard, receiving a certificate in 1961. That same year he was recruited to work as a Research Associate at the Forsyth Dental Center. In 1968, he was nominated Senior Member of the Staff and Head of the Department of Periodontology. During his 50-year career at Forsyth, Sig published over 300 manuscripts, keeping an average of 7 publications per year. His work had an indelible impact in the fields of periodontology and oral microbiology. All these accomplishments pale in comparison with the impact that Sig had on a personal level. We have collected testimonials from some of his former students, closest collaborators, and friends in an attempt to give readers an insight into Sig's personality. We hope we can offer those who knew him through his work a glimpse of how it felt to interact with this remarkable individual.

  10. Query auto completion in information retrieval

    NARCIS (Netherlands)

    Cai, Fei

    2016-01-01

    Query auto completion is an important feature embedded into today's search engines. It can help users formulate queries which other people have searched for when he/she finishes typing the query prefix. Today's most sophisticated query auto completion approaches are based on the collected query logs

  11. In-context query reformulation for failing SPARQL queries

    Science.gov (United States)

    Viswanathan, Amar; Michaelis, James R.; Cassidy, Taylor; de Mel, Geeth; Hendler, James

    2017-05-01

    Knowledge bases for decision support systems are growing increasingly complex, through continued advances in data ingest and management approaches. However, humans do not possess the cognitive capabilities to retain a bird's-eyeview of such knowledge bases, and may end up issuing unsatisfiable queries to such systems. This work focuses on the implementation of a query reformulation approach for graph-based knowledge bases, specifically designed to support the Resource Description Framework (RDF). The reformulation approach presented is instance-and schema-aware. Thus, in contrast to relaxation techniques found in the state-of-the-art, the presented approach produces in-context query reformulation.

  12. The Query-commit Problem

    CERN Document Server

    Molinaro, Marco

    2011-01-01

    In the query-commit problem we are given a graph where edges have distinct probabilities of existing. It is possible to query the edges of the graph, and if the queried edge exists then its endpoints are irrevocably matched. The goal is to find a querying strategy which maximizes the expected size of the matching obtained. This stochastic matching setup is motivated by applications in kidney exchanges and online dating. In this paper we address the query-commit problem from both theoretical and experimental perspectives. First, we show that a simple class of edges can be queried without compromising the optimality of the strategy. This property is then used to obtain in polynomial time an optimal querying strategy when the input graph is sparse. Next we turn our attentions to the kidney exchange application, focusing on instances modeled over real data from existing exchange programs. We prove that, as the number of nodes grows, almost every instance admits a strategy which matches almost all nodes. This resu...

  13. Multi-Dimensional Path Queries

    DEFF Research Database (Denmark)

    Bækgaard, Lars

    1998-01-01

    We present the path-relationship model that supports multi-dimensional data modeling and querying. A path-relationship database is composed of sets of paths and sets of relationships. A path is a sequence of related elements (atoms, paths, and sets of paths). A relationship is a binary path...... to create nested path structures. We present an SQL-like query language that is based on path expressions and we show how to use it to express multi-dimensional path queries that are suited for advanced data analysis in decision support environments like data warehousing environments...

  14. The role of economics in the QUERI program: QUERI Series

    Directory of Open Access Journals (Sweden)

    Smith Mark W

    2008-04-01

    Full Text Available Abstract Background The United States (U.S. Department of Veterans Affairs (VA Quality Enhancement Research Initiative (QUERI has implemented economic analyses in single-site and multi-site clinical trials. To date, no one has reviewed whether the QUERI Centers are taking an optimal approach to doing so. Consistent with the continuous learning culture of the QUERI Program, this paper provides such a reflection. Methods We present a case study of QUERI as an example of how economic considerations can and should be integrated into implementation research within both single and multi-site studies. We review theoretical and applied cost research in implementation studies outside and within VA. We also present a critique of the use of economic research within the QUERI program. Results Economic evaluation is a key element of implementation research. QUERI has contributed many developments in the field of implementation but has only recently begun multi-site implementation trials across multiple regions within the national VA healthcare system. These trials are unusual in their emphasis on developing detailed costs of implementation, as well as in the use of business case analyses (budget impact analyses. Conclusion Economics appears to play an important role in QUERI implementation studies, only after implementation has reached the stage of multi-site trials. Economic analysis could better inform the choice of which clinical best practices to implement and the choice of implementation interventions to employ. QUERI economics also would benefit from research on costing methods and development of widely accepted international standards for implementation economics.

  15. jQuery For Dummies

    CERN Document Server

    Beighley, Lynn

    2010-01-01

    Learn how jQuery can make your Web page or blog stand out from the crowd!. jQuery is free, open source software that allows you to extend and customize Joomla!, Drupal, AJAX, and WordPress via plug-ins. Assuming no previous programming experience, Lynn Beighley takes you through the basics of jQuery from the very start. You'll discover how the jQuery library separates itself from other JavaScript libraries through its ease of use, compactness, and friendliness if you're a beginner programmer. Written in the easy-to-understand style of the For Dummies brand, this book demonstrates how you can a

  16. XML Multidimensional Modelling and Querying

    CERN Document Server

    Boucher, Serge; Zimányi, Esteban

    2009-01-01

    As XML becomes ubiquitous and XML storage and processing becomes more efficient, the range of use cases for these technologies widens daily. One promising area is the integration of XML and data warehouses, where an XML-native database stores multidimensional data and processes OLAP queries written in the XQuery interrogation language. This paper explores issues arising in the implementation of such a data warehouse. We first compare approaches for multidimensional data modelling in XML, then describe how typical OLAP queries on these models can be expressed in XQuery. We then show how, regardless of the model, the grouping features of XQuery 1.1 improve performance and readability of these queries. Finally, we evaluate the performance of query evaluation in each modelling choice using the eXist database, which we extended with a grouping clause implementation.

  17. Schedule Sales Query Raw Data

    Data.gov (United States)

    General Services Administration — Schedule Sales Query presents sales volume figures as reported to GSA by contractors. The reports are generated as quarterly reports for the current year and the...

  18. Characterization of Mycobacterium smegmatis sigF mutant and its regulon: overexpression of SigF antagonist (MSMEG_1803) in M. smegmatis mimics sigF mutant phenotype, loss of pigmentation, and sensitivity to oxidative stress.

    Science.gov (United States)

    Singh, Anirudh K; Dutta, Debashis; Singh, Vandana; Srivastava, Vishal; Biswas, Rajesh K; Singh, Bhupendra N

    2015-12-01

    In Mycobacterium smegmatis, sigF is widely expressed during different growth stages and plays role in adaptation to stationary phase and oxidative stress. Using a sigF deletion mutant of M. smegmatis mc(2) 155, we demonstrate that SigF is not essential for growth of bacterium. Deletion of sigF results in loss of carotenoid pigmentation which rendered increased susceptibility to H2 O2 induced oxidative stress in M. smegmatis. SigF modulates the cell surface architecture and lipid biosynthesis extending the repertoire of SigF function in this species. M. smegmatis SigF regulon included variety of genes expressed during exponential and stationary phases of growth and those responsible for oxidative stress, lipid biosynthesis, energy, and central intermediary metabolism. Furthermore, we report the identification of a SigF antagonist, an anti-sigma factor (RsbW), which upon overexpression in M. smegmatis wild type strain produced a phenotype similar to M. smegmatis mc(2) 155 ΔsigF strain. The SigF-anti-SigF interaction is duly validated using bacterial two-hybrid and pull down assays. In addition, anti-sigma factor antagonists, RsfA and RsfB were identified and their interactions with anti-sigma factor were experimentally validated. Identification of these proteins will help decode regulatory circuit of this alternate sigma factor.

  19. Ontological Queries: Rewriting and Optimization (Extended Version)

    CERN Document Server

    Gottlob, Georg; Pieris, Andreas

    2011-01-01

    Ontological queries are evaluated against an ontology rather than directly on a database. The evaluation and optimization of such queries is an intriguing new problem for database research. In this paper we discuss two important aspects of this problem: query rewriting and query optimization. Query rewriting consists of the compilation of an ontological query into an equivalent query against the underlying relational database. The focus here is on soundness and completeness. We review previous results and present a new rewriting algorithm for rather general types of ontological constraints. In particular, we show how a conjunctive query against an ontology can be compiled into a union of conjunctive queries against the underlying database. Ontological query optimization, in this context, attempts to improve this process so to produce possibly small and cost-effective UCQ rewritings for an input query. We review existing optimization methods, and propose an effective new method that works for linear Datalog+/-...

  20. Research on the architecture and key technologies of SIG

    Science.gov (United States)

    Fu, Zhongliang; Meng, Qingxiang; Huang, Yan; Liu, Shufan

    2007-06-01

    Along with the development of computer network, Grid has become one of the hottest issues of researches on sharing and cooperation of Internet resources throughout the world. This paper illustrates a new architecture of SIG-a five-hierarchy architecture (including Data Collecting Layer, Grid Layer, Service Layer, Application Layer and Client Layer) of SIG from the traditional three hierarchies (only including resource layer, service layer and client layer). In the paper, the author proposes a new mixed network mode of Spatial Information Grid which integrates CAG (Certificate Authority of Grid) and P2P (Peer to Peer) in the Grid Layer, besides, the author discusses some key technologies of SIG and analysis the functions of these key technologies.

  1. Improved query difficulty prediction for the web

    NARCIS (Netherlands)

    Hauff, C.; Murdock, V.; Baeza-Yates, R.

    2008-01-01

    Query performance prediction aims to predict whether a query will have a high average precision given retrieval from a particular collection, or low average precision. An accurate estimator of the quality of search engine results can allow the search engine to decide to which queries to apply query

  2. Effective Density Queries of Continuously Moving Objects

    DEFF Research Database (Denmark)

    Jensen, Christian Søndergaard; Lin, D.; Ooi, B.C.

    2006-01-01

    In this paper, we study a newly emerging type of queries on moving objects - the density query. Basically, this query locates regions in the data space where the density of the objects is high. This type of queries is especially useful in Location Based Services (LBS). For example, in a traffic...

  3. Privacy Preserving Moving KNN Queries

    CERN Document Server

    Hashem, Tanzima; Zhang, Rui

    2011-01-01

    We present a novel approach that protects trajectory privacy of users who access location-based services through a moving k nearest neighbor (MkNN) query. An MkNN query continuously returns the k nearest data objects for a moving user (query point). Simply updating a user's imprecise location such as a region instead of the exact position to a location-based service provider (LSP) cannot ensure privacy of the user for an MkNN query: continuous disclosure of regions enables the LSP to follow a user's trajectory. We identify the problem of trajectory privacy that arises from the overlap of consecutive regions while requesting an MkNN query and provide the first solution to this problem. Our approach allows a user to specify the confidence level that represents a bound of how much more the user may need to travel than the actual kth nearest data object. By hiding a user's required confidence level and the required number of nearest data objects from an LSP, we develop a technique to prevent the LSP from tracking...

  4. Dynamic Planar Range Maxima Queries

    DEFF Research Database (Denmark)

    Brodal, Gerth Stølting; Tsakalidis, Konstantinos

    2011-01-01

    We consider the dynamic two-dimensional maxima query problem. Let P be a set of n points in the plane. A point is maximal if it is not dominated by any other point in P. We describe two data structures that support the reporting of the t maximal points that dominate a given query point, and allow...... update time, using O(nlogn) space, where t is the size of the output. This improves the worst case deletion time of the dynamic rectangular visibility query problem from O(log^3 n) to O(log^2 n). We adapt the data structure to the RAM model with word size w, where the coordinates of the points...... in the worst case. The data structure also supports the more general query of reporting the maximal points among the points that lie in a given 3-sided orthogonal range unbounded from above in the same complexity. We can support 4-sided queries in O(log^2 n + t) worst case time, and O(log^2 n) worst case...

  5. Bottom-up mining of XML query patterns to improve XML querying

    Institute of Scientific and Technical Information of China (English)

    Yi-jun BEI; Gang CHEN; Jin-xiang DONG; Ke CHEN

    2008-01-01

    Querying XML data is a computationally expensive process due to the complex nature of both the XML data and the XML queries. In this paper we propose an approach to expedite XML query processing by caching the results of frequent queries. We discover frequent query patterns from user-issued queries using an efficient bottom-up mining approach called VBUXMiner. VBUXMiner consists of two main steps. First, all queries are merged into a summary structure named "compressed global tree guide" (CGTG). Second, a bottom-up traversal scheme based on the CGTG is employed to generate frequent query patterns. We use the frequent query patterns in a cache mechanism to improve the XML query performance. Experimental results show that our proposed mining approach outperforms the previous mining algorithms for XML queries, such as XQPMinerTID and FastXMiner, and that by caching the results of frequent query patterns, XML query performance can be dramatically improved.

  6. CHI 2013 Human Work Interaction Design (HWID) SIG

    DEFF Research Database (Denmark)

    Clemmensen, Torkil; Campos, Pedro F.; Katre, Dinesh S.

    2013-01-01

    In this SIG we aim to introduce the IFIP 13.6 Human Work Interaction Design (HWID) approach to the CHI audience. The HWID working group aims at establishing relationships between extensive empirical work-domain studies and HCI design. We invite participants from industry and academia with an inte...

  7. Child-Computer Interaction SIG: New Challenges and Opportunities

    DEFF Research Database (Denmark)

    Hourcade, Juan Pablo; Iversen, Ole Sejer; Revelle, Glenda

    2016-01-01

    This SIG will provide child-computer interaction researchers and practitioners an opportunity to discuss four topics that represent new challenges and opportunities for the community. The four areas are: interactive technologies for children under the age of five, technology for inclusion, privacy...

  8. Det betaler sig at investere i egen sundhed

    DEFF Research Database (Denmark)

    Larsen, Kristian

    2009-01-01

    , drikkende eller den lade krop (KRAM). Den konjunktur har bidt sig fast, sikkert ind til dominerende grupper i samfundet ændrer kropskonstitutioner. Pointen er, at sundhedskapitalen eksisterer - fordi der er afsætning for den. Først på arbejdsmarkedet, siden på ægteskabsmarkedet, boligmarkedet og...

  9. Condorcet query engine: A query engine for coordinated index terms

    NARCIS (Netherlands)

    van der Vet, P.E.; Mars, Nicolaas

    1999-01-01

    On-line information retrieval systems often offer their users some means to tune the query to match the level of granularity of the information request. Users can be offered a far greater range of possibilities, however, if documents are indexed with coordinated index concepts. Coordinated index

  10. Head First jQuery

    CERN Document Server

    Benedetti, Ryan

    2011-01-01

    Want to add more interactivity and polish to your websites? Discover how jQuery can help you build complex scripting functionality in just a few lines of code. With Head First jQuery, you'll quickly get up to speed on this amazing JavaScript library by learning how to navigate HTML documents while handling events, effects, callbacks, and animations. By the time you've completed the book, you'll be incorporating Ajax apps, working seamlessly with HTML and CSS, and handling data with PHP, MySQL and JSON. If you want to learn-and understand-how to create interactive web pages, unobtrusive scrip

  11. Querying and Manipulating Temporal Databases

    Directory of Open Access Journals (Sweden)

    Mohamed Mkaouar

    2011-03-01

    Full Text Available Many works have focused, for over twenty five years, on the integration of the time dimension indatabases (DB. However, the standard SQL3 does not yet allow easy definition, manipulation andquerying of temporal DBs. In this paper, we study how we can simplify querying and manipulatingtemporal facts in SQL3, using a model that integrates time in a native manner. To do this, we proposenew keywords and syntax to define different temporal versions for many relational operators andfunctions used in SQL. It then becomes possible to perform various queries and updates appropriate totemporal facts. We illustrate the use of these proposals on many examples from a real application.

  12. Preference Elicitation in Prioritized Skyline Queries

    CERN Document Server

    Mindolin, Denis

    2010-01-01

    Preference queries incorporate the notion of binary preference relation into relational database querying. Instead of returning all the answers, such queries return only the best answers, according to a given preference relation. Preference queries are a fast growing area of database research. Skyline queries constitute one of the most thoroughly studied classes of preference queries. A well known limitation of skyline queries is that skyline preference relations assign the same importance to all attributes. In this work, we study p-skyline queries that generalize skyline queries by allowing varying attribute importance in preference relations. We perform an in-depth study of the properties of p-skyline preference relations. In particular,we study the problems of containment and minimal extension. We apply the obtained results to the central problem of the paper: eliciting relative importance of attributes. Relative importance is implicit in the constructed p-skyline preference relation. The elicitation is ba...

  13. Scalable Social Coordination using Enmeshed Queries

    CERN Document Server

    Chen, Jianjun; Varghese, George

    2012-01-01

    Social coordination allows users to move beyond awareness of their friends to efficiently coordinating physical activities with others. While specific forms of social coordination can be seen in tools such as Evite, Meetup and Groupon, we introduce a more general model using what we call {\\em enmeshed queries}. An enmeshed query allows users to declaratively specify an intent to coordinate by specifying social attributes such as the desired group size and who/what/when, and the database returns matching queries. Enmeshed queries are continuous, but new queries (and not data) answer older queries; the variable group size also makes enmeshed queries different from entangled queries, publish-subscribe systems, and dating services. We show that even offline group coordination using enmeshed queries is NP-hard. We then introduce efficient heuristics that use selective indices such as location and time to reduce the space of possible matches; we also add refinements such as delayed evaluation and using the relative...

  14. Query Expansion Using Heterogeneous Thesauri.

    Science.gov (United States)

    Mandala, Rila; Tokunaga, Takenobu; Tanaka, Hozumi

    2000-01-01

    Proposes a method to improve the performance of information retrieval systems by expanding queries using heterogeneous thesauri. Experiments show that using heterogeneous thesauri with an appropriate weighting method results in better retrieval performance than using only one type of thesaurus. (Author/LRW)

  15. Accomplishing Deterministic XML Query Optimization

    Institute of Scientific and Technical Information of China (English)

    Dun-Ren Che

    2005-01-01

    As the popularity of XML (eXtensible Markup Language) keeps growing rapidly, the management of XML compliant structured-document databases has become a very interesting and compelling research area. Query optimization for XML structured-documents stands out as one of the most challenging research issues in this area because of the much enlarged optimization (search) space, which is a consequence of the intrinsic complexity of the underlying data model of XML data. We therefore propose to apply deterministic transformations on query expressions to most aggressively prune the search space and fast achieve a sufficiently improved alternative (if not the optimal) for each incoming query expression. This idea is not just exciting but practically attainable. This paper first provides an overview of our optimization strategy, and then focuses on the key implementation issues of our rule-based transformation system for XML query optimization in a database environment. The performance results we obtained from experimentation show that our approach is a valid and effective one.

  16. Query Expansion Using Heterogeneous Thesauri.

    Science.gov (United States)

    Mandala, Rila; Tokunaga, Takenobu; Tanaka, Hozumi

    2000-01-01

    Proposes a method to improve the performance of information retrieval systems by expanding queries using heterogeneous thesauri. Experiments show that using heterogeneous thesauri with an appropriate weighting method results in better retrieval performance than using only one type of thesaurus. (Author/LRW)

  17. Querying Large Biological Network Datasets

    Science.gov (United States)

    Gulsoy, Gunhan

    2013-01-01

    New experimental methods has resulted in increasing amount of genetic interaction data to be generated every day. Biological networks are used to store genetic interaction data gathered. Increasing amount of data available requires fast large scale analysis methods. Therefore, we address the problem of querying large biological network datasets.…

  18. Explanations for Skyline Query Results

    DEFF Research Database (Denmark)

    Chester, Sean; Assent, Ira

    2015-01-01

    Skyline queries are a well-studied problem for multidimensional data, wherein points are returned to the user iff no other point is preferable across all attributes. This leaves only the points most likely to appeal to an arbitrary user. However, some dominated points may still be interesting, an...

  19. Logical Querying of Relational Databases

    Directory of Open Access Journals (Sweden)

    Luminita Pistol

    2016-12-01

    Full Text Available This paper aims to demonstrate the usefulness of formal logic and lambda calculus in database programming. After a short introduction in propositional and first order logic, we implement dynamically a small database and translate some SQL queries in filtered java 8 streams, enhanced with Tuples facilities from jOOλ library.

  20. Enhancing Recall in Semantic Querying

    DEFF Research Database (Denmark)

    Rouces, Jacobo

    2013-01-01

    RDF and SPARQL are currently state-of-the-art W3C standards to respectively represent and query structured information, especially when information from different sources must be federated. However, there are various reasons for which the same knowledge can be modeled in RDF graphs that are both ...

  1. Faecal SIgA secretion in infants fed on pre- or probiotic infant formula

    NARCIS (Netherlands)

    Bakker-Zierikzee, A.; Tol, van E.A.F.; Kroes, H.; Alles, M.S.; Kok, F.J.; Bindels, J.G.

    2006-01-01

    Secretory immunoglobulin A (SIgA) plays an important role in the defence of the gastrointestinal tract. The level of faecal SIgA antibody is associated with increased neutralization and clearance of viruses. Formula-fed infants who lack the transfer of protective maternal SIgA from breast milk may b

  2. Faecal SIgA secretion in infants fed on pre- or probiotic infant formula

    NARCIS (Netherlands)

    Bakker-Zierikzee, A.; Tol, van E.A.F.; Kroes, H.; Alles, M.S.; Kok, F.J.; Bindels, J.G.

    2006-01-01

    Secretory immunoglobulin A (SIgA) plays an important role in the defence of the gastrointestinal tract. The level of faecal SIgA antibody is associated with increased neutralization and clearance of viruses. Formula-fed infants who lack the transfer of protective maternal SIgA from breast milk may

  3. Faecal SIgA secretion in infants fed on pre- or probiotic infant formula

    NARCIS (Netherlands)

    Bakker-Zierikzee, A.; Tol, van E.A.F.; Kroes, H.; Alles, M.S.; Kok, F.J.; Bindels, J.G.

    2006-01-01

    Secretory immunoglobulin A (SIgA) plays an important role in the defence of the gastrointestinal tract. The level of faecal SIgA antibody is associated with increased neutralization and clearance of viruses. Formula-fed infants who lack the transfer of protective maternal SIgA from breast milk may b

  4. Large Catalogue Query Performance in Relational Databases

    Science.gov (United States)

    Power, Robert A.

    2007-05-01

    The performance of the mysql and oracle database systems have been compared for a selection of astronomy queries using large catalogues of up to a billion objects. The queries tested are those expected from the astronomy community: general database queries, cone searches, neighbour finding and cross matching. The catalogue preparation, sql query formulation and database performance is presented. Most of the general queries perform adequately when appropriate indexes are present in the database. Each system performs well for cone search queries when the Hierarchical Triangular Mesh spatial index is used. Neighbour finding and cross matching are not well supported in a database environment when compared to software specifically developed to solve these problems.

  5. Measuring persistence of implementation: QUERI Series

    Directory of Open Access Journals (Sweden)

    Asch Steven M

    2008-04-01

    Full Text Available Abstract As more quality improvement programs are implemented to achieve gains in performance, the need to evaluate their lasting effects has become increasingly evident. However, such long-term follow-up evaluations are scarce in healthcare implementation science, being largely relegated to the "need for further research" section of most project write-ups. This article explores the variety of conceptualizations of implementation sustainability, as well as behavioral and organizational factors that influence the maintenance of gains. It highlights the finer points of design considerations and draws on our own experiences with measuring sustainability, framed within the rich theoretical and empirical contributions of others. In addition, recommendations are made for designing sustainability analyses. This article is one in a Series of articles documenting implementation science frameworks and approaches developed by the U.S. Department of Veterans Affairs Quality Enhancement Research Initiative (QUERI.

  6. Genome-wide definition of the SigF regulon in Mycobacterium tuberculosis.

    Science.gov (United States)

    Hartkoorn, Ruben C; Sala, Claudia; Uplekar, Swapna; Busso, Philippe; Rougemont, Jacques; Cole, Stewart T

    2012-04-01

    In Mycobacterium tuberculosis the alternative sigma factor SigF controls the expression of a particular subset of genes by altering RNA polymerase specificity. Here, we utilize two genome-wide approaches to identify SigF-binding sites: chromatin immunoprecipitation (ChIP-on-chip) and microarray analysis of SigF-mediated transcripts. Since SigF is not an abundant protein in the logarithmic phase of growth, a pristinamyin IA-inducible system was used to control its expression. We identified 67 high-affinity SigF-binding sites and 16 loci where a SigF promoter directs the expression of a transcript. These loci include sigF itself, genes involved in lipid and intermediary metabolism and virulence, and at least one transcriptional regulator (Rv2884), possibly acting downstream of SigF. In addition, SigF was also found to direct the transcription of the gene for small RNA F6. Many loci were also found where SigF may be involved in antisense transcription, and in two cases (Rv1358 and Rv1870c) the SigF-dependent promoter was located within the predicted coding sequence. Quantitative PCR confirmed the microarray findings and 5'-rapid amplification of cDNA ends was used to map the SigF-specific transcriptional start points. A canonical SigF consensus promoter sequence GGTTT-N((15-17))-GGGTA was found prior to 11 genes. Together, these data help to define the SigF regulon and show that SigF not only governs expression of proteins such as the virulence factor, HbhA, but also impacts novel functions, such as noncoding RNAs and antisense transcripts.

  7. Optimizing Temporal Queries: Efficient Handling of Duplicates

    DEFF Research Database (Denmark)

    Toman, David; Bowman, Ivan Thomas

    2001-01-01

    Recent research in the area of temporal databases has proposed a number of query languages that vary in their expressive power and the semantics they provide to users. These query languages represent a spectrum of solutions to the tension between clean semantics and efficient evaluation. Often......, these query languages are implemented by translating temporal queries into standard relational queries. However, the compiled queries are often quite cumbersome and expensive to execute even using state-of-the- art relational products. This paper presents an optimization technique that produces more efficient...... translated SQL queries by taking into account the properties of the encoding used for temporal attributes. For concreteness, this translation technique is presented in the context of SQL/TP; however, these techniques are also applicable to other temporal query languages....

  8. Optimizing Temporal Queries: Efficient Handling of Duplicates

    DEFF Research Database (Denmark)

    Toman, David; Bowman, Ivan Thomas

    2001-01-01

    translated SQL queries by taking into account the properties of the encoding used for temporal attributes. For concreteness, this translation technique is presented in the context of SQL/TP; however, these techniques are also applicable to other temporal query languages......., these query languages are implemented by translating temporal queries into standard relational queries. However, the compiled queries are often quite cumbersome and expensive to execute even using state-of-the- art relational products. This paper presents an optimization technique that produces more efficient......Recent research in the area of temporal databases has proposed a number of query languages that vary in their expressive power and the semantics they provide to users. These query languages represent a spectrum of solutions to the tension between clean semantics and efficient evaluation. Often...

  9. Format SPARQL Query Results into HTML Report

    Directory of Open Access Journals (Sweden)

    Dr Sunitha Abburu

    2013-07-01

    Full Text Available SPARQL is one of the powerful query language for querying semantic data. It is recognized by the W3C as a query language for RDF. As an efficient query language for RDF, it has defined several query result formats such as CSV, TSV and XML etc. These formats are not attractive, understandable and readable. The results need to be converted in an appropriate format so that user can easily understand. The above formats require additional transformations or tool support to represent the query result in user readable format. The main aim of this paper is to propose a method to build HTML report dynamically for SPARQL query results. This enables SPARQL query result display, in HTML report format easily, in an attractive understandable format without the support of any additional or external tools or transformation.

  10. Kanonen rør på sig

    DEFF Research Database (Denmark)

    Kaspersen, Peter; Hägerfelth, Gun; Skarstein, Dag

    2009-01-01

    Ved gymnasiereformen i Danmark 2005 indførtes krav om kanonisk læsning i dansk. I Norge fjernedes de hidtidige krav om kanonisk læsning ved reformen i 2006. I Sverige har der ikke været krav om kanonisk læsning siden 1960'erne. Dette er et blandt flere eksempler på at de nordiske landes skolesyst...... skolesystemer har fjernet sig fra hinanden siden 1960'erne. Udgivelsesdato: December...

  11. Betænkelig bankunion betaler sig ikke

    DEFF Research Database (Denmark)

    Thomsen, Steen

    2014-01-01

    Allerede fra november 2014 planlægges Den Europæiske Bankunion at træde i kraft. ... Altså endnu en bailout, som den vi ser, når ECB køber de sydeuropæiske landes usælgelige statsobligationer. Er det i Danmarks interesseSpørgsmålet er, om det er i Danmarks interesse at melde sig ind i en sådan...

  12. Child-Computer Interaction SIG: Ethics and Values

    DEFF Research Database (Denmark)

    Hourcade, Juan Pablo; Zeising, Anja; Iversen, Ole Sejer

    2017-01-01

    This SIG will provide child computer interaction researchers and practitioners an opportunity to discuss topics related to ethical challenges in the design, and use of interactive technologies for children. Topics include the role of big data, the impact of technology in children’s social...... and physical ecosystem, and the consideration of ethics in children’s participation in the design of technologies, and in the conceptualization of technologies for children....

  13. Kan det amerikanske demokrati rejse sig fra valgkampens aske?

    DEFF Research Database (Denmark)

    Rosenbaum, Philip

    2016-01-01

    For både Trump og Clinton handler det i høj grad om at tilkendegive deres utilfredshed og intolerance overfor modparten, men i ligeså høj grad også om at definere og manifestere deres egne politiske positioner i partiernes fremadrettede planer. Sidstnævnte har vist sig at være en utrolig vanskelig...

  14. Identifying Aspects for Web-Search Queries

    OpenAIRE

    Wu, Fei; Madhavan, Jayant; Halevy, Alon

    2014-01-01

    Many web-search queries serve as the beginning of an exploration of an unknown space of information, rather than looking for a specific web page. To answer such queries effec- tively, the search engine should attempt to organize the space of relevant information in a way that facilitates exploration. We describe the Aspector system that computes aspects for a given query. Each aspect is a set of search queries that together represent a distinct information need relevant to the original search...

  15. Broadcast-Based Spatial Queries

    Institute of Scientific and Technical Information of China (English)

    Kwang-Jin Park; Moon-Bae Song; Chong-Sun Hwang

    2005-01-01

    Indexing techniques have been developed for wireless data broadcast environments, in order to conserve the scarce power resources of the mobile clients. However, the use of interleaved index segments in a broadcast cycle increases the average access latency for the clients. In this paper, the broadcast-based spatial query processing methods (BBS)are presented for the location-based services. In the BBS, broadcasted data objects are sorted sequentially based on their locations, and the server broadcasts the location dependent data along with an index segment. Then, a sequential prefetching and caching scheme is designed to reduce the query response time. The performance of this scheme is investigated in relation to various environmental variables, such as the distributions of the data objects, the average speed of the clients and the size of the service area.

  16. Querying Sentiment Development over Time

    DEFF Research Database (Denmark)

    Andreasen, Troels; Christiansen, Henning; Have, Christian Theil

    2013-01-01

    A new language is introduced for describing hypotheses about fluctuations of measurable properties in streams of timestamped data, and as prime example, we consider trends of emotions in the constantly flowing stream of Twitter messages. The language, called EmoEpisodes, has a precise semantics...... that measures how well a hypothesis characterizes a given time interval; the semantics is parameterized so it can be adjusted to different views of the data. EmoEpisodes is extended to a query language with variables standing for unknown topics and emotions, and the query-answering mechanism will return...... instantiations for topics and emotions as well as time intervals that provide the largest deflections in this measurement. Experiments are performed on a selection of Twitter data to demonstrates the usefulness of the approach....

  17. User Oriented Trajectory Search for Trip Recommendation

    KAUST Repository

    Ding, Ruogu

    2012-09-08

    Trajectory sharing and searching have received significant attention in recent years. In this thesis, we propose and investigate the methods to find and recommend the best trajectory to the traveler, and mainly focus on a novel technique named User Oriented Trajectory Search (UOTS) query processing. In contrast to conventional trajectory search by locations (spatial domain only), we consider both spatial and textual domains in the new UOTS query. Given a trajectory data set, the query input contains a set of intended places given by the traveler and a set of textual attributes describing the traveler’s preference. If a trajectory is connecting/close to the specified query locations, and the textual attributes of the trajectory are similar to the traveler’s preference, it will be recommended to the traveler. This type of queries can enable many popular applications such as trip planning and recommendation. There are two challenges in UOTS query processing, (i) how to constrain the searching range in two domains and (ii) how to schedule multiple query sources effectively. To overcome the challenges and answer the UOTS query efficiently, a novel collaborative searching approach is developed. Conceptually, the UOTS query processing is conducted in the spatial and textual domains alternately. A pair of upper and lower bounds are devised to constrain the searching range in two domains. In the meantime, a heuristic searching strategy based on priority ranking is adopted for scheduling the multiple query sources, which can further reduce the searching range and enhance the query efficiency notably. Furthermore, the devised collaborative searching approach can be extended to situations where the query locations are or- dered. Extensive experiments are conducted on both real and synthetic trajectory data in road networks. Our approach is verified to be effective in reducing both CPU time and disk I/O time.

  18. User Oriented Trajectory Search for Trip Recommendation

    KAUST Repository

    Ding, Ruogu

    2012-07-08

    Trajectory sharing and searching have received significant attention in recent years. In this thesis, we propose and investigate the methods to find and recommend the best trajectory to the traveler, and mainly focus on a novel technique named User Oriented Trajectory Search (UOTS) query processing. In contrast to conventional trajectory search by locations (spatial domain only), we consider both spatial and textual domains in the new UOTS query. Given a trajectory data set, the query input contains a set of intended places given by the traveler and a set of textual attributes describing the traveler’s preference. If a trajectory is connecting/close to the specified query locations, and the textual attributes of the trajectory are similar to the traveler’s preference, it will be recommended to the traveler. This type of queries can enable many popular applications such as trip planning and recommendation. There are two challenges in UOTS query processing, (i) how to constrain the searching range in two domains and (ii) how to schedule multiple query sources effectively. To overcome the challenges and answer the UOTS query efficiently, a novel collaborative searching approach is developed. Conceptually, the UOTS query processing is conducted in the spatial and textual domains alternately. A pair of upper and lower bounds are devised to constrain the searching range in two domains. In the meantime, a heuristic searching strategy based on priority ranking is adopted for scheduling the multiple query sources, which can further reduce the searching range and enhance the query efficiency notably. Furthermore, the devised collaborative searching approach can be extended to situations where the query locations are ordered. Extensive experiments are conducted on both real and synthetic trajectory data in road networks. Our approach is verified to be effective in reducing both CPU time and disk I/O time.

  19. Lightweight query authentication on streams

    OpenAIRE

    2014-01-01

    We consider a stream outsourcing setting, where a data owner delegates the management of a set of disjoint data streams to an untrusted server. The owner authenticates his streams via signatures. The server processes continuous queries on the union of the streams for clients trusted by the owner. Along with the results, the server sends proofs of result correctness derived from the owner's signatures, which are easily verifiable by the clients. We design novel constructions for a collection o...

  20. Building interactive queries with LINQPad

    CERN Document Server

    Finot, Sébastien

    2013-01-01

    A step-by-step practical guide that will introduce you to LINQPad's key features, thereby helping you to query databases interactively.This book is aimed at C#/.Net developers who wish to learn LINQ programming and leverage the easy way of using LINQPad. No prior knowledge of LINQ or LINQPad is expected. A basic knowledge of SQL and XML is required for some chapters.

  1. Flexible Query Answering Systems 2006

    DEFF Research Database (Denmark)

    submissions, relating to the topic of users posing queries and systems producing answers. The papers cover the fields: Database Management, Information Retrieval, Domain Modeling, Knowledge Representation and Ontologies, Knowledge Discovery and Data Mining, Artificial Intelligence, Classical and Non......-classical Logics, Computational Linguistics and Natural Language Processing, Multimedia Information Systems, and Human--Computer Interaction, including reports of interesting applications. We wish to thank the contributors for their excellent papers and the referees, publisher, and sponsors for their effort...

  2. SCRY: Enabling quantitative reasoning in SPARQL queries

    NARCIS (Netherlands)

    Meroño-Peñuela, A.; Stringer, Bas; Loizou, Antonis; Abeln, Sanne; Heringa, Jaap

    2015-01-01

    The inability to include quantitative reasoning in SPARQL queries slows down the application of Semantic Web technology in the life sciences. SCRY, our SPARQL compatible service layer, improves this by executing services at query time and making their outputs query-accessible, generating RDF data on

  3. Predecessor queries in dynamic integer sets

    DEFF Research Database (Denmark)

    Brodal, Gerth Stølting

    1997-01-01

    We consider the problem of maintaining a set of n integers in the range 0.2w–1 under the operations of insertion, deletion, predecessor queries, minimum queries and maximum queries on a unit cost RAM with word size w bits. Let f (n) be an arbitrary nondecreasing smooth function satisfying n...

  4. Heuristics-based query optimisation for SPARQL

    NARCIS (Netherlands)

    P. Tsialiamanis (Petros); E. Sidirourgos (Eleftherios); I. Fundulaki; V. Christophides; P.A. Boncz (Peter)

    2012-01-01

    textabstractQuery optimization in RDF Stores is a challenging problem as SPARQL queries typically contain many more joins than equivalent relational plans, and hence lead to a large join order search space. In such cases, cost-based query optimization often is not possible. One practical reason for

  5. Query Adaptive Image Retrieval System

    Directory of Open Access Journals (Sweden)

    Amruta Dubewar

    2014-03-01

    Full Text Available Images play a crucial role in various fields such as art gallery, medical, journalism and entertainment. Increasing use of image acquisition and data storage technologies have enabled the creation of large database. So, it is necessary to develop appropriate information management system to efficiently manage these collections and needed a system to retrieve required images from these collections. This paper proposed query adaptive image retrieval system (QAIRS to retrieve images similar to the query image specified by user from database. The goal of this system is to support image retrieval based on content properties such as colour and texture, usually encoded into feature vectors. In this system, colour feature extracted by various techniques such as colour moment, colour histogram and autocorrelogram and texture feature extracted by using gabor wavelet. Hashing technique is used to embed high dimensional image features into hamming space, where search can be performed by hamming distance of compact hash codes. Depending upon minimum hamming distance it returns the similar image to query image.

  6. Inactivation of the extracytoplasmic function sigma factor Sig6 stimulates avermectin production in Streptomyces avermitilis.

    Science.gov (United States)

    Jiang, Libin; Liu, Yanping; Wang, Pan; Wen, Ying; Song, Yuan; Chen, Zhi; Li, Jilun

    2011-10-01

    The role of the extracytoplasmic function (ECF) σ factor Sig6 (SAV663) in avermectin production by Streptomyces avermitilis was investigated by gene-deletion, complementation and over-expression experiments. Inactivation of Sig6 had no major effect on growth, stress responses, or morphology. Avermectin yield was increased 2- to 2.7-fold (~680 μg/ml) relative to the wild-type strain by deletion of the sig6 gene, and was restored to the wild-type level by introduction of a single copy of sig6. Introduction of extra multi-copy or integrative sig6 vectors into the wild-type decreased avermectin yield by 56-63%. Taken together, these findings indicate that Sig6 plays a negative regulatory role in avermectin production in S. avermitilis. RT-PCR analysis demonstrated that this role of Sig6 is mediated by the pathway-specific activator gene aveR.

  7. Disruption of the sigS gene attenuates the local innate immune response to Staphylococcus aureus in a mouse mastitis model.

    Science.gov (United States)

    Peton, Vincent; Breyne, Koen; Rault, Lucie; Demeyere, Kristel; Berkova, Nadia; Meyer, Evelyne; Even, Sergine; Le Loir, Yves

    2016-04-15

    Staphylococcus aureus (S. aureus) is a major pathogen involved in ruminant mastitis and present worldwide. Clinical signs of S. aureus mastitis vary considerably and are largely dependent on strain-specific factors. A comparison of two S. aureus strains that reproducibly induced either severe (O11) or mild (O46) mastitis in ewes revealed that the transcriptional regulator sigS was mutated in O46 (Le Maréchal et al., 2011. PLoS One. 6 (11) e27354. doi:10.1371/journal.pone.0027354). In the present paper, we analysed the sigS sequence in 18 other S. aureus strains isolated from goat or ewe mastitis and found a 4-bp deletion similar to that of the O46 sigS gene in three strains associated with subclinical ewe mastitis. This sigS gene was disrupted in strain O11 (O11ΔsigS), so our aim was to investigate its involvement in the severity of infections in the context of mastitis. The wild type (wt) and mutant strains were then characterized in vitro to determine the involvement of sigS in the response S. aureus under various stress conditions, and assess its influence on the cytotoxicity of the pathogen, its invasive capacity and biofilm formation. The strains were compared in vivo in an experimental mouse mastitis model in which clinical signs and cytokine production were evaluated at 24h post-infection. While no significant differences in the effect on bacterial growth between O11 and O11ΔsigS were observed either in vitro or in vivo, a significantly weaker in vivo production of interleukin (IL)-1α, IL-1β, and Tumor Necrosis Factor (TNF)-α was measured in the mammary glands infected with the mutant strain, suggesting that infection with O11ΔsigS induced an attenuated local innate immune response. These results suggest an impact of sigS disruption on S. aureus pathogenesis in a ruminant mastitis context. This disruption is probably involved in, and may partly explain, the milder symptoms previously observed in S. aureus O46-induced mastitis in ewes. Copyright

  8. Boolean queries for news monitoring: Suggesting new query terms to expert users

    NARCIS (Netherlands)

    Verberne, S.; Wabeke, T.; Kaptein, R.

    2016-01-01

    In this paper, we evaluate query suggestion for Boolean queries in a news monitoring system. Users of this system receive news articles that match their running query on a daily basis. Because the news for a topic continuously changes, the queries need regular updating. We first investigated the

  9. Truth Space Method for Caching Database Queries

    Directory of Open Access Journals (Sweden)

    S. V. Mosin

    2015-01-01

    Full Text Available We propose a new method of client-side data caching for relational databases with a central server and distant clients. Data are loaded into the client cache based on queries executed on the server. Every query has the corresponding DB table – the result of the query execution. These queries have a special form called "universal relational query" based on three fundamental Relational Algebra operations: selection, projection and natural join. We have to mention that such a form is the closest one to the natural language and the majority of database search queries can be expressed in this way. Besides, this form allows us to analyze query correctness by checking lossless join property. A subsequent query may be executed in a client’s local cache if we can determine that the query result is entirely contained in the cache. For this we compare truth spaces of the logical restrictions in a new user’s query and the results of the queries execution in the cache. Such a comparison can be performed analytically , without need in additional Database queries. This method may be used to define lacking data in the cache and execute the query on the server only for these data. To do this the analytical approach is also used, what distinguishes our paper from the existing technologies. We propose four theorems for testing the required conditions. The first and the third theorems conditions allow us to define the existence of required data in cache. The second and the fourth theorems state conditions to execute queries with cache only. The problem of cache data actualizations is not discussed in this paper. However, it can be solved by cataloging queries on the server and their serving by triggers in background mode. The article is published in the author’s wording.

  10. A solution of spatial query processing and query optimization for spatial databases

    Institute of Scientific and Technical Information of China (English)

    YUAN Jie; XIE Kun-qing; MA Xiu-jun; ZHANG Min; SUN Le-bin

    2004-01-01

    Recently, attention has been focused on spatial query language which is used to query spatial databases. A design of spatial query language has been presented in this paper by extending the standard relational database query language SQL. It recognizes the significantly different requirements of spatial data handling and overcomes the inherent problems of the application of conventional database query languages. This design is based on an extended spatial data model, including the spatial data types and the spatial operators on them. The processing and optimization of spatial queries have also been discussed in this design. In the end, an implementation of this design is given in a spatial query subsystem.

  11. Cache-Based Aggregate Query Shipping: An Efficient Scheme of Distributed OLAP Query Processing

    Institute of Scientific and Technical Information of China (English)

    Hua-Ming Liao; Guo-Shun Pei

    2008-01-01

    Our study introduces a novel distributed query plan refinement phase in an enhanced architecture of distributed query processing engine (DQPE). Query plan refinement generates potentially efficient distributed query plan by reusable aggregate query shipping (RAQS) approach. The approach improves response time at the cost of pre-processing time. If theoverheads could not be compensated by query results reusage, RAQS is no more favorable. Therefore a global cost estimation model is employed to get proper operators: RR_Agg, R_Agg, or R_Scan. For the purpose of reusing results of queries with aggregate function in distributed query processing, a multi-level hybrid view caching (HVC) scheme is introduced. The scheme retains the advantages of partial match and aggregate query results caching. By our solution, evaluations with distributed TPC-H queries show significant improvement on average response time.

  12. Web development with jQuery

    CERN Document Server

    York, Richard

    2015-01-01

    Newly revised and updated resource on jQuery's many features and advantages Web Development with jQuery offers a major update to the popular Beginning JavaScript and CSS Development with jQuery from 2009. More than half of the content is new or updated, and reflects recent innovations with regard to mobile applications, jQuery mobile, and the spectrum of associated plugins. Readers can expect thorough revisions with expanded coverage of events, CSS, AJAX, animation, and drag and drop. New chapters bring developers up to date on popular features like jQuery UI, navigation, tables, interacti

  13. Structured Query Language for Virtual Observatory

    CERN Document Server

    Shirasaki, Y; Mizumoto, Y; Tanaka, M; Honda, S; Oe, M; Yasuda, N; Masunaga, Y; Shirasaki, Yuji; Ohishi, Masatoshi; Mizumoto, Yoshihiko; Tanaka, Masahiro; Honda, Satoshi; Oe, Masafumi; Yasuda, Naoki; Masunaga, Yoshifumi

    2004-01-01

    Currently two query languages are defined as standards for the Virtual Observatory (VO). Astronomical Data Query Language (ADQL) is used for catalog data query and Simple Image Access Protocol (SIAP) is for image data query. As a result, when we query each data service, we need to know in advance which language is supported and then construct a query language accordingly. The construct of SIAP is simple, but they have a limited capability. For example, there is no way to specify multiple regions in one query, and it is difficult to specify complex query conditions. In this paper, we propose a unified query language for any kind of astronomical database on the basis of SQL99. SQL is a query language optimized for a table data, so to apply the SQL to the image and spectrum data set, the data structure need to be mapped to a table like structure. We present specification of this query language and an example of the architecture for the database system.

  14. User oriented trajectory search for trip recommendation

    KAUST Repository

    Shang, Shuo

    2012-01-01

    Trajectory sharing and searching have received significant attentions in recent years. In this paper, we propose and investigate a novel problem called User Oriented Trajectory Search (UOTS) for trip recommendation. In contrast to conventional trajectory search by locations (spatial domain only), we consider both spatial and textual domains in the new UOTS query. Given a trajectory data set, the query input contains a set of intended places given by the traveler and a set of textual attributes describing the traveler\\'s preference. If a trajectory is connecting/close to the specified query locations, and the textual attributes of the trajectory are similar to the traveler\\'e preference, it will be recommended to the traveler for reference. This type of queries can bring significant benefits to travelers in many popular applications such as trip planning and recommendation. There are two challenges in the UOTS problem, (i) how to constrain the searching range in two domains and (ii) how to schedule multiple query sources effectively. To overcome the challenges and answer the UOTS query efficiently, a novel collaborative searching approach is developed. Conceptually, the UOTS query processing is conducted in the spatial and textual domains alternately. A pair of upper and lower bounds are devised to constrain the searching range in two domains. In the meantime, a heuristic searching strategy based on priority ranking is adopted for scheduling the multiple query sources, which can further reduce the searching range and enhance the query efficiency notably. Furthermore, the devised collaborative searching approach can be extended to situations where the query locations are ordered. The performance of the proposed UOTS query is verified by extensive experiments based on real and synthetic trajectory data in road networks. © 2012 ACM.

  15. Fallstudie SIG - Supply Chain Prototyp mit Coca Cola Beverages

    OpenAIRE

    Senger, Enrico

    2003-01-01

    SIG, ein führender Lieferant von Verpackungsmaterialien für Getränke, hat mit Coca Cola CPFR, collaborative planning, forecasting and replenishment realisiert. Das Unternehmen kann elektronisch und ohne Zeitverzug (vorher mit bis zu 15 Tagen Verspätung) auf die Lagerbestände und Verkaufsvorhersagen bei Coca Cola zugreifen. Coca Cola erhält die exakten Liefermengen und zeiten. Coca Cola konnte dadurch den Lagerbestand von Verpackungsmaterial um 50% senken. Dies reduziert die Bearbeitungszeit d...

  16. Fallstudie SIG - Supply Chain Prototyp mit Coca Cola Beverages

    OpenAIRE

    Senger, Enrico

    2003-01-01

    SIG, ein führender Lieferant von Verpackungsmaterialien für Getränke, hat mit Coca Cola CPFR, collaborative planning, forecasting and replenishment realisiert. Das Unternehmen kann elektronisch und ohne Zeitverzug (vorher mit bis zu 15 Tagen Verspätung) auf die Lagerbestände und Verkaufsvorhersagen bei Coca Cola zugreifen. Coca Cola erhält die exakten Liefermengen und zeiten. Coca Cola konnte dadurch den Lagerbestand von Verpackungsmaterial um 50% senken. Dies reduziert die Bearbeitungszeit d...

  17. Research of SIP Compression Based on SigComp

    Directory of Open Access Journals (Sweden)

    Derong Du

    2013-05-01

    Full Text Available SIP (Session Initiation Protocol has been chosen as the core signaling protocol of the NGN (Next Generation Network, but the large SIP message which is text-based is an obstacle with the planned usage of SIP in wireless mobile networks. Based on the SigComp (Signaling Compression framework, some further improvements are made to the Deflate algorithm according to the characteristics of SIP in this study. Experiments show that the improved Deflate algorithm can compress the SIP message greatly and reduce the bandwidth requirements signally, so it is highly valued in IMS (IP Multimedia Subsystem, PTT (Push To Talk and other wireless real-time SIP applications.

  18. EquiX-A Search and Query Language for XML.

    Science.gov (United States)

    Cohen, Sara; Kanza, Yaron; Kogan, Yakov; Sagiv, Yehoshua; Nutt, Werner; Serebrenik, Alexander

    2002-01-01

    Describes EquiX, a search language for XML that combines querying with searching to query the data and the meta-data content of Web pages. Topics include search engines; a data model for XML documents; search query syntax; search query semantics; an algorithm for evaluating a query on a document; and indexing EquiX queries. (LRW)

  19. Monitoring nearest neighbor queries with cache strategies

    Institute of Scientific and Technical Information of China (English)

    PAN Peng; LU Yan-sheng

    2007-01-01

    The problem of continuously monitoring multiple K-nearest neighbor (K-NN) queries with dynamic object and query dataset is valuable for many location-based applications. A practical method is to partition the data space into grid cells, with both object and query table being indexed by this grid structure, while solving the problem by periodically joining cells of objects with queries having their influence regions intersecting the cells. In the worst case, all cells of objects will be accessed once. Object and query cache strategies are proposed to further reduce the I/O cost. With object cache strategy, queries remaining static in current processing cycle seldom need I/O cost, they can be returned quickly. The main I/O cost comes from moving queries, the query cache strategy is used to restrict their search-regions, which uses current results of queries in the main memory buffer. The queries can share not only the accessing of object pages, but also their influence regions. Theoretical analysis of the expected I/O cost is presented, with the I/O cost being about 40% that of the SEA-CNN method in the experiment results.

  20. Adding query privacy to robust DHTs

    DEFF Research Database (Denmark)

    Backes, Michael; Goldberg, Ian; Kate, Aniket

    2012-01-01

    Interest in anonymous communication over distributed hash tables (DHTs) has increased in recent years. However, almost all known solutions solely aim at achieving sender or requestor anonymity in DHT queries. In many application scenarios, it is crucial that the queried key remains secret from...... intermediate peers that (help to) route the queries towards their destinations. In this paper, we satisfy this requirement by presenting an approach for providing privacy for the keys in DHT queries. We use the concept of oblivious transfer (OT) in communication over DHTs to preserve query privacy without...... compromising spam resistance. Although our OT-based approach can work over any DHT, we concentrate on robust DHTs that can tolerate Byzantine faults and resist spam. We choose the best-known robust DHT construction, and employ an efficient OT protocol well-suited for achieving our goal of obtaining query...

  1. jQuery for ASPNET Developers

    CERN Document Server

    Brinkman, Joe

    2009-01-01

    This Wrox Blox teaches you how to use jQuery with your ASP.NET-based websites.  jQuery greatly simplifies JavaScript development and allows you to create highly interactive and responsive websites using the latest JavaScript and AJAX techniques. The author walks you through the jQuery API using a simple ASP.NET MVC application to highlight major topics, and shows how you can apply jQuery to your own applications. After learning the basics of using jQuery, you'll discover how easy it is to use within your own ASP.NET projects.  Whether you are using WebForms or the MVC framework, jQuery will gr

  2. GQL: Extending XQuery to Query GML Documents

    Institute of Scientific and Technical Information of China (English)

    GUAN Jihong; ZHU Fubao; ZHOU Jiaogen; NIU Liping

    2006-01-01

    GML is becoming the de facto standard for electronic data exchange among the applications of Web and distributed geographic information systems. However, the conventional query languages (e.g. SQL and its extended versions) are not suitable for direct querying and updating of GML documents. Even the effective approaches working well with XML could not guarantee good results when applied to GML documents. Although XQuery is a powerful standard query language for XML, it is not proposed for querying spatial features, which constitute the most important components in GML documents. We propose GQL, a query language specification to support spatial queries over GML documents by extending XQuery. The data model, algebra, and formal semantics as well as various spatial functions and operations of GQL are presented in detail.

  3. SIG-VISA: Signal-based Vertically Integrated Seismic Monitoring

    Science.gov (United States)

    Moore, D.; Mayeda, K. M.; Myers, S. C.; Russell, S.

    2013-12-01

    Traditional seismic monitoring systems rely on discrete detections produced by station processing software; however, while such detections may constitute a useful summary of station activity, they discard large amounts of information present in the original recorded signal. We present SIG-VISA (Signal-based Vertically Integrated Seismic Analysis), a system for seismic monitoring through Bayesian inference on seismic signals. By directly modeling the recorded signal, our approach incorporates additional information unavailable to detection-based methods, enabling higher sensitivity and more accurate localization using techniques such as waveform matching. SIG-VISA's Bayesian forward model of seismic signal envelopes includes physically-derived models of travel times and source characteristics as well as Gaussian process (kriging) statistical models of signal properties that combine interpolation of historical data with extrapolation of learned physical trends. Applying Bayesian inference, we evaluate the model on earthquakes as well as the 2009 DPRK test event, demonstrating a waveform matching effect as part of the probabilistic inference, along with results on event localization and sensitivity. In particular, we demonstrate increased sensitivity from signal-based modeling, in which the SIGVISA signal model finds statistical evidence for arrivals even at stations for which the IMS station processing failed to register any detection.

  4. jQuery Tools UI Library

    CERN Document Server

    Libby, Alex

    2012-01-01

    A practical tutorial with powerful yet simple projects that are quick to implement. This book is aimed at developers who have prior jQuery knowledge, but may not have any prior experience with jQuery Tools. It is possible that they may have started with the basics of jQuery Tools, but want to learn more about how it can be used, as well as get ideas for future projects.

  5. Evaluating Multidimensional Queries by Diamond Dicing

    CERN Document Server

    Webb, Hazel; Lemire, Daniel

    2010-01-01

    Queries that constrain multiple dimensions simultaneously are difficult to express and compute efficiently in both Structured Query Language (SQL) and multidimensional languages. We introduce the diamond cube operator to facilitate the expression of one such class of multidimensional query. We have developed, implemented and tested algorithms to compute diamonds on both real and synthetic large data sets. We show that our custom implementation is more than twenty-five times faster, on a large data set, than popular database engines.

  6. Determinants of Proteolysis and Cell-Binding for the Shigella flexneri Cytotoxin, SigA.

    Science.gov (United States)

    Chua, Eng Guan; Al-Hasani, Keith; Scanlon, Martin; Adler, Ben; Sakellaris, Harry

    2015-11-01

    Shigella flexneri secretes an enterotoxic, SPATE family autotransporter (AT), SigA, which has cytopathic activity towards cultured epithelial cells. Its cytopathic activity is due to its ability to degrade the cytoskeletal protein, α-fodrin. The mechanisms by which AT toxins target cells and tissues differ and the details of how SigA acts are not known. In the current study, the determinants of proteolysis and cell-targeting for SigA were determined. We demonstrate that the SigA passenger or α-domain consists of two functionally distinct domains, designated α1 and α2, which are sufficient to specify proteolytic and cell-binding activities, respectively.

  7. Queries with Guarded Negation (full version)

    CERN Document Server

    Barany, Vince; Otto, Martin

    2012-01-01

    A well-established and fundamental insight in database theory is that negation (also known as complementation) tends to make queries difficult to process and difficult to reason about. Many basic problems are decidable and admit practical algorithms in the case of unions of conjunctive queries, but become difficult or even undecidable when queries are allowed to contain negation. Inspired by recent results in finite model theory, we consider a restricted form of negation, guarded negation. We introduce a fragment of SQL, called GN-SQL, as well as a fragment of Datalog with stratified negation, called GN-Datalog, that allow only guarded negation, and we show that these query languages are computationally well behaved, in terms of testing query containment, query evaluation, open-world query answering, and boundedness. GN-SQL and GN-Datalog subsume a number of well known query languages and constraint languages, such as unions of conjunctive queries, monadic Datalog, and frontier-guarded tgds. In addition, an a...

  8. Oceanographic ontology-based spatial knowledge query

    Institute of Scientific and Technical Information of China (English)

    2005-01-01

    The construction of oceanographic ontologies is fundamental to the "digital ocean". Therefore, on the basis of introduction of new concept of oceanographic ontology, an oceanographic ontology-based spatial knowledge query (OOBSKQ) method was proposed and developed. Because the method uses a natural language to describe query conditions and the query result is highly integrated knowledge,it can provide users with direct answers while hiding the complicated computation and reasoning processes, and achieves intelligent,automatic oceanographic spatial information query on the level of knowledge and semantics. A case study of resource and environmental application in bay has shown the implementation process of the method and its feasibility and usefulness.

  9. Querying moving objects detected by sensor networks

    CERN Document Server

    Bestehorn, Markus

    2012-01-01

    Declarative query interfaces to Sensor Networks (SN) have become a commodity. These interfaces allow access to SN deployed for collecting data using relational queries. However, SN are not confined to data collection, but may track object movement, e.g., wildlife observation or traffic monitoring. While rational approaches are well suited for data collection, research on ""Moving Object Databases"" (MOD) has shown that relational operators are unsuitable to express information needs on object movement, i.e., spatio-temporal queries. ""Querying Moving Objects Detected by Sensor Networks"" studi

  10. Topic Level Disambiguation for Weak Queries

    Directory of Open Access Journals (Sweden)

    Zhang, Hui

    2013-09-01

    Full Text Available Despite limited success, today's information retrieval (IR systems are not intelligent or reliable. IR systems return poor search results when users formulate their information needs into incomplete or ambiguous queries (i.e., weak queries. Therefore, one of the main challenges in modern IR research is to provide consistent results across all queries by improving the performance on weak queries. However, existing IR approaches such as query expansion are not overly effective because they make little effort to analyze and exploit the meanings of the queries. Furthermore, word sense disambiguation approaches, which rely on textual context, are ineffective against weak queries that are typically short. Motivated by the demand for a robust IR system that can consistently provide highly accurate results, the proposed study implemented a novel topic detection that leveraged both the language model and structural knowledge of Wikipedia and systematically evaluated the effect of query disambiguation and topic-based retrieval approaches on TREC collections. The results not only confirm the effectiveness of the proposed topic detection and topic-based retrieval approaches but also demonstrate that query disambiguation does not improve IR as expected.

  11. Effective Density Queries of Continuously Moving Objects

    DEFF Research Database (Denmark)

    Jensen, Christian Søndergaard; Lin, D.; Ooi, B.C.

    2006-01-01

    control system, we need to identify the places that are or would be affected by a traffic jam, and report this information to drivers so that they can choose a less congested route. As a naive way to solve the problem is prohibitively expensive, we first introduce a framework which makes the problem......In this paper, we study a newly emerging type of queries on moving objects - the density query. Basically, this query locates regions in the data space where the density of the objects is high. This type of queries is especially useful in Location Based Services (LBS). For example, in a traffic...

  12. Adding Query Privacy to Robust DHTs

    DEFF Research Database (Denmark)

    Backes, Michael; Goldberg, Ian; Kate, Aniket

    2011-01-01

    of obtaining query privacy over robust DHTs. Finally, we compare the performance of our privacy-preserving protocols with their more privacy-invasive counterparts. We observe that there is no increase in the message complexity and only a small overhead in the computational complexity....... intermediate peers that (help to) route the queries towards their destinations. In this paper, we satisfy this requirement by presenting an approach for providing privacy for the keys in DHT queries. We use the concept of oblivious transfer (OT) in communication over DHTs to preserve query privacy without...

  13. Database queries and constraints via lifting problems

    CERN Document Server

    Spivak, David I

    2012-01-01

    Previous work has shown a tight relationship between databases and categories. In the present paper we extend that connection to show that certain queries and constraints correspond to the algebro-topological notion of lifting problems. In our formulation, each so-called SPARQL graph pattern query corresponds to a lifting problem, and each solution to the query corresponds to a lift. We interpret constraints within the same formalism and then investigate some formal properties of queries and constraints, e.g. their behavior under data migration functors.

  14. Object-Extended OLAP Querying

    DEFF Research Database (Denmark)

    Pedersen, Torben Bach; Gu, Junmin; Shoshani, Arie

    2009-01-01

    On-line analytical processing (OLAP) systems based on a dimensional view of data have found widespread use in business applications and are being used increasingly in non-standard applications. These systems provide good performance and ease-of-use. However, the complex structures and relationshi...... with performance measurements that show that the approach is a viable alternative to a physically integrated data warehouse.......On-line analytical processing (OLAP) systems based on a dimensional view of data have found widespread use in business applications and are being used increasingly in non-standard applications. These systems provide good performance and ease-of-use. However, the complex structures and relationships...... inherent in data in non-standard applications are not accommodated well by OLAP systems. In contrast, object database systems are built to handle such complexity, but do not support OLAP-type querying well. This paper presents the concepts and techniques underlying a flexible, "multi-model" federated...

  15. Query Optimization for Deductive Databases

    Institute of Scientific and Technical Information of China (English)

    周傲英; 施伯乐

    1995-01-01

    A systematic,efficient compilation method for query evaluation of Deductive Databases (DeDB) is proposed in this paper.In order to eliminate redundancy and to minimize the potentially relevant facts,which are two key issues to the efficiency of a DeDB,the compilation process is decomposed into two phases.The first is the pre-compilation phase,which is responsible for the minimization of the potentially relevant facts.The second,which we refer to as the general compilation phase,is responsible for the elimination of redundancy.The rule/goal graph devised by J.D.Ullman is appropriately extended and used as a uniform formalism.Two general algorithms corresponding to the two phases respectively are described intuitively and formally.

  16. Object-Extended OLAP Querying

    DEFF Research Database (Denmark)

    Pedersen, Torben Bach; Gu, Junmin; Shoshani, Arie

    2009-01-01

    inherent in data in non-standard applications are not accommodated well by OLAP systems. In contrast, object database systems are built to handle such complexity, but do not support OLAP-type querying well. This paper presents the concepts and techniques underlying a flexible, "multi-model" federated...... system that enables OLAP users to exploit simultaneously the features of OLAP and object systems. The system allows data to be handled using the most appropriate data model and technology: OLAP systems for dimensional data and object database systems for more complex, general data. This allows data...... analysis on the OLAP data to be significantly enriched by the use of additional object data. Additionally, physical integration of the OLAP and the object data can be avoided. As a vehicle for demonstrating the capabilities of the system, a prototypical OLAP language is defined and extended to naturally...

  17. Querying Business Process Models with VMQL

    DEFF Research Database (Denmark)

    Störrle, Harald; Acretoaie, Vlad

    2013-01-01

    The Visual Model Query Language (VMQL) has been invented with the objectives (1) to make it easier for modelers to query models effectively, and (2) to be universally applicable to all modeling languages. In previous work, we have applied VMQL to UML, and validated the first of these two claims. ...

  18. Path Minima Queries in Dynamic Weighted Trees

    DEFF Research Database (Denmark)

    Davoodi, Pooya; Brodal, Gerth Stølting; Satti, Srinivasa Rao

    2011-01-01

    In the path minima problem on a tree, each edge is assigned a weight and a query asks for the edge with minimum weight on a path between two nodes. For the dynamic version of the problem, where the edge weights can be updated, we give data structures that achieve optimal query time\\todo{what about...

  19. Meet Charles, big data query advisor

    NARCIS (Netherlands)

    Sellam, T.; Kersten, M.

    2013-01-01

    In scientific data management and business analytics, the most informative queries are a holy grail. Data collection becomes increasingly simpler, yet data exploration gets significantly harder. Exploratory querying is likely to return an empty or an overwhelming result set. On the other hand, data

  20. Meet Charles, big data query advisor

    NARCIS (Netherlands)

    Sellam, T.; Kersten, M.

    2013-01-01

    In scientific data management and business analytics, the most informative queries are a holy grail. Data collection becomes increasingly simpler, yet data exploration gets significantly harder. Exploratory querying is likely to return an empty or an overwhelming result set. On the other hand, data

  1. Quantum associative memory with improved distributed queries

    CERN Document Server

    Njafa, J -P Tchapet; Woafo, Paul

    2012-01-01

    The paper proposes an improved quantum associative algorithm with distributed query based on model proposed by Ezhov et al. We introduce two modifications of the query that optimized data retrieval of correct multi-patterns simultaneously for any rate of the number of the recognition pattern on the total patterns. Simulation results are given.

  2. Improving Web Search for Difficult Queries

    Science.gov (United States)

    Wang, Xuanhui

    2009-01-01

    Search engines have now become essential tools in all aspects of our life. Although a variety of information needs can be served very successfully, there are still a lot of queries that search engines can not answer very effectively and these queries always make users feel frustrated. Since it is quite often that users encounter such "difficult…

  3. Efficient caching for constrained skyline queries

    DEFF Research Database (Denmark)

    Mortensen, Michael Lind; Chester, Sean; Assent, Ira;

    2015-01-01

    Constrained skyline queries retrieve all points that optimize some user’s preferences subject to orthogonal range constraints, but at significant computational cost. This paper is the first to propose caching to improve constrained skyline query response time. Because arbitrary range constraints ...

  4. Exploring features for automatic identification of news queries through query logs

    Institute of Scientific and Technical Information of China (English)

    Xiaojuan; ZHANG; Jian; LI

    2014-01-01

    Purpose:Existing researches of predicting queries with news intents have tried to extract the classification features from external knowledge bases,this paper tries to present how to apply features extracted from query logs for automatic identification of news queries without using any external resources.Design/methodology/approach:First,we manually labeled 1,220 news queries from Sogou.com.Based on the analysis of these queries,we then identified three features of news queries in terms of query content,time of query occurrence and user click behavior.Afterwards,we used 12 effective features proposed in literature as baseline and conducted experiments based on the support vector machine(SVM)classifier.Finally,we compared the impacts of the features used in this paper on the identification of news queries.Findings:Compared with baseline features,the F-score has been improved from 0.6414 to0.8368 after the use of three newly-identified features,among which the burst point(bst)was the most effective while predicting news queries.In addition,query expression(qes)was more useful than query terms,and among the click behavior-based features,news URL was the most effective one.Research limitations:Analyses based on features extracted from query logs might lead to produce limited results.Instead of short queries,the segmentation tool used in this study has been more widely applied for long texts.Practical implications:The research will be helpful for general-purpose search engines to address search intents for news events.Originality/value:Our approach provides a new and different perspective in recognizing queries with news intent without such large news corpora as blogs or Twitter.

  5. Adding Query Privacy to Robust DHTs

    DEFF Research Database (Denmark)

    Backes, Michael; Goldberg, Ian; Kate, Aniket

    2011-01-01

    Interest in anonymous communication over distributed hash tables (DHTs) has increased in recent years. However, almost all known solutions solely aim at achieving sender or requestor anonymity in DHT queries. In many application scenarios, it is crucial that the queried key remains secret from...... intermediate peers that (help to) route the queries towards their destinations. In this paper, we satisfy this requirement by presenting an approach for providing privacy for the keys in DHT queries. We use the concept of oblivious transfer (OT) in communication over DHTs to preserve query privacy without...... compromising spam resistance. Although our OT-based approach can work over any DHT, we concentrate on communication over robust DHTs that can tolerate Byzantine faults and resist spam. We choose the best-known robust DHT construction, and employ an efficient OT protocol well-suited for achieving our goal...

  6. An Effective Information Retrieval for Ambiguous Query

    CERN Document Server

    Roul, R K

    2012-01-01

    Search engine returns thousands of web pages for a single user query, in which most of them are not relevant. In this context, effective information retrieval from the expanding web is a challenging task, in particular, if the query is ambiguous. The major question arises here is that how to get the relevant pages for an ambiguous query. We propose an approach for the effective result of an ambiguous query by forming community vector based on association concept of data minning using vector space model and the freedictionary. We develop clusters by computing the similarity between community vectors and document vectors formed from the extracted web pages by the search engine. We use Gensim package to implement the algorithm because of its simplicity and robust nature. Analysis shows that our approach is an effective way to form clusters for an ambiguous query.

  7. Ensuring Query Compatibility with Evolving XML Schemas

    CERN Document Server

    Genevès, Pierre; Quint, Vincent

    2008-01-01

    During the life cycle of an XML application, both schemas and queries may change from one version to another. Schema evolutions may affect query results and potentially the validity of produced data. Nowadays, a challenge is to assess and accommodate the impact of theses changes in rapidly evolving XML applications. This article proposes a logical framework and tool for verifying forward/backward compatibility issues involving schemas and queries. First, it allows analyzing relations between schemas. Second, it allows XML designers to identify queries that must be reformulated in order to produce the expected results across successive schema versions. Third, it allows examining more precisely the impact of schema changes over queries, therefore facilitating their reformulation.

  8. Conservation of thiol-oxidative stress responses regulated by SigR orthologues in actinomycetes

    Science.gov (United States)

    Kim, Min-Sik; Dufour, Yann S.; Yoo, Ji Sun; Cho, Yoo-Bok; Park, Joo-Hong; Nam, Gi-Baeg; Kim, Hae Min; Lee, Kang-Lok; Donohue, Timothy J.; Roe, Jung-Hye

    2015-01-01

    Summary Numerous thiol-reactive compounds cause oxidative stress where cells counteract by activation of survival strategies regulated by thiol-based sensors. In Streptomyces coelicolor, a model actinomycete, a sigma/antisigma pair SigR/RsrA controls the response to thiol-oxidative stress. To unravel its full physiological functions, chromatin immuno-precipitation combined with sequence and transcript analyses were employed to identify 108 SigR target genes in S. coelicolor and to predict orthologous regulons across actinomycetes. In addition to reported genes for thiol homeostasis, protein degradation and ribosome modulation, 64 additional operons were identified suggesting new functions of this global regulator. We demonstrate that SigR maintains the level and activity of the housekeeping sigma factor HrdB during thiol-oxidative stress, a novel strategy for stress responses. We also found that SigR defends cells against UV and thiol-reactive damages, in which repair UvrA takes a part. Using a refined SigR-binding sequence model, SigR orthologues and their targets were predicted in 42 actinomycetes. This revealed a conserved core set of SigR targets to function for thiol homeostasis, protein quality control, possible modulation of transcription and translation, flavin-mediated redox reactions, and Fe-S delivery. The composition of the SigR regulon reveals a robust conserved physiological mechanism to deal with thiol-oxidative stress from bacteria to human. PMID:22651816

  9. Snapshot of SIG: A Look at Four States' Approaches to School Turnaround

    Science.gov (United States)

    Quillin, Jessica

    2012-01-01

    Thousands of schools across the country are chronically low performing, and they operate within districts and states that are struggling to help them improve. The School Improvement Grants (SIG) program is designed to channel federal funds to states and districts facing the task of turning around struggling schools. SIG, a part of the Elementary…

  10. SIgA binding to mucosal surfaces is mediated by mucin-mucin interactions.

    Directory of Open Access Journals (Sweden)

    Hannah L Gibbins

    Full Text Available The oral mucosal pellicle is a layer of absorbed salivary proteins, including secretory IgA (SIgA, bound onto the surface of oral epithelial cells and is a useful model for all mucosal surfaces. The mechanism by which SIgA concentrates on mucosal surfaces is examined here using a tissue culture model with real saliva. Salivary mucins may initiate the formation of the mucosal pellicle through interactions with membrane-bound mucins on cells. Further protein interactions with mucins may then trigger binding of other pellicle proteins. HT29 colon cell lines, which when treated with methotrexate (HT29-MTX produce a gel-forming mucin, were used to determine the importance of these mucin-mucin interactions. Binding of SIgA to cells was then compared using whole mouth saliva, parotid (mucin-free saliva and a source of purified SIgA. Greatest SIgA binding occurred when WMS was incubated with HT29-MTX expressing mucus. Since salivary MUC5B was only able to bind to cells which produced mucus and purified SIgA showed little binding to the same cells we conclude that most SIgA binding to mucosal cells occurs because SIgA forms complexes with salivary mucins which then bind to cells expressing membrane-bound mucins. This work highlights the importance of mucin interactions in the development of the mucosal pellicle.

  11. Deletion in sigB in Bacillus cereus affects spore properties

    NARCIS (Netherlands)

    Vries, de Y.P.; Hornstra, L.M.; Atmadja, R.D.; Schaik, van W.; Vos, de W.M.; Abee, T.

    2005-01-01

    In Bacillus cereus and other gram-positive bacteria the alternative sigma factor ¿B is an important regulator of the stress response. Deletion of the sigB gene generally leads to a stress-sensitive phenotype of vegetative cells. In this study, we describe the effect of the deletion of the sigB gene

  12. Querying Schemas With Access Restrictions

    CERN Document Server

    Benedikt, Michael; Ley, Clemens

    2012-01-01

    We study verification of systems whose transitions consist of accesses to a Web-based data-source. An access is a lookup on a relation within a relational database, fixing values for a set of positions in the relation. For example, a transition can represent access to a Web form, where the user is restricted to filling in values for a particular set of fields. We look at verifying properties of a schema describing the possible accesses of such a system. We present a language where one can describe the properties of an access path, and also specify additional restrictions on accesses that are enforced by the schema. Our main property language, AccLTL, is based on a first-order extension of linear-time temporal logic, interpreting access paths as sequences of relational structures. We also present a lower-level automaton model, Aautomata, which AccLTL specifications can compile into. We show that AccLTL and A-automata can express static analysis problems related to "querying with limited access patterns" that h...

  13. BioSig: the free and open source software library for biomedical signal processing.

    Science.gov (United States)

    Vidaurre, Carmen; Sander, Tilmann H; Schlögl, Alois

    2011-01-01

    BioSig is an open source software library for biomedical signal processing. The aim of the BioSig project is to foster research in biomedical signal processing by providing free and open source software tools for many different application areas. Some of the areas where BioSig can be employed are neuroinformatics, brain-computer interfaces, neurophysiology, psychology, cardiovascular systems, and sleep research. Moreover, the analysis of biosignals such as the electroencephalogram (EEG), electrocorticogram (ECoG), electrocardiogram (ECG), electrooculogram (EOG), electromyogram (EMG), or respiration signals is a very relevant element of the BioSig project. Specifically, BioSig provides solutions for data acquisition, artifact processing, quality control, feature extraction, classification, modeling, and data visualization, to name a few. In this paper, we highlight several methods to help students and researchers to work more efficiently with biomedical signals.

  14. Query-By-Keywords (QBK): Query Formulation Using Semantics and Feedback

    Science.gov (United States)

    Telang, Aditya; Chakravarthy, Sharma; Li, Chengkai

    The staples of information retrieval have been querying and search, respectively, for structured and unstructured repositories. Processing queries over known, structured repositories (e.g., Databases) has been well-understood, and search has become ubiquitous when it comes to unstructured repositories (e.g., Web). Furthermore, searching structured repositories has been explored to a limited extent. However, there is not much work in querying unstructured sources. We argue that querying unstructured sources is the next step in performing focused retrievals. This paper proposed a new approach to generate queries from search-like inputs for unstructured repositories. Instead of burdening the user with schema details, we believe that pre-discovered semantic information in the form of taxonomies, relationship of keywords based on context, and attribute & operator compatibility can be used to generate query skeletons. Furthermore, progressive feedback from users can be used to improve the accuracy of query skeletons generated.

  15. Antibiotic resistance of canine Staphylococcus intermedius group (SIG)--practical implications.

    Science.gov (United States)

    Chrobak, D; Kizerwetter-Swida, M; Rzewuska, M; Binek, M

    2011-01-01

    A total of 221 SIG strains were isolated from clinical samples of canine origin submitted to the Diagnostic Laboratory of the Division of Bacteriology and Molecular Biology at the Warsaw University of Life Sciences in Warsaw during the period 2006-2010. The aim of the study was to investigate the frequency of prevalence of methicillin-resistant SIG strains and to determine the MIC values of cephalotin, amoxicillin/clavulanic acid, ciprofloxacin, clindamycin, gentamicin, chloramphenicol, mupirocin for a collection of randomly selected 79 strains belonging to Staphylococcus intermedius group (SIG), including 23 mecA-positive and 56 mecA-negative strains. All isolates were identified as belonging to SIG based on their phenotypic properties and PCR amplification of S. intermedius-specific fragment of the 16S rRNA gene. The mecA gene was detected in 26 (12%) of 221 SIG strains. All tested mecA-negative SIG strains were susceptible to amoxicillin/clavulanic acid and cephalotin. One of the 56 mecA-negative SIG strains was resistant to ciprofloxacin, six (11%) to gentamicin. It was found that sixteen (29%) of 56 mecA-negative SIG strains were resistant to clindamycin. Most of the mecA-positive SIG strains were resistant to ciprofloxacin (96%), clindamycin (96%), and gentamicin (96%). Only one MRSIG strain was resistant to chloramphenicol. All examined mecA-positive SIG strains were found to be susceptible to mupirocin. Our results imply that staphylococcal multidrug resistance has become more prevalent, which could lead to difficulties in effective treatment. With some resistant strains the only therapeutic possibility are antimicrobial agents important in human medicine. New regulations for veterinary medicine concerning appropriate therapy of infections caused by multidrug-resistat staphylococci are needed.

  16. Swelling Index of Glutenin (SIG) Related to Protein Quality,Dough Characters and Noodle Quality

    Institute of Scientific and Technical Information of China (English)

    HU Xin-zhong; WEI Yi-min; M I P Kovacs; WANG Chun

    2004-01-01

    The swelling index of glutenin (SIG) and the protein fraction of 25 Chinese wheat varieties were studied with new protein fractions extracting method. The protein fractions compose of monomeric protein, soluble glutenin and insoluble glutenin. The relations between other protein index, dough character, and fresh noodle quality were also discussed. The SIG results at different time is positively and highly significantly related to the insoluble glutenin content (r=0.808 -0.867, P< 0.01). The SIG result can reflect the insoluble glutenin content. The protein content, gluten index, farinograph stability time, extensibility length and extensigramenergy were positively and significantly correlated with SIG5 and SIG20 (r=0.516- 0.734, P<0.05, 0.01).SIG proved to be applicable in Chinese wheat dough evaluation. Fresh noodle making quality parameters were evaluated by fresh noodle length, thickness, maximum resistance to extension, extension area and extension distance, while cooked noodle texture was determined by cutting firmness, compression recovery, surface firmness and TPA by using a texture analyzer of TA.XT2i.The noodle cooking quality was significantly correlated with SIG value. The surface firmness and cutting firmness were more desirable for predicating the quality difference than TPA test and compression. Cooking loss and water absorption were negatively related to SIG value and insoluble content (r=-0.556 - - 0.787, P< 0.05, 0.01) . The results showed that SIG test was also suitable in evaluating noodle making and cooking quality in very small sample size, which was very important in wheat breeding programs. Therefore,SIG test, as a small scale test, is suitable to evaluate dough rheological properties for Chinese wheat varieties, and will be helpful in cereal research and wheat breeding program, especially, in early generations.

  17. Order Theoretical Semantic Recommendation

    Energy Technology Data Exchange (ETDEWEB)

    Joslyn, Cliff A.; Hogan, Emilie A.; Paulson, Patrick R.; Peterson, Elena S.; Stephan, Eric G.; Thomas, Dennis G.

    2013-07-23

    Mathematical concepts of order and ordering relations play multiple roles in semantic technologies. Discrete totally ordered data characterize both input streams and top-k rank-ordered recommendations and query output, while temporal attributes establish numerical total orders, either over time points or in the more complex case of startend temporal intervals. But also of note are the fully partially ordered data, including both lattices and non-lattices, which actually dominate the semantic strcuture of ontological systems. Scalar semantic similarities over partially-ordered semantic data are traditionally used to return rank-ordered recommendations, but these require complementation with true metrics available over partially ordered sets. In this paper we report on our work in the foundations of partial order measurement in ontologies, with application to top-k semantic recommendation in workflows.

  18. Spatial information semantic query based on SPARQL

    Science.gov (United States)

    Xiao, Zhifeng; Huang, Lei; Zhai, Xiaofang

    2009-10-01

    How can the efficiency of spatial information inquiries be enhanced in today's fast-growing information age? We are rich in geospatial data but poor in up-to-date geospatial information and knowledge that are ready to be accessed by public users. This paper adopts an approach for querying spatial semantic by building an Web Ontology language(OWL) format ontology and introducing SPARQL Protocol and RDF Query Language(SPARQL) to search spatial semantic relations. It is important to establish spatial semantics that support for effective spatial reasoning for performing semantic query. Compared to earlier keyword-based and information retrieval techniques that rely on syntax, we use semantic approaches in our spatial queries system. Semantic approaches need to be developed by ontology, so we use OWL to describe spatial information extracted by the large-scale map of Wuhan. Spatial information expressed by ontology with formal semantics is available to machines for processing and to people for understanding. The approach is illustrated by introducing a case study for using SPARQL to query geo-spatial ontology instances of Wuhan. The paper shows that making use of SPARQL to search OWL ontology instances can ensure the result's accuracy and applicability. The result also indicates constructing a geo-spatial semantic query system has positive efforts on forming spatial query and retrieval.

  19. Query Optimizations over Decentralized RDF Graphs

    KAUST Repository

    Abdelaziz, Ibrahim

    2017-05-18

    Applications in life sciences, decentralized social networks, Internet of Things, and statistical linked dataspaces integrate data from multiple decentralized RDF graphs via SPARQL queries. Several approaches have been proposed to optimize query processing over a small number of heterogeneous data sources by utilizing schema information. In the case of schema similarity and interlinks among sources, these approaches cause unnecessary data retrieval and communication, leading to poor scalability and response time. This paper addresses these limitations and presents Lusail, a system for scalable and efficient SPARQL query processing over decentralized graphs. Lusail achieves scalability and low query response time through various optimizations at compile and run times. At compile time, we use a novel locality-aware query decomposition technique that maximizes the number of query triple patterns sent together to a source based on the actual location of the instances satisfying these triple patterns. At run time, we use selectivity-awareness and parallel query execution to reduce network latency and to increase parallelism by delaying the execution of subqueries expected to return large results. We evaluate Lusail using real and synthetic benchmarks, with data sizes up to billions of triples on an in-house cluster and a public cloud. We show that Lusail outperforms state-of-the-art systems by orders of magnitude in terms of scalability and response time.

  20. Complexity of temporal query abduction in DL-Lite

    CSIR Research Space (South Africa)

    Klarman, S

    2014-07-01

    Full Text Available and Temporal Query Language, based on the combination of LTL with conjunctive queries. In this defined setting, we study the complexity of temporal query abduction, assuming different restrictions on the problem and minimality criteria for abductive solutions...

  1. Interaction of Mycobacterium tuberculosis RshA and SigH is mediated by salt bridges.

    Directory of Open Access Journals (Sweden)

    Shiva Kumar

    Full Text Available The alternate sigma factor sigH of Mycobacterium tuberculosis is expressed under stress and acts as a major regulator of several genes, including some other sigma factors and redox systems. While it is auto-regulated by its own promoter at the transcriptional level, its regulation at the post-translational level is through its cognate protein, an anti-sigma factor, RshA. Hither before RshA was believed to be a zinc-associated anti-sigma factor (ZAS and the binding of RshA to SigH is redox dependent. Here, we show that RshA coordinates a [2Fe-2S] cluster using cysteines as ligands and native RshA has more affinity to [2Fe-2S] cluster than to zinc. Furthermore, we used amide hydrogen deuterium exchange mass spectrometry (HDX-MS, followed by site-directed mutagenesis in SigH and RshA, to elucidate the interaction mechanism of RshA and SigH and the potential role of metal ion clustering in SigH regulation. Three regions in SigH, comprising of residues 1-25, 58-69, 90-111, 115-132 and 157-196 and residues 35-57 of RshA show decreased deuterium exchange and reflect decreased solvent accessibility upon complexation with SigH. Of the three RshA mutants, created based on the HDX results, the RsHA E37A mutant shows stronger interaction with SigH, relative to WT RshA, while the H49A mutant abolishes interactions and the C(53XXC(56AXXA mutant has no effect on complexation with SigH. The D22A, D160A and E162 SigH mutants show significantly decreased binding to RshA and the E168A mutant completely abolished interactions with RshA, indicating that the SigH-RshA interaction is mediated by salt bridges. In addition, SigH-RshA interaction does not require clustering of metal ions. Based on our results, we propose a molecular model of the SigH-RshA interaction.

  2. Query Through Heterogeneous Ontologies Using Association Matrix

    Institute of Scientific and Technical Information of China (English)

    KANG Da-zhou; XU Bao-wen; LU Jian-jiang; WANG Peng; LI Yan-hui

    2004-01-01

    This paper introduces the definition and calculation of the association matrix between ontologies.It uses the association matrix to describe the relations between concepts in different ontologies and uses concept vectors to represent queries; then computes the vectors with the association matrix in order to rewrite queries.This paper proposes a simple method of querying through heterogeneous Ontology using association matrix.This method is based on the correctness of approximate information filtering theory; and it is simple to be implemented and expected to run quite fast.

  3. Instant MDX queries for SQL Server 2012

    CERN Document Server

    Emond, Nicholas

    2013-01-01

    Get to grips with a new technology, understand what it is and what it can do for you, and then get to work with the most important features and tasks. This short, focused guide is a great way to get stated with writing MDX queries. New developers can use this book as a reference for how to use functions and the syntax of a query as well as how to use Calculated Members and Named Sets.This book is great for new developers who want to learn the MDX query language from scratch and install SQL Server 2012 with Analysis Services

  4. Relative aggregation operator in database fuzzy querying

    Directory of Open Access Journals (Sweden)

    Luminita DUMITRIU

    2005-12-01

    Full Text Available Fuzzy selection criteria querying relational databases include vague terms; they usually refer linguistic values form the attribute linguistic domains, defined as fuzzy sets. Generally, when a vague query is processed, the definitions of vague terms must already exist in a knowledge base. But there are also cases when vague terms must be dynamically defined, when a particular operation is used to aggregate simple criteria in a complex selection. The paper presents a new aggregation operator and the corresponding algorithm to evaluate the fuzzy query.

  5. Provenance Storage, Querying, and Visualization in PBase

    Energy Technology Data Exchange (ETDEWEB)

    Kianmajd, Parisa [University of California, Davis; Ludascher, Bertram [University of California, Davis; Missier, Paolo [Newcastle University, UK; Chirigati, Fernando [New York University; Wei, Yaxing [ORNL; Koop, David [New York University; Dey, Saumen [University of California, Davis

    2015-01-01

    We present PBase, a repository for scientific workflows and their corresponding provenance information that facilitates the sharing of experiments among the scientific community. PBase is interoperable since it uses ProvONE, a standard provenance model for scientific workflows. Workflows and traces are stored in RDF, and with the support of SPARQL and the tree cover encoding, the repository provides a scalable infrastructure for querying the provenance data. Furthermore, through its user interface, it is possible to: visualize workflows and execution traces; visualize reachability relations within these traces; issue SPARQL queries; and visualize query results.

  6. Query Load Balancing For Visible Object Extraction

    DEFF Research Database (Denmark)

    Bukauskas, Linas; Bøhlen, Michael Hanspeter

    2004-01-01

    Interactive visual data explorations impose rigid real-time requirements on the extraction of visible objects. Often these requirements are met by deploying powerful hardware that maintains the entire data set in huge main memory structures. In this paper we propose an approach that retrieves...... objects along the path. The visible objects are retrieved incrementally, and it is possible to precisely control the query load and the number of retrieved objects. The minimal distance path method issues frequent queries and retrieves the lowest possible number of objects at each query point. The end...

  7. Evaluating Trajectory Queries over Imprecise Location Data

    DEFF Research Database (Denmark)

    Xie, Scott, Xike; Cheng, Reynold; Yiu, Man Lung

    2012-01-01

    Trajectory queries, which retrieve nearby objects for every point of a given route, can be used to identify alerts of potential threats along a vessel route, or monitor the adjacent rescuers to a travel path. However, the locations of these objects (e.g., threats, succours) may not be precisely......, the query is quite time-consuming, since all the points on the trajectory are considered. In this paper, we study how to efficiently evaluate trajectory queries over imprecise location data, by proposing a new concept called the u-bisector. In general, the u-bisector is an extension of bisector to handle...

  8. Responsive web design with jQuery

    CERN Document Server

    Carlos, Gilberto

    2013-01-01

    Responsive Web Design with jQuery follows a standard tutorial-based approach, covering various aspects of responsive web design by building a comprehensive website.""Responsive Web Design with jQuery"" is aimed at web designers who are interested in building device-agnostic websites. You should have a grasp of standard HTML, CSS, and JavaScript development, and have a familiarity with graphic design. Some exposure to jQuery and HTML5 will be beneficial but isn't essential.

  9. OntoQuery: easy-to-use web-based OWL querying.

    Science.gov (United States)

    Tudose, Ilinca; Hastings, Janna; Muthukrishnan, Venkatesh; Owen, Gareth; Turner, Steve; Dekker, Adriano; Kale, Namrata; Ennis, Marcus; Steinbeck, Christoph

    2013-11-15

    The Web Ontology Language (OWL) provides a sophisticated language for building complex domain ontologies and is widely used in bio-ontologies such as the Gene Ontology. The Protégé-OWL ontology editing tool provides a query facility that allows composition and execution of queries with the human-readable Manchester OWL syntax, with syntax checking and entity label lookup. No equivalent query facility such as the Protégé Description Logics (DL) query yet exists in web form. However, many users interact with bio-ontologies such as chemical entities of biological interest and the Gene Ontology using their online Web sites, within which DL-based querying functionality is not available. To address this gap, we introduce the OntoQuery web-based query utility.  The source code for this implementation together with instructions for installation is available at http://github.com/IlincaTudose/OntoQuery. OntoQuery software is fully compatible with all OWL-based ontologies and is available for download (CC-0 license). The ChEBI installation, ChEBI OntoQuery, is available at http://www.ebi.ac.uk/chebi/tools/ontoquery. hastings@ebi.ac.uk.

  10. Alternative sigma factor SigK has a role in stress tolerance of group I Clostridium botulinum strain ATCC 3502.

    Science.gov (United States)

    Dahlsten, Elias; Kirk, David; Lindström, Miia; Korkeala, Hannu

    2013-06-01

    The role of the alternative sigma factor SigK in cold and osmotic stress tolerance of Clostridium botulinum ATCC 3502 was demonstrated by induction of sigK after temperature downshift and exposure to hyperosmotic conditions and by impaired growth of the sigK mutants under the respective conditions.

  11. A National Research Project Revitalizes and Strengthens a SIG's Membership, Leadership, and the Quality of Research in the Field

    Science.gov (United States)

    Educational Researcher, 2011

    2011-01-01

    A recent moratorium has temporarily halted the creation of new Special Interest Groups (SIGs) in the American Educational Research Association (AERA). The AERA SIG Executive Committee, the official governance body that oversees approximately 160 SIGs, requested this moratorium, which was subsequently approved by AERA Council. The purpose of the…

  12. Schedule Sales Query Report Generation System

    Data.gov (United States)

    General Services Administration — Schedule Sales Query presents sales volume figures as reported to GSA by contractors. The reports are generated as quarterly reports for the current year and the...

  13. Clean Air Markets - Compliance Query Wizard

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Compliance Query Wizard is part of a suite of Clean Air Markets-related tools that are accessible at http://ampd.epa.gov/ampd/. The Compliance module provides...

  14. Business information query expansion through semantic network

    Science.gov (United States)

    Gong, Zhiguo; Muyeba, Maybin; Guo, Jingzhi

    2010-02-01

    In this article, we propose a method for business information query expansions. In our approach, hypernym/hyponymy and synonym relations in WordNet are used as the basic expansion rules. Then we use WordNet Lexical Chains and WordNet semantic similarity to assign terms in the same query into different groups with respect to their semantic similarities. For each group, we expand the highest terms in the WordNet hierarchies with hypernym and synonym, the lowest terms with hyponym and synonym and all other terms with only synonym. In this way, the contradictory caused by full expansion can be well controlled. Furthermore, we use collection-related term semantic network to further improve the expansion performance. And our experiment reveals that our solution for query expansion can improve the query performance dramatically.

  15. Mining tree-query associations in graphs

    CERN Document Server

    Hoekx, Eveline

    2010-01-01

    New applications of data mining, such as in biology, bioinformatics, or sociology, are faced with large datasetsstructured as graphs. We introduce a novel class of tree-shapedpatterns called tree queries, and present algorithms for miningtree queries and tree-query associations in a large data graph. Novel about our class of patterns is that they can containconstants, and can contain existential nodes which are not counted when determining the number of occurrences of the patternin the data graph. Our algorithms have a number of provableoptimality properties, which are based on the theory of conjunctive database queries. We propose a practical, database-oriented implementation in SQL, and show that the approach works in practice through experiments on data about food webs, protein interactions, and citation analysis.

  16. Medical Expenditure Panel Survey (MEPS) Query Tool

    Data.gov (United States)

    U.S. Department of Health & Human Services — MEPSnet HC Query Tool MEPSnet/Household Component provides easy access to nationally representative statistics of health care use, expenditures, sources of payment,...

  17. Range Query Processing in Multidisk Systems

    Institute of Scientific and Technical Information of China (English)

    李建中

    1992-01-01

    In order to reduce the disk access time,a database can be stored on several simultaneously accessible disks.In this paper,we are concerned with the dynamic d-attribute database allocation problem for range queries,An allocation method,called coordinate moule allocation method,is proposed to allocate data in a d-attribute database among disks so that the maximum disk accessing concurrency can be achieved for range queries.Our analysis and experiments show that the method achieves the optimum or near-optimum parallelism for range queries.The paper offers the conditions under which the method is optimal .The worst case bounds of the performance of the method are also given.In addition,the parallel algorithm of processing range queries in described at the end of the paper.The method has been used in the statistic and scientific database management system whic is being designed by us.

  18. Efficient Probabilistic Inference with Partial Ranking Queries

    CERN Document Server

    Huang, Jonathan; Guestrin, Carlos E

    2012-01-01

    Distributions over rankings are used to model data in various settings such as preference analysis and political elections. The factorial size of the space of rankings, however, typically forces one to make structural assumptions, such as smoothness, sparsity, or probabilistic independence about these underlying distributions. We approach the modeling problem from the computational principle that one should make structural assumptions which allow for efficient calculation of typical probabilistic queries. For ranking models, "typical" queries predominantly take the form of partial ranking queries (e.g., given a user's top-k favorite movies, what are his preferences over remaining movies?). In this paper, we argue that riffled independence factorizations proposed in recent literature [7, 8] are a natural structural assumption for ranking distributions, allowing for particularly efficient processing of partial ranking queries.

  19. Mobile Information Access with Spoken Query Answering

    DEFF Research Database (Denmark)

    Brøndsted, Tom; Larsen, Henrik Legind; Larsen, Lars Bo

    2006-01-01

    This paper addresses the problem of information and service accessibility in mobile devices with limited resources. A solution is developed and tested through a prototype that applies state-of-the-art Distributed Speech Recognition (DSR) and knowledge-based Information Retrieval (IR) processing...... for spoken query answering. For the DSR part, a configurable DSR system is implemented on the basis of the ETSI-DSR advanced front-end and the SPHINX IV recognizer. For the knowledge-based IR part, a distributed system solution is developed for fast retrieval of the most relevant documents, with a text...... window focused over the part which most likely contains an answer to the query. The two systems are integrated into a full spoken query answering system. The prototype can answer queries and questions within the chosen football (soccer) test domain, but the system has the flexibility for being ported...

  20. Querying temporal databases via OWL 2 QL

    CSIR Research Space (South Africa)

    Klarman, S

    2014-06-01

    Full Text Available SQL:2011, the most recently adopted version of the SQL query language, has unprecedentedly standardized the representation of temporal data in relational databases. Following the successful paradigm of ontology-based data access, we develop a...

  1. Search Result Diversification Based on Query Facets

    Institute of Scientific and Technical Information of China (English)

    胡莎; 窦志成; 王晓捷; 继荣

    2015-01-01

    In search engines, different users may search for different information by issuing the same query. To satisfy more users with limited search results, search result diversification re-ranks the results to cover as many user intents as possible. Most existing intent-aware diversification algorithms recognize user intents as subtopics, each of which is usually a word, a phrase, or a piece of description. In this paper, we leverage query facets to understand user intents in diversification, where each facet contains a group of words or phrases that explain an underlying intent of a query. We generate subtopics based on query facets and propose faceted diversification approaches. Experimental results on the public TREC 2009 dataset show that our faceted approaches outperform state-of-the-art diversification models.

  2. A Query Language for Formal Mathematical Libraries

    CERN Document Server

    Rabe, Florian

    2012-01-01

    One of the most promising applications of mathematical knowledge management is search: Even if we restrict attention to the tiny fragment of mathematics that has been formalized, the amount exceeds the comprehension of an individual human. Based on the generic representation language MMT, we introduce the mathematical query language QMT: It combines simplicity, expressivity, and scalability while avoiding a commitment to a particular logical formalism. QMT can integrate various search paradigms such as unification, semantic web, or XQuery style queries, and QMT queries can span different mathematical libraries. We have implemented QMT as a part of the MMT API. This combination provides a scalable indexing and query engine that can be readily applied to any library of mathematical knowledge. While our focus here is on libraries that are available in a content markup language, QMT naturally extends to presentation and narration markup languages.

  3. Clean Air Markets - Allowances Query Wizard

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Allowances Query Wizard is part of a suite of Clean Air Markets-related tools that are accessible at http://camddataandmaps.epa.gov/gdm/index.cfm. The Allowances...

  4. Evaluating SPARQL queries on massive RDF datasets

    KAUST Repository

    Al-Harbi, Razen

    2015-08-01

    Distributed RDF systems partition data across multiple computer nodes. Partitioning is typically based on heuristics that minimize inter-node communication and it is performed in an initial, data pre-processing phase. Therefore, the resulting partitions are static and do not adapt to changes in the query workload; as a result, existing systems are unable to consistently avoid communication for queries that are not favored by the initial data partitioning. Furthermore, for very large RDF knowledge bases, the partitioning phase becomes prohibitively expensive, leading to high startup costs. In this paper, we propose AdHash, a distributed RDF system which addresses the shortcomings of previous work. First, AdHash initially applies lightweight hash partitioning, which drastically minimizes the startup cost, while favoring the parallel processing of join patterns on subjects, without any data communication. Using a locality-aware planner, queries that cannot be processed in parallel are evaluated with minimal communication. Second, AdHash monitors the data access patterns and adapts dynamically to the query load by incrementally redistributing and replicating frequently accessed data. As a result, the communication cost for future queries is drastically reduced or even eliminated. Our experiments with synthetic and real data verify that AdHash (i) starts faster than all existing systems, (ii) processes thousands of queries before other systems become online, and (iii) gracefully adapts to the query load, being able to evaluate queries on billion-scale RDF data in sub-seconds. In this demonstration, audience can use a graphical interface of AdHash to verify its performance superiority compared to state-of-the-art distributed RDF systems.

  5. Nearest Neighbor Queries in Road Networks

    DEFF Research Database (Denmark)

    Jensen, Christian Søndergaard; Kolar, Jan; Pedersen, Torben Bach

    2003-01-01

    With wireless communications and geo-positioning being widely available, it becomes possible to offer new e-services that provide mobile users with information about other mobile objects. This paper concerns active, ordered k-nearest neighbor queries for query and data objects that are moving...... for the nearest neighbor search in the prototype is presented in detail. In addition, the paper reports on results from experiments with the prototype system....

  6. Processing keyword queries under access limitations

    OpenAIRE

    Calì, Andrea; Martinenghi, D.; Torlone, R.

    2015-01-01

    The Deep Web is constituted by data accessible through Web pages, but not readily indexable by search engines, as they are returned in dynamic pages. In this paper we propose a framework for accessing Deep Web sources, represented as relational tables with so-called access limitations, with keyword-based queries. We formalize the notion of optimal answer and propose methods for query processing. To the best of our knowledge, ours is the first systematic approach to keyword search in such cont...

  7. Managing and querying whole slide images

    Science.gov (United States)

    Wang, Fusheng; Oh, Tae W.; Vergara-Niedermayr, Cristobal; Kurc, Tahsin; Saltz, Joel

    2012-02-01

    High-resolution pathology images provide rich information about the morphological and functional characteristics of biological systems, and are transforming the field of pathology into a new era. To facilitate the use of digital pathology imaging for biomedical research and clinical diagnosis, it is essential to manage and query both whole slide images (WSI) and analytical results generated from images, such as annotations made by humans and computed features and classifications made by computer algorithms. There are unique requirements on modeling, managing and querying whole slide images, including compatibility with standards, scalability, support of image queries at multiple granularities, and support of integrated queries between images and derived results from the images. In this paper, we present our work on developing the Pathology Image Database System (PIDB), which is a standard oriented image database to support retrieval of images, tiles, regions and analytical results, image visualization and experiment management through a unified interface and architecture. The system is deployed for managing and querying whole slide images for In Silico brain tumor studies at Emory University. PIDB is generic and open source, and can be easily used to support other biomedical research projects. It has the potential to be integrated into a Picture Archiving and Communications System (PACS) with powerful query capabilities to support pathology imaging.

  8. Implementing Graph Pattern Queries on a Relational Database

    Energy Technology Data Exchange (ETDEWEB)

    Kaplan, I L; Abdulla, G M; Brugger, S T; Kohn, S R

    2007-12-26

    When a graph database is implemented on top of a relational database, queries in the graph query language are translated into relational SQL queries. Graph pattern queries are an important feature of a graph query language. Translating graph pattern queries into single SQL statements results in very poor query performance. By taking into account the pattern query structure and generating multiple SQL statements, pattern query performance can be dramatically improved. The performance problems encountered with the single SQL statements generated for pattern queries reflects a problem in the SQL query planner and optimizer. Addressing this problem would allow relational databases to better support semantic graph databases. Relational database systems that provide good support for graph databases may also be more flexible platforms for data warehouses.

  9. k-Nearest Neighbor Query Processing Algorithms for a Query Region in Road Networks

    Institute of Scientific and Technical Information of China (English)

    Hyeong-Il Kim; Jae-Woo Chang

    2013-01-01

    Recent development of wireless communication technologies and the popularity of smart phones are making location-based services (LBS) popular.However,requesting queries to LBS servers with users' exact locations may threat the privacy of users.Therefore,there have been many researches on generating a cloaked query region for user privacy protection.Consequently,an efficient query processing algorithm for a query region is required.So,in this paper,we propose k-nearest neighbor query (k-NN) processing algorithms for a query region in road networks.To efficiently retrieve k-NN points of interest (POIs),we make use of the Island index.We also propose a method that generates an adaptive Island index to improve the query processing performance and storage usage.Finally,we show by our performance analysis that our k-NN query processing algorithms outperform the existing k-Range Nearest Neighbor (kRNN) algorithm in terms of network expansion cost and query processing time.

  10. An Efficient Query Rewriting Approach for Web Cached Data Management

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    With the internet development, querying data on the Web is an attention problem of involving information from distributed, and often dynamically, related Web sources. Basically, some sub-queries can be effectively cached from previous queries or materialized views in order to achieve a better query performance based on the notion of rewriting queries. In this paper, we propose a novel query-rewriting model, called Hierarchical Query Tree, for representing Web queries. Hierarchical Query Tree is a labeled tree that is suitable for representing the inherent hierarchy feature of data on the Web. Based on Hierarchical Query Tree, we use case-based approach to determine what the query results should be. The definitions of queries and query results are both represented as labeled trees. Thus, we can use the same model for representing cases and the medium query results can also be dynamically updated by the user queries. We show that our case-based method can be used to answer a new query based on the combination of previous queries, including changes of requirements and various information sources.

  11. Identification and Analysis of Multi-tasking Product Information Search Sessions with Query Logs

    Directory of Open Access Journals (Sweden)

    Xiang Zhou

    2016-09-01

    Full Text Available Purpose: This research aims to identify product search tasks in online shopping and analyze the characteristics of consumer multi-tasking search sessions. Design/methodology/approach: The experimental dataset contains 8,949 queries of 582 users from 3,483 search sessions. A sequential comparison of the Jaccard similarity coefficient between two adjacent search queries and hierarchical clustering of queries is used to identify search tasks. Findings: (1 Users issued a similar number of queries (1.43 to 1.47 with similar lengths (7.3-7.6 characters per task in mono-tasking and multi-tasking sessions, and (2 Users spent more time on average in sessions with more tasks, but spent less time for each task when the number of tasks increased in a session. Research limitations: The task identification method that relies only on query terms does not completely reflect the complex nature of consumer shopping behavior. Practical implications: These results provide an exploratory understanding of the relationships among multiple shopping tasks, and can be useful for product recommendation and shopping task prediction. Originality/value: The originality of this research is its use of query clustering with online shopping task identification and analysis, and the analysis of product search session characteristics.

  12. Region 7 Significant Ecological Resource Areas (ECO_RES.SIG_REGIONS)

    Data.gov (United States)

    U.S. Environmental Protection Agency — SIG_REGIONS is a boundary layer that displays Region 7's Significant Ecological Resource Areas. This layer represents large areas within which different ecosystem...

  13. ASIS '99 Knowledge: Creation, Organization and Use, Part II: SIG Sessions.

    Science.gov (United States)

    Proceedings of the ASIS Annual Meeting, 1999

    1999-01-01

    Abstracts and descriptions of Special Interest Group (SIG) sessions include such topics as: knowledge management tools, knowledge organization, information retrieval, information seeking behavior, metadata, indexing, library service for distance education, electronic books, future information workforce needs, technological developments, and…

  14. Sådan klarer en progressiv sig i en mørk tid

    DEFF Research Database (Denmark)

    Ibsen, Malte Frøslee

    2016-01-01

    Der er stor forskel på at være progressiv i en progressiv tid – og at være det i en tid, hvor mørket sænker sig, og fortvivlelsen truer med at stå i vejen for vreden og handlingen. Fem dyder, den progressive skal til at lære sig oven på Trumps sejr i USA......Der er stor forskel på at være progressiv i en progressiv tid – og at være det i en tid, hvor mørket sænker sig, og fortvivlelsen truer med at stå i vejen for vreden og handlingen. Fem dyder, den progressive skal til at lære sig oven på Trumps sejr i USA...

  15. Visionen som drivkraft - hvordan kommer en vision til syne for sig selv?

    DEFF Research Database (Denmark)

    Andersen, Frode Boye

    2012-01-01

    En vision er ikke nødvendigvis visionær men er først og fremmest noget som kommer til syne - noget man ser for sig - og typisk i et fremtidsblik. En vision der ikke kommer til syne for sig selv vil kun vanskeligt kunne bære billeder frem, og i organisationer vil visioner der ikke genkendes som vi...

  16. Distributed Top-k Queries in E-commerce Environment

    Institute of Scientific and Technical Information of China (English)

    JiangZhan; YiqingSong; HaixiaZhang

    2004-01-01

    This paper focus on how to make distributed top-k query in e-commerce environment through web service. We first give the query process in such environment, then we present an algorithms for processing such queries, which based on the query model we defined. Experimental results show that the algorithms is efficient.

  17. Comprehensive Definition of the SigH Regulon of Mycobacterium tuberculosis Reveals Transcriptional Control of Diverse Stress Responses.

    Directory of Open Access Journals (Sweden)

    Jared D Sharp

    Full Text Available Expression of SigH, one of 12 Mycobacterium tuberculosis alternative sigma factors, is induced by heat, oxidative and nitric oxide stresses. SigH activation has been shown to increase expression of several genes, including genes involved in maintaining redox equilibrium and in protein degradation. However, few of these are known to be directly regulated by SigH. The goal of this project is to comprehensively define the Mycobacterium tuberculosis genes and operons that are directly controlled by SigH in order to gain insight into the role of SigH in regulating M. tuberculosis physiology. We used ChIP-Seq to identify in vivo SigH binding sites throughout the M. tuberculosis genome, followed by quantification of SigH-dependent expression of genes linked to these sites and identification of SigH-regulated promoters. We identified 69 SigH binding sites, which are located both in intergenic regions and within annotated coding sequences in the annotated M. tuberculosis genome. 41 binding sites were linked to genes that showed greater expression following heat stress in a SigH-dependent manner. We identified several genes not previously known to be regulated by SigH, including genes involved in DNA repair, cysteine biosynthesis, translation, and genes of unknown function. Experimental and computational analysis of SigH-regulated promoter sequences within these binding sites identified strong consensus -35 and -10 promoter sequences, but with tolerance for non-consensus bases at specific positions. This comprehensive identification and validation of SigH-regulated genes demonstrates an extended SigH regulon that controls an unexpectedly broad range of stress response functions.

  18. Fast Discovering Frequent Patterns for Incremental XML Queries

    Institute of Scientific and Technical Information of China (English)

    PENG Dun-lu; QIU Yang

    2004-01-01

    It is nontrivial to maintain such discovered frequent query patterns in real XML-DBMS because the transaction database of queries may allow frequent updates and such updates may not only invalidate some existing frequent query patterns but also generate some new frequent query patterns.In this paper, two incremental updating algorithms, FUXQMiner and FUFXQMiner, are proposed for efficient maintenance of discovered frequent query patterns and generation the new frequent query patterns when new XML queries are added into the database.Experimental results from our implementation show that the proposed algorithms have good performance.

  19. Implementation of Quantum Private Queries Using Nuclear Magnetic Resonance

    Institute of Scientific and Technical Information of China (English)

    WANG Chuan; HAO Liang; ZHAO Lian-Jie

    2011-01-01

    @@ We present a modified protocol for the realization of a quantum private query process on a classical database.Using one-qubit query and CNOT operation,the query process can be realized in a two-mode database.In the query process,the data privacy is preserved as the sender would not reveal any information about the database besides her query information,and the database provider cannot retain any information about the query.We implement the quantum private query protocol in a nuclear magnetic resonance system.The density matrix of the memory registers are constructed.

  20. Compressed Data Cube for Approximate OLAP Query Processing

    Institute of Scientific and Technical Information of China (English)

    冯玉; 王珊

    2002-01-01

    Approximate query processing has emerged as an approach to dealing with thehuge data volume and complex queries in the environment of data warehouse. In this paper,we present a novel method that provides approximate answers to OLAP queries. Our methodis based on building a compressed (approximate) data cube by a clustering technique and usingthis compressed data cube to provide answers to queries directly, so it improves the performanceof the queries. We also provide the algorithm of the OLAP queries and the confidence intervalsof query results. An extensive experimental study with the OLAP council benchmark showsthe effectiveness and scalability of our cluster-based approach compared to sampling.

  1. Index and query methods in road networks

    CERN Document Server

    Feng, Jun

    2015-01-01

    This book presents the index and query techniques on road network and moving objects which are limited to road network. Here, the road network of non-Euclidean space has its unique characteristics such that two moving objects may be very close in a straight line distance. The index used in two-dimensional Euclidean space is not always appropriate for moving objects on road network. Therefore, the index structure needs to be improved in order to obtain suitable indexing methods, explore the shortest path and acquire nearest neighbor query and aggregation query methods under the new index structures. Chapter 1 of this book introduces the present situation of intelligent traffic and index in road network, Chapter 2 introduces the relevant existing spatial indexing methods. Chapter 3-5 focus on several issues of road network and query, they involves: traffic road network models (see Chapter 3), index structures (see Chapter 4) and aggregate query methods (see Chapter 5). Finally, in Chapter 6, the book briefly de...

  2. Indexing Reverse Top-k Queries

    CERN Document Server

    Chester, Sean; Venkatesh, S; Whitesides, Sue

    2012-01-01

    We consider the recently introduced monochromatic reverse top-k queries which ask for, given a new tuple q and a dataset D, all possible top-k queries on D union {q} for which q is in the result. Towards this problem, we focus on designing indexes in two dimensions for repeated (or batch) querying, a novel but practical consideration. We present the insight that by representing the dataset as an arrangement of lines, a critical k-polygon can be identified and used exclusively to respond to reverse top-k queries. We construct an index based on this observation which has guaranteed worst-case query cost that is logarithmic in the size of the k-polygon. We implement our work and compare it to related approaches, demonstrating that our index is fast in practice. Furthermore, we demonstrate through our experiments that a k-polygon is comprised of a small proportion of the original data, so our index structure consumes little disk space.

  3. EHR query language (EQL)--a query language for archetype-based health records.

    Science.gov (United States)

    Ma, Chunlan; Frankel, Heath; Beale, Thomas; Heard, Sam

    2007-01-01

    OpenEHR specifications have been developed to standardise the representation of an international electronic health record (EHR). The language used for querying EHR data is not as yet part of the specification. To fill in this gap, Ocean Informatics has developed a query language currently known as EHR Query Language (EQL), a declarative language supporting queries on EHR data. EQL is neutral to EHR systems, programming languages and system environments and depends only on the openEHR archetype model and semantics. Thus, in principle, EQL can be used in any archetype-based computational context. In the EHR context described here, particular queries mention concepts from the openEHR EHR Reference Model (RM). EQL can be used as a common query language for disparate archetype-based applications. The use of a common RM, archetypes, and a companion query language, such as EQL, semantic interoperability of EHR information is much closer. This paper introduces the EQL syntax and provides example clinical queries to illustrate the syntax. Finally, current implementations and future directions are outlined.

  4. A Preliminary Mapping of Web Queries Using Existing Image Query Schemes.

    Science.gov (United States)

    Jansen, Bernard J.

    End user searching on the Web has become the primary method of locating images for many people. This study investigates the nature of Web image queries by attempting to map them to known image classification schemes. In this study, approximately 100,000 image queries from a major Web search engine were collected in 1997, 1999, and 2001. A…

  5. SPARQL Query Re-writing Using Partonomy Based Transformation Rules

    Science.gov (United States)

    Jain, Prateek; Yeh, Peter Z.; Verma, Kunal; Henson, Cory A.; Sheth, Amit P.

    Often the information present in a spatial knowledge base is represented at a different level of granularity and abstraction than the query constraints. For querying ontology's containing spatial information, the precise relationships between spatial entities has to be specified in the basic graph pattern of SPARQL query which can result in long and complex queries. We present a novel approach to help users intuitively write SPARQL queries to query spatial data, rather than relying on knowledge of the ontology structure. Our framework re-writes queries, using transformation rules to exploit part-whole relations between geographical entities to address the mismatches between query constraints and knowledge base. Our experiments were performed on completely third party datasets and queries. Evaluations were performed on Geonames dataset using questions from National Geographic Bee serialized into SPARQL and British Administrative Geography Ontology using questions from a popular trivia website. These experiments demonstrate high precision in retrieval of results and ease in writing queries.

  6. jQuery for designers beginner's guide

    CERN Document Server

    MacLees, Natalie

    2014-01-01

    A step-by-step guide that spices up your web pages and designs them in the way you want using the most widely used JavaScript library, jQuery. The beginner-friendly and easy-to-understand approach of the book will help get to grips with jQuery in no time. If you know the fundamentals of HTML and CSS, and want to extend your knowledge by learning to use JavaScript, then this is just the book for you. jQuery makes JavaScript straightforward and approachable - you'll be surprised at how easy it can be to add animations and special effects to your beautifully designed pages.

  7. Extending OLAP Querying to External Object

    DEFF Research Database (Denmark)

    Pedersen, Torben Bach; Shoshani, Arie; Gu, Junmin

    inherent in data in nonstandard applications are not accommodated well by OLAP systems. In contrast, object database systems are built to handle such complexity, but do not support OLAP-type querying well. This paper presents the concepts and techniques underlying a flexible, multi-model federated system...... that enables OLAP users to exploit simultaneously the features of OLAP and object systems. The system allows data to be handled using the most appropriate data model and technology: OLAP systems for dimensional data and object database systems for more complex, general data. Additionally, physical data...... integration can be avoided. As a vehicle for demonstrating the capabilities of the system, a prototypical OLAP language is defined and extended to naturally support queries that involve data in object databases. The language permits selection criteria that reference object data, queries that return...

  8. Animating the Web with jQuery

    Directory of Open Access Journals (Sweden)

    Asokan M

    2013-02-01

    Full Text Available World globalization and present day technology increases the web users rapidly. Every website is trying to attract the web users. The web site creators /developers add different kind of animations to their websites. There are many softwares available to create animation. jQuery can be used to create interactive and powerful web pages with animations. JQuery is a JavaScript library intendedto make Java Script programming easier and more fun. A JavaScript library is a complex JavaScript program that both simplifies difficult tasks and solves cross-browser problems. With jQuery, we canaccomplish tasks in a single line of code. JQuery is used on millions of websites. This paper discuss about the advantages and usage statistics of jQuery on the web. A complete procedure to create a slider and banner plug-ins are also included. They are tested with different browsers.

  9. Optimal Planar Orthogonal Skyline Counting Queries

    DEFF Research Database (Denmark)

    Brodal, Gerth Stølting; Larsen, Kasper Green

    2014-01-01

    The skyline of a set of points in the plane is the subset of maximal points, where a point (x,y) is maximal if no other point (x',y') satisfies x'≥ x and y'≥ x. We consider the problem of preprocessing a set P of n points into a space efficient static data structure supporting orthogonal skyline...... counting queries, i.e. given a query rectangle R to report the size of the skyline of P\\cap R. We present a data structure for storing n points with integer coordinates having query time O(lg n/lglg n) and space usage O(n). The model of computation is a unit cost RAM with logarithmic word size. We prove...

  10. How Do Search Engines Handle Chinese Queries?

    Directory of Open Access Journals (Sweden)

    Hong Cui

    2005-10-01

    Full Text Available The use of languages other than English has been growing exponentially on the Web. However, the major search engines have been lagging behind in providing indexes and search features to handle these languages. This article explores the characteristics of the Chinese language and how queries in this language are handled by different search engines. Queries were entered in two major search engines (Google and AlltheWeb and two search engines developed for Chinese (Sohu and Baidu. Criteria such as handling word segmentation, number of retrieved documents, and correct display and identification of Chinese characters were used to examine how the search engines handled the queries. The results showed that the performance of the two major search engines was not on a par with that of the search engines developed for Chinese.

  11. Path-based Queries on Trajectory Data

    DEFF Research Database (Denmark)

    Krogh, Benjamin Bjerre; Pelekis, Nikos; Theodoridis, Yannis

    2014-01-01

    a specific path by only retrieving data from the first and last edge in the path. To correctly answer strict path queries existing network-constrained trajectory indexes must retrieve data from all edges in the path. An extensive performance study of NETTRA using a very large real-world trajectory data set....... To efficiently support strict path queries, we present a novel NETwork-constrained TRAjectory index (NETTRA). This index enables very efficient retrieval of trajectories that follow a specific path, i.e., strict path queries. NETTRA uses a new path encoding scheme that can determine if a trajectory follows......, consisting of 1.7 million trajectories (941 million GPS records) and a road network with 1.3 million edges, shows a speed-up of two orders of magnitude compared to state-of-the-art trajectory indexes....

  12. jQuery Mobile Up and Running

    CERN Document Server

    Firtman, Maximiliano

    2012-01-01

    Would you like to build one mobile web application that works on iPad and Kindle Fire as well as iPhone and Android smartphones? This introductory guide to jQuery Mobile shows you how. Through a series of hands-on exercises, you'll learn the best ways to use this framework's many interface components to build customizable, multiplatform apps. You don't need any programming skills or previous experience with jQuery to get started. By the time you finish this book, you'll know how to create responsive, Ajax-based interfaces that work on a variety of smartphones and tablets, using jQuery Mobile

  13. Query strategy for sequential ontology debugging

    CERN Document Server

    Shchekotykhina, Kostyantyn; Fleiss, Philipp; Rodler, Patrick

    2011-01-01

    Debugging of ontologies is an important prerequisite for their wide-spread application, especially in areas that rely upon everyday users to create and maintain knowledge bases, as in the case of the Semantic Web. Recent approaches use diagnosis methods to identify causes of inconsistent or incoherent ontologies. However, in most debugging scenarios these methods return many alternative diagnoses, thus placing the burden of fault localization on the user. This paper demonstrates how the target diagnosis can be identified by performing a sequence of observations, that is, by querying an oracle about entailments of the target ontology. We exploit a-priori probabilities of typical user errors to formulate information-theoretic concepts for query selection. Our evaluation showed that the proposed method significantly reduces the number of required queries compared to myopic strategies. We experimented with different probability distributions of user errors and different qualities of the a-priori probabilities. Ou...

  14. Automatic Building Information Model Query Generation

    Energy Technology Data Exchange (ETDEWEB)

    Jiang, Yufei; Yu, Nan; Ming, Jiang; Lee, Sanghoon; DeGraw, Jason; Yen, John; Messner, John I.; Wu, Dinghao

    2015-12-01

    Energy efficient building design and construction calls for extensive collaboration between different subfields of the Architecture, Engineering and Construction (AEC) community. Performing building design and construction engineering raises challenges on data integration and software interoperability. Using Building Information Modeling (BIM) data hub to host and integrate building models is a promising solution to address those challenges, which can ease building design information management. However, the partial model query mechanism of current BIM data hub collaboration model has several limitations, which prevents designers and engineers to take advantage of BIM. To address this problem, we propose a general and effective approach to generate query code based on a Model View Definition (MVD). This approach is demonstrated through a software prototype called QueryGenerator. By demonstrating a case study using multi-zone air flow analysis, we show how our approach and tool can help domain experts to use BIM to drive building design with less labour and lower overhead cost.

  15. Transfer active learning by querying committee

    Institute of Scientific and Technical Information of China (English)

    Hao SHAO; Feng TAO; Rui XU

    2014-01-01

    In real applications of inductive learning for classifi cation, labeled instances are often defi cient, and labeling them by an oracle is often expensive and time-consuming. Active learning on a single task aims to select only informative unlabeled instances for querying to improve the classifi cation accuracy while decreasing the querying cost. However, an inevitable problem in active learning is that the informative measures for selecting queries are commonly based on the initial hypotheses sampled from only a few labeled instances. In such a circumstance, the initial hypotheses are not reliable and may deviate from the true distribution underlying the target task. Consequently, the informative measures will possibly select irrelevant instances. A promising way to compensate this problem is to borrow useful knowledge from other sources with abundant labeled information, which is called transfer learning. However, a signifi cant challenge in transfer learning is how to measure the similarity between the source and the target tasks. One needs to be aware of different distributions or label assignments from unrelated source tasks;otherwise, they will lead to degenerated performance while transferring. Also, how to design an effective strategy to avoid selecting irrelevant samples to query is still an open question. To tackle these issues, we propose a hybrid algorithm for active learning with the help of transfer learning by adopting a divergence measure to alleviate the negative transfer caused by distribution differences. To avoid querying irrelevant instances, we also present an adaptive strategy which could eliminate unnecessary instances in the input space and models in the model space. Extensive experiments on both the synthetic and the real data sets show that the proposed algorithm is able to query fewer instances with a higher accuracy and that it converges faster than the state-of-the-art methods.

  16. A structural query system for Han characters

    DEFF Research Database (Denmark)

    Skala, Matthew

    2016-01-01

    The IDSgrep structural query system for Han character dictionaries is presented. This dictionary search system represents the spatial structure of Han characters using Extended Ideographic Description Sequences (EIDSes), a data model and syntax based on the Unicode IDS concept. It includes a query...... language for EIDS databases, with a freely available implementation and format translation from popular third-party IDS and XML character databases. The system is designed to suit the needs of font developers and foreign language learners. The search algorithm includes a bit vector index inspired by Bloom...

  17. Approximate Distance Oracles with Improved Query Time

    CERN Document Server

    Wulff-Nilsen, Christian

    2012-01-01

    Given an undirected graph $G$ with $m$ edges, $n$ vertices, and non-negative edge weights, and given an integer $k\\geq 2$, we show that a $(2k-1)$-approximate distance oracle for $G$ of size $O(kn^{1 + 1/k})$ and with $O(\\log k)$ query time can be constructed in $O(\\min\\{kmn^{1/k},\\sqrt km + kn^{1 + c/\\sqrt k}\\})$ time for some constant $c$. This improves the $O(k)$ query time of Thorup and Zwick. For any $0 0$ and $k = O(\\log n/\\log\\log n)$.

  18. Role of the Porphyromonas gingivalis ECF sigma factor, SigH

    Science.gov (United States)

    Yanamandra, Sai S.; Sarrafee, Sara S.; Anaya-Bergman, Cecilia; Jones, Kevin; Lewis, Janina P.

    2012-01-01

    Little is known about the regulatory mechanisms that allow Porphyromonas gingivalis to survive in the oral cavity. Here we characterize the sigma factor SigH, one of six extracytoplasmic (ECF) sigma (σ) factors encoded in the P. gingivalis genome. Our results indicate that sigH expression is upregulated by exposure to molecular oxygen, suggesting that sigH plays a role in adaptation of P. gingivalis to oxygen. Furthermore, several genes involved in oxidative stress protection, such as sod, trx, tpx, ftn, feoB2 and the hemin uptake hmu locus, are downregulated in mutant deficient in SigH designated as V2948. ECF σ consensus sequences were identified upstream of the transcriptional start sites of these genes, consistent with the SigH-dependent regulation of these genes. Growth of V2948 was inhibited in the presence of 6% oxygen when compared to the wild-type W83 strain, while in anaerobic conditions both strains were able to grow. In addition, reduced growth of V2948 was observed in the presence of peroxide and thiol-oxidizing reagent, diamide when compared to the W83 strain. The SigH-deficient strain V2948 also exhibited reduced hemin uptake, consistent with the observed reduced expression of genes involved in hemin uptake. Finally, survival of V2948 was reduced in the presence of host cells compared to the wild-type W83 strain. Collectively, our studies demonstrate that SigH is a positive regulator of gene expression required for survival of the bacterium in the presence of oxygen and oxidative stress, hemin uptake, and virulence. PMID:22520389

  19. Optimization and Evaluation of Nested Queries and Procedures

    CERN Document Server

    Guravannavar, Ravindra

    2009-01-01

    Many database applications perform complex data retrieval and update tasks. Nested queries, and queries that invoke user-defined functions, which are written using a mix of procedural and SQL constructs, are often used in such applications. A straight-forward evaluation of such queries involves repeated execution of parameterized sub-queries or blocks containing queries and procedural code. An important problem that arises while optimizing nested queries as well as queries with joins, aggregates and set operations is the problem of finding an optimal sort order from a factorial number of possible sort orders. We show that even a special case of this problem is NP-Hard, and present practical heuristics that are effective and easy to incorporate in existing query optimizers. We also consider iterative execution of queries and updates inside complex procedural blocks such as user-defined functions and stored procedures. Parameter batching is an important means of improving performance as it enables set-orientate...

  20. Study on Summarization of SIG and Its Effect on Information War in Future%空间信息栅格(SIG)概述及对未来信息战的影响

    Institute of Scientific and Technical Information of China (English)

    邱伟

    2006-01-01

    空间信息栅格(SIG)技术是国家重点研发项目之一.本文简要介绍了空间信息栅格(SIG)的功能与框架结构组成;重点对空间信息栅格(SIG)在未来信息战中的作用进行了分析论述.

  1. Hybrid Filtering in Semantic Query Processing

    Science.gov (United States)

    Jeong, Hanjo

    2011-01-01

    This dissertation presents a hybrid filtering method and a case-based reasoning framework for enhancing the effectiveness of Web search. Web search may not reflect user needs, intent, context, and preferences, because today's keyword-based search is lacking semantic information to capture the user's context and intent in posing the search query.…

  2. Beginning SQL queries from novice to professional

    CERN Document Server

    Churcher, Clare

    2016-01-01

    Anyone who does any work at all with databases needs to know something of SQL. This is a friendly and easy-to-read guide to writing queries with the all-important - in the database world - SQL language. The author writes with exceptional clarity.

  3. Anytime skyline query processing for interactive systems

    DEFF Research Database (Denmark)

    Magnani, Matteo; Assent, Ira; Mortensen, Michael L.

    In this paper we introduce the concept of anytime skyline query . The skyline database operator returns the top-1record for every possible monotone record scoring function.However, computing a skyline can be very time-consumingdepending on the size, distribution and dimensionality of the data, ma...

  4. Parallel hierarchical evaluation of transitive closure queries

    NARCIS (Netherlands)

    Houtsma, M.A.W.; Cacace, F.; Ceri, S.

    1991-01-01

    Presents a new approach to parallel computation of transitive closure queries using a semantic data fragmentation. Tuples of a large base relation denote edges in a graph, which models a transportation network. A fragmentation algorithm is proposed which produces a partitioning of the base relation

  5. Parallel evaluation of multi-join queries

    NARCIS (Netherlands)

    Wilschut, A.N.; Flokstra, Jan; Apers, Peter M.G.

    1995-01-01

    A number of execution strategies for parallel evaluation of multi-join queries have been proposed in the literature; their performance was evaluated by simulation. In this paper we give a comparative performance evaluation of four execution strategies by implementing all of them on the same parallel

  6. Query term suggestion in academic search

    NARCIS (Netherlands)

    Verberne, S.; Sappelli, M.; Kraaij, W.

    2014-01-01

    In this paper, we evaluate query term suggestion in the context of academic professional search. Our overall goal is to support scientists in their information seeking tasks. We set up an interactive search system in which terms are extracted from clicked documents and suggested to the user before e

  7. Querying Source Code with Natural Language

    CERN Document Server

    Kimmig, Markus; Mezini, Mira

    2012-01-01

    One common task of developing or maintaining software is searching the source code for information like specific method calls or write accesses to certain fields. This kind of information is required to correctly implement new features and to solve bugs. This paper presents an approach for querying source code with natural language.

  8. Exploiting cost distributions for query optimization

    NARCIS (Netherlands)

    Waas, F.; Pellenkoft, A.J.

    1998-01-01

    Large-scale query optimization is, besides its practical relevance, a hard test case for optimization techniques. Since exact methods cannot be applied due to the combinatorial explosion of the search space, heuristics and probabilistic strategies have been deployed for more than a decade. However,

  9. Enabling Incremental Query Re-Optimization

    Science.gov (United States)

    Liu, Mengmeng; Ives, Zachary G.; Loo, Boon Thau

    2017-01-01

    As declarative query processing techniques expand to the Web, data streams, network routers, and cloud platforms, there is an increasing need to re-plan execution in the presence of unanticipated performance changes. New runtime information may affect which query plan we prefer to run. Adaptive techniques require innovation both in terms of the algorithms used to estimate costs, and in terms of the search algorithm that finds the best plan. We investigate how to build a cost-based optimizer that recomputes the optimal plan incrementally given new cost information, much as a stream engine constantly updates its outputs given new data. Our implementation especially shows benefits for stream processing workloads. It lays the foundations upon which a variety of novel adaptive optimization algorithms can be built. We start by leveraging the recently proposed approach of formulating query plan enumeration as a set of recursive datalog queries; we develop a variety of novel optimization approaches to ensure effective pruning in both static and incremental cases. We further show that the lessons learned in the declarative implementation can be equally applied to more traditional optimizer implementations. PMID:28659658

  10. Cooperative Scalable Moving Continuous Query Processing

    DEFF Research Database (Denmark)

    Li, Xiaohui; Karras, Panagiotis; Jensen, Christian S.

    2012-01-01

    A range of applications call for a mobile client to continuously monitor others in close proximity. Past research on such problems has covered two extremes: It has offered totally centralized solutions, where a server takes care of all queries, and totally distributed solutions, in which there is...

  11. Using temporal bursts for query modeling

    NARCIS (Netherlands)

    Peetz, M.H.; Meij, E.; de Rijke, M.

    2014-01-01

    We present an approach to query modeling that leverages the temporal distribution of documents in an initially retrieved set of documents. In news-related document collections such distributions tend to exhibit bursts. Here, we define a burst to be a time period where unusually many documents are pu

  12. Parallel hierarchical evaluation of transitive closure queries

    NARCIS (Netherlands)

    Houtsma, M.A.W.; Houtsma, M.A.W.; Cacace, F.; Ceri, S.

    1991-01-01

    Presents a new approach to parallel computation of transitive closure queries using a semantic data fragmentation. Tuples of a large base relation denote edges in a graph, which models a transportation network. A fragmentation algorithm is proposed which produces a partitioning of the base relation

  13. Adapting Query Expansion to Search Proficiency

    NARCIS (Netherlands)

    C. Boscarino (Corrado); V. Hollink (Vera); A.P. de Vries (Arjen); B. Carterette; E. Kanoulas; P. Clough; M. Sanderson

    2012-01-01

    htmlabstractWe argue that query expansion (QE) based on the full ses- sion improves the overall search experience provided that we know how to adapt the QE weighting schema to a user's search proficiency. We propose a strategy to predict search ability from session parameters. Us- ing an

  14. Enriching a Descriptive Grammar with Treebank Queries

    NARCIS (Netherlands)

    Bouma, G.; van Koppen, J.M.|info:eu-repo/dai/nl/203188934; Landsbergen, Frank; Odijk, J.E.J.M.|info:eu-repo/dai/nl/082781710; van der Wouden, Ton; van de Camp, Matje

    2015-01-01

    The Syntax of Dutch (SoD) is a descriptive and detailed grammar of Dutch, that provides data for many issues raised in linguistic theory. We present the results of a pilot project that investigated the possibility of enriching the online version of the text with links to queries that provide

  15. Query term suggestion in academic search

    NARCIS (Netherlands)

    Verberne, S.; Sappelli, M.; Kraaij, W.

    2014-01-01

    In this paper, we evaluate query term suggestion in the context of academic professional search. Our overall goal is to support scientists in their information seeking tasks. We set up an interactive search system in which terms are extracted from clicked documents and suggested to the user before

  16. Enabling Incremental Query Re-Optimization.

    Science.gov (United States)

    Liu, Mengmeng; Ives, Zachary G; Loo, Boon Thau

    2016-01-01

    As declarative query processing techniques expand to the Web, data streams, network routers, and cloud platforms, there is an increasing need to re-plan execution in the presence of unanticipated performance changes. New runtime information may affect which query plan we prefer to run. Adaptive techniques require innovation both in terms of the algorithms used to estimate costs, and in terms of the search algorithm that finds the best plan. We investigate how to build a cost-based optimizer that recomputes the optimal plan incrementally given new cost information, much as a stream engine constantly updates its outputs given new data. Our implementation especially shows benefits for stream processing workloads. It lays the foundations upon which a variety of novel adaptive optimization algorithms can be built. We start by leveraging the recently proposed approach of formulating query plan enumeration as a set of recursive datalog queries; we develop a variety of novel optimization approaches to ensure effective pruning in both static and incremental cases. We further show that the lessons learned in the declarative implementation can be equally applied to more traditional optimizer implementations.

  17. A novel methodology for querying web images

    Science.gov (United States)

    Prabhakara, Rashmi; Lee, Ching Cheng

    2005-01-01

    Ever since the advent of Internet, there has been an immense growth in the amount of image data that is available on the World Wide Web. With such a magnitude of image availability, an efficient and effective image retrieval system is required to make use of this information. This research presents an effective image matching and indexing technique that improvises on existing integrated image retrieval methods. The proposed technique follows a two-phase approach, integrating query by topic and query by example specification methods. The first phase consists of topic-based image retrieval using an improved text information retrieval (IR) technique that makes use of the structured format of HTML documents. It consists of a focused crawler that not only provides for the user to enter the keyword for the topic-based search but also, the scope in which the user wants to find the images. The second phase uses the query by example specification to perform a low-level content-based image match for the retrieval of smaller and relatively closer results of the example image. Information related to the image feature is automatically extracted from the query image by the image processing system. A technique that is not computationally intensive based on color feature is used to perform content-based matching of images. The main goal is to develop a functional image search and indexing system and to demonstrate that better retrieval results can be achieved with this proposed hybrid search technique.

  18. Approximate Nearest Neighbor Queries among Parallel Segments

    DEFF Research Database (Denmark)

    Emiris, Ioannis Z.; Malamatos, Theocharis; Tsigaridas, Elias

    2010-01-01

    We develop a data structure for answering efficiently approximate nearest neighbor queries over a set of parallel segments in three dimensions. We connect this problem to approximate nearest neighbor searching under weight constraints and approximate nearest neighbor searching on historical data...

  19. Templates and Queries in Contextual Hypermedia

    DEFF Research Database (Denmark)

    Anderson, Kenneth Mark; Hansen, Frank Allan; Bouvin, Niels Olof

    2006-01-01

    This paper presents a new definition of context for context-aware computing based on a model that relies on dynamic queries over structured objects. This new model enables developers to flexibly specify the relationship between context and context data for their context-aware applications. We dis...

  20. Salivary SIgA responses to acute moderate-vigorous exercise in monophasic oral contraceptive users.

    Science.gov (United States)

    Hayashida, Harumi; Dolan, Nicola J; Hounsome, Charlotte; Alajmi, Nawal; Bishop, Nicolette C

    2015-09-01

    The purpose of this study was to examine the effect of oral contraceptive (OC) use on salivary secretory immunoglobulin A (SIgA) levels at rest and in response to an acute bout of moderate-vigorous exercise during 2 phases of the 4-week OC cycle corresponding to different phases of the synthetic menstrual cycle. Ten healthy active females completed a cycling at 70% peak oxygen uptake for 45 min at 2 time points of an OC cycle: during the equivalent in time to the mid-follicular phase (day 8 ± 2) and the mid-luteal phase (day 20 ± 2). Timed unstimulated saliva samples were obtained before, immediately postexercise, and 1 h postexercise and analyzed for salivary SIgA. Salivary SIgA secretion rate was 26% (95% confidence limits (CI) 6-46) lower at postexercise compared with pre-exercise during the synthetic follicular phase (p = 0.019) but no differences were observed during the synthetic luteal trial. Saliva flow rate was 11% (95% CI, 8-30) lower at postexercise compared with pre-exercise (main effect for time; p = 0.025). In conclusion, the pattern of salivary SIgA secretion rate response to moderate-vigorous exercise varies across the early and late phases of a monophasic OC cycle, with a transient reduction in salivary SIgA responses during the synthetic follicular phase. These findings indicate that monophasic OC use should be considered when assessing mucosal immune responses to acute exercise.

  1. Evaluation of the NCPDP Structured and Codified Sig Format for e-prescriptions.

    Science.gov (United States)

    Liu, Hangsheng; Burkhart, Q; Bell, Douglas S

    2011-01-01

    To evaluate the ability of the structure and code sets specified in the National Council for Prescription Drug Programs Structured and Codified Sig Format to represent ambulatory electronic prescriptions. We parsed the Sig strings from a sample of 20,161 de-identified ambulatory e-prescriptions into variables representing the fields of the Structured and Codified Sig Format. A stratified random sample of these representations was then reviewed by a group of experts. For codified Sig fields, we attempted to map the actual words used by prescribers to the equivalent terms in the designated terminology. Proportion of prescriptions that the Format could fully represent; proportion of terms used that could be mapped to the designated terminology. The fields defined in the Format could fully represent 95% of Sigs (95% CI 93% to 97%), but ambiguities were identified, particularly in representing multiple-step instructions. The terms used by prescribers could be codified for only 60% of dose delivery methods, 84% of dose forms, 82% of vehicles, 95% of routes, 70% of sites, 33% of administration timings, and 93% of indications. The findings are based on a retrospective sample of ambulatory prescriptions derived mostly from primary care physicians. The fields defined in the Format could represent most of the patient instructions in a large prescription sample, but prior to its mandatory adoption, further work is needed to ensure that potential ambiguities are addressed and that a complete set of terms is available for the codified fields.

  2. Semantic – Based Querying Using Ontology in Relational Database of Library Management System

    Directory of Open Access Journals (Sweden)

    Ayesha Banu

    2011-11-01

    Full Text Available The traditional Web stores huge amount of data in the form of Relational Databases (RDB as it is good atstoring objects and relationships between them. Relational Databases are dynamic in nature which allowsbringing tables together helping user to search for related material across multiple tables. RDB arescalable to expand as the data grows. The RDB uses a Structured Query Language called SQL to accessthe databases for several data retrieval purposes. As the world is moving today from the Syntactic form toSemantic form and the Web is also taking its new form of Semantic Web. The Structured Query of the RDBon web can be a Semantic Query on Semantic Web. The SPARQL is the Query Language recommended byW3C for the RDF(Resource Description Framework. RDF is a directed, labeled graph data format forrepresenting information in the Web and is a very important layer of the Semantic Web Architecture. In thispaper we consider the Library Management System (LMS database, taking some tuples of the LMSRelational Schema. We discuss how the RDF code is scripted and validated using RDF Validator and howRDF Triples are generated. Later we give the graphical representation of the RDF triples and see theprocess of extracting ontology from the RDF Schema and application of the Semantic Query.

  3. Effective Filtering of Query Results on Updated User Behavioral Profiles in Web Mining.

    Science.gov (United States)

    Sadesh, S; Suganthe, R C

    2015-01-01

    Web with tremendous volume of information retrieves result for user related queries. With the rapid growth of web page recommendation, results retrieved based on data mining techniques did not offer higher performance filtering rate because relationships between user profile and queries were not analyzed in an extensive manner. At the same time, existing user profile based prediction in web data mining is not exhaustive in producing personalized result rate. To improve the query result rate on dynamics of user behavior over time, Hamilton Filtered Regime Switching User Query Probability (HFRS-UQP) framework is proposed. HFRS-UQP framework is split into two processes, where filtering and switching are carried out. The data mining based filtering in our research work uses the Hamilton Filtering framework to filter user result based on personalized information on automatic updated profiles through search engine. Maximized result is fetched, that is, filtered out with respect to user behavior profiles. The switching performs accurate filtering updated profiles using regime switching. The updating in profile change (i.e., switches) regime in HFRS-UQP framework identifies the second- and higher-order association of query result on the updated profiles. Experiment is conducted on factors such as personalized information search retrieval rate, filtering efficiency, and precision ratio.

  4. Enhancing XML Data Warehouse Query Performance by Fragmentation

    CERN Document Server

    Mahboubi, Hadj

    2009-01-01

    XML data warehouses form an interesting basis for decision-support applications that exploit heterogeneous data from multiple sources. However, XML-native database systems currently suffer from limited performances in terms of manageable data volume and response time for complex analytical queries. Fragmenting and distributing XML data warehouses (e.g., on data grids) allow to address both these issues. In this paper, we work on XML warehouse fragmentation. In relational data warehouses, several studies recommend the use of derived horizontal fragmentation. Hence, we propose to adapt it to the XML context. We particularly focus on the initial horizontal fragmentation of dimensions' XML documents and exploit two alternative algorithms. We experimentally validate our proposal and compare these alternatives with respect to a unified XML warehouse model we advocate for.

  5. jQuery UI 1.10 the user interface library for jQuery

    CERN Document Server

    Libby, Alex

    2013-01-01

    This book consists of an easy-to-follow, example-based approach that leads you step-by-step through the implementation and customization of each library component.This book is for frontend designers and developers who need to learn how to use jQuery UI quickly. To get the most out of this book, you should have a good working knowledge of HTML, CSS, and JavaScript, and should ideally be comfortable using jQuery.

  6. Automated Query Learning with Wikipedia and Genetic Programming

    CERN Document Server

    Malo, Pekka; Sinha, Ankur

    2010-01-01

    Most of the existing information retrieval systems are based on bag of words model and are not equipped with common world knowledge. Work has been done towards improving the efficiency of such systems by using intelligent algorithms to generate search queries, however, not much research has been done in the direction of incorporating human-and-society level knowledge in the queries. This paper is one of the first attempts where such information is incorporated into the search queries using Wikipedia semantics. The paper presents an essential shift from conventional token based queries to concept based queries, leading to an enhanced efficiency of information retrieval systems. To efficiently handle the automated query learning problem, we propose Wikipedia-based Evolutionary Semantics (Wiki-ES) framework where concept based queries are learnt using a co-evolving evolutionary procedure. Learning concept based queries using an intelligent evolutionary procedure yields significant improvement in performance whic...

  7. An Approach to Assist Designers With Their Queries and Designs

    DEFF Research Database (Denmark)

    Ahmed, Saeema

    2006-01-01

    Recent research investigating how engineers search for information has concluded that engineering designers acquire assistance when formulating queries. An approach to assist designers with their queries is presented. This approach forms part of a knowledge management system, where indexed docume...

  8. A Revisit of Query Expansion with Different Semantic Levels

    DEFF Research Database (Denmark)

    Zhang, Ce; Cui, Bin; Cong, Gao;

    2009-01-01

    Query expansion has received extensive attention in information retrieval community. Although semantic based query expansion appears to be promising in improving retrieval performance, previous research has shown that it cannot consistently improve retrieval performance. It is a tricky problem to...

  9. Chlorella intake attenuates reduced salivary SIgA secretion in kendo training camp participants

    Directory of Open Access Journals (Sweden)

    Otsuki Takeshi

    2012-12-01

    Full Text Available Abstract Background The green alga Chlorella contains high levels of proteins, vitamins, and minerals. We previously reported that a chlorella-derived multicomponent supplement increased the secretion rate of salivary secretory immunoglobulin A (SIgA in humans. Here, we investigated whether intake of this chlorella-derived supplement attenuated the reduced salivary SIgA secretion rate during a kendo training camp. Methods Ten female kendo athletes participated in inter-university 6-day spring and 4-day summer camps. They were randomized into two groups; one took placebo tablets during the spring camp and chlorella tablets during the summer camp, while the other took chlorella tablets during the spring camp and placebo tablets during the summer camp. Subjects took these tablets starting 4 weeks before the camp until post-camp saliva sampling. Salivary SIgA concentrations were measured by ELISA. Results All subjects participated in nearly all training programs, and body-mass changes and subjective physical well-being scores during the camps were comparable between the groups. However, salivary SIgA secretion rate changes were different between these groups. Salivary SIgA secretion rates decreased during the camp in the placebo group (before vs. second, middle, and final day of camp, and after the camp: 146 ± 89 vs. 87 ± 56, 70 ± 45, 94 ± 58, and 116 ± 71 μg/min, whereas no such decreases were observed in the chlorella group (121 ± 53 vs. 113 ± 68, 98 ± 69,115 ± 80, and 128 ± 59 μg/min. Conclusion Our results suggest that a use of a chlorella-derived dietary supplement attenuates reduced salivary SIgA secretion during a training camp for a competitive sport.

  10. Uso di gvSIG e SEXTANTE per la perimetrazione degli ambiti periurbani

    Directory of Open Access Journals (Sweden)

    Gabriele Nolè

    2010-03-01

    Full Text Available Use of gvSIG and SEXTANTE for the perimetration of periurban areasThe periurban fringe is the portion of land with characteristics of urbanization that cannot be considered neither urban nor rural. These areas are often characterized by a building expectancy, whose detection requires careful consideration of several territorial and environmental variables. It was implemented using a model of spatial analysis based on kernel Density Estimation (KDE for the detection of periurban areas. The model is tested in the province of Potenza using gvSIG and SEXTANTE on Ubuntu Linux.

  11. Desarrollo de prototipo de sensor IoT usando la red SigFox

    OpenAIRE

    Pérez Hernández, Ramón

    2015-01-01

    Este proyecto, en colaboración con Wellness Smart Cities, está centrado en el estudio de las posibilidades que puede ofrecer la red SigFox para la integración y comunicación de dispositivos a través del Internet de las Cosas (IoT). Para ello, se hará uso de un modelo concreto de módem para realizar las pruebas; el TD1204 (de Telecom Design). Dicho dispositivo, usado como un sensor de la red SigFox, permitirá monitorizar diversas alarmas definidas en los casos de uso a probar; c...

  12. Research and application of mineral resources assessment by weights of evidence model based on SIG

    Institute of Scientific and Technical Information of China (English)

    Yuanyuan Chuai; Keyan Xiao; Yihua Xuan; Shaobin Zhan

    2006-01-01

    Geological data are usually of the characteristics of multi-source, large amount and multi-scale. The construction of Spatial Information Grid overcomes the shortages of personal computers when dealing with geological data. The authors introduce the definition, architecture and flow of mineral resources assessment by weights of evidence model based on Spatial Information Grid (SIG). Meanwhile, a case study on the prediction of copper mineral occurrence in the Middle-Lower Yangtze metallogenic belt is given. The results show that mineral resources assessement based on SIG is an effective new method which provides a way of sharing and integrating distributed geospatial information and improves the efficiency greatly.

  13. QUERY RESPONSE TIME COMPARISON NOSQLDB MONGODB WITH SQLDB ORACLE

    Directory of Open Access Journals (Sweden)

    Humasak T. A. Simanjuntak

    2015-01-01

    Full Text Available Penyimpanan data saat ini terdapat dua jenis yakni relational database dan non-relational database. Kedua jenis DBMS (Database Managemnet System tersebut berbeda dalam berbagai aspek seperti per-formansi eksekusi query, scalability, reliability maupun struktur penyimpanan data. Kajian ini memiliki tujuan untuk mengetahui perbandingan performansi DBMS antara Oracle sebagai jenis relational data-base dan MongoDB sebagai jenis non-relational database dalam mengolah data terstruktur. Eksperimen dilakukan untuk mengetahui perbandingan performansi kedua DBMS tersebut untuk operasi insert, select, update dan delete dengan menggunakan query sederhana maupun kompleks pada database Northwind. Untuk mencapai tujuan eksperimen, 18 query yang terdiri dari 2 insert query, 10 select query, 2 update query dan 2 delete query dieksekusi. Query dieksekusi melalui sebuah aplikasi .Net yang dibangun sebagai perantara antara user dengan basis data. Eksperimen dilakukan pada tabel dengan atau tanpa relasi pada Oracle dan embedded atau bukan embedded dokumen pada MongoDB. Response time untuk setiap eksekusi query dibandingkan dengan menggunakan metode statistik. Eksperimen menunjukkan response time query untuk proses select, insert, dan update pada MongoDB lebih cepatdaripada Oracle. MongoDB lebih cepat 64.8 % untuk select query;MongoDB lebihcepat 72.8 % untuk insert query dan MongoDB lebih cepat 33.9 % untuk update query. Pada delete query, Oracle lebih cepat 96.8 % daripada MongoDB untuk table yang berelasi, tetapi MongoDB lebih cepat 83.8 % daripada Oracle untuk table yang tidak memiliki relasi.Untuk query kompleks dengan Map Reduce pada MongoDB lebih lambat 97.6% daripada kompleks query dengan aggregate function pada Oracle.

  14. Constructing a Relational Query Optimizer for Non-Relational Languages

    OpenAIRE

    Rittinger, Jan

    2010-01-01

    Flat, unordered table data and a declarative query language established today’s success of relational database systems. Provided with the freedom to choose the evaluation order and underlying algorithms, their complex query optimizers are geared to come up with the best execution plan for a given query. With over 30 years of development and research, relational database management systems belong to the most mature and efficient query processors (especially for substantial amounts of data). ...

  15. Web Page Recommendation Using Web Mining

    Directory of Open Access Journals (Sweden)

    Modraj Bhavsar

    2014-07-01

    Full Text Available On World Wide Web various kind of content are generated in huge amount, so to give relevant result to user web recommendation become important part of web application. On web different kind of web recommendation are made available to user every day that includes Image, Video, Audio, query suggestion and web page. In this paper we are aiming at providing framework for web page recommendation. 1 First we describe the basics of web mining, types of web mining. 2 Details of each web mining technique.3We propose the architecture for the personalized web page recommendation.

  16. A Faceted Query Engine Applied to Archaeology

    Directory of Open Access Journals (Sweden)

    Kenneth A. Ross

    2007-04-01

    Full Text Available In this article we present the Faceted Query Engine, a system developed at Columbia University under the aegis of the inter-disciplinary project Computational Tools for Modeling, Visualizing and Analyzing Historic and Archaeological Sites. Our system is based on novel Database Systems research that has been published in Computer Science venues (Ross and Janevski, 2004 and Ross et al., 2005. The goal of this article is to introduce our system to the target user audience - the archaeology community. We demonstrate the use of the Faceted Query Engine on a previously unpublished dataset: the Thulamela (South Africa collection. This dataset is comprised of iron-age finds from the Thulamela site at the Kruger National Park. Our project is the first to systematically compile and classify this dataset. We also use a larger dataset, a collection of ancient Egyptian artifacts from the Memphis site (Giddy,1999, to demonstrate some of the features of our system.

  17. Virtual Solar Observatory Distributed Query Construction

    Science.gov (United States)

    Gurman, J. B.; Dimitoglou, G.; Bogart, R.; Davey, A.; Hill, F.; Martens, P.

    2003-01-01

    Through a prototype implementation (Tian et al., this meeting) the VSO has already demonstrated the capability of unifying geographically distributed data sources following the Web Services paradigm and utilizing mechanisms such as the Simple Object Access Protocol (SOAP). So far, four participating sites (Stanford, Montana State University, National Solar Observatory and the Solar Data Analysis Center) permit Web-accessible, time-based searches that allow browse access to a number of diverse data sets. Our latest work includes the extension of the simple, time-based queries to include numerous other searchable observation parameters. For VSO users, this extended functionality enables more refined searches. For the VSO, it is a proof of concept that more complex, distributed queries can be effectively constructed and that results from heterogeneous, remote sources can be synthesized and presented to users as a single, virtual data product.

  18. Mathematical Formula Search using Natural Language Queries

    Directory of Open Access Journals (Sweden)

    YANG, S.

    2014-11-01

    Full Text Available This paper presents how to search mathematical formulae written in MathML when given plain words as a query. Since the proposed method allows natural language queries like the traditional Information Retrieval for the mathematical formula search, users do not need to enter any complicated math symbols and to use any formula input tool. For this, formula data is converted into plain texts, and features are extracted from the converted texts. In our experiments, we achieve an outstanding performance, a MRR of 0.659. In addition, we introduce how to utilize formula classification for formula search. By using class information, we finally achieve an improved performance, a MRR of 0.690.

  19. Exploiting Conceptual Knowledge for Querying Information Systems

    OpenAIRE

    Selke, Joachim; Balke, Wolf-Tilo

    2011-01-01

    Whereas today's information systems are well-equipped for efficient query handling, their strict mathematical foundations hamper their use for everyday tasks. In daily life, people expect information to be offered in a personalized and focused way. But currently, personalization in digital systems still only takes explicit knowledge into account and does not yet process conceptual information often naturally implied by users. We discuss how to bridge the gap between users and today's systems,...

  20. Exploiting Conceptual Knowledge for Querying Information Systems

    CERN Document Server

    Selke, Joachim

    2011-01-01

    Whereas today's information systems are well-equipped for efficient query handling, their strict mathematical foundations hamper their use for everyday tasks. In daily life, people expect information to be offered in a personalized and focused way. But currently, personalization in digital systems still only takes explicit knowledge into account and does not yet process conceptual information often naturally implied by users. We discuss how to bridge the gap between users and today's systems, building on results from cognitive psychology.

  1. Date restricted queries in web search engines

    OpenAIRE

    Lewandowski, Dirk

    2004-01-01

    Search engines usually offer a date restricted search on their advanced search pages. But determining the actual update of a web page is not without problems. We conduct a study testing date restricted queries on the search engines Google, Teoma and Yahoo!. We find that these searches fail to work properly in the examined engines. We discuss implications of this for further research and search engine development.

  2. Query Reformulation for Clinical Decision Support Search

    Science.gov (United States)

    2014-11-01

    general purpose search engines: case reports are much longer than traditional queries and present a narrative structure. Our system, initially...relevance feedback (PRF). The advantage of using such technique is that it is able to expand the case report not only by adding relevant medical terms...v.4.8. The following fields were indexed and used for document retrieval (unless otherwise stated): article title, article abstract, and article text

  3. MQ-2 A Tool for Prolog-based Model Querying

    DEFF Research Database (Denmark)

    Acretoaie, Vlad; Störrle, Harald

    2012-01-01

    MQ-2 integrates a Prolog console into the MagicDraw1 modeling environment and equips this console with features targeted specifically to the task of querying models. The vision of MQ-2 is to make Prolog-based model querying accessible to both student and expert modelers by offering powerful query...

  4. Efficient Processing of Multiple DTW Queries in Time Series Databases

    DEFF Research Database (Denmark)

    Kremer, Hardy; Günnemann, Stephan; Ivanescu, Anca-Maria

    2011-01-01

    . In many of today’s applications, however, large numbers of queries arise at any given time. Existing DTW techniques do not process multiple DTW queries simultaneously, a serious limitation which slows down overall processing. In this paper, we propose an efficient processing approach for multiple DTW...... for multiple DTW queries....

  5. Multiple Query Evaluation Based on an Enhanced Genetic Algorithm.

    Science.gov (United States)

    Tamine, Lynda; Chrisment, Claude; Boughanem, Mohand

    2003-01-01

    Explains the use of genetic algorithms to combine results from multiple query evaluations to improve relevance in information retrieval. Discusses niching techniques, relevance feedback techniques, and evolution heuristics, and compares retrieval results obtained by both genetic multiple query evaluation and classical single query evaluation…

  6. Result Diversification Based on Query-Specific Cluster Ranking

    NARCIS (Netherlands)

    J. He (Jiyin); E. Meij; M. de Rijke

    2011-01-01

    htmlabstractResult diversification is a retrieval strategy for dealing with ambiguous or multi-faceted queries by providing documents that cover as many facets of the query as possible. We propose a result diversification framework based on query-specific clustering and cluster ranking,

  7. Result diversification based on query-specific cluster ranking

    NARCIS (Netherlands)

    He, J.; Meij, E.; de Rijke, M.

    2011-01-01

    Result diversification is a retrieval strategy for dealing with ambiguous or multi-faceted queries by providing documents that cover as many facets of the query as possible. We propose a result diversification framework based on query-specific clustering and cluster ranking, in which diversification

  8. Annotating URLs with query terms: What factors predict reliable annotations?

    NARCIS (Netherlands)

    Verberne, S.; Hinne, M.; Heijden, M. van der; Kraaij, W.; D'hondt, E.; Weide, T. van der

    2009-01-01

    A number of recent studies have investigated the relation be-ween URLs and associated query terms from search engine log files. In [5], the query terms associated with the domain of a URL were used as features for a URL classification task. The idea is that query terms that lead to successful classi

  9. A NEW TOP-K CONDITIONAL XML PREFERENCE QUERIES

    Directory of Open Access Journals (Sweden)

    Shaikhah Alhazmi

    2014-09-01

    Full Text Available Preference querying technology is a very important issue in a variety of applications ranging from ecommerce to personalized search engines. Most of recent research works have been dedicated to this topic in the Artificial Intelligence and Database fields. Several formalisms allowing preference reasoning and specification have been proposed in the Artificial Intelligence domain. On the other hand, in the Database field the interest has been focused mainly in extending standard Structured Query Language (SQL and also eXtensible Markup Language (XML with preference facilities in order to provide personalized query answering. More precisely, the interest in the database context focuses on the notion of Top-k preference query and on the development of efficient methods for evaluating these queries. A Top-k preference query returns k data tuples which are the most preferred according to the user’s preferences. Of course, Top-k preference query answering is closely dependent on the particular preference model underlying the semantics of the operators responsible for selecting the best tuples. In this paper, we consider the Conditional Preference queries (CP-queries where preferences are specified by a set of rules expressed in a logical formalism. We introduce Top-k conditional preference queries (Top-k CP-queries, and the operators BestK-Match and Best-Match for evaluating these queries will be presented.

  10. A comparison of user and system query performance predictions

    NARCIS (Netherlands)

    Hauff, C.; Kelly, Diane; Azzopardi, Leif

    2010-01-01

    Query performance prediction methods are usually applied to estimate the retrieval effectiveness of queries, where the evaluation is largely system sided. However, little work has been conducted to understand query performance prediction from the user's perspective. The question we consider is,

  11. Enabling Ontology Based Semantic Queries in Biomedical Database Systems.

    Science.gov (United States)

    Zheng, Shuai; Wang, Fusheng; Lu, James

    2014-03-01

    There is a lack of tools to ease the integration and ontology based semantic queries in biomedical databases, which are often annotated with ontology concepts. We aim to provide a middle layer between ontology repositories and semantically annotated databases to support semantic queries directly in the databases with expressive standard database query languages. We have developed a semantic query engine that provides semantic reasoning and query processing, and translates the queries into ontology repository operations on NCBO BioPortal. Semantic operators are implemented in the database as user defined functions extended to the database engine, thus semantic queries can be directly specified in standard database query languages such as SQL and XQuery. The system provides caching management to boosts query performance. The system is highly adaptable to support different ontologies through easy customizations. We have implemented the system DBOntoLink as an open source software, which supports major ontologies hosted at BioPortal. DBOntoLink supports a set of common ontology based semantic operations and have them fully integrated with a database management system IBM DB2. The system has been deployed and evaluated with an existing biomedical database for managing and querying image annotations and markups (AIM). Our performance study demonstrates the high expressiveness of semantic queries and the high efficiency of the queries.

  12. Tomograph: Highlighting query parallelism in a multi-core system

    NARCIS (Netherlands)

    Gawade, M.M.; Kersten, M.L.

    2013-01-01

    Query parallelism improves serial query execution performance by orders of magnitude. Getting optimal performance from an already parallelized query plan is however difficult due to its dependency on run time factors such as correct operator scheduling, memory pressure, disk io performance, and oper

  13. Tomograph: highlighting query parallelism in a multi-core system

    NARCIS (Netherlands)

    M. Gawade; M. Kersten

    2013-01-01

    Query parallelism improves serial query execution performance by orders of magnitude. Getting optimal performance from an already parallelized query plan is however difficult due to its dependency on run time factors such as correct operator scheduling, memory pressure, disk io performance, and oper

  14. Result diversification based on query-specific cluster ranking

    NARCIS (Netherlands)

    He, J.; Meij, E.; de Rijke, M.

    2011-01-01

    Result diversification is a retrieval strategy for dealing with ambiguous or multi-faceted queries by providing documents that cover as many facets of the query as possible. We propose a result diversification framework based on query-specific clustering and cluster ranking, in which diversification

  15. Semantic vs term-based query modification analysis

    NARCIS (Netherlands)

    V. Hollink (Vera); T. Tsikrika (Theodora); A.P. de Vries (Arjen)

    2010-01-01

    htmlabstractPrevious research has studied query modifications on a syntactic level by focusing on the addition, elimination and substitution of terms between consecutive queries that have at least one term in common. In this paper, we determine semantic relations between queries by first mapping

  16. A Relational Algebra Query Language for Programming Relational Databases

    Science.gov (United States)

    McMaster, Kirby; Sambasivam, Samuel; Anderson, Nicole

    2011-01-01

    In this paper, we describe a Relational Algebra Query Language (RAQL) and Relational Algebra Query (RAQ) software product we have developed that allows database instructors to teach relational algebra through programming. Instead of defining query operations using mathematical notation (the approach commonly taken in database textbooks), students…

  17. Discrete-query quantum algorithm for NAND trees

    CERN Document Server

    Childs, A M; Jordan, S P; Yeung, D; Childs, Andrew M.; Cleve, Richard; Jordan, Stephen P.; Yeung, David

    2007-01-01

    Recently, Farhi, Goldstone, and Gutmann gave a quantum algorithm for evaluating NAND trees that runs in time O(sqrt(N log N)) in the Hamiltonian query model. In this note, we point out that their algorithm can be converted into an algorithm using O(N^{1/2 + epsilon}) queries in the conventional quantum query model, for any fixed epsilon > 0.

  18. Predicting the Effectiveness of Queries and Retrieval Systems

    NARCIS (Netherlands)

    Hauff, C.

    2010-01-01

    In this thesis we consider users' attempts to express their information needs through queries, or search requests and try to predict whether those requests will be of high or low quality. Intuitively, a query's quality is determined by the outcome of the query, that is, whether the retrieved search

  19. Optimal Succinctness for Range Minimum Queries

    CERN Document Server

    Fischer, Johannes

    2008-01-01

    For an array A of n objects from a totally ordered universe, a range minimum query (RMQ) asks for the position of the minimum element in the sub-array A[i,j]. We focus on the setting where the array $A$ is static and known in advance, and can hence be preprocessed into a scheme in order to answer future queries faster. We make the further assumption that the input array A cannot be used at query time. Under this assumption, a natural lower bound of 2n bits for RMQ-schemes exists. We give the first truly succinct preprocessing scheme for O(1)-RMQs. Its final space consumption is 2n+o(n) bits, thus being asymptotically optimal. We also give a simple linear-time construction algorithm for this scheme that needs only n+o(n) bits of space in addition to the 2n+o(n) bits needed for the final data structure, thereby lowering the peak space consumption of previous schemes from O(n\\log n) to O(n) bits. We also improve on LCA-computation in BPS- and DFUDS-encoded trees.

  20. DETERMINING SIgA IN SALIVA BY SANDWICH ENZYME-LINKED ASSAY WITH AHL-HRP%采用白桂木凝集素-HRP测定唾液SIgA方法的建立

    Institute of Scientific and Technical Information of China (English)

    朱其芳; 吴耀生; 邓勇; 周素芳

    1999-01-01

    目的:建立白桂木凝集素-HRP夹心法检测唾液中分泌型IgA(SIgA).方法:制备AHL-HRP作为酶标凝集素,AHL为包被物,确定最适包被浓度及AHL-HRP工作稀释度,夹心酶联检测SIgA,进行检出限、精密度、准确性等实验并制备标准曲线,初步测定 71 例正常人唾液SIgA含量.结果:AHL-HRP夹心法检测SIgA标准曲线线性检测范围5~337mg/L. 本法测定71例健康人唾液SIgA含量为( 509.15±296.75 )mg/L.其中5份标本批内平均变异7.4% ,批间变异10.56%,回收率平均94%.结论:AHL-HRP夹心法测定SIgA的作用模式为 AHL→SIgA→AHL-HRP,可用于唾液中的SIgA含量的测定.方法简便、准确、便于推广.

  1. Report on BAAL "Language in Africa" SIG Meetings Reading in African Languages: Developing Literacies and Reading Methodologies

    Science.gov (United States)

    Wildsmith-Cromarty, Rosemary

    2015-01-01

    This report describes ongoing research on reading in African languages. It draws mainly on contributions from two British Association for Applied Linguistics (BAAL) "Language in Africa" (LiA) Special Interest Group (SIG) meetings: the LiA SIG strand at BAAL 2013 and the seminar on "Reading Methodologies in African Languages"…

  2. Report on BAAL "Language in Africa" SIG Meetings Reading in African Languages: Developing Literacies and Reading Methodologies

    Science.gov (United States)

    Wildsmith-Cromarty, Rosemary

    2015-01-01

    This report describes ongoing research on reading in African languages. It draws mainly on contributions from two British Association for Applied Linguistics (BAAL) "Language in Africa" (LiA) Special Interest Group (SIG) meetings: the LiA SIG strand at BAAL 2013 and the seminar on "Reading Methodologies in African Languages"…

  3. Spatio-temporal databases complex motion pattern queries

    CERN Document Server

    Vieira, Marcos R

    2013-01-01

    This brief presents several new query processing techniques, called complex motion pattern queries, specifically designed for very large spatio-temporal databases of moving objects. The brief begins with the definition of flexible pattern queries, which are powerful because of the integration of variables and motion patterns. This is followed by a summary of the expressive power of patterns and flexibility of pattern queries. The brief then present the Spatio-Temporal Pattern System (STPS) and density-based pattern queries. STPS databases contain millions of records with information about mobi

  4. Nearest and reverse nearest neighbor queries for moving objects

    DEFF Research Database (Denmark)

    Benetis, R.; Jensen, Christian Søndergaard; Karciauskas, G.

    2006-01-01

    With the continued proliferation of wireless communications and advances in positioning technologies, algorithms for efficiently answering queries about large populations of moving objects are gaining in interest. This paper proposes algorithms for k nearest and reverse k nearest neighbor queries...... on the current and anticipated future positions of points moving continuously in the plane. The former type of query returns k objects nearest to a query object for each time point during a time interval, while the latter returns the objects that have a specified query object as one of their k closest neighbors...

  5. Algebra-Based Optimization of XML-Extended OLAP Queries

    DEFF Research Database (Denmark)

    Yin, Xuepeng; Pedersen, Torben Bach

    is desirable. This report presents a complete foundation for such OLAP-XML federations. This includes a prototypical query engine, a simplified query semantics based on previous work, and a complete physical algebra which enables precise modeling of the execution tasks of an OLAP-XML query. Effective algebra......-based and cost-based query optimization and implementation are also proposed, as well as the execution techniques. Finally, experiments with the prototypical query engine w.r.t. federation performance, optimization effectiveness, and feasibility suggest that our approach, unlike the physical integration...

  6. Recommender Systems

    CERN Document Server

    Lü, Linyuan; Yeung, Chi Ho; Zhang, Yi-Cheng; Zhang, Zi-Ke; Zhou, Tao

    2012-01-01

    The ongoing rapid expansion of the Internet greatly increases the necessity of effective recommender systems for filtering the abundant information. Extensive research for recommender systems is conducted by a broad range of communities including social and computer scientists, physicists, and interdisciplinary researchers. Despite substantial theoretical and practical achievements, unification and comparison of different approaches are lacking, which impedes further advances. In this article, we review recent developments in recommender systems and discuss the major challenges. We compare and evaluate available algorithms and examine their roles in the future developments. In addition to algorithms, physical aspects are described to illustrate macroscopic behavior of recommender systems. Potential impacts and future directions are discussed. We emphasize that recommendation has a great scientific depth and combines diverse research fields which makes it of interests for physicists as well as interdisciplinar...

  7. BioSig3D: High Content Screening of Three-Dimensional Cell Culture Models.

    Directory of Open Access Journals (Sweden)

    Cemal Cagatay Bilgin

    Full Text Available BioSig3D is a computational platform for high-content screening of three-dimensional (3D cell culture models that are imaged in full 3D volume. It provides an end-to-end solution for designing high content screening assays, based on colony organization that is derived from segmentation of nuclei in each colony. BioSig3D also enables visualization of raw and processed 3D volumetric data for quality control, and integrates advanced bioinformatics analysis. The system consists of multiple computational and annotation modules that are coupled together with a strong use of controlled vocabularies to reduce ambiguities between different users. It is a web-based system that allows users to: design an experiment by defining experimental variables, upload a large set of volumetric images into the system, analyze and visualize the dataset, and either display computed indices as a heatmap, or phenotypic subtypes for heterogeneity analysis, or download computed indices for statistical analysis or integrative biology. BioSig3D has been used to profile baseline colony formations with two experiments: (i morphogenesis of a panel of human mammary epithelial cell lines (HMEC, and (ii heterogeneity in colony formation using an immortalized non-transformed cell line. These experiments reveal intrinsic growth properties of well-characterized cell lines that are routinely used for biological studies. BioSig3D is being released with seed datasets and video-based documentation.

  8. Proceedings of the 10th ASIS SIG/CR classification research workshop

    DEFF Research Database (Denmark)

    This volume is a working copy of the papers presented at the 9th ASIS SIG/CR workshop on classification research, held in Washington, DC, at the ASIS Annual Meeting on Sunday 31 October 1999. The contributions printed here are working papers, and thus, not necessarily in their final form. For thi...

  9. Federal School Improvement Grants (SIGs): How Capacity and Local Conditions Matter

    Science.gov (United States)

    Yatsko, Sarah; Lake, Robin; Bowen, Melissa; Cooley Nelson, Elizabeth

    2015-01-01

    In 2009, the federal government committed over $3 billion nationwide to help states and districts turn around their worst-performing schools. The U.S. Department of Education intended for the School Improvement Grants (SIGs) to spur dramatic change.This report looks at the results of a field study of the first-year implementation of those grants…

  10. Incorporating Early Learning Strategies in the School Improvement Grants (SIG) Program

    Science.gov (United States)

    Connors-Tadros, Lori; Dunn, Lenay; Martella, Jana; McCauley, Carlas

    2015-01-01

    The Center on Enhancing Early Learning Outcomes (CEELO) and the Center on School Turnaround (CST) collaborated to develop case studies of three selected schools receiving SIG funds that have, with the support of their districts, promoted the use of early childhood programming (PK-3) as a key strategy in their schools' turnaround models. The goal…

  11. 77 FR 24192 - SIG Energy, LLLP v. California Independent System Operator Corporation; Notice of Complaint

    Science.gov (United States)

    2012-04-23

    ... From the Federal Register Online via the Government Publishing Office DEPARTMENT OF ENERGY Federal Energy Regulatory Commission SIG Energy, LLLP v. California Independent System Operator Corporation; Notice of Complaint Take notice that on April 4, 2012, pursuant to section 206 of the Federal...

  12. SIG中的资源监控系统%Resource Monitoring System in SIG

    Institute of Scientific and Technical Information of China (English)

    田鸣华; 肖侬; 刘波; 黄斌

    2005-01-01

    空间信息网格(SIG)是一个广域分布的信息基础设施,发现与共享闲置资源是其主要功能之一.为了实现这一目标,必须实现资源监控系统来聚集和分发资源的状态信息.介绍了在SIG中实现的资源监控系统--SIG-RMS.该系统遵循GMA架构,实现了局部监控服务(LMS)和全局监控服务(GMS).系统采用模块化设计,具有灵活的可配置性,可扩展性和高可用性.详细介绍了SIG-RMS模块化的体系结构、灵活可扩展的信息模板和推拉式的通信方式,并描述了它的实现,最后总结了SIG-RMS的特点.

  13. BioSig3D: High Content Screening of Three-Dimensional Cell Culture Models.

    Science.gov (United States)

    Bilgin, Cemal Cagatay; Fontenay, Gerald; Cheng, Qingsu; Chang, Hang; Han, Ju; Parvin, Bahram

    2016-01-01

    BioSig3D is a computational platform for high-content screening of three-dimensional (3D) cell culture models that are imaged in full 3D volume. It provides an end-to-end solution for designing high content screening assays, based on colony organization that is derived from segmentation of nuclei in each colony. BioSig3D also enables visualization of raw and processed 3D volumetric data for quality control, and integrates advanced bioinformatics analysis. The system consists of multiple computational and annotation modules that are coupled together with a strong use of controlled vocabularies to reduce ambiguities between different users. It is a web-based system that allows users to: design an experiment by defining experimental variables, upload a large set of volumetric images into the system, analyze and visualize the dataset, and either display computed indices as a heatmap, or phenotypic subtypes for heterogeneity analysis, or download computed indices for statistical analysis or integrative biology. BioSig3D has been used to profile baseline colony formations with two experiments: (i) morphogenesis of a panel of human mammary epithelial cell lines (HMEC), and (ii) heterogeneity in colony formation using an immortalized non-transformed cell line. These experiments reveal intrinsic growth properties of well-characterized cell lines that are routinely used for biological studies. BioSig3D is being released with seed datasets and video-based documentation.

  14. ASIST 2001. Information in a Networked World: Harnessing the Flow. Part II: SIG and Panel Sessions.

    Science.gov (United States)

    Proceedings of the ASIST Annual Meeting, 2001

    2001-01-01

    Abstracts and descriptions of Special Interest Group (SIG) and Panel Sessions include such topics as: knowledge management; information networks; courseware; accreditation; Web site design; information retrieval and processing; digital imaging; electronic journals; human computer interaction; digital libraries; digital divide; information science;…

  15. Domogis: prototipo de un interfaz del sistema de control de un edificio integrado en un SIG

    Directory of Open Access Journals (Sweden)

    Álvarez, M.

    2010-06-01

    Full Text Available This paper deals with of a the use of Geographical Information Systems (GIS for domotic control. The foccus is put on the communication interface between the building control system (BCS integrated in a GIS. For get this aim, the GIS of the Montegancedo Campus where is located the Facultad de infomatica of UPM and the creation of an interface is needed. The implemented interface in Microsoft C# language allows the control, monotorizing and management of the sensors data installated in the Campus.

    Este trabajo trata de la utilización de los Sistemas de Información Geográfica (SIG en uno de las nuevos requerimientos de la arquitectura, el control domótico. El objetivo es el desarrollo de un interfaz de comunicación del Sistema de Control de un Edificio (SCE integrado en un SIG. La consecución de este objetivo implica previamente el desarrollo del SIG del Campus de Montegancedo sede de la Facultad de Informática de la UPM y la creación de un interfaz integrado en el SIG, desarrollado en lenguaje de programacion C# de Microsoft. Este interfaz dirige al usuario en la realización de ciertas tareas de control domótico de las instalaciones urbanas y edificios del Campus universitario, como evaluar, monotorizar y gestionar datos procedentes de sensores estratégicamente situados en dicho Campus.

  16. Increased bioplastic production with an RNA polymerase sigma factor SigE during nitrogen starvation in Synechocystis sp. PCC 6803.

    Science.gov (United States)

    Osanai, Takashi; Numata, Keiji; Oikawa, Akira; Kuwahara, Ayuko; Iijima, Hiroko; Doi, Yoshiharu; Tanaka, Kan; Saito, Kazuki; Hirai, Masami Yokota

    2013-12-01

    Because cyanobacteria directly harvest CO2 and light energy, their carbon metabolism is important for both basic and applied sciences. Here, we show that overexpression of the sigma factor sigE in Synechocystis sp. PCC 6803 widely changes sugar catabolism and increases production of the biodegradable polyester polyhydroxybutyrate (PHB) during nitrogen starvation. sigE overexpression elevates the levels of proteins implicated in glycogen catabolism, the oxidative pentose phosphate pathway, and polyhydroxyalkanoate biosynthesis. PHB accumulation is enhanced by sigE overexpression under nitrogen-limited conditions, yet the molecular weights of PHBs synthesized by the parental glucose-tolerant and sigE overexpression strain are similar. Although gene expression induced by nitrogen starvation is changed and other metabolites (such as GDP-mannose and citrate) accumulate under sigE overexpression, genetic engineering of this sigma factor altered the metabolic pathway from glycogen to PHB during nitrogen starvation.

  17. OPTIMIZATION OF LOCATION BASED QUERIES USING SPATIAL INDEXING

    Directory of Open Access Journals (Sweden)

    S. Geetha

    2014-04-01

    Full Text Available The recent development in the technology leads to the introduction of various mobile terminals and there is a demand that the client requires effective location based services. The valid regions expand and also query retrieval time increases which lead to poor performance of query processing. The spatial indexing techniques are one of the most effective optimization methods to improve the quality of services. In existing system NN queries and window queries are used. In that R-tree and grid indexing has been used for increasing the query efficiency. But the Grid-index technique support low memory and thus large databases cannot be handled effectively. In the proposed system we are using Ordered grid index and EVR-tree to minimize the query retrieval time and to decrease the depth of the search index. The Ordered grid index and EVR-tree to speed up the spatial query processing.

  18. A Comprehensive Trainable Error Model for Sung Music Queries

    CERN Document Server

    Birmingham, W P; 10.1613/jair.1334

    2011-01-01

    We propose a model for errors in sung queries, a variant of the hidden Markov model (HMM). This is a solution to the problem of identifying the degree of similarity between a (typically error-laden) sung query and a potential target in a database of musical works, an important problem in the field of music information retrieval. Similarity metrics are a critical component of query-by-humming (QBH) applications which search audio and multimedia databases for strong matches to oral queries. Our model comprehensively expresses the types of error or variation between target and query: cumulative and non-cumulative local errors, transposition, tempo and tempo changes, insertions, deletions and modulation. The model is not only expressive, but automatically trainable, or able to learn and generalize from query examples. We present results of simulations, designed to assess the discriminatory potential of the model, and tests with real sung queries, to demonstrate relevance to real-world applications.

  19. A Grammar Analysis Model for the Unified Multimedia Query Language

    Institute of Scientific and Technical Information of China (English)

    Zhong-Sheng Cao; Zong-Da Wu; Yuan-Zhen Wang

    2008-01-01

    The unified multimedia query language (UMQL) is a powerful general-purpose multimedia query language, and it is very suitable for multimedia information retrieval. The paper proposes a grammar analysis model to implement an effective grammatical processing for the language. It separates the grammar analysis of a UMQL query specification into two phases: syntactic analysis and semantic analysis, and then respectively uses Backus-Naur form (EBNF) and logical algebra to specify both restrictive grammar rules. As a result, the model can present error guiding information for a query specification which owns incorrect grammar. The model not only suits well the processing of UMQL queries, but also has a guiding significance for other projects concerning query processings of descriptive query languages.

  20. Improve Performance of Data Warehouse by Query Cache

    Science.gov (United States)

    Gour, Vishal; Sarangdevot, S. S.; Sharma, Anand; Choudhary, Vinod

    2010-11-01

    The primary goal of data warehouse is to free the information locked up in the operational database so that decision makers and business analyst can make queries, analysis and planning regardless of the data changes in operational database. As the number of queries is large, therefore, in certain cases there is reasonable probability that same query submitted by the one or multiple users at different times. Each time when query is executed, all the data of warehouse is analyzed to generate the result of that query. In this paper we will study how using query cache improves performance of Data Warehouse and try to find the common problems faced. These kinds of problems are faced by Data Warehouse administrators which are minimizes response time and improves the efficiency of query in data warehouse overall, particularly when data warehouse is updated at regular interval.

  1. Performance Oriented Query Processing In GEO Based Location Search Engines

    CERN Document Server

    Umamaheswari, M

    2010-01-01

    Geographic location search engines allow users to constrain and order search results in an intuitive manner by focusing a query on a particular geographic region. Geographic search technology, also called location search, has recently received significant interest from major search engine companies. Academic research in this area has focused primarily on techniques for extracting geographic knowledge from the web. In this paper, we study the problem of efficient query processing in scalable geographic search engines. Query processing is a major bottleneck in standard web search engines, and the main reason for the thousands of machines used by the major engines. Geographic search engine query processing is different in that it requires a combination of text and spatial data processing techniques. We propose several algorithms for efficient query processing in geographic search engines, integrate them into an existing web search query processor, and evaluate them on large sets of real data and query traces.

  2. Query Specific Rank Fusion for Image Retrieval.

    Science.gov (United States)

    Zhang, Shaoting; Yang, Ming; Cour, Timothee; Yu, Kai; Metaxas, Dimitris N

    2015-04-01

    Recently two lines of image retrieval algorithms demonstrate excellent scalability: 1) local features indexed by a vocabulary tree, and 2) holistic features indexed by compact hashing codes. Although both of them are able to search visually similar images effectively, their retrieval precision may vary dramatically among queries. Therefore, combining these two types of methods is expected to further enhance the retrieval precision. However, the feature characteristics and the algorithmic procedures of these methods are dramatically different, which is very challenging for the feature-level fusion. This motivates us to investigate how to fuse the ordered retrieval sets, i.e., the ranks of images, given by multiple retrieval methods, to boost the retrieval precision without sacrificing their scalability. In this paper, we model retrieval ranks as graphs of candidate images and propose a graph-based query specific fusion approach, where multiple graphs are merged and reranked by conducting a link analysis on a fused graph. The retrieval quality of an individual method is measured on-the-fly by assessing the consistency of the top candidates' nearest neighborhoods. Hence, it is capable of adaptively integrating the strengths of the retrieval methods using local or holistic features for different query images. This proposed method does not need any supervision, has few parameters, and is easy to implement. Extensive and thorough experiments have been conducted on four public datasets, i.e., the UKbench, Corel-5K, Holidays and the large-scale San Francisco Landmarks datasets. Our proposed method has achieved very competitive performance, including state-of-the-art results on several data sets, e.g., the N-S score 3.83 for UKbench.

  3. Deep web query interface understanding and integration

    CERN Document Server

    Dragut, Eduard C; Yu, Clement T

    2012-01-01

    There are millions of searchable data sources on the Web and to a large extent their contents can only be reached through their own query interfaces. There is an enormous interest in making the data in these sources easily accessible. There are primarily two general approaches to achieve this objective. The first is to surface the contents of these sources from the deep Web and add the contents to the index of regular search engines. The second is to integrate the searching capabilities of these sources and support integrated access to them. In this book, we introduce the state-of-the-art tech

  4. SPARQL Assist Language-Neutral Query Composer

    CERN Document Server

    McCarthy, Luke; Wilkinson, Mark

    2010-01-01

    SPARQL query composition is difficult for the lay-person or even the experienced bioinformatician in cases where the data model is unfamiliar. Established best-practices and internationalization concerns dictate that semantic web ontologies should use terms with opaque identifiers, further complicating the task. We present SPARQL Assist: a web application that addresses these issues by providing context-sensitive type-ahead completion to existing web forms. Ontological terms are suggested using their labels and descriptions, leveraging existing XML support for internationalization and language-neutrality.

  5. Immune Algorithm For Document Query Optimization

    Institute of Scientific and Technical Information of China (English)

    WangZiqiang; FengBoqin

    2005-01-01

    To efficiently retrieve relevant document from the rapid proliferation of large information collections, a novel immune algorithm for document query optimization is proposed. The essential ideal of the immune algorithm is that the crossover and mutation of operator are constructed according to its own characteristics of information retrieval. Immune operator is adopted to avoid degeneracy. Relevant documents retrieved am merged to a single document list according to rank formula. Experimental results show that the novel immune algorithm can lead to substantial improvements of relevant document retrieval effectiveness.

  6. Downloading Multiple Records Using Query Strings

    Directory of Open Access Journals (Sweden)

    Adam Crymble

    2012-11-01

    Full Text Available Downloading a single record from a website is easy, but downloading many records at a time – an increasingly frequent need for a historian – is much more efficient using a programming language such as Python. In this lesson, we will write a program that will download a series of records from the Old Bailey Online using custom search criteria, and save them to a directory on our computer. This process involves interpreting and manipulating URL Query Strings. In this case, the tutorial will seek to download sources that contain references to people of African descent that were published in the Old Bailey Proceedings between 1700 and 1750.

  7. An Optimal Labeling Scheme for Ancestry Queries

    OpenAIRE

    2009-01-01

    An ancestry labeling scheme assigns labels (bit strings) to the nodes of rooted trees such that ancestry queries between any two nodes in a tree can be answered merely by looking at their corresponding labels. The quality of an ancestry labeling scheme is measured by its label size, that is the maximal number of bits in a label of a tree node. In addition to its theoretical appeal, the design of efficient ancestry labeling schemes is motivated by applications in web search engines. For this p...

  8. Recommended Wilderness

    Data.gov (United States)

    National Park Service, Department of the Interior — Recommended wilderness is an Arcview shapefile representing the porposed wilderness areas throughout the park. The boundaries for this data set were digitized by...

  9. PRESY: A Context Based Query Reformulation Tool for Information Retrieval on the Web

    Directory of Open Access Journals (Sweden)

    Abdelkrim Bouramoul

    2010-01-01

    Full Text Available Problem statement: The huge number of information on the web as well as the growth of new inexperienced users creates new challenges for information retrieval. It has become increasingly difficult for these users to find relevant documents that satisfy their individual needs. Certainly the current search engines (such as Google, Bing and Yahoo offer an efficient way to browse the web content. However, the result quality is highly based on uses queries which need to be more precise to find relevant documents. This task still complicated for the majority of inept users who cannot express their needs with significant words in the query. For that reason, we believe that a reformulation of the initial user's query can be a good alternative to improve the information selectivity. This study proposed a novel approach and presents a prototype system called Profile-based Reformulation System (PRESY for information retrieval on the web. Approach: It used an incremental approach to categorize users by constructing a contextual base. The latter was composed of two types of context (static and dynamic obtained using the users' profiles. The architecture proposed was implemented using .Net environment to perform queries reformulating tests. Results: The experiments gave at the end of this article show that the precision of the returned content is effectively improved. The tests were performed with the most popular searching engine (i.e., Google, Bind and Yahoo selected in particular for their high selectivity. Among the given results, we found that query reformulation improve the first three results by 10.7 and 11.7% of the next seven returned elements. So as we could see the reformulation of users' initial queries improves the pertinence of returned content. Conclusion/Recommendations: Therefore, we believed that the exploitation of contextual data based on users' profiles could be a very good way to reformulate user query. This complementary mechanism would

  10. Recommender systems

    CERN Document Server

    Kembellec, Gérald; Saleh, Imad

    2014-01-01

    Acclaimed by various content platforms (books, music, movies) and auction sites online, recommendation systems are key elements of digital strategies. If development was originally intended for the performance of information systems, the issues are now massively moved on logical optimization of the customer relationship, with the main objective to maximize potential sales. On the transdisciplinary approach, engines and recommender systems brings together contributions linking information science and communications, marketing, sociology, mathematics and computing. It deals with the understan

  11. Partial match queries in random quadtrees

    CERN Document Server

    Broutin, Nicolas; Sulzbach, Henning

    2011-01-01

    We consider the problem of recovering items matching a partially specified pattern in multidimensional trees (quad trees and k-d trees). We assume the traditional model where the data consist of independent and uniform points in the unit square. For this model, in a structure on $n$ points, it is known that the number of nodes $C_n(\\xi)$ to visit in order to report the items matching an independent and uniformly on $[0,1]$ random query $\\xi$ satisfies $\\Ec{C_n(\\xi)}\\sim \\kappa n^{\\beta}$, where $\\kappa$ and $\\beta$ are explicit constants. We develop an approach based on the analysis of the cost $C_n(x)$ of any fixed query $x\\in [0,1]$, and give precise estimates for the variance and limit distribution of the cost $C_n(x)$. Our results permit to describe a limit process for the costs $C_n(x)$ as $x$ varies in $[0,1]$; one of the consequences is that $E{\\max_{x\\in [0,1]} C_n(x)} \\sim \\gamma n^\\beta$.

  12. Query-Time Optimization Techniques for Structured Queries in Information Retrieval

    Science.gov (United States)

    Cartright, Marc-Allen

    2013-01-01

    The use of information retrieval (IR) systems is evolving towards larger, more complicated queries. Both the IR industrial and research communities have generated significant evidence indicating that in order to continue improving retrieval effectiveness, increases in retrieval model complexity may be unavoidable. From an operational perspective,…

  13. SISTEM INFORMASI GEOGRAFIS (SIG DALAM BIDANG KESEHATAN MASYARAKAT

    Directory of Open Access Journals (Sweden)

    Endang Indriasih

    2012-11-01

    , and just a few cases were found in areas at more than 100meters above sea level. Finally, the capability of GIS to manage spatial data is enable to perform various scenarios as conducting strategic analysis. In order to minimaze bias and uncertain results, it recommends to conduct GIS analysis by multidiscipline members. It should be reminded that the final objective of GIS application in health sector is to improve the ability in plannmg, diagnosing, and intervention of health problems in many governmental administration level due tosupport the achievement of health development goals. Key words: geographic information system, public health, policy making process

  14. CrossQuery: a web tool for easy associative querying of transcriptome data.

    Directory of Open Access Journals (Sweden)

    Toni U Wagner

    Full Text Available Enormous amounts of data are being generated by modern methods such as transcriptome or exome sequencing and microarray profiling. Primary analyses such as quality control, normalization, statistics and mapping are highly complex and need to be performed by specialists. Thereafter, results are handed back to biomedical researchers, who are then confronted with complicated data lists. For rather simple tasks like data filtering, sorting and cross-association there is a need for new tools which can be used by non-specialists. Here, we describe CrossQuery, a web tool that enables straight forward, simple syntax queries to be executed on transcriptome sequencing and microarray datasets. We provide deep-sequencing data sets of stem cell lines derived from the model fish Medaka and microarray data of human endothelial cells. In the example datasets provided, mRNA expression levels, gene, transcript and sample identification numbers, GO-terms and gene descriptions can be freely correlated, filtered and sorted. Queries can be saved for later reuse and results can be exported to standard formats that allow copy-and-paste to all widespread data visualization tools such as Microsoft Excel. CrossQuery enables researchers to quickly and freely work with transcriptome and microarray data sets requiring only minimal computer skills. Furthermore, CrossQuery allows growing association of multiple datasets as long as at least one common point of correlated information, such as transcript identification numbers or GO-terms, is shared between samples. For advanced users, the object-oriented plug-in and event-driven code design of both server-side and client-side scripts allow easy addition of new features, data sources and data types.

  15. CrossQuery: a web tool for easy associative querying of transcriptome data.

    Science.gov (United States)

    Wagner, Toni U; Fischer, Andreas; Thoma, Eva C; Schartl, Manfred

    2011-01-01

    Enormous amounts of data are being generated by modern methods such as transcriptome or exome sequencing and microarray profiling. Primary analyses such as quality control, normalization, statistics and mapping are highly complex and need to be performed by specialists. Thereafter, results are handed back to biomedical researchers, who are then confronted with complicated data lists. For rather simple tasks like data filtering, sorting and cross-association there is a need for new tools which can be used by non-specialists. Here, we describe CrossQuery, a web tool that enables straight forward, simple syntax queries to be executed on transcriptome sequencing and microarray datasets. We provide deep-sequencing data sets of stem cell lines derived from the model fish Medaka and microarray data of human endothelial cells. In the example datasets provided, mRNA expression levels, gene, transcript and sample identification numbers, GO-terms and gene descriptions can be freely correlated, filtered and sorted. Queries can be saved for later reuse and results can be exported to standard formats that allow copy-and-paste to all widespread data visualization tools such as Microsoft Excel. CrossQuery enables researchers to quickly and freely work with transcriptome and microarray data sets requiring only minimal computer skills. Furthermore, CrossQuery allows growing association of multiple datasets as long as at least one common point of correlated information, such as transcript identification numbers or GO-terms, is shared between samples. For advanced users, the object-oriented plug-in and event-driven code design of both server-side and client-side scripts allow easy addition of new features, data sources and data types.

  16. A Study of Library Databases by Translating Those SQL Queries into Relational Algebra and Generating Query Trees

    Directory of Open Access Journals (Sweden)

    Santhi Lasya

    2011-09-01

    Full Text Available Even in this World Wide Web era where there is unrestricted access to a lot of articles and books at a mouses click, the role of an organized library is immense. It is vital to have effective software to manage various functions in a library and the fundamental for effective software is the underlying database access and the queries used. And hence library databases become our use-case for this study. This paper starts off with considering a basic ER model of a typical library relational database. We would also list all the basic use-cases in a library management system. The next part of the paper deals with the sql queries used for performing certain functions in a library database management system. Along with the queries, we would generate reports for some of the use cases. The final section of the paper forms the crux of this library database study, wherein we would dwell on the concepts of query processing and query optimization in the relational database domain. We would analyze the above mentioned queries, by translating the query into a relational algebra expression and generating a query tree for the same. By converting algebra, we look at optimizing the query, and by generating a query tree, we would come up a cheapest cost plan.

  17. Multidimensional Data Querying on Tree-Structured Overlay

    Institute of Scientific and Technical Information of China (English)

    XU Lizhen; WANG Shiyuan

    2006-01-01

    Multidimensional data query has been gaining much interest in database research communities in recent years, yet many of the existing studies focus mainly on centralized systems.A solution to querying in Peer-to-Peer(P2P) environment was proposed to achieve both low processing cost in terms of the number of peers accessed and search messages and balanced query loads among peers.The system is based on a balanced tree structured P2P network.By partitioning the query space intelligently, the amount of query forwarding is effectively controlled, and the number of peers involved and search messages are also limited.Dynamic load balancing can be achieved during space partitioning and query resolving.Extensive experiments confirm the effectiveness and scalability of our algorithms on P2P networks.

  18. Structured Query Translation in Peer to Peer Database Sharing Systems

    Directory of Open Access Journals (Sweden)

    Mehedi Masud

    2009-10-01

    Full Text Available This paper presents a query translation mechanism between heterogeneous peers in Peer to Peer Database Sharing Systems (PDSSs. A PDSS combines a database management system with P2P functionalities. The local databases on peers are called peer databases. In a PDSS, each peer chooses its own data model and schema and maintains data independently without any global coordinator. One of the problems in such a system is translating queries between peers, taking into account both the schema and data heterogeneity. Query translation is the problem of rewriting a query posed in terms of one peer schema to a query in terms of another peer schema. This paper proposes a query translation mechanism between peers where peers are acquainted in data sharing systems through data-level mappings for sharing data.

  19. Query-Based Outlier Detection in Heterogeneous Information Networks

    Science.gov (United States)

    Kuck, Jonathan; Zhuang, Honglei; Yan, Xifeng; Cam, Hasan; Han, Jiawei

    2015-01-01

    Outlier or anomaly detection in large data sets is a fundamental task in data science, with broad applications. However, in real data sets with high-dimensional space, most outliers are hidden in certain dimensional combinations and are relative to a user’s search space and interest. It is often more effective to give power to users and allow them to specify outlier queries flexibly, and the system will then process such mining queries efficiently. In this study, we introduce the concept of query-based outlier in heterogeneous information networks, design a query language to facilitate users to specify such queries flexibly, define a good outlier measure in heterogeneous networks, and study how to process outlier queries efficiently in large data sets. Our experiments on real data sets show that following such a methodology, interesting outliers can be defined and uncovered flexibly and effectively in large heterogeneous networks. PMID:27064397

  20. Goal Directed Relative Skyline Queries in Time Dependent Road Networks

    CERN Document Server

    Iyer, K B Priya

    2012-01-01

    The Wireless GIS technology is progressing rapidly in the area of mobile communications. Location-based spatial queries are becoming an integral part of many new mobile applications. The Skyline queries are latest apps under Location-based services. In this paper we introduce Goal Directed Relative Skyline queries on Time dependent (GD-RST) road networks. The algorithm uses travel time as a metric in finding the data object by considering multiple query points (multi-source skyline) relative to user location and in the user direction of travelling. We design an efficient algorithm based on Filter phase, Heap phase and Refine Skyline phases. At the end, we propose a dynamic skyline caching (DSC) mechanism which helps to reduce the computation cost for future skyline queries. The experimental evaluation reflects the performance of GD-RST algorithm over the traditional branch and bound algorithm for skyline queries in real road networks.

  1. Querying and Extracting Timeline Information from Road Traffic Sensor Data.

    Science.gov (United States)

    Imawan, Ardi; Indikawati, Fitri Indra; Kwon, Joonho; Rao, Praveen

    2016-08-23

    The escalation of traffic congestion in urban cities has urged many countries to use intelligent transportation system (ITS) centers to collect historical traffic sensor data from multiple heterogeneous sources. By analyzing historical traffic data, we can obtain valuable insights into traffic behavior. Many existing applications have been proposed with limited analysis results because of the inability to cope with several types of analytical queries. In this paper, we propose the QET (querying and extracting timeline information) system-a novel analytical query processing method based on a timeline model for road traffic sensor data. To address query performance, we build a TQ-index (timeline query-index) that exploits spatio-temporal features of timeline modeling. We also propose an intuitive timeline visualization method to display congestion events obtained from specified query parameters. In addition, we demonstrate the benefit of our system through a performance evaluation using a Busan ITS dataset and a Seattle freeway dataset.

  2. Multi-Dimensional Top-k Dominating Queries

    DEFF Research Database (Denmark)

    Yiu, Man Lung; Mamoulis, Nikos

    2009-01-01

    The top-k dominating query returns k data objects which dominate the highest number of objects in a dataset. This query is an important tool for decision support since it provides data analysts an intuitive way for finding significant objects. In addition, it combines the advantages of top...... attention from the research community. This paper is an extensive study on the evaluation of topk dominating queries. First, we propose a set of algorithms that apply on indexed multi-dimensional data. Second, we investigate query evaluation on data that are not indexed. Finally, we study a relaxed variant...... of the query which considers dominance in dimensional subspaces. Experiments using synthetic and real datasets demonstrate that our algorithms significantly outperform a previous skyline-based approach. We also illustrate the applicability of this multi-dimensional analysis query by studying the meaningfulness...

  3. A novel adaptive Cuckoo search for optimal query plan generation.

    Science.gov (United States)

    Gomathi, Ramalingam; Sharmila, Dhandapani

    2014-01-01

    The emergence of multiple web pages day by day leads to the development of the semantic web technology. A World Wide Web Consortium (W3C) standard for storing semantic web data is the resource description framework (RDF). To enhance the efficiency in the execution time for querying large RDF graphs, the evolving metaheuristic algorithms become an alternate to the traditional query optimization methods. This paper focuses on the problem of query optimization of semantic web data. An efficient algorithm called adaptive Cuckoo search (ACS) for querying and generating optimal query plan for large RDF graphs is designed in this research. Experiments were conducted on different datasets with varying number of predicates. The experimental results have exposed that the proposed approach has provided significant results in terms of query execution time. The extent to which the algorithm is efficient is tested and the results are documented.

  4. An adaptive range-query optimization technique with distributed replicas

    Institute of Scientific and Technical Information of China (English)

    Sayar Ahmet; Pierce Marlon; Fox C.Geoffrey

    2014-01-01

    Replication is an approach often used to speed up the execution of queries submitted to a large dataset. A compile-time/run-time approach is presented for minimizing the response time of 2-dimensional range when a distributed replica of a dataset exists. The aim is to partition the query payload (and its range) into subsets and distribute those to the replica nodes in a way that minimizes a client’s response time. However, since query size and distribution characteristics of data (data dense/sparse regions) in varying ranges are not known a priori, performing efficient load balancing and parallel processing over the unpredictable workload is difficult. A technique based on the creation and manipulation of dynamic spatial indexes for query payload estimation in distributed queries was proposed. The effectiveness of this technique was demonstrated on queries for analysis of archived earthquake-generated seismic data records.

  5. Querying Big Data:Bridging Theory and Practice

    Institute of Scientific and Technical Information of China (English)

    樊文飞; 怀进鹏

    2014-01-01

    Big data introduces challenges to query answering, from theory to practice. A number of questions arise. What queries are “tractable” on big data? How can we make big data “small” so that it is feasible to find exact query answers? When exact answers are beyond reach in practice, what approximation theory can help us strike a balance between the quality of approximate query answers and the costs of computing such answers? To get sensible query answers in big data, what else do we necessarily do in addition to coping with the size of the data? This position paper aims to provide an overview of recent advances in the study of querying big data. We propose approaches to tackling these challenging issues, and identify open problems for future research.

  6. A Novel Adaptive Cuckoo Search for Optimal Query Plan Generation

    Directory of Open Access Journals (Sweden)

    Ramalingam Gomathi

    2014-01-01

    Full Text Available The emergence of multiple web pages day by day leads to the development of the semantic web technology. A World Wide Web Consortium (W3C standard for storing semantic web data is the resource description framework (RDF. To enhance the efficiency in the execution time for querying large RDF graphs, the evolving metaheuristic algorithms become an alternate to the traditional query optimization methods. This paper focuses on the problem of query optimization of semantic web data. An efficient algorithm called adaptive Cuckoo search (ACS for querying and generating optimal query plan for large RDF graphs is designed in this research. Experiments were conducted on different datasets with varying number of predicates. The experimental results have exposed that the proposed approach has provided significant results in terms of query execution time. The extent to which the algorithm is efficient is tested and the results are documented.

  7. Querying and Extracting Timeline Information from Road Traffic Sensor Data

    Directory of Open Access Journals (Sweden)

    Ardi Imawan

    2016-08-01

    Full Text Available The escalation of traffic congestion in urban cities has urged many countries to use intelligent transportation system (ITS centers to collect historical traffic sensor data from multiple heterogeneous sources. By analyzing historical traffic data, we can obtain valuable insights into traffic behavior. Many existing applications have been proposed with limited analysis results because of the inability to cope with several types of analytical queries. In this paper, we propose the QET (querying and extracting timeline information system—a novel analytical query processing method based on a timeline model for road traffic sensor data. To address query performance, we build a TQ-index (timeline query-index that exploits spatio-temporal features of timeline modeling. We also propose an intuitive timeline visualization method to display congestion events obtained from specified query parameters. In addition, we demonstrate the benefit of our system through a performance evaluation using a Busan ITS dataset and a Seattle freeway dataset.

  8. Implementing and evaluating a regional strategy to improve testing rates in VA patients at risk for HIV, utilizing the QUERI process as a guiding framework: QUERI Series

    Directory of Open Access Journals (Sweden)

    Osborn Teresa

    2008-03-01

    Full Text Available Abstract Background We describe how we used the framework of the U.S. Department of Veterans Affairs (VA Quality Enhancement Research Initiative (QUERI to develop a program to improve rates of diagnostic testing for the Human Immunodeficiency Virus (HIV. This venture was prompted by the observation by the CDC that 25% of HIV-infected patients do not know their diagnosis – a point of substantial importance to the VA, which is the largest provider of HIV care in the United States. Methods Following the QUERI steps (or process, we evaluated: 1 whether undiagnosed HIV infection is a high-risk, high-volume clinical issue within the VA, 2 whether there are evidence-based recommendations for HIV testing, 3 whether there are gaps in the performance of VA HIV testing, and 4 the barriers and facilitators to improving current practice in the VA. Based on our findings, we developed and initiated a QUERI step 4/phase 1 pilot project using the precepts of the Chronic Care Model. Our improvement strategy relies upon electronic clinical reminders to provide decision support; audit/feedback as a clinical information system, and appropriate changes in delivery system design. These activities are complemented by academic detailing and social marketing interventions to achieve provider activation. Results Our preliminary formative evaluation indicates the need to ensure leadership and team buy-in, address facility-specific barriers, refine the reminder, and address factors that contribute to inter-clinic variances in HIV testing rates. Preliminary unadjusted data from the first seven months of our program show 3–5 fold increases in the proportion of at-risk patients who are offered HIV testing at the VA sites (stations where the pilot project has been undertaken; no change was seen at control stations. Discussion This project demonstrates the early success of the application of the QUERI process to the development of a program to improve HIV testing rates

  9. A sporulation-specific, sigF-dependent protein, SspA, affects septum positioning in Streptomyces coelicolor

    OpenAIRE

    2013-01-01

    The RNA polymerase sigma factor SigF controls late development during sporulation in the filamentous bacterium Streptomyces coelicolor. The only known SigF-dependent gene identified so far, SCO5321, is found in the biosynthetic cluster encoding spore pigment synthesis. Here we identify the first direct target for SigF, the gene sspA, encoding a sporulation-specific protein. Bioinformatic analysis suggests that SspA is a secreted lipoprotein with two PepSY signature domains. The sspA deletion ...

  10. Towards the Formalization of Fuzzy Relational Database Queries

    Directory of Open Access Journals (Sweden)

    Aleksandar Perović

    2009-03-01

    Full Text Available The aim of this paper is to give guidelines on how to formalize fuzzy relationaldatabase queries using 1LΠ 2 fuzzy logic. After the short introduction, we give anoverview of the1LΠ 2 logic. In the continuation we give a brief overview of the FRDBqueries and query-database similarity relation. We conclude the paper with the descriptionof FRDB query formalization using presented definitions.

  11. Northeastern University in TREC 2009. Million Query Track

    Science.gov (United States)

    2009-11-01

    queries in the collection had at least one click on documents in the .gov collection. Given the computational complexity of feature extraction and...ranking functions (by employing SVM) over two different collections, (a) the Million Query 2008 (MQ08) collection (GOV2 corpus and queries with at least one ... click on doc- uments in the .gov domain), and (b) a Bing generated collection (described in Section 2.1) and employed the constructed ranking

  12. Learning from minimum entropy queries in a large committee machine

    CERN Document Server

    Sollich, P

    1996-01-01

    In supervised learning, the redundancy contained in random examples can be avoided by learning from queries. Using statistical mechanics, we study learning from minimum entropy queries in a large tree-committee machine. The generalization error decreases exponentially with the number of training examples, providing a significant improvement over the algebraic decay for random examples. The connection between entropy and generalization error in multi-layer networks is discussed, and a computationally cheap algorithm for constructing queries is suggested and analysed.

  13. SIG et évaluation des risques naturels: application aux risques sismiques de Quito

    Directory of Open Access Journals (Sweden)

    Jean-Luc CHATELAIN

    1995-09-01

    Full Text Available L'article retrace rapidement les principales étapes de la réalisation d'un scénario sismique sur la ville de Quito. Les croisements nécessaires entre les données provenant de domaines variés (sciences de la Terre, ingénierie civile, et sociodémographie ont pu être effectués rapidement grâce à l'utilisation du SIG SAVANE. Le SIG a permis l'édition de documents graphiques décrivant de façon concrète la vulnérabilité sismique de la ville, facilitant ainsi la prise de conscience des responsables politiques et économiques.

  14. SIG EN LA NUBE: WEBSIG PARA LA ENSEÑANZA DE LA GEOGRAFÍA

    Directory of Open Access Journals (Sweden)

    Andrew J. Milson

    Full Text Available RESUMEN:Para la mayoría de los profesores dedicados a la enseñanza de la Geografía no hay duda de que el SIG es una herramienta importante en el proceso de enseñanza-aprendizaje, pero su uso se ha ido retrasando por problemas tales como el coste del software y la gestión de grandes archivos de datos espaciales. El movimiento hacia la computación en la nube, conocida como la nube de Internet, es una tendencia prometedora para los SIG en la educación. La "nube" se refiere a una red virtual que ofrece a los usuarios acceso a archivos, servicios y aplicaciones. En este artículo se pone de manifiesto que la nube de Internet y el WebSIG tienen un gran potencial para enriquecer la educación geográfica. Se presentan tres experiencias sustentadas en el uso de estas nuevas herramientas en las aulas en los EE.UU. con las conclusiones de carácter didáctico derivadas de cada caso. PALABRAS CLAVE WEBSIG; SIG; enseñanza de la geografía; la nube de Internet; ArcGIS Online; ArcGIS Explorer Desktop (AGX. ABSTRACT There is no doubt among most geography educators that GIS is an important tool for teaching and learning, but its use has been slowed by issues such as the cost of the software and the management of large spatial data files. The move to cloud computingis one trend that is promising for GIS in education. The "cloud" refers to a virtual network that provides many users with access to files, services, and applications. In this article I argue that cloud computing and WebGIS have the potential to transform geography education. I will describe three case studies that make use of these emerging tools in classrooms in the US, and discuss the lessons that we can learn from these cases. KEY WORDS WEBGIS; GIS; cloud computing; ArcGIS Online; ArcGIS Explorer Desktop (AGX. RÉSUMÉ Il n'ya aucun doute parmi les éducateurs les plus géographie que le SIG est un outil important pour l'enseignement et l'apprentissage, mais son utilisation a

  15. Error Checking for Chinese Query by Mining Web Log

    Directory of Open Access Journals (Sweden)

    Jianyong Duan

    2015-01-01

    Full Text Available For the search engine, error-input query is a common phenomenon. This paper uses web log as the training set for the query error checking. Through the n-gram language model that is trained by web log, the queries are analyzed and checked. Some features including query words and their number are introduced into the model. At the same time data smoothing algorithm is used to solve data sparseness problem. It will improve the overall accuracy of the n-gram model. The experimental results show that it is effective.

  16. The effect of query complexity on Web searching results

    Directory of Open Access Journals (Sweden)

    B.J. Jansen

    2000-01-01

    Full Text Available This paper presents findings from a study of the effects of query structure on retrieval by Web search services. Fifteen queries were selected from the transaction log of a major Web search service in simple query form with no advanced operators (e.g., Boolean operators, phrase operators, etc. and submitted to 5 major search engines - Alta Vista, Excite, FAST Search, Infoseek, and Northern Light. The results from these queries became the baseline data. The original 15 queries were then modified using the various search operators supported by each of the 5 search engines for a total of 210 queries. Each of these 210 queries was also submitted to the applicable search service. The results obtained were then compared to the baseline results. A total of 2,768 search results were returned by the set of all queries. In general, increasing the complexity of the queries had little effect on the results with a greater than 70% overlap in results, on average. Implications for the design of Web search services and directions for future research are discussed.

  17. Wild Card Queries for Searching Resources on the Web

    CERN Document Server

    Rafiei, Davood

    2009-01-01

    We propose a domain-independent framework for searching and retrieving facts and relationships within natural language text sources. In this framework, an extraction task over a text collection is expressed as a query that combines text fragments with wild cards, and the query result is a set of facts in the form of unary, binary and general $n$-ary tuples. A significance of our querying mechanism is that, despite being both simple and declarative, it can be applied to a wide range of extraction tasks. A problem in querying natural language text though is that a user-specified query may not retrieve enough exact matches. Unlike term queries which can be relaxed by removing some of the terms (as is done in search engines), removing terms from a wild card query without ruining its meaning is more challenging. Also, any query expansion has the potential to introduce false positives. In this paper, we address the problem of query expansion, and also analyze a few ranking alternatives to score the results and to r...

  18. Web Database Schema Identification through Simple Query Interface

    Science.gov (United States)

    Lin, Ling; Zhou, Lizhu

    Web databases provide different types of query interfaces to access the data records stored in the backend databases. While most existing works exploit a complex query interface with multiple input fields to perform schema identification of the Web databases, little attention has been paid on how to identify the schema of web databases by simple query interface (SQI), which has only one single query text input field. This paper proposes a new method of instance-based query probing to identify WDBs' interface and result schema for SQI. The interface schema identification problem is defined as generating the fullcondition query of SQI and a novel query probing strategy is proposed. The result schema is also identified based on the result webpages of SQI's full-condition query, and an extended identification of the non-query attributes is proposed to improve the attribute recall rate. Experimental results on web databases of online shopping for book, movie and mobile phone show that our method is effective and efficient.

  19. QVIZ: A FRAMEWORK FOR QUERYING AND VISUALIZING DATA

    Energy Technology Data Exchange (ETDEWEB)

    T. KEAHEY; P. MCCORMICK; ET AL

    2000-12-01

    Qviz is a lightweight, modular,and easy to use parallel system for interactive analytical query processing and visual presentation of large datasets. Qviz allows queries of arbitrary complexity to be easily constructed using a specialized scripting language. Visual presentation of the results is also easily achieved via simple scripted and interactive commands to our query-specific visualization tools. This paper describes our initial experiences with the Qviz system for querying and visualizing scientific datasets, showing how Qviz has been used in two different applications: ocean modeling and linear accelerator simulations.

  20. A Grammar Analysis Model for the Unified Multimedia Query Language

    Institute of Scientific and Technical Information of China (English)

    Zhong-Sheng Cao; Zong-Da Wu; Yuan-Zhen Wang

    2008-01-01

    The unified multimedia query language(UMQL) is a powerful general-purpose multimediaquery language, and it is very suitable for multimediainformation retrieval. The paper proposes a grammaranalysis model to implement an effective grammaticalprocessing for the language. It separates the grammaranalysis of a UMQL query specification into two phases:syntactic analysis and semantic analysis, and thenrespectively uses Backus-Naur form (EBNF) and logicalalgebra to specify both restrictive grammar rules. As aresult, the model can present error guiding informationfor a query specification which owns incorrect grammar.The model not only suits well the processing of UMQLqueries, but also has a guiding significance for otherprojects concerning query processings of descriptivequery languages.

  1. AQBE — QBE Style Queries for Archetyped Data

    Science.gov (United States)

    Sachdeva, Shelly; Yaginuma, Daigo; Chu, Wanming; Bhalla, Subhash

    Large-scale adoption of electronic healthcare applications requires semantic interoperability. The new proposals propose an advanced (multi-level) DBMS architecture for repository services for health records of patients. These also require query interfaces at multiple levels and at the level of semi-skilled users. In this regard, a high-level user interface for querying the new form of standardized Electronic Health Records system has been examined in this study. It proposes a step-by-step graphical query interface to allow semi-skilled users to write queries. Its aim is to decrease user effort and communication ambiguities, and increase user friendliness.

  2. Distributed query plan generation using multiobjective genetic algorithm.

    Science.gov (United States)

    Panicker, Shina; Kumar, T V Vijay

    2014-01-01

    A distributed query processing strategy, which is a key performance determinant in accessing distributed databases, aims to minimize the total query processing cost. One way to achieve this is by generating efficient distributed query plans that involve fewer sites for processing a query. In the case of distributed relational databases, the number of possible query plans increases exponentially with respect to the number of relations accessed by the query and the number of sites where these relations reside. Consequently, computing optimal distributed query plans becomes a complex problem. This distributed query plan generation (DQPG) problem has already been addressed using single objective genetic algorithm, where the objective is to minimize the total query processing cost comprising the local processing cost (LPC) and the site-to-site communication cost (CC). In this paper, this DQPG problem is formulated and solved as a biobjective optimization problem with the two objectives being minimize total LPC and minimize total CC. These objectives are simultaneously optimized using a multiobjective genetic algorithm NSGA-II. Experimental comparison of the proposed NSGA-II based DQPG algorithm with the single objective genetic algorithm shows that the former performs comparatively better and converges quickly towards optimal solutions for an observed crossover and mutation probability.

  3. Distributed Query Plan Generation Using Multiobjective Genetic Algorithm

    Directory of Open Access Journals (Sweden)

    Shina Panicker

    2014-01-01

    Full Text Available A distributed query processing strategy, which is a key performance determinant in accessing distributed databases, aims to minimize the total query processing cost. One way to achieve this is by generating efficient distributed query plans that involve fewer sites for processing a query. In the case of distributed relational databases, the number of possible query plans increases exponentially with respect to the number of relations accessed by the query and the number of sites where these relations reside. Consequently, computing optimal distributed query plans becomes a complex problem. This distributed query plan generation (DQPG problem has already been addressed using single objective genetic algorithm, where the objective is to minimize the total query processing cost comprising the local processing cost (LPC and the site-to-site communication cost (CC. In this paper, this DQPG problem is formulated and solved as a biobjective optimization problem with the two objectives being minimize total LPC and minimize total CC. These objectives are simultaneously optimized using a multiobjective genetic algorithm NSGA-II. Experimental comparison of the proposed NSGA-II based DQPG algorithm with the single objective genetic algorithm shows that the former performs comparatively better and converges quickly towards optimal solutions for an observed crossover and mutation probability.

  4. PAQ: Persistent Adaptive Query Middleware for Dynamic Environments

    Science.gov (United States)

    Rajamani, Vasanth; Julien, Christine; Payton, Jamie; Roman, Gruia-Catalin

    Pervasive computing applications often entail continuous monitoring tasks, issuing persistent queries that return continuously updated views of the operational environment. We present PAQ, a middleware that supports applications' needs by approximating a persistent query as a sequence of one-time queries. PAQ introduces an integration strategy abstraction that allows composition of one-time query responses into streams representing sophisticated spatio-temporal phenomena of interest. A distinguishing feature of our middleware is the realization that the suitability of a persistent query's result is a function of the application's tolerance for accuracy weighed against the associated overhead costs. In PAQ, programmers can specify an inquiry strategy that dictates how information is gathered. Since network dynamics impact the suitability of a particular inquiry strategy, PAQ associates an introspection strategy with a persistent query, that evaluates the quality of the query's results. The result of introspection can trigger application-defined adaptation strategies that alter the nature of the query. PAQ's simple API makes developing adaptive querying systems easily realizable. We present the key abstractions, describe their implementations, and demonstrate the middleware's usefulness through application examples and evaluation.

  5. ODQ: A Fluid Office Document Query Language

    Directory of Open Access Journals (Sweden)

    Xuhong Liu

    2015-06-01

    Full Text Available Fluid office documents, as semi-structured data often represented by Extensible Markup Language (XML are important parts of Big Data. These office documents have different formats, and their matching Application Programming Interfaces (APIs depend on developing platform and versions, which causes difficulty in custom development and information retrieval from them. To solve this problem, we have been developing an office document query (ODQ language which provides a uniform method to retrieve content from documents with different formats and versions. ODQ builds common document model ontology to conceal the format details of documents and provides a uniform operation interface to handle office documents with different formats. The results show that ODQ has advantages in format independence, and can facilitate users in developing documents processing systems with good interoperability.

  6. SIG et risques naturels: le glissement de terrain de Séchilienne (Isère

    Directory of Open Access Journals (Sweden)

    Jean-Pierre ASTÉ

    1993-12-01

    Full Text Available À Séchilienne (massif de l’Oisans s’est produit un glissement de terrain qui, selon les experts, peut devenir glissement de versant entier. Leurs scénarios ont été traduits cartographiquement en un SIG qui, malgré ses limites, constitue un premier outil de prise de conscience et d’aide à la décision.

  7. At Forberede sig til den mundtlige prøve ( kap12)

    DEFF Research Database (Denmark)

    Nørgaard, Britta Kusk

    2015-01-01

    Et opslagsværk for studerende til støtte for bacheloropgaveskrivning. Klæd dine studerende på til at skrive den store afsluttende opgave. De studerende får med denne bog et gedigent opslagsværk, som giver svar på alle de spørgsmål, det typisk trænger sig på, når bacheloropgaven skal påbegyndes. B...

  8. Rapport fra projektet ”At skrive sig til læsning”

    DEFF Research Database (Denmark)

    Labuz, Nadia; Bundsgaard, Jeppe; Kjertmann, Kjeld;

    Projektet At skrive sig til læsning er et Videnkupon-projekt finansieret af Forsknings- og Innovationsstyrelsen og firmaet Jamus. Projektet havde til formål at bidrage til viden om og kvalificere udviklingen af en app til iPad, som har til formål at støtte børn i deres skrive- og læseudvikling...

  9. 悍枪中的贵族——SIG-552

    Institute of Scientific and Technical Information of China (English)

    晓晓; 一平(画)

    2009-01-01

    呀!我们手中昂贵的冲锋抢,比起对面敌人的这款武器来,竟然毫无优势!不仅威力差了一大截,甚至连平时引以为豪的精度也远不如人。咦!敌人的这款武器原来是SIG-552啊!

  10. Ontological Matchmaking in Recommender Systems

    CERN Document Server

    Bonifati, Angela; Sileo, Domenica; Summa, Gianvito

    2010-01-01

    The electronic marketplace offers great potential for the recommendation of supplies. In the so called recommender systems, it is crucial to apply matchmaking strategies that faithfully satisfy the predicates specified in the demand, and take into account as much as possible the user preferences. We focus on real-life ontology-driven matchmaking scenarios and identify a number of challenges, being inspired by such scenarios. A key challenge is that of presenting the results to the users in an understandable and clear-cut fashion in order to facilitate the analysis of the results. Indeed, such scenarios evoke the opportunity to rank and group the results according to specific criteria. A further challenge consists of presenting the results to the user in an asynchronous fashion, i.e. the 'push' mode, along with the 'pull' mode, in which the user explicitly issues a query, and displays the results. Moreover, an important issue to consider in real-life cases is the possibility of submitting a query to multiple p...

  11. SigMol: repertoire of quorum sensing signaling molecules in prokaryotes.

    Science.gov (United States)

    Rajput, Akanksha; Kaur, Karambir; Kumar, Manoj

    2016-01-04

    Quorum sensing is a widespread phenomenon in prokaryotes that helps them to communicate among themselves and with eukaryotes. It is driven through quorum sensing signaling molecules (QSSMs) in a density dependent manner that assists in numerous biological functions like biofilm formation, virulence factors secretion, swarming motility, bioluminescence, etc. Despite immense implications, dedicated resources of QSSMs are lacking. Therefore, we have developed SigMol (http://bioinfo.imtech.res.in/manojk/sigmol), a specialized repository of these molecules in prokaryotes. SigMol harbors information on QSSMs pertaining to different quorum sensing signaling systems namely acylated homoserine lactones (AHLs), diketopiperazines (DKPs), 4-hydroxy-2-alkylquinolines (HAQs), diffusible signal factors (DSFs), autoinducer-2 (AI-2) and others. Database contains 1382: entries of 182: unique signaling molecules from 215: organisms. It encompasses biological as well as chemical aspects of signaling molecules. Biological information includes genes, preliminary bioassays, identification assays and applications, while chemical detail comprises of IUPAC name, SMILES and structure. We have provided user-friendly browsing and searching facilities for easy data retrieval and comparison. We have gleaned information of diverse QSSMs reported in literature at a single platform 'SigMol'. This comprehensive resource will assist the scientific community in understanding intraspecies, interspecies or interkingdom networking and further help to unfold different facets of quorum sensing and related therapeutics.

  12. SigMate: a Matlab-based automated tool for extracellular neuronal signal processing and analysis.

    Science.gov (United States)

    Mahmud, Mufti; Bertoldo, Alessandra; Girardi, Stefano; Maschietto, Marta; Vassanelli, Stefano

    2012-05-30

    Rapid advances in neuronal probe technology for multisite recording of brain activity have posed a significant challenge to neuroscientists for processing and analyzing the recorded signals. To be able to infer meaningful conclusions quickly and accurately from large datasets, automated and sophisticated signal processing and analysis tools are required. This paper presents a Matlab-based novel tool, "SigMate", incorporating standard methods to analyze spikes and EEG signals, and in-house solutions for local field potentials (LFPs) analysis. Available modules at present are - 1. In-house developed algorithms for: data display (2D and 3D), file operations (file splitting, file concatenation, and file column rearranging), baseline correction, slow stimulus artifact removal, noise characterization and signal quality assessment, current source density (CSD) analysis, latency estimation from LFPs and CSDs, determination of cortical layer activation order using LFPs and CSDs, and single LFP clustering; 2. Existing modules: spike detection, sorting and spike train analysis, and EEG signal analysis. SigMate has the flexibility of analyzing multichannel signals as well as signals from multiple recording sources. The in-house developed tools for LFP analysis have been extensively tested with signals recorded using standard extracellular recording electrode, and planar and implantable multi transistor array (MTA) based neural probes. SigMate will be disseminated shortly to the neuroscience community under the open-source GNU-General Public License.

  13. SigMol: repertoire of quorum sensing signaling molecules in prokaryotes

    Science.gov (United States)

    Rajput, Akanksha; Kaur, Karambir; Kumar, Manoj

    2016-01-01

    Quorum sensing is a widespread phenomenon in prokaryotes that helps them to communicate among themselves and with eukaryotes. It is driven through quorum sensing signaling molecules (QSSMs) in a density dependent manner that assists in numerous biological functions like biofilm formation, virulence factors secretion, swarming motility, bioluminescence, etc. Despite immense implications, dedicated resources of QSSMs are lacking. Therefore, we have developed SigMol (http://bioinfo.imtech.res.in/manojk/sigmol), a specialized repository of these molecules in prokaryotes. SigMol harbors information on QSSMs pertaining to different quorum sensing signaling systems namely acylated homoserine lactones (AHLs), diketopiperazines (DKPs), 4-hydroxy-2-alkylquinolines (HAQs), diffusible signal factors (DSFs), autoinducer-2 (AI-2) and others. Database contains 1382 entries of 182 unique signaling molecules from 215 organisms. It encompasses biological as well as chemical aspects of signaling molecules. Biological information includes genes, preliminary bioassays, identification assays and applications, while chemical detail comprises of IUPAC name, SMILES and structure. We have provided user-friendly browsing and searching facilities for easy data retrieval and comparison. We have gleaned information of diverse QSSMs reported in literature at a single platform ‘SigMol’. This comprehensive resource will assist the scientific community in understanding intraspecies, interspecies or interkingdom networking and further help to unfold different facets of quorum sensing and related therapeutics. PMID:26490957

  14. Identification and preliminary characterization of a SigB regulated small non-coding RNA in Listeria monocytogenes

    DEFF Research Database (Denmark)

    Nielsen, Jesper Sejrup; Olsen, Anders Steno; Bonde, Mette;

    by an alternative sigma factor. Using this approach we have successfully identified a novel sRNA of ~75 nucleotides in L. monocytogenes that is specifically regulated by SigB. This sRNA, which we have termed SigB1 is expressed from the 3'-UTR of a large operon. SigB1 is expressed in a SigB dependant manner only...... in response to known SigB associated stresses such as salt- and ethanol-stress or entrance into stationary phase. Using transcriptional promoter-lacZ reporter assays, we have confirmed that SigB1 is not the result of an RNA-processing event. Interestingly, SigB1 does not contain any obvious Hfq binding sites...

  15. Recommender Systems using Graph Theory

    Directory of Open Access Journals (Sweden)

    Vishal Venkatraman

    2013-08-01

    Full Text Available Recommender systems have become one of the important tools in E-Commerce. They combine the ratings of services or products by one user with the ratings from other users to answer similar interest queries with predictions and suggestions. The users thus receive anonymous recommendations from people similar interests. Even though this process seems unobjectionable, it aggregates user preferences, which can be tapped to recognise information about a particular user. Users who rate products or services across different types or domains in the systems are the major victims for this exploitation. We could determine the advantages and risks by performing a detailed analysis with a particular recommendation algorithm, but it would be difficult to draw general conclusions from this approach. In this paper, we aim for an algorithm independent analysis by applying a graph-theoretic model. By employing this model, we show that a user benefits most from recommendations based on similarity between the various products rated by the users. This paper tries to draw a graph through the various items rated by the users and finds the items that are most common among the user and his friends which is then recommended to him.

  16. The Future of UV-Visible Astronomy from Space - the NASA COPAG SIG

    Science.gov (United States)

    Scowen, Paul

    2015-08-01

    The ultraviolet (92-320nm) and visible (320-1000nm) (UVV) regions of the spectrum contain a vital suite of diagnostic lines that can be used to study diverse astronomical objects and phenomena that shape and energize the interstellar medium. It is a critical spectral range for tracing the physics of interstellar and intergalactic gas, the ionization of nebulae, the properties of shocks, the atmospheres and winds of hot stars, energy transfer between galaxies and their surrounding environments, and the engines of active galactic nuclei. This spectral range contains diagnostics that measure gas density, electron temperature, and energy balance between various modes of cooling. It is an unfortunate truth that many, if not most, of these diagnostics can only be observed outside the Earth’s atmosphere, requiring facilities in space. Space-based observations also provide access to diffraction-limited optical performance to achieve high spatial resolution. Such spatial resolutions cannot currently be achieved from the ground over wide fields, a capability that many science programs need for sampling and survey work.In order to provide continuing access in the future, new space-based missions will be needed to provide the core imaging and spectroscopic information in this important part of the electromagnetic spectrum. The technology that enables such access has been a high priority in technology development plans that have been developed by both the Cosmic Origins Program Office and Astrophysics Division at NASA, but a holistic approach to considering what is needed for a long-term technology roadmap has not yet been discussed widely within the community. This UVV Science Interest Group [SIG #2] has been established to collect community input and define long-term Cosmic Origins science objectives of the UVV astronomy community that can be addressed by space-based observations. The SIG facilitates communication to merge the needs and desires of the science community with

  17. Mining the SDSS SkyServer SQL queries log

    Science.gov (United States)

    Hirota, Vitor M.; Santos, Rafael; Raddick, Jordan; Thakar, Ani

    2016-05-01

    SkyServer, the Internet portal for the Sloan Digital Sky Survey (SDSS) astronomic catalog, provides a set of tools that allows data access for astronomers and scientific education. One of SkyServer data access interfaces allows users to enter ad-hoc SQL statements to query the catalog. SkyServer also presents some template queries that can be used as basis for more complex queries. This interface has logged over 330 million queries submitted since 2001. It is expected that analysis of this data can be used to investigate usage patterns, identify potential new classes of queries, find similar queries, etc. and to shed some light on how users interact with the Sloan Digital Sky Survey data and how scientists have adopted the new paradigm of e-Science, which could in turn lead to enhancements on the user interfaces and experience in general. In this paper we review some approaches to SQL query mining, apply the traditional techniques used in the literature and present lessons learned, namely, that the general text mining approach for feature extraction and clustering does not seem to be adequate for this type of data, and, most importantly, we find that this type of analysis can result in very different queries being clustered together.

  18. Classifying queries submitted to a vertical search engine

    NARCIS (Netherlands)

    Berendsen, R.; Kovachev, B.; Meij, E.; de Rijke, M.; Weerkamp, W.

    2011-01-01

    We propose and motivate a scheme for classifying queries submitted to a people search engine. We specify a number of features for automatically classifying people queries into the proposed classes and examine the eectiveness of these features. Our main nding is that classication is feasible and that

  19. The Acoi Algebra: a Query Algebra for Image Retrieval Systems

    NARCIS (Netherlands)

    Nes, N.J.; Kersten, M.L.

    1998-01-01

    Content-based image retrieval systems rely on a query-by-example technique often using a limited set of global image features. This leads to a rather coarse-grain approach to locate images. The next step is to concentrate on queries over spatial relations amongst objects within the images. This call

  20. Group-by Skyline Query Processing in Relational Engines

    DEFF Research Database (Denmark)

    Yiu, Man Lung; Luk, Ming-Hay; Lo, Eric

    2009-01-01

    the missing cost model for the BBS algorithm. Experimental results show that our techniques are able to devise the best query plans for a variety of group-by skyline queries. Our focus is on algorithms that can be directly implemented in today's commercial database systems without the addition of new access...

  1. Ontology Based Queries - Investigating a Natural Language Interface

    NARCIS (Netherlands)

    van der Sluis, Ielka; Hielkema, F.; Mellish, C.; Doherty, G.

    2010-01-01

    In this paper we look at what may be learned from a comparative study examining non-technical users with a background in social science browsing and querying metadata. Four query tasks were carried out with a natural language interface and with an interface that uses a web paradigm with hyperlinks.

  2. Dynamic Query Optimization Approach for Semantic Database Grid

    Institute of Scientific and Technical Information of China (English)

    Xiao-Qing Zheng; Hua-Jun Chen; Zhao-Hui Wu; Yu-Xin Mao

    2006-01-01

    Fundamentally, semantic grid database is about bringing globally distributed databases together in order to coordinate resource sharing and problem solving in which information is given well-defined meaning, and DartGrid Ⅱ is the implemented database gird system whose goal is to provide a semantic solution for integrating database resources on the Web.Although many algorithms have been proposed for optimizing query-processing in order to minimize costs and/or response time, associated with obtaining the answer to query in a distributed database system, database grid query optimization problem is fundamentally different from traditional distributed query optimization. These differences are shown to be the consequences of autonomy and heterogeneity of database nodes in database grid. Therefore, more challenges have arisen for query optimization in database grid than traditional distributed database. Following this observation, the design of a query optimizer in DartGrid Ⅱ is presented, and a heuristic, dynamic and parallel query optimization approach to processing query in database grid is proposed. A set of semantic tools supporting relational database integration and semantic-based information browsing has also been implemented to realize the above vision.

  3. Multidimensional indexing structure for use with linear optimization queries

    Science.gov (United States)

    Bergman, Lawrence David (Inventor); Castelli, Vittorio (Inventor); Chang, Yuan-Chi (Inventor); Li, Chung-Sheng (Inventor); Smith, John Richard (Inventor)

    2002-01-01

    Linear optimization queries, which usually arise in various decision support and resource planning applications, are queries that retrieve top N data records (where N is an integer greater than zero) which satisfy a specific optimization criterion. The optimization criterion is to either maximize or minimize a linear equation. The coefficients of the linear equation are given at query time. Methods and apparatus are disclosed for constructing, maintaining and utilizing a multidimensional indexing structure of database records to improve the execution speed of linear optimization queries. Database records with numerical attributes are organized into a number of layers and each layer represents a geometric structure called convex hull. Such linear optimization queries are processed by searching from the outer-most layer of this multi-layer indexing structure inwards. At least one record per layer will satisfy the query criterion and the number of layers needed to be searched depends on the spatial distribution of records, the query-issued linear coefficients, and N, the number of records to be returned. When N is small compared to the total size of the database, answering the query typically requires searching only a small fraction of all relevant records, resulting in a tremendous speedup as compared to linearly scanning the entire dataset.

  4. Persuasive Elements of 100 Successful Magazine Query Letters.

    Science.gov (United States)

    Jolliffe, Lee

    Building from scholarly works on persuasion and compliance-gaining, a study investigated magazine query letters that attempt to persuade an editor to buy the article offered, examining what message elements make them successful. Forty magazine editors provided copies of 100 recently accepted magazine query letters, which were compared with 50…

  5. Time-sensitive personalized query auto-completion

    NARCIS (Netherlands)

    Cai, F.; Liang, S.; de Rijke, M.; Li, J.; Wang, X.S.

    2014-01-01

    Query auto-completion (QAC) is a prominent feature of modern search engines. It is aimed at saving user's time and enhancing the search experience. Current QAC models mostly rank matching QAC candidates according to their past popularity, i.e., frequency. However, query popularity changes over time

  6. The Imposed Query: Implications for Library Service Evaluation.

    Science.gov (United States)

    Gross, Melissa

    1998-01-01

    Explores the potential impact of imposed query, a new model of information-seeking behavior, on current approaches to library service and system evaluation. Discusses reference service evaluation, user studies, output measures, and relevance as an evaluation tool. Argues that imposed query broadens understanding of the user and of the role that…

  7. Real SQL queries 50 challenges : practice for reporting and analysis

    CERN Document Server

    Cohen, Brian; Mishra, Neerja

    2015-01-01

    Queries improve when challenges are authentic. This book sets your learning on the fast track with realistic problems to solve. Topics span sales, marketing, human resources, purchasing, and production. Real SQL Queries: 50 Challenges is perfect for analysts, report writers, or anyone searching for a hands-on approach to learning SQL Server.

  8. Topology-free querying of protein interaction networks.

    Science.gov (United States)

    Bruckner, Sharon; Hüffner, Falk; Karp, Richard M; Shamir, Ron; Sharan, Roded

    2010-03-01

    In the network querying problem, one is given a protein complex or pathway of species A and a protein-protein interaction network of species B; the goal is to identify subnetworks of B that are similar to the query in terms of sequence, topology, or both. Existing approaches mostly depend on knowledge of the interaction topology of the query in the network of species A; however, in practice, this topology is often not known. To address this problem, we develop a topology-free querying algorithm, which we call Torque. Given a query, represented as a set of proteins, Torque seeks a matching set of proteins that are sequence-similar to the query proteins and span a connected region of the network, while allowing both insertions and deletions. The algorithm uses alternatively dynamic programming and integer linear programming for the search task. We test Torque with queries from yeast, fly, and human, where we compare it to the QNet topology-based approach, and with queries from less studied species, where only topology-free algorithms apply. Torque detects many more matches than QNet, while giving results that are highly functionally coherent.

  9. Low Redundancy in Static Dictionaries with Constant Query Time

    DEFF Research Database (Denmark)

    Pagh, Rasmus

    2001-01-01

    A static dictionary is a data structure for storing subsets of a finite universe U, so that membership queries can be answered efficiently. We study this problem in a unit cost RAM model with word size Ω(log |U|), and show that for n-element subsets, constant worst case query time can be obtained...

  10. Query Classification and Study of University Students' Search Trends

    Science.gov (United States)

    Maabreh, Majdi A.; Al-Kabi, Mohammed N.; Alsmadi, Izzat M.

    2012-01-01

    Purpose: This study is an attempt to develop an automatic identification method for Arabic web queries and divide them into several query types using data mining. In addition, it seeks to evaluate the impact of the academic environment on using the internet. Design/methodology/approach: The web log files were collected from one of the higher…

  11. On the query complexity of finding a local maximum point

    NARCIS (Netherlands)

    Rastsvelaev, A.L.; Beklemishev, L.D.

    2008-01-01

    We calculate the minimal number of queries sufficient to find a local maximum point of a functiun on a discrete interval for a model with M parallel queries, M≥1. Matching upper and lower bounds are obtained. The bounds are formulated in terms of certain Fibonacci type sequences of numbers.

  12. On the Suitability of Skyline Queries for Data Exploration

    DEFF Research Database (Denmark)

    Chester, Sean; Mortensen, Michael Lind; Assent, Ira

    2014-01-01

    The skyline operator has been studied in database research for multi-criteria decision making. Until now the focus has been on the efficiency or accuracy of single queries. In practice, however, users are increasingly confronted with unknown data collections, where precise query formulation prove...

  13. How many functions can be distinguished with k quantum queries?

    CERN Document Server

    Farhi, E; Gutmann, S; Sipser, M

    1999-01-01

    Suppose an oracle is known to hold one of a given set of D two-valued functions. To successfully identify which function the oracle holds with k classical queries, it must be the case that D is at most 2^k. In this paper we derive a bound for how many functions can be distinguished with k quantum queries.

  14. Approximate furthest neighbor with application to annulus query

    DEFF Research Database (Denmark)

    Pagh, Rasmus; Silvestri, Francesco; Sivertsen, Johan von Tangen

    2016-01-01

    -dimensional Euclidean space. The method builds on the technique of Indyk (SODA 2003), storing random projections to provide sublinear query time for AFN. However, we introduce a different query algorithm, improving on Indyk׳s approximation factor and reducing the running time by a logarithmic factor. We also present...

  15. Pareto-depth for multiple-query image retrieval.

    Science.gov (United States)

    Hsiao, Ko-Jen; Calder, Jeff; Hero, Alfred O

    2015-02-01

    Most content-based image retrieval systems consider either one single query, or multiple queries that include the same object or represent the same semantic information. In this paper, we consider the content-based image retrieval problem for multiple query images corresponding to different image semantics. We propose a novel multiple-query information retrieval algorithm that combines the Pareto front method with efficient manifold ranking. We show that our proposed algorithm outperforms state of the art multiple-query retrieval algorithms on real-world image databases. We attribute this performance improvement to concavity properties of the Pareto fronts, and prove a theoretical result that characterizes the asymptotic concavity of the fronts.

  16. Evaluation of Query Generators for Entity Search Engines

    CERN Document Server

    Endrullis, Stefan; Rahm, Erhard

    2010-01-01

    Dynamic web applications such as mashups need efficient access to web data that is only accessible via entity search engines (e.g. product or publication search engines). However, most current mashup systems and applications only support simple keyword searches for retrieving data from search engines. We propose the use of more powerful search strategies building on so-called query generators. For a given set of entities query generators are able to automatically determine a set of search queries to retrieve these entities from an entity search engine. We demonstrate the usefulness of query generators for on-demand web data integration and evaluate the effectiveness and efficiency of query generators for a challenging real-world integration scenario.

  17. Multiple k Nearest Neighbor Query Processing in Spatial Network Databases

    DEFF Research Database (Denmark)

    Xuegang, Huang; Jensen, Christian Søndergaard; Saltenis, Simonas

    2006-01-01

    This paper concerns the efficient processing of multiple k nearest neighbor queries in a road-network setting. The assumed setting covers a range of scenarios such as the one where a large population of mobile service users that are constrained to a road network issue nearest-neighbor queries...... for points of interest that are accessible via the road network. Given multiple k nearest neighbor queries, the paper proposes progressive techniques that selectively cache query results in main memory and subsequently reuse these for query processing. The paper initially proposes techniques for the case...... where an upper bound on k is known a priori and then extends the techniques to the case where this is not so. Based on empirical studies with real-world data, the paper offers insight into the circumstances under which the different proposed techniques can be used with advantage for multiple k nearest...

  18. Cross Lingual Information Retrieval With SMT And Query Mining

    Directory of Open Access Journals (Sweden)

    Suneet Kumar Gupta

    2011-10-01

    Full Text Available In this paper, we have taken the English Corpus and Queries, both translated and transliterated form. We use Statistical Machine Translator to find the result under translated and transliterated queries and then analyzed the result. These queries wise results can then be undergone mining and therefore a new list of queries is created. We have design an experimental setup followed by various steps which calculate Mean Average Precision. We have taken assistance ship of Terrier Open Source for the Information Retrieval. On the basis of created new query list, we calculate the Mean Average Precision and find a significant result i.e. 93.24% which is very close to monolingual results calculated for English language.

  19. A new approach to query expansion in information retrieval

    Institute of Scientific and Technical Information of China (English)

    Li Weijiang; Zhao Tiejun; Wang Xiangang

    2008-01-01

    To eliminate the mismatch between words of relevant documents and user's query and more serious negative effects it has on the performance of information retrieval,a method of query expansion on the basis of new terms co-occurrence representation was put forward by analyzing the process of producing query. The expansion terms were selected according to their correlation to the whole query. At the same time, the position information between terms were considered. The experimental result on test retrieval conference (TREC) data collection shows that the method proposed in the paper has made an improvement of 5%~19% all the time than the language modeling method without expansion. Compared to the popular approach of query expansion, pseudo feedback, the precision of the proposed method is competitive.

  20. Query Intent Disambiguation of Keyword-Based Semantic Entity Search in Dataspaces

    Institute of Scientific and Technical Information of China (English)

    Dan Yang; De-Rong Shen; Ge Yu; Yue Kou; Tie-Zheng Nie

    2013-01-01

    Keyword query has attracted much research attention due to its simplicity and wide applications.The inherent ambiguity of keyword query is prone to unsatisfied query results.Moreover some existing techniques on Web query,keyword query in relational databases and XML databases cannot be completely applied to keyword query in dataspaces.So we propose KeymanticES,a novel keyword-based semantic entity search mechanism in dataspaces which combines both keyword query and semantic query features.And we focus on query intent disambiguation problem and propose a novel three-step approach to resolve it.Extensive experimental results show the effectiveness and correctness of our proposed approach.

  1. Query-Driven Visualization and Analysis

    Energy Technology Data Exchange (ETDEWEB)

    Ruebel, Oliver; Bethel, E. Wes; Prabhat, Mr.; Wu, Kesheng

    2012-11-01

    This report focuses on an approach to high performance visualization and analysis, termed query-driven visualization and analysis (QDV). QDV aims to reduce the amount of data that needs to be processed by the visualization, analysis, and rendering pipelines. The goal of the data reduction process is to separate out data that is "scientifically interesting'' and to focus visualization, analysis, and rendering on that interesting subset. The premise is that for any given visualization or analysis task, the data subset of interest is much smaller than the larger, complete data set. This strategy---extracting smaller data subsets of interest and focusing of the visualization processing on these subsets---is complementary to the approach of increasing the capacity of the visualization, analysis, and rendering pipelines through parallelism. This report discusses the fundamental concepts in QDV, their relationship to different stages in the visualization and analysis pipelines, and presents QDV's application to problems in diverse areas, ranging from forensic cybersecurity to high energy physics.

  2. Query by image example: The CANDID approach

    Energy Technology Data Exchange (ETDEWEB)

    Kelly, P.M.; Cannon, M. [Los Alamos National Lab., NM (United States). Computer Research and Applications Group; Hush, D.R. [Univ. of New Mexico, Albuquerque, NM (United States). Dept. of Electrical and Computer Engineering

    1995-02-01

    CANDID (Comparison Algorithm for Navigating Digital Image Databases) was developed to enable content-based retrieval of digital imagery from large databases using a query-by-example methodology. A user provides an example image to the system, and images in the database that are similar to that example are retrieved. The development of CANDID was inspired by the N-gram approach to document fingerprinting, where a ``global signature`` is computed for every document in a database and these signatures are compared to one another to determine the similarity between any two documents. CANDID computes a global signature for every image in a database, where the signature is derived from various image features such as localized texture, shape, or color information. A distance between probability density functions of feature vectors is then used to compare signatures. In this paper, the authors present CANDID and highlight two results from their current research: subtracting a ``background`` signature from every signature in a database in an attempt to improve system performance when using inner-product similarity measures, and visualizing the contribution of individual pixels in the matching process. These ideas are applicable to any histogram-based comparison technique.

  3. An Optimal Labeling Scheme for Ancestry Queries

    CERN Document Server

    Fraigniaud, Pierre

    2009-01-01

    An ancestry labeling scheme assigns labels (bit strings) to the nodes of rooted trees such that ancestry queries between any two nodes in a tree can be answered merely by looking at their corresponding labels. The quality of an ancestry labeling scheme is measured by its label size, that is the maximal number of bits in a label of a tree node. In addition to its theoretical appeal, the design of efficient ancestry labeling schemes is motivated by applications in web search engines. For this purpose, even small improvements in the label size are important. In fact, the literature about this topic is interested in the exact label size rather than just its order of magnitude. As a result, following the proposal of a simple interval-based ancestry scheme with label size $2\\log_2 n$ bits (Kannan et al., STOC '88), a considerable amount of work was devoted to improve the bound on the size of a label. The current state of the art upper bound is $\\log_2 n + O(\\sqrt{\\log n})$ bits (Abiteboul et al., SODA '02) which is...

  4. Citing Dynamic Data - Research Data Alliance working group recommendations

    Science.gov (United States)

    Asmi, Ari; Rauber, Andreas; Pröll, Stefan; van Uytvanck, Dieter

    2016-04-01

    Geosciences research data sets are typically dynamic: changing over time as new records are added, errors are corrected and obsolete records are deleted from the data sets. Researchers often use only parts of the data sets or data stream, creating specific subsets tailored to their experiments. In order to keep such experiments reproducible and to share and cite the particular data used in a study, researchers need means of identifying the exact version of a subset as it was used during a specific execution of a workflow, even if the data source is continuously evolving. Some geosciences data services have tried to approach this problem by creating static versions of their data sets, and some have simply ignored this issue. The RDA Working Group on Dynamic Data Citation (WGDC) has instead approached the issue with a set of recommendations based upon versioned data, timestamping and a query based subsetting mechanism. The 14 RDA WGDC recommendations on how to adapt a data source for providing identifiable subsets for the long term are: Preparing the Data and the Query Store R1 - Data Versioning R2 - Timestamping R3 - Query Store Facilities Persistently Identifying Specific Data Sets R4 - Query Uniqueness R5 - Stable Sorting R6 - Result Set Verification R7 - Query Timestamping R8 - Query PID R9 - Store the Query R10 - Automated Citation Texts Resolving PIDs and Retrieving the Data - R11 - Landing Page R12 - Machine Actionability Upon modifications to the Data Infrastructure R13 - Technology Migration R14 - Migration Verification We present a detailed discussion of the recommendations, the rationale behind them, and give examples of how to implement them.

  5. ALGORITMA RC4 DALAM PROTEKSI TRANSMISI DAN HASIL QUERY UNTUK ORDBMS POSTGRESQL

    Directory of Open Access Journals (Sweden)

    Yuri Ariyanto

    2009-01-01

    Full Text Available In this research will be worked through about how cryptography RC4's algorithm implementation in protection to query result and of query, security by encryption and descryption up to both is in network. Implementation of this research which is build software in client that function access databases that is placed by the side of server. Software that building to have facility for encryption and descryption query result and of query that is sent from client goes to server and. transmission query result and of query can secure its security. Well guaranted transmission security him of query result and of query can be told to succeed if success software can encryption query result and of query which transmission so that in the event of scanning to both, scanning will not understand data content. Conclusion of this research that is woke up software succeed encryption query and result of query which transmission between application of client and of server databases. Abstract in Bahasa Indonesia: Pada penelitian ini dibahas mengenai bagaimana mengimplementasikan algoritma kriptografi RC4 dalam proteksi terhadap query dan hasil query, pengamanan dilakukan dengan cara melakukan enkripsi dan dekripsi selama keduanya berada di dalam jaringan. Pengimplementasian dari penelitian ini yaitu membangun sebuah software yang akan diletakkan di sisi client yang berfungsi mengakses database yang diletakkan di sisi server. Software yang dibangun memiliki fasilitas untuk mengenkripsi dan mendektipsi query dan hasil query yang dikirimkan dari client ke server dan juga sebaliknya. Dengan demikian tramsmisi query dan hasil query dapat terjamin keamanannya.Terjaminnya keamanan transmisi query dan hasil query dapat dikatakan berhasil jika software berhasil mengenkripsi query dan hasil query yang ditransmisikan sehingga apabila terjadi penyadapan terhadap keduanya, penyadap tidak akan mengerti isi data tersebut. Kesimpulan dari penelitian ini yaitu software yang dibangun

  6. Keyword Query over Error-Tolerant Knowledge Bases

    Institute of Scientific and Technical Information of China (English)

    Yu-Rong Cheng; Ye Yuan; Jia-Yu Li; Lei Chen; Guo-Ren Wang

    2016-01-01

    With more and more knowledge provided by WWW, querying and mining the knowledge bases have attracted much research attention. Among all the queries over knowledge bases, which are usually modelled as graphs, a keyword query is the most widely used one. Although the problem of keyword query over graphs has been deeply studied for years, knowledge bases, as special error-tolerant graphs, lead to the results of the traditional defined keyword queries out of users’ satisfaction. Thus, in this paper, we define a new keyword query, called confident r-clique, specific for knowledge bases based on the r-clique definition for keyword query on general graphs, which has been proved to be the best one. However, as we prove in the paper, finding the confident r-cliques is #P-hard. We propose a filtering-and-verification framework to improve the search efficiency. In the filtering phase, we develop the tightest upper bound of the confident r-clique, and design an index together with its search algorithm, which suits the large scale of knowledge bases well. In the verification phase, we develop an efficient sampling method to verify the final answers from the candidates remaining in the filtering phase. Extensive experiments demonstrate that the results derived from our new definition satisfy the users’ requirement better compared with the traditional r-clique definition, and our algorithms are efficient.

  7. An Architecture for Handling Fuzzy Queries in Data Warehouses

    Science.gov (United States)

    Singh, Manu Pratap; Tiwari, Rajdev; Mahajan, Manish; Dani, Diksha

    This paper presents an augmented architecture of Data Warehouse for fuzzy query handling to improve the performance of Data Mining process. The performance of Data Mining may become worst while mining the fuzzy information from the large Data Warehouses. There are number of preprocessing steps suggested and implemented so far to support the mining process. But querying large Data warehouses for fuzzy information is still a challenging task for the researchers’ community. The model proposed here may provide a more realistic and powerful technique for handling the vague queries directly. The basic idea behind the creation of Data Warehouses is to integrate a large amount of pre-fetched data and information from the distributed sources for direct querying and analysis .But the end user’s queries contain the maximum fuzziness and to handle those queries directly may not yield the desired response. So the model proposed here will create a fuzzy extension of Data warehouse by applying Neuro-Fuzzy technique and the fuzzy queries then will get handled directly by the extension of data warehouse.

  8. Research in Mobile Database Query Optimization and Processing

    Directory of Open Access Journals (Sweden)

    Agustinus Borgy Waluyo

    2005-01-01

    Full Text Available The emergence of mobile computing provides the ability to access information at any time and place. However, as mobile computing environments have inherent factors like power, storage, asymmetric communication cost, and bandwidth limitations, efficient query processing and minimum query response time are definitely of great interest. This survey groups a variety of query optimization and processing mechanisms in mobile databases into two main categories, namely: (i query processing strategy, and (ii caching management strategy. Query processing includes both pull and push operations (broadcast mechanisms. We further classify push operation into on-demand broadcast and periodic broadcast. Push operation (on-demand broadcast relates to designing techniques that enable the server to accommodate multiple requests so that the request can be processed efficiently. Push operation (periodic broadcast corresponds to data dissemination strategies. In this scheme, several techniques to improve the query performance by broadcasting data to a population of mobile users are described. A caching management strategy defines a number of methods for maintaining cached data items in clients' local storage. This strategy considers critical caching issues such as caching granularity, caching coherence strategy and caching replacement policy. Finally, this survey concludes with several open issues relating to mobile query optimization and processing strategy.

  9. jQuery UI 1.7 the user interface library for jQuery

    CERN Document Server

    Wellman, Dan

    2009-01-01

    An example-based approach leads you step-by-step through the implementation and customization of each library component and its associated resources in turn. To emphasize the way that jQuery UI takes the difficulty out of user interface design and implementation, each chapter ends with a 'fun with' section that puts together what you've learned throughout the chapter to make a usable and fun page. In these sections you'll often get to experiment with the latest associated technologies like AJAX and JSON. This book is for front-end designers and developers who need to quickly learn how to use t

  10. jQuery 2.0 animation techniques beginner's guide

    CERN Document Server

    Culpepper, Adam

    2013-01-01

    This book is a guide to help you create attractive web page animations using jQuery. Written in a friendly and engaging approach this book is designed to be placed alongside your computer as a mentor.If you are a web designer or a frontend developer or if you want to learn how to animate the user interface of your web applications with jQuery, this book is for you. Experience with jQuery or Javascript would be helpful but solid knowledge base of HTML and CSS is assumed.

  11. Intelligent query processing for semantic mediation of information systems

    Directory of Open Access Journals (Sweden)

    Saber Benharzallah

    2011-11-01

    Full Text Available We propose an intelligent and an efficient query processing approach for semantic mediation of information systems. We propose also a generic multi agent architecture that supports our approach. Our approach focuses on the exploitation of intelligent agents for query reformulation and the use of a new technology for the semantic representation. The algorithm is self-adapted to the changes of the environment, offers a wide aptitude and solves the various data conflicts in a dynamic way; it also reformulates the query using the schema mediation method for the discovered systems and the context mediation for the other systems.

  12. Joint Top-K Spatial Keyword Query Processing

    DEFF Research Database (Denmark)

    Wu, Dingming; Yiu, Man Lung; Cong, Gao

    2012-01-01

    Web users and content are increasingly being geopositioned, and increased focus is being given to serving local content in response to web queries. This development calls for spatial keyword queries that take into account both the locations and textual descriptions of content. We study...... keyword queries. Empirical studies show that the proposed solution is efficient on real data sets. We also offer analytical studies on synthetic data sets to demonstrate the efficiency of the proposed solution. Index Terms IEEE Terms Electronic mail , Google , Indexes , Joints , Mobile communication...

  13. Design of Intelligent layer for flexible querying in databases

    CERN Document Server

    Nihalani, Mrs Neelu; Motwani, Dr Mahesh

    2009-01-01

    Computer-based information technologies have been extensively used to help many organizations, private companies, and academic and education institutions manage their processes and information systems hereby become their nervous centre. The explosion of massive data sets created by businesses, science and governments necessitates intelligent and more powerful computing paradigms so that users can benefit from this data. Therefore most new-generation database applications demand intelligent information management to enhance efficient interactions between database and the users. Database systems support only a Boolean query model. A selection query on SQL database returns all those tuples that satisfy the conditions in the query.

  14. Processing Constrained K Closest Pairs Query in Spatial Databases

    Institute of Scientific and Technical Information of China (English)

    LIU Xiaofeng; LIU Yunsheng; XIAO Yingyuan

    2006-01-01

    In this paper, constrained K closest pairs query is introduced, which retrieves the K closest pairs satisfying the given spatial constraint from two datasets. For data sets indexed by R-trees in spatial databases, three algorithms are presented for answering this kind of query. Among of them,two-phase Range+Join and Join+Range algorithms adopt the strategy that changes the execution order of range and closest pairs queries, and constrained heap-based algorithm utilizes extended distance functions to prune search space and minimize the pruning distance. Experimental results show that constrained heap-base algorithm has better applicability and performance than two-phase algorithms.

  15. Materialized View Selection by Query Clustering in XML Data Warehouses

    CERN Document Server

    Mahboubi, Hadj; Darmont, Jérôme

    2008-01-01

    XML data warehouses form an interesting basis for decision-support applications that exploit complex data. However, native XML database management systems currently bear limited performances and it is necessary to design strategies to optimize them. In this paper, we propose an automatic strategy for the selection of XML materialized views that exploits a data mining technique, more precisely the clustering of the query workload. To validate our strategy, we implemented an XML warehouse modeled along the XCube specifications. We executed a workload of XQuery decision-support queries on this warehouse, with and without using our strategy. Our experimental results demonstrate its efficiency, even when queries are complex.

  16. Preliminary study into query translation for patent retrieval

    DEFF Research Database (Denmark)

    Jochim, C.; Lioma, Christina; Schütze, H.

    2010-01-01

    boundaries do not hinder their accessibility. This multilingual-ity of patent collections offers opportunities for improving patent retrieval. In this work we exploit these opportunities by applying query translation to patent retrieval. We expand monolingual patent queries with their translations, using......, but not always, and without great improvement compared to standard statistical monolingual query expansion (Rocchio). The improvement is greater when the source language is English, as opposed to French or German, a finding partly due to the effect of the complex French and German morphology upon translation...

  17. A sporulation-specific, sigF-dependent protein, SspA, affects septum positioning in Streptomyces coelicolor.

    Science.gov (United States)

    Tzanis, Angelos; Dalton, Kate A; Hesketh, Andrew; den Hengst, Chris D; Buttner, Mark J; Thibessard, Annabelle; Kelemen, Gabriella H

    2014-01-01

    The RNA polymerase sigma factor SigF controls late development during sporulation in the filamentous bacterium Streptomyces coelicolor. The only known SigF-dependent gene identified so far, SCO5321, is found in the biosynthetic cluster encoding spore pigment synthesis. Here we identify the first direct target for SigF, the gene sspA, encoding a sporulation-specific protein. Bioinformatic analysis suggests that SspA is a secreted lipoprotein with two PepSY signature domains. The sspA deletion mutant exhibits irregular sporulation septation and altered spore shape, suggesting that SspA plays a role in septum formation and spore maturation. The fluorescent translational fusion protein SspA-mCherry localized first to septum sites, then subsequently around the surface of the spores. Both SspA protein and sspA transcription are absent from the sigF null mutant. Moreover, in vitro transcription assay confirmed that RNA polymerase holoenzyme containing SigF is sufficient for initiation of transcription from a single sspA promoter. In addition, in vivo and in vitro experiments showed that sspA is a direct target of BldD, which functions to repress sporulation genes, including whiG, ftsZ and ssgB, during vegetative growth, co-ordinating their expression during sporulation septation.

  18. An Object Oriented Approach to Mining Web Graphs for Recommendations

    Directory of Open Access Journals (Sweden)

    T Murali Mohan

    2012-06-01

    Full Text Available Web mining is the application of data mining techniques to extract knowledge from Web. Web mining has been explored to a vast degree and different techniques have been proposed for a variety of applications that includes music, images, books recommendations, query suggestions, etc. In this paper, we highlight the significance of studying the evolving nature of the Web personalization. Web usage mining is used to discover interesting user navigation patterns and can be applied to many real-world problems, such as improving Web sites/pages, making additional topic or product recommendations, user/customer behavior studies, etc. The proposed framework can be utilized in many recommendation tasks on the World Wide Web, including query suggestions, image recommendations, etc. The experimental analysis on large datasets shows the promising future of our work.

  19. Characterization of the SigD regulon of C. difficile and its positive control of toxin production through the regulation of tcdR.

    Directory of Open Access Journals (Sweden)

    Imane El Meouche

    Full Text Available Clostridium difficile intestinal disease is mediated largely by the actions of toxins A (TcdA and B (TcdB, whose production occurs after the initial steps of colonization involving different surface or flagellar proteins. In B. subtilis, the sigma factor SigD controls flagellar synthesis, motility, and vegetative autolysins. A homolog of SigD encoding gene is present in the C.difficile 630 genome. We constructed a sigD mutant in C. difficile 630 ∆erm to analyze the regulon of SigD using a global transcriptomic approach. A total of 103 genes were differentially expressed between the wild-type and the sigD mutant, including genes involved in motility, metabolism and regulation. In addition, the sigD mutant displayed decreased expression of genes involved in flagellar biosynthesis, and also of genes encoding TcdA and TcdB as well as TcdR, the positive regulator of the toxins. Genomic analysis and RACE-PCR experiments allowed us to characterize promoter sequences of direct target genes of SigD including tcdR and to identify the SigD consensus. We then established that SigD positively regulates toxin expression via direct control of tcdR transcription. Interestingly, the overexpression of FlgM, a putative anti-SigD factor, inhibited the positive regulation of motility and toxin synthesis by SigD. Thus, SigD appears to be the first positive regulator of the toxin synthesis in C. difficile.

  20. Human Cell and Tissue Establishment Registration Public Query

    Data.gov (United States)

    U.S. Department of Health & Human Services — This application provides Human Cell and Tissue registration information for registered, inactive, and pre-registered firms. Query options are by Establishment Name,...

  1. Determinacy in Static Analysis of jQuery

    DEFF Research Database (Denmark)

    Andreasen, Esben; Møller, Anders

    2014-01-01

    Static analysis for JavaScript can potentially help programmers find errors early during development. Although much progress has been made on analysis techniques, a major obstacle is the prevalence of libraries, in particular jQuery, which apply programming patterns that have detrimental conseque......Static analysis for JavaScript can potentially help programmers find errors early during development. Although much progress has been made on analysis techniques, a major obstacle is the prevalence of libraries, in particular jQuery, which apply programming patterns that have detrimental...... present a static dataflow analysis for JavaScript that infers and exploits determinacy information on-the-fly, to enable analysis of some of the most complex parts of jQuery. The techniques are implemented in the TAJS analysis tool and evaluated on a collection of small programs that use jQuery. Our...

  2. An introduction to XML query processing and keyword search

    CERN Document Server

    Lu, Jiaheng

    2013-01-01

    This book systematically and comprehensively covers the latest advances in XML data searching. It presents an extensive overview of the current query processing and keyword search techniques on XML data.

  3. An Efficient Data Dissemination Scheme for Spatial Query Processing

    Institute of Scientific and Technical Information of China (English)

    Kwangjin Park; Hyunseung Choo; Chong-Sun Hwang

    2007-01-01

    Due to the personal portable devices and advances in wireless communication technologies, Location Dependent Information Services (LDISs) have received a lot of attention from both the industrial and academic communities.In LDISs,it is important to reduce the query response time, since a late query response may contain out-of-date information.In this paper, we study the issue of LDISs using a Voronoi Diagram.We introduce a new NN search method, called the Exponential Sequence Scheme (ESS), to support NN query processing in periodic broadcast environment.This paper aims to provide research directions towards minimizing both the access latency and energy consumption for the NN-query processing.

  4. An Adaptive Mechanism for Accurate Query Answering under Differential Privacy

    CERN Document Server

    Li, Chao

    2012-01-01

    We propose a novel mechanism for answering sets of count- ing queries under differential privacy. Given a workload of counting queries, the mechanism automatically selects a different set of "strategy" queries to answer privately, using those answers to derive answers to the workload. The main algorithm proposed in this paper approximates the optimal strategy for any workload of linear counting queries. With no cost to the privacy guarantee, the mechanism improves significantly on prior approaches and achieves near-optimal error for many workloads, when applied under (\\epsilon, \\delta)-differential privacy. The result is an adaptive mechanism which can help users achieve good utility without requiring that they reason carefully about the best formulation of their task.

  5. Comparing and Combining Methods for Automatic Query Expansion

    CERN Document Server

    Pérez-Agüera, José R

    2008-01-01

    Query expansion is a well known method to improve the performance of information retrieval systems. In this work we have tested different approaches to extract the candidate query terms from the top ranked documents returned by the first-pass retrieval. One of them is the cooccurrence approach, based on measures of cooccurrence of the candidate and the query terms in the retrieved documents. The other one, the probabilistic approach, is based on the probability distribution of terms in the collection and in the top ranked set. We compare the retrieval improvement achieved by expanding the query with terms obtained with different methods belonging to both approaches. Besides, we have developed a na\\"ive combination of both kinds of method, with which we have obtained results that improve those obtained with any of them separately. This result confirms that the information provided by each approach is of a different nature and, therefore, can be used in a combined manner.

  6. Cluster Analysis and Fuzzy Query in Ship Maintenance and Design

    Science.gov (United States)

    Che, Jianhua; He, Qinming; Zhao, Yinggang; Qian, Feng; Chen, Qi

    Cluster analysis and fuzzy query win wide-spread applications in modern intelligent information processing. In allusion to the features of ship maintenance data, a variant of hypergraph-based clustering algorithm, i.e., Correlation Coefficient-based Minimal Spanning Tree(CC-MST), is proposed to analyze the bulky data rooting in ship maintenance process, discovery the unknown rules and help ship maintainers make a decision on various device fault causes. At the same time, revising or renewing an existed design of ship or device maybe necessary to eliminate those device faults. For the sake of offering ship designers some valuable hints, a fuzzy query mechanism is designed to retrieve the useful information from large-scale complicated and reluctant ship technical and testing data. Finally, two experiments based on a real ship device fault statistical dataset validate the flexibility and efficiency of the CC-MST algorithm. A fuzzy query prototype demonstrates the usability of our fuzzy query mechanism.

  7. Capturing the Meaning of Internet Search Queries by Taxonomy Mapping

    Science.gov (United States)

    Tikk, Domonkos; Kardkovács, Zsolt T.; Bánsághi, Zoltán

    Capturing the meaning of internet search queries can significantly improve the effectiveness of search retrieval. Users often have problem to find relevant answer to their queries, particularly, when the posted query is ambiguous. The orientation of the user can be greatly facilitated, if answers are grouped into topics of a fixed subject taxonomy. In this manner, the original problem can be transformed to the labelling of queries — and consequently, the answers — with the topic names. Thus the original problem is transformed into a classification set-up. This paper introduces our Ferrety algorithm that performs topic assignment, which also works when there is no directly available training data that describes the semantics of the subject taxonomy. The approach is presented via the example of ACM KDD Cup 2005 problem, where Ferrety was awarded for precision and creativity.

  8. Human Cell and Tissue Establishment Registration Public Query

    Data.gov (United States)

    U.S. Department of Health & Human Services — This application provides Human Cell and Tissue registration information for registered, inactive, and pre-registered firms. Query options are by Establishment Name,...

  9. Optimizing Aggregate SPARQL Queries Using Materialized RDF Views

    DEFF Research Database (Denmark)

    Ibragimov, Dilshod; Hose, Katja; Pedersen, Torben Bach;

    2016-01-01

    During recent years, more and more data has been published as native RDF datasets. In this setup, both the size of the datasets and the need to process aggregate queries represent challenges for standard SPARQL query processing techniques. To overcome these limitations, materialized views can......, this paper proposes MARVEL (MAterialized Rdf Views with Entailment and incompLetness). The approach consists of a view selection algorithm based on an associated RDF-specific cost model, a view definition syntax, and an algorithm for rewriting SPARQL queries using materialized RDF views. The experimental...... be created and used as a source of precomputed partial results during query processing. However, materialized view techniques as proposed for relational databases do not support RDF specifics, such as incompleteness and the need to support implicit (derived) information. To overcome these challenges...

  10. Enhanced Distributed Dynamic Skyline Query for Wireless Sensor Networks

    Directory of Open Access Journals (Sweden)

    Khandakar Ahmed

    2016-02-01

    Full Text Available Dynamic skyline query is one of the most popular and significant variants of skyline query in the field of multi-criteria decision-making. However, designing a distributed dynamic skyline query possesses greater challenge, especially for the distributed data centric storage within wireless sensor networks (WSNs. In this paper, a novel Enhanced Distributed Dynamic Skyline (EDDS approach is proposed and implemented in Disk Based Data Centric Storage (DBDCS architecture. DBDCS is an adaptation of magnetic disk storage platter consisting tracks and sectors. In DBDCS, the disc track and sector analogy is used to map data locations. A distance based indexing method is used for storing and querying multi-dimensional similar data. EDDS applies a threshold based hierarchical approach, which uses temporal correlation among sectors and sector segments to calculate a dynamic skyline. The efficiency and effectiveness of EDDS has been evaluated in terms of latency, energy consumption and accuracy through a simulation model developed in Castalia.

  11. External Data Structures for Shortest Path Queries on Planar Digraphs

    DEFF Research Database (Denmark)

    Arge, Lars; Toma, Laura

    2005-01-01

    In this paper we present space-query trade-offs for external memory data structures that answer shortest path queries on planar directed graphs. For any S = Ω(N 1 + ε) and S = O(N2/B), our main result is a family of structures that use S space and answer queries in O(N2/ S B) I/Os, thus obtaining...... optimal space-query product O(N2/B). An S space structure can be constructed in O(√S · sort(N)) I/Os, where sort(N) is the number of I/Os needed to sort N elements, B is the disk block size, and N is the size of the graph....

  12. Queries, Influencers and Vocational Interests of Junior High School Students

    Science.gov (United States)

    Woal, S. Theodore

    1974-01-01

    The study, based on questionnaire results from 207 ninth grade students, investigates: student familiarity with occupations; influencers of their tentative occupational choices; post high school plans; and student queries and concerns pertinent to preparation for entry into a job. (MW)

  13. A Novel Visual Data Mining Module for the Geographical Information System gvSIG

    Directory of Open Access Journals (Sweden)

    Romel Vázquez-Rodríguez

    2013-01-01

    Full Text Available The exploration of large GIS models containing spatio-temporal information is a challenge. In this paper we propose the integration of scientific visualization (ScVis techniques into geographic information systems (GIS as an alternative for the visual analysis of data. Providing GIS with such tools improves the analysis and understanding of datasets with very low spatial density and allows to find correlations between variables in time and space. In this regard, we present a new visual data mining tool for the GIS gvSIG. This tool has been implemented as a gvSIG module and contains several ScVis techniques for multiparameter data with a wide range of possibilities to explore interactively the data. The developed module is a powerful visual data mining and data visualization tool to obtain knowledge from multiple datasets in time and space. A real case study with meteorological data from Villa Clara province (Cuba is presented, where the implemented visualization techniques were used to analyze the available datasets. Although it is tested with meteorological data, the developed module is of general application in the sense that it can be used in multiple application fields related with Earth Sciences.

  14. Sustainability of State-Level Substance Abuse Prevention Infrastructure After the Completion of the SPF SIG.

    Science.gov (United States)

    Edwards, Jessica M; Stein-Seroussi, Al; Flewelling, Robert L; Orwin, Robert G; Zhang, Lei

    2015-06-01

    Recent national substance abuse prevention efforts that have been disseminated at the state level have provided fertile ground for addressing the dearth of systematic research on state-level substance abuse prevention infrastructure. The Strategic Prevention Framework State Incentive Grant Program (SPF SIG), a national public health initiative sponsored by the US Substance Abuse and Mental Health Services Administration and its Center for Substance Abuse Prevention, is one such effort, providing an opportunity to examine state-level substance abuse prevention infrastructure across the country. The aims of the SPF SIG initiative include reducing substance abuse and its related problems, as well as enhancing state and local prevention infrastructure and capacity. In this article, we describe the status of state-level substance abuse prevention infrastructure and capacity 1 year after the first 26 funded states ended their projects, based on follow-up interviews with state prevention decision-makers. We found that, in five of the six prevention domains we measured, prevention infrastructure capacity increased during the 12-month period after the grants ended. The evidence for further SPF capacity development even after the conclusion of the grants suggests that states recognized the benefits of using the SPF and took deliberate steps to sustain and enhance the integration of this framework into their state prevention systems. In addition, the findings suggest that state agencies and organizations can benefit from time-limited resources aimed at increasing their capacity and that such efforts can have a lasting impact on measures of state prevention system capacity.

  15. Implementing an SIG based platform of application and service for city spatial information in Shanghai

    Science.gov (United States)

    Yu, Bailang; Wu, Jianping

    2006-10-01

    Spatial Information Grid (SIG) is an infrastructure that has the ability to provide the services for spatial information according to users' needs by means of collecting, sharing, organizing and processing the massive distributed spatial information resources. This paper presents the architecture, technologies and implementation of the Shanghai City Spatial Information Application and Service System, a SIG based platform, which is an integrated platform that serves for administration, planning, construction and development of the city. In the System, there are ten categories of spatial information resources, including city planning, land-use, real estate, river system, transportation, municipal facility construction, environment protection, sanitation, urban afforestation and basic geographic information data. In addition, spatial information processing services are offered as a means of GIS Web Services. The resources and services are all distributed in different web-based nodes. A single database is created to store the metadata of all the spatial information. A portal site is published as the main user interface of the System. There are three main functions in the portal site. First, users can search the metadata and consequently acquire the distributed data by using the searching results. Second, some spatial processing web applications that developed with GIS Web Services, such as file format conversion, spatial coordinate transfer, cartographic generalization and spatial analysis etc, are offered to use. Third, GIS Web Services currently available in the System can be searched and new ones can be registered. The System has been working efficiently in Shanghai Government Network since 2005.

  16. Rancang Bangun Sistem Informasi Geografis (SIG Pemetaan Pemancar Televisi Digital Terestrial di Indonesia

    Directory of Open Access Journals (Sweden)

    Isna Nur Mahmud

    2015-03-01

    Full Text Available Perubahan dari sistem televisi analog menjadi sistem televisi digital terestrial di Indonesia tinggal menunggu waktu. Namun masih banyak infrastruktur yang masih perlu dibangun untuk menunjang sistem televisi digital terestrial agar dapat beroperasi dengan baik. Belum meratanya sistem pemancar televisi digital terestrial yang keberadaannya masih terbenturnya undang – undang yang berlaku di negara ini menjadi salah satu permasalahannya. Salah satu solusinya adalah memetakannya dalam sebuah SIG. Pemetaan pemancar tv digital terestrial ini dibuat untuk mempermudah KPI  dalam melakukan identifikasi letak pemancar televisi digital terestrial serta memberikan informasi yang berkaitan dengan daya pemancar, spesifikasi pemancar televisi digital terestrial di Indonesia dalam kondisi offline. Dari pengujian didapatkan hasil antara lain, untuk pengujian black-box, didapatkan hasil yang sesuai dengan fungsionalitas sistem. Untuk nilai MOS, kemudahan menu aplikasi 3.9, kemudahan dlm navigasi aplikasi 4.1, kemudahan dlm menggunakan tools 4.05, penilaian tampilan interface 3.952, penilaian keseluruhan aplikasi SIG 4.hasil SUS yang dilakukan didapatkan nilai 65.71

  17. Alternative Development for Data Migration Using Dynamic Query Generation

    Directory of Open Access Journals (Sweden)

    Romero-Ramírez Johan Alfredo

    2016-05-01

    Full Text Available This article presents an ETL (Extract, Transform, Load prototype called Valery as alternative approach to migration process which includes a compiler for dynamic generation of SQL queries. Its main features involve: SQL dynamic generation, set of configuration commands and environment for file uploading. The tests use the Northwind academic database and an individual environment. The model implementation uses flat files and SQL as query language. Finally, there is an analysis of the results obtained.

  18. MOCQL: A Declarative Language for Ad-Hoc Model Querying

    DEFF Research Database (Denmark)

    Störrle, Harald

    2013-01-01

    This paper starts from the observation that existing model query facilities are not easy to use, and are thus not suitable for users without substantial IT/Computer Science background. In an attempt to highlight this issue and explore alternatives, we have created the Model Constraint and Query L...... with MOCQL than when working with OCL. While MOCQL is currently only implemented and validated for the different notations defined by UML, its concepts should be universally applicable....

  19. Study on consistent query answering in inconsistent databases

    Institute of Scientific and Technical Information of China (English)

    XIE Dong; YANG Luming

    2007-01-01

    Consistent query answering is an approach to retrieving consistent answers over databases that might be inconsistent with respect to some given integrity constraints The approach is based on a concept of repair.This paper surveys several recent researches on obtaining consistent information from inconsistent databases,such as the underlying semantic model,a number of approaches to computing consistent query answers and the computational complexity of this problem.Furthermore,the work outlines potential research directions in this area.

  20. Two Dimensional Range Minimum Queries and Fibonacci Lattices

    DEFF Research Database (Denmark)

    Brodal, Gerth Stølting; Davoodi, Pooya; Lewenstein, Moshe;

    2012-01-01

    technique—the discrepancy properties of Fibonacci lattices—we give an indexing data structure for 2D-RMQs that uses O(N/c) bits additional space with O(clogc(loglogc)2) query time, for any parameter c, 4 ≤ c ≤ N. Also, when the entries of the input matrix are from {0,1}, we show that the query time can...

  1. A distributed query execution engine of big attributed graphs.

    Science.gov (United States)

    Batarfi, Omar; Elshawi, Radwa; Fayoumi, Ayman; Barnawi, Ahmed; Sakr, Sherif

    2016-01-01

    A graph is a popular data model that has become pervasively used for modeling structural relationships between objects. In practice, in many real-world graphs, the graph vertices and edges need to be associated with descriptive attributes. Such type of graphs are referred to as attributed graphs. G-SPARQL has been proposed as an expressive language, with a centralized execution engine, for querying attributed graphs. G-SPARQL supports various types of graph querying operations including reachability, pattern matching and shortest path where any G-SPARQL query may include value-based predicates on the descriptive information (attributes) of the graph edges/vertices in addition to the structural predicates. In general, a main limitation of centralized systems is that their vertical scalability is always restricted by the physical limits of computer systems. This article describes the design, implementation in addition to the performance evaluation of DG-SPARQL, a distributed, hybrid and adaptive parallel execution engine of G-SPARQL queries. In this engine, the topology of the graph is distributed over the main memory of the underlying nodes while the graph data are maintained in a relational store which is replicated on the disk of each of the underlying nodes. DG-SPARQL evaluates parts of the query plan via SQL queries which are pushed to the underlying relational stores while other parts of the query plan, as necessary, are evaluated via indexless memory-based graph traversal algorithms. Our experimental evaluation shows the efficiency and the scalability of DG-SPARQL on querying massive attributed graph datasets in addition to its ability to outperform the performance of Apache Giraph, a popular distributed graph processing system, by orders of magnitudes.

  2. Online Query Scheduling on Source Permutation for Big Data Integration

    OpenAIRE

    Yuan, Zimu; Guo, Shusheng

    2015-01-01

    Big data integration could involve a large number of sources with unpredictable redundancy information between them. The approach of building a central warehousing to integrate big data from all sources then becomes infeasible because of so large number of sources and continuous updates happening. A practical approach is to apply online query scheduling that inquires data from sources at runtime upon receiving a query. In this paper, we address the Time-Cost Minimization Problem for online qu...

  3. The Query Complexity of Finding a Hidden Permutation

    DEFF Research Database (Denmark)

    Afshani, Peyman; Afrawal, Manindra; Benjamin, Doerr;

    2012-01-01

    the score fz(x) defined as fz(x):=maxi[0n]ji:z(j)=x(j); i.e., the length of the longest common prefix of x and z with respect to . The goal is to minimize the number of queries asked. Our main result are matching upper and lower bounds for this problem, both for deterministic and randomized query schemes...

  4. Optimizing Aggregate SPARQL Queries Using Materialized RDF Views

    DEFF Research Database (Denmark)

    Ibragimov, Dilshod; Hose, Katja; Pedersen, Torben Bach

    2016-01-01

    , this paper proposes MARVEL (MAterialized Rdf Views with Entailment and incompLetness). The approach consists of a view selection algorithm based on an associated RDF-specific cost model, a view definition syntax, and an algorithm for rewriting SPARQL queries using materialized RDF views. The experimental...... evaluation shows that MARVEL can improve query response time by more than an order of magnitude while effectively handling RDF specifics....

  5. Privately Releasing Conjunctions and the Statistical Query Barrier

    CERN Document Server

    Gupta, Anupam; Roth, Aaron; Ullman, Jonathan

    2010-01-01

    Suppose we would like to know all answers to a set of statistical queries C on a data set up to small error, but we can only access the data itself using statistical queries. A trivial solution is to exhaustively ask all queries in C. Can we do any better? + We show that the number of statistical queries necessary and sufficient for this task is---up to polynomial factors---equal to the agnostic learning complexity of C in Kearns' statistical query (SQ) model. This gives a complete answer to the question when running time is not a concern. + We then show that the problem can be solved efficiently (allowing arbitrary error on a small fraction of queries) whenever the answers to C can be described by a submodular function. This includes many natural concept classes, such as graph cuts and Boolean disjunctions and conjunctions. In doing so we also give a new learning algorithm for submodular functions that improves upon recent results in a different context. While interesting from a learning theoretic point of v...

  6. Largest Empty Circle Centered on a Query Line

    CERN Document Server

    Augustine, John; Roy, Sasanka

    2008-01-01

    The Largest Empty Circle problem seeks the largest circle centered within the convex hull of a set $P$ of $n$ points in $\\mathbb{R}^2$ and devoid of points from $P$. In this paper, we introduce a query version of this well-studied problem. In our query version, we are required to preprocess $P$ so that when given a query line $Q$, we can quickly compute the largest empty circle centered at some point on $Q$ and within the convex hull of $P$. We present solutions for two special cases and the general case; all our queries run in $O(\\log n)$ time. We restrict the query line to be horizontal in the first special case, which we preprocess in $O(n \\alpha(n) \\log n)$ time and space, where $\\alpha(n)$ is the slow growing inverse of the Ackermann's function. When the query line is restricted to pass through a fixed point, the second special case, our preprocessing takes $O(n \\alpha(n)^{O(\\alpha(n))} \\log n)$ time and space. We use insights from the two special cases to solve the general version of the problem with pr...

  7. A comparison of peer-to-peer query response modes

    CERN Document Server

    Hoschek, W

    2002-01-01

    In a large distributed system spanning many administrative domains such as a Grid (Foster et al., 2001), it is desirable to maintain and query dynamic and timely information about active participants such as services, resources and user communities. However, in such a database system, the set of information tuples in the universe is partitioned over one or more distributed nodes, for reasons including autonomy, scalability, availability, performance and security. This suggests the use of peer-to-peer (P2P) query technology. A variety of query response modes can be used to return matching query results from P2P nodes to an originator. Although from the functional perspective all response modes are equivalent, no mode is optimal under all circumstances. Which query response modes allow to express suitable trade-offs for a wide range ofP2P application? We answer this question by systematically describing and characterizing four query response modes for the unified peer-to-peer database framework (UPDF) proposed ...

  8. QUESEM: Towards building a Meta Search Service utilizing Query Semantics

    Directory of Open Access Journals (Sweden)

    Neelam Duhan

    2011-01-01

    Full Text Available Current Web Search Engines are built to serve needs of all users, independent of the special needs of any individual. The documents are returned by matching their queries with available documents, with no emphasis on the semantics of query. As a result, the generated information is often very large and inaccurate that results in increased user perceived latency. In this paper, a Semantic Search Service is being developed to help users gather relevant documents more efficiently unlike traditional Web search engines. The approach relies on the online web resource such as dictionary based sites to retrieve possible semantics of the query keywords, which are stored in a definition repository. The service works as a meta-layer above the keyword-based search engine to generate sub-queries based on different meanings of user query, which in turn are sent to the keyword-based search engine to perform Web search. This approach relieves the user in finding the desired information content and improves the search quality for certain types of complex queries. Experiments depict its efficiency as it results in reduced search space.

  9. Efficient external memory structures for range-aggregate queries

    DEFF Research Database (Denmark)

    Agarwal, P.K.; Yang, J.; Arge, L.;

    2013-01-01

    We present external memory data structures for efficiently answering range-aggregate queries. The range-aggregate problem is defined as follows: Given a set of weighted points in Rd, compute the aggregate of the weights of the points that lie inside a d-dimensional orthogonal query rectangle....... The aggregates we consider in this paper include count, sum, and max. First, we develop a structure for answering two-dimensional range-count queries that uses O(N/B) disk blocks and answers a query in View the MathML source I/Os, where N is the number of input points and B is the disk block size. The structure...... can be extended to obtain a near-linear-size structure for answering range-sum queries using View the MathML source I/Os, and a linear-size structure for answering range-max queries in View the MathML source I/Os. Our structures can be made dynamic and extended to higher dimensions....

  10. Improving query services of web map by web mining

    Science.gov (United States)

    Huang, Maojun

    2007-11-01

    Web map is the hybrid of map and the World Wide Web (known as Web). It is usually created with WebGIS techniques. With the rapid social development, web maps oriented the public are facing pressure that dissatisfy the increased demanding. The geocoding database plays a key role in supporting query services effectively. The traditional geocoding method is laborious and time-consuming. And there is much online spatial information, which would be the supplementary information source for geocoding. Therefore, this paper discusses how to improve query services by web mining. The improvement can be described from three facets: first, improving location query by discovering and extracting address information from the Web to extend geocoding database. Second, enhancing the ability of optimum path query of public traffic and buffer query by spatial analyzing and reasoning on the extended geocoding database. Third, adjusting strategies of collecting data according to patterns discovered by web map query mining. Finally, this paper presents the designing of the application system and experimental results.

  11. Measuring the achievable error of query sets under differential privacy

    CERN Document Server

    Li, Chao

    2012-01-01

    A common goal of privacy research is to release synthetic data that satisfies a formal privacy guarantee and can be used by an analyst in place of the original data. To achieve reasonable accuracy, a synthetic data set must be tuned to support a specified set of queries accurately, sacrificing fidelity for other queries. This work considers methods for producing synthetic data under differential privacy and investigates what makes a set of queries "easy" or "hard" to answer. We consider answering sets of linear counting queries using the matrix mechanism, a recent differentially-private mechanism that can reduce error by adding complex correlated noise adapted to a specified workload. Our main result is a novel lower bound on the minimum total error required to simultaneously release answers to a set of workload queries. The bound reveals that the hardness of a query workload is related to the spectral properties of the workload when it is represented in matrix form. The bound is tight and, because it satisfi...

  12. Interference Measurements in the European 868 MHz ISM Band with Focus on LoRa and SigFox

    DEFF Research Database (Denmark)

    Lauridsen, Mads; Vejlgaard, Benny; Kovács, István

    2017-01-01

    . The focus is on the Low Power Wide Area technologies LoRa and SigFox. The measurements show that there is a 22-33 % probability of interfering signals above -105 dBm within the mandatory LoRa and SigFox 868.0-868.6 MHz band in a shopping area and a business park in downtown Aalborg, which thus limits...... the potential coverage and capacity of LoRa and SigFox. However, the probability of interference is less than 3 % in the three other measurement locations in Aalborg. Finally, a hospital and an industrial area are shown to experience high activity in the RFID subband 865-868 MHz, while the wireless audio band...

  13. Restricted natural language based querying of clinical databases.

    Science.gov (United States)

    Safari, Leila; Patrick, Jon D

    2014-12-01

    To elevate the level of care to the community it is essential to provide usable tools for healthcare professionals to extract knowledge from clinical data. In this paper a generic translation algorithm is proposed to translate a restricted natural language query (RNLQ) to a standard query language like SQL (Structured Query Language). A special purpose clinical data analytics language (CliniDAL) has been introduced which provides scheme of six classes of clinical questioning templates. A translation algorithm is proposed to translate the RNLQ of users to SQL queries based on a similarity-based Top-k algorithm which is used in the mapping process of CliniDAL. Also a two layer rule-based method is used to interpret the temporal expressions of the query, based on the proposed temporal model. The mapping and translation algorithms are generic and thus able to work with clinical databases in three data design models, including Entity-Relationship (ER), Entity-Attribute-Value (EAV) and XML, however it is only implemented for ER and EAV design models in the current work. It is easy to compose a RNLQ via CliniDAL's interface in which query terms are automatically mapped to the underlying data models of a Clinical Information System (CIS) with an accuracy of more than 84% and the temporal expressions of the query comprising absolute times, relative times or relative events can be automatically mapped to time entities of the underlying CIS and to normalized temporal comparative values. The proposed solution of CliniDAL using the generic mapping and translation algorithms which is enhanced by a temporal analyzer component provides a simple mechanism for composing RNLQ for extracting knowledge from CISs with different data design models for analytics purposes. Copyright © 2014 Elsevier Inc. All rights reserved.

  14. Evaluating XML-Extended OLAP Queries Based on a Physical Algebra

    DEFF Research Database (Denmark)

    Yin, Xuepeng; Pedersen, Torben Bach

    2006-01-01

    . In this paper, we extend previous work on the logical federation of OLAP and XML data sources by presenting a simplified query semantics, a physical query algebra and a robust OLAP-XML query engine as well as the query evaluation techniques. Performance experiments with a prototypical implementation suggest...

  15. Constraint-based query distribution framework for an integrated global schema

    DEFF Research Database (Denmark)

    Malik, Ahmad Kamran; Qadir, Muhammad Abdul; Iftikhar, Nadeem

    2009-01-01

    Distributed heterogeneous data sources need to be queried uniformly using global schema. Query on global schema is reformulated so that it can be executed on local data sources. Constraints in global schema and mappings are used for source selection, query optimization, and querying partitioned...

  16. BioSig: A bioinformatic system for studying the mechanism of intra-cell signaling

    Energy Technology Data Exchange (ETDEWEB)

    Parvin, B.; Cong, G.; Fontenay, G.; Taylor, J.; Henshall, R.; Barcellos-Hoff, M.H.

    2000-12-15

    Mapping inter-cell signaling pathways requires an integrated view of experimental and informatic protocols. BioSig provides the foundation of cataloging inter-cell responses as a function of particular conditioning, treatment, staining, etc. for either in vivo or in vitro experiments. This paper outlines the system architecture, a functional data model for representing experimental protocols, algorithms for image analysis, and the required statistical analysis. The architecture provides remote shared operation of an inverted optical microscope, and couples instrument operation with images acquisition and annotation. The information is stored in an object-oriented database. The algorithms extract structural information such as morphology and organization, and map it to functional information such as inter-cellular responses. An example of usage of this system is included.

  17. Observation spatiale et SIG: des outils pour cartographier les zones sensibles aux mouvements de terrain

    Directory of Open Access Journals (Sweden)

    Jean-Yves SCANVIC

    1994-12-01

    Full Text Available Les niveaux de sensibilité des sols aux mouvements de terrain ont été cartographiés dans différents bassins de risques en Bolivie, en Colombie et à Taïwan, selon une méthodologie développée au BRGM et fondée en partie sur l’extraction visuelle et numérique d’informations contenues dans les données de télédétection spatiale stéréoscopiques et leur gestion-valorisation dans un SIG. Ces cartes font apparaître l’intérêt de l’imagerie Spot pour la gestion du risque naturel.

  18. Observation spatiale et SIG: des outils pour cartographier les zones sensibles aux mouvements de terrain

    OpenAIRE

    Jean-Yves SCANVIC

    1994-01-01

    Les niveaux de sensibilité des sols aux mouvements de terrain ont été cartographiés dans différents bassins de risques en Bolivie, en Colombie et à Taïwan, selon une méthodologie développée au BRGM et fondée en partie sur l’extraction visuelle et numérique d’informations contenues dans les données de télédétection spatiale stéréoscopiques et leur gestion-valorisation dans un SIG. Ces cartes font apparaître l’intérêt de l’imagerie Spot pour la gestion du risque naturel....

  19. Proceedings of the 10th ASIS SIG/CR classification research workshop

    DEFF Research Database (Denmark)

    This volume is a working copy of the papers presented at the 9th ASIS SIG/CR workshop on classification research, held in Washington, DC, at the ASIS Annual Meeting on Sunday 31 October 1999. The contributions printed here are working papers, and thus, not necessarily in their final form....... For this reason, workshop participants and other readers are asked not to quote from the papers in this volume. After completion of the workshops, authors will have the opportunity to edit and/or revise their paper. Participants in the workshop are cordially invited to provide the authors or the editors...... with constructive comments to their papers. These proceedings will subsequently be published by Information Today, Inc., as an ASIS monograph entitled Advances in Classification Research: Volume JO....

  20. SIG de un bicing en el municipio de Montcada i Reixac

    OpenAIRE

    Fernández Reyes, José

    2014-01-01

    Actualmente, y cada vez más, los usuarios de una ciudad se decantan por las alternativas ecológicas en los diferentes ámbitos de su día a día. Es por ello que, las empresas de transporte público en bicicleta o “bicing” están en auge en nuestra sociedad. Este proyecto tratará de facilitar la creación de este servicio y un posterior uso a los ciudadanos del municipio de Montcada. En primer lugar, y gracias a los SIG, en este proyecto se ha elaborado una aplicación web que permitirá al usuari...