WorldWideScience

Sample records for rapid information retrieval

  1. Rapid automatic keyword extraction for information retrieval and analysis

    Science.gov (United States)

    Rose, Stuart J [Richland, WA; Cowley,; E, Wendy [Richland, WA; Crow, Vernon L [Richland, WA; Cramer, Nicholas O [Richland, WA

    2012-03-06

    Methods and systems for rapid automatic keyword extraction for information retrieval and analysis. Embodiments can include parsing words in an individual document by delimiters, stop words, or both in order to identify candidate keywords. Word scores for each word within the candidate keywords are then calculated based on a function of co-occurrence degree, co-occurrence frequency, or both. Based on a function of the word scores for words within the candidate keyword, a keyword score is calculated for each of the candidate keywords. A portion of the candidate keywords are then extracted as keywords based, at least in part, on the candidate keywords having the highest keyword scores.

  2. Episodic Memory Retrieval Functionally Relies on Very Rapid Reactivation of Sensory Information.

    Science.gov (United States)

    Waldhauser, Gerd T; Braun, Verena; Hanslmayr, Simon

    2016-01-06

    Episodic memory retrieval is assumed to rely on the rapid reactivation of sensory information that was present during encoding, a process termed "ecphory." We investigated the functional relevance of this scarcely understood process in two experiments in human participants. We presented stimuli to the left or right of fixation at encoding, followed by an episodic memory test with centrally presented retrieval cues. This allowed us to track the reactivation of lateralized sensory memory traces during retrieval. Successful episodic retrieval led to a very early (∼100-200 ms) reactivation of lateralized alpha/beta (10-25 Hz) electroencephalographic (EEG) power decreases in the visual cortex contralateral to the visual field at encoding. Applying rhythmic transcranial magnetic stimulation to interfere with early retrieval processing in the visual cortex led to decreased episodic memory performance specifically for items encoded in the visual field contralateral to the site of stimulation. These results demonstrate, for the first time, that episodic memory functionally relies on very rapid reactivation of sensory information. Remembering personal experiences requires a "mental time travel" to revisit sensory information perceived in the past. This process is typically described as a controlled, relatively slow process. However, by using electroencephalography to measure neural activity with a high time resolution, we show that such episodic retrieval entails a very rapid reactivation of sensory brain areas. Using transcranial magnetic stimulation to alter brain function during retrieval revealed that this early sensory reactivation is causally relevant for conscious remembering. These results give first neural evidence for a functional, preconscious component of episodic remembering. This provides new insight into the nature of human memory and may help in the understanding of psychiatric conditions that involve the automatic intrusion of unwanted memories. Copyright

  3. Private information retrieval

    CERN Document Server

    Yi, Xun; Bertino, Elisa

    2013-01-01

    This book deals with Private Information Retrieval (PIR), a technique allowing a user to retrieve an element from a server in possession of a database without revealing to the server which element is retrieved. PIR has been widely applied to protect the privacy of the user in querying a service provider on the Internet. For example, by PIR, one can query a location-based service provider about the nearest car park without revealing his location to the server.The first PIR approach was introduced by Chor, Goldreich, Kushilevitz and Sudan in 1995 in a multi-server setting, where the user retriev

  4. Information Retrieval Models

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Göker, Ayse; Davies, John

    2009-01-01

    Many applications that handle information on the internet would be completely inadequate without the support of information retrieval technology. How would we find information on the world wide web if there were no web search engines? How would we manage our email without spam filtering? Much of the

  5. Topological Aspects of Information Retrieval.

    Science.gov (United States)

    Egghe, Leo; Rousseau, Ronald

    1998-01-01

    Discusses topological aspects of theoretical information retrieval, including retrieval topology; similarity topology; pseudo-metric topology; document spaces as topological spaces; Boolean information retrieval as a subsystem of any topological system; and proofs of theorems. (LRW)

  6. Introduction to information retrieval

    CERN Document Server

    Manning, Christopher D; Schütze, Hinrich

    2008-01-01

    Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced un

  7. Music Information Retrieval.

    Science.gov (United States)

    Downie, J. Stephen

    2003-01-01

    Identifies MIR (Music Information Retrieval) computer system problems, historic influences, current state-of-the-art, and future MIR solutions through an examination of the multidisciplinary approach to MIR. Highlights include pitch; temporal factors; harmonics; tone; editorial, textual, and bibliographic facets; multicultural factors; locating…

  8. Information Retrieval Evaluation

    CERN Document Server

    Harman, Donna

    2011-01-01

    Evaluation has always played a major role in information retrieval, with the early pioneers such as Cyril Cleverdon and Gerard Salton laying the foundations for most of the evaluation methodologies in use today. The retrieval community has been extremely fortunate to have such a well-grounded evaluation paradigm during a period when most of the human language technologies were just developing. This lecture has the goal of explaining where these evaluation methodologies came from and how they have continued to adapt to the vastly changed environment in the search engine world today. The lecture

  9. Interactive Information Retrieval

    DEFF Research Database (Denmark)

    Borlund, Pia

    2013-01-01

    The paper introduces the research area of interactive information retrieval (IIR) from a historical point of view. Further, the focus here is on evaluation, because much research in IR deals with IR evaluation methodology due to the core research interest in IR performance, system interaction...... and satisfaction with retrieved information. In order to position IIR evaluation, the Cranfield model and the series of tests that led to the Cranfield model are outlined. Three iconic user-oriented studies and projects that all have contributed to how IIR is perceived and understood today are presented....... As a response to this call the ‘IIR evaluation model’ by Borlund (e.g., 2003a) is introduced. The objective of the IIR evaluation model is to facilitate IIR evaluation as close as possible to actual information searching and IR processes, though still in a relatively controlled evaluation environment, in which...

  10. Information, conservation and retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Eng, T. [Swedish Nuclear Fuel and Waste Management Co., Stockholm (Sweden); Norberg, E. [National Swedish Archives, Stockholm (Sweden); Torbacke, J. [Stockholm Univ. (Sweden). Dept. of History; Jensen, M. [Swedish Radiation Protection Inst., Stockholm (Sweden)

    1996-12-01

    The seminar took place on the Swedish ship for transportation of radioactive wastes, M/S Sigyn, which at summer time is used for exhibitions. The seminar treated items related to general information needs in society and questions related to radioactive waste, i.e. how knowledge about a waste repository should be passed on to future generations. Three contributions are contained in the report from the seminar and are indexed separately: `Active preservation - otherwise no achieves`; `The conservation and dissemination of information - A democratic issue`; and, `Conservation and retrieval of information - Elements of a strategy to inform future societies about nuclear waste repositories`.

  11. Introduction to information retrieval

    CERN Document Server

    Manning, Christopher D; Schütze, Hinrich

    2008-01-01

    Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Slides and additional exercises (with solutions for lecturers) are also available through the book's supporting website to help course instructors prepare their lectures.

  12. Language Processing in Information Retrieval.

    Science.gov (United States)

    Doszkocs, Tamase

    1986-01-01

    Examines role and contributions of natural-language processing in information retrieval and artificial intelligence research in context of large operational information retrieval systems and services. State-of-the-art information retrieval systems combining the functional capabilities of conventional inverted file term adjacency approach with…

  13. Multimedia Information Retrieval

    CERN Document Server

    Rueger, Stefan

    2009-01-01

    At its very core multimedia information retrieval means the process of searching for and finding multimedia documents; the corresponding research field is concerned with building the best possible multimedia search engines. The intriguing bit here is that the query itself can be a multimedia excerpt: For example, when you walk around in an unknown place and stumble across an interesting landmark, would it not be great if you could just take a picture with your mobile phone and send it to a service that finds a similar picture in a database and tells you more about the building -- and about its

  14. Changing Information Retrieval Behaviours

    DEFF Research Database (Denmark)

    Constantiou, Ioanna D.; Lehrer, Christiane; Hess, Thomas

    2014-01-01

    The introduction of smartphones and the accompanying profusion of mobile data services have had a profound effect on individuals' lives. One of the most influential service categories is location-based services (LBS). Based on insights from behavioural decision-making, a conceptual framework...... is developed to analyse individuals' decisions to use LBS, focusing on the cognitive processes involved in the decision-making. Our research is based on two studies. First, we investigate the use of LBS through semi-structured interviews of smartphone users. Second, we explore daily LBS use through a study...... on the continuance of LBS use and indicate changes in individuals' information retrieval behaviours in everyday life. In particular, the distinct value dimension of LBS in specific contexts of use changes individuals' behaviours towards accessing location-related information....

  15. Advanced Topics in Information Retrieval

    CERN Document Server

    Melucci, Massimo

    2011-01-01

    Information retrieval is the science concerned with the effective and efficient retrieval of documents starting from their semantic content. It is employed to fulfill some information need from a large number of digital documents. Given the ever-growing amount of documents available and the heterogeneous data structures used for storage, information retrieval has recently faced and tackled novel applications. In this book, Melucci and Baeza-Yates present a wide-spectrum illustration of recent research results in advanced areas related to information retrieval. Readers will find chapters on e.g

  16. Information retrieval in cultural heritage

    NARCIS (Netherlands)

    Koolen, M.; Kamps, J.; de Keijzer, V.

    2009-01-01

    This article discusses the opportunities and challenges of applying modern information retrieval techniques to the cultural heritage domain. Although the field of information retrieval is closely associated with computer science, it originally emerged from library science — also one of the main

  17. Contextual Bandits for Information Retrieval

    NARCIS (Netherlands)

    Hofmann, K.; Whiteson, S.; de Rijke, M.

    2011-01-01

    In this paper we give an overview of and outlook on research at the intersection of information retrieval (IR) and contextual bandit problems. A critical problem in information retrieval is online learning to rank, where a search engine strives to improve the quality of the ranked result lists it

  18. Name Searching and Information Retrieval

    CERN Document Server

    Thompson, P; Thompson, Paul; Dozier, Christopher C.

    1997-01-01

    The main application of name searching has been name matching in a database of names. This paper discusses a different application: improving information retrieval through name recognition. It investigates name recognition accuracy, and the effect on retrieval performance of indexing and searching personal names differently from non-name terms in the context of ranked retrieval. The main conclusions are: that name recognition in text can be effective; that names occur frequently enough in a variety of domains, including those of legal documents and news databases, to make recognition worthwhile; and that retrieval performance can be improved using name searching.

  19. Ontology-based Information Retrieval

    DEFF Research Database (Denmark)

    Styltsvig, Henrik Bulskov

    In this thesis, we will present methods for introducing ontologies in information retrieval. The main hypothesis is that the inclusion of conceptual knowledge such as ontologies in the information retrieval process can contribute to the solution of major problems currently found in information...... retrieval. This utilization of ontologies has a number of challenges. Our focus is on the use of similarity measures derived from the knowledge about relations between concepts in ontologies, the recognition of semantic information in texts and the mapping of this knowledge into the ontologies in use......, as well as how to fuse together the ideas of ontological similarity and ontological indexing into a realistic information retrieval scenario. To achieve the recognition of semantic knowledge in a text, shallow natural language processing is used during indexing that reveals knowledge to the level of noun...

  20. Topic structure for information retrieval

    NARCIS (Netherlands)

    He, J.; Sanderson, M.; Zhai, C.; Zobel, J.; Allan, J.; Aslam, J.A.

    2009-01-01

    In my research, I propose a coherence measure, with the goal of discovering and using topic structures within and between documents, of which I explore its extensions and applications in information retrieval.

  1. Hooked on Music Information Retrieval

    National Research Council Canada - National Science Library

    W. Bas de Haas; Frans Wiering

    2011-01-01

    This article provides a reply to 'Lure(d) into listening: The potential of cognition-based music information retrieval,' in which Henkjan Honing discusses the potential impact of his proposed Listen, Lure...

  2. Hooked on Music Information Retrieval

    National Research Council Canada - National Science Library

    de Haas, W Bas

    2010-01-01

    This article provides a reply to 'Lure(d) into listening: The potential of cognition-based music information retrieval,' in which Henkjan Honing discusses the potential impact of his proposed Listen, Lure...

  3. Mobile medical visual information retrieval.

    Science.gov (United States)

    Depeursinge, Adrien; Duc, Samuel; Eggel, Ivan; Müller, Henning

    2012-01-01

    In this paper, we propose mobile access to peer-reviewed medical information based on textual search and content-based visual image retrieval. Web-based interfaces designed for limited screen space were developed to query via web services a medical information retrieval engine optimizing the amount of data to be transferred in wireless form. Visual and textual retrieval engines with state-of-the-art performance were integrated. Results obtained show a good usability of the software. Future use in clinical environments has the potential of increasing quality of patient care through bedside access to the medical literature in context.

  4. Information retrieval in digital environments

    CERN Document Server

    Dinet, Jérôme

    2014-01-01

    Information retrieval is a central and essential activity. It is indeed difficult to find a human activity that does not need to retrieve information in an environment which is often increasingly digital: moving and navigating, learning, having fun, communicating, informing, making a decision, etc. Most human activities are intimately linked to our ability to search quickly and effectively for relevant information, the stakes are sometimes extremely important: passing an exam, voting, finding a job, remaining autonomous, being socially connected, developing a critical spirit, or simply surviv

  5. Information Retrieval for Ecological Syntheses

    Science.gov (United States)

    Bayliss, Helen R.; Beyer, Fiona R.

    2015-01-01

    Research syntheses are increasingly being conducted within the fields of ecology and environmental management. Information retrieval is crucial in any synthesis in identifying data for inclusion whilst potentially reducing biases in the dataset gathered, yet the nature of ecological information provides several challenges when compared with…

  6. Conceptual Information Retrieval.

    Science.gov (United States)

    1980-12-01

    Corrputer Science 10 Hillhouse Avenue . -, New Haven, Connecticut 06520 . I. CONTROLLING OFFICE NAME AND ADDRESS 12. REPORT DATE Advanced Research... Controlling Office) IS. SECURITY CLASS. (of thts report, Office of Naval Research /. j " ’ unclassified Information Systems Program a. DECLASSIFICATION...can and should be useful in developin smarter IR systems. Many of the natural language and memory organizacion problems we have been dealing with are

  7. Challenges in Web Information Retrieval

    Science.gov (United States)

    Arora, Monika; Kanjilal, Uma; Varshney, Dinesh

    The major challenge in information access is the rich data available for information retrieval, evolved to provide principle approaches or strategies for searching. The search has become the leading paradigm to find the information on World Wide Web. For building the successful web retrieval search engine model, there are a number of challenges that arise at the different levels where techniques, such as Usenet, support vector machine are employed to have a significant impact. The present investigations explore the number of problems identified its level and related to finding information on web. This paper attempts to examine the issues by applying different methods such as web graph analysis, the retrieval and analysis of newsgroup postings and statistical methods for inferring meaning in text. We also discuss how one can have control over the vast amounts of data on web, by providing the proper address to the problems in innovative ways that can extremely improve on standard. The proposed model thus assists the users in finding the existing formation of data they need. The developed information retrieval model deals with providing access to information available in various modes and media formats and to provide the content is with facilitating users to retrieve relevant and comprehensive information efficiently and effectively as per their requirements. This paper attempts to discuss the parameters factors that are responsible for the efficient searching. These parameters can be distinguished in terms of important and less important based on the inputs that we have. The important parameters can be taken care of for the future extension or development of search engines

  8. Rhetorical relations for information retrieval

    DEFF Research Database (Denmark)

    Lioma, Christina; Larsen, Birger; Lu, Wei

    2012-01-01

    -called discourse structure has been applied successfully to several natural language processing tasks. This work studies the use of rhetorical relations for Information Retrieval (IR): Is there a correlation between certain rhetorical relations and retrieval performance? Can knowledge about a document’s rhetorical......Typically, every part in most coherent text has some plausible reason for its presence, some function that it performs to the overall semantics of the text. Rhetorical relations, e.g. contrast, cause, explanation, describe how the parts of a text are linked to each other. Knowledge about this so...... relations be useful to IR? We present a language model modification that considers rhetorical relations when estimating the relevance of a document to a query. Empirical evaluation of different versions of our model on TREC settings shows that certain rhetorical relations can benefit retrieval effectiveness...

  9. Hooked on Music Information Retrieval

    Directory of Open Access Journals (Sweden)

    W. Bas de Haas

    2011-04-01

    Full Text Available This article provides a reply to 'Lure(d into listening: The potential of cognition-based music information retrieval,' in which Henkjan Honing discusses the potential impact of his proposed Listen, Lure & Locate project on Music Information Retrieval (MIR. Honing presents some critical remarks on data-oriented approaches in MIR, which we endorse. To place these remarks in context, we first give a brief overview of the state of the art of MIR research. Then we present a series of arguments that show why purely data-oriented approaches are unlikely to take MIR research and applications to a more advanced level. Next, we propose our view on MIR research, in which the modelling of musical knowledge has a central role. Finally, we elaborate on the ideas in Honing's paper from a MIR perspective in this paper and propose some additions to the Listen, Lure & Locate project.

  10. Information retrieval and individual differences

    Directory of Open Access Journals (Sweden)

    Polona Vilar

    2008-01-01

    Full Text Available The paper presents individual differences, which are found in studies of information retrieval with emphasis on models of personality traits, cognitive and learning styles. It pays special attention to those models which are most often included in studies of information behaviour,information seeking,perceptions of IR systems, etc., but also brings forward some models which have not yet been included in such studies. Additionally, the relationship between different individual characteristics and individual’s chosen profession or academic area is discussed. In this context,the paper presents how investigation of individual differences can be useful in the design of IR systems.

  11. Interactive information seeking, behaviour and retrieval

    CERN Document Server

    Ruthven, Ian

    2011-01-01

    Information retrieval (IR) is a complex human activity supported by sophisticated systems. This book covers the whole spectrum of information retrieval, including: history and background information; behaviour and seeking task-based information; searching and retrieval approaches to investigating information; and, evaluation interfaces for IR.

  12. Multimedia information retrieval theory and techniques

    CERN Document Server

    Raieli, Roberto

    2013-01-01

    Novel processing and searching tools for the management of new multimedia documents have developed. Multimedia Information Retrieval (MMIR) is an organic system made up of Text Retrieval (TR); Visual Retrieval (VR); Video Retrieval (VDR); and Audio Retrieval (AR) systems. So that each type of digital document may be analysed and searched by the elements of language appropriate to its nature, search criteria must be extended. Such an approach is known as the Content Based Information Retrieval (CBIR), and is the core of MMIR. This novel content-based concept of information handling needs to be integrated with more traditional semantics. Multimedia Information Retrieval focuses on the tools of processing and searching applicable to the content-based management of new multimedia documents. Translated from Italian by Giles Smith, the book is divided in to two parts. Part one discusses MMIR and related theories, and puts forward new methodologies; part two reviews various experimental and operating MMIR systems, a...

  13. Information Retrieval Methods in Libraries and Information Centers ...

    African Journals Online (AJOL)

    The volumes of information created, generated and stored are immense that without adequate knowledge of information retrieval methods, the retrieval process for an information user would be cumbersome and frustrating. Studies have further revealed that information retrieval methods are essential in information centers ...

  14. Data Fusion in Information Retrieval

    CERN Document Server

    Wu, Shengli

    2012-01-01

    The technique of data fusion has been used extensively in information retrieval due to the complexity and diversity of tasks involved such as web and social networks, legal, enterprise, and many others. This book presents both a theoretical and empirical approach to data fusion. Several typical data fusion algorithms are discussed, analyzed and evaluated. A reader will find answers to the following questions, among others: -          What are the key factors that affect the performance of data fusion algorithms significantly? -          What conditions are favorable to data fusion algorithms? -          CombSum and CombMNZ, which one is better? and why? -          What is the rationale of using the linear combination method? -          How can the best fusion option be found under any given circumstances?

  15. Information retrieval implementing and evaluating search engines

    CERN Document Server

    Büttcher, Stefan; Cormack, Gordon V

    2016-01-01

    Information retrieval is the foundation for modern search engines. This textbook offers an introduction to the core topics underlying modern search technologies, including algorithms, data structures, indexing, retrieval, and evaluation. The emphasis is on implementation and experimentation; each chapter includes exercises and suggestions for student projects. Wumpus -- a multiuser open-source information retrieval system developed by one of the authors and available online -- provides model implementations and a basis for student work. The modular structure of the book allows instructors to use it in a variety of graduate-level courses, including courses taught from a database systems perspective, traditional information retrieval courses with a focus on IR theory, and courses covering the basics of Web retrieval. In addition to its classroom use, Information Retrieval will be a valuable reference for professionals in computer science, computer engineering, and software engineering.

  16. Peer to Peer Information Retrieval: An Overview

    NARCIS (Netherlands)

    Tigelaar, A.S.; Hiemstra, D.; Trieschnigg, D.

    2012-01-01

    Peer-to-peer technology is widely used for file sharing. In the past decade a number of prototype peer-to-peer information retrieval systems have been developed. Unfortunately, none of these have seen widespread real- world adoption and thus, in contrast with file sharing, information retrieval is

  17. Peer to Peer Information Retrieval: An Overview

    NARCIS (Netherlands)

    Tigelaar, A.S.; Hiemstra, Djoerd; Trieschnigg, Rudolf Berend

    Peer-to-peer technology is widely used for file sharing. In the past decade a number of prototype peer-to-peer information retrieval systems have been developed. Unfortunately, none of these have seen widespread real- world adoption and thus, in contrast with file sharing, information retrieval is

  18. Information Retrieval Interaction: an Analysis of Models

    Directory of Open Access Journals (Sweden)

    Farahnaz Sadoughi

    2012-03-01

    Full Text Available Information searching process is an interactive process; thus users has control on searching process, and they can manage the results of the search process. In this process, user's question became more mature, according to retrieved results. In addition, on the side of the information retrieval system, there are some processes that could not be realized, unless by user. Practically, this issue, is egregious in “Interaction” -i.e. process of user connection to other system elements- and in “Relevance judgment”. This paper had a glance to existence of “Interaction” in information retrieval, in first. Then the tradition model of information retrieval and its strenght and weak points were reviewed. Finally, the current models of interactive information retrieval includes: Belkin episodic model, Ingwersen cognitive model, Sarasevic stratified model, and Spinks interactive feedback model were elucidated.

  19. Bibliometric-enhanced information retrieval

    NARCIS (Netherlands)

    Mayr, Philipp; Scharnhorst, Andrea; Larsen, Birger; Schaer, Philipp; Mutschke, Peter

    2014-01-01

    Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offer value-added effects for users. In this workshop we will explore how statistical modelling of scholarship, such as Bradfordizing or network analysis of coauthorship network, can

  20. Parsimonious Language Models for Information Retrieval

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Robertson, Stephen; Zaragoza, Hugo

    We systematically investigate a new approach to estimating the parameters of language models for information retrieval, called parsimonious language models. Parsimonious language models explicitly address the relation between levels of language models that are typically used for smoothing. As such,

  1. Current challenges in patent information retrieval

    CERN Document Server

    Lupu, Mihai; Kando, Noriko; Trippe, Anthony J

    2017-01-01

    Intellectual property in the form of patents plays a vital role in today's increasingly knowledge-based economy. This book assembles state-of-the art research and is intended to illustrate innovative approaches to patent information retrieval.

  2. Ask Alice: an Artificial Retrieval of Information Agent

    NARCIS (Netherlands)

    Valstar, M.; Baur, T.; Cafaro, A.; Ghitulescu, A.; Potard, B.; Wagner, J.; Andre, E.; Durieu, L.; Aylett, M.; Dermouche, P.; Pelachaud, C.; Coutinho, E.; Schuller, B.; Zhang, Yue; Heylen, Dirk K.J.; Theune, Mariet; van Waterschoot, Jelte Barachia

    2016-01-01

    We present a demonstration of the ARIA framework, a modular approach for rapid development of virtual humans for information retrieval that have linguistic, emotional, and social skills and a strong personality. We demonstrate the capabilities of our framework in a scenario where a popular book from

  3. Ask Alice: an Artificial Retrieval of Information Agent

    NARCIS (Netherlands)

    Valstar, M.; Baur, T.; Cafaro, A.; Ghitulescu, A.; Potard, B.; Wagner, J.; Andre, E.; Durieu, L.; Aylett, M.; Dermouche, P.; Pelachaud, C.; Coutinho, E.; Schuller, B.; Zhang, Yue; Heylen, Dirk K.J.; Theune, Mariet; van Waterschoot, Jelte Barachia

    We present a demonstration of the ARIA framework, a modular approach for rapid development of virtual humans for information retrieval that have linguistic, emotional, and social skills and a strong personality. We demonstrate the capabilities of our framework in a scenario where a popular book from

  4. Information Retrieval Research and ESPRIT.

    Science.gov (United States)

    Smeaton, Alan F.

    1987-01-01

    Describes the European Strategic Programme of Research and Development in Information Technology (ESPRIT), and its five programs: advanced microelectronics, software technology, advanced information processing, office systems, and computer integrated manufacturing. The emphasis on logic programming and ESPRIT as the European response to the…

  5. A Personalized Health Information Retrieval System

    OpenAIRE

    Wang, Yunli; Liu, Zhenkai

    2005-01-01

    Consumers face barriers when seeking health information on the Internet. A Personalized Health Information Retrieval System (PHIRS) is proposed to recommend health information for consumers. The system consists of four modules: (1) User modeling module captures user’s preference and health interests; (2) Automatic quality filtering module identifies high quality health information; (3) Automatic text difficulty rating module classifies health information into professional or...

  6. Science information systems: Archive, access, and retrieval

    Science.gov (United States)

    Campbell, William J.

    1991-01-01

    The objective of this research is to develop technology for the automated characterization and interactive retrieval and visualization of very large, complex scientific data sets. Technologies will be developed for the following specific areas: (1) rapidly archiving data sets; (2) automatically characterizing and labeling data in near real-time; (3) providing users with the ability to browse contents of databases efficiently and effectively; (4) providing users with the ability to access and retrieve system independent data sets electronically; and (5) automatically alerting scientists to anomalies detected in data.

  7. AEROMETRIC INFORMATION RETRIEVAL SYSTEM (AIRS) - GRAPHICS

    Science.gov (United States)

    Aerometric Information Retrieval System (AIRS) is a computer-based repository of information about airborne pollution in the United States and various World Health Organization (WHO) member countries. AIRS is administered by the U.S. Environmental Protection Agency, and runs on t...

  8. Using Complexity Measures in Information Retrieval

    NARCIS (Netherlands)

    van der Sluis, Frans; van den Broek, Egon; Belkin, N.J.; Kelly, D.

    2010-01-01

    Although Information Retrieval (IR) is meant to serve its users, surprisingly little IR research is not user-centered. In contrast, this article utilizes the concept complexity of information as the determinant of the user's comprehension, not as a formal golden measure. Four aspects of user's

  9. Emergent web intelligence advanced information retrieval

    CERN Document Server

    Badr, Youakim; Abraham, Ajith; Hassanien, Aboul-Ella

    2010-01-01

    Web Intelligence explores the impact of artificial intelligence and advanced information technologies representing the next generation of Web-based systems, services, and environments, and designing hybrid web systems that serve wired and wireless users more efficiently. Multimedia and XML-based data are produced regularly and in increasing way in our daily digital activities, and their retrieval must be explored and studied in this emergent web-based era. 'Emergent Web Intelligence: Advanced information retrieval, provides reviews of the related cutting-edge technologies and insights. It is v

  10. Representation and alignment of sung queries for music information retrieval

    Science.gov (United States)

    Adams, Norman H.; Wakefield, Gregory H.

    2005-09-01

    The pursuit of robust and rapid query-by-humming systems, which search melodic databases using sung queries, is a common theme in music information retrieval. The retrieval aspect of this database problem has received considerable attention, whereas the front-end processing of sung queries and the data structure to represent melodies has been based on musical intuition and historical momentum. The present work explores three time series representations for sung queries: a sequence of notes, a ``smooth'' pitch contour, and a sequence of pitch histograms. The performance of the three representations is compared using a collection of naturally sung queries. It is found that the most robust performance is achieved by the representation with highest dimension, the smooth pitch contour, but that this representation presents a formidable computational burden. For all three representations, it is necessary to align the query and target in order to achieve robust performance. The computational cost of the alignment is quadratic, hence it is necessary to keep the dimension small for rapid retrieval. Accordingly, iterative deepening is employed to achieve both robust performance and rapid retrieval. Finally, the conventional iterative framework is expanded to adapt the alignment constraints based on previous iterations, further expediting retrieval without degrading performance.

  11. BIR 2014 - Bibliometric-enhanced Information Retrieval

    DEFF Research Database (Denmark)

    This first “Bibliometric-enhanced Information Retrieval” (BIR 2014) workshop1 aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although...... they offer value-added effects for users. To give an example, recent approaches have shown the possibilities of alternative ranking methods based on citation analysis leading to an enhanced IR. In this workshop we will explore how statistical modelling of scholarship, such as Bradfordizing or network...... analysis of co-authorship network, can improve retrieval services for specific communities, as well as for large, cross-domain collections. This workshop aims to raise awareness of the missing link between information retrieval (IR) and bibliometrics / scientometrics and to create a common ground...

  12. Applications Of Informetrics To Information Retrieval Research

    Directory of Open Access Journals (Sweden)

    Dietmar Wolfram

    2000-01-01

    Full Text Available A non-technical overview of two primary areas of study within the discipline of information science, information retrieval (IR and informetrics, is presented. Informetric properties of IR systems as the basis for understanding IR system structure and generalizing human information seeking in electronic environments are discussed. Applications of informetric study of IR systems for more efficient and effective design and evaluation of IR systems are also presented.

  13. Machine Learning Approaches for Music Information Retrieval

    OpenAIRE

    Li, Tao; Ogihara, Mitsunori; Shao, Bo; DingdingWang,

    2009-01-01

    We discussed the following machine learning approaches used in music information retrieval: (1) multi-class classification methods for music genre categorization; (2) multi-label classification methods for emotion detection; (3) clustering methods for music style identification; and (4) semi-supervised learning methods for music recommendation. Experimental results are also presented to evaluate the approaches.

  14. Language-based multimedia information retrieval

    NARCIS (Netherlands)

    de Jong, Franciska M.G.; Gauvain, J.L.; Hiemstra, Djoerd; Netter, K.

    2000-01-01

    This paper describes various methods and approaches for language-based multimedia information retrieval, which have been developed in the projects POP-EYE and OLIVE and which will be developed further in the MUMIS project. All of these project aim at supporting automated indexing of video material

  15. Cross language information retrieval for biomedical literature

    NARCIS (Netherlands)

    Schuemie, M.; Trieschnigg, D.; Kraaij, W.

    2007-01-01

    This workshop report discusses the collaborative work of UT, EMC and TNO on the TREC Genomics Track 2007. The biomedical information retrieval task is approached using cross language methods, in which biomedical concept detection is combined with effective IR based on unigram language models.

  16. Cross Language Information Retrieval for Biomedical Literature

    NARCIS (Netherlands)

    Schuemie, Martijn; Trieschnigg, Rudolf Berend; Kraaij, Wessel; Voorhees, E.M; Buckland, L.P.

    2007-01-01

    This workshop report discusses the collaborative work of UT, EMC and TNO on the TREC Genomics Track 2007. The biomedical information retrieval task is approached using cross language methods, in which biomedical concept detection is combined with effective IR based on unigram language models.

  17. Semantic association ranking schemes for information retrieval ...

    Indian Academy of Sciences (India)

    Most of the Information Retrieval (IR) techniques are based on representing the documents using the traditional vector space and probabilistic language model i.e., bag-of- words model. In this paper, associations among words in the documents are assessed and it is expressed in Term Association Graph model to represent ...

  18. Introduction to Data Transmission for Information Retrieval

    Science.gov (United States)

    Kallenbach, P. A.

    1975-01-01

    An introduction is presented to data transmission technology and networks for information retrieval purposes. Data signals are analyzed, modulation techniques are discussed, communication procedures between terminals and the central processing unit are surveyed, and possible network configurations are considered. (Author/PF)

  19. Towards an Information Retrieval Theory of Everything

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Lammerink, J.M.W.; Katoen, Joost P.; Kok, J.N.; van de Pol, Jan Cornelis; Raamsdonk, F.

    2009-01-01

    I present three well-known probabilistic models of information retrieval in tutorial style: The binary independence probabilistic model, the language modeling approach, and Google's page rank. Although all three models are based on probability theory, they are very different in nature. Each model

  20. Click Model-Based Information Retrieval Metrics

    NARCIS (Netherlands)

    Chuklin, A.; Serdyukov, P.; de Rijke, M.

    2013-01-01

    In recent years many models have been proposed that are aimed at predicting clicks of web search users. In addition, some information retrieval evaluation metrics have been built on top of a user model. In this paper we bring these two directions together and propose a common approach to converting

  1. Variations on language modeling for information retrieval.

    NARCIS (Netherlands)

    Kraaij, Wessel

    2004-01-01

    Search engine technology builds on theoretical and empirical research results in the area of information retrieval (IR). This dissertation makes a contribution to the field of language modeling (LM) for IR, which views both queries and documents as instances of a unigram language model and defines

  2. Inductive Information Retrieval Using Parallel Distributed Computation.

    Science.gov (United States)

    Mozer, Michael C.

    This paper reports on an application of parallel models to the area of information retrieval and argues that massively parallel, distributed models of computation, called connectionist, or parallel distributed processing (PDP) models, offer a new approach to the representation and manipulation of knowledge. Although this document focuses on…

  3. Database Optimization Aspects for Information Retrieval

    NARCIS (Netherlands)

    Blok, H.E.

    2002-01-01

    There is a growing need for systems that can process queries, combining both structured data and text. One way to provide such functionality is to integrate information retrieval (IR) techniques in a database management system (DBMS). However, both IR and database research have been separate

  4. Formalizing Evaluation in Music Information Retrieval

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2013-01-01

    We develop a formalism to disambiguate the evaluation of music information retrieval systems. We define a ``system,'' what it means to ``analyze'' one, and make clear the aims, parts, design, execution, interpretation, and assumptions of its ``evaluation.'' We apply this formalism to discuss the ...... the MIREX automatic mood classification task....

  5. Test OSIRIS (On Line Search Information Retrieval Information Storage).

    Science.gov (United States)

    Showalther, A. Kenneth

    The OSIRIS system is a prototype information retrieval system having the following components: an automated microfiche file having a capacity of 5000 punch card sized microfiche with a remote control 21 inch TV console for retrieving, magnifying (0-250X), and displaying any of the images on the microfiche; and a remote computer terminal for the…

  6. Information retrieval models foundations and relationships

    CERN Document Server

    Roelleke, Thomas

    2013-01-01

    Information Retrieval (IR) models are a core component of IR research and IR systems. The past decade brought a consolidation of the family of IR models, which by 2000 consisted of relatively isolated views on TF-IDF (Term-Frequency times Inverse-Document-Frequency) as the weighting scheme in the vector-space model (VSM), the probabilistic relevance framework (PRF), the binary independence retrieval (BIR) model, BM25 (Best-Match Version 25, the main instantiation of the PRF/BIR), and language modelling (LM). Also, the early 2000s saw the arrival of divergence from randomness (DFR).Regarding in

  7. Order effect in interactive information retrieval evaluation

    DEFF Research Database (Denmark)

    Clemmensen, Melanie Landvad; Borlund, Pia

    2016-01-01

    of such studies. Due to the limited sample of 20 test participants (Library and Information Science (LIS) students) inference statistics is not applicable; hence conclusions can be drawn from this sample of test participants only. Originality/value – Only few studies in LIS focus on order effect and none from......Purpose – The purpose of this paper is to report a study of order effect in interactive information retrieval (IIR) studies. The phenomenon of order effect is well-known, and it is the main reason why searches are permuted (counter-balanced) between test participants in IIR studies. However...... the perspective of IIR. Keywords Evaluation, Research methods, Information retrieval, User studies, Searching, Information searches...

  8. Method of and System for Information Retrieval

    DEFF Research Database (Denmark)

    2015-01-01

    This invention relates to a system for and a method (100) of searching a collection of digital information (150) comprising a number of digital documents (110), the method comprising receiving or obtaining (102) a search query, the query comprising a number of search terms, searching (103) an index......, a method of and a system for information retrieval or searching is readily provided that enhances the searching quality (i.e. the number of relevant documents retrieved and such documents being ranked high) when (also) using queries containing many search terms....... (300) using the search terms thereby providing information (301) about which digital documents (110) of the collection of digital information (150) that contains a given search term and one or more search related metrics (302; 303; 304; 305; 306), ranking (105) at least a part of the search result...

  9. Advanced Secure Information Retrieval Technology for Multilayer Information Extraction

    Directory of Open Access Journals (Sweden)

    Shoude Chang

    2008-01-01

    Full Text Available Secure information retrieval technology aims at status identification and documentation authentication. Ideally, materials or devices used in these technologies should be hard to find, difficult to counterfeit, and as simple as possible. This manuscript addresses a novel information retrieval technology, with photoluminescent (PL semiconductor quantum dots (QDs synthesized via wet chemistry approaches used as its coding materials. Conceptually, these QDs are designed to exhibit emission at Fraunhofer line positions, namely, black lines in the solar spectrum; thus, the retrieval system can extract useful information under sunshine covering areas. Furthermore, multiphoton excitation (MPE technology enables the retrieval system to be multilayer information extraction, with thin films consisting of QDs applied to various substrates, such as military helmets and vehicle and fingernails. Anticipated applications include security, military, and law enforcement. QD-based security information can be easily destroyed by preset expiration in the presence of timing agents.

  10. Multilevel resistive information storage and retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Lohn, Andrew; Mickel, Patrick R.

    2016-08-09

    The present invention relates to resistive random-access memory (RRAM or ReRAM) systems, as well as methods of employing multiple state variables to form degenerate states in such memory systems. The methods herein allow for precise write and read steps to form multiple state variables, and these steps can be performed electrically. Such an approach allows for multilevel, high density memory systems with enhanced information storage capacity and simplified information retrieval.

  11. New approach to information retrieval problems in separations science

    Energy Technology Data Exchange (ETDEWEB)

    McDowell, W.J.; Corey, B.B.

    1984-01-01

    Retrieving information on specific chemical separations is among the most difficult problems in information management, although the ability to find methods to cleanly and efficiently separate chemical species is of the utmost importance to chemists and chemical engineers. Information on performing specific chemical separations is largely buried in the literature of dozens of branches of science. Most methods of indexing (both hard copy and computer) do not provide good means of retrieving information on specific separations because index terms such as extraction, leaching, chromatography, and even ion exchange have different meanings in different disciplines. Recent attempts to solve some of the problems of information retrieval in separations science have resulted in the concept of a Separations Science Data Base. This data is designed for the chemical separations information and contains unique indexes that allow rapid and accurate retrieval of information about specific separations from specific matrices and a method of minimizing false returns that result from cross coupling of unrelated terms in multisubject reports. Although the data base is presently only about 20% complete, the success of this work has been encouraging and further work is indicated.

  12. Relating the new language models of information retrieval to the traditional retrieval models

    NARCIS (Netherlands)

    Hiemstra, Djoerd; de Vries, A.P.

    During the last two years, exciting new approaches to information retrieval were introduced by a number of different research groups that use statistical language models for retrieval. This paper relates the retrieval algorithms suggested by these approaches to widely accepted retrieval algorithms

  13. Information Retrieval Using a Middleware Approach

    Directory of Open Access Journals (Sweden)

    Danijela Boberić Krstićev

    2013-03-01

    Full Text Available This paper explores the use of a mediator/wrapper approach to enable the search of an existing library management system using different information retrieval protocols. It proposes an architecture for a software component that will act as an intermediary between the library system and search services. It provides an overview of different approaches to add Z39.50 and Search/Retrieval via URL (SRU functionality using a middleware approach that is implemented on the BISIS library management system. That wrapper performs transformation of Contextual Query Language (CQL into Lucene query language. The primary aim of this software component is to enable search and retrieval of bibliographic records using the SRU and Z39.50 protocols, but the proposed architecture of the software components is also suitable for inclusion of the existing library management system into a library portal. The software component provides a single interface to server-side protocols for search and retrieval of records. Additional protocols could be used. This paper provides practical demonstration of interest to developers of library management systems and those who are trying to use open-source solutions to make their local catalog accessible to other systems.

  14. Cross-view Embeddings for Information Retrieval

    OpenAIRE

    GUPTA, PARTH ALOKKUMAR

    2017-01-01

    In this dissertation, we deal with the cross-view tasks related to information retrieval using embedding methods. We study existing methodologies and propose new methods to overcome their limitations. We formally introduce the concept of mixed-script IR, which deals with the challenges faced by an IR system when a language is written in different scripts because of various technological and sociological factors. Mixed-script terms are represented by a small and finite feature space c...

  15. Statistical Language Models and Information Retrieval: Natural Language Processing Really Meets Retrieval

    NARCIS (Netherlands)

    Hiemstra, Djoerd; de Jong, Franciska M.G.

    2001-01-01

    Traditionally, natural language processing techniques for information retrieval have always been studied outside the framework of formal models of information retrieval. In this article, we introduce a new formal model of information retrieval based on the application of statistical language models.

  16. Cognitive approach to information retrieval and communication

    Directory of Open Access Journals (Sweden)

    Saša Zupanič

    1997-01-01

    Full Text Available Cognitive approach (viewpoint/standpoirit in the retrieval and communication of information, as well as in librarianship and information science has started gaining importance in the 70's. Today, it is present in literary and objective knowledge studies, as well as in studies of users,information brokers and systems of information retrieval.Cognitive approach exercises strong impact on several scientific disciplines which are grouped under the roof of cognitive science. The cognitive approach has caused split and the formation of a new paradigm, i.e. the cognitive paradigm, in many scientific disciplines.In the frames of the definition of Kuhn's concept of paradigm, it is evident that librarianship and information science are on the pre-paradigmatic level. I Iowever,some authors mention the existence of at least two paradigms in library and information science, i.e. physical and cognitive paradigm.The hištorical overview of cognitive oriented research works of Brookes, De Mey,Belkin, Ingwersen and others enables the insight into the development of library and information scientific thought up to the present.

  17. Geosemantic Information Retrieval Using a Geoontology

    Science.gov (United States)

    Hwang, J.

    2014-12-01

    Currently, most users prefer searching for the information using the more convenient and dynamic mobile information retrieval services to using the existing desktop PC services in the limited space, according as a lot of mobile terminals have been provided with the development of a variety of techniques. Information retrieval service using the mobile terminals has the strength that provides the personalized information results related to the users' information request anytime and anywhere, considering the users' mobility and portability. Therefore, for the information retrieval using the mobile devices I need the context awareness techniques which have been researched actively. In this thesis, I developed the context awareness ontology model for Geotourism as the representative method of the context awareness techniques to predict the user's interest and foresee the information about which retrieval results and which places the user want to get. The proposed Geotour ontology model is extended and designed from W3C Time Ontology defined in the international standards and spatial geometry feature ontology supported by OGC GeoSPARQL, so it can provide the usability and the function. That is, GeotourFeature class is the subclass of ogc:Feature defined in OGC as in Figure 1. GeotourTime class which is for expressing temporal information of a certain Geotour features is the subclass of TemporalThing of W3C. Figure 1: Relationship between the international standard ontology and the geotour ontology model A Geotour features and a geotour map describes a part of ontology to represent the GeotourFeature composed of GeotourTime class and GeotourLocation class. The highest class to represent GeotourTime and GeotourLocation is GeotourFeature class. As mentioned in the previous section, our model inherited the temporal ontology of W3C. Figure 2 describes a part of ontology to represent the GeotourFeature composed of GeotourTime class and GeotourLocation class. The highest class

  18. Practice of information retrieval technique and limitation of database usage (2) - Limitation of retrieval and retrieval by information broker -

    Science.gov (United States)

    Suzuki, Shigekazu

    Nowadays everyone can enjoy an information retrieval, thanks for the advancement of computer usage in searching the literatures of science and technology. Regarding limitation for database usage in these days, the author divided searchers into four generations according to their skillful degree for the retrieval technics. In this paper, the author considered a blind spot from the view point of retrieval by searcher after the third generation, who started to have a suspicion to the content of database and emphasised the following three blind spots that we cannot aford to overlook, 1) limitation on input to database, 2) limitation of keyword and code search computer searching, 3) limitation on fitness evaluation of retrieved results.

  19. Image Information Retrieval: An Overview of Current Research

    OpenAIRE

    Goodrum, Abby A.

    2000-01-01

    This paper provides an overview of current research in image information retrieval and provides an outline of areas for future research. The approach is broad and interdisciplinary and focuses on three aspects of image research (IR): text-based retrieval, content-based retrieval, and user interactions with image information retrieval systems. The review concludes with a call for image retrieval evaluation studies similar to TREC.

  20. 108 Information Retrieval Methods in Libraries and Information ...

    African Journals Online (AJOL)

    User

    the organization of library materials and their recordings for use by readers came into being a little more than a century ago. Today's information professionals should know and be conversant with the traditional information retrieval tools and methods like classification, cataloguing, and vocabulary control as well as the ...

  1. EFFICACIOUS GEOSPATIAL INFORMATION RETRIEVAL USING DENSITY PROBABILISTIC DOCUMENT CORRELATION APPROACH

    OpenAIRE

    Uma, R.; Muneeswaran

    2013-01-01

    Information Retrieval (IR) is a profound technique to find information that addresses the need of query. Processing of normal text is easier and information can be retrieved efficiently. There are plenty of algorithms in hand to carry out the normal text retrieval. Whereas retrieving geospatial information is very complex and requires additional operations to be performed. Since geospatial data contain complex details than general data such as location, direction. To handle geographical quer...

  2. Graph-Based Interactive Bibliographic Information Retrieval Systems

    Science.gov (United States)

    Zhu, Yongjun

    2017-01-01

    In the big data era, we have witnessed the explosion of scholarly literature. This explosion has imposed challenges to the retrieval of bibliographic information. Retrieval of intended bibliographic information has become challenging due to the overwhelming search results returned by bibliographic information retrieval systems for given input…

  3. Random walk term weighting for information retrieval

    DEFF Research Database (Denmark)

    Blanco, R.; Lioma, Christina

    2007-01-01

    We present a way of estimating term weights for Information Retrieval (IR), using term co-occurrence as a measure of dependency between terms.We use the random walk graph-based ranking algorithm on a graph that encodes terms and co-occurrence dependencies in text, from which we derive term weights...... that represent a quantification of how a term contributes to its context. Evaluation on two TREC collections and 350 topics shows that the random walk-based term weights perform at least comparably to the traditional tf-idf term weighting, while they outperform it when the distance between co-occurring terms...

  4. An Effective Information Retrieval for Ambiguous Query

    OpenAIRE

    Roul, R. K.; Sahay, S. K.

    2012-01-01

    Search engine returns thousands of web pages for a single user query, in which most of them are not relevant. In this context, effective information retrieval from the expanding web is a challenging task, in particular, if the query is ambiguous. The major question arises here is that how to get the relevant pages for an ambiguous query. We propose an approach for the effective result of an ambiguous query by forming community vector based on association concept of data minning using vector s...

  5. JANE, A new information retrieval system for the Radiation Shielding Information Center

    Energy Technology Data Exchange (ETDEWEB)

    Trubey, D.K.

    1991-05-01

    A new information storage and retrieval system has been developed for the Radiation Shielding Information Center (RSIC) at Oak Ridge National Laboratory to replace mainframe systems that have become obsolete. The database contains citations and abstracts of literature which were selected by RSIC analysts and indexed with terms from a controlled vocabulary. The database, begun in 1963, has been maintained continuously since that time. The new system, called JANE, incorporates automatic indexing techniques and on-line retrieval using the RSIC Data General Eclipse MV/4000 minicomputer, Automatic indexing and retrieval techniques based on fuzzy-set theory allow the presentation of results in order of Retrieval Status Value. The fuzzy-set membership function depends on term frequency in the titles and abstracts and on Term Discrimination Values which indicate the resolving power of the individual terms. These values are determined by the Cover Coefficient method. The use of a commercial database base to store and retrieve the indexing information permits rapid retrieval of the stored documents. Comparisons of the new and presently-used systems for actual searches of the literature indicate that it is practical to replace the mainframe systems with a minicomputer system similar to the present version of JANE. 18 refs., 10 figs.

  6. A semantic medical multimedia retrieval approach using ontology information hiding.

    Science.gov (United States)

    Guo, Kehua; Zhang, Shigeng

    2013-01-01

    Searching useful information from unstructured medical multimedia data has been a difficult problem in information retrieval. This paper reports an effective semantic medical multimedia retrieval approach which can reflect the users' query intent. Firstly, semantic annotations will be given to the multimedia documents in the medical multimedia database. Secondly, the ontology that represented semantic information will be hidden in the head of the multimedia documents. The main innovations of this approach are cross-type retrieval support and semantic information preservation. Experimental results indicate a good precision and efficiency of our approach for medical multimedia retrieval in comparison with some traditional approaches.

  7. A Semantic Medical Multimedia Retrieval Approach Using Ontology Information Hiding

    Science.gov (United States)

    Guo, Kehua; Zhang, Shigeng

    2013-01-01

    Searching useful information from unstructured medical multimedia data has been a difficult problem in information retrieval. This paper reports an effective semantic medical multimedia retrieval approach which can reflect the users' query intent. Firstly, semantic annotations will be given to the multimedia documents in the medical multimedia database. Secondly, the ontology that represented semantic information will be hidden in the head of the multimedia documents. The main innovations of this approach are cross-type retrieval support and semantic information preservation. Experimental results indicate a good precision and efficiency of our approach for medical multimedia retrieval in comparison with some traditional approaches. PMID:24082915

  8. Visualization of database structures for information retrieval

    Directory of Open Access Journals (Sweden)

    Grete Lisbjerg Jensen

    1994-12-01

    Full Text Available This paper describes the Book House system, which is designed to support children's information retrieval in libraries as part of their education. It is a shareware program available on CD-ROM or floppy disks, and comprises functionality for database searching as well as for classifying and storing book information in the database. The system concept is based on an understanding of children's domain structures and their capabilities for categorization of information needs in connection with their activities in schools, in school libraries or in public libraries. These structures are visualized in the interface by using metaphors and multimedia technology. Through the use of text, images and animation, the Book House encourages children - even at a very early age - to learn by doing in an enjoyable way, which plays on their previous experiences with computer games. Both words and pictures can be used for searching; this makes the system suitable for all age groups. Even children who have not yet learned to read properly can, by selecting pictures, search for and find those books they would like to have read aloud. Thus, at the very beginning of their school life, they can learn to search for books on their own. For the library community, such a system will provide an extended service which will increase the number of children's own searches and also improve the relevance, quality and utilization of the book collections in the libraries. A market research report on the need for an annual indexing service for books in the Book House format is in preparation by the Danish Library Centre A/S.

  9. A User-Oriented Approach to Music Information Retrieval

    OpenAIRE

    Lesaffre, Micheline; Leman, Marc; Martens, Jean-Pierre

    2006-01-01

    Search and retrieval of specific musical content (e.g. emotion, melody) has become an important aspect of system development but only little research is user-oriented. The success of music information retrieval technology primarily depends on both assessing and meeting the needs of its users. Potential users of music information retrieval systems, however, draw upon various ways of expressing themselves. But, who are the potential users of MIR systems and how would they describe music qualiti...

  10. Online learning to rank for information retrieval: SIGIR 2016 tutorial

    NARCIS (Netherlands)

    Grotov, A.; de Rijke, M.

    2016-01-01

    During the past 10--15 years offline learning to rank has had a tremendous influence on information retrieval, both scientifically and in practice. Recently, as the limitations of offline learning to rank for information retrieval have become apparent, there is increased attention for online

  11. Problems of Music Information Retrieval in the Real World.

    Science.gov (United States)

    Byrd, Donald; Crawford, Tim

    2002-01-01

    Considers some of the most fundamental problems in music information retrieval, challenging the common assumption that searching on pitch alone is likely to be satisfactory for all purposes. Discusses special issues related to polyphonic music, user-interface issues, and the notion of relevance for music information retrieval. (Contains 52…

  12. Innovations in information retrieval perspectives for theory and practice

    CERN Document Server

    Foster, Allen

    2011-01-01

    The advent of various information retrieval (IR) technologies and approaches to storage and retrieval provide communities with opportunities for mass documentation, digitization, and the recording of information in different forms. This book introduces and contextualizes these developments and looks at supporting research in IR.

  13. The Human-Computer Interface for Information Retrieval.

    Science.gov (United States)

    Shaw, Debora

    1991-01-01

    Discusses the human-computer interface as it relates to information technology and retrieval. Principles of interface design are examined, including visual display features and help messages; information retrieval applications are described, including online searching, CD-ROM, online public access catalogs (OPACs), and full-text databases; and…

  14. Perspectives in CD-ROM for Information Storage and Retrieval.

    Science.gov (United States)

    Lunin, Lois F., Ed.; Schipma, Peter B., Ed.

    1988-01-01

    A series of six articles discusses the technology of optical data disks, current and possible future applications of this technology, their potential impact on information retrieval systems, and the potential problems as they apply to information science. (CLB)

  15. Combining Information Sources for Video Retrieval

    NARCIS (Netherlands)

    Westerveld, T.H.W.; Ianeva, T.; Boldareva, L.; de Vries, A.P.; Hiemstra, Djoerd

    The previous video track results demonstrated that it is far from trivial to take advantage of multiple modalities for the video retrieval search task. For almost any query, results based on ASR transcripts have been better than any other run. This year’s main success in our runs is that a

  16. Short Communication The New Information Retrieval Media and the ...

    African Journals Online (AJOL)

    First from the manual method, then to the use of computer software's, retrieval is now made from full-text and on-line databases. This paper discusses the transition to these new information retrieval media and the challenges for Nigeria libraries to adopt the two key elements that propel it - computers and Telecommunication ...

  17. An Evaluation of Automatically Constructed Hypertexts for Information Retrieval.

    Science.gov (United States)

    Melucci, Massimo

    1999-01-01

    Assesses the retrieval effectiveness of automatically constructed interdocument hypertext links in information retrieval (IR). Describes experiments using statistical and probabilistic techniques that were designed to obtain evidence concerning the usefulness of querying and browsing automatically constructed IR hypertexts. Results indicate a…

  18. Adapting a Diagnostic Problem-Solving Model to Information Retrieval.

    Science.gov (United States)

    Syu, Inien; Lang, S. D.

    2000-01-01

    Explains how a competition-based connectionist model for diagnostic problem-solving is adapted to information retrieval. Topics include probabilistic causal networks; Bayesian networks; the neural network model; empirical studies of test collections that evaluated retrieval performance; precision results; and the use of a thesaurus to provide…

  19. Understanding information retrieval systems management, types, and standards

    CERN Document Server

    Bates, Marcia J

    2011-01-01

    In order to be effective for their users, information retrieval (IR) systems should be adapted to the specific needs of particular environments. The huge and growing array of types of information retrieval systems in use today is on display in Understanding Information Retrieval Systems: Management, Types, and Standards, which addresses over 20 types of IR systems. These various system types, in turn, present both technical and management challenges, which are also addressed in this volume. In order to be interoperable in a networked environment, IR systems must be able to use various types of

  20. Care episode retrieval: distributional semantic models for information retrieval in the clinical domain.

    Science.gov (United States)

    Moen, Hans; Ginter, Filip; Marsi, Erwin; Peltonen, Laura-Maria; Salakoski, Tapio; Salanterä, Sanna

    2015-01-01

    Patients' health related information is stored in electronic health records (EHRs) by health service providers. These records include sequential documentation of care episodes in the form of clinical notes. EHRs are used throughout the health care sector by professionals, administrators and patients, primarily for clinical purposes, but also for secondary purposes such as decision support and research. The vast amounts of information in EHR systems complicate information management and increase the risk of information overload. Therefore, clinicians and researchers need new tools to manage the information stored in the EHRs. A common use case is, given a--possibly unfinished--care episode, to retrieve the most similar care episodes among the records. This paper presents several methods for information retrieval, focusing on care episode retrieval, based on textual similarity, where similarity is measured through domain-specific modelling of the distributional semantics of words. Models include variants of random indexing and the semantic neural network model word2vec. Two novel methods are introduced that utilize the ICD-10 codes attached to care episodes to better induce domain-specificity in the semantic model. We report on experimental evaluation of care episode retrieval that circumvents the lack of human judgements regarding episode relevance. Results suggest that several of the methods proposed outperform a state-of-the art search engine (Lucene) on the retrieval task.

  1. On Region Algebras, XML Databases, and Information Retrieval

    NARCIS (Netherlands)

    Mihajlovic, V.; Hiemstra, Djoerd; Apers, Peter M.G.

    2003-01-01

    This paper describes some new ideas on developing a logical algebra for databases that manage textual data and support information retrieval functionality. We describe a first prototype of such a system.

  2. Vector space model for document representation in information retrieval

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2007-12-01

    Full Text Available This paper presents the basics of information retrieval: the vector space model for document representation with Boolean and term weighted models, ranking methods based on the cosine factor and evaluation measures: recall, precision and combined measure.

  3. GRAMMAR RULE BASED INFORMATION RETRIEVAL MODEL FOR BIG DATA

    Directory of Open Access Journals (Sweden)

    T. Nadana Ravishankar

    2015-07-01

    Full Text Available Though Information Retrieval (IR in big data has been an active field of research for past few years; the popularity of the native languages presents a unique challenge in big data information retrieval systems. There is a need to retrieve information which is present in English and display it in the native language for users. This aim of cross language information retrieval is complicated by unique features of the native languages such as: morphology, compound word formations, word spelling variations, ambiguity, word synonym, other language influence and etc. To overcome some of these issues, the native language is modeled using a grammar rule based approach in this work. The advantage of this approach is that the native language is modeled and its unique features are encoded using a set of inference rules. This rule base coupled with the customized ontological system shows considerable potential and is found to show better precision and recall.

  4. Comparative Analysis of Sparse Matrix Algorithms For Information Retrieval

    Directory of Open Access Journals (Sweden)

    Nazli Goharian

    2003-02-01

    Full Text Available We evaluate and compare the storage efficiency of different sparse matrix storage formats as index structure for text collection and their corresponding sparse matrixvector multiplication algorithm to perform query processing in information retrieval (IR application. We show the results of our implementations for several sparse matrix algorithms such as Coordinate Storage (COO, Compressed Sparse Column (CSC, Compressed Sparse Row (CSR, and Block Sparse Row (BSR sparse matrix algorithms, using a standard text collection. Evaluation is based on the storage space requirement for each indexing structure and the efficiency of the query-processing algorithm. Our results demonstrate that CSR is more efficient in terms of storage space requirement and query processing timing over the other sparse matrix algorithms for Information Retrieval application. Furthermore, we experimentally evaluate the mapping of various existing index compression techniques used to compress index in information retrieval systems (IR on Compressed Sparse Row Information Retrieval (CSR IR.

  5. Object-Centered Knowledge Representation and Information Retrieval.

    Science.gov (United States)

    Panyr, Jiri

    1996-01-01

    Discusses object-centered knowledge representation and information retrieval. Highlights include semantic networks; frames; predicative (declarative) and associative knowledge; cluster analysis; creation of subconcepts and superconcepts; automatic classification; hierarchies and pseudohierarchies; graph theory; term classification; clustering of…

  6. Interface design for an audio based information retrieval system

    OpenAIRE

    Johnson, James Robert

    1992-01-01

    This project involves a telephone-based information retrieval system. Users interact with the computer by pressing buttons on a telephone keypad and listening to the computer respond by way of a speech synthesizer. The purpose of this project is to redesign and revise an existing information retrieval system. The goals of this project include simplifying the job of the menu designer and providing a way so experience can aid users to perform a given task faster than previously possible. Key...

  7. JavaScript tools for online information retrieval

    OpenAIRE

    Gamage, Ruwan; Dong, Hui

    2006-01-01

    JavaScript has a comparatively long history as an online information retrieval tool. During the last decade SilverPlatter's popular WebSPIRS 4.0 started using JavaScript for its search functions. International Children's Digital Library is a current system that applies JavaScript for category based information retrieval. However, JavaScript capabilities for quick browsing and searching small collections is under utilized in light of advanced server-side technologies. Focussing on search engin...

  8. Semantic-Based Information Retrieval of Biomedical Data

    Energy Technology Data Exchange (ETDEWEB)

    Jiao, Yu [ORNL; Potok, Thomas E [ORNL; Hurson, Ali R. [Pennsylvania State University; Yan, Peng [Pennsylvania State University

    2006-01-01

    In this paper, we propose to improve the effectiveness of biomedical information retrieval via a medical thesaurus. We analyzed the deficiencies of the existing medical thesauri and reconstructed a new thesaurus, called MEDTHES, which follows the ANSI/NISO Z39.19-2003 standard. MEDTHES also endows the users with fine-grained control of information retrieval by providing functions to calculate the semantic similarity between words. We demonstrate the usage of MEDTHES through an existing data search engine.

  9. Adaptive Visualization for Focused Personalized Information Retrieval

    Science.gov (United States)

    Ahn, Jae-wook

    2010-01-01

    The new trend on the Web has totally changed today's information access environment. The traditional information overload problem has evolved into the qualitative level beyond the quantitative growth. The mode of producing and consuming information is changing and we need a new paradigm for accessing information. Personalized search is one of…

  10. A Process Model for Goal-Based Information Retrieval

    Directory of Open Access Journals (Sweden)

    Harvey Hyman

    2014-12-01

    Full Text Available In this paper we examine the domain of information search and propose a "goal-based" approach to study search strategy. We describe "goal-based information search" using a framework of Knowledge Discovery. We identify two Information Retrieval (IR goals using the constructs of Knowledge Acquisition (KA and Knowledge Explanation (KE. We classify these constructs into two specific information problems: An exploration-exploitation problem and an implicit-explicit problem. Our proposed framework is an extension of prior work in this domain, applying an IR Process Model originally developed for Legal-IR and adapted to Medical-IR. The approach in this paper is guided by the recent ACM-SIG Medical Information Retrieval (MedIR Workshop definition: "methodologies and technologies that seek to improve access to medical information archives via a process of information retrieval."

  11. Adaptive multi-agent system for information retrieval

    Science.gov (United States)

    Maleki-dizaji, Saeedeh; Nyongesa, H. O.; Siddiqqi, J.

    2001-10-01

    The current exponential growth of the Internet precipitates a need for improved tools to help people cope with the volume of information available. Existing search engines such, as Yahoo, Alta vista and Excite are efficient in terms of high recall (percentage of relevant document that are retrieved from Internet), and fast response time, at the cost of poor precision (percentage of documents retrieved that are considered relevant). The problem is due to the lack of filtering, lack of specialisation, lack of relevance feedback, lack of adaptation and lack of exploration. One solution for the above problems is to use intelligent agents, which can operate autonomously and become better over time. The agents rely on a user model to improve their performance in retrieving the information. This paper presents an adaptive information retrieval (IR) that learns from the user feedback through an evolutionary method, namely, genetic algorithms (GA).

  12. Associative conceptual space-based information retrieval systems

    NARCIS (Netherlands)

    M.J. Schuemie (Martijn); J.H. van den Berg (Jan)

    1998-01-01

    textabstractIn this `Information Era' with the availability of large collections of books, articles, journals, CD-ROMs, video films and so on, there exists an increasing need for intelligent information retrieval systems that enable users to find the information desired easily. Many attempts have

  13. Bibliometrics and Information Retrieval - Creating Knowledge through Research Synergies

    NARCIS (Netherlands)

    Bar-Ilan, Judit; Koopman, Rob; Wang, Shenghui; Scharnhorst, Andrea; John, Marcus; Mayr, Philipp; Wolfram, Dietmar

    2016-01-01

    This panel brings together experts in bibliometrics and information retrieval to discuss how each of these two important areas of information science can help to inform the research of the other. There is a growing body of literature that capitalizes on the synergies created by combining

  14. Roogle: an information retrieval engine for clinical data warehouse.

    Science.gov (United States)

    Cuggia, Marc; Garcelon, Nicolas; Campillo-Gimenez, Boris; Bernicot, Thomas; Laurent, Jean-François; Garin, Etienne; Happe, André; Duvauferrier, Régis

    2011-01-01

    High amount of relevant information is contained in reports stored in the electronic patient records and associated metadata. R-oogle is a project aiming at developing information retrieval engines adapted to these reports and designed for clinicians. The system consists in a data warehouse (full-text reports and structured data) imported from two different hospital information systems. Information retrieval is performed using metadata-based semantic and full-text search methods (as Google). Applications may be biomarkers identification in a translational approach, search of specific cases, and constitution of cohorts, professional practice evaluation, and quality control assessment.

  15. Locally decodable codes and private information retrieval schemes

    CERN Document Server

    Yekhanin, Sergey

    2010-01-01

    Locally decodable codes (LDCs) are codes that simultaneously provide efficient random access retrieval and high noise resilience by allowing reliable reconstruction of an arbitrary bit of a message by looking at only a small number of randomly chosen codeword bits. Local decodability comes with a certain loss in terms of efficiency - specifically, locally decodable codes require longer codeword lengths than their classical counterparts. Private information retrieval (PIR) schemes are cryptographic protocols designed to safeguard the privacy of database users. They allow clients to retrieve rec

  16. Information Retrieval and Criticality in Parity-Time-Symmetric Systems

    Science.gov (United States)

    Kawabata, Kohei; Ashida, Yuto; Ueda, Masahito

    2017-11-01

    By investigating information flow between a general parity-time (P T -)symmetric non-Hermitian system and an environment, we find that the complete information retrieval from the environment can be achieved in the P T -unbroken phase, whereas no information can be retrieved in the P T -broken phase. The P T -transition point thus marks the reversible-irreversible criticality of information flow, around which many physical quantities such as the recurrence time and the distinguishability between quantum states exhibit power-law behavior. Moreover, by embedding a P T -symmetric system into a larger Hilbert space so that the entire system obeys unitary dynamics, we reveal that behind the information retrieval lies a hidden entangled partner protected by P T symmetry. Possible experimental situations are also discussed.

  17. Semantic association ranking schemes for information retrieval ...

    Indian Academy of Sciences (India)

    ... relevance, multimedia, information, video, image, answer, text}. Doc 9. {google, search, engine, personalization, information, text, multimedia}. Figure 8. Term association graph on real data with 50 nodes. Table 6. User search interest value table. Session ID. Software. Algorithms. Healthcare. Sports. Movies. Music. S1.

  18. Foundations of Large-Scale Multimedia Information Management and Retrieval

    CERN Document Server

    Chang, Edward Y

    2011-01-01

    "Foundations of Large-Scale Multimedia Information Management and Retrieval - Mathematics of Perception" covers knowledge representation and semantic analysis of multimedia data and scalability in signal extraction, data mining, and indexing. The book is divided into two parts: Part I - Knowledge Representation and Semantic Analysis focuses on the key components of mathematics of perception as it applies to data management and retrieval. These include feature selection/reduction, knowledge representation, semantic analysis, distance function formulation for measuring similarity, and

  19. Interdisciplinary perspectives on abstracts for information retrieval

    Directory of Open Access Journals (Sweden)

    Soon Keng Chan

    2004-10-01

    Full Text Available The paper examines the abstract genre from the perspectives of English for Specific Purposes (ESP practitioners and information professionals. It aims to determine specific interdisciplinary interests in the abstract, and to explore areas of collaboration in terms of research and pedagogical practices. A focus group (FG comprising information professionals from the Division of Information Studies, Nanyang Technological University, Singapore, convened for a discussion on the subject of abstracts and abstracting. Two major issues that have significant implications for ESP practices emerged during the discussion. While differences in terms of approach to and objectives of the abstract genre are apparent between information professionals and language professionals, the demands for specific cognitive processes involved in abstracting proved to be similar. This area of similarity provides grounds for awareness raising and collaboration between the two disciplines. While ESP practitioners need to consider adding the dimension of information science to the rhetorical and linguistic scaffolding that they have been providing to novice-writers, information professionals can contribute useful insights about the qualities of abstracts that have the greatest impact in meeting the end-users' needs in information search.

  20. Web Information Retrieval System for Technological Forecasting

    OpenAIRE

    Montiel, Raúl; Lezcano Airaldi, Luis; Favret, Fabián; Eckert, Karina

    2017-01-01

    Technological Forecasting and Competitive Intelligence are two different disciplines that, used together, provide the organizations with an invaluable analytic tool for the environment and the competing companies’ behavior. This kind of technology can be used for extracting useful information to make strategic decisions. This paper describes a Web mining system which gathers the users’ information requirements through a series of guided questions, constructs various search keys with the answe...

  1. User's perspective: Information retrieval and usability

    Directory of Open Access Journals (Sweden)

    Salvador Zambrano Silva

    2008-02-01

    Full Text Available The point is to share some ideas to improve the on line database of "Defensor del Pueblo Andaluz", starting from an user's study and a bibliographic analysis. Our intention is to create an interface to make interactivity much easier and make it work as a connector bridge between the documentent´s information structure and the user's knowledge structure. With the only purpose to improve the user satis-faction level in the results of information search.

  2. Subsampling phase retrieval for rapid thermal measurements of heated microstructures.

    Science.gov (United States)

    Taylor, Lucas N; Talghader, Joseph J

    2016-07-15

    A subsampling technique for real-time phase retrieval of high-speed thermal signals is demonstrated with heated metal lines such as those found in microelectronic interconnects. The thermal signals were produced by applying a current through aluminum resistors deposited on soda-lime-silica glass, and the resulting refractive index changes were measured using a Mach-Zehnder interferometer with a microscope objective and high-speed camera. The temperatures of the resistors were measured both by the phase-retrieval method and by monitoring the resistance of the aluminum lines. The method used to analyze the phase is at least 60× faster than the state of the art but it maintains a small spatial phase noise of 16 nm, remaining comparable to the state of the art. For slowly varying signals, the system is able to perform absolute phase measurements over time, distinguishing temperature changes as small as 2 K. With angular scanning or structured illumination improvements, the system could also perform fast thermal tomography.

  3. Music information retrieval in compressed audio files: a survey

    Science.gov (United States)

    Zampoglou, Markos; Malamos, Athanasios G.

    2014-07-01

    In this paper, we present an organized survey of the existing literature on music information retrieval systems in which descriptor features are extracted directly from the compressed audio files, without prior decompression to pulse-code modulation format. Avoiding the decompression step and utilizing the readily available compressed-domain information can significantly lighten the computational cost of a music information retrieval system, allowing application to large-scale music databases. We identify a number of systems relying on compressed-domain information and form a systematic classification of the features they extract, the retrieval tasks they tackle and the degree in which they achieve an actual increase in the overall speed-as well as any resulting loss in accuracy. Finally, we discuss recent developments in the field, and the potential research directions they open toward ultra-fast, scalable systems.

  4. Information Retrieval Using Hadoop Big Data Analysis

    Science.gov (United States)

    Motwani, Deepak; Madan, Madan Lal

    This paper concern on big data analysis which is the cognitive operation of probing huge amounts of information in an attempt to get uncovers unseen patterns. Through Big Data Analytics Applications such as public and private organization sectors have formed a strategic determination to turn big data into cut throat benefit. The primary occupation of extracting value from big data give rise to a process applied to pull information from multiple different sources; this process is known as extract transforms and lode. This paper approach extract information from log files and Research Paper, awareness reduces the efforts for blueprint finding and summarization of document from several positions. The work is able to understand better Hadoop basic concept and increase the user experience for research. In this paper, we propose an approach for analysis log files for finding concise information which is useful and time saving by using Hadoop. Our proposed approach will be applied on different research papers on a specific domain and applied for getting summarized content for further improvement and make the new content.

  5. Acquisition and retrieval of ophthalmology academic information

    Directory of Open Access Journals (Sweden)

    Lei Li

    2014-06-01

    Full Text Available This article discusses how to search and access ophthalmology information based on specialized websites and resources by introducing the database, search engines, electronic journals, electronic books and so on. Hope to help ophthalmic practitioners to carry out scientific research and clinical practice.

  6. Dutch Speech Recognition in Multimedia Information Retrieval

    NARCIS (Netherlands)

    Ordelman, Roeland J.F.; Ordelman, Roeland Jacobus Frederik

    2003-01-01

    As data storage capacities grow to nearly unlimited sizes thanks to ever ongoing hardware and software improvements, an increasing amount of information is being stored in multimedia and spoken-word collections. Assuming that the intention of data storage is to use (portions of) it some later time,

  7. Semantic knowledge representation for information retrieval

    CERN Document Server

    Gödert, Winfried; Nagelschmidt, Matthias

    2014-01-01

    This book covers the basics of semantic web technologies and indexing languages, and describes their contribution to improve languages as a tool for subject queries and knowledge exploration. The book is relevant to information scientists, knowledge workers and indexers. It provides a suitable combination of theoretical foundations and practical applications.

  8. Learning to merge search results for efficient Distributed Information Retrieval

    NARCIS (Netherlands)

    Tjin-Kam-Jet, Kien; Hiemstra, Djoerd

    2010-01-01

    Merging search results from different servers is a major problem in Distributed Information Retrieval. We used Regression-SVM and Ranking-SVM which would learn a function that merges results based on information that is readily available: i.e. the ranks, titles, summaries and URLs contained in the

  9. LOGISTIC MANAGEMENT INFORMATION SYSTEM - MANUAL DATA STORAGE AND RETRIEVAL SYSTEM.

    Science.gov (United States)

    Logistics Management Information System . The procedures are applicable to manual storage and retrieval of all data used in the Logistics Management ... Information System (LMIS) and include the following: (1) Action Officer data source file. (2) Action Officer presentation format file. (3) LMI Coordination

  10. Level Search Schemes for Information Filtering and Retrieval.

    Science.gov (United States)

    Zhang, Xiaoyan; Berry, Michael W.; Raghavan, Padma

    2001-01-01

    Discusses latent semantic indexing (LSI); considers the high cost associated with the singular value decomposition (SVD) of the large term-by-document matrix that becomes a barrier for its application to scalable information retrieval; and shows that information filtering using level search techniques can reduce the SVD computation cost for LSI.…

  11. Introduction to Web Information Retrieval: A User Perspective

    Indian Academy of Sciences (India)

    Home; Journals; Resonance – Journal of Science Education; Volume 7; Issue 6. Introduction to Web Information Retrieval: A User Perspective - How to get ... Srinath Srinivasa1 Pramod Chandra P Bhatt1. Indian Institute of Information Technology International Technology Park Whitefield Road Bangalore 560066, India.

  12. Information Retrieval and Graph Analysis Approaches for Book Recommendation

    Directory of Open Access Journals (Sweden)

    Chahinez Benkoussas

    2015-01-01

    Full Text Available A combination of multiple information retrieval approaches is proposed for the purpose of book recommendation. In this paper, book recommendation is based on complex user's query. We used different theoretical retrieval models: probabilistic as InL2 (Divergence from Randomness model and language model and tested their interpolated combination. Graph analysis algorithms such as PageRank have been successful in Web environments. We consider the application of this algorithm in a new retrieval approach to related document network comprised of social links. We called Directed Graph of Documents (DGD a network constructed with documents and social information provided from each one of them. Specifically, this work tackles the problem of book recommendation in the context of INEX (Initiative for the Evaluation of XML retrieval Social Book Search track. A series of reranking experiments demonstrate that combining retrieval models yields significant improvements in terms of standard ranked retrieval metrics. These results extend the applicability of link analysis algorithms to different environments.

  13. Rare disease diagnosis as an information retrieval task

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina

    2011-01-01

    Increasingly more clinicians use web Information Retrieval (IR) systems to assist them in diagnosing difficult medical cases, for instance rare diseases that they may not be familiar with. However, web IR systems are not necessarily optimised for this task. For instance, clinicians’ queries tend...... to be long lists of symptoms, often containing phrases, whereas web IR systems typically expect very short keywordbased queries. Motivated by such differences, this work uses a preliminary study of 30 clinical cases to reflect on rare disease retrieval as an IR task. Initial experiments using both Google web...... search and offline retrieval from a rare disease collection indicate that the retrieval of rare diseases is an open problem with room for improvement....

  14. Rare disease diagnosis as an information retrieval task

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina

    2011-01-01

    Increasingly more clinicians use web Information Retrieval (IR) systems to assist them in diagnosing difficult medical cases, for instance rare diseases that they may not be familiar with. However, web IR systems are not necessarily optimised for this task. For instance, clinicians’ queries tend...... to be long lists of symptoms, often containing phrases, whereas web IR systems typically expect very short keyword-based queries. Motivated by such differences, this work uses a preliminary study of 30 clinical cases to reflect on rare disease retrieval as an IR task. Initial experiments using both Google...... web search and offline retrieval from a rare disease collection indicate that the retrieval of rare diseases is an open problem with room for improvement....

  15. MIRANDA - Music Information Retrieval And Data Acquisition

    DEFF Research Database (Denmark)

    Lehn-Schiøler, Tue; Petersen, Kaare Brandt; Hansen, Lars Kai

    2006-01-01

    In this report we present a music data harvesting system based on a plug-in for a popular music player. When a user is playing a song using the plug-in, information about the song is anonymously submitted to a server. The data gathered using MIRANDA is intended to be released to the MIR community....... We argue that even though content-based data is of interest to the community, also meta data and usage data can be important for research in music similarity.......In this report we present a music data harvesting system based on a plug-in for a popular music player. When a user is playing a song using the plug-in, information about the song is anonymously submitted to a server. The data gathered using MIRANDA is intended to be released to the MIR community...

  16. Distributed Systems and Applications of Information Filtering and Retrieval

    CERN Document Server

    Giuliani, Alessandro; Semeraro, Giovanni; DART 2012

    2014-01-01

    This volume focuses on new challenges in distributed Information Filtering and Retrieval. It collects invited chapters and extended research contributions from the special session on Information Filtering and Retrieval: Novel Distributed Systems and Applications (DART) of the 4th International Conference on Knowledge Discovery and Information Retrieval (KDIR 2012), held in Barcelona, Spain, on 4-7 October 2012. The main focus of DART was to discuss and compare suitable novel solutions based on intelligent techniques and applied to real-world applications. The chapters of this book present a comprehensive review of related works and state of the art. Authors, both practitioners and researchers, shared their results in several topics such as "Multi-Agent Systems", "Natural Language Processing", "Automatic Advertisement", "Customer Interaction Analytics", "Opinion Mining". Contributions have been careful reviewed by experts in the area, who also gave useful suggestions to improve the quality of the volume.

  17. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Directory of Open Access Journals (Sweden)

    Shozo Makino

    2007-01-01

    Full Text Available Recently, several music information retrieval (MIR systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA is used as recognition grammar, and about 86% retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0% retrieval accuracy was obtained with a query consisting of five words.

  18. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Directory of Open Access Journals (Sweden)

    Suzuki Motoyuki

    2007-01-01

    Full Text Available Recently, several music information retrieval (MIR systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA is used as recognition grammar, and about retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0 retrieval accuracy was obtained with a query consisting of five words.

  19. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Science.gov (United States)

    Suzuki, Motoyuki; Hosoya, Toru; Ito, Akinori; Makino, Shozo

    2006-12-01

    Recently, several music information retrieval (MIR) systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA) is used as recognition grammar, and about[InlineEquation not available: see fulltext.] retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0[InlineEquation not available: see fulltext.] retrieval accuracy was obtained with a query consisting of five words.

  20. Web-based multimedia information retrieval for clinical application research

    Science.gov (United States)

    Cao, Xinhua; Hoo, Kent S., Jr.; Zhang, Hong; Ching, Wan; Zhang, Ming; Wong, Stephen T. C.

    2001-08-01

    We described a web-based data warehousing method for retrieving and analyzing neurological multimedia information. The web-based method supports convenient access, effective search and retrieval of clinical textual and image data, and on-line analysis. To improve the flexibility and efficiency of multimedia information query and analysis, a three-tier, multimedia data warehouse for epilepsy research has been built. The data warehouse integrates clinical multimedia data related to epilepsy from disparate sources and archives them into a well-defined data model.

  1. Utilizing Mind-Maps for Information Retrieval and User Modelling

    OpenAIRE

    Beel, Joeran; Langer, Stefan; Genzmehr, Marcel; Gipp, Bela

    2014-01-01

    Mind-maps have been widely neglected by the information retrieval (IR) community. However, there are an estimated two million active mind-map users, who create 5 million mind-maps every year, of which a total of 300,000 is publicly available. We believe this to be a rich source for information retrieval applications, and present eight ideas on how mind-maps could be utilized by them. For instance, mind-maps could be utilized to generate user models for recommender systems or expert search, or...

  2. Improving life sciences information retrieval using semantic web technology.

    Science.gov (United States)

    Quan, Dennis

    2007-05-01

    The ability to retrieve relevant information is at the heart of every aspect of research and development in the life sciences industry. Information is often distributed across multiple systems and recorded in a way that makes it difficult to piece together the complete picture. Differences in data formats, naming schemes and network protocols amongst information sources, both public and private, must be overcome, and user interfaces not only need to be able to tap into these diverse information sources but must also assist users in filtering out extraneous information and highlighting the key relationships hidden within an aggregated set of information. The Semantic Web community has made great strides in proposing solutions to these problems, and many efforts are underway to apply Semantic Web techniques to the problem of information retrieval in the life sciences space. This article gives an overview of the principles underlying a Semantic Web-enabled information retrieval system: creating a unified abstraction for knowledge using the RDF semantic network model; designing semantic lenses that extract contextually relevant subsets of information; and assembling semantic lenses into powerful information displays. Furthermore, concrete examples of how these principles can be applied to life science problems including a scenario involving a drug discovery dashboard prototype called BioDash are provided.

  3. Experiences with automated categorization in e-government information retrieval

    DEFF Research Database (Denmark)

    Jonasen, Tanja Svarre; Lykke, Marianne

    2014-01-01

    High-precision search results are essential for supporting e-government employees’ information tasks. Prior studies have shown that existing features of e-government retrieval systems need improvement in terms of search facilities (e.g., Goh et al. 2008), navigation (e.g., de Jong and Lentz 2006......) and metadata (e.g., Kopackova, Michalek and Cejna 2010). This paper investigates how automated categorization can enhance information organization and retrieval, and presents the results of a realistic evaluation that compared automated categorization with free text indexing of the government intranet used...... documents were retrieved. The findings emphasise the importance of simultaneous search options for e-government IR systems, and reveal that automated categorization is valuable in improving search facilities in e-government....

  4. The semantics of similarity in geographic information retrieval

    Directory of Open Access Journals (Sweden)

    Krzysztof Janowicz

    2011-05-01

    Full Text Available Similarity measures have a long tradition in fields such as information retrieval, artificial intelligence, and cognitive science. Within the last years, these measures have been extended and reused to measure semantic similarity; i.e., for comparing meanings rather than syntactic differences. Various measures for spatial applications have been developed, but a solid foundation for answering what they measure; how they are best applied in information retrieval; which role contextual information plays; and how similarity values or rankings should be interpreted is still missing. It is therefore difficult to decide which measure should be used for a particular application or to compare results from different similarity theories. Based on a review of existing similarity measures, we introduce a framework to specify the semantics of similarity. We discuss similarity-based information retrieval paradigms as well as their implementation in web-based user interfaces for geographic information retrieval to demonstrate the applicability of the framework. Finally, we formulate open challenges for similarity research.

  5. Information Retrieval in Telemedicine: a Comparative Study on Bibliographic Databases.

    Science.gov (United States)

    Ahmadi, Maryam; Sarabi, Roghayeh Ershad; Orak, Roohangiz Jamshidi; Bahaadinbeigy, Kambiz

    2015-06-01

    The first step in each systematic review is selection of the most valid database that can provide the highest number of relevant references. This study was carried out to determine the most suitable database for information retrieval in telemedicine field. Cinhal, PubMed, Web of Science and Scopus databases were searched for telemedicine matched with Education, cost benefit and patient satisfaction. After analysis of the obtained results, the accuracy coefficient, sensitivity, uniqueness and overlap of databases were calculated. The studied databases differed in the number of retrieved articles. PubMed was identified as the most suitable database for retrieving information on the selected topics with the accuracy and sensitivity ratios of 50.7% and 61.4% respectively. The uniqueness percent of retrieved articles ranged from 38% for Pubmed to 3.0% for Cinhal. The highest overlap rate (18.6%) was found between PubMed and Web of Science. Less than 1% of articles have been indexed in all searched databases. PubMed is suggested as the most suitable database for starting search in telemedicine and after PubMed, Scopus and Web of Science can retrieve about 90% of the relevant articles.

  6. Interdisciplinarity and Computer Music Modeling and Information Retrieval

    DEFF Research Database (Denmark)

    Grund, Cynthia M.

    2006-01-01

    Abstract This paper takes a look at computer music modeling and information retrieval (CMMIR) from the point of view of the humanities with emphasis upon areas relevant to the philosophy of music. The desire for more interdisciplinary research involving CMMIR and the humanities is expressed...

  7. Disambiguation strategies for cross-language information retrieval

    NARCIS (Netherlands)

    Hiemstra, Djoerd; de Jong, Franciska M.G.

    1999-01-01

    This paper gives an overview of tools and methods for Cross-Language Information Retrieval (CLIR) that are developed within the Twenty-One project. The tools and methods are evaluated with the TREC CLIR task document collection using Dutch queries on the English document base. The main issue

  8. Scientometrics and information retrieval: weak-links revitalized

    NARCIS (Netherlands)

    Mayr, Philipp; Scharnhorst, Andrea

    This special issue brings together eight papers from experts of communities which often have been perceived as different once: bibliometrics, scientometrics and in- formetrics on the one side and information retrieval on the other. The idea of this special issue started at the workshop ‘‘Combining

  9. SLIMMER--A UNIX System-Based Information Retrieval System.

    Science.gov (United States)

    Waldstein, Robert K.

    1988-01-01

    Describes an information retrieval system developed at Bell Laboratories to create and maintain a variety of different but interrelated databases, and to provide controlled access to these databases. The components discussed include the interfaces, indexing rules, display languages, response time, and updating procedures of the system. (6 notes…

  10. A cross-lingual framework for monolingual biomedical information retrieval

    NARCIS (Netherlands)

    Trieschnigg, D.; Hiemstra, D.; Jong, F. de; Kraaij, W.

    2010-01-01

    An important challenge for biomedical information retrieval (IR) is dealing with the complex, inconsistent and ambiguous biomedical terminology. Frequently, a concept-based representation defined in terms of a domain-specific terminological resource is employed to deal with this challenge. In this

  11. Status report on SIRS: sorption information retrieval system

    Energy Technology Data Exchange (ETDEWEB)

    Hostetler, D.D.; Serne, R.J.; Baldwin, A.J.; Petrie, G.M.

    1980-11-01

    Two major uses were identified for the Sorption Information Retrieval System: (1) to aid geochemists in the elucidation of sorption mechanisms; and (2) to aid safety assessment modelers in selection of Kds for any given scenerio. Other benefits such as providing an auditable vehicle for the Kd selection were also discussed.

  12. Professional assistance to users of information retrieval tools at the ...

    African Journals Online (AJOL)

    The study investigated the need for professional assistance to users of information retrieval tools at the National Library of Nigeria, Enugu branch. A total of 38 (thirty-eight) users of the library were randomly selected and used for the study. It was found that most of the respondents 18(47.3%) consulted the card catalogue ...

  13. Support Vector Machines: Relevance Feedback and Information Retrieval.

    Science.gov (United States)

    Drucker, Harris; Shahrary, Behzad; Gibbon, David C.

    2002-01-01

    Compares support vector machines (SVMs) to Rocchio, Ide regular and Ide dec-hi algorithms in information retrieval (IR) of text documents using relevancy feedback. If the preliminary search is so poor that one has to search through many documents to find at least one relevant document, then SVM is preferred. Includes nine tables. (Contains 24…

  14. Creating an Information Retrieval test corpus for Dutch

    NARCIS (Netherlands)

    Hiemstra, Djoerd; van Leeuwen, D.A.; Theune, M.; Theune, Mariet; Nijholt, Antinus; Nijholt, A.; Hondorp, G.H.W.; Hondorp, H.

    2002-01-01

    This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch test data, which is part of the official CLEF multilingual test corpus, and give an overview of the experimental results of

  15. Towards an Intelligent Possibilistic Web Information Retrieval Using Multiagent System

    Science.gov (United States)

    Elayeb, Bilel; Evrard, Fabrice; Zaghdoud, Montaceur; Ahmed, Mohamed Ben

    2009-01-01

    Purpose: The purpose of this paper is to make a scientific contribution to web information retrieval (IR). Design/methodology/approach: A multiagent system for web IR is proposed based on new technologies: Hierarchical Small-Worlds (HSW) and Possibilistic Networks (PN). This system is based on a possibilistic qualitative approach which extends the…

  16. Why Information Retrieval Needs Cognitive Science: A call to arms

    NARCIS (Netherlands)

    Hoenkamp, E.C.M.

    2005-01-01

    Much of today’s success in Information Retrieval (IR) comes from a hard approach: employing blazingly fast machines, ever more refined statistics, and increasingly powerful classification schemes. In recent years, however, the hard approach has entered a phase of diminishing returns. This paper

  17. Design of an indigeous music information storage and retrieval ...

    African Journals Online (AJOL)

    MOI) and the Music Library of the Cultural Affairs (ML-CA) of Eritrea. The main aim of the study was to design an appropriate Indigenous Music Information Storage and Retrieval System for Eritrea. A quantitative approach was mainly used to ...

  18. Bibliometric-enhanced Information Retrieval : 2nd International BIR Workshop

    NARCIS (Netherlands)

    Mayr, Philipp; Frommholz, Ingo; Scharnhorst, Andrea; Mutschke, Peter

    2015-01-01

    This workshop brings together experts of communities which often have been perceived as different once: bibliometrics / scientometrics / informetrics on the one side and information retrieval on the other. Our motivation as organizers of the workshop started from the observation that main discourses

  19. A Survey of Query Auto Completion in Information Retrieval

    NARCIS (Netherlands)

    Cai, F.; de Rijke, M.

    2016-01-01

    In information retrieval, query auto completion (QAC), also known as type-ahead [Xiao et al., 2013, Cai et al., 2014b] and auto-complete suggestion [Jain and Mishne, 2010], refers to the following functionality: given a prefix consisting of a number of characters entered into a search box, the user

  20. Conventional and Knowledge-Based Information Retrieval with Prolog.

    Science.gov (United States)

    Leigh, William; Paz, Noemi

    1988-01-01

    Describes the use of PROLOG to program knowledge-based information retrieval systems, in which the knowledge contained in a document is translated into machine processable logic. Several examples of the resulting search process, and the program rules supporting the process, are given. (10 references) (CLB)

  1. Information Retrieval for Education: Making Search Engines Language Aware

    Science.gov (United States)

    Ott, Niels; Meurers, Detmar

    2010-01-01

    Search engines have been a major factor in making the web the successful and widely used information source it is today. Generally speaking, they make it possible to retrieve web pages on a topic specified by the keywords entered by the user. Yet web searching currently does not take into account which of the search results are comprehensible for…

  2. Proof of Concept: Concept-based Biomedical Information Retrieval

    NARCIS (Netherlands)

    Trieschnigg, Rudolf Berend

    2010-01-01

    In this thesis we investigate the possibility to integrate domain-specific knowledge into biomedical information retrieval (IR). Recent decades have shown a fast growing interest in biomedical research, reflected by an exponential growth in scientific literature. Biomedical IR is concerned with the

  3. Icon Based Information Retrieval and Disease Identification in Agriculture

    OpenAIRE

    Mittal, Namita; Agarwal, Basant; Gupta, Ajay; Madhur, Hemant

    2014-01-01

    Recent developments in the ICT industry in past few decades has enabled the quick and easy access to the information available on the internet. But, digital literacy is the pre-requisite for its use. The main purpose of this paper is to provide an interface for digitally illiterate users, especially farmers to efficiently and effectively retrieve information through Internet. In addition, to enable the farmers to identify the disease in their crop, its cause and symptoms using digital image p...

  4. Lower-Cost epsilon-Private Information Retrieval

    OpenAIRE

    Toledo, Raphael R.; Danezis, George; Goldberg, Ian

    2016-01-01

    Private Information Retrieval (PIR), despite being well studied, is computationally costly and arduous to scale. We explore lower-cost relaxations of information-theoretic PIR, based on dummy queries, sparse vectors, and compositions with an anonymity system. We prove the security of each scheme using a flexible differentially private definition for private queries that can capture notions of imperfect privacy. We show that basic schemes are weak, but some of them can be made arbitrarily safe...

  5. Web User Profile Using XUL and Information Retrieval Techniques

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2008-12-01

    Full Text Available This paper presents the importance of user profile in information retrieval, information filtering and recommender systems using explicit and implicit feedback. A Firefox extension (based on XUL used for gathering data needed to infer a web user profile and an example file with collected data are presented. Also an algorithm for creating and updating the user profile and keeping track of a fixed number k of subjects of interest is presented.

  6. Iterative Filtering of Retrieved Information to Increase Relevance

    Directory of Open Access Journals (Sweden)

    Robert Zeidman

    2007-12-01

    Full Text Available Efforts have been underway for years to find more effective ways to retrieve information from large knowledge domains. This effort is now being driven particularly by the Internet and the vast amount of information that is available to unsophisticated users. In the early days of the Internet, some effort involved allowing users to enter Boolean equations of search terms into search engines, for example, rather than just a list of keywords. More recently, effort has focused on understanding a user's desires from past search histories in order to narrow searches. Also there has been much effort to improve the ranking of results based on some measure of relevancy. This paper discusses using iterative filtering of retrieved information to focus in on useful information. This work was done for finding source code correlation and the author extends his findings to Internet searching and e-commerce. The paper presents specific information about a particular filtering application and then generalizes it to other forms of information retrieval.

  7. Use of information-retrieval languages in automated retrieval of experimental data from long-term storage

    Science.gov (United States)

    Khovanskiy, Y. D.; Kremneva, N. I.

    1975-01-01

    Problems and methods are discussed of automating information retrieval operations in a data bank used for long term storage and retrieval of data from scientific experiments. Existing information retrieval languages are analyzed along with those being developed. The results of studies discussing the application of the descriptive 'Kristall' language used in the 'ASIOR' automated information retrieval system are presented. The development and use of a specialized language of the classification-descriptive type, using universal decimal classification indices as the main descriptors, is described.

  8. Speech-recognition interfaces for music information retrieval

    Science.gov (United States)

    Goto, Masataka

    2005-09-01

    This paper describes two hands-free music information retrieval (MIR) systems that enable a user to retrieve and play back a musical piece by saying its title or the artist's name. Although various interfaces for MIR have been proposed, speech-recognition interfaces suitable for retrieving musical pieces have not been studied. Our MIR-based jukebox systems employ two different speech-recognition interfaces for MIR, speech completion and speech spotter, which exploit intentionally controlled nonverbal speech information in original ways. The first is a music retrieval system with the speech-completion interface that is suitable for music stores and car-driving situations. When a user only remembers part of the name of a musical piece or an artist and utters only a remembered fragment, the system helps the user recall and enter the name by completing the fragment. The second is a background-music playback system with the speech-spotter interface that can enrich human-human conversation. When a user is talking to another person, the system allows the user to enter voice commands for music playback control by spotting a special voice-command utterance in face-to-face or telephone conversations. Experimental results from use of these systems have demonstrated the effectiveness of the speech-completion and speech-spotter interfaces. (Video clips: http://staff.aist.go.jp/m.goto/MIR/speech-if.html)

  9. Application of a regularized model inversion system (REGFLEC) to multi-temporal RapidEye imagery for retrieving vegetation characteristics

    Science.gov (United States)

    Houborg, Rasmus; McCabe, Matthew F.

    2015-10-01

    Accurate retrieval of canopy biophysical and leaf biochemical constituents from space observations is critical to diagnosing the functioning and condition of vegetation canopies across spatio-temporal scales. Retrieved vegetation characteristics may serve as important inputs to precision farming applications and as constraints in spatially and temporally distributed model simulations of water and carbon exchange processes. However significant challenges remain in the translation of composite remote sensing signals into useful biochemical, physiological or structural quantities and treatment of confounding factors in spectrum-trait relations. Bands in the red-edge spectrum have particular potential for improving the robustness of retrieved vegetation properties. The development of observationally based vegetation retrieval capacities, effectively constrained by the enhanced information content afforded by bands in the red-edge, is a needed investment towards optimizing the benefit of current and future satellite sensor systems. In this study, a REGularized canopy reFLECtance model (REGFLEC) for joint leaf chlorophyll (Chll) and leaf area index (LAI) retrieval is extended to sensor systems with a band in the red-edge region for the first time. Application to time-series of 5 m resolution multi-spectral RapidEye data is demonstrated over an irrigated agricultural region in central Saudi Arabia, showcasing the value of satellite-derived crop information at this fine scale for precision management. Validation against in-situ measurements in fields of alfalfa, Rhodes grass, carrot and maize indicate improved accuracy of retrieved vegetation properties when exploiting red-edge information in the model inversion process.

  10. Application of a regularized model inversion system (REGFLEC) to multi-temporal RapidEye imagery for retrieving vegetation characteristics

    KAUST Repository

    Houborg, Rasmus

    2015-10-14

    Accurate retrieval of canopy biophysical and leaf biochemical constituents from space observations is critical to diagnosing the functioning and condition of vegetation canopies across spatio-temporal scales. Retrieved vegetation characteristics may serve as important inputs to precision farming applications and as constraints in spatially and temporally distributed model simulations of water and carbon exchange processes. However significant challenges remain in the translation of composite remote sensing signals into useful biochemical, physiological or structural quantities and treatment of confounding factors in spectrum-trait relations. Bands in the red-edge spectrum have particular potential for improving the robustness of retrieved vegetation properties. The development of observationally based vegetation retrieval capacities, effectively constrained by the enhanced information content afforded by bands in the red-edge, is a needed investment towards optimizing the benefit of current and future satellite sensor systems. In this study, a REGularized canopy reFLECtance model (REGFLEC) for joint leaf chlorophyll (Chll) and leaf area index (LAI) retrieval is extended to sensor systems with a band in the red-edge region for the first time. Application to time-series of 5 m resolution multi-spectral RapidEye data is demonstrated over an irrigated agricultural region in central Saudi Arabia, showcasing the value of satellite-derived crop information at this fine scale for precision management. Validation against in-situ measurements in fields of alfalfa, Rhodes grass, carrot and maize indicate improved accuracy of retrieved vegetation properties when exploiting red-edge information in the model inversion process. © (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE).

  11. Agent Community based Peer-to-Peer Information Retrieval

    Science.gov (United States)

    Mine, Tsunenori; Matsuno, Daisuke; Amamiya, Makoto

    This paper proposes an agent community based information retrieval method, which uses agent communities to manage and look up information related to users. An agent works as a delegate of its user and searches for information that the user wants by communicating with other agents. The communication between agents is carried out in a peer-to-peer computing architecture. In order to retrieve information related to a user query, an agent uses two histories : a query/retrieved document history(Q/RDH) and a query/sender agent history(Q/SAH). The former is a list of pairs of a query and retrieved documents, where the queries were sent by the agent itself. The latter is a list of pairs of a query and sender agents and shows ``who sent what query to the agent''. This is useful to find a new information source. Making use of the Q/SAH is expected to cause a collaborative filtering effect, which gradually creates virtual agent communities, where agents with the same interests stay together. Our hypothesis is that a virtual agent community reduces communication loads to perform a search. As an agent receives more queries, then more links to new knowledge are achieved. From this behavior, a ``give and take''(or positive feedback) effect for agents seems to emerge. We implemented this method with Multi-Agents Kodama which has been developed in our laboratory, and conducted preliminary experiments to test the hypothesis. The empirical results showed that the method was much more efficient than a naive method employing 'broadcast' techniques only to look up a target agent.

  12. A Novel Fuzzy Document Based Information Retrieval Model for Forecasting

    Directory of Open Access Journals (Sweden)

    Partha Roy

    2017-06-01

    Full Text Available Information retrieval systems are generally used to find documents that are most appropriate according to some query that comes dynamically from users. In this paper a novel Fuzzy Document based Information Retrieval Model (FDIRM is proposed for the purpose of Stock Market Index forecasting. The novelty of proposed approach is a modified tf-idf scoring scheme to predict the future trend of the stock market index. The contribution of this paper has two dimensions, 1 In the proposed system the simple time series is converted to an enriched fuzzy linguistic time series with a unique approach of incorporating market sentiment related information along with the price and 2 A unique approach is followed while modeling the information retrieval (IR system which converts a simple IR system into a forecasting system. From the performance comparison of FDIRM with standard benchmark models it can be affirmed that the proposed model has a potential of becoming a good forecasting model. The stock market data provided by Standard & Poor’s CRISIL NSE Index 50 (CNX NIFTY-50 index of National Stock Exchange of India (NSE is used to experiment and validate the proposed model. The authentic data for validation and experimentation is obtained from http://www.nseindia.com which is the official website of NSE. A java program is under construction to implement the model in real-time with graphical users’ interface.

  13. 8th International Workshop on Information Filtering and Retrieval

    CERN Document Server

    Giuliani, Alessandro; Semeraro, Giovanni

    2017-01-01

    This book focuses on new research challenges in intelligent information filtering and retrieval. It collects invited chapters and extended research contributions from DART 2014 (the 8th International Workshop on Information Filtering and Retrieval), held in Pisa (Italy), on December 10, 2014, and co-hosted with the XIII AI*IA Symposium on Artificial Intelligence. The main focus of DART was to discuss and compare suitable novel solutions based on intelligent techniques and applied to real-world contexts. The chapters of this book present a comprehensive review of related works and the current state of the art. The contributions from both practitioners and researchers have been carefully reviewed by experts in the area, who also gave useful suggestions to improve the quality of the book.

  14. Informative Top-k Retrieval for Advanced Skill Management

    Science.gov (United States)

    Colucci, Simona; di Noia, Tommaso; Ragone, Azzurra; Ruta, Michele; Straccia, Umberto; Tinelli, Eufemia

    The paper presents a knowledge-based framework for skills and talent management based on an advanced matchmaking between profiles of candidates and available job positions. Interestingly, informative content of top-k retrieval is enriched through semantic capabilities. The proposed approach allows to: (1) express a requested profile in terms of both hard constraints and soft ones; (2) provide a ranking function based also on qualitative attributes of a profile; (3) explain the resulting outcomes (given a job request, a motivation for the obtained score of each selected profile is provided). Top-k retrieval allows to select most promising candidates according to an ontology formalizing the domain knowledge. Such a knowledge is further exploited to provide a semantic-based explanation of missing or conflicting features in retrieved profiles. They also indicate additional profile characteristics emerging by the retrieval procedure for a further request refinement. A concrete case study followed by an exhaustive experimental campaign is reported to prove the approach effectiveness.

  15. The Use of a Context-Based Information Retrieval Technique

    Science.gov (United States)

    2009-07-01

    Carlson, 2004). However, in order to reduce plagiarism and manipulation, the specific details of these algorithms are closely protected and changed...age, academic background and gender can affect performance using information retrieval systems (Borgman, 1989). These factors can result in...and academic qualifications, a large proportion of the sample were recruited from a third year level or higher. 2.2 Materials 2.2.1 Demographic

  16. Latest Trends in Web Information Retrieval and in SEO Factors

    Directory of Open Access Journals (Sweden)

    Carlos Gonzalo

    2015-07-01

    Full Text Available Latest trends in web information retrieval and in  SEO factors, increasingly focused on signals from users as: profile of who performs the search and the interpretation of user intent. The objective of search engines is twofold: focusing at the maximum in the users and make ever less predictable the composition of the search engine result page (SERP , and  combating spam.

  17. [SIBIL: an information tool for the information retrieval on bioethics].

    Science.gov (United States)

    Dracos, Adriana

    2004-01-01

    The article describes the main features of the website SIBIL (Sistema Informativo per la Bioetica In Linea) implemented within the framework of a research project of the ISS for collecting, indexing and disseminating Italian literature on bioethics since 1995 through an integrated electronic system. The site, addressed to a wide range of people interested at different degrees and levels in bioethics, offers a comprehensive overview of the activities, such as courses and meetings, on the major ethical issues at stake in Italy, as well as a survey of the most important activities both at national and international level. The main feature of SIBIL is a database of a large collection of documents retrieved through sources or exploitation of the most important international electronic databases. A thesaurus of 1,600 terms, available in Italian and English, was created in order to organize documents with standardized criteria currently adopted in the Italian scientific environment. Future trends of the website are also discussed for sharing experiences with other countries and laying the basis for a European portal on bioethics.

  18. Information Retrieval and Text Mining Technologies for Chemistry.

    Science.gov (United States)

    Krallinger, Martin; Rabal, Obdulia; Lourenço, Anália; Oyarzabal, Julen; Valencia, Alfonso

    2017-06-28

    Efficient access to chemical information contained in scientific literature, patents, technical reports, or the web is a pressing need shared by researchers and patent attorneys from different chemical disciplines. Retrieval of important chemical information in most cases starts with finding relevant documents for a particular chemical compound or family. Targeted retrieval of chemical documents is closely connected to the automatic recognition of chemical entities in the text, which commonly involves the extraction of the entire list of chemicals mentioned in a document, including any associated information. In this Review, we provide a comprehensive and in-depth description of fundamental concepts, technical implementations, and current technologies for meeting these information demands. A strong focus is placed on community challenges addressing systems performance, more particularly CHEMDNER and CHEMDNER patents tasks of BioCreative IV and V, respectively. Considering the growing interest in the construction of automatically annotated chemical knowledge bases that integrate chemical information and biological data, cheminformatics approaches for mapping the extracted chemical names into chemical structures and their subsequent annotation together with text mining applications for linking chemistry with biological information are also presented. Finally, future trends and current challenges are highlighted as a roadmap proposal for research in this emerging field.

  19. Lower-Cost ∈-Private Information Retrieval

    Directory of Open Access Journals (Sweden)

    Toledo Raphael R.

    2016-10-01

    Full Text Available Private Information Retrieval (PIR, despite being well studied, is computationally costly and arduous to scale. We explore lower-cost relaxations of information-theoretic PIR, based on dummy queries, sparse vectors, and compositions with an anonymity system. We prove the security of each scheme using a flexible differentially private definition for private queries that can capture notions of imperfect privacy. We show that basic schemes are weak, but some of them can be made arbitrarily safe by composing them with large anonymity systems.

  20. An integrated information retrieval and document management system

    Science.gov (United States)

    Coles, L. Stephen; Alvarez, J. Fernando; Chen, James; Chen, William; Cheung, Lai-Mei; Clancy, Susan; Wong, Alexis

    1993-01-01

    This paper describes the requirements and prototype development for an intelligent document management and information retrieval system that will be capable of handling millions of pages of text or other data. Technologies for scanning, Optical Character Recognition (OCR), magneto-optical storage, and multiplatform retrieval using a Standard Query Language (SQL) will be discussed. The semantic ambiguity inherent in the English language is somewhat compensated-for through the use of coefficients or weighting factors for partial synonyms. Such coefficients are used both for defining structured query trees for routine queries and for establishing long-term interest profiles that can be used on a regular basis to alert individual users to the presence of relevant documents that may have just arrived from an external source, such as a news wire service. Although this attempt at evidential reasoning is limited in comparison with the latest developments in AI Expert Systems technology, it has the advantage of being commercially available.

  1. Estimating Missing Features to Improve Multimedia Information Retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Bagherjeiran, A; Love, N S; Kamath, C

    2006-09-28

    Retrieval in a multimedia database usually involves combining information from different modalities of data, such as text and images. However, all modalities of the data may not be available to form the query. The retrieval results from such a partial query are often less than satisfactory. In this paper, we present an approach to complete a partial query by estimating the missing features in the query. Our experiments with a database of images and their associated captions show that, with an initial text-only query, our completion method has similar performance to a full query with both image and text features. In addition, when we use relevance feedback, our approach outperforms the results obtained using a full query.

  2. INFORMATION RETRIEVAL SYSTEM USING MULTIWORDS EXPRESSIONS (MWE AS DESCRIPTORS

    Directory of Open Access Journals (Sweden)

    Edson Marchetti da Silva

    2012-08-01

    Full Text Available This paper aims to propose an alternative method for retrieving documents using Multiwords Expressions (MWE extracted from a document base to be used as descriptors in search of an Information Retrieval System (IRS. In this sense, unlike methods that consider the text as a set of words, bag of words, we propose a method that takes into account the characteristics of the physical structure of the document in the extraction process of MWE. From this set of terms comparing pre-processed using an exhaustive algorithmic technique proposed by the authors with the results obtained for thirteen different measures of association statistics generated by the software Ngram Statistics Package (NSP. To perform this experiment was set up with a corpus of documents in digital format

  3. Rapid storage and retrieval of genomic intervals from a relational database system using nested containment lists.

    Science.gov (United States)

    Wiley, Laura K; Sivley, R Michael; Bush, William S

    2013-01-01

    Efficient storage and retrieval of genomic annotations based on range intervals is necessary, given the amount of data produced by next-generation sequencing studies. The indexing strategies of relational database systems (such as MySQL) greatly inhibit their use in genomic annotation tasks. This has led to the development of stand-alone applications that are dependent on flat-file libraries. In this work, we introduce MyNCList, an implementation of the NCList data structure within a MySQL database. MyNCList enables the storage, update and rapid retrieval of genomic annotations from the convenience of a relational database system. Range-based annotations of 1 million variants are retrieved in under a minute, making this approach feasible for whole-genome annotation tasks. Database URL: https://github.com/bushlab/mynclist.

  4. Working out and the description of the hypertext information retrieval thesaurus on algebra

    Directory of Open Access Journals (Sweden)

    Ирина Викторовна Кузнецова

    2011-09-01

    Full Text Available In article working out of the hypertext information retrieval thesaurus on algebra in the course of designing of the hypertext information retrieval thesaurus of a meta language of a science is described.

  5. Retrieving Full Object Information from Partial Object Information using Digital Holography

    Science.gov (United States)

    Jackin, B. J.; Palanisamy, P. K.; Yatagai, T.

    2011-10-01

    Storage and retrieval of object information from hologram using partial object as input is reported. This method uses holographic associative memory principles combined with digital image processing techniques. The inclusion of digital image processing helps in eliminating the iterations which are otherwise mandatory when using neural network principles for object information retrieval. The implementation method is explained and simulation results are presented. The reconstructed images agree well with the object chosen.

  6. Information Retrieval Methods in Libraries and Information Centers ...

    African Journals Online (AJOL)

    If you would like more information about how to print, save, and work with PDFs, Highwire Press provides a helpful Frequently Asked Questions about PDFs. Alternatively, you can download the PDF file directly to your computer, from where it can be opened using a PDF reader. To download the PDF, click the Download link ...

  7. Survey the role of emotions in information retrieval

    Directory of Open Access Journals (Sweden)

    Hassan Behzadi

    2016-03-01

    Full Text Available The present study was conducted to identify the users' emotion in various stages of information retrieval based on the information retrieval model in web.From the methodological perspective, the present study is experimental, and the type of study is practical. The society comprised all MA students majoring in different humanistic science branches and studying at Imam Reza international university. The sample society of this research consisted of 30 participants. The sample size was determined through stratified random sampling via G*power software. Data collection was carried out by using: demographic and prior experience of using internet questionnaire, post search questionnaire and recorded videos of users' faces. The findings of the study demonstrated that: 1 during the initial stages of searching, the frequency of emotion of apprehension, and in general during the link tracking stage, the negative emotions with the overall 49/3 percent are more frequent than the other emotions in browsing and differentiation stages, the emotion of happy was more frequent than the other emotions. 2 These variances resulted in significant relations among different emotions of the users throughout the four stages of information retrieval. 3 In simple search, the respondents displayed the emotion of happy most frequently and the emotion of aversion least frequently. On the other hand, in complicated search, apprehension and aversion were the most and the least frequently-cited emotions, respectively. Overall, the negative emotions were reported more frequently in complicated search in comparison with the simple search. This demonstrated that any change in the difficulty level of search undertaking would cause users to exhibit different types of emotions.

  8. User-Oriented and Cognitive Models of Information Retrieval

    DEFF Research Database (Denmark)

    Ingwersen, Peter; Järvelin, Kalervo; Skov, Mette

    2017-01-01

    applications. Several models with different emphases on user-oriented and cognitive IR are presented—ranging from overall approaches and relevance models to procedural models, cognitive models, and task-based models. The present entry does not discuss empirical findings based on the models.......The domain of user-oriented and cognitive information retrieval (IR) is first discussed, followed by a discussion on the dimensions and types of models one may build for the domain. The focus of the present entry is on the models of user-oriented and cognitive IR, not on their empirical...

  9. Utilizing Information Technology to Facilitate Rapid Acquisition

    Science.gov (United States)

    2006-06-01

    ordering systems to facilitate streamlined commercial item acquisitions that reap the benefits of improved efficiency, reduced overall costs, and...PAGES 109 14. SUBJECT TERMS Rapid Acquisition, eCommerce , eProcurement, Information Technology, Contracting, Global Information Network...streamlined commercial item acquisitions that reap the benefits of improved efficiency, reduced overall costs, and timeliness. This thesis

  10. Query-by-Example Music Information Retrieval by Score-Informed Source Separation and Remixing Technologies

    OpenAIRE

    Okuno, Hiroshi G.; Tetsuya Ogata; Kazunori Komatani; Masataka Goto; Katsutoshi Itoyama

    2011-01-01

    We describe a novel query-by-example (QBE) approach in music information retrieval that allows a user to customize query examples by directly modifying the volume of different instrument parts. The underlying hypothesis of this approach is that the musical mood of retrieved results changes in relation to the volume balance of different instruments. On the basis of this hypothesis, we aim to clarify the relationship between the change in the volume balance of a query and the genre of the retr...

  11. An introduction to the Marshall information retrieval and display system

    Science.gov (United States)

    1974-01-01

    An on-line terminal oriented data storage and retrieval system is presented which allows a user to extract and process information from stored data bases. The use of on-line terminals for extracting and displaying data from the data bases provides a fast and responsive method for obtaining needed information. The system consists of general purpose computer programs that provide the overall capabilities of the total system. The system can process any number of data files via a Dictionary (one for each file) which describes the data format to the system. New files may be added to the system at any time, and reprogramming is not required. Illustrations of the system are shown, and sample inquiries and responses are given.

  12. EarthServer: Information Retrieval and Query Language

    Science.gov (United States)

    Perperis, Thanassis; Koltsida, Panagiota; Kakaletris, George

    2013-04-01

    Establishing open, unified, seamless, access and ad-hoc analytics on cross-disciplinary, multi-source, multi-dimensional, spatiotemporal Earth Science data of extreme-size and their supporting metadata are the main challenges of the EarthServer project (www.earthserver.eu), funded by the European Commission under its Seventh Framework Program. One of EarthServer's main objectives is to provide users with higher level coverage and metadata search, retrieval and processing capabilities to multi-disciplinary Earth Science data. Six Lighthouse Applications are being established, each one providing access to Cryospheric, Airborne, Atmospheric, Geology, Oceanography and Planetary science raster data repositories through strictly WCS 2.0 standard based service endpoints. EarthServers' information retrieval subsystem aims towards exploiting the WCS endpoints through a physically and logically distributed service oriented architecture, foreseeing the collaboration of several standard compliant services, capable of exploiting modern large grid and cloud infrastructures and of dynamically responding to availability and capabilities of underlying resources. Towards furthering technology for integrated, coherent service provision based on WCS and WCPS the concept of a query language (QL), unifying coverage and metadata processing and retrieval is introduced. EarthServer's information retrieval subsystem receives QL requests involving high volumes of all Earth Science data categories, executes them on the services that reside on the infrastructure and sends the results back to the requester through a high performance pipeline. In this contribution we briefly discuss EarthServer's service oriented coverage data and metadata search and retrieval architecture and further elaborate on the potentials of EarthServer's Query Language, called xWCPS (XQuery compliant WCPS). xWCPS aims towards merging the path that the two widely adopted standards (W3C XQuery, OGC WCPS) have paved, into a

  13. Content-Based Information Retrieval from Forensic Databases

    NARCIS (Netherlands)

    Geradts, Z.J.M.H.

    2002-01-01

    In forensic science, the number of image databases is growing rapidly. For this reason, it is necessary to have a proper procedure for searching in these images databases based on content. The use of image databases results in more solved crimes; furthermore, statistical information can be obtained

  14. MPEG-7-Standardized tools for music information retrieval

    Science.gov (United States)

    Herre, Jürgen

    2005-09-01

    Today, many applications in Music Information Retrieval (MIR) employ audio features which have been tailored individually by the algorithm developers. For a broader use also in commercial applications, MIR technology can benefit significantly from a ``common language'' in audio signal description that can be used to annotate any type of multimedia assets in order to facilitate search and retrieval according to a wide range of conceivable criteria in an interoperable way. The audio part of the ISO/MPEG-7 ``Multimedia Content Description Interface'' provides such a common signal description language by defining a rather comprehensive set of standardized features [called ``low level descriptors'' (LLDs)], application-centric subsets, and a unified way of exchanging this data based on XML. The talk provides an overview of the MPEG-7 Audio tool chest, including existing and forthcoming extensions. While the idea is clearly to create a universal platform for any conceivable MIR task, some of the initially conceived applications of MPEG-7 Audio are illustrated.

  15. Source-constrained retrieval influences the encoding of new information.

    Science.gov (United States)

    Danckert, Stacey L; MacLeod, Colin M; Fernandes, Myra A

    2011-11-01

    Jacoby, Shimizu, Daniels, and Rhodes (Psychonomic Bulletin & Review, 12, 852-857, 2005) showed that new words presented as foils among a list of old words that had been deeply encoded were themselves subsequently better recognized than new words presented as foils among a list of old words that had been shallowly encoded. In Experiment 1, by substituting a deep-versus-shallow imagery manipulation for the levels-of-processing manipulation, we demonstrated that the effect is robust and that it generalizes, also occurring with a different type of encoding. In Experiment 2, we provided more direct evidence for context-related encoding during tests of deeply encoded words, showing enhanced priming for foils presented among deeply encoded targets when participants made the same deep-encoding judgments on those items as had been made on the targets during study. In Experiment 3, we established that the findings from Experiment 2 are restricted to this specific deep judgment task and are not a general consequence of these foils being associated with deeply encoded items. These findings provide support for the source-constrained retrieval hypothesis of Jacoby, Shimizu, Daniels, and Rhodes: New information can be influenced by how surrounding items are encoded and retrieved, as long as the surrounding items recruit a coherent mode of processing.

  16. Issues in the use of neural networks in information retrieval

    CERN Document Server

    Iatan, Iuliana F

    2017-01-01

    This book highlights the ability of neural networks (NNs) to be excellent pattern matchers and their importance in information retrieval (IR), which is based on index term matching. The book defines a new NN-based method for learning image similarity and describes how to use fuzzy Gaussian neural networks to predict personality. It introduces the fuzzy Clifford Gaussian network, and two concurrent neural models: (1) concurrent fuzzy nonlinear perceptron modules, and (2) concurrent fuzzy Gaussian neural network modules. Furthermore, it explains the design of a new model of fuzzy nonlinear perceptron based on alpha level sets and describes a recurrent fuzzy neural network model with a learning algorithm based on the improved particle swarm optimization method.

  17. Cross-language information retrieval using PARAFAC2.

    Energy Technology Data Exchange (ETDEWEB)

    Bader, Brett William; Chew, Peter; Abdelali, Ahmed (New Mexico State University, Las Cruces, NM); Kolda, Tamara Gibson

    2007-05-01

    A standard approach to cross-language information retrieval (CLIR) uses Latent Semantic Analysis (LSA) in conjunction with a multilingual parallel aligned corpus. This approach has been shown to be successful in identifying similar documents across languages - or more precisely, retrieving the most similar document in one language to a query in another language. However, the approach has severe drawbacks when applied to a related task, that of clustering documents 'language-independently', so that documents about similar topics end up closest to one another in the semantic space regardless of their language. The problem is that documents are generally more similar to other documents in the same language than they are to documents in a different language, but on the same topic. As a result, when using multilingual LSA, documents will in practice cluster by language, not by topic. We propose a novel application of PARAFAC2 (which is a variant of PARAFAC, a multi-way generalization of the singular value decomposition [SVD]) to overcome this problem. Instead of forming a single multilingual term-by-document matrix which, under LSA, is subjected to SVD, we form an irregular three-way array, each slice of which is a separate term-by-document matrix for a single language in the parallel corpus. The goal is to compute an SVD for each language such that V (the matrix of right singular vectors) is the same across all languages. Effectively, PARAFAC2 imposes the constraint, not present in standard LSA, that the 'concepts' in all documents in the parallel corpus are the same regardless of language. Intuitively, this constraint makes sense, since the whole purpose of using a parallel corpus is that exactly the same concepts are expressed in the translations. We tested this approach by comparing the performance of PARAFAC2 with standard LSA in solving a particular CLIR problem. From our results, we conclude that PARAFAC2 offers a very promising alternative to

  18. Accelerating Information Retrieval from Profile Hidden Markov Model Databases.

    Directory of Open Access Journals (Sweden)

    Ahmad Tamimi

    Full Text Available Profile Hidden Markov Model (Profile-HMM is an efficient statistical approach to represent protein families. Currently, several databases maintain valuable protein sequence information as profile-HMMs. There is an increasing interest to improve the efficiency of searching Profile-HMM databases to detect sequence-profile or profile-profile homology. However, most efforts to enhance searching efficiency have been focusing on improving the alignment algorithms. Although the performance of these algorithms is fairly acceptable, the growing size of these databases, as well as the increasing demand for using batch query searching approach, are strong motivations that call for further enhancement of information retrieval from profile-HMM databases. This work presents a heuristic method to accelerate the current profile-HMM homology searching approaches. The method works by cluster-based remodeling of the database to reduce the search space, rather than focusing on the alignment algorithms. Using different clustering techniques, 4284 TIGRFAMs profiles were clustered based on their similarities. A representative for each cluster was assigned. To enhance sensitivity, we proposed an extended step that allows overlapping among clusters. A validation benchmark of 6000 randomly selected protein sequences was used to query the clustered profiles. To evaluate the efficiency of our approach, speed and recall values were measured and compared with the sequential search approach. Using hierarchical, k-means, and connected component clustering techniques followed by the extended overlapping step, we obtained an average reduction in time of 41%, and an average recall of 96%. Our results demonstrate that representation of profile-HMMs using a clustering-based approach can significantly accelerate data retrieval from profile-HMM databases.

  19. The role of grammatical category information in spoken word retrieval.

    Science.gov (United States)

    Duràn, Carolina Palma; Pillon, Agnesa

    2011-01-01

    We investigated the role of lexical syntactic information such as grammatical gender and category in spoken word retrieval processes by using a blocking paradigm in picture and written word naming experiments. In Experiments 1, 3, and 4, we found that the naming of target words (nouns) from pictures or written words was faster when these target words were named within a list where only words from the same grammatical category had to be produced (homogeneous category list: all nouns) than when they had to be produced within a list comprising also words from another grammatical category (heterogeneous category list: nouns and verbs). On the other hand, we detected no significant facilitation effect when the target words had to be named within a homogeneous gender list (all masculine nouns) compared to a heterogeneous gender list (both masculine and feminine nouns). In Experiment 2, using the same blocking paradigm by manipulating the semantic category of the items, we found that naming latencies were significantly slower in the semantic category homogeneous in comparison with the semantic category heterogeneous condition. Thus semantic category homogeneity caused an interference, not a facilitation effect like grammatical category homogeneity. Finally, in Experiment 5, nouns in the heterogeneous category condition had to be named just after a verb (category-switching position) or a noun (same-category position). We found a facilitation effect of category homogeneity but no significant effect of position, which showed that the effect of category homogeneity found in Experiments 1, 3, and 4 was not due to a cost of switching between grammatical categories in the heterogeneous grammatical category list. These findings supported the hypothesis that grammatical category information impacts word retrieval processes in speech production, even when words are to be produced in isolation. They are discussed within the context of extant theories of lexical production.

  20. The role of grammatical category information in spoken word retrieval

    Directory of Open Access Journals (Sweden)

    Carolina ePalma Duràn

    2011-11-01

    Full Text Available We investigated the role of lexical syntactic information such as grammatical gender and category in spoken word retrieval processes by using a blocking paradigm in picture and written word naming experiments. In Experiments 1, 3, and 4, we found that the naming of target words (nouns from pictures or written words was faster when these target words were named within a list where only words from the same grammatical category had to be produced (homogeneous category list: all nouns than when they had to be produced within a list comprising also words from another grammatical category (heterogeneous category list: nouns and verbs. On the other hand, no significant facilitation effect was detected when the target words had to be named within a homogeneous gender list (all masculine nouns compared to a heterogeneous gender list (both masculine and feminine nouns. In Experiment 2, using the same blocking paradigm by manipulating the semantic category of the items, we found that naming times were significantly slower in the semantic category homogeneous in comparison with the semantic category heterogeneous condition. Thus semantic category homogeneity caused an interference, not a facilitation effect like grammatical category homogeneity. Finally, in Experiment 5, nouns in the heterogeneous category condition had to be named just after a verb (category-switching position or a noun (same-category position. We found a facilitation effect of category homogeneity but no significant effect of position, which showed that the effect of category homogeneity found in Experiments 1, 3, and 4 was not due to a cost of switching between grammatical categories in the heterogeneous grammatical category list. These findings supported the hypothesis that grammatical category information could impact word retrieval processes in speech production, even when words are to be produced in isolation. They are discussed within the context of extant theories of lexical

  1. Information retrieval pathways for health information exchange in multiple care settings

    DEFF Research Database (Denmark)

    Kierkegaard, Patrick; Kaushal, Rainu; Vest, Joshua R.

    2014-01-01

    Objectives To determine which health information exchange (HIE) technologies and information retrieval pathways healthcare professionals relied on to meet their information needs in the context of laboratory test results, radiological images and reports, and medication histories. Study Design...... Primary data was collected over a 2-month period across 3 emergency departments, 7 primary care practices, and 2 public health clinics in New York state. Methods Qualitative research methods were used to collect and analyze data from semi-structured interviews and participant observation. Results...... The study reveals that healthcare professionals used a complex combination of information retrieval pathways for HIE to obtain clinical information from external organizations. The choice for each approach was setting- and information-specific, but was also highly dynamic across users and their information...

  2. A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering.

    Science.gov (United States)

    Sarrouti, Mourad; Ouatik El Alaoui, Said

    2017-04-01

    Passage retrieval, the identification of top-ranked passages that may contain the answer for a given biomedical question, is a crucial component for any biomedical question answering (QA) system. Passage retrieval in open-domain QA is a longstanding challenge widely studied over the last decades. However, it still requires further efforts in biomedical QA. In this paper, we present a new biomedical passage retrieval method based on Stanford CoreNLP sentence/passage length, probabilistic information retrieval (IR) model and UMLS concepts. In the proposed method, we first use our document retrieval system based on PubMed search engine and UMLS similarity to retrieve relevant documents to a given biomedical question. We then take the abstracts from the retrieved documents and use Stanford CoreNLP for sentence splitter to make a set of sentences, i.e., candidate passages. Using stemmed words and UMLS concepts as features for the BM25 model, we finally compute the similarity scores between the biomedical question and each of the candidate passages and keep the N top-ranked ones. Experimental evaluations performed on large standard datasets, provided by the BioASQ challenge, show that the proposed method achieves good performances compared with the current state-of-the-art methods. The proposed method significantly outperforms the current state-of-the-art methods by an average of 6.84% in terms of mean average precision (MAP). We have proposed an efficient passage retrieval method which can be used to retrieve relevant passages in biomedical QA systems with high mean average precision. Copyright © 2017 Elsevier Inc. All rights reserved.

  3. Assimilation of SMOS Retrievals in the Land Information System

    Science.gov (United States)

    Blankenship, Clay B.; Case, Jonathan L.; Zavodsky, Bradley T.; Crosson, William L.

    2016-01-01

    The Soil Moisture and Ocean Salinity (SMOS) satellite provides retrievals of soil moisture in the upper 5 cm with a 30-50 km resolution and a mission accuracy requirement of 0.04 cm(sub 3 cm(sub -3). These observations can be used to improve land surface model soil moisture states through data assimilation. In this paper, SMOS soil moisture retrievals are assimilated into the Noah land surface model via an Ensemble Kalman Filter within the NASA Land Information System. Bias correction is implemented using Cumulative Distribution Function (CDF) matching, with points aggregated by either land cover or soil type to reduce sampling error in generating the CDFs. An experiment was run for the warm season of 2011 to test SMOS data assimilation and to compare assimilation methods. Verification of soil moisture analyses in the 0-10 cm upper layer and root zone (0-1 m) was conducted using in situ measurements from several observing networks in the central and southeastern United States. This experiment showed that SMOS data assimilation significantly increased the anomaly correlation of Noah soil moisture with station measurements from 0.45 to 0.57 in the 0-10 cm layer. Time series at specific stations demonstrate the ability of SMOS DA to increase the dynamic range of soil moisture in a manner consistent with station measurements. Among the bias correction methods, the correction based on soil type performed best at bias reduction but also reduced correlations. The vegetation-based correction did not produce any significant differences compared to using a simple uniform correction curve.

  4. Query-Time Optimization Techniques for Structured Queries in Information Retrieval

    Science.gov (United States)

    Cartright, Marc-Allen

    2013-01-01

    The use of information retrieval (IR) systems is evolving towards larger, more complicated queries. Both the IR industrial and research communities have generated significant evidence indicating that in order to continue improving retrieval effectiveness, increases in retrieval model complexity may be unavoidable. From an operational perspective,…

  5. STATUS/IQ: A Semi-Intelligent Information Retrieval System.

    Science.gov (United States)

    Pearsall, Jayne

    1990-01-01

    Provides background on the problems of traditional text retrieval systems and describes STATUS/IQ, an advanced text retrieval system that incorporates a natural language front-end and an advanced relevance ranking facility. The principles, capabilities, and benefits of the system are discussed, and an example of a STATUS/IQ session is presented…

  6. Physicists' Information Tasks: Structure, Length and Retrieval Performance

    DEFF Research Database (Denmark)

    Lykke, Marianne; Ingwersen, Peter; Bogers, Toine

    2010-01-01

    to describe the tasks, 3) what semantic categories were used to express the search facets, and 4) retrieval performance. Results show variety in structure and length across task descriptions and task purposes. The results indicate effect of length and, in particular, of task purpose on retrieval performance...

  7. Analytical Study of Information Retrieval techniques and Modified Model of Search Engine

    OpenAIRE

    Ms. Leena More

    2015-01-01

    The concept of Information Retrieval is very vast and too many models of search engines are available in the market. In this research various information retrieval techniques used in search engine were studies and modified model of search engine were developed. In web mining most of the web search engines retrieve the documents or information first without knowing the meaning of the keyword and then ask for the relevant meaning of the keyword entered by the users. That means without understan...

  8. Toward Studying Music Cognition with Information Retrieval Techniques: Lessons Learned from the OpenMIIR Initiative

    OpenAIRE

    Sebastian Stober

    2017-01-01

    As an emerging sub-field of music information retrieval (MIR), music imagery information retrieval (MIIR) aims to retrieve information from brain activity recorded during music cognition–such as listening to or imagining music pieces. This is a highly inter-disciplinary endeavor that requires expertise in MIR as well as cognitive neuroscience and psychology. The OpenMIIR initiative strives to foster collaborations between these fields to advance the state of the art in MIIR. As a first step, ...

  9. Comparing the quality of accessing medical literature using content-based visual and textual information retrieval

    Science.gov (United States)

    Müller, Henning; Kalpathy-Cramer, Jayashree; Kahn, Charles E., Jr.; Hersh, William

    2009-02-01

    Content-based visual information (or image) retrieval (CBIR) has been an extremely active research domain within medical imaging over the past ten years, with the goal of improving the management of visual medical information. Many technical solutions have been proposed, and application scenarios for image retrieval as well as image classification have been set up. However, in contrast to medical information retrieval using textual methods, visual retrieval has only rarely been applied in clinical practice. This is despite the large amount and variety of visual information produced in hospitals every day. This information overload imposes a significant burden upon clinicians, and CBIR technologies have the potential to help the situation. However, in order for CBIR to become an accepted clinical tool, it must demonstrate a higher level of technical maturity than it has to date. Since 2004, the ImageCLEF benchmark has included a task for the comparison of visual information retrieval algorithms for medical applications. In 2005, a task for medical image classification was introduced and both tasks have been run successfully for the past four years. These benchmarks allow an annual comparison of visual retrieval techniques based on the same data sets and the same query tasks, enabling the meaningful comparison of various retrieval techniques. The datasets used from 2004-2007 contained images and annotations from medical teaching files. In 2008, however, the dataset used was made up of 67,000 images (along with their associated figure captions and the full text of their corresponding articles) from two Radiological Society of North America (RSNA) scientific journals. This article describes the results of the medical image retrieval task of the ImageCLEF 2008 evaluation campaign. We compare the retrieval results of both visual and textual information retrieval systems from 15 research groups on the aforementioned data set. The results show clearly that, currently

  10. Exploiting semantic linkages among multiple sources for semantic information retrieval

    Science.gov (United States)

    Li, JianQiang; Yang, Ji-Jiang; Liu, Chunchen; Zhao, Yu; Liu, Bo; Shi, Yuliang

    2014-07-01

    The vision of the Semantic Web is to build a global Web of machine-readable data to be consumed by intelligent applications. As the first step to make this vision come true, the initiative of linked open data has fostered many novel applications aimed at improving data accessibility in the public Web. Comparably, the enterprise environment is so different from the public Web that most potentially usable business information originates in an unstructured form (typically in free text), which poses a challenge for the adoption of semantic technologies in the enterprise environment. Considering that the business information in a company is highly specific and centred around a set of commonly used concepts, this paper describes a pilot study to migrate the concept of linked data into the development of a domain-specific application, i.e. the vehicle repair support system. The set of commonly used concepts, including the part name of a car and the phenomenon term on the car repairing, are employed to build the linkage between data and documents distributed among different sources, leading to the fusion of documents and data across source boundaries. Then, we describe the approaches of semantic information retrieval to consume these linkages for value creation for companies. The experiments on two real-world data sets show that the proposed approaches outperform the best baseline 6.3-10.8% and 6.4-11.1% in terms of top five and top 10 precisions, respectively. We believe that our pilot study can serve as an important reference for the development of similar semantic applications in an enterprise environment.

  11. How to retrieve additional information from the multiplicity distributions

    Science.gov (United States)

    Wilk, Grzegorz; Włodarczyk, Zbigniew

    2017-01-01

    Multiplicity distributions (MDs) P(N) measured in multiparticle production processes are most frequently described by the negative binomial distribution (NBD). However, with increasing collision energy some systematic discrepancies have become more and more apparent. They are usually attributed to the possible multi-source structure of the production process and described using a multi-NBD form of the MD. We investigate the possibility of keeping a single NBD but with its parameters depending on the multiplicity N. This is done by modifying the widely known clan model of particle production leading to the NBD form of P(N). This is then confronted with the approach based on the so-called cascade-stochastic formalism which is based on different types of recurrence relations defining P(N). We demonstrate that a combination of both approaches allows the retrieval of additional valuable information from the MDs, namely the oscillatory behavior of the counting statistics apparently visible in the high energy data.

  12. Intelligent information retrieval system using automatic thesaurus construction

    Science.gov (United States)

    Song, Wei; Yang, Jucheng; Li, Chenghua; Park, Sooncheol

    2011-05-01

    This paper presents an intelligent information retrieval (IR) system based on automatic thesaurus construction for its applications of document clustering and classification. These two applications are the most influential and widely used fields amongst the IR research community. We apply two biologically inspired algorithms, i.e. genetic algorithm (GA) and neural network (NN), to these two fields. A fuzzy logic controller GA and an adaptive back-propagation NN are proposed in our study, which can validly overcome the problems existing in their archetypes, e.g. slow evolution and being prone to trap into a local optimum. Furthermore, a well-constructed thesaurus has been recognised as a valuable tool in the effective operation of clustering and classification. It solves the problem in document representation organised by a bag of words, where some important relationships between words, e.g. synonymy and polysemy, are ignored. To investigate how our IR system could be used effectively, we conduct experiments on four data sets from the benchmark Reuter-21578 document collection and 20-newsgroup corpus. The results reveal that our IR system enhances the performance in comparison with k-means, common GA, and conventional back-propagation NN.

  13. Rapid and non-enzymatic in vitro retrieval of tumour cells from surgical specimens.

    Directory of Open Access Journals (Sweden)

    Brigitte Mack

    Full Text Available The study of tumourigenesis commonly involves the use of established cell lines or single cell suspensions of primary tumours. Standard methods for the generation of short-term tumour cell cultures include the disintegration of tissue based on enzymatic and mechanical stress. Here, we describe a simple and rapid method for the preparation of single cells from primary carcinomas, which is independent of enzymatic treatment and feeder cells. Tumour biopsies are processed to 1 mm(3 cubes termed explants, which are cultured 1-3 days on agarose-coated well plates in specified medium. Through incisions generated in the explants, single cells are retrieved and collected from the culture supernatant and can be used for further analysis including in vitro and in vivo studies. Collected cells retain tumour-forming capacity in xenotransplantation assays, mimic the phenotype of the primary tumour, and facilitate the generation of cell lines.

  14. Generic information can retrieve known biological associations: implications for biomedical knowledge discovery.

    Directory of Open Access Journals (Sweden)

    Herman H H B M van Haagen

    Full Text Available MOTIVATION: Weighted semantic networks built from text-mined literature can be used to retrieve known protein-protein or gene-disease associations, and have been shown to anticipate associations years before they are explicitly stated in the literature. Our text-mining system recognizes over 640,000 biomedical concepts: some are specific (i.e., names of genes or proteins others generic (e.g., 'Homo sapiens'. Generic concepts may play important roles in automated information retrieval, extraction, and inference but may also result in concept overload and confound retrieval and reasoning with low-relevance or even spurious links. Here, we attempted to optimize the retrieval performance for protein-protein interactions (PPI by filtering generic concepts (node filtering or links to generic concepts (edge filtering from a weighted semantic network. First, we defined metrics based on network properties that quantify the specificity of concepts. Then using these metrics, we systematically filtered generic information from the network while monitoring retrieval performance of known protein-protein interactions. We also systematically filtered specific information from the network (inverse filtering, and assessed the retrieval performance of networks composed of generic information alone. RESULTS: Filtering generic or specific information induced a two-phase response in retrieval performance: initially the effects of filtering were minimal but beyond a critical threshold network performance suddenly drops. Contrary to expectations, networks composed exclusively of generic information demonstrated retrieval performance comparable to unfiltered networks that also contain specific concepts. Furthermore, an analysis using individual generic concepts demonstrated that they can effectively support the retrieval of known protein-protein interactions. For instance the concept "binding" is indicative for PPI retrieval and the concept "mutation abnormality" is

  15. Information Retrieval eXperience (IRX): Towards a Human-Centered Personalized Model of Relevance

    NARCIS (Netherlands)

    van der Sluis, Frans; van den Broek, Egon; van Dijk, Elisabeth M.A.G.; Hoeber, O.; Li, Y.; Huang, X.J.

    2010-01-01

    We approach Information Retrieval (IR) from a User eXperience (UX) perspective. Through introducing a model for Information Retrieval eXperience (IRX), this paper operationalizes a perspective on IR that reaches beyond topicality. Based on a document's topicality, complexity, and emotional value, a

  16. Text mining scientific papers: a survey on FCA-based information retrieval research

    NARCIS (Netherlands)

    Poelmans, J.; Ignatov, D.I.; Viaene, S.; Dedene, G.; Kuznetsov, S.O.

    2012-01-01

    Formal Concept Analysis (FCA) is an unsupervised clustering technique and many scientific papers are devoted to applying FCA in Information Retrieval (IR) research. We collected 103 papers published between 2003-2009 which mention FCA and information retrieval in the abstract, title or keywords.

  17. Latent morpho-semantic analysis : multilingual information retrieval with character n-grams and mutual information.

    Energy Technology Data Exchange (ETDEWEB)

    Bader, Brett William; Chew, Peter A.; Abdelali, Ahmed (New Mexico State University)

    2008-08-01

    We describe an entirely statistics-based, unsupervised, and language-independent approach to multilingual information retrieval, which we call Latent Morpho-Semantic Analysis (LMSA). LMSA overcomes some of the shortcomings of related previous approaches such as Latent Semantic Analysis (LSA). LMSA has an important theoretical advantage over LSA: it combines well-known techniques in a novel way to break the terms of LSA down into units which correspond more closely to morphemes. Thus, it has a particular appeal for use with morphologically complex languages such as Arabic. We show through empirical results that the theoretical advantages of LMSA can translate into significant gains in precision in multilingual information retrieval tests. These gains are not matched either when a standard stemmer is used with LSA, or when terms are indiscriminately broken down into n-grams.

  18. An application of weighted transducers to music information retrieval

    Science.gov (United States)

    Basaldella, D.; Orio, N.

    2006-01-01

    In this paper it is proposed a methodology for retrieving music documents using a query by example paradigm. The basic idea is that a collection of music documents can be indexed by the set of melodic contours of its documents, and retrieval is carried out using an approximate matching between query and document contours. The approximate matching is based on the use of Weighted Transducers, which model the document contours and are used to compute their similarity with the query. The methodology has been evaluated on a collection of documents and with a set of audio queries.

  19. Expert Search Strategies: The Information Retrieval Practices of Healthcare Information Professionals.

    Science.gov (United States)

    Russell-Rose, Tony; Chamberlain, Jon

    2017-10-02

    Healthcare information professionals play a key role in closing the knowledge gap between medical research and clinical practice. Their work involves meticulous searching of literature databases using complex search strategies that can consist of hundreds of keywords, operators, and ontology terms. This process is prone to error and can lead to inefficiency and bias if performed incorrectly. The aim of this study was to investigate the search behavior of healthcare information professionals, uncovering their needs, goals, and requirements for information retrieval systems. A survey was distributed to healthcare information professionals via professional association email discussion lists. It investigated the search tasks they undertake, their techniques for search strategy formulation, their approaches to evaluating search results, and their preferred functionality for searching library-style databases. The popular literature search system PubMed was then evaluated to determine the extent to which their needs were met. The 107 respondents indicated that their information retrieval process relied on the use of complex, repeatable, and transparent search strategies. On average it took 60 minutes to formulate a search strategy, with a search task taking 4 hours and consisting of 15 strategy lines. Respondents reviewed a median of 175 results per search task, far more than they would ideally like (100). The most desired features of a search system were merging search queries and combining search results. Healthcare information professionals routinely address some of the most challenging information retrieval problems of any profession. However, their needs are not fully supported by current literature search systems and there is demand for improved functionality, in particular regarding the development and management of search strategies.

  20. Subjective Probability and Information Retrieval: A Review of the Psychological Literature.

    Science.gov (United States)

    Thompson, Paul

    1988-01-01

    Reviews the subjective probability estimation literature of six schools of human judgement and decision making: decision theory, behavioral decision theory, psychological decision theory, social judgement theory, information integration theory, and attribution theory. Implications for probabilistic information retrieval are discussed, including…

  1. The Role of Ontology in Information Retrieval: Reviewing Current Research and Representing a Conceptual Model

    Directory of Open Access Journals (Sweden)

    Mahdieh Mirzabeigi

    2012-03-01

    Full Text Available Inefficiency of thesauri and other information representation tools in electronic environment have forced librarians to revise the structure of these tools. So they have tried to develop other information organization tools such as ontology. In this paper, the performance of ontology in information retrieval was investigated. In addition, by reviewing two basic ontology-based information retrieval models- Lingpeng model and Dan Model- a new conceptual model was introduced.

  2. Bias-variance analysis in estimating true query model for information retrieval

    OpenAIRE

    Zhang, Peng; Song, Dawei; Wang, Jun; Yue HOU

    2014-01-01

    The estimation of query model is an important task in language modeling (LM) approaches to information retrieval (IR). The ideal estimation is expected to be not only effective in terms of high mean retrieval performance over all queries, but also stable in terms of low variance of retrieval performance across different queries. In practice, however, improving effectiveness can sacrifice stability, and vice versa. In this paper, we propose to study this tradeoff from a new perspective, i.e., ...

  3. An information retrieval system for research file data

    Science.gov (United States)

    Joan E. Lengel; John W. Koning

    1978-01-01

    Research file data have been successfully retrieved at the Forest Products Laboratory through a high-speed cross-referencing system involving the computer program FAMULUS as modified by the Madison Academic Computing Center at the University of Wisconsin. The method of data input, transfer to computer storage, system utilization, and effectiveness are discussed....

  4. Autocorrelation and Regularization of Query-Based Information Retrieval Scores

    Science.gov (United States)

    2008-02-01

    retrieval 2 The dog (Canis lupus familiaris) is a domestic subspecies of the wolf, a mammal of the Canidae family of the order Carnivora . The term...normalized Laplacian. This result suggests that, while degree normalize is important, our data may not exhibit the appropriate characteristics to notice

  5. FORDAT : an information retrieval system for forest economic data

    Science.gov (United States)

    Henry M. Spelter

    1981-01-01

    Time series data frequently used in Forest Service studies of wood products consumption have been stored in a data retrieval system on the computer of the University of Wisconsin. The data cover activity in wood processing from forest to end use. Prices and costs at succeeding stages, historical usage, production rates, and other relevant data to wood use analysis were...

  6. Information Retrieval in an Office Filing Facility and Future Work in Project Minstrel.

    Science.gov (United States)

    Smeaton, A. F.; van Rijsbergen, C. J.

    1986-01-01

    Review of office filing facility filing and retrieval mechanisms for unstructured and mixed media information focuses on free text methods. Also discussed are the state of the art in handling voice and image data, problems with searching text surrogates to implement free text content retrieval, and work of Project Minstrel. (Author/MBR)

  7. Document control and information retrieval system for the Fast Flux Test Facility (FFTF)

    Energy Technology Data Exchange (ETDEWEB)

    Theo, M.G.

    1976-03-01

    A description is given of the FFTF Document Control and Information Retrieval System. The system utilizes a mini-computer along with various microfilm equipment and is designed to accommodate an anticipated 50 million pages of text and 750,000 drawings. The system is simple, uncluttered, eliminates duplication, and provides quick retrievability of documents for all technical and administrative personnel.

  8. Editorial for the Bibliometric-enhanced Information Retrieval Workshop at ECIR 2014

    NARCIS (Netherlands)

    Mayr, Philipp; Schaer, Philipp; Scharnhorst, Andrea; Mutschke, Peter

    2014-01-01

    This first "Bibliometric-enhanced Information Retrieval" (BIR 2014) workshop aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they

  9. Entropy Associated with Information Storage and Its Retrieval

    Directory of Open Access Journals (Sweden)

    Abu Mohamed Alhasan

    2015-08-01

    Full Text Available We provide an entropy analysis for light storage and light retrieval. In this analysis, entropy extraction and reduction in a typical light storage experiment are identified. The spatiotemporal behavior of entropy is presented for D1 transition in cold sodium atoms. The governing equations are the reduced Maxwell field equations and the Liouville–von Neumann equation for the density matrix of the dressed atom.

  10. Query-by-Example Music Information Retrieval by Score-Informed Source Separation and Remixing Technologies

    Directory of Open Access Journals (Sweden)

    Goto Masataka

    2010-01-01

    Full Text Available We describe a novel query-by-example (QBE approach in music information retrieval that allows a user to customize query examples by directly modifying the volume of different instrument parts. The underlying hypothesis of this approach is that the musical mood of retrieved results changes in relation to the volume balance of different instruments. On the basis of this hypothesis, we aim to clarify the relationship between the change in the volume balance of a query and the genre of the retrieved pieces, called genre classification shift. Such an understanding would allow us to instruct users in how to generate alternative queries without finding other appropriate pieces. Our QBE system first separates all instrument parts from the audio signal of a piece with the help of its musical score, and then it allows users remix these parts to change the acoustic features that represent the musical mood of the piece. Experimental results showed that the genre classification shift was actually caused by the volume change in the vocal, guitar, and drum parts.

  11. Rapid Retrieval of Lung Nodule CT Images Based on Hashing and Pruning Methods

    Directory of Open Access Journals (Sweden)

    Ling Pan

    2016-01-01

    Full Text Available The similarity-based retrieval of lung nodule computed tomography (CT images is an important task in the computer-aided diagnosis of lung lesions. It can provide similar clinical cases for physicians and help them make reliable clinical diagnostic decisions. However, when handling large-scale lung images with a general-purpose computer, traditional image retrieval methods may not be efficient. In this paper, a new retrieval framework based on a hashing method for lung nodule CT images is proposed. This method can translate high-dimensional image features into a compact hash code, so the retrieval time and required memory space can be reduced greatly. Moreover, a pruning algorithm is presented to further improve the retrieval speed, and a pruning-based decision rule is presented to improve the retrieval precision. Finally, the proposed retrieval method is validated on 2,450 lung nodule CT images selected from the public Lung Image Database Consortium (LIDC database. The experimental results show that the proposed pruning algorithm effectively reduces the retrieval time of lung nodule CT images and improves the retrieval precision. In addition, the retrieval framework is evaluated by differentiating benign and malignant nodules, and the classification accuracy can reach 86.62%, outperforming other commonly used classification methods.

  12. Rapid Retrieval of Lung Nodule CT Images Based on Hashing and Pruning Methods.

    Science.gov (United States)

    Pan, Ling; Qiang, Yan; Yuan, Jie; Wu, Lidong

    2016-01-01

    The similarity-based retrieval of lung nodule computed tomography (CT) images is an important task in the computer-aided diagnosis of lung lesions. It can provide similar clinical cases for physicians and help them make reliable clinical diagnostic decisions. However, when handling large-scale lung images with a general-purpose computer, traditional image retrieval methods may not be efficient. In this paper, a new retrieval framework based on a hashing method for lung nodule CT images is proposed. This method can translate high-dimensional image features into a compact hash code, so the retrieval time and required memory space can be reduced greatly. Moreover, a pruning algorithm is presented to further improve the retrieval speed, and a pruning-based decision rule is presented to improve the retrieval precision. Finally, the proposed retrieval method is validated on 2,450 lung nodule CT images selected from the public Lung Image Database Consortium (LIDC) database. The experimental results show that the proposed pruning algorithm effectively reduces the retrieval time of lung nodule CT images and improves the retrieval precision. In addition, the retrieval framework is evaluated by differentiating benign and malignant nodules, and the classification accuracy can reach 86.62%, outperforming other commonly used classification methods.

  13. Improving biomedical information retrieval by linear combinations of different query expansion techniques.

    Science.gov (United States)

    Abdulla, Ahmed AbdoAziz Ahmed; Lin, Hongfei; Xu, Bo; Banbhrani, Santosh Kumar

    2016-07-25

    Biomedical literature retrieval is becoming increasingly complex, and there is a fundamental need for advanced information retrieval systems. Information Retrieval (IR) programs scour unstructured materials such as text documents in large reserves of data that are usually stored on computers. IR is related to the representation, storage, and organization of information items, as well as to access. In IR one of the main problems is to determine which documents are relevant and which are not to the user's needs. Under the current regime, users cannot precisely construct queries in an accurate way to retrieve particular pieces of data from large reserves of data. Basic information retrieval systems are producing low-quality search results. In our proposed system for this paper we present a new technique to refine Information Retrieval searches to better represent the user's information need in order to enhance the performance of information retrieval by using different query expansion techniques and apply a linear combinations between them, where the combinations was linearly between two expansion results at one time. Query expansions expand the search query, for example, by finding synonyms and reweighting original terms. They provide significantly more focused, particularized search results than do basic search queries. The retrieval performance is measured by some variants of MAP (Mean Average Precision) and according to our experimental results, the combination of best results of query expansion is enhanced the retrieved documents and outperforms our baseline by 21.06 %, even it outperforms a previous study by 7.12 %. We propose several query expansion techniques and their combinations (linearly) to make user queries more cognizable to search engines and to produce higher-quality search results.

  14. Retrieving XCO2 from GOSAT FTS over East Asia Using Simultaneous Aerosol Information from CAI

    Directory of Open Access Journals (Sweden)

    Woogyung Kim

    2016-12-01

    Full Text Available In East Asia, where aerosol concentrations are persistently high throughout the year, most satellite CO2 retrieval algorithms screen out many measurements during quality control in order to reduce retrieval errors. To reduce the retrieval errors associated with aerosols, we have modified YCAR (Yonsei Carbon Retrieval algorithm to YCAR-CAI to retrieve XCO2 from GOSAT FTS measurements using aerosol retrievals from simultaneous Cloud and Aerosol Imager (CAI measurements. The CAI aerosol algorithm provides aerosol type and optical depth information simultaneously for the same geometry and optical path as FTS. The YCAR-CAI XCO2 retrieval algorithm has been developed based on the optimal estimation method. The algorithm uses the VLIDORT V2.6 radiative transfer model to calculate radiances and Jacobian functions. The XCO2 results retrieved using the YCAR-CAI algorithm were evaluated by comparing them with ground-based TCCON measurements and current operational GOSAT XCO2 retrievals. The retrievals show a clear annual cycle, with an increasing trend of 2.02 to 2.39 ppm per year, which is higher than that measured at Mauna Loa, Hawaii. The YCAR-CAI results were validated against the Tsukuba and Saga TCCON sites and show an root mean square error of 2.25, a bias of −0.81 ppm, and a regression line closer to the linear identity function compared with other current algorithms. Even after post-screening, the YCAR-CAI algorithm provides a larger dataset of XCO2 compared with other retrieval algorithms by 21% to 67%, which could be substantially advantageous in validation and data analysis for the area of East Asia. Retrieval uncertainty indicates a 1.39 to 1.48 ppm at the TCCON sites. Using Carbon Tracker-Asia (CT-A data, the sampling error was analyzed and was found to be between 0.32 and 0.36 ppm for each individual sounding.

  15. Which user interaction for cross-language information retrieval? Design issues and reflections

    OpenAIRE

    Petrelli, Daniela; Levin, Stephen; Beaulieu, Micheline; Sanderson, Mark

    2006-01-01

    A novel and complex form of information access is cross-language information retrieval: searching for texts written in foreign languages based on native language queries. Although the underlying technology for achieving such a search is relatively well understood, the appropriate interface design is not. The authors present three user evaluations undertaken during the iterative design of Clarity, a cross-language retrieval system for low-density languages, and shows how the user-interaction d...

  16. Visual working memory buffers information retrieved from visual long-term memory.

    Science.gov (United States)

    Fukuda, Keisuke; Woodman, Geoffrey F

    2017-05-16

    Human memory is thought to consist of long-term storage and short-term storage mechanisms, the latter known as working memory. Although it has long been assumed that information retrieved from long-term memory is represented in working memory, we lack neural evidence for this and need neural measures that allow us to watch this retrieval into working memory unfold with high temporal resolution. Here, we show that human electrophysiology can be used to track information as it is brought back into working memory during retrieval from long-term memory. Specifically, we found that the retrieval of information from long-term memory was limited to just a few simple objects' worth of information at once, and elicited a pattern of neurophysiological activity similar to that observed when people encode new information into working memory. Our findings suggest that working memory is where information is buffered when being retrieved from long-term memory and reconcile current theories of memory retrieval with classic notions about the memory mechanisms involved.

  17. Study of query expansion techniques and their application in the biomedical information retrieval.

    Science.gov (United States)

    Rivas, A R; Iglesias, E L; Borrajo, L

    2014-01-01

    Information Retrieval focuses on finding documents whose content matches with a user query from a large document collection. As formulating well-designed queries is difficult for most users, it is necessary to use query expansion to retrieve relevant information. Query expansion techniques are widely applied for improving the efficiency of the textual information retrieval systems. These techniques help to overcome vocabulary mismatch issues by expanding the original query with additional relevant terms and reweighting the terms in the expanded query. In this paper, different text preprocessing and query expansion approaches are combined to improve the documents initially retrieved by a query in a scientific documental database. A corpus belonging to MEDLINE, called Cystic Fibrosis, is used as a knowledge source. Experimental results show that the proposed combinations of techniques greatly enhance the efficiency obtained by traditional queries.

  18. Study of Query Expansion Techniques and Their Application in the Biomedical Information Retrieval

    Science.gov (United States)

    Rivas, A. R.; Iglesias, E. L.; Borrajo, L.

    2014-01-01

    Information Retrieval focuses on finding documents whose content matches with a user query from a large document collection. As formulating well-designed queries is difficult for most users, it is necessary to use query expansion to retrieve relevant information. Query expansion techniques are widely applied for improving the efficiency of the textual information retrieval systems. These techniques help to overcome vocabulary mismatch issues by expanding the original query with additional relevant terms and reweighting the terms in the expanded query. In this paper, different text preprocessing and query expansion approaches are combined to improve the documents initially retrieved by a query in a scientific documental database. A corpus belonging to MEDLINE, called Cystic Fibrosis, is used as a knowledge source. Experimental results show that the proposed combinations of techniques greatly enhance the efficiency obtained by traditional queries. PMID:24723793

  19. The validation of the Yonsei CArbon Retrieval algorithm with improved aerosol information using GOSAT measurements

    Science.gov (United States)

    Jung, Yeonjin; Kim, Jhoon; Kim, Woogyung; Boesch, Hartmut; Goo, Tae-Young; Cho, Chunho

    2017-04-01

    Although several CO2 retrieval algorithms have been developed to improve our understanding about carbon cycle, limitations in spatial coverage and uncertainties due to aerosols and thin cirrus clouds are still remained as a problem for monitoring CO2 concentration globally. Based on an optimal estimation method, the Yonsei CArbon Retrieval (YCAR) algorithm was developed to retrieve the column-averaged dry-air mole fraction of carbon dioxide (XCO2) using the Greenhouse Gases Observing SATellite (GOSAT) measurements with optimized a priori CO2 profiles and aerosol models over East Asia. In previous studies, the aerosol optical properties (AOP) are the most important factors in CO2 retrievals since AOPs are assumed as fixed parameters during retrieval process, resulting in significant XCO2 retrieval error up to 2.5 ppm. In this study, to reduce these errors caused by inaccurate aerosol optical information, the YCAR algorithm improved with taking into account aerosol optical properties as well as aerosol vertical distribution simultaneously. The CO2 retrievals with two difference aerosol approaches have been analyzed using the GOSAT spectra and have been evaluated throughout the comparison with collocated ground-based observations at several Total Carbon Column Observing Network (TCCON) sites. The improved YCAR algorithm has biases of 0.59±0.48 ppm and 2.16±0.87 ppm at Saga and Tsukuba sites, respectively, with smaller biases and higher correlation coefficients compared to the GOSAT operational algorithm. In addition, the XCO2 retrievals will be validated at other TCCON sites and error analysis will be evaluated. These results reveal that considering better aerosol information can improve the accuracy of CO2 retrieval algorithm and provide more useful XCO2 information with reduced uncertainties. This study would be expected to provide useful information in estimating carbon sources and sinks.

  20. Knowledge Maps and Information Retrieval (KMIR) : Organization of a workshop

    NARCIS (Netherlands)

    Mutschke, Peter; Scharnhorst, Andrea; Guéret, Christophe; Mayr, Philipp; Hansen, Preben; Slavic, Aida

    2014-01-01

    Information systems usually show as a particular point of failure the vagueness between user search terms and the knowledge orders of the information space in question. Some kind of guided searching therefore becomes more and more important in order to precisely discover information without knowing

  1. Adding information may increase overconfidence in accuracy of knowledge retrieval.

    Science.gov (United States)

    Fleisig, Dida

    2011-04-01

    Feelings of retrospective confidence concerning the accuracy of a chosen answer might rely, among other things, on the amount of available information, regardless of its correctness. 43 participants, 26 women and 17 men (M age = 23.4 yr., SD = 3.5) in an intact group design, answered nine easy and nine difficult binary forced-choice questions and rated their confidence regarding the correctness of their choices. Participants were randomly assigned to one of three groups, differing in the additional information provided regarding the questions: a control group provided with no additional information, a correct information group, and a misleading information group. Performance was worst in the misleading information group, yet no difference in confidence was found between the correct and misleading information groups. The findings were interpreted as supporting the hypothesis that feelings of confidence partly reflect peripheral factors, indirectly related to choice processes.

  2. Computer Information Search and Retrieval: A Guide for the Music Educator.

    Science.gov (United States)

    Williams, David Brian; Beasley, L. Sue

    This report examines the features of computer information systems, differentiating between data files, search systems, and computer information organizations. An annotated listing is provided of those computer information retrieval files of interest to the music education researcher. This listing is divided into five categories: (1) music and…

  3. Designing and Implementing a Cross-Language Information Retrieval System Using Linguistic Corpora

    Directory of Open Access Journals (Sweden)

    Amin Nezarat

    2012-03-01

    Full Text Available Information retrieval (IR is a crucial area of natural language processing (NLP and can be defined as finding documents whose content is relevant to the query need of a user. Cross-language information retrieval (CLIR refers to a kind of information retrieval in which the language of the query and that of searched document are different. In fact, it is a retrieval process where the user presents queries in one language to retrieve documents in another language. This paper tried to construct a bilingual lexicon of parallel chunks of English and Persian from two very large monolingual corpora an English-Persian parallel corpus which could be directly applied to cross-language information retrieval tasks. For this purpose, a statistical measure known as Association Score (AS was used to compute the association value between every two corresponding chunks in the corpus using a couple of complicated algorithms. Once the CLIR system was developed using this bilingual lexicon, an experiment was performed on a set of one hundred English and Persian phrases and collocations to see to what extend this system was effective in assisting the users find the most relevant and suitable equivalents of their queries in either language.

  4. Retrieval of air quality information using image processing technique.

    Science.gov (United States)

    Lim, H. S.; MatJafri, M. Z.; Abdullah, K.; Saleh, N. M.

    2007-04-01

    This paper presents and describes an approach to retrieve concentration of particulate matter of size less than 10- micron (PM10) from Landsat TM data over Penang Island. The objective of this study is test the feasibility of using Landsat TM for PM10 mapping using our proposed developed algorithm. The development of the algorithm was developed base on the aerosol characteristics in the atmosphere. PM10 measurements were collected using a DustTrak Aerosol Monitor 8520 simultaneously with the image acquisition. The station locations of the PM10 measurements were detemined using a hand held GPS. The digital numbers were extracted corresponding to the ground-truth locations for each band and then converted into radiance and reflectance values. The reflectance measured from the satellite [reflectance at the top of atmospheric, ρ(TOA)] was subtracted by the amount given by the surface reflectance to obtain the atmospheric reflectance. Then the atmospheric reflectance was related to the PM10 using regression analysis. The surface reflectance values were created using ACTOR2 image correction software in the PCI Geomatica 9.1.8 image processing software. The proposed developed algorithm produced high accuracy and also showed a good agreement (R =0.8406) between the measured and estimated PM10. This study indicates that it is feasible to use Landsat TM data for mapping PM10 using the proposed algorithm.

  5. Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives

    DEFF Research Database (Denmark)

    Cao, Xin; Cong, Gao; Cui, Bin

    2012-01-01

    of CQA services, question retrieval in a CQA archive aims to retrieve historical question-answer pairs that are relevant to a query question. This article presents several new approaches to exploiting the category information of questions for improving the performance of question retrieval......Community Question Answering (CQA) is a popular type of service where users ask questions and where answers are obtained from other users or from historical question-answer pairs. CQA archives contain large volumes of questions organized into a hierarchy of categories. As an essential function...

  6. Image retrieval by information fusion based on scalable vocabulary tree and robust Hausdorff distance

    Science.gov (United States)

    Che, Chang; Yu, Xiaoyang; Sun, Xiaoming; Yu, Boyang

    2017-12-01

    In recent years, Scalable Vocabulary Tree (SVT) has been shown to be effective in image retrieval. However, for general images where the foreground is the object to be recognized while the background is cluttered, the performance of the current SVT framework is restricted. In this paper, a new image retrieval framework that incorporates a robust distance metric and information fusion is proposed, which improves the retrieval performance relative to the baseline SVT approach. First, the visual words that represent the background are diminished by using a robust Hausdorff distance between different images. Second, image matching results based on three image signature representations are fused, which enhances the retrieval precision. We conducted intensive experiments on small-scale to large-scale image datasets: Corel-9, Corel-48, and PKU-198, where the proposed Hausdorff metric and information fusion outperforms the state-of-the-art methods by about 13, 15, and 15%, respectively.

  7. Advances in metabolome information retrieval: turning chemistry into biology. Part II: biological information recovery.

    Science.gov (United States)

    Tebani, Abdellah; Afonso, Carlos; Bekri, Soumeya

    2017-08-25

    This work reports the second part of a review intending to give the state of the art of major metabolic phenotyping strategies. It particularly deals with inherent advantages and limits regarding data analysis issues and biological information retrieval tools along with translational challenges. This Part starts with introducing the main data preprocessing strategies of the different metabolomics data. Then, it describes the main data analysis techniques including univariate and multivariate aspects. It also addresses the challenges related to metabolite annotation and characterization. Finally, functional analysis including pathway and network strategies are discussed. The last section of this review is devoted to practical considerations and current challenges and pathways to bring metabolomics into clinical environments.

  8. Factors influencing user ability to retrieve information from the ...

    African Journals Online (AJOL)

    Based on these findings , recommendations were made urging for a clearly defined and well articulated set of policies to guide reference service as well as strengthening information literacy interventions in university libraries to enhance user ability to locate, evaluate and effectively use required information for academic ...

  9. Evaluation of some Information Retrieval models for Gujarati Ad hoc Monolingual Tasks

    OpenAIRE

    J., Joshi Hardik; Jyoti, Pareek

    2012-01-01

    This paper describes the work towards Gujarati Ad hoc Monolingual Retrieval task for widely used Information Retrieval (IR) models. We present an indexing baseline for the Gujarati Language represented by Mean Average Precision (MAP) values. Our objective is to obtain a relative picture of a better IR model for Gujarati Language. Results show that Classical IR models like Term Frequency Inverse Document Frequency (TF_IDF) performs better when compared to few recent probabilistic IR models. Th...

  10. Music to knowledge: A visual programming environment for the development and evaluation of music information retrieval techniques

    Science.gov (United States)

    Ehmann, Andreas F.; Downie, J. Stephen

    2005-09-01

    The objective of the International Music Information Retrieval Systems Evaluation Laboratory (IMIRSEL) project is the creation of a large, secure corpus of audio and symbolic music data accessible to the music information retrieval (MIR) community for the testing and evaluation of various MIR techniques. As part of the IMIRSEL project, a cross-platform JAVA based visual programming environment called Music to Knowledge (M2K) is being developed for a variety of music information retrieval related tasks. The primary objective of M2K is to supply the MIR community with a toolset that provides the ability to rapidly prototype algorithms, as well as foster the sharing of techniques within the MIR community through the use of a standardized set of tools. Due to the relatively large size of audio data and the computational costs associated with some digital signal processing and machine learning techniques, M2K is also designed to support distributed computing across computing clusters. In addition, facilities to allow the integration of non-JAVA based (e.g., C/C++, MATLAB, etc.) algorithms and programs are provided within M2K. [Work supported by the Andrew W. Mellon Foundation and NSF Grants No. IIS-0340597 and No. IIS-0327371.

  11. Using Bayesian networks to support decision-focused information retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Lehner, P.; Elsaesser, C.; Seligman, L. [Mitre Corp., McLean, VA (United States)

    1996-12-31

    This paper has described an approach to controlling the process of pulling data/information from distributed data bases in a way that is specific to a persons specific decision making context. Our prototype implementation of this approach uses a knowledge-based planner to generate a plan, an automatically constructed Bayesian network to evaluate the plan, specialized processing of the network to derive key information items that would substantially impact the evaluation of the plan (e.g., determine that replanning is needed), automated construction of Standing Requests for Information (SRIs) which are automated functions that monitor changes and trends in distributed data base that are relevant to the key information items. This emphasis of this paper is on how Bayesian networks are used.

  12. User centered and ontology based information retrieval system for life sciences

    Directory of Open Access Journals (Sweden)

    Sy Mohameth-François

    2012-01-01

    Full Text Available Abstract Background Because of the increasing number of electronic resources, designing efficient tools to retrieve and exploit them is a major challenge. Some improvements have been offered by semantic Web technologies and applications based on domain ontologies. In life science, for instance, the Gene Ontology is widely exploited in genomic applications and the Medical Subject Headings is the basis of biomedical publications indexation and information retrieval process proposed by PubMed. However current search engines suffer from two main drawbacks: there is limited user interaction with the list of retrieved resources and no explanation for their adequacy to the query is provided. Users may thus be confused by the selection and have no idea on how to adapt their queries so that the results match their expectations. Results This paper describes an information retrieval system that relies on domain ontology to widen the set of relevant documents that is retrieved and that uses a graphical rendering of query results to favor user interactions. Semantic proximities between ontology concepts and aggregating models are used to assess documents adequacy with respect to a query. The selection of documents is displayed in a semantic map to provide graphical indications that make explicit to what extent they match the user's query; this man/machine interface favors a more interactive and iterative exploration of data corpus, by facilitating query concepts weighting and visual explanation. We illustrate the benefit of using this information retrieval system on two case studies one of which aiming at collecting human genes related to transcription factors involved in hemopoiesis pathway. Conclusions The ontology based information retrieval system described in this paper (OBIRS is freely available at: http://www.ontotoolkit.mines-ales.fr/ObirsClient/. This environment is a first step towards a user centred application in which the system enlightens

  13. 'Meatball searching' - The adversarial approach to online information retrieval

    Science.gov (United States)

    Jack, R. F.

    1985-01-01

    It is proposed that the different styles of online searching can be described as either formal (highly precise) or informal with the needs of the client dictating which is most applicable at a particular moment. The background and personality of the searcher also come into play. Particular attention is focused on meatball searching which is a form of online searching characterized by deliberate vagueness. It requires generally comprehensive searches, often on unusual topics and with tight deadlines. It is most likely to occur in search centers serving many different disciplines and levels of client information sophistication. Various information needs are outlined as well as the laws of meatball searching and the adversarial approach. Traits and characteristics important to sucessful searching include: (1) concept analysis, (2) flexibility of thinking, (3) ability to think in synonyms and (4) anticipation of variant word forms and spellings.

  14. Information Extraction and Linking in a Retrieval Context

    NARCIS (Netherlands)

    Moens, M.F.; Hiemstra, Djoerd

    We witness a growing interest and capabilities of automatic content recognition (often referred to as information extraction) in various media sources that identify entities (e.g. persons, locations and products) and their semantic attributes (e.g., opinions expressed towards persons or products,

  15. Professional assistance to users of information retrieval tools at the ...

    African Journals Online (AJOL)

    If you would like more information about how to print, save, and work with PDFs, Highwire Press provides a helpful Frequently Asked Questions about PDFs. Alternatively, you can download the PDF file directly to your computer, from where it can be opened using a PDF reader. To download the PDF, click the Download link ...

  16. Fast and reliable online learning to rank for information retrieval

    NARCIS (Netherlands)

    Hofmann, K.

    2013-01-01

    The amount of digital data we produce every day far surpasses our ability to process this data, and finding useful information in this constant flow of data has become one of the major challenges of the 21st century. Search engines are one way of accessing large data collections. Their algorithms

  17. Dublin Core and Electronic Information Retrieval | Gbaje | Samaru ...

    African Journals Online (AJOL)

    Samaru Journal of Information Studies. Journal Home · ABOUT · Advanced Search · Current Issue · Archives · Journal Home > Vol 6, No 1 (2006) >. Log in or Register to get access to full text downloads. Username, Password, Remember me, or Register · Download this PDF file. The PDF file you selected should load here if ...

  18. Perspectives on Adaptivity in Information Retrieval Interaction (PAIRI)

    DEFF Research Database (Denmark)

    Ingwersen, Peter; Larsen, Birger; Kelly, Diane

    2010-01-01

    Adaptivity in IR interactions requires the IR systems adapting to users’ situations and the users adapting to the systems. System adaption entails dynamic user modeling, effective information architecture and enhanced search features such as search integration and relevance feedback; user adaptat...

  19. Systematisierung und Evaluierung von Clustering-Verfahren im Information Retrieval

    OpenAIRE

    Kürsten, Jens

    2006-01-01

    Im Rahmen der vorliegenden Diplomarbeit werden Verfahren zur Clusteranalyse sowie deren Anwendungsmöglichkeiten zur Optimierung der Rechercheergebnisse von Information Retrievalsystemen untersucht. Die Grundlage der vergleichenden Evaluation erfolgversprechender Ansätze zur Clusteranalyse anhand der Domain Specific Monolingual Tasks des Cross-Language Evaluation Forums 2006 bildet die systematische Analyse der in der Forschung etablierten Verfahren zur Clusteranalyse. Die Implementierung ...

  20. The Use of Metadata Visualisation Assist Information Retrieval

    Science.gov (United States)

    2007-10-01

    centred issues have been identified and they include; usability, prior knowledge, understanding of elementary perceptual-cognitive tasks and education ...pertain to information visualisation is required. • Education and Training The problems associated with education and training can be overcome... customised data. A coordinated visualisation interface consists of a set of visualisations, which can interact, portraying the relationship that

  1. Information Retrieval Strategies of Millennial Undergraduate Students in Web and Library Database Searches

    Science.gov (United States)

    Porter, Brandi

    2009-01-01

    Millennial students make up a large portion of undergraduate students attending colleges and universities, and they have a variety of online resources available to them to complete academically related information searches, primarily Web based and library-based online information retrieval systems. The content, ease of use, and required search…

  2. 42 CFR 433.116 - FFP for operation of mechanized claims processing and information retrieval systems.

    Science.gov (United States)

    2010-10-01

    ... FISCAL ADMINISTRATION Mechanized Claims Processing and Information Retrieval Systems § 433.116 FFP for... Medicaid fraud control unit certified under section 1903(q) of the Act and § 455.300 of this chapter, the Medicaid agency must have procedures to assure that information on probable fraud or abuse that is obtained...

  3. Information Visualization and Proposing New Interface for Movie Retrieval System (IMDB)

    Science.gov (United States)

    Etemadpour, Ronak; Masood, Mona; Belaton, Bahari

    2010-01-01

    This research studies the development of a new prototype of visualization in support of movie retrieval. The goal of information visualization is unveiling of large amounts of data or abstract data set using visual presentation. With this knowledge the main goal is to develop a 2D presentation of information on movies from the IMDB (Internet Movie…

  4. A Workshop on Qualitative Information Retrieval, November 18-20, 1980,

    Science.gov (United States)

    1981-09-28

    traditional libaray with card catalog. It does not accurately reflect the functions that most information systems must perform in the 1980’s. It...defined using emerging technologies for storing and retrieving both digital and visual information. For example, optical videvudsc technology will

  5. Aiming for User Experience in Information Retrieval: Towards User-Centered Relevance (USR)

    NARCIS (Netherlands)

    van der Sluis, Frans; van Dijk, Elisabeth M.A.G.; van den Broek, Egon; Chen, Hsin-Hsi; Efthimiadis, Efthimis N.; Savoy, Jacques; Crestani, Fabio; Marchand-Maillet, Stephane

    2010-01-01

    As widely recognized, there is more to relevance than topicality. By looking at the user experience of Information Retrieval (IR), this proposal takes a broader perspective on relevance. Several facets of relevance are structured according to how the user will experience an information object. In

  6. Evaluation of Multi Layers Web-based GIS Approach in Retrieving Tourist Related Information

    OpenAIRE

    Rosilawati Zainol; Zainab Abu Bakar

    2014-01-01

    Geo-based information is getting greater importance among tourists. However, retrieving this information on the web depends heavily on the methods of dissemination. Therefore, this study intends to evaluate methods used in disseminating tourist related geo-based information on the web using partial match query, firstly, in default system which is a single layer approach and secondly, using multi layer web-based Geographic Information System (GIS) approaches. Shah Alam tourist related data are...

  7. Energy for agriculture. A computerized information retrieval system

    Energy Technology Data Exchange (ETDEWEB)

    Stout, B.A.; Myers, C.A. (comps.)

    1979-12-01

    Energy may come from the sun or the earth or be the product of plant materials or agricultural wastes. Whatever its source, energy is indispensable to our way of life, beginning with the production, processing, and distribution of abundant, high quality food and fiber supplies. This specialized bibliography on the subject of energy for agriculture contains 2613 citations to the literature for 1973 through May 1979. Originally issued by Michigan State University (MSU), it is being reprinted and distributed by the U.S. Department of Agriculture. The literature citations will be incorporated into AGRICOLA (Agricultural On-Line Access), the comprehensive bibliographic data base maintained by Technical Information Systems (TIS), a component of USDA's Science and Education Administration (SEA). The citations and the listing of research projects will be combined with other relevant references to provide a continuously updated source of information on energy programs in the agricultural field. No abstracts are included.

  8. Public Health Information Retrieval from Non-health Databases

    Directory of Open Access Journals (Sweden)

    Thumeka Mgwigwi

    2012-06-01

    Full Text Available This study examines the extent to which non-health databases index public health and healthcare related journals. The field of public health and healthcare is unique and multidisciplinary and therefore presents some challenges for researchers looking for published literature in the field. This challenge forces researchers to look beyond databases like Medline and search a wide array of databases in various fields. A list of journal titles from non-health databases in various fields was used to compare title coverage in Medline (Ovid. Databases used in this study are Canadian Business & Current Affairs (CBCA Complete, which is a multidisciplinary database; ABI/Inform covering business literature; Public Affairs Information Services (PAIS; EconLit; PsycInfo focusing only on public health journals and eliminating psychology specific journals; Sociological Abstracts; and Women’s Studies international.

  9. Information retrieval and pedagogy in adapted physical activity.

    Science.gov (United States)

    O'Connor, J; Sherrill, C; French, R

    2001-06-01

    The purpose was to address which databases would be most productive for literature searches by professionals seeking information on adapted physical activity pedagogy. Four databases were searched using 126 pedagogy and 66 disability terms. The results of the searches (4,130 hits) support the use of Sport Discus (n= 2,442 hits) as the most productive database for searches on adapted physical activity pedagogy.

  10. Toward Studying Music Cognition with Information Retrieval Techniques: Lessons Learned from the OpenMIIR Initiative

    Directory of Open Access Journals (Sweden)

    Sebastian Stober

    2017-08-01

    Full Text Available As an emerging sub-field of music information retrieval (MIR, music imagery information retrieval (MIIR aims to retrieve information from brain activity recorded during music cognition–such as listening to or imagining music pieces. This is a highly inter-disciplinary endeavor that requires expertise in MIR as well as cognitive neuroscience and psychology. The OpenMIIR initiative strives to foster collaborations between these fields to advance the state of the art in MIIR. As a first step, electroencephalography (EEG recordings of music perception and imagination have been made publicly available, enabling MIR researchers to easily test and adapt their existing approaches for music analysis like fingerprinting, beat tracking or tempo estimation on this new kind of data. This paper reports on first results of MIIR experiments using these OpenMIIR datasets and points out how these findings could drive new research in cognitive neuroscience.

  11. Toward Studying Music Cognition with Information Retrieval Techniques: Lessons Learned from the OpenMIIR Initiative.

    Science.gov (United States)

    Stober, Sebastian

    2017-01-01

    As an emerging sub-field of music information retrieval (MIR), music imagery information retrieval (MIIR) aims to retrieve information from brain activity recorded during music cognition-such as listening to or imagining music pieces. This is a highly inter-disciplinary endeavor that requires expertise in MIR as well as cognitive neuroscience and psychology. The OpenMIIR initiative strives to foster collaborations between these fields to advance the state of the art in MIIR. As a first step, electroencephalography (EEG) recordings of music perception and imagination have been made publicly available, enabling MIR researchers to easily test and adapt their existing approaches for music analysis like fingerprinting, beat tracking or tempo estimation on this new kind of data. This paper reports on first results of MIIR experiments using these OpenMIIR datasets and points out how these findings could drive new research in cognitive neuroscience.

  12. Toward Studying Music Cognition with Information Retrieval Techniques: Lessons Learned from the OpenMIIR Initiative

    Science.gov (United States)

    Stober, Sebastian

    2017-01-01

    As an emerging sub-field of music information retrieval (MIR), music imagery information retrieval (MIIR) aims to retrieve information from brain activity recorded during music cognition–such as listening to or imagining music pieces. This is a highly inter-disciplinary endeavor that requires expertise in MIR as well as cognitive neuroscience and psychology. The OpenMIIR initiative strives to foster collaborations between these fields to advance the state of the art in MIIR. As a first step, electroencephalography (EEG) recordings of music perception and imagination have been made publicly available, enabling MIR researchers to easily test and adapt their existing approaches for music analysis like fingerprinting, beat tracking or tempo estimation on this new kind of data. This paper reports on first results of MIIR experiments using these OpenMIIR datasets and points out how these findings could drive new research in cognitive neuroscience. PMID:28824478

  13. Better late than never: information retrieval from black holes.

    Science.gov (United States)

    Braunstein, Samuel L; Pirandola, Stefano; Życzkowski, Karol

    2013-03-08

    We show that, in order to preserve the equivalence principle until late times in unitarily evaporating black holes, the thermodynamic entropy of a black hole must be primarily entropy of entanglement across the event horizon. For such black holes, we show that the information entering a black hole becomes encoded in correlations within a tripartite quantum state, the quantum analogue of a one-time pad, and is only decoded into the outgoing radiation very late in the evaporation. This behavior generically describes the unitary evaporation of highly entangled black holes and requires no specially designed evolution. Our work suggests the existence of a matter-field sum rule for any fundamental theory.

  14. Information Storage and Retrieval using Macromolecules as Storage Media

    OpenAIRE

    Mansuripur, M.; Khulbe, P. K.; Kuebler, S. M.; Perry, J W; Giridhar, M. S.; Erwin, J. Kevin; Seong, Kibyung; Marder, Seth; Peyghambarian, N

    2017-01-01

    To store information at extremely high-density and data-rate, we propose to adapt, integrate, and extend the techniques developed by chemists and molecular biologists for the purpose of manipulating biological and other macromolecules. In principle, volumetric densities in excess of 10^21 bits/cm^3 can be achieved when individual molecules having dimensions below a nanometer or so are used to encode the 0's and 1's of a binary string of data. In practice, however, given the limitations of ele...

  15. Practical Side of the Bibliographic Information Retrieval System in the National Museum of Ethnology

    Science.gov (United States)

    Kondo, Katsuichi

    The information retrieval system of the National Museum of Ethnology made its debut in 1979 and now enables us to search the books not only in the Museum but in the country and abroad by means of JAPAN MARC & LC MARC. The author presents the outline and the development of the information managing system including the above briefly and secondly the practical case of using our retrieval system in particular. The problems to be solved in the course of the future plan are also mentioned.

  16. Improving information retrieval using Medical Subject Headings Concepts: a test case on rare and chronic diseases.

    Science.gov (United States)

    Darmoni, Stéfan J; Soualmia, Lina F; Letord, Catherine; Jaulent, Marie-Christine; Griffon, Nicolas; Thirion, Benoît; Névéol, Aurélie

    2012-07-01

    As more scientific work is published, it is important to improve access to the biomedical literature. Since 2000, when Medical Subject Headings (MeSH) Concepts were introduced, the MeSH Thesaurus has been concept based. Nevertheless, information retrieval is still performed at the MeSH Descriptor or Supplementary Concept level. The study assesses the benefit of using MeSH Concepts for indexing and information retrieval. Three sets of queries were built for thirty-two rare diseases and twenty-two chronic diseases: (1) using PubMed Automatic Term Mapping (ATM), (2) using Catalog and Index of French-language Health Internet (CISMeF) ATM, and (3) extrapolating the MEDLINE citations that should be indexed with a MeSH Concept. Type 3 queries retrieve significantly fewer results than type 1 or type 2 queries (about 18,000 citations versus 200,000 for rare diseases; about 300,000 citations versus 2,000,000 for chronic diseases). CISMeF ATM also provides better precision than PubMed ATM for both disease categories. Using MeSH Concept indexing instead of ATM is theoretically possible to improve retrieval performance with the current indexing policy. However, using MeSH Concept information retrieval and indexing rules would be a fundamentally better approach. These modifications have already been implemented in the CISMeF search engine.

  17. What Automaticity Deficit? Activation of Lexical Information by Readers with Dyslexia in a Rapid Automatized Naming Stroop-Switch Task

    Science.gov (United States)

    Jones, Manon W.; Snowling, Margaret J.; Moll, Kristina

    2016-01-01

    Reading fluency is often predicted by rapid automatized naming (RAN) speed, which as the name implies, measures the automaticity with which familiar stimuli (e.g., letters) can be retrieved and named. Readers with dyslexia are considered to have less "automatized" access to lexical information, reflected in longer RAN times compared with…

  18. Probabilistic Information Integration and Retrieval in the Semantic Web

    Science.gov (United States)

    Predoiu, Livia

    The Semantic Web (SW) has been envisioned to enable software tools or Web Services, respectively, to process information provided on the Web automatically. For this purpose, languages for representing the semantics of data by means of ontologies have been proposed such as RDF(S) and OWL. While the semantics of RDF(S) requires a non-standard model-theory that goes beyond first order logics, OWL is intended to model subsets of first order logics. OWL consists of three variants that are layered on each other. The less expressive variants OWL-Light and OWL-DL correspond to the Description Logics {SHIF}(D) and {SHOIN}(D) [1], respectively, and thus to subsets of First Order Logics [2].

  19. Information storage and retrieval in a single levitating colloidal particle

    Science.gov (United States)

    Myers, Christopher J.; Celebrano, Michele; Krishnan, Madhavi

    2015-10-01

    The binary switch is a basic component of digital information. From phase-change alloys to nanomechanical beams, molecules and atoms, new strategies for controlled bistability hold great interest for emerging technologies. We present a generic methodology for precise and parallel spatiotemporal control of nanometre-scale matter in a fluid, and demonstrate the ability to attain digital functionalities such as switching, gating and data storage in a single colloid, with further implications for signal amplification and logic operations. This fluid-phase bit can be arrayed at high densities, manipulated by either electrical or optical fields, supports low-energy, high-speed operation and marks a first step toward ‘colloidal information’. The principle generalizes to any system where spatial perturbation of a particle elicits a differential response amenable to readout.

  20. Flexible patient information search and retrieval framework: pilot implementation

    Science.gov (United States)

    Erdal, Selnur; Catalyurek, Umit V.; Saltz, Joel; Kamal, Jyoti; Gurcan, Metin N.

    2007-03-01

    Medical centers collect and store significant amount of valuable data pertaining to patients' visit in the form of medical free-text. In addition, standardized diagnosis codes (International Classification of Diseases, Ninth Revision, Clinical Modification: ICD9-CM) related to those dictated reports are usually available. In this work, we have created a framework where image searches could be initiated through a combination of free-text reports as well as ICD9 codes. This framework enables more comprehensive search on existing large sets of patient data in a systematic way. The free text search is enriched by computer-aided inclusion of additional search terms enhanced by a thesaurus. This combination of enriched search allows users to access to a larger set of relevant results from a patient-centric PACS in a simpler way. Therefore, such framework is of particular use in tasks such as gathering images for desired patient populations, building disease models, and so on. As the motivating application of our framework, we implemented a search engine. This search engine processed two years of patient data from the OSU Medical Center's Information Warehouse and identified lung nodule location information using a combination of UMLS Meta-Thesaurus enhanced text report searches along with ICD9 code searches on patients that have been discharged. Five different queries with various ICD9 codes involving lung cancer were carried out on 172552 cases. Each search was completed under a minute on average per ICD9 code and the inclusion of UMLS thesaurus increased the number of relevant cases by 45% on average.

  1. Hospital nurses' information retrieval behaviours in relation to evidence based nursing: a literature review.

    Science.gov (United States)

    Alving, Berit Elisabeth; Christiansen, Janne Buck; Thrysoe, Lars

    2018-01-12

    The purpose of this literature review is to provide an overview of the information retrieval behaviour of clinical nurses, in terms of the use of databases and other information resources and their frequency of use. Systematic searches carried out in five databases and handsearching were used to identify the studies from 2010 to 2016, with a populations, exposures and outcomes (PEO) search strategy, focusing on the question: In which databases or other information resources do hospital nurses search for evidence based information, and how often? Of 5272 titles retrieved based on the search strategy, only nine studies fulfilled the criteria for inclusion. The studies are from the United States, Canada, Taiwan and Nigeria. The results show that hospital nurses' primary choice of source for evidence based information is Google and peers, while bibliographic databases such as PubMed are secondary choices. Data on frequency are only included in four of the studies, and data are heterogenous. The reasons for choosing Google and peers are primarily lack of time; lack of information; lack of retrieval skills; or lack of training in database searching. Only a few studies are published on clinical nurses' retrieval behaviours, and more studies are needed from Europe and Australia. © 2018 Health Libraries Group.

  2. Large-scale distributed foraging, gathering, and matching for information retrieval: assisting the geospatial intelligence analyst

    Science.gov (United States)

    Santos, Eugene, Jr.; Santos, Eunice E.; Nguyen, Hien; Pan, Long; Korah, John

    2005-03-01

    With the proliferation of online resources, there is an increasing need to effectively and efficiently retrieve data and knowledge from distributed geospatial databases. One of the key challenges of this problem is the fact that geospatial databases are usually large and dynamic. In this paper, we address this problem by developing a large scale distributed intelligent foraging, gathering and matching (I-FGM) framework for massive and dynamic information spaces. We assess the effectiveness of our approach by comparing a prototype I-FGM against two simple controls systems (randomized selection and partially intelligent systems). We designed and employed a medium-sized testbed to get an accurate measure of retrieval precision and recall for each system. The results obtained show that I-FGM retrieves relevant information more quickly than the two other control approaches.

  3. Comparing the Scale of Web Subject Directories Precision in Technical-Engineering Information Retrieval

    Directory of Open Access Journals (Sweden)

    Mehrdokht Wazirpour Keshmiri

    2012-07-01

    Full Text Available The main purpose of this research was to compare the scale of web subject directories precision in information retrieval of technical-engineering science. Information gathering was documentary and webometric. Keywords of technical-engineering science were chosen at twenty different subjects from IEEE (Institute of Electrical and Electronics Engineers and engineering magazines that situated in sciencedirect site. These keywords are used at five subject directories Yahoo, Google, Infomine, Intute, Dmoz, that were web directories high-utilization. Usually first results in searching tools are connected to searching keywords. Because, first ten results was evaluated in every search. These assessments to consist of scale of precision, scale of error, scale retrieval items in technical-engineering categories to retrieval items entirely. The used criteria for determining the scale of precision that was according to high-utilization standards in different documents, to consist of presence of the keywords in title, appearance of keywords at the part of web retrieved pages, keywords adjacency, URL of page, page description and subject categories. Information analysis was according to Kruskal-Wallis Test and L.S.D fisher. Results revealed that there was meaningful difference about precision of web subject directories in information retrieval of technical-engineering science, Therefore this theory was confirmed.web subject directories ranked from point of precision as follows. Google, Yahoo, Intute, Dmoz, and Infomine. The scale of observed error at the first results was another criterion that was used for comparing web subject directories. In this research, Yahoo had minimum scale of error and Infomine had most of error. This research also compared the scale of retrieval items in all of categories web subject directories entirely to retrieval items in technical-engineering categories, results revealed that there was meaningful difference between them. And

  4. Web Information Seeking and Retrieval in Digital Library Contexts: Towards an Intelligent Agent Solution.

    Science.gov (United States)

    Detlor, Brian; Arsenault, Clement

    2002-01-01

    Discusses the role of intelligent agents in facilitating the seeking and retrieval of information in Web-based library environments. Highlights include an overview of agents; current applications in library domains; an agent-based model for libraries; the design of interface agents; and implications for library policy and digital collections.…

  5. Applying Information-Retrieval Methods to Software Reuse: A Case Study.

    Science.gov (United States)

    Stierna, Eric J.; Rowe, Neil C.

    2003-01-01

    Discusses reuse of existing software for new purposes as a key aspect of efficient software engineering by matching formal written requirements used to define the new and the old software. Explores two matching methodologies that use information retrieval techniques and describes test results from a comparison of two military systems. (Author/LRW)

  6. On Using Genetic Algorithms for Multimodal Relevance Optimization in Information Retrieval.

    Science.gov (United States)

    Boughanem, M.; Christment, C.; Tamine, L.

    2002-01-01

    Presents a genetic relevance optimization process performed in an information retrieval system that uses genetic techniques for solving multimodal problems (niching) and query reformulation techniques. Explains that the niching technique allows the process to reach different relevance regions of the document space, and that query reformulations…

  7. System Scope for Library Automation and Generalized Information Storage and Retrieval at Stanford University.

    Science.gov (United States)

    Cady, Glee; And Others

    The scope of a manual-automated system serving the 40 libraries and the teaching and research community of Stanford University is defined. Also defined are the library operations to be supported and the bibliographic information storage and retrieval capabilities to be provided in the system. Two major projects have been working jointly on library…

  8. Proceedings of the 9th Dutch-Belgian Information Retrieval Workshop

    NARCIS (Netherlands)

    Aly, Robin; Hauff, C.; den Hamer, Ida; Hiemstra, Djoerd; Huibers, Theo W.C.; de Jong, Franciska M.G.

    Welcome to the 9th Dutch-Belgian Information Retrieval Workshop (DIR). I very well remember the DIR workshop in 2001 that was also organized in Twente. It took place exactly one day before my PhD defense, to give us the opportunity to have one of the PhD committee members, Stephen Robertson, as the

  9. Embedding Web-Based Statistical Translation Models in Cross-Language Information Retrieval

    NARCIS (Netherlands)

    Kraaij, W.; Nie, J.Y.; Simard, M.

    2003-01-01

    Although more and more language pairs are covered by machine translation (MT) services, there are still many pairs that lack translation resources. Cross-language information retrieval (CUR) is an application that needs translation functionality of a relatively low level of sophistication, since

  10. An Introduction to Genetic Algorithms and to Their Use in Information Retrieval.

    Science.gov (United States)

    Jones, Gareth; And Others

    1994-01-01

    Genetic algorithms, a class of nondeterministic algorithms in which the role of chance makes the precise nature of a solution impossible to guarantee, seem to be well suited to combinatorial-optimization problems in information retrieval. Provides an introduction to techniques and characteristics of genetic algorithms and illustrates their…

  11. Search Result Caching in Peer-to-Peer Information Retrieval Networks

    NARCIS (Netherlands)

    Tigelaar, A.S.; Hiemstra, Djoerd; Trieschnigg, Rudolf Berend

    2011-01-01

    For peer-to-peer web search engines it is important to quickly process queries and return search results. How to keep the perceived latency low is an open challenge. In this paper we explore the solution potential of search result caching in large-scale peer-to-peer information retrieval networks by

  12. TRECVID: evaluating the effectiveness of information retrieval tasks on digital video

    NARCIS (Netherlands)

    Smeaton, A.F.; Over, P.; Kraaij, W.

    2004-01-01

    TRECVID is an annual exercise which encourages research in information retrieval from digital video by providing a large video test collection, uniform scoring procedures, and a forum for organizations interested in comparing their results. TRECVID benchmarking covers both interactive and manual

  13. Degree of Agreement in Naming Objects and Concepts for Information Retrieval.

    Science.gov (United States)

    Collantes, Lourdes Y.

    1995-01-01

    Discussion of users and information retrieval systems highlights a study that investigated the representation of users' knowledge by examining their names for objects and concepts, agreement on names, the Library of Congress Subject Headings representation for similar objects and concepts, and measurement of the similarity between these…

  14. Making Explicit the Formalism Underlying Evaluation in Music Information Retrieval Research

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2014-01-01

    We make explicit the formalism underlying evaluation in music information retrieval research. We define a ``system,'' what it means to ``analyze'' one, and make clear the aims, parts, design, execution, interpretation, assumptions and limitations of its ``evaluation.'' We apply this formalism...

  15. Overview of the Living Labs for Information Retrieval Evaluation (LL4IR) CLEF Lab 2015

    NARCIS (Netherlands)

    Schuth, A.; Balog, K.; Kelly, L.; Mothe, J.; Savoy, J.; Kamps, J.; Pinel-Sauvagnat, K.; Jones, G.J.F.; SanJuan, E.; Cappellato, L.; Ferro, N.

    2015-01-01

    In this paper we report on the first Living Labs for Information Retrieval Evaluation (LL4IR) CLEF Lab. Our main goal with the lab is to provide a benchmarking platform for researchers to evaluate their ranking systems in a live setting with real users in their natural task environments. For this

  16. Adaptation of Statistical Machine Translation Model for Cross-Lingual Information Retrieval in a Service Context

    NARCIS (Netherlands)

    Nikoulina, V.; Kovachev, B.; Lagos, N.; Monz, C.

    2012-01-01

    This work proposes to adapt an existing general SMT model for the task of translating queries that are subsequently going to be used to retrieve information from a target language collection. In the scenario that we focus on access to the document collection itself is not available and changes to

  17. Millennial Undergraduate Research Strategies in Web and Library Information Retrieval Systems

    Science.gov (United States)

    Porter, Brandi

    2011-01-01

    This article summarizes the author's dissertation regarding search strategies of millennial undergraduate students in Web and library online information retrieval systems. Millennials bring a unique set of search characteristics and strategies to their research since they have never known a world without the Web. Through the use of search engines,…

  18. FIRES: Fire Information Retrieval and Evaluation System - A program for fire danger rating analysis

    Science.gov (United States)

    Patricia L. Andrews; Larry S. Bradshaw

    1997-01-01

    A computer program, FIRES: Fire Information Retrieval and Evaluation System, provides methods for evaluating the performance of fire danger rating indexes. The relationship between fire danger indexes and historical fire occurrence and size is examined through logistic regression and percentiles. Historical seasonal trends of fire danger and fire occurrence can be...

  19. Hybrid Ontology for Semantic Information Retrieval Model Using Keyword Matching Indexing System

    Directory of Open Access Journals (Sweden)

    K. R. Uthayan

    2015-01-01

    Full Text Available Ontology is the process of growth and elucidation of concepts of an information domain being common for a group of users. Establishing ontology into information retrieval is a normal method to develop searching effects of relevant information users require. Keywords matching process with historical or information domain is significant in recent calculations for assisting the best match for specific input queries. This research presents a better querying mechanism for information retrieval which integrates the ontology queries with keyword search. The ontology-based query is changed into a primary order to predicate logic uncertainty which is used for routing the query to the appropriate servers. Matching algorithms characterize warm area of researches in computer science and artificial intelligence. In text matching, it is more dependable to study semantics model and query for conditions of semantic matching. This research develops the semantic matching results between input queries and information in ontology field. The contributed algorithm is a hybrid method that is based on matching extracted instances from the queries and information field. The queries and information domain is focused on semantic matching, to discover the best match and to progress the executive process. In conclusion, the hybrid ontology in semantic web is sufficient to retrieve the documents when compared to standard ontology.

  20. Hybrid ontology for semantic information retrieval model using keyword matching indexing system.

    Science.gov (United States)

    Uthayan, K R; Mala, G S Anandha

    2015-01-01

    Ontology is the process of growth and elucidation of concepts of an information domain being common for a group of users. Establishing ontology into information retrieval is a normal method to develop searching effects of relevant information users require. Keywords matching process with historical or information domain is significant in recent calculations for assisting the best match for specific input queries. This research presents a better querying mechanism for information retrieval which integrates the ontology queries with keyword search. The ontology-based query is changed into a primary order to predicate logic uncertainty which is used for routing the query to the appropriate servers. Matching algorithms characterize warm area of researches in computer science and artificial intelligence. In text matching, it is more dependable to study semantics model and query for conditions of semantic matching. This research develops the semantic matching results between input queries and information in ontology field. The contributed algorithm is a hybrid method that is based on matching extracted instances from the queries and information field. The queries and information domain is focused on semantic matching, to discover the best match and to progress the executive process. In conclusion, the hybrid ontology in semantic web is sufficient to retrieve the documents when compared to standard ontology.

  1. Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus

    Directory of Open Access Journals (Sweden)

    Çağdaş Çapkın

    2016-12-01

    Full Text Available Information institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata (MIR, full-text (FIR and hybrid (HIR content information retrieval systems were developed with default Lucene information retrieval model for a small scale Turkish corpus. In order to evaluate performance of this three systems, “precision - recall” and “normalized recall” tests were conducted. Experimental findings showed that there were no significant differences between MIR and FIR in mean average precision (MAP performance. On the other hand, MAP performance of HIR was significantly higher in comparison to MIR and FIR. When information retrieval performance was evaluated as user-centered, the “normalized recall” performances of MIR and HIR were significantly higher than FIR. Additionally, there were no significant differences between the systems in retrieved relevant document means. Processing different types of contents such as metadata and full-text had some advantages and disadvantages for information retrieval systems in terms of term management. The advantages brought together in hybrid content processing (HIR and information retrieval performance improved.

  2. Medical Information & Technology: Rapidly Expanding Vast Horizons

    Science.gov (United States)

    Sahni, Anil K.

    2012-12-01

    During ÑMedical Council Of India?, Platinum Jubilee Year (1933-2008) Celebrations, In Year 2008, Several Scientific Meeting/Seminar/Symposium, On Various Topics Of Contemporary Importance And Relevance In The Field Of ÑMedical Education And Ethics?, Were Organized, By Different Medical Colleges At Various Local, State, National Levels. The Present Discussion, Is An Comprehensive Summary Of Various Different Aspects of ìMedical Information Communication Technologyî, Especially UseFul For The Audience Stratum Group Of Those Amateur Medical & Paramedical Staff, With No Previous Work Experience Knowledge Of Computronics Applications. Outlining The, i.Administration Applications: Medical Records Etc, ii. Clinical Applications: Pros pective Scope Of TeleMedicine Applicabilities Etc iii. Other Applications: Efforts To Augment Improvement Of Medical Education, Medical Presentations, Medical Education And Research Etc. ÑMedical Trancription? & Related Recent Study Fields e.g ÑModern Pharmaceuticals?,ÑBio-Engineering?, ÑBio-Mechanics?, ÑBio-Technology? Etc., Along With Important Aspects Of Computers-General Considerations, Computer Ergonomics Assembled To Summarize, The AwareNess Regarding Basic Fundamentals Of Medical Computronics & Its Practically SuccessFul Utilities.

  3. The role of automated categorization in e-government information retrieval

    DEFF Research Database (Denmark)

    Jonasen, Tanja Svarre; Lykke, Marianne

    2013-01-01

    categorization can enhance information organization and retrieval and presents the results of a controlled evaluation that compared automated categorization and free text indexing of the government intranet used by Danish tax authorities. Thirty-two individuals participated in the evaluation, conducting...... knowledge was present, categorization was used to support the assumptions of a correct search. On the other hand, however, test participants avoided using automated categorization if high-precision documents were among the top results or if few documents were retrieved. The findings emphasize the importance...

  4. MAC Protocols for Optimal Information Retrieval Pattern in Sensor Networks with Mobile Access

    Directory of Open Access Journals (Sweden)

    Dong Min

    2005-01-01

    Full Text Available In signal field reconstruction applications of sensor network, the locations where the measurements are retrieved from affect the reconstruction performance. In this paper, we consider the design of medium access control (MAC protocols in sensor networks with mobile access for the desirable information retrieval pattern to minimize the reconstruction distortion. Taking both performance and implementation complexity into consideration, besides the optimal centralized scheduler, we propose three decentralized MAC protocols, namely, decentralized scheduling through carrier sensing, Aloha scheduling, and adaptive Aloha scheduling. Design parameters for the proposed protocols are optimized. Finally, performance comparison among these protocols is provided via simulations.

  5. Information retrieval for the Cochrane systematic reviews: the case of breast cancer surgery

    Directory of Open Access Journals (Sweden)

    Gaetana Cognetti

    2015-03-01

    Full Text Available Introduction. Systematic reviews are fundamental sources of knowledge on the state-of-the-art interventions for various clinical problems. One of the essential components in carrying out a systematic review is that of developing a comprehensive literature search. Materials and methods. Three Cochrane systematic reviews published in 2012 were retrieved using the MeSH descriptor breast neoplasms/surgery, and analyzed with respect to the information sources used and the search strategies adopted. In March 2014, an update of one of the reviews retrieved was also considered in the study. Results. The number of databases queried for each review ranged between three and seven. All the reviews reported the search strategies adopted, however some only partially. All the reviews explicitly claimed that the searches applied no language restriction although sources such as the free database Lilacs (in Spanish and Portuguese was not consulted. Conclusion. To improve the quality it is necessary to apply standards in carrying out systematic reviews (as laid down in the MECIR project. To meet these standards concerning literature searching, professional information retrieval specialist staff should be involved. The peer review committee in charge of evaluating the publication of a systematic review should also include specialists in information retrieval for assessing the quality of the literature search.

  6. Citation Index: an indispensable information retrieval tool for research and evaluation

    OpenAIRE

    Kademani, B. S.; Vijai Kumar, *

    2002-01-01

    This paper highlights the information explosion, the need for bibliographic control, the need for information retrieval tools. Explains the emergence of Citation Index, concept of citation indexing, reasons for citing, its structure (print and electronic versions of Science citation Index and Social Science Citation Index ), and application of citation index. It also discusses the search effectiveness, factors taken into consideration for coverage of journals in citation indexes, Journal Cita...

  7. Multimedia Retrieval

    NARCIS (Netherlands)

    Blanken, Henk; de Vries, A.P.; de Vries, A.P.; Blok, H.E.; Feng, L.; Unknown, [Unknown

    2007-01-01

    Retrieval of multimedia data is different from retrieval of structured data. A key problem in multimedia databases is search, and the proposed solutions to the problem of multimedia information retrieval span a rather wide spectrum of topics outside the traditional database area, ranging from

  8. KAGIANA: An Excel-Based Tool for Retrieving Summary Information on Arabidopsis Genes

    Science.gov (United States)

    Ogata, Yoshiyuki; Sakurai, Nozomu; Aoki, Koh; Suzuki, Hideyuki; Okazaki, Koei; Saito, Kazuki; Shibata, Daisuke

    2009-01-01

    Various public databases provide Arabidopsis gene information via the internet. It is useful to abstract information obtained from such databases. We have developed the KAGIANA tool, which allows a user to retrieve summary information obtained from selective databases and to access pages for a gene of interest in those databases. The tool is based on Microsoft Excel and provides several macro programs for gene expression analyses. It can assist plant biologists in accessing omics information for plant biology. The KAGIANA tool is freely available at http://pmnedo.kazusa.or.jp/kagiana/. PMID:19043069

  9. KAGIANA: an excel-based tool for retrieving summary information on Arabidopsis genes.

    Science.gov (United States)

    Ogata, Yoshiyuki; Sakurai, Nozomu; Aoki, Koh; Suzuki, Hideyuki; Okazaki, Koei; Saito, Kazuki; Shibata, Daisuke

    2009-01-01

    Various public databases provide Arabidopsis gene information via the internet. It is useful to abstract information obtained from such databases. We have developed the KAGIANA tool, which allows a user to retrieve summary information obtained from selective databases and to access pages for a gene of interest in those databases. The tool is based on Microsoft Excel and provides several macro programs for gene expression analyses. It can assist plant biologists in accessing omics information for plant biology. The KAGIANA tool is freely available at http://pmnedo.kazusa.or.jp/kagiana/.

  10. The use of ICF codes for information retrieval in rehabilitation research: an empirical study.

    Science.gov (United States)

    Sundar, Vidyalakshmi; Daumen, Marcia E; Conley, Daniel J; Stone, John H

    2008-01-01

    Rehabilitation research information can be obtained from various bibliographic sources. Nevertheless, search strategies and terminologies differ from one database to another making it challenging for the novice user or users of multiple databases. This paper discusses a novel approach of using the International Classification of Functioning, Disability and Health (ICF) codes to retrieve rehabilitation research information. A crosswalk was created by mapping the Center for International Rehabilitation Research and Information Exchange's (CIRRIE) subject headings to the two-level ICF codes and a search interface was developed (available at: http://cirrie.buffalo.edu/icf/crosswalk.php) so that users can input ICF codes instead of conventional subject headings. About 62% of all CIRRIE subject headings were mapped to equivalent ICF codes. Among the CIRRIE subject heading that were mapped, 43% were mapped to the Environmental Factors, followed by 34% mapped to the Activities and Participation component of the ICF. Although the ICF was not conceived or developed as a system of formal terminology, it can be used effectively for information retrieval in conjunction with an existing vocabulary. This paper describes the first attempt in implementing the use of ICF for information retrieval.

  11. One of the Methods of Organizing the Information Storage Unit of a System of Data Retrieval and Processing (SPOD).

    Science.gov (United States)

    Askinazi, R. B.; Papina, I. L.

    The paper deals with one method of organizing the storage unit of a descriptor IPS (information retrieval system) of the SPOD type, the information array of which constitutes the totality of uniform documents with ordered disposition of data within each of them. Three categories of data composing the retrieval form of document were defined…

  12. Information retrieval for systematic reviews in food and feed topics: a narrative review.

    Science.gov (United States)

    Wood, Hannah; O'Connor, Annette; Sargeant, Jan; Glanville, Julie

    2018-01-09

    Systematic review methods are now being used for reviews of food production, food safety and security, plant health, and animal health and welfare. Information retrieval methods in this context have been informed by human healthcare approaches and ideally should be based on relevant research and experience. This narrative review seeks to identify and summarise current research-based evidence and experience on information retrieval for systematic reviews in food and feed topics. MEDLINE (Ovid), Science Citation Index (Web of Science) and ScienceDirect (http://www.sciencedirect.com/) were searched in 2012 and 2016. We also contacted topic experts and undertook citation searches. We selected and summarised studies reporting research on information retrieval, as well as published guidance and experience. There is little published evidence on the most efficient way to conduct searches for food and feed topics. There are few available study design search filters, and their use may be problematic given poor or inconsistent reporting of study methods. Food and feed research makes use of a wide range of study designs so it might be best to focus strategy development on capturing study populations, although this also has challenges. There is limited guidance on which resources should be searched and whether publication bias in disciplines relevant to food and feed necessitates extensive searching of the grey literature. There is some limited evidence on information retrieval approaches, but more research is required to inform effective and efficient approaches to searching to populate food and feed reviews. This article is protected by copyright. All rights reserved.

  13. A novel architecture for information retrieval system based on semantic web

    Science.gov (United States)

    Zhang, Hui

    2011-12-01

    Nowadays, the web has enabled an explosive growth of information sharing (there are currently over 4 billion pages covering most areas of human endeavor) so that the web has faced a new challenge of information overhead. The challenge that is now before us is not only to help people locating relevant information precisely but also to access and aggregate a variety of information from different resources automatically. Current web document are in human-oriented formats and they are suitable for the presentation, but machines cannot understand the meaning of document. To address this issue, Berners-Lee proposed a concept of semantic web. With semantic web technology, web information can be understood and processed by machine. It provides new possibilities for automatic web information processing. A main problem of semantic web information retrieval is that when these is not enough knowledge to such information retrieval system, the system will return to a large of no sense result to uses due to a huge amount of information results. In this paper, we present the architecture of information based on semantic web. In addiction, our systems employ the inference Engine to check whether the query should pose to Keyword-based Search Engine or should pose to the Semantic Search Engine.

  14. A semi-synthetic organism that stores and retrieves increased genetic information.

    Science.gov (United States)

    Zhang, Yorke; Ptacin, Jerod L; Fischer, Emil C; Aerni, Hans R; Caffaro, Carolina E; San Jose, Kristine; Feldman, Aaron W; Turner, Court R; Romesberg, Floyd E

    2017-11-29

    Since at least the last common ancestor of all life on Earth, genetic information has been stored in a four-letter alphabet that is propagated and retrieved by the formation of two base pairs. The central goal of synthetic biology is to create new life forms and functions, and the most general route to this goal is the creation of semi-synthetic organisms whose DNA harbours two additional letters that form a third, unnatural base pair. Previous efforts to generate such semi-synthetic organisms culminated in the creation of a strain of Escherichia coli that, by virtue of a nucleoside triphosphate transporter from Phaeodactylum tricornutum, imports the requisite unnatural triphosphates from its medium and then uses them to replicate a plasmid containing the unnatural base pair dNaM-dTPT3. Although the semi-synthetic organism stores increased information when compared to natural organisms, retrieval of the information requires in vivo transcription of the unnatural base pair into mRNA and tRNA, aminoacylation of the tRNA with a non-canonical amino acid, and efficient participation of the unnatural base pair in decoding at the ribosome. Here we report the in vivo transcription of DNA containing dNaM and dTPT3 into mRNAs with two different unnatural codons and tRNAs with cognate unnatural anticodons, and their efficient decoding at the ribosome to direct the site-specific incorporation of natural or non-canonical amino acids into superfolder green fluorescent protein. The results demonstrate that interactions other than hydrogen bonding can contribute to every step of information storage and retrieval. The resulting semi-synthetic organism both encodes and retrieves increased information and should serve as a platform for the creation of new life forms and functions.

  15. Applied information retrieval and multidisciplinary research: new mechanistic hypotheses in Complex Regional Pain Syndrome

    Science.gov (United States)

    Hettne, Kristina M; de Mos, Marissa; de Bruijn, Anke GJ; Weeber, Marc; Boyer, Scott; van Mulligen, Erik M; Cases, Montserrat; Mestres, Jordi; van der Lei, Johan

    2007-01-01

    Background Collaborative efforts of physicians and basic scientists are often necessary in the investigation of complex disorders. Difficulties can arise, however, when large amounts of information need to reviewed. Advanced information retrieval can be beneficial in combining and reviewing data obtained from the various scientific fields. In this paper, a team of investigators with varying backgrounds has applied advanced information retrieval methods, in the form of text mining and entity relationship tools, to review the current literature, with the intention to generate new insights into the molecular mechanisms underlying a complex disorder. As an example of such a disorder the Complex Regional Pain Syndrome (CRPS) was chosen. CRPS is a painful and debilitating syndrome with a complex etiology that is still unraveled for a considerable part, resulting in suboptimal diagnosis and treatment. Results A text mining based approach combined with a simple network analysis identified Nuclear Factor kappa B (NFκB) as a possible central mediator in both the initiation and progression of CRPS. Conclusion The result shows the added value of a multidisciplinary approach combined with information retrieval in hypothesis discovery in biomedical research. The new hypothesis, which was derived in silico, provides a framework for further mechanistic studies into the underlying molecular mechanisms of CRPS and requires evaluation in clinical and epidemiological studies. PMID:17480215

  16. A Notation for Rapid Specification of Information Visualization

    Science.gov (United States)

    Lee, Sang Yun

    2013-01-01

    This thesis describes a notation for rapid specification of information visualization, which can be used as a theoretical framework of integrating various types of information visualization, and its applications at a conceptual level. The notation is devised to codify the major characteristics of data/visual structures in conventionally-used…

  17. Parsed and fixed block representations of visual information for image retrieval

    Science.gov (United States)

    Bae, Soo Hyun; Juang, Biing-Hwang

    2009-02-01

    The theory of linguistics teaches us the existence of a hierarchical structure in linguistic expressions, from letter to word root, and on to word and sentences. By applying syntax and semantics beyond words, one can further recognize the grammatical relationship between among words and the meaning of a sequence of words. This layered view of a spoken language is useful for effective analysis and automated processing. Thus, it is interesting to ask if a similar hierarchy of representation of visual information does exist. A class of techniques that have a similar nature to the linguistic parsing is found in the Lempel-Ziv incremental parsing scheme. Based on a new class of multidimensional incremental parsing algorithms extended from the Lempel-Ziv incremental parsing, a new framework for image retrieval, which takes advantage of the source characterization property of the incremental parsing algorithm, was proposed recently. With the incremental parsing technique, a given image is decomposed into a number of patches, called a parsed representation. This representation can be thought of as a morphological interface between elementary pixel and a higher level representation. In this work, we examine the properties of two-dimensional parsed representation in the context of imagery information retrieval and in contrast to vector quantization; i.e. fixed square-block representations and minimum average distortion criteria. We implemented four image retrieval systems for the comparative study; three, called IPSILON image retrieval systems, use parsed representation with different perceptual distortion thresholds and one uses the convectional vector quantization for visual pattern analysis. We observe that different perceptual distortion in visual pattern matching does not have serious effects on the retrieval precision although allowing looser perceptual thresholds in image compression result poor reconstruction fidelity. We compare the effectiveness of the use of the

  18. Theory and approach of information retrievals from electromagnetic scattering and remote sensing

    CERN Document Server

    Jin, Ya-Qiu

    2006-01-01

    Covers several hot topics in current research of electromagnetic scattering, and radiative transfer in complex and random media, polarimetric scattering and SAR imagery technology, data validation and information retrieval from space-borne remote sensing, computational electromagnetics, etc.Including both forward modelling and inverse problems, analytic theory and numerical approachesAn overall summary of the author's works during most recent yearsAlso presents some insight for future research topics.

  19. Design and usability study of an iconic user interface to ease information retrieval of medical guidelines.

    Science.gov (United States)

    Griffon, Nicolas; Kerdelhué, Gaétan; Hamek, Saliha; Hassler, Sylvain; Boog, César; Lamy, Jean-Baptiste; Duclos, Catherine; Venot, Alain; Darmoni, Stéfan J

    2014-10-01

    Doc'CISMeF (DC) is a semantic search engine used to find resources in CISMeF-BP, a quality controlled health gateway, which gathers guidelines available on the internet in French. Visualization of Concepts in Medicine (VCM) is an iconic language that may ease information retrieval tasks. This study aimed to describe the creation and evaluation of an interface integrating VCM in DC in order to make this search engine much easier to use. Focus groups were organized to suggest ways to enhance information retrieval tasks using VCM in DC. A VCM interface was created and improved using the ergonomic evaluation approach. 20 physicians were recruited to compare the VCM interface with the non-VCM one. Each evaluator answered two different clinical scenarios in each interface. The ability and time taken to select a relevant resource were recorded and compared. A usability analysis was performed using the System Usability Scale (SUS). The VCM interface contains a filter based on icons, and icons describing each resource according to focus group recommendations. Some ergonomic issues were resolved before evaluation. Use of VCM significantly increased the success of information retrieval tasks (OR=11; 95% CI 1.4 to 507). Nonetheless, it took significantly more time to find a relevant resource with VCM interface (101 vs 65 s; p=0.02). SUS revealed 'good' usability with an average score of 74/100. VCM was successfully implemented in DC as an option. It increased the success rate of information retrieval tasks, despite requiring slightly more time, and was well accepted by end-users. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  20. Administrative professional's role in the processing, retrieval, dissemination and repackaging of information in the networked enterprise

    OpenAIRE

    2008-01-01

    The purpose of this research was to establish the administrative professional's role in the processing, retrieval, dissemination and repackaging of digital information in the networked enterprise, and to determine how the administrative professional can add value to the organisation and enhance its competitive position in industry. The digital economy has changed business practices to such an extent that research of the digital office environment and the administrative professional’s role in ...

  1. Opportunistic Carrier Sensing for Energy-Efficient Information Retrieval in Sensor Networks

    Directory of Open Access Journals (Sweden)

    Zhao Qing

    2005-01-01

    Full Text Available We consider distributed information retrieval for sensor networks with cluster heads or mobile access points. The performance metric used in the design is energy efficiency defined as the ratio of the average number of bits reliably retrieved by the access point to the total amount of energy consumed. A distributed opportunistic transmission protocol is proposed using a combination of carrier sensing and backoff strategy that incorporates channel state information (CSI of individual sensors. By selecting a set of sensors with the best channel states to transmit, the proposed protocol achieves the upper bound on energy efficiency when the signal propagation delay is negligible. For networks with substantial propagation delays, a backoff function optimized for energy efficiency is proposed. The design of this backoff function utilizes properties of extreme statistics and is shown to have mild performance loss in practical scenarios. We also demonstrate that opportunistic strategies that use CSI may not be optimal when channel acquisition at individual sensors consumes substantial energy. We show further that there is an optimal sensor density for which the opportunistic information retrieval is the most energy efficient. This observation leads to the design of the optimal sensor duty cycle.

  2. Machine Learning or Information Retrieval Techniques for Bug Triaging: Which is better?

    Directory of Open Access Journals (Sweden)

    Anjali Goyal

    2017-07-01

    Full Text Available Bugs are the inevitable part of a software system. Nowadays, large software development projects even release beta versions of their products to gather bug reports from users. The collected bug reports are then worked upon by various developers in order to resolve the defects and make the final software product more reliable. The high frequency of incoming bugs makes the bug handling a difficult and time consuming task. Bug assignment is an integral part of bug triaging that aims at the process of assigning a suitable developer for the reported bug who corrects the source code in order to resolve the bug. There are various semi and fully automated techniques to ease the task of bug assignment. This paper presents the current state of the art of various techniques used for bug report assignment. Through exhaustive research, the authors have observed that machine learning and information retrieval based bug assignment approaches are most popular in literature. A deeper investigation has shown that the trend of techniques is taking a shift from machine learning based approaches towards information retrieval based approaches. Therefore, the focus of this work is to find the reason behind the observed drift and thus a comparative analysis is conducted on the bug reports of the Mozilla, Eclipse, Gnome and Open Office projects in the Bugzilla repository. The results of the study show that the information retrieval based technique yields better efficiency in recommending the developers for bug reports.

  3. Centronuclear myopathy in Labrador retrievers: a recent founder mutation in the PTPLA gene has rapidly disseminated worldwide.

    Directory of Open Access Journals (Sweden)

    Marie Maurer

    Full Text Available Centronuclear myopathies (CNM are inherited congenital disorders characterized by an excessive number of internalized nuclei. In humans, CNM results from ~70 mutations in three major genes from the myotubularin, dynamin and amphiphysin families. Analysis of animal models with altered expression of these genes revealed common defects in all forms of CNM, paving the way for unified pathogenic and therapeutic mechanisms. Despite these efforts, some CNM cases remain genetically unresolved. We previously identified an autosomal recessive form of CNM in French Labrador retrievers from an experimental pedigree, and showed that a loss-of-function mutation in the protein tyrosine phosphatase-like A (PTPLA gene segregated with CNM. Around the world, client-owned Labrador retrievers with a similar clinical presentation and histopathological changes in muscle biopsies have been described. We hypothesized that these Labradors share the same PTPLA(cnm mutation. Genotyping of an international panel of 7,426 Labradors led to the identification of PTPLA(cnm carriers in 13 countries. Haplotype analysis demonstrated that the PTPLA(cnm allele resulted from a single and recent mutational event that may have rapidly disseminated through the extensive use of popular sires. PTPLA-deficient Labradors will help define the integrated role of PTPLA in the existing CNM gene network. They will be valuable complementary large animal models to test innovative therapies in CNM.

  4. Regulatory and Permitting Information Desktop (RAPID) Toolkit (Poster)

    Energy Technology Data Exchange (ETDEWEB)

    Young, K. R.; Levine, A.

    2014-09-01

    The Regulatory and Permitting Information Desktop (RAPID) Toolkit combines the former Geothermal Regulatory Roadmap, National Environmental Policy Act (NEPA) Database, and other resources into a Web-based tool that gives the regulatory and utility-scale geothermal developer communities rapid and easy access to permitting information. RAPID currently comprises five tools - Permitting Atlas, Regulatory Roadmap, Resource Library, NEPA Database, and Best Practices. A beta release of an additional tool, the Permitting Wizard, is scheduled for late 2014. Because of the huge amount of information involved, RAPID was developed in a wiki platform to allow industry and regulatory agencies to maintain the content in the future so that it continues to provide relevant and accurate information to users. In 2014, the content was expanded to include regulatory requirements for utility-scale solar and bulk transmission development projects. Going forward, development of the RAPID Toolkit will focus on expanding the capabilities of current tools, developing additional tools, including additional technologies, and continuing to increase stakeholder involvement.

  5. Exploring interdisciplinary relationships between linguistics and information retrieval from the 1960s to today

    DEFF Research Database (Denmark)

    Engerer, Volkmar Paul

    2017-01-01

    This article explores how linguistics has influenced information retrieval (IR) and attempts to explain the impact of linguistics through an analysis of internal developments in information science generally, and IR in particular. It notes that information science/IR has been evolving from a case...... science into a fully fledged, “disciplined”/disciplinary science. The article establishes correspondences between linguistics and information science/IR using the three established IR paradigms—physical, cognitive, and computational—as a frame of reference. The current relationship between information...... science/IR and linguistics is elucidated through discussion of some recent information science publications dealing with linguistic topics and a novel technique, “keyword collocation analysis,” is introduced. Insights from interdisciplinarity research and case theory are also discussed. It is demonstrated...

  6. Understanding the aerosol information content in multi-spectral reflectance measurements using a synergetic retrieval algorithm

    Directory of Open Access Journals (Sweden)

    D. Martynenko

    2010-11-01

    Full Text Available An information content analysis for multi-wavelength SYNergetic AErosol Retrieval algorithm SYNAER was performed to quantify the number of independent pieces of information that can be retrieved. In particular, the capability of SYNAER to discern various aerosol types is assessed. This information content depends on the aerosol optical depth, the surface albedo spectrum and the observation geometry. The theoretical analysis is performed for a large number of scenarios with various geometries and surface albedo spectra for ocean, soil and vegetation. When the surface albedo spectrum and its accuracy is known under cloud-free conditions, reflectance measurements used in SYNAER is able to provide for 2–4° of freedom that can be attributed to retrieval parameters: aerosol optical depth, aerosol type and surface albedo.

    The focus of this work is placed on an information content analysis with emphasis to the aerosol type classification. This analysis is applied to synthetic reflectance measurements for 40 predefined aerosol mixtures of different basic components, given by sea salt, mineral dust, biomass burning and diesel aerosols, water soluble and water insoluble aerosols. The range of aerosol parameters considered through the 40 mixtures covers the natural variability of tropospheric aerosols. After the information content analysis performed in Holzer-Popp et al. (2008 there was a necessity to compare derived degrees of freedom with retrieved aerosol optical depth for different aerosol types, which is the main focus of this paper.

    The principle component analysis was used to determine the correspondence between degrees of freedom for signal in the retrieval and derived aerosol types. The main results of the analysis indicate correspondence between the major groups of the aerosol types, which are: water soluble aerosol, soot, mineral dust and sea salt and degrees of freedom in the algorithm and show the ability of the SYNAER to

  7. Barriers to retrieving patient information from electronic health record data: failure analysis from the TREC Medical Records Track.

    Science.gov (United States)

    Edinger, Tracy; Cohen, Aaron M; Bedrick, Steven; Ambert, Kyle; Hersh, William

    2012-01-01

    Secondary use of electronic health record (EHR) data relies on the ability to retrieve accurate and complete information about desired patient populations. The Text Retrieval Conference (TREC) 2011 Medical Records Track was a challenge evaluation allowing comparison of systems and algorithms to retrieve patients eligible for clinical studies from a corpus of de-identified medical records, grouped by patient visit. Participants retrieved cohorts of patients relevant to 35 different clinical topics, and visits were judged for relevance to each topic. This study identified the most common barriers to identifying specific clinic populations in the test collection. Using the runs from track participants and judged visits, we analyzed the five non-relevant visits most often retrieved and the five relevant visits most often overlooked. Categories were developed iteratively to group the reasons for incorrect retrieval for each of the 35 topics. Reasons fell into nine categories for non-relevant visits and five categories for relevant visits. Non-relevant visits were most often retrieved because they contained a non-relevant reference to the topic terms. Relevant visits were most often infrequently retrieved because they used a synonym for a topic term. This failure analysis provides insight into areas for future improvement in EHR-based retrieval with techniques such as more widespread and complete use of standardized terminology in retrieval and data entry systems.

  8. Concept similarity and related categories in information retrieval using formal concept analysis

    Science.gov (United States)

    Eklund, P.; Ducrou, J.; Dau, F.

    2012-11-01

    The application of formal concept analysis to the problem of information retrieval has been shown useful but has lacked any real analysis of the idea of relevance ranking of search results. SearchSleuth is a program developed to experiment with the automated local analysis of Web search using formal concept analysis. SearchSleuth extends a standard search interface to include a conceptual neighbourhood centred on a formal concept derived from the initial query. This neighbourhood of the concept derived from the search terms is decorated with its upper and lower neighbours representing more general and special concepts, respectively. SearchSleuth is in many ways an archetype of search engines based on formal concept analysis with some novel features. In SearchSleuth, the notion of related categories - which are themselves formal concepts - is also introduced. This allows the retrieval focus to shift to a new formal concept called a sibling. This movement across the concept lattice needs to relate one formal concept to another in a principled way. This paper presents the issues concerning exploring, searching, and ordering the space of related categories. The focus is on understanding the use and meaning of proximity and semantic distance in the context of information retrieval using formal concept analysis.

  9. Ontology Mapping: An Information Retrieval and Interactive Activation Network Based Approach

    Science.gov (United States)

    Mao, Ming

    Ontology mapping is to find semantic correspondences between similar elements of different ontologies. It is critical to achieve semantic interoperability in the WWW. This paper proposes a new generic and scalable ontology mapping approach based on propagation theory, information retrieval technique and artificial intelligence model. The approach utilizes both linguistic and structural information, measures the similarity of different elements of ontologies in a vector space model, and deals with constraints using the interactive activation network. The results of pilot study, the PRIOR, are promising and scalable.

  10. Experiments with Cross-Language Information Retrieval on a Health Portal for Psychology and Psychotherapy.

    Science.gov (United States)

    Andrenucci, Andrea

    2016-01-01

    Few studies have been performed within cross-language information retrieval (CLIR) in the field of psychology and psychotherapy. The aim of this paper is to to analyze and assess the quality of available query translation methods for CLIR on a health portal for psychology. A test base of 100 user queries, 50 Multi Word Units (WUs) and 50 Single WUs, was used. Swedish was the source language and English the target language. Query translation methods based on machine translation (MT) and dictionary look-up were utilized in order to submit query translations to two search engines: Google Site Search and Quick Ask. Standard IR evaluation measures and a qualitative analysis were utilized to assess the results. The lexicon extracted with word alignment of the portal's parallel corpus provided better statistical results among dictionary look-ups. Google Translate provided more linguistically correct translations overall and also delivered better retrieval results in MT.

  11. Creation of reliable relevance judgments in information retrieval systems evaluation experimentation through crowdsourcing: a review.

    Science.gov (United States)

    Samimi, Parnia; Ravana, Sri Devi

    2014-01-01

    Test collection is used to evaluate the information retrieval systems in laboratory-based evaluation experimentation. In a classic setting, generating relevance judgments involves human assessors and is a costly and time consuming task. Researchers and practitioners are still being challenged in performing reliable and low-cost evaluation of retrieval systems. Crowdsourcing as a novel method of data acquisition is broadly used in many research fields. It has been proven that crowdsourcing is an inexpensive and quick solution as well as a reliable alternative for creating relevance judgments. One of the crowdsourcing applications in IR is to judge relevancy of query document pair. In order to have a successful crowdsourcing experiment, the relevance judgment tasks should be designed precisely to emphasize quality control. This paper is intended to explore different factors that have an influence on the accuracy of relevance judgments accomplished by workers and how to intensify the reliability of judgments in crowdsourcing experiment.

  12. Medical Information Retrieval Enhanced with User's Query Expanded with Tag-Neighbors

    DEFF Research Database (Denmark)

    Durao, Frederico; Bayyapu, Karunakar Reddy; Xu, Guandong

    2013-01-01

    ’ original queries with context-relevant information. We compute a set of significant tag neighbor candidates based on the neighbor frequency and weight, and utilize the qualified tag neighbors to expand an entry query. The proposed approach is evaluated by using MedWorm medical article collection......Under-specified queries often lead to undesirable search results that do not contain the information needed. This problem gets worse when it comes to medical information, a natural human demand everywhere. Existing search engines on the Web often are unable to handle medical search well because...... they do not consider its special requirements. Often a medical information searcher is uncertain about his exact questions and unfamiliar with medical terminology. To overcome the limitations of under-specified queries, we utilize tags to enhance information retrieval capabilities by expanding users...

  13. Development of digital dashboard system for medical practice: maximizing efficiency of medical information retrieval and communication.

    Science.gov (United States)

    Lee, Kee Hyuck; Yoo, Sooyoung; Shin, HoGyun; Baek, Rong-Min; Chung, Chin Youb; Hwang, Hee

    2013-01-01

    It is reported that digital dashboard systems in hospitals provide a user interface (UI) that can centrally manage and retrieve various information related to patients in a single screen, support the decision-making of medical professionals on a real time basis by integrating the scattered medical information systems and core work flows, enhance the competence and decision-making ability of medical professionals, and reduce the probability of misdiagnosis. However, the digital dashboard systems of hospitals reported to date have some limitations when medical professionals use them to generally treat inpatients, because those were limitedly used for the work process of certain departments or developed to improve specific disease-related indicators. Seoul National University Bundang Hospital developed a new concept of EMR system to overcome such limitations. The system allows medical professionals to easily access all information on inpatients and effectively retrieve important information from any part of the hospital by displaying inpatient information in the form of digital dashboard. In this study, we would like to introduce the structure, development methodology and the usage of our new concept.

  14. Bat-Inspired Algorithm Based Query Expansion for Medical Web Information Retrieval.

    Science.gov (United States)

    Khennak, Ilyes; Drias, Habiba

    2017-02-01

    With the increasing amount of medical data available on the Web, looking for health information has become one of the most widely searched topics on the Internet. Patients and people of several backgrounds are now using Web search engines to acquire medical information, including information about a specific disease, medical treatment or professional advice. Nonetheless, due to a lack of medical knowledge, many laypeople have difficulties in forming appropriate queries to articulate their inquiries, which deem their search queries to be imprecise due the use of unclear keywords. The use of these ambiguous and vague queries to describe the patients' needs has resulted in a failure of Web search engines to retrieve accurate and relevant information. One of the most natural and promising method to overcome this drawback is Query Expansion. In this paper, an original approach based on Bat Algorithm is proposed to improve the retrieval effectiveness of query expansion in medical field. In contrast to the existing literature, the proposed approach uses Bat Algorithm to find the best expanded query among a set of expanded query candidates, while maintaining low computational complexity. Moreover, this new approach allows the determination of the length of the expanded query empirically. Numerical results on MEDLINE, the on-line medical information database, show that the proposed approach is more effective and efficient compared to the baseline.

  15. Hydropower Regulatory and Permitting Information Desktop (RAPID) Toolkit

    Energy Technology Data Exchange (ETDEWEB)

    Levine, Aaron L [National Renewable Energy Laboratory (NREL), Golden, CO (United States)

    2017-12-19

    Hydropower Regulatory and Permitting Information Desktop (RAPID) Toolkit presentation from the WPTO FY14-FY16 Peer Review. The toolkit is aimed at regulatory agencies, consultants, project developers, the public, and any other party interested in learning more about the hydropower regulatory process.

  16. Developing Collective Learning Extension for Rapidly Evolving Information System Courses

    Science.gov (United States)

    Agarwal, Nitin; Ahmed, Faysal

    2017-01-01

    Due to rapidly evolving Information System (IS) technologies, instructors find themselves stuck in the constant game of catching up. On the same hand students find their skills obsolete almost as soon as they graduate. As part of IS curriculum and education, we need to emphasize more on teaching the students "how to learn" while keeping…

  17. Discrepancy between mRNA and protein abundance: insight from information retrieval process in computers.

    Science.gov (United States)

    Wang, Degeng

    2008-12-01

    Discrepancy between the abundance of cognate protein and RNA molecules is frequently observed. A theoretical understanding of this discrepancy remains elusive, and it is frequently described as surprises and/or technical difficulties in the literature. Protein and RNA represent different steps of the multi-stepped cellular genetic information flow process, in which they are dynamically produced and degraded. This paper explores a comparison with a similar process in computers-multi-step information flow from storage level to the execution level. Functional similarities can be found in almost every facet of the retrieval process. Firstly, common architecture is shared, as the ribonome (RNA space) and the proteome (protein space) are functionally similar to the computer primary memory and the computer cache memory, respectively. Secondly, the retrieval process functions, in both systems, to support the operation of dynamic networks-biochemical regulatory networks in cells and, in computers, the virtual networks (of CPU instructions) that the CPU travels through while executing computer programs. Moreover, many regulatory techniques are implemented in computers at each step of the information retrieval process, with a goal of optimizing system performance. Cellular counterparts can be easily identified for these regulatory techniques. In other words, this comparative study attempted to utilize theoretical insight from computer system design principles as catalysis to sketch an integrative view of the gene expression process, that is, how it functions to ensure efficient operation of the overall cellular regulatory network. In context of this bird's-eye view, discrepancy between protein and RNA abundance became a logical observation one would expect. It was suggested that this discrepancy, when interpreted in the context of system operation, serves as a potential source of information to decipher regulatory logics underneath biochemical network operation.

  18. A Point-Set-Based Footprint Model and Spatial Ranking Method for Geographic Information Retrieval

    Directory of Open Access Journals (Sweden)

    Yong Gao

    2016-07-01

    Full Text Available In the recent big data era, massive spatial related data are continuously generated and scrambled from various sources. Acquiring accurate geographic information is also urgently demanded. How to accurately retrieve desired geographic information has become the prominent issue, needing to be resolved in high priority. The key technologies in geographic information retrieval are modeling document footprints and ranking documents based on their similarity evaluation. The traditional spatial similarity evaluation methods are mainly performed using a MBR (Minimum Bounding Rectangle footprint model. However, due to its nature of simplification and roughness, the results of traditional methods tend to be isotropic and space-redundant. In this paper, a new model that constructs the footprints in the form of point-sets is presented. The point-set-based footprint coincides the nature of place names in web pages, so it is redundancy-free, consistent, accurate, and anisotropic to describe the spatial extents of documents, and can handle multi-scale geographic information. The corresponding spatial ranking method is also presented based on the point-set-based model. The new similarity evaluation algorithm of this method firstly measures multiple distances for the spatial proximity across different scales, and then combines the frequency of place names to improve the accuracy and precision. The experimental results show that the proposed method outperforms the traditional methods with higher accuracies under different searching scenarios.

  19. Assimilation of SMOS Retrieved Soil Moisture into the Land Information System

    Science.gov (United States)

    Blankenship, Clay; Case, Jonathan; Zavodsky, Bradley; Jedlovec, Gary

    2014-01-01

    Soil moisture retrievals from the Soil Moisture and Ocean Salinity (SMOS) instrument are assimilated into the Noah land surface model (LSM) within the NASA Land Information System (LIS). Before assimilation, SMOS retrievals are bias-corrected to match the model climatological distribution using a Cumulative Distribution Function (CDF) matching approach. Data assimilation is done via the Ensemble Kalman Filter. The goal is to improve the representation of soil moisture within the LSM, and ultimately to improve numerical weather forecasts through better land surface initialization. We present a case study showing a large area of irrigation in the lower Mississippi River Valley, in an area with extensive rice agriculture. High soil moisture value in this region are observed by SMOS, but not captured in the forcing data. After assimilation, the model fields reflect the observed geographic patterns of soil moisture. Plans for a modeling experiment and operational use of the data are given. This work helps prepare for the assimilation of Soil Moisture Active/Passive (SMAP) retrievals in the near future.

  20. Evaluating A Priori Ozone Profile Information Used in TEMPO Tropospheric Ozone Retrievals

    Science.gov (United States)

    Johnson, M. S.; Sullivan, J. T.; Liu, X.; Newchurch, M.; Kuang, S.; McGee, T. J.; Langford, A. O.; Senff, C. J.; Leblanc, T.; Berkoff, T.; Gronoff, G.; Chen, G.; Strawbridge, K. B.

    2016-12-01

    Ozone (O3) is a greenhouse gas and toxic pollutant which plays a major role in air quality. Typically, monitoring of surface air quality and O3 mixing ratios is primarily conducted using in situ measurement networks. This is partially due to high-quality information related to air quality being limited from space-borne platforms due to coarse spatial resolution, limited temporal frequency, and minimal sensitivity to lower tropospheric and surface-level O3. The Tropospheric Emissions: Monitoring of Pollution (TEMPO) satellite is designed to address these limitations of current space-based platforms and to improve our ability to monitor North American air quality. TEMPO will provide hourly data of total column and vertical profiles of O3 with high spatial resolution to be used as a near-real-time air quality product. TEMPO O3 retrievals will apply the Smithsonian Astrophysical Observatory profile algorithm developed based on work from GOME, GOME-2, and OMI. This algorithm uses a priori O3 profile information from a climatological data-base developed from long-term ozone-sonde measurements (tropopause-based (TB) O3 climatology). It has been shown that satellite O3 retrievals are sensitive to a priori O3 profiles and covariance matrices. During this work we investigate the climatological data to be used in TEMPO algorithms (TB O3) and simulated data from the NASA GMAO Goddard Earth Observing System (GEOS-5) Forward Processing (FP) near-real-time (NRT) model products. These two data products will be evaluated with ground-based lidar data from the Tropospheric Ozone Lidar Network (TOLNet) at various locations of the US. This study evaluates the TB climatology, GEOS-5 climatology, and 3-hourly GEOS-5 data compared to lower tropospheric observations to demonstrate the accuracy of a priori information to potentially be used in TEMPO O3 algorithms. Here we present our initial analysis and the theoretical impact on TEMPO retrievals in the lower troposphere.

  1. Harnessing the Power of Education Research Databases with the Pearl-Harvesting Methodological Framework for Information Retrieval

    Science.gov (United States)

    Sandieson, Robert W.; Kirkpatrick, Lori C.; Sandieson, Rachel M.; Zimmerman, Walter

    2010-01-01

    Digital technologies enable the storage of vast amounts of information, accessible with remarkable ease. However, along with this facility comes the challenge to find pertinent information from the volumes of nonrelevant information. The present article describes the pearl-harvesting methodological framework for information retrieval. Pearl…

  2. A two-level cache for distributed information retrieval in search engines.

    Science.gov (United States)

    Zhang, Weizhe; He, Hui; Ye, Jianwei

    2013-01-01

    To improve the performance of distributed information retrieval in search engines, we propose a two-level cache structure based on the queries of the users' logs. We extract the highest rank queries of users from the static cache, in which the queries are the most popular. We adopt the dynamic cache as an auxiliary to optimize the distribution of the cache data. We propose a distribution strategy of the cache data. The experiments prove that the hit rate, the efficiency, and the time consumption of the two-level cache have advantages compared with other structures of cache.

  3. Automatic generation of stop word lists for information retrieval and analysis

    Science.gov (United States)

    Rose, Stuart J

    2013-01-08

    Methods and systems for automatically generating lists of stop words for information retrieval and analysis. Generation of the stop words can include providing a corpus of documents and a plurality of keywords. From the corpus of documents, a term list of all terms is constructed and both a keyword adjacency frequency and a keyword frequency are determined. If a ratio of the keyword adjacency frequency to the keyword frequency for a particular term on the term list is less than a predetermined value, then that term is excluded from the term list. The resulting term list is truncated based on predetermined criteria to form a stop word list.

  4. The use of categorization information in language models for question retrieval

    DEFF Research Database (Denmark)

    Cao, Xin; Cong, Gao; Cui, Bin

    2009-01-01

    and have become important information resources on the Web. To make the body of knowledge accumulated in CQA archives accessible, effective and efficient question search is required. Question search in a CQA archive aims to retrieve historical questions that are relevant to new questions posed by users....... This paper proposes a category-based framework for search in CQA archives. The framework embodies several new techniques that use language models to exploit categories of questions for improving question-answer search. Experiments conducted on real data from Yahoo! Answers demonstrate that the proposed...

  5. Transportable, university-level educational programs in interactive information storage and retrieval systems

    Science.gov (United States)

    Dominick, Wayne D.; Roquemore, Leroy

    1984-01-01

    Pursuant to the specifications of a research contract entered into in December, 1983 with NASA, the Computer Science Departments of the University of Southwestern Louisiana and Southern University will be working jointly to address a variety of research and educational issues relating to the use, by non-computer professionals, of some of the largest and most sophiticated interactive information storage and retrieval systems available. Over the projected 6 to 8 year life of the project, in addition to NASA/RECON, the following systems will be examined: Lockheed DIALOG, DOE/RECON, DOD/DTIC, EPA/CSIN, and LLNL/TIS.

  6. A Simple Method to Determine if a Music Information Retrieval System is a "Horse"

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2014-01-01

    We propose and demonstrate a simple method to determine if a music information retrieval (MIR) system is using factors irrelevant to the task for which it is designed. This is of critical importance to certain use cases, but cannot be accomplished using standard approaches to evaluation in MIR....... Akin to the controlled experiments designed to test the intellect of the famous horse ``Clever Hans'', we perform two experiments to show how three state-of-the-art music genre recognition (MGR) and music emotion recognition (MER) systems are relying on factors confounded with the ``ground truth...

  7. Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval

    Directory of Open Access Journals (Sweden)

    Marcelo SCHIESSL

    Full Text Available Abstract The proposal presented in this study seeks to properly represent natural language to ontologies and vice-versa. Therefore, the semi-automatic creation of a lexical database in Brazilian Portuguese containing morphological, syntactic, and semantic information that can be read by machines was proposed, allowing the link between structured and unstructured data and its integration into an information retrieval model to improve precision. The results obtained demonstrated that the methodology can be used in the risco financeiro (financial risk domain in Portuguese for the construction of an ontology and the lexical-semantic database and the proposal of a semantic information retrieval model. In order to evaluate the performance of the proposed model, documents containing the main definitions of the financial risk domain were selected and indexed with and without semantic annotation. To enable the comparison between the approaches, two databases were created based on the texts with the semantic annotations to represent the semantic search. The first one represents the traditional search and the second contained the index built based on the texts with the semantic annotations to represent the semantic search. The evaluation of the proposal was based on recall and precision. The queries submitted to the model showed that the semantic search outperforms the traditional search and validates the methodology used. Although more complex, the procedure proposed can be used in all kinds of domains.

  8. Does the traditional thesaurus have a place in modern information retrieval?

    DEFF Research Database (Denmark)

    Hjørland, Birger

    2016-01-01

    The thesaurus has been - and still is - very important in the self-images of library and information professionals and scientists. However, as indicated by the recent debate in the ISKO UK (2015) the role of the thesaurus in modern information retrieval seemingly has shrunk from what it once...... was (although it won the day in the final voting of this debate). Why is this the case? What is the future prospect for thesauri? The three main points of this paper are: (1) Any knowledge organization system (KOS) is today threatened by Google-like systems, and it is therefore important to consider...... relations are most fruitful is thus an open question. (3) A thesaurus is today mostly considered a standardized tool but different domains may need different kinds of KOS including different sets of relations between terms. It is urgent that progress in information science and KOS is evaluated from proper...

  9. Global polar geospatial information service retrieval based on search engine and ontology reasoning

    Science.gov (United States)

    Chen, Nengcheng; E, Dongcheng; Di, Liping; Gong, Jianya; Chen, Zeqiang

    2007-01-01

    In order to improve the access precision of polar geospatial information service on web, a new methodology for retrieving global spatial information services based on geospatial service search and ontology reasoning is proposed, the geospatial service search is implemented to find the coarse service from web, the ontology reasoning is designed to find the refined service from the coarse service. The proposed framework includes standardized distributed geospatial web services, a geospatial service search engine, an extended UDDI registry, and a multi-protocol geospatial information service client. Some key technologies addressed include service discovery based on search engine and service ontology modeling and reasoning in the Antarctic geospatial context. Finally, an Antarctica multi protocol OWS portal prototype based on the proposed methodology is introduced.

  10. Immediate-Early Gene Transcriptional Activation in Hippocampus Ca1 and Ca3 Does Not Accurately Reflect Rapid, Pattern Completion-Based Retrieval of Context Memory

    Science.gov (United States)

    Pevzner, Aleksandr; Guzowski, John F.

    2015-01-01

    No studies to date have examined whether immediate-early gene (IEG) activation is driven by context memory recall. To address this question, we utilized the context preexposure facilitation effect (CPFE) paradigm. In CPFE, animals acquire contextual fear conditioning through hippocampus-dependent rapid retrieval of a previously formed contextual…

  11. On the functional significance of retrieval mode: Task switching disrupts the recollection of conceptual stimulus information from episodic memory.

    Science.gov (United States)

    Küper, Kristina

    2018-01-01

    Episodic memory retrieval is assumed to be associated with the tonic cognitive state of retrieval mode. Despite extensive research into the neurophysiological correlates of retrieval mode, as of yet, relatively little is known about its functional significance. The present event-related potential (ERP) study was aimed at examining the impact of retrieval mode on the specificity of memory content retrieved in the course of familiarity and recollection processes. In two experiments, participants performed a recognition memory inclusion task in which they had to distinguish identically repeated and re-colored versions of study items from new items. In Experiment 1, participants had to alternate between the episodic memory task and a semantic task requiring a natural/artificial decision. In Experiment 2, the two tasks were instead performed in separate blocks. ERPs locked to the preparatory cues in the test phases indicated that participants did not establish retrieval mode on switch trials in Experiment 1. In the absence of retrieval mode, neither type of studied item elicited ERP correlates of familiarity-based retrieval (FN400). Recollection-related late positive complex (LPC) old/new effects emerged only for identically repeated but not for conceptually identical but perceptually changed versions of study items. With blocked retrieval in Experiment 2, both types of old items instead elicited equivalent FN400 and LPC old/new effects. The LPC data indicate that retrieval mode may play an important role in the successful recollection of conceptual stimulus information. The FN400 results additionally suggest that task switching may have a detrimental effect on familiarity-based memory retrieval. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. Adaptation to Pronunciation Variations in Indonesian Spoken Query-Based Information Retrieval

    Science.gov (United States)

    Lestari, Dessi Puji; Furui, Sadaoki

    Recognition errors of proper nouns and foreign words significantly decrease the performance of ASR-based speech applications such as voice dialing systems, speech summarization, spoken document retrieval, and spoken query-based information retrieval (IR). The reason is that proper nouns and words that come from other languages are usually the most important key words. The loss of such words due to misrecognition in turn leads to a loss of significant information from the speech source. This paper focuses on how to improve the performance of Indonesian ASR by alleviating the problem of pronunciation variation of proper nouns and foreign words (English words in particular). To improve the proper noun recognition accuracy, proper-noun specific acoustic models are created by supervised adaptation using maximum likelihood linear regression (MLLR). To improve English word recognition, the pronunciation of English words contained in the lexicon is fixed by using rule-based English-to-Indonesian phoneme mapping. The effectiveness of the proposed method was confirmed through spoken query based Indonesian IR. We used Inference Network-based (IN-based) IR and compared its results with those of the classical Vector Space Model (VSM) IR, both using a tf-idf weighting schema. Experimental results show that IN-based IR outperforms VSM IR.

  13. Retrieval of Legal Information Through Discovery Layers: A Case Study Related to Indian Law Libraries

    Directory of Open Access Journals (Sweden)

    Kushwah, Shivpal Singh

    2016-09-01

    Full Text Available Purpose. The purpose of this paper is to analyze and evaluate discovery layer search tools for retrieval of legal information in Indian law libraries. This paper covers current practices in legal information retrieval with special reference to Indian academic law libraries, and analyses its importance in the domain of law.Design/Methodology/Approach. A web survey and observational study method are used to collect the data. Data related to the discovery tools were collected using email and further discussion held with the discovery layer/ tool /product developers and their representatives.Findings. Results show that most of the Indian law libraries are subscribing to bundles of legal information resources such as Hein Online, JSTOR, LexisNexis Academic, Manupatra, Westlaw India, SCC web, AIR Online (CDROM, and so on. International legal and academic resources are compatible with discovery tools because they support various standards related to online publishing and dissemination such as OAI/PMH, Open URL, MARC21, and Z39.50, but Indian legal resources such as Manupatra, Air, and SCC are not compatible with the discovery layers. The central index is one of the important components in a discovery search interface, and discovery layer services/tools could be useful for Indian law libraries also if they can include multiple legal and academic resources in their central index. But present practices and observations reveal that discovery layers are not providing facility to cover legal information resources. Therefore, in the present form, discovery tools are not very useful; they are an incomplete and half solution for Indian libraries because all available Indian legal resources available in the law libraries are not covered.Originality/Value. Very limited research or published literature is available in the area of discovery layers and their compatibility with legal information resources.

  14. Degenerative Encephalopathy in Nova Scotia Duck Tolling Retrievers Presenting with a Rapid Eye Movement Sleep Behavior Disorder.

    Science.gov (United States)

    Barker, E N; Dawson, L J; Rose, J H; Van Meervenne, S; Frykman, O; Rohdin, C; Leijon, A; Soerensen, K E; Järnegren, J; Johnson, G C; O'Brien, D P; Granger, N

    2016-09-01

    Neurodegenerative diseases are a heterogeneous group of disorders characterized by loss of neurons and are commonly associated with a genetic mutation. To characterize the clinical and histopathological features of a novel degenerative neurological disease affecting the brain of young adult Nova Scotia Duck Tolling Retrievers (NSDTRs). Nine, young adult, related NSDTRs were evaluated for neurological dysfunction and rapid eye movement sleep behavior disorder. Case series review. Clinical signs of neurological dysfunction began between 2 months and 5 years of age and were progressive in nature. They were characterized by episodes of marked movements during sleep, increased anxiety, noise phobia, and gait abnormalities. Magnetic resonance imaging documented symmetrical, progressively increasing, T2-weighted image intensity, predominantly within the caudate nuclei, consistent with necrosis secondary to gray matter degeneration. Abnormalities were not detected on clinicopathological analysis of blood and cerebrospinal fluid, infectious disease screening or urine metabolite screening in most cases. Postmortem examination of brain tissue identified symmetrical malacia of the caudate nuclei and axonal dystrophy within the brainstem and spinal cord. Genealogical analysis supports an autosomal recessive mode of inheritance. A degenerative encephalopathy was identified in young adult NSDTRs consistent with a hereditary disease. The prognosis is guarded due to the progressive nature of the disease, which is minimally responsive to empirical treatment. Copyright © 2016 The Authors. Journal of Veterinary Internal Medicine published by Wiley Periodicals, Inc. on behalf of the American College of Veterinary Internal Medicine.

  15. Middle-School Students' Online Information Problem Solving Behaviors on the Information Retrieval Interface

    Science.gov (United States)

    Yeh, Yi-Fen; Hsu, Ying-Shao; Chuang, Fu-Tai; Hwang, Fu-Kwun

    2014-01-01

    With the near-overload of online information, it is necessary to equip our students with the skills necessary to deal with Information Problem Solving (IPS). This study also intended to help students develop major IPS strategies with the assistance of an instructor's scaffolding in a designed IPS course as well as on an Online Information…

  16. A study of the use of simulated work task situations in interactive information retrieval evaluations

    DEFF Research Database (Denmark)

    Borlund, Pia

    2016-01-01

    Purpose – The purpose of this paper is to report a study of how the test instrument of a simulated work task situation is used in empirical evaluations of interactive information retrieval (IIR) and reported in the research literature. In particular, the author is interested to learn whether...... partly via citation analysis by use of Web of Science®, and partly by systematic search of online repositories. On this basis, 67 individual publications were identified and they constitute the sample of analysis. Findings – The analysis reveals a need for clarifications of how to use simulated work task...... situations in IIR evaluations. In particular, with respect to the design and creation of realistic simulated work task situations. There is a lack of tailoring of the simulated work task situations to the test participants. Likewise, the requirement to include the test participants’ personal information...

  17. INFORMATION RETRIEVAL TUGAS AKHIR DAN PERHITUNGAN KEMIRIPAN DOKUMEN MENGACU PADA ABSTRAK MENGGUNAKAN VECTOR SPACE MODEL

    Directory of Open Access Journals (Sweden)

    Putri Elfa Mas'udia

    2017-04-01

    Full Text Available Pencarian pada database yang biasa dilakukan mahasiswa hanya mampu mencari judul yang sesuai berdasarkan kata kunci yang diinputkan, misalnya, jika kata kunci yang dimasukkan adalah “sistem cerdas” maka akan ditampilkan semua dokumen yang mengandung kata “sistem cerdas” namun sistem tidak bisa mengukur mana dokumen yang paling mirip. Untuk dapat melakukan pencarian berdasar substansi  yang  paling  mirip,    terdapat  teknologi  yang  disebut  information  Text  Retrieval.  Dalam penelitian ini akan dikembangkan suatu sistem temu kembali informasi judul tugas akhir dan perhitungan kemiripan dokumen menggunakan vector space model. Sistem secara otomatis akan melakukan indexing secara offline dan temu kembali (retrieval secara real time. Proses retrieval dimulai dengan mengambil query dari pengguna, menerapkan stop word removal sehingga dihasilkan keyword yang compaq tetapi dapatmewakili query tersebut, kemudian sistem menghitung kemiripan antarakeyword dengan daftar dokumen  yang  diwakili  oleh  term-term  di  dalam  index.  Dokumen  akan  ditampilkan  diurutkan berdasarkan dokumen yang paling mirip.Dari hasil pengujian terlihat ketika keyword “android” dimasukkan maka akan tampil empat dokumen yang diurutkan sesuai tingkat kemiripannya, yaitu docId 3 dengan tingkat kemiripan 0.9512, docId 4 dengan tingkat kemiripan 0.5020, docId 2 dengan tingkat kemiripan 0.2671, docId 8 dengan tingkat kemiripan 0.1522.

  18. Construction of a bibliographic information database and development of retrieval system for research reports in nuclear science and technology (II)

    Energy Technology Data Exchange (ETDEWEB)

    Han, Duk Haeng; Kim, Tae Whan; Choi, Kwang; Yoo, An Na; Keum, Jong Yong; Kim, In Kwon [Korea Atomic Energy Research Institute, Taejon (Korea, Republic of)

    1996-05-01

    The major goal of this project is to construct a bibliographic information database in nuclear engineering and to develop a prototype retrieval system. To give an easy access to microfiche research report, this project has accomplished the construction of microfiche research reports database and the development of retrieval system. The results of the project are as follows; 1. Microfiche research reports database was constructed by downloading from DOE Energy, NTIS, INIS. 2. The retrieval system was developed in host and web version using access point such as title, abstracts, keyword, report number. 6 tabs., 8 figs., 11 refs. (Author) .new.

  19. A real-time in-memory discovery service leveraging hierarchical packaging information in a unique identifier network to retrieve track and trace information

    CERN Document Server

    Müller, Jürgen

    2014-01-01

    This book examines how to efficiently retrieve track and trace information for an item that took a certain path through a complex network of manufacturers, wholesalers, retailers and consumers. It includes valuable tips on in-memory data management.

  20. Assimilation of SMOS Soil Moisture Retrievals in the Land Information System

    Science.gov (United States)

    Blankenship, Clay; Case, Jonathan L.; Zavodsky, Brad

    2014-01-01

    Soil moisture is a crucial variable for weather prediction because of its influence on evaporation. It is of critical importance for drought and flood monitoring and prediction and for public health applications. The NASA Short-term Prediction Research and Transition Center (SPoRT) has implemented a new module in the NASA Land Information System (LIS) to assimilate observations from the ESA's Soil Moisture and Ocean Salinity (SMOS) satellite. SMOS Level 2 retrievals from the Microwave Imaging Radiometer using Aperture Synthesis (MIRAS) instrument are assimilated into the Noah LSM within LIS via an Ensemble Kalman Filter. The retrievals have a target volumetric accuracy of 4% at a resolution of 35-50 km. Parallel runs with and without SMOS assimilation are performed with precipitation forcing from intentionally degraded observations, and then validated against a model run using the best available precipitation data, as well as against selected station observations. The goal is to demonstrate how SMOS data assimilation can improve modeled soil states in the absence of dense rain gauge and radar networks.

  1. Imagery and retrieval of auditory and visual information: Neural correlates of successful and unsuccessful performance

    NARCIS (Netherlands)

    Huijbers, W.; Pennartz, C.M.A.; Rubin, D.C.; Daselaar, S.M.

    2011-01-01

    Remembering past events - or episodic retrieval - consists of several components. There is evidence that mental imagery plays an important role in retrieval and that the brain regions supporting imagery overlap with those supporting retrieval. An open issue is to what extent these regions support

  2. A rapid review of consumer health information needs and preferences.

    Science.gov (United States)

    Ramsey, Imogen; Corsini, Nadia; Peters, Micah D J; Eckert, Marion

    2017-09-01

    This rapid review summarizes best available evidence on consumers' needs and preferences for information about healthcare, with a focus on the Australian context. Three questions are addressed: 1) Where do consumers find and what platform do they use to access information about healthcare? 2) How do consumers use the healthcare information that they find? 3) About which topics or subjects do consumers need healthcare information? A hierarchical approach was adopted with evidence first sought from reviews then high quality studies using Medline (via PubMed), CINAHL, Embase, the JBI Database of Systematic Reviews and Implementation Reports, the Campbell Collaboration Library of Systematic Reviews, EPPI-Centre, and Epistemonikos. Twenty-eight articles were included; four systematic reviews, three literature reviews, thirteen quantitative studies, six qualitative studies, and two mixed methods studies. Consumers seek health information at varying times along the healthcare journey and through various modes of delivery. Complacency with historical health information modes is no longer appropriate and flexibility is essential to suit growing consumer demands. Health information should be readily available in different formats and not exclusive to any single medium. Copyright © 2017. Published by Elsevier B.V.

  3. Information retrieval and terminology extraction in online resources for patients with diabetes.

    Science.gov (United States)

    Seljan, Sanja; Baretić, Maja; Kucis, Vlasta

    2014-06-01

    Terminology use, as a mean for information retrieval or document indexing, plays an important role in health literacy. Specific types of users, i.e. patients with diabetes need access to various online resources (on foreign and/or native language) searching for information on self-education of basic diabetic knowledge, on self-care activities regarding importance of dietetic food, medications, physical exercises and on self-management of insulin pumps. Automatic extraction of corpus-based terminology from online texts, manuals or professional papers, can help in building terminology lists or list of "browsing phrases" useful in information retrieval or in document indexing. Specific terminology lists represent an intermediate step between free text search and controlled vocabulary, between user's demands and existing online resources in native and foreign language. The research aiming to detect the role of terminology in online resources, is conducted on English and Croatian manuals and Croatian online texts, and divided into three interrelated parts: i) comparison of professional and popular terminology use ii) evaluation of automatic statistically-based terminology extraction on English and Croatian texts iii) comparison and evaluation of extracted terminology performed on English manual using statistical and hybrid approaches. Extracted terminology candidates are evaluated by comparison with three types of reference lists: list created by professional medical person, list of highly professional vocabulary contained in MeSH and list created by non-medical persons, made as intersection of 15 lists. Results report on use of popular and professional terminology in online diabetes resources, on evaluation of automatically extracted terminology candidates in English and Croatian texts and on comparison of statistical and hybrid extraction methods in English text. Evaluation of automatic and semi-automatic terminology extraction methods is performed by recall

  4. OntoTrader: an ontological Web trading agent approach for environmental information retrieval.

    Science.gov (United States)

    Iribarne, Luis; Padilla, Nicolás; Ayala, Rosa; Asensio, José A; Criado, Javier

    2014-01-01

    Modern Web-based Information Systems (WIS) are becoming increasingly necessary to provide support for users who are in different places with different types of information, by facilitating their access to the information, decision making, workgroups, and so forth. Design of these systems requires the use of standardized methods and techniques that enable a common vocabulary to be defined to represent the underlying knowledge. Thus, mediation elements such as traders enrich the interoperability of web components in open distributed systems. These traders must operate with other third-party traders and/or agents in the system, which must also use a common vocabulary for communication between them. This paper presents the OntoTrader architecture, an Ontological Web Trading agent based on the OMG ODP trading standard. It also presents the ontology needed by some system agents to communicate with the trading agent and the behavioral framework for the SOLERES OntoTrader agent, an Environmental Management Information System (EMIS). This framework implements a "Query-Searching/Recovering-Response" information retrieval model using a trading service, SPARQL notation, and the JADE platform. The paper also presents reflection, delegation and, federation mediation models and describes formalization, an experimental testing environment in three scenarios, and a tool which allows our proposal to be evaluated and validated.

  5. OntoTrader: An Ontological Web Trading Agent Approach for Environmental Information Retrieval

    Directory of Open Access Journals (Sweden)

    Luis Iribarne

    2014-01-01

    Full Text Available Modern Web-based Information Systems (WIS are becoming increasingly necessary to provide support for users who are in different places with different types of information, by facilitating their access to the information, decision making, workgroups, and so forth. Design of these systems requires the use of standardized methods and techniques that enable a common vocabulary to be defined to represent the underlying knowledge. Thus, mediation elements such as traders enrich the interoperability of web components in open distributed systems. These traders must operate with other third-party traders and/or agents in the system, which must also use a common vocabulary for communication between them. This paper presents the OntoTrader architecture, an Ontological Web Trading agent based on the OMG ODP trading standard. It also presents the ontology needed by some system agents to communicate with the trading agent and the behavioral framework for the SOLERES OntoTrader agent, an Environmental Management Information System (EMIS. This framework implements a “Query-Searching/Recovering-Response” information retrieval model using a trading service, SPARQL notation, and the JADE platform. The paper also presents reflection, delegation and, federation mediation models and describes formalization, an experimental testing environment in three scenarios, and a tool which allows our proposal to be evaluated and validated.

  6. Geographic information metadata for spatial data infrastructures: resources, interoperability and information retrieval

    National Research Council Canada - National Science Library

    Nogueras-Iso, Javier; Zarazaga-Soria, F. Javier; Muro-Medrano, Pedro R

    2005-01-01

    ... of this information grows day by day thanks to important technology advances in high-resolution satellite remote sensors, Global Positioning Systems (GPS), databases and geo-processing software notwithstanding an increasing interest by individuals and institutions. Even more, it is possible to georeference complex collections of a broad rang...

  7. Self-Referential Information Alleviates Retrieval Inhibition of Directed Forgetting Effects—An ERP Evidence of Source Memory

    Directory of Open Access Journals (Sweden)

    Xinrui Mao

    2017-10-01

    Full Text Available Directed forgetting (DF assists in preventing outdated information from interfering with cognitive processing. Previous studies pointed that self-referential items alleviated DF effects due to the elaboration of encoding processes. However, the retrieval mechanism of this phenomenon remains unknown. Based on the dual-process framework of recognition, the retrieval of self-referential information was involved in familiarity and recollection. Using source memory tasks combined with event-related potential (ERP recording, our research investigated the retrieval processes of alleviative DF effects elicited by self-referential information. The FN400 (frontal negativity at 400 ms is a frontal potential at 300–500 ms related to familiarity and the late positive complex (LPC is a later parietal potential at 500–800 ms related to recollection. The FN400 effects of source memory suggested that familiarity processes were promoted by self-referential effects without the modulation of to-be-forgotten (TBF instruction. The ERP results of DF effects were involved with LPCs of source memory, which indexed retrieval processing of recollection. The other-referential source memory of TBF instruction caused the absence of LPC effects, while the self-referential source memory of TBF instruction still elicited the significant LPC effects. Therefore, our neural findings suggested that self-referential processing improved both familiarity and recollection. Furthermore, the self-referential processing advantage which was caused by the autobiographical retrieval alleviated retrieval inhibition of DF, supporting that the self-referential source memory alleviated DF effects.

  8. Linear information retrieval method in X-ray grating-based phase contrast imaging and its interchangeability with tomographic reconstruction

    Science.gov (United States)

    Wu, Z.; Gao, K.; Wang, Z. L.; Shao, Q. G.; Hu, R. F.; Wei, C. X.; Zan, G. B.; Wali, F.; Luo, R. H.; Zhu, P. P.; Tian, Y. C.

    2017-06-01

    In X-ray grating-based phase contrast imaging, information retrieval is necessary for quantitative research, especially for phase tomography. However, numerous and repetitive processes have to be performed for tomographic reconstruction. In this paper, we report a novel information retrieval method, which enables retrieving phase and absorption information by means of a linear combination of two mutually conjugate images. Thanks to the distributive law of the multiplication as well as the commutative law and associative law of the addition, the information retrieval can be performed after tomographic reconstruction, thus simplifying the information retrieval procedure dramatically. The theoretical model of this method is established in both parallel beam geometry for Talbot interferometer and fan beam geometry for Talbot-Lau interferometer. Numerical experiments are also performed to confirm the feasibility and validity of the proposed method. In addition, we discuss its possibility in cone beam geometry and its advantages compared with other methods. Moreover, this method can also be employed in other differential phase contrast imaging methods, such as diffraction enhanced imaging, non-interferometric imaging, and edge illumination.

  9. Mining biomedical images towards valuable information retrieval in biomedical and life sciences

    Science.gov (United States)

    Ahmed, Zeeshan; Zeeshan, Saman; Dandekar, Thomas

    2016-01-01

    Biomedical images are helpful sources for the scientists and practitioners in drawing significant hypotheses, exemplifying approaches and describing experimental results in published biomedical literature. In last decades, there has been an enormous increase in the amount of heterogeneous biomedical image production and publication, which results in a need for bioimaging platforms for feature extraction and analysis of text and content in biomedical images to take advantage in implementing effective information retrieval systems. In this review, we summarize technologies related to data mining of figures. We describe and compare the potential of different approaches in terms of their developmental aspects, used methodologies, produced results, achieved accuracies and limitations. Our comparative conclusions include current challenges for bioimaging software with selective image mining, embedded text extraction and processing of complex natural language queries. PMID:27538578

  10. A Hybrid Information Retrieval System for Medical Field Using MeSH Ontology

    Science.gov (United States)

    Jalali, Vahid; Borujerdi, Mohammad Reza Matash

    Using semantic relations between different terms beside their syntactical similarities in a search engine would result in systems with better overall precision. One major problem in achieving such systems is to find an appropriate way of calculating semantic similarity scores and combining them with those of classic methods. In this paper, we propose a hybrid approach for information retrieval in medical field using MeSH ontology. Our approach contains proposing a new semantic similarity measure and eliminating records with semantic score less than a specific threshold from syntactic results. Proposed approach in this paper outperforms VSM, graph comparison, neural network, Bayesian network and latent semantic indexing based approaches in terms of precision vs. recall.

  11. Information retrieval system: impacts of water-level changes on uses of federal storage reservoirs of the Columbia River.

    Energy Technology Data Exchange (ETDEWEB)

    Fickeisen, D.H.; Cowley, P.J.; Neitzel, D.A.; Simmons, M.A.

    1982-09-01

    A project undertaken to provide the Bonneville Power Administration (BPA) with information needed to conduct environmental assessments and meet requirements of the National Environmental Policy Act (NEPA) and the Pacific Northwest Electric Power Planning and Conservation Act (Regional Act) is described. Access to information on environmental effects would help BPA fulfill its responsibilities to coordinate power generation on the Columbia River system, protect uses of the river system (e.g., irrigation, recreation, navigation), and enhance fish and wildlife production. Staff members at BPA identified the need to compile and index information resources that would help answer environmental impact questions. A computer retrieval system that would provide ready access to the information was envisioned. This project was supported by BPA to provide an initial step toward a compilation of environmental impact information. Scientists at Pacific Northwest Laboratory (PNL) identified, gathered, and evaluated information related to environmental effects of water level on uses of five study reservoirs and developed and implemented and environmental data retrieval system, which provides for automated storage and retrieval of annotated citations to published and unpublished information. The data retrieval system is operating on BPA's computer facility and includes the reservoir water-level environmental data. This project was divided into several tasks, some of which were conducted simultaneously to meet project deadlines. The tasks were to identify uses of the five study reservoirs, compile and evaluate reservoir information, develop a data entry and retrieval system, identify and analyze research needs, and document the data retrieval system and train users. Additional details of the project are described in several appendixes.

  12. Indexing strategic retrieval of colour information with event-related potentials.

    Science.gov (United States)

    Wilding, E L; Fraser, C S; Herron, J E

    2005-09-01

    Event-related potentials (ERPs) were acquired during two experiments in order to determine boundary conditions for when recollection of colour information can be controlled strategically. In initial encoding phases, participants saw an equal number of words presented in red or green. In subsequent retrieval phases, all words were shown in white. Participants were asked to endorse old words that had been shown at encoding in one colour (targets), and to reject new test words as well as old words shown in the alternate colour (non-targets). Study and test lists were longer in Experiment 1, and as a result, the accuracy of memory judgments was superior in Experiment 2. The left-parietal ERP old/new effect--the electrophysiological signature of recollection--was reliable for targets in both experiments, and reliable for non-targets in Experiment 1 only. These findings are consistent with the view that participants were able to restrict recollection to targets in Experiment 2, while recollecting information about targets as well as non-targets in Experiment 1. The fact that this selective strategy was implemented in Experiment 2 despite the close correspondence between the kinds of information associated with targets and non-targets indicates that participants were able to exert considerable control over the conditions under which recollection of task-relevant information occurred.

  13. Conservation and retrieval of information - Elements of a strategy to inform future societies about nuclear waste repositories

    Energy Technology Data Exchange (ETDEWEB)

    Jensen, M. [ed.] [National Inst. of Radiation Protection, Stockholm (Sweden)

    1996-12-01

    Two main strategies exist for long-term information transfer, one which links information through successive transfers of archived material and other forms of knowledge in society, and one - such as marking the site with a monument - relying upon a direct link from the present to the distant future. Digital methods are not recommended for long-term storage, but digital processing may be a valuable tool to structure information summaries, and in the creation of better long-lasting records. Advances in archive management should also be pursued to widen the choice of information carriers of high durability. In the Nordic countries, during the first few thousand years, and perhaps up to the next period of glaciation, monuments at a repository site may be used to warn the public of the presence of dangerous waste. But messages from such markers may pose interpretation problems as we have today for messages left by earlier societies such as rune inscriptions. Since the national borders may change in the time scale relevant for nuclear waste, the creation of an international archive for all radioactive wastes would represent an improvement as regards conservation and retrieval of information. (EG).

  14. The Informative Documentation and Retrieval of Written Information. New Competences for Cyberspace

    Directory of Open Access Journals (Sweden)

    Pilar Beltrán Orenes

    2015-07-01

    Full Text Available Cyberspace has opened to the journalist the possibility of accessing a lot of documentation sources without the direct mediation of an information science professional. However, this autonomy is more a dream than a reality for the vast ocean that represents Internet and, especially, for those deep areas (deep Internet which are very difficult to reach without the right skills. This article will attempt to show the reality of documentation sources on the Internet and the need for literacy training on the future journalists, as well as the presence of documentalists, now more than ever, in the media.

  15. Editorial for the Proceedings of the Workshop Knowledge Maps and Information Retrieval (KMIR2014) at Digital Libraries 2014

    NARCIS (Netherlands)

    Mutschke, Peter; Mayr, Philipp; Scharnhorst, Andrea; Mutschke, Peter; Mayr, Philipp; Scharnhorst, Andrea

    2014-01-01

    Knowledge maps are promising tools for visualizing the structure of large - sc ale information spaces, but still far away from being applicable for searching. The first international workshop on ``Knowledge Maps and Information Retrieval (KMIR)'', held as part of the International Conference on

  16. An Examination of Natural Language as a Query Formation Tool for Retrieving Information on E-Health from Pub Med.

    Science.gov (United States)

    Peterson, Gabriel M.; Su, Kuichun; Ries, James E.; Sievert, Mary Ellen C.

    2002-01-01

    Discussion of Internet use for information searches on health-related topics focuses on a study that examined complexity and variability of natural language in using search terms that express the concept of electronic health (e-health). Highlights include precision of retrieved information; shift in terminology; and queries using the Pub Med…

  17. Incorporating indel information into phylogeny estimation for rapidly emerging pathogens

    Directory of Open Access Journals (Sweden)

    Suchard Marc A

    2007-03-01

    Full Text Available Abstract Background Phylogenies of rapidly evolving pathogens can be difficult to resolve because of the small number of substitutions that accumulate in the short times since divergence. To improve resolution of such phylogenies we propose using insertion and deletion (indel information in addition to substitution information. We accomplish this through joint estimation of alignment and phylogeny in a Bayesian framework, drawing inference using Markov chain Monte Carlo. Joint estimation of alignment and phylogeny sidesteps biases that stem from conditioning on a single alignment by taking into account the ensemble of near-optimal alignments. Results We introduce a novel Markov chain transition kernel that improves computational efficiency by proposing non-local topology rearrangements and by block sampling alignment and topology parameters. In addition, we extend our previous indel model to increase biological realism by placing indels preferentially on longer branches. We demonstrate the ability of indel information to increase phylogenetic resolution in examples drawn from within-host viral sequence samples. We also demonstrate the importance of taking alignment uncertainty into account when using such information. Finally, we show that codon-based substitution models can significantly affect alignment quality and phylogenetic inference by unrealistically forcing indels to begin and end between codons. Conclusion These results indicate that indel information can improve phylogenetic resolution of recently diverged pathogens and that alignment uncertainty should be considered in such analyses.

  18. Does the mind map learning strategy facilitate information retrieval and critical thinking in medical students?

    Science.gov (United States)

    D'Antoni, Anthony V; Zipp, Genevieve Pinto; Olson, Valerie G; Cahill, Terrence F

    2010-09-16

    retrieve information in the short term, and does not put them at a disadvantage compared to SNT students. Future studies should explore longitudinal effects of mind-map proficiency training on both short- and long-term information retrieval and critical thinking.

  19. Does the mind map learning strategy facilitate information retrieval and critical thinking in medical students?

    Directory of Open Access Journals (Sweden)

    Olson Valerie G

    2010-09-01

    demonstrates that medical students using mind maps can successfully retrieve information in the short term, and does not put them at a disadvantage compared to SNT students. Future studies should explore longitudinal effects of mind-map proficiency training on both short- and long-term information retrieval and critical thinking.

  20. Storage and retrieval of chemical mutagenesis information. [Operation of the Environmental Mutagen Information Center

    Energy Technology Data Exchange (ETDEWEB)

    Wassom, J.S.

    1979-01-01

    The imminent risks to human health which may result from exposure to environmental agents add urgency to the task of providing access to the genetic toxicology literature. Through the efforts at EMIC during the last ten years, workers in the field of genetic toxicology have an excellent means of acquiring needed information. In this regard, researchers in this field have an edge over their colleagues in other branches of toxicology. This paper has addressed the current state of the genetic toxicology literature and has reviewed several of the techniques employed by EMIC to control the literature in this area. Procedures for the development of a program to make this file more readily available in all countries where there is research in genetic toxicology are proposed.

  1. Generalization of prior information for rapid Bayesian time estimation.

    Science.gov (United States)

    Roach, Neil W; McGraw, Paul V; Whitaker, David J; Heron, James

    2017-01-10

    To enable effective interaction with the environment, the brain combines noisy sensory information with expectations based on prior experience. There is ample evidence showing that humans can learn statistical regularities in sensory input and exploit this knowledge to improve perceptual decisions and actions. However, fundamental questions remain regarding how priors are learned and how they generalize to different sensory and behavioral contexts. In principle, maintaining a large set of highly specific priors may be inefficient and restrict the speed at which expectations can be formed and updated in response to changes in the environment. However, priors formed by generalizing across varying contexts may not be accurate. Here, we exploit rapidly induced contextual biases in duration reproduction to reveal how these competing demands are resolved during the early stages of prior acquisition. We show that observers initially form a single prior by generalizing across duration distributions coupled with distinct sensory signals. In contrast, they form multiple priors if distributions are coupled with distinct motor outputs. Together, our findings suggest that rapid prior acquisition is facilitated by generalization across experiences of different sensory inputs but organized according to how that sensory information is acted on.

  2. Searching for evidence or approval? A commentary on database search in systematic reviews and alternative information retrieval methodologies.

    Science.gov (United States)

    Delaney, Aogán; Tamás, Peter A

    2017-11-04

    Despite recognition that database search alone is inadequate even within the health sciences, it appears that reviewers in fields that have adopted systematic review are choosing to rely primarily, or only, on database search for information retrieval. This commentary reminds readers of factors that call into question the appropriateness of default reliance on database searches particularly as systematic review is adapted for use in new and lower consensus fields. It then discusses alternative methods for information retrieval that require development, formalisation, and evaluation. Our goals are to encourage reviewers to reflect critically and transparently on their choice of information retrieval methods and to encourage investment in research on alternatives. This article is protected by copyright. All rights reserved.

  3. Adaptation of machine translation for multilingual information retrieval in the medical domain.

    Science.gov (United States)

    Pecina, Pavel; Dušek, Ondřej; Goeuriot, Lorraine; Hajič, Jan; Hlaváčová, Jaroslava; Jones, Gareth J F; Kelly, Liadh; Leveling, Johannes; Mareček, David; Novák, Michal; Popel, Martin; Rosa, Rudolf; Tamchyna, Aleš; Urešová, Zdeňka

    2014-07-01

    We investigate machine translation (MT) of user search queries in the context of cross-lingual information retrieval (IR) in the medical domain. The main focus is on techniques to adapt MT to increase translation quality; however, we also explore MT adaptation to improve effectiveness of cross-lingual IR. Our MT system is Moses, a state-of-the-art phrase-based statistical machine translation system. The IR system is based on the BM25 retrieval model implemented in the Lucene search engine. The MT techniques employed in this work include in-domain training and tuning, intelligent training data selection, optimization of phrase table configuration, compound splitting, and exploiting synonyms as translation variants. The IR methods include morphological normalization and using multiple translation variants for query expansion. The experiments are performed and thoroughly evaluated on three language pairs: Czech-English, German-English, and French-English. MT quality is evaluated on data sets created within the Khresmoi project and IR effectiveness is tested on the CLEF eHealth 2013 data sets. The search query translation results achieved in our experiments are outstanding - our systems outperform not only our strong baselines, but also Google Translate and Microsoft Bing Translator in direct comparison carried out on all the language pairs. The baseline BLEU scores increased from 26.59 to 41.45 for Czech-English, from 23.03 to 40.82 for German-English, and from 32.67 to 40.82 for French-English. This is a 55% improvement on average. In terms of the IR performance on this particular test collection, a significant improvement over the baseline is achieved only for French-English. For Czech-English and German-English, the increased MT quality does not lead to better IR results. Most of the MT techniques employed in our experiments improve MT of medical search queries. Especially the intelligent training data selection proves to be very successful for domain adaptation of

  4. A Part-Of-Speech term weighting scheme for biomedical information retrieval.

    Science.gov (United States)

    Wang, Yanshan; Wu, Stephen; Li, Dingcheng; Mehrabi, Saeed; Liu, Hongfang

    2016-10-01

    In the era of digitalization, information retrieval (IR), which retrieves and ranks documents from large collections according to users' search queries, has been popularly applied in the biomedical domain. Building patient cohorts using electronic health records (EHRs) and searching literature for topics of interest are some IR use cases. Meanwhile, natural language processing (NLP), such as tokenization or Part-Of-Speech (POS) tagging, has been developed for processing clinical documents or biomedical literature. We hypothesize that NLP can be incorporated into IR to strengthen the conventional IR models. In this study, we propose two NLP-empowered IR models, POS-BoW and POS-MRF, which incorporate automatic POS-based term weighting schemes into bag-of-word (BoW) and Markov Random Field (MRF) IR models, respectively. In the proposed models, the POS-based term weights are iteratively calculated by utilizing a cyclic coordinate method where golden section line search algorithm is applied along each coordinate to optimize the objective function defined by mean average precision (MAP). In the empirical experiments, we used the data sets from the Medical Records track in Text REtrieval Conference (TREC) 2011 and 2012 and the Genomics track in TREC 2004. The evaluation on TREC 2011 and 2012 Medical Records tracks shows that, for the POS-BoW models, the mean improvement rates for IR evaluation metrics, MAP, bpref, and P@10, are 10.88%, 4.54%, and 3.82%, compared to the BoW models; and for the POS-MRF models, these rates are 13.59%, 8.20%, and 8.78%, compared to the MRF models. Additionally, we experimentally verify that the proposed weighting approach is superior to the simple heuristic and frequency based weighting approaches, and validate our POS category selection. Using the optimal weights calculated in this experiment, we tested the proposed models on the TREC 2004 Genomics track and obtained average of 8.63% and 10.04% improvement rates for POS-BoW and POS

  5. A Semantic Enhanced Model for Effective Spatial Information Retrieval : Un modèle sémantique améliorée for Effective Information Retrieval spatiale

    OpenAIRE

    Akanbi, Adeyinka; Agunbiade, Olusanya,; Dehinbo, Olumuyiwa,; Kuti, Sadiq

    2014-01-01

    International audience; A lot of information on the web is geographically referenced. Discovering and retrieving this geographic information to satisfy various users needs across both open and distributed Spatial Data Infrastructures (SDI) poses eminent research challenges. However, this is mostly caused by semantic heterogeneity in user's query and lack of semantic referencing of the Geographic Information (GI) metadata. To addressing these challenges, this paper discusses an ontology-based ...

  6. Assimilation of SMOS Retrieved Soil Moisture into the Land Information System

    Science.gov (United States)

    Blankenship, Clay B.; Case, Jonathan L.; Zavodsky, Bradley T.

    2014-01-01

    Soil moisture is a crucial variable for weather prediction because of its influence on evaporation and surface heat fluxes. It is also of critical importance for drought and flood monitoring and prediction and for public health applications such as monitoring vector-borne diseases. Land surface modeling benefits greatly from regular updates with soil moisture observations via data assimilation. Satellite remote sensing is the only practical observation type for this purpose in most areas due to its worldwide coverage. The newest operational satellite sensor for soil moisture is the Microwave Imaging Radiometer using Aperture Synthesis (MIRAS) instrument aboard the Soil Moisture and Ocean Salinity (SMOS) satellite. The NASA Short-term Prediction Research and Transition Center (SPoRT) has implemented the assimilation of SMOS soil moisture observations into the NASA Land Information System (LIS), an integrated modeling and data assimilation software platform. We present results from assimilating SMOS observations into the Noah 3.2 land surface model within LIS. The SMOS MIRAS is an L-band radiometer launched by the European Space Agency in 2009, from which we assimilate Level 2 retrievals [1] into LIS-Noah. The measurements are sensitive to soil moisture concentration in roughly the top 2.5 cm of soil. The retrievals have a target volumetric accuracy of 4% at a resolution of 35-50 km. Sensitivity is reduced where precipitation, snowcover, frozen soil, or dense vegetation is present. Due to the satellite's polar orbit, the instrument achieves global coverage twice daily at most mid- and low-latitude locations, with only small gaps between swaths.

  7. Automatic natural acquisition of a semantic network for information retrieval systems

    Science.gov (United States)

    Enguehard, Chantal; Malvache, Pierre; Trigano, Philippe

    1992-03-01

    The amount of information is becoming greater and greater, in industries where complex processes are performed it is becoming increasingly difficult to profit from all the documents produced when fresh knowledge becomes available (reports, experiments, findings). This situation causes a considerable and expensive waste of precious time lost searching for documents or, quite simply, results in outright repeating what has been done. One solution is to transform all paper information into computerized information. We might imagine that we are in a science-fiction world and that we have the perfect computer. We tell it everything we know, we make it read all the books, and if we ask it any question, it will find the response if that response exists. But unfortunately, we are in the real world and the last four decades have taught us to minimize our expectations of computers. During the 1960s, the information retrieval systems appeared. Their purpose is to provide access to any desired documents, in response to a question about a subject, even if it is not known to exist. Here we focus on the problem of selecting items to index the documents. In 1966, Salton identified this problem as crucial when he saw that his system, Medlars, did not find a relevant text because of the wrong indexation. Faced with this problem, he imagined a guide to help authors choose the correct indexation, but he anticipated the automation of this operation with the SMART system. It was stated previously that a manual language analysis for information items by subjects experts is likely to prove impractical in the long run. After a brief survey of the existing responses to the index choice problem, we shall present the system automatic natural acquisition (ANA) which chooses items to index texts by using as little knowledge as possible- -just by learning the language. This system does not use any grammar or lexicon, so the selected indexes will be very close to the field concerned in the texts.

  8. A systematic review of interventions promoting clinical information retrieval technology (CIRT) adoption by healthcare professionals.

    Science.gov (United States)

    Gagnon, M-P; Pluye, P; Desmartis, M; Car, J; Pagliari, C; Labrecque, M; Frémont, P; Gagnon, J; Njoya, M; Légaré, F

    2010-10-01

    This paper presents the evidence on the effectiveness of interventions promoting the use of clinical information retrieval technologies (CIRTs) by healthcare professionals. We electronically searched articles published between January 1990 and March 2008 using following inclusion criteria: (1) participants were healthcare professionals; (2) specific intervention promoted CIRT adoption; (3) studies were randomised controlled trials, controlled clinical trials, controlled before and after studies or interrupted time series analyses; and (4) they objectively reporting measured outcomes on CIRT use. We found nine studies focusing on CIRT use. Main outcomes measured were searching skills and/or frequency of use of electronic databases by healthcare professionals. Three studies reported a positive effect of the intervention on CIRT use, one showed a positive impact post-intervention, and four studies failed to demonstrate significant intervention effect. The ninth study examined financial disincentives, and found a significant negative effect of introducing user fees for searching MEDLINE in clinical settings. A meta-analysis showed that educational meetings were the only type of interventions reporting consistent positive effects on CIRT adoption. CIRT is an information and communication technology commonly used in healthcare settings. Interventions promoting CIRT adoption by healthcare professionals have shown some success in improving searching skills and use of electronic databases. However, the effectiveness of these interventions remains uncertain and more rigorous studies are needed. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  9. Domain-Specific Thesaurus as a Tool for Information Retrieval and Collection of Knowledge

    Directory of Open Access Journals (Sweden)

    Vladimir N. Boikov

    2013-01-01

    Full Text Available This paper reports basic approaches to constructive creation of an open resource named ”Domain-specified thesaurus of poetics”, which is one of the levels of an information-analytical system of the Russian poetry (IAS RP. The poetics is a group of disciplines focused on a comprehensive theoretical and historical study of poetry. IAS RP will be used as a tool for a wide range of studies allowing to determine the characteristic features of the analyzed works of poetry. Consequently, the thesaurus is the knowledge base from which one can borrow input data for training the system. The aim of our research requires a specific approach to formating the knowledge base. Thesaurus is a web-based resource which includes a domain-specific directory, information retrieval tools and tools for further analyzes. The study of glossary consisting of three thousand terms and a set of semantic fields is reviewed in this paper. Rdf-graph of the domain-specified thesaurus of poetics is presented, containing 9 types of objects and different kinds of relationships among them. Wiki-tecnologies are used for implementing a resource which allows to store data in Semantic Web formats.

  10. Rapid sampling of molecular motions with prior information constraints.

    Directory of Open Access Journals (Sweden)

    Barak Raveh

    2009-02-01

    Full Text Available Proteins are active, flexible machines that perform a range of different functions. Innovative experimental approaches may now provide limited partial information about conformational changes along motion pathways of proteins. There is therefore a need for computational approaches that can efficiently incorporate prior information into motion prediction schemes. In this paper, we present PathRover, a general setup designed for the integration of prior information into the motion planning algorithm of rapidly exploring random trees (RRT. Each suggested motion pathway comprises a sequence of low-energy clash-free conformations that satisfy an arbitrary number of prior information constraints. These constraints can be derived from experimental data or from expert intuition about the motion. The incorporation of prior information is very straightforward and significantly narrows down the vast search in the typically high-dimensional conformational space, leading to dramatic reduction in running time. To allow the use of state-of-the-art energy functions and conformational sampling, we have integrated this framework into Rosetta, an accurate protocol for diverse types of structural modeling. The suggested framework can serve as an effective complementary tool for molecular dynamics, Normal Mode Analysis, and other prevalent techniques for predicting motion in proteins. We applied our framework to three different model systems. We show that a limited set of experimentally motivated constraints may effectively bias the simulations toward diverse predicates in an outright fashion, from distance constraints to enforcement of loop closure. In particular, our analysis sheds light on mechanisms of protein domain swapping and on the role of different residues in the motion.

  11. Information content analysis: the potential for methane isotopologue retrieval from GOSAT-2

    Directory of Open Access Journals (Sweden)

    E. Malina

    2018-02-01

    Full Text Available Atmospheric methane is comprised of multiple isotopic molecules, with the most abundant being 12CH4 and 13CH4, making up 98 and 1.1 % of atmospheric methane respectively. It has been shown that is it possible to distinguish between sources of methane (biogenic methane, e.g. marshland, or abiogenic methane, e.g. fracking via a ratio of these main methane isotopologues, otherwise known as the δ13C value. δ13C values typically range between −10 and −80 ‰, with abiogenic sources closer to zero and biogenic sources showing more negative values. Initially, we suggest that a δ13C difference of 10 ‰ is sufficient, in order to differentiate between methane source types, based on this we derive that a precision of 0.2 ppbv on 13CH4 retrievals may achieve the target δ13C variance. Using an application of the well-established information content analysis (ICA technique for assumed clear-sky conditions, this paper shows that using a combination of the shortwave infrared (SWIR bands on the planned Greenhouse gases Observing SATellite (GOSAT-2 mission, 13CH4 can be measured with sufficient information content to a precision of between 0.7 and 1.2 ppbv from a single sounding (assuming a total column average value of 19.14 ppbv, which can then be reduced to the target precision through spatial and temporal averaging techniques. We therefore suggest that GOSAT-2 can be used to differentiate between methane source types. We find that large unconstrained covariance matrices are required in order to achieve sufficient information content, while the solar zenith angle has limited impact on the information content.

  12. Tailoring online information retrieval to user's needs based on a logical semantic approach to natural language processing and UMLS mapping.

    Science.gov (United States)

    Kossman, Susan; Jones, Josette; Brennan, Patricia Flatley

    2007-10-11

    Depression can derail teenagers' lives and cause serious chronic health problems. Acquiring pertinent knowledge and skills supports care management, but retrieving appropriate information can be difficult. This poster presents a strategy to tailor online information to user attributes using a logical semantic approach to natural language processing (NLP) and mapping propositions to UMLS terms. This approach capitalizes on existing NLM resources and presents a potentially sustainable plan for meeting consumers and providers information needs.

  13. Retrieval Practice Is an Efficient Method of Enhancing the Retention of Anatomy and Physiology Information

    Science.gov (United States)

    Dobson, John L.

    2013-01-01

    Although a great deal of empirical evidence has indicated that retrieval practice is an effective means of promoting learning and memory, very few studies have investigated the strategy in the context of an actual class. The primary purpose of this study was to determine if a series of very brief retrieval quizzes could significantly improve the…

  14. Implementation of the common phrase index method on the phrase query for information retrieval

    Science.gov (United States)

    Fatmawati, Triyah; Zaman, Badrus; Werdiningsih, Indah

    2017-08-01

    As the development of technology, the process of finding information on the news text is easy, because the text of the news is not only distributed in print media, such as newspapers, but also in electronic media that can be accessed using the search engine. In the process of finding relevant documents on the search engine, a phrase often used as a query. The number of words that make up the phrase query and their position obviously affect the relevance of the document produced. As a result, the accuracy of the information obtained will be affected. Based on the outlined problem, the purpose of this research was to analyze the implementation of the common phrase index method on information retrieval. This research will be conducted in English news text and implemented on a prototype to determine the relevance level of the documents produced. The system is built with the stages of pre-processing, indexing, term weighting calculation, and cosine similarity calculation. Then the system will display the document search results in a sequence, based on the cosine similarity. Furthermore, system testing will be conducted using 100 documents and 20 queries. That result is then used for the evaluation stage. First, determine the relevant documents using kappa statistic calculation. Second, determine the system success rate using precision, recall, and F-measure calculation. In this research, the result of kappa statistic calculation was 0.71, so that the relevant documents are eligible for the system evaluation. Then the calculation of precision, recall, and F-measure produces precision of 0.37, recall of 0.50, and F-measure of 0.43. From this result can be said that the success rate of the system to produce relevant documents is low.

  15. Searching for evidence or approval? A commentary on database search in systematic reviews and alternative information retrieval methodologies

    NARCIS (Netherlands)

    Delaney, Aogán; Tamás, Peter A.

    2017-01-01

    Despite recognition that database search alone is inadequate even within the health sciences, it appears that reviewers in fields that have adopted systematic review are choosing to rely primarily, or only, on database search for information retrieval. This commentary reminds readers of factors that

  16. Synthesizer: Expediting synthesis studies from context-free data with information retrieval techniques.

    Directory of Open Access Journals (Sweden)

    Lisa M Gandy

    Full Text Available Scientists have unprecedented access to a wide variety of high-quality datasets. These datasets, which are often independently curated, commonly use unstructured spreadsheets to store their data. Standardized annotations are essential to perform synthesis studies across investigators, but are often not used in practice. Therefore, accurately combining records in spreadsheets from differing studies requires tedious and error-prone human curation. These efforts result in a significant time and cost barrier to synthesis research. We propose an information retrieval inspired algorithm, Synthesize, that merges unstructured data automatically based on both column labels and values. Application of the Synthesize algorithm to cancer and ecological datasets had high accuracy (on the order of 85-100%. We further implement Synthesize in an open source web application, Synthesizer (https://github.com/lisagandy/synthesizer. The software accepts input as spreadsheets in comma separated value (CSV format, visualizes the merged data, and outputs the results as a new spreadsheet. Synthesizer includes an easy to use graphical user interface, which enables the user to finish combining data and obtain perfect accuracy. Future work will allow detection of units to automatically merge continuous data and application of the algorithm to other data formats, including databases.

  17. VaProS: a database-integration approach for protein/genome information retrieval

    KAUST Repository

    Gojobori, Takashi

    2016-12-24

    Life science research now heavily relies on all sorts of databases for genome sequences, transcription, protein three-dimensional (3D) structures, protein–protein interactions, phenotypes and so forth. The knowledge accumulated by all the omics research is so vast that a computer-aided search of data is now a prerequisite for starting a new study. In addition, a combinatory search throughout these databases has a chance to extract new ideas and new hypotheses that can be examined by wet-lab experiments. By virtually integrating the related databases on the Internet, we have built a new web application that facilitates life science researchers for retrieving experts’ knowledge stored in the databases and for building a new hypothesis of the research target. This web application, named VaProS, puts stress on the interconnection between the functional information of genome sequences and protein 3D structures, such as structural effect of the gene mutation. In this manuscript, we present the notion of VaProS, the databases and tools that can be accessed without any knowledge of database locations and data formats, and the power of search exemplified in quest of the molecular mechanisms of lysosomal storage disease. VaProS can be freely accessed at http://p4d-info.nig.ac.jp/vapros/.

  18. Band target entropy minimization for retrieving the information of individual components from overlapping chromatographic data.

    Science.gov (United States)

    Xia, Zhenzhen; Liu, Yan; Cai, Wensheng; Shao, Xueguang

    2015-09-11

    Band target entropy minimization (BTEM) is a self-modeling curve resolution (SMCR) approach relying on non-negative criterion and minimization of Shannon entropy. In this study, BTEM algorithm was applied to retrieving the information of individual components from overlapping gas chromatography-mass spectrometry (GC-MS) data. The algorithm starts with dividing the whole data into bands along the retention time. In each band, singular value decomposition (SVD) is used to decompose the data into scores and loadings. Because the pure chromatographic signal possesses the lowest Shannon entropy, the chromatographic signal of each component can be constructed by optimizing the combination of the loadings with minimal Shannon entropy under non-negative criterion. To show the efficiency of the algorithm, a simulated four-component overlapping GC-MS data and an experimental GC-MS data of 18 organophosphorus pesticide mixture are investigated. The results show that both the chromatographic profiles and mass spectra of the components can be successfully extracted from the overlapping signals. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. Application of Musical Information Retrieval (MIR Techniques to Seismic Facies Classification. Examples in Hydrocarbon Exploration

    Directory of Open Access Journals (Sweden)

    Paolo Dell’Aversana

    2016-12-01

    Full Text Available In this paper, we introduce a novel approach for automatic pattern recognition and classification of geophysical data based on digital music technology. We import and apply in the geophysical domain the same approaches commonly used for Musical Information Retrieval (MIR. After accurate conversion from geophysical formats (example: SEG-Y to musical formats (example: Musical Instrument Digital Interface, or briefly MIDI, we extract musical features from the converted data. These can be single-valued attributes, such as pitch and sound intensity, or multi-valued attributes, such as pitch histograms, melodic, harmonic and rhythmic paths. Using a real data set, we show that these musical features can be diagnostic for seismic facies classification in a complex exploration area. They can be complementary with respect to “conventional” seismic attributes. Using a supervised machine learning approach based on the k-Nearest Neighbors algorithm and on Automatic Neural Networks, we classify three gas-bearing channels. The good performance of our classification approach is confirmed by borehole data available in the same area.

  20. A general UNIX interface for biocomputing and network information retrieval software.

    Science.gov (United States)

    Kiong, B K; Tan, T W

    1993-10-01

    We describe a UNIX program, HYBROW, which can integrate without modification a wide range of UNIX biocomputing and network information retrieval software. HYBROW works in conjunction with a separate set of ASCII files containing embedded hypertext-like links. The program operates like a hypertext browser featuring five basic links: file link, execute-only link, execute-display link, directory-browse link and field-filling link. Useful features of the interface may be developed using combinations of these links with simple shell scripts and examples of these are briefly described. The system manager who supports biocomputing users should find the program easy to maintain, and useful in assisting new and infrequent users; it is also simple to incorporate new programs. Moreover, the individual user can customize the interface, create dynamic menus, hypertext a document, invoke shell scripts and new programs simply with a basic understanding of the UNIX operating system and any text editor. This program was written in C language and uses the UNIX curses and termcap libraries. It is freely available as a tar compressed file (by anonymous FTP from nuscc.nus.sg).

  1. Matrix computations for information retrieval and major and outlier cluster detection

    Science.gov (United States)

    Kobayashi, M.; Aono, M.; Takeuchi, H.; Samukawa, H.

    2002-12-01

    In this paper we introduce COV, a novel information retrieval (IR) algorithm for massive databases based on vector space modeling and spectral analysis of the covariance matrix, for the document vectors, to reduce the scale of the problem. Since the dimension of the covariance matrix depends on the attribute space and is independent of the number of documents, COV can be applied to databases that are too massive for methods based on the singular value decomposition of the document-attribute matrix, such as latent semantic indexing (LSI). In addition to improved scalability, theoretical considerations indicate that results from our algorithm tend to be more accurate than those from LSI, particularly in detecting subtle differences in document vectors. We demonstrate the power and accuracy of COV through an important topic in data mining, known as outlier cluster detection. We propose two new algorithms for detecting major and outlier clusters in databases--the first is based on LSI, and the second on COV. Our implementation studies indicate that our cluster detection algorithms outperform the basic LSI and COV algorithm in detecting outlier clusters.

  2. Semantic information retrieval for geoscience resources : results and analysis of an online questionnaire of current web search experiences

    OpenAIRE

    Nkisi-Orji, I.

    2016-01-01

    An online questionnaire “Semantic web searches for geoscience resources” was completed by 35 staff of British Geological Survey (BGS) between 28th July 2015 and 28th August 2015. The questionnaire was designed to better understand current web search habits, preferences, and the reception of semantic search features in order to inform PhD research into the use of domain ontologies for semantic information retrieval. The key findings were that relevance ranking is important in fo...

  3. MRML: An Extensible Communication Protocol for Interoperability and Benchmarking of Multimedia Information Retrieval Systems

    OpenAIRE

    Muller, Wolfgang; Pecenovic, Zoran; Muller, Henning; Marchand-Maillet, Stéphane; Pun, Thierry; Squire, David; De Vries, Arjen; Giess, Christof

    2000-01-01

    While in the area of relational databases interoperability is ensured by common communication protocols (e.g. ODBC/JDBC using SQL), Content Based Image Retrieval Systems (CBIRS) and other multimedia retrieval systems are lacking both a common query language and a common communication protocol. Besides its obvious short term convenience, interoperability of systems is crucial for the exchange and analysis of user data. In this paper, we present and describe an extensible XML-based query markup...

  4. [Retrieval of Copper Pollution Information from Hyperspectral Satellite Data in a Vegetation Cover Mining Area].

    Science.gov (United States)

    Qu, Yong-hua; Jiao, Si-hong; Liu, Su-hong; Zhu, Ye-qing

    2015-11-01

    Heavy metal mining activities have caused the complex influence on the ecological environment of the mining regions. For example, a large amount of acidic waste water containing heavy metal ions have be produced in the process of copper mining which can bring serious pollution to the ecological environment of the region. In the previous research work, bare soil is mainly taken as the research target when monitoring environmental pollution, and thus the effects of land surface vegetation have been ignored. It is well known that vegetation condition is one of the most important indictors to reflect the ecological change in a certain region and there is a significant linkage between the vegetation spectral characteristics and the heavy metal when the vegetation is effected by the heavy metal pollution. It means the vegetation is sensitive to heavy metal pollution by their physiological behaviors in response to the physiological ecology change of their growing environment. The conventional methods, which often rely on large amounts of field survey data and laboratorial chemical analysis, are time consuming and costing a lot of material resources. The spectrum analysis method using remote sensing technology can acquire the information of the heavy mental content in the vegetation without touching it. However, the retrieval of that information from the hyperspectral data is not an easy job due to the difficulty in figuring out the specific band, which is sensitive to the specific heavy metal, from a huge number of hyperspectral bands. Thus the selection of the sensitive band is the key of the spectrum analysis method. This paper proposed a statistical analysis method to find the feature band sensitive to heavy metal ion from the hyperspectral data and to then retrieve the metal content using the field survey data and the hyperspectral images from China Environment Satellite HJ-1. This method selected copper ion content in the leaves as the indicator of copper pollution

  5. An evaluation of concept based latent semantic indexing for clinical information retrieval.

    Science.gov (United States)

    Chute, C. G.; Yang, Y.

    1992-01-01

    Latent Semantic Indexing (LSI) of surgical case report text using ICD-9-CM procedure codes and index terms was evaluated. The precision-recall performance of this two-step matrix retrieval process was compared with the SMART Document retrieval system, surface word matching, and humanly assigned procedure codes. Human coding performed best, two-step LSI did less well than surface matching or SMART. This evaluation suggests that concept-based LSI may be compromised by its two-stage nature and its dependence upon a robust term database linked to main concepts. However, the potential elegance of partial- credit concept matching merits the continued evaluation of LSI for clinical case retrieval. PMID:1482949

  6. On-demand information retrieval in sensor networks with localised query and energy-balanced data collection.

    Science.gov (United States)

    Teng, Rui; Zhang, Bing

    2011-01-01

    On-demand information retrieval enables users to query and collect up-to-date sensing information from sensor nodes. Since high energy efficiency is required in a sensor network, it is desirable to disseminate query messages with small traffic overhead and to collect sensing data with low energy consumption. However, on-demand query messages are generally forwarded to sensor nodes in network-wide broadcasts, which create large traffic overhead. In addition, since on-demand information retrieval may introduce intermittent and spatial data collections, the construction and maintenance of conventional aggregation structures such as clusters and chains will be at high cost. In this paper, we propose an on-demand information retrieval approach that exploits the name resolution of data queries according to the attribute and location of each sensor node. The proposed approach localises each query dissemination and enable localised data collection with maximised aggregation. To illustrate the effectiveness of the proposed approach, an analytical model that describes the criteria of sink proxy selection is provided. The evaluation results reveal that the proposed scheme significantly reduces energy consumption and improves the balance of energy consumption among sensor nodes by alleviating heavy traffic near the sink.

  7. Information Retrieval in a Work Setting: A Case Study of the Documentation Part of Chemists’ Work

    DEFF Research Database (Denmark)

    Hertzum, Morten

    1993-01-01

    The purpose of this study is to gain insight into a group of chemists’ documentation work in a large, international enterprise. The main concern is how filing is organized to support subsequent retrieval without overloading the primary work. The chemists’ documentation work is based on individual......, partial systems, such as piles with urgent things. Mostly, the final documentation work where documents are made part of the archive is delegated to the secretaries who act as intermediaries between the chemists and the archive. Recently, a comprehensive computer-based filing and retrieval system...

  8. Risk Informed Design Using Integrated Vehicle Rapid Assessment Tools Project

    Data.gov (United States)

    National Aeronautics and Space Administration — A successful proof of concept was performed in FY 2012 integrating the Envision tool for parametric estimates of vehicle mass and the Rapid Response Risk Assessment...

  9. Evaluating a Priori Ozone Profile Information Used in TEMPO (Tropospheric Emissions: Monitoring of Pollution) Tropospheric Ozone Retrievals

    Science.gov (United States)

    Johnson, Matthew Stephen

    2017-01-01

    A primary objective for TOLNet is the evaluation and validation of space-based tropospheric O3 retrievals from future systems such as the Tropospheric Emissions: Monitoring of Pollution (TEMPO) satellite. This study is designed to evaluate the tropopause-based O3 climatology (TB-Clim) dataset which will be used as the a priori profile information in TEMPO O3 retrievals. This study also evaluates model simulated O3 profiles, which could potentially serve as a priori O3 profile information in TEMPO retrievals, from near-real-time (NRT) data assimilation model products (NASA Global Modeling and Assimilation Office (GMAO) Goddard Earth Observing System (GEOS-5) Forward Processing (FP) and Modern-Era Retrospective analysis for Research and Applications version 2 (MERRA2)) and full chemical transport model (CTM), GEOS-Chem, simulations. The TB-Clim dataset and model products are evaluated with surface (0-2 km) and tropospheric (0-10 km) TOLNet observations to demonstrate the accuracy of the suggested a priori dataset and information which could potentially be used in TEMPO O3 algorithms. This study also presents the impact of individual a priori profile sources on the accuracy of theoretical TEMPO O3 retrievals in the troposphere and at the surface. Preliminary results indicate that while the TB-Clim climatological dataset can replicate seasonally-averaged tropospheric O3 profiles observed by TOLNet, model-simulated profiles from a full CTM (GEOS-Chem is used as a proxy for CTM O3 predictions) resulted in more accurate tropospheric and surface-level O3 retrievals from TEMPO when compared to hourly (diurnal cycle evaluation) and daily-averaged (daily variability evaluation) TOLNet observations. Furthermore, it was determined that when large daily-averaged surface O3 mixing ratios are observed (65 ppb), which are important for air quality purposes, TEMPO retrieval values at the surface display higher correlations and less bias when applying CTM a priori profile information

  10. Practical Private Information Retrieval from a Time-Varying, Multi-attribute, and Multiple-Occurrence Database

    OpenAIRE

    De Crescenzo, Giovanni; Cook, Debra; McIntosh, Allen; Panagos, Euthimios

    2014-01-01

    International audience; We study the problem of privately performing database queries (i.e., keyword searches and conjunctions over them), where a server provides its own database for client query-based access. We propose a cryptographic model for the study of such protocols,by expanding previous well-studied models of keyword search and private information retrieval to incorporate a more practical data model: a time-varying, multi-attribute and multiple-occurrence database table.Our first re...

  11. Rancang Bangun Aplikasi MusicMoo dengan Metode MIR (Music Information Retrieval) pada Modul Mood, Genre Recognition, dan Tempo Estimation

    OpenAIRE

    Ridoean, Johanes Andre; Sarno, Riyanarto; Sunaryono, Dwi

    2017-01-01

    Saat ini, metode pemanggilan kembali informasi suatu musik atau yang sering disebut Music Information Retrieval (MIR) telah banyak diterapkan. Contohnya pada suatu aplikasi Shazam ataupun SounHound. Kedua aplikasi ini hanya menangani sebatas suatu lagu berjudul apakah ketika diperdengarkan. Untuk itu, tujuan penelitian ini adalah pengembangan lebih lanjut MIR yang lebih spesifik lagi, yaitu melakukan pemanggilan informasi lagu yang terkait kembali beserta detail lagu di antaranya adalah mood...

  12. Design and development of semantic web-based system for computer science domain-specific information retrieval

    Directory of Open Access Journals (Sweden)

    Ritika Bansal

    2016-09-01

    Full Text Available In semantic web-based system, the concept of ontology is used to search results by contextual meaning of input query instead of keyword matching. From the research literature, there seems to be a need for a tool which can provide an easy interface for complex queries in natural language that can retrieve the domain-specific information from the ontology. This research paper proposes an IRSCSD system (Information retrieval system for computer science domain as a solution. This system offers advanced querying and browsing of structured data with search results automatically aggregated and rendered directly in a consistent user-interface, thus reducing the manual effort of users. So, the main objective of this research is design and development of semantic web-based system for integrating ontology towards domain-specific retrieval support. Methodology followed is a piecemeal research which involves the following stages. First Stage involves the designing of framework for semantic web-based system. Second stage builds the prototype for the framework using Protégé tool. Third Stage deals with the natural language query conversion into SPARQL query language using Python-based QUEPY framework. Fourth Stage involves firing of converted SPARQL queries to the ontology through Apache's Jena API to fetch the results. Lastly, evaluation of the prototype has been done in order to ensure its efficiency and usability. Thus, this research paper throws light on framework development for semantic web-based system that assists in efficient retrieval of domain-specific information, natural language query interpretation into semantic web language, creation of domain-specific ontology and its mapping with related ontology. This research paper also provides approaches and metrics for ontology evaluation on prototype ontology developed to study the performance based on accessibility of required domain-related information.

  13. On the Performance of Medical Information Retrieval using MeSH Terms – A Survey

    Directory of Open Access Journals (Sweden)

    Swetha S

    2014-09-01

    Full Text Available Internet users have increased everywhere. Searching and retrieving documents is a common thing nowadays. Retrieving related documents from the search engines are difficult task. To retrieve correct documents, knowledge about the search topic is essential. Even though separate search engines are there to retrieve medical documents the users are not familiar with MeSH terms (Medical Subject Heading. So, both the search browser and the MeSH terms have to be integrated to make the search effective and efficient. To implement this integration, SimpleMed and MeSHMed were introduced. The MeSH terms have to be ranked to know how frequently it has been used and to know the importance of the MeSH terms. To rank it a semi – automated tool called MeSHy was developed. The terms were extracted, filtered, ranked and displayed to the user. Classifiers have to be constructed to label the documents as health and non – health. Three strategies were used to classify them. The errors that are commonly done by the users have to be found out. It was calculated based on the queries presented by the user to the search browser.

  14. An Information Retrieval Model Based on Vector Space Method by Supervised Learning.

    Science.gov (United States)

    Tai, Xiaoying; Ren, Fuji; Kita, Kenji

    2002-01-01

    Proposes a method to improve retrieval performance of the vector space model by using users' relevance feedback. Discusses the use of singular value decomposition and the latent semantic indexing model, and reports the results of two experiments that show the effectiveness of the proposed method. (Author/LRW)

  15. Assimilation of SMOS (and SMAP) Retrieved Soil Moisture into the Land Information System

    Science.gov (United States)

    Blankenship, Clay; Zavodsky, Bradley; Case, Jonathan; Stano, Geoffrey

    2016-01-01

    Goal: Accurate, high-resolution (approx.3 km) soil moisture in near-real time. Situational awareness (drought assessment, flood and fire threat). Local modeling applications (to improve sfc-PBL exchanges) Method: Assimilate satellite soil moisture retrievals into a land surface model. Combines high-resolution geophysical model data with latest satellite observations.

  16. Evaluating the Testing Effect in the Classroom: An Effective Way to Retrieve Learned Information

    Science.gov (United States)

    Atabek Yigit, Elif; Balkan Kiyici, Fatime; Çetinkaya, Gamze

    2014-01-01

    Problem statement: Evaluation, an important step in educational settings, is usually understood as a process to measure what students know or what they have learned. A variety of methods can be used for assessment and tests are one of the most important and widely-used. While being tested, one may learn or retrieve previously learned information…

  17. MRML: an extensible communication protocol for interoperability and benchmarking of multimedia information retrieval systems

    Science.gov (United States)

    Mueller, Wolfgang; Mueller, Henning; Marchand-Maillet, Stephane; Pun, Thierry; Squire, David M.; Pecenovic, Zoran; Giess, Christoph; de Vries, Arjen P.

    2000-10-01

    While in the area of relational databases interoperability is ensured by common communication protocols (e.g. ODBC/JDBC using SQL), Content Based Image Retrieval Systems (CBIRS) and other multimedia retrieval systems are lacking both a common query language and a common communication protocol. Besides its obvious short term convenience, interoperability of systems is crucial for the exchange and analysis of user data. In this paper, we present and describe an extensible XML-based query markup language, called MRML (Multimedia Retrieval markup Language). MRML is primarily designed so as to ensure interoperability between different content-based multimedia retrieval systems. Further, MRML allows researchers to preserve their freedom in extending their system as needed. MRML encapsulates multimedia queries in a way that enable multimedia (MM) query languages, MM content descriptions, MM query engines, and MM user interfaces to grow independently from each other, reaching a maximum of interoperability while ensuring a maximum of freedom for the developer. For benefitting from this, only a few simple design principles have to be respected when extending MRML for one's fprivate needs. The design of extensions withing the MRML framework will be described in detail in the paper. MRML has been implemented and tested for the CBIRS Viper, using the user interface Snake Charmer. Both are part of the GNU project and can be downloaded at our site.

  18. Horizontal Saccadic Eye Movements Enhance the Retrieval of Landmark Shape and Location Information

    Science.gov (United States)

    Brunye, Tad T.; Mahoney, Caroline R.; Augustyn, Jason S.; Taylor, Holly A.

    2009-01-01

    Recent work has demonstrated that horizontal saccadic eye movements enhance verbal episodic memory retrieval, particularly in strongly right-handed individuals. The present experiments test three primary assumptions derived from this research. First, horizontal eye movements should facilitate episodic memory for both verbal and non-verbal…

  19. The impact of note taking style and note availability at retrieval on mock jurors' recall and recognition of trial information.

    Science.gov (United States)

    Thorley, Craig; Baxter, Rebecca E; Lorek, Joanna

    2016-01-01

    Jurors forget critical trial information and what they do recall can be inaccurate. Jurors' recall of trial information can be enhanced by permitting them to take notes during a trial onto blank sheets of paper (henceforth called freestyle note taking). A recent innovation is the trial-ordered-notebook (TON) for jurors, which is a notebook containing headings outlining the trial proceedings and which has space beneath each heading for notes. In a direct comparison, TON note takers recalled more trial information than freestyle note takers. This study investigated whether or not note taking improves recall as a result of enhanced encoding or as a result of note access at retrieval. To assess this, mock jurors watched and freely recalled a trial video with one-fifth taking no notes, two-fifths taking freestyle notes and two-fifths using TONs. During retrieval, half of the freestyle and TON note takers could access their notes. Note taking enhanced recall, with the freestyle note takers and TON note takers without note access performing equally as well. Note taking therefore enhances encoding. Recall was greatest for the TON note takers with note access, suggesting a retrieval enhancement unique to this condition. The theoretical and applied implications of these findings are discussed.

  20. Characterizing the information content of cloud thermodynamic phase retrievals from the notional PACE OCI shortwave reflectance measurements

    Science.gov (United States)

    Coddington, O. M.; Vukicevic, T.; Schmidt, K. S.; Platnick, S.

    2017-08-01

    We rigorously quantify the probability of liquid or ice thermodynamic phase using only shortwave spectral channels specific to the National Aeronautics and Space Administration's Moderate Resolution Imaging Spectroradiometer, Visible Infrared Imaging Radiometer Suite, and the notional future Plankton, Aerosol, Cloud, ocean Ecosystem imager. The results show that two shortwave-infrared channels (2135 and 2250 nm) provide more information on cloud thermodynamic phase than either channel alone; in one case, the probability of ice phase retrieval increases from 65 to 82% by combining 2135 and 2250 nm channels. The analysis is performed with a nonlinear statistical estimation approach, the GEneralized Nonlinear Retrieval Analysis (GENRA). The GENRA technique has previously been used to quantify the retrieval of cloud optical properties from passive shortwave observations, for an assumed thermodynamic phase. Here we present the methodology needed to extend the utility of GENRA to a binary thermodynamic phase space (i.e., liquid or ice). We apply formal information content metrics to quantify our results; two of these (mutual and conditional information) have not previously been used in the field of cloud studies.

  1. Refined repetitive sequence searches utilizing a fast hash function and cross species information retrievals

    Directory of Open Access Journals (Sweden)

    Reneker Jeff

    2005-05-01

    Full Text Available Abstract Background Searching for small tandem/disperse repetitive DNA sequences streamlines many biomedical research processes. For instance, whole genomic array analysis in yeast has revealed 22 PHO-regulated genes. The promoter regions of all but one of them contain at least one of the two core Pho4p binding sites, CACGTG and CACGTT. In humans, microsatellites play a role in a number of rare neurodegenerative diseases such as spinocerebellar ataxia type 1 (SCA1. SCA1 is a hereditary neurodegenerative disease caused by an expanded CAG repeat in the coding sequence of the gene. In bacterial pathogens, microsatellites are proposed to regulate expression of some virulence factors. For example, bacteria commonly generate intra-strain diversity through phase variation which is strongly associated with virulence determinants. A recent analysis of the complete sequences of the Helicobacter pylori strains 26695 and J99 has identified 46 putative phase-variable genes among the two genomes through their association with homopolymeric tracts and dinucleotide repeats. Life scientists are increasingly interested in studying the function of small sequences of DNA. However, current search algorithms often generate thousands of matches – most of which are irrelevant to the researcher. Results We present our hash function as well as our search algorithm to locate small sequences of DNA within multiple genomes. Our system applies information retrieval algorithms to discover knowledge of cross-species conservation of repeat sequences. We discuss our incorporation of the Gene Ontology (GO database into these algorithms. We conduct an exhaustive time analysis of our system for various repetitive sequence lengths. For instance, a search for eight bases of sequence within 3.224 GBases on 49 different chromosomes takes 1.147 seconds on average. To illustrate the relevance of the search results, we conduct a search with and without added annotation terms for the

  2. Content-based image retrieval using spatial layout information in brain tumor T1-weighted contrast-enhanced MR images.

    Directory of Open Access Journals (Sweden)

    Meiyan Huang

    Full Text Available This study aims to develop content-based image retrieval (CBIR system for the retrieval of T1-weighted contrast-enhanced MR (CE-MR images of brain tumors. When a tumor region is fed to the CBIR system as a query, the system attempts to retrieve tumors of the same pathological category. The bag-of-visual-words (BoVW model with partition learning is incorporated into the system to extract informative features for representing the image contents. Furthermore, a distance metric learning algorithm called the Rank Error-based Metric Learning (REML is proposed to reduce the semantic gap between low-level visual features and high-level semantic concepts. The effectiveness of the proposed method is evaluated on a brain T1-weighted CE-MR dataset with three types of brain tumors (i.e., meningioma, glioma, and pituitary tumor. Using the BoVW model with partition learning, the mean average precision (mAP of retrieval increases beyond 4.6% with the learned distance metrics compared with the spatial pyramid BoVW method. The distance metric learned by REML significantly outperforms three other existing distance metric learning methods in terms of mAP. The mAP of the CBIR system is as high as 91.8% using the proposed method, and the precision can reach 93.1% when the top 10 images are returned by the system. These preliminary results demonstrate that the proposed method is effective and feasible for the retrieval of brain tumors in T1-weighted CE-MR Images.

  3. Retrieving accurate temporal and spatial information about Taylor slug flows from non-invasive NIR photometry measurements

    Science.gov (United States)

    Helmers, Thorben; Thöming, Jorg; Mießner, Ulrich

    2017-11-01

    In this article, we introduce a novel approach to retrieve spatial- and time-resolved Taylor slug flow information from a single non-invasive photometric flow sensor. The presented approach uses disperse phase surface properties to retrieve the instantaneous velocity information from a single sensor's time-scaled signal. For this purpose, a photometric sensor system is simulated using a ray-tracing algorithm to calculate spatially resolved near-infrared transmission signals. At the signal position corresponding to the rear droplet cap, a correlation factor of the droplet's geometric properties is retrieved and used to extract the instantaneous droplet velocity from the real sensor's temporal transmission signal. Furthermore, a correlation for the rear cap geometry based on the a priori known total superficial flow velocity is developed, because the cap curvature is velocity sensitive itself. Our model for velocity derivation is validated, and measurements of a first prototype showcase the capability of the device. Long-term measurements visualize systematic fluctuations in droplet lengths, velocities, and frequencies that could otherwise, without the observation on a larger timescale, have been identified as measurement errors and not systematic phenomenas.

  4. Domainwise Web Page Optimization Based On Clustered Query Sessions Using Hybrid Of Trust And ACO For Effective Information Retrieval

    Directory of Open Access Journals (Sweden)

    Dr. Suruchi Chawla

    2015-08-01

    Full Text Available Abstract In this paper hybrid of Ant Colony OptimizationACO and trust has been used for domainwise web page optimization in clustered query sessions for effective Information retrieval. The trust of the web page identifies its degree of relevance in satisfying specific information need of the user. The trusted web pages when optimized using pheromone updates in ACO will identify the trusted colonies of web pages which will be relevant to users information need in a given domain. Hence in this paper the hybrid of Trust and ACO has been used on clustered query sessions for identifying more and more relevant number of documents in a given domain in order to better satisfy the information need of the user. Experiment was conducted on the data set of web query sessions to test the effectiveness of the proposed approach in selected three domains Academics Entertainment and Sports and the results confirm the improvement in the precision of search results.

  5. Lure(d) into listening: The potential of cognition-based music information retrieval

    OpenAIRE

    Henkjan Honing

    2011-01-01

    This paper argues for the potential of cognition-based music retrieval by introducing the notion of a musical ‘hook’ as a key memorization, recall, and search mechanism. A hook is considered the most salient, memorable, and easy to recall moment of a musical phrase or song. Next to its role in searching large data-bases of music, it is proposed as a way to understand and identify which cognitively relevant musical features affect the appreciation, memorization and recall of music. To illustra...

  6. Error sources in the retrieval of aerosol information over bright surfaces from satellite measurements in the oxygen A band

    Science.gov (United States)

    Nanda, Swadhin; de Graaf, Martin; Sneep, Maarten; de Haan, Johan F.; Stammes, Piet; Sanders, Abram F. J.; Tuinder, Olaf; Pepijn Veefkind, J.; Levelt, Pieternel F.

    2018-01-01

    Retrieving aerosol optical thickness and aerosol layer height over a bright surface from measured top-of-atmosphere reflectance spectrum in the oxygen A band is known to be challenging, often resulting in large errors. In certain atmospheric conditions and viewing geometries, a loss of sensitivity to aerosol optical thickness has been reported in the literature. This loss of sensitivity has been attributed to a phenomenon known as critical surface albedo regime, which is a range of surface albedos for which the top-of-atmosphere reflectance has minimal sensitivity to aerosol optical thickness. This paper extends the concept of critical surface albedo for aerosol layer height retrievals in the oxygen A band, and discusses its implications. The underlying physics are introduced by analysing the top-of-atmosphere reflectance spectrum as a sum of atmospheric path contribution and surface contribution, obtained using a radiative transfer model. Furthermore, error analysis of an aerosol layer height retrieval algorithm is conducted over dark and bright surfaces to show the dependence on surface reflectance. The analysis shows that the derivative with respect to aerosol layer height of the atmospheric path contribution to the top-of-atmosphere reflectance is opposite in sign to that of the surface contribution - an increase in surface brightness results in a decrease in information content. In the case of aerosol optical thickness, these derivatives are anti-correlated, leading to large retrieval errors in high surface albedo regimes. The consequence of this anti-correlation is demonstrated with measured spectra in the oxygen A band from the GOME-2 instrument on board the Metop-A satellite over the 2010 Russian wildfires incident.

  7. Facilitating medical information search using Google Glass connected to a content-based medical image retrieval system.

    Science.gov (United States)

    Widmer, Antoine; Schaer, Roger; Markonis, Dimitrios; Muller, Henning

    2014-01-01

    Wearable computing devices are starting to change the way users interact with computers and the Internet. Among them, Google Glass includes a small screen located in front of the right eye, a camera filming in front of the user and a small computing unit. Google Glass has the advantage to provide online services while allowing the user to perform tasks with his/her hands. These augmented glasses uncover many useful applications, also in the medical domain. For example, Google Glass can easily provide video conference between medical doctors to discuss a live case. Using these glasses can also facilitate medical information search by allowing the access of a large amount of annotated medical cases during a consultation in a non-disruptive fashion for medical staff. In this paper, we developed a Google Glass application able to take a photo and send it to a medical image retrieval system along with keywords in order to retrieve similar cases. As a preliminary assessment of the usability of the application, we tested the application under three conditions (images of the skin; printed CT scans and MRI images; and CT and MRI images acquired directly from an LCD screen) to explore whether using Google Glass affects the accuracy of the results returned by the medical image retrieval system. The preliminary results show that despite minor problems due to the relative stability of the Google Glass, images can be sent to and processed by the medical image retrieval system and similar images are returned to the user, potentially helping in the decision making process.

  8. Information operator approach applied to the retrieval of vertical distributions of atmospheric constituents from ground-based FTIR measurements

    Science.gov (United States)

    Senten, Cindy; de Mazière, Martine; Vanhaelewyn, Gauthier; Vigouroux, Corinne; Delmas, Robert

    2010-05-01

    The retrieval of information about the vertical distribution of an atmospheric absorber from high spectral resolution ground-based Fourier Transform infrared (FTIR) solar absorption spectra is an important issue in remote sensing. A frequently used technique at present is the optimal estimation method. This work introduces the application of an alternative method, namely the information operator approach (Doicu et al., 2007; Hoogen et al., 1999), for extracting the available information from such FTIR measurements. This approach has been implemented within the well-known retrieval code SFIT2, by adapting the optimal estimation method such as to take into account only the significant contributions to the solution. In particular, we demonstrate the feasibility of the method when applied to ground-based FTIR spectra taken at the southern (sub)tropical site Ile de La Réunion (21° S, 55° E) in 2007. A thorough comparison has been made between the retrieval results obtained with the original optimal estimation method and the ones obtained with the information operator approach, regarding profile and column stability, information content and corresponding full error budget evaluation. This has been done for the target species ozone (O3), methane (CH4), nitrous oxide (N2O), and carbon monoxide (CO). It is shown that the information operator approach performs well and is capable of achieving the same accuracy as optimal estimation, with a gain of stability and with the additional advantage of being less sensitive to the choice of a priori information as well as to the actual signal-to-noise ratio. Keywords: ground-based FTIR, solar absorption spectra, greenhouse gases, information operator approach References Doicu, A., Hilgers, S., von Bargen, A., Rozanov, A., Eichmann, K.-U., von Savigny, C., and Burrows, J.P.: Information operator approach and iterative regularization methods for atmospheric remote sensing, J. Quant. Spectrosc. Radiat. Transfer, 103, 340-350, 2007

  9. Natural Language Object Retrieval

    OpenAIRE

    Hu, Ronghang; Xu, Huazhe; Rohrbach, Marcus; Feng, Jiashi; Saenko, Kate; Darrell, Trevor

    2015-01-01

    In this paper, we address the task of natural language object retrieval, to localize a target object within a given image based on a natural language query of the object. Natural language object retrieval differs from text-based image retrieval task as it involves spatial information about objects within the scene and global scene context. To address this issue, we propose a novel Spatial Context Recurrent ConvNet (SCRC) model as scoring function on candidate boxes for object retrieval, integ...

  10. A note on a fatal error of optimized LFC private information retrieval scheme and its corrected results

    DEFF Research Database (Denmark)

    Tamura, Jim; Kobara, Kazukuni; Fathi, Hanane

    2010-01-01

    A number of lightweight PIR (Private Information Retrieval) schemes have been proposed in recent years. In JWIS2006, Kwon et al. proposed a new scheme (optimized LFCPIR, or OLFCPIR), which aimed at reducing the communication cost of Lipmaa's O(log2 n) PIR(LFCPIR) to O(logn). However in this paper......, we point out a fatal error of overflow contained in OLFCPIR and show how the error can be corrected. Finally, we compare with LFCPIR to show that the communication cost of our corrected OLFCPIR is asymptotically the same as the previous LFCPIR....

  11. An innovative, multidisciplinary educational program in interactive information storage and retrieval. Presentation visuals. M.S. Thesis Final Report, 1 Jul. 1985 - 31 Dec. 1987

    Science.gov (United States)

    Dominick, Wayne D. (Editor); Gallagher, Mary C.

    1985-01-01

    This Working Paper Series entry represents a collection of presentation visuals associated with the companion report entitled An Innovative, Multidisciplinary Educational Program in Interactive Information Storage and Retrieval, USL/DBMS NASA/RECON Working Paper Series report number DBMS.NASA/RECON-12. The project objectives are to develop a set of transportable, hands-on, data base management courses for science and engineering students to facilitate their utilization of information storage and retrieval programs.

  12. On the Estimation and Use of Statistical Modelling in Information Retrieval

    DEFF Research Database (Denmark)

    Petersen, Casper

    distribution, and to the fact that (ii) making such assumptions does not seem to impact IR effectiveness. However, if such assumptions are not validated, any subsequent calculations, deductions or modelling becomes less accurate for the task at hand. To remove the need for such assumptions, this thesis first...... introduces a statistically principled method for selecting the best fitting distribution. The thesis then demonstrates that integrating knowledge about the best-fitting distribution into IR leads to superior results compared to existing strong baselines on multiple datasets. Overall, this thesis concludes...... that assumptions regarding the distribution of dataset properties can be replaced with an effective, efficient and principled method for determining the best-fitting distribution and that using this distribution can lead to improved retrieval performance....

  13. Ontology driven framework for multimedia information retrieval in P2P network

    CERN Document Server

    Sokhn, Maria

    During the last decade we have witnessed an exponential growth of digital documents and multimedia resources, including a vast amount of video resources. Videos are becoming one of the most popular media thanks to the rich audio, visual and textual content they may convey. The recent technological advances have made this large amount of multimedia resources available to users in a variety of areas, including the academic and scientific realms. However, without adequate techniques for effective content based multimedia retrieval, this large and valuable body of data is barely accessible and remains in effect unusable. This thesis explores semantic approaches to content based management browsing and visualization of the multimedia resources generated for and during scientific conferences. Indeed, a so-called semantic gap exists between the explicit knowledge representation required by users who search the multimedia resources and the implicit knowledge conveyed within a conference life cycle. The aim of this wo...

  14. Available Methods in Farsi-English Cross Language Information Retrieval Using Machine-readable, Bilingual Glossary

    Directory of Open Access Journals (Sweden)

    Hamid Alizadeh

    2009-12-01

    Full Text Available In this paper the impact scope of Natural Language Processing (NLP on translating search statements was determined by testing out research hypotheses. The NLP techniques employed for search statement processing included text parsing, linguistic forms identification, stopword removal, morphological analysis, and tokenization. Examination of the hypotheses indicated that using the method of translating the first equivalent term selected versus the method of selecting all equivalent terms, would contribute to increased efficiency of the review that while morphological analysis of the terms not translated by the glossary, would increase the retrieval precision cutoff, there would be no significant difference established by the lack of such analysis thereof that sentence translation as opposed to term by term translation, would increase the efficiency of Farsi-English proofreading. Other findings are also represented.

  15. Searching to Translate and Translating to Search: When Information Retrieval Meets Machine Translation

    Science.gov (United States)

    Ture, Ferhan

    2013-01-01

    With the adoption of web services in daily life, people have access to tremendous amounts of information, beyond any human's reading and comprehension capabilities. As a result, search technologies have become a fundamental tool for accessing information. Furthermore, the web contains information in multiple languages, introducing another barrier…

  16. How You Store Information Affects How You Can Retrieve It: A Fundamental Principle for Business Students Studying Information Systems and Technology

    Science.gov (United States)

    Silver, Mark S.

    2017-01-01

    During the current period of rapid technological change, business students need to emerge from their introductory course in Information Systems (IS) with a set of fundamental principles to help them "think about Information Technology (IT)" in future courses and the workplace. Given the digital revolution, they also need to appreciate…

  17. Can global variation of nasopharynx cancer be retrieved from the combined analyses of IARC Cancer Information (CIN databases?

    Directory of Open Access Journals (Sweden)

    Xin Sun

    Full Text Available BACKGROUND: The international nasopharynx cancer (NPC burdens are masked due to the lack of integrated studies that examine epidemiological data based on up-to-date international disease databases such as the Cancer Information (CIN databases provided by the International Agency for Research on Cancer (IARC. METHODS: By analyzing the most recently updated NPC epidemiological data available from IARC, we tried to retrieve the worldwide NPC burden and patterns from combined analysis with GLOBOCAN2008 and the Cancer Incidence in Five Continents (CI5 databases. We provide age-standardized rates (ASR for NPC mortality in 20 highest cancer registries from GLOBOCAN2008 and the World Health Organization (WHO mortality databases, respectively. However, NPC incidence data can not be retrieved since it is not individually listed in CI5 database. The trend of NPC mortality was investigated with Joinpoint analysis in the selected countries/regions with high ASR. RESULTS: GLOBOCAN 2008 revealed that the highest NPC incidence rates in 2008 were in registries from South-Eastern Asia, Micronesia and Southern Africa with Malaysia, Indonesia and Singapore ranking the top 3. WHO mortality database analysis revealed that China Hong Kong, Singapore and Malta ranks the top 3 regions with the highest 5-year mortality rates. CONCLUSIONS: NPC mortality rate is about 2-3 times higher in male than that in female, and shows decrease tendency in those selected countries/regions during the analyzed periods. However, the integrated analyses of the current IARC CIN databases may not be suitable to retrieve epidemiological data of NPC. Much effort is required to improve the local cancer entry and regional death-reporting systems so as to aid similar studies.

  18. Lure(d into listening: The potential of cognition-based music information retrieval

    Directory of Open Access Journals (Sweden)

    Henkjan Honing

    2011-04-01

    Full Text Available This paper argues for the potential of cognition-based music retrieval by introducing the notion of a musical ‘hook’ as a key memorization, recall, and search mechanism. A hook is considered the most salient, memorable, and easy to recall moment of a musical phrase or song. Next to its role in searching large data-bases of music, it is proposed as a way to understand and identify which cognitively relevant musical features affect the appreciation, memorization and recall of music. To illustrate the potential of this idea for the computational humanities (Willekens et al., 2010, in the second half of the paper a pilot research project is described. This project, named Listen, Lure & Locate, aims to study the cultural phenomenon of being lured to listen to new unfamiliar music, and especially the role that recent internet-mediated technologies can have in this process. It is argued that a combination of crowd annotation (i.e., social- or crowd-tagging and marking the specific moment (the hook in one’s favorite music, has great potential for improving search engines for music. In addition, these annotations will provide a rich empirical source to music cognition research in determining what makes certain melodic fragments more sticky than others.

  19. A Framework for Information Retrieval and Knowledge Discovery from Online Healthcare Forums

    Science.gov (United States)

    Sampathkumar, Hariprasad

    2016-01-01

    Information used to assist biomedical and clinical research has largely comprised of data available in published sources like scientific papers and journals, or in clinical sources like patient health records, lab reports and discharge summaries. Information from such sources, though extensive and organized, is often not readily available due to…

  20. An Empirical Comparison of Visualization Tools To Assist Information Retrieval on the Web.

    Science.gov (United States)

    Heo, Misook; Hirtle, Stephen C.

    2001-01-01

    Discusses problems with navigation in hypertext systems, including cognitive overload, and describes a study that tested information visualization techniques to see which best represented the underlying structure of Web space. Considers the effects of visualization techniques on user performance on information searching tasks and the effects of…

  1. Exploring topic-based language models for effective web information retrieval

    NARCIS (Netherlands)

    Li, R.; Kaptein, R.; Hiemstra, D.; Kamps, J.

    2008-01-01

    The main obstacle for providing focused search is the relative opaqueness of search request—searchers tend to express their complex information needs in only a couple of keywords. Our overall aim is to find out if, and how, topic-based language models can leads to more effective web information

  2. Exploring Topic-based Language Models for Effective Web Information Retrieval

    NARCIS (Netherlands)

    Li, R.; Kaptein, Rianne; Hiemstra, Djoerd; Kamps, Jaap; Hoenkamp, E.; De Cock, M.; Hoste, V.

    2008-01-01

    The main obstacle for providing focused search is the relative opaqueness of search request -- searchers tend to express their complex information needs in only a couple of keywords. Our overall aim is to find out if, and how, topic-based language models can lead to more effective web information

  3. Behind the scenes of the digital museum of information retrieval research

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Pothoven, Tristan; van Vliet, Marijn; Harman, Donna

    As more and more of the world becomes digital, and documents become easily available over the Internet, we are suddenly able to access all kinds of information. The downside of this however is that information that is not digital becomes less accessed, and is liable to be lost to us and to future

  4. Assessing Website Quality in Context: Retrieving Information about Genetically Modified Food on the Web

    Science.gov (United States)

    McInerney, Claire R.; Bird, Nora J.

    2005-01-01

    Introduction: Knowing the credibility of information about genetically modified food on the Internet is critical to the everyday life information seeking of consumers as they form opinions about this nascent agricultural technology. The Website Quality Evaluation Tool (WQET) is a valuable instrument that can be used to determine the credibility of…

  5. Altered retrieval of melodic information in congenital amusia: insights from dynamic causal modeling of MEG data.

    Science.gov (United States)

    Albouy, Philippe; Mattout, Jérémie; Sanchez, Gaëtan; Tillmann, Barbara; Caclin, Anne

    2015-01-01

    Congenital amusia is a neuro-developmental disorder that primarily manifests as a difficulty in the perception and memory of pitch-based materials, including music. Recent findings have shown that the amusic brain exhibits altered functioning of a fronto-temporal network during pitch perception and short-term memory. Within this network, during the encoding of melodies, a decreased right backward frontal-to-temporal connectivity was reported in amusia, along with an abnormal connectivity within and between auditory cortices. The present study investigated whether connectivity patterns between these regions were affected during the short-term memory retrieval of melodies. Amusics and controls had to indicate whether sequences of six tones that were presented in pairs were the same or different. When melodies were different only one tone changed in the second melody. Brain responses to the changed tone in "Different" trials and to its equivalent (original) tone in "Same" trials were compared between groups using Dynamic Causal Modeling (DCM). DCM results confirmed that congenital amusia is characterized by an altered effective connectivity within and between the two auditory cortices during sound processing. Furthermore, right temporal-to-frontal message passing was altered in comparison to controls, with notably an increase in "Same" trials. An additional analysis in control participants emphasized that the detection of an unexpected event in the typically functioning brain is supported by right fronto-temporal connections. The results can be interpreted in a predictive coding framework as reflecting an abnormal prediction error sent by temporal auditory regions towards frontal areas in the amusic brain.

  6. The relational luring effect: Retrieval of relational information during associative recognition.

    Science.gov (United States)

    Popov, Vencislav; Hristova, Penka; Anders, Royce

    2017-05-01

    Here we argue that semantic relations (e.g., works in: nurse-hospital) have abstract independent representations in long-term memory (LTM) and that the same representation is accessed by all exemplars of a specific relation. We present evidence from 2 associative recognition experiments that uncovered a novel relational luring effect (RLE) in recognition memory. Participants studied word pairs, and then discriminated between intact (old) pairs and recombined lures. In the first experiment participants responded more slowly to lures that were relationally similar (table-cloth) to studied pairs (floor-carpet), in contrast to relationally dissimilar lures (pipe-water). Experiment 2 extended the RLE by showing a continuous effect of relational lure strength on recognition times (RTs), false alarms, and hits. It used a continuous pair recognition task, where each recombined lure or target could be preceded by 0, 1, 2, 3 or 4 different exemplars of the same relation. RTs and false alarms increased linearly with the number of different previously seen relationally similar pairs. Moreover, more typical exemplars of a given relation lead to a stronger RLE. Finally, hits for intact pairs also rose with the number of previously studied different relational instances. These results suggest that semantic relations exist as independent representations in LTM and that during associative recognition these representations can be a spurious source of familiarity. We discuss the implications of the RLE for current models of semantic and episodic memory, unitization in associative recognition, analogical reasoning and retrieval, as well as constructive memory research. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  7. Popular song and lyrics synchronization and its application to music information retrieval

    Science.gov (United States)

    Chen, Kai; Gao, Sheng; Zhu, Yongwei; Sun, Qibin

    2006-01-01

    An automatic synchronization system of the popular song and its lyrics is presented in the paper. The system includes two main components: a) automatically detecting vocal/non-vocal in the audio signal and b) automatically aligning the acoustic signal of the song with its lyric using speech recognition techniques and positioning the boundaries of the lyrics in its acoustic realization at the multiple levels simultaneously (e.g. the word / syllable level and phrase level). The GMM models and a set of HMM-based acoustic model units are carefully designed and trained for the detection and alignment. To eliminate the severe mismatch due to the diversity of musical signal and sparse training data available, the unsupervised adaptation technique such as maximum likelihood linear regression (MLLR) is exploited for tailoring the models to the real environment, which improves robustness of the synchronization system. To further reduce the effect of the missed non-vocal music on alignment, a novel grammar net is build to direct the alignment. As we know, this is the first automatic synchronization system only based on the low-level acoustic feature such as MFCC. We evaluate the system on a Chinese song dataset collecting from 3 popular singers. We obtain 76.1% for the boundary accuracy at the syllable level (BAS) and 81.5% for the boundary accuracy at the phrase level (BAP) using fully automatic vocal/non-vocal detection and alignment. The synchronization system has many applications such as multi-modality (audio and textual) content-based popular song browsing and retrieval. Through the study, we would like to open up the discussion of some challenging problems when developing a robust synchronization system for largescale database.

  8. Can Music Foster Learning – Effects of Different Text Modalities on Learning and Information Retrieval

    Science.gov (United States)

    Lehmann, Janina A. M.; Seufert, Tina

    2018-01-01

    This study investigates the possibilities of fostering learning based on differences in recall and comprehension after learning with texts which were presented in one of three modalities: either in a spoken, written, or sung version. All three texts differ regarding their processing, especially when considering working memory. Overall, we assume the best recall performance after learning with the written text and the best comprehension performance after learning with the sung text, respectively, compared to both other text modalities. We also analyzed whether the melody of the sung material functions as a mnemonic aid for the learners in the sung text condition. If melody and text of the sung version are closely linked, presentation of the melody during the post-test phase could foster text retrieval. 108 students either learned from a sung text performed by a professional singer, a printed text, or the same text read out loud. Half of the participants worked on the post-test while listening to the melody used for the musical learning material and the other half did not listen to a melody. The written learning modality led to significantly better recall than with the spoken (d = 0.97) or sung text (d = 0.78). However, comprehension after learning with the sung modality was significantly superior compared to when learning with the written learning modality (d = 0.40). Reading leads to more focus on details, which is required to answer recall questions, while listening fosters a general understanding of the text, leading to higher levels of comprehension. Listening to the melody during the post-test phase negatively affected comprehension, irrespective of the modality during the learning phase. This can be explained by the seductive detail effect, as listening to the melody during the post-test phase may distract learners from their main task. In closing, theoretical and practical implications are discussed.

  9. Altered retrieval of melodic information in congenital amusia: Insights from Dynamic Causal Modeling of MEG data

    Directory of Open Access Journals (Sweden)

    Philippe eAlbouy

    2015-02-01

    Full Text Available Congenital amusia is a neuro-developmental disorder that primarily manifests as a difficulty in the perception and memory of pitch-based materials, including music. Recent findings have shown that the amusic brain exhibits altered functioning of a fronto-temporal network during pitch perception and memory. Within this network, during the encoding of melodies, a decreased right backward frontal-to-temporal connectivity was reported in amusia, along with an abnormal connectivity within and between auditory cortices. The present study investigated whether connectivity patterns between these regions were affected during the retrieval of melodies. Amusics and controls had to indicate whether sequences of six tones that were presented in pairs were the same or different. When melodies were different only one tone changed in the second melody. Brain responses to the changed tone in Different trials and to its equivalent (original tone in Same trials were compared between groups using Dynamic Causal Modeling (DCM. DCM results confirmed that congenital amusia is characterized by an altered effective connectivity within and between the two auditory cortices during sound processing. Furthermore, right temporal-to-frontal message passing was altered in comparison to controls, with an increase in Same trials and a decrease in Different trials. An additional analysis in control participants emphasized that the detection of an unexpected event in the typically functioning brain is supported by right fronto-temporal connections. The results can be interpreted in a predictive coding framework as reflecting an abnormal prediction error sent by temporal auditory regions towards frontal areas in the amusic brain.

  10. A New Cellular Architecture for Information Retrieval from Sensor Networks through Embedded Service and Security Protocols.

    Science.gov (United States)

    Shahzad, Aamir; Landry, René; Lee, Malrey; Xiong, Naixue; Lee, Jongho; Lee, Changhoon

    2016-06-14

    Substantial changes have occurred in the Information Technology (IT) sectors and with these changes, the demand for remote access to field sensor information has increased. This allows visualization, monitoring, and control through various electronic devices, such as laptops, tablets, i-Pads, PCs, and cellular phones. The smart phone is considered as a more reliable, faster and efficient device to access and monitor industrial systems and their corresponding information interfaces anywhere and anytime. This study describes the deployment of a protocol whereby industrial system information can be securely accessed by cellular phones via a Supervisory Control And Data Acquisition (SCADA) server. To achieve the study goals, proprietary protocol interconnectivity with non-proprietary protocols and the usage of interconnectivity services are considered in detail. They support the visualization of the SCADA system information, and the related operations through smart phones. The intelligent sensors are configured and designated to process real information via cellular phones by employing information exchange services between the proprietary protocol and non-proprietary protocols. SCADA cellular access raises the issue of security flaws. For these challenges, a cryptography-based security method is considered and deployed, and it could be considered as a part of a proprietary protocol. Subsequently, transmission flows from the smart phones through a cellular network.

  11. A New Cellular Architecture for Information Retrieval from Sensor Networks through Embedded Service and Security Protocols

    Directory of Open Access Journals (Sweden)

    Aamir Shahzad

    2016-06-01

    Full Text Available Substantial changes have occurred in the Information Technology (IT sectors and with these changes, the demand for remote access to field sensor information has increased. This allows visualization, monitoring, and control through various electronic devices, such as laptops, tablets, i-Pads, PCs, and cellular phones. The smart phone is considered as a more reliable, faster and efficient device to access and monitor industrial systems and their corresponding information interfaces anywhere and anytime. This study describes the deployment of a protocol whereby industrial system information can be securely accessed by cellular phones via a Supervisory Control And Data Acquisition (SCADA server. To achieve the study goals, proprietary protocol interconnectivity with non-proprietary protocols and the usage of interconnectivity services are considered in detail. They support the visualization of the SCADA system information, and the related operations through smart phones. The intelligent sensors are configured and designated to process real information via cellular phones by employing information exchange services between the proprietary protocol and non-proprietary protocols. SCADA cellular access raises the issue of security flaws. For these challenges, a cryptography-based security method is considered and deployed, and it could be considered as a part of a proprietary protocol. Subsequently, transmission flows from the smart phones through a cellular network.

  12. Query Enhancement with Topic Detection and Disambiguation for Robust Retrieval

    Science.gov (United States)

    Zhang, Hui

    2013-01-01

    With the rapid increase in the amount of available information, people nowadays rely heavily on information retrieval (IR) systems such as web search engine to fulfill their information needs. However, due to the lack of domain knowledge and the limitation of natural language such as synonyms and polysemes, many system users cannot formulate their…

  13. A Java-based multi-institutional medical information retrieval system.

    Science.gov (United States)

    Wang, K; van Wingerde, F J; Bradshaw, K; Szolovits, P; Kohane, I

    1997-01-01

    JAMI (Java-based Agglutination of Medical Information) is designed as a framework for integrating heterogeneous information systems used in healthcare related institutions. It is one of the implementations under the W3-EMRS project 1 aimed at using the World Wide Web (Web) to unify different hospital information systems. JAMI inherited several design decisions from the first W3-EMRS implementation described in, including using the Web as the communication infrastructure and HL7 as the communication protocol between the heterogeneous systems and the W3-EMRS systems. In addition, JAMI incorporates the growing Java technologies and has a more flexible and efficient architecture. This paper describes JAMI's architecture and implementation. It also present two instances of JAMI, one for the integration of different hospital information systems and another for the integration of two heterogeneous systems within a single hospital. Some important issues for the further development of JAMI, including security and confidentiality, data input and decision support are discussed.

  14. Linking genes to literature: text mining, information extraction, and retrieval applications for biology.

    Science.gov (United States)

    Krallinger, Martin; Valencia, Alfonso; Hirschman, Lynette

    2008-01-01

    Efficient access to information contained in online scientific literature collections is essential for life science research, playing a crucial role from the initial stage of experiment planning to the final interpretation and communication of the results. The biological literature also constitutes the main information source for manual literature curation used by expert-curated databases. Following the increasing popularity of web-based applications for analyzing biological data, new text-mining and information extraction strategies are being implemented. These systems exploit existing regularities in natural language to extract biologically relevant information from electronic texts automatically. The aim of the BioCreative challenge is to promote the development of such tools and to provide insight into their performance. This review presents a general introduction to the main characteristics and applications of currently available text-mining systems for life sciences in terms of the following: the type of biological information demands being addressed; the level of information granularity of both user queries and results; and the features and methods commonly exploited by these applications. The current trend in biomedical text mining points toward an increasing diversification in terms of application types and techniques, together with integration of domain-specific resources such as ontologies. Additional descriptions of some of the systems discussed here are available on the internet http://zope.bioinfo.cnio.es/bionlp_tools/.

  15. Effects of load and maintenance duration on the time course of information encoding and retrieval in working memory: from perceptual analysis to post-categorization processes.

    Science.gov (United States)

    Pinal, Diego; Zurrón, Montserrat; Díaz, Fernando

    2014-01-01

    information encoding, maintenance, and retrieval; these are supported by brain activity in a network of frontal, parietal and temporal regions. Manipulation of WM load and duration of the maintenance period can modulate this activity. Although such modulations have been widely studied using the event-related potentials (ERP) technique, a precise description of the time course of brain activity during encoding and retrieval is still required. Here, we used this technique and principal component analysis to assess the time course of brain activity during encoding and retrieval in a delayed match to sample task. We also investigated the effects of memory load and duration of the maintenance period on ERP activity. Brain activity was similar during information encoding and retrieval and comprised six temporal factors, which closely matched the latency and scalp distribution of some ERP components: P1, N1, P2, N2, P300, and a slow wave. Changes in memory load modulated task performance and yielded variations in frontal lobe activation. Moreover, the P300 amplitude was smaller in the high than in the low load condition during encoding and retrieval. Conversely, the slow wave amplitude was higher in the high than in the low load condition during encoding, and the same was true for the N2 amplitude during retrieval. Thus, during encoding, memory load appears to modulate the processing resources for context updating and post-categorization processes, and during retrieval it modulates resources for stimulus classification and context updating. Besides, despite the lack of differences in task performance related to duration of the maintenance period, larger N2 amplitude and stronger activation of the left temporal lobe after long than after short maintenance periods were found during information retrieval. Thus, results regarding the duration of maintenance period were complex, and future work is required to test the time-based decay theory predictions.

  16. Effects of load and maintenance duration on the time course of information encoding and retrieval in working memory: from perceptual analysis to post-categorization processes.

    Directory of Open Access Journals (Sweden)

    Diego ePinal

    2014-04-01

    Full Text Available Working memory (WM involves three cognitive events: information encoding, maintenance and retrieval; these are supported by brain activity in a network of frontal, parietal and temporal regions. Manipulation of WM load and duration of the maintenance period can modulate this activity. Although such modulations have been widely studied using the ERP technique, a precise description of the time course of brain activity during encoding and retrieval is still required. Here, we used this technique and principal component analysis to assess the time course of brain activity during encoding and retrieval in a delayed match to sample task. We also investigated the effects of memory load and duration of the maintenance period on ERP activity. Brain activity was similar during information encoding and retrieval and comprised six temporal factors, which closely matched the latency and scalp distribution of some ERP components: P1, N1, P2, N2, P300 and a slow wave. Changes in memory load modulated task performance and yielded variations in frontal lobe activation. Moreover, the P300 amplitude was smaller in the high than in the low load condition during encoding and retrieval. Conversely, the slow wave amplitude was higher in the high than in the low load condition during encoding, and the same was true for the N2 amplitude during retrieval. Thus, during encoding, memory load appears to modulate the processing resources for context updating and post-categorization processes, and during retrieval it modulates resources for stimulus classification and context updating. Besides, despite the lack of differences in task performance related to duration of the maintenance period, larger N2 amplitude and stronger activation of the left temporal lobe after long than after short maintenance periods were found during information retrieval. Thus, results regarding the duration of maintenance period were complex, and future work is required to test the time-based decay

  17. Enhancing computer literacy and information retrieval skills: A rural and remote nursing and midwifery workforce study.

    Science.gov (United States)

    Mills, Jane; Francis, Karen; McLeod, Margaret; Al-Motlaq, Mohammad

    2015-01-01

    Nurses and midwives collectively, represent the largest workforce category in rural and remote areas of Australia. Maintaining currency of practice and attaining annual licensure with the Australian Health Practitioners Regulatory Authority (AHPRA) present challenges for individual nurses and midwives and for their health service managers. Engagement with information and communication technologies, in order for geographically isolated clinicians to access ongoing education and training, is considered a useful strategy to address such challenges. This paper presents a pre- and post-test study design. It examines the impact of an online continuing professional development (CPD) program on Australian rural nurses and midwives. The aims of the program were to increase basic skill acquisition in the utilisation of common computer software, the use of the Internet and the enhancement of email communication. Findings from the study demonstrate that participants who complete a relevant CPD program gain confidence in the use of information and communication technologies. Further, increased confidence leads to increased access to contemporary, reliable and important health care information on the Internet, in addition to clinicians adopting email as a regular method of communication. Health care employers commonly assume employees are skilled users of information and communication technologies. However, findings from this study contradict such assumptions. It is argued in the recommendations that health care employees should be given regular access to CPD programs designed to introduce them to information and communication technologies. Developing knowledge and skills in this area has the potential to improve staff productivity, raise health care standards and improve patient outcomes.

  18. LinkHub: a Semantic Web system that facilitates cross-database queries and information retrieval in proteomics

    Directory of Open Access Journals (Sweden)

    Cheung Kei-Hoi

    2007-05-01

    Full Text Available Abstract Background A key abstraction in representing proteomics knowledge is the notion of unique identifiers for individual entities (e.g. proteins and the massive graph of relationships among them. These relationships are sometimes simple (e.g. synonyms but are often more complex (e.g. one-to-many relationships in protein family membership. Results We have built a software system called LinkHub using Semantic Web RDF that manages the graph of identifier relationships and allows exploration with a variety of interfaces. For efficiency, we also provide relational-database access and translation between the relational and RDF versions. LinkHub is practically useful in creating small, local hubs on common topics and then connecting these to major portals in a federated architecture; we have used LinkHub to establish such a relationship between UniProt and the North East Structural Genomics Consortium. LinkHub also facilitates queries and access to information and documents related to identifiers spread across multiple databases, acting as "connecting glue" between different identifier spaces. We demonstrate this with example queries discovering "interologs" of yeast protein interactions in the worm and exploring the relationship between gene essentiality and pseudogene content. We also show how "protein family based" retrieval of documents can be achieved. LinkHub is available at hub.gersteinlab.org and hub.nesg.org with supplement, database models and full-source code. Conclusion LinkHub leverages Semantic Web standards-based integrated data to provide novel information retrieval to identifier-related documents through relational graph queries, simplifies and manages connections to major hubs such as UniProt, and provides useful interactive and query interfaces for exploring the integrated data.

  19. A Methodology for Retrieving Information from Malware Encrypted Output Files: Brazilian Case Studies

    Directory of Open Access Journals (Sweden)

    Nelson Uto

    2013-04-01

    Full Text Available This article presents and explains a methodology based on cryptanalytic and reverse engineering techniques that can be employed to quickly recover information from encrypted files generated by malware. The objective of the methodology is to minimize the effort with static and dynamic analysis, by using cryptanalysis and related knowledge as much as possible. In order to illustrate how it works, we present three case studies, taken from a big Brazilian company that was victimized by directed attacks focused on stealing information from a special purpose hardware they use in their environment.

  20. Expanding user’s query with tag-neighbors for effective medical information retrieval

    DEFF Research Database (Denmark)

    Durao, Frederico; Bayyapu, Karunakar Reddy; Xu, Guandong

    2014-01-01

    compute a set of significant tag neighbor candidates based on the neighbor frequency and weight, and utilize the qualified tag neighbors to expand an entry query. The proposed approach is evaluated by using MedWorm medical article collection and results show considerable precision improvements over state......Medical information is a natural human demand. Existing search engines on the Web often are unable to handle medical search well because they do not consider its special requirements. Often a medical information searcher is uncertain about his exact questions and unfamiliar with medical terminology...

  1. Intelligent multimedia indexing and retrieval through multi-source information extraction and merging

    NARCIS (Netherlands)

    Kuper, Jan; Saggion, H.; Cunningham, H.; Declerck, T.; de Jong, Franciska M.G.; Reidsma, Dennis; Wilks, Y.; Wittenburg, P.

    This paper reports work on automated meta-data creation for multimedia content. The approach results in the generation of a conceptual index of the content which may then be searched via semantic categories instead of keywords. The novelty of the work is to exploit multiple sources of information

  2. Excerpta Medica Automated Storage and Retrieval Program of Biomedical Information. Excerpta Mark I System.

    Science.gov (United States)

    Excerpta Medica Foundation, Amsterdam (Netherlands).

    This is a report of the international operations of the Excerpta Medica Foundation whose aim is to further the progress of medical knowledge by making information available to the medical and related professions on all significant basic research and clinical findings reported in any language, anywhere in the world. To accomplish this task,…

  3. The impact of named entity normalization on information retrieval for question answering

    NARCIS (Netherlands)

    Khalid, M.A.; Jijkoun, V.; de Rijke, M.

    2008-01-01

    In the named entity normalization task, a system identifies a canonical unambiguous referent for names like Bush or Alabama. Resolving synonymy and ambiguity of such names can benefit end-to-end information access tasks. We evaluate two entity normalization methods based on Wikipedia in the context

  4. Polyphonic Music Information Retrieval Based on Multi-Label Cascade Classification System

    Science.gov (United States)

    Jiang, Wenxin

    2009-01-01

    Recognition and separation of sounds played by various instruments is very useful in labeling audio files with semantic information. This is a non-trivial task requiring sound analysis, but the results can aid automatic indexing and browsing music data when searching for melodies played by user specified instruments. Melody match based on pitch…

  5. PROJECT EVALUATION OF TECHNICAL CONDITION OF SHIP STRUCTURES WITH USE OF INFORMATION RETRIEVAL SYSTEMS

    Directory of Open Access Journals (Sweden)

    Юлия Алексеевна КАЗИМИРЕНКО

    2015-06-01

    Full Text Available The mechanisms for evaluation of technical conditions of ship constructions were investigated and was developed a new specialized informational and search system for the collection, analysis and processing the defects of new materials during the designing, construction, operation of ships and floating structures for transportation of goods 1, 4, 6-8 classes of danger.

  6. Turning text into research networks: information retrieval and computational ontologies in the creation of scientific databases.

    Science.gov (United States)

    Ceci, Flávio; Pietrobon, Ricardo; Gonçalves, Alexandre Leopoldo

    2012-01-01

    Web-based, free-text documents on science and technology have been increasing growing on the web. However, most of these documents are not immediately processable by computers slowing down the acquisition of useful information. Computational ontologies might represent a possible solution by enabling semantically machine readable data sets. But, the process of ontology creation, instantiation and maintenance is still based on manual methodologies and thus time and cost intensive. We focused on a large corpus containing information on researchers, research fields, and institutions. We based our strategy on traditional entity recognition, social computing and correlation. We devised a semi automatic approach for the recognition, correlation and extraction of named entities and relations from textual documents which are then used to create, instantiate, and maintain an ontology. We present a prototype demonstrating the applicability of the proposed strategy, along with a case study describing how direct and indirect relations can be extracted from academic and professional activities registered in a database of curriculum vitae in free-text format. We present evidence that this system can identify entities to assist in the process of knowledge extraction and representation to support ontology maintenance. We also demonstrate the extraction of relationships among ontology classes and their instances. We have demonstrated that our system can be used for the conversion of research information in free text format into database with a semantic structure. Future studies should test this system using the growing number of free-text information available at the institutional and national levels.

  7. Exploring the Use of Concept Spaces to Improve Medical Information Retrieval

    Science.gov (United States)

    2000-01-01

    170, February. w x6 G.C. Chute, Y. Yang, D.A. Evans, Latent semantic indexing of medical diagnoses using UMLS semantic structures, Pro- ceedings of...Furnas, T.K. Landauer, R. Harshman, Indexing by latent semantic analysis, Journal of Ž . Ž .the American Society for Information Science 41 6 1990 391–407

  8. Ontology-based retrieval of bio-medical information based on microarray text corpora

    DEFF Research Database (Denmark)

    Hansen, Kim Allan; Zambach, Sine; Have, Christian Theil

    are exponentially growing, the text corpora are sparse and inconsistent in spite of attempts to standardize the format. Ordinary keyword search may in some cases be insucient to nd rele- vant information and the potential benet of using a semantic approach in this context has only been investigated to a limited...

  9. SWHi system description : A case study in information retrieval, inference, and visualization in the Semantic Web

    NARCIS (Netherlands)

    Fahmi, Ismail; Zhang, Junte; Ellermann, Henk; Bouma, Gosse; Franconi, E; Kifer, M; May, W

    2007-01-01

    Search engines have become the most popular tools for finding information on the Internet. A real-world Semantic Web application can benefit from this by combining its features with some features from search engines. In this paper, we describe methods for indexing and searching a populated ontology

  10. What Should Users Expect from Information Storage and Retrieval Systems of the 1980’s

    Science.gov (United States)

    1981-12-01

    Col Woodrow Dunlop , in his time head of ASTIA, the information agency of the U.S. Armed Forces. His most courageous approach in tackling the problems...arid procedures. All this leads to weariness , dissatisfaction and eventual frustration. The 80’s should bring a significant improvement in the

  11. Exploiting Structure and Conventions of Movie Scripts for Information Retrieval and Text Mining

    DEFF Research Database (Denmark)

    Jhala, Arnav

    2008-01-01

    Movie scripts are documents that describe the story, stage direction for actors and camera, and dialogue. Script writers, directors, and cinematographers have standardized the format and language that is used in script writing. Scripts contain a wealth of information about narrative patterns, cha...

  12. 76 FR 52581 - Automated Data Processing and Information Retrieval System Requirements

    Science.gov (United States)

    2011-08-23

    ... quality, utility and clarity of the information to be collected; and (4) ways to minimize the burden of..., professional test management, repeatable test processes, specific pass/fail metrics, adequate time allotted for... severity levels, error tracking software, results reporting, and regression testing. The system should be...

  13. The Computerized Cataloguing of Historic Watercraft: A Case Study in Information Retrieval in Museology.

    Science.gov (United States)

    Summers, John E.; Summers, Edward G.

    1989-01-01

    Discusses the application of information science concepts to problems in museum collections management and describes a project in which the Vancouver Maritime Museum recataloged its historic watercraft collection. A list of fields to be used in computerized cataloging of historical watercraft is proposed, as well as cataloging protocol. (29…

  14. Use of an Information Retrieval Service in an Obstetrics/Gynecology Residency Program.

    Science.gov (United States)

    And Others; Gunning, John E.

    1980-01-01

    A program that uses the clinical librarian as a member of the patient care team has been developed by an obstetrics and gynecology department of a university medical center to keep faculty and hospital house staff knowledgeable about current developments and research. Program objectives, methodology, costs, evaluation, and information utilization…

  15. Machine Learning for Information Retrieval: Neural Networks, Symbolic Learning, and Genetic Algorithms.

    Science.gov (United States)

    Chen, Hsinchun

    1995-01-01

    Presents an overview of artificial-intelligence-based inductive learning techniques and their use in information science research. Three methods are discussed: the connectionist Hopfield network; the symbolic ID3/ID5R; evolution-based genetic algorithms. The knowledge representations and algorithms of these methods are examined in the context of…

  16. Using a Reference Corpus as a User Model for Focused Information Retrieval

    NARCIS (Netherlands)

    Mishne, G.A.; de Rijke, M.; Jijkoun, V.; van Zwol, R.

    2005-01-01

    We propose a method for ranking short information nuggets extracted from a text corpus, using another, reliable reference corpus as a user model. We argue that the availability and usage of such additional corpora is common in a number of IR tasks, and apply the method to answering a form of

  17. Using a Reference Corpus as a User Model for Focused Information Retrieval

    NARCIS (Netherlands)

    Mishne, G.A.; de Rijke, M.; Jijkoun, V.

    2005-01-01

    We propose a method for ranking short information nuggets extracted from a text corpus, using another, reliable reference corpus as a user model. We argue that the availability and usage of such additional corpora is common in a number of IR, tasks, and apply the method to answering a form of

  18. Integration of top-down and bottom-up information for audio organization and retrieval

    DEFF Research Database (Denmark)

    Jensen, Bjørn Sand

    The increasing availability of digital audio and music calls for methods and systems to analyse and organize these digital objects. This thesis investigates three elements related to such systems focusing on the ability to represent and elicit the user's view on the multimedia object and the system...... is applied in the eld of music emotion modelling and optimization of a parametric audio system with high-dimensional input spaces. The final aspect, considered in the thesis, concerns the general context of users, such as location and social context. This is important in understanding user behavior...... output. The aim is to provide organization and processing, which aligns with the understanding and needs of the users. Audio and music is often characterized by the large amount of heterogenous information. The rst aspect investigated is the integration of such multi-variate and multi-modal information...

  19. Turning text into research networks: information retrieval and computational ontologies in the creation of scientific databases.

    Directory of Open Access Journals (Sweden)

    Flávio Ceci

    Full Text Available BACKGROUND: Web-based, free-text documents on science and technology have been increasing growing on the web. However, most of these documents are not immediately processable by computers slowing down the acquisition of useful information. Computational ontologies might represent a possible solution by enabling semantically machine readable data sets. But, the process of ontology creation, instantiation and maintenance is still based on manual methodologies and thus time and cost intensive. METHOD: We focused on a large corpus containing information on researchers, research fields, and institutions. We based our strategy on traditional entity recognition, social computing and correlation. We devised a semi automatic approach for the recognition, correlation and extraction of named entities and relations from textual documents which are then used to create, instantiate, and maintain an ontology. RESULTS: We present a prototype demonstrating the applicability of the proposed strategy, along with a case study describing how direct and indirect relations can be extracted from academic and professional activities registered in a database of curriculum vitae in free-text format. We present evidence that this system can identify entities to assist in the process of knowledge extraction and representation to support ontology maintenance. We also demonstrate the extraction of relationships among ontology classes and their instances. CONCLUSION: We have demonstrated that our system can be used for the conversion of research information in free text format into database with a semantic structure. Future studies should test this system using the growing number of free-text information available at the institutional and national levels.

  20. Foundations for context-aware information retrieval for proactive decision support

    Science.gov (United States)

    Mittu, Ranjeev; Lin, Jessica; Li, Qingzhe; Gao, Yifeng; Rangwala, Huzefa; Shargo, Peter; Robinson, Joshua; Rose, Carolyn; Tunison, Paul; Turek, Matt; Thomas, Stephen; Hanselman, Phil

    2016-05-01

    Intelligence analysts and military decision makers are faced with an onslaught of information. From the now ubiquitous presence of intelligence, surveillance, and reconnaissance (ISR) platforms providing large volumes of sensor data, to vast amounts of open source data in the form of news reports, blog postings, or social media postings, the amount of information available to a modern decision maker is staggering. Whether tasked with leading a military campaign or providing support for a humanitarian mission, being able to make sense of all the information available is a challenge. Due to the volume and velocity of this data, automated tools are required to help support reasoned, human decisions. In this paper we describe several automated techniques that are targeted at supporting decision making. Our approaches include modeling the kinematics of moving targets as motifs; developing normalcy models and detecting anomalies in kinematic data; automatically classifying the roles of users in social media; and modeling geo-spatial regions based on the behavior that takes place in them. These techniques cover a wide-range of potential decision maker needs.

  1. Scientometric Indicators and Webometrics - and the Polyrepresentation Principle in Information Retrieval

    DEFF Research Database (Denmark)

    Ingwersen, Peter

    on scientometric indicators presented two fundamental models of scientific communication: the classic one - mainly providing access to document records in library catalogues and bibliographic databases - and the digitized one - relying on open access and diversified document access potentials. The lecture......This book contains the text of three lectures from the 28th Sarada Ranganathan Endowment Lectures, held in Bangalore in December 2010. The lectures were delivered by Dr. Peter Ingwersen, Professor at the Danish School of Library and Information Science, Copenhagen. The first lecture...

  2. Optical encryption and QR codes: secure and noise-free information retrieval.

    Science.gov (United States)

    Barrera, John Fredy; Mira, Alejandro; Torroba, Roberto

    2013-03-11

    We introduce for the first time the concept of an information "container" before a standard optical encrypting procedure. The "container" selected is a QR code which offers the main advantage of being tolerant to pollutant speckle noise. Besides, the QR code can be read by smartphones, a massively used device. Additionally, QR code includes another secure step to the encrypting benefits the optical methods provide. The QR is generated by means of worldwide free available software. The concept development probes that speckle noise polluting the outcomes of normal optical encrypting procedures can be avoided, then making more attractive the adoption of these techniques. Actual smartphone collected results are shown to validate our proposal.

  3. A digital social network for rapid collection of earthquake disaster information

    Science.gov (United States)

    Xu, J. H.; Nie, G. Z.; Xu, X.

    2013-02-01

    Acquiring disaster information quickly after an earthquake is crucial for disaster and emergency rescue management. This study examines a digital social network - an earthquake disaster information reporting network - for rapid collection of earthquake disaster information. Based on the network, the disaster information rapid collection method is expounded in this paper. The structure and components of the reporting network are introduced. Then the work principles of the reporting network are discussed, in which the rapid collection of disaster information is realised by using Global System for Mobile Communications (GSM) messages to report the disaster information and Geographic information system (GIS) to analyse and extract useful disaster information. This study introduces some key technologies for the work principles, including the methods of mass sending and receiving of SMS for disaster management, the reporting network grouping management method, brief disaster information codes, and the GIS modelling of the reporting network. Finally, a city earthquake disaster information quick reporting system is developed and with the support of this system the reporting network obtained good results in a real earthquake and earthquake drills. This method is a semi-real time disaster information collection method which extends current SMS based method and meets the need of small and some moderate earthquakes.

  4. AN INFORMATION-THEORETIC APPROACH TO OPTIMIZE JWST OBSERVATIONS AND RETRIEVALS OF TRANSITING EXOPLANET ATMOSPHERES

    Energy Technology Data Exchange (ETDEWEB)

    Howe, Alex R.; Burrows, Adam [Department of Astronomy, University of Michigan, 1085 S. University, Ann Arbor, MI 48109 (United States); Deming, Drake, E-mail: arhowe@umich.edu, E-mail: burrows@astro.princeton.edu, E-mail: ddeming@astro.umd.edu [Department of Astronomy, University of Maryland College Park, MD 20742 (United States)

    2017-01-20

    We provide an example of an analysis to explore the optimization of observations of transiting hot Jupiters with the James Webb Space Telescope ( JWST ) to characterize their atmospheres based on a simple three-parameter forward model. We construct expansive forward model sets for 11 hot Jupiters, 10 of which are relatively well characterized, exploring a range of parameters such as equilibrium temperature and metallicity, as well as considering host stars over a wide range in brightness. We compute posterior distributions of our model parameters for each planet with all of the available JWST spectroscopic modes and several programs of combined observations and compute their effectiveness using the metric of estimated mutual information per degree of freedom. From these simulations, clear trends emerge that provide guidelines for designing a JWST observing program. We demonstrate that these guidelines apply over a wide range of planet parameters and target brightnesses for our simple forward model.

  5. The ADAM project: a generic web interface for retrieval and display of ATLAS TDAQ information.

    CERN Document Server

    Harwood, A; The ATLAS collaboration; Magnoni, L; Vandelli, W; Savu, D

    2011-01-01

    This paper describes a new approach to the visualization of stored information about the operation of the ATLAS Trigger and Data Acquisition system. ATLAS is one of the two general purpose detectors positioned along the Large Hadron Collider at CERN. Its data acquisition system consists of several thousand computers interconnected via multiple gigabit Ethernet networks, that are constantly monitored via different tools. Operational parameters ranging from the temperature of the computers to the network utilization are stored in several databases for later analysis. Although the ability to view these data-sets individually is already in place, currently there is no way to view this data together, in a uniform format, from one location. The ADAM project has been launched in order to overcome this limitation. It defines a uniform web interface to collect data from multiple providers that have different structures. It is capable of aggregating and correlating the data according to user defined criteria. Finally, ...

  6. ADAM Project – A generic web interface for retrieval and display of ATLAS TDAQ information.

    CERN Document Server

    Harwood, A; The ATLAS collaboration; Lehmann Miotto, G

    2011-01-01

    This paper describes a new approach to the visualization of stored information about the operation of the ATLAS Trigger and Data Acquisition system. ATLAS is one of the two general purpose detectors positioned along the Large Hadron Collider at CERN. Its data acquisition system consists of several thousand computers interconnected via multiple gigabit Ethernet networks, that are constantly monitored via different tools. Operational parameters ranging from the temperature of the computers, to the network utilization are stored in several databases for a posterior analysis. Although the ability to view these data-sets individually is already in place, there currently is no way to view this data together, in a uniform format, from one location. The ADAM project has been launched in order to overcome this limitation. It defines a uniform web interface to collect data from multiple diversely structured providers. It is capable of aggregating and correlating the data according to user defined criteria. Finally it v...

  7. The Design of PC/MISI, a PC-Based Common User Interface to Remote Information Storage and Retrieval Systems. M.S. ThesisFinal Report, 1 Jul. 1985 - 31 Dec. 1987

    Science.gov (United States)

    Dominick, Wayne D. (Editor); Hall, Philip P.

    1985-01-01

    The amount of information contained in the data bases of large-scale information storage and retrieval systems is very large and growing at a rapid rate. The methods available for assessing this information have not been successful in making the information easily available to the people who have the greatest need for it. This thesis describes the design of a personal computer based system which will provide a means for these individuals to retrieve this data through one standardized interface. The thesis identifies each of the major problems associated with providing access to casual users of IS and R systems and describes the manner in which these problems are to be solved by the utilization of the local processing power of a PC. Additional capabilities, not available with standard access methods, are also provided to improve the user's ability to make use of this information. The design of PC/MISI is intended to facilitate its use as a research vehicle. Evaluation mechanisms and possible areas of future research are described. The PC/MISI development effort is part of a larger research effort directed at improving access to remote IS and R systems. This research effort, supported in part by NASA, is also reviewed.

  8. Biomedical information from a national collection of spine x-rays: film to content-based retrieval

    Science.gov (United States)

    Long, L. Rodney; Antani, Sameer; Lee, Dah-Jye; Krainak, Daniel M.; Thoma, George R.

    2003-05-01

    We summarize research and development for the extraction and distribution of biomedical information from a collection of 17,000 spine x-ray images collected by the second National Health and Nutrition Examination Survey (NHANES II). We present a history of the technical milestones of this work, including the data collection as film, digitization, quality control, archiving technology, database organization, medical expert content evaluation, and Web data distribution. We conclude by presenting our current work in content-based image retrieval (CBIR) to exploit the information content of these images directly by using image processing. We provide an overview and current research results from this CBIR work, which includes: extensive segmentation research, focusing on Active Shape Modeling and Active Contour methods; alternative techniques for shape representation, including invariant moments, simple polygon approximation, and Fourier descriptors; neural network classification of shapes into biomedical categories, such as "anterior osteophytes present/not present" and the implementation of a prototype CBIR system for the vertebrae that supports hybrid text/image queries using MATLAB and the MySQL relational database system.

  9. Invasive brain-machine interfaces: a survey of paralyzed patients’ attitudes, knowledge and methods of information retrieval

    Science.gov (United States)

    Lahr, Jacob; Schwartz, Christina; Heimbach, Bernhard; Aertsen, Ad; Rickert, Jörn; Ball, Tonio

    2015-08-01

    Objective. Brain-machine interfaces (BMI) are an emerging therapeutic option that can allow paralyzed patients to gain control over assistive technology devices (ATDs). BMI approaches can be broadly classified into invasive (based on intracranially implanted electrodes) and noninvasive (based on skin electrodes or extracorporeal sensors). Invasive BMIs have a favorable signal-to-noise ratio, and thus allow for the extraction of more information than noninvasive BMIs, but they are also associated with the risks related to neurosurgical device implantation. Current noninvasive BMI approaches are typically concerned, among other issues, with long setup times and/or intensive training. Recent studies have investigated the attitudes of paralyzed patients eligible for BMIs, particularly patients affected by amyotrophic lateral sclerosis (ALS). These studies indicate that paralyzed patients are indeed interested in BMIs. Little is known, however, about the degree of knowledge among paralyzed patients concerning BMI approaches or about how patients retrieve information on ATDs. Furthermore, it is not yet clear if paralyzed patients would accept intracranial implantation of BMI electrodes with the premise of decoding improvements, and what the attitudes of a broader range of patients with diseases such as stroke or spinal cord injury are towards this new kind of treatment. Approach. Using a questionnaire, we surveyed 131 paralyzed patients for their opinions on invasive BMIs and their attitude toward invasive BMI treatment options. Main results. The majority of the patients knew about and had a positive attitude toward invasive BMI approaches. The group of ALS patients was especially open to the concept of BMIs. The acceptance of invasive BMI technology depended on the improvements expected from the technology. Furthermore, the survey revealed that for paralyzed patients, the Internet is an important source of information on ATDs. Significance. Websites tailored to

  10. Uncertainty of soil reflectance retrieval from SPOT and RapidEye multispectral satellite images using a per-pixel bootstrapped empirical line atmospheric correction over an agricultural region

    Science.gov (United States)

    Vaudour, E.; Gilliot, J. M.; Bel, L.; Bréchet, L.; Hamiache, J.; Hadjar, D.; Lemonnier, Y.

    2014-02-01

    Many authors have reported the use of empirical line regression between field target sites and image pixels in order to perform atmospheric correction of multispectral images. However few studies were dedicated to the specific reflectance retrieval for cultivated bare soils from multispectral satellite images, from a large number (≥15) of bare field targets spread over a region. Even fewer were oriented towards additional field targets for validation and uncertainty assessment of reflectance error. This study aimed at assessing ELM validation accuracy and uncertainty for predicting topsoil reflectance over a wide area (221 km2) with contrasting soils and tillage practices using a set of six multispectral images at very high (supermode SPOT5, 2.5 m), high (RapidEye, 6.5 m) and medium (SPOT4, 20 m) spatial resolutions. For each image and each spectral band, linear regression (LR) models were constructed through a series of 1000 bootstrap datasets of training/validation samples generated amongst a total of about 30 field sites used as targets, the reflectance measurements of which were made between -6 days/+7 days around acquisition date. The achieved models had an average coefficient of variation of validation errors of ∼14%, which indicates that the composition of training field sites does influence performance results of ELM. However, according to median LR-models, our approach mostly resulted in accurate predictions with low standard errors of estimation around 1-2% reflectance, validation errors of 2-3% reflectance, low validation bias (March: in agricultural areas, images programmed during periods when most field tillage operations have resulted in smooth seedbed conditions (April in this study) are in favour of better performances of soil reflectance prediction. Nevertheless, directional effects appear to mainly and moderately affect the global performance of near-infrared and SWIR bands-models except for oblique viewing images (viewing angle > |20°|). The

  11. Contextual and serial discriminations: a new learning paradigm to assess simultaneously the effects of acute stress on retrieval of flexible or stable information in mice.

    Science.gov (United States)

    Célérier, Aurélie; Piérard, Christophe; Rachbauer, Dagmar; Sarrieau, Alain; Béracochéa, Daniel

    2004-01-01

    The present study was aimed at simultaneously determining on the same subject, the effects of stress on retrieval of flexible (contextual or temporal) or stable (spatial) information. Three behavioral paradigms carried out in a four-hole board were designed as follows: (1) Simple Discrimination (SD), in which mice learned a single discrimination; (2) Contextual and Serial Discriminations (CSD), in which mice learned two successive discriminations on two different internal contexts; (3) Spatial Serial Discriminations (SSD), in which mice learned two successive discriminations on an identical internal context. The stressor (three inescapable electric footshocks) was delivered 5 min before retention, occurring 5 min or 24 h after acquisition. Results showed that this stressor increased plasmatic corticosterone levels and fear reactivity in an elevated-plus-maze, as compared with nonstressed mice. The stressor reversed the normal pattern of retrieval observed in nonstressed controls in the CSD task, this effect being context dependent, as it was not observed in the SSD task. Overall, our study shows that stress affected the retrieval of flexible and old information, but spared the retrieval of stable or recent ones. Therefore, these behavioral paradigms allow us to study simultaneously, on the same animal, the effects of stress on distinct forms of memory retrieval.

  12. Retrieval of Ice Cloud Properties Using an Optimal Estimation Algorithm and MODIS Infrared Observations. Part I: Forward Model, Error Analysis, and Information Content

    Science.gov (United States)

    Wang, Chenxi; Platnick, Steven; Zhang, Zhibo; Meyer, Kerry; Yang, Ping

    2016-01-01

    An optimal estimation (OE) retrieval method is developed to infer three ice cloud properties simultaneously: optical thickness (tau), effective radius (r(sub eff)), and cloud top height (h). This method is based on a fast radiative transfer (RT) model and infrared (IR) observations from the MODerate resolution Imaging Spectroradiometer (MODIS). This study conducts thorough error and information content analyses to understand the error propagation and performance of retrievals from various MODIS band combinations under different cloud/atmosphere states. Specifically, the algorithm takes into account four error sources: measurement uncertainty, fast RT model uncertainty, uncertainties in ancillary data sets (e.g., atmospheric state), and assumed ice crystal habit uncertainties. It is found that the ancillary and ice crystal habit error sources dominate the MODIS IR retrieval uncertainty and cannot be ignored. The information content analysis shows that for a given ice cloud, the use of four MODIS IR observations is sufficient to retrieve the three cloud properties. However, the selection of MODIS IR bands that provide the most information and their order of importance varies with both the ice cloud properties and the ambient atmospheric and the surface states. As a result, this study suggests the inclusion of all MODIS IR bands in practice since little a priori information is available.

  13. ISMIR 2010 Proceedings of the 11th International Society for Music Information Retrieval Conference, August 9-13, 2010 Utrecht, Netherlands

    NARCIS (Netherlands)

    Downie, J. Stephen; Veltkamp, Remco C.

    2010-01-01

    Welcome to the 11th International Society for Music Information Retrieval Conference (ISMIR 2010). ISMIR 2010 will be convened in Utrecht, Netherlands, 9-13 August 2010 and is jointly organised by Utrecht University, the Utrecht School of the Arts, the Meertens Institute and Philips Research. The

  14. Natural language query system design for interactive information storage and retrieval systems. Presentation visuals. M.S. Thesis Final Report, 1 Jul. 1985 - 31 Dec. 1987

    Science.gov (United States)

    Dominick, Wayne D. (Editor); Liu, I-Hsiung

    1985-01-01

    This Working Paper Series entry represents a collection of presentation visuals associated with the companion report entitled Natural Language Query System Design for Interactive Information Storage and Retrieval Systems, USL/DBMS NASA/RECON Working Paper Series report number DBMS.NASA/RECON-17.

  15. Rancang Bangun Aplikasi MusicMoo dengan Metode MIR (Music Information Retrieval pada Modul Mood, Genre Recognition, dan Tempo Estimation

    Directory of Open Access Journals (Sweden)

    Johanes Andre Ridoean

    2017-03-01

    Full Text Available Saat ini, metode pemanggilan kembali informasi suatu musik atau yang sering disebut Music Information Retrieval (MIR telah banyak diterapkan. Contohnya pada suatu aplikasi Shazam ataupun SounHound. Kedua aplikasi ini hanya menangani sebatas suatu lagu berjudul apakah ketika diperdengarkan. Untuk itu, tujuan penelitian ini adalah pengembangan lebih lanjut MIR yang lebih spesifik lagi, yaitu melakukan pemanggilan informasi lagu yang terkait kembali beserta detail lagu di antaranya adalah mood, genre, dan tempo lagu. Penelitian ini memakai ekstraksi fitur berbasis MPEG-7 yang oleh library Java bernama MPEG7AudioEnc. Hasil ekstraksi fitur ini berupa metadata dalam bentuk angka digital yang merepresentasikan karakteristik suatu sinyal pada tiap fiturnya. Setelah fitur didapatkan, tahap berikutnya adalah melakukan pengambilan suatu fitur sesuai dengan masing-masing modul dengan metode Xquery yang diimplementasikan oleh library Java bernama BaseX. Fitur yang diambil dipakai untuk proses pengolahan dengan Discrete Wavelet Transform (DWT beserta level dekomposisi terbaik oleh library Python bernama Pywt. Setelah fitur-fitur diproses, maka dilakukan penggabungan fitur pada suatu list beserta penyamaan panjang fitur untuk proses klasifikasi. Tahap terakhir adalah melakukan klasifikasi dengan menggunakan Support Vector Machine (SVM. Terdiri dari 2 tahap yaitu tahap training dan prediksi. Hasil akurasi keberhasilan pada penelitian ini untuk modul mood 75%, genre 87,5% dan tempo 80%.

  16. Rancang Bangun Aplikasi MusicMoo Dengan Metode MIR (Music Information Retrieval Pada Modul Mood, Genre Recognition, dan Tempo Estimation

    Directory of Open Access Journals (Sweden)

    Johanes Andre Ridoean

    2017-03-01

    Full Text Available Saat ini,metode pemanggilan kembali informasi suatu musik atau yang sering disebut Music Information Retrieval (MIR telah banyak diterapkan. Contohnya adalah pada suatu aplikasi Shazam ataupun Soundhound. Tetapi kedua aplikasi ini hanya menangani sebatas lagu apakah yang terkait ketika diperdengarkan. Untuk itu, tujuan penelitian ini adalah pengembangan lebih lanjut MIR yang lebih spesifik lagi, yaitu melakukan pemanggilan informasi lagu yang terkait kembali beserta detail lagu di antaranya adalah mood, genre, dan tempo lagu. Penelitian ini memakai ekstraksi fitur berbasis MPEG-7 yang oleh library Java bernama MPEG7AudioEnc. Hasil ekstraksi fiur ini berupa metadata yang terkandung fitur-fitur dalam bentuk angka digital yang merepresentasikan karakteristik suatu sinyal. Lalu melakukan pengambilan suatu fitur sesuai dengan masing-masing dengan metode Xquery yang diimplementasikan oleh library Java bernama BaseX. Fitur yang diambil akan diproses dengan melakukan Discrete Wavelet Transform (DWT beserta level dekomposisi terbaik oleh library Python bernama Pywt. Setelah fitur-fitur dilakukan DWT, maka dilakukan penggabungan fitur pada suatu list beserta penyamaan panjang fitur untuk proses klasifikasi. Tahap terakhir adalah melakukan klasifikasi dengan menggunakan Support Vector Machine (SVM. Terdiri dari 2 tahap yaitu tahap training dan prediksi. Hasil akurasi keberhasilan pada penelitian ini untuk modul mood 75%, genre 87,5% dan tempo 80%.

  17. BioTCM-SE: A Semantic Search Engine for the Information Retrieval of Modern Biology and Traditional Chinese Medicine

    Directory of Open Access Journals (Sweden)

    Xi Chen

    2014-01-01

    Full Text Available Understanding the functional mechanisms of the complex biological system as a whole is drawing more and more attention in global health care management. Traditional Chinese Medicine (TCM, essentially different from Western Medicine (WM, is gaining increasing attention due to its emphasis on individual wellness and natural herbal medicine, which satisfies the goal of integrative medicine. However, with the explosive growth of biomedical data on the Web, biomedical researchers are now confronted with the problem of large-scale data analysis and data query. Besides that, biomedical data also has a wide coverage which usually comes from multiple heterogeneous data sources and has different taxonomies, making it hard to integrate and query the big biomedical data. Embedded with domain knowledge from different disciplines all regarding human biological systems, the heterogeneous data repositories are implicitly connected by human expert knowledge. Traditional search engines cannot provide accurate and comprehensive search results for the semantically associated knowledge since they only support keywords-based searches. In this paper, we present BioTCM-SE, a semantic search engine for the information retrieval of modern biology and TCM, which provides biologists with a comprehensive and accurate associated knowledge query platform to greatly facilitate the implicit knowledge discovery between WM and TCM.

  18. Applications of the BIOPHYS Algorithm for Physically-Based Retrieval of Biophysical, Structural and Forest Disturbance Information

    Science.gov (United States)

    Peddle, Derek R.; Huemmrich, K. Fred; Hall, Forrest G.; Masek, Jeffrey G.; Soenen, Scott A.; Jackson, Chris D.

    2011-01-01

    Canopy reflectance model inversion using look-up table approaches provides powerful and flexible options for deriving improved forest biophysical structural information (BSI) compared with traditional statistical empirical methods. The BIOPHYS algorithm is an improved, physically-based inversion approach for deriving BSI for independent use and validation and for monitoring, inventory and quantifying forest disturbance as well as input to ecosystem, climate and carbon models. Based on the multiple-forward mode (MFM) inversion approach, BIOPHYS results were summarized from different studies (Minnesota/NASA COVER; Virginia/LEDAPS; Saskatchewan/BOREAS), sensors (airborne MMR; Landsat; MODIS) and models (GeoSail; GOMS). Applications output included forest density, height, crown dimension, branch and green leaf area, canopy cover, disturbance estimates based on multi-temporal chronosequences, and structural change following recovery from forest fires over the last century. Good correspondences with validation field data were obtained. Integrated analyses of multiple solar and view angle imagery further improved retrievals compared with single pass data. Quantifying ecosystem dynamics such as the area and percent of forest disturbance, early regrowth and succession provide essential inputs to process-driven models of carbon flux. BIOPHYS is well suited for large-area, multi-temporal applications involving multiple image sets and mosaics for assessing vegetation disturbance and quantifying biophysical structural dynamics and change. It is also suitable for integration with forest inventory, monitoring, updating, and other programs.

  19. BioTCM-SE: a semantic search engine for the information retrieval of modern biology and traditional Chinese medicine.

    Science.gov (United States)

    Chen, Xi; Chen, Huajun; Bi, Xuan; Gu, Peiqin; Chen, Jiaoyan; Wu, Zhaohui

    2014-01-01

    Understanding the functional mechanisms of the complex biological system as a whole is drawing more and more attention in global health care management. Traditional Chinese Medicine (TCM), essentially different from Western Medicine (WM), is gaining increasing attention due to its emphasis on individual wellness and natural herbal medicine, which satisfies the goal of integrative medicine. However, with the explosive growth of biomedical data on the Web, biomedical researchers are now confronted with the problem of large-scale data analysis and data query. Besides that, biomedical data also has a wide coverage which usually comes from multiple heterogeneous data sources and has different taxonomies, making it hard to integrate and query the big biomedical data. Embedded with domain knowledge from different disciplines all regarding human biological systems, the heterogeneous data repositories are implicitly connected by human expert knowledge. Traditional search engines cannot provide accurate and comprehensive search results for the semantically associated knowledge since they only support keywords-based searches. In this paper, we present BioTCM-SE, a semantic search engine for the information retrieval of modern biology and TCM, which provides biologists with a comprehensive and accurate associated knowledge query platform to greatly facilitate the implicit knowledge discovery between WM and TCM.

  20. Interactive Information Retrieval:

    DEFF Research Database (Denmark)

    Borlund, Pia

    theoretical framework to describe partly the various types of IIR, and partly how IIR nowadays often is carried out in a seamless task switching IT environment on various platforms, including via apps. This type of environment furthermore calls for new methodologies to study the IIR behaviour in the habitat...... advantage....