WorldWideScience

Sample records for web information retrieval

  1. Challenges in Web Information Retrieval

    Science.gov (United States)

    Arora, Monika; Kanjilal, Uma; Varshney, Dinesh

    The major challenge in information access is the rich data available for information retrieval, evolved to provide principle approaches or strategies for searching. The search has become the leading paradigm to find the information on World Wide Web. For building the successful web retrieval search engine model, there are a number of challenges that arise at the different levels where techniques, such as Usenet, support vector machine are employed to have a significant impact. The present investigations explore the number of problems identified its level and related to finding information on web. This paper attempts to examine the issues by applying different methods such as web graph analysis, the retrieval and analysis of newsgroup postings and statistical methods for inferring meaning in text. We also discuss how one can have control over the vast amounts of data on web, by providing the proper address to the problems in innovative ways that can extremely improve on standard. The proposed model thus assists the users in finding the existing formation of data they need. The developed information retrieval model deals with providing access to information available in various modes and media formats and to provide the content is with facilitating users to retrieve relevant and comprehensive information efficiently and effectively as per their requirements. This paper attempts to discuss the parameters factors that are responsible for the efficient searching. These parameters can be distinguished in terms of important and less important based on the inputs that we have. The important parameters can be taken care of for the future extension or development of search engines

  2. Emergent web intelligence advanced information retrieval

    CERN Document Server

    Badr, Youakim; Abraham, Ajith; Hassanien, Aboul-Ella

    2010-01-01

    Web Intelligence explores the impact of artificial intelligence and advanced information technologies representing the next generation of Web-based systems, services, and environments, and designing hybrid web systems that serve wired and wireless users more efficiently. Multimedia and XML-based data are produced regularly and in increasing way in our daily digital activities, and their retrieval must be explored and studied in this emergent web-based era. 'Emergent Web Intelligence: Advanced information retrieval, provides reviews of the related cutting-edge technologies and insights. It is v

  3. Web Information Retrieval System for Technological Forecasting

    OpenAIRE

    Montiel, Raúl; Lezcano Airaldi, Luis; Favret, Fabián; Eckert, Karina

    2017-01-01

    Technological Forecasting and Competitive Intelligence are two different disciplines that, used together, provide the organizations with an invaluable analytic tool for the environment and the competing companies’ behavior. This kind of technology can be used for extracting useful information to make strategic decisions. This paper describes a Web mining system which gathers the users’ information requirements through a series of guided questions, constructs various search keys with the answe...

  4. Towards an Intelligent Possibilistic Web Information Retrieval Using Multiagent System

    Science.gov (United States)

    Elayeb, Bilel; Evrard, Fabrice; Zaghdoud, Montaceur; Ahmed, Mohamed Ben

    2009-01-01

    Purpose: The purpose of this paper is to make a scientific contribution to web information retrieval (IR). Design/methodology/approach: A multiagent system for web IR is proposed based on new technologies: Hierarchical Small-Worlds (HSW) and Possibilistic Networks (PN). This system is based on a possibilistic qualitative approach which extends the…

  5. Introduction to Web Information Retrieval: A User Perspective

    Indian Academy of Sciences (India)

    Home; Journals; Resonance – Journal of Science Education; Volume 7; Issue 6. Introduction to Web Information Retrieval: A User Perspective - How to get ... Srinath Srinivasa1 Pramod Chandra P Bhatt1. Indian Institute of Information Technology International Technology Park Whitefield Road Bangalore 560066, India.

  6. Web-based multimedia information retrieval for clinical application research

    Science.gov (United States)

    Cao, Xinhua; Hoo, Kent S., Jr.; Zhang, Hong; Ching, Wan; Zhang, Ming; Wong, Stephen T. C.

    2001-08-01

    We described a web-based data warehousing method for retrieving and analyzing neurological multimedia information. The web-based method supports convenient access, effective search and retrieval of clinical textual and image data, and on-line analysis. To improve the flexibility and efficiency of multimedia information query and analysis, a three-tier, multimedia data warehouse for epilepsy research has been built. The data warehouse integrates clinical multimedia data related to epilepsy from disparate sources and archives them into a well-defined data model.

  7. Improving life sciences information retrieval using semantic web technology.

    Science.gov (United States)

    Quan, Dennis

    2007-05-01

    The ability to retrieve relevant information is at the heart of every aspect of research and development in the life sciences industry. Information is often distributed across multiple systems and recorded in a way that makes it difficult to piece together the complete picture. Differences in data formats, naming schemes and network protocols amongst information sources, both public and private, must be overcome, and user interfaces not only need to be able to tap into these diverse information sources but must also assist users in filtering out extraneous information and highlighting the key relationships hidden within an aggregated set of information. The Semantic Web community has made great strides in proposing solutions to these problems, and many efforts are underway to apply Semantic Web techniques to the problem of information retrieval in the life sciences space. This article gives an overview of the principles underlying a Semantic Web-enabled information retrieval system: creating a unified abstraction for knowledge using the RDF semantic network model; designing semantic lenses that extract contextually relevant subsets of information; and assembling semantic lenses into powerful information displays. Furthermore, concrete examples of how these principles can be applied to life science problems including a scenario involving a drug discovery dashboard prototype called BioDash are provided.

  8. Web User Profile Using XUL and Information Retrieval Techniques

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2008-12-01

    Full Text Available This paper presents the importance of user profile in information retrieval, information filtering and recommender systems using explicit and implicit feedback. A Firefox extension (based on XUL used for gathering data needed to infer a web user profile and an example file with collected data are presented. Also an algorithm for creating and updating the user profile and keeping track of a fixed number k of subjects of interest is presented.

  9. Latest Trends in Web Information Retrieval and in SEO Factors

    Directory of Open Access Journals (Sweden)

    Carlos Gonzalo

    2015-07-01

    Full Text Available Latest trends in web information retrieval and in  SEO factors, increasingly focused on signals from users as: profile of who performs the search and the interpretation of user intent. The objective of search engines is twofold: focusing at the maximum in the users and make ever less predictable the composition of the search engine result page (SERP , and  combating spam.

  10. Probabilistic Information Integration and Retrieval in the Semantic Web

    Science.gov (United States)

    Predoiu, Livia

    The Semantic Web (SW) has been envisioned to enable software tools or Web Services, respectively, to process information provided on the Web automatically. For this purpose, languages for representing the semantics of data by means of ontologies have been proposed such as RDF(S) and OWL. While the semantics of RDF(S) requires a non-standard model-theory that goes beyond first order logics, OWL is intended to model subsets of first order logics. OWL consists of three variants that are layered on each other. The less expressive variants OWL-Light and OWL-DL correspond to the Description Logics {SHIF}(D) and {SHOIN}(D) [1], respectively, and thus to subsets of First Order Logics [2].

  11. Evaluation of Multi Layers Web-based GIS Approach in Retrieving Tourist Related Information

    OpenAIRE

    Rosilawati Zainol; Zainab Abu Bakar

    2014-01-01

    Geo-based information is getting greater importance among tourists. However, retrieving this information on the web depends heavily on the methods of dissemination. Therefore, this study intends to evaluate methods used in disseminating tourist related geo-based information on the web using partial match query, firstly, in default system which is a single layer approach and secondly, using multi layer web-based Geographic Information System (GIS) approaches. Shah Alam tourist related data are...

  12. Millennial Undergraduate Research Strategies in Web and Library Information Retrieval Systems

    Science.gov (United States)

    Porter, Brandi

    2011-01-01

    This article summarizes the author's dissertation regarding search strategies of millennial undergraduate students in Web and library online information retrieval systems. Millennials bring a unique set of search characteristics and strategies to their research since they have never known a world without the Web. Through the use of search engines,…

  13. A novel architecture for information retrieval system based on semantic web

    Science.gov (United States)

    Zhang, Hui

    2011-12-01

    Nowadays, the web has enabled an explosive growth of information sharing (there are currently over 4 billion pages covering most areas of human endeavor) so that the web has faced a new challenge of information overhead. The challenge that is now before us is not only to help people locating relevant information precisely but also to access and aggregate a variety of information from different resources automatically. Current web document are in human-oriented formats and they are suitable for the presentation, but machines cannot understand the meaning of document. To address this issue, Berners-Lee proposed a concept of semantic web. With semantic web technology, web information can be understood and processed by machine. It provides new possibilities for automatic web information processing. A main problem of semantic web information retrieval is that when these is not enough knowledge to such information retrieval system, the system will return to a large of no sense result to uses due to a huge amount of information results. In this paper, we present the architecture of information based on semantic web. In addiction, our systems employ the inference Engine to check whether the query should pose to Keyword-based Search Engine or should pose to the Semantic Search Engine.

  14. Information Retrieval Strategies of Millennial Undergraduate Students in Web and Library Database Searches

    Science.gov (United States)

    Porter, Brandi

    2009-01-01

    Millennial students make up a large portion of undergraduate students attending colleges and universities, and they have a variety of online resources available to them to complete academically related information searches, primarily Web based and library-based online information retrieval systems. The content, ease of use, and required search…

  15. Comparing the Scale of Web Subject Directories Precision in Technical-Engineering Information Retrieval

    Directory of Open Access Journals (Sweden)

    Mehrdokht Wazirpour Keshmiri

    2012-07-01

    Full Text Available The main purpose of this research was to compare the scale of web subject directories precision in information retrieval of technical-engineering science. Information gathering was documentary and webometric. Keywords of technical-engineering science were chosen at twenty different subjects from IEEE (Institute of Electrical and Electronics Engineers and engineering magazines that situated in sciencedirect site. These keywords are used at five subject directories Yahoo, Google, Infomine, Intute, Dmoz, that were web directories high-utilization. Usually first results in searching tools are connected to searching keywords. Because, first ten results was evaluated in every search. These assessments to consist of scale of precision, scale of error, scale retrieval items in technical-engineering categories to retrieval items entirely. The used criteria for determining the scale of precision that was according to high-utilization standards in different documents, to consist of presence of the keywords in title, appearance of keywords at the part of web retrieved pages, keywords adjacency, URL of page, page description and subject categories. Information analysis was according to Kruskal-Wallis Test and L.S.D fisher. Results revealed that there was meaningful difference about precision of web subject directories in information retrieval of technical-engineering science, Therefore this theory was confirmed.web subject directories ranked from point of precision as follows. Google, Yahoo, Intute, Dmoz, and Infomine. The scale of observed error at the first results was another criterion that was used for comparing web subject directories. In this research, Yahoo had minimum scale of error and Infomine had most of error. This research also compared the scale of retrieval items in all of categories web subject directories entirely to retrieval items in technical-engineering categories, results revealed that there was meaningful difference between them. And

  16. Web Information Seeking and Retrieval in Digital Library Contexts: Towards an Intelligent Agent Solution.

    Science.gov (United States)

    Detlor, Brian; Arsenault, Clement

    2002-01-01

    Discusses the role of intelligent agents in facilitating the seeking and retrieval of information in Web-based library environments. Highlights include an overview of agents; current applications in library domains; an agent-based model for libraries; the design of interface agents; and implications for library policy and digital collections.…

  17. Bat-Inspired Algorithm Based Query Expansion for Medical Web Information Retrieval.

    Science.gov (United States)

    Khennak, Ilyes; Drias, Habiba

    2017-02-01

    With the increasing amount of medical data available on the Web, looking for health information has become one of the most widely searched topics on the Internet. Patients and people of several backgrounds are now using Web search engines to acquire medical information, including information about a specific disease, medical treatment or professional advice. Nonetheless, due to a lack of medical knowledge, many laypeople have difficulties in forming appropriate queries to articulate their inquiries, which deem their search queries to be imprecise due the use of unclear keywords. The use of these ambiguous and vague queries to describe the patients' needs has resulted in a failure of Web search engines to retrieve accurate and relevant information. One of the most natural and promising method to overcome this drawback is Query Expansion. In this paper, an original approach based on Bat Algorithm is proposed to improve the retrieval effectiveness of query expansion in medical field. In contrast to the existing literature, the proposed approach uses Bat Algorithm to find the best expanded query among a set of expanded query candidates, while maintaining low computational complexity. Moreover, this new approach allows the determination of the length of the expanded query empirically. Numerical results on MEDLINE, the on-line medical information database, show that the proposed approach is more effective and efficient compared to the baseline.

  18. Information Retrieval Models

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Göker, Ayse; Davies, John

    2009-01-01

    Many applications that handle information on the internet would be completely inadequate without the support of information retrieval technology. How would we find information on the world wide web if there were no web search engines? How would we manage our email without spam filtering? Much of the

  19. Intelligent web image retrieval system

    Science.gov (United States)

    Hong, Sungyong; Lee, Chungwoo; Nah, Yunmook

    2001-07-01

    Recently, the web sites such as e-business sites and shopping mall sites deal with lots of image information. To find a specific image from these image sources, we usually use web search engines or image database engines which rely on keyword only retrievals or color based retrievals with limited search capabilities. This paper presents an intelligent web image retrieval system. We propose the system architecture, the texture and color based image classification and indexing techniques, and representation schemes of user usage patterns. The query can be given by providing keywords, by selecting one or more sample texture patterns, by assigning color values within positional color blocks, or by combining some or all of these factors. The system keeps track of user's preferences by generating user query logs and automatically add more search information to subsequent user queries. To show the usefulness of the proposed system, some experimental results showing recall and precision are also explained.

  20. OntoTrader: an ontological Web trading agent approach for environmental information retrieval.

    Science.gov (United States)

    Iribarne, Luis; Padilla, Nicolás; Ayala, Rosa; Asensio, José A; Criado, Javier

    2014-01-01

    Modern Web-based Information Systems (WIS) are becoming increasingly necessary to provide support for users who are in different places with different types of information, by facilitating their access to the information, decision making, workgroups, and so forth. Design of these systems requires the use of standardized methods and techniques that enable a common vocabulary to be defined to represent the underlying knowledge. Thus, mediation elements such as traders enrich the interoperability of web components in open distributed systems. These traders must operate with other third-party traders and/or agents in the system, which must also use a common vocabulary for communication between them. This paper presents the OntoTrader architecture, an Ontological Web Trading agent based on the OMG ODP trading standard. It also presents the ontology needed by some system agents to communicate with the trading agent and the behavioral framework for the SOLERES OntoTrader agent, an Environmental Management Information System (EMIS). This framework implements a "Query-Searching/Recovering-Response" information retrieval model using a trading service, SPARQL notation, and the JADE platform. The paper also presents reflection, delegation and, federation mediation models and describes formalization, an experimental testing environment in three scenarios, and a tool which allows our proposal to be evaluated and validated.

  1. OntoTrader: An Ontological Web Trading Agent Approach for Environmental Information Retrieval

    Directory of Open Access Journals (Sweden)

    Luis Iribarne

    2014-01-01

    Full Text Available Modern Web-based Information Systems (WIS are becoming increasingly necessary to provide support for users who are in different places with different types of information, by facilitating their access to the information, decision making, workgroups, and so forth. Design of these systems requires the use of standardized methods and techniques that enable a common vocabulary to be defined to represent the underlying knowledge. Thus, mediation elements such as traders enrich the interoperability of web components in open distributed systems. These traders must operate with other third-party traders and/or agents in the system, which must also use a common vocabulary for communication between them. This paper presents the OntoTrader architecture, an Ontological Web Trading agent based on the OMG ODP trading standard. It also presents the ontology needed by some system agents to communicate with the trading agent and the behavioral framework for the SOLERES OntoTrader agent, an Environmental Management Information System (EMIS. This framework implements a “Query-Searching/Recovering-Response” information retrieval model using a trading service, SPARQL notation, and the JADE platform. The paper also presents reflection, delegation and, federation mediation models and describes formalization, an experimental testing environment in three scenarios, and a tool which allows our proposal to be evaluated and validated.

  2. Semantic information retrieval for geoscience resources : results and analysis of an online questionnaire of current web search experiences

    OpenAIRE

    Nkisi-Orji, I.

    2016-01-01

    An online questionnaire “Semantic web searches for geoscience resources” was completed by 35 staff of British Geological Survey (BGS) between 28th July 2015 and 28th August 2015. The questionnaire was designed to better understand current web search habits, preferences, and the reception of semantic search features in order to inform PhD research into the use of domain ontologies for semantic information retrieval. The key findings were that relevance ranking is important in fo...

  3. Introduction to information retrieval

    CERN Document Server

    Manning, Christopher D; Schütze, Hinrich

    2008-01-01

    Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced un

  4. Domainwise Web Page Optimization Based On Clustered Query Sessions Using Hybrid Of Trust And ACO For Effective Information Retrieval

    Directory of Open Access Journals (Sweden)

    Dr. Suruchi Chawla

    2015-08-01

    Full Text Available Abstract In this paper hybrid of Ant Colony OptimizationACO and trust has been used for domainwise web page optimization in clustered query sessions for effective Information retrieval. The trust of the web page identifies its degree of relevance in satisfying specific information need of the user. The trusted web pages when optimized using pheromone updates in ACO will identify the trusted colonies of web pages which will be relevant to users information need in a given domain. Hence in this paper the hybrid of Trust and ACO has been used on clustered query sessions for identifying more and more relevant number of documents in a given domain in order to better satisfy the information need of the user. Experiment was conducted on the data set of web query sessions to test the effectiveness of the proposed approach in selected three domains Academics Entertainment and Sports and the results confirm the improvement in the precision of search results.

  5. Using the open Web as an information resource and scholarly Web search engines as retrieval tools for academic and research purposes

    Directory of Open Access Journals (Sweden)

    Filistea Naude

    2010-08-01

    Full Text Available This study provided insight into the significance of the open Web as an information resource and Web search engines as research tools amongst academics. The academic staff establishment of the University of South Africa (Unisa was invited to participate in a questionnaire survey and included 1188 staff members from five colleges. This study culminated in a PhD dissertation in 2008. One hundred and eighty seven respondents participated in the survey which gave a response rate of 15.7%. The results of this study show that academics have indeed accepted the open Web as a useful information resource and Web search engines as retrieval tools when seeking information for academic and research work. The majority of respondents used the open Web and Web search engines on a daily or weekly basis to source academic and research information. The main obstacles presented by using the open Web and Web search engines included lack of time to search and browse the Web, information overload, poor network speed and the slow downloading speed of webpages.

  6. Using the open Web as an information resource and scholarly Web search engines as retrieval tools for academic and research purposes

    Directory of Open Access Journals (Sweden)

    Filistea Naude

    2010-12-01

    Full Text Available This study provided insight into the significance of the open Web as an information resource and Web search engines as research tools amongst academics. The academic staff establishment of the University of South Africa (Unisa was invited to participate in a questionnaire survey and included 1188 staff members from five colleges. This study culminated in a PhD dissertation in 2008. One hundred and eighty seven respondents participated in the survey which gave a response rate of 15.7%. The results of this study show that academics have indeed accepted the open Web as a useful information resource and Web search engines as retrieval tools when seeking information for academic and research work. The majority of respondents used the open Web and Web search engines on a daily or weekly basis to source academic and research information. The main obstacles presented by using the open Web and Web search engines included lack of time to search and browse the Web, information overload, poor network speed and the slow downloading speed of webpages.

  7. Design and development of semantic web-based system for computer science domain-specific information retrieval

    Directory of Open Access Journals (Sweden)

    Ritika Bansal

    2016-09-01

    Full Text Available In semantic web-based system, the concept of ontology is used to search results by contextual meaning of input query instead of keyword matching. From the research literature, there seems to be a need for a tool which can provide an easy interface for complex queries in natural language that can retrieve the domain-specific information from the ontology. This research paper proposes an IRSCSD system (Information retrieval system for computer science domain as a solution. This system offers advanced querying and browsing of structured data with search results automatically aggregated and rendered directly in a consistent user-interface, thus reducing the manual effort of users. So, the main objective of this research is design and development of semantic web-based system for integrating ontology towards domain-specific retrieval support. Methodology followed is a piecemeal research which involves the following stages. First Stage involves the designing of framework for semantic web-based system. Second stage builds the prototype for the framework using Protégé tool. Third Stage deals with the natural language query conversion into SPARQL query language using Python-based QUEPY framework. Fourth Stage involves firing of converted SPARQL queries to the ontology through Apache's Jena API to fetch the results. Lastly, evaluation of the prototype has been done in order to ensure its efficiency and usability. Thus, this research paper throws light on framework development for semantic web-based system that assists in efficient retrieval of domain-specific information, natural language query interpretation into semantic web language, creation of domain-specific ontology and its mapping with related ontology. This research paper also provides approaches and metrics for ontology evaluation on prototype ontology developed to study the performance based on accessibility of required domain-related information.

  8. Introduction to information retrieval

    CERN Document Server

    Manning, Christopher D; Schütze, Hinrich

    2008-01-01

    Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Slides and additional exercises (with solutions for lecturers) are also available through the book's supporting website to help course instructors prepare their lectures.

  9. Embedding Web-Based Statistical Translation Models in Cross-Language Information Retrieval

    NARCIS (Netherlands)

    Kraaij, W.; Nie, J.Y.; Simard, M.

    2003-01-01

    Although more and more language pairs are covered by machine translation (MT) services, there are still many pairs that lack translation resources. Cross-language information retrieval (CUR) is an application that needs translation functionality of a relatively low level of sophistication, since

  10. An Empirical Comparison of Visualization Tools To Assist Information Retrieval on the Web.

    Science.gov (United States)

    Heo, Misook; Hirtle, Stephen C.

    2001-01-01

    Discusses problems with navigation in hypertext systems, including cognitive overload, and describes a study that tested information visualization techniques to see which best represented the underlying structure of Web space. Considers the effects of visualization techniques on user performance on information searching tasks and the effects of…

  11. Exploring topic-based language models for effective web information retrieval

    NARCIS (Netherlands)

    Li, R.; Kaptein, R.; Hiemstra, D.; Kamps, J.

    2008-01-01

    The main obstacle for providing focused search is the relative opaqueness of search request—searchers tend to express their complex information needs in only a couple of keywords. Our overall aim is to find out if, and how, topic-based language models can leads to more effective web information

  12. Exploring Topic-based Language Models for Effective Web Information Retrieval

    NARCIS (Netherlands)

    Li, R.; Kaptein, Rianne; Hiemstra, Djoerd; Kamps, Jaap; Hoenkamp, E.; De Cock, M.; Hoste, V.

    2008-01-01

    The main obstacle for providing focused search is the relative opaqueness of search request -- searchers tend to express their complex information needs in only a couple of keywords. Our overall aim is to find out if, and how, topic-based language models can lead to more effective web information

  13. SWHi system description : A case study in information retrieval, inference, and visualization in the Semantic Web

    NARCIS (Netherlands)

    Fahmi, Ismail; Zhang, Junte; Ellermann, Henk; Bouma, Gosse; Franconi, E; Kifer, M; May, W

    2007-01-01

    Search engines have become the most popular tools for finding information on the Internet. A real-world Semantic Web application can benefit from this by combining its features with some features from search engines. In this paper, we describe methods for indexing and searching a populated ontology

  14. Mobile medical visual information retrieval.

    Science.gov (United States)

    Depeursinge, Adrien; Duc, Samuel; Eggel, Ivan; Müller, Henning

    2012-01-01

    In this paper, we propose mobile access to peer-reviewed medical information based on textual search and content-based visual image retrieval. Web-based interfaces designed for limited screen space were developed to query via web services a medical information retrieval engine optimizing the amount of data to be transferred in wireless form. Visual and textual retrieval engines with state-of-the-art performance were integrated. Results obtained show a good usability of the software. Future use in clinical environments has the potential of increasing quality of patient care through bedside access to the medical literature in context.

  15. The ADAM project: a generic web interface for retrieval and display of ATLAS TDAQ information.

    CERN Document Server

    Harwood, A; The ATLAS collaboration; Magnoni, L; Vandelli, W; Savu, D

    2011-01-01

    This paper describes a new approach to the visualization of stored information about the operation of the ATLAS Trigger and Data Acquisition system. ATLAS is one of the two general purpose detectors positioned along the Large Hadron Collider at CERN. Its data acquisition system consists of several thousand computers interconnected via multiple gigabit Ethernet networks, that are constantly monitored via different tools. Operational parameters ranging from the temperature of the computers to the network utilization are stored in several databases for later analysis. Although the ability to view these data-sets individually is already in place, currently there is no way to view this data together, in a uniform format, from one location. The ADAM project has been launched in order to overcome this limitation. It defines a uniform web interface to collect data from multiple providers that have different structures. It is capable of aggregating and correlating the data according to user defined criteria. Finally, ...

  16. ADAM Project – A generic web interface for retrieval and display of ATLAS TDAQ information.

    CERN Document Server

    Harwood, A; The ATLAS collaboration; Lehmann Miotto, G

    2011-01-01

    This paper describes a new approach to the visualization of stored information about the operation of the ATLAS Trigger and Data Acquisition system. ATLAS is one of the two general purpose detectors positioned along the Large Hadron Collider at CERN. Its data acquisition system consists of several thousand computers interconnected via multiple gigabit Ethernet networks, that are constantly monitored via different tools. Operational parameters ranging from the temperature of the computers, to the network utilization are stored in several databases for a posterior analysis. Although the ability to view these data-sets individually is already in place, there currently is no way to view this data together, in a uniform format, from one location. The ADAM project has been launched in order to overcome this limitation. It defines a uniform web interface to collect data from multiple diversely structured providers. It is capable of aggregating and correlating the data according to user defined criteria. Finally it v...

  17. LinkHub: a Semantic Web system that facilitates cross-database queries and information retrieval in proteomics

    Directory of Open Access Journals (Sweden)

    Cheung Kei-Hoi

    2007-05-01

    Full Text Available Abstract Background A key abstraction in representing proteomics knowledge is the notion of unique identifiers for individual entities (e.g. proteins and the massive graph of relationships among them. These relationships are sometimes simple (e.g. synonyms but are often more complex (e.g. one-to-many relationships in protein family membership. Results We have built a software system called LinkHub using Semantic Web RDF that manages the graph of identifier relationships and allows exploration with a variety of interfaces. For efficiency, we also provide relational-database access and translation between the relational and RDF versions. LinkHub is practically useful in creating small, local hubs on common topics and then connecting these to major portals in a federated architecture; we have used LinkHub to establish such a relationship between UniProt and the North East Structural Genomics Consortium. LinkHub also facilitates queries and access to information and documents related to identifiers spread across multiple databases, acting as "connecting glue" between different identifier spaces. We demonstrate this with example queries discovering "interologs" of yeast protein interactions in the worm and exploring the relationship between gene essentiality and pseudogene content. We also show how "protein family based" retrieval of documents can be achieved. LinkHub is available at hub.gersteinlab.org and hub.nesg.org with supplement, database models and full-source code. Conclusion LinkHub leverages Semantic Web standards-based integrated data to provide novel information retrieval to identifier-related documents through relational graph queries, simplifies and manages connections to major hubs such as UniProt, and provides useful interactive and query interfaces for exploring the integrated data.

  18. Blueprint of a Cross-Lingual Web Retrieval Collection

    NARCIS (Netherlands)

    Sigurbjörnsson, B.; Kamps, J.; de Rijke, M.; van Zwol, R.

    2005-01-01

    The world wide web is a natural setting for cross-lingual information retrieval; web content is essentially multilingual, and web searchers are often polyglots. Even though English has emerged as the lingua franca of the web, planning for a business trip or holiday usually involves digesting pages

  19. Towards Distributed Information Retrieval in the Semantic Web: Query Reformulation Using the oMAP Framework

    NARCIS (Netherlands)

    U. Straccia; R. Troncy (Raphael)

    2006-01-01

    textabstractThis paper introduces a general methodology for performing distributed search in the Semantic Web. We propose to define this task as a three steps process, namely resource selection, query reformulation/ontology alignment and rank aggregation/data fusion. For the second problem, we have

  20. Indexing and Retrieval for the Web.

    Science.gov (United States)

    Rasmussen, Edie M.

    2003-01-01

    Explores current research on indexing and ranking as retrieval functions of search engines on the Web. Highlights include measuring search engine stability; evaluation of Web indexing and retrieval; Web crawlers; hyperlinks for indexing and ranking; ranking for metasearch; document structure; citation indexing; relevance; query evaluation;…

  1. Assessing Website Quality in Context: Retrieving Information about Genetically Modified Food on the Web

    Science.gov (United States)

    McInerney, Claire R.; Bird, Nora J.

    2005-01-01

    Introduction: Knowing the credibility of information about genetically modified food on the Internet is critical to the everyday life information seeking of consumers as they form opinions about this nascent agricultural technology. The Website Quality Evaluation Tool (WQET) is a valuable instrument that can be used to determine the credibility of…

  2. Private information retrieval

    CERN Document Server

    Yi, Xun; Bertino, Elisa

    2013-01-01

    This book deals with Private Information Retrieval (PIR), a technique allowing a user to retrieve an element from a server in possession of a database without revealing to the server which element is retrieved. PIR has been widely applied to protect the privacy of the user in querying a service provider on the Internet. For example, by PIR, one can query a location-based service provider about the nearest car park without revealing his location to the server.The first PIR approach was introduced by Chor, Goldreich, Kushilevitz and Sudan in 1995 in a multi-server setting, where the user retriev

  3. Topological Aspects of Information Retrieval.

    Science.gov (United States)

    Egghe, Leo; Rousseau, Ronald

    1998-01-01

    Discusses topological aspects of theoretical information retrieval, including retrieval topology; similarity topology; pseudo-metric topology; document spaces as topological spaces; Boolean information retrieval as a subsystem of any topological system; and proofs of theorems. (LRW)

  4. Music Information Retrieval.

    Science.gov (United States)

    Downie, J. Stephen

    2003-01-01

    Identifies MIR (Music Information Retrieval) computer system problems, historic influences, current state-of-the-art, and future MIR solutions through an examination of the multidisciplinary approach to MIR. Highlights include pitch; temporal factors; harmonics; tone; editorial, textual, and bibliographic facets; multicultural factors; locating…

  5. Information Retrieval Evaluation

    CERN Document Server

    Harman, Donna

    2011-01-01

    Evaluation has always played a major role in information retrieval, with the early pioneers such as Cyril Cleverdon and Gerard Salton laying the foundations for most of the evaluation methodologies in use today. The retrieval community has been extremely fortunate to have such a well-grounded evaluation paradigm during a period when most of the human language technologies were just developing. This lecture has the goal of explaining where these evaluation methodologies came from and how they have continued to adapt to the vastly changed environment in the search engine world today. The lecture

  6. Interactive Information Retrieval

    DEFF Research Database (Denmark)

    Borlund, Pia

    2013-01-01

    The paper introduces the research area of interactive information retrieval (IIR) from a historical point of view. Further, the focus here is on evaluation, because much research in IR deals with IR evaluation methodology due to the core research interest in IR performance, system interaction...... and satisfaction with retrieved information. In order to position IIR evaluation, the Cranfield model and the series of tests that led to the Cranfield model are outlined. Three iconic user-oriented studies and projects that all have contributed to how IIR is perceived and understood today are presented....... As a response to this call the ‘IIR evaluation model’ by Borlund (e.g., 2003a) is introduced. The objective of the IIR evaluation model is to facilitate IIR evaluation as close as possible to actual information searching and IR processes, though still in a relatively controlled evaluation environment, in which...

  7. Information, conservation and retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Eng, T. [Swedish Nuclear Fuel and Waste Management Co., Stockholm (Sweden); Norberg, E. [National Swedish Archives, Stockholm (Sweden); Torbacke, J. [Stockholm Univ. (Sweden). Dept. of History; Jensen, M. [Swedish Radiation Protection Inst., Stockholm (Sweden)

    1996-12-01

    The seminar took place on the Swedish ship for transportation of radioactive wastes, M/S Sigyn, which at summer time is used for exhibitions. The seminar treated items related to general information needs in society and questions related to radioactive waste, i.e. how knowledge about a waste repository should be passed on to future generations. Three contributions are contained in the report from the seminar and are indexed separately: `Active preservation - otherwise no achieves`; `The conservation and dissemination of information - A democratic issue`; and, `Conservation and retrieval of information - Elements of a strategy to inform future societies about nuclear waste repositories`.

  8. Using the open Web as an information resource and scholarly Web search engines as retrieval tools for academic and research purposes

    OpenAIRE

    Filistea Naude; Chris Rensleigh; Adeline S.A. du Toit

    2010-01-01

    This study provided insight into the significance of the open Web as an information resource and Web search engines as research tools amongst academics. The academic staff establishment of the University of South Africa (Unisa) was invited to participate in a questionnaire survey and included 1188 staff members from five colleges. This study culminated in a PhD dissertation in 2008. One hundred and eighty seven respondents participated in the survey which gave a response rate of 15.7%. The re...

  9. Information retrieval implementing and evaluating search engines

    CERN Document Server

    Büttcher, Stefan; Cormack, Gordon V

    2016-01-01

    Information retrieval is the foundation for modern search engines. This textbook offers an introduction to the core topics underlying modern search technologies, including algorithms, data structures, indexing, retrieval, and evaluation. The emphasis is on implementation and experimentation; each chapter includes exercises and suggestions for student projects. Wumpus -- a multiuser open-source information retrieval system developed by one of the authors and available online -- provides model implementations and a basis for student work. The modular structure of the book allows instructors to use it in a variety of graduate-level courses, including courses taught from a database systems perspective, traditional information retrieval courses with a focus on IR theory, and courses covering the basics of Web retrieval. In addition to its classroom use, Information Retrieval will be a valuable reference for professionals in computer science, computer engineering, and software engineering.

  10. Nuclear expert web mining system: monitoring and analysis of nuclear acceptance by information retrieval and opinion extraction on the Internet

    Energy Technology Data Exchange (ETDEWEB)

    Reis, Thiago; Barroso, Antonio C.O.; Imakuma, Kengo, E-mail: thiagoreis@usp.b, E-mail: barroso@ipen.b, E-mail: kimakuma@ipen.b [Instituto de Pesquisas Energeticas e Nucleares (IPEN/CNEN-SP), Sao Paulo, SP (Brazil)

    2011-07-01

    This paper presents a research initiative that aims to collect nuclear related information and to analyze opinionated texts by mining the hypertextual data environment and social networks web sites on the Internet. Different from previous approaches that employed traditional statistical techniques, it is being proposed a novel Web Mining approach, built using the concept of Expert Systems, for massive and autonomous data collection and analysis. The initial step has been accomplished, resulting in a framework design that is able to gradually encompass a set of evolving techniques, methods, and theories in such a way that this work will build a platform upon which new researches can be performed more easily by just substituting modules or plugging in new ones. Upon completion it is expected that this research will contribute to the understanding of the population views on nuclear technology and its acceptance. (author)

  11. Language Processing in Information Retrieval.

    Science.gov (United States)

    Doszkocs, Tamase

    1986-01-01

    Examines role and contributions of natural-language processing in information retrieval and artificial intelligence research in context of large operational information retrieval systems and services. State-of-the-art information retrieval systems combining the functional capabilities of conventional inverted file term adjacency approach with…

  12. Multimedia Information Retrieval

    CERN Document Server

    Rueger, Stefan

    2009-01-01

    At its very core multimedia information retrieval means the process of searching for and finding multimedia documents; the corresponding research field is concerned with building the best possible multimedia search engines. The intriguing bit here is that the query itself can be a multimedia excerpt: For example, when you walk around in an unknown place and stumble across an interesting landmark, would it not be great if you could just take a picture with your mobile phone and send it to a service that finds a similar picture in a database and tells you more about the building -- and about its

  13. Changing Information Retrieval Behaviours

    DEFF Research Database (Denmark)

    Constantiou, Ioanna D.; Lehrer, Christiane; Hess, Thomas

    2014-01-01

    The introduction of smartphones and the accompanying profusion of mobile data services have had a profound effect on individuals' lives. One of the most influential service categories is location-based services (LBS). Based on insights from behavioural decision-making, a conceptual framework...... is developed to analyse individuals' decisions to use LBS, focusing on the cognitive processes involved in the decision-making. Our research is based on two studies. First, we investigate the use of LBS through semi-structured interviews of smartphone users. Second, we explore daily LBS use through a study...... on the continuance of LBS use and indicate changes in individuals' information retrieval behaviours in everyday life. In particular, the distinct value dimension of LBS in specific contexts of use changes individuals' behaviours towards accessing location-related information....

  14. [Review of:] Dirk Lewandowski, Web Information Retrieval: Technologien zur Informationssuche im Internet. Frankfurt am Main: DGI, 2005. 248 S. (DGI-Schrift; Informationswissenschaft 7), ISBN 3-925474-55-2

    OpenAIRE

    Oberhauser, Otto

    2005-01-01

    Book review of Dirk Lewandowski, Web Information Retrieval: Technologien zur Informationssuche im Internet. Frankfurt am Main: DGI, 2005. 248 p. (DGI-Schrift; Informationswissenschaft 7), ISBN 3-925474-55-2. A well-investigated and easy to read state-of-the-art report on web search engines.

  15. Advanced Topics in Information Retrieval

    CERN Document Server

    Melucci, Massimo

    2011-01-01

    Information retrieval is the science concerned with the effective and efficient retrieval of documents starting from their semantic content. It is employed to fulfill some information need from a large number of digital documents. Given the ever-growing amount of documents available and the heterogeneous data structures used for storage, information retrieval has recently faced and tackled novel applications. In this book, Melucci and Baeza-Yates present a wide-spectrum illustration of recent research results in advanced areas related to information retrieval. Readers will find chapters on e.g

  16. Click Model-Based Information Retrieval Metrics

    NARCIS (Netherlands)

    Chuklin, A.; Serdyukov, P.; de Rijke, M.

    2013-01-01

    In recent years many models have been proposed that are aimed at predicting clicks of web search users. In addition, some information retrieval evaluation metrics have been built on top of a user model. In this paper we bring these two directions together and propose a common approach to converting

  17. Application of Google Maps API service for creating web map of information retrieved from CORINE land cover databases

    Directory of Open Access Journals (Sweden)

    Kilibarda Milan

    2010-01-01

    Full Text Available Today, Google Maps API application based on Ajax technology as standard web service; facilitate users with publication interactive web maps, thus opening new possibilities in relation to the classical analogue maps. CORINE land cover databases are recognized as the fundamental reference data sets for numerious spatial analysis. The theoretical and applicable aspects of Google Maps API cartographic service are considered on the case of creating web map of change in urban areas in Belgrade and surround from 2000. to 2006. year, obtained from CORINE databases.

  18. Information retrieval in cultural heritage

    NARCIS (Netherlands)

    Koolen, M.; Kamps, J.; de Keijzer, V.

    2009-01-01

    This article discusses the opportunities and challenges of applying modern information retrieval techniques to the cultural heritage domain. Although the field of information retrieval is closely associated with computer science, it originally emerged from library science — also one of the main

  19. Contextual Bandits for Information Retrieval

    NARCIS (Netherlands)

    Hofmann, K.; Whiteson, S.; de Rijke, M.

    2011-01-01

    In this paper we give an overview of and outlook on research at the intersection of information retrieval (IR) and contextual bandit problems. A critical problem in information retrieval is online learning to rank, where a search engine strives to improve the quality of the ranked result lists it

  20. Introduction to the JASIST Special Topic Issue on Web Retrieval and Mining: A Machine Learning Perspective.

    Science.gov (United States)

    Chen, Hsinchun

    2003-01-01

    Discusses information retrieval techniques used on the World Wide Web. Topics include machine learning in information extraction; relevance feedback; information filtering and recommendation; text classification and text clustering; Web mining, based on data mining techniques; hyperlink structure; and Web size. (LRW)

  1. Name Searching and Information Retrieval

    CERN Document Server

    Thompson, P; Thompson, Paul; Dozier, Christopher C.

    1997-01-01

    The main application of name searching has been name matching in a database of names. This paper discusses a different application: improving information retrieval through name recognition. It investigates name recognition accuracy, and the effect on retrieval performance of indexing and searching personal names differently from non-name terms in the context of ranked retrieval. The main conclusions are: that name recognition in text can be effective; that names occur frequently enough in a variety of domains, including those of legal documents and news databases, to make recognition worthwhile; and that retrieval performance can be improved using name searching.

  2. Ontology-based Information Retrieval

    DEFF Research Database (Denmark)

    Styltsvig, Henrik Bulskov

    In this thesis, we will present methods for introducing ontologies in information retrieval. The main hypothesis is that the inclusion of conceptual knowledge such as ontologies in the information retrieval process can contribute to the solution of major problems currently found in information...... retrieval. This utilization of ontologies has a number of challenges. Our focus is on the use of similarity measures derived from the knowledge about relations between concepts in ontologies, the recognition of semantic information in texts and the mapping of this knowledge into the ontologies in use......, as well as how to fuse together the ideas of ontological similarity and ontological indexing into a realistic information retrieval scenario. To achieve the recognition of semantic knowledge in a text, shallow natural language processing is used during indexing that reveals knowledge to the level of noun...

  3. Topic structure for information retrieval

    NARCIS (Netherlands)

    He, J.; Sanderson, M.; Zhai, C.; Zobel, J.; Allan, J.; Aslam, J.A.

    2009-01-01

    In my research, I propose a coherence measure, with the goal of discovering and using topic structures within and between documents, of which I explore its extensions and applications in information retrieval.

  4. Hooked on Music Information Retrieval

    National Research Council Canada - National Science Library

    W. Bas de Haas; Frans Wiering

    2011-01-01

    This article provides a reply to 'Lure(d) into listening: The potential of cognition-based music information retrieval,' in which Henkjan Honing discusses the potential impact of his proposed Listen, Lure...

  5. Hooked on Music Information Retrieval

    National Research Council Canada - National Science Library

    de Haas, W Bas

    2010-01-01

    This article provides a reply to 'Lure(d) into listening: The potential of cognition-based music information retrieval,' in which Henkjan Honing discusses the potential impact of his proposed Listen, Lure...

  6. Data Fusion in Information Retrieval

    CERN Document Server

    Wu, Shengli

    2012-01-01

    The technique of data fusion has been used extensively in information retrieval due to the complexity and diversity of tasks involved such as web and social networks, legal, enterprise, and many others. This book presents both a theoretical and empirical approach to data fusion. Several typical data fusion algorithms are discussed, analyzed and evaluated. A reader will find answers to the following questions, among others: -          What are the key factors that affect the performance of data fusion algorithms significantly? -          What conditions are favorable to data fusion algorithms? -          CombSum and CombMNZ, which one is better? and why? -          What is the rationale of using the linear combination method? -          How can the best fusion option be found under any given circumstances?

  7. Information Retrieval for Education: Making Search Engines Language Aware

    Science.gov (United States)

    Ott, Niels; Meurers, Detmar

    2010-01-01

    Search engines have been a major factor in making the web the successful and widely used information source it is today. Generally speaking, they make it possible to retrieve web pages on a topic specified by the keywords entered by the user. Yet web searching currently does not take into account which of the search results are comprehensible for…

  8. An Effective Information Retrieval for Ambiguous Query

    OpenAIRE

    Roul, R. K.; Sahay, S. K.

    2012-01-01

    Search engine returns thousands of web pages for a single user query, in which most of them are not relevant. In this context, effective information retrieval from the expanding web is a challenging task, in particular, if the query is ambiguous. The major question arises here is that how to get the relevant pages for an ambiguous query. We propose an approach for the effective result of an ambiguous query by forming community vector based on association concept of data minning using vector s...

  9. Information retrieval in digital environments

    CERN Document Server

    Dinet, Jérôme

    2014-01-01

    Information retrieval is a central and essential activity. It is indeed difficult to find a human activity that does not need to retrieve information in an environment which is often increasingly digital: moving and navigating, learning, having fun, communicating, informing, making a decision, etc. Most human activities are intimately linked to our ability to search quickly and effectively for relevant information, the stakes are sometimes extremely important: passing an exam, voting, finding a job, remaining autonomous, being socially connected, developing a critical spirit, or simply surviv

  10. Information Retrieval for Ecological Syntheses

    Science.gov (United States)

    Bayliss, Helen R.; Beyer, Fiona R.

    2015-01-01

    Research syntheses are increasingly being conducted within the fields of ecology and environmental management. Information retrieval is crucial in any synthesis in identifying data for inclusion whilst potentially reducing biases in the dataset gathered, yet the nature of ecological information provides several challenges when compared with…

  11. Conceptual Information Retrieval.

    Science.gov (United States)

    1980-12-01

    Corrputer Science 10 Hillhouse Avenue . -, New Haven, Connecticut 06520 . I. CONTROLLING OFFICE NAME AND ADDRESS 12. REPORT DATE Advanced Research... Controlling Office) IS. SECURITY CLASS. (of thts report, Office of Naval Research /. j " ’ unclassified Information Systems Program a. DECLASSIFICATION...can and should be useful in developin smarter IR systems. Many of the natural language and memory organizacion problems we have been dealing with are

  12. Rhetorical relations for information retrieval

    DEFF Research Database (Denmark)

    Lioma, Christina; Larsen, Birger; Lu, Wei

    2012-01-01

    -called discourse structure has been applied successfully to several natural language processing tasks. This work studies the use of rhetorical relations for Information Retrieval (IR): Is there a correlation between certain rhetorical relations and retrieval performance? Can knowledge about a document’s rhetorical......Typically, every part in most coherent text has some plausible reason for its presence, some function that it performs to the overall semantics of the text. Rhetorical relations, e.g. contrast, cause, explanation, describe how the parts of a text are linked to each other. Knowledge about this so...... relations be useful to IR? We present a language model modification that considers rhetorical relations when estimating the relevance of a document to a query. Empirical evaluation of different versions of our model on TREC settings shows that certain rhetorical relations can benefit retrieval effectiveness...

  13. Rare disease diagnosis as an information retrieval task

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina

    2011-01-01

    Increasingly more clinicians use web Information Retrieval (IR) systems to assist them in diagnosing difficult medical cases, for instance rare diseases that they may not be familiar with. However, web IR systems are not necessarily optimised for this task. For instance, clinicians’ queries tend...... to be long lists of symptoms, often containing phrases, whereas web IR systems typically expect very short keywordbased queries. Motivated by such differences, this work uses a preliminary study of 30 clinical cases to reflect on rare disease retrieval as an IR task. Initial experiments using both Google web...... search and offline retrieval from a rare disease collection indicate that the retrieval of rare diseases is an open problem with room for improvement....

  14. Rare disease diagnosis as an information retrieval task

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina

    2011-01-01

    Increasingly more clinicians use web Information Retrieval (IR) systems to assist them in diagnosing difficult medical cases, for instance rare diseases that they may not be familiar with. However, web IR systems are not necessarily optimised for this task. For instance, clinicians’ queries tend...... to be long lists of symptoms, often containing phrases, whereas web IR systems typically expect very short keyword-based queries. Motivated by such differences, this work uses a preliminary study of 30 clinical cases to reflect on rare disease retrieval as an IR task. Initial experiments using both Google...... web search and offline retrieval from a rare disease collection indicate that the retrieval of rare diseases is an open problem with room for improvement....

  15. Hooked on Music Information Retrieval

    Directory of Open Access Journals (Sweden)

    W. Bas de Haas

    2011-04-01

    Full Text Available This article provides a reply to 'Lure(d into listening: The potential of cognition-based music information retrieval,' in which Henkjan Honing discusses the potential impact of his proposed Listen, Lure & Locate project on Music Information Retrieval (MIR. Honing presents some critical remarks on data-oriented approaches in MIR, which we endorse. To place these remarks in context, we first give a brief overview of the state of the art of MIR research. Then we present a series of arguments that show why purely data-oriented approaches are unlikely to take MIR research and applications to a more advanced level. Next, we propose our view on MIR research, in which the modelling of musical knowledge has a central role. Finally, we elaborate on the ideas in Honing's paper from a MIR perspective in this paper and propose some additions to the Listen, Lure & Locate project.

  16. Information retrieval and individual differences

    Directory of Open Access Journals (Sweden)

    Polona Vilar

    2008-01-01

    Full Text Available The paper presents individual differences, which are found in studies of information retrieval with emphasis on models of personality traits, cognitive and learning styles. It pays special attention to those models which are most often included in studies of information behaviour,information seeking,perceptions of IR systems, etc., but also brings forward some models which have not yet been included in such studies. Additionally, the relationship between different individual characteristics and individual’s chosen profession or academic area is discussed. In this context,the paper presents how investigation of individual differences can be useful in the design of IR systems.

  17. Interactive information seeking, behaviour and retrieval

    CERN Document Server

    Ruthven, Ian

    2011-01-01

    Information retrieval (IR) is a complex human activity supported by sophisticated systems. This book covers the whole spectrum of information retrieval, including: history and background information; behaviour and seeking task-based information; searching and retrieval approaches to investigating information; and, evaluation interfaces for IR.

  18. Engineering a Multi-Purpose Test Collection for Web Retrieval Experiments.

    Science.gov (United States)

    Bailey, Peter; Craswell, Nick; Hawking, David

    2003-01-01

    Describes a test collection that was developed as a multi-purpose testbed for experiments on the Web in distributed information retrieval, hyperlink algorithms, and conventional ad hoc retrieval. Discusses inter-server connectivity, integrity of server holdings, inclusion of documents related to a wide spread of likely queries, and distribution of…

  19. Dynamic “Inline” Images: Context-Sensitive Retrieval and Integration of Images into Web Documents

    OpenAIRE

    Kahn, Charles E.

    2008-01-01

    Integrating relevant images into web-based information resources adds value for research and education. This work sought to evaluate the feasibility of using “Web 2.0” technologies to dynamically retrieve and integrate pertinent images into a radiology web site. An online radiology reference of 1,178 textual web documents was selected as the set of target documents. The ARRS GoldMiner™ image search engine, which incorporated 176,386 images from 228 peer-reviewed journals, retrieved images on ...

  20. JavaScript tools for online information retrieval

    OpenAIRE

    Gamage, Ruwan; Dong, Hui

    2006-01-01

    JavaScript has a comparatively long history as an online information retrieval tool. During the last decade SilverPlatter's popular WebSPIRS 4.0 started using JavaScript for its search functions. International Children's Digital Library is a current system that applies JavaScript for category based information retrieval. However, JavaScript capabilities for quick browsing and searching small collections is under utilized in light of advanced server-side technologies. Focussing on search engin...

  1. Dynamic "inline" images: context-sensitive retrieval and integration of images into Web documents.

    Science.gov (United States)

    Kahn, Charles E

    2008-09-01

    Integrating relevant images into web-based information resources adds value for research and education. This work sought to evaluate the feasibility of using "Web 2.0" technologies to dynamically retrieve and integrate pertinent images into a radiology web site. An online radiology reference of 1,178 textual web documents was selected as the set of target documents. The ARRS GoldMiner image search engine, which incorporated 176,386 images from 228 peer-reviewed journals, retrieved images on demand and integrated them into the documents. At least one image was retrieved in real-time for display as an "inline" image gallery for 87% of the web documents. Each thumbnail image was linked to the full-size image at its original web site. Review of 20 randomly selected Collaborative Hypertext of Radiology documents found that 69 of 72 displayed images (96%) were relevant to the target document. Users could click on the "More" link to search the image collection more comprehensively and, from there, link to the full text of the article. A gallery of relevant radiology images can be inserted easily into web pages on any web server. Indexing by concepts and keywords allows context-aware image retrieval, and searching by document title and subject metadata yields excellent results. These techniques allow web developers to incorporate easily a context-sensitive image gallery into their documents.

  2. Multimedia information retrieval theory and techniques

    CERN Document Server

    Raieli, Roberto

    2013-01-01

    Novel processing and searching tools for the management of new multimedia documents have developed. Multimedia Information Retrieval (MMIR) is an organic system made up of Text Retrieval (TR); Visual Retrieval (VR); Video Retrieval (VDR); and Audio Retrieval (AR) systems. So that each type of digital document may be analysed and searched by the elements of language appropriate to its nature, search criteria must be extended. Such an approach is known as the Content Based Information Retrieval (CBIR), and is the core of MMIR. This novel content-based concept of information handling needs to be integrated with more traditional semantics. Multimedia Information Retrieval focuses on the tools of processing and searching applicable to the content-based management of new multimedia documents. Translated from Italian by Giles Smith, the book is divided in to two parts. Part one discusses MMIR and related theories, and puts forward new methodologies; part two reviews various experimental and operating MMIR systems, a...

  3. Information Retrieval Methods in Libraries and Information Centers ...

    African Journals Online (AJOL)

    The volumes of information created, generated and stored are immense that without adequate knowledge of information retrieval methods, the retrieval process for an information user would be cumbersome and frustrating. Studies have further revealed that information retrieval methods are essential in information centers ...

  4. Analytical Study of Information Retrieval techniques and Modified Model of Search Engine

    OpenAIRE

    Ms. Leena More

    2015-01-01

    The concept of Information Retrieval is very vast and too many models of search engines are available in the market. In this research various information retrieval techniques used in search engine were studies and modified model of search engine were developed. In web mining most of the web search engines retrieve the documents or information first without knowing the meaning of the keyword and then ask for the relevant meaning of the keyword entered by the users. That means without understan...

  5. Peer to Peer Information Retrieval: An Overview

    NARCIS (Netherlands)

    Tigelaar, A.S.; Hiemstra, D.; Trieschnigg, D.

    2012-01-01

    Peer-to-peer technology is widely used for file sharing. In the past decade a number of prototype peer-to-peer information retrieval systems have been developed. Unfortunately, none of these have seen widespread real- world adoption and thus, in contrast with file sharing, information retrieval is

  6. Peer to Peer Information Retrieval: An Overview

    NARCIS (Netherlands)

    Tigelaar, A.S.; Hiemstra, Djoerd; Trieschnigg, Rudolf Berend

    Peer-to-peer technology is widely used for file sharing. In the past decade a number of prototype peer-to-peer information retrieval systems have been developed. Unfortunately, none of these have seen widespread real- world adoption and thus, in contrast with file sharing, information retrieval is

  7. Information Retrieval Interaction: an Analysis of Models

    Directory of Open Access Journals (Sweden)

    Farahnaz Sadoughi

    2012-03-01

    Full Text Available Information searching process is an interactive process; thus users has control on searching process, and they can manage the results of the search process. In this process, user's question became more mature, according to retrieved results. In addition, on the side of the information retrieval system, there are some processes that could not be realized, unless by user. Practically, this issue, is egregious in “Interaction” -i.e. process of user connection to other system elements- and in “Relevance judgment”. This paper had a glance to existence of “Interaction” in information retrieval, in first. Then the tradition model of information retrieval and its strenght and weak points were reviewed. Finally, the current models of interactive information retrieval includes: Belkin episodic model, Ingwersen cognitive model, Sarasevic stratified model, and Spinks interactive feedback model were elucidated.

  8. Information Retrieval in Telemedicine: a Comparative Study on Bibliographic Databases.

    Science.gov (United States)

    Ahmadi, Maryam; Sarabi, Roghayeh Ershad; Orak, Roohangiz Jamshidi; Bahaadinbeigy, Kambiz

    2015-06-01

    The first step in each systematic review is selection of the most valid database that can provide the highest number of relevant references. This study was carried out to determine the most suitable database for information retrieval in telemedicine field. Cinhal, PubMed, Web of Science and Scopus databases were searched for telemedicine matched with Education, cost benefit and patient satisfaction. After analysis of the obtained results, the accuracy coefficient, sensitivity, uniqueness and overlap of databases were calculated. The studied databases differed in the number of retrieved articles. PubMed was identified as the most suitable database for retrieving information on the selected topics with the accuracy and sensitivity ratios of 50.7% and 61.4% respectively. The uniqueness percent of retrieved articles ranged from 38% for Pubmed to 3.0% for Cinhal. The highest overlap rate (18.6%) was found between PubMed and Web of Science. Less than 1% of articles have been indexed in all searched databases. PubMed is suggested as the most suitable database for starting search in telemedicine and after PubMed, Scopus and Web of Science can retrieve about 90% of the relevant articles.

  9. Bibliometric-enhanced information retrieval

    NARCIS (Netherlands)

    Mayr, Philipp; Scharnhorst, Andrea; Larsen, Birger; Schaer, Philipp; Mutschke, Peter

    2014-01-01

    Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offer value-added effects for users. In this workshop we will explore how statistical modelling of scholarship, such as Bradfordizing or network analysis of coauthorship network, can

  10. World-Wide Web the information universe

    CERN Document Server

    Berners-Lee, Tim; Groff, Jean-Francois; Pollermann, Bernd

    1992-01-01

    Purpose - The World-Wide Web (W-3) initiative is a practical project designed to bring a global information universe into existence using available technology. This paper seeks to describe the aims, data model, and protocols needed to implement the "web" and to compare them with various contemporary systems. Design/methodology/approach - Since Vannevar Bush's article, men have dreamed of extending their intellect by making their collective knowledge available to each individual by using machines. Computers provide us two practical techniques for human-knowledge interface. One is hypertext, in which links between pieces of text (or other media) mimic human association of ideas. The other is text retrieval, which allows associations to be deduced from the content of text. The W-3 ideal world allows both operations and provides access from any browsing platform. Findings - Various server gateways to other information systems have been produced, and the total amount of information available on the web is...

  11. Parsimonious Language Models for Information Retrieval

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Robertson, Stephen; Zaragoza, Hugo

    We systematically investigate a new approach to estimating the parameters of language models for information retrieval, called parsimonious language models. Parsimonious language models explicitly address the relation between levels of language models that are typically used for smoothing. As such,

  12. Current challenges in patent information retrieval

    CERN Document Server

    Lupu, Mihai; Kando, Noriko; Trippe, Anthony J

    2017-01-01

    Intellectual property in the form of patents plays a vital role in today's increasingly knowledge-based economy. This book assembles state-of-the art research and is intended to illustrate innovative approaches to patent information retrieval.

  13. A unified relevance feedback framework for web image retrieval.

    Science.gov (United States)

    Cheng, En; Jing, Feng; Zhang, Lei

    2009-06-01

    Although relevance feedback (RF) has been extensively studied in the content-based image retrieval community, no commercial Web image search engines support RF because of scalability, efficiency, and effectiveness issues. In this paper, we propose a unified relevance feedback framework for Web image retrieval. Our framework shows advantage over traditional RF mechanisms in the following three aspects. First, during the RF process, both textual feature and visual feature are used in a sequential way. To seamlessly combine textual feature-based RF and visual feature-based RF, a query concept-dependent fusion strategy is automatically learned. Second, the textual feature-based RF mechanism employs an effective search result clustering (SRC) algorithm to obtain salient phrases, based on which we could construct an accurate and low-dimensional textual space for the resulting Web images. Thus, we could integrate RF into Web image retrieval in a practical way. Last, a new user interface (UI) is proposed to support implicit RF. On the one hand, unlike traditional RF UI which enforces users to make explicit judgment on the results, the new UI regards the users' click-through data as implicit relevance feedback in order to release burden from the users. On the other hand, unlike traditional RF UI which hardily substitutes subsequent results for previous ones, a recommendation scheme is used to help the users better understand the feedback process and to mitigate the possible waiting caused by RF. Experimental results on a database consisting of nearly three million Web images show that the proposed framework is wieldy, scalable, and effective.

  14. Information Retrieval and Graph Analysis Approaches for Book Recommendation

    Directory of Open Access Journals (Sweden)

    Chahinez Benkoussas

    2015-01-01

    Full Text Available A combination of multiple information retrieval approaches is proposed for the purpose of book recommendation. In this paper, book recommendation is based on complex user's query. We used different theoretical retrieval models: probabilistic as InL2 (Divergence from Randomness model and language model and tested their interpolated combination. Graph analysis algorithms such as PageRank have been successful in Web environments. We consider the application of this algorithm in a new retrieval approach to related document network comprised of social links. We called Directed Graph of Documents (DGD a network constructed with documents and social information provided from each one of them. Specifically, this work tackles the problem of book recommendation in the context of INEX (Initiative for the Evaluation of XML retrieval Social Book Search track. A series of reranking experiments demonstrate that combining retrieval models yields significant improvements in terms of standard ranked retrieval metrics. These results extend the applicability of link analysis algorithms to different environments.

  15. Information Retrieval Research and ESPRIT.

    Science.gov (United States)

    Smeaton, Alan F.

    1987-01-01

    Describes the European Strategic Programme of Research and Development in Information Technology (ESPRIT), and its five programs: advanced microelectronics, software technology, advanced information processing, office systems, and computer integrated manufacturing. The emphasis on logic programming and ESPRIT as the European response to the…

  16. A Personalized Health Information Retrieval System

    OpenAIRE

    Wang, Yunli; Liu, Zhenkai

    2005-01-01

    Consumers face barriers when seeking health information on the Internet. A Personalized Health Information Retrieval System (PHIRS) is proposed to recommend health information for consumers. The system consists of four modules: (1) User modeling module captures user’s preference and health interests; (2) Automatic quality filtering module identifies high quality health information; (3) Automatic text difficulty rating module classifies health information into professional or...

  17. Adaptive Visualization for Focused Personalized Information Retrieval

    Science.gov (United States)

    Ahn, Jae-wook

    2010-01-01

    The new trend on the Web has totally changed today's information access environment. The traditional information overload problem has evolved into the qualitative level beyond the quantitative growth. The mode of producing and consuming information is changing and we need a new paradigm for accessing information. Personalized search is one of…

  18. AEROMETRIC INFORMATION RETRIEVAL SYSTEM (AIRS) - GRAPHICS

    Science.gov (United States)

    Aerometric Information Retrieval System (AIRS) is a computer-based repository of information about airborne pollution in the United States and various World Health Organization (WHO) member countries. AIRS is administered by the U.S. Environmental Protection Agency, and runs on t...

  19. Using Complexity Measures in Information Retrieval

    NARCIS (Netherlands)

    van der Sluis, Frans; van den Broek, Egon; Belkin, N.J.; Kelly, D.

    2010-01-01

    Although Information Retrieval (IR) is meant to serve its users, surprisingly little IR research is not user-centered. In contrast, this article utilizes the concept complexity of information as the determinant of the user's comprehension, not as a formal golden measure. Four aspects of user's

  20. Quality issues in the management of web information

    CERN Document Server

    Bordogna, Gloria; Jain, Lakhmi

    2013-01-01

    This research volume presents a sample of recent contributions related to the issue of quality-assessment for Web Based information in the context of information access, retrieval, and filtering systems. The advent of the Web and the uncontrolled process of documents' generation have raised the problem of declining quality assessment to information on the Web, by considering both the nature of documents (texts, images, video, sounds, and so on), the genre of documents ( news, geographic information, ontologies, medical records, products records, and so on), the reputation of information sources and sites, and, last but not least the actions performed on documents (content indexing, retrieval and ranking, collaborative filtering, and so on). The volume constitutes a compendium of both heterogeneous approaches and sample applications focusing specific aspects of the quality assessment for Web-based information for researchers, PhD students and practitioners carrying out their research activity in the field of W...

  1. A specialized framework for data retrieval Web applications

    Energy Technology Data Exchange (ETDEWEB)

    Jerzy Nogiec; Kelley Trombly-Freytag; Dana Walbridge

    2004-07-12

    Although many general-purpose frameworks have been developed to aid in web application development, they typically tend to be both comprehensive and complex. To address this problem, a specialized server-side Java framework designed specifically for data retrieval and visualization has been developed. The framework's focus is on maintainability and data security. The functionality is rich with features necessary for simplifying data display design, deployment, user management and application debugging, yet the scope is deliberately kept limited to allow for easy comprehension and rapid application development. The system clearly decouples the application processing and visualization, which in turn allows for clean separation of layout and processing development. Duplication of standard web page features such as toolbars and navigational aids is therefore eliminated. The framework employs the popular Model-View-Controller (MVC) architecture, but it also uses the filter mechanism for several of its base functionalities, which permits easy extension of the provided core functionality of the system.

  2. A Specialized Framework for Data Retrieval Web Applications

    Directory of Open Access Journals (Sweden)

    Jerzy Nogiec

    2005-06-01

    Full Text Available Although many general-purpose frameworks have been developed to aid in web application development, they typically tend to be both comprehensive and complex. To address this problem, a specialized server-side Java framework designed specifically for data retrieval and visualization has been developed. The framework's focus is on maintainability and data security. The functionality is rich with features necessary for simplifying data display design, deployment, user management and application debugging, yet the scope is deliberately kept limited to allow for easy comprehension and rapid application development. The system clearly decouples the application processing and visualization, which in turn allows for clean separation of layout and processing development. Duplication of standard web page features such as toolbars and navigational aids is therefore eliminated. The framework employs the popular Model-View-Controller (MVC architecture, but it also uses the filter mechanism for several of its base functionalities, which permits easy extension of the provided core functionality of the system.

  3. The semantics of similarity in geographic information retrieval

    Directory of Open Access Journals (Sweden)

    Krzysztof Janowicz

    2011-05-01

    Full Text Available Similarity measures have a long tradition in fields such as information retrieval, artificial intelligence, and cognitive science. Within the last years, these measures have been extended and reused to measure semantic similarity; i.e., for comparing meanings rather than syntactic differences. Various measures for spatial applications have been developed, but a solid foundation for answering what they measure; how they are best applied in information retrieval; which role contextual information plays; and how similarity values or rankings should be interpreted is still missing. It is therefore difficult to decide which measure should be used for a particular application or to compare results from different similarity theories. Based on a review of existing similarity measures, we introduce a framework to specify the semantics of similarity. We discuss similarity-based information retrieval paradigms as well as their implementation in web-based user interfaces for geographic information retrieval to demonstrate the applicability of the framework. Finally, we formulate open challenges for similarity research.

  4. BIR 2014 - Bibliometric-enhanced Information Retrieval

    DEFF Research Database (Denmark)

    This first “Bibliometric-enhanced Information Retrieval” (BIR 2014) workshop1 aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although...... they offer value-added effects for users. To give an example, recent approaches have shown the possibilities of alternative ranking methods based on citation analysis leading to an enhanced IR. In this workshop we will explore how statistical modelling of scholarship, such as Bradfordizing or network...... analysis of co-authorship network, can improve retrieval services for specific communities, as well as for large, cross-domain collections. This workshop aims to raise awareness of the missing link between information retrieval (IR) and bibliometrics / scientometrics and to create a common ground...

  5. Applications Of Informetrics To Information Retrieval Research

    Directory of Open Access Journals (Sweden)

    Dietmar Wolfram

    2000-01-01

    Full Text Available A non-technical overview of two primary areas of study within the discipline of information science, information retrieval (IR and informetrics, is presented. Informetric properties of IR systems as the basis for understanding IR system structure and generalizing human information seeking in electronic environments are discussed. Applications of informetric study of IR systems for more efficient and effective design and evaluation of IR systems are also presented.

  6. Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects

    DEFF Research Database (Denmark)

    Cong, Gao; Jensen, Christian Søndergaard; Wu, Dingming

    2009-01-01

    The conventional Internet is acquiring a geo-spatial dimension. Web documents are being geo-tagged, and geo-referenced objects such as points of interest are being associated with descriptive text documents. The resulting fusion of geo-location and documents enables a new kind of top-k query...... that takes into account both location proximity and text relevancy. To our knowledge, only naive techniques exist that are capable of computing a general web information retrieval query while also taking location into account. This paper proposes a new indexing framework for location-aware top-k text...

  7. Machine Learning Approaches for Music Information Retrieval

    OpenAIRE

    Li, Tao; Ogihara, Mitsunori; Shao, Bo; DingdingWang,

    2009-01-01

    We discussed the following machine learning approaches used in music information retrieval: (1) multi-class classification methods for music genre categorization; (2) multi-label classification methods for emotion detection; (3) clustering methods for music style identification; and (4) semi-supervised learning methods for music recommendation. Experimental results are also presented to evaluate the approaches.

  8. Language-based multimedia information retrieval

    NARCIS (Netherlands)

    de Jong, Franciska M.G.; Gauvain, J.L.; Hiemstra, Djoerd; Netter, K.

    2000-01-01

    This paper describes various methods and approaches for language-based multimedia information retrieval, which have been developed in the projects POP-EYE and OLIVE and which will be developed further in the MUMIS project. All of these project aim at supporting automated indexing of video material

  9. Cross language information retrieval for biomedical literature

    NARCIS (Netherlands)

    Schuemie, M.; Trieschnigg, D.; Kraaij, W.

    2007-01-01

    This workshop report discusses the collaborative work of UT, EMC and TNO on the TREC Genomics Track 2007. The biomedical information retrieval task is approached using cross language methods, in which biomedical concept detection is combined with effective IR based on unigram language models.

  10. Cross Language Information Retrieval for Biomedical Literature

    NARCIS (Netherlands)

    Schuemie, Martijn; Trieschnigg, Rudolf Berend; Kraaij, Wessel; Voorhees, E.M; Buckland, L.P.

    2007-01-01

    This workshop report discusses the collaborative work of UT, EMC and TNO on the TREC Genomics Track 2007. The biomedical information retrieval task is approached using cross language methods, in which biomedical concept detection is combined with effective IR based on unigram language models.

  11. Semantic association ranking schemes for information retrieval ...

    Indian Academy of Sciences (India)

    Most of the Information Retrieval (IR) techniques are based on representing the documents using the traditional vector space and probabilistic language model i.e., bag-of- words model. In this paper, associations among words in the documents are assessed and it is expressed in Term Association Graph model to represent ...

  12. Introduction to Data Transmission for Information Retrieval

    Science.gov (United States)

    Kallenbach, P. A.

    1975-01-01

    An introduction is presented to data transmission technology and networks for information retrieval purposes. Data signals are analyzed, modulation techniques are discussed, communication procedures between terminals and the central processing unit are surveyed, and possible network configurations are considered. (Author/PF)

  13. Towards an Information Retrieval Theory of Everything

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Lammerink, J.M.W.; Katoen, Joost P.; Kok, J.N.; van de Pol, Jan Cornelis; Raamsdonk, F.

    2009-01-01

    I present three well-known probabilistic models of information retrieval in tutorial style: The binary independence probabilistic model, the language modeling approach, and Google's page rank. Although all three models are based on probability theory, they are very different in nature. Each model

  14. Variations on language modeling for information retrieval.

    NARCIS (Netherlands)

    Kraaij, Wessel

    2004-01-01

    Search engine technology builds on theoretical and empirical research results in the area of information retrieval (IR). This dissertation makes a contribution to the field of language modeling (LM) for IR, which views both queries and documents as instances of a unigram language model and defines

  15. Inductive Information Retrieval Using Parallel Distributed Computation.

    Science.gov (United States)

    Mozer, Michael C.

    This paper reports on an application of parallel models to the area of information retrieval and argues that massively parallel, distributed models of computation, called connectionist, or parallel distributed processing (PDP) models, offer a new approach to the representation and manipulation of knowledge. Although this document focuses on…

  16. Database Optimization Aspects for Information Retrieval

    NARCIS (Netherlands)

    Blok, H.E.

    2002-01-01

    There is a growing need for systems that can process queries, combining both structured data and text. One way to provide such functionality is to integrate information retrieval (IR) techniques in a database management system (DBMS). However, both IR and database research have been separate

  17. Formalizing Evaluation in Music Information Retrieval

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2013-01-01

    We develop a formalism to disambiguate the evaluation of music information retrieval systems. We define a ``system,'' what it means to ``analyze'' one, and make clear the aims, parts, design, execution, interpretation, and assumptions of its ``evaluation.'' We apply this formalism to discuss the ...... the MIREX automatic mood classification task....

  18. Test OSIRIS (On Line Search Information Retrieval Information Storage).

    Science.gov (United States)

    Showalther, A. Kenneth

    The OSIRIS system is a prototype information retrieval system having the following components: an automated microfiche file having a capacity of 5000 punch card sized microfiche with a remote control 21 inch TV console for retrieving, magnifying (0-250X), and displaying any of the images on the microfiche; and a remote computer terminal for the…

  19. Information retrieval models foundations and relationships

    CERN Document Server

    Roelleke, Thomas

    2013-01-01

    Information Retrieval (IR) models are a core component of IR research and IR systems. The past decade brought a consolidation of the family of IR models, which by 2000 consisted of relatively isolated views on TF-IDF (Term-Frequency times Inverse-Document-Frequency) as the weighting scheme in the vector-space model (VSM), the probabilistic relevance framework (PRF), the binary independence retrieval (BIR) model, BM25 (Best-Match Version 25, the main instantiation of the PRF/BIR), and language modelling (LM). Also, the early 2000s saw the arrival of divergence from randomness (DFR).Regarding in

  20. Order effect in interactive information retrieval evaluation

    DEFF Research Database (Denmark)

    Clemmensen, Melanie Landvad; Borlund, Pia

    2016-01-01

    of such studies. Due to the limited sample of 20 test participants (Library and Information Science (LIS) students) inference statistics is not applicable; hence conclusions can be drawn from this sample of test participants only. Originality/value – Only few studies in LIS focus on order effect and none from......Purpose – The purpose of this paper is to report a study of order effect in interactive information retrieval (IIR) studies. The phenomenon of order effect is well-known, and it is the main reason why searches are permuted (counter-balanced) between test participants in IIR studies. However...... the perspective of IIR. Keywords Evaluation, Research methods, Information retrieval, User studies, Searching, Information searches...

  1. Method of and System for Information Retrieval

    DEFF Research Database (Denmark)

    2015-01-01

    This invention relates to a system for and a method (100) of searching a collection of digital information (150) comprising a number of digital documents (110), the method comprising receiving or obtaining (102) a search query, the query comprising a number of search terms, searching (103) an index......, a method of and a system for information retrieval or searching is readily provided that enhances the searching quality (i.e. the number of relevant documents retrieved and such documents being ranked high) when (also) using queries containing many search terms....... (300) using the search terms thereby providing information (301) about which digital documents (110) of the collection of digital information (150) that contains a given search term and one or more search related metrics (302; 303; 304; 305; 306), ranking (105) at least a part of the search result...

  2. Advanced Secure Information Retrieval Technology for Multilayer Information Extraction

    Directory of Open Access Journals (Sweden)

    Shoude Chang

    2008-01-01

    Full Text Available Secure information retrieval technology aims at status identification and documentation authentication. Ideally, materials or devices used in these technologies should be hard to find, difficult to counterfeit, and as simple as possible. This manuscript addresses a novel information retrieval technology, with photoluminescent (PL semiconductor quantum dots (QDs synthesized via wet chemistry approaches used as its coding materials. Conceptually, these QDs are designed to exhibit emission at Fraunhofer line positions, namely, black lines in the solar spectrum; thus, the retrieval system can extract useful information under sunshine covering areas. Furthermore, multiphoton excitation (MPE technology enables the retrieval system to be multilayer information extraction, with thin films consisting of QDs applied to various substrates, such as military helmets and vehicle and fingernails. Anticipated applications include security, military, and law enforcement. QD-based security information can be easily destroyed by preset expiration in the presence of timing agents.

  3. Multilevel resistive information storage and retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Lohn, Andrew; Mickel, Patrick R.

    2016-08-09

    The present invention relates to resistive random-access memory (RRAM or ReRAM) systems, as well as methods of employing multiple state variables to form degenerate states in such memory systems. The methods herein allow for precise write and read steps to form multiple state variables, and these steps can be performed electrically. Such an approach allows for multilevel, high density memory systems with enhanced information storage capacity and simplified information retrieval.

  4. Applying Semantic Web technologies to improve the retrieval, credibility and use of health-related web resources.

    Science.gov (United States)

    Mayer, Miguel A; Karampiperis, Pythagoras; Kukurikos, Antonis; Karkaletsis, Vangelis; Stamatakis, Kostas; Villarroel, Dagmar; Leis, Angela

    2011-06-01

    The number of health-related websites is increasing day-by-day; however, their quality is variable and difficult to assess. Various "trust marks" and filtering portals have been created in order to assist consumers in retrieving quality medical information. Consumers are using search engines as the main tool to get health information; however, the major problem is that the meaning of the web content is not machine-readable in the sense that computers cannot understand words and sentences as humans can. In addition, trust marks are invisible to search engines, thus limiting their usefulness in practice. During the last five years there have been different attempts to use Semantic Web tools to label health-related web resources to help internet users identify trustworthy resources. This paper discusses how Semantic Web technologies can be applied in practice to generate machine-readable labels and display their content, as well as to empower end-users by providing them with the infrastructure for expressing and sharing their opinions on the quality of health-related web resources.

  5. Indexing and retrieving Web documents as direct manipulation of images

    Science.gov (United States)

    Ferri, Fernando; Grifoni, Patrizia; Mussio, Piero; Padula, Marco

    2000-12-01

    The rapid growth of network communication through the World Wide Web has encouraged a large diffusion of connections to Internet, due to the heavily interactive services which are offered for accessing, using and producing the incredible mass of information and more general resources which is now available. People communicating in this environment are usually end users whom are not skilled in computer science and are experienced in a specific area; they are generally interested in search, producing information, and accessibility. The phenomenon of the World Wide Web is producing a significant change in the concept of document, which is becoming strongly visual and dynamically arranged. A document is an image, and an image is a document. This change requires a new approach in presenting, authoring, indexing and querying a web document. In the paper we propose visual language defined to reach the previously introduced goals, discussing the case of an Information Base containing clinical data. Notwithstanding the amount and the heterogeneity of the data available, it is quite difficult to access truly interesting information and to suitably exploit it; this is due to the poor usability of tools which offer and interaction style still limited with respect to the interfaces WIMP (Window, Icons, Menu, Pointers) and to the indexing techniques usually adopted to organize the web pages by means of robots and search engines.

  6. Relating the new language models of information retrieval to the traditional retrieval models

    NARCIS (Netherlands)

    Hiemstra, Djoerd; de Vries, A.P.

    During the last two years, exciting new approaches to information retrieval were introduced by a number of different research groups that use statistical language models for retrieval. This paper relates the retrieval algorithms suggested by these approaches to widely accepted retrieval algorithms

  7. Semantic knowledge representation for information retrieval

    CERN Document Server

    Gödert, Winfried; Nagelschmidt, Matthias

    2014-01-01

    This book covers the basics of semantic web technologies and indexing languages, and describes their contribution to improve languages as a tool for subject queries and knowledge exploration. The book is relevant to information scientists, knowledge workers and indexers. It provides a suitable combination of theoretical foundations and practical applications.

  8. Folksonomies indexing and retrieval in web 2.0

    CERN Document Server

    Peters, Isabella

    2009-01-01

    In Web 2.0 users not only make heavy use of Col-laborative Information Services in order to create, publish and share digital information resources - what is more, they index and represent these re-sources via own keywords, so-called tags. The sum of this user-generated metadata of a Collaborative Information Service is also called Folksonomy. In contrast to professionally created and highly struc-tured metadata, e.g. subject headings, thesauri, clas-sification systems or ontologies, which are applied in libraries, corporate information architectures or commercial databases and which were deve

  9. Information Retrieval Using a Middleware Approach

    Directory of Open Access Journals (Sweden)

    Danijela Boberić Krstićev

    2013-03-01

    Full Text Available This paper explores the use of a mediator/wrapper approach to enable the search of an existing library management system using different information retrieval protocols. It proposes an architecture for a software component that will act as an intermediary between the library system and search services. It provides an overview of different approaches to add Z39.50 and Search/Retrieval via URL (SRU functionality using a middleware approach that is implemented on the BISIS library management system. That wrapper performs transformation of Contextual Query Language (CQL into Lucene query language. The primary aim of this software component is to enable search and retrieval of bibliographic records using the SRU and Z39.50 protocols, but the proposed architecture of the software components is also suitable for inclusion of the existing library management system into a library portal. The software component provides a single interface to server-side protocols for search and retrieval of records. Additional protocols could be used. This paper provides practical demonstration of interest to developers of library management systems and those who are trying to use open-source solutions to make their local catalog accessible to other systems.

  10. Cross-view Embeddings for Information Retrieval

    OpenAIRE

    GUPTA, PARTH ALOKKUMAR

    2017-01-01

    In this dissertation, we deal with the cross-view tasks related to information retrieval using embedding methods. We study existing methodologies and propose new methods to overcome their limitations. We formally introduce the concept of mixed-script IR, which deals with the challenges faced by an IR system when a language is written in different scripts because of various technological and sociological factors. Mixed-script terms are represented by a small and finite feature space c...

  11. Statistical Language Models and Information Retrieval: Natural Language Processing Really Meets Retrieval

    NARCIS (Netherlands)

    Hiemstra, Djoerd; de Jong, Franciska M.G.

    2001-01-01

    Traditionally, natural language processing techniques for information retrieval have always been studied outside the framework of formal models of information retrieval. In this article, we introduce a new formal model of information retrieval based on the application of statistical language models.

  12. Cognitive approach to information retrieval and communication

    Directory of Open Access Journals (Sweden)

    Saša Zupanič

    1997-01-01

    Full Text Available Cognitive approach (viewpoint/standpoirit in the retrieval and communication of information, as well as in librarianship and information science has started gaining importance in the 70's. Today, it is present in literary and objective knowledge studies, as well as in studies of users,information brokers and systems of information retrieval.Cognitive approach exercises strong impact on several scientific disciplines which are grouped under the roof of cognitive science. The cognitive approach has caused split and the formation of a new paradigm, i.e. the cognitive paradigm, in many scientific disciplines.In the frames of the definition of Kuhn's concept of paradigm, it is evident that librarianship and information science are on the pre-paradigmatic level. I Iowever,some authors mention the existence of at least two paradigms in library and information science, i.e. physical and cognitive paradigm.The hištorical overview of cognitive oriented research works of Brookes, De Mey,Belkin, Ingwersen and others enables the insight into the development of library and information scientific thought up to the present.

  13. Geosemantic Information Retrieval Using a Geoontology

    Science.gov (United States)

    Hwang, J.

    2014-12-01

    Currently, most users prefer searching for the information using the more convenient and dynamic mobile information retrieval services to using the existing desktop PC services in the limited space, according as a lot of mobile terminals have been provided with the development of a variety of techniques. Information retrieval service using the mobile terminals has the strength that provides the personalized information results related to the users' information request anytime and anywhere, considering the users' mobility and portability. Therefore, for the information retrieval using the mobile devices I need the context awareness techniques which have been researched actively. In this thesis, I developed the context awareness ontology model for Geotourism as the representative method of the context awareness techniques to predict the user's interest and foresee the information about which retrieval results and which places the user want to get. The proposed Geotour ontology model is extended and designed from W3C Time Ontology defined in the international standards and spatial geometry feature ontology supported by OGC GeoSPARQL, so it can provide the usability and the function. That is, GeotourFeature class is the subclass of ogc:Feature defined in OGC as in Figure 1. GeotourTime class which is for expressing temporal information of a certain Geotour features is the subclass of TemporalThing of W3C. Figure 1: Relationship between the international standard ontology and the geotour ontology model A Geotour features and a geotour map describes a part of ontology to represent the GeotourFeature composed of GeotourTime class and GeotourLocation class. The highest class to represent GeotourTime and GeotourLocation is GeotourFeature class. As mentioned in the previous section, our model inherited the temporal ontology of W3C. Figure 2 describes a part of ontology to represent the GeotourFeature composed of GeotourTime class and GeotourLocation class. The highest class

  14. Practice of information retrieval technique and limitation of database usage (2) - Limitation of retrieval and retrieval by information broker -

    Science.gov (United States)

    Suzuki, Shigekazu

    Nowadays everyone can enjoy an information retrieval, thanks for the advancement of computer usage in searching the literatures of science and technology. Regarding limitation for database usage in these days, the author divided searchers into four generations according to their skillful degree for the retrieval technics. In this paper, the author considered a blind spot from the view point of retrieval by searcher after the third generation, who started to have a suspicion to the content of database and emphasised the following three blind spots that we cannot aford to overlook, 1) limitation on input to database, 2) limitation of keyword and code search computer searching, 3) limitation on fitness evaluation of retrieved results.

  15. Image Information Retrieval: An Overview of Current Research

    OpenAIRE

    Goodrum, Abby A.

    2000-01-01

    This paper provides an overview of current research in image information retrieval and provides an outline of areas for future research. The approach is broad and interdisciplinary and focuses on three aspects of image research (IR): text-based retrieval, content-based retrieval, and user interactions with image information retrieval systems. The review concludes with a call for image retrieval evaluation studies similar to TREC.

  16. A framework for medical visual information exchange on the web.

    Science.gov (United States)

    Carro, Silvio Antonio; Scharcanski, Jacob

    2006-04-01

    The web has become such an extensive health information repository in the world that it is increasingly difficult to search for relevant medical information. Most medical information available on the web is not peer reviewed, and is retrieved imprecisely by current web search mechanisms (i.e. based on keywords). This paper presents the MedISeek metadata model that allows one to describe medical visual information (i.e. medical images) of different modalities, including their properties, components, relationships and authorship. The model uses the web architecture and supports the international classification of diseases and related health problems (i.e. ICD-10). An RDF schema (Resource Description Framework (RDF), http://www.w3.org/RDF/.) derived from this metadata model is integrated to each medical image, and specifies the semantics of each property in the image. Thus, relevant information can be extracted directly from the images, and data integrity is better preserved in the web. A prototype, presented here, has been built to validate the metadata model, and the mechanism for medical visual information exchange on the web. Our preliminary experimental results indicate that authorized users of our system have been able to describe, store and retrieve medical images and their associated diagnostic information.

  17. 108 Information Retrieval Methods in Libraries and Information ...

    African Journals Online (AJOL)

    User

    the organization of library materials and their recordings for use by readers came into being a little more than a century ago. Today's information professionals should know and be conversant with the traditional information retrieval tools and methods like classification, cataloguing, and vocabulary control as well as the ...

  18. EFFICACIOUS GEOSPATIAL INFORMATION RETRIEVAL USING DENSITY PROBABILISTIC DOCUMENT CORRELATION APPROACH

    OpenAIRE

    Uma, R.; Muneeswaran

    2013-01-01

    Information Retrieval (IR) is a profound technique to find information that addresses the need of query. Processing of normal text is easier and information can be retrieved efficiently. There are plenty of algorithms in hand to carry out the normal text retrieval. Whereas retrieving geospatial information is very complex and requires additional operations to be performed. Since geospatial data contain complex details than general data such as location, direction. To handle geographical quer...

  19. The Nuclear Science References (NSR) Database and Web Retrieval System

    CERN Document Server

    Pritychenko, B; Kellett, M A; Singh, B; Totans, J

    2011-01-01

    The Nuclear Science References (NSR) database, and associated Web inter- face, is the world's only comprehensive source of easily accessible low- and intermediate-energy nuclear physics bibliographic information for more than 200,000 articles since the beginning of nuclear science. The weekly-updated NSR database provides essential support for nuclear data evaluation, com- pilation and research activities. The principles of the database and Web application development and maintenance are described. Examples of nuclear structure, reaction and decay applications are specifically included. The complete NSR database is freely available at the websites of the National Nuclear Data Center http://www.nndc.bnl.gov/nsr and the International Atomic Energy Agency http://www-nds.iaea.org/nsr.

  20. The Nuclear Science References (NSR) database and Web Retrieval System

    Science.gov (United States)

    Pritychenko, B.; Běták, E.; Kellett, M. A.; Singh, B.; Totans, J.

    2011-06-01

    The Nuclear Science References (NSR) database together with its associated Web interface is the world's only comprehensive source of easily accessible low- and intermediate-energy nuclear physics bibliographic information for more than 200,000 articles since the beginning of nuclear science. The weekly updated NSR database provides essential support for nuclear data evaluation, compilation and research activities. The principles of the database and Web application development and maintenance are described. Examples of nuclear structure, reaction and decay applications are specifically included. The complete NSR database is freely available at the websites of the National Nuclear Data Center http://www.nndc.bnl.gov/nsr and the International Atomic Energy Agency http://www-nds.iaea.org/nsr.

  1. Information Retrieval and Text Mining Technologies for Chemistry.

    Science.gov (United States)

    Krallinger, Martin; Rabal, Obdulia; Lourenço, Anália; Oyarzabal, Julen; Valencia, Alfonso

    2017-06-28

    Efficient access to chemical information contained in scientific literature, patents, technical reports, or the web is a pressing need shared by researchers and patent attorneys from different chemical disciplines. Retrieval of important chemical information in most cases starts with finding relevant documents for a particular chemical compound or family. Targeted retrieval of chemical documents is closely connected to the automatic recognition of chemical entities in the text, which commonly involves the extraction of the entire list of chemicals mentioned in a document, including any associated information. In this Review, we provide a comprehensive and in-depth description of fundamental concepts, technical implementations, and current technologies for meeting these information demands. A strong focus is placed on community challenges addressing systems performance, more particularly CHEMDNER and CHEMDNER patents tasks of BioCreative IV and V, respectively. Considering the growing interest in the construction of automatically annotated chemical knowledge bases that integrate chemical information and biological data, cheminformatics approaches for mapping the extracted chemical names into chemical structures and their subsequent annotation together with text mining applications for linking chemistry with biological information are also presented. Finally, future trends and current challenges are highlighted as a roadmap proposal for research in this emerging field.

  2. Graph-Based Interactive Bibliographic Information Retrieval Systems

    Science.gov (United States)

    Zhu, Yongjun

    2017-01-01

    In the big data era, we have witnessed the explosion of scholarly literature. This explosion has imposed challenges to the retrieval of bibliographic information. Retrieval of intended bibliographic information has become challenging due to the overwhelming search results returned by bibliographic information retrieval systems for given input…

  3. Random walk term weighting for information retrieval

    DEFF Research Database (Denmark)

    Blanco, R.; Lioma, Christina

    2007-01-01

    We present a way of estimating term weights for Information Retrieval (IR), using term co-occurrence as a measure of dependency between terms.We use the random walk graph-based ranking algorithm on a graph that encodes terms and co-occurrence dependencies in text, from which we derive term weights...... that represent a quantification of how a term contributes to its context. Evaluation on two TREC collections and 350 topics shows that the random walk-based term weights perform at least comparably to the traditional tf-idf term weighting, while they outperform it when the distance between co-occurring terms...

  4. Web metrics for library and information professionals

    CERN Document Server

    Stuart, David

    2014-01-01

    This is a practical guide to using web metrics to measure impact and demonstrate value. The web provides an opportunity to collect a host of different metrics, from those associated with social media accounts and websites to more traditional research outputs. This book is a clear guide for library and information professionals as to what web metrics are available and how to assess and use them to make informed decisions and demonstrate value. As individuals and organizations increasingly use the web in addition to traditional publishing avenues and formats, this book provides the tools to unlock web metrics and evaluate the impact of this content. The key topics covered include: bibliometrics, webometrics and web metrics; data collection tools; evaluating impact on the web; evaluating social media impact; investigating relationships between actors; exploring traditional publications in a new environment; web metrics and the web of data; the future of web metrics and the library and information professional.Th...

  5. GIS information organization based on the Semantic Geospatial Web

    Science.gov (United States)

    Li, Shuxia; Su, Xuming; Li, Ke

    2008-10-01

    People typically use geographic names instead of coordinates to find geographic information on the web through a search engine. But the current keyword-based web search engines are poorly adapted to help people find information that relates to a particular geographic name, because they don't incorporate the geospatial semantic during the search process. The Semantic Web is a new semantic-based information-retrieval environment. We propose the information organization framework of the GIS semantic data according to the architecture of the Semantic Web, that is, the ontology, the metadata and the data source. Then we deal with the organization of the semantic data based on the three-layered framework respectively. As a focus, we present a novel method to disambiguate geographical name based on the ontology of the place.

  6. Exploiting semantic linkages among multiple sources for semantic information retrieval

    Science.gov (United States)

    Li, JianQiang; Yang, Ji-Jiang; Liu, Chunchen; Zhao, Yu; Liu, Bo; Shi, Yuliang

    2014-07-01

    The vision of the Semantic Web is to build a global Web of machine-readable data to be consumed by intelligent applications. As the first step to make this vision come true, the initiative of linked open data has fostered many novel applications aimed at improving data accessibility in the public Web. Comparably, the enterprise environment is so different from the public Web that most potentially usable business information originates in an unstructured form (typically in free text), which poses a challenge for the adoption of semantic technologies in the enterprise environment. Considering that the business information in a company is highly specific and centred around a set of commonly used concepts, this paper describes a pilot study to migrate the concept of linked data into the development of a domain-specific application, i.e. the vehicle repair support system. The set of commonly used concepts, including the part name of a car and the phenomenon term on the car repairing, are employed to build the linkage between data and documents distributed among different sources, leading to the fusion of documents and data across source boundaries. Then, we describe the approaches of semantic information retrieval to consume these linkages for value creation for companies. The experiments on two real-world data sets show that the proposed approaches outperform the best baseline 6.3-10.8% and 6.4-11.1% in terms of top five and top 10 precisions, respectively. We believe that our pilot study can serve as an important reference for the development of similar semantic applications in an enterprise environment.

  7. Information Architecture for the Web: The IA Matrix Approach to Designing Children's Portals.

    Science.gov (United States)

    Large, Andrew; Beheshti, Jamshid; Cole, Charles

    2002-01-01

    Presents a matrix that can serve as a tool for designing the information architecture of a Web portal in a logical and systematic manner. Highlights include interfaces; metaphors; navigation; interaction; information retrieval; and an example of a children's Web portal to provide access to museum information. (Author/LRW)

  8. Survey the role of emotions in information retrieval

    Directory of Open Access Journals (Sweden)

    Hassan Behzadi

    2016-03-01

    Full Text Available The present study was conducted to identify the users' emotion in various stages of information retrieval based on the information retrieval model in web.From the methodological perspective, the present study is experimental, and the type of study is practical. The society comprised all MA students majoring in different humanistic science branches and studying at Imam Reza international university. The sample society of this research consisted of 30 participants. The sample size was determined through stratified random sampling via G*power software. Data collection was carried out by using: demographic and prior experience of using internet questionnaire, post search questionnaire and recorded videos of users' faces. The findings of the study demonstrated that: 1 during the initial stages of searching, the frequency of emotion of apprehension, and in general during the link tracking stage, the negative emotions with the overall 49/3 percent are more frequent than the other emotions in browsing and differentiation stages, the emotion of happy was more frequent than the other emotions. 2 These variances resulted in significant relations among different emotions of the users throughout the four stages of information retrieval. 3 In simple search, the respondents displayed the emotion of happy most frequently and the emotion of aversion least frequently. On the other hand, in complicated search, apprehension and aversion were the most and the least frequently-cited emotions, respectively. Overall, the negative emotions were reported more frequently in complicated search in comparison with the simple search. This demonstrated that any change in the difficulty level of search undertaking would cause users to exhibit different types of emotions.

  9. Sexual information seeking on web search engines.

    Science.gov (United States)

    Spink, Amanda; Koricich, Andrew; Jansen, B J; Cole, Charles

    2004-02-01

    Sexual information seeking is an important element within human information behavior. Seeking sexually related information on the Internet takes many forms and channels, including chat rooms discussions, accessing Websites or searching Web search engines for sexual materials. The study of sexual Web queries provides insight into sexually-related information-seeking behavior, of value to Web users and providers alike. We qualitatively analyzed queries from logs of 1,025,910 Alta Vista and AlltheWeb.com Web user queries from 2001. We compared the differences in sexually-related Web searching between Alta Vista and AlltheWeb.com users. Differences were found in session duration, query outcomes, and search term choices. Implications of the findings for sexual information seeking are discussed.

  10. A framework for efficient spatial web object retrieval

    DEFF Research Database (Denmark)

    Wu, Dinging; Cong, Gao; Jensen, Christian S.

    2012-01-01

    into account both location proximity and text relevancy. This paper proposes a new indexing framework for top-k spatial text retrieval. The framework leverages the inverted file for text retrieval and the R-tree for spatial proximity querying. Several indexing approaches are explored within this framework....... The framework encompasses algorithms that utilize the proposed indexes for computing location-aware as well as region-aware top-k text retrieval queries, thus taking into account both text relevancy and spatial proximity to prune the search space. Results of empirical studies with an implementation...

  11. Improving Concept-Based Web Image Retrieval by Mixing Semantically Similar Greek Queries

    Science.gov (United States)

    Lazarinis, Fotis

    2008-01-01

    Purpose: Image searching is a common activity for web users. Search engines offer image retrieval services based on textual queries. Previous studies have shown that web searching is more demanding when the search is not in English and does not use a Latin-based language. The aim of this paper is to explore the behaviour of the major search…

  12. Information Architecture for Bilingual Web Sites.

    Science.gov (United States)

    Cunliffe, Daniel; Jones, Helen; Jarvis, Melanie; Egan, Kevin; Huws, Rhian; Munro, Sian

    2002-01-01

    Discusses creating an information architecture for a bilingual Web site and reports work in progress on the development of a content-based bilingual Web site to facilitate shared resources between speech and language therapists. Considers a structural analysis of existing bilingual Web designs and explains a card-sorting activity conducted with…

  13. A semantic medical multimedia retrieval approach using ontology information hiding.

    Science.gov (United States)

    Guo, Kehua; Zhang, Shigeng

    2013-01-01

    Searching useful information from unstructured medical multimedia data has been a difficult problem in information retrieval. This paper reports an effective semantic medical multimedia retrieval approach which can reflect the users' query intent. Firstly, semantic annotations will be given to the multimedia documents in the medical multimedia database. Secondly, the ontology that represented semantic information will be hidden in the head of the multimedia documents. The main innovations of this approach are cross-type retrieval support and semantic information preservation. Experimental results indicate a good precision and efficiency of our approach for medical multimedia retrieval in comparison with some traditional approaches.

  14. A Semantic Medical Multimedia Retrieval Approach Using Ontology Information Hiding

    Science.gov (United States)

    Guo, Kehua; Zhang, Shigeng

    2013-01-01

    Searching useful information from unstructured medical multimedia data has been a difficult problem in information retrieval. This paper reports an effective semantic medical multimedia retrieval approach which can reflect the users' query intent. Firstly, semantic annotations will be given to the multimedia documents in the medical multimedia database. Secondly, the ontology that represented semantic information will be hidden in the head of the multimedia documents. The main innovations of this approach are cross-type retrieval support and semantic information preservation. Experimental results indicate a good precision and efficiency of our approach for medical multimedia retrieval in comparison with some traditional approaches. PMID:24082915

  15. Visualization of database structures for information retrieval

    Directory of Open Access Journals (Sweden)

    Grete Lisbjerg Jensen

    1994-12-01

    Full Text Available This paper describes the Book House system, which is designed to support children's information retrieval in libraries as part of their education. It is a shareware program available on CD-ROM or floppy disks, and comprises functionality for database searching as well as for classifying and storing book information in the database. The system concept is based on an understanding of children's domain structures and their capabilities for categorization of information needs in connection with their activities in schools, in school libraries or in public libraries. These structures are visualized in the interface by using metaphors and multimedia technology. Through the use of text, images and animation, the Book House encourages children - even at a very early age - to learn by doing in an enjoyable way, which plays on their previous experiences with computer games. Both words and pictures can be used for searching; this makes the system suitable for all age groups. Even children who have not yet learned to read properly can, by selecting pictures, search for and find those books they would like to have read aloud. Thus, at the very beginning of their school life, they can learn to search for books on their own. For the library community, such a system will provide an extended service which will increase the number of children's own searches and also improve the relevance, quality and utilization of the book collections in the libraries. A market research report on the need for an annual indexing service for books in the Book House format is in preparation by the Danish Library Centre A/S.

  16. Search Result Caching in Peer-to-Peer Information Retrieval Networks

    NARCIS (Netherlands)

    Tigelaar, A.S.; Hiemstra, Djoerd; Trieschnigg, Rudolf Berend

    2011-01-01

    For peer-to-peer web search engines it is important to quickly process queries and return search results. How to keep the perceived latency low is an open challenge. In this paper we explore the solution potential of search result caching in large-scale peer-to-peer information retrieval networks by

  17. Scatter Matters: Regularities and Implications for the Scatter of Healthcare Information on the Web

    OpenAIRE

    Bhavnani, Suresh K.; Peck, Frederick A.

    2010-01-01

    Despite the development of huge healthcare Web sites and powerful search engines, many searchers end their searches prematurely with incomplete information. Recent studies suggest that users often retrieve incomplete information because of the complex scatter of relevant facts about a topic across Web pages. However, little is understood about regularities underlying such information scatter. To probe regularities within the scatter of facts across Web pages, this article presents the results...

  18. SPIRS: a Web-based image retrieval system for large biomedical databases.

    Science.gov (United States)

    Hsu, William; Antani, Sameer; Long, L Rodney; Neve, Leif; Thoma, George R

    2009-04-01

    With the increasing use of images in disease research, education, and clinical medicine, the need for methods that effectively archive, query, and retrieve these images by their content is underscored. This paper describes the implementation of a Web-based retrieval system called SPIRS (Spine Pathology & Image Retrieval System), which permits exploration of a large biomedical database of digitized spine X-ray images and data from a national health survey using a combination of visual and textual queries. SPIRS is a generalizable framework that consists of four components: a client applet, a gateway, an indexing and retrieval system, and a database of images and associated text data. The prototype system is demonstrated using text and imaging data collected as part of the second U.S. National Health and Nutrition Examination Survey (NHANES II). Users search the image data by providing a sketch of the vertebral outline or selecting an example vertebral image and some relevant text parameters. Pertinent pathology on the image/sketch can be annotated and weighted to indicate importance. During the course of development, we explored different algorithms to perform functions such as segmentation, indexing, and retrieval. Each algorithm was tested individually and then implemented as part of SPIRS. To evaluate the overall system, we first tested the system's ability to return similar vertebral shapes from the database given a query shape. Initial evaluations using visual queries only (no text) have shown that the system achieves up to 68% accuracy in finding images in the database that exhibit similar abnormality type and severity. Relevance feedback mechanisms have been shown to increase accuracy by an additional 22% after three iterations. While we primarily demonstrate this system in the context of retrieving vertebral shape, our framework has also been adapted to search a collection of 100,000 uterine cervix images to study the progression of cervical cancer. SPIRS is

  19. A User-Oriented Approach to Music Information Retrieval

    OpenAIRE

    Lesaffre, Micheline; Leman, Marc; Martens, Jean-Pierre

    2006-01-01

    Search and retrieval of specific musical content (e.g. emotion, melody) has become an important aspect of system development but only little research is user-oriented. The success of music information retrieval technology primarily depends on both assessing and meeting the needs of its users. Potential users of music information retrieval systems, however, draw upon various ways of expressing themselves. But, who are the potential users of MIR systems and how would they describe music qualiti...

  20. Informational Value of Museum Web Sites

    OpenAIRE

    Kravchyna, Victoria; Hastings, Sam

    2002-01-01

    What information are virtual visitors looking for on museum Web sites? This paper is a first step in a larger investigation into the informational value of museum Web sites. Scholars, teachers, students, museums staff, and museum visitors are the main categories of visitors examined in this study. Questions were asked of these museum audiences about their use of museum Web sites, museum databases, and other aspects of virtual visits.

  1. Online learning to rank for information retrieval: SIGIR 2016 tutorial

    NARCIS (Netherlands)

    Grotov, A.; de Rijke, M.

    2016-01-01

    During the past 10--15 years offline learning to rank has had a tremendous influence on information retrieval, both scientifically and in practice. Recently, as the limitations of offline learning to rank for information retrieval have become apparent, there is increased attention for online

  2. Problems of Music Information Retrieval in the Real World.

    Science.gov (United States)

    Byrd, Donald; Crawford, Tim

    2002-01-01

    Considers some of the most fundamental problems in music information retrieval, challenging the common assumption that searching on pitch alone is likely to be satisfactory for all purposes. Discusses special issues related to polyphonic music, user-interface issues, and the notion of relevance for music information retrieval. (Contains 52…

  3. Innovations in information retrieval perspectives for theory and practice

    CERN Document Server

    Foster, Allen

    2011-01-01

    The advent of various information retrieval (IR) technologies and approaches to storage and retrieval provide communities with opportunities for mass documentation, digitization, and the recording of information in different forms. This book introduces and contextualizes these developments and looks at supporting research in IR.

  4. The Human-Computer Interface for Information Retrieval.

    Science.gov (United States)

    Shaw, Debora

    1991-01-01

    Discusses the human-computer interface as it relates to information technology and retrieval. Principles of interface design are examined, including visual display features and help messages; information retrieval applications are described, including online searching, CD-ROM, online public access catalogs (OPACs), and full-text databases; and…

  5. Science information systems: Archive, access, and retrieval

    Science.gov (United States)

    Campbell, William J.

    1991-01-01

    The objective of this research is to develop technology for the automated characterization and interactive retrieval and visualization of very large, complex scientific data sets. Technologies will be developed for the following specific areas: (1) rapidly archiving data sets; (2) automatically characterizing and labeling data in near real-time; (3) providing users with the ability to browse contents of databases efficiently and effectively; (4) providing users with the ability to access and retrieve system independent data sets electronically; and (5) automatically alerting scientists to anomalies detected in data.

  6. Factors influencing evaluations of web site information.

    Science.gov (United States)

    Amsbary, Jonathan Howard; Powell, Larry

    2003-08-01

    This study investigated the effect of first-person and third-person perceptions of web site information. Responses from a telephone survey of 226 participants in a stratified random sample indicated that (1) most participants had higher evaluations for television news than for news received on the Internet; (2) a third-person effect was present in that most respondents generally thought that other people found the Internet easier to use than they did, and that other people were more likely to believe Internet information and trust the sources of Internet information than they would. Also, (3), evaluations of information on a particular web site could be increased by providing links to other web sites on the same topic. Perhaps links to other web sites may serve as either a "referencing" function or a social confirmation function to increase evaluations of web site information.

  7. Perspectives in CD-ROM for Information Storage and Retrieval.

    Science.gov (United States)

    Lunin, Lois F., Ed.; Schipma, Peter B., Ed.

    1988-01-01

    A series of six articles discusses the technology of optical data disks, current and possible future applications of this technology, their potential impact on information retrieval systems, and the potential problems as they apply to information science. (CLB)

  8. EPA's Information Architecture and Web Taxonomy

    Science.gov (United States)

    EPA's Information Architecture creates a topical organization of our website, instead of an ownership-based organization. The EPA Web Taxonomy allows audiences easy access to relevant information from EPA programs, by using a common vocabulary.

  9. Combining Information Sources for Video Retrieval

    NARCIS (Netherlands)

    Westerveld, T.H.W.; Ianeva, T.; Boldareva, L.; de Vries, A.P.; Hiemstra, Djoerd

    The previous video track results demonstrated that it is far from trivial to take advantage of multiple modalities for the video retrieval search task. For almost any query, results based on ASR transcripts have been better than any other run. This year’s main success in our runs is that a

  10. Retrieving top-k prestige-based relevant spatial web objects

    DEFF Research Database (Denmark)

    Cao, Xin; Cong, Gao; Jensen, Christian S.

    2010-01-01

    The location-aware keyword query returns ranked objects that are near a query location and that have textual descriptions that match query keywords. This query occurs inherently in many types of mobile and traditional web services and applications, e.g., Yellow Pages and Maps services. Previous...... of prestige-based relevance to capture both the textual relevance of an object to a query and the effects of nearby objects. Based on this, a new type of query, the Location-aware top-k Prestige-based Text retrieval (LkPT) query, is proposed that retrieves the top-k spatial web objects ranked according...... to both prestige-based relevance and location proximity. We propose two algorithms that compute LkPT queries. Empirical studies with real-world spatial data demonstrate that LkPT queries are more effective in retrieving web objects than a previous approach that does not consider the effects of nearby...

  11. Comparing cosmic web classifiers using information theory

    Science.gov (United States)

    Leclercq, Florent; Lavaux, Guilhem; Jasche, Jens; Wandelt, Benjamin

    2016-08-01

    We introduce a decision scheme for optimally choosing a classifier, which segments the cosmic web into different structure types (voids, sheets, filaments, and clusters). Our framework, based on information theory, accounts for the design aims of different classes of possible applications: (i) parameter inference, (ii) model selection, and (iii) prediction of new observations. As an illustration, we use cosmographic maps of web-types in the Sloan Digital Sky Survey to assess the relative performance of the classifiers T-WEB, DIVA and ORIGAMI for: (i) analyzing the morphology of the cosmic web, (ii) discriminating dark energy models, and (iii) predicting galaxy colors. Our study substantiates a data-supported connection between cosmic web analysis and information theory, and paves the path towards principled design of analysis procedures for the next generation of galaxy surveys. We have made the cosmic web maps, galaxy catalog, and analysis scripts used in this work publicly available.

  12. Automating Information Discovery Within the Invisible Web

    Science.gov (United States)

    Sweeney, Edwina; Curran, Kevin; Xie, Ermai

    A Web crawler or spider crawls through the Web looking for pages to index, and when it locates a new page it passes the page on to an indexer. The indexer identifies links, keywords, and other content and stores these within its database. This database is searched by entering keywords through an interface and suitable Web pages are returned in a results page in the form of hyperlinks accompanied by short descriptions. The Web, however, is increasingly moving away from being a collection of documents to a multidimensional repository for sounds, images, audio, and other formats. This is leading to a situation where certain parts of the Web are invisible or hidden. The term known as the "Deep Web" has emerged to refer to the mass of information that can be accessed via the Web but cannot be indexed by conventional search engines. The concept of the Deep Web makes searches quite complex for search engines. Google states that the claim that conventional search engines cannot find such documents as PDFs, Word, PowerPoint, Excel, or any non-HTML page is not fully accurate and steps have been taken to address this problem by implementing procedures to search items such as academic publications, news, blogs, videos, books, and real-time information. However, Google still only provides access to a fraction of the Deep Web. This chapter explores the Deep Web and the current tools available in accessing it.

  13. Understanding and Supporting Web Developers: Design and Evaluation of a Web Accessibility Information Resource (WebAIR).

    Science.gov (United States)

    Swallow, David; Petrie, Helen; Power, Christopher

    2016-01-01

    This paper describes the design and evaluation of a Web Accessibility Information Resource (WebAIR) for supporting web developers to create and evaluate accessible websites. WebAIR was designed with web developers in mind, recognising their current working practices and acknowledging their existing understanding of web accessibility. We conducted an evaluation with 32 professional web developers in which they used either WebAIR or an existing accessibility information resource, the Web Content Accessibility Guidelines, to identify accessibility problems. The findings indicate that several design decisions made in relation to the language, organisation, and volume of WebAIR were effective in supporting web developers to undertake web accessibility evaluations.

  14. Short Communication The New Information Retrieval Media and the ...

    African Journals Online (AJOL)

    First from the manual method, then to the use of computer software's, retrieval is now made from full-text and on-line databases. This paper discusses the transition to these new information retrieval media and the challenges for Nigeria libraries to adopt the two key elements that propel it - computers and Telecommunication ...

  15. An Evaluation of Automatically Constructed Hypertexts for Information Retrieval.

    Science.gov (United States)

    Melucci, Massimo

    1999-01-01

    Assesses the retrieval effectiveness of automatically constructed interdocument hypertext links in information retrieval (IR). Describes experiments using statistical and probabilistic techniques that were designed to obtain evidence concerning the usefulness of querying and browsing automatically constructed IR hypertexts. Results indicate a…

  16. Adapting a Diagnostic Problem-Solving Model to Information Retrieval.

    Science.gov (United States)

    Syu, Inien; Lang, S. D.

    2000-01-01

    Explains how a competition-based connectionist model for diagnostic problem-solving is adapted to information retrieval. Topics include probabilistic causal networks; Bayesian networks; the neural network model; empirical studies of test collections that evaluated retrieval performance; precision results; and the use of a thesaurus to provide…

  17. Understanding information retrieval systems management, types, and standards

    CERN Document Server

    Bates, Marcia J

    2011-01-01

    In order to be effective for their users, information retrieval (IR) systems should be adapted to the specific needs of particular environments. The huge and growing array of types of information retrieval systems in use today is on display in Understanding Information Retrieval Systems: Management, Types, and Standards, which addresses over 20 types of IR systems. These various system types, in turn, present both technical and management challenges, which are also addressed in this volume. In order to be interoperable in a networked environment, IR systems must be able to use various types of

  18. Care episode retrieval: distributional semantic models for information retrieval in the clinical domain.

    Science.gov (United States)

    Moen, Hans; Ginter, Filip; Marsi, Erwin; Peltonen, Laura-Maria; Salakoski, Tapio; Salanterä, Sanna

    2015-01-01

    Patients' health related information is stored in electronic health records (EHRs) by health service providers. These records include sequential documentation of care episodes in the form of clinical notes. EHRs are used throughout the health care sector by professionals, administrators and patients, primarily for clinical purposes, but also for secondary purposes such as decision support and research. The vast amounts of information in EHR systems complicate information management and increase the risk of information overload. Therefore, clinicians and researchers need new tools to manage the information stored in the EHRs. A common use case is, given a--possibly unfinished--care episode, to retrieve the most similar care episodes among the records. This paper presents several methods for information retrieval, focusing on care episode retrieval, based on textual similarity, where similarity is measured through domain-specific modelling of the distributional semantics of words. Models include variants of random indexing and the semantic neural network model word2vec. Two novel methods are introduced that utilize the ICD-10 codes attached to care episodes to better induce domain-specificity in the semantic model. We report on experimental evaluation of care episode retrieval that circumvents the lack of human judgements regarding episode relevance. Results suggest that several of the methods proposed outperform a state-of-the art search engine (Lucene) on the retrieval task.

  19. Sigma: Web Retrieval Interface for Nuclear Reaction Data

    Energy Technology Data Exchange (ETDEWEB)

    Pritychenko,B.; Sonzogni, A.A.

    2008-06-24

    The authors present Sigma, a Web-rich application which provides user-friendly access in processing and plotting of the evaluated and experimental nuclear reaction data stored in the ENDF-6 and EXFOR formats. The main interface includes browsing using a periodic table and a directory tree, basic and advanced search capabilities, interactive plots of cross sections, angular distributions and spectra, comparisons between evaluated and experimental data, computations between different cross section sets. Interactive energy-angle, neutron cross section uncertainties plots and visualization of covariance matrices are under development. Sigma is publicly available at the National Nuclear Data Center website at www.nndc.bnl.gov/sigma.

  20. On Region Algebras, XML Databases, and Information Retrieval

    NARCIS (Netherlands)

    Mihajlovic, V.; Hiemstra, Djoerd; Apers, Peter M.G.

    2003-01-01

    This paper describes some new ideas on developing a logical algebra for databases that manage textual data and support information retrieval functionality. We describe a first prototype of such a system.

  1. Vector space model for document representation in information retrieval

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2007-12-01

    Full Text Available This paper presents the basics of information retrieval: the vector space model for document representation with Boolean and term weighted models, ranking methods based on the cosine factor and evaluation measures: recall, precision and combined measure.

  2. GRAMMAR RULE BASED INFORMATION RETRIEVAL MODEL FOR BIG DATA

    Directory of Open Access Journals (Sweden)

    T. Nadana Ravishankar

    2015-07-01

    Full Text Available Though Information Retrieval (IR in big data has been an active field of research for past few years; the popularity of the native languages presents a unique challenge in big data information retrieval systems. There is a need to retrieve information which is present in English and display it in the native language for users. This aim of cross language information retrieval is complicated by unique features of the native languages such as: morphology, compound word formations, word spelling variations, ambiguity, word synonym, other language influence and etc. To overcome some of these issues, the native language is modeled using a grammar rule based approach in this work. The advantage of this approach is that the native language is modeled and its unique features are encoded using a set of inference rules. This rule base coupled with the customized ontological system shows considerable potential and is found to show better precision and recall.

  3. Comparative Analysis of Sparse Matrix Algorithms For Information Retrieval

    Directory of Open Access Journals (Sweden)

    Nazli Goharian

    2003-02-01

    Full Text Available We evaluate and compare the storage efficiency of different sparse matrix storage formats as index structure for text collection and their corresponding sparse matrixvector multiplication algorithm to perform query processing in information retrieval (IR application. We show the results of our implementations for several sparse matrix algorithms such as Coordinate Storage (COO, Compressed Sparse Column (CSC, Compressed Sparse Row (CSR, and Block Sparse Row (BSR sparse matrix algorithms, using a standard text collection. Evaluation is based on the storage space requirement for each indexing structure and the efficiency of the query-processing algorithm. Our results demonstrate that CSR is more efficient in terms of storage space requirement and query processing timing over the other sparse matrix algorithms for Information Retrieval application. Furthermore, we experimentally evaluate the mapping of various existing index compression techniques used to compress index in information retrieval systems (IR on Compressed Sparse Row Information Retrieval (CSR IR.

  4. Object-Centered Knowledge Representation and Information Retrieval.

    Science.gov (United States)

    Panyr, Jiri

    1996-01-01

    Discusses object-centered knowledge representation and information retrieval. Highlights include semantic networks; frames; predicative (declarative) and associative knowledge; cluster analysis; creation of subconcepts and superconcepts; automatic classification; hierarchies and pseudohierarchies; graph theory; term classification; clustering of…

  5. Using an Automatic Retrieval System in the Web To Assist Co-operative Learning.

    Science.gov (United States)

    Badue, Claudine; Vaz, Wesley; Albuquerque, Eduardo

    This paper presents an information agent and latent semantic-based indexing architecture to retrieve documents on the Internet. The system optimizes the search for documents in the Internet by automatically retrieving relevant links. The information used for the search can be obtained, for instance, from Internet browser caches and from grades of…

  6. Bibliographic information organization in the semantic web

    CERN Document Server

    Willer, Mirna

    2013-01-01

    New technologies will underpin the future generation of library catalogues. To facilitate their role providing information, serving users, and fulfilling their mission as cultural heritage and memory institutions, libraries must take a technological leap; their standards and services must be transformed to those of the Semantic Web. Bibliographic Information Organization in the Semantic Web explores the technologies that may power future library catalogues, and argues the necessity of such a leap. The text introduces international bibliographic standards and models, and fundamental concepts in

  7. Interface design for an audio based information retrieval system

    OpenAIRE

    Johnson, James Robert

    1992-01-01

    This project involves a telephone-based information retrieval system. Users interact with the computer by pressing buttons on a telephone keypad and listening to the computer respond by way of a speech synthesizer. The purpose of this project is to redesign and revise an existing information retrieval system. The goals of this project include simplifying the job of the menu designer and providing a way so experience can aid users to perform a given task faster than previously possible. Key...

  8. Semantic-Based Information Retrieval of Biomedical Data

    Energy Technology Data Exchange (ETDEWEB)

    Jiao, Yu [ORNL; Potok, Thomas E [ORNL; Hurson, Ali R. [Pennsylvania State University; Yan, Peng [Pennsylvania State University

    2006-01-01

    In this paper, we propose to improve the effectiveness of biomedical information retrieval via a medical thesaurus. We analyzed the deficiencies of the existing medical thesauri and reconstructed a new thesaurus, called MEDTHES, which follows the ANSI/NISO Z39.19-2003 standard. MEDTHES also endows the users with fine-grained control of information retrieval by providing functions to calculate the semantic similarity between words. We demonstrate the usage of MEDTHES through an existing data search engine.

  9. A Process Model for Goal-Based Information Retrieval

    Directory of Open Access Journals (Sweden)

    Harvey Hyman

    2014-12-01

    Full Text Available In this paper we examine the domain of information search and propose a "goal-based" approach to study search strategy. We describe "goal-based information search" using a framework of Knowledge Discovery. We identify two Information Retrieval (IR goals using the constructs of Knowledge Acquisition (KA and Knowledge Explanation (KE. We classify these constructs into two specific information problems: An exploration-exploitation problem and an implicit-explicit problem. Our proposed framework is an extension of prior work in this domain, applying an IR Process Model originally developed for Legal-IR and adapted to Medical-IR. The approach in this paper is guided by the recent ACM-SIG Medical Information Retrieval (MedIR Workshop definition: "methodologies and technologies that seek to improve access to medical information archives via a process of information retrieval."

  10. Adaptive multi-agent system for information retrieval

    Science.gov (United States)

    Maleki-dizaji, Saeedeh; Nyongesa, H. O.; Siddiqqi, J.

    2001-10-01

    The current exponential growth of the Internet precipitates a need for improved tools to help people cope with the volume of information available. Existing search engines such, as Yahoo, Alta vista and Excite are efficient in terms of high recall (percentage of relevant document that are retrieved from Internet), and fast response time, at the cost of poor precision (percentage of documents retrieved that are considered relevant). The problem is due to the lack of filtering, lack of specialisation, lack of relevance feedback, lack of adaptation and lack of exploration. One solution for the above problems is to use intelligent agents, which can operate autonomously and become better over time. The agents rely on a user model to improve their performance in retrieving the information. This paper presents an adaptive information retrieval (IR) that learns from the user feedback through an evolutionary method, namely, genetic algorithms (GA).

  11. Research on land information web query service for public

    Science.gov (United States)

    Liang, Dongdong; Li, Lin; Song, Pingchao; Cheng, Yang; Mei, Song; Min, Yuan

    2009-10-01

    With economics developing fast and internet spreading extensively, the public strongly desire to know about land information. Especially, the policy, Land registration information available to the public inquiry approach, has been executed since March 1st, 2003, which gives the Land Department with guidance to build land information web query service for public. Land information web query service for public requires Land Management Department to provide land registration information which contains attribute and graphics information. When it comes to querying attribute information, precise and fuzzy query methods are commonly used in realistic applications. To improve the speed and accuracy of fuzzy query, Chinese word segmentation method is currently used. Especially, there is no previous example by this method used in cadastre information inquiry. Meanwhile, as for querying lands' spatial information, it is necessary to query attribute information before retrieving the actual graphics information. Then turning to the map service, eagle eye can show which part of whole cadastre map the specified cadastre land located in. But it is obvious the display speed of eagle eye is not as fast as that of cadastre map. Hence, we try to implement the multi-level query with frame selection on cadastre map and identify the different cadastre land with different colors, as eagle eye's display and panning speed are also accelerated. The accomplishments of our research have been applied to Land information query system of Ningbo. It is hoped that the solutions in this system will help to develop and study analogous issues.

  12. Associative conceptual space-based information retrieval systems

    NARCIS (Netherlands)

    M.J. Schuemie (Martijn); J.H. van den Berg (Jan)

    1998-01-01

    textabstractIn this `Information Era' with the availability of large collections of books, articles, journals, CD-ROMs, video films and so on, there exists an increasing need for intelligent information retrieval systems that enable users to find the information desired easily. Many attempts have

  13. Bibliometrics and Information Retrieval - Creating Knowledge through Research Synergies

    NARCIS (Netherlands)

    Bar-Ilan, Judit; Koopman, Rob; Wang, Shenghui; Scharnhorst, Andrea; John, Marcus; Mayr, Philipp; Wolfram, Dietmar

    2016-01-01

    This panel brings together experts in bibliometrics and information retrieval to discuss how each of these two important areas of information science can help to inform the research of the other. There is a growing body of literature that capitalizes on the synergies created by combining

  14. Global polar geospatial information service retrieval based on search engine and ontology reasoning

    Science.gov (United States)

    Chen, Nengcheng; E, Dongcheng; Di, Liping; Gong, Jianya; Chen, Zeqiang

    2007-01-01

    In order to improve the access precision of polar geospatial information service on web, a new methodology for retrieving global spatial information services based on geospatial service search and ontology reasoning is proposed, the geospatial service search is implemented to find the coarse service from web, the ontology reasoning is designed to find the refined service from the coarse service. The proposed framework includes standardized distributed geospatial web services, a geospatial service search engine, an extended UDDI registry, and a multi-protocol geospatial information service client. Some key technologies addressed include service discovery based on search engine and service ontology modeling and reasoning in the Antarctic geospatial context. Finally, an Antarctica multi protocol OWS portal prototype based on the proposed methodology is introduced.

  15. Roogle: an information retrieval engine for clinical data warehouse.

    Science.gov (United States)

    Cuggia, Marc; Garcelon, Nicolas; Campillo-Gimenez, Boris; Bernicot, Thomas; Laurent, Jean-François; Garin, Etienne; Happe, André; Duvauferrier, Régis

    2011-01-01

    High amount of relevant information is contained in reports stored in the electronic patient records and associated metadata. R-oogle is a project aiming at developing information retrieval engines adapted to these reports and designed for clinicians. The system consists in a data warehouse (full-text reports and structured data) imported from two different hospital information systems. Information retrieval is performed using metadata-based semantic and full-text search methods (as Google). Applications may be biomarkers identification in a translational approach, search of specific cases, and constitution of cohorts, professional practice evaluation, and quality control assessment.

  16. Locally decodable codes and private information retrieval schemes

    CERN Document Server

    Yekhanin, Sergey

    2010-01-01

    Locally decodable codes (LDCs) are codes that simultaneously provide efficient random access retrieval and high noise resilience by allowing reliable reconstruction of an arbitrary bit of a message by looking at only a small number of randomly chosen codeword bits. Local decodability comes with a certain loss in terms of efficiency - specifically, locally decodable codes require longer codeword lengths than their classical counterparts. Private information retrieval (PIR) schemes are cryptographic protocols designed to safeguard the privacy of database users. They allow clients to retrieve rec

  17. Information Retrieval and Criticality in Parity-Time-Symmetric Systems

    Science.gov (United States)

    Kawabata, Kohei; Ashida, Yuto; Ueda, Masahito

    2017-11-01

    By investigating information flow between a general parity-time (P T -)symmetric non-Hermitian system and an environment, we find that the complete information retrieval from the environment can be achieved in the P T -unbroken phase, whereas no information can be retrieved in the P T -broken phase. The P T -transition point thus marks the reversible-irreversible criticality of information flow, around which many physical quantities such as the recurrence time and the distinguishability between quantum states exhibit power-law behavior. Moreover, by embedding a P T -symmetric system into a larger Hilbert space so that the entire system obeys unitary dynamics, we reveal that behind the information retrieval lies a hidden entangled partner protected by P T symmetry. Possible experimental situations are also discussed.

  18. Semantic association ranking schemes for information retrieval ...

    Indian Academy of Sciences (India)

    ... relevance, multimedia, information, video, image, answer, text}. Doc 9. {google, search, engine, personalization, information, text, multimedia}. Figure 8. Term association graph on real data with 50 nodes. Table 6. User search interest value table. Session ID. Software. Algorithms. Healthcare. Sports. Movies. Music. S1.

  19. Foundations of Large-Scale Multimedia Information Management and Retrieval

    CERN Document Server

    Chang, Edward Y

    2011-01-01

    "Foundations of Large-Scale Multimedia Information Management and Retrieval - Mathematics of Perception" covers knowledge representation and semantic analysis of multimedia data and scalability in signal extraction, data mining, and indexing. The book is divided into two parts: Part I - Knowledge Representation and Semantic Analysis focuses on the key components of mathematics of perception as it applies to data management and retrieval. These include feature selection/reduction, knowledge representation, semantic analysis, distance function formulation for measuring similarity, and

  20. Interdisciplinary perspectives on abstracts for information retrieval

    Directory of Open Access Journals (Sweden)

    Soon Keng Chan

    2004-10-01

    Full Text Available The paper examines the abstract genre from the perspectives of English for Specific Purposes (ESP practitioners and information professionals. It aims to determine specific interdisciplinary interests in the abstract, and to explore areas of collaboration in terms of research and pedagogical practices. A focus group (FG comprising information professionals from the Division of Information Studies, Nanyang Technological University, Singapore, convened for a discussion on the subject of abstracts and abstracting. Two major issues that have significant implications for ESP practices emerged during the discussion. While differences in terms of approach to and objectives of the abstract genre are apparent between information professionals and language professionals, the demands for specific cognitive processes involved in abstracting proved to be similar. This area of similarity provides grounds for awareness raising and collaboration between the two disciplines. While ESP practitioners need to consider adding the dimension of information science to the rhetorical and linguistic scaffolding that they have been providing to novice-writers, information professionals can contribute useful insights about the qualities of abstracts that have the greatest impact in meeting the end-users' needs in information search.

  1. User's perspective: Information retrieval and usability

    Directory of Open Access Journals (Sweden)

    Salvador Zambrano Silva

    2008-02-01

    Full Text Available The point is to share some ideas to improve the on line database of "Defensor del Pueblo Andaluz", starting from an user's study and a bibliographic analysis. Our intention is to create an interface to make interactivity much easier and make it work as a connector bridge between the documentent´s information structure and the user's knowledge structure. With the only purpose to improve the user satis-faction level in the results of information search.

  2. Music information retrieval in compressed audio files: a survey

    Science.gov (United States)

    Zampoglou, Markos; Malamos, Athanasios G.

    2014-07-01

    In this paper, we present an organized survey of the existing literature on music information retrieval systems in which descriptor features are extracted directly from the compressed audio files, without prior decompression to pulse-code modulation format. Avoiding the decompression step and utilizing the readily available compressed-domain information can significantly lighten the computational cost of a music information retrieval system, allowing application to large-scale music databases. We identify a number of systems relying on compressed-domain information and form a systematic classification of the features they extract, the retrieval tasks they tackle and the degree in which they achieve an actual increase in the overall speed-as well as any resulting loss in accuracy. Finally, we discuss recent developments in the field, and the potential research directions they open toward ultra-fast, scalable systems.

  3. User centered and ontology based information retrieval system for life sciences

    Directory of Open Access Journals (Sweden)

    Sy Mohameth-François

    2012-01-01

    Full Text Available Abstract Background Because of the increasing number of electronic resources, designing efficient tools to retrieve and exploit them is a major challenge. Some improvements have been offered by semantic Web technologies and applications based on domain ontologies. In life science, for instance, the Gene Ontology is widely exploited in genomic applications and the Medical Subject Headings is the basis of biomedical publications indexation and information retrieval process proposed by PubMed. However current search engines suffer from two main drawbacks: there is limited user interaction with the list of retrieved resources and no explanation for their adequacy to the query is provided. Users may thus be confused by the selection and have no idea on how to adapt their queries so that the results match their expectations. Results This paper describes an information retrieval system that relies on domain ontology to widen the set of relevant documents that is retrieved and that uses a graphical rendering of query results to favor user interactions. Semantic proximities between ontology concepts and aggregating models are used to assess documents adequacy with respect to a query. The selection of documents is displayed in a semantic map to provide graphical indications that make explicit to what extent they match the user's query; this man/machine interface favors a more interactive and iterative exploration of data corpus, by facilitating query concepts weighting and visual explanation. We illustrate the benefit of using this information retrieval system on two case studies one of which aiming at collecting human genes related to transcription factors involved in hemopoiesis pathway. Conclusions The ontology based information retrieval system described in this paper (OBIRS is freely available at: http://www.ontotoolkit.mines-ales.fr/ObirsClient/. This environment is a first step towards a user centred application in which the system enlightens

  4. Information Retrieval Using Hadoop Big Data Analysis

    Science.gov (United States)

    Motwani, Deepak; Madan, Madan Lal

    This paper concern on big data analysis which is the cognitive operation of probing huge amounts of information in an attempt to get uncovers unseen patterns. Through Big Data Analytics Applications such as public and private organization sectors have formed a strategic determination to turn big data into cut throat benefit. The primary occupation of extracting value from big data give rise to a process applied to pull information from multiple different sources; this process is known as extract transforms and lode. This paper approach extract information from log files and Research Paper, awareness reduces the efforts for blueprint finding and summarization of document from several positions. The work is able to understand better Hadoop basic concept and increase the user experience for research. In this paper, we propose an approach for analysis log files for finding concise information which is useful and time saving by using Hadoop. Our proposed approach will be applied on different research papers on a specific domain and applied for getting summarized content for further improvement and make the new content.

  5. Acquisition and retrieval of ophthalmology academic information

    Directory of Open Access Journals (Sweden)

    Lei Li

    2014-06-01

    Full Text Available This article discusses how to search and access ophthalmology information based on specialized websites and resources by introducing the database, search engines, electronic journals, electronic books and so on. Hope to help ophthalmic practitioners to carry out scientific research and clinical practice.

  6. Dutch Speech Recognition in Multimedia Information Retrieval

    NARCIS (Netherlands)

    Ordelman, Roeland J.F.; Ordelman, Roeland Jacobus Frederik

    2003-01-01

    As data storage capacities grow to nearly unlimited sizes thanks to ever ongoing hardware and software improvements, an increasing amount of information is being stored in multimedia and spoken-word collections. Assuming that the intention of data storage is to use (portions of) it some later time,

  7. Learning to merge search results for efficient Distributed Information Retrieval

    NARCIS (Netherlands)

    Tjin-Kam-Jet, Kien; Hiemstra, Djoerd

    2010-01-01

    Merging search results from different servers is a major problem in Distributed Information Retrieval. We used Regression-SVM and Ranking-SVM which would learn a function that merges results based on information that is readily available: i.e. the ranks, titles, summaries and URLs contained in the

  8. LOGISTIC MANAGEMENT INFORMATION SYSTEM - MANUAL DATA STORAGE AND RETRIEVAL SYSTEM.

    Science.gov (United States)

    Logistics Management Information System . The procedures are applicable to manual storage and retrieval of all data used in the Logistics Management ... Information System (LMIS) and include the following: (1) Action Officer data source file. (2) Action Officer presentation format file. (3) LMI Coordination

  9. Level Search Schemes for Information Filtering and Retrieval.

    Science.gov (United States)

    Zhang, Xiaoyan; Berry, Michael W.; Raghavan, Padma

    2001-01-01

    Discusses latent semantic indexing (LSI); considers the high cost associated with the singular value decomposition (SVD) of the large term-by-document matrix that becomes a barrier for its application to scalable information retrieval; and shows that information filtering using level search techniques can reduce the SVD computation cost for LSI.…

  10. MIRANDA - Music Information Retrieval And Data Acquisition

    DEFF Research Database (Denmark)

    Lehn-Schiøler, Tue; Petersen, Kaare Brandt; Hansen, Lars Kai

    2006-01-01

    In this report we present a music data harvesting system based on a plug-in for a popular music player. When a user is playing a song using the plug-in, information about the song is anonymously submitted to a server. The data gathered using MIRANDA is intended to be released to the MIR community....... We argue that even though content-based data is of interest to the community, also meta data and usage data can be important for research in music similarity.......In this report we present a music data harvesting system based on a plug-in for a popular music player. When a user is playing a song using the plug-in, information about the song is anonymously submitted to a server. The data gathered using MIRANDA is intended to be released to the MIR community...

  11. A web service for enabling medical image retrieval integrated into a social medical image sharing platform.

    Science.gov (United States)

    Niinimäki, Marko; Zhou, Xin; de la Vega, Enrique; Cabrer, Miguel; Müller, Henning

    2010-01-01

    Content-based visual image access is in the process from a research domain towards real applications. So far, most image retrieval applications have been in one specialized domain such as lung CTs as diagnosis aid or for classification of general images based on anatomic region, modality, and view. This article describes the use of a content-based image retrieval system in connection with the medical image sharing platform MEDTING, so a data set with a very large variety. Similarity retrieval is possible for all cases of the social image sharing platform, so cases can be linked by either visual similarity or similarity in keywords. The visual retrieval search is based on the GIFT (GNU Image Finding Tool). The technology for updating the index with new images added by users employs RSS (Really Simple Syndication) feeds. The ARC (Advanced Resource Connector) middleware is used for the implementation of a web service for similarity retrieval, simplifying the integration of this service. Novelty of this article is the application/integration and image updating strategy. Retrieval methods themselves employ existing techniques that are all open source and can easily be reproduced.

  12. Distributed Systems and Applications of Information Filtering and Retrieval

    CERN Document Server

    Giuliani, Alessandro; Semeraro, Giovanni; DART 2012

    2014-01-01

    This volume focuses on new challenges in distributed Information Filtering and Retrieval. It collects invited chapters and extended research contributions from the special session on Information Filtering and Retrieval: Novel Distributed Systems and Applications (DART) of the 4th International Conference on Knowledge Discovery and Information Retrieval (KDIR 2012), held in Barcelona, Spain, on 4-7 October 2012. The main focus of DART was to discuss and compare suitable novel solutions based on intelligent techniques and applied to real-world applications. The chapters of this book present a comprehensive review of related works and state of the art. Authors, both practitioners and researchers, shared their results in several topics such as "Multi-Agent Systems", "Natural Language Processing", "Automatic Advertisement", "Customer Interaction Analytics", "Opinion Mining". Contributions have been careful reviewed by experts in the area, who also gave useful suggestions to improve the quality of the volume.

  13. New approach to information retrieval problems in separations science

    Energy Technology Data Exchange (ETDEWEB)

    McDowell, W.J.; Corey, B.B.

    1984-01-01

    Retrieving information on specific chemical separations is among the most difficult problems in information management, although the ability to find methods to cleanly and efficiently separate chemical species is of the utmost importance to chemists and chemical engineers. Information on performing specific chemical separations is largely buried in the literature of dozens of branches of science. Most methods of indexing (both hard copy and computer) do not provide good means of retrieving information on specific separations because index terms such as extraction, leaching, chromatography, and even ion exchange have different meanings in different disciplines. Recent attempts to solve some of the problems of information retrieval in separations science have resulted in the concept of a Separations Science Data Base. This data is designed for the chemical separations information and contains unique indexes that allow rapid and accurate retrieval of information about specific separations from specific matrices and a method of minimizing false returns that result from cross coupling of unrelated terms in multisubject reports. Although the data base is presently only about 20% complete, the success of this work has been encouraging and further work is indicated.

  14. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Directory of Open Access Journals (Sweden)

    Shozo Makino

    2007-01-01

    Full Text Available Recently, several music information retrieval (MIR systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA is used as recognition grammar, and about 86% retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0% retrieval accuracy was obtained with a query consisting of five words.

  15. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Directory of Open Access Journals (Sweden)

    Suzuki Motoyuki

    2007-01-01

    Full Text Available Recently, several music information retrieval (MIR systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA is used as recognition grammar, and about retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0 retrieval accuracy was obtained with a query consisting of five words.

  16. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Science.gov (United States)

    Suzuki, Motoyuki; Hosoya, Toru; Ito, Akinori; Makino, Shozo

    2006-12-01

    Recently, several music information retrieval (MIR) systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA) is used as recognition grammar, and about[InlineEquation not available: see fulltext.] retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0[InlineEquation not available: see fulltext.] retrieval accuracy was obtained with a query consisting of five words.

  17. Utilizing Mind-Maps for Information Retrieval and User Modelling

    OpenAIRE

    Beel, Joeran; Langer, Stefan; Genzmehr, Marcel; Gipp, Bela

    2014-01-01

    Mind-maps have been widely neglected by the information retrieval (IR) community. However, there are an estimated two million active mind-map users, who create 5 million mind-maps every year, of which a total of 300,000 is publicly available. We believe this to be a rich source for information retrieval applications, and present eight ideas on how mind-maps could be utilized by them. For instance, mind-maps could be utilized to generate user models for recommender systems or expert search, or...

  18. Experiences with automated categorization in e-government information retrieval

    DEFF Research Database (Denmark)

    Jonasen, Tanja Svarre; Lykke, Marianne

    2014-01-01

    High-precision search results are essential for supporting e-government employees’ information tasks. Prior studies have shown that existing features of e-government retrieval systems need improvement in terms of search facilities (e.g., Goh et al. 2008), navigation (e.g., de Jong and Lentz 2006......) and metadata (e.g., Kopackova, Michalek and Cejna 2010). This paper investigates how automated categorization can enhance information organization and retrieval, and presents the results of a realistic evaluation that compared automated categorization with free text indexing of the government intranet used...... documents were retrieved. The findings emphasise the importance of simultaneous search options for e-government IR systems, and reveal that automated categorization is valuable in improving search facilities in e-government....

  19. Internet Web Communication Technology (WCT) and Information ...

    African Journals Online (AJOL)

    Internet Web Communication Technology (WCT) and Information Communication Technology (ICT) Development and Use for Veterinary Medicine Education in Nigeria ... Veterinary Medicine Electronic Journals such as Access to Global Online Research in Agriculture (AGORA), African Journal Online (AJOL), and Health ...

  20. Construction of a bibliographic information database and development of retrieval system for research reports in nuclear science and technology (II)

    Energy Technology Data Exchange (ETDEWEB)

    Han, Duk Haeng; Kim, Tae Whan; Choi, Kwang; Yoo, An Na; Keum, Jong Yong; Kim, In Kwon [Korea Atomic Energy Research Institute, Taejon (Korea, Republic of)

    1996-05-01

    The major goal of this project is to construct a bibliographic information database in nuclear engineering and to develop a prototype retrieval system. To give an easy access to microfiche research report, this project has accomplished the construction of microfiche research reports database and the development of retrieval system. The results of the project are as follows; 1. Microfiche research reports database was constructed by downloading from DOE Energy, NTIS, INIS. 2. The retrieval system was developed in host and web version using access point such as title, abstracts, keyword, report number. 6 tabs., 8 figs., 11 refs. (Author) .new.

  1. Hybrid Ontology for Semantic Information Retrieval Model Using Keyword Matching Indexing System

    Directory of Open Access Journals (Sweden)

    K. R. Uthayan

    2015-01-01

    Full Text Available Ontology is the process of growth and elucidation of concepts of an information domain being common for a group of users. Establishing ontology into information retrieval is a normal method to develop searching effects of relevant information users require. Keywords matching process with historical or information domain is significant in recent calculations for assisting the best match for specific input queries. This research presents a better querying mechanism for information retrieval which integrates the ontology queries with keyword search. The ontology-based query is changed into a primary order to predicate logic uncertainty which is used for routing the query to the appropriate servers. Matching algorithms characterize warm area of researches in computer science and artificial intelligence. In text matching, it is more dependable to study semantics model and query for conditions of semantic matching. This research develops the semantic matching results between input queries and information in ontology field. The contributed algorithm is a hybrid method that is based on matching extracted instances from the queries and information field. The queries and information domain is focused on semantic matching, to discover the best match and to progress the executive process. In conclusion, the hybrid ontology in semantic web is sufficient to retrieve the documents when compared to standard ontology.

  2. Hybrid ontology for semantic information retrieval model using keyword matching indexing system.

    Science.gov (United States)

    Uthayan, K R; Mala, G S Anandha

    2015-01-01

    Ontology is the process of growth and elucidation of concepts of an information domain being common for a group of users. Establishing ontology into information retrieval is a normal method to develop searching effects of relevant information users require. Keywords matching process with historical or information domain is significant in recent calculations for assisting the best match for specific input queries. This research presents a better querying mechanism for information retrieval which integrates the ontology queries with keyword search. The ontology-based query is changed into a primary order to predicate logic uncertainty which is used for routing the query to the appropriate servers. Matching algorithms characterize warm area of researches in computer science and artificial intelligence. In text matching, it is more dependable to study semantics model and query for conditions of semantic matching. This research develops the semantic matching results between input queries and information in ontology field. The contributed algorithm is a hybrid method that is based on matching extracted instances from the queries and information field. The queries and information domain is focused on semantic matching, to discover the best match and to progress the executive process. In conclusion, the hybrid ontology in semantic web is sufficient to retrieve the documents when compared to standard ontology.

  3. Interdisciplinarity and Computer Music Modeling and Information Retrieval

    DEFF Research Database (Denmark)

    Grund, Cynthia M.

    2006-01-01

    Abstract This paper takes a look at computer music modeling and information retrieval (CMMIR) from the point of view of the humanities with emphasis upon areas relevant to the philosophy of music. The desire for more interdisciplinary research involving CMMIR and the humanities is expressed...

  4. Disambiguation strategies for cross-language information retrieval

    NARCIS (Netherlands)

    Hiemstra, Djoerd; de Jong, Franciska M.G.

    1999-01-01

    This paper gives an overview of tools and methods for Cross-Language Information Retrieval (CLIR) that are developed within the Twenty-One project. The tools and methods are evaluated with the TREC CLIR task document collection using Dutch queries on the English document base. The main issue

  5. Scientometrics and information retrieval: weak-links revitalized

    NARCIS (Netherlands)

    Mayr, Philipp; Scharnhorst, Andrea

    This special issue brings together eight papers from experts of communities which often have been perceived as different once: bibliometrics, scientometrics and in- formetrics on the one side and information retrieval on the other. The idea of this special issue started at the workshop ‘‘Combining

  6. Ask Alice: an Artificial Retrieval of Information Agent

    NARCIS (Netherlands)

    Valstar, M.; Baur, T.; Cafaro, A.; Ghitulescu, A.; Potard, B.; Wagner, J.; Andre, E.; Durieu, L.; Aylett, M.; Dermouche, P.; Pelachaud, C.; Coutinho, E.; Schuller, B.; Zhang, Yue; Heylen, Dirk K.J.; Theune, Mariet; van Waterschoot, Jelte Barachia

    2016-01-01

    We present a demonstration of the ARIA framework, a modular approach for rapid development of virtual humans for information retrieval that have linguistic, emotional, and social skills and a strong personality. We demonstrate the capabilities of our framework in a scenario where a popular book from

  7. SLIMMER--A UNIX System-Based Information Retrieval System.

    Science.gov (United States)

    Waldstein, Robert K.

    1988-01-01

    Describes an information retrieval system developed at Bell Laboratories to create and maintain a variety of different but interrelated databases, and to provide controlled access to these databases. The components discussed include the interfaces, indexing rules, display languages, response time, and updating procedures of the system. (6 notes…

  8. A cross-lingual framework for monolingual biomedical information retrieval

    NARCIS (Netherlands)

    Trieschnigg, D.; Hiemstra, D.; Jong, F. de; Kraaij, W.

    2010-01-01

    An important challenge for biomedical information retrieval (IR) is dealing with the complex, inconsistent and ambiguous biomedical terminology. Frequently, a concept-based representation defined in terms of a domain-specific terminological resource is employed to deal with this challenge. In this

  9. Status report on SIRS: sorption information retrieval system

    Energy Technology Data Exchange (ETDEWEB)

    Hostetler, D.D.; Serne, R.J.; Baldwin, A.J.; Petrie, G.M.

    1980-11-01

    Two major uses were identified for the Sorption Information Retrieval System: (1) to aid geochemists in the elucidation of sorption mechanisms; and (2) to aid safety assessment modelers in selection of Kds for any given scenerio. Other benefits such as providing an auditable vehicle for the Kd selection were also discussed.

  10. Professional assistance to users of information retrieval tools at the ...

    African Journals Online (AJOL)

    The study investigated the need for professional assistance to users of information retrieval tools at the National Library of Nigeria, Enugu branch. A total of 38 (thirty-eight) users of the library were randomly selected and used for the study. It was found that most of the respondents 18(47.3%) consulted the card catalogue ...

  11. Support Vector Machines: Relevance Feedback and Information Retrieval.

    Science.gov (United States)

    Drucker, Harris; Shahrary, Behzad; Gibbon, David C.

    2002-01-01

    Compares support vector machines (SVMs) to Rocchio, Ide regular and Ide dec-hi algorithms in information retrieval (IR) of text documents using relevancy feedback. If the preliminary search is so poor that one has to search through many documents to find at least one relevant document, then SVM is preferred. Includes nine tables. (Contains 24…

  12. Creating an Information Retrieval test corpus for Dutch

    NARCIS (Netherlands)

    Hiemstra, Djoerd; van Leeuwen, D.A.; Theune, M.; Theune, Mariet; Nijholt, Antinus; Nijholt, A.; Hondorp, G.H.W.; Hondorp, H.

    2002-01-01

    This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch test data, which is part of the official CLEF multilingual test corpus, and give an overview of the experimental results of

  13. Why Information Retrieval Needs Cognitive Science: A call to arms

    NARCIS (Netherlands)

    Hoenkamp, E.C.M.

    2005-01-01

    Much of today’s success in Information Retrieval (IR) comes from a hard approach: employing blazingly fast machines, ever more refined statistics, and increasingly powerful classification schemes. In recent years, however, the hard approach has entered a phase of diminishing returns. This paper

  14. Design of an indigeous music information storage and retrieval ...

    African Journals Online (AJOL)

    MOI) and the Music Library of the Cultural Affairs (ML-CA) of Eritrea. The main aim of the study was to design an appropriate Indigenous Music Information Storage and Retrieval System for Eritrea. A quantitative approach was mainly used to ...

  15. Bibliometric-enhanced Information Retrieval : 2nd International BIR Workshop

    NARCIS (Netherlands)

    Mayr, Philipp; Frommholz, Ingo; Scharnhorst, Andrea; Mutschke, Peter

    2015-01-01

    This workshop brings together experts of communities which often have been perceived as different once: bibliometrics / scientometrics / informetrics on the one side and information retrieval on the other. Our motivation as organizers of the workshop started from the observation that main discourses

  16. A Survey of Query Auto Completion in Information Retrieval

    NARCIS (Netherlands)

    Cai, F.; de Rijke, M.

    2016-01-01

    In information retrieval, query auto completion (QAC), also known as type-ahead [Xiao et al., 2013, Cai et al., 2014b] and auto-complete suggestion [Jain and Mishne, 2010], refers to the following functionality: given a prefix consisting of a number of characters entered into a search box, the user

  17. Ask Alice: an Artificial Retrieval of Information Agent

    NARCIS (Netherlands)

    Valstar, M.; Baur, T.; Cafaro, A.; Ghitulescu, A.; Potard, B.; Wagner, J.; Andre, E.; Durieu, L.; Aylett, M.; Dermouche, P.; Pelachaud, C.; Coutinho, E.; Schuller, B.; Zhang, Yue; Heylen, Dirk K.J.; Theune, Mariet; van Waterschoot, Jelte Barachia

    We present a demonstration of the ARIA framework, a modular approach for rapid development of virtual humans for information retrieval that have linguistic, emotional, and social skills and a strong personality. We demonstrate the capabilities of our framework in a scenario where a popular book from

  18. Conventional and Knowledge-Based Information Retrieval with Prolog.

    Science.gov (United States)

    Leigh, William; Paz, Noemi

    1988-01-01

    Describes the use of PROLOG to program knowledge-based information retrieval systems, in which the knowledge contained in a document is translated into machine processable logic. Several examples of the resulting search process, and the program rules supporting the process, are given. (10 references) (CLB)

  19. Proof of Concept: Concept-based Biomedical Information Retrieval

    NARCIS (Netherlands)

    Trieschnigg, Rudolf Berend

    2010-01-01

    In this thesis we investigate the possibility to integrate domain-specific knowledge into biomedical information retrieval (IR). Recent decades have shown a fast growing interest in biomedical research, reflected by an exponential growth in scientific literature. Biomedical IR is concerned with the

  20. Millennial Generation Students Search the Web Erratically, with Minimal Evaluation of Information Quality. A Review of: Taylor, A. (2012. A study of the information search behaviour of the millennial generation. Information Research, 17(1, paper 508. Retrieved from http://informationr.net/ir/17-1/paper508.html

    Directory of Open Access Journals (Sweden)

    Dominique Daniel

    2013-03-01

    Full Text Available Objective – To identify how millennial generation students proceed through the information search process and select resources on the web; to determine whether students evaluate the quality of web resources and how they use general information websites.Design – Longitudinal study.Setting – University in the United States.Subjects – 80 undergraduate students of the millennial generation enrolled in a business course.Methods – The students were required to complete a research report with a bibliography in five weeks. They also had to turn in interim assignments during that period (including an abstract, an outline, and rough draft. Their search behaviour was monitored using a modified Yahoo search engine that allowed subjects to search, and then to fill out surveys integrated directly below their search results. The students were asked to indicate the relevance of the resources they found on the open web, to identify the criteria they used toevaluate relevance, and to specify the stage they were at in the search process. They could choose from five stages defined by the author, based on Wilson (1999: initiation, exploration, differentiation, extracting, and verifying. Datawere collected using anonymous user IDs and included URLs for sources selected along with subject answers until completion of all assignments. The students provided 758 distinct web page evaluations.Main Results – Students did not progress in orderly fashion through the search process, but rather proceeded erratically. A substantial number reported being in fewer than four of the five search stages. Only a small percentage ever declared being in the final stage of verifying previously gathered information, and during preparation of the final report a majority still declared being in the extracting stage. In fact, participants selected documents (extracting stage throughout the process. In addition, students were not much concerned with the quality, validity, or

  1. Web wisdom how to evaluate and create information quality on the Web

    CERN Document Server

    Alexander, Janet E

    1999-01-01

    Web Wisdom is an essential reference for anyone needing to evaluate or establish information quality on the World Wide Web. The book includes easy to use checklists for step-by-step quality evaluations of virtually any Web page. The checklists can also be used by Web authors to help them ensure quality information on their pages. In addition, Web Wisdom addresses other important issues, such as understanding the ways that advertising and sponsorship may affect the quality of Web information. It features: * a detailed discussion of the items involved in evaluating Web information; * checklists

  2. Seeking health information on the web: positive hypothesis testing.

    Science.gov (United States)

    Kayhan, Varol Onur

    2013-04-01

    The goal of this study is to investigate positive hypothesis testing among consumers of health information when they search the Web. After demonstrating the extent of positive hypothesis testing using Experiment 1, we conduct Experiment 2 to test the effectiveness of two debiasing techniques. A total of 60 undergraduate students searched a tightly controlled online database developed by the authors to test the validity of a hypothesis. The database had four abstracts that confirmed the hypothesis and three abstracts that disconfirmed it. Findings of Experiment 1 showed that majority of participants (85%) exhibited positive hypothesis testing. In Experiment 2, we found that the recommendation technique was not effective in reducing positive hypothesis testing since none of the participants assigned to this server could retrieve disconfirming evidence. Experiment 2 also showed that the incorporation technique successfully reduced positive hypothesis testing since 75% of the participants could retrieve disconfirming evidence. Positive hypothesis testing on the Web is an understudied topic. More studies are needed to validate the effectiveness of the debiasing techniques discussed in this study and develop new techniques. Search engine developers should consider developing new options for users so that both confirming and disconfirming evidence can be presented in search results as users test hypotheses using search engines. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  3. Representation and alignment of sung queries for music information retrieval

    Science.gov (United States)

    Adams, Norman H.; Wakefield, Gregory H.

    2005-09-01

    The pursuit of robust and rapid query-by-humming systems, which search melodic databases using sung queries, is a common theme in music information retrieval. The retrieval aspect of this database problem has received considerable attention, whereas the front-end processing of sung queries and the data structure to represent melodies has been based on musical intuition and historical momentum. The present work explores three time series representations for sung queries: a sequence of notes, a ``smooth'' pitch contour, and a sequence of pitch histograms. The performance of the three representations is compared using a collection of naturally sung queries. It is found that the most robust performance is achieved by the representation with highest dimension, the smooth pitch contour, but that this representation presents a formidable computational burden. For all three representations, it is necessary to align the query and target in order to achieve robust performance. The computational cost of the alignment is quadratic, hence it is necessary to keep the dimension small for rapid retrieval. Accordingly, iterative deepening is employed to achieve both robust performance and rapid retrieval. Finally, the conventional iterative framework is expanded to adapt the alignment constraints based on previous iterations, further expediting retrieval without degrading performance.

  4. Icon Based Information Retrieval and Disease Identification in Agriculture

    OpenAIRE

    Mittal, Namita; Agarwal, Basant; Gupta, Ajay; Madhur, Hemant

    2014-01-01

    Recent developments in the ICT industry in past few decades has enabled the quick and easy access to the information available on the internet. But, digital literacy is the pre-requisite for its use. The main purpose of this paper is to provide an interface for digitally illiterate users, especially farmers to efficiently and effectively retrieve information through Internet. In addition, to enable the farmers to identify the disease in their crop, its cause and symptoms using digital image p...

  5. Lower-Cost epsilon-Private Information Retrieval

    OpenAIRE

    Toledo, Raphael R.; Danezis, George; Goldberg, Ian

    2016-01-01

    Private Information Retrieval (PIR), despite being well studied, is computationally costly and arduous to scale. We explore lower-cost relaxations of information-theoretic PIR, based on dummy queries, sparse vectors, and compositions with an anonymity system. We prove the security of each scheme using a flexible differentially private definition for private queries that can capture notions of imperfect privacy. We show that basic schemes are weak, but some of them can be made arbitrarily safe...

  6. A Semantic Enhanced Model for Effective Spatial Information Retrieval : Un modèle sémantique améliorée for Effective Information Retrieval spatiale

    OpenAIRE

    Akanbi, Adeyinka; Agunbiade, Olusanya,; Dehinbo, Olumuyiwa,; Kuti, Sadiq

    2014-01-01

    International audience; A lot of information on the web is geographically referenced. Discovering and retrieving this geographic information to satisfy various users needs across both open and distributed Spatial Data Infrastructures (SDI) poses eminent research challenges. However, this is mostly caused by semantic heterogeneity in user's query and lack of semantic referencing of the Geographic Information (GI) metadata. To addressing these challenges, this paper discusses an ontology-based ...

  7. Iterative Filtering of Retrieved Information to Increase Relevance

    Directory of Open Access Journals (Sweden)

    Robert Zeidman

    2007-12-01

    Full Text Available Efforts have been underway for years to find more effective ways to retrieve information from large knowledge domains. This effort is now being driven particularly by the Internet and the vast amount of information that is available to unsophisticated users. In the early days of the Internet, some effort involved allowing users to enter Boolean equations of search terms into search engines, for example, rather than just a list of keywords. More recently, effort has focused on understanding a user's desires from past search histories in order to narrow searches. Also there has been much effort to improve the ranking of results based on some measure of relevancy. This paper discusses using iterative filtering of retrieved information to focus in on useful information. This work was done for finding source code correlation and the author extends his findings to Internet searching and e-commerce. The paper presents specific information about a particular filtering application and then generalizes it to other forms of information retrieval.

  8. Use of information-retrieval languages in automated retrieval of experimental data from long-term storage

    Science.gov (United States)

    Khovanskiy, Y. D.; Kremneva, N. I.

    1975-01-01

    Problems and methods are discussed of automating information retrieval operations in a data bank used for long term storage and retrieval of data from scientific experiments. Existing information retrieval languages are analyzed along with those being developed. The results of studies discussing the application of the descriptive 'Kristall' language used in the 'ASIOR' automated information retrieval system are presented. The development and use of a specialized language of the classification-descriptive type, using universal decimal classification indices as the main descriptors, is described.

  9. The Use of Web Search Engines in Information Science Research.

    Science.gov (United States)

    Bar-Ilan, Judit

    2004-01-01

    Reviews the literature on the use of Web search engines in information science research, including: ways users interact with Web search engines; social aspects of searching; structure and dynamic nature of the Web; link analysis; other bibliometric applications; characterizing information on the Web; search engine evaluation and improvement; and…

  10. Enriching the Web of Data with Educational Information Using We-Share

    Science.gov (United States)

    Ruiz-Calleja, Adolfo; Asensio-Pérez, Juan I.; Vega-Gorgojo, Guillermo; Gómez-Sánchez, Eduardo; Bote-Lorenzo, Miguel L.; Alario-Hoyos, Carlos

    2017-01-01

    This paper presents We-Share, a social annotation application that enables educators to publish and retrieve information about educational ICT tools. As a distinctive characteristic, We-Share provides educators data about educational tools already available on the Web of Data while allowing them to enrich such data with their experience using…

  11. Speech-recognition interfaces for music information retrieval

    Science.gov (United States)

    Goto, Masataka

    2005-09-01

    This paper describes two hands-free music information retrieval (MIR) systems that enable a user to retrieve and play back a musical piece by saying its title or the artist's name. Although various interfaces for MIR have been proposed, speech-recognition interfaces suitable for retrieving musical pieces have not been studied. Our MIR-based jukebox systems employ two different speech-recognition interfaces for MIR, speech completion and speech spotter, which exploit intentionally controlled nonverbal speech information in original ways. The first is a music retrieval system with the speech-completion interface that is suitable for music stores and car-driving situations. When a user only remembers part of the name of a musical piece or an artist and utters only a remembered fragment, the system helps the user recall and enter the name by completing the fragment. The second is a background-music playback system with the speech-spotter interface that can enrich human-human conversation. When a user is talking to another person, the system allows the user to enter voice commands for music playback control by spotting a special voice-command utterance in face-to-face or telephone conversations. Experimental results from use of these systems have demonstrated the effectiveness of the speech-completion and speech-spotter interfaces. (Video clips: http://staff.aist.go.jp/m.goto/MIR/speech-if.html)

  12. Hybrid Information Flow Monitoring Against Web Tracking

    OpenAIRE

    Besson, Frédéric; Bielova, Nataliia; Jensen, Thomas

    2013-01-01

    International audience; Motivated by the problem of stateless web tracking (fingerprinting), we propose a novel approach to hybrid information flow monitoring by tracking the knowledge about secret variables using logical formulae. This knowledge representation helps to compare and improve precision of hybrid infor- mation flow monitors. We define a generic hybrid monitor parametrised by a static analysis and derive sufficient conditions on the static analysis for sound- ness and relative pre...

  13. A web-accessible content-based cervicographic image retrieval system

    Science.gov (United States)

    Xue, Zhiyun; Long, L. Rodney; Antani, Sameer; Jeronimo, Jose; Thoma, George R.

    2008-03-01

    Content-based image retrieval (CBIR) is the process of retrieving images by directly using image visual characteristics. In this paper, we present a prototype system implemented for CBIR for a uterine cervix image (cervigram) database. This cervigram database is a part of data collected in a multi-year longitudinal effort by the National Cancer Institute (NCI), and archived by the National Library of Medicine (NLM), for the study of the origins of, and factors related to, cervical precancer/cancer. Users may access the system with any Web browser. The system is built with a distributed architecture which is modular and expandable; the user interface is decoupled from the core indexing and retrieving algorithms, and uses open communication standards and open source software. The system tries to bridge the gap between a user's semantic understanding and image feature representation, by incorporating the user's knowledge. Given a user-specified query region, the system returns the most similar regions from the database, with respect to attributes of color, texture, and size. Experimental evaluation of the retrieval performance of the system on "groundtruth" test data illustrates its feasibility to serve as a possible research tool to aid the study of the visual characteristics of cervical neoplasia.

  14. Agent Community based Peer-to-Peer Information Retrieval

    Science.gov (United States)

    Mine, Tsunenori; Matsuno, Daisuke; Amamiya, Makoto

    This paper proposes an agent community based information retrieval method, which uses agent communities to manage and look up information related to users. An agent works as a delegate of its user and searches for information that the user wants by communicating with other agents. The communication between agents is carried out in a peer-to-peer computing architecture. In order to retrieve information related to a user query, an agent uses two histories : a query/retrieved document history(Q/RDH) and a query/sender agent history(Q/SAH). The former is a list of pairs of a query and retrieved documents, where the queries were sent by the agent itself. The latter is a list of pairs of a query and sender agents and shows ``who sent what query to the agent''. This is useful to find a new information source. Making use of the Q/SAH is expected to cause a collaborative filtering effect, which gradually creates virtual agent communities, where agents with the same interests stay together. Our hypothesis is that a virtual agent community reduces communication loads to perform a search. As an agent receives more queries, then more links to new knowledge are achieved. From this behavior, a ``give and take''(or positive feedback) effect for agents seems to emerge. We implemented this method with Multi-Agents Kodama which has been developed in our laboratory, and conducted preliminary experiments to test the hypothesis. The empirical results showed that the method was much more efficient than a naive method employing 'broadcast' techniques only to look up a target agent.

  15. A Novel Fuzzy Document Based Information Retrieval Model for Forecasting

    Directory of Open Access Journals (Sweden)

    Partha Roy

    2017-06-01

    Full Text Available Information retrieval systems are generally used to find documents that are most appropriate according to some query that comes dynamically from users. In this paper a novel Fuzzy Document based Information Retrieval Model (FDIRM is proposed for the purpose of Stock Market Index forecasting. The novelty of proposed approach is a modified tf-idf scoring scheme to predict the future trend of the stock market index. The contribution of this paper has two dimensions, 1 In the proposed system the simple time series is converted to an enriched fuzzy linguistic time series with a unique approach of incorporating market sentiment related information along with the price and 2 A unique approach is followed while modeling the information retrieval (IR system which converts a simple IR system into a forecasting system. From the performance comparison of FDIRM with standard benchmark models it can be affirmed that the proposed model has a potential of becoming a good forecasting model. The stock market data provided by Standard & Poor’s CRISIL NSE Index 50 (CNX NIFTY-50 index of National Stock Exchange of India (NSE is used to experiment and validate the proposed model. The authentic data for validation and experimentation is obtained from http://www.nseindia.com which is the official website of NSE. A java program is under construction to implement the model in real-time with graphical users’ interface.

  16. 8th International Workshop on Information Filtering and Retrieval

    CERN Document Server

    Giuliani, Alessandro; Semeraro, Giovanni

    2017-01-01

    This book focuses on new research challenges in intelligent information filtering and retrieval. It collects invited chapters and extended research contributions from DART 2014 (the 8th International Workshop on Information Filtering and Retrieval), held in Pisa (Italy), on December 10, 2014, and co-hosted with the XIII AI*IA Symposium on Artificial Intelligence. The main focus of DART was to discuss and compare suitable novel solutions based on intelligent techniques and applied to real-world contexts. The chapters of this book present a comprehensive review of related works and the current state of the art. The contributions from both practitioners and researchers have been carefully reviewed by experts in the area, who also gave useful suggestions to improve the quality of the book.

  17. Informative Top-k Retrieval for Advanced Skill Management

    Science.gov (United States)

    Colucci, Simona; di Noia, Tommaso; Ragone, Azzurra; Ruta, Michele; Straccia, Umberto; Tinelli, Eufemia

    The paper presents a knowledge-based framework for skills and talent management based on an advanced matchmaking between profiles of candidates and available job positions. Interestingly, informative content of top-k retrieval is enriched through semantic capabilities. The proposed approach allows to: (1) express a requested profile in terms of both hard constraints and soft ones; (2) provide a ranking function based also on qualitative attributes of a profile; (3) explain the resulting outcomes (given a job request, a motivation for the obtained score of each selected profile is provided). Top-k retrieval allows to select most promising candidates according to an ontology formalizing the domain knowledge. Such a knowledge is further exploited to provide a semantic-based explanation of missing or conflicting features in retrieved profiles. They also indicate additional profile characteristics emerging by the retrieval procedure for a further request refinement. A concrete case study followed by an exhaustive experimental campaign is reported to prove the approach effectiveness.

  18. The Use of a Context-Based Information Retrieval Technique

    Science.gov (United States)

    2009-07-01

    Carlson, 2004). However, in order to reduce plagiarism and manipulation, the specific details of these algorithms are closely protected and changed...age, academic background and gender can affect performance using information retrieval systems (Borgman, 1989). These factors can result in...and academic qualifications, a large proportion of the sample were recruited from a third year level or higher. 2.2 Materials 2.2.1 Demographic

  19. The use of categorization information in language models for question retrieval

    DEFF Research Database (Denmark)

    Cao, Xin; Cong, Gao; Cui, Bin

    2009-01-01

    and have become important information resources on the Web. To make the body of knowledge accumulated in CQA archives accessible, effective and efficient question search is required. Question search in a CQA archive aims to retrieve historical questions that are relevant to new questions posed by users....... This paper proposes a category-based framework for search in CQA archives. The framework embodies several new techniques that use language models to exploit categories of questions for improving question-answer search. Experiments conducted on real data from Yahoo! Answers demonstrate that the proposed...

  20. A STUDY ON RANKING METHOD IN RETRIEVING WEB PAGES BASED ON CONTENT AND LINK ANALYSIS: COMBINATION OF FOURIER DOMAIN SCORING AND PAGERANK SCORING

    Directory of Open Access Journals (Sweden)

    Diana Purwitasari

    2008-01-01

    Full Text Available Ranking module is an important component of search process which sorts through relevant pages. Since collection of Web pages has additional information inherent in the hyperlink structure of the Web, it can be represented as link score and then combined with the usual information retrieval techniques of content score. In this paper we report our studies about ranking score of Web pages combined from link analysis, PageRank Scoring, and content analysis, Fourier Domain Scoring. Our experiments use collection of Web pages relate to Statistic subject from Wikipedia with objectives to check correctness and performance evaluation of combination ranking method. Evaluation of PageRank Scoring show that the highest score does not always relate to Statistic. Since the links within Wikipedia articles exists so that users are always one click away from more information on any point that has a link attached, it it possible that unrelated topics to Statistic are most likely frequently mentioned in the collection. While the combination method show link score which is given proportional weight to content score of Web pages does effect the retrieval results.

  1. [SIBIL: an information tool for the information retrieval on bioethics].

    Science.gov (United States)

    Dracos, Adriana

    2004-01-01

    The article describes the main features of the website SIBIL (Sistema Informativo per la Bioetica In Linea) implemented within the framework of a research project of the ISS for collecting, indexing and disseminating Italian literature on bioethics since 1995 through an integrated electronic system. The site, addressed to a wide range of people interested at different degrees and levels in bioethics, offers a comprehensive overview of the activities, such as courses and meetings, on the major ethical issues at stake in Italy, as well as a survey of the most important activities both at national and international level. The main feature of SIBIL is a database of a large collection of documents retrieved through sources or exploitation of the most important international electronic databases. A thesaurus of 1,600 terms, available in Italian and English, was created in order to organize documents with standardized criteria currently adopted in the Italian scientific environment. Future trends of the website are also discussed for sharing experiences with other countries and laying the basis for a European portal on bioethics.

  2. Medical Information Retrieval Enhanced with User's Query Expanded with Tag-Neighbors

    DEFF Research Database (Denmark)

    Durao, Frederico; Bayyapu, Karunakar Reddy; Xu, Guandong

    2013-01-01

    ’ original queries with context-relevant information. We compute a set of significant tag neighbor candidates based on the neighbor frequency and weight, and utilize the qualified tag neighbors to expand an entry query. The proposed approach is evaluated by using MedWorm medical article collection......Under-specified queries often lead to undesirable search results that do not contain the information needed. This problem gets worse when it comes to medical information, a natural human demand everywhere. Existing search engines on the Web often are unable to handle medical search well because...... they do not consider its special requirements. Often a medical information searcher is uncertain about his exact questions and unfamiliar with medical terminology. To overcome the limitations of under-specified queries, we utilize tags to enhance information retrieval capabilities by expanding users...

  3. Using web 2.0 for health information

    CERN Document Server

    Younger, Paula

    2011-01-01

    Since it was first formally described in 2004, what is known as Web 2.0 has affected every library and information sector. Web 2.0 has tremendous potential to transform health information delivery. This book offers a cohesive overview of how Web 2.0 is changing health and medical information work.

  4. Web 2.0 and Critical Information Literacy

    Science.gov (United States)

    Dunaway, Michelle

    2011-01-01

    The impact of Web 2.0 upon culture, education, and knowledge is obfuscated by the pervasiveness of Web 2.0 applications and technologies. Web 2.0 is commonly conceptualized in terms of the tools that it makes possible, such as Facebook, Twitter, and Wikipedia. In the context of information literacy instruction, Web 2.0 is frequently conceptualized…

  5. Information retrieval for systematic reviews in food and feed topics: a narrative review.

    Science.gov (United States)

    Wood, Hannah; O'Connor, Annette; Sargeant, Jan; Glanville, Julie

    2018-01-09

    Systematic review methods are now being used for reviews of food production, food safety and security, plant health, and animal health and welfare. Information retrieval methods in this context have been informed by human healthcare approaches and ideally should be based on relevant research and experience. This narrative review seeks to identify and summarise current research-based evidence and experience on information retrieval for systematic reviews in food and feed topics. MEDLINE (Ovid), Science Citation Index (Web of Science) and ScienceDirect (http://www.sciencedirect.com/) were searched in 2012 and 2016. We also contacted topic experts and undertook citation searches. We selected and summarised studies reporting research on information retrieval, as well as published guidance and experience. There is little published evidence on the most efficient way to conduct searches for food and feed topics. There are few available study design search filters, and their use may be problematic given poor or inconsistent reporting of study methods. Food and feed research makes use of a wide range of study designs so it might be best to focus strategy development on capturing study populations, although this also has challenges. There is limited guidance on which resources should be searched and whether publication bias in disciplines relevant to food and feed necessitates extensive searching of the grey literature. There is some limited evidence on information retrieval approaches, but more research is required to inform effective and efficient approaches to searching to populate food and feed reviews. This article is protected by copyright. All rights reserved.

  6. Web tools for effective retrieval, visualization, and evaluation of cardiology medical images and records

    Science.gov (United States)

    Masseroli, Marco; Pinciroli, Francesco

    2000-12-01

    To provide easy retrieval, integration and evaluation of multimodal cardiology images and data in a web browser environment, distributed application technologies and java programming were used to implement a client-server architecture based on software agents. The server side manages secure connections and queries to heterogeneous remote databases and file systems containing patient personal and clinical data. The client side is a Java applet running in a web browser and providing a friendly medical user interface to perform queries on patient and medical test dat and integrate and visualize properly the various query results. A set of tools based on Java Advanced Imaging API enables to process and analyze the retrieved cardiology images, and quantify their features in different regions of interest. The platform-independence Java technology makes the developed prototype easy to be managed in a centralized form and provided in each site where an intranet or internet connection can be located. Giving the healthcare providers effective tools for querying, visualizing and evaluating comprehensively cardiology medical images and records in all locations where they can need them- i.e. emergency, operating theaters, ward, or even outpatient clinics- the developed prototype represents an important aid in providing more efficient diagnoses and medical treatments.

  7. Lower-Cost ∈-Private Information Retrieval

    Directory of Open Access Journals (Sweden)

    Toledo Raphael R.

    2016-10-01

    Full Text Available Private Information Retrieval (PIR, despite being well studied, is computationally costly and arduous to scale. We explore lower-cost relaxations of information-theoretic PIR, based on dummy queries, sparse vectors, and compositions with an anonymity system. We prove the security of each scheme using a flexible differentially private definition for private queries that can capture notions of imperfect privacy. We show that basic schemes are weak, but some of them can be made arbitrarily safe by composing them with large anonymity systems.

  8. An integrated information retrieval and document management system

    Science.gov (United States)

    Coles, L. Stephen; Alvarez, J. Fernando; Chen, James; Chen, William; Cheung, Lai-Mei; Clancy, Susan; Wong, Alexis

    1993-01-01

    This paper describes the requirements and prototype development for an intelligent document management and information retrieval system that will be capable of handling millions of pages of text or other data. Technologies for scanning, Optical Character Recognition (OCR), magneto-optical storage, and multiplatform retrieval using a Standard Query Language (SQL) will be discussed. The semantic ambiguity inherent in the English language is somewhat compensated-for through the use of coefficients or weighting factors for partial synonyms. Such coefficients are used both for defining structured query trees for routine queries and for establishing long-term interest profiles that can be used on a regular basis to alert individual users to the presence of relevant documents that may have just arrived from an external source, such as a news wire service. Although this attempt at evidential reasoning is limited in comparison with the latest developments in AI Expert Systems technology, it has the advantage of being commercially available.

  9. Estimating Missing Features to Improve Multimedia Information Retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Bagherjeiran, A; Love, N S; Kamath, C

    2006-09-28

    Retrieval in a multimedia database usually involves combining information from different modalities of data, such as text and images. However, all modalities of the data may not be available to form the query. The retrieval results from such a partial query are often less than satisfactory. In this paper, we present an approach to complete a partial query by estimating the missing features in the query. Our experiments with a database of images and their associated captions show that, with an initial text-only query, our completion method has similar performance to a full query with both image and text features. In addition, when we use relevance feedback, our approach outperforms the results obtained using a full query.

  10. INFORMATION RETRIEVAL SYSTEM USING MULTIWORDS EXPRESSIONS (MWE AS DESCRIPTORS

    Directory of Open Access Journals (Sweden)

    Edson Marchetti da Silva

    2012-08-01

    Full Text Available This paper aims to propose an alternative method for retrieving documents using Multiwords Expressions (MWE extracted from a document base to be used as descriptors in search of an Information Retrieval System (IRS. In this sense, unlike methods that consider the text as a set of words, bag of words, we propose a method that takes into account the characteristics of the physical structure of the document in the extraction process of MWE. From this set of terms comparing pre-processed using an exhaustive algorithmic technique proposed by the authors with the results obtained for thirteen different measures of association statistics generated by the software Ngram Statistics Package (NSP. To perform this experiment was set up with a corpus of documents in digital format

  11. Creating a Web-based image database for benchmarking image retrieval systems

    Science.gov (United States)

    Joergensen, Corinne; Srihari, Rohini K.

    1999-05-01

    There is, at present, a critical need within image retrieval research for an image testbed which would enable the objective evaluation of different content-based search engines, indexing and metadata schemes, and search heuristics, as well as research and evaluation in image- based knowledge structures and system architectures, user's needs in image retrieval and the cognitive processes involved in image searching. This paper discusses a pilot project specifying and establishing a prototype testbed for the evaluation of image retrieval techniques. A feasibility study is underway focusing on the development of a large set of standardized test images accessible through a web interface, and researchers in the field are being surveyed for input. Areas being addressed in the feasibility study include technical specifications as well as content issues such as: which specific image domains to include; the useful proportion of imags belonging to specific domains to images belonging to a general 'world' domain; types of image attributes and baseline and 'advanced' levels of image description needed, and research needs to be accommodated, as well as development of a standardized set of test queries and the establishment of methods for 'truthing' the database and test queries.

  12. Working out and the description of the hypertext information retrieval thesaurus on algebra

    Directory of Open Access Journals (Sweden)

    Ирина Викторовна Кузнецова

    2011-09-01

    Full Text Available In article working out of the hypertext information retrieval thesaurus on algebra in the course of designing of the hypertext information retrieval thesaurus of a meta language of a science is described.

  13. Programmatic access to data and information at the IRIS DMC via web services

    Science.gov (United States)

    Weertman, B. R.; Trabant, C.; Karstens, R.; Suleiman, Y. Y.; Ahern, T. K.; Casey, R.; Benson, R. B.

    2011-12-01

    The IRIS Data Management Center (DMC) has developed a suite of web services that provide access to the DMC's time series holdings, their related metadata and earthquake catalogs. In addition, services are available to perform simple, on-demand time series processing at the DMC prior to being shipped to the user. The primary goal is to provide programmatic access to data and processing services in a manner usable by and useful to the research community. The web services are relatively simple to understand and use and will form the foundation on which future DMC access tools will be built. Based on standard Web technologies they can be accessed programmatically with a wide range of programming languages (e.g. Perl, Python, Java), command line utilities such as wget and curl or with any web browser. We anticipate these services being used for everything from simple command line access, used in shell scripts and higher programming languages to being integrated within complex data processing software. In addition to improving access to our data by the seismological community the web services will also make our data more accessible to other disciplines. The web services available from the DMC include ws-bulkdataselect for the retrieval of large volumes of miniSEED data, ws-timeseries for the retrieval of individual segments of time series data in a variety of formats (miniSEED, SAC, ASCII, audio WAVE, and PNG plots) with optional signal processing, ws-station for station metadata in StationXML format, ws-resp for the retrieval of instrument response in RESP format, ws-sacpz for the retrieval of sensor response in the SAC poles and zeros convention and ws-event for the retrieval of earthquake catalogs. To make the services even easier to use, the DMC is developing a library that allows Java programmers to seamlessly retrieve and integrate DMC information into their own programs. The library will handle all aspects of dealing with the services and will parse the returned

  14. Retrieving Full Object Information from Partial Object Information using Digital Holography

    Science.gov (United States)

    Jackin, B. J.; Palanisamy, P. K.; Yatagai, T.

    2011-10-01

    Storage and retrieval of object information from hologram using partial object as input is reported. This method uses holographic associative memory principles combined with digital image processing techniques. The inclusion of digital image processing helps in eliminating the iterations which are otherwise mandatory when using neural network principles for object information retrieval. The implementation method is explained and simulation results are presented. The reconstructed images agree well with the object chosen.

  15. Extracting Macroscopic Information from Web Links.

    Science.gov (United States)

    Thelwall, Mike

    2001-01-01

    Discussion of Web-based link analysis focuses on an evaluation of Ingversen's proposed external Web Impact Factor for the original use of the Web, namely the interlinking of academic research. Studies relationships between academic hyperlinks and research activities for British universities and discusses the use of search engines for Web link…

  16. Information Retrieval Methods in Libraries and Information Centers ...

    African Journals Online (AJOL)

    If you would like more information about how to print, save, and work with PDFs, Highwire Press provides a helpful Frequently Asked Questions about PDFs. Alternatively, you can download the PDF file directly to your computer, from where it can be opened using a PDF reader. To download the PDF, click the Download link ...

  17. Dogslife: A web-based longitudinal study of Labrador Retriever health in the UK

    Directory of Open Access Journals (Sweden)

    Clements Dylan N

    2013-01-01

    Full Text Available Abstract Background Dogslife is the first large-scale internet-based longitudinal study of canine health. The study has been designed to examine how environmental and genetic factors influence the health and development of a birth cohort of UK-based pedigree Labrador Retrievers. Results In the first 12 months of the study 1,407 Kennel Club (KC registered eligible dogs were recruited, at a mean age of 119 days of age (SD 69 days, range 3 days – 504 days. Recruitment rates varied depending upon the study team’s ability to contact owners. Where owners authorised the provision of contact details 8.4% of dogs were recruited compared to 1.3% where no direct contact was possible. The proportion of dogs recruited was higher for owners who transferred the registration of their puppy from the breeder to themselves with the KC, and for owners who were sent an e-mail or postcard requesting participation in the project. Compliance with monthly updates was highly variable. For the 280 dogs that were aged 400 days or more on the 30th June 2011, we estimated between 39% and 45% of owners were still actively involved in the project. Initial evaluation suggests that the cohort is representative of the general population of the KC registered Labrador Retrievers eligible to enrol with the project. Clinical signs of illnesses were reported in 44.3% of Labrador Retrievers registered with Dogslife (median age of first illness 138 days, although only 44.1% of these resulted in a veterinary presentation (median age 316 days. Conclusions The web-based platform has enabled the recruitment of a representative population of KC registered Labrador Retrievers, providing the first large-scale longitudinal population-based study of dog health. The use of multiple different methods (e-mail, post and telephone of contact with dog owners was essential to maximise recruitment and retention of the cohort.

  18. Dogslife: a web-based longitudinal study of Labrador Retriever health in the UK.

    Science.gov (United States)

    Clements, Dylan N; Handel, Ian G; Rose, Erica; Querry, Damon; Pugh, Carys A; Ollier, William Er; Morgan, Kenton L; Kennedy, Lorna J; Sampson, Jeffery; Summers, Kim M; de Bronsvoort, B Mark C

    2013-01-18

    Dogslife is the first large-scale internet-based longitudinal study of canine health. The study has been designed to examine how environmental and genetic factors influence the health and development of a birth cohort of UK-based pedigree Labrador Retrievers. In the first 12 months of the study 1,407 Kennel Club (KC) registered eligible dogs were recruited, at a mean age of 119 days of age (SD 69 days, range 3 days - 504 days). Recruitment rates varied depending upon the study team's ability to contact owners. Where owners authorised the provision of contact details 8.4% of dogs were recruited compared to 1.3% where no direct contact was possible. The proportion of dogs recruited was higher for owners who transferred the registration of their puppy from the breeder to themselves with the KC, and for owners who were sent an e-mail or postcard requesting participation in the project. Compliance with monthly updates was highly variable. For the 280 dogs that were aged 400 days or more on the 30th June 2011, we estimated between 39% and 45% of owners were still actively involved in the project. Initial evaluation suggests that the cohort is representative of the general population of the KC registered Labrador Retrievers eligible to enrol with the project. Clinical signs of illnesses were reported in 44.3% of Labrador Retrievers registered with Dogslife (median age of first illness 138 days), although only 44.1% of these resulted in a veterinary presentation (median age 316 days). The web-based platform has enabled the recruitment of a representative population of KC registered Labrador Retrievers, providing the first large-scale longitudinal population-based study of dog health. The use of multiple different methods (e-mail, post and telephone) of contact with dog owners was essential to maximise recruitment and retention of the cohort.

  19. User-Oriented and Cognitive Models of Information Retrieval

    DEFF Research Database (Denmark)

    Ingwersen, Peter; Järvelin, Kalervo; Skov, Mette

    2017-01-01

    applications. Several models with different emphases on user-oriented and cognitive IR are presented—ranging from overall approaches and relevance models to procedural models, cognitive models, and task-based models. The present entry does not discuss empirical findings based on the models.......The domain of user-oriented and cognitive information retrieval (IR) is first discussed, followed by a discussion on the dimensions and types of models one may build for the domain. The focus of the present entry is on the models of user-oriented and cognitive IR, not on their empirical...

  20. Rapid automatic keyword extraction for information retrieval and analysis

    Science.gov (United States)

    Rose, Stuart J [Richland, WA; Cowley,; E, Wendy [Richland, WA; Crow, Vernon L [Richland, WA; Cramer, Nicholas O [Richland, WA

    2012-03-06

    Methods and systems for rapid automatic keyword extraction for information retrieval and analysis. Embodiments can include parsing words in an individual document by delimiters, stop words, or both in order to identify candidate keywords. Word scores for each word within the candidate keywords are then calculated based on a function of co-occurrence degree, co-occurrence frequency, or both. Based on a function of the word scores for words within the candidate keyword, a keyword score is calculated for each of the candidate keywords. A portion of the candidate keywords are then extracted as keywords based, at least in part, on the candidate keywords having the highest keyword scores.

  1. Concept similarity and related categories in information retrieval using formal concept analysis

    Science.gov (United States)

    Eklund, P.; Ducrou, J.; Dau, F.

    2012-11-01

    The application of formal concept analysis to the problem of information retrieval has been shown useful but has lacked any real analysis of the idea of relevance ranking of search results. SearchSleuth is a program developed to experiment with the automated local analysis of Web search using formal concept analysis. SearchSleuth extends a standard search interface to include a conceptual neighbourhood centred on a formal concept derived from the initial query. This neighbourhood of the concept derived from the search terms is decorated with its upper and lower neighbours representing more general and special concepts, respectively. SearchSleuth is in many ways an archetype of search engines based on formal concept analysis with some novel features. In SearchSleuth, the notion of related categories - which are themselves formal concepts - is also introduced. This allows the retrieval focus to shift to a new formal concept called a sibling. This movement across the concept lattice needs to relate one formal concept to another in a principled way. This paper presents the issues concerning exploring, searching, and ordering the space of related categories. The focus is on understanding the use and meaning of proximity and semantic distance in the context of information retrieval using formal concept analysis.

  2. WebGIS for mapping information derived from paleoenvironmental literature

    OpenAIRE

    小口, 高; Oguchi, Takashi; 近藤, 康久; Kondo, Yasuhisa

    2012-01-01

    Web-based Geographical Information Systems (WebGIS) allow us to distribute interactive maps via the Internet. Users can handle the maps using a web browser to change the scale, contents and extent of a displayed map. WebGIS can also distribute text descriptions for particular locations. We use WebGIS to map information on paleoenvironmental literature published in academic journals. A preliminary system of WebGIS was constructed in the late 1990s and early 2000s, using ArcView IMS from ESRI a...

  3. Introduction to Web Information Retrieval: A User Perspective

    Indian Academy of Sciences (India)

    annotations, bookmarking, etc. on the user's end. Search provided by the researcher is usually more complicated than a simple keyword search. It is necessary for the researcher to be much more precise in formulating search terms. For example, suppose the researcher is looking for literature about. IP tunneling in computer ...

  4. A Model Based on Cocitation for Web Information Retrieval

    Directory of Open Access Journals (Sweden)

    Yue Xie

    2014-01-01

    Full Text Available According to the relationship between authority and cocitation in HITS, we propose a new hyperlink weighting scheme to describe the strength of the relevancy between any two webpages. Then we combine hyperlink weight normalization and random surfing schemes as used in PageRank to justify the new model. In the new model based on cocitation (MBCC, the pages with stronger relevancy are assigned higher values, not just depending on the outlinks. This model combines both features of HITS and PageRank. Finally, we present the results of some numerical experiments, showing that the MBCC ranking agrees with the HITS ranking, especially in top 10. Meanwhile, MBCC keeps the superiority of PageRank, that is, existence and uniqueness of ranking vectors.

  5. Web-Based Interactive Visualization in an Information Retrieval Course.

    Science.gov (United States)

    Brusilovsky, Peter

    Interactive visualization is a powerful educational tool. It has been used to enhance the teaching of various subjects from computer science to chemistry to engineering. In computer science education, this powerful tool is used almost exclusively in programming and data structure courses. This paper suggests that visualization could be very…

  6. Intelligent Information Retrieval and Web Mining Architecture Using SOA

    Science.gov (United States)

    El-Bathy, Naser Ibrahim

    2010-01-01

    The study of this dissertation provides a solution to a very specific problem instance in the area of data mining, data warehousing, and service-oriented architecture in publishing and newspaper industries. The research question focuses on the integration of data mining and data warehousing. The research problem focuses on the development of…

  7. Geographic Information Systems and Web Page Development

    Science.gov (United States)

    Reynolds, Justin

    2004-01-01

    The Facilities Engineering and Architectural Branch is responsible for the design and maintenance of buildings, laboratories, and civil structures. In order to improve efficiency and quality, the FEAB has dedicated itself to establishing a data infrastructure based on Geographic Information Systems, GIs. The value of GIS was explained in an article dating back to 1980 entitled "Need for a Multipurpose Cadastre which stated, "There is a critical need for a better land-information system in the United States to improve land-conveyance procedures, furnish a basis for equitable taxation, and provide much-needed information for resource management and environmental planning." Scientists and engineers both point to GIS as the solution. What is GIS? According to most text books, Geographic Information Systems is a class of software that stores, manages, and analyzes mapable features on, above, or below the surface of the earth. GIS software is basically database management software to the management of spatial data and information. Simply put, Geographic Information Systems manage, analyze, chart, graph, and map spatial information. At the outset, I was given goals and expectations from my branch and from my mentor with regards to the further implementation of GIs. Those goals are as follows: (1) Continue the development of GIS for the underground structures. (2) Extract and export annotated data from AutoCAD drawing files and construct a database (to serve as a prototype for future work). (3) Examine existing underground record drawings to determine existing and non-existing underground tanks. Once this data was collected and analyzed, I set out on the task of creating a user-friendly database that could be assessed by all members of the branch. It was important that the database be built using programs that most employees already possess, ruling out most AutoCAD-based viewers. Therefore, I set out to create an Access database that translated onto the web using Internet

  8. Web-Scale Discovery Services Retrieve Relevant Results in Health Sciences Topics Including MEDLINE Content

    Directory of Open Access Journals (Sweden)

    Elizabeth Margaret Stovold

    2017-06-01

    Full Text Available A Review of: Hanneke, R., & O’Brien, K. K. (2016. Comparison of three web-scale discovery services for health sciences research. Journal of the Medical Library Association, 104(2, 109-117. http://dx.doi.org/10.3163/1536-5050.104.2.004 Abstract Objective – To compare the results of health sciences search queries in three web-scale discovery (WSD services for relevance, duplicate detection, and retrieval of MEDLINE content. Design – Comparative evaluation and bibliometric study. Setting – Six university libraries in the United States of America. Subjects – Three commercial WSD services: Primo, Summon, and EBSCO Discovery Service (EDS. Methods – The authors collected data at six universities, including their own. They tested each of the three WSDs at two data collection sites. However, since one of the sites was using a legacy version of Summon that was due to be upgraded, data collected for Summon at this site were considered obsolete and excluded from the analysis. The authors generated three questions for each of six major health disciplines, then designed simple keyword searches to mimic typical student search behaviours. They captured the first 20 results from each query run at each test site, to represent the first “page” of results, giving a total of 2,086 total search results. These were independently assessed for relevance to the topic. Authors resolved disagreements by discussion, and calculated a kappa inter-observer score. They retained duplicate records within the results so that the duplicate detection by the WSDs could be compared. They assessed MEDLINE coverage by the WSDs in several ways. Using precise strategies to generate a relevant set of articles, they conducted one search from each of the six disciplines in PubMed so that they could compare retrieval of MEDLINE content. These results were cross-checked against the first 20 results from the corresponding query in the WSDs. To aid investigation of overall

  9. StreptomycesInforSys: A web-enabled information repository.

    Science.gov (United States)

    Jain, Chakresh Kumar; Gupta, Vidhi; Gupta, Ashvarya; Gupta, Sanjay; Wadhwa, Gulshan; Sharma, Sanjeev Kumar; Sarethy, Indira P

    2012-01-01

    Members of Streptomyces produce 70% of natural bioactive products. There is considerable amount of information available based on polyphasic approach for classification of Streptomyces. However, this information based on phenotypic, genotypic and bioactive component production profiles is crucial for pharmacological screening programmes. This is scattered across various journals, books and other resources, many of which are not freely accessible. The designed database incorporates polyphasic typing information using combinations of search options to aid in efficient screening of new isolates. This will help in the preliminary categorization of appropriate groups. It is a free relational database compatible with existing operating systems. A cross platform technology with XAMPP Web server has been used to develop, manage, and facilitate the user query effectively with database support. Employment of PHP, a platform-independent scripting language, embedded in HTML and the database management software MySQL will facilitate dynamic information storage and retrieval. The user-friendly, open and flexible freeware (PHP, MySQL and Apache) is foreseen to reduce running and maintenance cost. www.sis.biowaves.org.

  10. SIT-REM: An Interoperable and Interactive Web Geographic Information System for Fauna, Flora and Plant Landscape Data Management

    National Research Council Canada - National Science Library

    Frontoni, Emanuele; Mancini, Adriano; Zingaretti, Primo; Malinverni, Eva; Pesaresi, Simone; Biondi, Edoardo; Pandolfi, Massimo; Marseglia, Maria; Sturari, Mirco; Zabaglia, Claudio

    2014-01-01

      The main goal of the SIT-REM project is the design and the development of an interoperable web-GIS environment for the information retrieval and data editing/updating of the geobotanical and wildlife...

  11. Query-by-Example Music Information Retrieval by Score-Informed Source Separation and Remixing Technologies

    OpenAIRE

    Okuno, Hiroshi G.; Tetsuya Ogata; Kazunori Komatani; Masataka Goto; Katsutoshi Itoyama

    2011-01-01

    We describe a novel query-by-example (QBE) approach in music information retrieval that allows a user to customize query examples by directly modifying the volume of different instrument parts. The underlying hypothesis of this approach is that the musical mood of retrieved results changes in relation to the volume balance of different instruments. On the basis of this hypothesis, we aim to clarify the relationship between the change in the volume balance of a query and the genre of the retr...

  12. An introduction to the Marshall information retrieval and display system

    Science.gov (United States)

    1974-01-01

    An on-line terminal oriented data storage and retrieval system is presented which allows a user to extract and process information from stored data bases. The use of on-line terminals for extracting and displaying data from the data bases provides a fast and responsive method for obtaining needed information. The system consists of general purpose computer programs that provide the overall capabilities of the total system. The system can process any number of data files via a Dictionary (one for each file) which describes the data format to the system. New files may be added to the system at any time, and reprogramming is not required. Illustrations of the system are shown, and sample inquiries and responses are given.

  13. EarthServer: Information Retrieval and Query Language

    Science.gov (United States)

    Perperis, Thanassis; Koltsida, Panagiota; Kakaletris, George

    2013-04-01

    Establishing open, unified, seamless, access and ad-hoc analytics on cross-disciplinary, multi-source, multi-dimensional, spatiotemporal Earth Science data of extreme-size and their supporting metadata are the main challenges of the EarthServer project (www.earthserver.eu), funded by the European Commission under its Seventh Framework Program. One of EarthServer's main objectives is to provide users with higher level coverage and metadata search, retrieval and processing capabilities to multi-disciplinary Earth Science data. Six Lighthouse Applications are being established, each one providing access to Cryospheric, Airborne, Atmospheric, Geology, Oceanography and Planetary science raster data repositories through strictly WCS 2.0 standard based service endpoints. EarthServers' information retrieval subsystem aims towards exploiting the WCS endpoints through a physically and logically distributed service oriented architecture, foreseeing the collaboration of several standard compliant services, capable of exploiting modern large grid and cloud infrastructures and of dynamically responding to availability and capabilities of underlying resources. Towards furthering technology for integrated, coherent service provision based on WCS and WCPS the concept of a query language (QL), unifying coverage and metadata processing and retrieval is introduced. EarthServer's information retrieval subsystem receives QL requests involving high volumes of all Earth Science data categories, executes them on the services that reside on the infrastructure and sends the results back to the requester through a high performance pipeline. In this contribution we briefly discuss EarthServer's service oriented coverage data and metadata search and retrieval architecture and further elaborate on the potentials of EarthServer's Query Language, called xWCPS (XQuery compliant WCPS). xWCPS aims towards merging the path that the two widely adopted standards (W3C XQuery, OGC WCPS) have paved, into a

  14. JANE, A new information retrieval system for the Radiation Shielding Information Center

    Energy Technology Data Exchange (ETDEWEB)

    Trubey, D.K.

    1991-05-01

    A new information storage and retrieval system has been developed for the Radiation Shielding Information Center (RSIC) at Oak Ridge National Laboratory to replace mainframe systems that have become obsolete. The database contains citations and abstracts of literature which were selected by RSIC analysts and indexed with terms from a controlled vocabulary. The database, begun in 1963, has been maintained continuously since that time. The new system, called JANE, incorporates automatic indexing techniques and on-line retrieval using the RSIC Data General Eclipse MV/4000 minicomputer, Automatic indexing and retrieval techniques based on fuzzy-set theory allow the presentation of results in order of Retrieval Status Value. The fuzzy-set membership function depends on term frequency in the titles and abstracts and on Term Discrimination Values which indicate the resolving power of the individual terms. These values are determined by the Cover Coefficient method. The use of a commercial database base to store and retrieve the indexing information permits rapid retrieval of the stored documents. Comparisons of the new and presently-used systems for actual searches of the literature indicate that it is practical to replace the mainframe systems with a minicomputer system similar to the present version of JANE. 18 refs., 10 figs.

  15. Retrieval of Legal Information Through Discovery Layers: A Case Study Related to Indian Law Libraries

    Directory of Open Access Journals (Sweden)

    Kushwah, Shivpal Singh

    2016-09-01

    Full Text Available Purpose. The purpose of this paper is to analyze and evaluate discovery layer search tools for retrieval of legal information in Indian law libraries. This paper covers current practices in legal information retrieval with special reference to Indian academic law libraries, and analyses its importance in the domain of law.Design/Methodology/Approach. A web survey and observational study method are used to collect the data. Data related to the discovery tools were collected using email and further discussion held with the discovery layer/ tool /product developers and their representatives.Findings. Results show that most of the Indian law libraries are subscribing to bundles of legal information resources such as Hein Online, JSTOR, LexisNexis Academic, Manupatra, Westlaw India, SCC web, AIR Online (CDROM, and so on. International legal and academic resources are compatible with discovery tools because they support various standards related to online publishing and dissemination such as OAI/PMH, Open URL, MARC21, and Z39.50, but Indian legal resources such as Manupatra, Air, and SCC are not compatible with the discovery layers. The central index is one of the important components in a discovery search interface, and discovery layer services/tools could be useful for Indian law libraries also if they can include multiple legal and academic resources in their central index. But present practices and observations reveal that discovery layers are not providing facility to cover legal information resources. Therefore, in the present form, discovery tools are not very useful; they are an incomplete and half solution for Indian libraries because all available Indian legal resources available in the law libraries are not covered.Originality/Value. Very limited research or published literature is available in the area of discovery layers and their compatibility with legal information resources.

  16. A Point-Set-Based Footprint Model and Spatial Ranking Method for Geographic Information Retrieval

    Directory of Open Access Journals (Sweden)

    Yong Gao

    2016-07-01

    Full Text Available In the recent big data era, massive spatial related data are continuously generated and scrambled from various sources. Acquiring accurate geographic information is also urgently demanded. How to accurately retrieve desired geographic information has become the prominent issue, needing to be resolved in high priority. The key technologies in geographic information retrieval are modeling document footprints and ranking documents based on their similarity evaluation. The traditional spatial similarity evaluation methods are mainly performed using a MBR (Minimum Bounding Rectangle footprint model. However, due to its nature of simplification and roughness, the results of traditional methods tend to be isotropic and space-redundant. In this paper, a new model that constructs the footprints in the form of point-sets is presented. The point-set-based footprint coincides the nature of place names in web pages, so it is redundancy-free, consistent, accurate, and anisotropic to describe the spatial extents of documents, and can handle multi-scale geographic information. The corresponding spatial ranking method is also presented based on the point-set-based model. The new similarity evaluation algorithm of this method firstly measures multiple distances for the spatial proximity across different scales, and then combines the frequency of place names to improve the accuracy and precision. The experimental results show that the proposed method outperforms the traditional methods with higher accuracies under different searching scenarios.

  17. Information Technology and Web use Characteristics of Nigerian ...

    African Journals Online (AJOL)

    The different uses of the Internet by a university to provide and access appropriate web content define the university's Web characteristics. Studies of the Information Technology (IT) and Web characteristics of Nigerian universities so far have focused mostly on public universities. This study, therefore, evaluated the ...

  18. A study of the use of simulated work task situations in interactive information retrieval evaluations

    DEFF Research Database (Denmark)

    Borlund, Pia

    2016-01-01

    Purpose – The purpose of this paper is to report a study of how the test instrument of a simulated work task situation is used in empirical evaluations of interactive information retrieval (IIR) and reported in the research literature. In particular, the author is interested to learn whether...... partly via citation analysis by use of Web of Science®, and partly by systematic search of online repositories. On this basis, 67 individual publications were identified and they constitute the sample of analysis. Findings – The analysis reveals a need for clarifications of how to use simulated work task...... situations in IIR evaluations. In particular, with respect to the design and creation of realistic simulated work task situations. There is a lack of tailoring of the simulated work task situations to the test participants. Likewise, the requirement to include the test participants’ personal information...

  19. Compact Optical Discs and the World Wide Web: Two Mediums in Digitized Information Delivery Services

    Directory of Open Access Journals (Sweden)

    Ziyu Lin

    1999-10-01

    Full Text Available

    頁次:40-52

    Compact optical discs (CDs and the World Wide Web (the Web are two mechanisms that contemporary libraries extensively use for digitized information storage, dissemination, and retrieval. The Web features an unparalleled global accessibility free from many previously known temporal and spatial restrictions. Its real-time update capability is impossible for CDs. Web-based information delivery can reduce the cost in hardware and software ownership and management of a local library, and provide one-to-one zcustomization to better serve library's clients. The current limitations of the Web include inadequate speed in data transmission, particularly for multimedia applications, and its insufficient reliability, search capabilities, and security. In comparison, speed, quality, portability, and reliability are the current advantages of CDs over the Web. These features, together with the trend in the PC industry and market, suggest that CDs will exist and continue to develop. CD/Web hybrids can combine the best of both developing mechanisms and offer optimal results. Through a comparison of CDs and the Web, it is argued that the functionality and unique features of a technology determine its future.

  20. MPEG-7-Standardized tools for music information retrieval

    Science.gov (United States)

    Herre, Jürgen

    2005-09-01

    Today, many applications in Music Information Retrieval (MIR) employ audio features which have been tailored individually by the algorithm developers. For a broader use also in commercial applications, MIR technology can benefit significantly from a ``common language'' in audio signal description that can be used to annotate any type of multimedia assets in order to facilitate search and retrieval according to a wide range of conceivable criteria in an interoperable way. The audio part of the ISO/MPEG-7 ``Multimedia Content Description Interface'' provides such a common signal description language by defining a rather comprehensive set of standardized features [called ``low level descriptors'' (LLDs)], application-centric subsets, and a unified way of exchanging this data based on XML. The talk provides an overview of the MPEG-7 Audio tool chest, including existing and forthcoming extensions. While the idea is clearly to create a universal platform for any conceivable MIR task, some of the initially conceived applications of MPEG-7 Audio are illustrated.

  1. Source-constrained retrieval influences the encoding of new information.

    Science.gov (United States)

    Danckert, Stacey L; MacLeod, Colin M; Fernandes, Myra A

    2011-11-01

    Jacoby, Shimizu, Daniels, and Rhodes (Psychonomic Bulletin & Review, 12, 852-857, 2005) showed that new words presented as foils among a list of old words that had been deeply encoded were themselves subsequently better recognized than new words presented as foils among a list of old words that had been shallowly encoded. In Experiment 1, by substituting a deep-versus-shallow imagery manipulation for the levels-of-processing manipulation, we demonstrated that the effect is robust and that it generalizes, also occurring with a different type of encoding. In Experiment 2, we provided more direct evidence for context-related encoding during tests of deeply encoded words, showing enhanced priming for foils presented among deeply encoded targets when participants made the same deep-encoding judgments on those items as had been made on the targets during study. In Experiment 3, we established that the findings from Experiment 2 are restricted to this specific deep judgment task and are not a general consequence of these foils being associated with deeply encoded items. These findings provide support for the source-constrained retrieval hypothesis of Jacoby, Shimizu, Daniels, and Rhodes: New information can be influenced by how surrounding items are encoded and retrieved, as long as the surrounding items recruit a coherent mode of processing.

  2. HIDDEN WEB EXTRACTOR DYNAMIC WAY TO UNCOVER THE DEEP WEB

    OpenAIRE

    DR. ANURADHA; BABITA AHUJA

    2012-01-01

    In this era of digital tsunami of information on the web, everyone is completely dependent on the WWW for information retrieval. This has posed a challenging problem in extracting relevant data. Traditional web crawlers focus only on the surface web while the deep web keeps expanding behind the scene. The web databases are hidden behind the query interfaces. In this paper, we propose a Hidden Web Extractor (HWE) that can automatically discover and download data from the Hidden Web databases. ...

  3. Information Systems Weave their Worldwide Web

    CERN Multimedia

    2002-01-01

    Following the example of the Web for HTML documents, information services are developing a common standard, the Open Archives Initiative, to simplify the exchange of documents Some 140 participants from 27 different countries attended the second Open Archives Initiative Workshop that was held at CERN last October. The abbreviation OAI denotes a standard designed to simplify the exchange of documents between information systems all over the world. The OAI (Open Archives Initiative) has gone from strength to strength since its launch three years ago, as reflected in the turn-out for the second workshop on it held at CERN from 17th to 19th October. The event attracted some 140 participants from 27 different countries, not only from the library sector but also from the academic world in general. The OAI constitutes a mini revolution in the cataloguing of public documents. The definition and use of an extremely simple common protocol means that the contents of any database of texts, photos, videos or other medi...

  4. Issues in the use of neural networks in information retrieval

    CERN Document Server

    Iatan, Iuliana F

    2017-01-01

    This book highlights the ability of neural networks (NNs) to be excellent pattern matchers and their importance in information retrieval (IR), which is based on index term matching. The book defines a new NN-based method for learning image similarity and describes how to use fuzzy Gaussian neural networks to predict personality. It introduces the fuzzy Clifford Gaussian network, and two concurrent neural models: (1) concurrent fuzzy nonlinear perceptron modules, and (2) concurrent fuzzy Gaussian neural network modules. Furthermore, it explains the design of a new model of fuzzy nonlinear perceptron based on alpha level sets and describes a recurrent fuzzy neural network model with a learning algorithm based on the improved particle swarm optimization method.

  5. Cross-language information retrieval using PARAFAC2.

    Energy Technology Data Exchange (ETDEWEB)

    Bader, Brett William; Chew, Peter; Abdelali, Ahmed (New Mexico State University, Las Cruces, NM); Kolda, Tamara Gibson

    2007-05-01

    A standard approach to cross-language information retrieval (CLIR) uses Latent Semantic Analysis (LSA) in conjunction with a multilingual parallel aligned corpus. This approach has been shown to be successful in identifying similar documents across languages - or more precisely, retrieving the most similar document in one language to a query in another language. However, the approach has severe drawbacks when applied to a related task, that of clustering documents 'language-independently', so that documents about similar topics end up closest to one another in the semantic space regardless of their language. The problem is that documents are generally more similar to other documents in the same language than they are to documents in a different language, but on the same topic. As a result, when using multilingual LSA, documents will in practice cluster by language, not by topic. We propose a novel application of PARAFAC2 (which is a variant of PARAFAC, a multi-way generalization of the singular value decomposition [SVD]) to overcome this problem. Instead of forming a single multilingual term-by-document matrix which, under LSA, is subjected to SVD, we form an irregular three-way array, each slice of which is a separate term-by-document matrix for a single language in the parallel corpus. The goal is to compute an SVD for each language such that V (the matrix of right singular vectors) is the same across all languages. Effectively, PARAFAC2 imposes the constraint, not present in standard LSA, that the 'concepts' in all documents in the parallel corpus are the same regardless of language. Intuitively, this constraint makes sense, since the whole purpose of using a parallel corpus is that exactly the same concepts are expressed in the translations. We tested this approach by comparing the performance of PARAFAC2 with standard LSA in solving a particular CLIR problem. From our results, we conclude that PARAFAC2 offers a very promising alternative to

  6. Accelerating Information Retrieval from Profile Hidden Markov Model Databases.

    Directory of Open Access Journals (Sweden)

    Ahmad Tamimi

    Full Text Available Profile Hidden Markov Model (Profile-HMM is an efficient statistical approach to represent protein families. Currently, several databases maintain valuable protein sequence information as profile-HMMs. There is an increasing interest to improve the efficiency of searching Profile-HMM databases to detect sequence-profile or profile-profile homology. However, most efforts to enhance searching efficiency have been focusing on improving the alignment algorithms. Although the performance of these algorithms is fairly acceptable, the growing size of these databases, as well as the increasing demand for using batch query searching approach, are strong motivations that call for further enhancement of information retrieval from profile-HMM databases. This work presents a heuristic method to accelerate the current profile-HMM homology searching approaches. The method works by cluster-based remodeling of the database to reduce the search space, rather than focusing on the alignment algorithms. Using different clustering techniques, 4284 TIGRFAMs profiles were clustered based on their similarities. A representative for each cluster was assigned. To enhance sensitivity, we proposed an extended step that allows overlapping among clusters. A validation benchmark of 6000 randomly selected protein sequences was used to query the clustered profiles. To evaluate the efficiency of our approach, speed and recall values were measured and compared with the sequential search approach. Using hierarchical, k-means, and connected component clustering techniques followed by the extended overlapping step, we obtained an average reduction in time of 41%, and an average recall of 96%. Our results demonstrate that representation of profile-HMMs using a clustering-based approach can significantly accelerate data retrieval from profile-HMM databases.

  7. The role of grammatical category information in spoken word retrieval.

    Science.gov (United States)

    Duràn, Carolina Palma; Pillon, Agnesa

    2011-01-01

    We investigated the role of lexical syntactic information such as grammatical gender and category in spoken word retrieval processes by using a blocking paradigm in picture and written word naming experiments. In Experiments 1, 3, and 4, we found that the naming of target words (nouns) from pictures or written words was faster when these target words were named within a list where only words from the same grammatical category had to be produced (homogeneous category list: all nouns) than when they had to be produced within a list comprising also words from another grammatical category (heterogeneous category list: nouns and verbs). On the other hand, we detected no significant facilitation effect when the target words had to be named within a homogeneous gender list (all masculine nouns) compared to a heterogeneous gender list (both masculine and feminine nouns). In Experiment 2, using the same blocking paradigm by manipulating the semantic category of the items, we found that naming latencies were significantly slower in the semantic category homogeneous in comparison with the semantic category heterogeneous condition. Thus semantic category homogeneity caused an interference, not a facilitation effect like grammatical category homogeneity. Finally, in Experiment 5, nouns in the heterogeneous category condition had to be named just after a verb (category-switching position) or a noun (same-category position). We found a facilitation effect of category homogeneity but no significant effect of position, which showed that the effect of category homogeneity found in Experiments 1, 3, and 4 was not due to a cost of switching between grammatical categories in the heterogeneous grammatical category list. These findings supported the hypothesis that grammatical category information impacts word retrieval processes in speech production, even when words are to be produced in isolation. They are discussed within the context of extant theories of lexical production.

  8. The role of grammatical category information in spoken word retrieval

    Directory of Open Access Journals (Sweden)

    Carolina ePalma Duràn

    2011-11-01

    Full Text Available We investigated the role of lexical syntactic information such as grammatical gender and category in spoken word retrieval processes by using a blocking paradigm in picture and written word naming experiments. In Experiments 1, 3, and 4, we found that the naming of target words (nouns from pictures or written words was faster when these target words were named within a list where only words from the same grammatical category had to be produced (homogeneous category list: all nouns than when they had to be produced within a list comprising also words from another grammatical category (heterogeneous category list: nouns and verbs. On the other hand, no significant facilitation effect was detected when the target words had to be named within a homogeneous gender list (all masculine nouns compared to a heterogeneous gender list (both masculine and feminine nouns. In Experiment 2, using the same blocking paradigm by manipulating the semantic category of the items, we found that naming times were significantly slower in the semantic category homogeneous in comparison with the semantic category heterogeneous condition. Thus semantic category homogeneity caused an interference, not a facilitation effect like grammatical category homogeneity. Finally, in Experiment 5, nouns in the heterogeneous category condition had to be named just after a verb (category-switching position or a noun (same-category position. We found a facilitation effect of category homogeneity but no significant effect of position, which showed that the effect of category homogeneity found in Experiments 1, 3, and 4 was not due to a cost of switching between grammatical categories in the heterogeneous grammatical category list. These findings supported the hypothesis that grammatical category information could impact word retrieval processes in speech production, even when words are to be produced in isolation. They are discussed within the context of extant theories of lexical

  9. Information retrieval pathways for health information exchange in multiple care settings

    DEFF Research Database (Denmark)

    Kierkegaard, Patrick; Kaushal, Rainu; Vest, Joshua R.

    2014-01-01

    Objectives To determine which health information exchange (HIE) technologies and information retrieval pathways healthcare professionals relied on to meet their information needs in the context of laboratory test results, radiological images and reports, and medication histories. Study Design...... Primary data was collected over a 2-month period across 3 emergency departments, 7 primary care practices, and 2 public health clinics in New York state. Methods Qualitative research methods were used to collect and analyze data from semi-structured interviews and participant observation. Results...... The study reveals that healthcare professionals used a complex combination of information retrieval pathways for HIE to obtain clinical information from external organizations. The choice for each approach was setting- and information-specific, but was also highly dynamic across users and their information...

  10. Has Retrieval Technology in Vertical Site Search Systems Improved over the Years? A Holistic Evaluation for Real Web Systems

    Directory of Open Access Journals (Sweden)

    Mandl, Thomas

    2015-12-01

    Full Text Available Evaluation of retrieval systems is mostly limited to laboratory settings and rarely considers changes of performance over time. This article presents an evaluation of retrieval systems for internal Web site search systems between the years 2006 and 2011. A holistic evaluation methodology for real Web sites was developed which includes tests for functionality, search quality, and user interaction. Among other sites, one set of 20 Web site search systems was evaluated three times in different years and no substantial improvement could be shown. It is surprising that the communication between site and user still leads to very poor results in many cases. Overall, the quality of these search systems could be improved, and several areas for improvement are apparent from our evaluation. For a comparison, Google’s site search function was also tested with the same tasks.

  11. A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering.

    Science.gov (United States)

    Sarrouti, Mourad; Ouatik El Alaoui, Said

    2017-04-01

    Passage retrieval, the identification of top-ranked passages that may contain the answer for a given biomedical question, is a crucial component for any biomedical question answering (QA) system. Passage retrieval in open-domain QA is a longstanding challenge widely studied over the last decades. However, it still requires further efforts in biomedical QA. In this paper, we present a new biomedical passage retrieval method based on Stanford CoreNLP sentence/passage length, probabilistic information retrieval (IR) model and UMLS concepts. In the proposed method, we first use our document retrieval system based on PubMed search engine and UMLS similarity to retrieve relevant documents to a given biomedical question. We then take the abstracts from the retrieved documents and use Stanford CoreNLP for sentence splitter to make a set of sentences, i.e., candidate passages. Using stemmed words and UMLS concepts as features for the BM25 model, we finally compute the similarity scores between the biomedical question and each of the candidate passages and keep the N top-ranked ones. Experimental evaluations performed on large standard datasets, provided by the BioASQ challenge, show that the proposed method achieves good performances compared with the current state-of-the-art methods. The proposed method significantly outperforms the current state-of-the-art methods by an average of 6.84% in terms of mean average precision (MAP). We have proposed an efficient passage retrieval method which can be used to retrieve relevant passages in biomedical QA systems with high mean average precision. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. Deep Neural Networks for Web Page Information Extraction

    OpenAIRE

    Gogar, Tomas; Hubacek, Ondrej; Sedivy, Jan

    2016-01-01

    Part 3: Ontology-Web and Social Media AI Modeling (OWESOM); International audience; Web wrappers are systems for extracting structured information from web pages. Currently, wrappers need to be adapted to a particular website template before they can start the extraction process. In this work we present a new method, which uses convolutional neural networks to learn a wrapper that can extract information from previously unseen templates. Therefore, this wrapper does not need any site-specific...

  13. Ontology-Based Information Behaviour to Improve Web Search

    Directory of Open Access Journals (Sweden)

    Silvia Calegari

    2010-10-01

    Full Text Available Web Search Engines provide a huge number of answers in response to a user query, many of which are not relevant, whereas some of the most relevant ones may not be found. In the literature several approaches have been proposed in order to help a user to find the information relevant to his/her real needs on the Web. To achieve this goal the individual Information Behavior can been analyzed to ’keep’ track of the user’s interests. Keeping information is a type of Information Behavior, and in several works researchers have referred to it as the study on what people do during a search on the Web. Generally, the user’s actions (e.g., how the user moves from one Web page to another, or her/his download of a document, etc. are recorded in Web logs. This paper reports on research activities which aim to exploit the information extracted from Web logs (or query logs in personalized user ontologies, with the objective to support the user in the process of discovering Web information relevant to her/his information needs. Personalized ontologies are used to improve the quality of Web search by applying two main techniques: query reformulation and re-ranking of query evaluation results. In this paper we analyze various methodologies presented in the literature aimed at using personalized ontologies, defined on the basis of the observation of Information Behaviour to help the user in finding relevant information.

  14. Assimilation of SMOS Retrievals in the Land Information System

    Science.gov (United States)

    Blankenship, Clay B.; Case, Jonathan L.; Zavodsky, Bradley T.; Crosson, William L.

    2016-01-01

    The Soil Moisture and Ocean Salinity (SMOS) satellite provides retrievals of soil moisture in the upper 5 cm with a 30-50 km resolution and a mission accuracy requirement of 0.04 cm(sub 3 cm(sub -3). These observations can be used to improve land surface model soil moisture states through data assimilation. In this paper, SMOS soil moisture retrievals are assimilated into the Noah land surface model via an Ensemble Kalman Filter within the NASA Land Information System. Bias correction is implemented using Cumulative Distribution Function (CDF) matching, with points aggregated by either land cover or soil type to reduce sampling error in generating the CDFs. An experiment was run for the warm season of 2011 to test SMOS data assimilation and to compare assimilation methods. Verification of soil moisture analyses in the 0-10 cm upper layer and root zone (0-1 m) was conducted using in situ measurements from several observing networks in the central and southeastern United States. This experiment showed that SMOS data assimilation significantly increased the anomaly correlation of Noah soil moisture with station measurements from 0.45 to 0.57 in the 0-10 cm layer. Time series at specific stations demonstrate the ability of SMOS DA to increase the dynamic range of soil moisture in a manner consistent with station measurements. Among the bias correction methods, the correction based on soil type performed best at bias reduction but also reduced correlations. The vegetation-based correction did not produce any significant differences compared to using a simple uniform correction curve.

  15. Query-Time Optimization Techniques for Structured Queries in Information Retrieval

    Science.gov (United States)

    Cartright, Marc-Allen

    2013-01-01

    The use of information retrieval (IR) systems is evolving towards larger, more complicated queries. Both the IR industrial and research communities have generated significant evidence indicating that in order to continue improving retrieval effectiveness, increases in retrieval model complexity may be unavoidable. From an operational perspective,…

  16. Ontology-Based Information Visualization: Toward Semantic Web Applications

    NARCIS (Netherlands)

    Fluit, Christiaan; Sabou, Marta; Harmelen, Frank van

    2006-01-01

    The Semantic Web is an extension of the current World Wide Web, based on the idea of exchanging information with explicit, formal, and machine-accessible descriptions of meaning. Providing information with such semantics will enable the construction of applications that have an increased awareness

  17. Evaluation Criteria for the Educational Web-Information System

    Science.gov (United States)

    Seok, Soonhwa; Meyen, Edward; Poggio, John C.; Semon, Sarah; Tillberg-Webb, Heather

    2008-01-01

    This article addresses how evaluation criteria improve educational Web-information system design, and the tangible and intangible benefits of using evaluation criteria, when implemented in an educational Web-information system design. The evaluation criteria were developed by the authors through a content validation study applicable to…

  18. Web accessibility practical advice for the library and information professional

    CERN Document Server

    Craven, Jenny

    2008-01-01

    Offers an introduction to web accessibility and usability for information professionals, offering advice on the concerns relevant to library and information organizations. This book can be used as a resource for developing staff training and awareness activities. It will also be of value to website managers involved in web design and development.

  19. Utilization of Web-Based Information Resources for Researchers in ...

    African Journals Online (AJOL)

    The findings revealed that respondents generally showed positive attitude towards use of web-based information resources. The implication of the findings implies that university libraries that provide such resources effectively will help to promote academic scholarship and research. Key Words: Web-Based, Information, ...

  20. Sample-based XPath Ranking for Web Information Extraction

    NARCIS (Netherlands)

    Jundt, Oliver; van Keulen, Maurice

    Web information extraction typically relies on a wrapper, i.e., program code or a configuration that specifies how to extract some information from web pages at a specific website. Manually creating and maintaining wrappers is a cumbersome and error-prone task. It may even be prohibitive as some

  1. TOWARD SEMANTIC WEB INFRASTRUCTURE FOR SPATIAL FEATURES' INFORMATION

    Directory of Open Access Journals (Sweden)

    R. Arabsheibani

    2015-12-01

    Full Text Available The Web and its capabilities can be employed as a tool for data and information integration if comprehensive datasets and appropriate technologies and standards enable the web with interpretation and easy alignment of data and information. Semantic Web along with the spatial functionalities enable the web to deal with the huge amount of data and information. The present study investigate the advantages and limitations of the Spatial Semantic Web and compare its capabilities with relational models in order to build a spatial data infrastructure. An architecture is proposed and a set of criteria is defined for the efficiency evaluation. The result demonstrate that when using the data with special characteristics such as schema dynamicity, sparse data or available relations between the features, the spatial semantic web and graph databases with spatial operations are preferable.

  2. STATUS/IQ: A Semi-Intelligent Information Retrieval System.

    Science.gov (United States)

    Pearsall, Jayne

    1990-01-01

    Provides background on the problems of traditional text retrieval systems and describes STATUS/IQ, an advanced text retrieval system that incorporates a natural language front-end and an advanced relevance ranking facility. The principles, capabilities, and benefits of the system are discussed, and an example of a STATUS/IQ session is presented…

  3. Physicists' Information Tasks: Structure, Length and Retrieval Performance

    DEFF Research Database (Denmark)

    Lykke, Marianne; Ingwersen, Peter; Bogers, Toine

    2010-01-01

    to describe the tasks, 3) what semantic categories were used to express the search facets, and 4) retrieval performance. Results show variety in structure and length across task descriptions and task purposes. The results indicate effect of length and, in particular, of task purpose on retrieval performance...

  4. Toward Studying Music Cognition with Information Retrieval Techniques: Lessons Learned from the OpenMIIR Initiative

    OpenAIRE

    Sebastian Stober

    2017-01-01

    As an emerging sub-field of music information retrieval (MIR), music imagery information retrieval (MIIR) aims to retrieve information from brain activity recorded during music cognition–such as listening to or imagining music pieces. This is a highly inter-disciplinary endeavor that requires expertise in MIR as well as cognitive neuroscience and psychology. The OpenMIIR initiative strives to foster collaborations between these fields to advance the state of the art in MIIR. As a first step, ...

  5. Domain-Specific Thesaurus as a Tool for Information Retrieval and Collection of Knowledge

    Directory of Open Access Journals (Sweden)

    Vladimir N. Boikov

    2013-01-01

    Full Text Available This paper reports basic approaches to constructive creation of an open resource named ”Domain-specified thesaurus of poetics”, which is one of the levels of an information-analytical system of the Russian poetry (IAS RP. The poetics is a group of disciplines focused on a comprehensive theoretical and historical study of poetry. IAS RP will be used as a tool for a wide range of studies allowing to determine the characteristic features of the analyzed works of poetry. Consequently, the thesaurus is the knowledge base from which one can borrow input data for training the system. The aim of our research requires a specific approach to formating the knowledge base. Thesaurus is a web-based resource which includes a domain-specific directory, information retrieval tools and tools for further analyzes. The study of glossary consisting of three thousand terms and a set of semantic fields is reviewed in this paper. Rdf-graph of the domain-specified thesaurus of poetics is presented, containing 9 types of objects and different kinds of relationships among them. Wiki-tecnologies are used for implementing a resource which allows to store data in Semantic Web formats.

  6. WebCIS: large scale deployment of a Web-based clinical information system.

    Science.gov (United States)

    Hripcsak, G; Cimino, J J; Sengupta, S

    1999-01-01

    WebCIS is a Web-based clinical information system. It sits atop the existing Columbia University clinical information system architecture, which includes a clinical repository, the Medical Entities Dictionary, an HL7 interface engine, and an Arden Syntax based clinical event monitor. WebCIS security features include authentication with secure tokens, authorization maintained in an LDAP server, SSL encryption, permanent audit logs, and application time outs. WebCIS is currently used by 810 physicians at the Columbia-Presbyterian center of New York Presbyterian Healthcare to review and enter data into the electronic medical record. Current deployment challenges include maintaining adequate database performance despite complex queries, replacing large numbers of computers that cannot run modern Web browsers, and training users that have never logged onto the Web. Although the raised expectations and higher goals have increased deployment costs, the end result is a far more functional, far more available system.

  7. Automatic web filtering approach based on multimodal content information

    Science.gov (United States)

    Ming, Wei H.; Rossi, Lorenzo; Li, Ying; Kuo, C.-C. Jay

    2001-07-01

    An automatic web content classification system is proposed in this research for web information filtering. A sample group of web contents are first collected via commercial search engines. Then, they are classified into different subject group and more related web pages can be searched for further analysis. It can free from the troublesome and routine process that are performed by human beings in most search engines. And the clustered information can be updated at any specified time automatically. Preliminary experimental results are used to demonstrate the effectiveness of the performance of the proposed system.

  8. Web-based sharing of electrocardiogram: a framework for information publishing.

    Science.gov (United States)

    Yuan, Shizhong; Wei, Daming; Xu, Weimin; Shen, Wenfeng

    2009-01-01

    Network-based data sharing is a current trend in medicine and healthcare. The search and retrieval architecture (SRA) we previously proposed for web-based sharing of electrocardiogram (ECG) facilitates the search and retrieval of ECG across hospitals via the Internet. The SRA has a triangle-like configuration including an ECG metadata registry, an ECG provider and an ECG querist. In this paper, we present a framework for ECG information publishing of an ECG provider. We also introduce a prototype of this framework, which was developed for an experimental scenario for assessment test based on MFER, an IEEE standard proposed from Japan. The assessment shows that the prototype of the framework can effectively publish the ECGs in a group of emulated MFER-conformant electrocardiographs, and the published ECGs can be successfully discovered and retrieved via the Internet.

  9. Comparing the quality of accessing medical literature using content-based visual and textual information retrieval

    Science.gov (United States)

    Müller, Henning; Kalpathy-Cramer, Jayashree; Kahn, Charles E., Jr.; Hersh, William

    2009-02-01

    Content-based visual information (or image) retrieval (CBIR) has been an extremely active research domain within medical imaging over the past ten years, with the goal of improving the management of visual medical information. Many technical solutions have been proposed, and application scenarios for image retrieval as well as image classification have been set up. However, in contrast to medical information retrieval using textual methods, visual retrieval has only rarely been applied in clinical practice. This is despite the large amount and variety of visual information produced in hospitals every day. This information overload imposes a significant burden upon clinicians, and CBIR technologies have the potential to help the situation. However, in order for CBIR to become an accepted clinical tool, it must demonstrate a higher level of technical maturity than it has to date. Since 2004, the ImageCLEF benchmark has included a task for the comparison of visual information retrieval algorithms for medical applications. In 2005, a task for medical image classification was introduced and both tasks have been run successfully for the past four years. These benchmarks allow an annual comparison of visual retrieval techniques based on the same data sets and the same query tasks, enabling the meaningful comparison of various retrieval techniques. The datasets used from 2004-2007 contained images and annotations from medical teaching files. In 2008, however, the dataset used was made up of 67,000 images (along with their associated figure captions and the full text of their corresponding articles) from two Radiological Society of North America (RSNA) scientific journals. This article describes the results of the medical image retrieval task of the ImageCLEF 2008 evaluation campaign. We compare the retrieval results of both visual and textual information retrieval systems from 15 research groups on the aforementioned data set. The results show clearly that, currently

  10. How to retrieve additional information from the multiplicity distributions

    Science.gov (United States)

    Wilk, Grzegorz; Włodarczyk, Zbigniew

    2017-01-01

    Multiplicity distributions (MDs) P(N) measured in multiparticle production processes are most frequently described by the negative binomial distribution (NBD). However, with increasing collision energy some systematic discrepancies have become more and more apparent. They are usually attributed to the possible multi-source structure of the production process and described using a multi-NBD form of the MD. We investigate the possibility of keeping a single NBD but with its parameters depending on the multiplicity N. This is done by modifying the widely known clan model of particle production leading to the NBD form of P(N). This is then confronted with the approach based on the so-called cascade-stochastic formalism which is based on different types of recurrence relations defining P(N). We demonstrate that a combination of both approaches allows the retrieval of additional valuable information from the MDs, namely the oscillatory behavior of the counting statistics apparently visible in the high energy data.

  11. Intelligent information retrieval system using automatic thesaurus construction

    Science.gov (United States)

    Song, Wei; Yang, Jucheng; Li, Chenghua; Park, Sooncheol

    2011-05-01

    This paper presents an intelligent information retrieval (IR) system based on automatic thesaurus construction for its applications of document clustering and classification. These two applications are the most influential and widely used fields amongst the IR research community. We apply two biologically inspired algorithms, i.e. genetic algorithm (GA) and neural network (NN), to these two fields. A fuzzy logic controller GA and an adaptive back-propagation NN are proposed in our study, which can validly overcome the problems existing in their archetypes, e.g. slow evolution and being prone to trap into a local optimum. Furthermore, a well-constructed thesaurus has been recognised as a valuable tool in the effective operation of clustering and classification. It solves the problem in document representation organised by a bag of words, where some important relationships between words, e.g. synonymy and polysemy, are ignored. To investigate how our IR system could be used effectively, we conduct experiments on four data sets from the benchmark Reuter-21578 document collection and 20-newsgroup corpus. The results reveal that our IR system enhances the performance in comparison with k-means, common GA, and conventional back-propagation NN.

  12. Web based Library Information System Using PHP and MYSQL

    Directory of Open Access Journals (Sweden)

    Kartika Firdausy

    2008-08-01

    Full Text Available Library are usually used by visitors as media to search a reference and obtain information. Problems at this time is that not many libraries have shaped the web information system for online services. This research aims to analyze and design web-based library information system and testing of the performance of library information systems. Results of research. Results of research shows that the library information system is web-based software built with PHP and MySQL, can work in the Internet network, with the ability to receive data on the WEB library visits, serving registration members get access to a wider, providing services to the availability of information needs click in the form of books, ordering and serving the order book from the members, to serve and return of rental transactions directly.

  13. SIDECACHE: Information access, management and dissemination framework for web services.

    Science.gov (United States)

    Doderer, Mark S; Burkhardt, Cory; Robbins, Kay A

    2011-06-14

    Many bioinformatics algorithms and data sets are deployed using web services so that the results can be explored via the Internet and easily integrated into other tools and services. These services often include data from other sites that is accessed either dynamically or through file downloads. Developers of these services face several problems because of the dynamic nature of the information from the upstream services. Many publicly available repositories of bioinformatics data frequently update their information. When such an update occurs, the developers of the downstream service may also need to update. For file downloads, this process is typically performed manually followed by web service restart. Requests for information obtained by dynamic access of upstream sources is sometimes subject to rate restrictions. SideCache provides a framework for deploying web services that integrate information extracted from other databases and from web sources that are periodically updated. This situation occurs frequently in biotechnology where new information is being continuously generated and the latest information is important. SideCache provides several types of services including proxy access and rate control, local caching, and automatic web service updating. We have used the SideCache framework to automate the deployment and updating of a number of bioinformatics web services and tools that extract information from remote primary sources such as NCBI, NCIBI, and Ensembl. The SideCache framework also has been used to share research results through the use of a SideCache derived web service.

  14. SIDECACHE: Information access, management and dissemination framework for web services

    Directory of Open Access Journals (Sweden)

    Robbins Kay A

    2011-06-01

    Full Text Available Abstract Background Many bioinformatics algorithms and data sets are deployed using web services so that the results can be explored via the Internet and easily integrated into other tools and services. These services often include data from other sites that is accessed either dynamically or through file downloads. Developers of these services face several problems because of the dynamic nature of the information from the upstream services. Many publicly available repositories of bioinformatics data frequently update their information. When such an update occurs, the developers of the downstream service may also need to update. For file downloads, this process is typically performed manually followed by web service restart. Requests for information obtained by dynamic access of upstream sources is sometimes subject to rate restrictions. Findings SideCache provides a framework for deploying web services that integrate information extracted from other databases and from web sources that are periodically updated. This situation occurs frequently in biotechnology where new information is being continuously generated and the latest information is important. SideCache provides several types of services including proxy access and rate control, local caching, and automatic web service updating. Conclusions We have used the SideCache framework to automate the deployment and updating of a number of bioinformatics web services and tools that extract information from remote primary sources such as NCBI, NCIBI, and Ensembl. The SideCache framework also has been used to share research results through the use of a SideCache derived web service.

  15. VaProS: a database-integration approach for protein/genome information retrieval

    KAUST Repository

    Gojobori, Takashi

    2016-12-24

    Life science research now heavily relies on all sorts of databases for genome sequences, transcription, protein three-dimensional (3D) structures, protein–protein interactions, phenotypes and so forth. The knowledge accumulated by all the omics research is so vast that a computer-aided search of data is now a prerequisite for starting a new study. In addition, a combinatory search throughout these databases has a chance to extract new ideas and new hypotheses that can be examined by wet-lab experiments. By virtually integrating the related databases on the Internet, we have built a new web application that facilitates life science researchers for retrieving experts’ knowledge stored in the databases and for building a new hypothesis of the research target. This web application, named VaProS, puts stress on the interconnection between the functional information of genome sequences and protein 3D structures, such as structural effect of the gene mutation. In this manuscript, we present the notion of VaProS, the databases and tools that can be accessed without any knowledge of database locations and data formats, and the power of search exemplified in quest of the molecular mechanisms of lysosomal storage disease. VaProS can be freely accessed at http://p4d-info.nig.ac.jp/vapros/.

  16. design and implementation of a web based information system for ...

    African Journals Online (AJOL)

    Admin

    The design and implementation of a web-based administrative information system for National Health. Insurance Scheme ... NET framework has been explored for use in designing a web-based working prototype for the scheme with cold fusion mark-up .... licensed Government or Private Health Care Practitioner or facility ...

  17. Quality of Web-Based Information on Cannabis Addiction

    Science.gov (United States)

    Khazaal, Yasser; Chatton, Anne; Cochand, Sophie; Zullino, Daniele

    2008-01-01

    This study evaluated the quality of Web-based information on cannabis use and addiction and investigated particular content quality indicators. Three keywords ("cannabis addiction," "cannabis dependence," and "cannabis abuse") were entered into two popular World Wide Web search engines. Websites were assessed with a standardized proforma designed…

  18. Adapting Web Information to Disabled and Elderly Users.

    Science.gov (United States)

    Kobsa, Alfred

    This paper describes work aimed at catering the content of World Wide Web (WWW) pages to the needs of different users, including elderly people and users with vision and motor impairments. An overview is provided of the AVANTI system, a European WWW-based tourist information system that adapts Web pages to each user's individual needs before…

  19. Information Literacy Instruction in the Web 2.0 Library

    Science.gov (United States)

    Humrickhouse, Elizabeth

    2011-01-01

    This paper examines how library educators can implement Web 2.0 tools in their Information Literacy programs to better prepare students for the rigors of academic research. Additionally, this paper looks at transliteracy and constructivism as the most useful teaching methods in a Web 2.0 classroom and attempts to pinpoint specific educational…

  20. Casos prácticos de Information Seeking en el diseño de sistemas de información web

    OpenAIRE

    Serrano Cobos, Jorge

    2006-01-01

    Information seeking user behaviour on a web site can be analized through cuantitative and cualitative studies using search logs. Different experiencies involving search analytics are described. Also is studied its methodolgy, advantages for information seeking studies, key performance indicators, and its practical application to Information Architecture, improving Information Retrieval, internationalization of web sites, and as a help on seasonality future predictions using keywords search tr...

  1. Information-Flow-Based Access Control for Web Browsers

    Science.gov (United States)

    Yoshihama, Sachiko; Tateishi, Takaaki; Tabuchi, Naoshi; Matsumoto, Tsutomu

    The emergence of Web 2.0 technologies such as Ajax and Mashup has revealed the weakness of the same-origin policy[1], the current de facto standard for the Web browser security model. We propose a new browser security model to allow fine-grained access control in the client-side Web applications for secure mashup and user-generated contents. We propose a browser security model that is based on information-flow-based access control (IBAC) to overcome the dynamic nature of the client-side Web applications and to accurately determine the privilege of scripts in the event-driven programming model.

  2. Generic information can retrieve known biological associations: implications for biomedical knowledge discovery.

    Directory of Open Access Journals (Sweden)

    Herman H H B M van Haagen

    Full Text Available MOTIVATION: Weighted semantic networks built from text-mined literature can be used to retrieve known protein-protein or gene-disease associations, and have been shown to anticipate associations years before they are explicitly stated in the literature. Our text-mining system recognizes over 640,000 biomedical concepts: some are specific (i.e., names of genes or proteins others generic (e.g., 'Homo sapiens'. Generic concepts may play important roles in automated information retrieval, extraction, and inference but may also result in concept overload and confound retrieval and reasoning with low-relevance or even spurious links. Here, we attempted to optimize the retrieval performance for protein-protein interactions (PPI by filtering generic concepts (node filtering or links to generic concepts (edge filtering from a weighted semantic network. First, we defined metrics based on network properties that quantify the specificity of concepts. Then using these metrics, we systematically filtered generic information from the network while monitoring retrieval performance of known protein-protein interactions. We also systematically filtered specific information from the network (inverse filtering, and assessed the retrieval performance of networks composed of generic information alone. RESULTS: Filtering generic or specific information induced a two-phase response in retrieval performance: initially the effects of filtering were minimal but beyond a critical threshold network performance suddenly drops. Contrary to expectations, networks composed exclusively of generic information demonstrated retrieval performance comparable to unfiltered networks that also contain specific concepts. Furthermore, an analysis using individual generic concepts demonstrated that they can effectively support the retrieval of known protein-protein interactions. For instance the concept "binding" is indicative for PPI retrieval and the concept "mutation abnormality" is

  3. E-Government Goes Semantic Web: How Administrations Can Transform Their Information Processes

    Science.gov (United States)

    Klischewski, Ralf; Ukena, Stefan

    E-government applications and services are built mainly on access to, retrieval of, integration of, and delivery of relevant information to citizens, businesses, and administrative users. In order to perform such information processing automatically through the Semantic Web,1 machine-readable2 enhancements of web resources are needed, based on the understanding of the content and context of the information in focus. While these enhancements are far from trivial to produce, administrations in their role of information and service providers so far find little guidance on how to migrate their web resources and enable a new quality of information processing; even research is still seeking best practices. Therefore, the underlying research question of this chapter is: what are the appropriate approaches which guide administrations in transforming their information processes toward the Semantic Web? In search for answers, this chapter analyzes the challenges and possible solutions from the perspective of administrations: (a) the reconstruction of the information processing in the e-government in terms of how semantic technologies must be employed to support information provision and consumption through the Semantic Web; (b) the required contribution to the transformation is compared to the capabilities and expectations of administrations; and (c) available experience with the steps of transformation are reviewed and discussed as to what extent they can be expected to successfully drive the e-government to the Semantic Web. This research builds on studying the case of Schleswig-Holstein, Germany, where semantic technologies have been used within the frame of the Access-eGov3 project in order to semantically enhance electronic service interfaces with the aim of providing a new way of accessing and combining e-government services.

  4. Information Retrieval eXperience (IRX): Towards a Human-Centered Personalized Model of Relevance

    NARCIS (Netherlands)

    van der Sluis, Frans; van den Broek, Egon; van Dijk, Elisabeth M.A.G.; Hoeber, O.; Li, Y.; Huang, X.J.

    2010-01-01

    We approach Information Retrieval (IR) from a User eXperience (UX) perspective. Through introducing a model for Information Retrieval eXperience (IRX), this paper operationalizes a perspective on IR that reaches beyond topicality. Based on a document's topicality, complexity, and emotional value, a

  5. Text mining scientific papers: a survey on FCA-based information retrieval research

    NARCIS (Netherlands)

    Poelmans, J.; Ignatov, D.I.; Viaene, S.; Dedene, G.; Kuznetsov, S.O.

    2012-01-01

    Formal Concept Analysis (FCA) is an unsupervised clustering technique and many scientific papers are devoted to applying FCA in Information Retrieval (IR) research. We collected 103 papers published between 2003-2009 which mention FCA and information retrieval in the abstract, title or keywords.

  6. Latent morpho-semantic analysis : multilingual information retrieval with character n-grams and mutual information.

    Energy Technology Data Exchange (ETDEWEB)

    Bader, Brett William; Chew, Peter A.; Abdelali, Ahmed (New Mexico State University)

    2008-08-01

    We describe an entirely statistics-based, unsupervised, and language-independent approach to multilingual information retrieval, which we call Latent Morpho-Semantic Analysis (LMSA). LMSA overcomes some of the shortcomings of related previous approaches such as Latent Semantic Analysis (LSA). LMSA has an important theoretical advantage over LSA: it combines well-known techniques in a novel way to break the terms of LSA down into units which correspond more closely to morphemes. Thus, it has a particular appeal for use with morphologically complex languages such as Arabic. We show through empirical results that the theoretical advantages of LMSA can translate into significant gains in precision in multilingual information retrieval tests. These gains are not matched either when a standard stemmer is used with LSA, or when terms are indiscriminately broken down into n-grams.

  7. New technologies for publishing information on the World Wide Web a case study at CERN

    CERN Document Server

    Faggian, R

    1999-01-01

    This thesis studies the problem of information retrieval, discovery and integration of resources applied to a real world case study. An initiative to explain High Energy Physics to the general public (outreach) has been started at CERN. The use of the Web has been identified as crucial to the success of this initiative. This study examines the characteristics of HTML and XML languages and the use of metadata for describing document content in order to improve understanding and discovery. The main part of the work is the study of RDF standard for representing metadata using the XML syntax. The proposed solution is an information system, which collects many different resources on the Web (information published by many European particle physics institutes), organizes and queries them using their metadata description instead of working directly on their contents.

  8. Information architecture for a planetary 'exploration web'

    Science.gov (United States)

    Lamarra, N.; McVittie, T.

    2002-01-01

    'Web services' is a common way of deploying distributed applications whose software components and data sources may be in different locations, formats, languages, etc. Although such collaboration is not utilized significantly in planetary exploration, we believe there is significant benefit in developing an architecture in which missions could leverage each others capabilities. We believe that an incremental deployment of such an architecture could significantly contribute to the evolution of increasingly capable, efficient, and even autonomous remote exploration.

  9. Teaching Information Literacy on the Web: A Survey

    OpenAIRE

    Yang, Sharon

    2014-01-01

    This presentation is a summary of a survey of 264 academic libraries and how they conduct information literacy related activities on the web. The presentation was done at 2012 IFLA meeting in Tampere, Finland.

  10. An application of weighted transducers to music information retrieval

    Science.gov (United States)

    Basaldella, D.; Orio, N.

    2006-01-01

    In this paper it is proposed a methodology for retrieving music documents using a query by example paradigm. The basic idea is that a collection of music documents can be indexed by the set of melodic contours of its documents, and retrieval is carried out using an approximate matching between query and document contours. The approximate matching is based on the use of Weighted Transducers, which model the document contours and are used to compute their similarity with the query. The methodology has been evaluated on a collection of documents and with a set of audio queries.

  11. Expert Search Strategies: The Information Retrieval Practices of Healthcare Information Professionals.

    Science.gov (United States)

    Russell-Rose, Tony; Chamberlain, Jon

    2017-10-02

    Healthcare information professionals play a key role in closing the knowledge gap between medical research and clinical practice. Their work involves meticulous searching of literature databases using complex search strategies that can consist of hundreds of keywords, operators, and ontology terms. This process is prone to error and can lead to inefficiency and bias if performed incorrectly. The aim of this study was to investigate the search behavior of healthcare information professionals, uncovering their needs, goals, and requirements for information retrieval systems. A survey was distributed to healthcare information professionals via professional association email discussion lists. It investigated the search tasks they undertake, their techniques for search strategy formulation, their approaches to evaluating search results, and their preferred functionality for searching library-style databases. The popular literature search system PubMed was then evaluated to determine the extent to which their needs were met. The 107 respondents indicated that their information retrieval process relied on the use of complex, repeatable, and transparent search strategies. On average it took 60 minutes to formulate a search strategy, with a search task taking 4 hours and consisting of 15 strategy lines. Respondents reviewed a median of 175 results per search task, far more than they would ideally like (100). The most desired features of a search system were merging search queries and combining search results. Healthcare information professionals routinely address some of the most challenging information retrieval problems of any profession. However, their needs are not fully supported by current literature search systems and there is demand for improved functionality, in particular regarding the development and management of search strategies.

  12. AN OVERVIEW OF SEARCHING AND DISCOVERING WEB BASED INFORMATION RESOURCES

    Directory of Open Access Journals (Sweden)

    Cezar VASILESCU

    2010-01-01

    Full Text Available The Internet becomes for most of us a daily used instrument, for professional or personal reasons. We even do not remember the times when a computer and a broadband connection were luxury items. More and more people are relying on the complicated web network to find the needed information.This paper presents an overview of Internet search related issues, upon search engines and describes the parties and the basic mechanism that is embedded in a search for web based information resources. Also presents ways to increase the efficiency of web searches, through a better understanding of what search engines ignore at websites content.

  13. Subjective Probability and Information Retrieval: A Review of the Psychological Literature.

    Science.gov (United States)

    Thompson, Paul

    1988-01-01

    Reviews the subjective probability estimation literature of six schools of human judgement and decision making: decision theory, behavioral decision theory, psychological decision theory, social judgement theory, information integration theory, and attribution theory. Implications for probabilistic information retrieval are discussed, including…

  14. BAGET: a web server for the effortless retrieval of prokaryotic gene context and sequence.

    Science.gov (United States)

    Oberto, Jacques

    2008-02-01

    BAGET (Bacterial and Archaeal Gene Exploration Tool) is a web service designed to facilitate extraction, by molecular geneticists and phylogeneticists, of specific gene and protein sequences from completely determined prokaryotic genomes. Upon selection of a particular prokaryotic organism and gene, two levels of visual gene context information are provided on a single dynamic page: (i) a graphical representation of a user defined portion of the chromosome centered on the gene of interest and (ii) the DNA sequence of the query gene, of the immediate neighboring genes and the intergenic regions each identified by a consistent color code. The aminoacid sequence is provided for protein-coding query genes. Query results can be exported as a rich text format (RTF) word processor file for printing, archival or further analysis. http://archaea.u-psud.fr/bin/baget.dll.

  15. The Role of Ontology in Information Retrieval: Reviewing Current Research and Representing a Conceptual Model

    Directory of Open Access Journals (Sweden)

    Mahdieh Mirzabeigi

    2012-03-01

    Full Text Available Inefficiency of thesauri and other information representation tools in electronic environment have forced librarians to revise the structure of these tools. So they have tried to develop other information organization tools such as ontology. In this paper, the performance of ontology in information retrieval was investigated. In addition, by reviewing two basic ontology-based information retrieval models- Lingpeng model and Dan Model- a new conceptual model was introduced.

  16. Bias-variance analysis in estimating true query model for information retrieval

    OpenAIRE

    Zhang, Peng; Song, Dawei; Wang, Jun; Yue HOU

    2014-01-01

    The estimation of query model is an important task in language modeling (LM) approaches to information retrieval (IR). The ideal estimation is expected to be not only effective in terms of high mean retrieval performance over all queries, but also stable in terms of low variance of retrieval performance across different queries. In practice, however, improving effectiveness can sacrifice stability, and vice versa. In this paper, we propose to study this tradeoff from a new perspective, i.e., ...

  17. An information retrieval system for research file data

    Science.gov (United States)

    Joan E. Lengel; John W. Koning

    1978-01-01

    Research file data have been successfully retrieved at the Forest Products Laboratory through a high-speed cross-referencing system involving the computer program FAMULUS as modified by the Madison Academic Computing Center at the University of Wisconsin. The method of data input, transfer to computer storage, system utilization, and effectiveness are discussed....

  18. Autocorrelation and Regularization of Query-Based Information Retrieval Scores

    Science.gov (United States)

    2008-02-01

    retrieval 2 The dog (Canis lupus familiaris) is a domestic subspecies of the wolf, a mammal of the Canidae family of the order Carnivora . The term...normalized Laplacian. This result suggests that, while degree normalize is important, our data may not exhibit the appropriate characteristics to notice

  19. FORDAT : an information retrieval system for forest economic data

    Science.gov (United States)

    Henry M. Spelter

    1981-01-01

    Time series data frequently used in Forest Service studies of wood products consumption have been stored in a data retrieval system on the computer of the University of Wisconsin. The data cover activity in wood processing from forest to end use. Prices and costs at succeeding stages, historical usage, production rates, and other relevant data to wood use analysis were...

  20. Position paper: Web tutorials and Information Literacy research

    DEFF Research Database (Denmark)

    Hyldegård, Jette

    2011-01-01

    Position paper on future research challenges regarding web tutorials with the aim of supporting and facilitating Information Literacy in an academic context. Presented and discussed at the workshop: Social media & Information Practices, track on Information literacy practices, University of Borås...

  1. Searching to Translate and Translating to Search: When Information Retrieval Meets Machine Translation

    Science.gov (United States)

    Ture, Ferhan

    2013-01-01

    With the adoption of web services in daily life, people have access to tremendous amounts of information, beyond any human's reading and comprehension capabilities. As a result, search technologies have become a fundamental tool for accessing information. Furthermore, the web contains information in multiple languages, introducing another barrier…

  2. An Investigation of the Academic Information Finding and Re-finding Behavior on the Web

    Directory of Open Access Journals (Sweden)

    Hsiao-Tieh Pu

    2014-12-01

    Full Text Available Academic researchers often need and re-use relevant information found after a period of time. This preliminary study used various methods, including experiments, interviews, search log analysis, sequential analysis, and observation to investigate characteristics of academic information finding and re-finding behavior. Overall, the participants in this study entered short queries either in finding or re-finding phases. Comparatively speaking, the participants entered greater number of queries, modified more queries, browsed more web pages, and stayed longer on web pages in the finding phase. On the other hand, in the re-finding phase, they utilized personal information management tools to re-find instead of finding again using search engine, such as checking browsing history; moreover, they tend to input less number of queries and stayed shorter on web pages. In short, the participants interacted more with the retrieval system during the finding phase, while they increased the use of personal information management tools in the re-finding phase. As to the contextual clues used in re-finding phase, the participants used less clues from the target itself, instead, they used indirect clues more often, especially location-related information. Based on the results of sequential analysis, the transition states in the re-finding phase was found to be more complex than those in the finding phase. Web information finding and re-finding behavior is an important and novel area of research. The preliminary results would benefit research on Web information re-finding behavior, and provide useful suggestions for developing personal academic information management systems. [Article content in Chinese

  3. Interoperable Multimedia Annotation and Retrieval for the Tourism Sector

    NARCIS (Netherlands)

    Chatzitoulousis, Antonios; Efraimidis, Pavlos S.; Athanasiadis, I.N.

    2015-01-01

    The Atlas Metadata System (AMS) employs semantic web annotation techniques in order to create an interoperable information annotation and retrieval platform for the tourism sector. AMS adopts state-of-the-art metadata vocabularies, annotation techniques and semantic web technologies.

  4. Information Retrieval in an Office Filing Facility and Future Work in Project Minstrel.

    Science.gov (United States)

    Smeaton, A. F.; van Rijsbergen, C. J.

    1986-01-01

    Review of office filing facility filing and retrieval mechanisms for unstructured and mixed media information focuses on free text methods. Also discussed are the state of the art in handling voice and image data, problems with searching text surrogates to implement free text content retrieval, and work of Project Minstrel. (Author/MBR)

  5. Document control and information retrieval system for the Fast Flux Test Facility (FFTF)

    Energy Technology Data Exchange (ETDEWEB)

    Theo, M.G.

    1976-03-01

    A description is given of the FFTF Document Control and Information Retrieval System. The system utilizes a mini-computer along with various microfilm equipment and is designed to accommodate an anticipated 50 million pages of text and 750,000 drawings. The system is simple, uncluttered, eliminates duplication, and provides quick retrievability of documents for all technical and administrative personnel.

  6. Editorial for the Bibliometric-enhanced Information Retrieval Workshop at ECIR 2014

    NARCIS (Netherlands)

    Mayr, Philipp; Schaer, Philipp; Scharnhorst, Andrea; Mutschke, Peter

    2014-01-01

    This first "Bibliometric-enhanced Information Retrieval" (BIR 2014) workshop aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they

  7. Health and medication information resources on the World Wide Web.

    Science.gov (United States)

    Grossman, Sara; Zerilli, Tina

    2013-04-01

    Health care practitioners have increasingly used the Internet to obtain health and medication information. The vast number of Internet Web sites providing such information and concerns with their reliability makes it essential for users to carefully select and evaluate Web sites prior to use. To this end, this article reviews the general principles to consider in this process. Moreover, as cost may limit access to subscription-based health and medication information resources with established reputability, freely accessible online resources that may serve as an invaluable addition to one's reference collection are highlighted. These include government- and organization-sponsored resources (eg, US Food and Drug Administration Web site and the American Society of Health-System Pharmacists' Drug Shortage Resource Center Web site, respectively) as well as commercial Web sites (eg, Medscape, Google Scholar). Familiarity with such online resources can assist health care professionals in their ability to efficiently navigate the Web and may potentially expedite the information gathering and decision-making process, thereby improving patient care.

  8. Entropy Associated with Information Storage and Its Retrieval

    Directory of Open Access Journals (Sweden)

    Abu Mohamed Alhasan

    2015-08-01

    Full Text Available We provide an entropy analysis for light storage and light retrieval. In this analysis, entropy extraction and reduction in a typical light storage experiment are identified. The spatiotemporal behavior of entropy is presented for D1 transition in cold sodium atoms. The governing equations are the reduced Maxwell field equations and the Liouville–von Neumann equation for the density matrix of the dressed atom.

  9. Query-by-Example Music Information Retrieval by Score-Informed Source Separation and Remixing Technologies

    Directory of Open Access Journals (Sweden)

    Goto Masataka

    2010-01-01

    Full Text Available We describe a novel query-by-example (QBE approach in music information retrieval that allows a user to customize query examples by directly modifying the volume of different instrument parts. The underlying hypothesis of this approach is that the musical mood of retrieved results changes in relation to the volume balance of different instruments. On the basis of this hypothesis, we aim to clarify the relationship between the change in the volume balance of a query and the genre of the retrieved pieces, called genre classification shift. Such an understanding would allow us to instruct users in how to generate alternative queries without finding other appropriate pieces. Our QBE system first separates all instrument parts from the audio signal of a piece with the help of its musical score, and then it allows users remix these parts to change the acoustic features that represent the musical mood of the piece. Experimental results showed that the genre classification shift was actually caused by the volume change in the vocal, guitar, and drum parts.

  10. Web content accessibility of consumer health information web sites for people with disabilities: a cross sectional evaluation.

    Science.gov (United States)

    Zeng, Xiaoming; Parmanto, Bambang

    2004-06-21

    The World Wide Web (WWW) has become an increasingly essential resource for health information consumers. The ability to obtain accurate medical information online quickly, conveniently and privately provides health consumers with the opportunity to make informed decisions and participate actively in their personal care. Little is known, however, about whether the content of this online health information is equally accessible to people with disabilities who must rely on special devices or technologies to process online information due to their visual, hearing, mobility, or cognitive limitations. To construct a framework for an automated Web accessibility evaluation; to evaluate the state of accessibility of consumer health information Web sites; and to investigate the possible relationships between accessibility and other features of the Web sites, including function, popularity and importance. We carried out a cross-sectional study of the state of accessibility of health information Web sites to people with disabilities. We selected 108 consumer health information Web sites from the directory service of a Web search engine. A measurement framework was constructed to automatically measure the level of Web Accessibility Barriers (WAB) of Web sites following Web accessibility specifications. We investigated whether there was a difference between WAB scores across various functional categories of the Web sites, and also evaluated the correlation between the WAB and Alexa traffic rank and Google Page Rank of the Web sites. We found that none of the Web sites we looked at are completely accessible to people with disabilities, i.e., there were no sites that had no violation of Web accessibility rules. However, governmental and educational health information Web sites do exhibit better Web accessibility than the other categories of Web sites (P health information Web sites shows that no Web site scrupulously abides by Web accessibility specifications, even for entities

  11. Migrating the facility profile information management system into the world wide web

    Energy Technology Data Exchange (ETDEWEB)

    Kero, R.E.; Swietlik, C.E.

    1994-09-01

    The Department of Energy - Office of Special Projects and Argonne National Laboratory (ANL), along with the Department of Energy - office of Scientific and Technical Information have previously designed and implemented the Environment, Safety and Health Facility Profile Information Management System (FPIMS) to facilitate greater efficiency in searching, analyzing and disseminating information found within environment, safety and health oversight documents. This information retrieval based system serves as a central repository for full-text electronic oversight documents, as well as a management planning and decision making tool that can assist in trend and root cause analyses. Continuous improvement of environment, safety and health programs are currently aided through this personal computer-based system by providing a means for the open communication of lessons learned across the department. Overall benefits have included reductions in costs and improvements in past information management capabilities. Access to the FPIMS has been possible historically through a headquarters-based local area network equipped with modems. Continued demand for greater accessibility of the system by remote DOE field offices and sites, in conjunction with the Secretary of Energy` s call for greater public accessibility to Department of Energy (DOE) information resources, has been the impetus to expand access through the use of Internet technologies. Therefore, the following paper will discuss reasons for migrating the FPIMS system into the World Wide Web (Web), various lessons learned from the FPIMS migration effort, as well as future plans for enhancing the Web-based FPIMS.

  12. [Development of a Web-based laboratory data browser integrated with heterogeneous clinical information].

    Science.gov (United States)

    Fujikawa, Jun

    2009-02-01

    To demonstrate the feasibility of a Web-based laboratory data browser integrated with heterogeneous clinical information in a hospital setting. A Java-based web application was developed in-house, using free open-source software. The server side manages queries to heterogeneous hospital databases containing patient data. Order entry information including laboratory test results, drug prescriptions, injection orders, physiological test orders and, imaging test orders, was retrieved from a replication database, and integrated with nursing data from a nursing system database. The result was visualized in a time-series table format, and accessed by web browsers on computers connected to the hospital intranet. The laboratory data browser system achieved practical response times over huge databases (> 90 million records). The medical personnel accepted the system well, and applied the system to various clinical situations. Integrating heterogeneous data from hospital databases in a Web-based laboratory data browser is a practical approach. Presenting relevant medical information simultaneously added value to the laboratory data, and may promote better medical management.

  13. Information Sharing on the Semantic Web

    NARCIS (Netherlands)

    Stuckenschmidt, Heiner; Harmelen, Frank Van

    2003-01-01

    The large-scale and almost ubiquitous availability of information has become as much of a curse as it is a blessing. The more information is available, the harder it is to locate any particular piece of it. And even when it has been successfully found, it is even harder still to usefully combine it

  14. Improving biomedical information retrieval by linear combinations of different query expansion techniques.

    Science.gov (United States)

    Abdulla, Ahmed AbdoAziz Ahmed; Lin, Hongfei; Xu, Bo; Banbhrani, Santosh Kumar

    2016-07-25

    Biomedical literature retrieval is becoming increasingly complex, and there is a fundamental need for advanced information retrieval systems. Information Retrieval (IR) programs scour unstructured materials such as text documents in large reserves of data that are usually stored on computers. IR is related to the representation, storage, and organization of information items, as well as to access. In IR one of the main problems is to determine which documents are relevant and which are not to the user's needs. Under the current regime, users cannot precisely construct queries in an accurate way to retrieve particular pieces of data from large reserves of data. Basic information retrieval systems are producing low-quality search results. In our proposed system for this paper we present a new technique to refine Information Retrieval searches to better represent the user's information need in order to enhance the performance of information retrieval by using different query expansion techniques and apply a linear combinations between them, where the combinations was linearly between two expansion results at one time. Query expansions expand the search query, for example, by finding synonyms and reweighting original terms. They provide significantly more focused, particularized search results than do basic search queries. The retrieval performance is measured by some variants of MAP (Mean Average Precision) and according to our experimental results, the combination of best results of query expansion is enhanced the retrieved documents and outperforms our baseline by 21.06 %, even it outperforms a previous study by 7.12 %. We propose several query expansion techniques and their combinations (linearly) to make user queries more cognizable to search engines and to produce higher-quality search results.

  15. Infant Gastroesophageal Reflux Information on the World Wide Web.

    Science.gov (United States)

    Balgowan, Regina; Greer, Leah C; D'Auria, Jennifer P

    2016-01-01

    The purpose of this study was to describe the type and quality of health information about infant gastroesophageal reflux (GER) that a parent may find on the World Wide Web. The data collection tool included evaluation of Web site quality and infant GER-specific content on the 30 sites that met the inclusion criteria. The most commonly found content categories in order of frequency were management strategies, when to call a primary care provider, definition, and clinical features. The most frequently mentioned strategies included feeding changes, infant positioning, and medications. Thirteen of the 30 Web sites included information on both GER and gastroesophageal reflux disease. Mention of the use of medication to lessen infant symptoms was found on 15 of the 30 sites. Only 10 of the 30 sites included information about parent support and coping strategies. Pediatric nurse practitioners (PNPs) should utilize well-child visits to address the normalcy of physiologic infant GER and clarify any misperceptions parents may have about diagnosis and the role of medication from information they may have found on the Internet. It is critical for PNPs to assist in the development of Web sites with accurate content, advise parents on how to identify safe and reliable information, and provide examples of high-quality Web sites about child health topics such as infant GER. Copyright © 2016 National Association of Pediatric Nurse Practitioners. Published by Elsevier Inc. All rights reserved.

  16. African Web-Based Animal Health Information: Analysis Of Online ...

    African Journals Online (AJOL)

    This paper examines the coverage of animal health information published on the web from Africa or about Africa using content analysis method. A total of 27 agricultural academic indexing and abstracting online databases were selected for the study. African animal health information was determined according to the ...

  17. Lifelong Learning: Web-Based Information Literacy Module for Merchandisers

    Science.gov (United States)

    Hines, Jean D.; Frey, Diane K.; Swinker, Mary E.

    2005-01-01

    Universities are strategically positioned to serve as a vital impetus in developing pre-professionals' lifelong learning skills. The development of a Web portal, InfoWIZARD, a tool for integrating information literacy and information technology in problem-based research assignments is described in this article. InfoWIZARD includes 20 modules in…

  18. Geographic Information Systems and Web Page Development

    Science.gov (United States)

    Reynolds, Justin

    2004-01-01

    The Facilities Engineering and Architectural Branch is responsible for the design and maintenance of buildings, laboratories, and civil structures. In order to improve efficiency and quality, the FEAB has dedicated itself to establishing a data infrastructure based on Geographic Information Systems, GIS. The value of GIS was explained in an article dating back to 1980 entitled "Need for a Multipurpose Cadastre" which stated, "There is a critical need for a better land-information system in the United States to improve land-conveyance procedures, furnish a basis for equitable taxation, and provide much-needed information for resource management and environmental planning." Scientists and engineers both point to GIS as the solution. What is GIS? According to most text books, Geographic Information Systems is a class of software that stores, manages, and analyzes mapable features on, above, or below the surface of the earth. GIS software is basically database management software to the management of spatial data and information. Simply put, Geographic Information Systems manage, analyze, chart, graph, and map spatial information. GIS can be broken down into two main categories, urban GIS and natural resource GIS. Further still, natural resource GIS can be broken down into six sub-categories, agriculture, forestry, wildlife, catchment management, archaeology, and geology/mining. Agriculture GIS has several applications, such as agricultural capability analysis, land conservation, market analysis, or whole farming planning. Forestry GIs can be used for timber assessment and management, harvest scheduling and planning, environmental impact assessment, and pest management. GIS when used in wildlife applications enables the user to assess and manage habitats, identify and track endangered and rare species, and monitor impact assessment.

  19. Retrieving XCO2 from GOSAT FTS over East Asia Using Simultaneous Aerosol Information from CAI

    Directory of Open Access Journals (Sweden)

    Woogyung Kim

    2016-12-01

    Full Text Available In East Asia, where aerosol concentrations are persistently high throughout the year, most satellite CO2 retrieval algorithms screen out many measurements during quality control in order to reduce retrieval errors. To reduce the retrieval errors associated with aerosols, we have modified YCAR (Yonsei Carbon Retrieval algorithm to YCAR-CAI to retrieve XCO2 from GOSAT FTS measurements using aerosol retrievals from simultaneous Cloud and Aerosol Imager (CAI measurements. The CAI aerosol algorithm provides aerosol type and optical depth information simultaneously for the same geometry and optical path as FTS. The YCAR-CAI XCO2 retrieval algorithm has been developed based on the optimal estimation method. The algorithm uses the VLIDORT V2.6 radiative transfer model to calculate radiances and Jacobian functions. The XCO2 results retrieved using the YCAR-CAI algorithm were evaluated by comparing them with ground-based TCCON measurements and current operational GOSAT XCO2 retrievals. The retrievals show a clear annual cycle, with an increasing trend of 2.02 to 2.39 ppm per year, which is higher than that measured at Mauna Loa, Hawaii. The YCAR-CAI results were validated against the Tsukuba and Saga TCCON sites and show an root mean square error of 2.25, a bias of −0.81 ppm, and a regression line closer to the linear identity function compared with other current algorithms. Even after post-screening, the YCAR-CAI algorithm provides a larger dataset of XCO2 compared with other retrieval algorithms by 21% to 67%, which could be substantially advantageous in validation and data analysis for the area of East Asia. Retrieval uncertainty indicates a 1.39 to 1.48 ppm at the TCCON sites. Using Carbon Tracker-Asia (CT-A data, the sampling error was analyzed and was found to be between 0.32 and 0.36 ppm for each individual sounding.

  20. Web-based Construction Information Management System

    Directory of Open Access Journals (Sweden)

    David Scott

    2012-11-01

    Full Text Available Centralised information systems that are accessible to all parties in a construction project are powerful tools in the quest to improve efficiency and to enhance the flow of information within the construction industry. This report points out the maturity of the necessary IT technology, the availability and the suitability of existing commercial products.Some of these products have been studied and analysed. An evaluation and selection process based on the functions offered in the products and their utility is presented. A survey of local construction personnel has been used to collect typical weighting data and performance criteria used in the evaluation process.

  1. Unit 148 - World Wide Web Basics

    OpenAIRE

    148, CC in GIScience; Yeung, Albert K.

    2000-01-01

    This unit explains the characteristics and the working principles of the World Wide Web as the most important protocol of the Internet. Topics covered in this unit include characteristics of the World Wide Web; using the World Wide Web for the dissemination of information on the Internet; and using the World Wide Web for the retrieval of information from the Internet.

  2. Which user interaction for cross-language information retrieval? Design issues and reflections

    OpenAIRE

    Petrelli, Daniela; Levin, Stephen; Beaulieu, Micheline; Sanderson, Mark

    2006-01-01

    A novel and complex form of information access is cross-language information retrieval: searching for texts written in foreign languages based on native language queries. Although the underlying technology for achieving such a search is relatively well understood, the appropriate interface design is not. The authors present three user evaluations undertaken during the iterative design of Clarity, a cross-language retrieval system for low-density languages, and shows how the user-interaction d...

  3. Synthesizer: Expediting synthesis studies from context-free data with information retrieval techniques.

    Directory of Open Access Journals (Sweden)

    Lisa M Gandy

    Full Text Available Scientists have unprecedented access to a wide variety of high-quality datasets. These datasets, which are often independently curated, commonly use unstructured spreadsheets to store their data. Standardized annotations are essential to perform synthesis studies across investigators, but are often not used in practice. Therefore, accurately combining records in spreadsheets from differing studies requires tedious and error-prone human curation. These efforts result in a significant time and cost barrier to synthesis research. We propose an information retrieval inspired algorithm, Synthesize, that merges unstructured data automatically based on both column labels and values. Application of the Synthesize algorithm to cancer and ecological datasets had high accuracy (on the order of 85-100%. We further implement Synthesize in an open source web application, Synthesizer (https://github.com/lisagandy/synthesizer. The software accepts input as spreadsheets in comma separated value (CSV format, visualizes the merged data, and outputs the results as a new spreadsheet. Synthesizer includes an easy to use graphical user interface, which enables the user to finish combining data and obtain perfect accuracy. Future work will allow detection of units to automatically merge continuous data and application of the algorithm to other data formats, including databases.

  4. Ontology-oriented retrieval of putative microRNAs in Vitis vinifera via GrapeMiRNA: a web database of de novo predicted grape microRNAs

    Directory of Open Access Journals (Sweden)

    Fontana Paolo

    2009-06-01

    Full Text Available Abstract Background Two complete genome sequences are available for Vitis vinifera Pinot noir. Based on the sequence and gene predictions produced by the IASMA, we performed an in silico detection of putative microRNA genes and of their targets, and collected the most reliable microRNA predictions in a web database. The application is available at http://www.itb.cnr.it/ptp/grapemirna/. Description The program FindMiRNA was used to detect putative microRNA genes in the grape genome. A very high number of predictions was retrieved, calling for validation. Nine parameters were calculated and, based on the grape microRNAs dataset available at miRBase, thresholds were defined and applied to FindMiRNA predictions having targets in gene exons. In the resulting subset, predictions were ranked according to precursor positions and sequence similarity, and to target identity. To further validate FindMiRNA predictions, comparisons to the Arabidopsis genome, to the grape Genoscope genome, and to the grape EST collection were performed. Results were stored in a MySQL database and a web interface was prepared to query the database and retrieve predictions of interest. Conclusion The GrapeMiRNA database encompasses 5,778 microRNA predictions spanning the whole grape genome. Predictions are integrated with information that can be of use in selection procedures. Tools added in the web interface also allow to inspect predictions according to gene ontology classes and metabolic pathways of targets. The GrapeMiRNA database can be of help in selecting candidate microRNA genes to be validated.

  5. Visual working memory buffers information retrieved from visual long-term memory.

    Science.gov (United States)

    Fukuda, Keisuke; Woodman, Geoffrey F

    2017-05-16

    Human memory is thought to consist of long-term storage and short-term storage mechanisms, the latter known as working memory. Although it has long been assumed that information retrieved from long-term memory is represented in working memory, we lack neural evidence for this and need neural measures that allow us to watch this retrieval into working memory unfold with high temporal resolution. Here, we show that human electrophysiology can be used to track information as it is brought back into working memory during retrieval from long-term memory. Specifically, we found that the retrieval of information from long-term memory was limited to just a few simple objects' worth of information at once, and elicited a pattern of neurophysiological activity similar to that observed when people encode new information into working memory. Our findings suggest that working memory is where information is buffered when being retrieved from long-term memory and reconcile current theories of memory retrieval with classic notions about the memory mechanisms involved.

  6. Improving data management and dissemination in web based information systems by semantic enrichment of descriptive data aspects

    Science.gov (United States)

    Gebhardt, Steffen; Wehrmann, Thilo; Klinger, Verena; Schettler, Ingo; Huth, Juliane; Künzer, Claudia; Dech, Stefan

    2010-10-01

    The German-Vietnamese water-related information system for the Mekong Delta (WISDOM) project supports business processes in Integrated Water Resources Management in Vietnam. Multiple disciplines bring together earth and ground based observation themes, such as environmental monitoring, water management, demographics, economy, information technology, and infrastructural systems. This paper introduces the components of the web-based WISDOM system including data, logic and presentation tier. It focuses on the data models upon which the database management system is built, including techniques for tagging or linking metadata with the stored information. The model also uses ordered groupings of spatial, thematic and temporal reference objects to semantically tag datasets to enable fast data retrieval, such as finding all data in a specific administrative unit belonging to a specific theme. A spatial database extension is employed by the PostgreSQL database. This object-oriented database was chosen over a relational database to tag spatial objects to tabular data, improving the retrieval of census and observational data at regional, provincial, and local areas. While the spatial database hinders processing raster data, a "work-around" was built into WISDOM to permit efficient management of both raster and vector data. The data model also incorporates styling aspects of the spatial datasets through styled layer descriptions (SLD) and web mapping service (WMS) layer specifications, allowing retrieval of rendered maps. Metadata elements of the spatial data are based on the ISO19115 standard. XML structured information of the SLD and metadata are stored in an XML database. The data models and the data management system are robust for managing the large quantity of spatial objects, sensor observations, census and document data. The operational WISDOM information system prototype contains modules for data management, automatic data integration, and web services for data

  7. WEB STRUCTURE MINING

    Directory of Open Access Journals (Sweden)

    CLAUDIA ELENA DINUCĂ

    2011-01-01

    Full Text Available The World Wide Web became one of the most valuable resources for information retrievals and knowledge discoveries due to the permanent increasing of the amount of data available online. Taking into consideration the web dimension, the users get easily lost in the web’s rich hyper structure. Application of data mining methods is the right solution for knowledge discovery on the Web. The knowledge extracted from the Web can be used to raise the performances for Web information retrievals, question answering and Web based data warehousing. In this paper, I provide an introduction of Web mining categories and I focus on one of these categories: the Web structure mining. Web structure mining, one of three categories of web mining for data, is a tool used to identify the relationship between Web pages linked by information or direct link connection. It offers information about how different pages are linked together to form this huge web. Web Structure Mining finds hidden basic structures and uses hyperlinks for more web applications such as web search.

  8. Bee Swarm Optimization for Medical Web Information Foraging.

    Science.gov (United States)

    Drias, Yassine; Kechid, Samir; Pasi, Gabriella

    2016-02-01

    The present work is related to Web intelligence and more precisely to medical information foraging. We present here a novel approach based on agents technology for information foraging. An architecture is proposed, in which we distinguish two important phases. The first one is a learning process for localizing the most relevant pages that might interest the user. This is performed on a fixed instance of the Web. The second takes into account the openness and the dynamicity of the Web. It consists on an incremental learning starting from the result of the first phase and reshaping the outcomes taking into account the changes that undergoes the Web. The whole system offers a tool to help the user undertaking information foraging. We implemented the system using a group of cooperative reactive agents and more precisely a colony of artificial bees. In order to validate our proposal, experiments were conducted on MedlinePlus, a benchmark dedicated for research in the domain of Health. The results are promising either for those related to Web regularities and for the response time, which is very short and hence complies the real time constraint.

  9. Ontology-based Query Expansion for Arabic Text Retrieval

    OpenAIRE

    Waseem Alromima; Moawad, Ibrahim F.; Rania Elgohary; Mostafa Aref

    2016-01-01

    The semantic resources are important parts in the Information Retrieval (IR) such as search engines, Question Answering (QA), etc., these resources should be available, readable and understandable. In semantic web, the ontology plays a central role for the information retrieval, which use to retrieves more relevant information from unstructured information. This paper presents a semantic-based retrieval system for the Arabic text, which expands the input query semantically using Arabic domain...

  10. Study of query expansion techniques and their application in the biomedical information retrieval.

    Science.gov (United States)

    Rivas, A R; Iglesias, E L; Borrajo, L

    2014-01-01

    Information Retrieval focuses on finding documents whose content matches with a user query from a large document collection. As formulating well-designed queries is difficult for most users, it is necessary to use query expansion to retrieve relevant information. Query expansion techniques are widely applied for improving the efficiency of the textual information retrieval systems. These techniques help to overcome vocabulary mismatch issues by expanding the original query with additional relevant terms and reweighting the terms in the expanded query. In this paper, different text preprocessing and query expansion approaches are combined to improve the documents initially retrieved by a query in a scientific documental database. A corpus belonging to MEDLINE, called Cystic Fibrosis, is used as a knowledge source. Experimental results show that the proposed combinations of techniques greatly enhance the efficiency obtained by traditional queries.

  11. Study of Query Expansion Techniques and Their Application in the Biomedical Information Retrieval

    Science.gov (United States)

    Rivas, A. R.; Iglesias, E. L.; Borrajo, L.

    2014-01-01

    Information Retrieval focuses on finding documents whose content matches with a user query from a large document collection. As formulating well-designed queries is difficult for most users, it is necessary to use query expansion to retrieve relevant information. Query expansion techniques are widely applied for improving the efficiency of the textual information retrieval systems. These techniques help to overcome vocabulary mismatch issues by expanding the original query with additional relevant terms and reweighting the terms in the expanded query. In this paper, different text preprocessing and query expansion approaches are combined to improve the documents initially retrieved by a query in a scientific documental database. A corpus belonging to MEDLINE, called Cystic Fibrosis, is used as a knowledge source. Experimental results show that the proposed combinations of techniques greatly enhance the efficiency obtained by traditional queries. PMID:24723793

  12. The validation of the Yonsei CArbon Retrieval algorithm with improved aerosol information using GOSAT measurements

    Science.gov (United States)

    Jung, Yeonjin; Kim, Jhoon; Kim, Woogyung; Boesch, Hartmut; Goo, Tae-Young; Cho, Chunho

    2017-04-01

    Although several CO2 retrieval algorithms have been developed to improve our understanding about carbon cycle, limitations in spatial coverage and uncertainties due to aerosols and thin cirrus clouds are still remained as a problem for monitoring CO2 concentration globally. Based on an optimal estimation method, the Yonsei CArbon Retrieval (YCAR) algorithm was developed to retrieve the column-averaged dry-air mole fraction of carbon dioxide (XCO2) using the Greenhouse Gases Observing SATellite (GOSAT) measurements with optimized a priori CO2 profiles and aerosol models over East Asia. In previous studies, the aerosol optical properties (AOP) are the most important factors in CO2 retrievals since AOPs are assumed as fixed parameters during retrieval process, resulting in significant XCO2 retrieval error up to 2.5 ppm. In this study, to reduce these errors caused by inaccurate aerosol optical information, the YCAR algorithm improved with taking into account aerosol optical properties as well as aerosol vertical distribution simultaneously. The CO2 retrievals with two difference aerosol approaches have been analyzed using the GOSAT spectra and have been evaluated throughout the comparison with collocated ground-based observations at several Total Carbon Column Observing Network (TCCON) sites. The improved YCAR algorithm has biases of 0.59±0.48 ppm and 2.16±0.87 ppm at Saga and Tsukuba sites, respectively, with smaller biases and higher correlation coefficients compared to the GOSAT operational algorithm. In addition, the XCO2 retrievals will be validated at other TCCON sites and error analysis will be evaluated. These results reveal that considering better aerosol information can improve the accuracy of CO2 retrieval algorithm and provide more useful XCO2 information with reduced uncertainties. This study would be expected to provide useful information in estimating carbon sources and sinks.

  13. Towards Web-based representation and processing of health information

    DEFF Research Database (Denmark)

    Gao, S.; Mioc, Darka; Yi, X.L.

    2009-01-01

    at their fingertips. Increasingly complex problems in the health field require increasingly sophisticated computer software, distributed computing power, and standardized data sharing. To address this need, Web-based mapping is now emerging as an important tool to enable health practitioners, policy makers......, and the public to understand spatial health risks, population health trends and vulnerabilities. Today several web-based health applications generate dynamic maps; however, for people to fully interpret the maps they need data source description and the method used in the data analysis or statistical modeling....... For the representation of health information through Web-mapping applications, there still lacks a standard format to accommodate all fixed (such as location) and variable (such as age, gender, health outcome, etc) indicators in the representation of health information. Furthermore, net-centric computing has not been...

  14. Knowledge Maps and Information Retrieval (KMIR) : Organization of a workshop

    NARCIS (Netherlands)

    Mutschke, Peter; Scharnhorst, Andrea; Guéret, Christophe; Mayr, Philipp; Hansen, Preben; Slavic, Aida

    2014-01-01

    Information systems usually show as a particular point of failure the vagueness between user search terms and the knowledge orders of the information space in question. Some kind of guided searching therefore becomes more and more important in order to precisely discover information without knowing

  15. Publication and Retrieval of Computational Chemical-Physical Data Via the Semantic Web. Final Technical Report

    Energy Technology Data Exchange (ETDEWEB)

    Ostlund, Neil [Chemical Semantics, Inc., Gainesville, FL (United States)

    2017-07-20

    This research showed the feasibility of applying the concepts of the Semantic Web to Computation Chemistry. We have created the first web portal (www.chemsem.com) that allows data created in the calculations of quantum chemistry, and other such chemistry calculations to be placed on the web in a way that makes the data accessible to scientists in a semantic form never before possible. The semantic web nature of the portal allows data to be searched, found, and used as an advance over the usual approach of a relational database. The semantic data on our portal has the nature of a Giant Global Graph (GGG) that can be easily merged with related data and searched globally via a SPARQL Protocol and RDF Query Language (SPARQL) that makes global searches for data easier than with traditional methods. Our Semantic Web Portal requires that the data be understood by a computer and hence defined by an ontology (vocabulary). This ontology is used by the computer in understanding the data. We have created such an ontology for computational chemistry (purl.org/gc) that encapsulates a broad knowledge of the field of computational chemistry. We refer to this ontology as the Gainesville Core. While it is perhaps the first ontology for computational chemistry and is used by our portal, it is only a start of what must be a long multi-partner effort to define computational chemistry. In conjunction with the above efforts we have defined a new potential file standard (Common Standard for eXchange – CSX for computational chemistry data). This CSX file is the precursor of data in the Resource Description Framework (RDF) form that the semantic web requires. Our portal translates CSX files (as well as other computational chemistry data files) into RDF files that are part of the graph database that the semantic web employs. We propose a CSX file as a convenient way to encapsulate computational chemistry data.

  16. Adding information may increase overconfidence in accuracy of knowledge retrieval.

    Science.gov (United States)

    Fleisig, Dida

    2011-04-01

    Feelings of retrospective confidence concerning the accuracy of a chosen answer might rely, among other things, on the amount of available information, regardless of its correctness. 43 participants, 26 women and 17 men (M age = 23.4 yr., SD = 3.5) in an intact group design, answered nine easy and nine difficult binary forced-choice questions and rated their confidence regarding the correctness of their choices. Participants were randomly assigned to one of three groups, differing in the additional information provided regarding the questions: a control group provided with no additional information, a correct information group, and a misleading information group. Performance was worst in the misleading information group, yet no difference in confidence was found between the correct and misleading information groups. The findings were interpreted as supporting the hypothesis that feelings of confidence partly reflect peripheral factors, indirectly related to choice processes.

  17. Computer Information Search and Retrieval: A Guide for the Music Educator.

    Science.gov (United States)

    Williams, David Brian; Beasley, L. Sue

    This report examines the features of computer information systems, differentiating between data files, search systems, and computer information organizations. An annotated listing is provided of those computer information retrieval files of interest to the music education researcher. This listing is divided into five categories: (1) music and…

  18. A Web Browser Interface to Manage the Searching and Organizing of Information on the Web by Learners

    Science.gov (United States)

    Li, Liang-Yi; Chen, Gwo-Dong

    2010-01-01

    Information Gathering is a knowledge construction process. Web learners make a plan for their Information Gathering task based on their prior knowledge. The plan is evolved with new information encountered and their mental model is constructed through continuously assimilating and accommodating new information gathered from different Web pages. In…

  19. Designing and Implementing a Cross-Language Information Retrieval System Using Linguistic Corpora

    Directory of Open Access Journals (Sweden)

    Amin Nezarat

    2012-03-01

    Full Text Available Information retrieval (IR is a crucial area of natural language processing (NLP and can be defined as finding documents whose content is relevant to the query need of a user. Cross-language information retrieval (CLIR refers to a kind of information retrieval in which the language of the query and that of searched document are different. In fact, it is a retrieval process where the user presents queries in one language to retrieve documents in another language. This paper tried to construct a bilingual lexicon of parallel chunks of English and Persian from two very large monolingual corpora an English-Persian parallel corpus which could be directly applied to cross-language information retrieval tasks. For this purpose, a statistical measure known as Association Score (AS was used to compute the association value between every two corresponding chunks in the corpus using a couple of complicated algorithms. Once the CLIR system was developed using this bilingual lexicon, an experiment was performed on a set of one hundred English and Persian phrases and collocations to see to what extend this system was effective in assisting the users find the most relevant and suitable equivalents of their queries in either language.

  20. Retrieval of air quality information using image processing technique.

    Science.gov (United States)

    Lim, H. S.; MatJafri, M. Z.; Abdullah, K.; Saleh, N. M.

    2007-04-01

    This paper presents and describes an approach to retrieve concentration of particulate matter of size less than 10- micron (PM10) from Landsat TM data over Penang Island. The objective of this study is test the feasibility of using Landsat TM for PM10 mapping using our proposed developed algorithm. The development of the algorithm was developed base on the aerosol characteristics in the atmosphere. PM10 measurements were collected using a DustTrak Aerosol Monitor 8520 simultaneously with the image acquisition. The station locations of the PM10 measurements were detemined using a hand held GPS. The digital numbers were extracted corresponding to the ground-truth locations for each band and then converted into radiance and reflectance values. The reflectance measured from the satellite [reflectance at the top of atmospheric, ρ(TOA)] was subtracted by the amount given by the surface reflectance to obtain the atmospheric reflectance. Then the atmospheric reflectance was related to the PM10 using regression analysis. The surface reflectance values were created using ACTOR2 image correction software in the PCI Geomatica 9.1.8 image processing software. The proposed developed algorithm produced high accuracy and also showed a good agreement (R =0.8406) between the measured and estimated PM10. This study indicates that it is feasible to use Landsat TM data for mapping PM10 using the proposed algorithm.

  1. Security Vulnerabilities of the Web Based Open Source Information ...

    African Journals Online (AJOL)

    This paper exposes security vulnerabilities of the web based Open Source Information Systems (OSIS) from both system angle and human perspectives.It shows the extent of risk that can likely hinder adopting organization from attaning full intended benefits of using OSIS software. To undertake this study, a case study ...

  2. Designing a School's Web Site Using Information Architecture.

    Science.gov (United States)

    Vazquez, Gustavo; Victor, Stephen P.

    This paper is a case study of Longfellow Elementary, a K-8 school in San Diego (California) that is using the concepts of information architecture to develop its Web site. The site is intended to be a virtual meeting place for all of the school's constituents: parents, teachers, students, and the community at large. The site is a dynamic, ongoing…

  3. Accountable Information Flow for Java-Based Web Applications

    Science.gov (United States)

    2010-01-01

    industry-standard approach of Java 2 Enterprise Edition (J2EE) Enterprise JavaBeans ( EJB ) layered over an Microsoft SQL Server or Oracle database— while...Domain Information Sharing GWT Google Web Toolkit FabIL Fabric Intermediate Language J2EE Java 2 Enterprise Edition EJB Enterprise JavaBeans Jif Java

  4. Rational Analyses of Information Foraging on the Web

    Science.gov (United States)

    Pirolli, Peter

    2005-01-01

    This article describes rational analyses and cognitive models of Web users developed within information foraging theory. This is done by following the rational analysis methodology of (a) characterizing the problems posed by the environment, (b) developing rational analyses of behavioral solutions to those problems, and (c) developing cognitive…

  5. Semantic Web Technologies as the Foundation for the Information Infrastructure

    NARCIS (Netherlands)

    Van Oosterom, Peter; Zlatanova, S.; Van Harmelen, Frank; Van Oosterom, Peter; Zlatanova, S

    2008-01-01

    The Semantic Web is arising over the pas few years as a realistic option for a world wide Information Infrastructure, with its promises of semantic interoperability and serendipitous reuse. In this paper we will analyse the essential ingredients of semantic technologies, what makes them suitable as

  6. Design and implementation of a web based information system for ...

    African Journals Online (AJOL)

    The design and implementation of a web-based administrative information system for National Health Insurance Scheme (NHIS) using its guidelines has been carried out. The system allows any NHIS-Registered patient to visit any registered provider anywhere in the country and be assigned to a doctor. To carry out the ...

  7. Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives

    DEFF Research Database (Denmark)

    Cao, Xin; Cong, Gao; Cui, Bin

    2012-01-01

    of CQA services, question retrieval in a CQA archive aims to retrieve historical question-answer pairs that are relevant to a query question. This article presents several new approaches to exploiting the category information of questions for improving the performance of question retrieval......Community Question Answering (CQA) is a popular type of service where users ask questions and where answers are obtained from other users or from historical question-answer pairs. CQA archives contain large volumes of questions organized into a hierarchy of categories. As an essential function...

  8. Image retrieval by information fusion based on scalable vocabulary tree and robust Hausdorff distance

    Science.gov (United States)

    Che, Chang; Yu, Xiaoyang; Sun, Xiaoming; Yu, Boyang

    2017-12-01

    In recent years, Scalable Vocabulary Tree (SVT) has been shown to be effective in image retrieval. However, for general images where the foreground is the object to be recognized while the background is cluttered, the performance of the current SVT framework is restricted. In this paper, a new image retrieval framework that incorporates a robust distance metric and information fusion is proposed, which improves the retrieval performance relative to the baseline SVT approach. First, the visual words that represent the background are diminished by using a robust Hausdorff distance between different images. Second, image matching results based on three image signature representations are fused, which enhances the retrieval precision. We conducted intensive experiments on small-scale to large-scale image datasets: Corel-9, Corel-48, and PKU-198, where the proposed Hausdorff metric and information fusion outperforms the state-of-the-art methods by about 13, 15, and 15%, respectively.

  9. Advances in metabolome information retrieval: turning chemistry into biology. Part II: biological information recovery.

    Science.gov (United States)

    Tebani, Abdellah; Afonso, Carlos; Bekri, Soumeya

    2017-08-25

    This work reports the second part of a review intending to give the state of the art of major metabolic phenotyping strategies. It particularly deals with inherent advantages and limits regarding data analysis issues and biological information retrieval tools along with translational challenges. This Part starts with introducing the main data preprocessing strategies of the different metabolomics data. Then, it describes the main data analysis techniques including univariate and multivariate aspects. It also addresses the challenges related to metabolite annotation and characterization. Finally, functional analysis including pathway and network strategies are discussed. The last section of this review is devoted to practical considerations and current challenges and pathways to bring metabolomics into clinical environments.

  10. Handling Internet-Based Health Information: Improving Health Information Web Site Literacy Among Undergraduate Nursing Students.

    Science.gov (United States)

    Wang, Weiwen; Sun, Ran; Mulvehill, Alice M; Gilson, Courtney C; Huang, Linda L

    2017-02-01

    Patient care problems arise when health care consumers and professionals find health information on the Internet because that information is often inaccurate. To mitigate this problem, nurses can develop Web literacy and share that skill with health care consumers. This study evaluated a Web-literacy intervention for undergraduate nursing students to find reliable Web-based health information. A pre- and postsurvey queried undergraduate nursing students in an informatics course; the intervention comprised lecture, in-class practice, and assignments about health Web site evaluation tools. Data were analyzed using Wilcoxon and ANOVA signed-rank tests. Pre-intervention, 75.9% of participants reported using Web sites to obtain health information. Postintervention, 87.9% displayed confidence in using an evaluation tool. Both the ability to critique health Web sites (p = .005) and confidence in finding reliable Internet-based health information (p = .058) increased. Web-literacy education guides nursing students to find, evaluate, and use reliable Web sites, which improves their ability to deliver safer patient care. [J Nurs Educ. 2017;56(2):110-114.]. Copyright 2017, SLACK Incorporated.

  11. Factors influencing user ability to retrieve information from the ...

    African Journals Online (AJOL)

    Based on these findings , recommendations were made urging for a clearly defined and well articulated set of policies to guide reference service as well as strengthening information literacy interventions in university libraries to enhance user ability to locate, evaluate and effectively use required information for academic ...

  12. Evaluation of some Information Retrieval models for Gujarati Ad hoc Monolingual Tasks

    OpenAIRE

    J., Joshi Hardik; Jyoti, Pareek

    2012-01-01

    This paper describes the work towards Gujarati Ad hoc Monolingual Retrieval task for widely used Information Retrieval (IR) models. We present an indexing baseline for the Gujarati Language represented by Mean Average Precision (MAP) values. Our objective is to obtain a relative picture of a better IR model for Gujarati Language. Results show that Classical IR models like Term Frequency Inverse Document Frequency (TF_IDF) performs better when compared to few recent probabilistic IR models. Th...

  13. Effects of Diacritics on Web Search Engines’ Performance for Retrieval of Yoruba Documents

    Directory of Open Access Journals (Sweden)

    Toluwase Victor Asubiaro

    2014-06-01

    Full Text Available This paper aims to find out the possible effect of the use or nonuse of diacritics in Yoruba search queries on the performance of major search engines, AOL, Bing, Google and Yahoo!, in retrieving documents. 30 Yoruba queries created from the most searched keywords from Nigeria on Google search logs were submitted to the search engines. The search queries were posed to the search engines without diacritics and then with diacritics. All of the search engines retrieved more sites in response to the queries without diacritics. Also, they all retrieved more precise results for queries without diacritics. The search engines also answered more queries without diacritics. There was no significant difference in the precision values of any two of the four search engines for diacritized and undiacritized queries. There was a significant difference in the effectiveness of AOL and Yahoo when diacritics were applied and when they were not applied. The findings of the study indicate that the search engines do not find a relationship between the diacritized Yoruba words and the undiacritized versions. Therefore, there is a need for search engines to add normalization steps to pre-process Yoruba queries and indexes. This study concentrates on a problem with search engines that has not been previously investigated.

  14. Using Bayesian networks to support decision-focused information retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Lehner, P.; Elsaesser, C.; Seligman, L. [Mitre Corp., McLean, VA (United States)

    1996-12-31

    This paper has described an approach to controlling the process of pulling data/information from distributed data bases in a way that is specific to a persons specific decision making context. Our prototype implementation of this approach uses a knowledge-based planner to generate a plan, an automatically constructed Bayesian network to evaluate the plan, specialized processing of the network to derive key information items that would substantially impact the evaluation of the plan (e.g., determine that replanning is needed), automated construction of Standing Requests for Information (SRIs) which are automated functions that monitor changes and trends in distributed data base that are relevant to the key information items. This emphasis of this paper is on how Bayesian networks are used.

  15. Has Web 2.0 Revitalized Informal Learning? The Relationship between Web 2.0 and Informal Learning

    Science.gov (United States)

    Song, D.; Lee, J.

    2014-01-01

    Learning is becoming increasingly self-directed and often occurs away from schools and other formal educational settings. The development of a myriad of new technologies for learning has enabled people to learn anywhere and anytime. Web 2.0 technology allows researchers to shed a new light on the importance and prevalence of informal learning.…

  16. A WEB API AND WEB APPLICATION DEVELOPMENT FOR DISSEMINATION OF AIR QUALITY INFORMATION

    Directory of Open Access Journals (Sweden)

    K. Şahin

    2017-11-01

    Full Text Available Various studies have been carried out since 2005 under the leadership of Ministry of Environment and Urbanism of Turkey, in order to observe the quality of air in Turkey, to develop new policies and to develop a sustainable air quality management strategy. For this reason, a national air quality monitoring network has been developed providing air quality indices. By this network, the quality of the air has been continuously monitored and an important information system has been constructed in order to take precautions for preventing a dangerous situation. The biggest handicap in the network is the data access problem for instant and time series data acquisition and processing because of its proprietary structure. Currently, there is no service offered by the current air quality monitoring system for exchanging information with third party applications. Within the context of this work, a web service has been developed to enable location based querying of the current/past air quality data in Turkey. This web service is equipped with up-todate and widely preferred technologies. In other words, an architecture is chosen in which applications can easily integrate. In the second phase of the study, a web-based application was developed to test the developed web service and this testing application can perform location based acquisition of air-quality data. This makes it possible to easily carry out operations such as screening and examination of the area in the given time-frame which cannot be done with the national monitoring network.

  17. a Web Api and Web Application Development for Dissemination of Air Quality Information

    Science.gov (United States)

    Şahin, K.; Işıkdağ, U.

    2017-11-01

    Various studies have been carried out since 2005 under the leadership of Ministry of Environment and Urbanism of Turkey, in order to observe the quality of air in Turkey, to develop new policies and to develop a sustainable air quality management strategy. For this reason, a national air quality monitoring network has been developed providing air quality indices. By this network, the quality of the air has been continuously monitored and an important information system has been constructed in order to take precautions for preventing a dangerous situation. The biggest handicap in the network is the data access problem for instant and time series data acquisition and processing because of its proprietary structure. Currently, there is no service offered by the current air quality monitoring system for exchanging information with third party applications. Within the context of this work, a web service has been developed to enable location based querying of the current/past air quality data in Turkey. This web service is equipped with up-todate and widely preferred technologies. In other words, an architecture is chosen in which applications can easily integrate. In the second phase of the study, a web-based application was developed to test the developed web service and this testing application can perform location based acquisition of air-quality data. This makes it possible to easily carry out operations such as screening and examination of the area in the given time-frame which cannot be done with the national monitoring network.

  18. Querying Data Providing Web Services

    OpenAIRE

    Sabesan, Manivasakan

    2010-01-01

    Web services are often used for search computing where data is retrieved from servers providing information of different kinds. Such data providing web services return a set of objects for a given set of parameters without any side effects. There is need to enable general and scalable search capabilities of data from data providing web services, which is the topic of this Thesis. The Web Service MEDiator (WSMED) system automatically provides relational views of any data providing web service ...

  19. 'Meatball searching' - The adversarial approach to online information retrieval

    Science.gov (United States)

    Jack, R. F.

    1985-01-01

    It is proposed that the different styles of online searching can be described as either formal (highly precise) or informal with the needs of the client dictating which is most applicable at a particular moment. The background and personality of the searcher also come into play. Particular attention is focused on meatball searching which is a form of online searching characterized by deliberate vagueness. It requires generally comprehensive searches, often on unusual topics and with tight deadlines. It is most likely to occur in search centers serving many different disciplines and levels of client information sophistication. Various information needs are outlined as well as the laws of meatball searching and the adversarial approach. Traits and characteristics important to sucessful searching include: (1) concept analysis, (2) flexibility of thinking, (3) ability to think in synonyms and (4) anticipation of variant word forms and spellings.

  20. Information Extraction and Linking in a Retrieval Context

    NARCIS (Netherlands)

    Moens, M.F.; Hiemstra, Djoerd

    We witness a growing interest and capabilities of automatic content recognition (often referred to as information extraction) in various media sources that identify entities (e.g. persons, locations and products) and their semantic attributes (e.g., opinions expressed towards persons or products,

  1. Professional assistance to users of information retrieval tools at the ...

    African Journals Online (AJOL)

    If you would like more information about how to print, save, and work with PDFs, Highwire Press provides a helpful Frequently Asked Questions about PDFs. Alternatively, you can download the PDF file directly to your computer, from where it can be opened using a PDF reader. To download the PDF, click the Download link ...

  2. Fast and reliable online learning to rank for information retrieval

    NARCIS (Netherlands)

    Hofmann, K.

    2013-01-01

    The amount of digital data we produce every day far surpasses our ability to process this data, and finding useful information in this constant flow of data has become one of the major challenges of the 21st century. Search engines are one way of accessing large data collections. Their algorithms

  3. Dublin Core and Electronic Information Retrieval | Gbaje | Samaru ...

    African Journals Online (AJOL)

    Samaru Journal of Information Studies. Journal Home · ABOUT · Advanced Search · Current Issue · Archives · Journal Home > Vol 6, No 1 (2006) >. Log in or Register to get access to full text downloads. Username, Password, Remember me, or Register · Download this PDF file. The PDF file you selected should load here if ...

  4. Perspectives on Adaptivity in Information Retrieval Interaction (PAIRI)

    DEFF Research Database (Denmark)

    Ingwersen, Peter; Larsen, Birger; Kelly, Diane

    2010-01-01

    Adaptivity in IR interactions requires the IR systems adapting to users’ situations and the users adapting to the systems. System adaption entails dynamic user modeling, effective information architecture and enhanced search features such as search integration and relevance feedback; user adaptat...

  5. Systematisierung und Evaluierung von Clustering-Verfahren im Information Retrieval

    OpenAIRE

    Kürsten, Jens

    2006-01-01

    Im Rahmen der vorliegenden Diplomarbeit werden Verfahren zur Clusteranalyse sowie deren Anwendungsmöglichkeiten zur Optimierung der Rechercheergebnisse von Information Retrievalsystemen untersucht. Die Grundlage der vergleichenden Evaluation erfolgversprechender Ansätze zur Clusteranalyse anhand der Domain Specific Monolingual Tasks des Cross-Language Evaluation Forums 2006 bildet die systematische Analyse der in der Forschung etablierten Verfahren zur Clusteranalyse. Die Implementierung ...

  6. Content-Based Information Retrieval from Forensic Databases

    NARCIS (Netherlands)

    Geradts, Z.J.M.H.

    2002-01-01

    In forensic science, the number of image databases is growing rapidly. For this reason, it is necessary to have a proper procedure for searching in these images databases based on content. The use of image databases results in more solved crimes; furthermore, statistical information can be obtained

  7. The Use of Metadata Visualisation Assist Information Retrieval

    Science.gov (United States)

    2007-10-01

    centred issues have been identified and they include; usability, prior knowledge, understanding of elementary perceptual-cognitive tasks and education ...pertain to information visualisation is required. • Education and Training The problems associated with education and training can be overcome... customised data. A coordinated visualisation interface consists of a set of visualisations, which can interact, portraying the relationship that

  8. 42 CFR 433.116 - FFP for operation of mechanized claims processing and information retrieval systems.

    Science.gov (United States)

    2010-10-01

    ... FISCAL ADMINISTRATION Mechanized Claims Processing and Information Retrieval Systems § 433.116 FFP for... Medicaid fraud control unit certified under section 1903(q) of the Act and § 455.300 of this chapter, the Medicaid agency must have procedures to assure that information on probable fraud or abuse that is obtained...

  9. Information Visualization and Proposing New Interface for Movie Retrieval System (IMDB)

    Science.gov (United States)

    Etemadpour, Ronak; Masood, Mona; Belaton, Bahari

    2010-01-01

    This research studies the development of a new prototype of visualization in support of movie retrieval. The goal of information visualization is unveiling of large amounts of data or abstract data set using visual presentation. With this knowledge the main goal is to develop a 2D presentation of information on movies from the IMDB (Internet Movie…

  10. A Workshop on Qualitative Information Retrieval, November 18-20, 1980,

    Science.gov (United States)

    1981-09-28

    traditional libaray with card catalog. It does not accurately reflect the functions that most information systems must perform in the 1980’s. It...defined using emerging technologies for storing and retrieving both digital and visual information. For example, optical videvudsc technology will

  11. Aiming for User Experience in Information Retrieval: Towards User-Centered Relevance (USR)

    NARCIS (Netherlands)

    van der Sluis, Frans; van Dijk, Elisabeth M.A.G.; van den Broek, Egon; Chen, Hsin-Hsi; Efthimiadis, Efthimis N.; Savoy, Jacques; Crestani, Fabio; Marchand-Maillet, Stephane

    2010-01-01

    As widely recognized, there is more to relevance than topicality. By looking at the user experience of Information Retrieval (IR), this proposal takes a broader perspective on relevance. Several facets of relevance are structured according to how the user will experience an information object. In

  12. The Evolution of Web Searching.

    Science.gov (United States)

    Green, David

    2000-01-01

    Explores the interrelation between Web publishing and information retrieval technologies and lists new approaches to Web indexing and searching. Highlights include Web directories; search engines; portalisation; Internet service providers; browser providers; meta search engines; popularity based analysis; natural language searching; links-based…

  13. Energy for agriculture. A computerized information retrieval system

    Energy Technology Data Exchange (ETDEWEB)

    Stout, B.A.; Myers, C.A. (comps.)

    1979-12-01

    Energy may come from the sun or the earth or be the product of plant materials or agricultural wastes. Whatever its source, energy is indispensable to our way of life, beginning with the production, processing, and distribution of abundant, high quality food and fiber supplies. This specialized bibliography on the subject of energy for agriculture contains 2613 citations to the literature for 1973 through May 1979. Originally issued by Michigan State University (MSU), it is being reprinted and distributed by the U.S. Department of Agriculture. The literature citations will be incorporated into AGRICOLA (Agricultural On-Line Access), the comprehensive bibliographic data base maintained by Technical Information Systems (TIS), a component of USDA's Science and Education Administration (SEA). The citations and the listing of research projects will be combined with other relevant references to provide a continuously updated source of information on energy programs in the agricultural field. No abstracts are included.

  14. Public Health Information Retrieval from Non-health Databases

    Directory of Open Access Journals (Sweden)

    Thumeka Mgwigwi

    2012-06-01

    Full Text Available This study examines the extent to which non-health databases index public health and healthcare related journals. The field of public health and healthcare is unique and multidisciplinary and therefore presents some challenges for researchers looking for published literature in the field. This challenge forces researchers to look beyond databases like Medline and search a wide array of databases in various fields. A list of journal titles from non-health databases in various fields was used to compare title coverage in Medline (Ovid. Databases used in this study are Canadian Business & Current Affairs (CBCA Complete, which is a multidisciplinary database; ABI/Inform covering business literature; Public Affairs Information Services (PAIS; EconLit; PsycInfo focusing only on public health journals and eliminating psychology specific journals; Sociological Abstracts; and Women’s Studies international.

  15. Information retrieval and pedagogy in adapted physical activity.

    Science.gov (United States)

    O'Connor, J; Sherrill, C; French, R

    2001-06-01

    The purpose was to address which databases would be most productive for literature searches by professionals seeking information on adapted physical activity pedagogy. Four databases were searched using 126 pedagogy and 66 disability terms. The results of the searches (4,130 hits) support the use of Sport Discus (n= 2,442 hits) as the most productive database for searches on adapted physical activity pedagogy.

  16. Toward Studying Music Cognition with Information Retrieval Techniques: Lessons Learned from the OpenMIIR Initiative

    Directory of Open Access Journals (Sweden)

    Sebastian Stober

    2017-08-01

    Full Text Available As an emerging sub-field of music information retrieval (MIR, music imagery information retrieval (MIIR aims to retrieve information from brain activity recorded during music cognition–such as listening to or imagining music pieces. This is a highly inter-disciplinary endeavor that requires expertise in MIR as well as cognitive neuroscience and psychology. The OpenMIIR initiative strives to foster collaborations between these fields to advance the state of the art in MIIR. As a first step, electroencephalography (EEG recordings of music perception and imagination have been made publicly available, enabling MIR researchers to easily test and adapt their existing approaches for music analysis like fingerprinting, beat tracking or tempo estimation on this new kind of data. This paper reports on first results of MIIR experiments using these OpenMIIR datasets and points out how these findings could drive new research in cognitive neuroscience.

  17. Toward Studying Music Cognition with Information Retrieval Techniques: Lessons Learned from the OpenMIIR Initiative.

    Science.gov (United States)

    Stober, Sebastian

    2017-01-01

    As an emerging sub-field of music information retrieval (MIR), music imagery information retrieval (MIIR) aims to retrieve information from brain activity recorded during music cognition-such as listening to or imagining music pieces. This is a highly inter-disciplinary endeavor that requires expertise in MIR as well as cognitive neuroscience and psychology. The OpenMIIR initiative strives to foster collaborations between these fields to advance the state of the art in MIIR. As a first step, electroencephalography (EEG) recordings of music perception and imagination have been made publicly available, enabling MIR researchers to easily test and adapt their existing approaches for music analysis like fingerprinting, beat tracking or tempo estimation on this new kind of data. This paper reports on first results of MIIR experiments using these OpenMIIR datasets and points out how these findings could drive new research in cognitive neuroscience.

  18. Toward Studying Music Cognition with Information Retrieval Techniques: Lessons Learned from the OpenMIIR Initiative

    Science.gov (United States)

    Stober, Sebastian

    2017-01-01

    As an emerging sub-field of music information retrieval (MIR), music imagery information retrieval (MIIR) aims to retrieve information from brain activity recorded during music cognition–such as listening to or imagining music pieces. This is a highly inter-disciplinary endeavor that requires expertise in MIR as well as cognitive neuroscience and psychology. The OpenMIIR initiative strives to foster collaborations between these fields to advance the state of the art in MIIR. As a first step, electroencephalography (EEG) recordings of music perception and imagination have been made publicly available, enabling MIR researchers to easily test and adapt their existing approaches for music analysis like fingerprinting, beat tracking or tempo estimation on this new kind of data. This paper reports on first results of MIIR experiments using these OpenMIIR datasets and points out how these findings could drive new research in cognitive neuroscience. PMID:28824478

  19. Better late than never: information retrieval from black holes.

    Science.gov (United States)

    Braunstein, Samuel L; Pirandola, Stefano; Życzkowski, Karol

    2013-03-08

    We show that, in order to preserve the equivalence principle until late times in unitarily evaporating black holes, the thermodynamic entropy of a black hole must be primarily entropy of entanglement across the event horizon. For such black holes, we show that the information entering a black hole becomes encoded in correlations within a tripartite quantum state, the quantum analogue of a one-time pad, and is only decoded into the outgoing radiation very late in the evaporation. This behavior generically describes the unitary evaporation of highly entangled black holes and requires no specially designed evolution. Our work suggests the existence of a matter-field sum rule for any fundamental theory.

  20. Information Storage and Retrieval using Macromolecules as Storage Media

    OpenAIRE

    Mansuripur, M.; Khulbe, P. K.; Kuebler, S. M.; Perry, J W; Giridhar, M. S.; Erwin, J. Kevin; Seong, Kibyung; Marder, Seth; Peyghambarian, N

    2017-01-01

    To store information at extremely high-density and data-rate, we propose to adapt, integrate, and extend the techniques developed by chemists and molecular biologists for the purpose of manipulating biological and other macromolecules. In principle, volumetric densities in excess of 10^21 bits/cm^3 can be achieved when individual molecules having dimensions below a nanometer or so are used to encode the 0's and 1's of a binary string of data. In practice, however, given the limitations of ele...

  1. Implementation of Web-based Information Systems in Distributed Organizations

    DEFF Research Database (Denmark)

    Bødker, Keld; Pors, Jens Kaaber; Simonsen, Jesper

    2004-01-01

    This article presents results elicited from studies conducted in relation to implementing a web-based information system throughout a large distributed organization. We demonstrate the kind of expectations and conditions for change that management face in relation to open-ended, configurable......, and context specific web-based information systems like Lotus QuickPlace. Our synthesis from the empirical findings is related to two recent models, the improvisational change management model suggested by Orlikowski and Hofman (1997), and Gallivan's (2001) model for organizational adoption and assimilation....... In line with comparable approaches from the knowledge management area (Dixon 2000; Markus 2001), we relate to, refine, and operationalize the models from an overall organizational view by identifying and characterizing four different and general implementation contexts...

  2. Episodic Memory Retrieval Functionally Relies on Very Rapid Reactivation of Sensory Information.

    Science.gov (United States)

    Waldhauser, Gerd T; Braun, Verena; Hanslmayr, Simon

    2016-01-06

    Episodic memory retrieval is assumed to rely on the rapid reactivation of sensory information that was present during encoding, a process termed "ecphory." We investigated the functional relevance of this scarcely understood process in two experiments in human participants. We presented stimuli to the left or right of fixation at encoding, followed by an episodic memory test with centrally presented retrieval cues. This allowed us to track the reactivation of lateralized sensory memory traces during retrieval. Successful episodic retrieval led to a very early (∼100-200 ms) reactivation of lateralized alpha/beta (10-25 Hz) electroencephalographic (EEG) power decreases in the visual cortex contralateral to the visual field at encoding. Applying rhythmic transcranial magnetic stimulation to interfere with early retrieval processing in the visual cortex led to decreased episodic memory performance specifically for items encoded in the visual field contralateral to the site of stimulation. These results demonstrate, for the first time, that episodic memory functionally relies on very rapid reactivation of sensory information. Remembering personal experiences requires a "mental time travel" to revisit sensory information perceived in the past. This process is typically described as a controlled, relatively slow process. However, by using electroencephalography to measure neural activity with a high time resolution, we show that such episodic retrieval entails a very rapid reactivation of sensory brain areas. Using transcranial magnetic stimulation to alter brain function during retrieval revealed that this early sensory reactivation is causally relevant for conscious remembering. These results give first neural evidence for a functional, preconscious component of episodic remembering. This provides new insight into the nature of human memory and may help in the understanding of psychiatric conditions that involve the automatic intrusion of unwanted memories. Copyright

  3. Practical Side of the Bibliographic Information Retrieval System in the National Museum of Ethnology

    Science.gov (United States)

    Kondo, Katsuichi

    The information retrieval system of the National Museum of Ethnology made its debut in 1979 and now enables us to search the books not only in the Museum but in the country and abroad by means of JAPAN MARC & LC MARC. The author presents the outline and the development of the information managing system including the above briefly and secondly the practical case of using our retrieval system in particular. The problems to be solved in the course of the future plan are also mentioned.

  4. Improving information retrieval using Medical Subject Headings Concepts: a test case on rare and chronic diseases.

    Science.gov (United States)

    Darmoni, Stéfan J; Soualmia, Lina F; Letord, Catherine; Jaulent, Marie-Christine; Griffon, Nicolas; Thirion, Benoît; Névéol, Aurélie

    2012-07-01

    As more scientific work is published, it is important to improve access to the biomedical literature. Since 2000, when Medical Subject Headings (MeSH) Concepts were introduced, the MeSH Thesaurus has been concept based. Nevertheless, information retrieval is still performed at the MeSH Descriptor or Supplementary Concept level. The study assesses the benefit of using MeSH Concepts for indexing and information retrieval. Three sets of queries were built for thirty-two rare diseases and twenty-two chronic diseases: (1) using PubMed Automatic Term Mapping (ATM), (2) using Catalog and Index of French-language Health Internet (CISMeF) ATM, and (3) extrapolating the MEDLINE citations that should be indexed with a MeSH Concept. Type 3 queries retrieve significantly fewer results than type 1 or type 2 queries (about 18,000 citations versus 200,000 for rare diseases; about 300,000 citations versus 2,000,000 for chronic diseases). CISMeF ATM also provides better precision than PubMed ATM for both disease categories. Using MeSH Concept indexing instead of ATM is theoretically possible to improve retrieval performance with the current indexing policy. However, using MeSH Concept information retrieval and indexing rules would be a fundamentally better approach. These modifications have already been implemented in the CISMeF search engine.

  5. Web Services and Widgets for Library Information Systems

    Directory of Open Access Journals (Sweden)

    Godmar Back

    2010-06-01

    Full Text Available As more libraries integrate information from web services to enhance their online public displays, techniques that facilitate this integration are needed. This paper presents a technique for such integration that is based on HTML widgets. We discuss three example systems (Google Book Classes, Tictoclookup, and MAJAX that implement this technique. These systems can be easily adapted without requiring programming experience or expensive hosting.

  6. Information storage and retrieval in a single levitating colloidal particle

    Science.gov (United States)

    Myers, Christopher J.; Celebrano, Michele; Krishnan, Madhavi

    2015-10-01

    The binary switch is a basic component of digital information. From phase-change alloys to nanomechanical beams, molecules and atoms, new strategies for controlled bistability hold great interest for emerging technologies. We present a generic methodology for precise and parallel spatiotemporal control of nanometre-scale matter in a fluid, and demonstrate the ability to attain digital functionalities such as switching, gating and data storage in a single colloid, with further implications for signal amplification and logic operations. This fluid-phase bit can be arrayed at high densities, manipulated by either electrical or optical fields, supports low-energy, high-speed operation and marks a first step toward ‘colloidal information’. The principle generalizes to any system where spatial perturbation of a particle elicits a differential response amenable to readout.

  7. Flexible patient information search and retrieval framework: pilot implementation

    Science.gov (United States)

    Erdal, Selnur; Catalyurek, Umit V.; Saltz, Joel; Kamal, Jyoti; Gurcan, Metin N.

    2007-03-01

    Medical centers collect and store significant amount of valuable data pertaining to patients' visit in the form of medical free-text. In addition, standardized diagnosis codes (International Classification of Diseases, Ninth Revision, Clinical Modification: ICD9-CM) related to those dictated reports are usually available. In this work, we have created a framework where image searches could be initiated through a combination of free-text reports as well as ICD9 codes. This framework enables more comprehensive search on existing large sets of patient data in a systematic way. The free text search is enriched by computer-aided inclusion of additional search terms enhanced by a thesaurus. This combination of enriched search allows users to access to a larger set of relevant results from a patient-centric PACS in a simpler way. Therefore, such framework is of particular use in tasks such as gathering images for desired patient populations, building disease models, and so on. As the motivating application of our framework, we implemented a search engine. This search engine processed two years of patient data from the OSU Medical Center's Information Warehouse and identified lung nodule location information using a combination of UMLS Meta-Thesaurus enhanced text report searches along with ICD9 code searches on patients that have been discharged. Five different queries with various ICD9 codes involving lung cancer were carried out on 172552 cases. Each search was completed under a minute on average per ICD9 code and the inclusion of UMLS thesaurus increased the number of relevant cases by 45% on average.

  8. Hospital nurses' information retrieval behaviours in relation to evidence based nursing: a literature review.

    Science.gov (United States)

    Alving, Berit Elisabeth; Christiansen, Janne Buck; Thrysoe, Lars

    2018-01-12

    The purpose of this literature review is to provide an overview of the information retrieval behaviour of clinical nurses, in terms of the use of databases and other information resources and their frequency of use. Systematic searches carried out in five databases and handsearching were used to identify the studies from 2010 to 2016, with a populations, exposures and outcomes (PEO) search strategy, focusing on the question: In which databases or other information resources do hospital nurses search for evidence based information, and how often? Of 5272 titles retrieved based on the search strategy, only nine studies fulfilled the criteria for inclusion. The studies are from the United States, Canada, Taiwan and Nigeria. The results show that hospital nurses' primary choice of source for evidence based information is Google and peers, while bibliographic databases such as PubMed are secondary choices. Data on frequency are only included in four of the studies, and data are heterogenous. The reasons for choosing Google and peers are primarily lack of time; lack of information; lack of retrieval skills; or lack of training in database searching. Only a few studies are published on clinical nurses' retrieval behaviours, and more studies are needed from Europe and Australia. © 2018 Health Libraries Group.

  9. An information filtering system prototype for world wide web; Prototipo di sistema di information filtering per world wide web

    Energy Technology Data Exchange (ETDEWEB)

    Bordoni, L. [ENEA Centro Ricerche Casaccia, S. Maria di Galeria, RM (Italy). Funzione Centrale Studi

    1999-07-01

    In this report the architecture of an information filtering system for world wide web, developed by the Rome Third University (Italy) for ENEA (National Agency for New Technology, Energy and the Environment), is described. This prototype allows for selecting documents in text/HTML format from the web according to the interests of users. A user modeling shell allows ro build a model of user's interests, obtained during the interaction. The experimental results support the choice of embedding methods for this kind of application. [Italian] In questo rapporto viene descritta l'architettura di un sistema adattivo di information filtering su world wide web, sviluppato dall'universita' di Roma III in collaborazione con l'ENEA. Il prototipo descritto e' in grado di selezionare documenti in formato testo/html, raccolti dal web, in accordo con le caratteristiche e gli interessi degli utenti. Una shell di modellazione utente consente di costruire un modello degli interessi dell'utente, ottenuto nel corso dell'interazione. I risultati sperimentali rafforzano la scelta di usare metodi di modellazione utente per questo genere di applicazioni.

  10. Large-scale distributed foraging, gathering, and matching for information retrieval: assisting the geospatial intelligence analyst

    Science.gov (United States)

    Santos, Eugene, Jr.; Santos, Eunice E.; Nguyen, Hien; Pan, Long; Korah, John

    2005-03-01

    With the proliferation of online resources, there is an increasing need to effectively and efficiently retrieve data and knowledge from distributed geospatial databases. One of the key challenges of this problem is the fact that geospatial databases are usually large and dynamic. In this paper, we address this problem by developing a large scale distributed intelligent foraging, gathering and matching (I-FGM) framework for massive and dynamic information spaces. We assess the effectiveness of our approach by comparing a prototype I-FGM against two simple controls systems (randomized selection and partially intelligent systems). We designed and employed a medium-sized testbed to get an accurate measure of retrieval precision and recall for each system. The results obtained show that I-FGM retrieves relevant information more quickly than the two other control approaches.

  11. Web application for recording learners’ mouse trajectories and retrieving their study logs for data analysis

    Directory of Open Access Journals (Sweden)

    Yoshinori Miyazaki

    2012-03-01

    Full Text Available With the accelerated implementation of e-learning systems in educational institutions, it has become possible to record learners’ study logs in recent years. It must be admitted that little research has been conducted upon the analysis of the study logs that are obtained. In addition, there is no software that traces the mouse movements of learners during their learning processes, which the authors believe would enable teachers to better understand their students’ behaviors. The objective of this study is to develop a Web application that records students’ study logs, including their mouse trajectories, and to devise an IR tool that can summarize such diversified data. The results of an experiment are also scrutinized to provide an analysis of the relationship between learners’ activities and their study logs.

  12. 77 FR 74278 - Proposed Information Collection (Internet Student CPR Web Registration Application); Comment Request

    Science.gov (United States)

    2012-12-13

    ... From the Federal Register Online via the Government Publishing Office DEPARTMENT OF VETERANS AFFAIRS Proposed Information Collection (Internet Student CPR Web Registration Application); Comment... use of other forms of information technology. Title: Internet Student CPR Web Registration Application...

  13. ScolioMedIS: web-oriented information system for idiopathic scoliosis visualization and monitoring.

    Science.gov (United States)

    Devedžić, Goran; Cuković, Saša; Luković, Vanja; Milošević, Danijela; Subburaj, K; Luković, Tanja

    2012-11-01

    Adolescent idiopathic scoliosis is the most common type of abnormal curvature observed in spine and it progresses rapidly during the puberty period. The most followed clinical way of assessing the spinal deformity is subjective by measuring the characteristic angles of spinal curve from a set of radiographic images. This paper presents a web-based information system (called ScolioMedIS) based on parameterized 3D anatomical models of the spine to quantitatively assess the deformity and to minimize the amount of radiation exposure by reducing the number of radiographs required. The main components of the system are 3D parametric solid model of spine, back surfaces, relevant clinical information and scoliosis ontology. The patient-specific spine model is regenerated from the parametric model and surface data using anatomical information extracted from radiographic images. The system is designed to take inherent advantage of Web for facilitating multi-center data collection and collaborative clinical decisions. The preliminary analysis of patient data showed promising results, which involve improved documentation standard, clinical decision knowledge base record, facilitated exchange and retrieval of medical data between institutions in multi-center clinical studies, 3D visualization of spinal deformity, and permanent monitoring of treatments. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  14. Impact of the web on citation and information-seeking behaviour of academics

    OpenAIRE

    2012-01-01

    D.Litt. et Phil. This study investigated the impact of the Web on the information-seeking and citation behaviour of Unisa academics. The research study was executed in two phases. Phase 1 consisted of a Web citation analysis and phase 2 a questionnaire. Phase 1 explored how the availability of Web information resources affected the scholarly citation behaviour of Unisa academics by determining the relationship between Web-based references and non-Web-based references in the reference lists...

  15. Quality of Web-based information on obsessive compulsive disorder

    Directory of Open Access Journals (Sweden)

    Klila H

    2013-11-01

    Full Text Available Hedi Klila,1 Anne Chatton,2 Ariane Zermatten,2 Riaz Khan,2 Martin Preisig,1,3 Yasser Khazaal2,4 1Department of Psychiatry, Lausanne University Hospital, Lausanne, Switzerland; 2Department of Mental Health and Psychiatry, Geneva University Hospitals, Geneva, Switzerland; 3Lausanne University, Lausanne, Switzerland; 4Geneva University, Geneva, Switzerland Background: The Internet is increasingly used as a source of information for mental health issues. The burden of obsessive compulsive disorder (OCD may lead persons with diagnosed or undiagnosed OCD, and their relatives, to search for good quality information on the Web. This study aimed to evaluate the quality of Web-based information on English-language sites dealing with OCD and to compare the quality of websites found through a general and a medically specialized search engine. Methods: Keywords related to OCD were entered into Google and OmniMedicalSearch. Websites were assessed on the basis of accountability, interactivity, readability, and content quality. The "Health on the Net" (HON quality label and the Brief DISCERN scale score were used as possible content quality indicators. Of the 235 links identified, 53 websites were analyzed. Results: The content quality of the OCD websites examined was relatively good. The use of a specialized search engine did not offer an advantage in finding websites with better content quality. A score ≥16 on the Brief DISCERN scale is associated with better content quality. Conclusion: This study shows the acceptability of the content quality of OCD websites. There is no advantage in searching for information with a specialized search engine rather than a general one. Practical implications: The Internet offers a number of high quality OCD websites. It remains critical, however, to have a provider–patient talk about the information found on the Web. Keywords: Internet, quality indicators, anxiety disorders, OCD, search engine

  16. A Java-based multi-institutional medical information retrieval system.

    Science.gov (United States)

    Wang, K; van Wingerde, F J; Bradshaw, K; Szolovits, P; Kohane, I

    1997-01-01

    JAMI (Java-based Agglutination of Medical Information) is designed as a framework for integrating heterogeneous information systems used in healthcare related institutions. It is one of the implementations under the W3-EMRS project 1 aimed at using the World Wide Web (Web) to unify different hospital information systems. JAMI inherited several design decisions from the first W3-EMRS implementation described in, including using the Web as the communication infrastructure and HL7 as the communication protocol between the heterogeneous systems and the W3-EMRS systems. In addition, JAMI incorporates the growing Java technologies and has a more flexible and efficient architecture. This paper describes JAMI's architecture and implementation. It also present two instances of JAMI, one for the integration of different hospital information systems and another for the integration of two heterogeneous systems within a single hospital. Some important issues for the further development of JAMI, including security and confidentiality, data input and decision support are discussed.

  17. Functional Requirements for Information Resource Provenance on the Web

    Energy Technology Data Exchange (ETDEWEB)

    McCusker, James P.; Lebo, Timothy; Graves, Alvaro; Difranzo, Dominic; Pinheiro da Silva, Paulo; McGuinness, Deborah L.

    2012-06-19

    We provide a means to formally explain the relationship between HTTP URLs and the representations returned when they are requested. According to existing World Wide Web architecture, the URL serves as an identier for a semiotic referent while the document returned via HTTP serves as a representation of the same referent. This begins with two sides of a semiotic triangle; the third side is the relationship between the URL and the representation received. We complete this description by extending the library science resource model Functional Requirements for Bibliographic Resources (FRBR) with cryptographic message and content digests to create a Functional Requirements for Information Resources (FRIR). We show how applying the FRIR model to HTTP GET and POST transactions disambiguates the many relationships between a given URL and all representations received from its request, provides fine-grained explanations that are complementary to existing explanations of web resources, and integrates easily into the emerging W3C provenance standard.

  18. Applying Information-Retrieval Methods to Software Reuse: A Case Study.

    Science.gov (United States)

    Stierna, Eric J.; Rowe, Neil C.

    2003-01-01

    Discusses reuse of existing software for new purposes as a key aspect of efficient software engineering by matching formal written requirements used to define the new and the old software. Explores two matching methodologies that use information retrieval techniques and describes test results from a comparison of two military systems. (Author/LRW)

  19. On Using Genetic Algorithms for Multimodal Relevance Optimization in Information Retrieval.

    Science.gov (United States)

    Boughanem, M.; Christment, C.; Tamine, L.

    2002-01-01

    Presents a genetic relevance optimization process performed in an information retrieval system that uses genetic techniques for solving multimodal problems (niching) and query reformulation techniques. Explains that the niching technique allows the process to reach different relevance regions of the document space, and that query reformulations…

  20. System Scope for Library Automation and Generalized Information Storage and Retrieval at Stanford University.

    Science.gov (United States)

    Cady, Glee; And Others

    The scope of a manual-automated system serving the 40 libraries and the teaching and research community of Stanford University is defined. Also defined are the library operations to be supported and the bibliographic information storage and retrieval capabilities to be provided in the system. Two major projects have been working jointly on library…

  1. Proceedings of the 9th Dutch-Belgian Information Retrieval Workshop

    NARCIS (Netherlands)

    Aly, Robin; Hauff, C.; den Hamer, Ida; Hiemstra, Djoerd; Huibers, Theo W.C.; de Jong, Franciska M.G.

    Welcome to the 9th Dutch-Belgian Information Retrieval Workshop (DIR). I very well remember the DIR workshop in 2001 that was also organized in Twente. It took place exactly one day before my PhD defense, to give us the opportunity to have one of the PhD committee members, Stephen Robertson, as the

  2. An Introduction to Genetic Algorithms and to Their Use in Information Retrieval.

    Science.gov (United States)

    Jones, Gareth; And Others

    1994-01-01

    Genetic algorithms, a class of nondeterministic algorithms in which the role of chance makes the precise nature of a solution impossible to guarantee, seem to be well suited to combinatorial-optimization problems in information retrieval. Provides an introduction to techniques and characteristics of genetic algorithms and illustrates their…

  3. TRECVID: evaluating the effectiveness of information retrieval tasks on digital video

    NARCIS (Netherlands)

    Smeaton, A.F.; Over, P.; Kraaij, W.

    2004-01-01

    TRECVID is an annual exercise which encourages research in information retrieval from digital video by providing a large video test collection, uniform scoring procedures, and a forum for organizations interested in comparing their results. TRECVID benchmarking covers both interactive and manual

  4. Degree of Agreement in Naming Objects and Concepts for Information Retrieval.

    Science.gov (United States)

    Collantes, Lourdes Y.

    1995-01-01

    Discussion of users and information retrieval systems highlights a study that investigated the representation of users' knowledge by examining their names for objects and concepts, agreement on names, the Library of Congress Subject Headings representation for similar objects and concepts, and measurement of the similarity between these…

  5. Making Explicit the Formalism Underlying Evaluation in Music Information Retrieval Research

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2014-01-01

    We make explicit the formalism underlying evaluation in music information retrieval research. We define a ``system,'' what it means to ``analyze'' one, and make clear the aims, parts, design, execution, interpretation, assumptions and limitations of its ``evaluation.'' We apply this formalism...

  6. Overview of the Living Labs for Information Retrieval Evaluation (LL4IR) CLEF Lab 2015

    NARCIS (Netherlands)

    Schuth, A.; Balog, K.; Kelly, L.; Mothe, J.; Savoy, J.; Kamps, J.; Pinel-Sauvagnat, K.; Jones, G.J.F.; SanJuan, E.; Cappellato, L.; Ferro, N.

    2015-01-01

    In this paper we report on the first Living Labs for Information Retrieval Evaluation (LL4IR) CLEF Lab. Our main goal with the lab is to provide a benchmarking platform for researchers to evaluate their ranking systems in a live setting with real users in their natural task environments. For this

  7. Adaptation of Statistical Machine Translation Model for Cross-Lingual Information Retrieval in a Service Context

    NARCIS (Netherlands)

    Nikoulina, V.; Kovachev, B.; Lagos, N.; Monz, C.

    2012-01-01

    This work proposes to adapt an existing general SMT model for the task of translating queries that are subsequently going to be used to retrieve information from a target language collection. In the scenario that we focus on access to the document collection itself is not available and changes to

  8. FIRES: Fire Information Retrieval and Evaluation System - A program for fire danger rating analysis

    Science.gov (United States)

    Patricia L. Andrews; Larry S. Bradshaw

    1997-01-01

    A computer program, FIRES: Fire Information Retrieval and Evaluation System, provides methods for evaluating the performance of fire danger rating indexes. The relationship between fire danger indexes and historical fire occurrence and size is examined through logistic regression and percentiles. Historical seasonal trends of fire danger and fire occurrence can be...

  9. FedWeb Greatest Hits: Presenting the New Test Collection for Federated Web Search

    NARCIS (Netherlands)

    Demeester, Thomas; Trieschnigg, Rudolf Berend; Zhou, Ke; Nguyen, Dong-Phuong; Hiemstra, Djoerd

    This paper presents 'FedWeb Greatest Hits', a large new test collection for research in web information retrieval. As a combination and extension of the datasets used in the TREC Federated Web Search Track, this collection opens up new research possibilities on federated web search challenges, as

  10. Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus

    Directory of Open Access Journals (Sweden)

    Çağdaş Çapkın

    2016-12-01

    Full Text Available Information institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata (MIR, full-text (FIR and hybrid (HIR content information retrieval systems were developed with default Lucene information retrieval model for a small scale Turkish corpus. In order to evaluate performance of this three systems, “precision - recall” and “normalized recall” tests were conducted. Experimental findings showed that there were no significant differences between MIR and FIR in mean average precision (MAP performance. On the other hand, MAP performance of HIR was significantly higher in comparison to MIR and FIR. When information retrieval performance was evaluated as user-centered, the “normalized recall” performances of MIR and HIR were significantly higher than FIR. Additionally, there were no significant differences between the systems in retrieved relevant document means. Processing different types of contents such as metadata and full-text had some advantages and disadvantages for information retrieval systems in terms of term management. The advantages brought together in hybrid content processing (HIR and information retrieval performance improved.

  11. Internet use in pregnancy informs women's decision making: a web-based survey.

    Science.gov (United States)

    Lagan, Briege M; Sinclair, Marlene; Kernohan, W George

    2010-06-01

    Internet access and usage is almost ubiquitous, providing new opportunities and increasing challenges for health care practitioners and users. With pregnant women reportedly turning to the Internet for information during pregnancy, a better understanding of this behavior is needed. The objective of this study was to ascertain why and how pregnant women use the Internet as a health information source, and the overall effect it had on their decision making. Kuhlthau's (1993) information-seeking model was adapted to provide the underpinning theoretical framework for the study. The design was exploratory and descriptive. Data were collected using a valid and reliable web-based questionnaire. Over a 12-week period, 613 women from 24 countries who had confirmed that they had used the Internet for pregnancy-related information during their pregnancy completed and submitted a questionnaire. Most women (97%) used search engines such as Google to identify online web pages to access a large variety of pregnancy-related information and to use the Internet for pregnancy-related social networking, support, and electronic commerce (i.e., e-commerce). Almost 94 percent of women used the Internet to supplement information already provided by health professionals and 83 percent used it to influence their pregnancy decision making. Nearly half of the respondents reported dissatisfaction with information given by health professionals (48.6%) and lack of time to ask health professionals questions (46.5%) as key factors influencing them to access the Internet. Statistically, women's confidence levels significantly increased with respect to making decisions about their pregnancy after Internet usage (p < 0.05). In this study, the Internet played a significant part in the respondents' health information seeking and decision making in pregnancy. Health professionals need to be ready to support pregnant women in online data retrieval, interpretation, and application.

  12. The role of automated categorization in e-government information retrieval

    DEFF Research Database (Denmark)

    Jonasen, Tanja Svarre; Lykke, Marianne

    2013-01-01

    categorization can enhance information organization and retrieval and presents the results of a controlled evaluation that compared automated categorization and free text indexing of the government intranet used by Danish tax authorities. Thirty-two individuals participated in the evaluation, conducting...... knowledge was present, categorization was used to support the assumptions of a correct search. On the other hand, however, test participants avoided using automated categorization if high-precision documents were among the top results or if few documents were retrieved. The findings emphasize the importance...

  13. MAC Protocols for Optimal Information Retrieval Pattern in Sensor Networks with Mobile Access

    Directory of Open Access Journals (Sweden)

    Dong Min

    2005-01-01

    Full Text Available In signal field reconstruction applications of sensor network, the locations where the measurements are retrieved from affect the reconstruction performance. In this paper, we consider the design of medium access control (MAC protocols in sensor networks with mobile access for the desirable information retrieval pattern to minimize the reconstruction distortion. Taking both performance and implementation complexity into consideration, besides the optimal centralized scheduler, we propose three decentralized MAC protocols, namely, decentralized scheduling through carrier sensing, Aloha scheduling, and adaptive Aloha scheduling. Design parameters for the proposed protocols are optimized. Finally, performance comparison among these protocols is provided via simulations.

  14. Collect Meaningful Information about Stock Markets from the Web

    Directory of Open Access Journals (Sweden)

    Saleem Abuleil

    2015-02-01

    Full Text Available Events represent a significant source of information on the web; they deliver information about events that occur around the world in all subjects and areas. These events can be collected and organized to provide valuable and useful information for decision makers, researchers, as well as for any person seeking knowledge. In this paper, we discuss an ongoing research to target stock markets domain to observe and record changes (events when they happen, collect them, understand the meaning of each one of them, and organize the information along with meaning in a well-structured format. By using Semantic Role Labeling (SRL technique, we have identified four factors for each event in this paper: verb of action and three roles associated with it, entity name, attribute, and attribute value. We have generated a set of rules and techniques to support our approach to analyze and understand the meaning of the events that take place in stock markets.

  15. Review of extracting information from the Social Web for health personalization.

    Science.gov (United States)

    Fernandez-Luque, Luis; Karlsen, Randi; Bonander, Jason

    2011-01-28

    In recent years the Web has come into its own as a social platform where health consumers are actively creating and consuming Web content. Moreover, as the Web matures, consumers are gaining access to personalized applications adapted to their health needs and interests. The creation of personalized Web applications relies on extracted information about the users and the content to personalize. The Social Web itself provides many sources of information that can be used to extract information for personalization apart from traditional Web forms and questionnaires. This paper provides a review of different approaches for extracting information from the Social Web for health personalization. We reviewed research literature across different fields addressing the disclosure of health information in the Social Web, techniques to extract that information, and examples of personalized health applications. In addition, the paper includes a discussion of technical and socioethical challenges related to the extraction of information for health personalization.

  16. Effects of Surrounding Information and Line Length on Text Comprehension from the Web

    Directory of Open Access Journals (Sweden)

    Jess McMullin

    2002-02-01

    Full Text Available The World Wide Web (Web is becoming a popular medium for transmission of information and online learning. We need to understand how people comprehend information from the Web to design Web sites that maximize the acquisition of information. We examined two features of Web page design that are easily modified by developers, namely line length and the amount of surrounding information, or whitespace. Undergraduate university student participants read text and answered comprehension questions on the Web. Comprehension was affected by whitespace; participants had better comprehension for information surrounded by whitespace than for information surrounded by meaningless information. Participants were not affected by line length. These findings demonstrate that reading from the Web is not the same as reading print and have implications for instructional Web design.

  17. A Holistic, Similarity-Based Approach for Personalized Ranking in Web Databases

    Science.gov (United States)

    Telang, Aditya

    2011-01-01

    With the advent of the Web, the notion of "information retrieval" has acquired a completely new connotation and currently encompasses several disciplines ranging from traditional forms of text and data retrieval in unstructured and structured repositories to retrieval of static and dynamic information from the contents of the surface and deep Web.…

  18. Application of information theory methods to food web reconstruction

    Science.gov (United States)

    Moniz, L.J.; Cooch, E.G.; Ellner, S.P.; Nichols, J.D.; Nichols, J.M.

    2007-01-01

    In this paper we use information theory techniques on time series of abundances to determine the topology of a food web. At the outset, the food web participants (two consumers, two resources) are known; in addition we know that each consumer prefers one of the resources over the other. However, we do not know which consumer prefers which resource, and if this preference is absolute (i.e., whether or not the consumer will consume the non-preferred resource). Although the consumers and resources are identified at the beginning of the experiment, we also provide evidence that the consumers are not resources for each other, and the resources do not consume each other. We do show that there is significant mutual information between resources; the model is seasonally forced and some shared information between resources is expected. Similarly, because the model is seasonally forced, we expect shared information between consumers as they respond to the forcing of the resources. The model that we consider does include noise, and in an effort to demonstrate that these methods may be of some use in other than model data, we show the efficacy of our methods with decreasing time series size; in this particular case we obtain reasonably clear results with a time series length of 400 points. This approaches ecological time series lengths from real systems.

  19. Build, Buy, Open Source, or Web 2.0?: Making an Informed Decision for Your Library

    Science.gov (United States)

    Fagan, Jody Condit; Keach, Jennifer A.

    2010-01-01

    When improving a web presence, today's libraries have a choice: using a free Web 2.0 application, opting for open source, buying a product, or building a web application. This article discusses how to make an informed decision for one's library. The authors stress that deciding whether to use a free Web 2.0 application, to choose open source, to…

  20. Three types of children’s informational web sites: an inventory of design conventions

    NARCIS (Netherlands)

    Jochmann-Mannak, Hanna; Lentz, Leo; Huibers, Theo W.C.; Sanders, Ted

    "Purpose: Research on Web design conventions has an almost exclusive focus on Web design for adults. There is far less knowledge about Web design for children. For the first time, an overview is presented of the current design conventions for children's informational Web sites. Method: In this study

  1. Design of a web portal for interdisciplinary image retrieval from multiple online image resources.

    Science.gov (United States)

    Kammerer, F J; Frankewitsch, T; Prokosch, H-U

    2009-01-01

    Images play an important role in medicine. Finding the desired images within the multitude of online image databases is a time-consuming and frustrating process. Existing websites do not meet all the requirements for an ideal learning environment for medical students. This work intends to establish a new web portal providing a centralized access point to a selected number of online image databases. A back-end system locates images on given websites and extracts relevant metadata. The images are indexed using UMLS and the MetaMap system provided by the US National Library of Medicine. Specially developed functions allow to create individual navigation structures. The front-end system suits the specific needs of medical students. A navigation structure consisting of several medical fields, university curricula and the ICD-10 was created. The images may be accessed via the given navigation structure or using different search functions. Cross-references are provided by the semantic relations of the UMLS. Over 25,000 images were identified and indexed. A pilot evaluation among medical students showed good first results concerning the acceptance of the developed navigation structures and search features. The integration of the images from different sources into the UMLS semantic network offers a quick and an easy-to-use learning environment.

  2. Quality of health information on acute myocardial infarction and stroke in the world wide web.

    Science.gov (United States)

    Bastos, Ana; Paiva, Dagmara; Azevedo, Ana

    2014-01-01

    The quality of health information in the Internet may be low. This is a concerning issue in cardiovascular diseases which warrant patient self-management. We aimed to assess the quality of Portuguese websites as a source of health information on acute myocardial infarction and stroke. We used the search terms 'enfarte miocardio' and 'acidente vascular cerebral' (Portuguese terms for myocardial infarction and stroke) on Google(®), on April 5th and 7th 2011, respectively, using Internet Explorer(®). The first 200 URL retrieved in each search were independently visited and Portuguese websites in Portuguese language were selected. We analysed and classified 121 websites for structural characteristics, information coverage and accuracy of the web pages with items defined a priori, trustworthiness in general according to the Health on the Net Foundation and regarding treatments using the DISCERN instrument (48 websites). Websites were most frequently commercial (49.5%), not exclusively dedicated to acute myocardial infarction/ stroke (94.2%), and with information on medical facts (59.5%), using images, video or animation (60.3%). Websites' trustworthiness was low. None of the websites displayed the Health on the Net Foundation seal. Acute myocardial infarction/ stroke websites differed in information coverage but the accuracy of the information was acceptable, although often incomplete. The quality of information on acute myocardial infarction/ stroke in Portuguese websites was acceptable. Trustworthiness was low, impairing users' capability of identifying potentially more reliable content.

  3. Information retrieval for the Cochrane systematic reviews: the case of breast cancer surgery

    Directory of Open Access Journals (Sweden)

    Gaetana Cognetti

    2015-03-01

    Full Text Available Introduction. Systematic reviews are fundamental sources of knowledge on the state-of-the-art interventions for various clinical problems. One of the essential components in carrying out a systematic review is that of developing a comprehensive literature search. Materials and methods. Three Cochrane systematic reviews published in 2012 were retrieved using the MeSH descriptor breast neoplasms/surgery, and analyzed with respect to the information sources used and the search strategies adopted. In March 2014, an update of one of the reviews retrieved was also considered in the study. Results. The number of databases queried for each review ranged between three and seven. All the reviews reported the search strategies adopted, however some only partially. All the reviews explicitly claimed that the searches applied no language restriction although sources such as the free database Lilacs (in Spanish and Portuguese was not consulted. Conclusion. To improve the quality it is necessary to apply standards in carrying out systematic reviews (as laid down in the MECIR project. To meet these standards concerning literature searching, professional information retrieval specialist staff should be involved. The peer review committee in charge of evaluating the publication of a systematic review should also include specialists in information retrieval for assessing the quality of the literature search.

  4. Information Organisation Practices on the Web: Tagging and the Social Organisation of Information

    OpenAIRE

    Kipp, Margaret E. I.

    2009-01-01

    This talk (the public talk for my thesis) examines the phenomenon of social tagging from its early beginnings to its current level of prominence on a wide variety of websites in a series of linked studies examining the structures and patterns of tag term use to determine whether regular patterns appear that would support information organisation and retrieval.

  5. Citation Index: an indispensable information retrieval tool for research and evaluation

    OpenAIRE

    Kademani, B. S.; Vijai Kumar, *

    2002-01-01

    This paper highlights the information explosion, the need for bibliographic control, the need for information retrieval tools. Explains the emergence of Citation Index, concept of citation indexing, reasons for citing, its structure (print and electronic versions of Science citation Index and Social Science Citation Index ), and application of citation index. It also discusses the search effectiveness, factors taken into consideration for coverage of journals in citation indexes, Journal Cita...

  6. WEB STRUCTURE MINING USING PAGERANK, IMPROVED PAGERANK – AN OVERVIEW

    Directory of Open Access Journals (Sweden)

    V. Lakshmi Praba

    2011-03-01

    Full Text Available Web Mining is the extraction of interesting and potentially useful patterns and information from Web. It includes Web documents, hyperlinks between documents, and usage logs of web sites. The significant task for web mining can be listed out as Information Retrieval, Information Selection / Extraction, Generalization and Analysis. Web information retrieval tools consider only the text on pages and ignore information in the links. The goal of Web structure mining is to explore structural summary about web. Web structure mining focusing on link information is an important aspect of web data. This paper presents an overview of the PageRank, Improved Page Rank and its working functionality in web structure mining.

  7. A Web-Based Information System for Field Data Management

    Science.gov (United States)

    Weng, Y. H.; Sun, F. S.

    2014-12-01

    A web-based field data management system has been designed and developed to allow field geologists to store, organize, manage, and share field data online. System requirements were analyzed and clearly defined first regarding what data are to be stored, who the potential users are, and what system functions are needed in order to deliver the right data in the right way to the right user. A 3-tiered architecture was adopted to create this secure, scalable system that consists of a web browser at the front end while a database at the back end and a functional logic server in the middle. Specifically, HTML, CSS, and JavaScript were used to implement the user interface in the front-end tier, the Apache web server runs PHP scripts, and MySQL to server is used for the back-end database. The system accepts various types of field information, including image, audio, video, numeric, and text. It allows users to select data and populate them on either Google Earth or Google Maps for the examination of the spatial relations. It also makes the sharing of field data easy by converting them into XML format that is both human-readable and machine-readable, and thus ready for reuse.

  8. Using open-source programs to create a web-based portal for hydrologic information

    Science.gov (United States)

    Kim, H.

    2013-12-01

    Some hydrologic data sets, such as basin climatology, precipitation, and terrestrial water storage, are not easily obtainable and distributable due to their size and complexity. We present a Hydrologic Information Portal (HIP) that has been implemented at the University of California for Hydrologic Modeling (UCCHM) and that has been organized around the large river basins of North America. This portal can be easily accessed through a modern web browser that enables easy access and visualization of such hydrologic data sets. Some of the main features of our HIP include a set of data visualization features so that users can search, retrieve, analyze, integrate, organize, and map data within large river basins. Recent information technologies such as Google Maps, Tornado (Python asynchronous web server), NumPy/SciPy (Scientific Library for Python) and d3.js (Visualization library for JavaScript) were incorporated into the HIP to create ease in navigating large data sets. With such open source libraries, HIP can give public users a way to combine and explore various data sets by generating multiple chart types (Line, Bar, Pie, Scatter plot) directly from the Google Maps viewport. Every rendered object such as a basin shape on the viewport is clickable, and this is the first step to access the visualization of data sets.

  9. Design and Implementation of Domain based Semantic Hidden Web Crawler

    OpenAIRE

    Manvi; Bhatia, Komal Kumar; Dixit, Ashutosh

    2015-01-01

    Web is a wide term which mainly consists of surface web and hidden web. One can easily access the surface web using traditional web crawlers, but they are not able to crawl the hidden portion of the web. These traditional crawlers retrieve contents from web pages, which are linked by hyperlinks ignoring the information hidden behind form pages, which cannot be extracted using simple hyperlink structure. Thus, they ignore large amount of data hidden behind search forms. This paper emphasizes o...

  10. Communicating climate change adaptation information using web-based platforms

    Science.gov (United States)

    Karali, Eleni; Mattern, Kati

    2017-07-01

    To facilitate progress in climate change adaptation policy and practice, it is important not only to ensure the production of accurate, comprehensive and relevant information, but also the easy, timely and affordable access to it. This can contribute to better-informed decisions and improve the design and implementation of adaptation policies and other relevant initiatives. Web-based platforms can play an important role in communicating and distributing data, information and knowledge that become constantly available, reaching out to a large group of potential users. Indeed in the last decade there has been an extensive increase in the number of platforms developed for this purpose in many fields including climate change adaptation. This short paper concentrates on the web-based adaptation platforms developed in Europe. It provides an overview of the recently emerged landscape, examines the basic characteristics of a set of platforms that operate at national, transnational and European level, and discusses some of the key challenges related to their development, maintenance and overall management. Findings presented in this short paper are discussed in greater detailed in the Technical Report of the European Environment Agency Overview of climate change adaptation platforms in Europe.

  11. Multimedia Retrieval

    NARCIS (Netherlands)

    Blanken, Henk; de Vries, A.P.; de Vries, A.P.; Blok, H.E.; Feng, L.; Unknown, [Unknown

    2007-01-01

    Retrieval of multimedia data is different from retrieval of structured data. A key problem in multimedia databases is search, and the proposed solutions to the problem of multimedia information retrieval span a rather wide spectrum of topics outside the traditional database area, ranging from

  12. Information organization on the web site of medical library

    Directory of Open Access Journals (Sweden)

    Marjeta Oven

    2005-01-01

    Full Text Available A library home page should be a document providing information about library, its various services to and activities for the users. The purpose of the present research was an evaluation of the actual state of-the-art of library home pages of Slovene medical libraries, and a set of suggestions is offered for their improvement. The results of the research, based on the sample of medical libraries partaining to the Ljubljana medical circle, indicate that posibilities offered by databases accessible on internet are unutilizied. Medical libraries mostly use their web sites for presentation of their activities in a fairly rigid and unchanging format, including general information on the library, e-mail, mailing adresses, library regulations, but without much interactive information. The article indicates the necessity for improvement of the present state and offers a few advices on how to achieve the positive change.

  13. The invisible Web uncovering information sources search engines can't see

    CERN Document Server

    Sherman, Chris

    2001-01-01

    Enormous expanses of the Internet are unreachable with standard web search engines. This book provides the key to finding these hidden resources by identifying how to uncover and use invisible web resources. Mapping the invisible Web, when and how to use it, assessing the validity of the information, and the future of Web searching are topics covered in detail. Only 16 percent of Net-based information can be located using a general search engine. The other 84 percent is what is referred to as the invisible Web-made up of information stored in databases. Unlike pages on the visible Web, informa

  14. Information-Seeking Behaviors of Medical Students: A Cross-Sectional Web-Based Survey.

    Science.gov (United States)

    O'Carroll, Aoife Marie; Westby, Erin Patricia; Dooley, Joseph; Gordon, Kevin E

    2015-06-29

    Medical students face an information-rich environment in which retrieval and appraisal strategies are increasingly important. To describe medical students' current pattern of health information resource use and characterize their experience of instruction on information search and appraisal. We conducted a cross-sectional web-based survey of students registered in the four-year MD Program at Dalhousie University (Halifax, Nova Scotia, and Saint John, New Brunswick, sites), Canada. We collected self-reported data on information-seeking behavior, instruction, and evaluation of resources in the context of their medical education. Data were analyzed using descriptive statistics. Surveys were returned by 213 of 462 eligible students (46.1%). Most respondents (165/204, 80.9%) recalled receiving formal instruction regarding information searches, but this seldom included nontraditional tools such as Google (23/107, 11.1%), Wikipedia, or social media. In their daily practice, however, they reported heavy use of these tools, as well as EBM summaries. Accessibility, understandability, and overall usefulness were common features of highly used resources. Students identified challenges managing information and/or resource overload and source accessibility. Medical students receive instruction primarily on searching and assessing primary medical literature. In their daily practice, however, they rely heavily on nontraditional tools as well as EBM summaries. Attention to appropriate use and appraisal of nontraditional sources might enhance the current EBM curriculum.

  15. KAGIANA: An Excel-Based Tool for Retrieving Summary Information on Arabidopsis Genes

    Science.gov (United States)

    Ogata, Yoshiyuki; Sakurai, Nozomu; Aoki, Koh; Suzuki, Hideyuki; Okazaki, Koei; Saito, Kazuki; Shibata, Daisuke

    2009-01-01

    Various public databases provide Arabidopsis gene information via the internet. It is useful to abstract information obtained from such databases. We have developed the KAGIANA tool, which allows a user to retrieve summary information obtained from selective databases and to access pages for a gene of interest in those databases. The tool is based on Microsoft Excel and provides several macro programs for gene expression analyses. It can assist plant biologists in accessing omics information for plant biology. The KAGIANA tool is freely available at http://pmnedo.kazusa.or.jp/kagiana/. PMID:19043069

  16. KAGIANA: an excel-based tool for retrieving summary information on Arabidopsis genes.

    Science.gov (United States)

    Ogata, Yoshiyuki; Sakurai, Nozomu; Aoki, Koh; Suzuki, Hideyuki; Okazaki, Koei; Saito, Kazuki; Shibata, Daisuke

    2009-01-01

    Various public databases provide Arabidopsis gene information via the internet. It is useful to abstract information obtained from such databases. We have developed the KAGIANA tool, which allows a user to retrieve summary information obtained from selective databases and to access pages for a gene of interest in those databases. The tool is based on Microsoft Excel and provides several macro programs for gene expression analyses. It can assist plant biologists in accessing omics information for plant biology. The KAGIANA tool is freely available at http://pmnedo.kazusa.or.jp/kagiana/.

  17. The use of ICF codes for information retrieval in rehabilitation research: an empirical study.

    Science.gov (United States)

    Sundar, Vidyalakshmi; Daumen, Marcia E; Conley, Daniel J; Stone, John H

    2008-01-01

    Rehabilitation research information can be obtained from various bibliographic sources. Nevertheless, search strategies and terminologies differ from one database to another making it challenging for the novice user or users of multiple databases. This paper discusses a novel approach of using the International Classification of Functioning, Disability and Health (ICF) codes to retrieve rehabilitation research information. A crosswalk was created by mapping the Center for International Rehabilitation Research and Information Exchange's (CIRRIE) subject headings to the two-level ICF codes and a search interface was developed (available at: http://cirrie.buffalo.edu/icf/crosswalk.php) so that users can input ICF codes instead of conventional subject headings. About 62% of all CIRRIE subject headings were mapped to equivalent ICF codes. Among the CIRRIE subject heading that were mapped, 43% were mapped to the Environmental Factors, followed by 34% mapped to the Activities and Participation component of the ICF. Although the ICF was not conceived or developed as a system of formal terminology, it can be used effectively for information retrieval in conjunction with an existing vocabulary. This paper describes the first attempt in implementing the use of ICF for information retrieval.

  18. Web-based surveillance of public information needs for informing preconception interventions.

    Directory of Open Access Journals (Sweden)

    Angelo D'Ambrosio

    Full Text Available The risk of adverse pregnancy outcomes can be minimized through the adoption of healthy lifestyles before pregnancy by women of childbearing age. Initiatives for promotion of preconception health may be difficult to implement. Internet can be used to build tailored health interventions through identification of the public's information needs. To this aim, we developed a semi-automatic web-based system for monitoring Google searches, web pages and activity on social networks, regarding preconception health.Based on the American College of Obstetricians and Gynecologists guidelines and on the actual search behaviors of Italian Internet users, we defined a set of keywords targeting preconception care topics. Using these keywords, we analyzed the usage of Google search engine and identified web pages containing preconception care recommendations. We also monitored how the selected web pages were shared on social networks. We analyzed discrepancies between searched and published information and the sharing pattern of the topics.We identified 1,807 Google search queries which generated a total of 1,995,030 searches during the study period. Less than 10% of the reviewed pages contained preconception care information and in 42.8% information was consistent with ACOG guidelines. Facebook was the most used social network for sharing. Nutrition, Chronic Diseases and Infectious Diseases were the most published and searched topics. Regarding Genetic Risk and Folic Acid, a high search volume was not associated to a high web page production, while Medication pages were more frequently published than searched. Vaccinations elicited high sharing although web page production was low; this effect was quite variable in time.Our study represent a resource to prioritize communication on specific topics on the web, to address misconceptions, and to tailor interventions to specific populations.

  19. Web-based surveillance of public information needs for informing preconception interventions.

    Science.gov (United States)

    D'Ambrosio, Angelo; Agricola, Eleonora; Russo, Luisa; Gesualdo, Francesco; Pandolfi, Elisabetta; Bortolus, Renata; Castellani, Carlo; Lalatta, Faustina; Mastroiacovo, Pierpaolo; Tozzi, Alberto Eugenio

    2015-01-01

    The risk of adverse pregnancy outcomes can be minimized through the adoption of healthy lifestyles before pregnancy by women of childbearing age. Initiatives for promotion of preconception health may be difficult to implement. Internet can be used to build tailored health interventions through identification of the public's information needs. To this aim, we developed a semi-automatic web-based system for monitoring Google searches, web pages and activity on social networks, regarding preconception health. Based on the American College of Obstetricians and Gynecologists guidelines and on the actual search behaviors of Italian Internet users, we defined a set of keywords targeting preconception care topics. Using these keywords, we analyzed the usage of Google search engine and identified web pages containing preconception care recommendations. We also monitored how the selected web pages were shared on social networks. We analyzed discrepancies between searched and published information and the sharing pattern of the topics. We identified 1,807 Google search queries which generated a total of 1,995,030 searches during the study period. Less than 10% of the reviewed pages contained preconception care information and in 42.8% information was consistent with ACOG guidelines. Facebook was the most used social network for sharing. Nutrition, Chronic Diseases and Infectious Diseases were the most published and searched topics. Regarding Genetic Risk and Folic Acid, a high search volume was not associated to a high web page production, while Medication pages were more frequently published than searched. Vaccinations elicited high sharing although web page production was low; this effect was quite variable in time. Our study represent a resource to prioritize communication on specific topics on the web, to address misconceptions, and to tailor interventions to specific populations.

  20. Informal Learning through Expertise Mining in the Social Web

    Science.gov (United States)

    Valencia-Garcia, Rafael; Garcia-Sanchez, Francisco; Casado-Lumbreras, Cristina; Castellanos-Nieves, Dagoberto; Fernandez-Breis, Jesualdo Tomas

    2012-01-01

    The advent of Web 2.0, also called the Social Web, has changed the way people interact with the Web. Assisted by the technologies associated with this new trend, users now play a much more active role as content providers. This Web paradigm shift has also changed how companies operate and interact with their employees, partners and customers. The…