WorldWideScience

Sample records for websites information retrieval

  1. Assessing Website quality in context: retrieving information about genetically modified food on the Web

    Directory of Open Access Journals (Sweden)

    Claire R. McInerney

    2005-01-01

    Full Text Available Introduction. Knowing the credibility of information about genetically modified food on the Internet is critical to the everyday life information seeking of consumers as they form opinions about this nascent agricultural technology. The Website Quality Evaluation Tool (WQET is a valuable instrument that can be used to determine the credibility of Websites on any topic. Method. This study sought to use the WQET to determine the quality of Websites in the context of biotechnology or genetically modified food and to seek one or more easily identified characteristics, such as bias, commitment, use of metatags and site update-access interval (length of time between last update of the site and the date reviewed that might be used as a quick discriminator of a Website's quality. Analysis. Using SPSS, ANOVA and regression analyses were performed with the website variables of a population of one hundred Websites about genetically modified food. Results. Only the site update-access interval was determined to be a shortcut quality indicator with an inverse relationship. The longer the interval the lower the quality score. Conclusion. The study established a model for Website quality evaluation. The update-access interval proved to be the single clear-cut indicator to judge Website quality in everyday information seeking.

  2. Assessing Website Quality in Context: Retrieving Information about Genetically Modified Food on the Web

    Science.gov (United States)

    McInerney, Claire R.; Bird, Nora J.

    2005-01-01

    Introduction: Knowing the credibility of information about genetically modified food on the Internet is critical to the everyday life information seeking of consumers as they form opinions about this nascent agricultural technology. The Website Quality Evaluation Tool (WQET) is a valuable instrument that can be used to determine the credibility of…

  3. Information Classification on University Websites

    DEFF Research Database (Denmark)

    Nawaz, Ather; Clemmensen, Torkil; Hertzum, Morten

    2011-01-01

    Websites are increasingly used as a medium for providing information to university students. The quality of a university website depends on how well the students’ information classification fits with the structure of the information on the website. This paper investigates the information...... classification of 14 Danish and 14 Pakistani students and compares it with the information classification of their university website. Brainstorming, card sorting, and task exploration activities were used to discover similarities and differences in the participating students’ classification of website...... information and their ability to navigate the websites. The results of the study indicate group differences in user classification and related taskperformance differences. The main implications of the study are that (a) the edit distance appears a useful measure in cross-country HCI research and practice...

  4. Information Classification on University Websites

    DEFF Research Database (Denmark)

    Nawaz, Ather; Clemmensen, Torkil; Hertzum, Morten

    2011-01-01

    Websites are increasingly used as a medium for providing information to university students. The quality of a university website depends on how well the students’ information classification fits with the structure of the information on the website. This paper investigates the information...... classification of 14 Danish and 14 Pakistani students and compares it with the information classification of their university website. Brainstorming, card sorting, and task exploration activities were used to discover similarities and differences in the participating students’ classification of website...... information and their ability to navigate the websites. The results of the study indicate group differences in user classification and related task-performance differences. The main implications of the study are that (a) the edit distance appears a useful measure in cross-country HCI research and practice...

  5. Information Classification on University Websites

    DEFF Research Database (Denmark)

    Nawaz, Ather; Clemmensen, Torkil; Hertzum, Morten

    2011-01-01

    Websites are increasingly used as a medium for providing information to university students. The quality of a university website depends on how well the students’ information classification fits with the structure of the information on the website. This paper investigates the information...... classification of 14 Danish and 14 Pakistani students and compares it with the information classification of their university website. Brainstorming, card sorting, and task exploration activities were used to discover similarities and differences in the participating students’ classification of website...... information and their ability to navigate the websites. The results of the study indicate group differences in user classification and related taskperformance differences. The main implications of the study are that (a) the edit distance appears a useful measure in cross-country HCI research and practice...

  6. Information Classification on University Websites

    DEFF Research Database (Denmark)

    Nawaz, Ather; Clemmensen, Torkil; Hertzum, Morten

    2011-01-01

    Websites are increasingly used as a medium for providing information to university students. The quality of a university website depends on how well the students’ information classification fits with the structure of the information on the website. This paper investigates the information...... classification of 14 Danish and 14 Pakistani students and compares it with the information classification of their university website. Brainstorming, card sorting and task exploration activities were used to discover similarities and differences in the participating students’ classification of website...... information and their ability to navigate the websites. The results of the study indicated group differences in user classification and related task performances differences. The main implications of the study were that (a) the edit distance appears a useful measure in cross-country HCI research and practice...

  7. A Retrieval Performance Study of Research websites

    Directory of Open Access Journals (Sweden)

    G.Charles Babu

    2012-09-01

    Full Text Available Data and databases became prime importance with the advent of computers which resulted in efficient storage and retrieval of information to advance scientific research to the next level. Such ease of information has been made possible by various stochastic algorithms implemented in data storage and the search methodologies tomine data from databases. Here, in this paper we made an attempt to compare the efficiency of data retrieval and search options employed in three databases, viz., NCBI (National Centre for Biotechnology Information, IEEE( Institute of Electrical and Electronic Engineers , citeseer and ACM( Association of Computing Machinary respectively. From the analysis, PubMed database was found to be much user friendly and more advanced than citeseer and ACM databases.

  8. Introduction to information retrieval

    CERN Document Server

    Manning, Christopher D; Schütze, Hinrich

    2008-01-01

    Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Slides and additional exercises (with solutions for lecturers) are also available through the book's supporting website to help course instructors prepare their lectures.

  9. Connectionist Interaction Information Retrieval.

    Science.gov (United States)

    Dominich, Sandor

    2003-01-01

    Discussion of connectionist views for adaptive clustering in information retrieval focuses on a connectionist clustering technique and activation spreading-based information retrieval model using the interaction information retrieval method. Presents theoretical as well as simulation results as regards computational complexity and includes…

  10. Design and Implementation of Medical Information Retrieval and Utilization Teaching Website%医学信息检索与利用教学网站设计与实现

    Institute of Scientific and Technical Information of China (English)

    朱妍昕; 邱君瑞; 徐维; 张静昌; 潘雅玲

    2012-01-01

    介绍第二军医大学图书馆医学信息检索与利用教学网站的设计与实现,阐述该网站的主要功能,包括学科介绍、教学资源、课堂任务、教学互动4大版块。该网站具有动态性、便捷性、实用性、安全性以及规范性,能够在日常教学过程中发挥重要作用。%The paper introduces the website design and implementation for medical information retrieval and utilization teaching in Library of Second Military Medical University,elaborates the major functions of the websites including subject introduction,teaching resources, tasks and activities.This website is good for its dynamics,convenience,practicality,safety and regulation that would take full advantage in daily study and teaching.

  11. A framework for automatic information quality ranking of diabetes websites.

    Science.gov (United States)

    Belen Sağlam, Rahime; Taskaya Temizel, Tugba

    2015-01-01

    Objective: When searching for particular medical information on the internet the challenge lies in distinguishing the websites that are relevant to the topic, and contain accurate information. In this article, we propose a framework that automatically identifies and ranks diabetes websites according to their relevance and information quality based on the website content. Design: The proposed framework ranks diabetes websites according to their content quality, relevance and evidence based medicine. The framework combines information retrieval techniques with a lexical resource based on Sentiwordnet making it possible to work with biased and untrusted websites while, at the same time, ensuring the content relevance. Measurement: The evaluation measurements used were Pearson-correlation, true positives, false positives and accuracy. We tested the framework with a benchmark data set consisting of 55 websites with varying degrees of information quality problems. Results: The proposed framework gives good results that are comparable with the non-automated information quality measuring approaches in the literature. The correlation between the results of the proposed automated framework and ground-truth is 0.68 on an average with p < 0.001 which is greater than the other proposed automated methods in the literature (r score in average is 0.33).

  12. Evaluating personal information retrieval

    OpenAIRE

    Kelly, Liadh; Bunbury, Paul; Jones, Gareth J.F.

    2012-01-01

    Evaluation of personal search over an individual’s personal information space on the desktop or elsewhere is problematic for reasons relating both to the personal and private nature of the data and the associated personal information needs of collection owners. Indeed challenges associated with evaluation in this space are recognised as one of the key factors hindering the development of research in personal information retrieval. We present the “personal information retrieval evaluatio...

  13. Private information retrieval

    CERN Document Server

    Yi, Xun; Bertino, Elisa

    2013-01-01

    This book deals with Private Information Retrieval (PIR), a technique allowing a user to retrieve an element from a server in possession of a database without revealing to the server which element is retrieved. PIR has been widely applied to protect the privacy of the user in querying a service provider on the Internet. For example, by PIR, one can query a location-based service provider about the nearest car park without revealing his location to the server.The first PIR approach was introduced by Chor, Goldreich, Kushilevitz and Sudan in 1995 in a multi-server setting, where the user retriev

  14. Information retrieval system

    Science.gov (United States)

    Berg, R. F.; Holcomb, J. E.; Kelroy, E. A.; Levine, D. A.; Mee, C., III

    1970-01-01

    Generalized information storage and retrieval system capable of generating and maintaining a file, gathering statistics, sorting output, and generating final reports for output is reviewed. File generation and file maintenance programs written for the system are general purpose routines.

  15. Arabic Studies’ Progress in Information Retrieval

    Directory of Open Access Journals (Sweden)

    Essam Hanandeh

    2016-02-01

    Full Text Available The field of information retrieval has witnessed tangible progress over the past decades in response to the expanded usage of the internet and the dire need of users to search for massive amounts of digital information. Given the steady increase of Arabic e-content, excellent information retrieval systems must be devised to suit the nature and requirements of the Arabic language. This paper sheds light on the current progress in the field of Arabic information retrieval, identifies the challenges that hinder the progress of this science, and proposes suggestions for further research. This paper uses the descriptive analytical method to examine the reality of Arabic studies in the field of information retrieval and to study the problems that are being faced in this area. Specifically, the previous literature on information retrieval is reviewed by searching the related databases and websites.

  16. The Information Architecture of Behavior Change Websites

    OpenAIRE

    2005-01-01

    The extraordinary growth in Internet use offers researchers important new opportunities to identify and test new ways to deliver effective behavior change programs. The information architecture (IA)—the structure of website information—is an important but often overlooked factor to consider when adapting behavioral strategies developed in office-based settings for Web delivery. Using examples and relevant perspectives from multiple disciplines, we describe a continuum of website IA designs ra...

  17. Information Retrieval Models

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Göker, Ayse; Davies, John

    2009-01-01

    Many applications that handle information on the internet would be completely inadequate without the support of information retrieval technology. How would we find information on the world wide web if there were no web search engines? How would we manage our email without spam filtering? Much of the

  18. Information Retrieval Models

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Göker, Ayse; Davies, John

    2009-01-01

    Many applications that handle information on the internet would be completely inadequate without the support of information retrieval technology. How would we find information on the world wide web if there were no web search engines? How would we manage our email without spam filtering? Much of the

  19. Historic Preservation Information CFM Website

    Data.gov (United States)

    Department of Veterans Affairs — The VA Historic Preservation Office keeps information about VA's programs to comply with Federal preservation requirements, and also interesting information about VA...

  20. Introduction to information retrieval

    CERN Document Server

    Manning, Christopher D; Schütze, Hinrich

    2008-01-01

    Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced un

  1. Interactive Information Retrieval

    DEFF Research Database (Denmark)

    Borlund, Pia

    2013-01-01

    The paper introduces the research area of interactive information retrieval (IIR) from a historical point of view. Further, the focus here is on evaluation, because much research in IR deals with IR evaluation methodology due to the core research interest in IR performance, system interaction...... and satisfaction with retrieved information. In order to position IIR evaluation, the Cranfield model and the series of tests that led to the Cranfield model are outlined. Three iconic user-oriented studies and projects that all have contributed to how IIR is perceived and understood today are presented......: The MEDLARS test, the Book House fiction retrieval system, and the OKAPI project. On this basis the call for alternative IIR evaluation approaches motivated by the three revolutions (the cognitive, the relevance, and the interactive revolutions) put forward by Robertson & Hancock-Beaulieu (1992) is presented...

  2. Music Information Retrieval.

    Science.gov (United States)

    Downie, J. Stephen

    2003-01-01

    Identifies MIR (Music Information Retrieval) computer system problems, historic influences, current state-of-the-art, and future MIR solutions through an examination of the multidisciplinary approach to MIR. Highlights include pitch; temporal factors; harmonics; tone; editorial, textual, and bibliographic facets; multicultural factors; locating…

  3. Information Retrieval Evaluation

    CERN Document Server

    Harman, Donna

    2011-01-01

    Evaluation has always played a major role in information retrieval, with the early pioneers such as Cyril Cleverdon and Gerard Salton laying the foundations for most of the evaluation methodologies in use today. The retrieval community has been extremely fortunate to have such a well-grounded evaluation paradigm during a period when most of the human language technologies were just developing. This lecture has the goal of explaining where these evaluation methodologies came from and how they have continued to adapt to the vastly changed environment in the search engine world today. The lecture

  4. Quantitative Information on Oncology Prescription Drug Websites.

    Science.gov (United States)

    Sullivan, Helen W; Aikin, Kathryn J; Squiers, Linda B

    2016-09-02

    Our objective was to determine whether and how quantitative information about drug benefits and risks is presented to consumers and healthcare professionals on cancer-related prescription drug websites. We analyzed the content of 65 active cancer-related prescription drug websites. We assessed the inclusion and presentation of quantitative information for two audiences (consumers and healthcare professionals) and two types of information (drug benefits and risks). Websites were equally likely to present quantitative information for benefits (96.9 %) and risks (95.4 %). However, the amount of the information differed significantly: Both consumer-directed and healthcare-professional-directed webpages were more likely to have quantitative information for every benefit (consumer 38.5 %; healthcare professional 86.1 %) compared with every risk (consumer 3.1 %; healthcare professional 6.2 %). The numeric and graphic presentations also differed by audience and information type. Consumers have access to quantitative information about oncology drugs and, in particular, about the benefits of these drugs. Research has shown that using quantitative information to communicate treatment benefits and risks can increase patients' and physicians' understanding and can aid in treatment decision-making, although some numeric and graphic formats are more useful than others.

  5. Information, conservation and retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Eng, T. [Swedish Nuclear Fuel and Waste Management Co., Stockholm (Sweden); Norberg, E. [National Swedish Archives, Stockholm (Sweden); Torbacke, J. [Stockholm Univ. (Sweden). Dept. of History; Jensen, M. [Swedish Radiation Protection Inst., Stockholm (Sweden)

    1996-12-01

    The seminar took place on the Swedish ship for transportation of radioactive wastes, M/S Sigyn, which at summer time is used for exhibitions. The seminar treated items related to general information needs in society and questions related to radioactive waste, i.e. how knowledge about a waste repository should be passed on to future generations. Three contributions are contained in the report from the seminar and are indexed separately: `Active preservation - otherwise no achieves`; `The conservation and dissemination of information - A democratic issue`; and, `Conservation and retrieval of information - Elements of a strategy to inform future societies about nuclear waste repositories`.

  6. Interactive Information Retrieval

    DEFF Research Database (Denmark)

    Borlund, Pia

    2013-01-01

    The paper introduces the research area of interactive information retrieval (IIR) from a historical point of view. Further, the focus here is on evaluation, because much research in IR deals with IR evaluation methodology due to the core research interest in IR performance, system interaction...... and satisfaction with retrieved information. In order to position IIR evaluation, the Cranfield model and the series of tests that led to the Cranfield model are outlined. Three iconic user-oriented studies and projects that all have contributed to how IIR is perceived and understood today are presented....... As a response to this call the ‘IIR evaluation model’ by Borlund (e.g., 2003a) is introduced. The objective of the IIR evaluation model is to facilitate IIR evaluation as close as possible to actual information searching and IR processes, though still in a relatively controlled evaluation environment, in which...

  7. Interactive Information Retrieval:

    DEFF Research Database (Denmark)

    Borlund, Pia

    This presentation addresses methodological issues of interactive information retrieval (IIR) evaluation in terms of what it entails to study users' use and interaction with IR systems, as well as their satisfaction with retrieved information. In particular, the presentation focuses on test design......, and it takes a look into the toolbox of IIR test design with reference to data collection methods and test procedure. It calls for careful and well-planned studies to qualify the knowledgebase generated as a result of the conducted IIR studies. The presentation further reflects on the need for an updated...... IIR from the perspective of search dedication and task load in order to also include everyday life information seeking? With this presentation, the IIR community is invited to an exchange of ideas and is encouraged to engage in collaborations with the solving of these (and other) issues to our joint...

  8. The Effects of Website Information Utility on the Outcomes of User-Website Interactions

    Science.gov (United States)

    Hasley, Joseph Paul

    2010-01-01

    This study investigates the relationships between website information content utility and various outcomes of user interactions with e-tail websites. Although previous research has consistently identified high quality information content as a critical factor of successful e-commerce websites, those studies have not reported how to identify the…

  9. The Effects of Website Information Utility on the Outcomes of User-Website Interactions

    Science.gov (United States)

    Hasley, Joseph Paul

    2010-01-01

    This study investigates the relationships between website information content utility and various outcomes of user interactions with e-tail websites. Although previous research has consistently identified high quality information content as a critical factor of successful e-commerce websites, those studies have not reported how to identify the…

  10. Changing Information Retrieval Behaviours

    DEFF Research Database (Denmark)

    Constantiou, Ioanna D.; Lehrer, Christiane; Hess, Thomas

    2014-01-01

    The introduction of smartphones and the accompanying profusion of mobile data services have had a profound effect on individuals' lives. One of the most influential service categories is location-based services (LBS). Based on insights from behavioural decision-making, a conceptual framework is d...... on the continuance of LBS use and indicate changes in individuals' information retrieval behaviours in everyday life. In particular, the distinct value dimension of LBS in specific contexts of use changes individuals' behaviours towards accessing location-related information....

  11. A Visual Information Retrieval Tool.

    Science.gov (United States)

    Zhang, Jin

    2000-01-01

    Discussion of visualization for information retrieval, that transforms unseen internal semantic representation of a document collection into visible geometric displays, focuses on DARE (Distance Angle Retrieval Environment). Highlights include expression of information need; interpretation and manipulation of information retrieval models; ranking…

  12. Multimedia Information Retrieval

    CERN Document Server

    Rueger, Stefan

    2009-01-01

    At its very core multimedia information retrieval means the process of searching for and finding multimedia documents; the corresponding research field is concerned with building the best possible multimedia search engines. The intriguing bit here is that the query itself can be a multimedia excerpt: For example, when you walk around in an unknown place and stumble across an interesting landmark, would it not be great if you could just take a picture with your mobile phone and send it to a service that finds a similar picture in a database and tells you more about the building -- and about its

  13. Intelligent Information Retrieval

    CERN Document Server

    Kurtz, M J; Accomazzi, A; Grant, C; Henneken, E; Murray, S S; Kurtz, Michael J.; Eichhorn, Guenther; Accomazzi, Alberto; Grant, Carolyn; Henneken, Edwin; Murray, Stephen S.

    2005-01-01

    Since it was first announced at ADASS 2 the Smithsonian/NASA Astrophysics System Abstract Service (ADS) has played a central role in the information seeking behavior of astronomers. Central to the ability of the ADS to act as a search and discovery tool is its role as metadata agregator. Over the past 13 years the ADS has introduced many new techniques to facilitate information retrieval, broadly defined. We discuss some of these developments; with particular attention to how the ADS might inta754-1.pseract with the virtual observatory, and to the new myADS-arXiv customized open access virtual journal. The ADS is at http://ads.harvard.edu

  14. Website for avian flu information and bioinformatics

    Institute of Scientific and Technical Information of China (English)

    GAO; George; Fu

    2009-01-01

    Highly pathogenic influenza A virus H5N1 has spread out worldwide and raised the public concerns. This increased the output of influenza virus sequence data as well as the research publication and other reports. In order to fight against H5N1 avian flu in a comprehensive way, we designed and started to set up the Website for Avian Flu Information (http://www.avian-flu.info) from 2004. Other than the influenza virus database available, the website is aiming to integrate diversified information for both researchers and the public. From 2004 to 2009, we collected information from all aspects, i.e. reports of outbreaks, scientific publications and editorials, policies for prevention, medicines and vaccines, clinic and diagnosis. Except for publications, all information is in Chinese. Till April 15, 2009, the cumulative news entries had been over 2000 and research papers were approaching 5000. By using the curated data from Influenza Virus Resource, we have set up an influenza virus sequence database and a bioinformatic platform, providing the basic functions for the sequence analysis of influenza virus. We will focus on the collection of experimental data and results as well as the integration of the data from the geological information system and avian influenza epidemiology.

  15. Website for avian flu information and bioinformatics

    Institute of Scientific and Technical Information of China (English)

    LIU Di; LIU Quan-He; WU Lin-Huan; LIU Bin; WU Jun; LAO Yi-Mei; LI Xiao-Jing; GAO George Fu; MA Jun-Cai

    2009-01-01

    Highly pathogenic influenza A virus H5N1 has spread out worldwide and raised the public concerns. This increased the output of influenza virus sequence data as well as the research publication and other reports. In order to fight against H5N1 avian flu in a comprehensive way, we designed and started to set up the Website for Avian Flu Information (http://www.avian-flu.info) from 2004. Other than the influenza virus database available, the website is aiming to integrate diversified information for both researchers and the public. From 2004 to 2009, we collected information from all aspects, i.e. reports of outbreaks, scientific publications and editorials, policies for prevention, medicines and vaccines, clinic and diagnosis. Except for publications, all information is in Chinese. Till April 15, 2009, the cumulative news entries had been over 2000 and research papers were approaching 5000. By using the curated data from Influenza Virus Resource, we have set up an influenza virus sequence database and a bioin-formatic platform, providing the basic functions for the sequence analysis of influenza virus. We will focus on the collection of experimental data and results as well as the integration of the data from the geological information system and avian influenza epidemiology.

  16. Implementation of the Website Information Publication Using PHP

    Institute of Scientific and Technical Information of China (English)

    PANLi; CHENLiaoyuan; WANGXiuhui; WANGWeiping; DENGWei; QIANLin

    2003-01-01

    With the rapid growth of the internet, the fast information communication and the comprehensive information publication become available. To attract the attention of browsers and let browsers know the hot news in time,a website is required to update its content constantly. The traditional mode to manually update website is no longer suitable for the rapid growth of website contents. Moreover,

  17. Advanced Topics in Information Retrieval

    CERN Document Server

    Melucci, Massimo

    2011-01-01

    Information retrieval is the science concerned with the effective and efficient retrieval of documents starting from their semantic content. It is employed to fulfill some information need from a large number of digital documents. Given the ever-growing amount of documents available and the heterogeneous data structures used for storage, information retrieval has recently faced and tackled novel applications. In this book, Melucci and Baeza-Yates present a wide-spectrum illustration of recent research results in advanced areas related to information retrieval. Readers will find chapters on e.g

  18. The information architecture of behavior change websites.

    Science.gov (United States)

    Danaher, Brian G; McKay, H Garth; Seeley, John R

    2005-05-18

    The extraordinary growth in Internet use offers researchers important new opportunities to identify and test new ways to deliver effective behavior change programs. The information architecture (IA)-the structure of website information--is an important but often overlooked factor to consider when adapting behavioral strategies developed in office-based settings for Web delivery. Using examples and relevant perspectives from multiple disciplines, we describe a continuum of website IA designs ranging from a matrix design to the tunnel design. The free-form matrix IA design allows users free rein to use multiple hyperlinks to explore available content according to their idiosyncratic interests. The more directive tunnel IA design (commonly used in e-learning courses) guides users step-by-step through a series of Web pages that are arranged in a particular order to improve the chances of achieving a goal that is measurable and consistent. Other IA designs are also discussed, including hierarchical IA and hybrid IA designs. In the hierarchical IA design, program content is arranged in a top-down manner, which helps the user find content of interest. The more complex hybrid IA design incorporates some combination of components that use matrix, tunnel, and/or hierarchical IA designs. Each of these IA designs is discussed in terms of usability, participant engagement, and program tailoring, as well as how they might best be matched with different behavior change goals (using Web-based smoking cessation interventions as examples). Our presentation underscores the role of considering and clearly reporting the use of IA designs when creating effective Web-based interventions. We also encourage the adoption of a multidisciplinary perspective as we move towards a more mature view of Internet intervention research.

  19. Intelligent Information Retrieval: An Introduction.

    Science.gov (United States)

    Gauch, Susan

    1992-01-01

    Discusses the application of artificial intelligence to online information retrieval systems and describes several systems: (1) CANSEARCH, from MEDLINE; (2) Intelligent Interface for Information Retrieval (I3R); (3) Gausch's Query Reformulation; (4) Environmental Pollution Expert (EP-X); (5) PLEXUS (gardening); and (6) SCISOR (corporate…

  20. Information retrieval in cultural heritage

    NARCIS (Netherlands)

    Koolen, M.; Kamps, J.; de Keijzer, V.

    2009-01-01

    This article discusses the opportunities and challenges of applying modern information retrieval techniques to the cultural heritage domain. Although the field of information retrieval is closely associated with computer science, it originally emerged from library science — also one of the main disc

  1. Contextual Bandits for Information Retrieval

    NARCIS (Netherlands)

    Hofmann, K.; Whiteson, S.; de Rijke, M.

    2011-01-01

    In this paper we give an overview of and outlook on research at the intersection of information retrieval (IR) and contextual bandit problems. A critical problem in information retrieval is online learning to rank, where a search engine strives to improve the quality of the ranked result lists it

  2. Ontology-based Information Retrieval

    DEFF Research Database (Denmark)

    Styltsvig, Henrik Bulskov

    of concept similarity in query evaluation is discussed. A semantic expansion approach that incorporates concept similarity is introduced and a generalized fuzzy set retrieval model that applies expansion during query evaluation is presented. While not commonly used in present information retrieval systems......In this thesis, we will present methods for introducing ontologies in information retrieval. The main hypothesis is that the inclusion of conceptual knowledge such as ontologies in the information retrieval process can contribute to the solution of major problems currently found in information...... retrieval. This utilization of ontologies has a number of challenges. Our focus is on the use of similarity measures derived from the knowledge about relations between concepts in ontologies, the recognition of semantic information in texts and the mapping of this knowledge into the ontologies in use...

  3. Design and implementation of website information disclosure assessment system.

    Science.gov (United States)

    Cho, Ying-Chiang; Pan, Jen-Yi

    2015-01-01

    Internet application technologies, such as cloud computing and cloud storage, have increasingly changed people's lives. Websites contain vast amounts of personal privacy information. In order to protect this information, network security technologies, such as database protection and data encryption, attract many researchers. The most serious problems concerning web vulnerability are e-mail address and network database leakages. These leakages have many causes. For example, malicious users can steal database contents, taking advantage of mistakes made by programmers and administrators. In order to mitigate this type of abuse, a website information disclosure assessment system is proposed in this study. This system utilizes a series of technologies, such as web crawler algorithms, SQL injection attack detection, and web vulnerability mining, to assess a website's information disclosure. Thirty websites, randomly sampled from the top 50 world colleges, were used to collect leakage information. This testing showed the importance of increasing the security and privacy of website information for academic websites.

  4. Design and implementation of website information disclosure assessment system.

    Directory of Open Access Journals (Sweden)

    Ying-Chiang Cho

    Full Text Available Internet application technologies, such as cloud computing and cloud storage, have increasingly changed people's lives. Websites contain vast amounts of personal privacy information. In order to protect this information, network security technologies, such as database protection and data encryption, attract many researchers. The most serious problems concerning web vulnerability are e-mail address and network database leakages. These leakages have many causes. For example, malicious users can steal database contents, taking advantage of mistakes made by programmers and administrators. In order to mitigate this type of abuse, a website information disclosure assessment system is proposed in this study. This system utilizes a series of technologies, such as web crawler algorithms, SQL injection attack detection, and web vulnerability mining, to assess a website's information disclosure. Thirty websites, randomly sampled from the top 50 world colleges, were used to collect leakage information. This testing showed the importance of increasing the security and privacy of website information for academic websites.

  5. Bibliometric-enhanced Information Retrieval

    CERN Document Server

    Mayr, Philipp; Larsen, Birger; Schaer, Philipp; Mutschke, Peter

    2013-01-01

    Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offer value-added effects for users. In this workshop we will explore how statistical modelling of scholarship, such as Bradfordizing or network analysis of coauthorship network, can improve re-trieval services for specific communities, as well as for large, cross-domain col-lections. This workshop aims to raise awareness of the missing link between in-formation retrieval (IR) and bibliometrics/scientometrics and to create a common ground for the incorporation of bibliometric-enhanced services into retrieval at the digital library interface.

  6. Mobile medical visual information retrieval.

    Science.gov (United States)

    Depeursinge, Adrien; Duc, Samuel; Eggel, Ivan; Müller, Henning

    2012-01-01

    In this paper, we propose mobile access to peer-reviewed medical information based on textual search and content-based visual image retrieval. Web-based interfaces designed for limited screen space were developed to query via web services a medical information retrieval engine optimizing the amount of data to be transferred in wireless form. Visual and textual retrieval engines with state-of-the-art performance were integrated. Results obtained show a good usability of the software. Future use in clinical environments has the potential of increasing quality of patient care through bedside access to the medical literature in context.

  7. Information retrieval in digital environments

    CERN Document Server

    Dinet, Jérôme

    2014-01-01

    Information retrieval is a central and essential activity. It is indeed difficult to find a human activity that does not need to retrieve information in an environment which is often increasingly digital: moving and navigating, learning, having fun, communicating, informing, making a decision, etc. Most human activities are intimately linked to our ability to search quickly and effectively for relevant information, the stakes are sometimes extremely important: passing an exam, voting, finding a job, remaining autonomous, being socially connected, developing a critical spirit, or simply surviv

  8. Cross-language information retrieval

    CERN Document Server

    Nie, Jian-Yun

    2010-01-01

    Search for information is no longer exclusively limited within the native language of the user, but is more and more extended to other languages. This gives rise to the problem of cross-language information retrieval (CLIR), whose goal is to find relevant information written in a different language to a query. In addition to the problems of monolingual information retrieval (IR), translation is the key problem in CLIR: one should translate either the query or the documents from a language to another. However, this translation problem is not identical to full-text machine translation (MT): the

  9. Rhetorical relations for information retrieval

    DEFF Research Database (Denmark)

    Lioma, Christina; Larsen, Birger; Lu, Wei

    2012-01-01

    -called discourse structure has been applied successfully to several natural language processing tasks. This work studies the use of rhetorical relations for Information Retrieval (IR): Is there a correlation between certain rhetorical relations and retrieval performance? Can knowledge about a document’s rhetorical...... relations be useful to IR? We present a language model modification that considers rhetorical relations when estimating the relevance of a document to a query. Empirical evaluation of different versions of our model on TREC settings shows that certain rhetorical relations can benefit retrieval effectiveness...

  10. Information Network Systems and Information Sharing on Administrative Websites

    Institute of Scientific and Technical Information of China (English)

    HIROTA Denjiro

    2004-01-01

    In Japanese "e-government" policy, called "e-Japan", the "administrative document management system" is functioning as information searching systems. On the other hand, this system has also generated the problem that it is not fully functioning as a means for the information sharing in a governmental agency. So, the purpose of this research is to find how the administrative document management system can function as information sharing in administrative organization. For this purpose,this paper considers the current status and some problems firstly. And secondary, this paper proposes the idea and constructs some information systems using administrative official Website. This is the method and approach of this research. As a conclusion, this proposal information system functions as information sharing support systems.

  11. Information Retrieval for Ecological Syntheses

    Science.gov (United States)

    Bayliss, Helen R.; Beyer, Fiona R.

    2015-01-01

    Research syntheses are increasingly being conducted within the fields of ecology and environmental management. Information retrieval is crucial in any synthesis in identifying data for inclusion whilst potentially reducing biases in the dataset gathered, yet the nature of ecological information provides several challenges when compared with…

  12. The Ecosystem of Information Retrieval

    Science.gov (United States)

    Rodriguez-Munoz, Jose-Vicente; Martinez-Mendez, Francisco-Javier; Pastor-Sanchez, Juan-Antonio

    2012-01-01

    Introduction: This paper presents an initial proposal for a formal framework that, by studying the metric variables involved in information retrieval, can establish the sequence of events involved and how to perform it. Method: A systematic approach from the equations of Shannon and Weaver to establish the decidability of information retrieval…

  13. Evaluation of Information Retrieval Systems

    Directory of Open Access Journals (Sweden)

    Keneilwe Zuva

    2012-07-01

    Full Text Available One of the challenges of modern information retrieval is to adequately evaluate Information RetrievalSystem (IRS in order to estimate future performance in a specified application domain. Since there aremany algorithms in literature the decision to select one for usage depends mostly on the evaluation of thesystems’ performance in the domain. This paper presents how visual and scalar evaluation methodscomplement one another to adequately evaluate information retrieval systems. The visual evaluationmethods are capable of indicating whether one IRS performs better than another IRS fully or partially. Anoverall performance of IRS is revealed using scalar evaluation methods. The use of both types of evaluationmethods will give a clear picture of the performance of the IRSs. The Receiver Operator Characteristic(ROC curve and Precision-Recall (P-R curve were used to illustrate the visual evaluation methods. Scalarmethods notably precision, recall, Area Under Curve (AUC and F measure were used.

  14. ORDINAL REGRESSION FOR INFORMATION RETRIEVAL

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    This letter presents a new discriminative model for Information Retrieval (IR), referred to as Ordinal Regression Model (ORM). ORM is different from most existing models in that it views IR as ordinal regression problem (i.e. ranking problem) instead of binary classification. It is noted that the task of IR is to rank documents according to the user information needed, so IR can be viewed as ordinal regression problem. Two parameter learning algorithms for ORM are presented. One is a perceptron-based algorithm. The other is the ranking Support Vector Machine (SVM). The effectiveness of the proposed approach has been evaluated on the task of ad hoc retrieval using three English Text REtrieval Conference (TREC) sets and two Chinese TREC sets. Results show that ORM significantly outperforms the state-of-the-art language model approaches and OKAPI system in all test sets; and it is more appropriate to view IR as ordinal regression other than binary classification.

  15. INFORMATION RETRIEVAL FOR SHORT DOCUMENTS

    Institute of Scientific and Technical Information of China (English)

    Qi Haoliang; Li Mu; Gao Jianfeng; Li Sheng

    2006-01-01

    The major problem of the most current approaches of information models lies in that individual words provide unreliable evidence about the content of the texts. When the document is short, e.g. only the abstract is available, the word-use variability problem will have substantial impact on the Information Retrieval (IR) performance. To solve the problem, a new technology to short document retrieval named Reference Document Model (RDM) is put forward in this letter. RDM gets the statistical semantic of the query/document by pseudo feedback both for the query and document from reference documents. The contributions of this model are three-fold: (1) Pseudo feedback both for the query and the document; (2) Building the query model and the document model from reference documents; (3) Flexible indexing units, which can be any linguistic elements such as documents, paragraphs, sentences, n-grams, term or character. For short document retrieval, RDM achieves significant improvements over the classical probabilistic models on the task of ad hoc retrieval on Text REtrieval Conference (TREC) test sets. Results also show that the shorter the document, the better the RDM performance.

  16. An Introduction to Information Retrieval.

    Science.gov (United States)

    International Business Machines Corp., White Plains, NY. Data Processing Div.

    The ways in which digital computers can be used in information storage and retrieval are presented in the language of the nonspecialist. Indexing methods, file organization, and search strategies are discussed and a brief bibliography containing 30 IBM publications is given. The manual is intended as a first reader for those interested in the…

  17. Information Retrieval in the Classroom.

    Science.gov (United States)

    Oley, Elizabeth

    1989-01-01

    Explores aspects of information retrieval skills such as end user training, indexing, controlled vocabulary systems, search protocol, boolean logic, problem analysis, and decision making. Suggests techniques for classroom instruction using simulations of online databases, CD-ROMs, and DIALOG's classroom instruction program. Describes several…

  18. Information Retrieval in Virtual Universities

    Science.gov (United States)

    Puustjärvi, Juha; Pöyry, Päivi

    2006-01-01

    Information retrieval in the context of virtual universities deals with the representation, organization, and access to learning objects. The representation and organization of learning objects should provide the learner with an easy access to the learning objects. In this article, we give an overview of the ONES system, and analyze the relevance…

  19. Automated information retrieval using CLIPS

    Science.gov (United States)

    Raines, Rodney Doyle, III; Beug, James Lewis

    1991-01-01

    Expert systems have considerable potential to assist computer users in managing the large volume of information available to them. One possible use of an expert system is to model the information retrieval interests of a human user and then make recommendations to the user as to articles of interest. At Cal Poly, a prototype expert system written in the C Language Integrated Production System (CLIPS) serves as an Automated Information Retrieval System (AIRS). AIRS monitors a user's reading preferences, develops a profile of the user, and then evaluates items returned from the information base. When prompted by the user, AIRS returns a list of items of interest to the user. In order to minimize the impact on system resources, AIRS is designed to run in the background during periods of light system use.

  20. Rhetorical relations for information retrieval

    DEFF Research Database (Denmark)

    Lioma, Christina; Larsen, Birger; Lu, Wei

    2012-01-01

    Typically, every part in most coherent text has some plausible reason for its presence, some function that it performs to the overall semantics of the text. Rhetorical relations, e.g. contrast, cause, explanation, describe how the parts of a text are linked to each other. Knowledge about this so......-called discourse structure has been applied successfully to several natural language processing tasks. This work studies the use of rhetorical relations for Information Retrieval (IR): Is there a correlation between certain rhetorical relations and retrieval performance? Can knowledge about a document’s rhetorical...... relations be useful to IR? We present a language model modification that considers rhetorical relations when estimating the relevance of a document to a query. Empirical evaluation of different versions of our model on TREC settings shows that certain rhetorical relations can benefit retrieval effectiveness...

  1. Interactive information seeking, behaviour and retrieval

    CERN Document Server

    Ruthven, Ian

    2011-01-01

    Information retrieval (IR) is a complex human activity supported by sophisticated systems. This book covers the whole spectrum of information retrieval, including: history and background information; behaviour and seeking task-based information; searching and retrieval approaches to investigating information; and, evaluation interfaces for IR.

  2. Understanding vaccination resistance: vaccine search term selection bias and the valence of retrieved information.

    Science.gov (United States)

    Ruiz, Jeanette B; Bell, Robert A

    2014-10-07

    Dubious vaccination-related information on the Internet leads some parents to opt out of vaccinating their children. To determine if negative, neutral and positive search terms retrieve vaccination information that differs in valence and confirms searchers' assumptions about vaccination. A content analysis of first-page Google search results was conducted using three negative, three neutral, and three positive search terms for the concepts "vaccine," "vaccination," and "MMR"; 84 of the 90 websites retrieved met inclusion requirements. Two coders independently and reliably coded for the presence or absence of each of 15 myths about vaccination (e.g., "vaccines cause autism"), statements that countered these myths, and recommendations for or against vaccination. Data were analyzed using descriptive statistics. Across all websites, at least one myth was perpetuated on 16.7% of websites and at least one myth was countered on 64.3% of websites. The mean number of myths perpetuated on websites retrieved with negative, neutral, and positive search terms, respectively, was 1.93, 0.53, and 0.40. The mean number of myths countered on websites retrieved with negative, neutral, and positive search terms, respectively, was 3.0, 3.27, and 2.87. Explicit recommendations regarding vaccination were offered on 22.6% of websites. A recommendation against vaccination was more often made on websites retrieved with negative search terms (37.5% of recommendations) than on websites retrieved with neutral (12.5%) or positive (0%) search terms. The concerned parent who seeks information about the risks of childhood immunizations will find more websites that perpetuate vaccine myths and recommend against vaccination than the parent who seeks information about the benefits of vaccination. This suggests that search term valence can lead to online information that supports concerned parents' misconceptions about vaccines. Copyright © 2014 Elsevier Ltd. All rights reserved.

  3. Least Information Modeling for Information Retrieval

    CERN Document Server

    Ke, Weimao

    2012-01-01

    We proposed a Least Information theory (LIT) to quantify meaning of information in probability distribution changes, from which a new information retrieval model was developed. We observed several important characteristics of the proposed theory and derived two quantities in the IR context for document representation. Given probability distributions in a collection as prior knowledge, LI Binary (LIB) quantifies least information due to the binary occurrence of a term in a document whereas LI Frequency (LIF) measures least information based on the probability of drawing a term from a bag of words. Three fusion methods were also developed to combine LIB and LIF quantities for term weighting and document ranking. Experiments on four benchmark TREC collections for ad hoc retrieval showed that LIT-based methods demonstrated very strong performances compared to classic TF*IDF and BM25, especially for verbose queries and hard search topics. The least information theory offers a new approach to measuring semantic qua...

  4. Source Evaluation and Information Literacy: Findings from a Study on Science Websites

    Directory of Open Access Journals (Sweden)

    Nora J. Bird

    2011-03-01

    Full Text Available An essential component of information literacy is the evaluation of information resources. Integral to evaluation are users’ judgments about which Web sources might prove reliable when learning about a particular topic and the ones that they would choose for short term and long term use. Past Website quality studies have used research methods that involved asking participants to recall quality factors without the benefit of concurrent Web searching. Users in this study evaluated Websites during live searching on the “open” or unrestricted Web in a quasi-experimental protocol to determine the quality factors they valued and how these factors relate to gaining knowledge about a particular topic – genetically modified food. Forty users from within a university setting and from the general community were given a pre-test about subject knowledge, were then asked to search and evaluate the most promising sites they found, and, subsequently, were given a post-searching questionnaire related to the quality of the information and the Websites retrieved. The quality factors that participants reported as helpful to them during the search are reported here. Two weeks later participants answered questions about the Websites they visited and what they had learned via an email survey. The participants then reported factors that allowed them to remember a Website or the information contained within it.

  5. Information retrieval and individual differences

    Directory of Open Access Journals (Sweden)

    Polona Vilar

    2008-01-01

    Full Text Available The paper presents individual differences, which are found in studies of information retrieval with emphasis on models of personality traits, cognitive and learning styles. It pays special attention to those models which are most often included in studies of information behaviour,information seeking,perceptions of IR systems, etc., but also brings forward some models which have not yet been included in such studies. Additionally, the relationship between different individual characteristics and individual’s chosen profession or academic area is discussed. In this context,the paper presents how investigation of individual differences can be useful in the design of IR systems.

  6. Interactive Information Retrieval: Context and Basic Notions

    Directory of Open Access Journals (Sweden)

    David Robins

    2000-01-01

    Full Text Available his paper provides an introduction to interactive information retrieval--the study of human interaction with information retrieval systems. Interactive information retrieval may be contrasted with the "system-centered" view of information retrieval in which changes to information retrieval system variables are manipulated in isolation from users in laboratory situations. The paper elucidates current models of interactive information retrieval, namely, the episodic model, the stratified model, the interactive feedback and search process model, and the global model of polyrepresentation. Future directions for research in the field are discussed.

  7. Users' information-seeking behavior on a medical library Website.

    Science.gov (United States)

    Rozic-Hristovsk, Anamarija; Hristovski, Dimitar; Todorovski, Ljupco

    2002-04-01

    The Central Medical Library (CMK) at the Faculty of Medicine, University of Ljubljana, Slovenia, started to build a library Website that included a guide to library services and resources in 1997. The evaluation of Website usage plays an important role in its maintenance and development. Analyzing and exploring regularities in the visitors' behavior can be used to enhance the quality and facilitate delivery of information services, identify visitors' interests, and improve the server's performance. The analysis of the CMK Website users' navigational behavior was carried out by analyzing the Web server log files. These files contained information on all user accesses to the Website and provided a great opportunity to learn more about the behavior of visitors to the Website. The majority of the available tools for Web log file analysis provide a predefined set of reports showing the access count and the transferred bytes grouped along several dimensions. In addition to the reports mentioned above, the authors wanted to be able to perform interactive exploration and ad hoc analysis and discover trends in a user-friendly way. Because of that, we developed our own solution for exploring and analyzing the Web logs based on data warehousing and online analytical processing technologies. The analytical solution we developed proved successful, so it may find further application in the field of Web log file analysis. We will apply the findings of the analysis to restructuring the CMK Website.

  8. SEMANTIC TERM BASED INFORMATION RETRIEVAL USING ONTOLOGY

    OpenAIRE

    2014-01-01

    Information Searching and retrieval is a challenging task in the traditional keyword based textual information retrieval system. In the growing information age, adding huge data every day the searching problem also augmented. Keyword based retrieval system returns bulk of junk document irrelevant to query. To address the limitations, this paper proposed query terms along with semantic terms for information retrieval using multiple ontology reference. User query sometimes reflects multiple ...

  9. Crisis pregnancy center websites: Information, misinformation and disinformation.

    Science.gov (United States)

    Bryant, Amy G; Narasimhan, Subasri; Bryant-Comstock, Katelyn; Levi, Erika E

    2014-12-01

    Most states with 24-h waiting periods prior to abortion provide state resource directories to women seeking abortion. Our objective was to evaluate the information on abortion provided on the websites of crisis pregnancy centers listed in these resource directories. We performed a survey of the websites of crisis pregnancy centers referenced in state resource directories for pregnant women. We searched for these state-provided resource directories online. We contacted state Departments of Health and Human Services for a print copy when a directory could not be found online. The crisis pregnancy center websites were evaluated for the information provided on abortion. Standardized data collection tools were used. Descriptive statistics were generated. Resource directories of 12 states were procured. A total of 254 websites referring to 348 crisis pregnancy centers were identified. Overall, a total of 203/254 [80%, 95% confidence interval (CI) 75%-84%] of websites provided at least one false or misleading piece of information. The most common misleading or false information included on the websites were a declared link between abortion and mental health risks (122/254 sites; 48%, 95% CI 42%-54%), preterm birth (54/254; 21%, 95% CI 17%-27%), breast cancer (51/254; 20%, 95% CI 16%-25%) and future infertility (32/254; 13%, 95% CI 9%-17%). Most crisis pregnancy centers listed in state resource directories for pregnant women provide misleading or false information regarding the risks of abortion. States should not list agencies that provide inaccurate information as resources in their directories. Copyright © 2014 Elsevier Inc. All rights reserved.

  10. Computing, Information and Communications Technology (CICT) Website

    Science.gov (United States)

    Hardman, John; Tu, Eugene (Technical Monitor)

    2002-01-01

    The Computing, Information and Communications Technology Program (CICT) was established in 2001 to ensure NASA's Continuing leadership in emerging technologies. It is a coordinated, Agency-wide effort to develop and deploy key enabling technologies for a broad range of mission-critical tasks. The NASA CICT program is designed to address Agency-specific computing, information, and communications technology requirements beyond the projected capabilities of commercially available solutions. The areas of technical focus have been chosen for their impact on NASA's missions, their national importance, and the technical challenge they provide to the Program. In order to meet its objectives, the CICT Program is organized into the following four technology focused projects: 1) Computing, Networking and Information Systems (CNIS); 2) Intelligent Systems (IS); 3) Space Communications (SC); 4) Information Technology Strategic Research (ITSR).

  11. Do You Ignore Information Security in Your Journal Website?

    Science.gov (United States)

    Dadkhah, Mehdi; Borchardt, Glenn; Lagzian, Mohammad

    2016-11-24

    Nowadays, web-based applications extend to all businesses due to their advantages and easy usability. The most important issue in web-based applications is security. Due to their advantages, most academic journals are now using these applications, with papers being submitted and published through their websites. As these websites are resources for knowledge, information security is primary for maintaining their integrity. In this opinion piece, we point out vulnerabilities in certain websites and introduce the potential for future threats. We intend to present how some journals are vulnerable and what will happen if a journal can be infected by attackers. This opinion is not a technical manual in information security, it is a short inspection that we did to improve the security of academic journals.

  12. Personalized Mobile Information Retrieval System

    Directory of Open Access Journals (Sweden)

    Okkyung Choi

    2012-04-01

    Full Text Available Building a global Network Relations with the internet has made huge changes in personal information system and even comments left on a webpage of SNS(Social Network Services are appreciated as important elements that would provide valuable information for someone. Social Network is a relation between individuals or groups, represented in a graph model, which converts the concept of psychological and social relations into a logical structure by using node and link. But, most of the current personalized systems on the basis of Social Network are built and constructed mainly in the PC environment, and the systems are neither designed nor implemented in mobile environment. Hence, the objective of this study is to propose methods of providing Personalized Mobile Information Retrieval System using NFC (Near Field Communication Smartphone, which will be then used for Smartphone users. Besides, this study aims to verify its efficiency through a comparative analysis of existing studies.

  13. Personalized Mobile Information Retrieval System

    Directory of Open Access Journals (Sweden)

    Okkyung Choi

    2012-04-01

    Full Text Available Building a global Network Relations with the internet has made huge changes in personal information system and even comments left on a webpage of SNS(Social Network Services are appreciated as important elements that would provide valuable information for someone. Social Network is a relation between individuals or groups, represented in a graph model, which converts the concept of psychological and social relations into a logical structure by using node and link. But, most of the current personalized systems on the basis of Social Network are built and constructed mainly in the PC environment, and the systems are neither designed nor implemented in mobile environment. Hence, the objective of this study is to propose methods of providing Personalized Mobile Information Retrieval System using NFC (Near Field Communication Smartphone, which will be then used for Smartphone users. Besides, this study aims to verify its efficiency through a comparative analysis of existing studies.

  14. Oil field waste disposal in salt caverns: An information website

    Energy Technology Data Exchange (ETDEWEB)

    Tomasko, D.; Veil, J. A.

    1999-12-10

    Argonne National Laboratory has completed the construction of a Website for the US Department of Energy (DOE) that provides detailed information on salt caverns and their use for disposing of nonhazardous oil field wastes (NOW) and naturally occurring radioactive materials (NORM). Specific topics in the Website include the following: descriptions of salt deposits and salt caverns within the US, salt cavern construction methods, potential types of wastes, waste emplacement, regulatory issues, costs, carcinogenic and noncarcinogenic human health risks associated with postulated cavern release scenarios, new information on cavern disposal (e.g., upcoming meetings, regulatory issues, etc.), other studies supported by the National Petroleum Technology Office (NPTO) (e.g., considerations of site location, cavern stability, development issues, and bedded salt characterization in the Midland Basin), and links to other associated Web sites. In addition, the Website allows downloadable access to reports prepared on the topic that were funded by DOE. Because of the large quantities of NOW and NORM wastes generated annually by the oil industry, information presented on this Website is particularly interesting and valuable to project managers, regulators, and concerned citizens.

  15. Biomedical information retrieval across languages.

    Science.gov (United States)

    Daumke, Philipp; Markü, Kornél; Poprat, Michael; Schulz, Stefan; Klar, Rüdiger

    2007-06-01

    This work presents a new dictionary-based approach to biomedical cross-language information retrieval (CLIR) that addresses many of the general and domain-specific challenges in current CLIR research. Our method is based on a multilingual lexicon that was generated partly manually and partly automatically, and currently covers six European languages. It contains morphologically meaningful word fragments, termed subwords. Using subwords instead of entire words significantly reduces the number of lexical entries necessary to sufficiently cover a specific language and domain. Mediation between queries and documents is based on these subwords as well as on lists of word-n-grams that are generated from large monolingual corpora and constitute possible translation units. The translations are then sent to a standard Internet search engine. This process makes our approach an effective tool for searching the biomedical content of the World Wide Web in different languages. We evaluate this approach using the OHSUMED corpus, a large medical document collection, within a cross-language retrieval setting.

  16. Information retrieval from black holes

    Science.gov (United States)

    Lochan, Kinjalk; Chakraborty, Sumanta; Padmanabhan, T.

    2016-08-01

    It is generally believed that, when matter collapses to form a black hole, the complete information about the initial state of the matter cannot be retrieved by future asymptotic observers, through local measurements. This is contrary to the expectation from a unitary evolution in quantum theory and leads to (a version of) the black hole information paradox. Classically, nothing else, apart from mass, charge, and angular momentum is expected to be revealed to such asymptotic observers after the formation of a black hole. Semiclassically, black holes evaporate after their formation through the Hawking radiation. The dominant part of the radiation is expected to be thermal and hence one cannot know anything about the initial data from the resultant radiation. However, there can be sources of distortions which make the radiation nonthermal. Although the distortions are not strong enough to make the evolution unitary, these distortions carry some part of information regarding the in-state. In this work, we show how one can decipher the information about the in-state of the field from these distortions. We show that the distortions of a particular kind—which we call nonvacuum distortions—can be used to fully reconstruct the initial data. The asymptotic observer can do this operationally by measuring certain well-defined observables of the quantum field at late times. We demonstrate that a general class of in-states encode all their information content in the correlation of late time out-going modes. Further, using a 1 +1 dimensional dilatonic black hole model to accommodate backreaction self-consistently, we show that observers can also infer and track the information content about the initial data, during the course of evaporation, unambiguously. Implications of such information extraction are discussed.

  17. Personalized Multimedia Information Retrieval based on User Profile Mining

    Directory of Open Access Journals (Sweden)

    Pengyi Zhang

    2013-10-01

    Full Text Available This paper focuses on how to retrieve personalized multimedia information based on user interest which can be mined from user profile. After analyzing the related works, a general structure of the personalized multimedia information retrieval system is given, which combines online module and offline module. Firstly, we collect a large-sale of photos from multimedia information sharing websites. Then, we record the information of the users who upload the multimedia information. For a given user, we save his history data which could describe the multimedia data. Secondly, the relationship between contents of multimedia data and semantic information is analyzed and then the user interest model is constructed by a modified LDA model which can integrate all the influencing factors in the task of multimedia information retrieval. Thirdly, the query distributions of all the topics can be estimated by the proposed modified LDA model. Thirdly, based on the above offline computing process, the online personalized multimedia information ranking algorithm is given which utilize the user interest model and the query word. Fourthly, multimedia information retrieval results are obtained using the proposed personalized multimedia information ranking algorithm. Finally, performance evaluation is conducted by a series of experiments to test the performance of the proposed algorithm compared with other methods on different datasets.

  18. Multimedia information retrieval theory and techniques

    CERN Document Server

    Raieli, Roberto

    2013-01-01

    Novel processing and searching tools for the management of new multimedia documents have developed. Multimedia Information Retrieval (MMIR) is an organic system made up of Text Retrieval (TR); Visual Retrieval (VR); Video Retrieval (VDR); and Audio Retrieval (AR) systems. So that each type of digital document may be analysed and searched by the elements of language appropriate to its nature, search criteria must be extended. Such an approach is known as the Content Based Information Retrieval (CBIR), and is the core of MMIR. This novel content-based concept of information handling needs to be integrated with more traditional semantics. Multimedia Information Retrieval focuses on the tools of processing and searching applicable to the content-based management of new multimedia documents. Translated from Italian by Giles Smith, the book is divided in to two parts. Part one discusses MMIR and related theories, and puts forward new methodologies; part two reviews various experimental and operating MMIR systems, a...

  19. Qualitative website analysis of information on birth after caesarean section.

    Science.gov (United States)

    Peddie, Valerie L; Whitelaw, Natalie; Cumming, Grant P; Bhattacharya, Siladitya; Black, Mairead

    2015-08-19

    The United Kingdom (UK) caesarean section (CS) rate is largely determined by reluctance to augment trial of labour and vaginal birth. Choice between repeat CS and attempting vaginal birth after CS (VBAC) in the next pregnancy is challenging, with neither offering clear safety advantages. Women may access online information during the decision-making process. Such information is known to vary in its support for either mode of birth when assessed quantitatively. Therefore, we sought to explore qualitatively, the content and presentation of web-based health care information on birth after caesarean section (CS) in order to identify the dominant messages being conveyed. The search engine Google™ was used to conduct an internet search using terms relating to birth after CS. The ten most frequently returned websites meeting relevant purposive sampling criteria were analysed. Sampling criteria were based upon funding source, authorship and intended audience. Images and written textual content together with presence of links to additional media or external web content were analysed using descriptive and thematic analyses respectively. Ten websites were analysed: five funded by Government bodies or professional membership; one via charitable donations, and four funded commercially. All sites compared the advantages and disadvantages of both repeat CS and VBAC. Commercially funded websites favoured a question and answer format alongside images, 'pop-ups', social media forum links and hyperlinks to third-party sites. The relationship between the parent sites and those being linked to may not be readily apparent to users, risking perception of endorsement of either VBAC or repeat CS whether intended or otherwise. Websites affiliated with Government or health services presented referenced clinical information in a factual manner with podcasts of real life experiences. Many imply greater support for VBAC than repeat CS although this was predominantly conveyed through subtle

  20. A Unified Mathematical Definition of Classical Information Retrieval.

    Science.gov (United States)

    Dominich, Sandor

    2000-01-01

    Presents a unified mathematical definition for the classical models of information retrieval and identifies a mathematical structure behind relevance feedback. Highlights include vector information retrieval; probabilistic information retrieval; and similarity information retrieval. (Contains 118 references.) (Author/LRW)

  1. Information retrieval from black holes

    CERN Document Server

    Lochan, Kinjalk; Padmanabhan, T

    2016-01-01

    It is generally believed that, when matter collapses to form a black hole, the complete information about the initial state of the matter cannot be retrieved by future asymptotic observers, through local measurements. This is contrary to the expectation from a unitary evolution in quantum theory and leads to (a version of) the black hole information paradox. Classically, nothing else, apart from mass, charge and angular momentum is expected to be revealed to such asymptotic observers after the formation of a black hole. Semi-classically, black holes evaporate after their formation through the Hawking radiation. The dominant part of the radiation is expected to be thermal and hence one cannot know anything about the initial data from the resultant radiation. However, there can be sources of distortions which make the radiation non-thermal. Although the distortions are not strong enough to make the evolution unitary, these distortions carry some part of information regarding the in-state. In this work, we show ...

  2. BIR 2014 - Bibliometric-enhanced Information Retrieval

    DEFF Research Database (Denmark)

    This first “Bibliometric-enhanced Information Retrieval” (BIR 2014) workshop1 aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although...... for the incorporation of bibliometric-enhanced services into retrieval at the digital library interface. Our interests include information retrieval, information seeking, science modelling, network analysis, and digital libraries. The goal is to apply insights from bibliometrics, scientometrics, and informetrics...

  3. Probabilistic Modeling in Dynamic Information Retrieval

    OpenAIRE

    Sloan, M. C.

    2016-01-01

    Dynamic modeling is used to design systems that are adaptive to their changing environment and is currently poorly understood in information retrieval systems. Common elements in the information retrieval methodology, such as documents, relevance, users and tasks, are dynamic entities that may evolve over the course of several interactions, which is increasingly captured in search log datasets. Conventional frameworks and models in information retrieval treat these elements as static, or only...

  4. A High-Speed Information Retrieval System

    Institute of Scientific and Technical Information of China (English)

    SHI Shu-dong; LI Zhi-tang

    2004-01-01

    We cleveloped a high-speed information retrieval system. The system hased on the IXP 2800 is one of the dedicute device. The velocily of the information retrieval is 6.8 Gb/s. The protocol support Telnet, FTP, SMTP, POP3 etc. various networks protocols. The information retrieval supports the key word and the natural language process. This paper explains the hardware system, software system and the index of the performance.

  5. Query space reduction in information retrieval

    OpenAIRE

    Kelledy, Fergus

    1997-01-01

    Today’s rapidly expanding and dynamic information age coupled with users who are becoming more discerning about what information they want and when they want it poses a serious challenge to information retrieval systems in their attempt to match user’s information needs with information repositories. To date most research on information retrieval has concentrated on improving system effectiveness. However as the amount of online information and the number of users concurrently accessing t...

  6. Characteristics of international websites with information on developmental disabilities.

    Science.gov (United States)

    Reichow, Brian; Gelbar, Nicholas W; Mouradjian, Keri; Shefcyk, Allison; Smith, Isaac C

    2014-10-01

    The Internet often serves as a primary resource for individuals seeking health-related information, and a large and growing number of websites contain information related to developmental disabilities. This paper presents the results of an international evaluation of the characteristics and content of the top 10 ranked results (i.e., not including sponsored results - pay-per-click) returned when one of five terms related to developmental disabilities (i.e., ADHD, autism, down syndrome, learning disability, intellectual disability) was entered into one of six country specific Google online search engines (i.e., Australia (https://www.google.com.au), Canada (https://www.google.ca), Ireland (https://www.google.ie), New Zealand (https://www.google.co.nz), the United Kingdom (https://www.google.co.uk), and the United States (https://www.google.com)) on October 22, 2013. Collectively, we found that international consumers of websites related to developmental disabilities will encounter different websites with differing content and terminology, and should be critical consumers to ensure they locate the information they are seeking.

  7. Information retrieval implementing and evaluating search engines

    CERN Document Server

    Büttcher, Stefan; Cormack, Gordon V

    2016-01-01

    Information retrieval is the foundation for modern search engines. This textbook offers an introduction to the core topics underlying modern search technologies, including algorithms, data structures, indexing, retrieval, and evaluation. The emphasis is on implementation and experimentation; each chapter includes exercises and suggestions for student projects. Wumpus -- a multiuser open-source information retrieval system developed by one of the authors and available online -- provides model implementations and a basis for student work. The modular structure of the book allows instructors to use it in a variety of graduate-level courses, including courses taught from a database systems perspective, traditional information retrieval courses with a focus on IR theory, and courses covering the basics of Web retrieval. In addition to its classroom use, Information Retrieval will be a valuable reference for professionals in computer science, computer engineering, and software engineering.

  8. Using Induction to Refine Information Retrieval Strategies

    Science.gov (United States)

    Baudin, Catherine; Pell, Barney; Kedar, Smadar

    1994-01-01

    Conceptual information retrieval systems use structured document indices, domain knowledge and a set of heuristic retrieval strategies to match user queries with a set of indices describing the document's content. Such retrieval strategies increase the set of relevant documents retrieved (increase recall), but at the expense of returning additional irrelevant documents (decrease precision). Usually in conceptual information retrieval systems this tradeoff is managed by hand and with difficulty. This paper discusses ways of managing this tradeoff by the application of standard induction algorithms to refine the retrieval strategies in an engineering design domain. We gathered examples of query/retrieval pairs during the system's operation using feedback from a user on the retrieved information. We then fed these examples to the induction algorithm and generated decision trees that refine the existing set of retrieval strategies. We found that (1) induction improved the precision on a set of queries generated by another user, without a significant loss in recall, and (2) in an interactive mode, the decision trees pointed out flaws in the retrieval and indexing knowledge and suggested ways to refine the retrieval strategies.

  9. Modelling and Retrieving Audiovisual Information - A Soccer Video Retrieval System

    NARCIS (Netherlands)

    Woudstra, A.; Velthausz, D.D.; Poot, de H.J.G.; Moelaart El-Hadidy, F.; Jonker, W.; Houtsma, M.A.W.; Heller, R.G.; Heemskerk, J.N.H.

    1998-01-01

    This paper describes the results of an ongoing collaborative project between KPN Research and the Telematics Institute on multimedia information handling. The focus of the paper is the modelling and retrieval of audiovisual information. The paper presents a general framework for modeling multimedia

  10. Data Fusion in Information Retrieval

    CERN Document Server

    Wu, Shengli

    2012-01-01

    The technique of data fusion has been used extensively in information retrieval due to the complexity and diversity of tasks involved such as web and social networks, legal, enterprise, and many others. This book presents both a theoretical and empirical approach to data fusion. Several typical data fusion algorithms are discussed, analyzed and evaluated. A reader will find answers to the following questions, among others: -          What are the key factors that affect the performance of data fusion algorithms significantly? -          What conditions are favorable to data fusion algorithms? -          CombSum and CombMNZ, which one is better? and why? -          What is the rationale of using the linear combination method? -          How can the best fusion option be found under any given circumstances?

  11. Systematic review of the types of methods and approaches used to assess the effectiveness of healthcare information websites.

    Science.gov (United States)

    Tieman, Jennifer; Bradley, Sandra L

    2013-01-01

    The aim of this systematic review was to identify types of approaches and methods used to evaluate the effectiveness of healthcare information websites. Simple usage data may not be sufficient to assess whether desired healthcare outcomes were achieved or to determine the relative effectiveness of different web resources on the same health topic. To establish the state of the knowledge base on assessment methods used to determine the effectiveness of healthcare websites, a structured search of the literature was conducted in Ovid Medline, resulting in the retrieval of 1611 articles, of which 240 met the inclusion criteria for the present review. The present review found that diverse evaluation methods were used to measure the effectiveness of healthcare websites. These evaluation methods were used during development, before release and after release. Economic assessment was rare and most evaluations looked at content issues, such as readability scores. Several studies did try to assess the usefulness of websites, but few studies looked at behaviour change or knowledge transfer following engagement with the designated health website. To assess the effectiveness of the knowledge transfer of healthcare information through the online environment, multiple methods may need to be used to evaluate healthcare websites and may need to be undertaken at all stages of the website development process.

  12. Trust in health information websites: A systematic literature review on the antecedents of trust.

    Science.gov (United States)

    Kim, Yeolib

    2016-06-01

    Health websites are important sources of information for consumers. In choosing websites, trust in websites largely determines which website to access and how to best utilize the information. Thus, it is critical to understand why consumers trust certain websites and distrust others. A systematic literature review was conducted with the goal of identifying the antecedents of trust in health information websites. After four rounds of screening process, 20 articles between 2000 and 2013 were harvested. Factors that determine trust are classified into individual difference antecedents, website-related antecedents, and consumer-to-website interaction-related antecedents. The most frequently studied antecedents were socio-demographics, information quality, appearance, and perceived reputation of the website. Each antecedent of trust are discussed in detail and future research directions are proposed.

  13. Information Retrieval beyond the Text Document.

    Science.gov (United States)

    Rui, Yong; Ortega, Michael; Huang, Thomas S.; Mehrotra, Sharad

    1999-01-01

    Reports some of the progress made over the years toward exploring information beyond the text domain. Describes the Multimedia Analysis and Retrieval Systems (MARS), developed to increase access to non-textual information. Addresses the following aspects of MARS: (1) visual feature extraction; (2) retrieval models; (3) query reformulation…

  14. Applications of Optical Technology: Information Retrieval.

    Science.gov (United States)

    O'Connor, Mary Ann

    1991-01-01

    Discusses applications of optical technology, especially CD-ROMs, to information management needs. Information retrieval problems are discussed; design questions that concern the format of the data, indexing methods, and retrieval capabilities are presented; the need for updates is considered; access requirements are discussed; and the importance…

  15. Expert Systems and Intelligent Information Retrieval.

    Science.gov (United States)

    Brooks, H. M.

    1987-01-01

    Explores what an intelligent information retrieval system involves and why expert system techniques might interest system designers. Expert systems research is reviewed with emphasis on components, architecture, and computer interaction, and it is concluded that information retrieval is not an ideal problem domain for expert system application at…

  16. Progress in Documentation: Pictorial Information Retrieval.

    Science.gov (United States)

    Enser, P. G. B.

    1995-01-01

    Surveys theoretical and practical issues associated with pictorial information retrieval. Concentrating on still and moving pictorial forms of the visual image, this paper focuses on indexing pictorial material and discusses four models of pictorial information retrieval corresponding with permutations of the verbal and visual modes for the…

  17. Information Retrieval Interaction: an Analysis of Models

    Directory of Open Access Journals (Sweden)

    Farahnaz Sadoughi

    2012-03-01

    Full Text Available Information searching process is an interactive process; thus users has control on searching process, and they can manage the results of the search process. In this process, user's question became more mature, according to retrieved results. In addition, on the side of the information retrieval system, there are some processes that could not be realized, unless by user. Practically, this issue, is egregious in “Interaction” -i.e. process of user connection to other system elements- and in “Relevance judgment”. This paper had a glance to existence of “Interaction” in information retrieval, in first. Then the tradition model of information retrieval and its strenght and weak points were reviewed. Finally, the current models of interactive information retrieval includes: Belkin episodic model, Ingwersen cognitive model, Sarasevic stratified model, and Spinks interactive feedback model were elucidated.

  18. SEMANTIC TERM BASED INFORMATION RETRIEVAL USING ONTOLOGY

    Directory of Open Access Journals (Sweden)

    J. Mannar Mannan

    2014-01-01

    Full Text Available Information Searching and retrieval is a challenging task in the traditional keyword based textual information retrieval system. In the growing information age, adding huge data every day the searching problem also augmented. Keyword based retrieval system returns bulk of junk document irrelevant to query. To address the limitations, this paper proposed query terms along with semantic terms for information retrieval using multiple ontology reference. User query sometimes reflects multiple domain of interest that persist us to collect semantically related ontologies. If no related ontology exists then WordNet ontology used to retrieve semantic terms related to query term. In this approach, classes on the ontology derived as semantic related text keywords, these keywords considered for rank the documents.

  19. BIR 2014 - Bibliometric-enhanced Information Retrieval

    DEFF Research Database (Denmark)

    This first “Bibliometric-enhanced Information Retrieval” (BIR 2014) workshop1 aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although...... analysis of co-authorship network, can improve retrieval services for specific communities, as well as for large, cross-domain collections. This workshop aims to raise awareness of the missing link between information retrieval (IR) and bibliometrics / scientometrics and to create a common ground...... for the incorporation of bibliometric-enhanced services into retrieval at the digital library interface. Our interests include information retrieval, information seeking, science modelling, network analysis, and digital libraries. The goal is to apply insights from bibliometrics, scientometrics, and informetrics...

  20. Bibliometric-enhanced information retrieval

    NARCIS (Netherlands)

    Mayr, Philipp; Scharnhorst, Andrea; Larsen, Birger; Schaer, Philipp; Mutschke, Peter

    2014-01-01

    Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offer value-added effects for users. In this workshop we will explore how statistical modelling of scholarship, such as Bradfordizing or network analysis of coauthorship network, can im

  1. Information Retrieval and the Philosophy of Language.

    Science.gov (United States)

    Blair, David C.

    2003-01-01

    Provides an overview of some of the main ideas in the philosophy of language that have relevance to the issues of information retrieval, focusing on the description of the intellectual content. Highlights include retrieval problems; recall and precision; words and meanings; context; externalism and the philosophy of language; and scaffolding and…

  2. Acquisition and retrieval of ophthalmology academic information

    Directory of Open Access Journals (Sweden)

    Lei Li

    2014-06-01

    Full Text Available This article discusses how to search and access ophthalmology information based on specialized websites and resources by introducing the database, search engines, electronic journals, electronic books and so on. Hope to help ophthalmic practitioners to carry out scientific research and clinical practice.

  3. Enhanced Trustworthy and High-Quality Information Retrieval System for Web Search Engines

    CERN Document Server

    Ramachandran, S; Joseph, S; Ramaraj, V

    2009-01-01

    The WWW is the most important source of information. But, there is no guarantee for information correctness and lots of conflicting information is retrieved by the search engines and the quality of provided information also varies from low quality to high quality. We provide enhanced trustworthiness in both specific (entity) and broad (content) queries in web searching. The filtering of trustworthiness is based on 5 factors: Provenance, Authority, Age, Popularity, and Related Links. The trustworthiness is calculated based on these 5 factors and it is stored thereby increasing the performance in retrieving trustworthy websites. The calculated trustworthiness is stored only for static websites. Quality is provided based on policies selected by the user. Quality based ranking of retrieved trusted information is provided using WIQA (Web Information Quality Assessment) Framework.

  4. Concept Framework for Audio Information Retrieval: ARF

    Institute of Scientific and Technical Information of China (English)

    LI GuoHui(李国辉); WU DeFeng(武德峰); ZHANG Jun(张军)

    2003-01-01

    The majority of researches on content-based retrieval focused on visual media.However audio is also an important medium and information carrier from the viewpoint of humanauditory perception, so it is needed to retrieve for audio collection. Audio is handled by conven-tional methods as an opaque stream medium, which is not suitable for information retrieval byits content. In fact, audio carries rich aural information with the form of speech, musical, andsound effects, so it could be retrieved based on its aural content, such as acoustic features, musicalmelodies and associated semantics. In this paper, a concept framework (ARF) for content-basedaudio retrieval is proposed from systematic perspectives, which describes audio content model,audio retrieval architecture and audio query schemes. Audio contents are represented by a hier-archical model and a set of formal descriptions from physical to acoustic to semantic level, whichdepict acoustic features, logical structure and semantics of audio and audio objects. The archi-tecture consisting of audio meta-database, populating and accessing modules presents a systemstructure view of audio information retrieval. The query schemes give generalized approaches andmodes concerning how users deliver audio information needs to audio collections. Finally, an audioretrieval example implemented is used to explain and specify the application of the components in the proposed ARF.

  5. Parsimonious Language Models for Information Retrieval

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Robertson, Stephen; Zaragoza, Hugo

    2004-01-01

    We systematically investigate a new approach to estimating the parameters of language models for information retrieval, called parsimonious language models. Parsimonious language models explicitly address the relation between levels of language models that are typically used for smoothing. As such,

  6. Current challenges in patent information retrieval

    CERN Document Server

    Lupu, Mihai; Kando, Noriko

    2017-01-01

    Intellectual property in the form of patents plays a vital role in today's increasingly knowledge-based economy. This book assembles state-of-the art research and is intended to illustrate innovative approaches to patent information retrieval.

  7. Introducing Multimedia Information Retrieval to libraries

    OpenAIRE

    2016-01-01

    L'articolo vuole introdurre le biblioteche alla prospettiva che operare entro i termini dell'Information Retrieval (IR) tradizionale mediante il solo uso del linguaggio testuale è limitativo, e che prendere in considerazione i criteri più ampi del Multimedia Information Retrieval (MIR) è invece necessario. L'articolo illustra la storia dei principi fondamentali del MIR, a partire dai primi anni di dibattito sulla documentazione fino alle teorie odierne sui significati semantici. Vengono dibat...

  8. Phase retrieval with prior information.

    Science.gov (United States)

    Irwan, R; Lane, R G

    1998-09-01

    An algorithm for phase retrieval with Bayesian statistics is discussed. It is shown how the statistics of Kolmogorov turbulence can be used to compute the likelihood for a particular phase screen. This likelihood is then added to that of the observed data to produce a functional that is maximized directly by use of conjugate gradient maximization. It is shown that although this can significantly improve the quality of the phase estimate,the issue is complicated by local maxima introduced by the possibility of phase wrapping. The causes of the local maxima are analyzed, and a method that increases the likelihood of convergence to the global maximum is presented.

  9. Using ontology for domain specific information retrieval

    Science.gov (United States)

    Shashirekha, H. L.; Murali, S.; Nagabhushan, P.

    2010-02-01

    This paper presents a system for retrieving information from a domain specific document collection made up of data rich unnatural language text documents. Instead of conventional keyword based retrieval, our system makes use of domain ontology to retrieve the information from a collection of documents. The system addresses the problem of representing unnatural language text documents and constructing a classifier model that helps in the efficient retrieval of relevant information. Query to this system may be either the key phrases in terms of concepts or a domain specific unnatural language text document. The classifier used in this system can also be used to assign multiple labels to the previously unseen text document belonging to the same domain. An empirical evaluation of the system is conducted on the domain of text documents describing the classified matrimonial advertisements to determine its performance.

  10. Evaluation of Web-Based Consumer Medication Information: Content and Usability of 4 Australian Websites.

    Science.gov (United States)

    Raban, Magdalena Z; Tariq, Amina; Richardson, Lauren; Byrne, Mary; Robinson, Maureen; Li, Ling; Westbrook, Johanna I; Baysari, Melissa T

    2016-07-21

    Medication is the most common intervention in health care, and written medication information can affect consumers' medication-related behavior. Research has shown that a large proportion of Australians search for medication information on the Internet. To evaluate the medication information content, based on consumer medication information needs, and usability of 4 Australian health websites: Better Health Channel, myDr, healthdirect, and NPS MedicineWise . To assess website content, the most common consumer medication information needs were identified using (1) medication queries to the healthdirect helpline (a telephone helpline available across most of Australia) and (2) the most frequently used medications in Australia. The most frequently used medications were extracted from Australian government statistics on use of subsidized medicines in the community and the National Census of Medicines Use. Each website was assessed to determine whether it covered or partially covered information and advice about these medications. To assess website usability, 16 consumers participated in user testing wherein they were required to locate 2 pieces of medication information on each website. Brief semistructured interviews were also conducted with participants to gauge their opinions of the websites. Information on prescription medication was more comprehensively covered on all websites (3 of 4 websites covered 100% of information) than nonprescription medication (websites covered 0%-67% of information). Most websites relied on consumer medicines information leaflets to convey prescription medication information to consumers. Information about prescription medication classes was less comprehensive, with no website providing all information examined about antibiotics and antidepressants. Participants (n=16) were able to locate medication information on websites in most cases (accuracy ranged from 84% to 91%). However, a number of usability issues relating to website

  11. Web information retrieval based on ontology

    Science.gov (United States)

    Zhang, Jian

    2013-03-01

    The purpose of the Information Retrieval (IR) is to find a set of documents that are relevant for a specific information need of a user. Traditional Information Retrieval model commonly used in commercial search engine is based on keyword indexing system and Boolean logic queries. One big drawback of traditional information retrieval is that they typically retrieve information without an explicitly defined domain of interest to the users so that a lot of no relevance information returns to users, which burden the user to pick up useful answer from these no relevance results. In order to tackle this issue, many semantic web information retrieval models have been proposed recently. The main advantage of Semantic Web is to enhance search mechanisms with the use of Ontology's mechanisms. In this paper, we present our approach to personalize web search engine based on ontology. In addition, key techniques are also discussed in our paper. Compared to previous research, our works concentrate on the semantic similarity and the whole process including query submission and information annotation.

  12. A Novel Fuzzy Document Based Information Retrieval Model for Forecasting

    Directory of Open Access Journals (Sweden)

    Partha Roy

    2017-06-01

    Full Text Available Information retrieval systems are generally used to find documents that are most appropriate according to some query that comes dynamically from users. In this paper a novel Fuzzy Document based Information Retrieval Model (FDIRM is proposed for the purpose of Stock Market Index forecasting. The novelty of proposed approach is a modified tf-idf scoring scheme to predict the future trend of the stock market index. The contribution of this paper has two dimensions, 1 In the proposed system the simple time series is converted to an enriched fuzzy linguistic time series with a unique approach of incorporating market sentiment related information along with the price and 2 A unique approach is followed while modeling the information retrieval (IR system which converts a simple IR system into a forecasting system. From the performance comparison of FDIRM with standard benchmark models it can be affirmed that the proposed model has a potential of becoming a good forecasting model. The stock market data provided by Standard & Poor’s CRISIL NSE Index 50 (CNX NIFTY-50 index of National Stock Exchange of India (NSE is used to experiment and validate the proposed model. The authentic data for validation and experimentation is obtained from http://www.nseindia.com which is the official website of NSE. A java program is under construction to implement the model in real-time with graphical users’ interface.

  13. Activities of information retrieval in Daicel Corporation : The roles and efforts of information retrieval team

    Science.gov (United States)

    Yamazaki, Towako

    In order to stabilize and improve quality of information retrieval service, the information retrieval team of Daicel Corporation has given some efforts on standard operating procedures, interview sheet for information retrieval, structured format for search report, and search expressions for some technological fields of Daicel. These activities and efforts will also lead to skill sharing and skill tradition between searchers. In addition, skill improvements are needed not only for a searcher individually, but also for the information retrieval team totally when playing searcher's new roles.

  14. Editorial: Hyperlinks and Their Roles in Web Information Retrieval

    Directory of Open Access Journals (Sweden)

    Alireza Noruzi

    2005-10-01

    Full Text Available A web page generally includes elements such as text, hyperlink, image, etc. Hyperlink represents a relationship between two web pages or just between sections of the same page. Understanding the hyperlink structure is fundamental to understanding the Web connectivity structure, because hyperlinks have been used in web indexing and information retrieval, as well as page ranking. If the Web were a car, hyperlinks would be the engine, because without them, we are not going anywhere. It can be concluded that search engines consider that any words used by other sites to describe a site is particularly relevant even if the keywords are not used in the backlinked site/page (the linked target destination. In other words, the foreign language text links allow the linked sites to have a chance to be retrieved as relevant results in response to a search query. Many search engines judge the linking page partly based on the quality of the linked page, and if many sites backlinking to a site use keywords in their link text, search engines will raise its ranking for those keywords. Ultimately, backlinks from popular websites with a higher ranking, have a higher weight then backlinks from smaller unknown websites.

  15. Language-based multimedia information retrieval

    OpenAIRE

    De Jong; Gauvain, J.L.; Hiemstra, D; Netter, K.

    2000-01-01

    This paper describes various methods and approaches for language-based multimedia information retrieval, which have been developed in the projects POP-EYE and OLIVE and which will be developed further in the MUMIS project. All of these project aim at supporting automated indexing of video material by use of human language technologies. Thus, in contrast to image or sound-based retrieval methods, where both the query language and the indexing methods build on non-linguistic data, these methods...

  16. Exploiting salient semantic analysis for information retrieval

    Science.gov (United States)

    Luo, Jing; Meng, Bo; Quan, Changqin; Tu, Xinhui

    2016-11-01

    Recently, many Wikipedia-based methods have been proposed to improve the performance of different natural language processing (NLP) tasks, such as semantic relatedness computation, text classification and information retrieval. Among these methods, salient semantic analysis (SSA) has been proven to be an effective way to generate conceptual representation for words or documents. However, its feasibility and effectiveness in information retrieval is mostly unknown. In this paper, we study how to efficiently use SSA to improve the information retrieval performance, and propose a SSA-based retrieval method under the language model framework. First, SSA model is adopted to build conceptual representations for documents and queries. Then, these conceptual representations and the bag-of-words (BOW) representations can be used in combination to estimate the language models of queries and documents. The proposed method is evaluated on several standard text retrieval conference (TREC) collections. Experiment results on standard TREC collections show the proposed models consistently outperform the existing Wikipedia-based retrieval methods.

  17. Hybrid Information Retrieval Model For Web Images

    CERN Document Server

    Bassil, Youssef

    2012-01-01

    The Bing Bang of the Internet in the early 90's increased dramatically the number of images being distributed and shared over the web. As a result, image information retrieval systems were developed to index and retrieve image files spread over the Internet. Most of these systems are keyword-based which search for images based on their textual metadata; and thus, they are imprecise as it is vague to describe an image with a human language. Besides, there exist the content-based image retrieval systems which search for images based on their visual information. However, content-based type systems are still immature and not that effective as they suffer from low retrieval recall/precision rate. This paper proposes a new hybrid image information retrieval model for indexing and retrieving web images published in HTML documents. The distinguishing mark of the proposed model is that it is based on both graphical content and textual metadata. The graphical content is denoted by color features and color histogram of ...

  18. Treatments and services for neurodevelopmental disorders on advocacy websites: Information or evaluation?

    DEFF Research Database (Denmark)

    Di Pietro, Nina C; Whiteley, Louise Emma; Illes, Judy

    2011-01-01

    disorder (FASD)—inform stakeholders about treatment options, and discuss the ethical challenges inherent in providing such information online. We identified major advocacy websites for each disorder and assessed website accountability, the number, attributes, and accessibility of treatments described......, and the valence of treatment information. With the exception of FASD websites, we found that advocacy websites provide a plethora of information about a wide variety of readily available products and services. Treatment information is primarily targeted at families and is overwhelmingly encouraging, regardless...... of the type or conventionality of treatments. Many websites acknowledge corporate sponsors. While the majority do not overtly advertise or endorse specific brands, they also do not prominently display disclaimers about the nature and intent of treatment information. Thus, while advocacy websites are organized...

  19. 万维网信息智能检索及其在语音网站中的应用%Intelligent WWW Information Retrival and Its Application in Voice Website

    Institute of Scientific and Technical Information of China (English)

    李梅; 刘文; 王庆林

    2001-01-01

    A scheme for designing a voice Website-based intelligent WWW information retrieval system is presented.It provides convenient and rapid information retrieval service for more users.Combining the advantages of full-text retrieval and intelligent retrieval,it can improve the response speed,recall ratio and pertinency ratio.Meanwhile,by using ASR,TIS and natural language processing,it can enlarge the scale of Internet users.

  20. Learning to rank for information retrieval

    CERN Document Server

    Liu, Tie-Yan

    2011-01-01

    Due to the fast growth of the Web and the difficulties in finding desired information, efficient and effective information retrieval systems have become more important than ever, and the search engine has become an essential tool for many people. The ranker, a central component in every search engine, is responsible for the matching between processed queries and indexed documents. Because of its central role, great attention has been paid to the research and development of ranking technologies. In addition, ranking is also pivotal for many other information retrieval applications, such as coll

  1. Website quality, expectation, confirmation, and end user satisfaction: the knowledge-intensive website of the Korean National Cancer Information Center.

    Science.gov (United States)

    Koo, Chulmo; Wati, Yulia; Park, Keeho; Lim, Min Kyung

    2011-11-02

    The fact that patient satisfaction with primary care clinical practices and physician-patient communications has decreased gradually has brought a new opportunity to the online channel as a supplementary service to provide additional information. In this study, our objectives were to examine the process of cognitive knowledge expectation-confirmation from eHealth users and to recommend the attributes of a "knowledge-intensive website.". Knowledge expectation can be defined as users' existing attitudes or beliefs regarding expected levels of knowledge they may gain by accessing the website. Knowledge confirmation is the extent to which user's knowledge expectation of information systems use is realized during actual use. In our hypothesized research model, perceived information quality, presentation and attractiveness as well as knowledge expectation influence knowledge confirmation, which in turn influences perceived usefulness and end user satisfaction, which feeds back to knowledge expectation. An empirical study was conducted at the National Cancer Center (NCC), Republic of Korea (South Korea), by evaluating its official website. A user survey was administered containing items to measure subjectively perceived website quality and expectation-confirmation attributes. A study sample of 198 usable responses was used for further analysis. We used the structural equation model to test the proposed research model. Knowledge expectation exhibited a positive effect on knowledge confirmation (beta = .27, P knowledge confirmation were also positive and significant (beta = .24, P knowledge confirmation on perceived usefulness was also positively significant (beta = .64, P Knowledge expectation together with knowledge confirmation and perceived usefulness also significantly affected end user satisfaction (beta = .22 P knowledge-intensive website attributes, (2) enhanced the theoretical foundation of eHealth from the information systems (IS) perspective by adopting the

  2. Teaching a Heuristic Approach to Information Retrieval.

    Science.gov (United States)

    Ury, Connie Jo; And Others

    1997-01-01

    Discusses lifelong learning and the need for information retrieval skills, and describes how Northwest Missouri State University incorporates a heuristic model of library instruction in which students continually evaluate and refine information-seeking practices while progressing through all levels of courses in diverse disciplines. (Author/LRW)

  3. Diversity cues on recruitment websites: investigating the effects on job seekers' information processing.

    Science.gov (United States)

    Walker, H Jack; Feild, Hubert S; Bernerth, Jeremy B; Becton, J Bret

    2012-01-01

    Although job seekers' motivation to process the information encountered during recruitment partially influences recruitment success, little is known about what motivates more thorough information processing. To address this issue, we integrated recruitment and social information processing theories to examine the possibility that diversity cues on recruitment websites influence website viewers' processing of presented information. Utilizing a controlled experiment and a hypothetical organization, Study 1 revealed that both Blacks and Whites spent more time viewing recruitment websites and better recalled website information when the sites included racial diversity cues. These relationships were stronger for Blacks, and organizational attractiveness perceptions mediated these effects for Blacks but not for Whites. Study 2 found similar relationships for Black and White participants viewing real organizational recruitment websites after taking into account perceived organizational attributes and website design effects. Implications of these findings for recruiting organizations are discussed.

  4. BIRS - Bioterrorism Information Retrieval System.

    Science.gov (United States)

    Tewari, Ashish Kumar; Rashi; Wadhwa, Gulshan; Sharma, Sanjeev Kumar; Jain, Chakresh Kumar

    2013-01-01

    Bioterrorism is the intended use of pathogenic strains of microbes to widen terror in a population. There is a definite need to promote research for development of vaccines, therapeutics and diagnostic methods as a part of preparedness to any bioterror attack in the future. BIRS is an open-access database of collective information on the organisms related to bioterrorism. The architecture of database utilizes the current open-source technology viz PHP ver 5.3.19, MySQL and IIS server under windows platform for database designing. Database stores information on literature, generic- information and unique pathways of about 10 microorganisms involved in bioterrorism. This may serve as a collective repository to accelerate the drug discovery and vaccines designing process against such bioterrorist agents (microbes). The available data has been validated from various online resources and literature mining in order to provide the user with a comprehensive information system. The database is freely available at http://www.bioterrorism.biowaves.org.

  5. Emergent web intelligence advanced information retrieval

    CERN Document Server

    Badr, Youakim; Abraham, Ajith; Hassanien, Aboul-Ella

    2010-01-01

    Web Intelligence explores the impact of artificial intelligence and advanced information technologies representing the next generation of Web-based systems, services, and environments, and designing hybrid web systems that serve wired and wireless users more efficiently. Multimedia and XML-based data are produced regularly and in increasing way in our daily digital activities, and their retrieval must be explored and studied in this emergent web-based era. 'Emergent Web Intelligence: Advanced information retrieval, provides reviews of the related cutting-edge technologies and insights. It is v

  6. Role of Ontology in Information Retrieval

    Institute of Scientific and Technical Information of China (English)

    WU Dan; WANG Hui-lin

    2006-01-01

    Based on the comparison between ontology and thesaurus, and the analysis of an ontology-based Information Retrieval (IR) model, the potential advantages that ontology may contribute to IR are analyzed. Then a general architecture of ontology-based Information Retrieval System (IRS) and the approach of constructing it are presented. Based on the researches, the role of ontology in IR is summarized from four aspects and a typical system called Textpresso is analyzed. Finally, a conclusion is drawn that utilizing ontology is the trend of IR and can really improve the IRS.

  7. Evaluation of Quality and Readability of Health Information Websites Identified through India's Major Search Engines.

    Science.gov (United States)

    Raj, S; Sharma, V L; Singh, A J; Goel, S

    2016-01-01

    Background. The available health information on websites should be reliable and accurate in order to make informed decisions by community. This study was done to assess the quality and readability of health information websites on World Wide Web in India. Methods. This cross-sectional study was carried out in June 2014. The key words "Health" and "Information" were used on search engines "Google" and "Yahoo." Out of 50 websites (25 from each search engines), after exclusion, 32 websites were evaluated. LIDA tool was used to assess the quality whereas the readability was assessed using Flesch Reading Ease Score (FRES), Flesch-Kincaid Grade Level (FKGL), and SMOG. Results. Forty percent of websites (n = 13) were sponsored by government. Health On the Net Code of Conduct (HONcode) certification was present on 50% (n = 16) of websites. The mean LIDA score (74.31) was average. Only 3 websites scored high on LIDA score. Only five had readability scores at recommended sixth-grade level. Conclusion. Most health information websites had average quality especially in terms of usability and reliability and were written at high readability levels. Efforts are needed to develop the health information websites which can help general population in informed decision making.

  8. Applications Of Informetrics To Information Retrieval Research

    Directory of Open Access Journals (Sweden)

    Dietmar Wolfram

    2000-01-01

    Full Text Available A non-technical overview of two primary areas of study within the discipline of information science, information retrieval (IR and informetrics, is presented. Informetric properties of IR systems as the basis for understanding IR system structure and generalizing human information seeking in electronic environments are discussed. Applications of informetric study of IR systems for more efficient and effective design and evaluation of IR systems are also presented.

  9. Test OSIRIS (On Line Search Information Retrieval Information Storage).

    Science.gov (United States)

    Showalther, A. Kenneth

    The OSIRIS system is a prototype information retrieval system having the following components: an automated microfiche file having a capacity of 5000 punch card sized microfiche with a remote control 21 inch TV console for retrieving, magnifying (0-250X), and displaying any of the images on the microfiche; and a remote computer terminal for the…

  10. Teaching Fifth Graders Electronic Information Retrieval Skills.

    Science.gov (United States)

    Christy, Annette

    Fifth graders were taught to use an electronic card catalog to retrieve information and materials for class assignments and leisure reading materials. Groups of 10 or 12 students were seen twice a week for periods lasting up to 30 minutes. At these sessions they were introduced to computer components, proper handling, how to log into a network…

  11. Formalizing Evaluation in Music Information Retrieval

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2013-01-01

    We develop a formalism to disambiguate the evaluation of music information retrieval systems. We define a ``system,'' what it means to ``analyze'' one, and make clear the aims, parts, design, execution, interpretation, and assumptions of its ``evaluation.'' We apply this formalism to discuss...... the MIREX automatic mood classification task....

  12. Introduction: Natural Language Processing and Information Retrieval.

    Science.gov (United States)

    Smeaton, Alan F.

    1990-01-01

    Discussion of research into information and text retrieval problems highlights the work with automatic natural language processing (NLP) that is reported in this issue. Topics discussed include the occurrences of nominal compounds; anaphoric references; discontinuous language constructs; automatic back-of-the-book indexing; and full-text analysis.…

  13. Language-based multimedia information retrieval

    NARCIS (Netherlands)

    de Jong, Franciska M.G.; Gauvain, J.L.; Hiemstra, Djoerd; Netter, K.

    2000-01-01

    This paper describes various methods and approaches for language-based multimedia information retrieval, which have been developed in the projects POP-EYE and OLIVE and which will be developed further in the MUMIS project. All of these project aim at supporting automated indexing of video material

  14. Strategies for Building Distributed Information Retrieval Systems.

    Science.gov (United States)

    Macleod, Ian A.; And Others

    1987-01-01

    Discussion of the need for distributed information retrieval systems focuses on a model system, Fulcrum FUL/Text. Differences from distributed database management systems are described; system design is discussed; implementation requirements are explained including remote operation calls (ROC's); and a prototype simulation model based on FUL/Text…

  15. Language-based multimedia information retrieval

    NARCIS (Netherlands)

    Jong, de F.M.G.; Gauvain, J.L.; Hiemstra, D.; Netter, K.

    2000-01-01

    This paper describes various methods and approaches for language-based multimedia information retrieval, which have been developed in the projects POP-EYE and OLIVE and which will be developed further in the MUMIS project. All of these project aim at supporting automated indexing of video material b

  16. Millennial Students' Mental Models of Information Retrieval

    Science.gov (United States)

    Holman, Lucy

    2009-01-01

    This qualitative study examines first-year college students' online search habits in order to identify patterns in millennials' mental models of information retrieval. The study employed a combination of modified contextual inquiry and concept mapping methodologies to elicit students' mental models. The researcher confirmed previously observed…

  17. Language-based multimedia information retrieval

    NARCIS (Netherlands)

    de Jong, Franciska M.G.; Gauvain, J.L.; Hiemstra, Djoerd; Netter, K.

    2000-01-01

    This paper describes various methods and approaches for language-based multimedia information retrieval, which have been developed in the projects POP-EYE and OLIVE and which will be developed further in the MUMIS project. All of these project aim at supporting automated indexing of video material b

  18. Challenges in Information Retrieval and Language Modeling

    NARCIS (Netherlands)

    Allen, J.; Aslam, J.; Belkin, N.; Buckley, C.; Callan, J.; Croft, W.B.; Dumais, S.; Fuhr, N.; Harman, D.; Harper, D.J.; Hiemstra, D.; Hofmann, T.; Hovey, E.; Kraaij, W.; Lafferty, J.; Lavrenko, V.; Lewis, D.; Liddy, L.; Manmatha, R.; McCallum, A.; Ponte, J.; Prager, J.; Radev, D.; Resnik, P.; Robertson, S.E.; Rosenfeld, R.; Roukos, S.; Sanderson, M.; Schwartz, R.; Singhal, A.; Smeaton, A.; Turtle, H.; Voorhees, E.M.; Weischedel, R.; Xu, J.; Zhai, B.C.

    2003-01-01

    Information retrieval (IR) research has reached a point where it is appropriate to assess progress and to define a research agenda for the next five to ten years. This report summarizes a discussion of IR research challenges that took place at a recent workshop. The attendees of the workshop conside

  19. Treatments and services for neurodevelopmental disorders on advocacy websites: Information or evaluation?

    DEFF Research Database (Denmark)

    Di Pietro, Nina C; Whiteley, Louise Emma; Illes, Judy

    2011-01-01

    The Internet has quickly gained popularity as a major source of health-related information, but its impact is unclear. Here, we investigate the extent to which advocacy websites for three neurodevelopmental disorders—cerebral palsy (CP), autism spectrum disorder (ASD) and fetal alcohol spectrum...... disorder (FASD)—inform stakeholders about treatment options, and discuss the ethical challenges inherent in providing such information online. We identified major advocacy websites for each disorder and assessed website accountability, the number, attributes, and accessibility of treatments described......, and the valence of treatment information. With the exception of FASD websites, we found that advocacy websites provide a plethora of information about a wide variety of readily available products and services. Treatment information is primarily targeted at families and is overwhelmingly encouraging, regardless...

  20. Information retrieval models foundations and relationships

    CERN Document Server

    Roelleke, Thomas

    2013-01-01

    Information Retrieval (IR) models are a core component of IR research and IR systems. The past decade brought a consolidation of the family of IR models, which by 2000 consisted of relatively isolated views on TF-IDF (Term-Frequency times Inverse-Document-Frequency) as the weighting scheme in the vector-space model (VSM), the probabilistic relevance framework (PRF), the binary independence retrieval (BIR) model, BM25 (Best-Match Version 25, the main instantiation of the PRF/BIR), and language modelling (LM). Also, the early 2000s saw the arrival of divergence from randomness (DFR).Regarding in

  1. MIREX: MapReduce Information Retrieval Experiments

    CERN Document Server

    Hiemstra, Djoerd

    2010-01-01

    We propose to use MapReduce to quickly test new retrieval approaches on a cluster of machines by sequentially scanning all documents. We present a small case study in which we use a cluster of 15 low cost ma- chines to search a web crawl of 0.5 billion pages showing that sequential scanning is a viable approach to running large-scale information retrieval experiments with little effort. The code is available to other researchers at: http://mirex.sourceforge.net

  2. The impact of the introduction and use of an informational website on offline customer buying behavior

    NARCIS (Netherlands)

    van Nierop, J. E. M.; Leeflang, P. S. H.; Teerling, M. L.; Huizingh, K. R. E.

    2011-01-01

    Do customers increase or decrease their spending in response to the introduction of an informational website? To answer this question, this study considers the effects of the introduction and use of an informational website by a large national retailer on offline customer buying behavior. More speci

  3. The impact of the introduction and use of an informational website on offline customer buying behavior

    NARCIS (Netherlands)

    van Nierop, J. E. M.; Leeflang, P. S. H.; Teerling, M. L.; Huizingh, K. R. E.

    Do customers increase or decrease their spending in response to the introduction of an informational website? To answer this question, this study considers the effects of the introduction and use of an informational website by a large national retailer on offline customer buying behavior. More

  4. Method of and System for Information Retrieval

    DEFF Research Database (Denmark)

    2015-01-01

    This invention relates to a system for and a method (100) of searching a collection of digital information (150) comprising a number of digital documents (110), the method comprising receiving or obtaining (102) a search query, the query comprising a number of search terms, searching (103) an index...... (300) using the search terms thereby providing information (301) about which digital documents (110) of the collection of digital information (150) that contains a given search term and one or more search related metrics (302; 303; 304; 305; 306), ranking (105) at least a part of the search result......, a method of and a system for information retrieval or searching is readily provided that enhances the searching quality (i.e. the number of relevant documents retrieved and such documents being ranked high) when (also) using queries containing many search terms....

  5. Multilevel resistive information storage and retrieval

    Science.gov (United States)

    Lohn, Andrew; Mickel, Patrick R.

    2016-08-09

    The present invention relates to resistive random-access memory (RRAM or ReRAM) systems, as well as methods of employing multiple state variables to form degenerate states in such memory systems. The methods herein allow for precise write and read steps to form multiple state variables, and these steps can be performed electrically. Such an approach allows for multilevel, high density memory systems with enhanced information storage capacity and simplified information retrieval.

  6. Web information retrieval for health professionals.

    Science.gov (United States)

    Ting, S L; See-To, Eric W K; Tse, Y K

    2013-06-01

    This paper presents a Web Information Retrieval System (WebIRS), which is designed to assist the healthcare professionals to obtain up-to-date medical knowledge and information via the World Wide Web (WWW). The system leverages the document classification and text summarization techniques to deliver the highly correlated medical information to the physicians. The system architecture of the proposed WebIRS is first discussed, and then a case study on an application of the proposed system in a Hong Kong medical organization is presented to illustrate the adoption process and a questionnaire is administrated to collect feedback on the operation and performance of WebIRS in comparison with conventional information retrieval in the WWW. A prototype system has been constructed and implemented on a trial basis in a medical organization. It has proven to be of benefit to healthcare professionals through its automatic functions in classification and summarizing the medical information that the physicians needed and interested. The results of the case study show that with the use of the proposed WebIRS, significant reduction of searching time and effort, with retrieval of highly relevant materials can be attained.

  7. Electronic publishing and intelligent information retrieval

    Science.gov (United States)

    Heck, A.

    1992-01-01

    Europeans are now taking steps to homogenize policies and standardize procedures in electronic publishing (EP) in astronomy and space sciences. This arose from an open meeting organized in Oct. 1991 at Strasbourg Observatory (France) and another business meeting held late Mar. 1992 with the major publishers and journal editors in astronomy and space sciences. The ultimate aim of EP might be considered as the so-called 'intelligent information retrieval' (IIR) or better named 'advanced information retrieval' (AIR), taking advantage of the fact that the material to be published appears at some stage in a machine-readable form. It is obvious that the combination of desktop and electronic publishing with networking and new structuring of knowledge bases will profoundly reshape not only our ways of publishing, but also our procedures of communicating and retrieving information. It should be noted that a world-wide survey among astronomers and space scientists carried out before the October 1991 colloquium on the various packages and machines used, indicated that TEX-related packages were already in majoritarian use in our community. It has also been stressed at each meeting that the European developments should be carried out in collaboration with what is done in the US (STELLAR project, for instance). American scientists and journal editors actually attended both meetings mentioned above. The paper will offer a review of the status of electronic publishing in astronomy and its possible contribution to advanced information retrieval in this field. It will also report on recent meetings such as the 'Astronomy from Large Databases-2 (ALD-2)' conference dealing with the latest developments in networking, in data, information, and knowledge bases, as well as in the related methodologies.

  8. Overcoming terminology barrier using Web resources for cross-language medical information retrieval.

    Science.gov (United States)

    Lu, Wen-Hsiang; Lin, Ray Shih-Jui; Chan, Yi-Che; Chen, Kuan-Hsi

    2006-01-01

    A number of authoritative medical websites, such as PubMed and MedlinePlus, provide consumers with the most up-to-date health information. However, non-English speakers often encounter not only language barriers (from other languages to English) but also terminology barriers (from laypersons inverted exclamation mark| terms to professional medical terms) when retrieving information from these websites. Our previous work address language barriers by developing a multilingual medical thesaurus, Chinese-English MeSH, while this study presents an approach to overcome terminology barriers based on Web resources. Two techniques were utilized in our approach: monolingual concept mapping using approximate string matching and crosslingual concept mapping using Web resources. The evaluation shows that our approach can significantly improve the performance on MeSH concept mapping and cross-language medical information retrieval.

  9. The Oklahoma Geographic Information Retrieval System

    Science.gov (United States)

    Blanchard, W. A.

    1982-01-01

    The Oklahoma Geographic Information Retrieval System (OGIRS) is a highly interactive data entry, storage, manipulation, and display software system for use with geographically referenced data. Although originally developed for a project concerned with coal strip mine reclamation, OGIRS is capable of handling any geographically referenced data for a variety of natural resource management applications. A special effort has been made to integrate remotely sensed data into the information system. The timeliness and synoptic coverage of satellite data are particularly useful attributes for inclusion into the geographic information system.

  10. COMPUTATIONALLY EFFICIENT PRIVATE INFORMATION RETRIEVAL PROTOCOL

    Directory of Open Access Journals (Sweden)

    A. V. Afanasyeva

    2016-03-01

    Full Text Available This paper describes a new computationally efficient private information retrieval protocol for one q-ary symbol retrieving. The main advantage of the proposed solution lies in a low computational complexity of information extraction procedure, as well as the constructive simplicity and flexibility in choosing the system parameters. Such results are based on cosets properties. The proposed protocol has communication complexity slightly worse than the best schemes at the moment, which is based on locally decodable codes, but it can be easily built for any parameters of the system, as opposed to codes. In comparison with similar solutions based on polynomials, the proposed method gains in computational complexity, which is important especially for servers which must service multiple requests from multiple users.

  11. Users guide for information retrieval using APL

    Science.gov (United States)

    Shapiro, A.

    1974-01-01

    A Programming Language (APL) is a precise, concise, and powerful computer programming language. Several features make APL useful to managers and other potential computer users. APL is interactive; therefore, the user can communicate with his program or data base in near real-time. This, coupled with the fact that APL has excellent debugging features, reduces program checkout time to minutes or hours rather than days or months. Of particular importance is the fact that APL can be utilized as a management science tool using such techniques as operations research, statistical analysis, and forecasting. The gap between the scientist and the manager could be narrowed by showing how APL can be used to do what the scientists and the manager each need to do, retrieve information. Sometimes, the information needs to be retrieved rapidly. In this case APL is ideally suited for this challenge.

  12. An evaluation of telehealth websites for design, literacy, information and content.

    Science.gov (United States)

    Whitten, Pamela; Holtz, Bree; Cornacchione, Jennifer; Wirth, Christina

    2011-01-01

    We examined 62 telehealth websites using four assessment criteria: design, literacy, information and telehealth content. The websites came from the member list of the American Telemedicine Association and the Office for the Advancement of Telehealth and partner sites, and were included if they were currently active and at least three clicks deep. Approximately 130 variables were examined for each website by two independent researchers. The websites reviewed contained most of the design variables (mean 74%, SD 6), but fewer of those relating to literacy (mean 26%, SD 6), website information (mean 35%, SD 16) and telehealth content (mean 37%, SD 18). Only 29% of websites encouraged users to ask about telehealth, and 19% contained information on overcoming telehealth barriers. Nonetheless, 84% promoted awareness of telehealth. All evaluation assessments were significantly correlated with each other except for literacy and information. The present study identified various matters that should be addressed when developing telehealth websites. Although much of this represents simple common sense in website design, our evaluation demonstrates that there is still much room for improvement.

  13. Information at the Nexus: Young People's Perceptions of Government and Government Websites

    Science.gov (United States)

    Taylor, Natalie Greene

    2015-01-01

    This dissertation focuses on the perceptions that young people have of federal government websites and of the U.S. government, as well as exploring possible connections between the perceptions of government and government websites. Not only is this a virtually unstudied area of e-government and youth information behavior, but it is also of…

  14. Information Retrieval Using a Middleware Approach

    Directory of Open Access Journals (Sweden)

    Danijela Boberić Krstićev

    2013-03-01

    Full Text Available This paper explores the use of a mediator/wrapper approach to enable the search of an existing library management system using different information retrieval protocols. It proposes an architecture for a software component that will act as an intermediary between the library system and search services. It provides an overview of different approaches to add Z39.50 and Search/Retrieval via URL (SRU functionality using a middleware approach that is implemented on the BISIS library management system. That wrapper performs transformation of Contextual Query Language (CQL into Lucene query language. The primary aim of this software component is to enable search and retrieval of bibliographic records using the SRU and Z39.50 protocols, but the proposed architecture of the software components is also suitable for inclusion of the existing library management system into a library portal. The software component provides a single interface to server-side protocols for search and retrieval of records. Additional protocols could be used. This paper provides practical demonstration of interest to developers of library management systems and those who are trying to use open-source solutions to make their local catalog accessible to other systems.

  15. Stylistic Variation in an Information Retrieval Experiment

    CERN Document Server

    Karlgren, J

    1996-01-01

    Texts exhibit considerable stylistic variation. This paper reports an experiment where a corpus of documents (N= 75 000) is analyzed using various simple stylistic metrics. A subset (n = 1000) of the corpus has been previously assessed to be relevant for answering given information retrieval queries. The experiment shows that this subset differs significantly from the rest of the corpus in terms of the stylistic metrics studied.

  16. Cognitive approach to information retrieval and communication

    Directory of Open Access Journals (Sweden)

    Saša Zupanič

    1997-01-01

    Full Text Available Cognitive approach (viewpoint/standpoirit in the retrieval and communication of information, as well as in librarianship and information science has started gaining importance in the 70's. Today, it is present in literary and objective knowledge studies, as well as in studies of users,information brokers and systems of information retrieval.Cognitive approach exercises strong impact on several scientific disciplines which are grouped under the roof of cognitive science. The cognitive approach has caused split and the formation of a new paradigm, i.e. the cognitive paradigm, in many scientific disciplines.In the frames of the definition of Kuhn's concept of paradigm, it is evident that librarianship and information science are on the pre-paradigmatic level. I Iowever,some authors mention the existence of at least two paradigms in library and information science, i.e. physical and cognitive paradigm.The hištorical overview of cognitive oriented research works of Brookes, De Mey,Belkin, Ingwersen and others enables the insight into the development of library and information scientific thought up to the present.

  17. NLP Meets the Jabberwocky: Natural Language Processing in Information Retrieval.

    Science.gov (United States)

    Feldman, Susan

    1999-01-01

    Focuses on natural language processing (NLP) in information retrieval. Defines the seven levels at which people extract meaning from text/spoken language. Discusses the stages of information processing; how an information retrieval system works; advantages to adding full NLP to information retrieval systems; and common problems with information…

  18. Data Visualization in Information Retrieval and Data Mining (SIG VIS).

    Science.gov (United States)

    Efthimiadis, Efthimis

    2000-01-01

    Presents abstracts that discuss using data visualization for information retrieval and data mining, including immersive information space and spatial metaphors; spatial data using multi-dimensional matrices with maps; TREC (Text Retrieval Conference) experiments; users' information needs in cartographic information retrieval; and users' relevance…

  19. Evaluating the Iran\\'s Governmental Websites: Information, Interaction and Transaction

    Directory of Open Access Journals (Sweden)

    Mehdi Montazer Ghaem

    2015-12-01

    Full Text Available Presenting a model for evaluating governmental website (Information, Interaction and Transaction, the article has compared Iranian governmental websites in two time periods (February 2006 and February and March 2011. The research has applied longitudinal study to observe the development process of the Iran's electronic government. In the research, websites has been considered as representation of the electronic government for Iranian citizens in the cyberspace. The three-stage evolutionary model in the research has inspired from the UNESCO electron government analysis. In the research model, electronic government has been defined as "applying information and communication technologies (IC TS in the government and created changes in the government structure, nature and performance". 16 criterias and 61 sub-criterias have been used to evaluate 51 governmental websites in the first period and 41 websites in the second period. Based on the research results, the Iranian governmental websites especially in the research first evaluation has been in the primary stage of the E-Gov. Few evidences have been observed to show a development toward stages beyond the information publication in the websites. Based on the quantitative data, after more than five years no dramatic changes have been observed in the Iran's electronic government. Then as before, few websites in the second evaluation has applied interactive capabilities of the ICTS

  20. [SIBIL: an information tool for the information retrieval on bioethics].

    Science.gov (United States)

    Dracos, Adriana

    2004-01-01

    The article describes the main features of the website SIBIL (Sistema Informativo per la Bioetica In Linea) implemented within the framework of a research project of the ISS for collecting, indexing and disseminating Italian literature on bioethics since 1995 through an integrated electronic system. The site, addressed to a wide range of people interested at different degrees and levels in bioethics, offers a comprehensive overview of the activities, such as courses and meetings, on the major ethical issues at stake in Italy, as well as a survey of the most important activities both at national and international level. The main feature of SIBIL is a database of a large collection of documents retrieved through sources or exploitation of the most important international electronic databases. A thesaurus of 1,600 terms, available in Italian and English, was created in order to organize documents with standardized criteria currently adopted in the Italian scientific environment. Future trends of the website are also discussed for sharing experiences with other countries and laying the basis for a European portal on bioethics.

  1. An evaluation of the content and quality of tinnitus information on websites preferred by General Practitioners

    Directory of Open Access Journals (Sweden)

    Fackrell Kathryn

    2012-07-01

    Full Text Available Abstract Background Tinnitus is a prevalent and complex medical complaint often co-morbid with stress, anxiety, insomnia, depression, and cognitive or communication difficulties. Its chronicity places a major burden on primary and secondary healthcare services. In our recent national survey of General Practitioners (GPs from across England, many reported that their awareness of tinnitus was limited and as a result were dissatisfied with the service they currently provide. GPs identified 10 online sources of information they currently use in clinical practice, but welcomed further concise and accurate information on tinnitus assessment and management. The purpose of this study was to assess the content, reliability, and quality of the information related to primary care tinnitus assessment and management on these 10 websites. Methods Tinnitus related content on each website was assessed using a summative content analysis approach. Reliability and quality of the information was assessed using the DISCERN questionnaire. Results Quality of information was rated using the validated DISCERN questionnaire. Significant inter-rater reliability was confirmed by Kendall’s coefficient of concordance (Wt which ranged from 0.48 to 0.92 across websites. The website Map of Medicine achieved the highest overall DISCERN score. However, for information on treatment choice, the British Tinnitus Association was rated best. Content analysis revealed that all websites lacked a number of details relating to either tinnitus assessment or management options. Conclusions No single website provides comprehensive information for GPs on tinnitus assessment and management and so GPs may need to refer to more than one if they want to maximise their coverage of the topic. From those preferred by GPs we recommend several specific websites as the current ‘best’ sources. Our findings should guide healthcare website providers to improve the quality and inclusiveness of the

  2. Tag Clusters as Information Retrieval Interfaces

    CERN Document Server

    Knautz, Kathrin; Stock, Wolfgang G

    2010-01-01

    The paper presents our design of a next generation information retrieval system based on tag co-occurrences and subsequent clustering. We help users getting access to digital data through information visualization in the form of tag clusters. Current problems like the absence of interactivity and semantics between tags or the difficulty of adding additional search arguments are solved. In the evaluation, based upon SERVQUAL and IT systems quality indicators, we found out that tag clusters are perceived as more useful than tag clouds, are much more trustworthy, and are more enjoyable to use.

  3. Image Information Retrieval: An Overview of Current Research

    Directory of Open Access Journals (Sweden)

    Abby A. Goodrum

    2000-01-01

    Full Text Available This paper provides an overview of current research in image information retrieval and provides an outline of areas for future research. The approach is broad and interdisciplinary and focuses on three aspects of image research (IR: text-based retrieval, content-based retrieval, and user interactions with image information retrieval systems. The review concludes with a call for image retrieval evaluation studies similar to TREC.

  4. Image Information Retrieval: An Overview of Current Research

    OpenAIRE

    Abby A. Goodrum

    2000-01-01

    This paper provides an overview of current research in image information retrieval and provides an outline of areas for future research. The approach is broad and interdisciplinary and focuses on three aspects of image research (IR): text-based retrieval, content-based retrieval, and user interactions with image information retrieval systems. The review concludes with a call for image retrieval evaluation studies similar to TREC.

  5. Ten lessons for developing a health information website.

    Science.gov (United States)

    Ottmann, Goetz; Street, Annette F

    2007-11-01

    This paper outlines ten lessons derived from the development of a palliative care website, www.pallcarevic.asn.au. The following program elements contributed to the success of the project: (1) peer and stakeholder participation; (2) response to a significant need; (3) networking skills; (4) administrative skills; (5) mediation of conflicts; (6) project management skills; (7) sourcing of good evidence; (8) iterative evaluation involving users and stakeholders; (9) iterative expert evaluation; and (10) a well thought through sustainability strategy.

  6. A pilot study of website information regarding aromatase inhibitors: dietary supplement interactions.

    Science.gov (United States)

    McDermott, Cara L; Hsieh, Angela A; Sweet, Erin S; Tippens, Kimberly M; McCune, Jeannine S

    2011-11-01

    Patients who have hormone receptor-positive breast cancer and who are taking aromatase inhibitors (AIs) should understand the benefits and risks of concomitant dietary supplement (DS) use. The International Society for Integrative Oncology (SIO) encourages patients to discuss DS use with their health care practitioners. The objective was to conduct a pilot study rating Internet websites from the perspective of health care practitioners for information about AI-DS interactions. Five (5) Internet websites suggested by SIO were evaluated using the DISCERN instrument rating tool. The available AI-DS information on these websites was rated by 4 evaluators: 2 naturopathic doctors, 1 oncology pharmacy resident, and a pharmacy student. The overall rankings ranged from 1.6 to 3.9, with considerable variability in the type of information available from the websites. The interevaluator rankings of the websites ranged from 0.44 to 0.89. The evaluators consistently found the most reliable, unbiased, and comprehensive information on AI-DS interactions at the Natural Medicines Comprehensive Database and Memorial Sloan-Kettering Cancer Center websites. However, more than one database was needed for provision of optimal patient information on AI-DS interactions. In order to effectively advise patients regarding AI-DS interactions, more than one website should be evaluated to assess the potential efficacy and safety of DS in women whose breast cancer is being treated with an AI. © Mary Ann Liebert, Inc.

  7. Placement and Format of Risk Information on Direct-to-Consumer Prescription Drug Websites.

    Science.gov (United States)

    Sullivan, Helen W; O'Donoghue, Amie C; Rupert, Douglas J; Willoughby, Jessica Fitts; Aikin, Kathryn J

    2017-02-01

    We investigated whether the location and format of risk information on branded prescription drug websites influence consumers' knowledge and perceptions of the drug's risks. Participants (Internet panelists with high cholesterol [n = 2,609] or seasonal allergies [n = 2,637]) were randomly assigned to view a website promoting a fictitious prescription drug for their condition. The website presented risk information at the bottom of the homepage, or at the bottom of the homepage with a signal above indicating that the risk information was located below, or on a linked secondary page. We also varied the format of risk information (paragraph, checklist, bulleted list, highlighted box). Participants then answered questions on risk recall and perceptions. Participants recalled fewer drug risks when the risks were placed on a secondary page. The signal had little effect, and risk information format did not affect outcomes. The location of risk information on prescription drug websites can affect consumer knowledge of drug risks; however, signals and special formatting may not be necessary for websites to adequately inform consumers about drug risks. We recommend that prescription drug websites maintain risk information on their homepages to achieve "fair balance" as required by the U.S. Food and Drug Administration.

  8. Graph-Based Interactive Bibliographic Information Retrieval Systems

    Science.gov (United States)

    Zhu, Yongjun

    2017-01-01

    In the big data era, we have witnessed the explosion of scholarly literature. This explosion has imposed challenges to the retrieval of bibliographic information. Retrieval of intended bibliographic information has become challenging due to the overwhelming search results returned by bibliographic information retrieval systems for given input…

  9. Organizational Schemes of Information Resources in Top 50 Academic Business Library Websites

    Science.gov (United States)

    Kim, Soojung; DeCoster, Elizabeth

    2011-01-01

    This paper analyzes the organizational schemes of information resources found in top 50 academic business library websites through content analysis and discusses the development and evaluation of the identified schemes.

  10. Availability of and ease of access to calorie information on restaurant websites.

    Directory of Open Access Journals (Sweden)

    Gary G Bennett

    Full Text Available OBJECTIVE: Offering calories on restaurant websites might be particularly important for consumer meal planning, but the availability of and ease of accessing this information are unknown. METHODS: We assessed websites for the top 100 U.S. chain restaurants to determine the availability of and ease of access to calorie information as well as website design characteristics. We also examined potential predictors of calorie availability and ease of access. RESULTS: Eighty-two percent of restaurants provided calorie information on their websites; 25% presented calories on a mobile-formatted website. On average, calories could be accessed in 2.35±0.99 clicks. About half of sites (51.2% linked to calorie information via the homepage. Fewer than half had a separate section identifying healthful options (46.3%, or utilized interactive meal planning tools (35.4%. Quick service/fast casual, larger restaurants, and those with less expensive entrées and lower revenue were more likely to make calorie information available. There were no predictors of ease of access. CONCLUSION: Calorie information is both available and largely accessible on the websites of America's leading restaurants. It is unclear whether consumer behavior is affected by the variability in the presentation of calorie information.

  11. Four Challenges for Music Information Retrieval Researchers

    DEFF Research Database (Denmark)

    Sturm, Bob L.; Collins, Nick

    Exemplified in the substantial amount of published research in music genre recognition, mood recognition and autotagging, content-based music information retrieval (MIR) advances an "engineering approach'': build a system producing the most "correct'' answers in datasets appearing throughout...... might not even be considering the through it answers "correctly''. It could thus be worthless for addressing real-world problems that must consider (e.g., music description). To emphasise the critical points above, and encourage a new approaches to research that address real-world problems, we present...

  12. Random walk term weighting for information retrieval

    DEFF Research Database (Denmark)

    Blanco, R.; Lioma, Christina

    2007-01-01

    We present a way of estimating term weights for Information Retrieval (IR), using term co-occurrence as a measure of dependency between terms.We use the random walk graph-based ranking algorithm on a graph that encodes terms and co-occurrence dependencies in text, from which we derive term weights...... that represent a quantification of how a term contributes to its context. Evaluation on two TREC collections and 350 topics shows that the random walk-based term weights perform at least comparably to the traditional tf-idf term weighting, while they outperform it when the distance between co-occurring terms...

  13. Enhancing genomics information retrieval through dimensional analysis.

    Science.gov (United States)

    Hu, Qinmin; Huang, Jimmy Xiangji

    2013-06-01

    We propose a novel dimensional analysis approach to employing meta information in order to find the relationships within the unstructured or semi-structured document/passages for improving genomics information retrieval performance. First, we make use of the auxiliary information as three basic dimensions, namely "temporal", "journal", and "author". The reference section is treated as a commensurable quantity of the three basic dimensions. Then, the sample space and subspaces are built up and a set of events are defined to meet the basic requirement of dimensional homogeneity to be commensurable quantities. After that, the classic graph analysis algorithm in the Web environments is applied on each dimension respectively to calculate the importance of each dimension. Finally, we integrate all the dimension networks and re-rank the outputs for evaluation. Our experimental results show the proposed approach is superior and promising.

  14. A Theoretical Paradigm of Information Retrieval in Information Science and Computer Science

    Directory of Open Access Journals (Sweden)

    M. S. Saleem Basha

    2012-09-01

    Full Text Available This paper describes the theoretical paradigms of information retrieval in information science and computer science, and constructs the theory framework of information retrieval from three perspectives that are user, information and technology. It evaluates the research priorities of the two disciplines and cross-domain of information retrieval theory. Finally, it points-out the theory status and development trend of information retrieval in information science and computer science, and provides exploration direction in information retrieval theory.

  15. Polluted online information? Surfing Italian websites dealing with the topic of waste and health

    Science.gov (United States)

    Orizio, G.; Locatelli, M. K.; Caimi, L.; Gelatti, U.

    2011-10-01

    In the field of health communication, a particularly critical issue is communication to the public of environmental risks, especially on topics for which there is still a high degree of scientific uncertainty regarding risk estimates. One such topic is undoubtedly the impact of waste on people's health. The aim of this study was to evaluate the presence and characteristics of Italian websites dealing with the topic of waste and health. The keywords 'waste' and 'health' were entered in 2010 in the three most commonly used search engines, and the first five pages were analysed. The selected websites were coded according to the content analysis method. For websites of interest we evaluated the 'page rank'. Out of the 150 occurrences analysed, the number of websites found to deal with this subject was only 19, four of which were of an institutional nature. The majority of websites gave a message of increased health risk associated with the three kinds of waste disposal tackled. As regards visibility, only one of the four institutional websites maintained its position on the first page of the three search engines. We found that institutional health websites have low visibility, despite extensive media coverage of waste and health issues in Italy as a result of the Naples case, which was debated globally. This indicates that public health institutions' web strategies are basically unable to meet people's health information requirements, which could strengthen rival health information providers.

  16. Recommender Systems by means of Information Retrieval

    CERN Document Server

    Costa, Alberto

    2010-01-01

    In this paper we present a method for reformulating the Recommender Systems problem in an Information Retrieval one. In our tests we have a dataset of users who give ratings for some movies; we hide some values from the dataset, and we try to predict them again using its remaining portion (the so-called "leave-n-out approach"). In order to use an Information Retrieval algorithm, we reformulate this Recommender Systems problem in this way: a user corresponds to a document, a movie corresponds to a term, the active user (whose rating we want to predict) plays the role of the query, and the ratings are used as weigths, in place of the weighting schema of the original IR algorithm. The output is the ranking list of the documents ("users") relevant for the query ("active user"). We use the ratings of these users, weighted according to the rank, to predict the rating of the active user. We carry out the comparison by means of a typical metric, namely the accuracy of the predictions returned by the algorithm, and we...

  17. MATCHING LSI FOR SCALABLE INFORMATION RETRIEVAL

    Directory of Open Access Journals (Sweden)

    Rajagopal Palsonkennedy

    2012-01-01

    Full Text Available Latent Semantic Indexing (LSI is one of the well-liked techniques in the information retrieval fields. Different from the traditional information retrieval techniques, LSI is not based on the keyword matching simply. It uses statistics and algebraic computations. Based on Singular Value Decomposition (SVD, the higher dimensional matrix is converted to a lower dimensional approximate matrix, of which the noises could be filtered. And also the issues of synonymy and polysemy in the traditional techniques can be prevail over based on the investigations of the terms related with the documents. However, it is notable that LSI suffers a scalability issue due to the computing complexity of SVD. This study presents a distributed LSI algorithm MR-LSI which can solve the scalability issue using Hadoop framework based on the distributed computing model Map Reduce. It also solves the overhead issue caused by the involved clustering algorithm by k-means algorithm. The evaluations indicate that MR-LSI can gain noteworthy improvement compared to the other scheme on processing large scale of documents. One significant advantage of Hadoop is that it supports various computing environments so that the issue of unbalanced load among nodes is highlighted.Hence, a load balancing algorithm based on genetic algorithm for balancing load in static environment is proposed. The results show that it can advance the performance of a cluster according to different levels.

  18. A Concise and Practical Framework for the Development and Usability Evaluation of Patient Information Websites

    Science.gov (United States)

    Knijnenburg, S.L.; Kremer, L.C.; Jaspers, M.W.M.

    2015-01-01

    Summary Background The Website Developmental Model for the Healthcare Consumer (WDMHC) is an extensive and successfully evaluated framework that incorporates user-centered design principles. However, due to its extensiveness its application is limited. In the current study we apply a subset of the WDMHC framework in a case study concerning the development and evaluation of a website aimed at childhood cancer survivors (CCS). Objective To assess whether the implementation of a limited subset of the WDMHC-framework is sufficient to deliver a high-quality website with few usability problems, aimed at a specific patient population. Methods The website was developed using a six-step approach divided into three phases derived from the WDMHC: 1) information needs analysis, mock-up creation and focus group discussion; 2) website prototype development; and 3) heuristic evaluation (HE) and think aloud analysis (TA). The HE was performed by three double experts (knowledgeable both in usability engineering and childhood cancer survivorship), who assessed the site using the Nielsen heuristics. Eight end-users were invited to complete three scenarios covering all functionality of the website by TA. Results The HE and TA were performed concurrently on the website prototype. The HE resulted in 29 unique usability issues; the end-users performing the TA encountered eleven unique problems. Four issues specifically revealed by HE concerned cosmetic design flaws, whereas two problems revealed by TA were related to website content. Conclusion Based on the subset of the WDMHC framework we were able to deliver a website that closely matched the expectancy of the end-users and resulted in relatively few usability problems during end-user testing. With the successful application of this subset of the WDMHC, we provide developers with a clear and easily applicable framework for the development of healthcare websites with high usability aimed at specific medical populations. PMID:26171083

  19. Parallel Computing in Information Retrieval--An Updated Review.

    Science.gov (United States)

    Macfarlane, A.; And Others

    1997-01-01

    Reviews the progress of parallel computing in information retrieval (IR) and stresses the importance of the motivation in using parallel computing for text retrieval. Analyzes parallel IR systems using a classification defined by Rasmussen; describes retrieval models used in parallel information processing; and suggests areas of needed research.…

  20. Data Discretization for Novel Relationship Discovery in Information Retrieval.

    Science.gov (United States)

    Benoit, G.

    2002-01-01

    Describes an information retrieval, visualization, and manipulation model which offers the user multiple ways to exploit the retrieval set, based on weighted query terms, via an interactive interface. Outlines the mathematical model and describes an information retrieval application built on the model to search structured and full-text files.…

  1. 46 CFR 520.6 - Retrieval of information.

    Science.gov (United States)

    2010-10-01

    ... 46 Shipping 9 2010-10-01 2010-10-01 false Retrieval of information. 520.6 Section 520.6 Shipping FEDERAL MARITIME COMMISSION REGULATIONS AFFECTING OCEAN SHIPPING IN FOREIGN COMMERCE CARRIER AUTOMATED TARIFFS § 520.6 Retrieval of information. (a) General. Tariffs systems shall present retrievers with...

  2. Flexible method for Boolean information retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Salton, G.; Wu, H.

    1983-01-01

    A new flexible retrieval system is described which makes it possible to relax the strict conditions of Boolean query logic thereby retrieving useful items that are rejected in a conventional retrieval situation. The query structure inherent in the Boolean system is preserved, while at the same time weighted terms may be incorporated into both queries and stored documents; the retrieved output can also be ranked in strict similarity order with the user queries. A conventional retrieval system can be modified to make use of the flexible metric system. Laboratory tests indicate that the extended system produces better retrieval output than either the Boolean or the vector processing systems. 11 references.

  3. Information vs Engagement in parliamentary websites – a case study of Brazil and the UK

    Directory of Open Access Journals (Sweden)

    Cristiane Brum Bernardes

    Full Text Available Abstract Parliamentary websites have become the main window of parliament to the outside world. More than a gimmick, they are an essential element in the promotion of a relationship between parliament and citizens. This paper develops a comparative analysis of the websites of the lower chambers of the Brazilian and the British parliaments, respectively the Chamber of Deputies and the House of Commons. We structure this analysis around three dimensions: 1 information about the institution; 2 information about parliamentary activity; and 3 tools to promote engagement with the public. The choice of two very different case studies enables us to consider more clearly the specific purposes of these parliamentary websites. We consider in particular if these parliaments' institutional differences affect their websites. The websites' analysis is complemented by semi-structured elite interviews with parliamentary staff who manage the services provided by these websites. Our analysis shows that both websites achieve much higher levels of complexity in the information area than in engagement. But it also shows that the Brazilian parliament website includes far more tools designed for public interaction than its UK counterpart. The indexes and interviews show that both institutions are highly committed to disseminating data and information to citizens. This is seen as a path towards achieving higher accountability and improving knowledge about parliamentary processes and, consequently, improving public image and levels of trust. Whilst there is a strong focus on the provision of information, there is still little evidence of enabling citizen participation in the legislative process. This is partly due to a tension between conceptions of representative democracy and those of participatory democracy. The articulation between these different types of democracy still has a long way to be resolved, although parliaments are slowly introducing participatory tools.

  4. Evaluating Digital Strategies for Storing and Retrieving Scholarly Information.

    Science.gov (United States)

    Getz, Malcolm

    1997-01-01

    Outlines the advantages of digital documents for scholars and offers considerations for designing systems for storing and retrieving digital information products. Discusses conventional and electronic storage and retrieval; network costs; digital storage; search strategies; acquisition prices; and digital initiatives. (AEF)

  5. The Retrieval Skill of Education Information Resource Based on Web%基于Web的教育信息资源的检索技巧

    Institute of Scientific and Technical Information of China (English)

    李霞

    2011-01-01

    The education information resource on internet assumes the tendency which rises gradually.It mainly distributes in the network database website,the educational institution website,the subject-based information gateways,the comprehensive website and individual website.The retrieval methods and skills of searching education information on internet are described from five aspects: retrieve from the educational institutional website,retrieve from the category of comprehensive information networks,retrieve through search engine,retrieve from online database system,and retrieve from the subject web portal.%网络上的教育信息资源呈逐步上升的趋势,其主要分布于综合性网站、教育机构网站、网络数据库网站、学科门户网站和个人网站中。本文从五个方面概述了互联网教育信息资源的检索方法与技巧:利用教育机构网站查找、从综合性网站的分类目录中查找、利用搜索引擎查找、从网络数据库系统查找、利用学科门户网站查找。

  6. Preliminary Discussion on the Science and Technology Website Information Construction%浅论科技网站信息建设

    Institute of Scientific and Technical Information of China (English)

    陈晓盼

    2001-01-01

    With an analysis of the characteristics of science and technology websites, the paper puts forward 4 principles for the science and technology website information construction. Ways of carrying out the science and technology website information construction are also discussed.

  7. Is the information about dengue available on Brazilian websites of quality and reliable?

    Directory of Open Access Journals (Sweden)

    Thiago Henrique de Lima

    2016-12-01

    Full Text Available The objective of the present study was to identify and evaluate the content of information about dengue available on Brazilian websites. Thirty-two websites were selected for the analysis. For the evaluation of the content of information about dengue, a form was prepared with 16 topics grouped in six information blocks: etiology/transmission, vector, control and prevention, disease/diagnosis, treatment and epidemiology. The websites were also evaluated according to the following criteria: authorship, update, language, interactivity, scientific basis and graphic elements. The results showed a predominantly lack of information in relation to the topics analyzed in each information block. Regarding the technical quality of the websites, only 28.1% showed some indication of scientific basis and 34.3% contained the date of publication or of the last update. Such results attested the low reliability of the selected websites. Knowing that the internet is an efficient mechanism for disseminating information on health topics, we concluded that the creation of such mechanisms to disseminate correct and comprehensive information about dengue is necessary in order to apply this useful tool in the prevention and control of the disease in Brazil.

  8. [Information quality and health risks in Spanish-language retail websites for Chinese herbal medicine].

    Science.gov (United States)

    Tejedor-García, Noelia; García-Pastor, Coral; Benito-Martínez, Selma; de Lucio-Cazaña, Francisco Javier

    2017-03-16

    The growing use of purchase online via Internet retailers favours the access to potentially toxic natural products. It also contributes to the quick dissemination of the claims made by the retailers on efficacy and safety, these claims being not always based upon reliable information. Here, we have conducted an online search to find Spanish-language retail websites for Chinese herbal medicine and we have analysed them for the quality of product information and the potential health risks. i) Online search in Google España to find Spanish-language retail websites for Chinese herbal medicine in which we analysed both the claims regarding possible health benefits and adequate safe use indications ii) Identification of potentially toxic herbs in the websites iii) Quantification of Chinese herbal medicines withdrawn by the Agencia Española de Medicamentos y Productos Sanitarios (AEMPS). 1) Only one third of the 30 Spanish-language retail websites found which sell Chinese herbal medicine observe the law, given that the other websites include illegal Western disease claims as marketing tools, 2) Five websites provide some safety information, 3) Two websites offer potentially toxic herbs and 4) Chinese herbal medicine adulterated with sibutramine, silfenafil or their analogues make a considerable percentage of the total products withdrawn by the AEMPS. Online health seekers should be warned about misinformation on retail websites for Chinese herbal medicine and directed to a Spanish government Web site for guidance in safely navigating the Internet for buying Chinese herbal medicine. Copyright © 2017 SESPAS. Publicado por Elsevier España, S.L.U. All rights reserved.

  9. Making the procedure manual come alive: A prototype relational database and dynamic website model for the management of nursing information.

    Science.gov (United States)

    Peace, Jane; Brennan, Patricia Flatley

    2006-01-01

    The nursing procedural manual is an essential resource for clinical practice, yet insuring its currency and availability at the point of care remains an unresolved information management challenge for nurses. While standard HTML-based web pages offer significant advantage over paper compilations, employing emerging computer science tools offers even greater promise. This paper reports on the creation of a prototypical dynamic web-based nursing procedure manual driven by a relational database. We created a relational database in MySQL to manage, store, and link the procedure information, and developed PHP files to guide content retrieval, content management, and display on demand in browser-viewable format. This database driven dynamic website model is an important innovation to meet the challenge of content management and dissemination of nursing information.

  10. Formal Concept Analysis for Information Retrieval

    CERN Document Server

    Qadi, Abderrahim El; Ennouary, Yassine

    2010-01-01

    In this paper we describe a mechanism to improve Information Retrieval (IR) on the web. The method is based on Formal Concepts Analysis (FCA) that it is makes semantical relations during the queries, and allows a reorganizing, in the shape of a lattice of concepts, the answers provided by a search engine. We proposed for the IR an incremental algorithm based on Galois lattice. This algorithm allows a formal clustering of the data sources, and the results which it turns over are classified by order of relevance. The control of relevance is exploited in clustering, we improved the result by using ontology in field of image processing, and reformulating the user queries which make it possible to give more relevant documents.

  11. An Effective Information Retrieval for Ambiguous Query

    CERN Document Server

    Roul, R K

    2012-01-01

    Search engine returns thousands of web pages for a single user query, in which most of them are not relevant. In this context, effective information retrieval from the expanding web is a challenging task, in particular, if the query is ambiguous. The major question arises here is that how to get the relevant pages for an ambiguous query. We propose an approach for the effective result of an ambiguous query by forming community vector based on association concept of data minning using vector space model and the freedictionary. We develop clusters by computing the similarity between community vectors and document vectors formed from the extracted web pages by the search engine. We use Gensim package to implement the algorithm because of its simplicity and robust nature. Analysis shows that our approach is an effective way to form clusters for an ambiguous query.

  12. Gaining access to information at a municipality website: a question of age?

    NARCIS (Netherlands)

    Loos, E.

    2011-01-01

    The number of senior citizens is increasing quickly. The use of new media is also on the rise in our information society. Websites are an important tool for (local) governments to provide information to their citizens. If we want information supply through ICT to remain available to senior citizens

  13. Research on the Application of Content-based Image Retrieval Technology in Shopping Website%基于内容的图像检索技术在购物网站中的应用研究

    Institute of Scientific and Technical Information of China (English)

    张薷; 李玉海

    2012-01-01

    本文通过分析电子商务购物网站中基于文本信息检索的现状以及存在的问题,结合虚拟购物平台的特点,提出了基于内容的图像检索技术在购物网站中的应用,并进一步分析了基于内容的图像检索技术的特点、方法以及用于购物网站的检索匹配过程。%This paper puts forward the application of content-based image retrieval technology in shopping website through the analysis of the current situation and existing problems of e-commerce shopping website based on the text information retrieval,bounded up with the characteristics of the virtual shopping platform.Then it further analyzes not only the methods and characteristics of content-based image retrieval but also the matching process in the shopping website.

  14. E-loyalty towards a cancer information website: applying a theoretical framework.

    Science.gov (United States)

    Crutzen, Rik; Beekers, Nienke; van Eenbergen, Mies; Becker, Monique; Jongen, Lilian; van Osch, Liesbeth

    2014-06-01

    To provide more insight into user perceptions related to e-loyalty towards a cancer information website. This is needed to assure adequate provision of high quality information during the full process of cancer treatment-from diagnosis to after care-and an important first step towards optimizing cancer information websites in order to promote e-loyalty. Participants were cancer patients (n = 63) and informal caregivers (n = 202) that visited a website providing regional information about cancer care for all types of cancer. Subsequently, they filled out a questionnaire assessing e-loyalty towards the website and user perceptions (efficiency, effectiveness, active trust and enjoyment) based on a theoretical framework derived from the field of e-commerce. A structural equation model was constructed to test the relationships between user perceptions and e-loyalty. Participants in general could find the information they were looking for (efficiency), thought it was relevant (effectiveness) and that they could act upon it (active trust) and thought the visit itself was pleasant (enjoyment). Effectiveness and enjoyment were both positively related with e-loyalty, but this was mediated by active trust. Efficiency was positively related with e-loyalty. The explained variance of e-loyalty was high (R(2)  = 0.70). This study demonstrates that the importance of user perceptions is not limited to fields such as e-commerce but is also present within the context of cancer information websites. The high information need among participants might explain the positive relationship between efficiency and e-loyalty. Therefore, cancer information websites need to foster easy search and access of information provided. Copyright © 2014 John Wiley & Sons, Ltd.

  15. Distributed and Cooperative Information Retrieval on the World Wide Web

    Institute of Scientific and Technical Information of China (English)

    王继成; 金翔宇; 杨晓江; 张福炎

    2000-01-01

    A mass of heterogeneous, distributed and dynamic information on the World Wide Web (the Web) has resulted in "information overload". It's an important and urgent research issue to provide users with effective information retrieval service on the Web. Web search engines attempt to solve this problem, yet their effect is far from satisfying. In this paper, a distributed and cooperative strategy for information retrieval on the Web is proposed to substitute the centralized mode adopted by the current search engines. Then a new information retrieval system model IRSM is presented, which supports the retrieval of metadata about Web documents and uses Z39.50 standard protocol to unify the heterogeneous interfaces of different systems. Based on that, a distributed and cooperative information retrieval framework, called DCIRF, is designed to help users in fast and effective information retrieval on the Web.

  16. Visualization for Information Retrieval based on Fast Search Technology

    Directory of Open Access Journals (Sweden)

    Mamoon H. Mamoon

    2013-03-01

    Full Text Available The core of search engine is information retrieval technique. Using information retrieval system backs more retrieval results, some of them more relevant than other, and some is not relevant. While using search engine to retrieve information has grown very substantially, there remain problems with the information retrieval systems. The interface of the systems does not help them to perceive the precision of these results. It is therefore not surprising that graphical visualizations have been employed in search engines to assist users. The main objective of Internet users is to find the required information with high efficiency and effectiveness. In this paper we present brief sides of information visualization's role in enhancing web information retrieval system as in some of its techniques such as tree view, title view, map view, bubble view and cloud view and its tools such as highlighting and Colored Query Result.

  17. Retrieving Information from the Invisible Web Using Mobile Agents

    Directory of Open Access Journals (Sweden)

    Fabien-Kenzo Sato

    2005-01-01

    Full Text Available This study proposes a model of information retrieval on the invisible Web by using the mobile agent paradigm. The developed architecture uses the power of a search engine to provide a list of sites of the invisible Web which are likely to be relevant and launches a dynamic search on these sites, thanks to mobile agents. To compare and experiment in real conditions, two versions were implemented: a version using the traditional client/server paradigm and a version using mobile agents. Client/server tests on actual Websites generated satisfactory qualitative results. A series of comparative experiments of the two versions implemented were carried out using a test site. Results show that the mobile agent version generates much less traffic and is thus faster than the client/server version, especially with low bandwidth. Moreover, as the mobile agents carry out calculations on the server rather than on the client’s site, this approach relieves the resources of the client terminal. Thus, the mobile agent approach seems particularly advantageous in the case of weak resource terminals such as PDAs.

  18. Quality Assessment of Information About Pit and Fissure Sealants in Persian Websites in 2012

    Directory of Open Access Journals (Sweden)

    Firoozeh Nilchian Nilchian

    2016-08-01

    Full Text Available Objectives: Despite the increasing use of Internet, there is no supervision over the accuracy and quality of the information provided in the web. To deal with this problem, health specialists should take part in planning, publishing and supervision of online health-related information. The aim of this study was to evaluate the quality of information related to pit and fissure sealants in Persian websites.Materials and Methods: In this cross-sectional study, Persian websites providing information about fissure sealants were found using Google search engine. The searched keywords according to the MeSH database were "patient education" and "fissure sealant". After applying the exclusion criteria, 37 websites out of 500 initial links remained in the study. These websites were evaluated based on a researcher-made checklist. The validity and reliability of the checklist were evaluated and confirmed. Descriptive analysis was applied to report the results of our study using SPSS version 11.5.Results: The average score for the quality of information was 22.46 out of 38. The minimum scores were 16 and 30 and belonged to Pezeshkanemrooz.com and Asa85.blogfa.com, respectively. The results showed that 62.2% of the answers were scored 2-4 and 37.8% were scored 1; therefore, the overall quality of the published content was rated to be moderate for 62.2% and low for 37.8% of the websites.Conclusions: Overall, the quality of information related to fissure sealant provided in Persian websites was good; however, the information given was mostly incomplete and could be improved. The main problems were doubtful credibility and outdated information.

  19. Geothermal Websites

    Energy Technology Data Exchange (ETDEWEB)

    Boyd, Tonya

    2005-03-01

    The Internet has become such an important part of our every day life. It can be used to correspond with people across the world, a lot faster than to send a letter in the mail. The Internet has a wealth of information that is available to anybody just by searching for it. Sometimes you get more information than you ever wanted to know and sometimes you can’t find any information. This paper will only cover a small portion of the websites and their links that have geothermal information concerning reservoir engineering, enhanced geothermal systems, hot dry rock and other aspects of geothermal. Some of the websites below are located in the US others international, such as, geothermal associations, and websites where you can access publications. Most of the websites listed below also have links to other websites for even more information.

  20. TOFIR: A Tool of Facilitating Information Retrieval - Introduce a Visual Retrieval Model.

    Science.gov (United States)

    Zhang, Jin

    2001-01-01

    Introduces a new method for the visualization of information retrieval called TOFIR (Tool of Facilitating Information Retrieval). Discusses the use of angle attributes of a document to construct the angle-based visual space; two-dimensional and three-dimensional visual tools; ambiguity; and future research directions. (Author/LRW)

  1. Pharma Websites and "Professionals-Only" Information: The Implications for Patient Trust and Autonomy.

    Science.gov (United States)

    Graber, Mark Alan; Hershkop, Eliyakim; Graber, Rachel Ilana

    2017-05-24

    Access to information is critical to a patient's valid exercise of autonomy. One increasingly important source of medical information is the Internet. Individuals often turn to drug company ("pharma") websites to look for drug information. The objective of this study was to determine whether there is information on pharma websites that is embargoed: Is there information that is hidden from the patient unless she attests to being a health care provider? We discuss the implications of our findings for health care ethics. We reviewed a convenience sample of 40 pharma websites for "professionals-only" areas and determined whether access to those areas was restricted, requiring attestation that the user is a health care professional in the United States. Of the 40 websites reviewed, 38 had information that was labeled for health care professionals-only. Of these, 24 required the user to certify their status as a health care provider before they were able to access this "hidden" information. Many pharma websites include information in a "professionals-only" section. Of these, the majority require attestation that the user is a health care professional before they can access the information. This leaves patients with two bad choices: (1) not accessing the information or (2) lying about being a health care professional. Both of these outcomes are unacceptable. In the first instance, the patient's access to information is limited, potentially impairing their health and their ability to make reasonable and well-informed decisions. In the second instance, they may be induced to lie in a medical setting. "Teaching" patients to lie may have adverse consequences for the provider-patient relationship.

  2. A semantic medical multimedia retrieval approach using ontology information hiding.

    Science.gov (United States)

    Guo, Kehua; Zhang, Shigeng

    2013-01-01

    Searching useful information from unstructured medical multimedia data has been a difficult problem in information retrieval. This paper reports an effective semantic medical multimedia retrieval approach which can reflect the users' query intent. Firstly, semantic annotations will be given to the multimedia documents in the medical multimedia database. Secondly, the ontology that represented semantic information will be hidden in the head of the multimedia documents. The main innovations of this approach are cross-type retrieval support and semantic information preservation. Experimental results indicate a good precision and efficiency of our approach for medical multimedia retrieval in comparison with some traditional approaches.

  3. Multimodal medical information retrieval with unsupervised rank fusion.

    Science.gov (United States)

    Mourão, André; Martins, Flávio; Magalhães, João

    2015-01-01

    Modern medical information retrieval systems are paramount to manage the insurmountable quantities of clinical data. These systems empower health care experts in the diagnosis of patients and play an important role in the clinical decision process. However, the ever-growing heterogeneous information generated in medical environments poses several challenges for retrieval systems. We propose a medical information retrieval system with support for multimodal medical case-based retrieval. The system supports medical information discovery by providing multimodal search, through a novel data fusion algorithm, and term suggestions from a medical thesaurus. Our search system compared favorably to other systems in 2013 ImageCLEFMedical.

  4. A Virtual Commitment: Disability Services Information on Public Community College Websites

    Science.gov (United States)

    Jackson, Dimitra Lynette; Jones, Stephanie J.

    2014-01-01

    The research on students with disabilities has focused primarily on transition programs and the accessibility of information in the classroom environment. There is a dearth of studies that examine the accessibility of disability services information on community college websites for prospective students with disabilities. A researcher-developed…

  5. A Survey of Stemming Algorithms in Information Retrieval

    Science.gov (United States)

    Moral, Cristian; de Antonio, Angélica; Imbert, Ricardo; Ramírez, Jaime

    2014-01-01

    Background: During the last fifty years, improved information retrieval techniques have become necessary because of the huge amount of information people have available, which continues to increase rapidly due to the use of new technologies and the Internet. Stemming is one of the processes that can improve information retrieval in terms of…

  6. Content-based retrieval of visual information

    NARCIS (Netherlands)

    Oerlemans, Adrianus Antonius Johannes

    2011-01-01

    In this dissertation, I investigate new approaches relevant to content-based image retrieval techniques. First, the MOD paradigm is proposed, a method for detecting salient points in images. These salient points are specifically designed to enhance image retrieval accuracy by maximizing distinctive

  7. Informing, advising, or persuading? An assessment of bone mineral density testing information from consumer health websites.

    Science.gov (United States)

    Green, Carolyn J; Kazanjian, Arminée; Helmer, Diane

    2004-01-01

    Greater access to web-based information on health-care interventions might result in greater participation by patients in care and self-care decisions, but only improve health outcomes if the indicated actions produce the intended benefits. Unbiased research on benefits and harms of health information can provide a basis for evidence-based patient information systems. To evaluate the quality of the information content on bone-mineral density (BMD) testing posted on consumer health websites (CHWS). Five popular engines (Yahoo, MSN, AOL, Lycos, and Go.com) were used to search for patient information on bone densitometry. The fifteen websites that supplied relevant content and were identified by three of the five search engines were selected in order of popularity of the search engine and primacy of placement. Six BMD reports from health technology assessment (HTA) organizations were used as a standard of scientific quality. These were identified from the HTA Database at York University United Kingdom and published between 1996 and 2001. Content was extracted from both document types, and these sets were compared independently by two reviewers. The majority of CHWS identified by popular search engines do not disclose the limited capacity of BMD to discriminate between low-risk individuals and those who will suffer future fractures. CHWS generally present BMD testing as quick, painless, noninvasive, and as being recommended, based on risk factors that are widespread among the general public. BMD testing information is prominently paired on CHWS sites with information on osteoporosis, with an emphasis on "silent disease" and the devastating consequences of advanced disease. Sponsors of CHWS sites are frequently either providers of BMD testing or companion drugs, and consequently in a position of conflict of interest with regard to decisions to undergo BMD testing. HTA organizations have no documented conflict of interest, nor do they invoke emotional arguments. Their

  8. Visualization of database structures for information retrieval

    Directory of Open Access Journals (Sweden)

    Grete Lisbjerg Jensen

    1994-12-01

    Full Text Available This paper describes the Book House system, which is designed to support children's information retrieval in libraries as part of their education. It is a shareware program available on CD-ROM or floppy disks, and comprises functionality for database searching as well as for classifying and storing book information in the database. The system concept is based on an understanding of children's domain structures and their capabilities for categorization of information needs in connection with their activities in schools, in school libraries or in public libraries. These structures are visualized in the interface by using metaphors and multimedia technology. Through the use of text, images and animation, the Book House encourages children - even at a very early age - to learn by doing in an enjoyable way, which plays on their previous experiences with computer games. Both words and pictures can be used for searching; this makes the system suitable for all age groups. Even children who have not yet learned to read properly can, by selecting pictures, search for and find those books they would like to have read aloud. Thus, at the very beginning of their school life, they can learn to search for books on their own. For the library community, such a system will provide an extended service which will increase the number of children's own searches and also improve the relevance, quality and utilization of the book collections in the libraries. A market research report on the need for an annual indexing service for books in the Book House format is in preparation by the Danish Library Centre A/S.

  9. Information on 'Overdiagnosis' in Breast Cancer Screening on Prominent United Kingdom- and Australia-Oriented Health Websites

    OpenAIRE

    Alex Ghanouni; Meisel, Susanne F.; Jolyn Hersch; Jo Waller; Jane Wardle; Cristina Renzi

    2016-01-01

    Objectives: Health-related websites are an important source of information for the public. Increasing public awareness of overdiagnosis and ductal carcinoma in situ (DCIS) in breast cancer screening may facilitate more informed decision-making. This study assessed the extent to which such information was included on prominent health websites oriented towards the general public, and evaluated how it was explained. Design: Cross-sectional study. Setting: Websites identified through Google searc...

  10. Improving information retrieval in functional analysis.

    Science.gov (United States)

    Rodriguez, Juan C; González, Germán A; Fresno, Cristóbal; Llera, Andrea S; Fernández, Elmer A

    2016-12-01

    Transcriptome analysis is essential to understand the mechanisms regulating key biological processes and functions. The first step usually consists of identifying candidate genes; to find out which pathways are affected by those genes, however, functional analysis (FA) is mandatory. The most frequently used strategies for this purpose are Gene Set and Singular Enrichment Analysis (GSEA and SEA) over Gene Ontology. Several statistical methods have been developed and compared in terms of computational efficiency and/or statistical appropriateness. However, whether their results are similar or complementary, the sensitivity to parameter settings, or possible bias in the analyzed terms has not been addressed so far. Here, two GSEA and four SEA methods and their parameter combinations were evaluated in six datasets by comparing two breast cancer subtypes with well-known differences in genetic background and patient outcomes. We show that GSEA and SEA lead to different results depending on the chosen statistic, model and/or parameters. Both approaches provide complementary results from a biological perspective. Hence, an Integrative Functional Analysis (IFA) tool is proposed to improve information retrieval in FA. It provides a common gene expression analytic framework that grants a comprehensive and coherent analysis. Only a minimal user parameter setting is required, since the best SEA/GSEA alternatives are integrated. IFA utility was demonstrated by evaluating four prostate cancer and the TCGA breast cancer microarray datasets, which showed its biological generalization capabilities.

  11. An Abstraction-Based Data Model for Information Retrieval

    Science.gov (United States)

    McAllister, Richard A.; Angryk, Rafal A.

    Language ontologies provide an avenue for automated lexical analysis that may be used to supplement existing information retrieval methods. This paper presents a method of information retrieval that takes advantage of WordNet, a lexical database, to generate paths of abstraction, and uses them as the basis for an inverted index structure to be used in the retrieval of documents from an indexed corpus. We present this method as a entree to a line of research on using ontologies to perform word-sense disambiguation and improve the precision of existing information retrieval techniques.

  12. Development of a Website Providing Evidence-Based Information About Nutrition and Cancer: Fighting Fiction and Supporting Facts Online.

    Science.gov (United States)

    van Veen, Merel Rebecca; Beijer, Sandra; Adriaans, Anika Maria Alberdina; Vogel-Boezeman, Jeanne; Kampman, Ellen

    2015-09-08

    Although widely available, the general public, cancer patients, and cancer survivors have difficulties accessing evidence-based information on nutrition and cancer. It is challenging to distinguish myths from facts, and sometimes conflicting information can be found in different places. The public and patients would benefit from evidence-based, correct, and clear information from an easily recognizable source. The aim of this project is to make scientific information available for the general public, cancer patients, and cancer survivors through a website. The aim of this paper is to describe and evaluate the development of the website as well as related statistics 1st year after its launch. To develop the initial content for the website, the website was filled with answers to frequently asked questions provided by cancer organizations and the Dutch Dietetic Oncology Group, and by responding to various fiction and facts published in the media. The website was organized into 3 parts, namely, nutrition before (prevention), during, and after cancer therapy; an opportunity for visitors to submit specific questions regarding nutrition and cancer was included. The website was pretested by patients, health care professionals, and communication experts. After launching the website, visitors' questions were answered by nutritional scientists and dieticians with evidence- or eminence-based information on nutrition and cancer. Once the website was live, question categories and website statistics were recorded. Before launch, the key areas for improvement, such as navigation, categorization, and missing information, were identified and adjusted. In the 1st year after the launch, 90,111 individuals visited the website, and 404 questions were submitted on nutrition and cancer. Most of the questions were on cancer prevention and nutrition during the treatment of cancer. The website provides access to evidence- and eminence-based information on nutrition and cancer. As can be

  13. Are Social Networking Websites Educational? Information Capsule. Volume 0909

    Science.gov (United States)

    Blazer, Christie

    2009-01-01

    More and more school districts across the country are joining social networking sites, such as Facebook and MySpace. This Information Capsule discusses the frequency with which school districts are using social networking sites, how districts are using the sites, and potential drawbacks associated with their use. Issues for districts to consider…

  14. Information content of ozone retrieval algorithms

    Science.gov (United States)

    Rodgers, C.; Bhartia, P. K.; Chu, W. P.; Curran, R.; Deluisi, J.; Gille, J. C.; Hudson, R.; Mateer, C.; Rusch, D.; Thomas, R. J.

    1989-01-01

    The algorithms are characterized that were used for production processing by the major suppliers of ozone data to show quantitatively: how the retrieved profile is related to the actual profile (This characterizes the altitude range and vertical resolution of the data); the nature of systematic errors in the retrieved profiles, including their vertical structure and relation to uncertain instrumental parameters; how trends in the real ozone are reflected in trends in the retrieved ozone profile; and how trends in other quantities (both instrumental and atmospheric) might appear as trends in the ozone profile. No serious deficiencies were found in the algorithms used in generating the major available ozone data sets. As the measurements are all indirect in someway, and the retrieved profiles have different characteristics, data from different instruments are not directly comparable.

  15. Innovations in information retrieval perspectives for theory and practice

    CERN Document Server

    Foster, Allen

    2011-01-01

    The advent of various information retrieval (IR) technologies and approaches to storage and retrieval provide communities with opportunities for mass documentation, digitization, and the recording of information in different forms. This book introduces and contextualizes these developments and looks at supporting research in IR.

  16. User-Centric Multi-Criteria Information Retrieval

    Science.gov (United States)

    Wolfe, Shawn R.; Zhang, Yi

    2009-01-01

    Information retrieval models usually represent content only, and not other considerations, such as authority, cost, and recency. How could multiple criteria be utilized in information retrieval, and how would it affect the results? In our experiments, using multiple user-centric criteria always produced better results than a single criteria.

  17. Brute Force Information Retrieval Experiments using MapReduce

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Hauff, Claudia

    2012-01-01

    MIREX (MapReduce Information Retrieval Experiments) is a software library initially developed by the Database Group of the University of Twente for running large scale information retrieval experiments on clusters of machines. MIREX has been tested on web crawls of up to half a billion web pages, to

  18. Prototyping a Distributed Information Retrieval System That Uses Statistical Ranking.

    Science.gov (United States)

    Harman, Donna; And Others

    1991-01-01

    Built using a distributed architecture, this prototype distributed information retrieval system uses statistical ranking techniques to provide better service to the end user. Distributed architecture was shown to be a feasible alternative to centralized or CD-ROM information retrieval, and user testing of the ranking methodology showed both…

  19. Problems of Music Information Retrieval in the Real World.

    Science.gov (United States)

    Byrd, Donald; Crawford, Tim

    2002-01-01

    Considers some of the most fundamental problems in music information retrieval, challenging the common assumption that searching on pitch alone is likely to be satisfactory for all purposes. Discusses special issues related to polyphonic music, user-interface issues, and the notion of relevance for music information retrieval. (Contains 52…

  20. Counter-intuitive Cases of Data Fusion in Information Retrieval.

    Science.gov (United States)

    Ibraev, Ulukbek; Kantor, Paul; Ng, K. B.

    2001-01-01

    Aspects of Data Fusion (DF) for information retrieval are explored. Based on a geometrical model of DF, it is shown that in the ideal case, performance of DF for a pair of information retrieval schemes may be approximated by a quadratic polynomial. Compares counter-intuitive cases of DF with cases that behave according to the geometric model. (AEF)

  1. An Expressive and Efficient Language for XML Information Retrieval.

    Science.gov (United States)

    Chinenyanga, Taurai Tapiwa; Kushmerick, Nicholas

    2002-01-01

    Discusses XML and information retrieval and describes a query language, ELIXIR (expressive and efficient language for XML information retrieval), with a textual similarity operator that can be used for similarity joins. Explains the algorithm for answering ELIXIR queries to generate intermediate relational data. (Author/LRW)

  2. Here's an idea: ask the users! Young people's views on navigation, design and content of a health information website.

    Science.gov (United States)

    Franck, Linda S; Noble, Genevieve

    2007-12-01

    Use of the internet to provide health information to young people is a relatively recent development. Few studies have explored young people's views on how they use internet health websites. This study investigated the navigation, design and content preferences of young people using the Children First for Health (CFfH) website. Young people from five secondary schools completed an internet site navigation exercise, website evaluation questionnaire and participated in informal discussions. Of the participants, 45 percent visited the website section aimed at older adolescents within their first two clicks, regardless of their age. There were conflicting preferences for design and strong preference for gender-specific information on topics such as appearance, relationships, fitness and sexual health. The findings indicate the importance of gaining young people's views to ensure that health information websites meet the needs of their intended audience. Cooperation from schools can facilitate the process of gaining young people's views on internet website navigation, design and content.

  3. An Ethnographic Analysis of Adolescent Sexual Minority Website Usage: Exploring Notions of Information Seeking and Sexual Identity Development

    Science.gov (United States)

    Sulfridge, Rocky M.

    2012-01-01

    This dissertation explores the website usage of adolescent sexual minorities, examining notions of information seeking and sexual identity development. Sexual information seeking is an important element within human information behavior and is uniquely problematic for young sexual minorities. Utilizing a contemporary gay teen website, this…

  4. An Ethnographic Analysis of Adolescent Sexual Minority Website Usage: Exploring Notions of Information Seeking and Sexual Identity Development

    Science.gov (United States)

    Sulfridge, Rocky M.

    2012-01-01

    This dissertation explores the website usage of adolescent sexual minorities, examining notions of information seeking and sexual identity development. Sexual information seeking is an important element within human information behavior and is uniquely problematic for young sexual minorities. Utilizing a contemporary gay teen website, this…

  5. Information retrieval for children based on the aggregated search paradigm

    NARCIS (Netherlands)

    Duarte Torres, Sergio

    2011-01-01

    This report presents research to develop information services for children by expanding and adapting current Information retrieval technologies according to the search characteristics and needs of children. Concretely, we will employ the aggregated search paradigm as theoretical framework. The objec

  6. Attitudes Toward Automated Information Retrieval Services Among RASD Members

    Science.gov (United States)

    Nitecki, Danuta A.

    1976-01-01

    Summary of survey of the American Library Association Reference and Adult Services Division (RASD) members concerning attitudes toward, need for, and preferences in acquiring information on automated information retrieval services. (KP)

  7. Evaluation of Quality and Readability of Health Information Websites Identified through India’s Major Search Engines

    Directory of Open Access Journals (Sweden)

    S. Raj

    2016-01-01

    Full Text Available Background. The available health information on websites should be reliable and accurate in order to make informed decisions by community. This study was done to assess the quality and readability of health information websites on World Wide Web in India. Methods. This cross-sectional study was carried out in June 2014. The key words “Health” and “Information” were used on search engines “Google” and “Yahoo.” Out of 50 websites (25 from each search engines, after exclusion, 32 websites were evaluated. LIDA tool was used to assess the quality whereas the readability was assessed using Flesch Reading Ease Score (FRES, Flesch-Kincaid Grade Level (FKGL, and SMOG. Results. Forty percent of websites (n=13 were sponsored by government. Health On the Net Code of Conduct (HONcode certification was present on 50% (n=16 of websites. The mean LIDA score (74.31 was average. Only 3 websites scored high on LIDA score. Only five had readability scores at recommended sixth-grade level. Conclusion. Most health information websites had average quality especially in terms of usability and reliability and were written at high readability levels. Efforts are needed to develop the health information websites which can help general population in informed decision making.

  8. Information Retrieval and Graph Analysis Approaches for Book Recommendation.

    Science.gov (United States)

    Benkoussas, Chahinez; Bellot, Patrice

    2015-01-01

    A combination of multiple information retrieval approaches is proposed for the purpose of book recommendation. In this paper, book recommendation is based on complex user's query. We used different theoretical retrieval models: probabilistic as InL2 (Divergence from Randomness model) and language model and tested their interpolated combination. Graph analysis algorithms such as PageRank have been successful in Web environments. We consider the application of this algorithm in a new retrieval approach to related document network comprised of social links. We called Directed Graph of Documents (DGD) a network constructed with documents and social information provided from each one of them. Specifically, this work tackles the problem of book recommendation in the context of INEX (Initiative for the Evaluation of XML retrieval) Social Book Search track. A series of reranking experiments demonstrate that combining retrieval models yields significant improvements in terms of standard ranked retrieval metrics. These results extend the applicability of link analysis algorithms to different environments.

  9. Rare disease diagnosis as an information retrieval task

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina;

    2011-01-01

    Increasingly more clinicians use web Information Retrieval (IR) systems to assist them in diagnosing difficult medical cases, for instance rare diseases that they may not be familiar with. However, web IR systems are not necessarily optimised for this task. For instance, clinicians’ queries tend...... search and offline retrieval from a rare disease collection indicate that the retrieval of rare diseases is an open problem with room for improvement....

  10. Science information systems: Archive, access, and retrieval

    Science.gov (United States)

    Campbell, William J.

    1991-01-01

    The objective of this research is to develop technology for the automated characterization and interactive retrieval and visualization of very large, complex scientific data sets. Technologies will be developed for the following specific areas: (1) rapidly archiving data sets; (2) automatically characterizing and labeling data in near real-time; (3) providing users with the ability to browse contents of databases efficiently and effectively; (4) providing users with the ability to access and retrieve system independent data sets electronically; and (5) automatically alerting scientists to anomalies detected in data.

  11. SCHOOL WEBSITE AS A FACTOR OF DEVELOPMENT OF SCHOOL INFORMATION EDUCATIONAL ENVIRONMENT

    Directory of Open Access Journals (Sweden)

    Olga P. Pinchuk

    2013-03-01

    Full Text Available The article analyses the current state of school openness and presents an experience of creating a school website as a mean to resolve the contradiction between the appearance of various forms of information and limited ways to use these forms in educational systems. Website is regarded as a component of a common information educational space in Ukraine and an important factor in its development as a tool to develop cooperation of all members of the educational process (students, teachers, psychologists, principals and parents. The article considers the following elements as operative informing of parents open exchange of teaching experience and reporting monitoring results. Perspective course of research, among others, information security of school operation is defined.

  12. High school students' perspective on the features of consumer health information websites

    Directory of Open Access Journals (Sweden)

    Vahideh Zarea

    2016-06-01

    Full Text Available The main aim of study was to identify the primary source of health information seeking among high school students and the characteristics of quality consumer health information from their perspective. A cross sectional descriptive survey was used to conduct the study utilizing a valid questionnaire. The first source of health information seeking for most of the high school student (79% was the Internet rather than books, journals or family members. Majority of boys (87% go to the Internet for pathology and definition of diseases, but the girls (82% usually search for life style, exercise, nutrition, mental health, maturity and then general health information such as physiology, anatomy, and calculations. All of the student recognize content accuracy, and believe that involvements of information specialists in management of websites may guarantee the quality criteria of website. It is concluded that development of a quality consumer health information website is essential to meet the health information needs of students and promotion of health literacy among high school students and adolescents in Iran.

  13. MIREX: MapReduce Information Retrieval Experiments

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Hauff, Claudia

    2010-01-01

    We propose to use MapReduce to quickly test new retrieval approaches on a cluster of machines by sequentially scanning all documents. We present a small case study in which we use a cluster of 15 low cost machines to search a web crawl of 0.5 billion pages showing that sequential scanning is a viabl

  14. The JPL Library information retrieval system

    Science.gov (United States)

    Walsh, J.

    1975-01-01

    The development, capabilities, and products of the computer-based retrieval system of the Jet Propulsion Laboratory Library are described. The system handles books and documents, produces a book catalog, and provides a machine search capability. Programs and documentation are available to the public through NASA's computer software dissemination program.

  15. Tools for assessing the quality and accessibility of online health information: initial testing among breast cancer websites.

    Science.gov (United States)

    Whitten, Pamela; Nazione, Samantha; Lauckner, Carolyn

    2013-12-01

    Health websites are used frequently, but there are many concerns about their value as information sources. Additionally, there are numerous personal barriers that prevent individuals from wholly benefitting from them. In order to assess the quality of health websites and their accessibility to users, we created tools based on previous research that examine design aspects, information validity, motivational health content and literacy content. To test these tools, we examined 155 breast cancer websites and created scores for each assessment tool to describe the percent of constructs on the average website. Results demonstrated that websites performed best on the design tool followed by the information validity, motivational health content and literacy assessment tools. The average website contained the majority of the design and information validity constructs, but only about a third of the motivational health or literacy constructs. Multiple items from the motivational health content and literacy assessment tools were not found on any of the websites, and many were only represented on a handful of sites. Overall, the assessment tools were useful in evaluating the quality of websites, and could serve as valuable resources for health website developers in the future.

  16. An Evaluation of Automatically Constructed Hypertexts for Information Retrieval.

    Science.gov (United States)

    Melucci, Massimo

    1999-01-01

    Assesses the retrieval effectiveness of automatically constructed interdocument hypertext links in information retrieval (IR). Describes experiments using statistical and probabilistic techniques that were designed to obtain evidence concerning the usefulness of querying and browsing automatically constructed IR hypertexts. Results indicate a…

  17. Improving Performance Support Systems through Information Retrieval Evaluation

    Science.gov (United States)

    Schatz, Steven

    2006-01-01

    This study examines existent and new methods for evaluating the success of information retrieval systems. The theory underlying current methods is not robust enough to allow testing retrieval using different meta-tagging schema's. Traditional measures rely on judgments of whether a document is relevant to a particular question. A good system…

  18. Bibliometric-Enhanced Information Retrieval. Editorial for the workshop.

    NARCIS (Netherlands)

    Mayr, Philipp; Schaer, Philipp; Scharnhorst, Andrea; Mutschke, Peter; de Rijke, Maarten; Kenter, Tom; de Vries, Arjen P.; Zhai, ChengXiang; de Jong, Franciska; Radinsky, Kira; Hofmann, Katja

    2014-01-01

    This first "Bibliometric-enhanced Information Retrieval" (BIR 2014) workshop aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offe

  19. A Semantic Medical Multimedia Retrieval Approach Using Ontology Information Hiding

    OpenAIRE

    2013-01-01

    Searching useful information from unstructured medical multimedia data has been a difficult problem in information retrieval. This paper reports an effective semantic medical multimedia retrieval approach which can reflect the users’ query intent. Firstly, semantic annotations will be given to the multimedia documents in the medical multimedia database. Secondly, the ontology that represented semantic information will be hidden in the head of the multimedia documents. The main innovations of ...

  20. Intelligent Agent-Based System for Digital Library Information Retrieval

    Institute of Scientific and Technical Information of China (English)

    师雪霖; 牛振东; 宋瀚涛; 宋丽哲

    2003-01-01

    A new information search model is reported and the design and implementation of a system based on intelligent agent is presented. The system is an assistant information retrieval system which helps users to search what they need. The system consists of four main components: interface agent, information retrieval agent, broker agent and learning agent. They collaborate to implement system functions. The agents apply learning mechanisms based on an improved ID3 algorithm.

  1. Understanding information retrieval systems management, types, and standards

    CERN Document Server

    Bates, Marcia J

    2011-01-01

    In order to be effective for their users, information retrieval (IR) systems should be adapted to the specific needs of particular environments. The huge and growing array of types of information retrieval systems in use today is on display in Understanding Information Retrieval Systems: Management, Types, and Standards, which addresses over 20 types of IR systems. These various system types, in turn, present both technical and management challenges, which are also addressed in this volume. In order to be interoperable in a networked environment, IR systems must be able to use various types of

  2. A feature-centric view of information retrieval

    CERN Document Server

    Metzler, Donald

    2011-01-01

    Commercial Web search engines such as Google, Yahoo, and Bing are used every day by millions of people across the globe. With their ever-growing refinement and usage, it has become increasingly difficult for academic researchers to keep up with the collection sizes and other critical research issues related to Web search, which has created a divide between the information retrieval research being done within academia and industry. Such large collections pose a new set of challenges for information retrieval researchers. In this work, Metzler describes highly effective information retrieval mod

  3. An Integrated Information Retrieval Support System for Campus Network

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    This paper presents a new integrated information retrieval support system (IIRSS) which can help Web search engines retrieve cross-lingual information from heterogeneous resources stored in multi-databases in Intranet. The IIRSS, with a three-layer architecture, can cooperate with other application servers running in Intranet. By using intelligent agents to collect information and to create indexes on-the-fly, using an access control strategy to confine a user to browsing those accessible documents for him/her through a single portal, and using a new cross-lingual translation tool to help the search engine retrieve documents, the new system provides controllable information access with different authorizations, personalized services, and real-time information retrieval.

  4. Assessing the nutritional information for children younger than two years old available on popular websites

    Directory of Open Access Journals (Sweden)

    Gisele da Silva Gomes Monteiro

    Full Text Available Abstract Objective: To analyze whether the information found on popular Internet sites are in accordance with the steps recommended by the Food Guide for Children Younger than Two Years of the Ministry of Health (2010. Methods: Descriptive/comparative study, carried out between August and October 2014, which carried out a search for popular sites (for lay people in Portuguese, containing information on nutrition of children younger than two years. The Google search engine was used. These findings were compared with the Food Guide for Children Younger than Two Years of the Ministry of Health (2010. It was verified whether the information shown on the websites was in accordance with the Guide. Results: A total of 50 sites were analyzed, including blogs, food company websites and websites specialized in child nutrition. Only 10% of those pages correctly showed every step of the Food Guide. The recommendations were: exclusive breastfeeding up to six months of life (80%; complementary feeding from six months of life (36%; baby food consistency according to the guide (48%; encouraging the consumption of fruits and vegetables daily (60%. Regarding the complementary feeding safety and hygiene, 26% contained correct information. Only 36% correctly warned about which foods should be avoided in the first years of life. Conclusions: The information found on the sites is largely in disagreement with the Ministry of Health recommendations, which can lead to misconceptions in the nutritional care of the children younger than two years.

  5. Order effect in interactive information retrieval evaluation

    DEFF Research Database (Denmark)

    Clemmensen, Melanie Landvad; Borlund, Pia

    2016-01-01

    , the phenomenon is not yet fully understood or investigated in relation to IIR; hence the objective is to increase the knowledge of this phenomenon in the context of IIR as it has implications for test design of IIR studies. Design/methodology/approach – Order effect is studied via partly a literature review...... and partly an empirical IIR study. The empirical IIR study is designed as a classic between-groups design. The IIR search behaviour was logged and complementary post-search interviews were conducted. Findings – The order effect between groups and within search tasks were measured against nine classic IIR...... performance parameters of search interaction behaviour. Order effect is seen with respect to three performance parameters (website changes, visit of webpages, and formulation of queries) shown by an increase in activity on the last performed search. Further the theories with respect to motivation, fatigue...

  6. Interfering effects of retrieval in learning new information.

    Science.gov (United States)

    Finn, Bridgid; Roediger, Henry L

    2013-11-01

    In 7 experiments, we explored the role of retrieval in associative updating, that is, in incorporating new information into an associative memory. We tested the hypothesis that retrieval would facilitate incorporating a new contextual detail into a learned association. Participants learned 3 pieces of information-a person's face, name, and profession (in Experiments 1-5). In the 1st phase, participants in all conditions learned faces and names. In the 2nd phase, participants either restudied the face-name pair (the restudy condition) or were given the face and asked to retrieve the name (the test condition). In the 3rd phase, professions were presented for study just after restudy or testing. Our prediction was that the new information (the profession) would be more readily learned following retrieval of the face-name association compared to restudy of the face-name association. However, we found that the act of retrieval generally undermined acquisition of new associations rather than facilitating them. This detrimental effect emerged on both immediate and delayed tests. Further, the effect was not due to selective attention to feedback because we found impairment whether or not feedback was provided after the Phase 2 test. The data are novel in showing that the act of retrieving information can inhibit the ability to learn new information shortly thereafter. The results are difficult to accommodate within current theories that mostly emphasize benefits of retrieval for learning.

  7. Design Issues and Information Contents of the Provincial Government Websites of Indonesia: A Content Analysis on Visual Messages

    Directory of Open Access Journals (Sweden)

    Achmad Syarief

    2009-07-01

    Full Text Available A website is not just merely act as an object of displaying information, but it also represents a contextual medium of communication through visuals and contents. The interplay of website design elements builds up meanings that affect users beyond what previous communication practices have uncovered. Previous research acknowledges that visuals and contents have significant effects in attracting users’ attention and trust. Thus, the ability of a website to provide credible information through visuals and contents to target users is therefore plays great importance in the success of a website. However, although a considerable number of researches on website design have been performed, study in understanding the characteristics of site’s visual appearances and information contents for the purpose of promoting local investment in Indonesia has been very limited. This paper addresses visual design issues and information contents of eighteen provincial government websites of Indonesia. Through content analysis, the paper comparatively examines visual appearances, information contents, and functions of each website, in order to determine visual characteristics and contents that suit the purpose of promoting local potencies. The paper focuses on commonality, discrepancy, and pattern of contents, provide suggestions to improve the use of provincial government website design of Indonesia.

  8. Vector space model for document representation in information retrieval

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2007-12-01

    Full Text Available This paper presents the basics of information retrieval: the vector space model for document representation with Boolean and term weighted models, ranking methods based on the cosine factor and evaluation measures: recall, precision and combined measure.

  9. Private Information Retrieval and Connections to Coding Theory

    OpenAIRE

    Horlemann, Anna-Lena

    2017-01-01

    We give an introduction to the problem of private information retrieval and show simple first ideas how to achieve this. Then we will generalize these ideas and show how known techniques from coding theory are helpful in this regard.

  10. A Question Answering service for information retrieval in Cooper

    NARCIS (Netherlands)

    Giesbers, Bas; Taddeo, Antonio; Van der Vegt, Wim; Van Bruggen, Jan; Koper, Rob

    2007-01-01

    Giesbers, B., Taddeo, A., van der Vegt, W., van Bruggen, J., Koper, R. (2007). A Question Answering service for information retrieval in Cooper. Paper presented at the Cooper workshop, September 18, Crete, Greece.

  11. Classification in Information Retrieval: The Twenty Years Following Dorking.

    Science.gov (United States)

    Coates, E. J.

    1978-01-01

    Discusses theoretical and practical progress made in the classification of information for retrieval in the last 20 years and suggests alternatives to the Dewey Decimal and Library of Congress classification systems. (JVP)

  12. GRAMMAR RULE BASED INFORMATION RETRIEVAL MODEL FOR BIG DATA

    Directory of Open Access Journals (Sweden)

    T. Nadana Ravishankar

    2015-07-01

    Full Text Available Though Information Retrieval (IR in big data has been an active field of research for past few years; the popularity of the native languages presents a unique challenge in big data information retrieval systems. There is a need to retrieve information which is present in English and display it in the native language for users. This aim of cross language information retrieval is complicated by unique features of the native languages such as: morphology, compound word formations, word spelling variations, ambiguity, word synonym, other language influence and etc. To overcome some of these issues, the native language is modeled using a grammar rule based approach in this work. The advantage of this approach is that the native language is modeled and its unique features are encoded using a set of inference rules. This rule base coupled with the customized ontological system shows considerable potential and is found to show better precision and recall.

  13. INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY

    Directory of Open Access Journals (Sweden)

    Thinn Mya Mya Swe

    2011-07-01

    Full Text Available A digital library is a type of information retrieval (IR system. The existing information retrieval methodologies generally have problems on keyword-searching. We proposed a model to solve the problem by using concept-based approach (ontology and metadata case base. This model consists of identifying domain concepts in user’s query and applying expansion to them. The system aims at contributing to an improved relevance of results retrieved from digital libraries by proposing a conceptual query expansion for intelligent concept-based retrieval. We need to import the concept of ontology, making use of its advantage of abundant semantics and standard concept. Domain specific ontology can be used to improve information retrieval from traditional level based on keyword to the lay based on knowledge (or concept and change the process of retrieval from traditional keyword matching to semantics matching. One approach is query expansion techniques using domain ontology and the other would be introducing a case based similarity measure for metadata information retrieval using Case Based Reasoning (CBR approach. Results show improvements over classic method, query expansion using general purpose ontology and a number of other approaches.

  14. Information retrieval in digital libraries: bringing search to the net.

    Science.gov (United States)

    Schatz, B R

    1997-01-17

    A digital library enables users to interact effectively with information distributed across a network. These network information systems support search and display of items from organized collections. In the historical evolution of digital libraries, the mechanisms for retrieval of scientific literature have been particularly important. Grand visions in 1960 led first to the development of text search, from bibliographic databases to full-text retrieval. Next, research prototypes catalyzed the rise of document search, from multimedia browsing across local-area networks to distributed search on the Internet. By 2010, the visions will be realized, with concept search enabling semantic retrieval across large collections.

  15. Noun-Phrase Analysis in Unrestricted Text for Information Retrieval

    OpenAIRE

    Evans, David A.; Zhai, Chengxiang

    1996-01-01

    Information retrieval is an important application area of natural-language processing where one encounters the genuine challenge of processing large quantities of unrestricted natural-language text. This paper reports on the application of a few simple, yet robust and efficient noun-phrase analysis techniques to create better indexing phrases for information retrieval. In particular, we describe a hybrid approach to the extraction of meaningful (continuous or discontinuous) subcompounds from ...

  16. Care episode retrieval: distributional semantic models for information retrieval in the clinical domain.

    Science.gov (United States)

    Moen, Hans; Ginter, Filip; Marsi, Erwin; Peltonen, Laura-Maria; Salakoski, Tapio; Salanterä, Sanna

    2015-01-01

    Patients' health related information is stored in electronic health records (EHRs) by health service providers. These records include sequential documentation of care episodes in the form of clinical notes. EHRs are used throughout the health care sector by professionals, administrators and patients, primarily for clinical purposes, but also for secondary purposes such as decision support and research. The vast amounts of information in EHR systems complicate information management and increase the risk of information overload. Therefore, clinicians and researchers need new tools to manage the information stored in the EHRs. A common use case is, given a--possibly unfinished--care episode, to retrieve the most similar care episodes among the records. This paper presents several methods for information retrieval, focusing on care episode retrieval, based on textual similarity, where similarity is measured through domain-specific modelling of the distributional semantics of words. Models include variants of random indexing and the semantic neural network model word2vec. Two novel methods are introduced that utilize the ICD-10 codes attached to care episodes to better induce domain-specificity in the semantic model. We report on experimental evaluation of care episode retrieval that circumvents the lack of human judgements regarding episode relevance. Results suggest that several of the methods proposed outperform a state-of-the art search engine (Lucene) on the retrieval task.

  17. The KNOTTIN website and database: a new information system dedicated to the knottin scaffold

    OpenAIRE

    Gelly, Jean-Christophe; Gracy, Jérôme; Kaas, Quentin; Le-Nguyen, Dung; Heitz, Annie; Chiche, Laurent

    2004-01-01

    The KNOTTIN website and database organize information about knottins or inhibitor cystine knots, small disulfide-rich proteins with a knotted topology. Thanks to their small size and high stability, knottins provide appealing scaffolds for protein engineering and drug design. Static pages present the main historical and recent results about knottin discoveries, sequences, structures, folding, functions, applications and bibliography. Database searches provide dynamically generated tabular rep...

  18. Research of SEO in information retrieval%SEO在搜索中应用研究

    Institute of Scientific and Technical Information of China (English)

    王庆福

    2016-01-01

    Search engines as the main information retrieval tools,improve their own website in the search engine rankings can bring very large flow of information to their website and into economic benefits.SEO technology mainly through a number of technical means to improve the search of their own web site and the user to retrieve the matching degree between the word so as to improve the ranking of results,which has a very important significance for enterprise promotion.%搜索引擎作为目前主要的信息检索工具,提高自身网站在搜索引擎中排名能够给自身网站带来非常大的流量消息并转化为经济收益。SEO技术主要通过一些技术手段来提高搜索时自身网站和用户检索词之间的匹配度从而提高结果排名,这对于企业推广具有非常重要的意义。

  19. Semantic Annotation for Biological Information Retrieval System

    Directory of Open Access Journals (Sweden)

    Mohamed Marouf Z. Oshaiba

    2015-01-01

    Full Text Available Online literatures are increasing in a tremendous rate. Biological domain is one of the fast growing domains. Biological researchers face a problem finding what they are searching for effectively and efficiently. The aim of this research is to find documents that contain any combination of biological process and/or molecular function and/or cellular component. This research proposes a framework that helps researchers to retrieve meaningful documents related to their asserted terms based on gene ontology (GO. The system utilizes GO by semantically decomposing it into three subontologies (cellular component, biological process, and molecular function. Researcher has the flexibility to choose searching terms from any combination of the three subontologies. Document annotation is taking a place in this research to create an index of biological terms in documents to speed the searching process. Query expansion is used to infer semantically related terms to asserted terms. It increases the search meaningful results using the term synonyms and term relationships. The system uses a ranking method to order the retrieved documents based on the ranking weights. The proposed system achieves researchers’ needs to find documents that fit the asserted terms semantically.

  20. A comparison of Boolean-based retrieval to the WAIS system for retrieval of aeronautical information

    Science.gov (United States)

    Marchionini, Gary; Barlow, Diane

    1994-01-01

    An evaluation of an information retrieval system using a Boolean-based retrieval engine and inverted file architecture and WAIS, which uses a vector-based engine, was conducted. Four research questions in aeronautical engineering were used to retrieve sets of citations from the NASA Aerospace Database which was mounted on a WAIS server and available through Dialog File 108 which served as the Boolean-based system (BBS). High recall and high precision searches were done in the BBS and terse and verbose queries were used in the WAIS condition. Precision values for the WAIS searches were consistently above the precision values for high recall BBS searches and consistently below the precision values for high precision BBS searches. Terse WAIS queries gave somewhat better precision performance than verbose WAIS queries. In every case, a small number of relevant documents retrieved by one system were not retrieved by the other, indicating the incomplete nature of the results from either retrieval system. Relevant documents in the WAIS searches were found to be randomly distributed in the retrieved sets rather than distributed by ranks. Advantages and limitations of both types of systems are discussed.

  1. Adaptive multi-agent system for information retrieval

    Science.gov (United States)

    Maleki-dizaji, Saeedeh; Nyongesa, H. O.; Siddiqqi, J.

    2001-10-01

    The current exponential growth of the Internet precipitates a need for improved tools to help people cope with the volume of information available. Existing search engines such, as Yahoo, Alta vista and Excite are efficient in terms of high recall (percentage of relevant document that are retrieved from Internet), and fast response time, at the cost of poor precision (percentage of documents retrieved that are considered relevant). The problem is due to the lack of filtering, lack of specialisation, lack of relevance feedback, lack of adaptation and lack of exploration. One solution for the above problems is to use intelligent agents, which can operate autonomously and become better over time. The agents rely on a user model to improve their performance in retrieving the information. This paper presents an adaptive information retrieval (IR) that learns from the user feedback through an evolutionary method, namely, genetic algorithms (GA).

  2. Information seeking and information retrieval curriculum development for courses taught in two LIS schools

    OpenAIRE

    Bates, Jessica; Vilar, Polona; Žumer, Maja

    2015-01-01

    Introduction. This paper shows how the set of Information Seeking and Retrieval (information seeking and retrieval) topics (for devising a curriculum) relates to the curriculum of two modules taught at two different institutions: Department of Library and Information Science and Book Studies at the University of Ljubljana, Slovenia and School of Information and Library Studies at University College Dublin, Ireland. Method. The information seeking and retrieval framework is compared to the str...

  3. Adaptive Visualization for Focused Personalized Information Retrieval

    Science.gov (United States)

    Ahn, Jae-wook

    2010-01-01

    The new trend on the Web has totally changed today's information access environment. The traditional information overload problem has evolved into the qualitative level beyond the quantitative growth. The mode of producing and consuming information is changing and we need a new paradigm for accessing information. Personalized search is one of…

  4. Adaptive Visualization for Focused Personalized Information Retrieval

    Science.gov (United States)

    Ahn, Jae-wook

    2010-01-01

    The new trend on the Web has totally changed today's information access environment. The traditional information overload problem has evolved into the qualitative level beyond the quantitative growth. The mode of producing and consuming information is changing and we need a new paradigm for accessing information. Personalized search is one of…

  5. Quality of web-based medical information on stable COPD: comparison of non-commercial and commercial websites.

    Science.gov (United States)

    Kunst, Heinke; Khan, Khalid S

    2002-03-01

    The Internet provides an easy and accessible way to deliver medical information about the management of various diseases, both to practitioners and to their patients. As there is no control over who posts information on the Web, there is a risk that the interests of the web producer may bias the quality of information. The quality of medical information on the management of chronic obstructive pulmonary disease (COPD) on the Internet was evaluated, comparing non-commercial and commercial websites. An internet search was conducted to locate relevant websites using a metasearch engine. The quality of websites was scored on a scale of 0-10, based on three items about the credibility of the site and seven items about the accuracy of the information provided by the site. Quality differences between commercial and non-commercial websites were explored. The search revealed 23 relevant websites (12 noncommercial and 11 commercial). The overall quality of non-commercial websites was better than that of commercial websites (median score 7 vs. 4, P = 0.01). Compared to commercial sites, non-commercial websites more often provided information about cessation of smoking (100% vs. 64%, P = 0.03), preventative influenza vaccinations (42% vs. 9%, P = 0.07) and use of long-term oxygen therapy (92% vs. 45%, P = 0.02). Among websites providing information on COPD, commercial sites were much more likely to be of poorer quality compared to sites of non-commercial organizations. In particular, commercial sites do not provide information about simple preventative treatments. There is a need to be vigilant about the quality of health information about COPD on the Internet.

  6. Learning about Potential Users of Collaborative Information Retrieval Systems

    CERN Document Server

    Reddy, Madhu

    2009-01-01

    One of the key components of designing usable and useful collaborative information retrieval systems is to understand the needs of the users of these systems. Our research team has been exploring collaborative information behavior in a variety of organizational settings. Our research goals have been two-fold: First, to develop a conceptual understanding of collaborative information behavior and second, gather requirements for the design of collaborative information retrieval systems. In this paper, we present a brief overview of our fieldwork in a three different organizational settings, discuss our methodology for collecting data on collaborative information behavior, and highlight some lessons that we are learning about potential users of collaborative information retrieval systems in these domains.

  7. Optimizing XML Information Retrieval Query Execution at the Physical Level

    NARCIS (Netherlands)

    Os, van R.

    2007-01-01

    XML is emerging as a standard format for information interchange and storage of structured information. The wide-spread use of XML has sparked the interest of both the database and information retrieval research communities. XML databases are designed to store and query large volumes of XML data. St

  8. Associative conceptual space-based information retrieval systems

    NARCIS (Netherlands)

    M.J. Schuemie (Martijn); J.H. van den Berg (Jan)

    1998-01-01

    textabstractIn this `Information Era' with the availability of large collections of books, articles, journals, CD-ROMs, video films and so on, there exists an increasing need for intelligent information retrieval systems that enable users to find the information desired easily. Many attempts have be

  9. Associative conceptual space-based information retrieval systems

    NARCIS (Netherlands)

    M.J. Schuemie (Martijn); J.H. van den Berg (Jan)

    1998-01-01

    textabstractIn this `Information Era' with the availability of large collections of books, articles, journals, CD-ROMs, video films and so on, there exists an increasing need for intelligent information retrieval systems that enable users to find the information desired easily. Many attempts have

  10. Bibliometrics and Information Retrieval - Creating Knowledge through Research Synergies

    NARCIS (Netherlands)

    Bar-Ilan, Judit; Koopman, Rob; Wang, Shenghui; Scharnhorst, Andrea; John, Marcus; Mayr, Philipp; Wolfram, Dietmar

    2016-01-01

    This panel brings together experts in bibliometrics and information retrieval to discuss how each of these two important areas of information science can help to inform the research of the other. There is a growing body of literature that capitalizes on the synergies created by combining methodologi

  11. Successfully Changing the Landscape of Information Distribution: Extension Food Website Reaches People Locally and Globally

    Directory of Open Access Journals (Sweden)

    Alice Henneman

    2016-02-01

    Full Text Available The goal of the Food website was to develop Internet-based content that was relevant and reached the general public and multiplier groups, such as educators, health professionals, and media outlets. The purpose of this paper was to examine whether a multi-modal approach to information delivery through increases in and changes to content, electronic mailing list creation, and social media posting impacted user access, traffic channels, and referrals from 2010 to 2014. When comparing 2010-2011 versus 2013-2014, there was a 150% increase in total pageviews, 197% increase in unique pageviews, and a 39% increase in average time spent on a page. Since 2010, the website had over 5.2 million total pageviews, 3.1 million sessions, and 2.6 million users. In 2014, top social media referrals included Pinterest, Facebook, LinkedIn, and Twitter. Age of visitors ranged from 18 to 65+, with 45% being 18-34 years old. Approximately 70% were female. Visitors came from 229 countries/territories and 18,237 different cities. The website connects Nebraska and the world to the exciting food research and information generated at the University of Nebraska-Lincoln and is playing an increasingly important role in shaping the future of food in the local and global community.

  12. The KNOTTIN website and database: a new information system dedicated to the knottin scaffold.

    Science.gov (United States)

    Gelly, Jean-Christophe; Gracy, Jérôme; Kaas, Quentin; Le-Nguyen, Dung; Heitz, Annie; Chiche, Laurent

    2004-01-01

    The KNOTTIN website and database organize information about knottins or inhibitor cystine knots, small disulfide-rich proteins with a knotted topology. Thanks to their small size and high stability, knottins provide appealing scaffolds for protein engineering and drug design. Static pages present the main historical and recent results about knottin discoveries, sequences, structures, folding, functions, applications and bibliography. Database searches provide dynamically generated tabular reports or sequence alignments for knottin three-dimensional structures or sequences. BLAST/HMM searches are also available. A simple nomenclature, based on loop lengths between cysteines, is proposed and is complemented by a uniform numbering scheme. This standardization is applied to all knottin structures in the database, facilitating comparisons. Renumbered and structurally fitted knottin PDB files are available for download. The standardized numbering is used for automatic drawing of two-dimensional Colliers de Perles. The KNOTTIN website and database are available at http://knottin.cbs.cnrs.fr and http://knottin.com.

  13. Roogle: an information retrieval engine for clinical data warehouse.

    Science.gov (United States)

    Cuggia, Marc; Garcelon, Nicolas; Campillo-Gimenez, Boris; Bernicot, Thomas; Laurent, Jean-François; Garin, Etienne; Happe, André; Duvauferrier, Régis

    2011-01-01

    High amount of relevant information is contained in reports stored in the electronic patient records and associated metadata. R-oogle is a project aiming at developing information retrieval engines adapted to these reports and designed for clinicians. The system consists in a data warehouse (full-text reports and structured data) imported from two different hospital information systems. Information retrieval is performed using metadata-based semantic and full-text search methods (as Google). Applications may be biomarkers identification in a translational approach, search of specific cases, and constitution of cohorts, professional practice evaluation, and quality control assessment.

  14. Knowledge Maps and Information Retrieval (KMIR II)

    NARCIS (Netherlands)

    Mutschke, Peter; Scharnhorst, Andrea; Mayr, Philipp; Slavic, Aida; Hansen, Preben

    2015-01-01

    Information systems usually show as a particular point of failure the vagueness between user search terms and the knowledge orders of the information space in question. Some kind of guided searching therefore becomes more and more important in order to more precisely discover information without kno

  15. Visualization for Information Retrieval in Regional Distributed Environment

    Directory of Open Access Journals (Sweden)

    Amany Salama

    2013-09-01

    Full Text Available Information retrieval (IR is the task of representing, storing, organizing, and offering access to information items. The problem for search engines is not only to find topic relevant results, but results consistent with the user’s information need. How to retrieve desired information from the Internet with high efficiency and good effectiveness is become the main concern of internet user-based. The interface of the systems does not help them to perceive the precision of these results. Speed, resources consuming, searching and retrieving process also aren't optimal. The search engine's aim is developing and improving the performance of information retrieval system and gifting the user whatever his culture' level. The proposed system is using information visualization for interface problems, and for improving other side of web IR system's problems, it uses the regional crawler on distributed search environment with conceptual query processing and enhanced vector space information retrieval model (VSM. It is an effective attempt to match renewal user's needs and get a better performance than ordinary system.

  16. Information Literacy on the Web: How College Students Use Visual and Textual Cues to Assess Credibility on Health Websites

    OpenAIRE

    Katrina L. Pariera

    2012-01-01

    One of the most important literacy skills in today’s information society is the ability to determine the credibility of online information. Users sort through a staggering number of websites while discerning which will provide satisfactory information. In this study, 70 college students assessed the credibility of health websites with a low and high design quality, in either low or high credibility groups. The study’s purpose was to understand if students relied more on textual or visual cues...

  17. Locally decodable codes and private information retrieval schemes

    CERN Document Server

    Yekhanin, Sergey

    2010-01-01

    Locally decodable codes (LDCs) are codes that simultaneously provide efficient random access retrieval and high noise resilience by allowing reliable reconstruction of an arbitrary bit of a message by looking at only a small number of randomly chosen codeword bits. Local decodability comes with a certain loss in terms of efficiency - specifically, locally decodable codes require longer codeword lengths than their classical counterparts. Private information retrieval (PIR) schemes are cryptographic protocols designed to safeguard the privacy of database users. They allow clients to retrieve rec

  18. Graph-based term weighting for information retrieval

    DEFF Research Database (Denmark)

    Blanco, Roi; Lioma, Christina

    2012-01-01

    A standard approach to Information Retrieval (IR) is to model text as a bag of words. Alternatively, text can be modelled as a graph, whose vertices represent words, and whose edges represent relations between the words, defined on the basis of any meaningful statistical or linguistic relation......, flow and density during retrieval. We experimentally show that this type of ranking performs comparably to BM25, and can even outperform it, across different TREC (Voorhees and Harman in TREC: Experiment and evaluation in information retrieval, MIT Press, 2005) datasets and evaluation measures. © 2011...... weights and (2) integrating discourse aspects into retrieval. Given a text graph, whose vertices denote terms linked by co-occurrence and grammatical modification, we use graph ranking computations (e.g. PageRank Page et al. in The pagerank citation ranking: Bringing order to the Web. Technical report...

  19. Foundations of Large-Scale Multimedia Information Management and Retrieval

    CERN Document Server

    Chang, Edward Y

    2011-01-01

    "Foundations of Large-Scale Multimedia Information Management and Retrieval - Mathematics of Perception" covers knowledge representation and semantic analysis of multimedia data and scalability in signal extraction, data mining, and indexing. The book is divided into two parts: Part I - Knowledge Representation and Semantic Analysis focuses on the key components of mathematics of perception as it applies to data management and retrieval. These include feature selection/reduction, knowledge representation, semantic analysis, distance function formulation for measuring similarity, and

  20. Managing Event Information Modeling, Retrieval, and Applications

    CERN Document Server

    Gupta, Amarnath

    2011-01-01

    With the proliferation of citizen reporting, smart mobile devices, and social media, an increasing number of people are beginning to generate information about events they observe and participate in. A significant fraction of this information contains multimedia data to share the experience with their audience. A systematic information modeling and management framework is necessary to capture this widely heterogeneous, schemaless, potentially humongous information produced by many different people. This book is an attempt to examine the modeling, storage, querying, and applications of such an

  1. Hypertext and hypermedia systems in information retrieval

    Science.gov (United States)

    Kaye, K. M.; Kuhn, A. D.

    1992-01-01

    This paper opens with a brief history of hypertext and hypermedia in the context of information management during the 'information age.' Relevant terms are defined and the approach of the paper is explained. Linear and hypermedia information access methods are contrasted. A discussion of hyperprogramming in the handling of complex scientific and technical information follows. A selection of innovative hypermedia systems is discussed. An analysis of the Clinical Practice Library of Medicine NASA STI Program hypermedia application is presented. The paper concludes with a discussion of the NASA STI Program's future hypermedia project plans.

  2. 108 Information Retrieval Methods in Libraries and Information ...

    African Journals Online (AJOL)

    User

    Indexed African Journals Online: www.ajol.info .... internet resources is catalogued according to DDC on all academic sources. Users can search the ... retrieval could be based on a structure of semantic relationship. Macleod .... References.

  3. Can We Retrieve the Information Which Was Intentionally Forgotten? Electrophysiological Correlates of Strategic Retrieval in Directed Forgetting.

    Science.gov (United States)

    Mao, Xinrui; Tian, Mengxi; Liu, Yi; Li, Bingcan; Jin, Yan; Wu, Yanhong; Guo, Chunyan

    2017-01-01

    Retrieval inhibition hypothesis of directed forgetting effects assumed TBF (to-be-forgotten) items were not retrieved intentionally, while selective rehearsal hypothesis assumed the memory representation of retrieved TBF (to-be-forgotten) items was weaker than TBR (to-be-remembered) items. Previous studies indicated that directed forgetting effects of item-cueing method resulted from selective rehearsal at encoding, but the mechanism of retrieval inhibition that affected directed forgetting of TBF (to-be-forgotten) items was not clear. Strategic retrieval is a control process allowing the selective retrieval of target information, which includes retrieval orientation and strategic recollection. Retrieval orientation via the comparison of tasks refers to the specific form of processing resulted by retrieval efforts. Strategic recollection is the type of strategies to recollect studied items for the retrieval success of targets. Using a "directed forgetting" paradigm combined with a memory exclusion task, our investigation of strategic retrieval in directed forgetting assisted to explore how retrieval inhibition played a role on directed forgetting effects. When TBF items were targeted, retrieval orientation showed more positive ERPs to new items, indicating that TBF items demanded more retrieval efforts. The results of strategic recollection indicated that: (a) when TBR items were retrieval targets, late parietal old/new effects were only evoked by TBR items but not TBF items, indicating the retrieval inhibition of TBF items; (b) when TBF items were retrieval targets, the late parietal old/new effect were evoked by both TBR items and TBF items, indicating that strategic retrieval could overcome retrieval inhibition of TBF items. These findings suggested the modulation of strategic retrieval on retrieval inhibition of directed forgetting, supporting that directed forgetting effects were not only caused by selective rehearsal, but also retrieval inhibition.

  4. Local Area Networks for Information Retrieval.

    Science.gov (United States)

    Kibirige, Harry M.

    This examination of the use of local area networks (LANs) by libraries summarizes the findings of a nationwide survey of 600 libraries and information centers and 200 microcomputer networking system manufacturers and vendors, which was conducted to determine the relevance of currently available networking systems for library and information center…

  5. Cosmos: An Information Retrieval System that Works.

    Science.gov (United States)

    Clay, Katherine; Grossman, Alvin

    1980-01-01

    Briefly described is the County of San Mateo Online System (COSMOS) which was developed and is used by the San Mateo Educational Resources Center (SMERC) to access the Educational Resources Information Center (ERIC) and Fugitive Information Data Organizer (FIDO) databases as well as the curriculum guides housed at SMERC. (TG)

  6. An ergonomic study on the navigation structure and information units of websites with multimedia content. A case study of the Xbox 360 promotional website.

    Science.gov (United States)

    Ariel, Eduardo; de Moraes, Anamaria

    2012-01-01

    This paper presents an ergonomic study on the navigation structures and information units of entertainment sites with multimedia content. This research is a case study on the XBOX 360 promotional website. It analyzes the presentation of the content on a grid that simulates the spatial displacement of the screen's elements and evaluates the interaction that the page allows for, from the users' point of view.

  7. Assessment of the quality and variability of health information on chronic pain websites using the DISCERN instrument

    Directory of Open Access Journals (Sweden)

    Buckley Norman

    2010-10-01

    Full Text Available Abstract Background The Internet is used increasingly by providers as a tool for disseminating pain-related health information and by patients as a resource about health conditions and treatment options. However, health information on the Internet remains unregulated and varies in quality, accuracy and readability. The objective of this study was to determine the quality of pain websites, and explain variability in quality and readability between pain websites. Methods Five key terms (pain, chronic pain, back pain, arthritis, and fibromyalgia were entered into the Google, Yahoo and MSN search engines. Websites were assessed using the DISCERN instrument as a quality index. Grade level readability ratings were assessed using the Flesch-Kincaid Readability Algorithm. Univariate (using alpha = 0.20 and multivariable regression (using alpha = 0.05 analyses were used to explain the variability in DISCERN scores and grade level readability using potential for commercial gain, health related seals of approval, language(s and multimedia features as independent variables. Results A total of 300 websites were assessed, 21 excluded in accordance with the exclusion criteria and 110 duplicate websites, leaving 161 unique sites. About 6.8% (11/161 websites of the websites offered patients' commercial products for their pain condition, 36.0% (58/161 websites had a health related seal of approval, 75.8% (122/161 websites presented information in English only and 40.4% (65/161 websites offered an interactive multimedia experience. In assessing the quality of the unique websites, of a maximum score of 80, the overall average DISCERN Score was 55.9 (13.6 and readability (grade level of 10.9 (3.9. The multivariable regressions demonstrated that website seals of approval (P = 0.015 and potential for commercial gain (P = 0.189 were contributing factors to higher DISCERN scores, while seals of approval (P = 0.168 and interactive multimedia (P = 0.244 contributed to

  8. Semantic-Sensitive Web Information Retrieval Model for HTML Documents

    CERN Document Server

    Bassil, Youssef

    2012-01-01

    With the advent of the Internet, a new era of digital information exchange has begun. Currently, the Internet encompasses more than five billion online sites and this number is exponentially increasing every day. Fundamentally, Information Retrieval (IR) is the science and practice of storing documents and retrieving information from within these documents. Mathematically, IR systems are at the core based on a feature vector model coupled with a term weighting scheme that weights terms in a document according to their significance with respect to the context in which they appear. Practically, Vector Space Model (VSM), Term Frequency (TF), and Inverse Term Frequency (IDF) are among other long-established techniques employed in mainstream IR systems. However, present IR models only target generic-type text documents, in that, they do not consider specific formats of files such as HTML web documents. This paper proposes a new semantic-sensitive web information retrieval model for HTML documents. It consists of a...

  9. Music information retrieval in compressed audio files: a survey

    Science.gov (United States)

    Zampoglou, Markos; Malamos, Athanasios G.

    2014-07-01

    In this paper, we present an organized survey of the existing literature on music information retrieval systems in which descriptor features are extracted directly from the compressed audio files, without prior decompression to pulse-code modulation format. Avoiding the decompression step and utilizing the readily available compressed-domain information can significantly lighten the computational cost of a music information retrieval system, allowing application to large-scale music databases. We identify a number of systems relying on compressed-domain information and form a systematic classification of the features they extract, the retrieval tasks they tackle and the degree in which they achieve an actual increase in the overall speed-as well as any resulting loss in accuracy. Finally, we discuss recent developments in the field, and the potential research directions they open toward ultra-fast, scalable systems.

  10. Efficient medical information retrieval in encrypted Electronic Health Records.

    Science.gov (United States)

    Pruski, Cédric; Wisniewski, François

    2012-01-01

    The recent development of eHealth platforms across the world, whose main objective is to centralize patient's healthcare information to ensure the best continuity of care, requires the development of advanced tools and techniques for supporting health professionals in retrieving relevant information in this vast quantity of data. However, for preserving patient's privacy, some countries decided to de-identify and encrypt data contained in the shared Electronic Health Records, which reinforces the complexity of proposing efficient medical information retrieval approach. In this paper, we describe an original approach exploiting standards metadata as well as knowledge organizing systems to overcome the barriers of data encryption for improving the results of medical information retrieval in centralized and encrypted Electronic Health Records. This is done through the exploitation of semantic properties provided by knowledge organizing systems, which enable query expansion. Furthermore, we provide an overview of the approach together with illustrating examples and a discussion on the advantages and limitations of the provided framework.

  11. The Paradox of the Fuzzy Disambiguation in the Information Retrieval

    Directory of Open Access Journals (Sweden)

    Anna Bryniarska

    2013-09-01

    Full Text Available Current methods of data mining, word sense disambiguation in the information retrieval, semantic relation, fuzzy sets theory, fuzzy description logic, fuzzy ontology and their implementation, omit the existence of paradox called here the paradox of the fuzzy disambiguation. The paradox lies in the fact that due to fuzzy data and the experts knowledge it can be obtained precise knowledge. In this paper to describe this paradox, is introduced a conceptual apparatus. Moreover, there is formulated an information retrieval logic. There are suggested certain applications of this logic to search information on the Web.

  12. A novel dependency language model for information retrieval

    Institute of Scientific and Technical Information of China (English)

    CAI Ke-ke; BU Jia-jun; CHEN Chun; QIU Guang

    2007-01-01

    This paper explores the application of term dependency in information retrieval (IR) and proposes a novel dependency retrieval model. This retrieval model suggests an extension to the existing language modeling (LM) approach to IR by introducing dependency models for both query and document. Relevance between document and query is then evaluated by reference to the Kullback-Leibler divergence between their dependency models. This paper introduces a novel hybrid dependency structure, which allows integration of various forms of dependency within a single framework. A pseudo relevance feedback based method is also introduced for constructing query dependency model. The basic idea is to use query-relevant top-ranking sentences extracted from the top documents at retrieval time as the augmented representation of query, from which the relationships between query terms are identified. A Markov Random Field (MRF) based approach is presented to ensure the relevance of the extracted sentences,which utilizes the association features between query terms within a sentence to evaluate the relevance of each sentence. This dependency retrieval model was compared with other traditional retrieval models. Experiments indicated that it produces significant improvements in retrieval effectiveness.

  13. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Directory of Open Access Journals (Sweden)

    Shozo Makino

    2007-01-01

    Full Text Available Recently, several music information retrieval (MIR systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA is used as recognition grammar, and about 86% retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0% retrieval accuracy was obtained with a query consisting of five words.

  14. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Directory of Open Access Journals (Sweden)

    Suzuki Motoyuki

    2007-01-01

    Full Text Available Recently, several music information retrieval (MIR systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA is used as recognition grammar, and about retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0 retrieval accuracy was obtained with a query consisting of five words.

  15. Teaching Skills in Medical Information Retrieval to Medical Students.

    Science.gov (United States)

    Kolner, Stuart J.; And Others

    1986-01-01

    A project that attempts to overcome the principal obstacles and to provide an efficient and effective method of teaching information retrieval skills to second-year medical students is described. The method includes a pretest, a diagnosis of deficiencies in information skills, a self-paced learning module, and a posttest. (Author/MLW)

  16. Research and Development of Information Retrieval Models and Their Applications.

    Science.gov (United States)

    Fox, Edward A.

    1989-01-01

    This introduction to a special issue devoted to modeling data, information, and knowledge briefly describes the origins of the papers presented and the topics covered, which include: Boolean logic; probability theory; artificial intelligence; organizing and encoding information and data; and characteristics of users of retrieval systems. (12…

  17. The Internet and Information Retrieval Research: A Brief Review.

    Science.gov (United States)

    Chowdhury, G. G.

    1999-01-01

    A survey of recent publications shows that frequent topics of Internet and information retrieval research are the effectiveness of search engines, information validation and quality, user studies, design of user interfaces, data structures and metadata, classification and vocabulary based aids, and indexing and search agents. The changing balance…

  18. The Physical and Cognitive Paradigms in Information Retrieval Research.

    Science.gov (United States)

    Ellis, David

    1992-01-01

    Explores the role of paradigms in information retrieval research and discusses the nature of a paradigm and the applicability of the paradigm concept to a multidisciplinary field such as information science. The features of the physical paradigm and the cognitive paradigm are outlined, and their origins, nature, and role are examined. (55…

  19. Learning to merge search results for efficient Distributed Information Retrieval

    NARCIS (Netherlands)

    Tjin-Kam-Jet, Kien; Hiemstra, Djoerd

    2010-01-01

    Merging search results from different servers is a major problem in Distributed Information Retrieval. We used Regression-SVM and Ranking-SVM which would learn a function that merges results based on information that is readily available: i.e. the ranks, titles, summaries and URLs contained in the

  20. Rare Disease Diagnosis as an Information Retrieval Task

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina;

    2011-01-01

    Increasingly more clinicians use web Information Retrieval (IR) systems to assist them in diagnosing difficult medical cases, for instance rare diseases that they may not be familiar with. However, web IR systems are not necessarily optimised for this task. For instance, clinicians’ queries tend...... to be long lists of symptoms, often containing phrases, whereas web IR systems typically expect very short keyword-based queries. Motivated by such differences, this work uses a preliminary study of 30 clinical cases to reflect on rare disease retrieval as an IR task. Initial experiments using both Google...... web search and offline retrieval from a rare disease collection indicate that the retrieval of rare diseases is an open problem with room for improvement....

  1. Harvesting all matching information to a given query from a deep website

    NARCIS (Netherlands)

    Khelghati, Mohammadreza; Hiemstra, Djoerd; Keulen, van Maurice; Armano, Giuliano; Bozzon, Alessandro; Giuliani, Alessandro

    2015-01-01

    In this paper, the goal is harvesting all documents matching a given (entity) query from a deep web source. The objective is to retrieve all information about for instance "Denzel Washington", "Iran Nuclear Deal", or "FC Barcelona" from data hidden behind web forms. Policies of web search engines us

  2. Harvesting All Matching Information To A Given Query From a Deep Website

    NARCIS (Netherlands)

    Khelghati, Mohammadreza; Hiemstra, Djoerd; van Keulen, Maurice; Armano, Giuliano; Bozzon, Alessandro; Giuliani, Alessandro

    In this paper, the goal is harvesting all documents matching a given (entity) query from a deep web source. The objective is to retrieve all information about for instance "Denzel Washington", "Iran Nuclear Deal", or "FC Barcelona" from data hidden behind web forms. Policies of web search engines

  3. Incidental retrieval-induced forgetting of location information.

    Science.gov (United States)

    Gómez-Ariza, Carlos J; Fernandez, Angel; Bajo, M Teresa

    2012-06-01

    Retrieval-induced forgetting (RIF) has been studied with different types of tests and materials. However, RIF has always been tested on the items' central features, and there is no information on whether inhibition also extends to peripheral features of the events in which the items are embedded. In two experiments, we specifically tested the presence of RIF in a task in which recall of peripheral information was required. After a standard retrieval practice task oriented to item identity, participants were cued with colors (Exp. 1) or with the items themselves (Exp. 2) and asked to recall the screen locations where the items had been displayed during the study phase. RIF for locations was observed after retrieval practice, an effect that was not present when participants were asked to read instead of retrieving the items. Our findings provide evidence that peripheral location information associated with an item during study can be also inhibited when the retrieval conditions promote the inhibition of more central, item identity information.

  4. Storage and retrieval of mass spectral information

    Science.gov (United States)

    Hohn, M. E.; Humberston, M. J.; Eglinton, G.

    1977-01-01

    Computer handling of mass spectra serves two main purposes: the interpretation of the occasional, problematic mass spectrum, and the identification of the large number of spectra generated in the gas-chromatographic-mass spectrometric (GC-MS) analysis of complex natural and synthetic mixtures. Methods available fall into the three categories of library search, artificial intelligence, and learning machine. Optional procedures for coding, abbreviating and filtering a library of spectra minimize time and storage requirements. Newer techniques make increasing use of probability and information theory in accessing files of mass spectral information.

  5. Distributed Systems and Applications of Information Filtering and Retrieval

    CERN Document Server

    Giuliani, Alessandro; Semeraro, Giovanni; DART 2012

    2014-01-01

    This volume focuses on new challenges in distributed Information Filtering and Retrieval. It collects invited chapters and extended research contributions from the special session on Information Filtering and Retrieval: Novel Distributed Systems and Applications (DART) of the 4th International Conference on Knowledge Discovery and Information Retrieval (KDIR 2012), held in Barcelona, Spain, on 4-7 October 2012. The main focus of DART was to discuss and compare suitable novel solutions based on intelligent techniques and applied to real-world applications. The chapters of this book present a comprehensive review of related works and state of the art. Authors, both practitioners and researchers, shared their results in several topics such as "Multi-Agent Systems", "Natural Language Processing", "Automatic Advertisement", "Customer Interaction Analytics", "Opinion Mining". Contributions have been careful reviewed by experts in the area, who also gave useful suggestions to improve the quality of the volume.

  6. Noun-Phrase Analysis in Unrestricted Text for Information Retrieval

    CERN Document Server

    Evans, D A; Evans, David A.; Zhai, Chengxiang

    1996-01-01

    Information retrieval is an important application area of natural-language processing where one encounters the genuine challenge of processing large quantities of unrestricted natural-language text. This paper reports on the application of a few simple, yet robust and efficient noun-phrase analysis techniques to create better indexing phrases for information retrieval. In particular, we describe a hybrid approach to the extraction of meaningful (continuous or discontinuous) subcompounds from complex noun phrases using both corpus statistics and linguistic heuristics. Results of experiments show that indexing based on such extracted subcompounds improves both recall and precision in an information retrieval system. The noun-phrase analysis techniques are also potentially useful for book indexing and automatic thesaurus extraction.

  7. Political environment in the effect of the regional government financial performance on disclosure of financial information on website

    Directory of Open Access Journals (Sweden)

    Yustina Hiola

    2016-07-01

    Full Text Available This study aims to analyze the effect of financial performance of local governments towards the disclosure compliance of financial information on the website, as well as the political environment as a moderating variable for the effect of the financial performance of local governments towards disclosure compliance of financial infor-mation on the website. The study was conducted at the local government in Sulawesi with the sample consisting of 53 governments. The data were analyzed by partial least square (PLS. The results showed that good financial performance of local governments can encourage disclosure compliance of financial information on the website. This study also found that the political environment cannot moderate the effect of the financial performance towards the disclosure compliance of financial information on the website. This is due to the people who are interested more in paper-based reporting. The implication of this study was to encourage related re-search as well as encouraging local governments to use website as a media for finan-cial information reporting. Gorontalo district government is local government, which has excellent financial performance with complete disclosure of financial information on the website.

  8. Ontology Based Information Retrieval in Semantic Web: A Survey

    Directory of Open Access Journals (Sweden)

    Vishal Jain

    2013-09-01

    Full Text Available In present age of computers, there are various resources for gathering information related to given query like Radio Stations, Television, Internet and many more. Among them, Internet is considered as major factor for obtaining any information about a given domain. When a user wants to find some information, he/she enters a query and results are produced via hyperlinks linked to various documents available on web. But the information that is retrieved to us may or may not be relevant. This irrelevance is caused due to huge collection of documents available on web. Traditional search engines are based on keyword based searching that is unable to transform raw data into knowledgeable representation data. It is a cumbersome task to extract relevant information from large collection of web documents. These shortcomings have led to the concept of Semantic Web (SW and Ontology into existence. Semantic Web (SW is a well defined portal that helps in extracting relevant information using many Information Retrieval (IR techniques. Current Information Retrieval (IR techniques are not so advanced that they can be able to exploit semantic knowledge within documents and give precise result. The terms, Information Retrieval (IR, Semantic Web (SW and Ontology are used differently but they are interconnected with each other. Information Retrieval (IR technology and Web based Indexing contributes to existence of Semantic Web. Use of Ontology also contributes in building new generation of web- Semantic Web. With the help of ontologies, we can make content of web as it will be markup with the help of Semantic Web documents (SWD’s. Ontology is considered as backbone of Software system. It improves understanding between concepts used in Semantic Web (SW. So, there is need to build an ontology that uses well defined methodology and process of developing ontology is called Ontology Development.

  9. Semantic knowledge representation for information retrieval

    CERN Document Server

    Gödert, Winfried; Nagelschmidt, Matthias

    2014-01-01

    This book covers the basics of semantic web technologies and indexing languages, and describes their contribution to improve languages as a tool for subject queries and knowledge exploration. The book is relevant to information scientists, knowledge workers and indexers. It provides a suitable combination of theoretical foundations and practical applications.

  10. Peer-to-peer information retrieval

    NARCIS (Netherlands)

    Tigelaar, Almer S.

    2012-01-01

    The Internet has become an integral part of our daily lives. However,the essential task of finding information is dominated by a handful of large centralised search engines. In this thesis we study an alternative to this approach. Instead of using large data centres, we propose using the machines th

  11. Image-based information, communication, and retrieval

    Science.gov (United States)

    Bryant, N. A.; Zobrist, A. L.

    1980-01-01

    IBIS/VICAR system combines video image processing and information management. Flexible programs require user to supply only parameters specific to particular application. Special-purpose input/output routines transfer image data with reduced memory requirements. New application programs are easily incorporated. Program is written in FORTRAN IV, Assembler, and OS JCL for batch execution and has been implemented on IBM 360.

  12. MIRANDA - Music Information Retrieval And Data Acquisition

    DEFF Research Database (Denmark)

    Lehn-Schiøler, Tue; Petersen, Kaare Brandt; Hansen, Lars Kai

    2006-01-01

    In this report we present a music data harvesting system based on a plug-in for a popular music player. When a user is playing a song using the plug-in, information about the song is anonymously submitted to a server. The data gathered using MIRANDA is intended to be released to the MIR community...

  13. Assessment of the contents related to screening on Portuguese language websites providing information on breast and prostate cancer

    Directory of Open Access Journals (Sweden)

    Daniel Ferreira

    2013-11-01

    Full Text Available The objective of this study was to assess the quality of the contents related to screening in a sample of websites providing information on breast and prostate cancer in the Portuguese language. The first 200 results of each cancer-specific Google search were considered. The accuracy of the screening contents was defined in accordance with the state of the art, and its readability was assessed. Most websites mentioned mammography as a method for breast cancer screening (80%, although only 28% referred to it as the only recommended method. Almost all websites mentioned PSA evaluation as a possible screening test, but correct information regarding its effectiveness was given in less than 10%. For both breast and prostate cancer screening contents, the potential for overdiagnosis and false positive results was seldom addressed, and the median readability index was approximately 70. There is ample margin for improving the quality of websites providing information on breast and prostate cancer in Portuguese.

  14. Information Literacy on the Web: How College Students Use Visual and Textual Cues to Assess Credibility on Health Websites

    Directory of Open Access Journals (Sweden)

    Katrina L. Pariera

    2012-12-01

    Full Text Available One of the most important literacy skills in today’s information society is the ability to determine the credibility of online information. Users sort through a staggering number of websites while discerning which will provide satisfactory information. In this study, 70 college students assessed the credibility of health websites with a low and high design quality, in either low or high credibility groups. The study’s purpose was to understand if students relied more on textual or visual cues in determining credibility, and to understand if this affected their recall of those cues later. The results indicate that when viewing a high credibility website, high design quality will bolster the credibility perception, but design quality will not compensate for a low credibility website. The recall test also indicated that credibility does impact the participants’ recall of visual and textual cues. Implications are discussed in light of the Elaboration Likelihood Model.

  15. Assessment of the contents related to screening on Portuguese language websites providing information on breast and prostate cancer.

    Science.gov (United States)

    Ferreira, Daniel; Carreira, Helena; Silva, Susana; Lunet, Nuno

    2013-11-01

    The objective of this study was to assess the quality of the contents related to screening in a sample of websites providing information on breast and prostate cancer in the Portuguese language. The first 200 results of each cancer-specific Google search were considered. The accuracy of the screening contents was defined in accordance with the state of the art, and its readability was assessed. Most websites mentioned mammography as a method for breast cancer screening (80%), although only 28% referred to it as the only recommended method. Almost all websites mentioned PSA evaluation as a possible screening test, but correct information regarding its effectiveness was given in less than 10%. For both breast and prostate cancer screening contents, the potential for overdiagnosis and false positive results was seldom addressed, and the median readability index was approximately 70. There is ample margin for improving the quality of websites providing information on breast and prostate cancer in Portuguese.

  16. Natural Resource Knowledge and Information Management via the Victorian Resources Online Website

    Directory of Open Access Journals (Sweden)

    Christopher Pettit

    2011-11-01

    Full Text Available Since 1997, the Victorian Resources Online (VRO website (http://www.dpi.vic.gov.au/vro has been a key means for the dissemination of landscape-based natural resources information via the internet in Victoria, Australia. The website currently consists of approximately 11,000 web pages, including 1900 maps and 1000 downloadable documents. Information is provided at a range of scales—from statewide and regional overviews to more detailed catchment and sub-catchment levels. At all these levels of generalisation, information is arranged in an organisationally agnostic way around key knowledge “domains” (e.g., soil, landform, water. VRO represents a useful model for the effective dissemination of a wide range of natural resources information; relying on partnerships with key subject matter experts and data custodians, including a “knowledge network” of retired land resource assessment specialists. In this paper, case studies are presented that illustrate various approaches to information and knowledge management with a focus on presentation of spatially contexted soil and landscape information at different levels of generalisation. Examples are provided of adapting site-based information into clickable maps that reveal site-specific details, as well as “spatialising” data from specialist internal databases to improve accessibility to a wider audience. Legacy information sources have also been consolidated and spatially referenced. More recent incorporation of interactive visualisation products (such as landscape panoramas, videos and animations is providing interactive rich media content. Currently the site attracts an average of 1190 user visits per day and user evaluation has indicated a wide range of users, including students, teachers, consultants, researchers and extension staff. The wide range of uses for information and, in particular, the benefits for natural resource education, research and extension has also been identified.

  17. Physicists' Information Tasks: Structure, Length and Retrieval Performance

    DEFF Research Database (Denmark)

    Lykke, Marianne; Ingwersen, Peter; Bogers, Toine;

    2010-01-01

    to describe the tasks, 3) what semantic categories were used to express the search facets, and 4) retrieval performance. Results show variety in structure and length across task descriptions and task purposes. The results indicate effect of length and, in particular, of task purpose on retrieval performance......In this poster, we describe central aspects of 65 natural information tasks from 23 senior researchers, PhDs, and experienced MSc students from three different university departments of physics. We analyze 1) the main purpose of the information task, 2) which and how many search facets were used...

  18. Learning to rank for information retrieval and natural language processing

    CERN Document Server

    Li, Hang

    2014-01-01

    Learning to rank refers to machine learning techniques for training a model in a ranking task. Learning to rank is useful for many applications in information retrieval, natural language processing, and data mining. Intensive studies have been conducted on its problems recently, and significant progress has been made. This lecture gives an introduction to the area including the fundamental problems, major approaches, theories, applications, and future work.The author begins by showing that various ranking problems in information retrieval and natural language processing can be formalized as tw

  19. Physicists' Information Tasks: Structure, Length and Retrieval Performance

    DEFF Research Database (Denmark)

    Lykke, Marianne; Ingwersen, Peter; Bogers, Toine

    2010-01-01

    to describe the tasks, 3) what semantic categories were used to express the search facets, and 4) retrieval performance. Results show variety in structure and length across task descriptions and task purposes. The results indicate effect of length and, in particular, of task purpose on retrieval performance......In this poster, we describe central aspects of 65 natural information tasks from 23 senior researchers, PhDs, and experienced MSc students from three different university departments of physics. We analyze 1) the main purpose of the information task, 2) which and how many search facets were used...

  20. Information retrieval pathways for health information exchange in multiple care settings

    DEFF Research Database (Denmark)

    Kierkegaard, Patrick; Kaushal, Rainu; Vest, Joshua R.

    2014-01-01

    Objectives To determine which health information exchange (HIE) technologies and information retrieval pathways healthcare professionals relied on to meet their information needs in the context of laboratory test results, radiological images and reports, and medication histories. Study Design...

  1. Intelligent Information Retrieval: Part IV. Testing the Timing of Two Information Retrieval Devices in a Naturalistic Setting.

    Science.gov (United States)

    Cole, Charles

    2001-01-01

    Reports the results of two studies of undergraduates that tested an uncertainty expansion information retrieval device and an uncertainty reduction device in naturalistic settings, designed to be given at different stages of Kuhlthau's information search process. Concludes that the timing of the device interventions is crucial to their potential…

  2. Content-based Image Retrieval by Information Theoretic Measure

    Directory of Open Access Journals (Sweden)

    Madasu Hanmandlu

    2011-09-01

    Full Text Available Content-based image retrieval focuses on intuitive and efficient methods for retrieving images from databases based on the content of the images. A new entropy function that serves as a measure of information content in an image termed as 'an information theoretic measure' is devised in this paper. Among the various query paradigms, 'query by example' (QBE is adopted to set a query image for retrieval from a large image database. In this paper, colour and texture features are extracted using the new entropy function and the dominant colour is considered as a visual feature for a particular set of images. Thus colour and texture features constitute the two-dimensional feature vector for indexing the images. The low dimensionality of the feature vector speeds up the atomic query. Indices in a large database system help retrieve the images relevant to the query image without looking at every image in the database. The entropy values of colour and texture and the dominant colour are considered for measuring the similarity. The utility of the proposed image retrieval system based on the information theoretic measures is demonstrated on a benchmark dataset.Defence Science Journal, 2011, 61(5, pp.415-430, DOI:http://dx.doi.org/10.14429/dsj.61.1177

  3. MIRANDA - Music Information Retrieval And Data Acquisition

    DEFF Research Database (Denmark)

    Lehn-Schiøler, Tue; Petersen, Kaare Brandt; Hansen, Lars Kai

    2006-01-01

    In this report we present a music data harvesting system based on a plug-in for a popular music player. When a user is playing a song using the plug-in, information about the song is anonymously submitted to a server. The data gathered using MIRANDA is intended to be released to the MIR community....... We argue that even though content-based data is of interest to the community, also meta data and usage data can be important for research in music similarity....

  4. Improving life sciences information retrieval using semantic web technology.

    Science.gov (United States)

    Quan, Dennis

    2007-05-01

    The ability to retrieve relevant information is at the heart of every aspect of research and development in the life sciences industry. Information is often distributed across multiple systems and recorded in a way that makes it difficult to piece together the complete picture. Differences in data formats, naming schemes and network protocols amongst information sources, both public and private, must be overcome, and user interfaces not only need to be able to tap into these diverse information sources but must also assist users in filtering out extraneous information and highlighting the key relationships hidden within an aggregated set of information. The Semantic Web community has made great strides in proposing solutions to these problems, and many efforts are underway to apply Semantic Web techniques to the problem of information retrieval in the life sciences space. This article gives an overview of the principles underlying a Semantic Web-enabled information retrieval system: creating a unified abstraction for knowledge using the RDF semantic network model; designing semantic lenses that extract contextually relevant subsets of information; and assembling semantic lenses into powerful information displays. Furthermore, concrete examples of how these principles can be applied to life science problems including a scenario involving a drug discovery dashboard prototype called BioDash are provided.

  5. A cognitive approach for design of a multimedia informed consent video and website in pediatric research.

    Science.gov (United States)

    Antal, Holly; Bunnell, H Timothy; McCahan, Suzanne M; Pennington, Chris; Wysocki, Tim; Blake, Kathryn V

    2017-02-01

    Poor participant comprehension of research procedures following the conventional face-to-face consent process for biomedical research is common. We describe the development of a multimedia informed consent video and website that incorporates cognitive strategies to enhance comprehension of study related material directed to parents and adolescents. A multidisciplinary team was assembled for development of the video and website that included human subjects professionals; psychologist researchers; institutional video and web developers; bioinformaticians and programmers; and parent and adolescent stakeholders. Five learning strategies that included Sensory-Modality view, Coherence, Signaling, Redundancy, and Personalization were integrated into a 15-min video and website material that describes a clinical research trial. A diverse team collaborated extensively over 15months to design and build a multimedia platform for obtaining parental permission and adolescent assent for participant in as asthma clinical trial. Examples of the learning principles included, having a narrator describe what was being viewed on the video (sensory-modality); eliminating unnecessary text and graphics (coherence); having the initial portion of the video explain the sections of the video to be viewed (signaling); avoiding simultaneous presentation of text and graphics (redundancy); and having a consistent narrator throughout the video (personalization). Existing conventional and multimedia processes for obtaining research informed consent have not actively incorporated basic principles of human cognition and learning in the design and implementation of these processes. The present paper illustrates how this can be achieved, setting the stage for rigorous evaluation of potential benefits such as improved comprehension, satisfaction with the consent process, and completion of research objectives. New consent strategies that have an integrated cognitive approach need to be developed and

  6. Adding dimensions to the analysis of the quality of health information of websites returned by Google. Cluster analysis identifies patterns of websites according to their classification and the type of intervention described.

    Directory of Open Access Journals (Sweden)

    Pietro eGhezzi

    2015-08-01

    Full Text Available Background and aims: Most of the instruments used to assess the quality of health information on the Web (e.g. the JAMA criteria only analyze one dimension of information quality, trustworthiness. We try to compare these characteristics with the type of treatments the website describe, whether evidence-based medicine or note, and correlate this with the established criteria.Methods: We searched Google for migraine cure and analyzed the first 200 websites for: 1 JAMA criteria (authorship, attribution, disclosure, currency; 2 class of websites (commercial, health portals, professional, patient groups, no-profit; and 3 type of intervention described (approved drugs, alternative medicine, food, procedures, lifestyle, drugs still at the research stage. We used hierarchical cluster analysis to assess associations between classes of websites and types of intervention described. Subgroup analysis on the first 10 websites returned was performed. Results: Google returned health portals (44%, followed by commercial websites (31% and journalism websites (11%. The type of intervention mentioned most often was alternative medicine (55%, followed by procedures (49%, lifestyle (42%, food (41% and approved drugs (35%. Cluster analysis indicated that health portals are more likely to describe more than one type of treatment while commercial websites most often describe only one. The average JAMA score of commercial websites was significantly lower than for health portals or journalism websites, and this was mainly due to lack of information on the authors of the text and indication of the date the information was written. Looking at the first 10 websites from Google, commercial websites are under-represented and approved drugs over-represented. Conclusions: This approach allows the appraisal of the quality of health-related information on the Internet focusing on the type of therapies/prevention methods that are shown to the patient.

  7. Information Retrieval Systems Adapted to the Biomedical Domain

    CERN Document Server

    Marrero, Mónica; Urbano, Julián; Morato, Jorge; Moreiro, José-Antonio; 10.3145/epi.2010.may.04

    2012-01-01

    The terminology used in Biomedicine shows lexical peculiarities that have required the elaboration of terminological resources and information retrieval systems with specific functionalities. The main characteristics are the high rates of synonymy and homonymy, due to phenomena such as the proliferation of polysemic acronyms and their interaction with common language. Information retrieval systems in the biomedical domain use techniques oriented to the treatment of these lexical peculiarities. In this paper we review some of the techniques used in this domain, such as the application of Natural Language Processing (BioNLP), the incorporation of lexical-semantic resources, and the application of Named Entity Recognition (BioNER). Finally, we present the evaluation methods adopted to assess the suitability of these techniques for retrieving biomedical resources.

  8. Annotation of Scientific Summaries for Information Retrieval

    CERN Document Server

    Ibekwe-Sanjuan, Fidelia; Eric, Sanjuan; Eric, Charton

    2011-01-01

    We present a methodology combining surface NLP and Machine Learning techniques for ranking asbtracts and generating summaries based on annotated corpora. The corpora were annotated with meta-semantic tags indicating the category of information a sentence is bearing (objective, findings, newthing, hypothesis, conclusion, future work, related work). The annotated corpus is fed into an automatic summarizer for query-oriented abstract ranking and multi- abstract summarization. To adapt the summarizer to these two tasks, two novel weighting functions were devised in order to take into account the distribution of the tags in the corpus. Results, although still preliminary, are encouraging us to pursue this line of work and find better ways of building IR systems that can take into account semantic annotations in a corpus.

  9. Concept Tree Based Information Retrieval Model

    Directory of Open Access Journals (Sweden)

    Chunyan Yuan

    2014-05-01

    Full Text Available This paper proposes a novel concept-based query expansion technique named Markov concept tree model (MCTM, discovering term relationship through the concept tree deduced by term markov network. We address two important issues for query expansion: the selection and the weighting of expansion search terms. In contrast to earlier methods, queries are expanded by adding those terms that are most similar to the concept of the query, rather than selecting terms that are similar to a signal query terms. Utilizing Markov network which is constructed according to the co-occurrence information of the terms in collection, it generate concept tree for each original query term, remove the redundant and irrelevant nodes in concept tree, then adjust the weight of original query and the weight of expansion term based on a pruning algorithm. We use this model for query expansion and evaluate the effectiveness of the model by examining the accuracy and robustness of the expansion methods, Compared with the baseline model, the experiments on standard dataset reveal that this method can achieve a better query quality

  10. Information Retrieval in Telemedicine: a Comparative Study on Bibliographic Databases

    Science.gov (United States)

    Ahmadi, Maryam; Sarabi, Roghayeh Ershad; Orak, Roohangiz Jamshidi; Bahaadinbeigy, Kambiz

    2015-01-01

    Background and Aims: The first step in each systematic review is selection of the most valid database that can provide the highest number of relevant references. This study was carried out to determine the most suitable database for information retrieval in telemedicine field. Methods: Cinhal, PubMed, Web of Science and Scopus databases were searched for telemedicine matched with Education, cost benefit and patient satisfaction. After analysis of the obtained results, the accuracy coefficient, sensitivity, uniqueness and overlap of databases were calculated. Results: The studied databases differed in the number of retrieved articles. PubMed was identified as the most suitable database for retrieving information on the selected topics with the accuracy and sensitivity ratios of 50.7% and 61.4% respectively. The uniqueness percent of retrieved articles ranged from 38% for Pubmed to 3.0% for Cinhal. The highest overlap rate (18.6%) was found between PubMed and Web of Science. Less than 1% of articles have been indexed in all searched databases. Conclusion: PubMed is suggested as the most suitable database for starting search in telemedicine and after PubMed, Scopus and Web of Science can retrieve about 90% of the relevant articles. PMID:26236086

  11. From the Field and into the Classroom: Information Architecture Assessment and Website Usability Tests

    Science.gov (United States)

    Clayton, Michael J.; Hettche, Matt

    2012-01-01

    Although it is difficult these days to find a company that does not have a website, you do not have to look very far for to find a website with significant design and architecture flaws. Getting a visitor to your website is one thing, making the experience effortless and allowing them to find exactly what they need is another story. That being…

  12. "BRAIN": Baruch Retrieval of Automated Information for Negotiations.

    Science.gov (United States)

    Levenstein, Aaron, Ed.

    1981-01-01

    A data processing program that can be used as a research and collective bargaining aid for colleges is briefly described and the fields of the system are outlined. The system, known as BRAIN (Baruch Retrieval of Automated Information for Negotiations), is designed primarily as an instrument for quantitative and qualitative analysis. BRAIN consists…

  13. Development of Information Retrieval Skills for Freshman Medical Students.

    Science.gov (United States)

    Moore, Gerald F.

    1988-01-01

    A study, using a specific patient encounter as the focal point for each student's research, is described that documents the skills of entering freshmen medical students before and immediately after a short course emphasizing information retrieval and at follow-up one year later. (MLW)

  14. Learning to Rank for Information Retrieval from User Interactions

    NARCIS (Netherlands)

    Hofmann, K.; Whiteson, S.; Schuth, A.; de Rijke, M.

    2014-01-01

    In this article we give an overview of our recent work on online learning to rank for information retrieval (IR). This work addresses IR from a reinforcement learning (RL) point of view, with the aim to enable systems that can learn directly from interactions with their users. Learning directly from

  15. Information Storage and Retrieval Scientific Report No. ISR-22.

    Science.gov (United States)

    Salton, Gerard

    The twenty-second in a series, this report describes research in information organization and retrieval conducted by the Department of Computer Science at Cornell University. The report covers work carried out during the period summer 1972 through summer 1974 and is divided into four parts: indexing theory, automatic content analysis, feedback…

  16. Rare disease diagnosis as an information retrieval task

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina

    2011-01-01

    Increasingly more clinicians use web Information Retrieval (IR) systems to assist them in diagnosing difficult medical cases, for instance rare diseases that they may not be familiar with. However, web IR systems are not necessarily optimised for this task. For instance, clinicians’ queries tend...

  17. Cross-Language Information Retrieval: An Analysis of Errors.

    Science.gov (United States)

    Ruiz, Miguel E.; Srinivasan, Padmini

    1998-01-01

    Investigates an automatic method for Cross Language Information Retrieval (CLIR) that utilizes the multilingual Unified Medical Language System (UMLS) Metathesaurus to translate Spanish natural-language queries into English. Results indicate that for Spanish, the UMLS Metathesaurus-based CLIR method is at least equivalent to if not better than…

  18. A Survey of Query Auto Completion in Information Retrieval

    NARCIS (Netherlands)

    Cai, F.; de Rijke, M.

    2016-01-01

    In information retrieval, query auto completion (QAC), also known as type-ahead [Xiao et al., 2013, Cai et al., 2014b] and auto-complete suggestion [Jain and Mishne, 2010], refers to the following functionality: given a prefix consisting of a number of characters entered into a search box, the user i

  19. Bibliometric-enhanced Information Retrieval : 2nd International BIR Workshop

    NARCIS (Netherlands)

    Mayr, Philipp; Frommholz, Ingo; Scharnhorst, Andrea; Mutschke, Peter

    2015-01-01

    This workshop brings together experts of communities which often have been perceived as different once: bibliometrics / scientometrics / informetrics on the one side and information retrieval on the other. Our motivation as organizers of the workshop started from the observation that main discourses

  20. On Inference Rules of Logic-Based Information Retrieval Systems.

    Science.gov (United States)

    Chen, Patrick Shicheng

    1994-01-01

    Discussion of relevance and the needs of the users in information retrieval focuses on a deductive object-oriented approach and suggests eight inference rules for the deduction. Highlights include characteristics of a deductive object-oriented system, database and data modeling language, implementation, and user interface. (Contains 24…

  1. Creating an Information Retrieval test corpus for Dutch

    NARCIS (Netherlands)

    Hiemstra, D.; Leeuwen, van D.A.; Theune, M.; Nijholt, A.; Hondorp, G.H.W.

    2002-01-01

    This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch test data, which is part of the official CLEF multilingual test corpus, and give an overview of the experimental results of

  2. Vocabulary Mining for Information Retrieval: Rough Sets and Fuzzy Sets.

    Science.gov (United States)

    Srinivasan, Padmini; Ruiz, Miguel E.; Kraft, Donald H.; Chen, Jianhua

    2001-01-01

    Explains vocabulary mining in information retrieval and describes a framework for vocabulary mining that allows the use of rough set-based approximations even when documents and queries are described using weighted, or fuzzy, representations. Examines coordination between multiple vocabulary views and applies the framework to the Unified Medical…

  3. Towards an Intelligent Possibilistic Web Information Retrieval Using Multiagent System

    Science.gov (United States)

    Elayeb, Bilel; Evrard, Fabrice; Zaghdoud, Montaceur; Ahmed, Mohamed Ben

    2009-01-01

    Purpose: The purpose of this paper is to make a scientific contribution to web information retrieval (IR). Design/methodology/approach: A multiagent system for web IR is proposed based on new technologies: Hierarchical Small-Worlds (HSW) and Possibilistic Networks (PN). This system is based on a possibilistic qualitative approach which extends the…

  4. Towards an Intelligent Possibilistic Web Information Retrieval Using Multiagent System

    Science.gov (United States)

    Elayeb, Bilel; Evrard, Fabrice; Zaghdoud, Montaceur; Ahmed, Mohamed Ben

    2009-01-01

    Purpose: The purpose of this paper is to make a scientific contribution to web information retrieval (IR). Design/methodology/approach: A multiagent system for web IR is proposed based on new technologies: Hierarchical Small-Worlds (HSW) and Possibilistic Networks (PN). This system is based on a possibilistic qualitative approach which extends the…

  5. Fault-tolerant symmetrically-private information retrieval

    Science.gov (United States)

    Wang, Tian-Yin; Cai, Xiao-Qiu; Zhang, Rui-Ling

    2016-08-01

    We propose two symmetrically-private information retrieval protocols based on quantum key distribution, which provide a good degree of database and user privacy while being flexible, loss-resistant and easily generalized to a large database similar to the precedent works. Furthermore, one protocol is robust to a collective-dephasing noise, and the other is robust to a collective-rotation noise.

  6. A cross-lingual framework for monolingual biomedical information retrieval

    NARCIS (Netherlands)

    Trieschnigg, D.; Hiemstra, D.; Jong, F. de; Kraaij, W.

    2010-01-01

    An important challenge for biomedical information retrieval (IR) is dealing with the complex, inconsistent and ambiguous biomedical terminology. Frequently, a concept-based representation defined in terms of a domain-specific terminological resource is employed to deal with this challenge. In this p

  7. Information Theoretic Similarity Measures for Content Based Image Retrieval.

    Science.gov (United States)

    Zachary, John; Iyengar, S. S.

    2001-01-01

    Content-based image retrieval is based on the idea of extracting visual features from images and using them to index images in a database. Proposes similarity measures and an indexing algorithm based on information theory that permits an image to be represented as a single number. When used in conjunction with vectors, this method displays…

  8. Bibliometric-enhanced Information Retrieval : 2nd International BIR Workshop

    NARCIS (Netherlands)

    Mayr, Philipp; Frommholz, Ingo; Scharnhorst, Andrea; Mutschke, Peter

    2015-01-01

    This workshop brings together experts of communities which often have been perceived as different once: bibliometrics / scientometrics / informetrics on the one side and information retrieval on the other. Our motivation as organizers of the workshop started from the observation that main discourses

  9. Videodisc: A New Resource for Library Information Storage and Retrieval.

    Science.gov (United States)

    Sonnemann, Sabine S.

    1984-01-01

    Details a National Library of Canada project to produce a videodisc, show its value as a research tool, and demonstrate its viability as an information storage and retrieval medium. An overview and time sequence of the project, disc contents, resource materials for production, production and postproduction techniques, and project results are…

  10. Automotive websites

    CERN Document Server

    Jensen, Todd A

    2006-01-01

    For anyone buying a new car, restoring an old favorite, collecting license plates or looking for motorsports information, the internet is the place to go and this is the book to help you get there. Now with over 650 internet addresses, this expanded and updated guide provides detailed descriptions and reviews of the biggest, best and most interesting automotive websites on the net. Beginning with a brief internet history and helpful hints, it aids the novice (or not so novice) user in picking through the countless automotive sites on the internet. Websites are arranged by topics such as afterm

  11. Experiences with automated categorization in e-government information retrieval

    DEFF Research Database (Denmark)

    Jonasen, Tanja Svarre; Lykke, Marianne

    2014-01-01

    High-precision search results are essential for supporting e-government employees’ information tasks. Prior studies have shown that existing features of e-government retrieval systems need improvement in terms of search facilities (e.g., Goh et al. 2008), navigation (e.g., de Jong and Lentz 2006...... documents were retrieved. The findings emphasise the importance of simultaneous search options for e-government IR systems, and reveal that automated categorization is valuable in improving search facilities in e-government....

  12. Automatic Data Extraction from Websites for Generating Aquatic Product Market Information

    Institute of Scientific and Technical Information of China (English)

    YUAN Hong-chun; CHEN Ying; SUN Yue-fu

    2006-01-01

    The massive web-based information resources have led to an increasing demand for effective automatic retrieval of target information for web applications. This paper introduces a web-based data extraction tool that deploys various algorithms to locate, extract and filter tabular data from HTML pages and to transform them into new web-based representations. The tool has been applied in an aquaculture web application platform for extracting and generating aquatic product market information.Results prove that this tool is very effective in extracting the required data from web pages.

  13. Semantic Information Retrieval from Distributed Heterogeneous Data Sources

    CERN Document Server

    Munir, K; McClatchey, R; Khan, S; Habib, I

    2007-01-01

    Information retrieval from distributed heterogeneous data sources remains a challenging issue. As the number of data sources increases more intelligent retrieval techniques, focusing on information content and semantics, are required. Currently ontologies are being widely used for managing semantic knowledge, especially in the field of bioinformatics. In this paper we describe an ontology assisted system that allows users to query distributed heterogeneous data sources by hiding details like location, information structure, access pattern and semantic structure of the data. Our goal is to provide an integrated view on biomedical information sources for the Health-e-Child project with the aim to overcome the lack of sufficient semantic-based reformulation techniques for querying distributed data sources. In particular, this paper examines the problem of query reformulation across biomedical data sources, based on merged ontologies and the underlying heterogeneous descriptions of the respective data sources.

  14. Building an Automatic Thesaurus to Enhance Information Retrieval

    Directory of Open Access Journals (Sweden)

    Essam Said Hanandeh

    2013-01-01

    Full Text Available One of the major problems of modern Information Retrieval (IR systems is the vocabulary Problem that concerns with the discrepancies between terms used for describing documents and the terms used by the researcher to describe their information need. We have implemented an automatic thesurs, the system was built using Vector Space Model (VSM. In this model, we used Cosine measure similarity. In this paper we use selected 242 Arabic abstract documents. All these abstracts involve computer science and information system. The main goal of this paper is to design and build automatic Arabic thesauri using term-term similarity that can be used in any special field or domain to improve the expansion process and to get more relevance documents for the user's query. The study concluded that the similarl thesaurus improved the recall and precision more than traditional information retrieval system in terms of recall and precision level.

  15. Toward higher effectiveness for recall-oriented information retrieval: A patent retrieval case study

    OpenAIRE

    Magdy, Walid

    2012-01-01

    Research in information retrieval (IR) has largely been directed towards tasks requiring high precision. Recently, other IR applications which can be described as recall-oriented IR tasks have received increased attention in the IR research domain. Prominent among these IR applications are patent search and legal search, where users are typically ready to check hundreds or possibly thousands of documents in order to find any possible relevant document. The main concerns in this kind of applic...

  16. Iterative Filtering of Retrieved Information to Increase Relevance

    Directory of Open Access Journals (Sweden)

    Robert Zeidman

    2007-12-01

    Full Text Available Efforts have been underway for years to find more effective ways to retrieve information from large knowledge domains. This effort is now being driven particularly by the Internet and the vast amount of information that is available to unsophisticated users. In the early days of the Internet, some effort involved allowing users to enter Boolean equations of search terms into search engines, for example, rather than just a list of keywords. More recently, effort has focused on understanding a user's desires from past search histories in order to narrow searches. Also there has been much effort to improve the ranking of results based on some measure of relevancy. This paper discusses using iterative filtering of retrieved information to focus in on useful information. This work was done for finding source code correlation and the author extends his findings to Internet searching and e-commerce. The paper presents specific information about a particular filtering application and then generalizes it to other forms of information retrieval.

  17. Software Helps Retrieve Information Relevant to the User

    Science.gov (United States)

    Mathe, Natalie; Chen, James

    2003-01-01

    The Adaptive Indexing and Retrieval Agent (ARNIE) is a code library, designed to be used by an application program, that assists human users in retrieving desired information in a hypertext setting. Using ARNIE, the program implements a computational model for interactively learning what information each human user considers relevant in context. The model, called a "relevance network," incrementally adapts retrieved information to users individual profiles on the basis of feedback from the users regarding specific queries. The model also generalizes such knowledge for subsequent derivation of relevant references for similar queries and profiles, thereby, assisting users in filtering information by relevance. ARNIE thus enables users to categorize and share information of interest in various contexts. ARNIE encodes the relevance and structure of information in a neural network dynamically configured with a genetic algorithm. ARNIE maintains an internal database, wherein it saves associations, and from which it returns associated items in response to a query. A C++ compiler for a platform on which ARNIE will be utilized is necessary for creating the ARNIE library but is not necessary for the execution of the software.

  18. Designing Health Websites Based on Users' Web-Based Information-Seeking Behaviors: A Mixed-Method Observational Study.

    Science.gov (United States)

    Pang, Patrick Cheong-Iao; Chang, Shanton; Verspoor, Karin; Pearce, Jon

    2016-06-06

    Laypeople increasingly use the Internet as a source of health information, but finding and discovering the right information remains problematic. These issues are partially due to the mismatch between the design of consumer health websites and the needs of health information seekers, particularly the lack of support for "exploring" health information. The aim of this research was to create a design for consumer health websites by supporting different health information-seeking behaviors. We created a website called Better Health Explorer with the new design. Through the evaluation of this new design, we derive design implications for future implementations. Better Health Explorer was designed using a user-centered approach. The design was implemented and assessed through a laboratory-based observational study. Participants tried to use Better Health Explorer and another live health website. Both websites contained the same content. A mixed-method approach was adopted to analyze multiple types of data collected in the experiment, including screen recordings, activity logs, Web browsing histories, and audiotaped interviews. Overall, 31 participants took part in the observational study. Our new design showed a positive result for improving the experience of health information seeking, by providing a wide range of information and an engaging environment. The results showed better knowledge acquisition, a higher number of page reads, and more query reformulations in both focused and exploratory search tasks. In addition, participants spent more time to discover health information with our design in exploratory search tasks, indicating higher engagement with the website. Finally, we identify 4 design considerations for designing consumer health websites and health information-seeking apps: (1) providing a dynamic information scope; (2) supporting serendipity; (3) considering trust implications; and (4) enhancing interactivity. Better Health Explorer provides strong

  19. Retrieving self-vocalized information: An event-related potential (ERP) study on the effect of retrieval orientation.

    Science.gov (United States)

    Rosburg, Timm; Johansson, Mikael; Sprondel, Volker; Mecklinger, Axel

    2014-11-18

    Retrieval orientation refers to a pre-retrieval process and conceptualizes the specific form of processing that is applied to a retrieval cue. In the current event-related potential (ERP) study, we sought to find evidence for an involvement of the auditory cortex when subjects attempt to retrieve vocalized information, and hypothesized that adopting retrieval orientation would be beneficial for retrieval accuracy. During study, participants saw object words that they subsequently vocalized or visually imagined. At test, participants had to identify object names of one study condition as targets and to reject object names of the second condition together with new items. Target category switched after half of the test trials. Behaviorally, participants responded less accurately and more slowly to targets of the vocalize condition than to targets of the imagine condition. ERPs to new items varied at a single left electrode (T7) between 500 and 800ms, indicating a moderate retrieval orientation effect in the subject group as a whole. However, whereas the effect was strongly pronounced in participants with high retrieval accuracy, it was absent in participants with low retrieval accuracy. A current source density (CSD) mapping of the retrieval orientation effect indicated a source over left temporal regions. Independently from retrieval accuracy, the ERP retrieval orientation effect was surprisingly also modulated by test order. Findings are suggestive for an involvement of the auditory cortex in retrieval attempts of vocalized information and confirm that adopting retrieval orientation is potentially beneficial for retrieval accuracy. The effects of test order on retrieval-related processes might reflect a stronger focus on the newness of items in the more difficult test condition when participants started with this condition.

  20. Acute low back pain information online: an evaluation of quality, content accuracy and readability of related websites.

    Science.gov (United States)

    Hendrick, Paul A; Ahmed, Osman H; Bankier, Shane S; Chan, Tze Jieh; Crawford, Sarah A; Ryder, Catherine R; Welsh, Lisa J; Schneiders, Anthony G

    2012-08-01

    The internet is increasingly being used as a source of health information by the general public. Numerous websites exist that provide advice and information on the diagnosis and management of acute low back pain (ALBP), however, the accuracy and utility of this information has yet to be established. The aim of this study was to establish the quality, content and readability of online information relating to the treatment and management of ALBP. The internet was systematically searched using Google search engines from six major English-speaking countries. In addition, relevant national and international low back pain-related professional organisations were also searched. A total of 22 relevant websites was identified. The accuracy of the content of the ALBP information was established using a 13 point guide developed from international guidelines. Website quality was evaluated using the HONcode, and the Flesch-Kincaid Grade level (FKGL) was used to establish readability. The majority of websites lacked accurate information, resulting in an overall mean content accuracy score of 6.3/17. Only 3 websites had a high content accuracy score (>14/17) along with an acceptable readability score (FKGL 6-8) with the majority of websites providing information which exceeded the recommended level for the average person to comprehend. The most accurately reported category was, "Education and reassurance" (98%) while information regarding "manipulation" (50%), "massage" (9%) and "exercise" (0%) were amongst the lowest scoring categories. These results demonstrate the need for more accurate and readable internet-based ALBP information specifically centred on evidence-based guidelines.

  1. Use of information-retrieval languages in automated retrieval of experimental data from long-term storage

    Science.gov (United States)

    Khovanskiy, Y. D.; Kremneva, N. I.

    1975-01-01

    Problems and methods are discussed of automating information retrieval operations in a data bank used for long term storage and retrieval of data from scientific experiments. Existing information retrieval languages are analyzed along with those being developed. The results of studies discussing the application of the descriptive 'Kristall' language used in the 'ASIOR' automated information retrieval system are presented. The development and use of a specialized language of the classification-descriptive type, using universal decimal classification indices as the main descriptors, is described.

  2. BiLingual Information Retrieval System for English and Tamil

    CERN Document Server

    Saraswathi, S; K, Kalaimagal; M, Kalaiyarasi

    2010-01-01

    This paper addresses the design and implementation of BiLingual Information Retrieval system on the domain, Festivals. A generic platform is built for BiLingual Information retrieval which can be extended to any foreign or Indian language working with the same efficiency. Search for the solution of the query is not done in a specific predefined set of standard languages but is chosen dynamically on processing the user's query. This paper deals with Indian language Tamil apart from English. The task is to retrieve the solution for the user given query in the same language as that of the query. In this process, a Ontological tree is built for the domain in such a way that there are entries in the above listed two languages in every node of the tree. A Part-Of-Speech (POS) Tagger is used to determine the keywords from the given query. Based on the context, the keywords are translated to appropriate languages using the Ontological tree. A search is performed and documents are retrieved based on the keywords. With...

  3. Speech-recognition interfaces for music information retrieval

    Science.gov (United States)

    Goto, Masataka

    2005-09-01

    This paper describes two hands-free music information retrieval (MIR) systems that enable a user to retrieve and play back a musical piece by saying its title or the artist's name. Although various interfaces for MIR have been proposed, speech-recognition interfaces suitable for retrieving musical pieces have not been studied. Our MIR-based jukebox systems employ two different speech-recognition interfaces for MIR, speech completion and speech spotter, which exploit intentionally controlled nonverbal speech information in original ways. The first is a music retrieval system with the speech-completion interface that is suitable for music stores and car-driving situations. When a user only remembers part of the name of a musical piece or an artist and utters only a remembered fragment, the system helps the user recall and enter the name by completing the fragment. The second is a background-music playback system with the speech-spotter interface that can enrich human-human conversation. When a user is talking to another person, the system allows the user to enter voice commands for music playback control by spotting a special voice-command utterance in face-to-face or telephone conversations. Experimental results from use of these systems have demonstrated the effectiveness of the speech-completion and speech-spotter interfaces. (Video clips: http://staff.aist.go.jp/m.goto/MIR/speech-if.html)

  4. Case retrieval in medical databases by fusing heterogeneous information.

    Science.gov (United States)

    Quellec, Gwénolé; Lamard, Mathieu; Cazuguel, Guy; Roux, Christian; Cochener, Béatrice

    2011-01-01

    A novel content-based heterogeneous information retrieval framework, particularly well suited to browse medical databases and support new generation computer aided diagnosis (CADx) systems, is presented in this paper. It was designed to retrieve possibly incomplete documents, consisting of several images and semantic information, from a database; more complex data types such as videos can also be included in the framework. The proposed retrieval method relies on image processing, in order to characterize each individual image in a document by their digital content, and information fusion. Once the available images in a query document are characterized, a degree of match, between the query document and each reference document stored in the database, is defined for each attribute (an image feature or a metadata). A Bayesian network is used to recover missing information if need be. Finally, two novel information fusion methods are proposed to combine these degrees of match, in order to rank the reference documents by decreasing relevance for the query. In the first method, the degrees of match are fused by the Bayesian network itself. In the second method, they are fused by the Dezert-Smarandache theory: the second approach lets us model our confidence in each source of information (i.e., each attribute) and take it into account in the fusion process for a better retrieval performance. The proposed methods were applied to two heterogeneous medical databases, a diabetic retinopathy database and a mammography screening database, for computer aided diagnosis. Precisions at five of 0.809 ± 0.158 and 0.821 ± 0.177, respectively, were obtained for these two databases, which is very promising.

  5. A Survey on Web Text Information Retrieval in Text Mining

    Directory of Open Access Journals (Sweden)

    Tapaswini Nayak

    2015-08-01

    Full Text Available In this study we have analyzed different techniques for information retrieval in text mining. The aim of the study is to identify web text information retrieval. Text mining almost alike to analytics, which is a process of deriving high quality information from text. High quality information is typically derived in the course of the devising of patterns and trends through means such as statistical pattern learning. Typical text mining tasks include text categorization, text clustering, concept/entity extraction, creation of coarse taxonomies, sentiment analysis, document summarization and entity relation modeling. It is used to mine hidden information from not-structured or semi-structured data. This feature is necessary because a large amount of the Web information is semi-structured due to the nested structure of HTML code, is linked and is redundant. Web content categorization with a content database is the most important tool to the efficient use of search engines. A customer requesting information on a particular subject or item would otherwise have to search through hundred of results to find the most relevant information to his query. Hundreds of results through use of mining text are reduced by this step. This eliminates the aggravation and improves the navigation of information on the Web.

  6. Risk communication and informed consent in the medical tourism industry: A thematic content analysis of canadian broker websites

    Science.gov (United States)

    2011-01-01

    Background Medical tourism, thought of as patients seeking non-emergency medical care outside of their home countries, is a growing industry worldwide. Canadians are amongst those engaging in medical tourism, and many are helped in the process of accessing care abroad by medical tourism brokers - agents who specialize in making international medical care arrangements for patients. As a key source of information for these patients, brokers are likely to play an important role in communicating the risks and benefits of undergoing surgery or other procedures abroad to their clientele. This raises important ethical concerns regarding processes such as informed consent and the liability of brokers in the event that complications arise from procedures. The purpose of this article is to examine the language, information, and online marketing of Canadian medical tourism brokers' websites in light of such ethical concerns. Methods An exhaustive online search using multiple search engines and keywords was performed to compile a comprehensive directory of English-language Canadian medical tourism brokerage websites. These websites were examined using thematic content analysis, which included identifying informational themes, generating frequency counts of these themes, and comparing trends in these counts to the established literature. Results Seventeen websites were identified for inclusion in this study. It was found that Canadian medical tourism broker websites varied widely in scope, content, professionalism and depth of information. Three themes emerged from the thematic content analysis: training and accreditation, risk communication, and business dimensions. Third party accreditation bodies of debatable regulatory value were regularly mentioned on the reviewed websites, and discussion of surgical risk was absent on 47% of the websites reviewed, with limited discussion of risk on the remaining ones. Terminology describing brokers' roles was somewhat inconsistent across

  7. Risk communication and informed consent in the medical tourism industry: A thematic content analysis of canadian broker websites

    Directory of Open Access Journals (Sweden)

    Crooks Valorie A

    2011-09-01

    Full Text Available Abstract Background Medical tourism, thought of as patients seeking non-emergency medical care outside of their home countries, is a growing industry worldwide. Canadians are amongst those engaging in medical tourism, and many are helped in the process of accessing care abroad by medical tourism brokers - agents who specialize in making international medical care arrangements for patients. As a key source of information for these patients, brokers are likely to play an important role in communicating the risks and benefits of undergoing surgery or other procedures abroad to their clientele. This raises important ethical concerns regarding processes such as informed consent and the liability of brokers in the event that complications arise from procedures. The purpose of this article is to examine the language, information, and online marketing of Canadian medical tourism brokers' websites in light of such ethical concerns. Methods An exhaustive online search using multiple search engines and keywords was performed to compile a comprehensive directory of English-language Canadian medical tourism brokerage websites. These websites were examined using thematic content analysis, which included identifying informational themes, generating frequency counts of these themes, and comparing trends in these counts to the established literature. Results Seventeen websites were identified for inclusion in this study. It was found that Canadian medical tourism broker websites varied widely in scope, content, professionalism and depth of information. Three themes emerged from the thematic content analysis: training and accreditation, risk communication, and business dimensions. Third party accreditation bodies of debatable regulatory value were regularly mentioned on the reviewed websites, and discussion of surgical risk was absent on 47% of the websites reviewed, with limited discussion of risk on the remaining ones. Terminology describing brokers' roles was

  8. Risk communication and informed consent in the medical tourism industry: a thematic content analysis of Canadian broker websites.

    Science.gov (United States)

    Penney, Kali; Snyder, Jeremy; Crooks, Valorie A; Johnston, Rory

    2011-09-26

    Medical tourism, thought of as patients seeking non-emergency medical care outside of their home countries, is a growing industry worldwide. Canadians are amongst those engaging in medical tourism, and many are helped in the process of accessing care abroad by medical tourism brokers - agents who specialize in making international medical care arrangements for patients. As a key source of information for these patients, brokers are likely to play an important role in communicating the risks and benefits of undergoing surgery or other procedures abroad to their clientele. This raises important ethical concerns regarding processes such as informed consent and the liability of brokers in the event that complications arise from procedures. The purpose of this article is to examine the language, information, and online marketing of Canadian medical tourism brokers' websites in light of such ethical concerns. An exhaustive online search using multiple search engines and keywords was performed to compile a comprehensive directory of English-language Canadian medical tourism brokerage websites. These websites were examined using thematic content analysis, which included identifying informational themes, generating frequency counts of these themes, and comparing trends in these counts to the established literature. Seventeen websites were identified for inclusion in this study. It was found that Canadian medical tourism broker websites varied widely in scope, content, professionalism and depth of information. Three themes emerged from the thematic content analysis: training and accreditation, risk communication, and business dimensions. Third party accreditation bodies of debatable regulatory value were regularly mentioned on the reviewed websites, and discussion of surgical risk was absent on 47% of the websites reviewed, with limited discussion of risk on the remaining ones. Terminology describing brokers' roles was somewhat inconsistent across the websites. Finally

  9. [Information quality in general public French-speaking websites dedicated to oral cancer detection].

    Science.gov (United States)

    Vivien, A; Kowalski, V; Chatellier, A; Babin, E; Bénateau, H; Veyssière, A

    2017-02-01

    The goal set by the French highest national authorities in the 2014-2019 Cancer Plan is to "heal more sick persons by promoting early diagnosis through screening". Screening requires information. Nowadays, Internet allows for access to information "in one click". The aim of our study was to evaluate the quality of information found on the Internet. Several sites dedicated to oral cavity cancer screening were selected on Google. The quality of health information found in these sites was evaluated by the DISCERN questionnaire. The quality of decision support provided by the sites was evaluated by the IPDAS checklist. Twenty-seven sites were selected. The average DISCERN score was 25.1/75 (15/75 to 40/75). Eighteen sites (66.6%) had very poor, 8 sites (29.6%) had poor and 1 site had average information quality. IPDAS scores ranged from 11.1 to 38.1. Eight sites (29.6%) had less than 20%, 14 sites (51.9%) had between 20 and 30% and 5 sites (18.5%) had 30% or more validated criteria. No site achieved the pass mark. The quality of general public French-speaking website dedicated to oral cancer detection is very bad. The role of health professionals such as general practitioners and head and neck surgeons, remains essential. Copyright © 2016 Elsevier Masson SAS. All rights reserved.

  10. Using cognitive and affective illustrations to enhance older adults’ website satisfaction and recall of online cancer-related information.

    NARCIS (Netherlands)

    Bol, R.; van Weert, J.C.M.; de Haes, J.C.J.M.; Loos, Eugène|info:eu-repo/dai/nl/078758475; De Heer, S,; Sikkel, D.; Smets, E.M.A.

    2014-01-01

    This study examined the effect of adding cognitive and affective illustrations to online health information (vs. text only) on older adults’ website satisfaction and recall of cancer-related information. Results of an online experiment among younger and older adults showed that illustrations increas

  11. Topic Map: An Ontology Framework for Information Retrieval

    CERN Document Server

    Kannan, Rajkumar

    2010-01-01

    The basic classification techniques for organizing information are thesauri, taxonomy and faceted classification. Topic map is relatively a new entrant to this information space. Topic map standard describes how complex relationships between abstract concepts and real world resources can be represented using XML syntax. This paper explores how topic map incorporates the traditional techniques and what are its advantages and disadvantages in several dimensions such as content management, indexing, knowledge representation, constraint specification and query languages in the context of information retrieval. The constructs of topic maps are illustrated with a use-case implemented in XTM

  12. 8th International Workshop on Information Filtering and Retrieval

    CERN Document Server

    Giuliani, Alessandro; Semeraro, Giovanni

    2017-01-01

    This book focuses on new research challenges in intelligent information filtering and retrieval. It collects invited chapters and extended research contributions from DART 2014 (the 8th International Workshop on Information Filtering and Retrieval), held in Pisa (Italy), on December 10, 2014, and co-hosted with the XIII AI*IA Symposium on Artificial Intelligence. The main focus of DART was to discuss and compare suitable novel solutions based on intelligent techniques and applied to real-world contexts. The chapters of this book present a comprehensive review of related works and the current state of the art. The contributions from both practitioners and researchers have been carefully reviewed by experts in the area, who also gave useful suggestions to improve the quality of the book.

  13. A new approach to query expansion in information retrieval

    Institute of Scientific and Technical Information of China (English)

    Li Weijiang; Zhao Tiejun; Wang Xiangang

    2008-01-01

    To eliminate the mismatch between words of relevant documents and user's query and more serious negative effects it has on the performance of information retrieval,a method of query expansion on the basis of new terms co-occurrence representation was put forward by analyzing the process of producing query. The expansion terms were selected according to their correlation to the whole query. At the same time, the position information between terms were considered. The experimental result on test retrieval conference (TREC) data collection shows that the method proposed in the paper has made an improvement of 5%~19% all the time than the language modeling method without expansion. Compared to the popular approach of query expansion, pseudo feedback, the precision of the proposed method is competitive.

  14. Designing Health Websites Based on Users’ Web-Based Information-Seeking Behaviors: A Mixed-Method Observational Study

    Science.gov (United States)

    Pang, Patrick Cheong-Iao; Verspoor, Karin; Pearce, Jon

    2016-01-01

    Background Laypeople increasingly use the Internet as a source of health information, but finding and discovering the right information remains problematic. These issues are partially due to the mismatch between the design of consumer health websites and the needs of health information seekers, particularly the lack of support for “exploring” health information. Objective The aim of this research was to create a design for consumer health websites by supporting different health information–seeking behaviors. We created a website called Better Health Explorer with the new design. Through the evaluation of this new design, we derive design implications for future implementations. Methods Better Health Explorer was designed using a user-centered approach. The design was implemented and assessed through a laboratory-based observational study. Participants tried to use Better Health Explorer and another live health website. Both websites contained the same content. A mixed-method approach was adopted to analyze multiple types of data collected in the experiment, including screen recordings, activity logs, Web browsing histories, and audiotaped interviews. Results Overall, 31 participants took part in the observational study. Our new design showed a positive result for improving the experience of health information seeking, by providing a wide range of information and an engaging environment. The results showed better knowledge acquisition, a higher number of page reads, and more query reformulations in both focused and exploratory search tasks. In addition, participants spent more time to discover health information with our design in exploratory search tasks, indicating higher engagement with the website. Finally, we identify 4 design considerations for designing consumer health websites and health information–seeking apps: (1) providing a dynamic information scope; (2) supporting serendipity; (3) considering trust implications; and (4) enhancing interactivity

  15. Informative Top-k Retrieval for Advanced Skill Management

    Science.gov (United States)

    Colucci, Simona; di Noia, Tommaso; Ragone, Azzurra; Ruta, Michele; Straccia, Umberto; Tinelli, Eufemia

    The paper presents a knowledge-based framework for skills and talent management based on an advanced matchmaking between profiles of candidates and available job positions. Interestingly, informative content of top-k retrieval is enriched through semantic capabilities. The proposed approach allows to: (1) express a requested profile in terms of both hard constraints and soft ones; (2) provide a ranking function based also on qualitative attributes of a profile; (3) explain the resulting outcomes (given a job request, a motivation for the obtained score of each selected profile is provided). Top-k retrieval allows to select most promising candidates according to an ontology formalizing the domain knowledge. Such a knowledge is further exploited to provide a semantic-based explanation of missing or conflicting features in retrieved profiles. They also indicate additional profile characteristics emerging by the retrieval procedure for a further request refinement. A concrete case study followed by an exhaustive experimental campaign is reported to prove the approach effectiveness.

  16. Latest Trends in Web Information Retrieval and in SEO Factors

    Directory of Open Access Journals (Sweden)

    Carlos Gonzalo

    2015-07-01

    Full Text Available Latest trends in web information retrieval and in  SEO factors, increasingly focused on signals from users as: profile of who performs the search and the interpretation of user intent. The objective of search engines is twofold: focusing at the maximum in the users and make ever less predictable the composition of the search engine result page (SERP , and  combating spam.

  17. Information overload, retrieval strategies and Internet user empowerment

    OpenAIRE

    Carlson, Christopher N.

    2003-01-01

    Initial user benefits from search engine technology have been critically degraded over time by the rapid increase of Internet pages. Traditional retrieval strategies therefore yield increasingly poor results due to a dramatic increase in ballast in the results. Search engine users thus increasingly experience information overload. Technical approaches to dealing with this problem have caused an initial euphoria, yet have proven ineffective in solving the problem. Enhancement of user empow...

  18. The Use of a Context-Based Information Retrieval Technique

    Science.gov (United States)

    2009-07-01

    Carlson, 2004). However, in order to reduce plagiarism and manipulation, the specific details of these algorithms are closely protected and changed...age, academic background and gender can affect performance using information retrieval systems (Borgman, 1989). These factors can result in...and academic qualifications, a large proportion of the sample were recruited from a third year level or higher. 2.2 Materials 2.2.1 Demographic

  19. Web Structure Mining: Exploring Hyperlinks and Algorithms for Information Retrieval

    Directory of Open Access Journals (Sweden)

    P. R. Kumar

    2010-01-01

    Full Text Available Problem statement: A study on hyperlink analysis and the algorithms used for link analysis in the Web Information retrieval was done. Approach: This research was initiated because of the dependability of search engines for information retrieval in the web. Understand the web structure mining and determine the importance of hyperlink in web information retrieval particularly using the Google Search engine. Hyperlink analysis was important methodology used by famous search engine Google to rank the pages. Results: The different algorithms used for link analysis like PageRank (PR, Weighted PageRank (WPR and Hyperlink-Induced Topic Search (HITS algorithms are discussed and compared. PageRank algorithm was implemented using a Java program and the convergence of the PageRank values are shown in a chart form. Conclusion: This study was done basically to explore the link structure algorithms for ranking and compare those algorithms. The further research on this area will be problems facing PageRank algorithm and how to handle those problems.

  20. Models of a Distributed Information Retrieval System Based on Thesauri with Weights.

    Science.gov (United States)

    Mazur, Zygmunt

    1994-01-01

    Discusses distributed information retrieval systems that take into account the weights of descriptors from thesauri. Topics addressed include a mathematical model for information retrieval subsystems; organization of inverted files; models for the distributed homogeneous information systems; a distributed information retrieval system based on…

  1. [Malaria websites].

    Science.gov (United States)

    Genton, B

    2007-05-16

    One click on google.com, key-word "Malaria", 24,900,000 entries. How to choose among this jungle of websites? Ten sites are proposed to meet the needs of the general practitioner They are categorized by focus of interest, namely 1) detailed information on pre- and post-travel advice and management of travelers with illness upon return, 2) the essential on the parasite, the diagnosis and the treatment, 3) the malaria problem worldwide and 4) malaria maps.

  2. Lower-Cost ∈-Private Information Retrieval

    Directory of Open Access Journals (Sweden)

    Toledo Raphael R.

    2016-10-01

    Full Text Available Private Information Retrieval (PIR, despite being well studied, is computationally costly and arduous to scale. We explore lower-cost relaxations of information-theoretic PIR, based on dummy queries, sparse vectors, and compositions with an anonymity system. We prove the security of each scheme using a flexible differentially private definition for private queries that can capture notions of imperfect privacy. We show that basic schemes are weak, but some of them can be made arbitrarily safe by composing them with large anonymity systems.

  3. Semantic Information Retrieval Using Ontology in University Domain

    Directory of Open Access Journals (Sweden)

    Swathi Rajasurya

    2012-11-01

    Full Text Available Today’s conventional search engines hardly do provide the essential content relevant to the user’s searchquery. This is because the context and semantics of the request made by the user is not analyzed to the fullextent. So here the need for a semantic web search arises. SWS is upcoming in the area of web searchwhich combines Natural Language Processing and Artificial Intelligence. The objective of the work donehere is to design, develop and implement a semantic search engine- SIEU(Semantic InformationExtraction in University Domain confined to the university domain. SIEU uses ontology as a knowledgebase for the information retrieval process. It is not just a mere keyword search. It is one layer above whatGoogle or any other search engines retrieve by analyzing just the keywords. Here the query is analyzedboth syntactically and semantically. The developed system retrieves the web results more relevant to theuser query through keyword expansion. The results obtained here will be accurate enough to satisfy therequest made by the user. The level of accuracy will be enhanced since the query is analyzed semantically.The system will be of great use to the developers and researchers who work on web. The Google results arere-ranked and optimized for providing the relevant links. For ranking an algorithm has been applied whichfetches more apt results for the user query

  4. Multi-lingual Information Retrieval in Digital Libraries

    Directory of Open Access Journals (Sweden)

    Hsiao-Tieh Pu

    1997-12-01

    Full Text Available With the advancements of the Internet and the Digital Library Initiatives in the U.S.A., the research of digital library has been flourished around the world. Recently the increasing availability of networked access to multilingual text collections within such an environment has drawn much attention in the development of cross-language retrieval technology. This article is used to structure a comprehensive discussion of published research and known commercial practice in the western world on the topic. In addition to the focus on the characteristics of Chinese text collections, some brief observations of the potential for multilingual information retrieval are also discussed in details.[Article content in Chinese

  5. Estimating Missing Features to Improve Multimedia Information Retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Bagherjeiran, A; Love, N S; Kamath, C

    2006-09-28

    Retrieval in a multimedia database usually involves combining information from different modalities of data, such as text and images. However, all modalities of the data may not be available to form the query. The retrieval results from such a partial query are often less than satisfactory. In this paper, we present an approach to complete a partial query by estimating the missing features in the query. Our experiments with a database of images and their associated captions show that, with an initial text-only query, our completion method has similar performance to a full query with both image and text features. In addition, when we use relevance feedback, our approach outperforms the results obtained using a full query.

  6. An integrated information retrieval and document management system

    Science.gov (United States)

    Coles, L. Stephen; Alvarez, J. Fernando; Chen, James; Chen, William; Cheung, Lai-Mei; Clancy, Susan; Wong, Alexis

    1993-01-01

    This paper describes the requirements and prototype development for an intelligent document management and information retrieval system that will be capable of handling millions of pages of text or other data. Technologies for scanning, Optical Character Recognition (OCR), magneto-optical storage, and multiplatform retrieval using a Standard Query Language (SQL) will be discussed. The semantic ambiguity inherent in the English language is somewhat compensated-for through the use of coefficients or weighting factors for partial synonyms. Such coefficients are used both for defining structured query trees for routine queries and for establishing long-term interest profiles that can be used on a regular basis to alert individual users to the presence of relevant documents that may have just arrived from an external source, such as a news wire service. Although this attempt at evidential reasoning is limited in comparison with the latest developments in AI Expert Systems technology, it has the advantage of being commercially available.

  7. Retrieving Nuclear Information from Protons Propagating through A Thick Target

    CERN Document Server

    Giraud, B G

    2007-01-01

    The multiple scattering of high-energy particles in a thick target is fromulated in an impact parameter representation. A formalism similar but not identical to that of Moliere is obtained. We show that calculations of particle beam broadening due to multiple Coulomb scattering alone can be given in closed form. The focus of this study is on whether or not the broadening of the Coulomb angular distribution prevents the retrieval of nuclear-interaction information from mesauring the angular distributions of charged partiles scattered from a thick target. For this purpose, we study multiple scatterings with both the nuclear and Coulomb interactions included and we do not make a small-angle expansion. Condition for retrieving nuclear infomration from high-energy protons propagating through a block of material are obtained.

  8. INFORMATION RETRIEVAL SYSTEM USING MULTIWORDS EXPRESSIONS (MWE AS DESCRIPTORS

    Directory of Open Access Journals (Sweden)

    Edson Marchetti da Silva

    2012-08-01

    Full Text Available This paper aims to propose an alternative method for retrieving documents using Multiwords Expressions (MWE extracted from a document base to be used as descriptors in search of an Information Retrieval System (IRS. In this sense, unlike methods that consider the text as a set of words, bag of words, we propose a method that takes into account the characteristics of the physical structure of the document in the extraction process of MWE. From this set of terms comparing pre-processed using an exhaustive algorithmic technique proposed by the authors with the results obtained for thirteen different measures of association statistics generated by the software Ngram Statistics Package (NSP. To perform this experiment was set up with a corpus of documents in digital format

  9. Tetrahydrocannabinol (THC) impairs encoding but not retrieval of verbal information.

    Science.gov (United States)

    Ranganathan, Mohini; Radhakrishnan, Rajiv; Addy, Peter H; Schnakenberg-Martin, Ashley M; Williams, Ashley H; Carbuto, Michelle; Elander, Jacqueline; Pittman, Brian; Andrew Sewell, R; Skosnik, Patrick D; D'Souza, Deepak Cyril

    2017-10-03

    Cannabis and agonists of the brain cannabinoid receptor (CB1R) produce acute memory impairments in humans. However, the extent to which cannabinoids impair the component processes of encoding and retrieval has not been established in humans. The objective of this analysis was to determine whether the administration of Δ(9)-Tetrahydrocannabinol (THC), the principal psychoactive constituent of cannabis, impairs encoding and/or retrieval of verbal information. Healthy subjects were recruited from the community. Subjects were administered the Rey-Auditory Verbal Learning Test (RAVLT) either before administration of THC (experiment #1) (n=38) or while under the influence of THC (experiment #2) (n=57). Immediate and delayed recall on the RAVLT was compared. Subjects received intravenous THC, in a placebo-controlled, double-blind, randomized manner at doses known to produce behavioral and subjective effects consistent with cannabis intoxication. Total immediate recall, short delayed recall, and long delayed recall were reduced in a statistically significant manner only when the RAVLT was administered to subjects while they were under the influence of THC (experiment #2) and not when the RAVLT was administered prior. THC acutely interferes with encoding of verbal memory without interfering with retrieval. These data suggest that learning information prior to the use of cannabis or cannabinoids is not likely to disrupt recall of that information. Future studies will be necessary to determine whether THC impairs encoding of non-verbal information, to what extent THC impairs memory consolidation, and the role of other cannabinoids in the memory-impairing effects of cannabis. Cannabinoids, Neural Synchrony, and Information Processing (THC-Gamma) http://clinicaltrials.gov/ct2/show/study/NCT00708994 NCT00708994 Pharmacogenetics of Cannabinoid Response http://clinicaltrials.gov/ct2/show/NCT00678730 NCT00678730. Copyright © 2017. Published by Elsevier Inc.

  10. Controlled Retrieval of Specific Context Information in Children and Adults.

    Science.gov (United States)

    Lorsbach, Thomas C; Friehe, Mary J; Teten, Amy Fair; Reimer, Jason F; Armendarez, Joseph J

    2015-01-01

    This study adapted a procedure used by Luo and Craik (2009) to examine whether developmental differences exist in the ability to use controlled retrieval processes to access the contextual details of memory representations. Participants from 3 age groups (mean ages 9, 12, and 25 years) were presented with words in 3 study contexts: with a black-and-white picture, with a color picture, or alone without a picture. Six recognition tests were then presented that varied in the demands (high or low) placed on the retrieval of specific contextual information. Each test consisted of a mixture of words that were old targets from 1 study context, distractors (i.e., previously studied words from a different context), and completely new words. A high-specificity and a low-specificity test list was paired with each test question, with high and low specificity being determined by the nature of the distractors used in a test list. High-specificity tests contained words that were studied in similar contexts: old targets (e.g., words studied with black-and-white pictures) and distractors (e.g., words studied with color pictures). In contrast, low-specificity tests contained words that were studied in dissimilar contexts: old targets (e.g., words studied with black-and-white pictures) and distractors (e.g., words previously studied without a picture). Relative to low-specificity tests, the retrieval conditions of high-specificity tests were assumed to place greater demands on the controlled access of specific contextual information. Analysis of recollection scores revealed that age differences were present on high-but not low-specificity tests, with the performance of 9-year-olds disproportionately affected by the retrieval demands of high-specificity tests.

  11. Diffused holographic information storage and retrieval using photorefractive optical materials

    Science.gov (United States)

    McMillen, Deanna Kay

    Holography offers a tremendous opportunity for dense information storage, theoretically one bit per cubic wavelength of material volume, with rapid retrieval, of up to thousands of pages of information simultaneously. However, many factors prevent the theoretical storage limit from being reached, including dynamic range problems and imperfections in recording materials. This research explores new ways of moving closer to practical holographic information storage and retrieval by altering the recording materials, in this case, photorefractive crystals, and by increasing the current storage capacity while improving the information retrieved. As an experimental example of the techniques developed, the information retrieved is the correlation peak from an optical recognition architecture, but the materials and methods developed are applicable to many other holographic information storage systems. Optical correlators can potentially solve any signal or image recognition problem. Military surveillance, fingerprint identification for law enforcement or employee identification, and video games are but a few examples of applications. A major obstacle keeping optical correlators from being universally accepted is the lack of a high quality, thick (high capacity) holographic recording material that operates with red or infrared wavelengths which are available from inexpensive diode lasers. This research addresses the problems from two positions: find a better material for use with diode lasers, and reduce the requirements placed on the material while maintaining an efficient and effective system. This research found that the solutions are new dopants introduced into photorefractive lithium niobate to improve wavelength sensitivities and the use of a novel inexpensive diffuser that reduces the dynamic range and optical element quality requirements (which reduces the cost) while improving performance. A uniquely doped set of 12 lithium niobate crystals was specified and

  12. An Examination of Doctoral Preparation Information in the United States: A Content Analysis of Counselor Education Doctoral Program Websites

    Science.gov (United States)

    Woo, Hongryun; Mulit, Cynthia J.; Visalli, Kelsea M.

    2016-01-01

    Counselor Education (CE) program websites play a role in program fit by helping prospective students learn about the profession, search for programs and apply for admission. Using the 2014 "ACA Code of Ethics'" nine categories of orientation content as its framework, this study explored the information provided on the 63…

  13. Web multimedia information retrieval using improved Bayesian algorithm

    Institute of Scientific and Technical Information of China (English)

    余铁军; 陈纯; 余铁民; 林怀忠

    2003-01-01

    The main thrust of this paper is application of a novel data mining approach on the log of user' s feedback to improve web multimedia information retrieval performance. A user space model was constructed based on data mining, and then integrated into the original information space model to improve the accuracy of the new information space model. It can remove clutter and irrelevant text information and help to eliminate mismatch between the page author' s expression and the user' s understanding and expectation. User spacemodel was also utilized to discover the relationship between high-level and low-level features for assigning weight. The authors proposed improved Bayesian algorithm for data mining. Experiment proved that the au-thors' proposed algorithm was efficient.

  14. Web multimedia information retrieval using improved Bayesian algorithm

    Institute of Scientific and Technical Information of China (English)

    余轶军; 陈纯; 余轶民; 林怀忠

    2003-01-01

    The main thrust of this paper is application of a novel data mining approach on the log of user's feedback to improve web multimedia information retrieval performance. A user space model was constructed based on data mining, and then integrated into the original information space model to improve the accuracy of the new information space model. It can remove clutter and irrelevant text information and help to eliminate mismatch between the page author's expression and the user's understanding and expectation. User space model was also utilized to discover the relationship between high-level and low-level features for assigning weight. The authors proposed improved Bayesian algorithm for data mining. Experiment proved that the authors' proposed algorithm was efficient.

  15. Information Content of Aerosol Retrievals in the Sunglint Region

    Science.gov (United States)

    Ottaviani, M.; Knobelspiesse, K.; Cairns, B.; Mishchenko, M.

    2013-01-01

    We exploit quantitative metrics to investigate the information content in retrievals of atmospheric aerosol parameters (with a focus on single-scattering albedo), contained in multi-angle and multi-spectral measurements with sufficient dynamical range in the sunglint region. The simulations are performed for two classes of maritime aerosols with optical and microphysical properties compiled from measurements of the Aerosol Robotic Network. The information content is assessed using the inverse formalism and is compared to that deriving from observations not affected by sunglint. We find that there indeed is additional information in measurements containing sunglint, not just for single-scattering albedo, but also for aerosol optical thickness and the complex refractive index of the fine aerosol size mode, although the amount of additional information varies with aerosol type.

  16. ERISTAR: Earth Resources Information Storage, Transformation, Analysis, and Retrieval

    Science.gov (United States)

    1972-01-01

    The National Aeronautics and Space Administration (NASA) and the American Society for Engineering Education (ASEE) have sponsored faculty fellowship programs in systems engineering design for the past several years. During the summer of 1972 four such programs were conducted by NASA, with Auburn University cooperating with Marshall Space Flight Center (MSFC). The subject for the Auburn-MSFC design group was ERISTAR, an acronym for Earth Resources Information Storage, Transformation, Analysis and Retrieval, which represents an earth resources information management network of state information centers administered by the respective states and linked to federally administered regional centers and a national center. The considerations for serving the users and the considerations that must be given to processing data from a variety of sources are described. The combination of these elements into a national network is discussed and an implementation plan is proposed for a prototype state information center. The compatibility of the proposed plan with the Department of Interior plan, RALI, is indicated.

  17. 15 CFR 950.9 - Computerized Environmental Data and Information Retrieval Service.

    Science.gov (United States)

    2010-01-01

    ... Information Retrieval Service. 950.9 Section 950.9 Commerce and Foreign Trade Regulations Relating to Commerce... Computerized Environmental Data and Information Retrieval Service. The Environmental Data Index (ENDEX... computerized, information retrieval service provides a parallel subject-author-abstract referral service....

  18. 42 CFR 433.116 - FFP for operation of mechanized claims processing and information retrieval systems.

    Science.gov (United States)

    2010-10-01

    ... FISCAL ADMINISTRATION Mechanized Claims Processing and Information Retrieval Systems § 433.116 FFP for operation of mechanized claims processing and information retrieval systems. (a) Subject to 42 CFR 433.113(c... and information retrieval systems. 433.116 Section 433.116 Public Health CENTERS FOR...

  19. Working in a developing communication space. Facebook and Twitter as journalistic tools for European information pure-player websites

    Directory of Open Access Journals (Sweden)

    Florian Tixier

    2014-06-01

    Full Text Available Since the creation of the European Union, European information has been a very important issue of communication. Numerous Europe-specialized information websites were born in the first decade of the 21st century, thus creating a European informational landscape on the Internet. In a context of journalistic technological and economical evolutions, journalists have to adapt rapidly their ways of working. A new function in terms of management of socio-numeric networks has appeared: community management. This research aims at analyzing the uses of Facebook and Twitter in the community management of online European information websites. We will be specifically observing how information makers integrate these technologies, which originally were not part of the journalistic work patterns, and how they use these new means of communication to circulate European ideas through self-promotion practices.

  20. Intimate partner violence among female service members and veterans: information and resources available through military and non-military websites.

    Science.gov (United States)

    Brown, Amy; Joshi, Manisha

    2014-01-01

    With the expansion of women's roles in the military, the number of female service members and veterans has increased. Considerable knowledge about intimate partner violence (IPV) in civilian couples exists but little is known about IPV among female service members and veterans. Prevalence rates of IPV range from 17% to 39% for female service members, and 21.9% to 74% for veterans. Most service members and veterans indicated using the Internet at least occasionally and expressed willingness to seek information about services via the Internet. Informed by data, we conducted a systematic review of military (Army, Navy, Air Force, and Marine Corps) and non-military (Veterans Affairs and Google) websites to explore the availability and presentation of information and resources related to IPV. The websites search revealed a variety of resources and information available, and important differences between sites with regard to what and how information is presented. Implications for practice and further research are discussed.

  1. WORKING IN A DEVELOPING COMMUNICATION SPACE. FACEBOOK AND TWITTER AS JOURNALISTIC TOOLS FOR EUROPEAN INFORMATION PURE-PLAYER WEBSITES

    Directory of Open Access Journals (Sweden)

    Florian Tixier

    2014-06-01

    Full Text Available Since the creation of the European Union, European information has been a very important issue of communication. Numerous Europe-specialized information websites were born in the first decade of the 21st century, thus creating a European informational landscape on the Internet. In a context of journalistic technological and economical evolutions, journalists have to adapt rapidly their ways of working. A new function in terms of management of socio-numeric networks has appeared: community management. This research aims at analyzing the uses of Facebook and Twitter in the community management of online European information websites. We will be specifically observing how information makers integrate these technologies, which originally were not part of the journalistic work patterns, and how they use these new means of communication to circulate European ideas through self-promotion practices.

  2. Human Information Behaviour and Design, Development and Evaluation of Information Retrieval Systems

    Science.gov (United States)

    Keshavarz, Hamid

    2008-01-01

    Purpose: The purpose of this paper is to introduce the concept of human information behaviour and to explore the relationship between information behaviour of users and the existing approaches dominating design and evaluation of information retrieval (IR) systems and also to describe briefly new design and evaluation methods in which extensive…

  3. How well are health information websites displayed on mobile phones? Implications for the readability of health information.

    Science.gov (United States)

    Cheng, Christina; Dunn, Matthew

    2016-06-02

    Issue addressed: More than 87% of Australians own a mobile phone with Internet access and 82% of phone owners use their smartphones to search for health information, indicating that mobile phones may be a powerful tool for building health literacy. Yet, online health information has been found to be above the reading ability of the general population. As reading on a smaller screen may further complicate the readability of information, this study aimed to examine how health information is displayed on mobile phones and its implications for readability.Methods: Using a cross-sectional design with convenience sampling, a sample of 270 mobile webpages with information on 12 common health conditions was generated for analysis, they were categorised based on design and position of information display.Results: The results showed that 71.48% of webpages were mobile-friendly but only 15.93% were mobile-friendly webpages designed in a way to optimise readability, with a paging format and queried information displayed for immediate viewing.Conclusion: With inadequate evidence and lack of consensus on how webpage design can best promote reading and comprehension, it is difficult to draw a conclusion on the effect of current mobile health information presentation on readability.So what?: Building mobile-responsive websites should be a priority for health information providers and policy-makers. Research efforts are urgently required to identify how best to enhance readability of mobile health information and fully capture the capabilities of mobile phones as a useful device to increase health literacy.

  4. Survey the role of emotions in information retrieval

    Directory of Open Access Journals (Sweden)

    Hassan Behzadi

    2016-03-01

    Full Text Available The present study was conducted to identify the users' emotion in various stages of information retrieval based on the information retrieval model in web.From the methodological perspective, the present study is experimental, and the type of study is practical. The society comprised all MA students majoring in different humanistic science branches and studying at Imam Reza international university. The sample society of this research consisted of 30 participants. The sample size was determined through stratified random sampling via G*power software. Data collection was carried out by using: demographic and prior experience of using internet questionnaire, post search questionnaire and recorded videos of users' faces. The findings of the study demonstrated that: 1 during the initial stages of searching, the frequency of emotion of apprehension, and in general during the link tracking stage, the negative emotions with the overall 49/3 percent are more frequent than the other emotions in browsing and differentiation stages, the emotion of happy was more frequent than the other emotions. 2 These variances resulted in significant relations among different emotions of the users throughout the four stages of information retrieval. 3 In simple search, the respondents displayed the emotion of happy most frequently and the emotion of aversion least frequently. On the other hand, in complicated search, apprehension and aversion were the most and the least frequently-cited emotions, respectively. Overall, the negative emotions were reported more frequently in complicated search in comparison with the simple search. This demonstrated that any change in the difficulty level of search undertaking would cause users to exhibit different types of emotions.

  5. Semantic Annotation Framework For Intelligent Information Retrieval Using KIM Architecture

    Directory of Open Access Journals (Sweden)

    Sanjay Kumar Malik

    2010-11-01

    Full Text Available Due to the explosion of information/knowledge on the web and wide use of search engines for desiredinformation,the role of knowledge management(KM is becoming more significant in an organization.Knowledge Management in an Organization is used to create ,capture, store, share, retrieve and manageinformation efficiently. The semantic web, an intelligent and meaningful web, tend to provide a promisingplatform for knowledge management systems and vice versa, since they have the potential to give eachother the real substance for machine-understandable web resources which in turn will lead to anintelligent, meaningful and efficient information retrieval on web. Today,the challenge for web communityis to integrate the distributed heterogeneous resources on web with an objective of an intelligent webenvironment focusing on data semantics and user requirements. Semantic Annotation(SA is being widelyused which is about assigning to the entities in the text and links to their semantic descriptions. Varioustools like KIM, Amaya etc may be used for semantic Annotation.In this paper, we introduce semantic annotation as one of the key technology in an intelligent webenvironment , then revisit and review, discuss and explore about Knowledge Management and SemanticAnnotation. A Knowledge Management Framework and a Framework for Semantic Annotation andSemantic Search with Knowledge Base(GATE and Ontology have been presented. Then KIM Annotationplatform architecture including KIM Ontology(KIMO, KIM Knowledge Base and KIM front ends havebeen highlighted. Finally, intelligent pattern search and concerned GATE framework with a KIMAnnotation Example have been illiustrated towards an intelligent information retrieval

  6. Rapid automatic keyword extraction for information retrieval and analysis

    Science.gov (United States)

    Rose, Stuart J [Richland, WA; Cowley,; E, Wendy [Richland, WA; Crow, Vernon L [Richland, WA; Cramer, Nicholas O [Richland, WA

    2012-03-06

    Methods and systems for rapid automatic keyword extraction for information retrieval and analysis. Embodiments can include parsing words in an individual document by delimiters, stop words, or both in order to identify candidate keywords. Word scores for each word within the candidate keywords are then calculated based on a function of co-occurrence degree, co-occurrence frequency, or both. Based on a function of the word scores for words within the candidate keyword, a keyword score is calculated for each of the candidate keywords. A portion of the candidate keywords are then extracted as keywords based, at least in part, on the candidate keywords having the highest keyword scores.

  7. Web search: how the Web has changed information retrieval

    Directory of Open Access Journals (Sweden)

    Brooks Terrence A.

    2003-01-01

    Full Text Available Topical metadata are simultaneously hailed as building blocks of the semantic Web and derogated as spam. The significance of the metadata controversy depends on the technological appropriateness of adding them to Web pages. A survey of Web technology suggests that Web pages are both transient and volatile: poor hosts of topical metadata. A more supportive environment exists in the closed Web. The vast majority of Web pages, however, exist in the open Web, an environment that challenges the application of legacy information retrieval concepts and methods.

  8. Interdisciplinarity and Computer Music Modeling and Information Retrieval

    DEFF Research Database (Denmark)

    Grund, Cynthia M.

    2006-01-01

    Abstract This paper takes a look at computer music modeling and information retrieval (CMMIR) from the point of view of the humanities with emphasis upon areas relevant to the philosophy of music. The desire for more interdisciplinary research involving CMMIR and the humanities is expressed...... and some specific positive experiences are cited which have given this author reason to believe that such cooperation is beneficial for both sides. A short list of some contemporary areas of interest in the philosophy of music is provided, and it is suggested that these could be interesting areas...

  9. Computer programs: Information retrieval and data analysis, a compilation

    Science.gov (United States)

    1972-01-01

    The items presented in this compilation are divided into two sections. Section one treats of computer usage devoted to the retrieval of information that affords the user rapid entry into voluminous collections of data on a selective basis. Section two is a more generalized collection of computer options for the user who needs to take such data and reduce it to an analytical study within a specific discipline. These programs, routines, and subroutines should prove useful to users who do not have access to more sophisticated and expensive computer software.

  10. Information direction, website reputation and eWOM effect: A moderating role of product type

    National Research Council Canada - National Science Library

    Park, Cheol; Lee, Thae Min

    2009-01-01

    ...) and a website's reputation (established vs. unestablished) contribute to the eWOM effect. The article describes a study focusing on the moderating role of the product type (search vs. experience...

  11. AN EFFECTIVE INFORMATION RETRIEVAL SYSTEM USING KEYWORD SEARCH TECHNIQUE

    Directory of Open Access Journals (Sweden)

    Dhananjay A. Gholap

    2015-10-01

    Full Text Available Keyword search is the technique use for the retrieving data or information. In Information Retrieval, keyword search is a type of search method that looks for matching documents which contain one or more keywords specified by a user.A keyword search scheme to relational database becomes an interesting area of research system within the IR and relational database system. The assumption and investigation of user search goals can be very valuable in improving search engine relevance and user experience. The user tries to search about any query on the internet, Search engine gives many numbers of result related to that query. These results can be depend on metadata or on full text indexing, because of this, user need to spend a lot of time in finding the information of his interest. Therefore, in project inferred user search goals by analyzing search engine query logs. System use a framework to discover different user search goals for a query by clustering the propose feedback sessions.

  12. Information retrieval patterns and needs among practicing general surgeons: a statewide experience.

    OpenAIRE

    Shelstad, K R; Clevenger, F W

    1996-01-01

    Information retrieval has progressed from a reliance on traditional print sources to the modern era of computer databases and online networks. Surgeons, many from remote areas not served by professional medical libraries, must develop and maintain skills in information retrieval and management in both electronic and standard formats. One hundred thirty-three New Mexico general surgeons were surveyed to identify their information-seeking patterns in five areas: retrieval purposes, retrieval so...

  13. JANE, A new information retrieval system for the Radiation Shielding Information Center

    Energy Technology Data Exchange (ETDEWEB)

    Trubey, D.K.

    1991-05-01

    A new information storage and retrieval system has been developed for the Radiation Shielding Information Center (RSIC) at Oak Ridge National Laboratory to replace mainframe systems that have become obsolete. The database contains citations and abstracts of literature which were selected by RSIC analysts and indexed with terms from a controlled vocabulary. The database, begun in 1963, has been maintained continuously since that time. The new system, called JANE, incorporates automatic indexing techniques and on-line retrieval using the RSIC Data General Eclipse MV/4000 minicomputer, Automatic indexing and retrieval techniques based on fuzzy-set theory allow the presentation of results in order of Retrieval Status Value. The fuzzy-set membership function depends on term frequency in the titles and abstracts and on Term Discrimination Values which indicate the resolving power of the individual terms. These values are determined by the Cover Coefficient method. The use of a commercial database base to store and retrieve the indexing information permits rapid retrieval of the stored documents. Comparisons of the new and presently-used systems for actual searches of the literature indicate that it is practical to replace the mainframe systems with a minicomputer system similar to the present version of JANE. 18 refs., 10 figs.

  14. Mathematical, Logical, and Formal Methods in Information Retrieval: An Introduction to the Special Issue.

    Science.gov (United States)

    Crestani, Fabio; Dominich, Sandor; Lalmas, Mounia; van Rijsbergen, Cornelis Joost

    2003-01-01

    Discusses the importance of research on the use of mathematical, logical, and formal methods in information retrieval to help enhance retrieval effectiveness and clarify underlying concepts of information retrieval. Highlights include logic; probability; spaces; and future research needs. (Author/LRW)

  15. Non-Compositional Term Dependence for Information Retrieval

    DEFF Research Database (Denmark)

    Lioma, Christina; Simonsen, Jakob Grue; Larsen, Birger

    2015-01-01

    We present two novel models of document coherence and their application to information retrieval (IR). Both models approximate document coherence using discourse entities, e.g. the subject or object of a sentence. Our first model views text as a Markov process generating sequences of discourse...... entities (entity n-grams); we use the entropy of these entity n-grams to approximate the rate at which new information appears in text, reasoning that as more new words appear, the topic increasingly drifts and text coherence decreases. Our second model extends the work of Guinaudeau & Strube [28......] that represents text as a graph of discourse entities, linked by different relations, such as their distance or adjacency in text. We use several graph topology metrics to approximate different aspects of the discourse flow that can indicate coherence, such as the average clustering or betweenness of discourse...

  16. An introduction to the Marshall information retrieval and display system

    Science.gov (United States)

    1974-01-01

    An on-line terminal oriented data storage and retrieval system is presented which allows a user to extract and process information from stored data bases. The use of on-line terminals for extracting and displaying data from the data bases provides a fast and responsive method for obtaining needed information. The system consists of general purpose computer programs that provide the overall capabilities of the total system. The system can process any number of data files via a Dictionary (one for each file) which describes the data format to the system. New files may be added to the system at any time, and reprogramming is not required. Illustrations of the system are shown, and sample inquiries and responses are given.

  17. Information retrieval pathways for health information exchange in multiple care settings

    DEFF Research Database (Denmark)

    Kierkegaard, Patrick; Kaushal, Rainu; Vest, Joshua R.

    2014-01-01

    Objectives To determine which health information exchange (HIE) technologies and information retrieval pathways healthcare professionals relied on to meet their information needs in the context of laboratory test results, radiological images and reports, and medication histories. Study Design...... Primary data was collected over a 2-month period across 3 emergency departments, 7 primary care practices, and 2 public health clinics in New York state. Methods Qualitative research methods were used to collect and analyze data from semi-structured interviews and participant observation. Results...... The study reveals that healthcare professionals used a complex combination of information retrieval pathways for HIE to obtain clinical information from external organizations. The choice for each approach was setting- and information-specific, but was also highly dynamic across users and their information...

  18. An integrated Korean biodiversity and genetic information retrieval system.

    Science.gov (United States)

    Lim, Jeongheui; Bhak, Jong; Oh, Hee-Mock; Kim, Chang-Bae; Park, Yong-Ha; Paek, Woon Kee

    2008-12-12

    On-line biodiversity information databases are growing quickly and being integrated into general bioinformatics systems due to the advances of fast gene sequencing technologies and the Internet. These can reduce the cost and effort of performing biodiversity surveys and genetic searches, which allows scientists to spend more time researching and less time collecting and maintaining data. This will cause an increased rate of knowledge build-up and improve conservations. The biodiversity databases in Korea have been scattered among several institutes and local natural history museums with incompatible data types. Therefore, a comprehensive database and a nation wide web portal for biodiversity information is necessary in order to integrate diverse information resources, including molecular and genomic databases. The Korean Natural History Research Information System (NARIS) was built and serviced as the central biodiversity information system to collect and integrate the biodiversity data of various institutes and natural history museums in Korea. This database aims to be an integrated resource that contains additional biological information, such as genome sequences and molecular level diversity. Currently, twelve institutes and museums in Korea are integrated by the DiGIR (Distributed Generic Information Retrieval) protocol, with Darwin Core2.0 format as its metadata standard for data exchange. Data quality control and statistical analysis functions have been implemented. In particular, integrating molecular and genetic information from the National Center for Biotechnology Information (NCBI) databases with NARIS was recently accomplished. NARIS can also be extended to accommodate other institutes abroad, and the whole system can be exported to establish local biodiversity management servers. A Korean data portal, NARIS, has been developed to efficiently manage and utilize biodiversity data, which includes genetic resources. NARIS aims to be integral in maximizing

  19. Visualization of website's link structure and information used in website%网站链接结构和使用信息的可视化研究与应用

    Institute of Scientific and Technical Information of China (English)

    汪洁; 陈韬; 罗月童

    2012-01-01

    With the advent of information age. it becomes more and more difficult for users to locate the information coming from websites, which manifests an explosive growth in these years. As a result, the websites navigation based on information visualization has been a main method to solve this problem. Because of the complexity and variety of website information, it is difficult to show all kinds of information from websites completely. Radial View tree layout algorithm is used to draw the website topological structure which is composed of hyperlinks information, and then a visualization rule is put forward, which is to visualize the usage information of the website connection and heat. The effect of website information visualization in assisting the user navigation website is analyzed on the basis of practical application of fusion database websites.%信息时代的到来,网站信息量呈现出的爆炸式发展导致用户无法定位其中信息,使用网站信息可视化辅助用户导航网站是解决上述问题的主要方法.由于网站中信息复杂多变,如何融合展现网站中的各类信息是目前的难点问题.在此使用Radial View树型布局算法绘制由超链接信息组成的网站拓扑结构.提出一种可视化规则在网站结构图的基础上添加对网页关联、热度等使用信息的可视化;以聚变数据库网站为实际应用案例,分析了该文关于网站信息可视化方面的工作在辅助用户导航网站方面的效果.

  20. Engaging patients through your website.

    Science.gov (United States)

    Snyder, Kimberlee; Ornes, Lynne L; Paulson, Pat

    2014-01-01

    Legislation requires the healthcare industry to directly engage patients through technology. This paper proposes a model that can be used to review hospital websites for features that engage patients in their healthcare. The model describes four levels of patient engagement in website design. The sample consisted of 130 hospital websites from hospitals listed on 2010 and 2011 Most Wired Hospitals. Hospital websites were analyzed for features that encouraged patient interaction with their healthcare according to the levels in the model. Of the four levels identified in the model, websites ranged from "informing" to "collaborative" in website design. There was great variation of features offered on hospital websites with few being engaging and interactive.

  1. Website updates

    Data.gov (United States)

    National Aeronautics and Space Administration — Updates to Website: (Please add new items at the top of this description with the date of the website change) May 9, 2012: Uploaded experimental data in matlab...

  2. Contextual and Conceptual Information Retrieval and Navigation on the Web

    Science.gov (United States)

    Le Grand, Bénédicte; Aufaure, Marie-Aude; Soto, Michel

    The goal of this chapter is to propose a methodology and tools to enhance information retrieval and navigation on the Web through contextual and conceptual help. This methodology provides users with an extended navigation space by adding a conceptual and a semantic layer above Web data. The conceptual layer is made of Galois lattices which cluster Web pages into concepts according to their common features (in particular their textual content). These lattices represent the Global Conceptual Context of Web pages. An additional navigation layer is provided by ontologies which are connected to the conceptual level through specific concepts of the lattices. Users may navigate transparently within each of these three layers and go from one to another very easily.

  3. Issues in the use of neural networks in information retrieval

    CERN Document Server

    Iatan, Iuliana F

    2017-01-01

    This book highlights the ability of neural networks (NNs) to be excellent pattern matchers and their importance in information retrieval (IR), which is based on index term matching. The book defines a new NN-based method for learning image similarity and describes how to use fuzzy Gaussian neural networks to predict personality. It introduces the fuzzy Clifford Gaussian network, and two concurrent neural models: (1) concurrent fuzzy nonlinear perceptron modules, and (2) concurrent fuzzy Gaussian neural network modules. Furthermore, it explains the design of a new model of fuzzy nonlinear perceptron based on alpha level sets and describes a recurrent fuzzy neural network model with a learning algorithm based on the improved particle swarm optimization method.

  4. Algebraic Modeling of Information Retrieval in XML Documents

    Science.gov (United States)

    Georgiev, Bozhidar; Georgieva, Adriana

    2009-11-01

    This paper presents an information retrieval approach in XML documents using tools, based on the linear algebra. The well-known transformation languages as XSLT (XPath) are grounded on the features of higher-order logic for manipulating hierarchical trees. The presented conception is compared to existing higher-order logic formalisms, where the queries are realized by both languages XSLT and XPath. The possibilities of the proposed linear algebraic model combined with hierarchy data models permit more efficient solutions for searching, extracting and manipulating semi-structured data with hierarchical structures avoiding the global navigation over the XML tree components. The main purpose of this algebraic model representation, applied to the hierarchical relationships in the XML data structures, is to make the implementation of linear algebra tools possible for XML data manipulations and to eliminate existing problems, related to regular grammars theory and also to avoid the difficulties, connected with higher -order logic (first-order logic, monadic second- order logic etc.).

  5. Efficient hardware-based private information retrieval using partial reshuffle

    Institute of Scientific and Technical Information of China (English)

    Lan Tian; Qin Zhiguang

    2010-01-01

    The paper proposes a novel hardware-based private information retrieval(HWPIR)protocol.By partially reshuffling previously accessed items in each round,instead of frequently reshuffling the whole database,the scheme makes better use of shuffled data copies and achieves the computation overhead at O(√N/k),where N and k are the sizes of the database and secure storage respectively.For.secure storage with moderate size,e.g.k=O(√N),the overhead is O(4√N).The result is much better than the state-of-art schemes(as compared to e.g.O(log2N)).Without increasing response time and communication cost,the proposed protocol is truly practicable regardless of the database size.The security and preformance of the protocol is formally analyzed.

  6. How to retrieve additional information from the multiplicity distributions

    CERN Document Server

    Wilka, Grzegorz

    2016-01-01

    Multiplicity distributions $P(N)$ measured in multiparticle production processes are most frequently described by the Negative Binomial Distribution (NBD). However, with increasing collision energy some systematic discrepancies become more and more apparent. They are usually attributed to the possible multi-source structure of the production process and described using a multi-NBD form of the multiplicity distribution. We investigate the possibility of keeping a single NBD but with its parameters depending on the multiplicity $N$. This is done by modifying the widely known clan model of particle production leading to the NBD form of $P(N)$. This is then confronted with the approach based on the so-called cascade-stochastic formalism which is based on different types of recurrence relations defining $P(N)$. We demonstrate that a combination of both approaches allows the retrieval of additional valuable information from the multiplicity distributions, namely the oscillatory behavior of the counting statistics a...

  7. Using Context to Improve the Evaluation of Information Retrieval Systems

    CERN Document Server

    Bouramoul, Abdelkrim; Doan, Bich-Lien; 10.5121/ijdms.2011.3202

    2011-01-01

    The crucial role of the evaluation in the development of the information retrieval tools is useful evidence to improve the performance of these tools and the quality of results that they return. However, the classic evaluation approaches have limitations and shortcomings especially regarding to the user consideration, the measure of the adequacy between the query and the returned documents and the consideration of characteristics, specifications and behaviors of the search tool. Therefore, we believe that the exploitation of contextual elements could be a very good way to evaluate the search tools. So, this paper presents a new approach that takes into account the context during the evaluation process at three complementary levels. The experiments gives at the end of this article has shown the applicability of the proposed approach to real research tools. The tests were performed with the most popular searching engine (i.e. Google, Bing and Yahoo) selected in particular for their high selectivity. The obtaine...

  8. Cross Lingual Information Retrieval With SMT And Query Mining

    Directory of Open Access Journals (Sweden)

    Suneet Kumar Gupta

    2011-10-01

    Full Text Available In this paper, we have taken the English Corpus and Queries, both translated and transliterated form. We use Statistical Machine Translator to find the result under translated and transliterated queries and then analyzed the result. These queries wise results can then be undergone mining and therefore a new list of queries is created. We have design an experimental setup followed by various steps which calculate Mean Average Precision. We have taken assistance ship of Terrier Open Source for the Information Retrieval. On the basis of created new query list, we calculate the Mean Average Precision and find a significant result i.e. 93.24% which is very close to monolingual results calculated for English language.

  9. Challenging Conventional Assumptions of Automated Information Retrieval with Real Users: Boolean Searching and Batch Retrieval Evaluations.

    Science.gov (United States)

    Hersh, William; Turpin, Andrew; Price, Susan; Kraemer, Dale; Olson, Daniel; Chan, Benjamin; Sacherek, Lynetta

    2001-01-01

    Describes research conducted at the TREC (Text Retrieval Conference) interactive track that compared Boolean and natural language searching, showing they achieved comparable results; and assessed the validity of batch-oriented retrieval evaluations, showing that the results from batch evaluations were not comparable to those obtained in…

  10. Teaching information retrieval using research questions to encourage creativity and assess understanding

    OpenAIRE

    Jones, Gareth J.F.

    2007-01-01

    The study of information retrieval has increased in interest and importance with the explosive growth of online information in recent years. Learning about information retrieval within formal courses of study enables users of search engines to use them more knowledgeably and effectively, while providing the starting point for the explorations of new researchers into novel search technologies. The nature of information retrieval as a topic also makes it an ideal subject for develop...

  11. The challenge of automated tutoring in Web-based learning environments for information retrieval instruction

    OpenAIRE

    Sormunen Eero; Pennanen; Sami

    2004-01-01

    The need to enhance information literacy education increases demand for effective Web-based learning environments for information retrieval instruction. The paper introduces the Query Performance Analyser, a unique instructional tool for information retrieval learning environments. On top of an information retrieval system and within a given search assignment, the Query Performance Analyser supports learning by instantly visualizing achieved query performance. Although the Query Performance A...

  12. Cross-language information retrieval using PARAFAC2.

    Energy Technology Data Exchange (ETDEWEB)

    Bader, Brett William; Chew, Peter; Abdelali, Ahmed (New Mexico State University, Las Cruces, NM); Kolda, Tamara Gibson

    2007-05-01

    A standard approach to cross-language information retrieval (CLIR) uses Latent Semantic Analysis (LSA) in conjunction with a multilingual parallel aligned corpus. This approach has been shown to be successful in identifying similar documents across languages - or more precisely, retrieving the most similar document in one language to a query in another language. However, the approach has severe drawbacks when applied to a related task, that of clustering documents 'language-independently', so that documents about similar topics end up closest to one another in the semantic space regardless of their language. The problem is that documents are generally more similar to other documents in the same language than they are to documents in a different language, but on the same topic. As a result, when using multilingual LSA, documents will in practice cluster by language, not by topic. We propose a novel application of PARAFAC2 (which is a variant of PARAFAC, a multi-way generalization of the singular value decomposition [SVD]) to overcome this problem. Instead of forming a single multilingual term-by-document matrix which, under LSA, is subjected to SVD, we form an irregular three-way array, each slice of which is a separate term-by-document matrix for a single language in the parallel corpus. The goal is to compute an SVD for each language such that V (the matrix of right singular vectors) is the same across all languages. Effectively, PARAFAC2 imposes the constraint, not present in standard LSA, that the 'concepts' in all documents in the parallel corpus are the same regardless of language. Intuitively, this constraint makes sense, since the whole purpose of using a parallel corpus is that exactly the same concepts are expressed in the translations. We tested this approach by comparing the performance of PARAFAC2 with standard LSA in solving a particular CLIR problem. From our results, we conclude that PARAFAC2 offers a very promising alternative to

  13. Card sorting to evaluate the robustness of the information architecture of a protocol website

    NARCIS (Netherlands)

    Wentzel, J.; Müller, F.; Jong, de N.; Gemert-Pijnen, van J.

    2016-01-01

    Objectives A website on Methicillin-Resistant Staphylococcus Aureus, MRSA-net, was developed for Health Care Workers (HCWs) and the general public, in German and in Dutch. The website’s content was based on existing protocols and its structure was based on a card sort study. A Human Centered Design

  14. The EFSUMB website, a great source for ultrasound information and education.

    Science.gov (United States)

    Dietrich, Christoph F; Rudd, Lynne; Saftiou, Adrian; Gilja, Odd Helge

    2017-01-31

    The aim of this updated EFSUMB-website guide is to introduce readers to EFSUMB's wide ranging activities. The most recent are the guidelines on interventional ultrasound and intestinal ultrasound and updated CEUS Non-Liver and Elastography Liver Guidelines which can be freely downloaded. Hosting eBooks on our website is another new departure, most importantly the EFSUMB Course Book on Ultrasound available in a second edition as an eReader and an online Student Edition of the ECB. EFSUMB has been active with updating Guidelines; those mentioned above have all been revised or written in thelast two years. Webinars have been introduced and participation is possible online but can be reviewed later along with recent recordings of Euroson Schools. The EFSUMB Newsletter in the EJU promotes our activities and topical articles intended to reach all our members with the online version hosted on our website. The Case of the Month continues to be one of EFSUMB's most visited sites and in the last few years has been translated into 14 different languages including Chinese. In conclusion, this article aims to provide an updated guide to the website educational sites of the European Federation of Societies for Ultrasound in Medicine and Biology (EFSUMB).

  15. Development of E-Info geneca: a website providing computer-tailored information and question prompt prior to breast cancer genetic counseling.

    NARCIS (Netherlands)

    Albada, A.; Dulmen, S. van; Otten, R.; Bensing, J.M.; Ausems, M.G.E.M.

    2009-01-01

    This article describes the stepwise development of the website ‘E-info geneca’. The website provides counselees in breast cancer genetic counseling with computer-tailored information and a question prompt prior to their first consultation. Counselees generally do not know what to expect from genetic

  16. A Real-Time and Dynamic Biological Information Retrieval and Analysis System (BIRAS)

    Institute of Scientific and Technical Information of China (English)

    Qi Zhou; Hong Zhang; Meiying Geng; Chenggang Zhang

    2003-01-01

    The aim of this study is to design a biological information retrieval and analysis system (BIRAS) based on the Internet. Using the specific network protocol, BIRAS system could send and receive information from the Entrez search and retrieval system maintained by National Center for Biotechnology Information (NCBI) in USA. The literatures, nucleotide sequence, protein sequences, and other resources according to the user-defined term could then be retrieved and sent to the user by pop up message or by E-mail informing automatically using BIRAS system.All the information retrieving and analyzing processes are done in real-time. As a robust system for intelligently and dynamically retrieving and analyzing on the user-defined information, it is believed that BIRAS would be extensively used to retrieve specific information from large amount of biological databases in now days.The program is available on request from the corresponding author.

  17. A Real—Time and Dynamic Biological Information Retrieval and Analysis System(BIRAS)

    Institute of Scientific and Technical Information of China (English)

    QiZhou; HongZhang; MeiyingGeng; ChenggangZhang

    2003-01-01

    The aim of this study is to design a biological information retrieval and analysis system(BIRAS) based on the Internet.Using the specific network protocol,BIRAS system could send and receive information from the Entrez search and retrieval system maintained by National Center for Biotechnology Information(NCBI)in USA.The literatures,nucleotide sequence,protein sequences,and other resources according to the user-defined term could then be retrieved and sent to the user by pop up message or by E-amil informing automatically using BIRAS system.All the information retrieving and analyzing processes are done in real-time.As a robust system for intelligently and dynamically retrieving and analyzing on the user-defined information,it is believed that BIRAS would be extensively used to retrieve specific information from large amount of biological databases in now days.The program is available on request from the corresponding author.

  18. Automatic Content Analysis; Part I of Scientific Report No. ISR-18, Information Storage and Retrieval...

    Science.gov (United States)

    Cornell Univ., Ithaca, NY. Dept. of Computer Science.

    Four papers are included in Part One of the eighteenth report on Salton's Magical Automatic Retriever of Texts (SMART) project. The first paper: "Content Analysis in Information Retrieval" by S. F. Weiss presents the results of experiments aimed at determining the conditions under which content analysis improves retrieval results as well…

  19. A Fuzzy Genetic Algorithm Approach to an Adaptive Information Retrieval Agent.

    Science.gov (United States)

    Martin-Bautista, Maria J.; Vila, Maria-Amparo; Larsen, Henrik Legind

    1999-01-01

    Presents an approach to a Genetic Information Retrieval Agent Filter (GIRAF) that filters and ranks documents retrieved from the Internet according to users' preferences by using a Genetic Algorithm and fuzzy set theory to handle the imprecision of users' preferences and users' evaluation of the retrieved documents. (Author/LRW)

  20. Organization of the Inverted Files in a Distributed Information Retrieval System Based on Thesauri.

    Science.gov (United States)

    Mazur, Zygmunt

    1986-01-01

    Describes how operations on local inverted files are to be modified in order to use them in distributed information retrieval systems based on thesauri. The presented rules may be viewed as the logical approach in implementing a distributed retrieval system consisting of n local retrieval systems. (Author/MBR)

  1. Query-Time Optimization Techniques for Structured Queries in Information Retrieval

    Science.gov (United States)

    Cartright, Marc-Allen

    2013-01-01

    The use of information retrieval (IR) systems is evolving towards larger, more complicated queries. Both the IR industrial and research communities have generated significant evidence indicating that in order to continue improving retrieval effectiveness, increases in retrieval model complexity may be unavoidable. From an operational perspective,…

  2. Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives

    DEFF Research Database (Denmark)

    Cao, Xin; Cong, Gao; Cui, Bin

    2012-01-01

    of CQA services, question retrieval in a CQA archive aims to retrieve historical question-answer pairs that are relevant to a query question. This article presents several new approaches to exploiting the category information of questions for improving the performance of question retrieval...

  3. Entropy of the Information Retrieved from Black Holes

    CERN Document Server

    Mersini-Houghton, Laura

    2015-01-01

    The retrieval of black hole information was recently presented in two interesting proposals in the 'Hawking Radiation' conference: a revised version by G. 't Hooft of a proposal he initially suggested 20 years ago and, a new proposal by S. Hawking. Both proposals address the problem of black hole information loss at the classical level and derive an expression for the scattering matrix. The former uses gravitation back reaction of incoming particles that imprints its information on the outgoing modes. The latter uses supertranslation symmetry of horizons to relate a phase delay of the outgoing wave packet compared to their incoming wave partners. The difficulty in both proposals is that the entropy obtained from them appears to be infinite. By including quantum effects into the Hawking and 't Hooft's proposals, I show that a subtlety arising from the inescapable measurement process, the Quantum Zeno Effect, not only tames divergences but it actually recovers the correct $1/4$ of the area Bekenstein-Hawking en...

  4. Analyzing traffic source impact on returning visitors ratio in information provider website

    Science.gov (United States)

    Prasetio, A.; Sari, P. K.; Sharif, O. O.; Sofyan, E.

    2016-04-01

    Web site performance, especially returning visitor is an important metric for an information provider web site. Since high returning visitor is a good indication of a web site’s visitor loyalty, it is important to find a way to improve this metric. This research investigated if there is any difference on returning visitor metric among three web traffic sources namely direct, referral and search. Monthly returning visitor and total visitor from each source is retrieved from Google Analytics tools and then calculated to measure returning visitor ratio. The period of data observation is from July 2012 to June 2015 resulting in a total of 108 samples. These data then analysed using One-Way Analysis of Variance (ANOVA) to address our research question. The results showed that different traffic source has significantly different returning visitor ratio especially between referral traffic source and the other two traffic sources. On the other hand, this research did not find any significant difference between returning visitor ratio from direct and search traffic sources. The owner of the web site can focus to multiply referral links from other relevant sites.

  5. Next-Generation Search Engines for Information Retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Devarakonda, Ranjeet [ORNL; Hook, Leslie A [ORNL; Palanisamy, Giri [ORNL; Green, James M [ORNL

    2011-01-01

    centralized index. The harvested files are indexed against SOLR search API consistently, so that it can render search capabilities such as simple, fielded, spatial and temporal searches across a span of projects ranging from land, atmosphere, and ocean ecology. Mercury also provides data sharing capabilities using Open Archive Initiatives Protocol for Metadata Handling (OAI-PMH). In this paper we will discuss about the best practices for archiving data and metadata, new searching techniques, efficient ways of data retrieval and information display.

  6. A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering.

    Science.gov (United States)

    Sarrouti, Mourad; Ouatik El Alaoui, Said

    2017-04-01

    Passage retrieval, the identification of top-ranked passages that may contain the answer for a given biomedical question, is a crucial component for any biomedical question answering (QA) system. Passage retrieval in open-domain QA is a longstanding challenge widely studied over the last decades. However, it still requires further efforts in biomedical QA. In this paper, we present a new biomedical passage retrieval method based on Stanford CoreNLP sentence/passage length, probabilistic information retrieval (IR) model and UMLS concepts. In the proposed method, we first use our document retrieval system based on PubMed search engine and UMLS similarity to retrieve relevant documents to a given biomedical question. We then take the abstracts from the retrieved documents and use Stanford CoreNLP for sentence splitter to make a set of sentences, i.e., candidate passages. Using stemmed words and UMLS concepts as features for the BM25 model, we finally compute the similarity scores between the biomedical question and each of the candidate passages and keep the N top-ranked ones. Experimental evaluations performed on large standard datasets, provided by the BioASQ challenge, show that the proposed method achieves good performances compared with the current state-of-the-art methods. The proposed method significantly outperforms the current state-of-the-art methods by an average of 6.84% in terms of mean average precision (MAP). We have proposed an efficient passage retrieval method which can be used to retrieve relevant passages in biomedical QA systems with high mean average precision. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. Trust in prescription drug brand websites: website trust cues, attitude toward the website, and behavioral intentions.

    Science.gov (United States)

    Huh, Jisu; Shin, Wonsun

    2014-01-01

    Direct-to-consumer (DTC) prescription drug brand websites, as a form of DTC advertising, are receiving increasing attention due to the growing number and importance as an ad and a consumer information source. This study examined consumer trust in a DTC website as an important factor influencing consumers' attitude toward the website and behavioral intention. Applying the conceptual framework of website trust, the particular focus of investigation was the effect of the website trust cue factor on consumers' perceived DTC website trust and subsequent attitudinal and behavioral responses. Results show a significant relation between the website trust cue factor and consumers' perceived DTC website trust. Perceived DTC website trust, in turn, was found to be significantly associated with consumers' attitude toward the DTC website and behavioral intention.

  8. Improving throughput and user experience for information intensive websites by applying HTTP compression technique.

    Science.gov (United States)

    Malla, Ratnakar

    2008-11-06

    HTTP compression is a technique specified as part of the W3C HTTP 1.0 standard. It allows HTTP servers to take advantage of GZIP compression technology that is built into latest browsers. A brief survey of medical informatics websites show that compression is not enabled. With compression enabled, downloaded files sizes are reduced by more than 50% and typical transaction time is also reduced from 20 to 8 minutes, thus providing a better user experience.

  9. Shared vision, shared vulnerability: A content analysis of corporate social responsibility information on tobacco industry websites.

    Science.gov (United States)

    McDaniel, Patricia A; Cadman, Brie; Malone, Ruth E

    2016-08-01

    Tobacco companies rely on corporate social responsibility (CSR) initiatives to improve their public image and advance their political objectives, which include thwarting or undermining tobacco control policies. For these reasons, implementation guidelines for the World Health Organization's Framework Convention on Tobacco Control (FCTC) recommend curtailing or prohibiting tobacco industry CSR. To understand how and where major tobacco companies focus their CSR resources, we explored CSR-related content on 4 US and 4 multinational tobacco company websites in February 2014. The websites described a range of CSR-related activities, many common across all companies, and no programs were unique to a particular company. The websites mentioned CSR activities in 58 countries, representing nearly every region of the world. Tobacco companies appear to have a shared vision about what constitutes CSR, due perhaps to shared vulnerabilities. Most countries that host tobacco company CSR programs are parties to the FCTC, highlighting the need for full implementation of the treaty, and for funding to monitor CSR activity, replace industry philanthropy, and enforce existing bans. Copyright © 2016 Elsevier Inc. All rights reserved.

  10. Disposal of Information Seeking and Retrieval Research: Replacement with a Radical Proposition

    Science.gov (United States)

    Budd, John M.; Anstaett, Ashley

    2013-01-01

    Introduction: Research and theory on the topics of information seeking and retrieval have been plagued by some fundamental problems for several decades. Many of the difficulties spring from mechanistic and instrumental thinking and modelling. Method: Existing models of information retrieval and information seeking are examined for efficacy in a…

  11. Comparing the quality of accessing medical literature using content-based visual and textual information retrieval

    Science.gov (United States)

    Müller, Henning; Kalpathy-Cramer, Jayashree; Kahn, Charles E., Jr.; Hersh, William

    2009-02-01

    Content-based visual information (or image) retrieval (CBIR) has been an extremely active research domain within medical imaging over the past ten years, with the goal of improving the management of visual medical information. Many technical solutions have been proposed, and application scenarios for image retrieval as well as image classification have been set up. However, in contrast to medical information retrieval using textual methods, visual retrieval has only rarely been applied in clinical practice. This is despite the large amount and variety of visual information produced in hospitals every day. This information overload imposes a significant burden upon clinicians, and CBIR technologies have the potential to help the situation. However, in order for CBIR to become an accepted clinical tool, it must demonstrate a higher level of technical maturity than it has to date. Since 2004, the ImageCLEF benchmark has included a task for the comparison of visual information retrieval algorithms for medical applications. In 2005, a task for medical image classification was introduced and both tasks have been run successfully for the past four years. These benchmarks allow an annual comparison of visual retrieval techniques based on the same data sets and the same query tasks, enabling the meaningful comparison of various retrieval techniques. The datasets used from 2004-2007 contained images and annotations from medical teaching files. In 2008, however, the dataset used was made up of 67,000 images (along with their associated figure captions and the full text of their corresponding articles) from two Radiological Society of North America (RSNA) scientific journals. This article describes the results of the medical image retrieval task of the ImageCLEF 2008 evaluation campaign. We compare the retrieval results of both visual and textual information retrieval systems from 15 research groups on the aforementioned data set. The results show clearly that, currently

  12. Determining the Content of a Pediatric Asthma Website from Parents’ Perspective: The Internet Use and Information Needs

    Directory of Open Access Journals (Sweden)

    Rezvan Ansari

    2017-06-01

    Full Text Available Background The acquisition of knowledge by parents of children with asthma plays an important role in the treatment of children. Thus, it is important to understand their needs and provide this information through available methods such as a website.The aim of this studywas to determine the content of a pediatric asthma website based on the evaluation of parents information needs. Materials and Methods This cross-sectional studywas conducted by a descriptive-analytical approach in Kerman, Iran. Data were collected using a semi-structured questionnaire.The questionnaire was distributed among a sample of 300 parents visiting allergy and asthma specialists’ offices. Three experts confirmed validity of the questionnaire. The reliability of the questionnairewas confirmed using the test- retest method on 40 participants (r = 0.82. Data were analyzed using descriptive and analytical statistics by SPSS version 20.0 software. Results Participants demanded information concerning asthma nutrition (79.0%, prevention (78.1%, treatment (77.1%, medications (72.4% as well as general information (71.4% and information about etiology of the disease (70.5%, respectively. The results showed that the fathers use the Internet significantly more than the mothers (p=0.0001. There was a statistically significant relationship between participants’ educational level and the type of resources they use to obtain information (P

  13. A Problem in Information Retrieval with Fuzzy Sets.

    Science.gov (United States)

    Buell, Duncan A.

    1985-01-01

    Discussion of problems with fuzzy subsets in document retrieval highlights attempts to invent a system of weighted fuzzy queries in which weights correspond to relative importance of each term in query as whole, and use of Kantor's Logic for Retrieval as an alternative to Boolean queries. Six references are cited. (EJS)

  14. A method for the design and development of medical or health care information websites to optimize search engine results page rankings on Google.

    Science.gov (United States)

    Dunne, Suzanne; Cummins, Niamh Maria; Hannigan, Ailish; Shannon, Bill; Dunne, Colum; Cullen, Walter

    2013-08-27

    The Internet is a widely used source of information for patients searching for medical/health care information. While many studies have assessed existing medical/health care information on the Internet, relatively few have examined methods for design and delivery of such websites, particularly those aimed at the general public. This study describes a method of evaluating material for new medical/health care websites, or for assessing those already in existence, which is correlated with higher rankings on Google's Search Engine Results Pages (SERPs). A website quality assessment (WQA) tool was developed using criteria related to the quality of the information to be contained in the website in addition to an assessment of the readability of the text. This was retrospectively applied to assess existing websites that provide information about generic medicines. The reproducibility of the WQA tool and its predictive validity were assessed in this study. The WQA tool demonstrated very high reproducibility (intraclass correlation coefficient=0.95) between 2 independent users. A moderate to strong correlation was found between WQA scores and rankings on Google SERPs. Analogous correlations were seen between rankings and readability of websites as determined by Flesch Reading Ease and Flesch-Kincaid Grade Level scores. The use of the WQA tool developed in this study is recommended as part of the design phase of a medical or health care information provision website, along with assessment of readability of the material to be used. This may ensure that the website performs better on Google searches. The tool can also be used retrospectively to make improvements to existing websites, thus, potentially enabling better Google search result positions without incurring the costs associated with Search Engine Optimization (SEO) professionals or paid promotion.

  15. Contextual Information Retrieval based on Algorithmic Information Theory and Statistical Outlier Detection

    CERN Document Server

    Martinez, Rafael; Rodriguez, Francisco de Borja; Camacho, David

    2007-01-01

    The main contribution of this paper is to design an Information Retrieval (IR) technique based on Algorithmic Information Theory (using the Normalized Compression Distance- NCD), statistical techniques (outliers), and novel organization of data base structure. The paper shows how they can be integrated to retrieve information from generic databases using long (text-based) queries. Two important problems are analyzed in the paper. On the one hand, how to detect "false positives" when the distance among the documents is very low and there is actual similarity. On the other hand, we propose a way to structure a document database which similarities distance estimation depends on the length of the selected text. Finally, the experimental evaluations that have been carried out to study previous problems are shown.

  16. An automatic method for retrieving and indexing catalogues of biomedical courses.

    Science.gov (United States)

    Maojo, Victor; de la Calle, Guillermo; García-Remesal, Miguel; Bankauskaite, Vaida; Crespo, Jose

    2008-11-06

    Although there is wide information about Biomedical Informatics education and courses in different Websites, information is usually not exhaustive and difficult to update. We propose a new methodology based on information retrieval techniques for extracting, indexing and retrieving automatically information about educational offers. A web application has been developed to make available such information in an inventory of courses and educational offers.

  17. Stemmer Impact on Quranic Mobile Information Retrieval Performance

    Directory of Open Access Journals (Sweden)

    Huda Omar Aljaloud

    2016-12-01

    Full Text Available Stemming algorithms are employed in information retrieval (IR to reduce verity variants of the same word with several endings to a standard stem. Stemmers can also help IR systems by unifying vocabulary, reducing term variants, reducing storage space, and increasing the likelihood of matching documents, all of which make stemming very attractive for use in IR. This paper aims to study the impact of using stemming techniques in mobile effectiveness. Two-word extraction stemming techniques will be used: a light stemmer and a dictionary-lookup stemmer. Also, three sets of experiments were conducted in this research in order to raise the efficiency of mobile aapplications. Implementing the two stemming approaches and assessing their accuracy by calculating the precision, recall, MAP, and f-measure, produced results which show that the light10 stemmer outperforms the dictionary-lookup stemmer in precision and MAP. Furthermore, the mobile performance of the light10 stemmer exceeds that of the dictionary-based stemmer.

  18. How to retrieve additional information from the multiplicity distributions

    Science.gov (United States)

    Wilk, Grzegorz; Włodarczyk, Zbigniew

    2017-01-01

    Multiplicity distributions (MDs) P(N) measured in multiparticle production processes are most frequently described by the negative binomial distribution (NBD). However, with increasing collision energy some systematic discrepancies have become more and more apparent. They are usually attributed to the possible multi-source structure of the production process and described using a multi-NBD form of the MD. We investigate the possibility of keeping a single NBD but with its parameters depending on the multiplicity N. This is done by modifying the widely known clan model of particle production leading to the NBD form of P(N). This is then confronted with the approach based on the so-called cascade-stochastic formalism which is based on different types of recurrence relations defining P(N). We demonstrate that a combination of both approaches allows the retrieval of additional valuable information from the MDs, namely the oscillatory behavior of the counting statistics apparently visible in the high energy data.

  19. Information retrieval for education: making search engines language aware

    Directory of Open Access Journals (Sweden)

    Niels Ott

    2010-01-01

    Full Text Available Search engines have been a major factor in making the web the successful and widely usedinformation source it is today. Generally speaking, they make it possible to retrieve web pageson a topic specified by the keywords entered by the user. Yet web searching currently doesnot take into account which of the search results are comprehensible for a given user – anissue of particular relevance when considering students in an educational setting. And currentsearch engines do not support teachers in searching for language properties relevant forselecting texts appropriate for language students at different stages in the second languageacquisition process.At the same time, raising language awareness is a major focus in second language acquisitionresearch and foreign language teaching practice, and research since the 20s has tried toidentify indicators predicting which texts are comprehensible for readers at a particular levelof ability. For example, the military has been interested in ensuring that workers at a givenlevel of education can understand the manuals they need to read in order to perform their job.We present a new search engine approach which makes it possible for teachers to search fortexts both in terms of contents and in terms of their reading difficulty and other languageproperties. The implemented prototype builds on state-of-the art information retrievaltechnology and exemplifies how a range of readability measures can be integrated in amodular fashion.

  20. Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data

    OpenAIRE

    2005-01-01

    This paper aimed to design and build an Automatic Information Retrieval System to handle the Arabic data. Also, this paper presents some type of comparison between the retrieval results using the vector space model in two different indexing methods: the full-ward indexing and the root indexing. The proposed Automatic Information Retrieval system was implemented and built using a traditional model technique: Vector Space Model (VSM) where the cosine measure similarity was used. The output resu...

  1. A new website with real-time dissemination of information on fire activity and meteorological fire danger in Portugal

    Science.gov (United States)

    DaCamara, Carlos; Trigo, Ricardo; Nunes, Sílvia; Pinto, Miguel; Oliveira, Tiago; Almeida, Rui

    2017-04-01

    In Portugal, like in Mediterranean Europe, fire activity is a natural phenomenon linking climate, humans and vegetation and is therefore conditioned by natural and anthropogenic factors. Natural factors include topography, vegetation cover and prevailing weather conditions whereas anthropogenic factors encompass land management practices and fire prevention policies. Land management practices, in particular the inadequate use of fire, is a crucial anthropogenic factor that accounts for about 90% of fire ignitions. Fire prevention policies require adequate and timely information about wildfire potential assessment, which is usually based on fire danger rating systems that provide indices to be used on an operational and tactical basis in decision support systems. We present a new website designed to provide the user community with relevant real-time information on fire activity and meteorological fire danger that will allow adopting the adequate measures to mitigate fire damage. The fire danger product consists of forecasts of fire danger over Portugal based on a statistical procedure that combines information about fire history derived from the Fire Radiative Power product disseminated by the Land Surface Analysis Satellite Application Facility (LSA SAF) with daily meteorological forecasts provided by the European Centre for Medium-Range Weather Forecasts (ECMWF). The aim of the website is fourfold; 1) to concentrate all information available (databases and maps) relevant to fire management in a unique platform so that access by end users becomes easier, faster and friendlier; 2) to supervise the access of users to the different products available; 3) to control and assist the access to the platform and obtain feedbacks from users for further improvements; 4) to outreach the operational community and foster the use of better information that increase efficiency in risk management. The website is sponsored by The Navigator Company, a leading force in the global pulp

  2. Website Optimization

    CERN Document Server

    King, Andrew

    2008-01-01

    Remember when an optimized website was one that merely didn't take all day to appear? Times have changed. Today, website optimization can spell the difference between enterprise success and failure, and it takes a lot more know-how to achieve success. This book is a comprehensive guide to the tips, techniques, secrets, standards, and methods of website optimization. From increasing site traffic to maximizing leads, from revving up responsiveness to increasing navigability, from prospect retention to closing more sales, the world of 21st century website optimization is explored, exemplified a

  3. The Construction of Library Websites Based on Information Building and Readers' Experience%基于信息构建与读者体验的图书馆网站建设

    Institute of Scientific and Technical Information of China (English)

    汤妙吉

    2016-01-01

    Currently, the construction of library websites in many universities and colleges is not closely dedicated to readers' experience demand, which results in a few problems, such as the navigating organization of websites, the usability of web⁃sites, the support of reader's personalized service, and the content of websites. The development of library websites which is based on both information construction and readers' experience through the combination of Java and Mssql database will not only guarantee the scientificity and standardization of the construction of library websites, but also emphasize resource navi⁃gation and retrieval, personalized libraries and interaction through social media, improve the utilization rate of library collec⁃tion resources, enhance the initiative of library to serve readers, as well as optimize readers' experience in using library web⁃sites.%当前许多高校图书馆网站建设并非贴近读者体验需求,存在网站导航组织缺乏科学性、网站易用性不理想、读者个性化服务支持力度不大以及网站内容缺乏主动性等问题。通过采用Java结合Mssql数据库进行基于信息构建与读者体验的图书馆网站开发,在保证图书馆网站建设科学性与标准化的情况下重点突出资源导航检索、个性图书馆、社交媒体互动,提高图书馆馆藏资源利用率,提升图书馆为读者服务的主动性,优化读者使用图书馆网站体验。

  4. Generic information can retrieve known biological associations: implications for biomedical knowledge discovery.

    Directory of Open Access Journals (Sweden)

    Herman H H B M van Haagen

    Full Text Available MOTIVATION: Weighted semantic networks built from text-mined literature can be used to retrieve known protein-protein or gene-disease associations, and have been shown to anticipate associations years before they are explicitly stated in the literature. Our text-mining system recognizes over 640,000 biomedical concepts: some are specific (i.e., names of genes or proteins others generic (e.g., 'Homo sapiens'. Generic concepts may play important roles in automated information retrieval, extraction, and inference but may also result in concept overload and confound retrieval and reasoning with low-relevance or even spurious links. Here, we attempted to optimize the retrieval performance for protein-protein interactions (PPI by filtering generic concepts (node filtering or links to generic concepts (edge filtering from a weighted semantic network. First, we defined metrics based on network properties that quantify the specificity of concepts. Then using these metrics, we systematically filtered generic information from the network while monitoring retrieval performance of known protein-protein interactions. We also systematically filtered specific information from the network (inverse filtering, and assessed the retrieval performance of networks composed of generic information alone. RESULTS: Filtering generic or specific information induced a two-phase response in retrieval performance: initially the effects of filtering were minimal but beyond a critical threshold network performance suddenly drops. Contrary to expectations, networks composed exclusively of generic information demonstrated retrieval performance comparable to unfiltered networks that also contain specific concepts. Furthermore, an analysis using individual generic concepts demonstrated that they can effectively support the retrieval of known protein-protein interactions. For instance the concept "binding" is indicative for PPI retrieval and the concept "mutation abnormality" is

  5. Latent morpho-semantic analysis : multilingual information retrieval with character n-grams and mutual information.

    Energy Technology Data Exchange (ETDEWEB)

    Bader, Brett William; Chew, Peter A.; Abdelali, Ahmed (New Mexico State University)

    2008-08-01

    We describe an entirely statistics-based, unsupervised, and language-independent approach to multilingual information retrieval, which we call Latent Morpho-Semantic Analysis (LMSA). LMSA overcomes some of the shortcomings of related previous approaches such as Latent Semantic Analysis (LSA). LMSA has an important theoretical advantage over LSA: it combines well-known techniques in a novel way to break the terms of LSA down into units which correspond more closely to morphemes. Thus, it has a particular appeal for use with morphologically complex languages such as Arabic. We show through empirical results that the theoretical advantages of LMSA can translate into significant gains in precision in multilingual information retrieval tests. These gains are not matched either when a standard stemmer is used with LSA, or when terms are indiscriminately broken down into n-grams.

  6. Efficient Methods to Assimilate Satellite Retrievals Based on Information Content. Part 2; Suboptimal Retrieval Assimilation

    Science.gov (United States)

    Joiner, J.; Dee, D. P.

    1998-01-01

    One of the outstanding problems in data assimilation has been and continues to be how best to utilize satellite data while balancing the tradeoff between accuracy and computational cost. A number of weather prediction centers have recently achieved remarkable success in improving their forecast skill by changing the method by which satellite data are assimilated into the forecast model from the traditional approach of assimilating retrievals to the direct assimilation of radiances in a variational framework. The operational implementation of such a substantial change in methodology involves a great number of technical details, e.g., pertaining to quality control procedures, systematic error correction techniques, and tuning of the statistical parameters in the analysis algorithm. Although there are clear theoretical advantages to the direct radiance assimilation approach, it is not obvious at all to what extent the improvements that have been obtained so far can be attributed to the change in methodology, or to various technical aspects of the implementation. The issue is of interest because retrieval assimilation retains many practical and logistical advantages which may become even more significant in the near future when increasingly high-volume data sources become available. The central question we address here is: how much improvement can we expect from assimilating radiances rather than retrievals, all other things being equal? We compare the two approaches in a simplified one-dimensional theoretical framework, in which problems related to quality control and systematic error correction are conveniently absent. By assuming a perfect radiative transfer model and perfect knowledge of radiance and background error covariances, we are able to formulate a nonlinear local error analysis for each assimilation method. Direct radiance assimilation is optimal in this idealized context, while the traditional method of assimilating retrievals is suboptimal because it

  7. 45 CFR 205.35 - Mechanized claims processing and information retrieval systems; definitions.

    Science.gov (United States)

    2010-10-01

    ... retrieval systems; definitions. 205.35 Section 205.35 Public Welfare Regulations Relating to Public Welfare... claims processing and information retrieval systems; definitions. Section 205.35 through 205.38 contain State plan requirements for an automated statewide management information system, conditions for FFP...

  8. Information Retrieval eXperience (IRX): Towards a Human-Centered Personalized Model of Relevance

    NARCIS (Netherlands)

    Sluis, van der Frans; Broek, van den Egon L.; Dijk, van Betsy; Hoeber, O.; Li, Y.; Huang, X.J.

    2010-01-01

    We approach Information Retrieval (IR) from a User eXperience (UX) perspective. Through introducing a model for Information Retrieval eXperience (IRX), this paper operationalizes a perspective on IR that reaches beyond topicality. Based on a document's topicality, complexity, and emotional value, a

  9. A Parallel Relational Database Management System Approach to Relevance Feedback in Information Retrieval.

    Science.gov (United States)

    Lundquist, Carol; Frieder, Ophir; Holmes, David O.; Grossman, David

    1999-01-01

    Describes a scalable, parallel, relational database-drive information retrieval engine. To support portability across a wide range of execution environments, all algorithms adhere to the SQL-92 standard. By incorporating relevance feedback algorithms, accuracy is enhanced over prior database-driven information retrieval efforts. Presents…

  10. Nonmaterialized Relations and the Support of Information Retrieval Applications by Relational Database Systems.

    Science.gov (United States)

    Lynch, Clifford A.

    1991-01-01

    Describes several aspects of the problem of supporting information retrieval system query requirements in the relational database management system (RDBMS) environment and proposes an extension to query processing called nonmaterialized relations. User interactions with information retrieval systems are discussed, and nonmaterialized relations are…

  11. Experiments in Discourse Analysis Impact on Information Classification and Retrieval Algorithms.

    Science.gov (United States)

    Morato, Jorge; Llorens, J.; Genova, G.; Moreiro, J. A.

    2003-01-01

    Discusses the inclusion of contextual information in indexing and retrieval systems to improve results and the ability to carry out text analysis by means of linguistic knowledge. Presents research that investigated whether discourse variables have an impact on information and retrieval and classification algorithms. (Author/LRW)

  12. On a Model of Distributed Information Retrieval Systems Based on Thesauri.

    Science.gov (United States)

    Mazur, Zygmunt

    1984-01-01

    Investigates the properties of a global model consisting of "n" local information retrieval systems based on thesaurus. Definitions of a distributed information retrieval system (thesaurus, documents set, set of queries) and proofs of theorems denoting further properties of the systems are highlighted. Five references are included. (EJS)

  13. A Domain Specific Lexicon Acquisition Tool for Cross-Language Information Retrieval

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Jong, de Franciska; Kraaij, Wessel

    1997-01-01

    With the recent enormous increase of information dissemination via the web as incentive there is a growing interest in supporting tools for cross-language retrieval. In this paper we describe a disclosure and retrieval approach that fulfils the needs of both information providers and users by offeri

  14. The Relative Effectiveness of Varied Visual Testing Formats in Retrieving Information Related to Different Educational Objectives

    Science.gov (United States)

    Williams, Jaison; Dwyer, Francis

    2004-01-01

    The purpose of this study is to: (1) examine the relative effectiveness with which different types of visual test formats facilitated information retrieval on tests measuring different educational objectives; (2) measure the effect that prior knowledge had on information retrieval; and (3) to determine whether an interaction existed between prior…

  15. Strong Similarity Measures for Ordered Sets of Documents in Information Retrieval.

    Science.gov (United States)

    Egghe, L.; Michel, Christine

    2002-01-01

    Presents a general method to construct ordered similarity measures in information retrieval based on classical similarity measures for ordinary sets. Describes a test of some of these measures in an information retrieval system that extracted ranked document sets and discuses the practical usability of the ordered similarity measures. (Author/LRW)

  16. Personalizing Information Retrieval Using Interaction Behaviors in Search Sessions in Different Types of Tasks

    Science.gov (United States)

    Liu, Chang

    2012-01-01

    When using information retrieval (IR) systems, users often pose short and ambiguous query terms. It is critical for IR systems to obtain more accurate representation of users' information need, their document preferences, and the context they are working in, and then incorporate them into the design of the systems to tailor retrieval to…

  17. A Parallel Relational Database Management System Approach to Relevance Feedback in Information Retrieval.

    Science.gov (United States)

    Lundquist, Carol; Frieder, Ophir; Holmes, David O.; Grossman, David

    1999-01-01

    Describes a scalable, parallel, relational database-drive information retrieval engine. To support portability across a wide range of execution environments, all algorithms adhere to the SQL-92 standard. By incorporating relevance feedback algorithms, accuracy is enhanced over prior database-driven information retrieval efforts. Presents…

  18. A Virtual Assistant for Websites

    Directory of Open Access Journals (Sweden)

    José Luiz Andrade Duizith

    2004-06-01

    Full Text Available This work presents a Virtual Assistant (VA whose main goal is to supply information for Websites users. AVA is a software system that interacts with persons through a Web browser, receiving textual questions and answering automatically without human intervention. The VA supplies information by looking for similar questions in a knowledge base and giving the corresponding answer. Artificial Intelligence techniques are employed in this matching process, to compare the user’s question against questions stored in the base. The main advantage of using the VA is to minimize information overload when users get lost in Websites. The VA can guide the user across the web pages or directly supply information. This is especially important for customers visiting an enterprise site, looking for products, services or prices or needing information about some topic. The VA can also help in Knowledge Management processes inside enterprises, offering an easy way for people storing and retrieving knowledge. An extra advantage is to reduce the structure of Call Centers, since the VA can be given to customers in a CD-ROM. Furthermore, the VA provides Webmasters with statistics about the usage of the VA (themes more asked, number of visitants, time of conversation.

  19. Dissociable parietal regions facilitate successful retrieval of recently learned and personally familiar information.

    Science.gov (United States)

    Elman, Jeremy A; Cohn-Sheehy, Brendan I; Shimamura, Arthur P

    2013-03-01

    In fMRI analyses, the posterior parietal cortex (PPC) is particularly active during the successful retrieval of episodic memory. To delineate the neural correlates of episodic retrieval more succinctly, we compared retrieval of recently learned spatial locations (photographs of buildings) with retrieval of previously familiar locations (photographs of familiar campus buildings). Episodic retrieval of recently learned locations activated a circumscribed region within the ventral PPC (anterior angular gyrus and adjacent regions in the supramarginal gyrus) as well as medial PPC regions (posterior cingulated gyrus and posterior precuneus). Retrieval of familiar locations activated more posterior regions in the ventral PPC (posterior angular gyrus, LOC) and more anterior regions in the medial PPC (anterior precuneus and retrosplenial cortex). These dissociable effects define more precisely PPC regions involved in the retrieval of recent, contextually bound information as opposed to regions involved in other processes, such as visual imagery, scene reconstruction, and self-referential processing.

  20. Information retrieval patterns and needs among practicing general surgeons: a statewide experience.

    Science.gov (United States)

    Shelstad, K R; Clevenger, F W

    1996-10-01

    Information retrieval has progressed from a reliance on traditional print sources to the modern era of computer databases and online networks. Surgeons, many from remote areas not served by professional medical libraries, must develop and maintain skills in information retrieval and management in both electronic and standard formats. One hundred thirty-three New Mexico general surgeons were surveyed to identify their information-seeking patterns in five areas: retrieval purposes, retrieval sources, barriers to access, techniques used, and continuing education needs. Ninety-nine (74.4%) surgeons responded to the survey. Ninety-five percent utilize professional meetings, the medical literature, and physician colleagues as information sources. Only 17% utilize the outreach services of the state's only medical school library. Common retrieval barriers were practice demands (71%), isolation from medical schools (30%), computer illiteracy (28%), and rural environment (25%). Continuing education topics related to information management would be valuable to 61% of the surgeons. Sixty-nine percent believe their current ability to access biomedical information is adequate, despite most frequently accessing their personal libraries for information related to decision-making or patient management. These data suggest that, despite significant information needs, surgeons have not embraced newer forms of information retrieval. It is imperative that surgeons acquire and maintain modern information retrieval skills as a means of remaining up-to-date in their profession. Professional surgical organizations and medical librarians should collaborate on these continuing education ventures.

  1. Official Antimonopoly Website Opens

    Institute of Scientific and Technical Information of China (English)

    Guo Liqin

    2011-01-01

    @@ China's antimonopoly law website opened in December 19, 2009.Netizens can log in at http://www.antimonopolylaw.org to see the update information of indepth anti-monopoly law theory and case studies, according to the organizer.

  2. ILRS Website Update

    Science.gov (United States)

    Noll, Carey E.; Torrence, Mark H.; Pollack, Nathan H.; Tyahla, Lori J.

    2013-01-01

    The ILRS website, http://ilrs.gsfc.nasa.gov, is the central source of information for all aspects of the service. The website provides information on the organization and operation of the ILRS and descriptions of ILRS components data, and products. Furthermore, the website provides an entry point to the archive of these data products available through the data centers. Links are provided to extensive information on the ILRS network stations including performance assesments and data quality evaluations. Descriptions of suported satellite missions (current, future, and past) are provided to aid in station acquisition and data analysis. The website was reently redesigned. Content was reviewed during the update process, ensuring information is current and useful. This poster will provide specific examples of key sections, applicaitons, and webpages.

  3. Aerometric Information Retrieval System/AIRS Facility Subsystem (AIRS/AFS)

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Aerometric Information Retrieval System/AIRS Facility Subsystem (AIRS/AFS) is a database that provides information on air releases from various stationary...

  4. Subjective Probability and Information Retrieval: A Review of the Psychological Literature.

    Science.gov (United States)

    Thompson, Paul

    1988-01-01

    Reviews the subjective probability estimation literature of six schools of human judgement and decision making: decision theory, behavioral decision theory, psychological decision theory, social judgement theory, information integration theory, and attribution theory. Implications for probabilistic information retrieval are discussed, including…

  5. Intrasubtest scatter on the WAIS-III information subtest and psychometrically defined retrieval deficits.

    Science.gov (United States)

    Ryan, J J; Paul, C A; Arb, J D

    1999-12-01

    Milberg, et al. (1996) postulated that significant intrasubtest scatter on the Wechsler Information subtest reflects impaired retrieval. From a pool of 205 male referrals at a VA medical center with complete WAIS-III and WMS-III protocols, 28 participants with impaired retrieval (Group I) defined by a high Retrieval Composite score were identified. A sample (Group II) without similar evidence of impaired retrieval was matched to Group I on age, education, Full Scale IQ, race, and diagnosis. Intrasubtest scatter on the Information subtest was the same across groups (Group I M = 6.3, SD = 2.7; Group II M = 6.9, SD = 3.4). A second study identified impaired retrieval using the WMS-III Word Lists subtest. 21 participants (Group III) had impaired retrieval indicated by a Recognition scaled score being > or = 4 points higher than the Delayed Recall scaled score. A matched sample (Group IV) of VA patients without similar evidence of impaired retrieval was constituted. Intrasubtest scatter on the Information subtest did not differ across groups (Group III M = 6.6, SD = 2.4; Group IV M = 6.0, SD = 2.5). Evaluations of the retrieval deficit hypothesis should be based on responses of participants whose Information performance is characterized by abnormal amounts of intrasubtest scatter. It is possible that a specific amount of response variability must be present within the subtest before retrieval problems can be detected.

  6. Library Website Usability Test Project

    KAUST Repository

    Ramli, Rindra M.

    2013-06-01

    This usability testing project was conducted to elicit an understanding of our community use of the library website. The researchers wanted to know how our users are interacting with the library website and the ease of obtaining relevant information from the website. The methodology deployed was computer user testing where participants are made to answer several questions and executing the actions on the library website. Their actions are recorded via Techsmith Camtasia software for later analysis by the researchers.

  7. [Design and implementation of medical instrument standard information retrieval system based on APS.NET].

    Science.gov (United States)

    Yu, Kaijun

    2010-07-01

    This paper Analys the design goals of Medical Instrumentation standard information retrieval system. Based on the B /S structure,we established a medical instrumentation standard retrieval system with ASP.NET C # programming language, IIS f Web server, SQL Server 2000 database, in the. NET environment. The paper also Introduces the system structure, retrieval system modules, system development environment and detailed design of the system.

  8. Editorial for the Bibliometric-enhanced Information Retrieval Workshop at ECIR 2014

    OpenAIRE

    2014-01-01

    This first "Bibliometric-enhanced Information Retrieval" (BIR 2014) workshop aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offer value-added effects for users. In this workshop we will explore how statistical modelling of scholarship, such as Bradfordizing or network analysis of co-authorship network, can improve retrieval...

  9. Understanding Academic Information Seeking Habits through Analysis of Web Server Log Files: The Case of the Teachers College Library Website

    Science.gov (United States)

    Asunka, Stephen; Chae, Hui Soo; Hughes, Brian; Natriello, Gary

    2009-01-01

    Transaction logs of user activity on an academic library website were analyzed to determine general usage patterns on the website. This paper reports on insights gained from the analysis, and identifies and discusses issues relating to content access, interface design and general functionality of the website. (Contains 13 figures and 8 tables.)

  10. A Novel Approach for Information Content Retrieval and Analysis of Bio-Images using Datamining techniques

    Directory of Open Access Journals (Sweden)

    Ayyagari Sri Nagesh

    2012-11-01

    Full Text Available In Bio-Medical image processing domain, content-based analysis and Information retrieval of bio-images is very critical for disease diagnosis. Content-Based Image Analysis and Information Retrieval (CBIAIR has become a significant part of information retrieval technology. One challenge in this area is that the ever-increasing number of bio-images acquired through the digital world makes the brute force searching almost impossible. Medical Image structural objects content and object identification plays significant role for image content analysis and information retrieval. There are basically three fundamental concepts for content-based bio-image retrieval, i.e. visual-feature extraction, multi-dimensional indexing, and retrieval system process. Each image has three contents such as: colour, texture and shape features. Colour and Texture both plays important image visual features used in Content-Based Image Retrieval to improve results. In this paper, we have presented an effective image retrieval system using features like texture, shape and color, called CBIAIR (Content-Based Image Analysis and Information Retrieval. Here, we have taken three different features such as texture, color and shape. Firstly, we have developed a new texture pattern feature for pixel based feature in CBIAIR system. Subsequently, we have used semantic color feature for color based feature and the shape based feature selection is done using the existing technique. For retrieving, these features are extracted from the query image and matched with the feature library using the feature weighted distance. After that, all feature vectors will be stored in the database using indexing procedure. Finally, the relevant images that have less matched distance than the predefined threshold value are retrieved from the image database after adapting the K-NN classifier.

  11. Information Storage and Retrieval, Scientific Report No. ISR-15.

    Science.gov (United States)

    Salton, Gerard

    Several algorithms were investigated which would allow a user to interact with an automatic document retrieval system by requesting relevance judgments on selected sets of documents. Two viewpoints were taken in evaluation. One measured the movement of queries toward the optimum query as defined by Rocchio; the other measured the retrieval…

  12. Data retrieval system provides unlimited hardware design information

    Science.gov (United States)

    Rawson, R. D.; Swanson, R. L.

    1967-01-01

    Data is input to magnetic tape on a single format card that specifies the system, location, and component, the test point identification number, the operators initial, the date, a data code, and the data itself. This method is efficient for large volume data storage and retrieval, and permits output variations without continuous program modifications.

  13. A method for the design and development of medical or health care information websites to optimize search engine results page rankings on Google.

    LENUS (Irish Health Repository)

    Dunne, Suzanne

    2013-01-01

    The Internet is a widely used source of information for patients searching for medical\\/health care information. While many studies have assessed existing medical\\/health care information on the Internet, relatively few have examined methods for design and delivery of such websites, particularly those aimed at the general public.

  14. An Information-Theoretic Privacy Criterion for Query Forgery in Information Retrieval

    CERN Document Server

    Rebollo-Monedero, David; Forné, Jordi

    2011-01-01

    In previous work, we presented a novel information-theoretic privacy criterion for query forgery in the domain of information retrieval. Our criterion measured privacy risk as a divergence between the user's and the population's query distribution, and contemplated the entropy of the user's distribution as a particular case. In this work, we make a twofold contribution. First, we thoroughly interpret and justify the privacy metric proposed in our previous work, elaborating on the intimate connection between the celebrated method of entropy maximization and the use of entropies and divergences as measures of privacy. Secondly, we attempt to bridge the gap between the privacy and the information-theoretic communities by substantially adapting some technicalities of our original work to reach a wider audience, not intimately familiar with information theory and the method of types.

  15. Design and Implementation of Automatic Indexing for Information Retrieval with Arabic Documents.

    Science.gov (United States)

    Hmeidi, Ismail; Kanaan, Ghassan; Evens, Martha

    1997-01-01

    Describes automatic information retrieval system designed and built to handle Arabic data. Discusses cost-effectiveness of automatic indexing. Compares retrieval results using words as index terms versus stems and roots. Includes 19 tables; 60 queries using full words and relevance judgments are appended. (JAK)

  16. Cross-Language Information Retrieval: Experiments Based on CLEF 2000 Corpora.

    Science.gov (United States)

    Savoy, Jacques

    2003-01-01

    Discusses cross-language, multilingual, and bilingual information retrieval on the Web; evaluates retrieval effectiveness of indexing and search strategies based on test collections from CLEF (Cross-Language Evaluation Forum) in English, French, German, and Italian; and suggests and evaluates database merging strategies. Appendices include…

  17. Modeling the Time Course of Feature Perception and Feature Information Retrieval

    Science.gov (United States)

    Kent, Christopher; Lamberts, Koen

    2006-01-01

    Three experiments investigated whether retrieval of information about different dimensions of a visual object varies as a function of the perceptual properties of those dimensions. The experiments involved two perception-based matching tasks and two retrieval-based matching tasks. A signal-to-respond methodology was used in all tasks. A stochastic…

  18. Editorial for the Bibliometric-enhanced Information Retrieval Workshop at ECIR 2014

    NARCIS (Netherlands)

    Mayr, Philipp; Schaer, Philipp; Scharnhorst, Andrea; Mutschke, Peter

    2014-01-01

    This first "Bibliometric-enhanced Information Retrieval" (BIR 2014) workshop aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offe

  19. Query-by-Example Music Information Retrieval by Score-Informed Source Separation and Remixing Technologies

    Directory of Open Access Journals (Sweden)

    Goto Masataka

    2010-01-01

    Full Text Available We describe a novel query-by-example (QBE approach in music information retrieval that allows a user to customize query examples by directly modifying the volume of different instrument parts. The underlying hypothesis of this approach is that the musical mood of retrieved results changes in relation to the volume balance of different instruments. On the basis of this hypothesis, we aim to clarify the relationship between the change in the volume balance of a query and the genre of the retrieved pieces, called genre classification shift. Such an understanding would allow us to instruct users in how to generate alternative queries without finding other appropriate pieces. Our QBE system first separates all instrument parts from the audio signal of a piece with the help of its musical score, and then it allows users remix these parts to change the acoustic features that represent the musical mood of the piece. Experimental results showed that the genre classification shift was actually caused by the volume change in the vocal, guitar, and drum parts.

  20. Utilization of ontology look-up services in information retrieval for biomedical literature

    OpenAIRE

    2014-01-01

    With the vast amount of biomedical data we face the necessity to improve information retrieval processes in biomedical domain. The use of biomedical ontologies facilitated the combination of various data sources (e.g. scientific literature, clinical data repository) by increasing the quality of information retrieval and reducing the maintenance efforts. In this context, we developed Ontology Look-up services (OLS), based on NEWT and MeSH vocabularies. Our services were involved in some inform...