WorldWideScience

Sample records for bilingual information retrieval

  1. BiLingual Information Retrieval System for English and Tamil

    CERN Document Server

    Saraswathi, S; K, Kalaimagal; M, Kalaiyarasi

    2010-01-01

    This paper addresses the design and implementation of BiLingual Information Retrieval system on the domain, Festivals. A generic platform is built for BiLingual Information retrieval which can be extended to any foreign or Indian language working with the same efficiency. Search for the solution of the query is not done in a specific predefined set of standard languages but is chosen dynamically on processing the user's query. This paper deals with Indian language Tamil apart from English. The task is to retrieve the solution for the user given query in the same language as that of the query. In this process, a Ontological tree is built for the domain in such a way that there are entries in the above listed two languages in every node of the tree. A Part-Of-Speech (POS) Tagger is used to determine the keywords from the given query. Based on the context, the keywords are translated to appropriate languages using the Ontological tree. A search is performed and documents are retrieved based on the keywords. With...

  2. Semi Automatic Ontology Based Bilingual Information Retrieval System (Pilgrimage Tourism in South India

    Directory of Open Access Journals (Sweden)

    S. Saraswathi

    2012-04-01

    Full Text Available This paper focuses on the construction of a Semi Automatic Ontological tree in the domain of Pilgrimage Tourism in South India for the purpose of enhancing the efficiency in the online Information Retrieval. The proposed system uses two languages Tamil and English for the input query and document retrieval. The user can pose the query in either Tamil or English and the resultant document will be displayed in the query language. In order to retrieve more relevant documents, a semi-automatic Ontology tree has been constructed. The semi automatic ontological tree uses only the English language. Machine Translation approach is used to translate the retrieved result to the language that of the user’s query. Our system produces the better results for the simple user’s query about Pilgrimage Tourism in South India for which the answers could be retrieved from the updated semi automatic ontological tree itself.

  3. Available Methods in Farsi-English Cross Language Information Retrieval Using Machine-readable, Bilingual Glossary

    Directory of Open Access Journals (Sweden)

    Hamid Alizadeh

    2009-12-01

    Full Text Available In this paper the impact scope of Natural Language Processing (NLP on translating search statements was determined by testing out research hypotheses. The NLP techniques employed for search statement processing included text parsing, linguistic forms identification, stopword removal, morphological analysis, and tokenization. Examination of the hypotheses indicated that using the method of translating the first equivalent term selected versus the method of selecting all equivalent terms, would contribute to increased efficiency of the review that while morphological analysis of the terms not translated by the glossary, would increase the retrieval precision cutoff, there would be no significant difference established by the lack of such analysis thereof that sentence translation as opposed to term by term translation, would increase the efficiency of Farsi-English proofreading. Other findings are also represented.

  4. Connectionist Interaction Information Retrieval.

    Science.gov (United States)

    Dominich, Sandor

    2003-01-01

    Discussion of connectionist views for adaptive clustering in information retrieval focuses on a connectionist clustering technique and activation spreading-based information retrieval model using the interaction information retrieval method. Presents theoretical as well as simulation results as regards computational complexity and includes…

  5. Evaluating personal information retrieval

    OpenAIRE

    Kelly, Liadh; Bunbury, Paul; Jones, Gareth J.F.

    2012-01-01

    Evaluation of personal search over an individual’s personal information space on the desktop or elsewhere is problematic for reasons relating both to the personal and private nature of the data and the associated personal information needs of collection owners. Indeed challenges associated with evaluation in this space are recognised as one of the key factors hindering the development of research in personal information retrieval. We present the “personal information retrieval evaluatio...

  6. Private information retrieval

    CERN Document Server

    Yi, Xun; Bertino, Elisa

    2013-01-01

    This book deals with Private Information Retrieval (PIR), a technique allowing a user to retrieve an element from a server in possession of a database without revealing to the server which element is retrieved. PIR has been widely applied to protect the privacy of the user in querying a service provider on the Internet. For example, by PIR, one can query a location-based service provider about the nearest car park without revealing his location to the server.The first PIR approach was introduced by Chor, Goldreich, Kushilevitz and Sudan in 1995 in a multi-server setting, where the user retriev

  7. Information retrieval system

    Science.gov (United States)

    Berg, R. F.; Holcomb, J. E.; Kelroy, E. A.; Levine, D. A.; Mee, C., III

    1970-01-01

    Generalized information storage and retrieval system capable of generating and maintaining a file, gathering statistics, sorting output, and generating final reports for output is reviewed. File generation and file maintenance programs written for the system are general purpose routines.

  8. Information Retrieval Models

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Göker, Ayse; Davies, John

    2009-01-01

    Many applications that handle information on the internet would be completely inadequate without the support of information retrieval technology. How would we find information on the world wide web if there were no web search engines? How would we manage our email without spam filtering? Much of the

  9. Information Retrieval Models

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Göker, Ayse; Davies, John

    2009-01-01

    Many applications that handle information on the internet would be completely inadequate without the support of information retrieval technology. How would we find information on the world wide web if there were no web search engines? How would we manage our email without spam filtering? Much of the

  10. Introduction to information retrieval

    CERN Document Server

    Manning, Christopher D; Schütze, Hinrich

    2008-01-01

    Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced un

  11. Interactive Information Retrieval

    DEFF Research Database (Denmark)

    Borlund, Pia

    2013-01-01

    The paper introduces the research area of interactive information retrieval (IIR) from a historical point of view. Further, the focus here is on evaluation, because much research in IR deals with IR evaluation methodology due to the core research interest in IR performance, system interaction...... and satisfaction with retrieved information. In order to position IIR evaluation, the Cranfield model and the series of tests that led to the Cranfield model are outlined. Three iconic user-oriented studies and projects that all have contributed to how IIR is perceived and understood today are presented......: The MEDLARS test, the Book House fiction retrieval system, and the OKAPI project. On this basis the call for alternative IIR evaluation approaches motivated by the three revolutions (the cognitive, the relevance, and the interactive revolutions) put forward by Robertson & Hancock-Beaulieu (1992) is presented...

  12. Grammatical category mediates the bilingual disadvantage in word retrieval

    Directory of Open Access Journals (Sweden)

    Yasmeen Faroqi-Shah

    2015-05-01

    Figure 1 shows the scores. Language Quotient (LQ, Kertesz, 2006 was treated as a covariate to account for overall language proficiency for both picture naming and fluency. There was an interaction between grammatical category and bilingualism for both picture naming accuracy (F(1, 69 = 37.5, p< .001 and verbal fluency (F(1,60 = 5.2, p<.05, such that bilinguals scored lower than monolinguals for noun picture naming (F(1, 69 = 4.1, p< .05, LQ-corrected mean difference of 28.2% and animal fluency (F(1, 60 = 13.9, p< .001, mean difference = 3.2, but not for actions. This finding of comparable verb retrieval accuracy in high proficiency bilingual speakers could be attributed to fewer cross-language competitors for verbs (Bultena et al., 2013. This study suggests that 1 bilingual lexical organization is influenced by grammatical category; and 2 action naming tasks may be more reliable for neuropsychological testing of high proficiency bilinguals.

  13. Music Information Retrieval.

    Science.gov (United States)

    Downie, J. Stephen

    2003-01-01

    Identifies MIR (Music Information Retrieval) computer system problems, historic influences, current state-of-the-art, and future MIR solutions through an examination of the multidisciplinary approach to MIR. Highlights include pitch; temporal factors; harmonics; tone; editorial, textual, and bibliographic facets; multicultural factors; locating…

  14. Information Retrieval Evaluation

    CERN Document Server

    Harman, Donna

    2011-01-01

    Evaluation has always played a major role in information retrieval, with the early pioneers such as Cyril Cleverdon and Gerard Salton laying the foundations for most of the evaluation methodologies in use today. The retrieval community has been extremely fortunate to have such a well-grounded evaluation paradigm during a period when most of the human language technologies were just developing. This lecture has the goal of explaining where these evaluation methodologies came from and how they have continued to adapt to the vastly changed environment in the search engine world today. The lecture

  15. Information, conservation and retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Eng, T. [Swedish Nuclear Fuel and Waste Management Co., Stockholm (Sweden); Norberg, E. [National Swedish Archives, Stockholm (Sweden); Torbacke, J. [Stockholm Univ. (Sweden). Dept. of History; Jensen, M. [Swedish Radiation Protection Inst., Stockholm (Sweden)

    1996-12-01

    The seminar took place on the Swedish ship for transportation of radioactive wastes, M/S Sigyn, which at summer time is used for exhibitions. The seminar treated items related to general information needs in society and questions related to radioactive waste, i.e. how knowledge about a waste repository should be passed on to future generations. Three contributions are contained in the report from the seminar and are indexed separately: `Active preservation - otherwise no achieves`; `The conservation and dissemination of information - A democratic issue`; and, `Conservation and retrieval of information - Elements of a strategy to inform future societies about nuclear waste repositories`.

  16. Interactive Information Retrieval

    DEFF Research Database (Denmark)

    Borlund, Pia

    2013-01-01

    The paper introduces the research area of interactive information retrieval (IIR) from a historical point of view. Further, the focus here is on evaluation, because much research in IR deals with IR evaluation methodology due to the core research interest in IR performance, system interaction...... and satisfaction with retrieved information. In order to position IIR evaluation, the Cranfield model and the series of tests that led to the Cranfield model are outlined. Three iconic user-oriented studies and projects that all have contributed to how IIR is perceived and understood today are presented....... As a response to this call the ‘IIR evaluation model’ by Borlund (e.g., 2003a) is introduced. The objective of the IIR evaluation model is to facilitate IIR evaluation as close as possible to actual information searching and IR processes, though still in a relatively controlled evaluation environment, in which...

  17. Interactive Information Retrieval:

    DEFF Research Database (Denmark)

    Borlund, Pia

    This presentation addresses methodological issues of interactive information retrieval (IIR) evaluation in terms of what it entails to study users' use and interaction with IR systems, as well as their satisfaction with retrieved information. In particular, the presentation focuses on test design......, and it takes a look into the toolbox of IIR test design with reference to data collection methods and test procedure. It calls for careful and well-planned studies to qualify the knowledgebase generated as a result of the conducted IIR studies. The presentation further reflects on the need for an updated...... IIR from the perspective of search dedication and task load in order to also include everyday life information seeking? With this presentation, the IIR community is invited to an exchange of ideas and is encouraged to engage in collaborations with the solving of these (and other) issues to our joint...

  18. Introduction to information retrieval

    CERN Document Server

    Manning, Christopher D; Schütze, Hinrich

    2008-01-01

    Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Slides and additional exercises (with solutions for lecturers) are also available through the book's supporting website to help course instructors prepare their lectures.

  19. Changing Information Retrieval Behaviours

    DEFF Research Database (Denmark)

    Constantiou, Ioanna D.; Lehrer, Christiane; Hess, Thomas

    2014-01-01

    The introduction of smartphones and the accompanying profusion of mobile data services have had a profound effect on individuals' lives. One of the most influential service categories is location-based services (LBS). Based on insights from behavioural decision-making, a conceptual framework is d...... on the continuance of LBS use and indicate changes in individuals' information retrieval behaviours in everyday life. In particular, the distinct value dimension of LBS in specific contexts of use changes individuals' behaviours towards accessing location-related information....

  20. A Visual Information Retrieval Tool.

    Science.gov (United States)

    Zhang, Jin

    2000-01-01

    Discussion of visualization for information retrieval, that transforms unseen internal semantic representation of a document collection into visible geometric displays, focuses on DARE (Distance Angle Retrieval Environment). Highlights include expression of information need; interpretation and manipulation of information retrieval models; ranking…

  1. Multimedia Information Retrieval

    CERN Document Server

    Rueger, Stefan

    2009-01-01

    At its very core multimedia information retrieval means the process of searching for and finding multimedia documents; the corresponding research field is concerned with building the best possible multimedia search engines. The intriguing bit here is that the query itself can be a multimedia excerpt: For example, when you walk around in an unknown place and stumble across an interesting landmark, would it not be great if you could just take a picture with your mobile phone and send it to a service that finds a similar picture in a database and tells you more about the building -- and about its

  2. Intelligent Information Retrieval

    CERN Document Server

    Kurtz, M J; Accomazzi, A; Grant, C; Henneken, E; Murray, S S; Kurtz, Michael J.; Eichhorn, Guenther; Accomazzi, Alberto; Grant, Carolyn; Henneken, Edwin; Murray, Stephen S.

    2005-01-01

    Since it was first announced at ADASS 2 the Smithsonian/NASA Astrophysics System Abstract Service (ADS) has played a central role in the information seeking behavior of astronomers. Central to the ability of the ADS to act as a search and discovery tool is its role as metadata agregator. Over the past 13 years the ADS has introduced many new techniques to facilitate information retrieval, broadly defined. We discuss some of these developments; with particular attention to how the ADS might inta754-1.pseract with the virtual observatory, and to the new myADS-arXiv customized open access virtual journal. The ADS is at http://ads.harvard.edu

  3. Advanced Topics in Information Retrieval

    CERN Document Server

    Melucci, Massimo

    2011-01-01

    Information retrieval is the science concerned with the effective and efficient retrieval of documents starting from their semantic content. It is employed to fulfill some information need from a large number of digital documents. Given the ever-growing amount of documents available and the heterogeneous data structures used for storage, information retrieval has recently faced and tackled novel applications. In this book, Melucci and Baeza-Yates present a wide-spectrum illustration of recent research results in advanced areas related to information retrieval. Readers will find chapters on e.g

  4. Intelligent Information Retrieval: An Introduction.

    Science.gov (United States)

    Gauch, Susan

    1992-01-01

    Discusses the application of artificial intelligence to online information retrieval systems and describes several systems: (1) CANSEARCH, from MEDLINE; (2) Intelligent Interface for Information Retrieval (I3R); (3) Gausch's Query Reformulation; (4) Environmental Pollution Expert (EP-X); (5) PLEXUS (gardening); and (6) SCISOR (corporate…

  5. Information retrieval in cultural heritage

    NARCIS (Netherlands)

    Koolen, M.; Kamps, J.; de Keijzer, V.

    2009-01-01

    This article discusses the opportunities and challenges of applying modern information retrieval techniques to the cultural heritage domain. Although the field of information retrieval is closely associated with computer science, it originally emerged from library science — also one of the main disc

  6. Contextual Bandits for Information Retrieval

    NARCIS (Netherlands)

    Hofmann, K.; Whiteson, S.; de Rijke, M.

    2011-01-01

    In this paper we give an overview of and outlook on research at the intersection of information retrieval (IR) and contextual bandit problems. A critical problem in information retrieval is online learning to rank, where a search engine strives to improve the quality of the ranked result lists it

  7. Ontology-based Information Retrieval

    DEFF Research Database (Denmark)

    Styltsvig, Henrik Bulskov

    of concept similarity in query evaluation is discussed. A semantic expansion approach that incorporates concept similarity is introduced and a generalized fuzzy set retrieval model that applies expansion during query evaluation is presented. While not commonly used in present information retrieval systems......In this thesis, we will present methods for introducing ontologies in information retrieval. The main hypothesis is that the inclusion of conceptual knowledge such as ontologies in the information retrieval process can contribute to the solution of major problems currently found in information...... retrieval. This utilization of ontologies has a number of challenges. Our focus is on the use of similarity measures derived from the knowledge about relations between concepts in ontologies, the recognition of semantic information in texts and the mapping of this knowledge into the ontologies in use...

  8. Bibliometric-enhanced Information Retrieval

    CERN Document Server

    Mayr, Philipp; Larsen, Birger; Schaer, Philipp; Mutschke, Peter

    2013-01-01

    Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offer value-added effects for users. In this workshop we will explore how statistical modelling of scholarship, such as Bradfordizing or network analysis of coauthorship network, can improve re-trieval services for specific communities, as well as for large, cross-domain col-lections. This workshop aims to raise awareness of the missing link between in-formation retrieval (IR) and bibliometrics/scientometrics and to create a common ground for the incorporation of bibliometric-enhanced services into retrieval at the digital library interface.

  9. Second language experience modulates word retrieval effort in bilinguals: Evidence from pupillometry

    Directory of Open Access Journals (Sweden)

    Jens eSchmidtke

    2014-02-01

    Full Text Available Bilingual speakers often have less language experience compared to monolinguals as a result of speaking two languages and/or a later age of acquisition of the second language. This may result in weaker and less precise phonological representations of words in memory, which may cause greater retrieval effort during spoken word recognition. To gauge retrieval effort, the present study compared the effects of word frequency, neighborhood density (ND, and level of English experience by testing monolingual English speakers and native Spanish speakers who differed in their age of acquisition of English (early/late. In the experimental paradigm, participants heard English words and matched them to one of four pictures while the pupil size, an indication of cognitive effort, was recorded. Overall, both frequency and ND effects could be observed in the pupil response, indicating that lower frequency and higher ND were associated with greater retrieval effort. Bilingual speakers showed an overall delayed pupil response and a larger ND effect compared to the monolingual speakers. The frequency effect was the same in early bilinguals and monolinguals but was larger in late bilinguals. Within the group of bilingual speakers, higher English proficiency was associated with an earlier pupil response in addition to a smaller frequency and ND effect. These results suggest that greater retrieval effort associated with bilingualism may be a consequence of reduced language experience rather than constitute a categorical bilingual disadvantage. Future avenues for the use of pupillometry in the field of spoken word recognition are discussed.

  10. Second language experience modulates word retrieval effort in bilinguals: evidence from pupillometry.

    Science.gov (United States)

    Schmidtke, Jens

    2014-01-01

    Bilingual speakers often have less language experience compared to monolinguals as a result of speaking two languages and/or a later age of acquisition of the second language. This may result in weaker and less precise phonological representations of words in memory, which may cause greater retrieval effort during spoken word recognition. To gauge retrieval effort, the present study compared the effects of word frequency, neighborhood density (ND), and level of English experience by testing monolingual English speakers and native Spanish speakers who differed in their age of acquisition of English (early/late). In the experimental paradigm, participants heard English words and matched them to one of four pictures while the pupil size, an indication of cognitive effort, was recorded. Overall, both frequency and ND effects could be observed in the pupil response, indicating that lower frequency and higher ND were associated with greater retrieval effort. Bilingual speakers showed an overall delayed pupil response and a larger ND effect compared to the monolingual speakers. The frequency effect was the same in early bilinguals and monolinguals but was larger in late bilinguals. Within the group of bilingual speakers, higher English proficiency was associated with an earlier pupil response in addition to a smaller frequency and ND effect. These results suggest that greater retrieval effort associated with bilingualism may be a consequence of reduced language experience rather than constitute a categorical bilingual disadvantage. Future avenues for the use of pupillometry in the field of spoken word recognition are discussed.

  11. Mobile medical visual information retrieval.

    Science.gov (United States)

    Depeursinge, Adrien; Duc, Samuel; Eggel, Ivan; Müller, Henning

    2012-01-01

    In this paper, we propose mobile access to peer-reviewed medical information based on textual search and content-based visual image retrieval. Web-based interfaces designed for limited screen space were developed to query via web services a medical information retrieval engine optimizing the amount of data to be transferred in wireless form. Visual and textual retrieval engines with state-of-the-art performance were integrated. Results obtained show a good usability of the software. Future use in clinical environments has the potential of increasing quality of patient care through bedside access to the medical literature in context.

  12. Information retrieval in digital environments

    CERN Document Server

    Dinet, Jérôme

    2014-01-01

    Information retrieval is a central and essential activity. It is indeed difficult to find a human activity that does not need to retrieve information in an environment which is often increasingly digital: moving and navigating, learning, having fun, communicating, informing, making a decision, etc. Most human activities are intimately linked to our ability to search quickly and effectively for relevant information, the stakes are sometimes extremely important: passing an exam, voting, finding a job, remaining autonomous, being socially connected, developing a critical spirit, or simply surviv

  13. Cross-language information retrieval

    CERN Document Server

    Nie, Jian-Yun

    2010-01-01

    Search for information is no longer exclusively limited within the native language of the user, but is more and more extended to other languages. This gives rise to the problem of cross-language information retrieval (CLIR), whose goal is to find relevant information written in a different language to a query. In addition to the problems of monolingual information retrieval (IR), translation is the key problem in CLIR: one should translate either the query or the documents from a language to another. However, this translation problem is not identical to full-text machine translation (MT): the

  14. Rhetorical relations for information retrieval

    DEFF Research Database (Denmark)

    Lioma, Christina; Larsen, Birger; Lu, Wei

    2012-01-01

    -called discourse structure has been applied successfully to several natural language processing tasks. This work studies the use of rhetorical relations for Information Retrieval (IR): Is there a correlation between certain rhetorical relations and retrieval performance? Can knowledge about a document’s rhetorical...... relations be useful to IR? We present a language model modification that considers rhetorical relations when estimating the relevance of a document to a query. Empirical evaluation of different versions of our model on TREC settings shows that certain rhetorical relations can benefit retrieval effectiveness...

  15. Information Retrieval for Ecological Syntheses

    Science.gov (United States)

    Bayliss, Helen R.; Beyer, Fiona R.

    2015-01-01

    Research syntheses are increasingly being conducted within the fields of ecology and environmental management. Information retrieval is crucial in any synthesis in identifying data for inclusion whilst potentially reducing biases in the dataset gathered, yet the nature of ecological information provides several challenges when compared with…

  16. The Ecosystem of Information Retrieval

    Science.gov (United States)

    Rodriguez-Munoz, Jose-Vicente; Martinez-Mendez, Francisco-Javier; Pastor-Sanchez, Juan-Antonio

    2012-01-01

    Introduction: This paper presents an initial proposal for a formal framework that, by studying the metric variables involved in information retrieval, can establish the sequence of events involved and how to perform it. Method: A systematic approach from the equations of Shannon and Weaver to establish the decidability of information retrieval…

  17. Evaluation of Information Retrieval Systems

    Directory of Open Access Journals (Sweden)

    Keneilwe Zuva

    2012-07-01

    Full Text Available One of the challenges of modern information retrieval is to adequately evaluate Information RetrievalSystem (IRS in order to estimate future performance in a specified application domain. Since there aremany algorithms in literature the decision to select one for usage depends mostly on the evaluation of thesystems’ performance in the domain. This paper presents how visual and scalar evaluation methodscomplement one another to adequately evaluate information retrieval systems. The visual evaluationmethods are capable of indicating whether one IRS performs better than another IRS fully or partially. Anoverall performance of IRS is revealed using scalar evaluation methods. The use of both types of evaluationmethods will give a clear picture of the performance of the IRSs. The Receiver Operator Characteristic(ROC curve and Precision-Recall (P-R curve were used to illustrate the visual evaluation methods. Scalarmethods notably precision, recall, Area Under Curve (AUC and F measure were used.

  18. Dublin City University at CLEF 2004: experiments in monolingual, bilingual and multilingual retrieval

    OpenAIRE

    Jones, Gareth J.F.; Burke, Michael; Judge, John; Khasin, Anna; Lam-Adesina, Adenike M.; Wagner, Joachim

    2005-01-01

    The Dublin City University group participated in the monolingual, bilingual and multilingual retrieval tasks this year. The main focus of our investigation this year was extending our retrieval system to document languages other than English, and completing the multilingual task comprising four languages: English, French, Russian and Finnish. Results from our French monolingual experiments indicate that working in French is more effective for retrieval than adopting document and topic translat...

  19. Cross-Language Information Retrieval: Experiments Based on CLEF 2000 Corpora.

    Science.gov (United States)

    Savoy, Jacques

    2003-01-01

    Discusses cross-language, multilingual, and bilingual information retrieval on the Web; evaluates retrieval effectiveness of indexing and search strategies based on test collections from CLEF (Cross-Language Evaluation Forum) in English, French, German, and Italian; and suggests and evaluates database merging strategies. Appendices include…

  20. Exeter at CLEF 2003: Experiments with machine translation for monolingual, bilingual and multilingual retrieval

    OpenAIRE

    Lam-Adesina, Adenike M.; Jones, Gareth J.F.

    2004-01-01

    The University of Exeter group participated in the monolingual, bilingual and multilingual-4 retrieval tasks this year. The main focus of our investigation this year was the small multilingual task comprising four languages, French, German, Spanish and English. We adopted a document translation strategy and tested four merging techniques to combine results from the separate document collections, as well as a merged collection strategy. For both the monolingual and bilingual tasks we explored ...

  1. ORDINAL REGRESSION FOR INFORMATION RETRIEVAL

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    This letter presents a new discriminative model for Information Retrieval (IR), referred to as Ordinal Regression Model (ORM). ORM is different from most existing models in that it views IR as ordinal regression problem (i.e. ranking problem) instead of binary classification. It is noted that the task of IR is to rank documents according to the user information needed, so IR can be viewed as ordinal regression problem. Two parameter learning algorithms for ORM are presented. One is a perceptron-based algorithm. The other is the ranking Support Vector Machine (SVM). The effectiveness of the proposed approach has been evaluated on the task of ad hoc retrieval using three English Text REtrieval Conference (TREC) sets and two Chinese TREC sets. Results show that ORM significantly outperforms the state-of-the-art language model approaches and OKAPI system in all test sets; and it is more appropriate to view IR as ordinal regression other than binary classification.

  2. INFORMATION RETRIEVAL FOR SHORT DOCUMENTS

    Institute of Scientific and Technical Information of China (English)

    Qi Haoliang; Li Mu; Gao Jianfeng; Li Sheng

    2006-01-01

    The major problem of the most current approaches of information models lies in that individual words provide unreliable evidence about the content of the texts. When the document is short, e.g. only the abstract is available, the word-use variability problem will have substantial impact on the Information Retrieval (IR) performance. To solve the problem, a new technology to short document retrieval named Reference Document Model (RDM) is put forward in this letter. RDM gets the statistical semantic of the query/document by pseudo feedback both for the query and document from reference documents. The contributions of this model are three-fold: (1) Pseudo feedback both for the query and the document; (2) Building the query model and the document model from reference documents; (3) Flexible indexing units, which can be any linguistic elements such as documents, paragraphs, sentences, n-grams, term or character. For short document retrieval, RDM achieves significant improvements over the classical probabilistic models on the task of ad hoc retrieval on Text REtrieval Conference (TREC) test sets. Results also show that the shorter the document, the better the RDM performance.

  3. EBMPracticeNet: A Bilingual National Electronic Point-Of-Care Project for Retrieval of Evidence-Based Clinical Guideline Information and Decision Support.

    Science.gov (United States)

    Van de Velde, Stijn; Vander Stichele, Robert; Fauquert, Benjamin; Geens, Siegfried; Heselmans, Annemie; Ramaekers, Dirk; Kunnamo, Ilkka; Aertgeerts, Bert

    2013-07-10

    In Belgium, the construction of a national electronic point-of-care information service, EBMPracticeNet, was initiated in 2011 to optimize quality of care by promoting evidence-based decision-making. The collaboration of the government, health care providers, evidence-based medicine (EBM) partners, and vendors of electronic health records (EHR) is unique to this project. All Belgian health care professionals get free access to an up-to-date database of validated Belgian and nearly 1000 international guidelines, incorporated in a portal that also provides EBM information from other sources than guidelines, including computerized clinical decision support that is integrated in the EHRs. The objective of this paper was to describe the development strategy, the overall content, and the management of EBMPracticeNet which may be of relevance to other health organizations creating national or regional electronic point-of-care information services. Several candidate providers of comprehensive guideline solutions were evaluated and one database was selected. Translation of the guidelines to Dutch and French was done with translation software, post-editing by translators and medical proofreading. A strategy is determined to adapt the guideline content to the Belgian context. Acceptance of the computerized clinical decision support tool has been tested and a randomized controlled trial is planned to evaluate the effect on process and patient outcomes. Currently, EBMPracticeNet is in "work in progress" state. Reference is made to the results of a pilot study and to further planned research including a randomized controlled trial. The collaboration of government, health care providers, EBM partners, and vendors of EHRs is unique. The potential value of the project is great. The link between all the EHRs from different vendors and a national database held on a single platform that is controlled by all EBM organizations in Belgium are the strengths of EBMPracticeNet.

  4. An Introduction to Information Retrieval.

    Science.gov (United States)

    International Business Machines Corp., White Plains, NY. Data Processing Div.

    The ways in which digital computers can be used in information storage and retrieval are presented in the language of the nonspecialist. Indexing methods, file organization, and search strategies are discussed and a brief bibliography containing 30 IBM publications is given. The manual is intended as a first reader for those interested in the…

  5. Information Retrieval in the Classroom.

    Science.gov (United States)

    Oley, Elizabeth

    1989-01-01

    Explores aspects of information retrieval skills such as end user training, indexing, controlled vocabulary systems, search protocol, boolean logic, problem analysis, and decision making. Suggests techniques for classroom instruction using simulations of online databases, CD-ROMs, and DIALOG's classroom instruction program. Describes several…

  6. Information Retrieval in Virtual Universities

    Science.gov (United States)

    Puustjärvi, Juha; Pöyry, Päivi

    2006-01-01

    Information retrieval in the context of virtual universities deals with the representation, organization, and access to learning objects. The representation and organization of learning objects should provide the learner with an easy access to the learning objects. In this article, we give an overview of the ONES system, and analyze the relevance…

  7. Automated information retrieval using CLIPS

    Science.gov (United States)

    Raines, Rodney Doyle, III; Beug, James Lewis

    1991-01-01

    Expert systems have considerable potential to assist computer users in managing the large volume of information available to them. One possible use of an expert system is to model the information retrieval interests of a human user and then make recommendations to the user as to articles of interest. At Cal Poly, a prototype expert system written in the C Language Integrated Production System (CLIPS) serves as an Automated Information Retrieval System (AIRS). AIRS monitors a user's reading preferences, develops a profile of the user, and then evaluates items returned from the information base. When prompted by the user, AIRS returns a list of items of interest to the user. In order to minimize the impact on system resources, AIRS is designed to run in the background during periods of light system use.

  8. Rhetorical relations for information retrieval

    DEFF Research Database (Denmark)

    Lioma, Christina; Larsen, Birger; Lu, Wei

    2012-01-01

    Typically, every part in most coherent text has some plausible reason for its presence, some function that it performs to the overall semantics of the text. Rhetorical relations, e.g. contrast, cause, explanation, describe how the parts of a text are linked to each other. Knowledge about this so......-called discourse structure has been applied successfully to several natural language processing tasks. This work studies the use of rhetorical relations for Information Retrieval (IR): Is there a correlation between certain rhetorical relations and retrieval performance? Can knowledge about a document’s rhetorical...... relations be useful to IR? We present a language model modification that considers rhetorical relations when estimating the relevance of a document to a query. Empirical evaluation of different versions of our model on TREC settings shows that certain rhetorical relations can benefit retrieval effectiveness...

  9. Interactive information seeking, behaviour and retrieval

    CERN Document Server

    Ruthven, Ian

    2011-01-01

    Information retrieval (IR) is a complex human activity supported by sophisticated systems. This book covers the whole spectrum of information retrieval, including: history and background information; behaviour and seeking task-based information; searching and retrieval approaches to investigating information; and, evaluation interfaces for IR.

  10. Least Information Modeling for Information Retrieval

    CERN Document Server

    Ke, Weimao

    2012-01-01

    We proposed a Least Information theory (LIT) to quantify meaning of information in probability distribution changes, from which a new information retrieval model was developed. We observed several important characteristics of the proposed theory and derived two quantities in the IR context for document representation. Given probability distributions in a collection as prior knowledge, LI Binary (LIB) quantifies least information due to the binary occurrence of a term in a document whereas LI Frequency (LIF) measures least information based on the probability of drawing a term from a bag of words. Three fusion methods were also developed to combine LIB and LIF quantities for term weighting and document ranking. Experiments on four benchmark TREC collections for ad hoc retrieval showed that LIT-based methods demonstrated very strong performances compared to classic TF*IDF and BM25, especially for verbose queries and hard search topics. The least information theory offers a new approach to measuring semantic qua...

  11. Information retrieval and individual differences

    Directory of Open Access Journals (Sweden)

    Polona Vilar

    2008-01-01

    Full Text Available The paper presents individual differences, which are found in studies of information retrieval with emphasis on models of personality traits, cognitive and learning styles. It pays special attention to those models which are most often included in studies of information behaviour,information seeking,perceptions of IR systems, etc., but also brings forward some models which have not yet been included in such studies. Additionally, the relationship between different individual characteristics and individual’s chosen profession or academic area is discussed. In this context,the paper presents how investigation of individual differences can be useful in the design of IR systems.

  12. Interactive Information Retrieval: Context and Basic Notions

    Directory of Open Access Journals (Sweden)

    David Robins

    2000-01-01

    Full Text Available his paper provides an introduction to interactive information retrieval--the study of human interaction with information retrieval systems. Interactive information retrieval may be contrasted with the "system-centered" view of information retrieval in which changes to information retrieval system variables are manipulated in isolation from users in laboratory situations. The paper elucidates current models of interactive information retrieval, namely, the episodic model, the stratified model, the interactive feedback and search process model, and the global model of polyrepresentation. Future directions for research in the field are discussed.

  13. Miracle’s 2005 Approach to Cross-lingual Information Retrieval

    OpenAIRE

    González Cristóbal, José Carlos; Goñi Menoyo, José Miguel; Villena Román, Julio

    2005-01-01

    This paper presents the 2005 Miracle’s team approach to Bilingual and Multilingual Information Retrieval. In the multilingual track, we have concentrated our work on the merging process of the results of monolingual runs to get the multilingual overall result, relying on available translations. In the bilingual and multilingual tracks, we have used available translation resources, and in some cases we have using a combining approach.

  14. Inhibitory control, word retrieval and bilingual aphasia: is there a relationship?

    Directory of Open Access Journals (Sweden)

    Yasmeen Faroqi-Shah

    2014-04-01

    Responses to incongruent trials were slower than congruent for linguistic (F(1,126.3=44.9, p.05. While confrontation naming and category fluency were highly correlated (r=.77, p<.001, correlations between linguistic and non-linguistic inhibition and between word retrieval and linguistic/non-linguistic inhibition were non-significant. This large group study poses challenges to theories of bilingual advantage and the role of non-lexical inhibitory measures in word retrieval in PWA.

  15. SEMANTIC TERM BASED INFORMATION RETRIEVAL USING ONTOLOGY

    OpenAIRE

    2014-01-01

    Information Searching and retrieval is a challenging task in the traditional keyword based textual information retrieval system. In the growing information age, adding huge data every day the searching problem also augmented. Keyword based retrieval system returns bulk of junk document irrelevant to query. To address the limitations, this paper proposed query terms along with semantic terms for information retrieval using multiple ontology reference. User query sometimes reflects multiple ...

  16. Personalized Mobile Information Retrieval System

    Directory of Open Access Journals (Sweden)

    Okkyung Choi

    2012-04-01

    Full Text Available Building a global Network Relations with the internet has made huge changes in personal information system and even comments left on a webpage of SNS(Social Network Services are appreciated as important elements that would provide valuable information for someone. Social Network is a relation between individuals or groups, represented in a graph model, which converts the concept of psychological and social relations into a logical structure by using node and link. But, most of the current personalized systems on the basis of Social Network are built and constructed mainly in the PC environment, and the systems are neither designed nor implemented in mobile environment. Hence, the objective of this study is to propose methods of providing Personalized Mobile Information Retrieval System using NFC (Near Field Communication Smartphone, which will be then used for Smartphone users. Besides, this study aims to verify its efficiency through a comparative analysis of existing studies.

  17. Personalized Mobile Information Retrieval System

    Directory of Open Access Journals (Sweden)

    Okkyung Choi

    2012-04-01

    Full Text Available Building a global Network Relations with the internet has made huge changes in personal information system and even comments left on a webpage of SNS(Social Network Services are appreciated as important elements that would provide valuable information for someone. Social Network is a relation between individuals or groups, represented in a graph model, which converts the concept of psychological and social relations into a logical structure by using node and link. But, most of the current personalized systems on the basis of Social Network are built and constructed mainly in the PC environment, and the systems are neither designed nor implemented in mobile environment. Hence, the objective of this study is to propose methods of providing Personalized Mobile Information Retrieval System using NFC (Near Field Communication Smartphone, which will be then used for Smartphone users. Besides, this study aims to verify its efficiency through a comparative analysis of existing studies.

  18. Formal and Informal Academic Language Socialization of a Bilingual Child

    Science.gov (United States)

    Cho, Hyonsuk

    2016-01-01

    This ethnographic case study examines a bilingual child's academic socialization in both formal and informal academic communities. The study follows a high-achieving, bilingual student in a public US elementary school, who paradoxically is seen as a slow learner in her Korean-American Sunday school. From the academic socialization and community of…

  19. Biomedical information retrieval across languages.

    Science.gov (United States)

    Daumke, Philipp; Markü, Kornél; Poprat, Michael; Schulz, Stefan; Klar, Rüdiger

    2007-06-01

    This work presents a new dictionary-based approach to biomedical cross-language information retrieval (CLIR) that addresses many of the general and domain-specific challenges in current CLIR research. Our method is based on a multilingual lexicon that was generated partly manually and partly automatically, and currently covers six European languages. It contains morphologically meaningful word fragments, termed subwords. Using subwords instead of entire words significantly reduces the number of lexical entries necessary to sufficiently cover a specific language and domain. Mediation between queries and documents is based on these subwords as well as on lists of word-n-grams that are generated from large monolingual corpora and constitute possible translation units. The translations are then sent to a standard Internet search engine. This process makes our approach an effective tool for searching the biomedical content of the World Wide Web in different languages. We evaluate this approach using the OHSUMED corpus, a large medical document collection, within a cross-language retrieval setting.

  20. Information retrieval from black holes

    Science.gov (United States)

    Lochan, Kinjalk; Chakraborty, Sumanta; Padmanabhan, T.

    2016-08-01

    It is generally believed that, when matter collapses to form a black hole, the complete information about the initial state of the matter cannot be retrieved by future asymptotic observers, through local measurements. This is contrary to the expectation from a unitary evolution in quantum theory and leads to (a version of) the black hole information paradox. Classically, nothing else, apart from mass, charge, and angular momentum is expected to be revealed to such asymptotic observers after the formation of a black hole. Semiclassically, black holes evaporate after their formation through the Hawking radiation. The dominant part of the radiation is expected to be thermal and hence one cannot know anything about the initial data from the resultant radiation. However, there can be sources of distortions which make the radiation nonthermal. Although the distortions are not strong enough to make the evolution unitary, these distortions carry some part of information regarding the in-state. In this work, we show how one can decipher the information about the in-state of the field from these distortions. We show that the distortions of a particular kind—which we call nonvacuum distortions—can be used to fully reconstruct the initial data. The asymptotic observer can do this operationally by measuring certain well-defined observables of the quantum field at late times. We demonstrate that a general class of in-states encode all their information content in the correlation of late time out-going modes. Further, using a 1 +1 dimensional dilatonic black hole model to accommodate backreaction self-consistently, we show that observers can also infer and track the information content about the initial data, during the course of evaporation, unambiguously. Implications of such information extraction are discussed.

  1. Multimedia information retrieval theory and techniques

    CERN Document Server

    Raieli, Roberto

    2013-01-01

    Novel processing and searching tools for the management of new multimedia documents have developed. Multimedia Information Retrieval (MMIR) is an organic system made up of Text Retrieval (TR); Visual Retrieval (VR); Video Retrieval (VDR); and Audio Retrieval (AR) systems. So that each type of digital document may be analysed and searched by the elements of language appropriate to its nature, search criteria must be extended. Such an approach is known as the Content Based Information Retrieval (CBIR), and is the core of MMIR. This novel content-based concept of information handling needs to be integrated with more traditional semantics. Multimedia Information Retrieval focuses on the tools of processing and searching applicable to the content-based management of new multimedia documents. Translated from Italian by Giles Smith, the book is divided in to two parts. Part one discusses MMIR and related theories, and puts forward new methodologies; part two reviews various experimental and operating MMIR systems, a...

  2. A Unified Mathematical Definition of Classical Information Retrieval.

    Science.gov (United States)

    Dominich, Sandor

    2000-01-01

    Presents a unified mathematical definition for the classical models of information retrieval and identifies a mathematical structure behind relevance feedback. Highlights include vector information retrieval; probabilistic information retrieval; and similarity information retrieval. (Contains 118 references.) (Author/LRW)

  3. Information retrieval from black holes

    CERN Document Server

    Lochan, Kinjalk; Padmanabhan, T

    2016-01-01

    It is generally believed that, when matter collapses to form a black hole, the complete information about the initial state of the matter cannot be retrieved by future asymptotic observers, through local measurements. This is contrary to the expectation from a unitary evolution in quantum theory and leads to (a version of) the black hole information paradox. Classically, nothing else, apart from mass, charge and angular momentum is expected to be revealed to such asymptotic observers after the formation of a black hole. Semi-classically, black holes evaporate after their formation through the Hawking radiation. The dominant part of the radiation is expected to be thermal and hence one cannot know anything about the initial data from the resultant radiation. However, there can be sources of distortions which make the radiation non-thermal. Although the distortions are not strong enough to make the evolution unitary, these distortions carry some part of information regarding the in-state. In this work, we show ...

  4. BIR 2014 - Bibliometric-enhanced Information Retrieval

    DEFF Research Database (Denmark)

    This first “Bibliometric-enhanced Information Retrieval” (BIR 2014) workshop1 aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although...... for the incorporation of bibliometric-enhanced services into retrieval at the digital library interface. Our interests include information retrieval, information seeking, science modelling, network analysis, and digital libraries. The goal is to apply insights from bibliometrics, scientometrics, and informetrics...

  5. Probabilistic Modeling in Dynamic Information Retrieval

    OpenAIRE

    Sloan, M. C.

    2016-01-01

    Dynamic modeling is used to design systems that are adaptive to their changing environment and is currently poorly understood in information retrieval systems. Common elements in the information retrieval methodology, such as documents, relevance, users and tasks, are dynamic entities that may evolve over the course of several interactions, which is increasingly captured in search log datasets. Conventional frameworks and models in information retrieval treat these elements as static, or only...

  6. A High-Speed Information Retrieval System

    Institute of Scientific and Technical Information of China (English)

    SHI Shu-dong; LI Zhi-tang

    2004-01-01

    We cleveloped a high-speed information retrieval system. The system hased on the IXP 2800 is one of the dedicute device. The velocily of the information retrieval is 6.8 Gb/s. The protocol support Telnet, FTP, SMTP, POP3 etc. various networks protocols. The information retrieval supports the key word and the natural language process. This paper explains the hardware system, software system and the index of the performance.

  7. Query space reduction in information retrieval

    OpenAIRE

    Kelledy, Fergus

    1997-01-01

    Today’s rapidly expanding and dynamic information age coupled with users who are becoming more discerning about what information they want and when they want it poses a serious challenge to information retrieval systems in their attempt to match user’s information needs with information repositories. To date most research on information retrieval has concentrated on improving system effectiveness. However as the amount of online information and the number of users concurrently accessing t...

  8. Information retrieval implementing and evaluating search engines

    CERN Document Server

    Büttcher, Stefan; Cormack, Gordon V

    2016-01-01

    Information retrieval is the foundation for modern search engines. This textbook offers an introduction to the core topics underlying modern search technologies, including algorithms, data structures, indexing, retrieval, and evaluation. The emphasis is on implementation and experimentation; each chapter includes exercises and suggestions for student projects. Wumpus -- a multiuser open-source information retrieval system developed by one of the authors and available online -- provides model implementations and a basis for student work. The modular structure of the book allows instructors to use it in a variety of graduate-level courses, including courses taught from a database systems perspective, traditional information retrieval courses with a focus on IR theory, and courses covering the basics of Web retrieval. In addition to its classroom use, Information Retrieval will be a valuable reference for professionals in computer science, computer engineering, and software engineering.

  9. Using Induction to Refine Information Retrieval Strategies

    Science.gov (United States)

    Baudin, Catherine; Pell, Barney; Kedar, Smadar

    1994-01-01

    Conceptual information retrieval systems use structured document indices, domain knowledge and a set of heuristic retrieval strategies to match user queries with a set of indices describing the document's content. Such retrieval strategies increase the set of relevant documents retrieved (increase recall), but at the expense of returning additional irrelevant documents (decrease precision). Usually in conceptual information retrieval systems this tradeoff is managed by hand and with difficulty. This paper discusses ways of managing this tradeoff by the application of standard induction algorithms to refine the retrieval strategies in an engineering design domain. We gathered examples of query/retrieval pairs during the system's operation using feedback from a user on the retrieved information. We then fed these examples to the induction algorithm and generated decision trees that refine the existing set of retrieval strategies. We found that (1) induction improved the precision on a set of queries generated by another user, without a significant loss in recall, and (2) in an interactive mode, the decision trees pointed out flaws in the retrieval and indexing knowledge and suggested ways to refine the retrieval strategies.

  10. Modelling and Retrieving Audiovisual Information - A Soccer Video Retrieval System

    NARCIS (Netherlands)

    Woudstra, A.; Velthausz, D.D.; Poot, de H.J.G.; Moelaart El-Hadidy, F.; Jonker, W.; Houtsma, M.A.W.; Heller, R.G.; Heemskerk, J.N.H.

    1998-01-01

    This paper describes the results of an ongoing collaborative project between KPN Research and the Telematics Institute on multimedia information handling. The focus of the paper is the modelling and retrieval of audiovisual information. The paper presents a general framework for modeling multimedia

  11. Development of a Mandarin-English Bilingual Speech Recognition System for Real World Music Retrieval

    Science.gov (United States)

    Zhang, Qingqing; Pan, Jielin; Lin, Yang; Shao, Jian; Yan, Yonghong

    In recent decades, there has been a great deal of research into the problem of bilingual speech recognition-to develop a recognizer that can handle inter- and intra-sentential language switching between two languages. This paper presents our recent work on the development of a grammar-constrained, Mandarin-English bilingual Speech Recognition System (MESRS) for real world music retrieval. Two of the main difficult issues in handling the bilingual speech recognition systems for real world applications are tackled in this paper. One is to balance the performance and the complexity of the bilingual speech recognition system; the other is to effectively deal with the matrix language accents in embedded language**. In order to process the intra-sentential language switching and reduce the amount of data required to robustly estimate statistical models, a compact single set of bilingual acoustic models derived by phone set merging and clustering is developed instead of using two separate monolingual models for each language. In our study, a novel Two-pass phone clustering method based on Confusion Matrix (TCM) is presented and compared with the log-likelihood measure method. Experiments testify that TCM can achieve better performance. Since potential system users' native language is Mandarin which is regarded as a matrix language in our application, their pronunciations of English as the embedded language usually contain Mandarin accents. In order to deal with the matrix language accents in embedded language, different non-native adaptation approaches are investigated. Experiments show that model retraining method outperforms the other common adaptation methods such as Maximum A Posteriori (MAP). With the effective incorporation of approaches on phone clustering and non-native adaptation, the Phrase Error Rate (PER) of MESRS for English utterances was reduced by 24.47% relatively compared to the baseline monolingual English system while the PER on Mandarin utterances was

  12. Data Fusion in Information Retrieval

    CERN Document Server

    Wu, Shengli

    2012-01-01

    The technique of data fusion has been used extensively in information retrieval due to the complexity and diversity of tasks involved such as web and social networks, legal, enterprise, and many others. This book presents both a theoretical and empirical approach to data fusion. Several typical data fusion algorithms are discussed, analyzed and evaluated. A reader will find answers to the following questions, among others: -          What are the key factors that affect the performance of data fusion algorithms significantly? -          What conditions are favorable to data fusion algorithms? -          CombSum and CombMNZ, which one is better? and why? -          What is the rationale of using the linear combination method? -          How can the best fusion option be found under any given circumstances?

  13. Arabic Studies’ Progress in Information Retrieval

    Directory of Open Access Journals (Sweden)

    Essam Hanandeh

    2016-02-01

    Full Text Available The field of information retrieval has witnessed tangible progress over the past decades in response to the expanded usage of the internet and the dire need of users to search for massive amounts of digital information. Given the steady increase of Arabic e-content, excellent information retrieval systems must be devised to suit the nature and requirements of the Arabic language. This paper sheds light on the current progress in the field of Arabic information retrieval, identifies the challenges that hinder the progress of this science, and proposes suggestions for further research. This paper uses the descriptive analytical method to examine the reality of Arabic studies in the field of information retrieval and to study the problems that are being faced in this area. Specifically, the previous literature on information retrieval is reviewed by searching the related databases and websites.

  14. Information Retrieval beyond the Text Document.

    Science.gov (United States)

    Rui, Yong; Ortega, Michael; Huang, Thomas S.; Mehrotra, Sharad

    1999-01-01

    Reports some of the progress made over the years toward exploring information beyond the text domain. Describes the Multimedia Analysis and Retrieval Systems (MARS), developed to increase access to non-textual information. Addresses the following aspects of MARS: (1) visual feature extraction; (2) retrieval models; (3) query reformulation…

  15. Applications of Optical Technology: Information Retrieval.

    Science.gov (United States)

    O'Connor, Mary Ann

    1991-01-01

    Discusses applications of optical technology, especially CD-ROMs, to information management needs. Information retrieval problems are discussed; design questions that concern the format of the data, indexing methods, and retrieval capabilities are presented; the need for updates is considered; access requirements are discussed; and the importance…

  16. Expert Systems and Intelligent Information Retrieval.

    Science.gov (United States)

    Brooks, H. M.

    1987-01-01

    Explores what an intelligent information retrieval system involves and why expert system techniques might interest system designers. Expert systems research is reviewed with emphasis on components, architecture, and computer interaction, and it is concluded that information retrieval is not an ideal problem domain for expert system application at…

  17. Progress in Documentation: Pictorial Information Retrieval.

    Science.gov (United States)

    Enser, P. G. B.

    1995-01-01

    Surveys theoretical and practical issues associated with pictorial information retrieval. Concentrating on still and moving pictorial forms of the visual image, this paper focuses on indexing pictorial material and discusses four models of pictorial information retrieval corresponding with permutations of the verbal and visual modes for the…

  18. Information Retrieval Interaction: an Analysis of Models

    Directory of Open Access Journals (Sweden)

    Farahnaz Sadoughi

    2012-03-01

    Full Text Available Information searching process is an interactive process; thus users has control on searching process, and they can manage the results of the search process. In this process, user's question became more mature, according to retrieved results. In addition, on the side of the information retrieval system, there are some processes that could not be realized, unless by user. Practically, this issue, is egregious in “Interaction” -i.e. process of user connection to other system elements- and in “Relevance judgment”. This paper had a glance to existence of “Interaction” in information retrieval, in first. Then the tradition model of information retrieval and its strenght and weak points were reviewed. Finally, the current models of interactive information retrieval includes: Belkin episodic model, Ingwersen cognitive model, Sarasevic stratified model, and Spinks interactive feedback model were elucidated.

  19. SEMANTIC TERM BASED INFORMATION RETRIEVAL USING ONTOLOGY

    Directory of Open Access Journals (Sweden)

    J. Mannar Mannan

    2014-01-01

    Full Text Available Information Searching and retrieval is a challenging task in the traditional keyword based textual information retrieval system. In the growing information age, adding huge data every day the searching problem also augmented. Keyword based retrieval system returns bulk of junk document irrelevant to query. To address the limitations, this paper proposed query terms along with semantic terms for information retrieval using multiple ontology reference. User query sometimes reflects multiple domain of interest that persist us to collect semantically related ontologies. If no related ontology exists then WordNet ontology used to retrieve semantic terms related to query term. In this approach, classes on the ontology derived as semantic related text keywords, these keywords considered for rank the documents.

  20. BIR 2014 - Bibliometric-enhanced Information Retrieval

    DEFF Research Database (Denmark)

    This first “Bibliometric-enhanced Information Retrieval” (BIR 2014) workshop1 aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although...... analysis of co-authorship network, can improve retrieval services for specific communities, as well as for large, cross-domain collections. This workshop aims to raise awareness of the missing link between information retrieval (IR) and bibliometrics / scientometrics and to create a common ground...... for the incorporation of bibliometric-enhanced services into retrieval at the digital library interface. Our interests include information retrieval, information seeking, science modelling, network analysis, and digital libraries. The goal is to apply insights from bibliometrics, scientometrics, and informetrics...

  1. Bibliometric-enhanced information retrieval

    NARCIS (Netherlands)

    Mayr, Philipp; Scharnhorst, Andrea; Larsen, Birger; Schaer, Philipp; Mutschke, Peter

    2014-01-01

    Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offer value-added effects for users. In this workshop we will explore how statistical modelling of scholarship, such as Bradfordizing or network analysis of coauthorship network, can im

  2. Information Retrieval and the Philosophy of Language.

    Science.gov (United States)

    Blair, David C.

    2003-01-01

    Provides an overview of some of the main ideas in the philosophy of language that have relevance to the issues of information retrieval, focusing on the description of the intellectual content. Highlights include retrieval problems; recall and precision; words and meanings; context; externalism and the philosophy of language; and scaffolding and…

  3. Concept Framework for Audio Information Retrieval: ARF

    Institute of Scientific and Technical Information of China (English)

    LI GuoHui(李国辉); WU DeFeng(武德峰); ZHANG Jun(张军)

    2003-01-01

    The majority of researches on content-based retrieval focused on visual media.However audio is also an important medium and information carrier from the viewpoint of humanauditory perception, so it is needed to retrieve for audio collection. Audio is handled by conven-tional methods as an opaque stream medium, which is not suitable for information retrieval byits content. In fact, audio carries rich aural information with the form of speech, musical, andsound effects, so it could be retrieved based on its aural content, such as acoustic features, musicalmelodies and associated semantics. In this paper, a concept framework (ARF) for content-basedaudio retrieval is proposed from systematic perspectives, which describes audio content model,audio retrieval architecture and audio query schemes. Audio contents are represented by a hier-archical model and a set of formal descriptions from physical to acoustic to semantic level, whichdepict acoustic features, logical structure and semantics of audio and audio objects. The archi-tecture consisting of audio meta-database, populating and accessing modules presents a systemstructure view of audio information retrieval. The query schemes give generalized approaches andmodes concerning how users deliver audio information needs to audio collections. Finally, an audioretrieval example implemented is used to explain and specify the application of the components in the proposed ARF.

  4. Parsimonious Language Models for Information Retrieval

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Robertson, Stephen; Zaragoza, Hugo

    2004-01-01

    We systematically investigate a new approach to estimating the parameters of language models for information retrieval, called parsimonious language models. Parsimonious language models explicitly address the relation between levels of language models that are typically used for smoothing. As such,

  5. Current challenges in patent information retrieval

    CERN Document Server

    Lupu, Mihai; Kando, Noriko

    2017-01-01

    Intellectual property in the form of patents plays a vital role in today's increasingly knowledge-based economy. This book assembles state-of-the art research and is intended to illustrate innovative approaches to patent information retrieval.

  6. Introducing Multimedia Information Retrieval to libraries

    OpenAIRE

    2016-01-01

    L'articolo vuole introdurre le biblioteche alla prospettiva che operare entro i termini dell'Information Retrieval (IR) tradizionale mediante il solo uso del linguaggio testuale è limitativo, e che prendere in considerazione i criteri più ampi del Multimedia Information Retrieval (MIR) è invece necessario. L'articolo illustra la storia dei principi fondamentali del MIR, a partire dai primi anni di dibattito sulla documentazione fino alle teorie odierne sui significati semantici. Vengono dibat...

  7. Phase retrieval with prior information.

    Science.gov (United States)

    Irwan, R; Lane, R G

    1998-09-01

    An algorithm for phase retrieval with Bayesian statistics is discussed. It is shown how the statistics of Kolmogorov turbulence can be used to compute the likelihood for a particular phase screen. This likelihood is then added to that of the observed data to produce a functional that is maximized directly by use of conjugate gradient maximization. It is shown that although this can significantly improve the quality of the phase estimate,the issue is complicated by local maxima introduced by the possibility of phase wrapping. The causes of the local maxima are analyzed, and a method that increases the likelihood of convergence to the global maximum is presented.

  8. The Application of the Comparable Corpora in Chinese-English Cross-Lingual Information Retrieval

    Institute of Scientific and Technical Information of China (English)

    DU Lin; ZHANG Yibo; SUN Le; SUN Yufang

    2001-01-01

    This paper proposes a novel Chinese-English Cross-Lingual Information Retrieval (CECLIR) model PME, in which bilingual dictionary and comparable corpora are used to translate the query terms. The proximity and mutual information of the term-pairs in the Chinese and English comparable corpora are employed not only to resolve the translation ambiguities but also to perform the query expansion so as to deal with the out-of-vocabulary issues in the CECLIR. The evaluation results show that the query precision of PME algorithm is about 84.4% of the monolingual information retrieval.

  9. Designing and Implementing a Cross-Language Information Retrieval System Using Linguistic Corpora

    Directory of Open Access Journals (Sweden)

    Amin Nezarat

    2012-03-01

    Full Text Available Information retrieval (IR is a crucial area of natural language processing (NLP and can be defined as finding documents whose content is relevant to the query need of a user. Cross-language information retrieval (CLIR refers to a kind of information retrieval in which the language of the query and that of searched document are different. In fact, it is a retrieval process where the user presents queries in one language to retrieve documents in another language. This paper tried to construct a bilingual lexicon of parallel chunks of English and Persian from two very large monolingual corpora an English-Persian parallel corpus which could be directly applied to cross-language information retrieval tasks. For this purpose, a statistical measure known as Association Score (AS was used to compute the association value between every two corresponding chunks in the corpus using a couple of complicated algorithms. Once the CLIR system was developed using this bilingual lexicon, an experiment was performed on a set of one hundred English and Persian phrases and collocations to see to what extend this system was effective in assisting the users find the most relevant and suitable equivalents of their queries in either language.

  10. Using ontology for domain specific information retrieval

    Science.gov (United States)

    Shashirekha, H. L.; Murali, S.; Nagabhushan, P.

    2010-02-01

    This paper presents a system for retrieving information from a domain specific document collection made up of data rich unnatural language text documents. Instead of conventional keyword based retrieval, our system makes use of domain ontology to retrieve the information from a collection of documents. The system addresses the problem of representing unnatural language text documents and constructing a classifier model that helps in the efficient retrieval of relevant information. Query to this system may be either the key phrases in terms of concepts or a domain specific unnatural language text document. The classifier used in this system can also be used to assign multiple labels to the previously unseen text document belonging to the same domain. An empirical evaluation of the system is conducted on the domain of text documents describing the classified matrimonial advertisements to determine its performance.

  11. Web information retrieval based on ontology

    Science.gov (United States)

    Zhang, Jian

    2013-03-01

    The purpose of the Information Retrieval (IR) is to find a set of documents that are relevant for a specific information need of a user. Traditional Information Retrieval model commonly used in commercial search engine is based on keyword indexing system and Boolean logic queries. One big drawback of traditional information retrieval is that they typically retrieve information without an explicitly defined domain of interest to the users so that a lot of no relevance information returns to users, which burden the user to pick up useful answer from these no relevance results. In order to tackle this issue, many semantic web information retrieval models have been proposed recently. The main advantage of Semantic Web is to enhance search mechanisms with the use of Ontology's mechanisms. In this paper, we present our approach to personalize web search engine based on ontology. In addition, key techniques are also discussed in our paper. Compared to previous research, our works concentrate on the semantic similarity and the whole process including query submission and information annotation.

  12. Activities of information retrieval in Daicel Corporation : The roles and efforts of information retrieval team

    Science.gov (United States)

    Yamazaki, Towako

    In order to stabilize and improve quality of information retrieval service, the information retrieval team of Daicel Corporation has given some efforts on standard operating procedures, interview sheet for information retrieval, structured format for search report, and search expressions for some technological fields of Daicel. These activities and efforts will also lead to skill sharing and skill tradition between searchers. In addition, skill improvements are needed not only for a searcher individually, but also for the information retrieval team totally when playing searcher's new roles.

  13. Language-based multimedia information retrieval

    OpenAIRE

    De Jong; Gauvain, J.L.; Hiemstra, D; Netter, K.

    2000-01-01

    This paper describes various methods and approaches for language-based multimedia information retrieval, which have been developed in the projects POP-EYE and OLIVE and which will be developed further in the MUMIS project. All of these project aim at supporting automated indexing of video material by use of human language technologies. Thus, in contrast to image or sound-based retrieval methods, where both the query language and the indexing methods build on non-linguistic data, these methods...

  14. Exploiting salient semantic analysis for information retrieval

    Science.gov (United States)

    Luo, Jing; Meng, Bo; Quan, Changqin; Tu, Xinhui

    2016-11-01

    Recently, many Wikipedia-based methods have been proposed to improve the performance of different natural language processing (NLP) tasks, such as semantic relatedness computation, text classification and information retrieval. Among these methods, salient semantic analysis (SSA) has been proven to be an effective way to generate conceptual representation for words or documents. However, its feasibility and effectiveness in information retrieval is mostly unknown. In this paper, we study how to efficiently use SSA to improve the information retrieval performance, and propose a SSA-based retrieval method under the language model framework. First, SSA model is adopted to build conceptual representations for documents and queries. Then, these conceptual representations and the bag-of-words (BOW) representations can be used in combination to estimate the language models of queries and documents. The proposed method is evaluated on several standard text retrieval conference (TREC) collections. Experiment results on standard TREC collections show the proposed models consistently outperform the existing Wikipedia-based retrieval methods.

  15. Hybrid Information Retrieval Model For Web Images

    CERN Document Server

    Bassil, Youssef

    2012-01-01

    The Bing Bang of the Internet in the early 90's increased dramatically the number of images being distributed and shared over the web. As a result, image information retrieval systems were developed to index and retrieve image files spread over the Internet. Most of these systems are keyword-based which search for images based on their textual metadata; and thus, they are imprecise as it is vague to describe an image with a human language. Besides, there exist the content-based image retrieval systems which search for images based on their visual information. However, content-based type systems are still immature and not that effective as they suffer from low retrieval recall/precision rate. This paper proposes a new hybrid image information retrieval model for indexing and retrieving web images published in HTML documents. The distinguishing mark of the proposed model is that it is based on both graphical content and textual metadata. The graphical content is denoted by color features and color histogram of ...

  16. Learning to rank for information retrieval

    CERN Document Server

    Liu, Tie-Yan

    2011-01-01

    Due to the fast growth of the Web and the difficulties in finding desired information, efficient and effective information retrieval systems have become more important than ever, and the search engine has become an essential tool for many people. The ranker, a central component in every search engine, is responsible for the matching between processed queries and indexed documents. Because of its central role, great attention has been paid to the research and development of ranking technologies. In addition, ranking is also pivotal for many other information retrieval applications, such as coll

  17. Teaching a Heuristic Approach to Information Retrieval.

    Science.gov (United States)

    Ury, Connie Jo; And Others

    1997-01-01

    Discusses lifelong learning and the need for information retrieval skills, and describes how Northwest Missouri State University incorporates a heuristic model of library instruction in which students continually evaluate and refine information-seeking practices while progressing through all levels of courses in diverse disciplines. (Author/LRW)

  18. BIRS - Bioterrorism Information Retrieval System.

    Science.gov (United States)

    Tewari, Ashish Kumar; Rashi; Wadhwa, Gulshan; Sharma, Sanjeev Kumar; Jain, Chakresh Kumar

    2013-01-01

    Bioterrorism is the intended use of pathogenic strains of microbes to widen terror in a population. There is a definite need to promote research for development of vaccines, therapeutics and diagnostic methods as a part of preparedness to any bioterror attack in the future. BIRS is an open-access database of collective information on the organisms related to bioterrorism. The architecture of database utilizes the current open-source technology viz PHP ver 5.3.19, MySQL and IIS server under windows platform for database designing. Database stores information on literature, generic- information and unique pathways of about 10 microorganisms involved in bioterrorism. This may serve as a collective repository to accelerate the drug discovery and vaccines designing process against such bioterrorist agents (microbes). The available data has been validated from various online resources and literature mining in order to provide the user with a comprehensive information system. The database is freely available at http://www.bioterrorism.biowaves.org.

  19. Emergent web intelligence advanced information retrieval

    CERN Document Server

    Badr, Youakim; Abraham, Ajith; Hassanien, Aboul-Ella

    2010-01-01

    Web Intelligence explores the impact of artificial intelligence and advanced information technologies representing the next generation of Web-based systems, services, and environments, and designing hybrid web systems that serve wired and wireless users more efficiently. Multimedia and XML-based data are produced regularly and in increasing way in our daily digital activities, and their retrieval must be explored and studied in this emergent web-based era. 'Emergent Web Intelligence: Advanced information retrieval, provides reviews of the related cutting-edge technologies and insights. It is v

  20. Role of Ontology in Information Retrieval

    Institute of Scientific and Technical Information of China (English)

    WU Dan; WANG Hui-lin

    2006-01-01

    Based on the comparison between ontology and thesaurus, and the analysis of an ontology-based Information Retrieval (IR) model, the potential advantages that ontology may contribute to IR are analyzed. Then a general architecture of ontology-based Information Retrieval System (IRS) and the approach of constructing it are presented. Based on the researches, the role of ontology in IR is summarized from four aspects and a typical system called Textpresso is analyzed. Finally, a conclusion is drawn that utilizing ontology is the trend of IR and can really improve the IRS.

  1. Applications Of Informetrics To Information Retrieval Research

    Directory of Open Access Journals (Sweden)

    Dietmar Wolfram

    2000-01-01

    Full Text Available A non-technical overview of two primary areas of study within the discipline of information science, information retrieval (IR and informetrics, is presented. Informetric properties of IR systems as the basis for understanding IR system structure and generalizing human information seeking in electronic environments are discussed. Applications of informetric study of IR systems for more efficient and effective design and evaluation of IR systems are also presented.

  2. Test OSIRIS (On Line Search Information Retrieval Information Storage).

    Science.gov (United States)

    Showalther, A. Kenneth

    The OSIRIS system is a prototype information retrieval system having the following components: an automated microfiche file having a capacity of 5000 punch card sized microfiche with a remote control 21 inch TV console for retrieving, magnifying (0-250X), and displaying any of the images on the microfiche; and a remote computer terminal for the…

  3. Teaching Fifth Graders Electronic Information Retrieval Skills.

    Science.gov (United States)

    Christy, Annette

    Fifth graders were taught to use an electronic card catalog to retrieve information and materials for class assignments and leisure reading materials. Groups of 10 or 12 students were seen twice a week for periods lasting up to 30 minutes. At these sessions they were introduced to computer components, proper handling, how to log into a network…

  4. Formalizing Evaluation in Music Information Retrieval

    DEFF Research Database (Denmark)

    Sturm, Bob L.

    2013-01-01

    We develop a formalism to disambiguate the evaluation of music information retrieval systems. We define a ``system,'' what it means to ``analyze'' one, and make clear the aims, parts, design, execution, interpretation, and assumptions of its ``evaluation.'' We apply this formalism to discuss...... the MIREX automatic mood classification task....

  5. Introduction: Natural Language Processing and Information Retrieval.

    Science.gov (United States)

    Smeaton, Alan F.

    1990-01-01

    Discussion of research into information and text retrieval problems highlights the work with automatic natural language processing (NLP) that is reported in this issue. Topics discussed include the occurrences of nominal compounds; anaphoric references; discontinuous language constructs; automatic back-of-the-book indexing; and full-text analysis.…

  6. Language-based multimedia information retrieval

    NARCIS (Netherlands)

    de Jong, Franciska M.G.; Gauvain, J.L.; Hiemstra, Djoerd; Netter, K.

    2000-01-01

    This paper describes various methods and approaches for language-based multimedia information retrieval, which have been developed in the projects POP-EYE and OLIVE and which will be developed further in the MUMIS project. All of these project aim at supporting automated indexing of video material

  7. Strategies for Building Distributed Information Retrieval Systems.

    Science.gov (United States)

    Macleod, Ian A.; And Others

    1987-01-01

    Discussion of the need for distributed information retrieval systems focuses on a model system, Fulcrum FUL/Text. Differences from distributed database management systems are described; system design is discussed; implementation requirements are explained including remote operation calls (ROC's); and a prototype simulation model based on FUL/Text…

  8. Language-based multimedia information retrieval

    NARCIS (Netherlands)

    Jong, de F.M.G.; Gauvain, J.L.; Hiemstra, D.; Netter, K.

    2000-01-01

    This paper describes various methods and approaches for language-based multimedia information retrieval, which have been developed in the projects POP-EYE and OLIVE and which will be developed further in the MUMIS project. All of these project aim at supporting automated indexing of video material b

  9. Millennial Students' Mental Models of Information Retrieval

    Science.gov (United States)

    Holman, Lucy

    2009-01-01

    This qualitative study examines first-year college students' online search habits in order to identify patterns in millennials' mental models of information retrieval. The study employed a combination of modified contextual inquiry and concept mapping methodologies to elicit students' mental models. The researcher confirmed previously observed…

  10. Language-based multimedia information retrieval

    NARCIS (Netherlands)

    de Jong, Franciska M.G.; Gauvain, J.L.; Hiemstra, Djoerd; Netter, K.

    2000-01-01

    This paper describes various methods and approaches for language-based multimedia information retrieval, which have been developed in the projects POP-EYE and OLIVE and which will be developed further in the MUMIS project. All of these project aim at supporting automated indexing of video material b

  11. Challenges in Information Retrieval and Language Modeling

    NARCIS (Netherlands)

    Allen, J.; Aslam, J.; Belkin, N.; Buckley, C.; Callan, J.; Croft, W.B.; Dumais, S.; Fuhr, N.; Harman, D.; Harper, D.J.; Hiemstra, D.; Hofmann, T.; Hovey, E.; Kraaij, W.; Lafferty, J.; Lavrenko, V.; Lewis, D.; Liddy, L.; Manmatha, R.; McCallum, A.; Ponte, J.; Prager, J.; Radev, D.; Resnik, P.; Robertson, S.E.; Rosenfeld, R.; Roukos, S.; Sanderson, M.; Schwartz, R.; Singhal, A.; Smeaton, A.; Turtle, H.; Voorhees, E.M.; Weischedel, R.; Xu, J.; Zhai, B.C.

    2003-01-01

    Information retrieval (IR) research has reached a point where it is appropriate to assess progress and to define a research agenda for the next five to ten years. This report summarizes a discussion of IR research challenges that took place at a recent workshop. The attendees of the workshop conside

  12. Information retrieval models foundations and relationships

    CERN Document Server

    Roelleke, Thomas

    2013-01-01

    Information Retrieval (IR) models are a core component of IR research and IR systems. The past decade brought a consolidation of the family of IR models, which by 2000 consisted of relatively isolated views on TF-IDF (Term-Frequency times Inverse-Document-Frequency) as the weighting scheme in the vector-space model (VSM), the probabilistic relevance framework (PRF), the binary independence retrieval (BIR) model, BM25 (Best-Match Version 25, the main instantiation of the PRF/BIR), and language modelling (LM). Also, the early 2000s saw the arrival of divergence from randomness (DFR).Regarding in

  13. MIREX: MapReduce Information Retrieval Experiments

    CERN Document Server

    Hiemstra, Djoerd

    2010-01-01

    We propose to use MapReduce to quickly test new retrieval approaches on a cluster of machines by sequentially scanning all documents. We present a small case study in which we use a cluster of 15 low cost ma- chines to search a web crawl of 0.5 billion pages showing that sequential scanning is a viable approach to running large-scale information retrieval experiments with little effort. The code is available to other researchers at: http://mirex.sourceforge.net

  14. Method of and System for Information Retrieval

    DEFF Research Database (Denmark)

    2015-01-01

    This invention relates to a system for and a method (100) of searching a collection of digital information (150) comprising a number of digital documents (110), the method comprising receiving or obtaining (102) a search query, the query comprising a number of search terms, searching (103) an index...... (300) using the search terms thereby providing information (301) about which digital documents (110) of the collection of digital information (150) that contains a given search term and one or more search related metrics (302; 303; 304; 305; 306), ranking (105) at least a part of the search result......, a method of and a system for information retrieval or searching is readily provided that enhances the searching quality (i.e. the number of relevant documents retrieved and such documents being ranked high) when (also) using queries containing many search terms....

  15. Multilevel resistive information storage and retrieval

    Science.gov (United States)

    Lohn, Andrew; Mickel, Patrick R.

    2016-08-09

    The present invention relates to resistive random-access memory (RRAM or ReRAM) systems, as well as methods of employing multiple state variables to form degenerate states in such memory systems. The methods herein allow for precise write and read steps to form multiple state variables, and these steps can be performed electrically. Such an approach allows for multilevel, high density memory systems with enhanced information storage capacity and simplified information retrieval.

  16. Web information retrieval for health professionals.

    Science.gov (United States)

    Ting, S L; See-To, Eric W K; Tse, Y K

    2013-06-01

    This paper presents a Web Information Retrieval System (WebIRS), which is designed to assist the healthcare professionals to obtain up-to-date medical knowledge and information via the World Wide Web (WWW). The system leverages the document classification and text summarization techniques to deliver the highly correlated medical information to the physicians. The system architecture of the proposed WebIRS is first discussed, and then a case study on an application of the proposed system in a Hong Kong medical organization is presented to illustrate the adoption process and a questionnaire is administrated to collect feedback on the operation and performance of WebIRS in comparison with conventional information retrieval in the WWW. A prototype system has been constructed and implemented on a trial basis in a medical organization. It has proven to be of benefit to healthcare professionals through its automatic functions in classification and summarizing the medical information that the physicians needed and interested. The results of the case study show that with the use of the proposed WebIRS, significant reduction of searching time and effort, with retrieval of highly relevant materials can be attained.

  17. Electronic publishing and intelligent information retrieval

    Science.gov (United States)

    Heck, A.

    1992-01-01

    Europeans are now taking steps to homogenize policies and standardize procedures in electronic publishing (EP) in astronomy and space sciences. This arose from an open meeting organized in Oct. 1991 at Strasbourg Observatory (France) and another business meeting held late Mar. 1992 with the major publishers and journal editors in astronomy and space sciences. The ultimate aim of EP might be considered as the so-called 'intelligent information retrieval' (IIR) or better named 'advanced information retrieval' (AIR), taking advantage of the fact that the material to be published appears at some stage in a machine-readable form. It is obvious that the combination of desktop and electronic publishing with networking and new structuring of knowledge bases will profoundly reshape not only our ways of publishing, but also our procedures of communicating and retrieving information. It should be noted that a world-wide survey among astronomers and space scientists carried out before the October 1991 colloquium on the various packages and machines used, indicated that TEX-related packages were already in majoritarian use in our community. It has also been stressed at each meeting that the European developments should be carried out in collaboration with what is done in the US (STELLAR project, for instance). American scientists and journal editors actually attended both meetings mentioned above. The paper will offer a review of the status of electronic publishing in astronomy and its possible contribution to advanced information retrieval in this field. It will also report on recent meetings such as the 'Astronomy from Large Databases-2 (ALD-2)' conference dealing with the latest developments in networking, in data, information, and knowledge bases, as well as in the related methodologies.

  18. The Oklahoma Geographic Information Retrieval System

    Science.gov (United States)

    Blanchard, W. A.

    1982-01-01

    The Oklahoma Geographic Information Retrieval System (OGIRS) is a highly interactive data entry, storage, manipulation, and display software system for use with geographically referenced data. Although originally developed for a project concerned with coal strip mine reclamation, OGIRS is capable of handling any geographically referenced data for a variety of natural resource management applications. A special effort has been made to integrate remotely sensed data into the information system. The timeliness and synoptic coverage of satellite data are particularly useful attributes for inclusion into the geographic information system.

  19. COMPUTATIONALLY EFFICIENT PRIVATE INFORMATION RETRIEVAL PROTOCOL

    Directory of Open Access Journals (Sweden)

    A. V. Afanasyeva

    2016-03-01

    Full Text Available This paper describes a new computationally efficient private information retrieval protocol for one q-ary symbol retrieving. The main advantage of the proposed solution lies in a low computational complexity of information extraction procedure, as well as the constructive simplicity and flexibility in choosing the system parameters. Such results are based on cosets properties. The proposed protocol has communication complexity slightly worse than the best schemes at the moment, which is based on locally decodable codes, but it can be easily built for any parameters of the system, as opposed to codes. In comparison with similar solutions based on polynomials, the proposed method gains in computational complexity, which is important especially for servers which must service multiple requests from multiple users.

  20. Users guide for information retrieval using APL

    Science.gov (United States)

    Shapiro, A.

    1974-01-01

    A Programming Language (APL) is a precise, concise, and powerful computer programming language. Several features make APL useful to managers and other potential computer users. APL is interactive; therefore, the user can communicate with his program or data base in near real-time. This, coupled with the fact that APL has excellent debugging features, reduces program checkout time to minutes or hours rather than days or months. Of particular importance is the fact that APL can be utilized as a management science tool using such techniques as operations research, statistical analysis, and forecasting. The gap between the scientist and the manager could be narrowed by showing how APL can be used to do what the scientists and the manager each need to do, retrieve information. Sometimes, the information needs to be retrieved rapidly. In this case APL is ideally suited for this challenge.

  1. Information Retrieval Using a Middleware Approach

    Directory of Open Access Journals (Sweden)

    Danijela Boberić Krstićev

    2013-03-01

    Full Text Available This paper explores the use of a mediator/wrapper approach to enable the search of an existing library management system using different information retrieval protocols. It proposes an architecture for a software component that will act as an intermediary between the library system and search services. It provides an overview of different approaches to add Z39.50 and Search/Retrieval via URL (SRU functionality using a middleware approach that is implemented on the BISIS library management system. That wrapper performs transformation of Contextual Query Language (CQL into Lucene query language. The primary aim of this software component is to enable search and retrieval of bibliographic records using the SRU and Z39.50 protocols, but the proposed architecture of the software components is also suitable for inclusion of the existing library management system into a library portal. The software component provides a single interface to server-side protocols for search and retrieval of records. Additional protocols could be used. This paper provides practical demonstration of interest to developers of library management systems and those who are trying to use open-source solutions to make their local catalog accessible to other systems.

  2. Stylistic Variation in an Information Retrieval Experiment

    CERN Document Server

    Karlgren, J

    1996-01-01

    Texts exhibit considerable stylistic variation. This paper reports an experiment where a corpus of documents (N= 75 000) is analyzed using various simple stylistic metrics. A subset (n = 1000) of the corpus has been previously assessed to be relevant for answering given information retrieval queries. The experiment shows that this subset differs significantly from the rest of the corpus in terms of the stylistic metrics studied.

  3. Cognitive approach to information retrieval and communication

    Directory of Open Access Journals (Sweden)

    Saša Zupanič

    1997-01-01

    Full Text Available Cognitive approach (viewpoint/standpoirit in the retrieval and communication of information, as well as in librarianship and information science has started gaining importance in the 70's. Today, it is present in literary and objective knowledge studies, as well as in studies of users,information brokers and systems of information retrieval.Cognitive approach exercises strong impact on several scientific disciplines which are grouped under the roof of cognitive science. The cognitive approach has caused split and the formation of a new paradigm, i.e. the cognitive paradigm, in many scientific disciplines.In the frames of the definition of Kuhn's concept of paradigm, it is evident that librarianship and information science are on the pre-paradigmatic level. I Iowever,some authors mention the existence of at least two paradigms in library and information science, i.e. physical and cognitive paradigm.The hištorical overview of cognitive oriented research works of Brookes, De Mey,Belkin, Ingwersen and others enables the insight into the development of library and information scientific thought up to the present.

  4. NLP Meets the Jabberwocky: Natural Language Processing in Information Retrieval.

    Science.gov (United States)

    Feldman, Susan

    1999-01-01

    Focuses on natural language processing (NLP) in information retrieval. Defines the seven levels at which people extract meaning from text/spoken language. Discusses the stages of information processing; how an information retrieval system works; advantages to adding full NLP to information retrieval systems; and common problems with information…

  5. Data Visualization in Information Retrieval and Data Mining (SIG VIS).

    Science.gov (United States)

    Efthimiadis, Efthimis

    2000-01-01

    Presents abstracts that discuss using data visualization for information retrieval and data mining, including immersive information space and spatial metaphors; spatial data using multi-dimensional matrices with maps; TREC (Text Retrieval Conference) experiments; users' information needs in cartographic information retrieval; and users' relevance…

  6. Bilingual Gestures: The "Manual" Way of Informing the Notion of "Balanced Bilinguals"

    Science.gov (United States)

    Sondermann, Kerstin

    2013-01-01

    Defining what makes a person bilingual is notoriously difficult and dependent on various factors like fluency, age of acquisition, and situational context, among others. The notion of a "balanced bilingual" is even more elusive and fraught with limitations regarding the proper assessment and linguistic profiling of bilinguals, leading to…

  7. Tag Clusters as Information Retrieval Interfaces

    CERN Document Server

    Knautz, Kathrin; Stock, Wolfgang G

    2010-01-01

    The paper presents our design of a next generation information retrieval system based on tag co-occurrences and subsequent clustering. We help users getting access to digital data through information visualization in the form of tag clusters. Current problems like the absence of interactivity and semantics between tags or the difficulty of adding additional search arguments are solved. In the evaluation, based upon SERVQUAL and IT systems quality indicators, we found out that tag clusters are perceived as more useful than tag clouds, are much more trustworthy, and are more enjoyable to use.

  8. Image Information Retrieval: An Overview of Current Research

    Directory of Open Access Journals (Sweden)

    Abby A. Goodrum

    2000-01-01

    Full Text Available This paper provides an overview of current research in image information retrieval and provides an outline of areas for future research. The approach is broad and interdisciplinary and focuses on three aspects of image research (IR: text-based retrieval, content-based retrieval, and user interactions with image information retrieval systems. The review concludes with a call for image retrieval evaluation studies similar to TREC.

  9. Image Information Retrieval: An Overview of Current Research

    OpenAIRE

    Abby A. Goodrum

    2000-01-01

    This paper provides an overview of current research in image information retrieval and provides an outline of areas for future research. The approach is broad and interdisciplinary and focuses on three aspects of image research (IR): text-based retrieval, content-based retrieval, and user interactions with image information retrieval systems. The review concludes with a call for image retrieval evaluation studies similar to TREC.

  10. QUERY TRANSLATION USING CONCEPTS SIMILARITY BASED ON QURAN ONTOLOGY FOR CROSS-LANGUAGE INFORMATION RETRIEVAL

    Directory of Open Access Journals (Sweden)

    Zulaini Yahya

    2013-01-01

    Full Text Available In Cross-Language Information Retrieval (CLIR process, the translation effects have a direct impact on the accuracy of follow-up retrieval results. In dictionary-based approach, we are dealing with the words that have more than one meaning which can decrease the retrieval performance if the query translation return an incorrect translations. These issues need to be overcome using efficient technique. In this study we proposed a Cross-Language Information Retrieval (CLIR method based on domain ontology using Quran concepts for disambiguating translation of the query and to improve the dictionary-based query translation. For experimentation, we use Quran ontology written in English and Malay languages as a bilingual parallel-corpora and Quran concepts as a resource for cross-language query translation along with dictionary-based translation. For evaluation, we measure the performance of three IR systems. IR1 is natural language query IR, IR2 is natural language query CLIR based on dictionary (as a Baseline and IR3 is the retrieval of this research proposed method using Mean Average Precision (MAP and average precision at 11 points of recall. The experimental result shows that our proposed method brings significant improvement in retrieval accuracy for English document collections, but deficient for Malay document collections. The proposed CLIR method can obtain query expansion effect and improve retrieval performance in certain language.

  11. Graph-Based Interactive Bibliographic Information Retrieval Systems

    Science.gov (United States)

    Zhu, Yongjun

    2017-01-01

    In the big data era, we have witnessed the explosion of scholarly literature. This explosion has imposed challenges to the retrieval of bibliographic information. Retrieval of intended bibliographic information has become challenging due to the overwhelming search results returned by bibliographic information retrieval systems for given input…

  12. Four Challenges for Music Information Retrieval Researchers

    DEFF Research Database (Denmark)

    Sturm, Bob L.; Collins, Nick

    Exemplified in the substantial amount of published research in music genre recognition, mood recognition and autotagging, content-based music information retrieval (MIR) advances an "engineering approach'': build a system producing the most "correct'' answers in datasets appearing throughout...... might not even be considering the through it answers "correctly''. It could thus be worthless for addressing real-world problems that must consider (e.g., music description). To emphasise the critical points above, and encourage a new approaches to research that address real-world problems, we present...

  13. Random walk term weighting for information retrieval

    DEFF Research Database (Denmark)

    Blanco, R.; Lioma, Christina

    2007-01-01

    We present a way of estimating term weights for Information Retrieval (IR), using term co-occurrence as a measure of dependency between terms.We use the random walk graph-based ranking algorithm on a graph that encodes terms and co-occurrence dependencies in text, from which we derive term weights...... that represent a quantification of how a term contributes to its context. Evaluation on two TREC collections and 350 topics shows that the random walk-based term weights perform at least comparably to the traditional tf-idf term weighting, while they outperform it when the distance between co-occurring terms...

  14. Enhancing genomics information retrieval through dimensional analysis.

    Science.gov (United States)

    Hu, Qinmin; Huang, Jimmy Xiangji

    2013-06-01

    We propose a novel dimensional analysis approach to employing meta information in order to find the relationships within the unstructured or semi-structured document/passages for improving genomics information retrieval performance. First, we make use of the auxiliary information as three basic dimensions, namely "temporal", "journal", and "author". The reference section is treated as a commensurable quantity of the three basic dimensions. Then, the sample space and subspaces are built up and a set of events are defined to meet the basic requirement of dimensional homogeneity to be commensurable quantities. After that, the classic graph analysis algorithm in the Web environments is applied on each dimension respectively to calculate the importance of each dimension. Finally, we integrate all the dimension networks and re-rank the outputs for evaluation. Our experimental results show the proposed approach is superior and promising.

  15. A Theoretical Paradigm of Information Retrieval in Information Science and Computer Science

    Directory of Open Access Journals (Sweden)

    M. S. Saleem Basha

    2012-09-01

    Full Text Available This paper describes the theoretical paradigms of information retrieval in information science and computer science, and constructs the theory framework of information retrieval from three perspectives that are user, information and technology. It evaluates the research priorities of the two disciplines and cross-domain of information retrieval theory. Finally, it points-out the theory status and development trend of information retrieval in information science and computer science, and provides exploration direction in information retrieval theory.

  16. Recommender Systems by means of Information Retrieval

    CERN Document Server

    Costa, Alberto

    2010-01-01

    In this paper we present a method for reformulating the Recommender Systems problem in an Information Retrieval one. In our tests we have a dataset of users who give ratings for some movies; we hide some values from the dataset, and we try to predict them again using its remaining portion (the so-called "leave-n-out approach"). In order to use an Information Retrieval algorithm, we reformulate this Recommender Systems problem in this way: a user corresponds to a document, a movie corresponds to a term, the active user (whose rating we want to predict) plays the role of the query, and the ratings are used as weigths, in place of the weighting schema of the original IR algorithm. The output is the ranking list of the documents ("users") relevant for the query ("active user"). We use the ratings of these users, weighted according to the rank, to predict the rating of the active user. We carry out the comparison by means of a typical metric, namely the accuracy of the predictions returned by the algorithm, and we...

  17. MATCHING LSI FOR SCALABLE INFORMATION RETRIEVAL

    Directory of Open Access Journals (Sweden)

    Rajagopal Palsonkennedy

    2012-01-01

    Full Text Available Latent Semantic Indexing (LSI is one of the well-liked techniques in the information retrieval fields. Different from the traditional information retrieval techniques, LSI is not based on the keyword matching simply. It uses statistics and algebraic computations. Based on Singular Value Decomposition (SVD, the higher dimensional matrix is converted to a lower dimensional approximate matrix, of which the noises could be filtered. And also the issues of synonymy and polysemy in the traditional techniques can be prevail over based on the investigations of the terms related with the documents. However, it is notable that LSI suffers a scalability issue due to the computing complexity of SVD. This study presents a distributed LSI algorithm MR-LSI which can solve the scalability issue using Hadoop framework based on the distributed computing model Map Reduce. It also solves the overhead issue caused by the involved clustering algorithm by k-means algorithm. The evaluations indicate that MR-LSI can gain noteworthy improvement compared to the other scheme on processing large scale of documents. One significant advantage of Hadoop is that it supports various computing environments so that the issue of unbalanced load among nodes is highlighted.Hence, a load balancing algorithm based on genetic algorithm for balancing load in static environment is proposed. The results show that it can advance the performance of a cluster according to different levels.

  18. Parallel Computing in Information Retrieval--An Updated Review.

    Science.gov (United States)

    Macfarlane, A.; And Others

    1997-01-01

    Reviews the progress of parallel computing in information retrieval (IR) and stresses the importance of the motivation in using parallel computing for text retrieval. Analyzes parallel IR systems using a classification defined by Rasmussen; describes retrieval models used in parallel information processing; and suggests areas of needed research.…

  19. Data Discretization for Novel Relationship Discovery in Information Retrieval.

    Science.gov (United States)

    Benoit, G.

    2002-01-01

    Describes an information retrieval, visualization, and manipulation model which offers the user multiple ways to exploit the retrieval set, based on weighted query terms, via an interactive interface. Outlines the mathematical model and describes an information retrieval application built on the model to search structured and full-text files.…

  20. 46 CFR 520.6 - Retrieval of information.

    Science.gov (United States)

    2010-10-01

    ... 46 Shipping 9 2010-10-01 2010-10-01 false Retrieval of information. 520.6 Section 520.6 Shipping FEDERAL MARITIME COMMISSION REGULATIONS AFFECTING OCEAN SHIPPING IN FOREIGN COMMERCE CARRIER AUTOMATED TARIFFS § 520.6 Retrieval of information. (a) General. Tariffs systems shall present retrievers with...

  1. Flexible method for Boolean information retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Salton, G.; Wu, H.

    1983-01-01

    A new flexible retrieval system is described which makes it possible to relax the strict conditions of Boolean query logic thereby retrieving useful items that are rejected in a conventional retrieval situation. The query structure inherent in the Boolean system is preserved, while at the same time weighted terms may be incorporated into both queries and stored documents; the retrieved output can also be ranked in strict similarity order with the user queries. A conventional retrieval system can be modified to make use of the flexible metric system. Laboratory tests indicate that the extended system produces better retrieval output than either the Boolean or the vector processing systems. 11 references.

  2. Evaluating Digital Strategies for Storing and Retrieving Scholarly Information.

    Science.gov (United States)

    Getz, Malcolm

    1997-01-01

    Outlines the advantages of digital documents for scholars and offers considerations for designing systems for storing and retrieving digital information products. Discusses conventional and electronic storage and retrieval; network costs; digital storage; search strategies; acquisition prices; and digital initiatives. (AEF)

  3. Bilingualism and Cognition: Informing Research, Pedagogy, and Policy

    Science.gov (United States)

    Garcia, Eugene E.; Nanez, Jose E., Sr.

    2011-01-01

    In the United States, approximately 7% to 10% of children are raised in bilingual households. Despite inherent advantages to bilingualism, some bilingual children experience a significant lag in academic success relative to other groups. Bridging the fields of cognitive psychology and education, this volume presents research-based knowledge on…

  4. Formal Concept Analysis for Information Retrieval

    CERN Document Server

    Qadi, Abderrahim El; Ennouary, Yassine

    2010-01-01

    In this paper we describe a mechanism to improve Information Retrieval (IR) on the web. The method is based on Formal Concepts Analysis (FCA) that it is makes semantical relations during the queries, and allows a reorganizing, in the shape of a lattice of concepts, the answers provided by a search engine. We proposed for the IR an incremental algorithm based on Galois lattice. This algorithm allows a formal clustering of the data sources, and the results which it turns over are classified by order of relevance. The control of relevance is exploited in clustering, we improved the result by using ontology in field of image processing, and reformulating the user queries which make it possible to give more relevant documents.

  5. An Effective Information Retrieval for Ambiguous Query

    CERN Document Server

    Roul, R K

    2012-01-01

    Search engine returns thousands of web pages for a single user query, in which most of them are not relevant. In this context, effective information retrieval from the expanding web is a challenging task, in particular, if the query is ambiguous. The major question arises here is that how to get the relevant pages for an ambiguous query. We propose an approach for the effective result of an ambiguous query by forming community vector based on association concept of data minning using vector space model and the freedictionary. We develop clusters by computing the similarity between community vectors and document vectors formed from the extracted web pages by the search engine. We use Gensim package to implement the algorithm because of its simplicity and robust nature. Analysis shows that our approach is an effective way to form clusters for an ambiguous query.

  6. Distributed and Cooperative Information Retrieval on the World Wide Web

    Institute of Scientific and Technical Information of China (English)

    王继成; 金翔宇; 杨晓江; 张福炎

    2000-01-01

    A mass of heterogeneous, distributed and dynamic information on the World Wide Web (the Web) has resulted in "information overload". It's an important and urgent research issue to provide users with effective information retrieval service on the Web. Web search engines attempt to solve this problem, yet their effect is far from satisfying. In this paper, a distributed and cooperative strategy for information retrieval on the Web is proposed to substitute the centralized mode adopted by the current search engines. Then a new information retrieval system model IRSM is presented, which supports the retrieval of metadata about Web documents and uses Z39.50 standard protocol to unify the heterogeneous interfaces of different systems. Based on that, a distributed and cooperative information retrieval framework, called DCIRF, is designed to help users in fast and effective information retrieval on the Web.

  7. Visualization for Information Retrieval based on Fast Search Technology

    Directory of Open Access Journals (Sweden)

    Mamoon H. Mamoon

    2013-03-01

    Full Text Available The core of search engine is information retrieval technique. Using information retrieval system backs more retrieval results, some of them more relevant than other, and some is not relevant. While using search engine to retrieve information has grown very substantially, there remain problems with the information retrieval systems. The interface of the systems does not help them to perceive the precision of these results. It is therefore not surprising that graphical visualizations have been employed in search engines to assist users. The main objective of Internet users is to find the required information with high efficiency and effectiveness. In this paper we present brief sides of information visualization's role in enhancing web information retrieval system as in some of its techniques such as tree view, title view, map view, bubble view and cloud view and its tools such as highlighting and Colored Query Result.

  8. TOFIR: A Tool of Facilitating Information Retrieval - Introduce a Visual Retrieval Model.

    Science.gov (United States)

    Zhang, Jin

    2001-01-01

    Introduces a new method for the visualization of information retrieval called TOFIR (Tool of Facilitating Information Retrieval). Discusses the use of angle attributes of a document to construct the angle-based visual space; two-dimensional and three-dimensional visual tools; ambiguity; and future research directions. (Author/LRW)

  9. A semantic medical multimedia retrieval approach using ontology information hiding.

    Science.gov (United States)

    Guo, Kehua; Zhang, Shigeng

    2013-01-01

    Searching useful information from unstructured medical multimedia data has been a difficult problem in information retrieval. This paper reports an effective semantic medical multimedia retrieval approach which can reflect the users' query intent. Firstly, semantic annotations will be given to the multimedia documents in the medical multimedia database. Secondly, the ontology that represented semantic information will be hidden in the head of the multimedia documents. The main innovations of this approach are cross-type retrieval support and semantic information preservation. Experimental results indicate a good precision and efficiency of our approach for medical multimedia retrieval in comparison with some traditional approaches.

  10. Multimodal medical information retrieval with unsupervised rank fusion.

    Science.gov (United States)

    Mourão, André; Martins, Flávio; Magalhães, João

    2015-01-01

    Modern medical information retrieval systems are paramount to manage the insurmountable quantities of clinical data. These systems empower health care experts in the diagnosis of patients and play an important role in the clinical decision process. However, the ever-growing heterogeneous information generated in medical environments poses several challenges for retrieval systems. We propose a medical information retrieval system with support for multimodal medical case-based retrieval. The system supports medical information discovery by providing multimodal search, through a novel data fusion algorithm, and term suggestions from a medical thesaurus. Our search system compared favorably to other systems in 2013 ImageCLEFMedical.

  11. A Survey of Stemming Algorithms in Information Retrieval

    Science.gov (United States)

    Moral, Cristian; de Antonio, Angélica; Imbert, Ricardo; Ramírez, Jaime

    2014-01-01

    Background: During the last fifty years, improved information retrieval techniques have become necessary because of the huge amount of information people have available, which continues to increase rapidly due to the use of new technologies and the Internet. Stemming is one of the processes that can improve information retrieval in terms of…

  12. Content-based retrieval of visual information

    NARCIS (Netherlands)

    Oerlemans, Adrianus Antonius Johannes

    2011-01-01

    In this dissertation, I investigate new approaches relevant to content-based image retrieval techniques. First, the MOD paradigm is proposed, a method for detecting salient points in images. These salient points are specifically designed to enhance image retrieval accuracy by maximizing distinctive

  13. Visualization of database structures for information retrieval

    Directory of Open Access Journals (Sweden)

    Grete Lisbjerg Jensen

    1994-12-01

    Full Text Available This paper describes the Book House system, which is designed to support children's information retrieval in libraries as part of their education. It is a shareware program available on CD-ROM or floppy disks, and comprises functionality for database searching as well as for classifying and storing book information in the database. The system concept is based on an understanding of children's domain structures and their capabilities for categorization of information needs in connection with their activities in schools, in school libraries or in public libraries. These structures are visualized in the interface by using metaphors and multimedia technology. Through the use of text, images and animation, the Book House encourages children - even at a very early age - to learn by doing in an enjoyable way, which plays on their previous experiences with computer games. Both words and pictures can be used for searching; this makes the system suitable for all age groups. Even children who have not yet learned to read properly can, by selecting pictures, search for and find those books they would like to have read aloud. Thus, at the very beginning of their school life, they can learn to search for books on their own. For the library community, such a system will provide an extended service which will increase the number of children's own searches and also improve the relevance, quality and utilization of the book collections in the libraries. A market research report on the need for an annual indexing service for books in the Book House format is in preparation by the Danish Library Centre A/S.

  14. Improving information retrieval in functional analysis.

    Science.gov (United States)

    Rodriguez, Juan C; González, Germán A; Fresno, Cristóbal; Llera, Andrea S; Fernández, Elmer A

    2016-12-01

    Transcriptome analysis is essential to understand the mechanisms regulating key biological processes and functions. The first step usually consists of identifying candidate genes; to find out which pathways are affected by those genes, however, functional analysis (FA) is mandatory. The most frequently used strategies for this purpose are Gene Set and Singular Enrichment Analysis (GSEA and SEA) over Gene Ontology. Several statistical methods have been developed and compared in terms of computational efficiency and/or statistical appropriateness. However, whether their results are similar or complementary, the sensitivity to parameter settings, or possible bias in the analyzed terms has not been addressed so far. Here, two GSEA and four SEA methods and their parameter combinations were evaluated in six datasets by comparing two breast cancer subtypes with well-known differences in genetic background and patient outcomes. We show that GSEA and SEA lead to different results depending on the chosen statistic, model and/or parameters. Both approaches provide complementary results from a biological perspective. Hence, an Integrative Functional Analysis (IFA) tool is proposed to improve information retrieval in FA. It provides a common gene expression analytic framework that grants a comprehensive and coherent analysis. Only a minimal user parameter setting is required, since the best SEA/GSEA alternatives are integrated. IFA utility was demonstrated by evaluating four prostate cancer and the TCGA breast cancer microarray datasets, which showed its biological generalization capabilities.

  15. An Abstraction-Based Data Model for Information Retrieval

    Science.gov (United States)

    McAllister, Richard A.; Angryk, Rafal A.

    Language ontologies provide an avenue for automated lexical analysis that may be used to supplement existing information retrieval methods. This paper presents a method of information retrieval that takes advantage of WordNet, a lexical database, to generate paths of abstraction, and uses them as the basis for an inverted index structure to be used in the retrieval of documents from an indexed corpus. We present this method as a entree to a line of research on using ontologies to perform word-sense disambiguation and improve the precision of existing information retrieval techniques.

  16. Information content of ozone retrieval algorithms

    Science.gov (United States)

    Rodgers, C.; Bhartia, P. K.; Chu, W. P.; Curran, R.; Deluisi, J.; Gille, J. C.; Hudson, R.; Mateer, C.; Rusch, D.; Thomas, R. J.

    1989-01-01

    The algorithms are characterized that were used for production processing by the major suppliers of ozone data to show quantitatively: how the retrieved profile is related to the actual profile (This characterizes the altitude range and vertical resolution of the data); the nature of systematic errors in the retrieved profiles, including their vertical structure and relation to uncertain instrumental parameters; how trends in the real ozone are reflected in trends in the retrieved ozone profile; and how trends in other quantities (both instrumental and atmospheric) might appear as trends in the ozone profile. No serious deficiencies were found in the algorithms used in generating the major available ozone data sets. As the measurements are all indirect in someway, and the retrieved profiles have different characteristics, data from different instruments are not directly comparable.

  17. Innovations in information retrieval perspectives for theory and practice

    CERN Document Server

    Foster, Allen

    2011-01-01

    The advent of various information retrieval (IR) technologies and approaches to storage and retrieval provide communities with opportunities for mass documentation, digitization, and the recording of information in different forms. This book introduces and contextualizes these developments and looks at supporting research in IR.

  18. User-Centric Multi-Criteria Information Retrieval

    Science.gov (United States)

    Wolfe, Shawn R.; Zhang, Yi

    2009-01-01

    Information retrieval models usually represent content only, and not other considerations, such as authority, cost, and recency. How could multiple criteria be utilized in information retrieval, and how would it affect the results? In our experiments, using multiple user-centric criteria always produced better results than a single criteria.

  19. Brute Force Information Retrieval Experiments using MapReduce

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Hauff, Claudia

    2012-01-01

    MIREX (MapReduce Information Retrieval Experiments) is a software library initially developed by the Database Group of the University of Twente for running large scale information retrieval experiments on clusters of machines. MIREX has been tested on web crawls of up to half a billion web pages, to

  20. Prototyping a Distributed Information Retrieval System That Uses Statistical Ranking.

    Science.gov (United States)

    Harman, Donna; And Others

    1991-01-01

    Built using a distributed architecture, this prototype distributed information retrieval system uses statistical ranking techniques to provide better service to the end user. Distributed architecture was shown to be a feasible alternative to centralized or CD-ROM information retrieval, and user testing of the ranking methodology showed both…

  1. Problems of Music Information Retrieval in the Real World.

    Science.gov (United States)

    Byrd, Donald; Crawford, Tim

    2002-01-01

    Considers some of the most fundamental problems in music information retrieval, challenging the common assumption that searching on pitch alone is likely to be satisfactory for all purposes. Discusses special issues related to polyphonic music, user-interface issues, and the notion of relevance for music information retrieval. (Contains 52…

  2. Counter-intuitive Cases of Data Fusion in Information Retrieval.

    Science.gov (United States)

    Ibraev, Ulukbek; Kantor, Paul; Ng, K. B.

    2001-01-01

    Aspects of Data Fusion (DF) for information retrieval are explored. Based on a geometrical model of DF, it is shown that in the ideal case, performance of DF for a pair of information retrieval schemes may be approximated by a quadratic polynomial. Compares counter-intuitive cases of DF with cases that behave according to the geometric model. (AEF)

  3. An Expressive and Efficient Language for XML Information Retrieval.

    Science.gov (United States)

    Chinenyanga, Taurai Tapiwa; Kushmerick, Nicholas

    2002-01-01

    Discusses XML and information retrieval and describes a query language, ELIXIR (expressive and efficient language for XML information retrieval), with a textual similarity operator that can be used for similarity joins. Explains the algorithm for answering ELIXIR queries to generate intermediate relational data. (Author/LRW)

  4. Information retrieval for children based on the aggregated search paradigm

    NARCIS (Netherlands)

    Duarte Torres, Sergio

    2011-01-01

    This report presents research to develop information services for children by expanding and adapting current Information retrieval technologies according to the search characteristics and needs of children. Concretely, we will employ the aggregated search paradigm as theoretical framework. The objec

  5. Attitudes Toward Automated Information Retrieval Services Among RASD Members

    Science.gov (United States)

    Nitecki, Danuta A.

    1976-01-01

    Summary of survey of the American Library Association Reference and Adult Services Division (RASD) members concerning attitudes toward, need for, and preferences in acquiring information on automated information retrieval services. (KP)

  6. Information Retrieval and Graph Analysis Approaches for Book Recommendation.

    Science.gov (United States)

    Benkoussas, Chahinez; Bellot, Patrice

    2015-01-01

    A combination of multiple information retrieval approaches is proposed for the purpose of book recommendation. In this paper, book recommendation is based on complex user's query. We used different theoretical retrieval models: probabilistic as InL2 (Divergence from Randomness model) and language model and tested their interpolated combination. Graph analysis algorithms such as PageRank have been successful in Web environments. We consider the application of this algorithm in a new retrieval approach to related document network comprised of social links. We called Directed Graph of Documents (DGD) a network constructed with documents and social information provided from each one of them. Specifically, this work tackles the problem of book recommendation in the context of INEX (Initiative for the Evaluation of XML retrieval) Social Book Search track. A series of reranking experiments demonstrate that combining retrieval models yields significant improvements in terms of standard ranked retrieval metrics. These results extend the applicability of link analysis algorithms to different environments.

  7. Rare disease diagnosis as an information retrieval task

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina;

    2011-01-01

    Increasingly more clinicians use web Information Retrieval (IR) systems to assist them in diagnosing difficult medical cases, for instance rare diseases that they may not be familiar with. However, web IR systems are not necessarily optimised for this task. For instance, clinicians’ queries tend...... search and offline retrieval from a rare disease collection indicate that the retrieval of rare diseases is an open problem with room for improvement....

  8. Science information systems: Archive, access, and retrieval

    Science.gov (United States)

    Campbell, William J.

    1991-01-01

    The objective of this research is to develop technology for the automated characterization and interactive retrieval and visualization of very large, complex scientific data sets. Technologies will be developed for the following specific areas: (1) rapidly archiving data sets; (2) automatically characterizing and labeling data in near real-time; (3) providing users with the ability to browse contents of databases efficiently and effectively; (4) providing users with the ability to access and retrieve system independent data sets electronically; and (5) automatically alerting scientists to anomalies detected in data.

  9. MIREX: MapReduce Information Retrieval Experiments

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Hauff, Claudia

    2010-01-01

    We propose to use MapReduce to quickly test new retrieval approaches on a cluster of machines by sequentially scanning all documents. We present a small case study in which we use a cluster of 15 low cost machines to search a web crawl of 0.5 billion pages showing that sequential scanning is a viabl

  10. The JPL Library information retrieval system

    Science.gov (United States)

    Walsh, J.

    1975-01-01

    The development, capabilities, and products of the computer-based retrieval system of the Jet Propulsion Laboratory Library are described. The system handles books and documents, produces a book catalog, and provides a machine search capability. Programs and documentation are available to the public through NASA's computer software dissemination program.

  11. An Evaluation of Automatically Constructed Hypertexts for Information Retrieval.

    Science.gov (United States)

    Melucci, Massimo

    1999-01-01

    Assesses the retrieval effectiveness of automatically constructed interdocument hypertext links in information retrieval (IR). Describes experiments using statistical and probabilistic techniques that were designed to obtain evidence concerning the usefulness of querying and browsing automatically constructed IR hypertexts. Results indicate a…

  12. Improving Performance Support Systems through Information Retrieval Evaluation

    Science.gov (United States)

    Schatz, Steven

    2006-01-01

    This study examines existent and new methods for evaluating the success of information retrieval systems. The theory underlying current methods is not robust enough to allow testing retrieval using different meta-tagging schema's. Traditional measures rely on judgments of whether a document is relevant to a particular question. A good system…

  13. Bibliometric-Enhanced Information Retrieval. Editorial for the workshop.

    NARCIS (Netherlands)

    Mayr, Philipp; Schaer, Philipp; Scharnhorst, Andrea; Mutschke, Peter; de Rijke, Maarten; Kenter, Tom; de Vries, Arjen P.; Zhai, ChengXiang; de Jong, Franciska; Radinsky, Kira; Hofmann, Katja

    2014-01-01

    This first "Bibliometric-enhanced Information Retrieval" (BIR 2014) workshop aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offe

  14. A Semantic Medical Multimedia Retrieval Approach Using Ontology Information Hiding

    OpenAIRE

    2013-01-01

    Searching useful information from unstructured medical multimedia data has been a difficult problem in information retrieval. This paper reports an effective semantic medical multimedia retrieval approach which can reflect the users’ query intent. Firstly, semantic annotations will be given to the multimedia documents in the medical multimedia database. Secondly, the ontology that represented semantic information will be hidden in the head of the multimedia documents. The main innovations of ...

  15. Intelligent Agent-Based System for Digital Library Information Retrieval

    Institute of Scientific and Technical Information of China (English)

    师雪霖; 牛振东; 宋瀚涛; 宋丽哲

    2003-01-01

    A new information search model is reported and the design and implementation of a system based on intelligent agent is presented. The system is an assistant information retrieval system which helps users to search what they need. The system consists of four main components: interface agent, information retrieval agent, broker agent and learning agent. They collaborate to implement system functions. The agents apply learning mechanisms based on an improved ID3 algorithm.

  16. Understanding information retrieval systems management, types, and standards

    CERN Document Server

    Bates, Marcia J

    2011-01-01

    In order to be effective for their users, information retrieval (IR) systems should be adapted to the specific needs of particular environments. The huge and growing array of types of information retrieval systems in use today is on display in Understanding Information Retrieval Systems: Management, Types, and Standards, which addresses over 20 types of IR systems. These various system types, in turn, present both technical and management challenges, which are also addressed in this volume. In order to be interoperable in a networked environment, IR systems must be able to use various types of

  17. A feature-centric view of information retrieval

    CERN Document Server

    Metzler, Donald

    2011-01-01

    Commercial Web search engines such as Google, Yahoo, and Bing are used every day by millions of people across the globe. With their ever-growing refinement and usage, it has become increasingly difficult for academic researchers to keep up with the collection sizes and other critical research issues related to Web search, which has created a divide between the information retrieval research being done within academia and industry. Such large collections pose a new set of challenges for information retrieval researchers. In this work, Metzler describes highly effective information retrieval mod

  18. An Integrated Information Retrieval Support System for Campus Network

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    This paper presents a new integrated information retrieval support system (IIRSS) which can help Web search engines retrieve cross-lingual information from heterogeneous resources stored in multi-databases in Intranet. The IIRSS, with a three-layer architecture, can cooperate with other application servers running in Intranet. By using intelligent agents to collect information and to create indexes on-the-fly, using an access control strategy to confine a user to browsing those accessible documents for him/her through a single portal, and using a new cross-lingual translation tool to help the search engine retrieve documents, the new system provides controllable information access with different authorizations, personalized services, and real-time information retrieval.

  19. Interfering effects of retrieval in learning new information.

    Science.gov (United States)

    Finn, Bridgid; Roediger, Henry L

    2013-11-01

    In 7 experiments, we explored the role of retrieval in associative updating, that is, in incorporating new information into an associative memory. We tested the hypothesis that retrieval would facilitate incorporating a new contextual detail into a learned association. Participants learned 3 pieces of information-a person's face, name, and profession (in Experiments 1-5). In the 1st phase, participants in all conditions learned faces and names. In the 2nd phase, participants either restudied the face-name pair (the restudy condition) or were given the face and asked to retrieve the name (the test condition). In the 3rd phase, professions were presented for study just after restudy or testing. Our prediction was that the new information (the profession) would be more readily learned following retrieval of the face-name association compared to restudy of the face-name association. However, we found that the act of retrieval generally undermined acquisition of new associations rather than facilitating them. This detrimental effect emerged on both immediate and delayed tests. Further, the effect was not due to selective attention to feedback because we found impairment whether or not feedback was provided after the Phase 2 test. The data are novel in showing that the act of retrieving information can inhibit the ability to learn new information shortly thereafter. The results are difficult to accommodate within current theories that mostly emphasize benefits of retrieval for learning.

  20. Vector space model for document representation in information retrieval

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2007-12-01

    Full Text Available This paper presents the basics of information retrieval: the vector space model for document representation with Boolean and term weighted models, ranking methods based on the cosine factor and evaluation measures: recall, precision and combined measure.

  1. Private Information Retrieval and Connections to Coding Theory

    OpenAIRE

    Horlemann, Anna-Lena

    2017-01-01

    We give an introduction to the problem of private information retrieval and show simple first ideas how to achieve this. Then we will generalize these ideas and show how known techniques from coding theory are helpful in this regard.

  2. A Question Answering service for information retrieval in Cooper

    NARCIS (Netherlands)

    Giesbers, Bas; Taddeo, Antonio; Van der Vegt, Wim; Van Bruggen, Jan; Koper, Rob

    2007-01-01

    Giesbers, B., Taddeo, A., van der Vegt, W., van Bruggen, J., Koper, R. (2007). A Question Answering service for information retrieval in Cooper. Paper presented at the Cooper workshop, September 18, Crete, Greece.

  3. Classification in Information Retrieval: The Twenty Years Following Dorking.

    Science.gov (United States)

    Coates, E. J.

    1978-01-01

    Discusses theoretical and practical progress made in the classification of information for retrieval in the last 20 years and suggests alternatives to the Dewey Decimal and Library of Congress classification systems. (JVP)

  4. GRAMMAR RULE BASED INFORMATION RETRIEVAL MODEL FOR BIG DATA

    Directory of Open Access Journals (Sweden)

    T. Nadana Ravishankar

    2015-07-01

    Full Text Available Though Information Retrieval (IR in big data has been an active field of research for past few years; the popularity of the native languages presents a unique challenge in big data information retrieval systems. There is a need to retrieve information which is present in English and display it in the native language for users. This aim of cross language information retrieval is complicated by unique features of the native languages such as: morphology, compound word formations, word spelling variations, ambiguity, word synonym, other language influence and etc. To overcome some of these issues, the native language is modeled using a grammar rule based approach in this work. The advantage of this approach is that the native language is modeled and its unique features are encoded using a set of inference rules. This rule base coupled with the customized ontological system shows considerable potential and is found to show better precision and recall.

  5. INTELLIGENT INFORMATION RETRIEVAL WITHIN DIGITAL LIBRARY USING DOMAIN ONTOLOGY

    Directory of Open Access Journals (Sweden)

    Thinn Mya Mya Swe

    2011-07-01

    Full Text Available A digital library is a type of information retrieval (IR system. The existing information retrieval methodologies generally have problems on keyword-searching. We proposed a model to solve the problem by using concept-based approach (ontology and metadata case base. This model consists of identifying domain concepts in user’s query and applying expansion to them. The system aims at contributing to an improved relevance of results retrieved from digital libraries by proposing a conceptual query expansion for intelligent concept-based retrieval. We need to import the concept of ontology, making use of its advantage of abundant semantics and standard concept. Domain specific ontology can be used to improve information retrieval from traditional level based on keyword to the lay based on knowledge (or concept and change the process of retrieval from traditional keyword matching to semantics matching. One approach is query expansion techniques using domain ontology and the other would be introducing a case based similarity measure for metadata information retrieval using Case Based Reasoning (CBR approach. Results show improvements over classic method, query expansion using general purpose ontology and a number of other approaches.

  6. Information retrieval in digital libraries: bringing search to the net.

    Science.gov (United States)

    Schatz, B R

    1997-01-17

    A digital library enables users to interact effectively with information distributed across a network. These network information systems support search and display of items from organized collections. In the historical evolution of digital libraries, the mechanisms for retrieval of scientific literature have been particularly important. Grand visions in 1960 led first to the development of text search, from bibliographic databases to full-text retrieval. Next, research prototypes catalyzed the rise of document search, from multimedia browsing across local-area networks to distributed search on the Internet. By 2010, the visions will be realized, with concept search enabling semantic retrieval across large collections.

  7. Noun-Phrase Analysis in Unrestricted Text for Information Retrieval

    OpenAIRE

    Evans, David A.; Zhai, Chengxiang

    1996-01-01

    Information retrieval is an important application area of natural-language processing where one encounters the genuine challenge of processing large quantities of unrestricted natural-language text. This paper reports on the application of a few simple, yet robust and efficient noun-phrase analysis techniques to create better indexing phrases for information retrieval. In particular, we describe a hybrid approach to the extraction of meaningful (continuous or discontinuous) subcompounds from ...

  8. Care episode retrieval: distributional semantic models for information retrieval in the clinical domain.

    Science.gov (United States)

    Moen, Hans; Ginter, Filip; Marsi, Erwin; Peltonen, Laura-Maria; Salakoski, Tapio; Salanterä, Sanna

    2015-01-01

    Patients' health related information is stored in electronic health records (EHRs) by health service providers. These records include sequential documentation of care episodes in the form of clinical notes. EHRs are used throughout the health care sector by professionals, administrators and patients, primarily for clinical purposes, but also for secondary purposes such as decision support and research. The vast amounts of information in EHR systems complicate information management and increase the risk of information overload. Therefore, clinicians and researchers need new tools to manage the information stored in the EHRs. A common use case is, given a--possibly unfinished--care episode, to retrieve the most similar care episodes among the records. This paper presents several methods for information retrieval, focusing on care episode retrieval, based on textual similarity, where similarity is measured through domain-specific modelling of the distributional semantics of words. Models include variants of random indexing and the semantic neural network model word2vec. Two novel methods are introduced that utilize the ICD-10 codes attached to care episodes to better induce domain-specificity in the semantic model. We report on experimental evaluation of care episode retrieval that circumvents the lack of human judgements regarding episode relevance. Results suggest that several of the methods proposed outperform a state-of-the art search engine (Lucene) on the retrieval task.

  9. Semantic Annotation for Biological Information Retrieval System

    Directory of Open Access Journals (Sweden)

    Mohamed Marouf Z. Oshaiba

    2015-01-01

    Full Text Available Online literatures are increasing in a tremendous rate. Biological domain is one of the fast growing domains. Biological researchers face a problem finding what they are searching for effectively and efficiently. The aim of this research is to find documents that contain any combination of biological process and/or molecular function and/or cellular component. This research proposes a framework that helps researchers to retrieve meaningful documents related to their asserted terms based on gene ontology (GO. The system utilizes GO by semantically decomposing it into three subontologies (cellular component, biological process, and molecular function. Researcher has the flexibility to choose searching terms from any combination of the three subontologies. Document annotation is taking a place in this research to create an index of biological terms in documents to speed the searching process. Query expansion is used to infer semantically related terms to asserted terms. It increases the search meaningful results using the term synonyms and term relationships. The system uses a ranking method to order the retrieved documents based on the ranking weights. The proposed system achieves researchers’ needs to find documents that fit the asserted terms semantically.

  10. A comparison of Boolean-based retrieval to the WAIS system for retrieval of aeronautical information

    Science.gov (United States)

    Marchionini, Gary; Barlow, Diane

    1994-01-01

    An evaluation of an information retrieval system using a Boolean-based retrieval engine and inverted file architecture and WAIS, which uses a vector-based engine, was conducted. Four research questions in aeronautical engineering were used to retrieve sets of citations from the NASA Aerospace Database which was mounted on a WAIS server and available through Dialog File 108 which served as the Boolean-based system (BBS). High recall and high precision searches were done in the BBS and terse and verbose queries were used in the WAIS condition. Precision values for the WAIS searches were consistently above the precision values for high recall BBS searches and consistently below the precision values for high precision BBS searches. Terse WAIS queries gave somewhat better precision performance than verbose WAIS queries. In every case, a small number of relevant documents retrieved by one system were not retrieved by the other, indicating the incomplete nature of the results from either retrieval system. Relevant documents in the WAIS searches were found to be randomly distributed in the retrieved sets rather than distributed by ranks. Advantages and limitations of both types of systems are discussed.

  11. Adaptive multi-agent system for information retrieval

    Science.gov (United States)

    Maleki-dizaji, Saeedeh; Nyongesa, H. O.; Siddiqqi, J.

    2001-10-01

    The current exponential growth of the Internet precipitates a need for improved tools to help people cope with the volume of information available. Existing search engines such, as Yahoo, Alta vista and Excite are efficient in terms of high recall (percentage of relevant document that are retrieved from Internet), and fast response time, at the cost of poor precision (percentage of documents retrieved that are considered relevant). The problem is due to the lack of filtering, lack of specialisation, lack of relevance feedback, lack of adaptation and lack of exploration. One solution for the above problems is to use intelligent agents, which can operate autonomously and become better over time. The agents rely on a user model to improve their performance in retrieving the information. This paper presents an adaptive information retrieval (IR) that learns from the user feedback through an evolutionary method, namely, genetic algorithms (GA).

  12. Information seeking and information retrieval curriculum development for courses taught in two LIS schools

    OpenAIRE

    Bates, Jessica; Vilar, Polona; Žumer, Maja

    2015-01-01

    Introduction. This paper shows how the set of Information Seeking and Retrieval (information seeking and retrieval) topics (for devising a curriculum) relates to the curriculum of two modules taught at two different institutions: Department of Library and Information Science and Book Studies at the University of Ljubljana, Slovenia and School of Information and Library Studies at University College Dublin, Ireland. Method. The information seeking and retrieval framework is compared to the str...

  13. Adaptive Visualization for Focused Personalized Information Retrieval

    Science.gov (United States)

    Ahn, Jae-wook

    2010-01-01

    The new trend on the Web has totally changed today's information access environment. The traditional information overload problem has evolved into the qualitative level beyond the quantitative growth. The mode of producing and consuming information is changing and we need a new paradigm for accessing information. Personalized search is one of…

  14. Adaptive Visualization for Focused Personalized Information Retrieval

    Science.gov (United States)

    Ahn, Jae-wook

    2010-01-01

    The new trend on the Web has totally changed today's information access environment. The traditional information overload problem has evolved into the qualitative level beyond the quantitative growth. The mode of producing and consuming information is changing and we need a new paradigm for accessing information. Personalized search is one of…

  15. Learning about Potential Users of Collaborative Information Retrieval Systems

    CERN Document Server

    Reddy, Madhu

    2009-01-01

    One of the key components of designing usable and useful collaborative information retrieval systems is to understand the needs of the users of these systems. Our research team has been exploring collaborative information behavior in a variety of organizational settings. Our research goals have been two-fold: First, to develop a conceptual understanding of collaborative information behavior and second, gather requirements for the design of collaborative information retrieval systems. In this paper, we present a brief overview of our fieldwork in a three different organizational settings, discuss our methodology for collecting data on collaborative information behavior, and highlight some lessons that we are learning about potential users of collaborative information retrieval systems in these domains.

  16. Optimizing XML Information Retrieval Query Execution at the Physical Level

    NARCIS (Netherlands)

    Os, van R.

    2007-01-01

    XML is emerging as a standard format for information interchange and storage of structured information. The wide-spread use of XML has sparked the interest of both the database and information retrieval research communities. XML databases are designed to store and query large volumes of XML data. St

  17. Associative conceptual space-based information retrieval systems

    NARCIS (Netherlands)

    M.J. Schuemie (Martijn); J.H. van den Berg (Jan)

    1998-01-01

    textabstractIn this `Information Era' with the availability of large collections of books, articles, journals, CD-ROMs, video films and so on, there exists an increasing need for intelligent information retrieval systems that enable users to find the information desired easily. Many attempts have be

  18. Associative conceptual space-based information retrieval systems

    NARCIS (Netherlands)

    M.J. Schuemie (Martijn); J.H. van den Berg (Jan)

    1998-01-01

    textabstractIn this `Information Era' with the availability of large collections of books, articles, journals, CD-ROMs, video films and so on, there exists an increasing need for intelligent information retrieval systems that enable users to find the information desired easily. Many attempts have

  19. Bibliometrics and Information Retrieval - Creating Knowledge through Research Synergies

    NARCIS (Netherlands)

    Bar-Ilan, Judit; Koopman, Rob; Wang, Shenghui; Scharnhorst, Andrea; John, Marcus; Mayr, Philipp; Wolfram, Dietmar

    2016-01-01

    This panel brings together experts in bibliometrics and information retrieval to discuss how each of these two important areas of information science can help to inform the research of the other. There is a growing body of literature that capitalizes on the synergies created by combining methodologi

  20. Roogle: an information retrieval engine for clinical data warehouse.

    Science.gov (United States)

    Cuggia, Marc; Garcelon, Nicolas; Campillo-Gimenez, Boris; Bernicot, Thomas; Laurent, Jean-François; Garin, Etienne; Happe, André; Duvauferrier, Régis

    2011-01-01

    High amount of relevant information is contained in reports stored in the electronic patient records and associated metadata. R-oogle is a project aiming at developing information retrieval engines adapted to these reports and designed for clinicians. The system consists in a data warehouse (full-text reports and structured data) imported from two different hospital information systems. Information retrieval is performed using metadata-based semantic and full-text search methods (as Google). Applications may be biomarkers identification in a translational approach, search of specific cases, and constitution of cohorts, professional practice evaluation, and quality control assessment.

  1. Knowledge Maps and Information Retrieval (KMIR II)

    NARCIS (Netherlands)

    Mutschke, Peter; Scharnhorst, Andrea; Mayr, Philipp; Slavic, Aida; Hansen, Preben

    2015-01-01

    Information systems usually show as a particular point of failure the vagueness between user search terms and the knowledge orders of the information space in question. Some kind of guided searching therefore becomes more and more important in order to more precisely discover information without kno

  2. Visualization for Information Retrieval in Regional Distributed Environment

    Directory of Open Access Journals (Sweden)

    Amany Salama

    2013-09-01

    Full Text Available Information retrieval (IR is the task of representing, storing, organizing, and offering access to information items. The problem for search engines is not only to find topic relevant results, but results consistent with the user’s information need. How to retrieve desired information from the Internet with high efficiency and good effectiveness is become the main concern of internet user-based. The interface of the systems does not help them to perceive the precision of these results. Speed, resources consuming, searching and retrieving process also aren't optimal. The search engine's aim is developing and improving the performance of information retrieval system and gifting the user whatever his culture' level. The proposed system is using information visualization for interface problems, and for improving other side of web IR system's problems, it uses the regional crawler on distributed search environment with conceptual query processing and enhanced vector space information retrieval model (VSM. It is an effective attempt to match renewal user's needs and get a better performance than ordinary system.

  3. Locally decodable codes and private information retrieval schemes

    CERN Document Server

    Yekhanin, Sergey

    2010-01-01

    Locally decodable codes (LDCs) are codes that simultaneously provide efficient random access retrieval and high noise resilience by allowing reliable reconstruction of an arbitrary bit of a message by looking at only a small number of randomly chosen codeword bits. Local decodability comes with a certain loss in terms of efficiency - specifically, locally decodable codes require longer codeword lengths than their classical counterparts. Private information retrieval (PIR) schemes are cryptographic protocols designed to safeguard the privacy of database users. They allow clients to retrieve rec

  4. Graph-based term weighting for information retrieval

    DEFF Research Database (Denmark)

    Blanco, Roi; Lioma, Christina

    2012-01-01

    A standard approach to Information Retrieval (IR) is to model text as a bag of words. Alternatively, text can be modelled as a graph, whose vertices represent words, and whose edges represent relations between the words, defined on the basis of any meaningful statistical or linguistic relation......, flow and density during retrieval. We experimentally show that this type of ranking performs comparably to BM25, and can even outperform it, across different TREC (Voorhees and Harman in TREC: Experiment and evaluation in information retrieval, MIT Press, 2005) datasets and evaluation measures. © 2011...... weights and (2) integrating discourse aspects into retrieval. Given a text graph, whose vertices denote terms linked by co-occurrence and grammatical modification, we use graph ranking computations (e.g. PageRank Page et al. in The pagerank citation ranking: Bringing order to the Web. Technical report...

  5. Foundations of Large-Scale Multimedia Information Management and Retrieval

    CERN Document Server

    Chang, Edward Y

    2011-01-01

    "Foundations of Large-Scale Multimedia Information Management and Retrieval - Mathematics of Perception" covers knowledge representation and semantic analysis of multimedia data and scalability in signal extraction, data mining, and indexing. The book is divided into two parts: Part I - Knowledge Representation and Semantic Analysis focuses on the key components of mathematics of perception as it applies to data management and retrieval. These include feature selection/reduction, knowledge representation, semantic analysis, distance function formulation for measuring similarity, and

  6. Personalized Multimedia Information Retrieval based on User Profile Mining

    Directory of Open Access Journals (Sweden)

    Pengyi Zhang

    2013-10-01

    Full Text Available This paper focuses on how to retrieve personalized multimedia information based on user interest which can be mined from user profile. After analyzing the related works, a general structure of the personalized multimedia information retrieval system is given, which combines online module and offline module. Firstly, we collect a large-sale of photos from multimedia information sharing websites. Then, we record the information of the users who upload the multimedia information. For a given user, we save his history data which could describe the multimedia data. Secondly, the relationship between contents of multimedia data and semantic information is analyzed and then the user interest model is constructed by a modified LDA model which can integrate all the influencing factors in the task of multimedia information retrieval. Thirdly, the query distributions of all the topics can be estimated by the proposed modified LDA model. Thirdly, based on the above offline computing process, the online personalized multimedia information ranking algorithm is given which utilize the user interest model and the query word. Fourthly, multimedia information retrieval results are obtained using the proposed personalized multimedia information ranking algorithm. Finally, performance evaluation is conducted by a series of experiments to test the performance of the proposed algorithm compared with other methods on different datasets.

  7. Managing Event Information Modeling, Retrieval, and Applications

    CERN Document Server

    Gupta, Amarnath

    2011-01-01

    With the proliferation of citizen reporting, smart mobile devices, and social media, an increasing number of people are beginning to generate information about events they observe and participate in. A significant fraction of this information contains multimedia data to share the experience with their audience. A systematic information modeling and management framework is necessary to capture this widely heterogeneous, schemaless, potentially humongous information produced by many different people. This book is an attempt to examine the modeling, storage, querying, and applications of such an

  8. Hypertext and hypermedia systems in information retrieval

    Science.gov (United States)

    Kaye, K. M.; Kuhn, A. D.

    1992-01-01

    This paper opens with a brief history of hypertext and hypermedia in the context of information management during the 'information age.' Relevant terms are defined and the approach of the paper is explained. Linear and hypermedia information access methods are contrasted. A discussion of hyperprogramming in the handling of complex scientific and technical information follows. A selection of innovative hypermedia systems is discussed. An analysis of the Clinical Practice Library of Medicine NASA STI Program hypermedia application is presented. The paper concludes with a discussion of the NASA STI Program's future hypermedia project plans.

  9. 108 Information Retrieval Methods in Libraries and Information ...

    African Journals Online (AJOL)

    User

    Indexed African Journals Online: www.ajol.info .... internet resources is catalogued according to DDC on all academic sources. Users can search the ... retrieval could be based on a structure of semantic relationship. Macleod .... References.

  10. Can We Retrieve the Information Which Was Intentionally Forgotten? Electrophysiological Correlates of Strategic Retrieval in Directed Forgetting.

    Science.gov (United States)

    Mao, Xinrui; Tian, Mengxi; Liu, Yi; Li, Bingcan; Jin, Yan; Wu, Yanhong; Guo, Chunyan

    2017-01-01

    Retrieval inhibition hypothesis of directed forgetting effects assumed TBF (to-be-forgotten) items were not retrieved intentionally, while selective rehearsal hypothesis assumed the memory representation of retrieved TBF (to-be-forgotten) items was weaker than TBR (to-be-remembered) items. Previous studies indicated that directed forgetting effects of item-cueing method resulted from selective rehearsal at encoding, but the mechanism of retrieval inhibition that affected directed forgetting of TBF (to-be-forgotten) items was not clear. Strategic retrieval is a control process allowing the selective retrieval of target information, which includes retrieval orientation and strategic recollection. Retrieval orientation via the comparison of tasks refers to the specific form of processing resulted by retrieval efforts. Strategic recollection is the type of strategies to recollect studied items for the retrieval success of targets. Using a "directed forgetting" paradigm combined with a memory exclusion task, our investigation of strategic retrieval in directed forgetting assisted to explore how retrieval inhibition played a role on directed forgetting effects. When TBF items were targeted, retrieval orientation showed more positive ERPs to new items, indicating that TBF items demanded more retrieval efforts. The results of strategic recollection indicated that: (a) when TBR items were retrieval targets, late parietal old/new effects were only evoked by TBR items but not TBF items, indicating the retrieval inhibition of TBF items; (b) when TBF items were retrieval targets, the late parietal old/new effect were evoked by both TBR items and TBF items, indicating that strategic retrieval could overcome retrieval inhibition of TBF items. These findings suggested the modulation of strategic retrieval on retrieval inhibition of directed forgetting, supporting that directed forgetting effects were not only caused by selective rehearsal, but also retrieval inhibition.

  11. Local Area Networks for Information Retrieval.

    Science.gov (United States)

    Kibirige, Harry M.

    This examination of the use of local area networks (LANs) by libraries summarizes the findings of a nationwide survey of 600 libraries and information centers and 200 microcomputer networking system manufacturers and vendors, which was conducted to determine the relevance of currently available networking systems for library and information center…

  12. Cosmos: An Information Retrieval System that Works.

    Science.gov (United States)

    Clay, Katherine; Grossman, Alvin

    1980-01-01

    Briefly described is the County of San Mateo Online System (COSMOS) which was developed and is used by the San Mateo Educational Resources Center (SMERC) to access the Educational Resources Information Center (ERIC) and Fugitive Information Data Organizer (FIDO) databases as well as the curriculum guides housed at SMERC. (TG)

  13. Semantic-Sensitive Web Information Retrieval Model for HTML Documents

    CERN Document Server

    Bassil, Youssef

    2012-01-01

    With the advent of the Internet, a new era of digital information exchange has begun. Currently, the Internet encompasses more than five billion online sites and this number is exponentially increasing every day. Fundamentally, Information Retrieval (IR) is the science and practice of storing documents and retrieving information from within these documents. Mathematically, IR systems are at the core based on a feature vector model coupled with a term weighting scheme that weights terms in a document according to their significance with respect to the context in which they appear. Practically, Vector Space Model (VSM), Term Frequency (TF), and Inverse Term Frequency (IDF) are among other long-established techniques employed in mainstream IR systems. However, present IR models only target generic-type text documents, in that, they do not consider specific formats of files such as HTML web documents. This paper proposes a new semantic-sensitive web information retrieval model for HTML documents. It consists of a...

  14. Music information retrieval in compressed audio files: a survey

    Science.gov (United States)

    Zampoglou, Markos; Malamos, Athanasios G.

    2014-07-01

    In this paper, we present an organized survey of the existing literature on music information retrieval systems in which descriptor features are extracted directly from the compressed audio files, without prior decompression to pulse-code modulation format. Avoiding the decompression step and utilizing the readily available compressed-domain information can significantly lighten the computational cost of a music information retrieval system, allowing application to large-scale music databases. We identify a number of systems relying on compressed-domain information and form a systematic classification of the features they extract, the retrieval tasks they tackle and the degree in which they achieve an actual increase in the overall speed-as well as any resulting loss in accuracy. Finally, we discuss recent developments in the field, and the potential research directions they open toward ultra-fast, scalable systems.

  15. Efficient medical information retrieval in encrypted Electronic Health Records.

    Science.gov (United States)

    Pruski, Cédric; Wisniewski, François

    2012-01-01

    The recent development of eHealth platforms across the world, whose main objective is to centralize patient's healthcare information to ensure the best continuity of care, requires the development of advanced tools and techniques for supporting health professionals in retrieving relevant information in this vast quantity of data. However, for preserving patient's privacy, some countries decided to de-identify and encrypt data contained in the shared Electronic Health Records, which reinforces the complexity of proposing efficient medical information retrieval approach. In this paper, we describe an original approach exploiting standards metadata as well as knowledge organizing systems to overcome the barriers of data encryption for improving the results of medical information retrieval in centralized and encrypted Electronic Health Records. This is done through the exploitation of semantic properties provided by knowledge organizing systems, which enable query expansion. Furthermore, we provide an overview of the approach together with illustrating examples and a discussion on the advantages and limitations of the provided framework.

  16. The Paradox of the Fuzzy Disambiguation in the Information Retrieval

    Directory of Open Access Journals (Sweden)

    Anna Bryniarska

    2013-09-01

    Full Text Available Current methods of data mining, word sense disambiguation in the information retrieval, semantic relation, fuzzy sets theory, fuzzy description logic, fuzzy ontology and their implementation, omit the existence of paradox called here the paradox of the fuzzy disambiguation. The paradox lies in the fact that due to fuzzy data and the experts knowledge it can be obtained precise knowledge. In this paper to describe this paradox, is introduced a conceptual apparatus. Moreover, there is formulated an information retrieval logic. There are suggested certain applications of this logic to search information on the Web.

  17. A novel dependency language model for information retrieval

    Institute of Scientific and Technical Information of China (English)

    CAI Ke-ke; BU Jia-jun; CHEN Chun; QIU Guang

    2007-01-01

    This paper explores the application of term dependency in information retrieval (IR) and proposes a novel dependency retrieval model. This retrieval model suggests an extension to the existing language modeling (LM) approach to IR by introducing dependency models for both query and document. Relevance between document and query is then evaluated by reference to the Kullback-Leibler divergence between their dependency models. This paper introduces a novel hybrid dependency structure, which allows integration of various forms of dependency within a single framework. A pseudo relevance feedback based method is also introduced for constructing query dependency model. The basic idea is to use query-relevant top-ranking sentences extracted from the top documents at retrieval time as the augmented representation of query, from which the relationships between query terms are identified. A Markov Random Field (MRF) based approach is presented to ensure the relevance of the extracted sentences,which utilizes the association features between query terms within a sentence to evaluate the relevance of each sentence. This dependency retrieval model was compared with other traditional retrieval models. Experiments indicated that it produces significant improvements in retrieval effectiveness.

  18. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Directory of Open Access Journals (Sweden)

    Shozo Makino

    2007-01-01

    Full Text Available Recently, several music information retrieval (MIR systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA is used as recognition grammar, and about 86% retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0% retrieval accuracy was obtained with a query consisting of five words.

  19. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information

    Directory of Open Access Journals (Sweden)

    Suzuki Motoyuki

    2007-01-01

    Full Text Available Recently, several music information retrieval (MIR systems which retrieve musical pieces by the user's singing voice have been developed. All of these systems use only melody information for retrieval, although lyrics information is also useful for retrieval. In this paper, we propose a new MIR system that uses both lyrics and melody information. First, we propose a new lyrics recognition method. A finite state automaton (FSA is used as recognition grammar, and about retrieval accuracy was obtained. We also develop an algorithm for verifying a hypothesis output by a lyrics recognizer. Melody information is extracted from an input song using several pieces of information of the hypothesis, and a total score is calculated from the recognition score and the verification score. From the experimental results, 95.0 retrieval accuracy was obtained with a query consisting of five words.

  20. Teaching Skills in Medical Information Retrieval to Medical Students.

    Science.gov (United States)

    Kolner, Stuart J.; And Others

    1986-01-01

    A project that attempts to overcome the principal obstacles and to provide an efficient and effective method of teaching information retrieval skills to second-year medical students is described. The method includes a pretest, a diagnosis of deficiencies in information skills, a self-paced learning module, and a posttest. (Author/MLW)

  1. Research and Development of Information Retrieval Models and Their Applications.

    Science.gov (United States)

    Fox, Edward A.

    1989-01-01

    This introduction to a special issue devoted to modeling data, information, and knowledge briefly describes the origins of the papers presented and the topics covered, which include: Boolean logic; probability theory; artificial intelligence; organizing and encoding information and data; and characteristics of users of retrieval systems. (12…

  2. The Internet and Information Retrieval Research: A Brief Review.

    Science.gov (United States)

    Chowdhury, G. G.

    1999-01-01

    A survey of recent publications shows that frequent topics of Internet and information retrieval research are the effectiveness of search engines, information validation and quality, user studies, design of user interfaces, data structures and metadata, classification and vocabulary based aids, and indexing and search agents. The changing balance…

  3. The Physical and Cognitive Paradigms in Information Retrieval Research.

    Science.gov (United States)

    Ellis, David

    1992-01-01

    Explores the role of paradigms in information retrieval research and discusses the nature of a paradigm and the applicability of the paradigm concept to a multidisciplinary field such as information science. The features of the physical paradigm and the cognitive paradigm are outlined, and their origins, nature, and role are examined. (55…

  4. Learning to merge search results for efficient Distributed Information Retrieval

    NARCIS (Netherlands)

    Tjin-Kam-Jet, Kien; Hiemstra, Djoerd

    2010-01-01

    Merging search results from different servers is a major problem in Distributed Information Retrieval. We used Regression-SVM and Ranking-SVM which would learn a function that merges results based on information that is readily available: i.e. the ranks, titles, summaries and URLs contained in the

  5. Rare Disease Diagnosis as an Information Retrieval Task

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina;

    2011-01-01

    Increasingly more clinicians use web Information Retrieval (IR) systems to assist them in diagnosing difficult medical cases, for instance rare diseases that they may not be familiar with. However, web IR systems are not necessarily optimised for this task. For instance, clinicians’ queries tend...... to be long lists of symptoms, often containing phrases, whereas web IR systems typically expect very short keyword-based queries. Motivated by such differences, this work uses a preliminary study of 30 clinical cases to reflect on rare disease retrieval as an IR task. Initial experiments using both Google...... web search and offline retrieval from a rare disease collection indicate that the retrieval of rare diseases is an open problem with room for improvement....

  6. Incidental retrieval-induced forgetting of location information.

    Science.gov (United States)

    Gómez-Ariza, Carlos J; Fernandez, Angel; Bajo, M Teresa

    2012-06-01

    Retrieval-induced forgetting (RIF) has been studied with different types of tests and materials. However, RIF has always been tested on the items' central features, and there is no information on whether inhibition also extends to peripheral features of the events in which the items are embedded. In two experiments, we specifically tested the presence of RIF in a task in which recall of peripheral information was required. After a standard retrieval practice task oriented to item identity, participants were cued with colors (Exp. 1) or with the items themselves (Exp. 2) and asked to recall the screen locations where the items had been displayed during the study phase. RIF for locations was observed after retrieval practice, an effect that was not present when participants were asked to read instead of retrieving the items. Our findings provide evidence that peripheral location information associated with an item during study can be also inhibited when the retrieval conditions promote the inhibition of more central, item identity information.

  7. Storage and retrieval of mass spectral information

    Science.gov (United States)

    Hohn, M. E.; Humberston, M. J.; Eglinton, G.

    1977-01-01

    Computer handling of mass spectra serves two main purposes: the interpretation of the occasional, problematic mass spectrum, and the identification of the large number of spectra generated in the gas-chromatographic-mass spectrometric (GC-MS) analysis of complex natural and synthetic mixtures. Methods available fall into the three categories of library search, artificial intelligence, and learning machine. Optional procedures for coding, abbreviating and filtering a library of spectra minimize time and storage requirements. Newer techniques make increasing use of probability and information theory in accessing files of mass spectral information.

  8. Distributed Systems and Applications of Information Filtering and Retrieval

    CERN Document Server

    Giuliani, Alessandro; Semeraro, Giovanni; DART 2012

    2014-01-01

    This volume focuses on new challenges in distributed Information Filtering and Retrieval. It collects invited chapters and extended research contributions from the special session on Information Filtering and Retrieval: Novel Distributed Systems and Applications (DART) of the 4th International Conference on Knowledge Discovery and Information Retrieval (KDIR 2012), held in Barcelona, Spain, on 4-7 October 2012. The main focus of DART was to discuss and compare suitable novel solutions based on intelligent techniques and applied to real-world applications. The chapters of this book present a comprehensive review of related works and state of the art. Authors, both practitioners and researchers, shared their results in several topics such as "Multi-Agent Systems", "Natural Language Processing", "Automatic Advertisement", "Customer Interaction Analytics", "Opinion Mining". Contributions have been careful reviewed by experts in the area, who also gave useful suggestions to improve the quality of the volume.

  9. Noun-Phrase Analysis in Unrestricted Text for Information Retrieval

    CERN Document Server

    Evans, D A; Evans, David A.; Zhai, Chengxiang

    1996-01-01

    Information retrieval is an important application area of natural-language processing where one encounters the genuine challenge of processing large quantities of unrestricted natural-language text. This paper reports on the application of a few simple, yet robust and efficient noun-phrase analysis techniques to create better indexing phrases for information retrieval. In particular, we describe a hybrid approach to the extraction of meaningful (continuous or discontinuous) subcompounds from complex noun phrases using both corpus statistics and linguistic heuristics. Results of experiments show that indexing based on such extracted subcompounds improves both recall and precision in an information retrieval system. The noun-phrase analysis techniques are also potentially useful for book indexing and automatic thesaurus extraction.

  10. Ontology Based Information Retrieval in Semantic Web: A Survey

    Directory of Open Access Journals (Sweden)

    Vishal Jain

    2013-09-01

    Full Text Available In present age of computers, there are various resources for gathering information related to given query like Radio Stations, Television, Internet and many more. Among them, Internet is considered as major factor for obtaining any information about a given domain. When a user wants to find some information, he/she enters a query and results are produced via hyperlinks linked to various documents available on web. But the information that is retrieved to us may or may not be relevant. This irrelevance is caused due to huge collection of documents available on web. Traditional search engines are based on keyword based searching that is unable to transform raw data into knowledgeable representation data. It is a cumbersome task to extract relevant information from large collection of web documents. These shortcomings have led to the concept of Semantic Web (SW and Ontology into existence. Semantic Web (SW is a well defined portal that helps in extracting relevant information using many Information Retrieval (IR techniques. Current Information Retrieval (IR techniques are not so advanced that they can be able to exploit semantic knowledge within documents and give precise result. The terms, Information Retrieval (IR, Semantic Web (SW and Ontology are used differently but they are interconnected with each other. Information Retrieval (IR technology and Web based Indexing contributes to existence of Semantic Web. Use of Ontology also contributes in building new generation of web- Semantic Web. With the help of ontologies, we can make content of web as it will be markup with the help of Semantic Web documents (SWD’s. Ontology is considered as backbone of Software system. It improves understanding between concepts used in Semantic Web (SW. So, there is need to build an ontology that uses well defined methodology and process of developing ontology is called Ontology Development.

  11. Semantic knowledge representation for information retrieval

    CERN Document Server

    Gödert, Winfried; Nagelschmidt, Matthias

    2014-01-01

    This book covers the basics of semantic web technologies and indexing languages, and describes their contribution to improve languages as a tool for subject queries and knowledge exploration. The book is relevant to information scientists, knowledge workers and indexers. It provides a suitable combination of theoretical foundations and practical applications.

  12. Peer-to-peer information retrieval

    NARCIS (Netherlands)

    Tigelaar, Almer S.

    2012-01-01

    The Internet has become an integral part of our daily lives. However,the essential task of finding information is dominated by a handful of large centralised search engines. In this thesis we study an alternative to this approach. Instead of using large data centres, we propose using the machines th

  13. Image-based information, communication, and retrieval

    Science.gov (United States)

    Bryant, N. A.; Zobrist, A. L.

    1980-01-01

    IBIS/VICAR system combines video image processing and information management. Flexible programs require user to supply only parameters specific to particular application. Special-purpose input/output routines transfer image data with reduced memory requirements. New application programs are easily incorporated. Program is written in FORTRAN IV, Assembler, and OS JCL for batch execution and has been implemented on IBM 360.

  14. MIRANDA - Music Information Retrieval And Data Acquisition

    DEFF Research Database (Denmark)

    Lehn-Schiøler, Tue; Petersen, Kaare Brandt; Hansen, Lars Kai

    2006-01-01

    In this report we present a music data harvesting system based on a plug-in for a popular music player. When a user is playing a song using the plug-in, information about the song is anonymously submitted to a server. The data gathered using MIRANDA is intended to be released to the MIR community...

  15. Acquisition and retrieval of ophthalmology academic information

    Directory of Open Access Journals (Sweden)

    Lei Li

    2014-06-01

    Full Text Available This article discusses how to search and access ophthalmology information based on specialized websites and resources by introducing the database, search engines, electronic journals, electronic books and so on. Hope to help ophthalmic practitioners to carry out scientific research and clinical practice.

  16. Physicists' Information Tasks: Structure, Length and Retrieval Performance

    DEFF Research Database (Denmark)

    Lykke, Marianne; Ingwersen, Peter; Bogers, Toine;

    2010-01-01

    to describe the tasks, 3) what semantic categories were used to express the search facets, and 4) retrieval performance. Results show variety in structure and length across task descriptions and task purposes. The results indicate effect of length and, in particular, of task purpose on retrieval performance......In this poster, we describe central aspects of 65 natural information tasks from 23 senior researchers, PhDs, and experienced MSc students from three different university departments of physics. We analyze 1) the main purpose of the information task, 2) which and how many search facets were used...

  17. Learning to rank for information retrieval and natural language processing

    CERN Document Server

    Li, Hang

    2014-01-01

    Learning to rank refers to machine learning techniques for training a model in a ranking task. Learning to rank is useful for many applications in information retrieval, natural language processing, and data mining. Intensive studies have been conducted on its problems recently, and significant progress has been made. This lecture gives an introduction to the area including the fundamental problems, major approaches, theories, applications, and future work.The author begins by showing that various ranking problems in information retrieval and natural language processing can be formalized as tw

  18. Physicists' Information Tasks: Structure, Length and Retrieval Performance

    DEFF Research Database (Denmark)

    Lykke, Marianne; Ingwersen, Peter; Bogers, Toine

    2010-01-01

    to describe the tasks, 3) what semantic categories were used to express the search facets, and 4) retrieval performance. Results show variety in structure and length across task descriptions and task purposes. The results indicate effect of length and, in particular, of task purpose on retrieval performance......In this poster, we describe central aspects of 65 natural information tasks from 23 senior researchers, PhDs, and experienced MSc students from three different university departments of physics. We analyze 1) the main purpose of the information task, 2) which and how many search facets were used...

  19. Information retrieval pathways for health information exchange in multiple care settings

    DEFF Research Database (Denmark)

    Kierkegaard, Patrick; Kaushal, Rainu; Vest, Joshua R.

    2014-01-01

    Objectives To determine which health information exchange (HIE) technologies and information retrieval pathways healthcare professionals relied on to meet their information needs in the context of laboratory test results, radiological images and reports, and medication histories. Study Design...

  20. Intelligent Information Retrieval: Part IV. Testing the Timing of Two Information Retrieval Devices in a Naturalistic Setting.

    Science.gov (United States)

    Cole, Charles

    2001-01-01

    Reports the results of two studies of undergraduates that tested an uncertainty expansion information retrieval device and an uncertainty reduction device in naturalistic settings, designed to be given at different stages of Kuhlthau's information search process. Concludes that the timing of the device interventions is crucial to their potential…

  1. Content-based Image Retrieval by Information Theoretic Measure

    Directory of Open Access Journals (Sweden)

    Madasu Hanmandlu

    2011-09-01

    Full Text Available Content-based image retrieval focuses on intuitive and efficient methods for retrieving images from databases based on the content of the images. A new entropy function that serves as a measure of information content in an image termed as 'an information theoretic measure' is devised in this paper. Among the various query paradigms, 'query by example' (QBE is adopted to set a query image for retrieval from a large image database. In this paper, colour and texture features are extracted using the new entropy function and the dominant colour is considered as a visual feature for a particular set of images. Thus colour and texture features constitute the two-dimensional feature vector for indexing the images. The low dimensionality of the feature vector speeds up the atomic query. Indices in a large database system help retrieve the images relevant to the query image without looking at every image in the database. The entropy values of colour and texture and the dominant colour are considered for measuring the similarity. The utility of the proposed image retrieval system based on the information theoretic measures is demonstrated on a benchmark dataset.Defence Science Journal, 2011, 61(5, pp.415-430, DOI:http://dx.doi.org/10.14429/dsj.61.1177

  2. MIRANDA - Music Information Retrieval And Data Acquisition

    DEFF Research Database (Denmark)

    Lehn-Schiøler, Tue; Petersen, Kaare Brandt; Hansen, Lars Kai

    2006-01-01

    In this report we present a music data harvesting system based on a plug-in for a popular music player. When a user is playing a song using the plug-in, information about the song is anonymously submitted to a server. The data gathered using MIRANDA is intended to be released to the MIR community....... We argue that even though content-based data is of interest to the community, also meta data and usage data can be important for research in music similarity....

  3. Improving life sciences information retrieval using semantic web technology.

    Science.gov (United States)

    Quan, Dennis

    2007-05-01

    The ability to retrieve relevant information is at the heart of every aspect of research and development in the life sciences industry. Information is often distributed across multiple systems and recorded in a way that makes it difficult to piece together the complete picture. Differences in data formats, naming schemes and network protocols amongst information sources, both public and private, must be overcome, and user interfaces not only need to be able to tap into these diverse information sources but must also assist users in filtering out extraneous information and highlighting the key relationships hidden within an aggregated set of information. The Semantic Web community has made great strides in proposing solutions to these problems, and many efforts are underway to apply Semantic Web techniques to the problem of information retrieval in the life sciences space. This article gives an overview of the principles underlying a Semantic Web-enabled information retrieval system: creating a unified abstraction for knowledge using the RDF semantic network model; designing semantic lenses that extract contextually relevant subsets of information; and assembling semantic lenses into powerful information displays. Furthermore, concrete examples of how these principles can be applied to life science problems including a scenario involving a drug discovery dashboard prototype called BioDash are provided.

  4. Information Retrieval Systems Adapted to the Biomedical Domain

    CERN Document Server

    Marrero, Mónica; Urbano, Julián; Morato, Jorge; Moreiro, José-Antonio; 10.3145/epi.2010.may.04

    2012-01-01

    The terminology used in Biomedicine shows lexical peculiarities that have required the elaboration of terminological resources and information retrieval systems with specific functionalities. The main characteristics are the high rates of synonymy and homonymy, due to phenomena such as the proliferation of polysemic acronyms and their interaction with common language. Information retrieval systems in the biomedical domain use techniques oriented to the treatment of these lexical peculiarities. In this paper we review some of the techniques used in this domain, such as the application of Natural Language Processing (BioNLP), the incorporation of lexical-semantic resources, and the application of Named Entity Recognition (BioNER). Finally, we present the evaluation methods adopted to assess the suitability of these techniques for retrieving biomedical resources.

  5. Annotation of Scientific Summaries for Information Retrieval

    CERN Document Server

    Ibekwe-Sanjuan, Fidelia; Eric, Sanjuan; Eric, Charton

    2011-01-01

    We present a methodology combining surface NLP and Machine Learning techniques for ranking asbtracts and generating summaries based on annotated corpora. The corpora were annotated with meta-semantic tags indicating the category of information a sentence is bearing (objective, findings, newthing, hypothesis, conclusion, future work, related work). The annotated corpus is fed into an automatic summarizer for query-oriented abstract ranking and multi- abstract summarization. To adapt the summarizer to these two tasks, two novel weighting functions were devised in order to take into account the distribution of the tags in the corpus. Results, although still preliminary, are encouraging us to pursue this line of work and find better ways of building IR systems that can take into account semantic annotations in a corpus.

  6. Concept Tree Based Information Retrieval Model

    Directory of Open Access Journals (Sweden)

    Chunyan Yuan

    2014-05-01

    Full Text Available This paper proposes a novel concept-based query expansion technique named Markov concept tree model (MCTM, discovering term relationship through the concept tree deduced by term markov network. We address two important issues for query expansion: the selection and the weighting of expansion search terms. In contrast to earlier methods, queries are expanded by adding those terms that are most similar to the concept of the query, rather than selecting terms that are similar to a signal query terms. Utilizing Markov network which is constructed according to the co-occurrence information of the terms in collection, it generate concept tree for each original query term, remove the redundant and irrelevant nodes in concept tree, then adjust the weight of original query and the weight of expansion term based on a pruning algorithm. We use this model for query expansion and evaluate the effectiveness of the model by examining the accuracy and robustness of the expansion methods, Compared with the baseline model, the experiments on standard dataset reveal that this method can achieve a better query quality

  7. Information Retrieval in Telemedicine: a Comparative Study on Bibliographic Databases

    Science.gov (United States)

    Ahmadi, Maryam; Sarabi, Roghayeh Ershad; Orak, Roohangiz Jamshidi; Bahaadinbeigy, Kambiz

    2015-01-01

    Background and Aims: The first step in each systematic review is selection of the most valid database that can provide the highest number of relevant references. This study was carried out to determine the most suitable database for information retrieval in telemedicine field. Methods: Cinhal, PubMed, Web of Science and Scopus databases were searched for telemedicine matched with Education, cost benefit and patient satisfaction. After analysis of the obtained results, the accuracy coefficient, sensitivity, uniqueness and overlap of databases were calculated. Results: The studied databases differed in the number of retrieved articles. PubMed was identified as the most suitable database for retrieving information on the selected topics with the accuracy and sensitivity ratios of 50.7% and 61.4% respectively. The uniqueness percent of retrieved articles ranged from 38% for Pubmed to 3.0% for Cinhal. The highest overlap rate (18.6%) was found between PubMed and Web of Science. Less than 1% of articles have been indexed in all searched databases. Conclusion: PubMed is suggested as the most suitable database for starting search in telemedicine and after PubMed, Scopus and Web of Science can retrieve about 90% of the relevant articles. PMID:26236086

  8. A Formative Study: Inquiry and Informational Text with Fifth-Grade Bilinguals

    Science.gov (United States)

    Moses, Lindsey

    2014-01-01

    This article includes the findings from a formative experiment implementing inquiry with informational texts in a fifth-grade bilingual classroom after the completion of state assessments. The pedagogical goals were focused on facilitating engaged reading and writing for native Spanish-speakers and building content knowledge and related academic…

  9. Synthesizing Information from Language Samples and Standardized Tests in School-Age Bilingual Assessment

    Science.gov (United States)

    Ebert, Kerry Danahy; Pham, Giang

    2017-01-01

    Purpose: Although language samples and standardized tests are regularly used in assessment, few studies provide clinical guidance on how to synthesize information from these testing tools. This study extends previous work on the relations between tests and language samples to a new population--school-age bilingual speakers with primary language…

  10. "BRAIN": Baruch Retrieval of Automated Information for Negotiations.

    Science.gov (United States)

    Levenstein, Aaron, Ed.

    1981-01-01

    A data processing program that can be used as a research and collective bargaining aid for colleges is briefly described and the fields of the system are outlined. The system, known as BRAIN (Baruch Retrieval of Automated Information for Negotiations), is designed primarily as an instrument for quantitative and qualitative analysis. BRAIN consists…

  11. Development of Information Retrieval Skills for Freshman Medical Students.

    Science.gov (United States)

    Moore, Gerald F.

    1988-01-01

    A study, using a specific patient encounter as the focal point for each student's research, is described that documents the skills of entering freshmen medical students before and immediately after a short course emphasizing information retrieval and at follow-up one year later. (MLW)

  12. Learning to Rank for Information Retrieval from User Interactions

    NARCIS (Netherlands)

    Hofmann, K.; Whiteson, S.; Schuth, A.; de Rijke, M.

    2014-01-01

    In this article we give an overview of our recent work on online learning to rank for information retrieval (IR). This work addresses IR from a reinforcement learning (RL) point of view, with the aim to enable systems that can learn directly from interactions with their users. Learning directly from

  13. Information Storage and Retrieval Scientific Report No. ISR-22.

    Science.gov (United States)

    Salton, Gerard

    The twenty-second in a series, this report describes research in information organization and retrieval conducted by the Department of Computer Science at Cornell University. The report covers work carried out during the period summer 1972 through summer 1974 and is divided into four parts: indexing theory, automatic content analysis, feedback…

  14. Rare disease diagnosis as an information retrieval task

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina

    2011-01-01

    Increasingly more clinicians use web Information Retrieval (IR) systems to assist them in diagnosing difficult medical cases, for instance rare diseases that they may not be familiar with. However, web IR systems are not necessarily optimised for this task. For instance, clinicians’ queries tend...

  15. Cross-Language Information Retrieval: An Analysis of Errors.

    Science.gov (United States)

    Ruiz, Miguel E.; Srinivasan, Padmini

    1998-01-01

    Investigates an automatic method for Cross Language Information Retrieval (CLIR) that utilizes the multilingual Unified Medical Language System (UMLS) Metathesaurus to translate Spanish natural-language queries into English. Results indicate that for Spanish, the UMLS Metathesaurus-based CLIR method is at least equivalent to if not better than…

  16. A Survey of Query Auto Completion in Information Retrieval

    NARCIS (Netherlands)

    Cai, F.; de Rijke, M.

    2016-01-01

    In information retrieval, query auto completion (QAC), also known as type-ahead [Xiao et al., 2013, Cai et al., 2014b] and auto-complete suggestion [Jain and Mishne, 2010], refers to the following functionality: given a prefix consisting of a number of characters entered into a search box, the user i

  17. Bibliometric-enhanced Information Retrieval : 2nd International BIR Workshop

    NARCIS (Netherlands)

    Mayr, Philipp; Frommholz, Ingo; Scharnhorst, Andrea; Mutschke, Peter

    2015-01-01

    This workshop brings together experts of communities which often have been perceived as different once: bibliometrics / scientometrics / informetrics on the one side and information retrieval on the other. Our motivation as organizers of the workshop started from the observation that main discourses

  18. On Inference Rules of Logic-Based Information Retrieval Systems.

    Science.gov (United States)

    Chen, Patrick Shicheng

    1994-01-01

    Discussion of relevance and the needs of the users in information retrieval focuses on a deductive object-oriented approach and suggests eight inference rules for the deduction. Highlights include characteristics of a deductive object-oriented system, database and data modeling language, implementation, and user interface. (Contains 24…

  19. Creating an Information Retrieval test corpus for Dutch

    NARCIS (Netherlands)

    Hiemstra, D.; Leeuwen, van D.A.; Theune, M.; Nijholt, A.; Hondorp, G.H.W.

    2002-01-01

    This paper describes the first large-scale evaluation of information retrieval systems using Dutch documents and queries. We describe in detail the characteristics of the Dutch test data, which is part of the official CLEF multilingual test corpus, and give an overview of the experimental results of

  20. Vocabulary Mining for Information Retrieval: Rough Sets and Fuzzy Sets.

    Science.gov (United States)

    Srinivasan, Padmini; Ruiz, Miguel E.; Kraft, Donald H.; Chen, Jianhua

    2001-01-01

    Explains vocabulary mining in information retrieval and describes a framework for vocabulary mining that allows the use of rough set-based approximations even when documents and queries are described using weighted, or fuzzy, representations. Examines coordination between multiple vocabulary views and applies the framework to the Unified Medical…

  1. Towards an Intelligent Possibilistic Web Information Retrieval Using Multiagent System

    Science.gov (United States)

    Elayeb, Bilel; Evrard, Fabrice; Zaghdoud, Montaceur; Ahmed, Mohamed Ben

    2009-01-01

    Purpose: The purpose of this paper is to make a scientific contribution to web information retrieval (IR). Design/methodology/approach: A multiagent system for web IR is proposed based on new technologies: Hierarchical Small-Worlds (HSW) and Possibilistic Networks (PN). This system is based on a possibilistic qualitative approach which extends the…

  2. Towards an Intelligent Possibilistic Web Information Retrieval Using Multiagent System

    Science.gov (United States)

    Elayeb, Bilel; Evrard, Fabrice; Zaghdoud, Montaceur; Ahmed, Mohamed Ben

    2009-01-01

    Purpose: The purpose of this paper is to make a scientific contribution to web information retrieval (IR). Design/methodology/approach: A multiagent system for web IR is proposed based on new technologies: Hierarchical Small-Worlds (HSW) and Possibilistic Networks (PN). This system is based on a possibilistic qualitative approach which extends the…

  3. Fault-tolerant symmetrically-private information retrieval

    Science.gov (United States)

    Wang, Tian-Yin; Cai, Xiao-Qiu; Zhang, Rui-Ling

    2016-08-01

    We propose two symmetrically-private information retrieval protocols based on quantum key distribution, which provide a good degree of database and user privacy while being flexible, loss-resistant and easily generalized to a large database similar to the precedent works. Furthermore, one protocol is robust to a collective-dephasing noise, and the other is robust to a collective-rotation noise.

  4. A cross-lingual framework for monolingual biomedical information retrieval

    NARCIS (Netherlands)

    Trieschnigg, D.; Hiemstra, D.; Jong, F. de; Kraaij, W.

    2010-01-01

    An important challenge for biomedical information retrieval (IR) is dealing with the complex, inconsistent and ambiguous biomedical terminology. Frequently, a concept-based representation defined in terms of a domain-specific terminological resource is employed to deal with this challenge. In this p

  5. Information Theoretic Similarity Measures for Content Based Image Retrieval.

    Science.gov (United States)

    Zachary, John; Iyengar, S. S.

    2001-01-01

    Content-based image retrieval is based on the idea of extracting visual features from images and using them to index images in a database. Proposes similarity measures and an indexing algorithm based on information theory that permits an image to be represented as a single number. When used in conjunction with vectors, this method displays…

  6. Bibliometric-enhanced Information Retrieval : 2nd International BIR Workshop

    NARCIS (Netherlands)

    Mayr, Philipp; Frommholz, Ingo; Scharnhorst, Andrea; Mutschke, Peter

    2015-01-01

    This workshop brings together experts of communities which often have been perceived as different once: bibliometrics / scientometrics / informetrics on the one side and information retrieval on the other. Our motivation as organizers of the workshop started from the observation that main discourses

  7. Videodisc: A New Resource for Library Information Storage and Retrieval.

    Science.gov (United States)

    Sonnemann, Sabine S.

    1984-01-01

    Details a National Library of Canada project to produce a videodisc, show its value as a research tool, and demonstrate its viability as an information storage and retrieval medium. An overview and time sequence of the project, disc contents, resource materials for production, production and postproduction techniques, and project results are…

  8. Experiences with automated categorization in e-government information retrieval

    DEFF Research Database (Denmark)

    Jonasen, Tanja Svarre; Lykke, Marianne

    2014-01-01

    High-precision search results are essential for supporting e-government employees’ information tasks. Prior studies have shown that existing features of e-government retrieval systems need improvement in terms of search facilities (e.g., Goh et al. 2008), navigation (e.g., de Jong and Lentz 2006...... documents were retrieved. The findings emphasise the importance of simultaneous search options for e-government IR systems, and reveal that automated categorization is valuable in improving search facilities in e-government....

  9. Semantic Information Retrieval from Distributed Heterogeneous Data Sources

    CERN Document Server

    Munir, K; McClatchey, R; Khan, S; Habib, I

    2007-01-01

    Information retrieval from distributed heterogeneous data sources remains a challenging issue. As the number of data sources increases more intelligent retrieval techniques, focusing on information content and semantics, are required. Currently ontologies are being widely used for managing semantic knowledge, especially in the field of bioinformatics. In this paper we describe an ontology assisted system that allows users to query distributed heterogeneous data sources by hiding details like location, information structure, access pattern and semantic structure of the data. Our goal is to provide an integrated view on biomedical information sources for the Health-e-Child project with the aim to overcome the lack of sufficient semantic-based reformulation techniques for querying distributed data sources. In particular, this paper examines the problem of query reformulation across biomedical data sources, based on merged ontologies and the underlying heterogeneous descriptions of the respective data sources.

  10. Building an Automatic Thesaurus to Enhance Information Retrieval

    Directory of Open Access Journals (Sweden)

    Essam Said Hanandeh

    2013-01-01

    Full Text Available One of the major problems of modern Information Retrieval (IR systems is the vocabulary Problem that concerns with the discrepancies between terms used for describing documents and the terms used by the researcher to describe their information need. We have implemented an automatic thesurs, the system was built using Vector Space Model (VSM. In this model, we used Cosine measure similarity. In this paper we use selected 242 Arabic abstract documents. All these abstracts involve computer science and information system. The main goal of this paper is to design and build automatic Arabic thesauri using term-term similarity that can be used in any special field or domain to improve the expansion process and to get more relevance documents for the user's query. The study concluded that the similarl thesaurus improved the recall and precision more than traditional information retrieval system in terms of recall and precision level.

  11. Translation lexicon acquisition from bilingual dictionaries

    Science.gov (United States)

    Doermann, David S.; Ma, Huanfeng; Karagol-Ayan, Burcu; Oard, Douglas W.

    2001-12-01

    Bilingual dictionaries hold great potential as a source of lexical resources for training automated systems for optical character recognition, machine translation and cross-language information retrieval. In this work we describe a system for extracting term lexicons from printed copies of bilingual dictionaries. We describe our approach to page and definition segmentation and entry parsing. We have used the approach to parse a number of dictionaries and demonstrate the results for retrieval using a French-English Dictionary to generate a translation lexicon and a corpus of English queries applied to French documents to evaluation cross-language IR.

  12. Toward higher effectiveness for recall-oriented information retrieval: A patent retrieval case study

    OpenAIRE

    Magdy, Walid

    2012-01-01

    Research in information retrieval (IR) has largely been directed towards tasks requiring high precision. Recently, other IR applications which can be described as recall-oriented IR tasks have received increased attention in the IR research domain. Prominent among these IR applications are patent search and legal search, where users are typically ready to check hundreds or possibly thousands of documents in order to find any possible relevant document. The main concerns in this kind of applic...

  13. Iterative Filtering of Retrieved Information to Increase Relevance

    Directory of Open Access Journals (Sweden)

    Robert Zeidman

    2007-12-01

    Full Text Available Efforts have been underway for years to find more effective ways to retrieve information from large knowledge domains. This effort is now being driven particularly by the Internet and the vast amount of information that is available to unsophisticated users. In the early days of the Internet, some effort involved allowing users to enter Boolean equations of search terms into search engines, for example, rather than just a list of keywords. More recently, effort has focused on understanding a user's desires from past search histories in order to narrow searches. Also there has been much effort to improve the ranking of results based on some measure of relevancy. This paper discusses using iterative filtering of retrieved information to focus in on useful information. This work was done for finding source code correlation and the author extends his findings to Internet searching and e-commerce. The paper presents specific information about a particular filtering application and then generalizes it to other forms of information retrieval.

  14. Software Helps Retrieve Information Relevant to the User

    Science.gov (United States)

    Mathe, Natalie; Chen, James

    2003-01-01

    The Adaptive Indexing and Retrieval Agent (ARNIE) is a code library, designed to be used by an application program, that assists human users in retrieving desired information in a hypertext setting. Using ARNIE, the program implements a computational model for interactively learning what information each human user considers relevant in context. The model, called a "relevance network," incrementally adapts retrieved information to users individual profiles on the basis of feedback from the users regarding specific queries. The model also generalizes such knowledge for subsequent derivation of relevant references for similar queries and profiles, thereby, assisting users in filtering information by relevance. ARNIE thus enables users to categorize and share information of interest in various contexts. ARNIE encodes the relevance and structure of information in a neural network dynamically configured with a genetic algorithm. ARNIE maintains an internal database, wherein it saves associations, and from which it returns associated items in response to a query. A C++ compiler for a platform on which ARNIE will be utilized is necessary for creating the ARNIE library but is not necessary for the execution of the software.

  15. Retrieving self-vocalized information: An event-related potential (ERP) study on the effect of retrieval orientation.

    Science.gov (United States)

    Rosburg, Timm; Johansson, Mikael; Sprondel, Volker; Mecklinger, Axel

    2014-11-18

    Retrieval orientation refers to a pre-retrieval process and conceptualizes the specific form of processing that is applied to a retrieval cue. In the current event-related potential (ERP) study, we sought to find evidence for an involvement of the auditory cortex when subjects attempt to retrieve vocalized information, and hypothesized that adopting retrieval orientation would be beneficial for retrieval accuracy. During study, participants saw object words that they subsequently vocalized or visually imagined. At test, participants had to identify object names of one study condition as targets and to reject object names of the second condition together with new items. Target category switched after half of the test trials. Behaviorally, participants responded less accurately and more slowly to targets of the vocalize condition than to targets of the imagine condition. ERPs to new items varied at a single left electrode (T7) between 500 and 800ms, indicating a moderate retrieval orientation effect in the subject group as a whole. However, whereas the effect was strongly pronounced in participants with high retrieval accuracy, it was absent in participants with low retrieval accuracy. A current source density (CSD) mapping of the retrieval orientation effect indicated a source over left temporal regions. Independently from retrieval accuracy, the ERP retrieval orientation effect was surprisingly also modulated by test order. Findings are suggestive for an involvement of the auditory cortex in retrieval attempts of vocalized information and confirm that adopting retrieval orientation is potentially beneficial for retrieval accuracy. The effects of test order on retrieval-related processes might reflect a stronger focus on the newness of items in the more difficult test condition when participants started with this condition.

  16. Use of information-retrieval languages in automated retrieval of experimental data from long-term storage

    Science.gov (United States)

    Khovanskiy, Y. D.; Kremneva, N. I.

    1975-01-01

    Problems and methods are discussed of automating information retrieval operations in a data bank used for long term storage and retrieval of data from scientific experiments. Existing information retrieval languages are analyzed along with those being developed. The results of studies discussing the application of the descriptive 'Kristall' language used in the 'ASIOR' automated information retrieval system are presented. The development and use of a specialized language of the classification-descriptive type, using universal decimal classification indices as the main descriptors, is described.

  17. Speech-recognition interfaces for music information retrieval

    Science.gov (United States)

    Goto, Masataka

    2005-09-01

    This paper describes two hands-free music information retrieval (MIR) systems that enable a user to retrieve and play back a musical piece by saying its title or the artist's name. Although various interfaces for MIR have been proposed, speech-recognition interfaces suitable for retrieving musical pieces have not been studied. Our MIR-based jukebox systems employ two different speech-recognition interfaces for MIR, speech completion and speech spotter, which exploit intentionally controlled nonverbal speech information in original ways. The first is a music retrieval system with the speech-completion interface that is suitable for music stores and car-driving situations. When a user only remembers part of the name of a musical piece or an artist and utters only a remembered fragment, the system helps the user recall and enter the name by completing the fragment. The second is a background-music playback system with the speech-spotter interface that can enrich human-human conversation. When a user is talking to another person, the system allows the user to enter voice commands for music playback control by spotting a special voice-command utterance in face-to-face or telephone conversations. Experimental results from use of these systems have demonstrated the effectiveness of the speech-completion and speech-spotter interfaces. (Video clips: http://staff.aist.go.jp/m.goto/MIR/speech-if.html)

  18. Case retrieval in medical databases by fusing heterogeneous information.

    Science.gov (United States)

    Quellec, Gwénolé; Lamard, Mathieu; Cazuguel, Guy; Roux, Christian; Cochener, Béatrice

    2011-01-01

    A novel content-based heterogeneous information retrieval framework, particularly well suited to browse medical databases and support new generation computer aided diagnosis (CADx) systems, is presented in this paper. It was designed to retrieve possibly incomplete documents, consisting of several images and semantic information, from a database; more complex data types such as videos can also be included in the framework. The proposed retrieval method relies on image processing, in order to characterize each individual image in a document by their digital content, and information fusion. Once the available images in a query document are characterized, a degree of match, between the query document and each reference document stored in the database, is defined for each attribute (an image feature or a metadata). A Bayesian network is used to recover missing information if need be. Finally, two novel information fusion methods are proposed to combine these degrees of match, in order to rank the reference documents by decreasing relevance for the query. In the first method, the degrees of match are fused by the Bayesian network itself. In the second method, they are fused by the Dezert-Smarandache theory: the second approach lets us model our confidence in each source of information (i.e., each attribute) and take it into account in the fusion process for a better retrieval performance. The proposed methods were applied to two heterogeneous medical databases, a diabetic retinopathy database and a mammography screening database, for computer aided diagnosis. Precisions at five of 0.809 ± 0.158 and 0.821 ± 0.177, respectively, were obtained for these two databases, which is very promising.

  19. A Survey on Web Text Information Retrieval in Text Mining

    Directory of Open Access Journals (Sweden)

    Tapaswini Nayak

    2015-08-01

    Full Text Available In this study we have analyzed different techniques for information retrieval in text mining. The aim of the study is to identify web text information retrieval. Text mining almost alike to analytics, which is a process of deriving high quality information from text. High quality information is typically derived in the course of the devising of patterns and trends through means such as statistical pattern learning. Typical text mining tasks include text categorization, text clustering, concept/entity extraction, creation of coarse taxonomies, sentiment analysis, document summarization and entity relation modeling. It is used to mine hidden information from not-structured or semi-structured data. This feature is necessary because a large amount of the Web information is semi-structured due to the nested structure of HTML code, is linked and is redundant. Web content categorization with a content database is the most important tool to the efficient use of search engines. A customer requesting information on a particular subject or item would otherwise have to search through hundred of results to find the most relevant information to his query. Hundreds of results through use of mining text are reduced by this step. This eliminates the aggravation and improves the navigation of information on the Web.

  20. A Novel Fuzzy Document Based Information Retrieval Model for Forecasting

    Directory of Open Access Journals (Sweden)

    Partha Roy

    2017-06-01

    Full Text Available Information retrieval systems are generally used to find documents that are most appropriate according to some query that comes dynamically from users. In this paper a novel Fuzzy Document based Information Retrieval Model (FDIRM is proposed for the purpose of Stock Market Index forecasting. The novelty of proposed approach is a modified tf-idf scoring scheme to predict the future trend of the stock market index. The contribution of this paper has two dimensions, 1 In the proposed system the simple time series is converted to an enriched fuzzy linguistic time series with a unique approach of incorporating market sentiment related information along with the price and 2 A unique approach is followed while modeling the information retrieval (IR system which converts a simple IR system into a forecasting system. From the performance comparison of FDIRM with standard benchmark models it can be affirmed that the proposed model has a potential of becoming a good forecasting model. The stock market data provided by Standard & Poor’s CRISIL NSE Index 50 (CNX NIFTY-50 index of National Stock Exchange of India (NSE is used to experiment and validate the proposed model. The authentic data for validation and experimentation is obtained from http://www.nseindia.com which is the official website of NSE. A java program is under construction to implement the model in real-time with graphical users’ interface.

  1. Topic Map: An Ontology Framework for Information Retrieval

    CERN Document Server

    Kannan, Rajkumar

    2010-01-01

    The basic classification techniques for organizing information are thesauri, taxonomy and faceted classification. Topic map is relatively a new entrant to this information space. Topic map standard describes how complex relationships between abstract concepts and real world resources can be represented using XML syntax. This paper explores how topic map incorporates the traditional techniques and what are its advantages and disadvantages in several dimensions such as content management, indexing, knowledge representation, constraint specification and query languages in the context of information retrieval. The constructs of topic maps are illustrated with a use-case implemented in XTM

  2. 8th International Workshop on Information Filtering and Retrieval

    CERN Document Server

    Giuliani, Alessandro; Semeraro, Giovanni

    2017-01-01

    This book focuses on new research challenges in intelligent information filtering and retrieval. It collects invited chapters and extended research contributions from DART 2014 (the 8th International Workshop on Information Filtering and Retrieval), held in Pisa (Italy), on December 10, 2014, and co-hosted with the XIII AI*IA Symposium on Artificial Intelligence. The main focus of DART was to discuss and compare suitable novel solutions based on intelligent techniques and applied to real-world contexts. The chapters of this book present a comprehensive review of related works and the current state of the art. The contributions from both practitioners and researchers have been carefully reviewed by experts in the area, who also gave useful suggestions to improve the quality of the book.

  3. A new approach to query expansion in information retrieval

    Institute of Scientific and Technical Information of China (English)

    Li Weijiang; Zhao Tiejun; Wang Xiangang

    2008-01-01

    To eliminate the mismatch between words of relevant documents and user's query and more serious negative effects it has on the performance of information retrieval,a method of query expansion on the basis of new terms co-occurrence representation was put forward by analyzing the process of producing query. The expansion terms were selected according to their correlation to the whole query. At the same time, the position information between terms were considered. The experimental result on test retrieval conference (TREC) data collection shows that the method proposed in the paper has made an improvement of 5%~19% all the time than the language modeling method without expansion. Compared to the popular approach of query expansion, pseudo feedback, the precision of the proposed method is competitive.

  4. Informative Top-k Retrieval for Advanced Skill Management

    Science.gov (United States)

    Colucci, Simona; di Noia, Tommaso; Ragone, Azzurra; Ruta, Michele; Straccia, Umberto; Tinelli, Eufemia

    The paper presents a knowledge-based framework for skills and talent management based on an advanced matchmaking between profiles of candidates and available job positions. Interestingly, informative content of top-k retrieval is enriched through semantic capabilities. The proposed approach allows to: (1) express a requested profile in terms of both hard constraints and soft ones; (2) provide a ranking function based also on qualitative attributes of a profile; (3) explain the resulting outcomes (given a job request, a motivation for the obtained score of each selected profile is provided). Top-k retrieval allows to select most promising candidates according to an ontology formalizing the domain knowledge. Such a knowledge is further exploited to provide a semantic-based explanation of missing or conflicting features in retrieved profiles. They also indicate additional profile characteristics emerging by the retrieval procedure for a further request refinement. A concrete case study followed by an exhaustive experimental campaign is reported to prove the approach effectiveness.

  5. Latest Trends in Web Information Retrieval and in SEO Factors

    Directory of Open Access Journals (Sweden)

    Carlos Gonzalo

    2015-07-01

    Full Text Available Latest trends in web information retrieval and in  SEO factors, increasingly focused on signals from users as: profile of who performs the search and the interpretation of user intent. The objective of search engines is twofold: focusing at the maximum in the users and make ever less predictable the composition of the search engine result page (SERP , and  combating spam.

  6. Information overload, retrieval strategies and Internet user empowerment

    OpenAIRE

    Carlson, Christopher N.

    2003-01-01

    Initial user benefits from search engine technology have been critically degraded over time by the rapid increase of Internet pages. Traditional retrieval strategies therefore yield increasingly poor results due to a dramatic increase in ballast in the results. Search engine users thus increasingly experience information overload. Technical approaches to dealing with this problem have caused an initial euphoria, yet have proven ineffective in solving the problem. Enhancement of user empow...

  7. The Use of a Context-Based Information Retrieval Technique

    Science.gov (United States)

    2009-07-01

    Carlson, 2004). However, in order to reduce plagiarism and manipulation, the specific details of these algorithms are closely protected and changed...age, academic background and gender can affect performance using information retrieval systems (Borgman, 1989). These factors can result in...and academic qualifications, a large proportion of the sample were recruited from a third year level or higher. 2.2 Materials 2.2.1 Demographic

  8. Web Structure Mining: Exploring Hyperlinks and Algorithms for Information Retrieval

    Directory of Open Access Journals (Sweden)

    P. R. Kumar

    2010-01-01

    Full Text Available Problem statement: A study on hyperlink analysis and the algorithms used for link analysis in the Web Information retrieval was done. Approach: This research was initiated because of the dependability of search engines for information retrieval in the web. Understand the web structure mining and determine the importance of hyperlink in web information retrieval particularly using the Google Search engine. Hyperlink analysis was important methodology used by famous search engine Google to rank the pages. Results: The different algorithms used for link analysis like PageRank (PR, Weighted PageRank (WPR and Hyperlink-Induced Topic Search (HITS algorithms are discussed and compared. PageRank algorithm was implemented using a Java program and the convergence of the PageRank values are shown in a chart form. Conclusion: This study was done basically to explore the link structure algorithms for ranking and compare those algorithms. The further research on this area will be problems facing PageRank algorithm and how to handle those problems.

  9. [SIBIL: an information tool for the information retrieval on bioethics].

    Science.gov (United States)

    Dracos, Adriana

    2004-01-01

    The article describes the main features of the website SIBIL (Sistema Informativo per la Bioetica In Linea) implemented within the framework of a research project of the ISS for collecting, indexing and disseminating Italian literature on bioethics since 1995 through an integrated electronic system. The site, addressed to a wide range of people interested at different degrees and levels in bioethics, offers a comprehensive overview of the activities, such as courses and meetings, on the major ethical issues at stake in Italy, as well as a survey of the most important activities both at national and international level. The main feature of SIBIL is a database of a large collection of documents retrieved through sources or exploitation of the most important international electronic databases. A thesaurus of 1,600 terms, available in Italian and English, was created in order to organize documents with standardized criteria currently adopted in the Italian scientific environment. Future trends of the website are also discussed for sharing experiences with other countries and laying the basis for a European portal on bioethics.

  10. Models of a Distributed Information Retrieval System Based on Thesauri with Weights.

    Science.gov (United States)

    Mazur, Zygmunt

    1994-01-01

    Discusses distributed information retrieval systems that take into account the weights of descriptors from thesauri. Topics addressed include a mathematical model for information retrieval subsystems; organization of inverted files; models for the distributed homogeneous information systems; a distributed information retrieval system based on…

  11. Lower-Cost ∈-Private Information Retrieval

    Directory of Open Access Journals (Sweden)

    Toledo Raphael R.

    2016-10-01

    Full Text Available Private Information Retrieval (PIR, despite being well studied, is computationally costly and arduous to scale. We explore lower-cost relaxations of information-theoretic PIR, based on dummy queries, sparse vectors, and compositions with an anonymity system. We prove the security of each scheme using a flexible differentially private definition for private queries that can capture notions of imperfect privacy. We show that basic schemes are weak, but some of them can be made arbitrarily safe by composing them with large anonymity systems.

  12. Semantic Information Retrieval Using Ontology in University Domain

    Directory of Open Access Journals (Sweden)

    Swathi Rajasurya

    2012-11-01

    Full Text Available Today’s conventional search engines hardly do provide the essential content relevant to the user’s searchquery. This is because the context and semantics of the request made by the user is not analyzed to the fullextent. So here the need for a semantic web search arises. SWS is upcoming in the area of web searchwhich combines Natural Language Processing and Artificial Intelligence. The objective of the work donehere is to design, develop and implement a semantic search engine- SIEU(Semantic InformationExtraction in University Domain confined to the university domain. SIEU uses ontology as a knowledgebase for the information retrieval process. It is not just a mere keyword search. It is one layer above whatGoogle or any other search engines retrieve by analyzing just the keywords. Here the query is analyzedboth syntactically and semantically. The developed system retrieves the web results more relevant to theuser query through keyword expansion. The results obtained here will be accurate enough to satisfy therequest made by the user. The level of accuracy will be enhanced since the query is analyzed semantically.The system will be of great use to the developers and researchers who work on web. The Google results arere-ranked and optimized for providing the relevant links. For ranking an algorithm has been applied whichfetches more apt results for the user query

  13. Multi-lingual Information Retrieval in Digital Libraries

    Directory of Open Access Journals (Sweden)

    Hsiao-Tieh Pu

    1997-12-01

    Full Text Available With the advancements of the Internet and the Digital Library Initiatives in the U.S.A., the research of digital library has been flourished around the world. Recently the increasing availability of networked access to multilingual text collections within such an environment has drawn much attention in the development of cross-language retrieval technology. This article is used to structure a comprehensive discussion of published research and known commercial practice in the western world on the topic. In addition to the focus on the characteristics of Chinese text collections, some brief observations of the potential for multilingual information retrieval are also discussed in details.[Article content in Chinese

  14. Estimating Missing Features to Improve Multimedia Information Retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Bagherjeiran, A; Love, N S; Kamath, C

    2006-09-28

    Retrieval in a multimedia database usually involves combining information from different modalities of data, such as text and images. However, all modalities of the data may not be available to form the query. The retrieval results from such a partial query are often less than satisfactory. In this paper, we present an approach to complete a partial query by estimating the missing features in the query. Our experiments with a database of images and their associated captions show that, with an initial text-only query, our completion method has similar performance to a full query with both image and text features. In addition, when we use relevance feedback, our approach outperforms the results obtained using a full query.

  15. An integrated information retrieval and document management system

    Science.gov (United States)

    Coles, L. Stephen; Alvarez, J. Fernando; Chen, James; Chen, William; Cheung, Lai-Mei; Clancy, Susan; Wong, Alexis

    1993-01-01

    This paper describes the requirements and prototype development for an intelligent document management and information retrieval system that will be capable of handling millions of pages of text or other data. Technologies for scanning, Optical Character Recognition (OCR), magneto-optical storage, and multiplatform retrieval using a Standard Query Language (SQL) will be discussed. The semantic ambiguity inherent in the English language is somewhat compensated-for through the use of coefficients or weighting factors for partial synonyms. Such coefficients are used both for defining structured query trees for routine queries and for establishing long-term interest profiles that can be used on a regular basis to alert individual users to the presence of relevant documents that may have just arrived from an external source, such as a news wire service. Although this attempt at evidential reasoning is limited in comparison with the latest developments in AI Expert Systems technology, it has the advantage of being commercially available.

  16. Retrieving Nuclear Information from Protons Propagating through A Thick Target

    CERN Document Server

    Giraud, B G

    2007-01-01

    The multiple scattering of high-energy particles in a thick target is fromulated in an impact parameter representation. A formalism similar but not identical to that of Moliere is obtained. We show that calculations of particle beam broadening due to multiple Coulomb scattering alone can be given in closed form. The focus of this study is on whether or not the broadening of the Coulomb angular distribution prevents the retrieval of nuclear-interaction information from mesauring the angular distributions of charged partiles scattered from a thick target. For this purpose, we study multiple scatterings with both the nuclear and Coulomb interactions included and we do not make a small-angle expansion. Condition for retrieving nuclear infomration from high-energy protons propagating through a block of material are obtained.

  17. INFORMATION RETRIEVAL SYSTEM USING MULTIWORDS EXPRESSIONS (MWE AS DESCRIPTORS

    Directory of Open Access Journals (Sweden)

    Edson Marchetti da Silva

    2012-08-01

    Full Text Available This paper aims to propose an alternative method for retrieving documents using Multiwords Expressions (MWE extracted from a document base to be used as descriptors in search of an Information Retrieval System (IRS. In this sense, unlike methods that consider the text as a set of words, bag of words, we propose a method that takes into account the characteristics of the physical structure of the document in the extraction process of MWE. From this set of terms comparing pre-processed using an exhaustive algorithmic technique proposed by the authors with the results obtained for thirteen different measures of association statistics generated by the software Ngram Statistics Package (NSP. To perform this experiment was set up with a corpus of documents in digital format

  18. Tetrahydrocannabinol (THC) impairs encoding but not retrieval of verbal information.

    Science.gov (United States)

    Ranganathan, Mohini; Radhakrishnan, Rajiv; Addy, Peter H; Schnakenberg-Martin, Ashley M; Williams, Ashley H; Carbuto, Michelle; Elander, Jacqueline; Pittman, Brian; Andrew Sewell, R; Skosnik, Patrick D; D'Souza, Deepak Cyril

    2017-10-03

    Cannabis and agonists of the brain cannabinoid receptor (CB1R) produce acute memory impairments in humans. However, the extent to which cannabinoids impair the component processes of encoding and retrieval has not been established in humans. The objective of this analysis was to determine whether the administration of Δ(9)-Tetrahydrocannabinol (THC), the principal psychoactive constituent of cannabis, impairs encoding and/or retrieval of verbal information. Healthy subjects were recruited from the community. Subjects were administered the Rey-Auditory Verbal Learning Test (RAVLT) either before administration of THC (experiment #1) (n=38) or while under the influence of THC (experiment #2) (n=57). Immediate and delayed recall on the RAVLT was compared. Subjects received intravenous THC, in a placebo-controlled, double-blind, randomized manner at doses known to produce behavioral and subjective effects consistent with cannabis intoxication. Total immediate recall, short delayed recall, and long delayed recall were reduced in a statistically significant manner only when the RAVLT was administered to subjects while they were under the influence of THC (experiment #2) and not when the RAVLT was administered prior. THC acutely interferes with encoding of verbal memory without interfering with retrieval. These data suggest that learning information prior to the use of cannabis or cannabinoids is not likely to disrupt recall of that information. Future studies will be necessary to determine whether THC impairs encoding of non-verbal information, to what extent THC impairs memory consolidation, and the role of other cannabinoids in the memory-impairing effects of cannabis. Cannabinoids, Neural Synchrony, and Information Processing (THC-Gamma) http://clinicaltrials.gov/ct2/show/study/NCT00708994 NCT00708994 Pharmacogenetics of Cannabinoid Response http://clinicaltrials.gov/ct2/show/NCT00678730 NCT00678730. Copyright © 2017. Published by Elsevier Inc.

  19. Controlled Retrieval of Specific Context Information in Children and Adults.

    Science.gov (United States)

    Lorsbach, Thomas C; Friehe, Mary J; Teten, Amy Fair; Reimer, Jason F; Armendarez, Joseph J

    2015-01-01

    This study adapted a procedure used by Luo and Craik (2009) to examine whether developmental differences exist in the ability to use controlled retrieval processes to access the contextual details of memory representations. Participants from 3 age groups (mean ages 9, 12, and 25 years) were presented with words in 3 study contexts: with a black-and-white picture, with a color picture, or alone without a picture. Six recognition tests were then presented that varied in the demands (high or low) placed on the retrieval of specific contextual information. Each test consisted of a mixture of words that were old targets from 1 study context, distractors (i.e., previously studied words from a different context), and completely new words. A high-specificity and a low-specificity test list was paired with each test question, with high and low specificity being determined by the nature of the distractors used in a test list. High-specificity tests contained words that were studied in similar contexts: old targets (e.g., words studied with black-and-white pictures) and distractors (e.g., words studied with color pictures). In contrast, low-specificity tests contained words that were studied in dissimilar contexts: old targets (e.g., words studied with black-and-white pictures) and distractors (e.g., words previously studied without a picture). Relative to low-specificity tests, the retrieval conditions of high-specificity tests were assumed to place greater demands on the controlled access of specific contextual information. Analysis of recollection scores revealed that age differences were present on high-but not low-specificity tests, with the performance of 9-year-olds disproportionately affected by the retrieval demands of high-specificity tests.

  20. Diffused holographic information storage and retrieval using photorefractive optical materials

    Science.gov (United States)

    McMillen, Deanna Kay

    Holography offers a tremendous opportunity for dense information storage, theoretically one bit per cubic wavelength of material volume, with rapid retrieval, of up to thousands of pages of information simultaneously. However, many factors prevent the theoretical storage limit from being reached, including dynamic range problems and imperfections in recording materials. This research explores new ways of moving closer to practical holographic information storage and retrieval by altering the recording materials, in this case, photorefractive crystals, and by increasing the current storage capacity while improving the information retrieved. As an experimental example of the techniques developed, the information retrieved is the correlation peak from an optical recognition architecture, but the materials and methods developed are applicable to many other holographic information storage systems. Optical correlators can potentially solve any signal or image recognition problem. Military surveillance, fingerprint identification for law enforcement or employee identification, and video games are but a few examples of applications. A major obstacle keeping optical correlators from being universally accepted is the lack of a high quality, thick (high capacity) holographic recording material that operates with red or infrared wavelengths which are available from inexpensive diode lasers. This research addresses the problems from two positions: find a better material for use with diode lasers, and reduce the requirements placed on the material while maintaining an efficient and effective system. This research found that the solutions are new dopants introduced into photorefractive lithium niobate to improve wavelength sensitivities and the use of a novel inexpensive diffuser that reduces the dynamic range and optical element quality requirements (which reduces the cost) while improving performance. A uniquely doped set of 12 lithium niobate crystals was specified and

  1. Web multimedia information retrieval using improved Bayesian algorithm

    Institute of Scientific and Technical Information of China (English)

    余铁军; 陈纯; 余铁民; 林怀忠

    2003-01-01

    The main thrust of this paper is application of a novel data mining approach on the log of user' s feedback to improve web multimedia information retrieval performance. A user space model was constructed based on data mining, and then integrated into the original information space model to improve the accuracy of the new information space model. It can remove clutter and irrelevant text information and help to eliminate mismatch between the page author' s expression and the user' s understanding and expectation. User spacemodel was also utilized to discover the relationship between high-level and low-level features for assigning weight. The authors proposed improved Bayesian algorithm for data mining. Experiment proved that the au-thors' proposed algorithm was efficient.

  2. Web multimedia information retrieval using improved Bayesian algorithm

    Institute of Scientific and Technical Information of China (English)

    余轶军; 陈纯; 余轶民; 林怀忠

    2003-01-01

    The main thrust of this paper is application of a novel data mining approach on the log of user's feedback to improve web multimedia information retrieval performance. A user space model was constructed based on data mining, and then integrated into the original information space model to improve the accuracy of the new information space model. It can remove clutter and irrelevant text information and help to eliminate mismatch between the page author's expression and the user's understanding and expectation. User space model was also utilized to discover the relationship between high-level and low-level features for assigning weight. The authors proposed improved Bayesian algorithm for data mining. Experiment proved that the authors' proposed algorithm was efficient.

  3. Information Content of Aerosol Retrievals in the Sunglint Region

    Science.gov (United States)

    Ottaviani, M.; Knobelspiesse, K.; Cairns, B.; Mishchenko, M.

    2013-01-01

    We exploit quantitative metrics to investigate the information content in retrievals of atmospheric aerosol parameters (with a focus on single-scattering albedo), contained in multi-angle and multi-spectral measurements with sufficient dynamical range in the sunglint region. The simulations are performed for two classes of maritime aerosols with optical and microphysical properties compiled from measurements of the Aerosol Robotic Network. The information content is assessed using the inverse formalism and is compared to that deriving from observations not affected by sunglint. We find that there indeed is additional information in measurements containing sunglint, not just for single-scattering albedo, but also for aerosol optical thickness and the complex refractive index of the fine aerosol size mode, although the amount of additional information varies with aerosol type.

  4. ERISTAR: Earth Resources Information Storage, Transformation, Analysis, and Retrieval

    Science.gov (United States)

    1972-01-01

    The National Aeronautics and Space Administration (NASA) and the American Society for Engineering Education (ASEE) have sponsored faculty fellowship programs in systems engineering design for the past several years. During the summer of 1972 four such programs were conducted by NASA, with Auburn University cooperating with Marshall Space Flight Center (MSFC). The subject for the Auburn-MSFC design group was ERISTAR, an acronym for Earth Resources Information Storage, Transformation, Analysis and Retrieval, which represents an earth resources information management network of state information centers administered by the respective states and linked to federally administered regional centers and a national center. The considerations for serving the users and the considerations that must be given to processing data from a variety of sources are described. The combination of these elements into a national network is discussed and an implementation plan is proposed for a prototype state information center. The compatibility of the proposed plan with the Department of Interior plan, RALI, is indicated.

  5. 15 CFR 950.9 - Computerized Environmental Data and Information Retrieval Service.

    Science.gov (United States)

    2010-01-01

    ... Information Retrieval Service. 950.9 Section 950.9 Commerce and Foreign Trade Regulations Relating to Commerce... Computerized Environmental Data and Information Retrieval Service. The Environmental Data Index (ENDEX... computerized, information retrieval service provides a parallel subject-author-abstract referral service....

  6. 42 CFR 433.116 - FFP for operation of mechanized claims processing and information retrieval systems.

    Science.gov (United States)

    2010-10-01

    ... FISCAL ADMINISTRATION Mechanized Claims Processing and Information Retrieval Systems § 433.116 FFP for operation of mechanized claims processing and information retrieval systems. (a) Subject to 42 CFR 433.113(c... and information retrieval systems. 433.116 Section 433.116 Public Health CENTERS FOR...

  7. Cross Language Information Retrieval Model for Discovering WSDL Documents Using Arabic Language Query

    Directory of Open Access Journals (Sweden)

    Prof. Dr. Torkey I.Sultan

    2013-09-01

    Full Text Available Web service discovery is the process of finding a suitable Web service for a given user’s query through analyzing the web service‘s WSDL content and finding the best match for the user’s query. The service query should be written in the same language of the WSDL, for example English. Cross Language Information Retrieval techniques does not exist in the web service discovery process. The absence of CLIR methods limits the search language to the English language keywords only, which raises the following question “How do people that do not know the English Language find a web service, This paper proposes the application of CLIR techniques and IR methods to support Bilingual Web service discovery process the second language that proposed here is Arabic. Text mining techniques were applied on WSDL content and user’s query to be ready for CLIR methods. The proposed model was tested on a curated catalogue of Life Science Web Services http://www.biocatalogue.org/ and used for solving the research problem with 99.87 % accuracy and 95.06 precision

  8. Human Information Behaviour and Design, Development and Evaluation of Information Retrieval Systems

    Science.gov (United States)

    Keshavarz, Hamid

    2008-01-01

    Purpose: The purpose of this paper is to introduce the concept of human information behaviour and to explore the relationship between information behaviour of users and the existing approaches dominating design and evaluation of information retrieval (IR) systems and also to describe briefly new design and evaluation methods in which extensive…

  9. How Information Communication Technology (ICT) can Support College Language Teachers in the Bilingual Context

    Institute of Scientific and Technical Information of China (English)

    曾葳

    2015-01-01

    While the increasing development of modern information technology, the globalization is becoming an obvious fea⁃ture on educational situations. Therefore, mastering some necessary bilingual competencies will present essential meaning for edu⁃cators. In the case of language teachers who teaching in the ethnically plural countries, for instance, China, the United States, lan⁃guage teachers have to face to various difficulties on the process of teaching in the bilingual class. Currently, the advanced technol⁃ogy is gradually being applied into language teaching, and then provides a series of advantages on improving the quality of lan⁃guage teaching. Firstly, the essay will analyse the barriers which exist in the language class, which in the level of Chinese universi⁃ty. Secondly, it will systematically describe how ICT can help language teachers to solve difficulties on teaching and display di⁃verse innovative technological tools of language teaching.

  10. Survey the role of emotions in information retrieval

    Directory of Open Access Journals (Sweden)

    Hassan Behzadi

    2016-03-01

    Full Text Available The present study was conducted to identify the users' emotion in various stages of information retrieval based on the information retrieval model in web.From the methodological perspective, the present study is experimental, and the type of study is practical. The society comprised all MA students majoring in different humanistic science branches and studying at Imam Reza international university. The sample society of this research consisted of 30 participants. The sample size was determined through stratified random sampling via G*power software. Data collection was carried out by using: demographic and prior experience of using internet questionnaire, post search questionnaire and recorded videos of users' faces. The findings of the study demonstrated that: 1 during the initial stages of searching, the frequency of emotion of apprehension, and in general during the link tracking stage, the negative emotions with the overall 49/3 percent are more frequent than the other emotions in browsing and differentiation stages, the emotion of happy was more frequent than the other emotions. 2 These variances resulted in significant relations among different emotions of the users throughout the four stages of information retrieval. 3 In simple search, the respondents displayed the emotion of happy most frequently and the emotion of aversion least frequently. On the other hand, in complicated search, apprehension and aversion were the most and the least frequently-cited emotions, respectively. Overall, the negative emotions were reported more frequently in complicated search in comparison with the simple search. This demonstrated that any change in the difficulty level of search undertaking would cause users to exhibit different types of emotions.

  11. Semantic Annotation Framework For Intelligent Information Retrieval Using KIM Architecture

    Directory of Open Access Journals (Sweden)

    Sanjay Kumar Malik

    2010-11-01

    Full Text Available Due to the explosion of information/knowledge on the web and wide use of search engines for desiredinformation,the role of knowledge management(KM is becoming more significant in an organization.Knowledge Management in an Organization is used to create ,capture, store, share, retrieve and manageinformation efficiently. The semantic web, an intelligent and meaningful web, tend to provide a promisingplatform for knowledge management systems and vice versa, since they have the potential to give eachother the real substance for machine-understandable web resources which in turn will lead to anintelligent, meaningful and efficient information retrieval on web. Today,the challenge for web communityis to integrate the distributed heterogeneous resources on web with an objective of an intelligent webenvironment focusing on data semantics and user requirements. Semantic Annotation(SA is being widelyused which is about assigning to the entities in the text and links to their semantic descriptions. Varioustools like KIM, Amaya etc may be used for semantic Annotation.In this paper, we introduce semantic annotation as one of the key technology in an intelligent webenvironment , then revisit and review, discuss and explore about Knowledge Management and SemanticAnnotation. A Knowledge Management Framework and a Framework for Semantic Annotation andSemantic Search with Knowledge Base(GATE and Ontology have been presented. Then KIM Annotationplatform architecture including KIM Ontology(KIMO, KIM Knowledge Base and KIM front ends havebeen highlighted. Finally, intelligent pattern search and concerned GATE framework with a KIMAnnotation Example have been illiustrated towards an intelligent information retrieval

  12. Rapid automatic keyword extraction for information retrieval and analysis

    Science.gov (United States)

    Rose, Stuart J [Richland, WA; Cowley,; E, Wendy [Richland, WA; Crow, Vernon L [Richland, WA; Cramer, Nicholas O [Richland, WA

    2012-03-06

    Methods and systems for rapid automatic keyword extraction for information retrieval and analysis. Embodiments can include parsing words in an individual document by delimiters, stop words, or both in order to identify candidate keywords. Word scores for each word within the candidate keywords are then calculated based on a function of co-occurrence degree, co-occurrence frequency, or both. Based on a function of the word scores for words within the candidate keyword, a keyword score is calculated for each of the candidate keywords. A portion of the candidate keywords are then extracted as keywords based, at least in part, on the candidate keywords having the highest keyword scores.

  13. Web search: how the Web has changed information retrieval

    Directory of Open Access Journals (Sweden)

    Brooks Terrence A.

    2003-01-01

    Full Text Available Topical metadata are simultaneously hailed as building blocks of the semantic Web and derogated as spam. The significance of the metadata controversy depends on the technological appropriateness of adding them to Web pages. A survey of Web technology suggests that Web pages are both transient and volatile: poor hosts of topical metadata. A more supportive environment exists in the closed Web. The vast majority of Web pages, however, exist in the open Web, an environment that challenges the application of legacy information retrieval concepts and methods.

  14. Interdisciplinarity and Computer Music Modeling and Information Retrieval

    DEFF Research Database (Denmark)

    Grund, Cynthia M.

    2006-01-01

    Abstract This paper takes a look at computer music modeling and information retrieval (CMMIR) from the point of view of the humanities with emphasis upon areas relevant to the philosophy of music. The desire for more interdisciplinary research involving CMMIR and the humanities is expressed...... and some specific positive experiences are cited which have given this author reason to believe that such cooperation is beneficial for both sides. A short list of some contemporary areas of interest in the philosophy of music is provided, and it is suggested that these could be interesting areas...

  15. Computer programs: Information retrieval and data analysis, a compilation

    Science.gov (United States)

    1972-01-01

    The items presented in this compilation are divided into two sections. Section one treats of computer usage devoted to the retrieval of information that affords the user rapid entry into voluminous collections of data on a selective basis. Section two is a more generalized collection of computer options for the user who needs to take such data and reduce it to an analytical study within a specific discipline. These programs, routines, and subroutines should prove useful to users who do not have access to more sophisticated and expensive computer software.

  16. AN EFFECTIVE INFORMATION RETRIEVAL SYSTEM USING KEYWORD SEARCH TECHNIQUE

    Directory of Open Access Journals (Sweden)

    Dhananjay A. Gholap

    2015-10-01

    Full Text Available Keyword search is the technique use for the retrieving data or information. In Information Retrieval, keyword search is a type of search method that looks for matching documents which contain one or more keywords specified by a user.A keyword search scheme to relational database becomes an interesting area of research system within the IR and relational database system. The assumption and investigation of user search goals can be very valuable in improving search engine relevance and user experience. The user tries to search about any query on the internet, Search engine gives many numbers of result related to that query. These results can be depend on metadata or on full text indexing, because of this, user need to spend a lot of time in finding the information of his interest. Therefore, in project inferred user search goals by analyzing search engine query logs. System use a framework to discover different user search goals for a query by clustering the propose feedback sessions.

  17. Information retrieval patterns and needs among practicing general surgeons: a statewide experience.

    OpenAIRE

    Shelstad, K R; Clevenger, F W

    1996-01-01

    Information retrieval has progressed from a reliance on traditional print sources to the modern era of computer databases and online networks. Surgeons, many from remote areas not served by professional medical libraries, must develop and maintain skills in information retrieval and management in both electronic and standard formats. One hundred thirty-three New Mexico general surgeons were surveyed to identify their information-seeking patterns in five areas: retrieval purposes, retrieval so...

  18. JANE, A new information retrieval system for the Radiation Shielding Information Center

    Energy Technology Data Exchange (ETDEWEB)

    Trubey, D.K.

    1991-05-01

    A new information storage and retrieval system has been developed for the Radiation Shielding Information Center (RSIC) at Oak Ridge National Laboratory to replace mainframe systems that have become obsolete. The database contains citations and abstracts of literature which were selected by RSIC analysts and indexed with terms from a controlled vocabulary. The database, begun in 1963, has been maintained continuously since that time. The new system, called JANE, incorporates automatic indexing techniques and on-line retrieval using the RSIC Data General Eclipse MV/4000 minicomputer, Automatic indexing and retrieval techniques based on fuzzy-set theory allow the presentation of results in order of Retrieval Status Value. The fuzzy-set membership function depends on term frequency in the titles and abstracts and on Term Discrimination Values which indicate the resolving power of the individual terms. These values are determined by the Cover Coefficient method. The use of a commercial database base to store and retrieve the indexing information permits rapid retrieval of the stored documents. Comparisons of the new and presently-used systems for actual searches of the literature indicate that it is practical to replace the mainframe systems with a minicomputer system similar to the present version of JANE. 18 refs., 10 figs.

  19. Web-based software system for processing bilingual digital resources

    Directory of Open Access Journals (Sweden)

    Ralitsa Dutsova

    2014-09-01

    Full Text Available Web-based software system for processing bilingual digital resourcesThe article describes a software management system developed at the Institute of Mathematics and Informatics, BAS, for the creation, storing and processing of digital language resources in Bulgarian. Independent components of the system are intended for the creation and management of bilingual dictionaries, for information retrieval and data mining from a bilingual dictionary, and for the presentation of aligned corpora. A module which connects these components is also being developed. The system, implemented as a web-application, contains tools for compilation, editing and search within all components.

  20. Word reading and translation in bilinguals: The impact of formal and informal translation expertise

    Directory of Open Access Journals (Sweden)

    Adolfo M. García

    2014-11-01

    Full Text Available Studies on bilingual word reading and translation have examined the effects of lexical variables (e.g., concreteness, cognate status by comparing groups of non-translators with varying levels of L2 proficiency. However, little attention has been paid to another relevant factor: translation expertise (TI. To explore this issue, we administered word reading and translation tasks to two groups of non-translators possessing different levels of informal TI (Experiment 1, and to three groups of bilinguals possessing different levels of translation training (Experiment 2. Reaction-time recordings showed that in all groups reading was faster than translation and unaffected by concreteness and cognate effects. Conversely, in both experiments, all groups translated concrete and cognate words faster than abstract and non-cognate words, respectively. Notably, an advantage of backward over forward translation was observed only for low-proficiency non-translators (in Experiment 1. Also, in Experiment 2, the modifications induced by translation expertise were more marked in the early than in the late stages of training and practice. The results suggest that TI contributes to modulating inter-equivalent connections in bilingual memory.

  1. Mathematical, Logical, and Formal Methods in Information Retrieval: An Introduction to the Special Issue.

    Science.gov (United States)

    Crestani, Fabio; Dominich, Sandor; Lalmas, Mounia; van Rijsbergen, Cornelis Joost

    2003-01-01

    Discusses the importance of research on the use of mathematical, logical, and formal methods in information retrieval to help enhance retrieval effectiveness and clarify underlying concepts of information retrieval. Highlights include logic; probability; spaces; and future research needs. (Author/LRW)

  2. Bilingual Cancer Information: Access Is the First Line of Defense

    Science.gov (United States)

    Boudreault, Patrick; Palmer, Christina

    2015-01-01

    Information about cancer, the disease that kills more Americans than any other except heart disease, is essential. In some ways, information is our first line of defense. It allows us to identify individual risk factors, to note when a problem means we should see a professional, and to avoid activities that might put us at risk. However,…

  3. Non-Compositional Term Dependence for Information Retrieval

    DEFF Research Database (Denmark)

    Lioma, Christina; Simonsen, Jakob Grue; Larsen, Birger

    2015-01-01

    We present two novel models of document coherence and their application to information retrieval (IR). Both models approximate document coherence using discourse entities, e.g. the subject or object of a sentence. Our first model views text as a Markov process generating sequences of discourse...... entities (entity n-grams); we use the entropy of these entity n-grams to approximate the rate at which new information appears in text, reasoning that as more new words appear, the topic increasingly drifts and text coherence decreases. Our second model extends the work of Guinaudeau & Strube [28......] that represents text as a graph of discourse entities, linked by different relations, such as their distance or adjacency in text. We use several graph topology metrics to approximate different aspects of the discourse flow that can indicate coherence, such as the average clustering or betweenness of discourse...

  4. An introduction to the Marshall information retrieval and display system

    Science.gov (United States)

    1974-01-01

    An on-line terminal oriented data storage and retrieval system is presented which allows a user to extract and process information from stored data bases. The use of on-line terminals for extracting and displaying data from the data bases provides a fast and responsive method for obtaining needed information. The system consists of general purpose computer programs that provide the overall capabilities of the total system. The system can process any number of data files via a Dictionary (one for each file) which describes the data format to the system. New files may be added to the system at any time, and reprogramming is not required. Illustrations of the system are shown, and sample inquiries and responses are given.

  5. Information retrieval pathways for health information exchange in multiple care settings

    DEFF Research Database (Denmark)

    Kierkegaard, Patrick; Kaushal, Rainu; Vest, Joshua R.

    2014-01-01

    Objectives To determine which health information exchange (HIE) technologies and information retrieval pathways healthcare professionals relied on to meet their information needs in the context of laboratory test results, radiological images and reports, and medication histories. Study Design...... Primary data was collected over a 2-month period across 3 emergency departments, 7 primary care practices, and 2 public health clinics in New York state. Methods Qualitative research methods were used to collect and analyze data from semi-structured interviews and participant observation. Results...... The study reveals that healthcare professionals used a complex combination of information retrieval pathways for HIE to obtain clinical information from external organizations. The choice for each approach was setting- and information-specific, but was also highly dynamic across users and their information...

  6. Verb and Noun Word Retrieval in Bilingual Aphasia: A Case Study of Language- and Modality-Specific Levels of Breakdown

    Science.gov (United States)

    Kambanaros, Maria

    2016-01-01

    This study reports on the pattern of performance on spoken and written naming, spelling to dictation, and oral reading of single verbs and nouns in a bilingual speaker with aphasia in two first languages that differ in morphological complexity, orthographic transparency, and script: Greek (L1a) and English (L1b). The results reveal no verb/noun…

  7. An integrated Korean biodiversity and genetic information retrieval system.

    Science.gov (United States)

    Lim, Jeongheui; Bhak, Jong; Oh, Hee-Mock; Kim, Chang-Bae; Park, Yong-Ha; Paek, Woon Kee

    2008-12-12

    On-line biodiversity information databases are growing quickly and being integrated into general bioinformatics systems due to the advances of fast gene sequencing technologies and the Internet. These can reduce the cost and effort of performing biodiversity surveys and genetic searches, which allows scientists to spend more time researching and less time collecting and maintaining data. This will cause an increased rate of knowledge build-up and improve conservations. The biodiversity databases in Korea have been scattered among several institutes and local natural history museums with incompatible data types. Therefore, a comprehensive database and a nation wide web portal for biodiversity information is necessary in order to integrate diverse information resources, including molecular and genomic databases. The Korean Natural History Research Information System (NARIS) was built and serviced as the central biodiversity information system to collect and integrate the biodiversity data of various institutes and natural history museums in Korea. This database aims to be an integrated resource that contains additional biological information, such as genome sequences and molecular level diversity. Currently, twelve institutes and museums in Korea are integrated by the DiGIR (Distributed Generic Information Retrieval) protocol, with Darwin Core2.0 format as its metadata standard for data exchange. Data quality control and statistical analysis functions have been implemented. In particular, integrating molecular and genetic information from the National Center for Biotechnology Information (NCBI) databases with NARIS was recently accomplished. NARIS can also be extended to accommodate other institutes abroad, and the whole system can be exported to establish local biodiversity management servers. A Korean data portal, NARIS, has been developed to efficiently manage and utilize biodiversity data, which includes genetic resources. NARIS aims to be integral in maximizing

  8. Editorial: Hyperlinks and Their Roles in Web Information Retrieval

    Directory of Open Access Journals (Sweden)

    Alireza Noruzi

    2005-10-01

    Full Text Available A web page generally includes elements such as text, hyperlink, image, etc. Hyperlink represents a relationship between two web pages or just between sections of the same page. Understanding the hyperlink structure is fundamental to understanding the Web connectivity structure, because hyperlinks have been used in web indexing and information retrieval, as well as page ranking. If the Web were a car, hyperlinks would be the engine, because without them, we are not going anywhere. It can be concluded that search engines consider that any words used by other sites to describe a site is particularly relevant even if the keywords are not used in the backlinked site/page (the linked target destination. In other words, the foreign language text links allow the linked sites to have a chance to be retrieved as relevant results in response to a search query. Many search engines judge the linking page partly based on the quality of the linked page, and if many sites backlinking to a site use keywords in their link text, search engines will raise its ranking for those keywords. Ultimately, backlinks from popular websites with a higher ranking, have a higher weight then backlinks from smaller unknown websites.

  9. Contextual and Conceptual Information Retrieval and Navigation on the Web

    Science.gov (United States)

    Le Grand, Bénédicte; Aufaure, Marie-Aude; Soto, Michel

    The goal of this chapter is to propose a methodology and tools to enhance information retrieval and navigation on the Web through contextual and conceptual help. This methodology provides users with an extended navigation space by adding a conceptual and a semantic layer above Web data. The conceptual layer is made of Galois lattices which cluster Web pages into concepts according to their common features (in particular their textual content). These lattices represent the Global Conceptual Context of Web pages. An additional navigation layer is provided by ontologies which are connected to the conceptual level through specific concepts of the lattices. Users may navigate transparently within each of these three layers and go from one to another very easily.

  10. Issues in the use of neural networks in information retrieval

    CERN Document Server

    Iatan, Iuliana F

    2017-01-01

    This book highlights the ability of neural networks (NNs) to be excellent pattern matchers and their importance in information retrieval (IR), which is based on index term matching. The book defines a new NN-based method for learning image similarity and describes how to use fuzzy Gaussian neural networks to predict personality. It introduces the fuzzy Clifford Gaussian network, and two concurrent neural models: (1) concurrent fuzzy nonlinear perceptron modules, and (2) concurrent fuzzy Gaussian neural network modules. Furthermore, it explains the design of a new model of fuzzy nonlinear perceptron based on alpha level sets and describes a recurrent fuzzy neural network model with a learning algorithm based on the improved particle swarm optimization method.

  11. Algebraic Modeling of Information Retrieval in XML Documents

    Science.gov (United States)

    Georgiev, Bozhidar; Georgieva, Adriana

    2009-11-01

    This paper presents an information retrieval approach in XML documents using tools, based on the linear algebra. The well-known transformation languages as XSLT (XPath) are grounded on the features of higher-order logic for manipulating hierarchical trees. The presented conception is compared to existing higher-order logic formalisms, where the queries are realized by both languages XSLT and XPath. The possibilities of the proposed linear algebraic model combined with hierarchy data models permit more efficient solutions for searching, extracting and manipulating semi-structured data with hierarchical structures avoiding the global navigation over the XML tree components. The main purpose of this algebraic model representation, applied to the hierarchical relationships in the XML data structures, is to make the implementation of linear algebra tools possible for XML data manipulations and to eliminate existing problems, related to regular grammars theory and also to avoid the difficulties, connected with higher -order logic (first-order logic, monadic second- order logic etc.).

  12. Efficient hardware-based private information retrieval using partial reshuffle

    Institute of Scientific and Technical Information of China (English)

    Lan Tian; Qin Zhiguang

    2010-01-01

    The paper proposes a novel hardware-based private information retrieval(HWPIR)protocol.By partially reshuffling previously accessed items in each round,instead of frequently reshuffling the whole database,the scheme makes better use of shuffled data copies and achieves the computation overhead at O(√N/k),where N and k are the sizes of the database and secure storage respectively.For.secure storage with moderate size,e.g.k=O(√N),the overhead is O(4√N).The result is much better than the state-of-art schemes(as compared to e.g.O(log2N)).Without increasing response time and communication cost,the proposed protocol is truly practicable regardless of the database size.The security and preformance of the protocol is formally analyzed.

  13. How to retrieve additional information from the multiplicity distributions

    CERN Document Server

    Wilka, Grzegorz

    2016-01-01

    Multiplicity distributions $P(N)$ measured in multiparticle production processes are most frequently described by the Negative Binomial Distribution (NBD). However, with increasing collision energy some systematic discrepancies become more and more apparent. They are usually attributed to the possible multi-source structure of the production process and described using a multi-NBD form of the multiplicity distribution. We investigate the possibility of keeping a single NBD but with its parameters depending on the multiplicity $N$. This is done by modifying the widely known clan model of particle production leading to the NBD form of $P(N)$. This is then confronted with the approach based on the so-called cascade-stochastic formalism which is based on different types of recurrence relations defining $P(N)$. We demonstrate that a combination of both approaches allows the retrieval of additional valuable information from the multiplicity distributions, namely the oscillatory behavior of the counting statistics a...

  14. Using Context to Improve the Evaluation of Information Retrieval Systems

    CERN Document Server

    Bouramoul, Abdelkrim; Doan, Bich-Lien; 10.5121/ijdms.2011.3202

    2011-01-01

    The crucial role of the evaluation in the development of the information retrieval tools is useful evidence to improve the performance of these tools and the quality of results that they return. However, the classic evaluation approaches have limitations and shortcomings especially regarding to the user consideration, the measure of the adequacy between the query and the returned documents and the consideration of characteristics, specifications and behaviors of the search tool. Therefore, we believe that the exploitation of contextual elements could be a very good way to evaluate the search tools. So, this paper presents a new approach that takes into account the context during the evaluation process at three complementary levels. The experiments gives at the end of this article has shown the applicability of the proposed approach to real research tools. The tests were performed with the most popular searching engine (i.e. Google, Bing and Yahoo) selected in particular for their high selectivity. The obtaine...

  15. Cross Lingual Information Retrieval With SMT And Query Mining

    Directory of Open Access Journals (Sweden)

    Suneet Kumar Gupta

    2011-10-01

    Full Text Available In this paper, we have taken the English Corpus and Queries, both translated and transliterated form. We use Statistical Machine Translator to find the result under translated and transliterated queries and then analyzed the result. These queries wise results can then be undergone mining and therefore a new list of queries is created. We have design an experimental setup followed by various steps which calculate Mean Average Precision. We have taken assistance ship of Terrier Open Source for the Information Retrieval. On the basis of created new query list, we calculate the Mean Average Precision and find a significant result i.e. 93.24% which is very close to monolingual results calculated for English language.

  16. Challenging Conventional Assumptions of Automated Information Retrieval with Real Users: Boolean Searching and Batch Retrieval Evaluations.

    Science.gov (United States)

    Hersh, William; Turpin, Andrew; Price, Susan; Kraemer, Dale; Olson, Daniel; Chan, Benjamin; Sacherek, Lynetta

    2001-01-01

    Describes research conducted at the TREC (Text Retrieval Conference) interactive track that compared Boolean and natural language searching, showing they achieved comparable results; and assessed the validity of batch-oriented retrieval evaluations, showing that the results from batch evaluations were not comparable to those obtained in…

  17. Teaching information retrieval using research questions to encourage creativity and assess understanding

    OpenAIRE

    Jones, Gareth J.F.

    2007-01-01

    The study of information retrieval has increased in interest and importance with the explosive growth of online information in recent years. Learning about information retrieval within formal courses of study enables users of search engines to use them more knowledgeably and effectively, while providing the starting point for the explorations of new researchers into novel search technologies. The nature of information retrieval as a topic also makes it an ideal subject for develop...

  18. The challenge of automated tutoring in Web-based learning environments for information retrieval instruction

    OpenAIRE

    Sormunen Eero; Pennanen; Sami

    2004-01-01

    The need to enhance information literacy education increases demand for effective Web-based learning environments for information retrieval instruction. The paper introduces the Query Performance Analyser, a unique instructional tool for information retrieval learning environments. On top of an information retrieval system and within a given search assignment, the Query Performance Analyser supports learning by instantly visualizing achieved query performance. Although the Query Performance A...

  19. Cross-language information retrieval using PARAFAC2.

    Energy Technology Data Exchange (ETDEWEB)

    Bader, Brett William; Chew, Peter; Abdelali, Ahmed (New Mexico State University, Las Cruces, NM); Kolda, Tamara Gibson

    2007-05-01

    A standard approach to cross-language information retrieval (CLIR) uses Latent Semantic Analysis (LSA) in conjunction with a multilingual parallel aligned corpus. This approach has been shown to be successful in identifying similar documents across languages - or more precisely, retrieving the most similar document in one language to a query in another language. However, the approach has severe drawbacks when applied to a related task, that of clustering documents 'language-independently', so that documents about similar topics end up closest to one another in the semantic space regardless of their language. The problem is that documents are generally more similar to other documents in the same language than they are to documents in a different language, but on the same topic. As a result, when using multilingual LSA, documents will in practice cluster by language, not by topic. We propose a novel application of PARAFAC2 (which is a variant of PARAFAC, a multi-way generalization of the singular value decomposition [SVD]) to overcome this problem. Instead of forming a single multilingual term-by-document matrix which, under LSA, is subjected to SVD, we form an irregular three-way array, each slice of which is a separate term-by-document matrix for a single language in the parallel corpus. The goal is to compute an SVD for each language such that V (the matrix of right singular vectors) is the same across all languages. Effectively, PARAFAC2 imposes the constraint, not present in standard LSA, that the 'concepts' in all documents in the parallel corpus are the same regardless of language. Intuitively, this constraint makes sense, since the whole purpose of using a parallel corpus is that exactly the same concepts are expressed in the translations. We tested this approach by comparing the performance of PARAFAC2 with standard LSA in solving a particular CLIR problem. From our results, we conclude that PARAFAC2 offers a very promising alternative to

  20. A Real-Time and Dynamic Biological Information Retrieval and Analysis System (BIRAS)

    Institute of Scientific and Technical Information of China (English)

    Qi Zhou; Hong Zhang; Meiying Geng; Chenggang Zhang

    2003-01-01

    The aim of this study is to design a biological information retrieval and analysis system (BIRAS) based on the Internet. Using the specific network protocol, BIRAS system could send and receive information from the Entrez search and retrieval system maintained by National Center for Biotechnology Information (NCBI) in USA. The literatures, nucleotide sequence, protein sequences, and other resources according to the user-defined term could then be retrieved and sent to the user by pop up message or by E-mail informing automatically using BIRAS system.All the information retrieving and analyzing processes are done in real-time. As a robust system for intelligently and dynamically retrieving and analyzing on the user-defined information, it is believed that BIRAS would be extensively used to retrieve specific information from large amount of biological databases in now days.The program is available on request from the corresponding author.

  1. A Real—Time and Dynamic Biological Information Retrieval and Analysis System(BIRAS)

    Institute of Scientific and Technical Information of China (English)

    QiZhou; HongZhang; MeiyingGeng; ChenggangZhang

    2003-01-01

    The aim of this study is to design a biological information retrieval and analysis system(BIRAS) based on the Internet.Using the specific network protocol,BIRAS system could send and receive information from the Entrez search and retrieval system maintained by National Center for Biotechnology Information(NCBI)in USA.The literatures,nucleotide sequence,protein sequences,and other resources according to the user-defined term could then be retrieved and sent to the user by pop up message or by E-amil informing automatically using BIRAS system.All the information retrieving and analyzing processes are done in real-time.As a robust system for intelligently and dynamically retrieving and analyzing on the user-defined information,it is believed that BIRAS would be extensively used to retrieve specific information from large amount of biological databases in now days.The program is available on request from the corresponding author.

  2. Automatic Content Analysis; Part I of Scientific Report No. ISR-18, Information Storage and Retrieval...

    Science.gov (United States)

    Cornell Univ., Ithaca, NY. Dept. of Computer Science.

    Four papers are included in Part One of the eighteenth report on Salton's Magical Automatic Retriever of Texts (SMART) project. The first paper: "Content Analysis in Information Retrieval" by S. F. Weiss presents the results of experiments aimed at determining the conditions under which content analysis improves retrieval results as well…

  3. A Fuzzy Genetic Algorithm Approach to an Adaptive Information Retrieval Agent.

    Science.gov (United States)

    Martin-Bautista, Maria J.; Vila, Maria-Amparo; Larsen, Henrik Legind

    1999-01-01

    Presents an approach to a Genetic Information Retrieval Agent Filter (GIRAF) that filters and ranks documents retrieved from the Internet according to users' preferences by using a Genetic Algorithm and fuzzy set theory to handle the imprecision of users' preferences and users' evaluation of the retrieved documents. (Author/LRW)

  4. Organization of the Inverted Files in a Distributed Information Retrieval System Based on Thesauri.

    Science.gov (United States)

    Mazur, Zygmunt

    1986-01-01

    Describes how operations on local inverted files are to be modified in order to use them in distributed information retrieval systems based on thesauri. The presented rules may be viewed as the logical approach in implementing a distributed retrieval system consisting of n local retrieval systems. (Author/MBR)

  5. Query-Time Optimization Techniques for Structured Queries in Information Retrieval

    Science.gov (United States)

    Cartright, Marc-Allen

    2013-01-01

    The use of information retrieval (IR) systems is evolving towards larger, more complicated queries. Both the IR industrial and research communities have generated significant evidence indicating that in order to continue improving retrieval effectiveness, increases in retrieval model complexity may be unavoidable. From an operational perspective,…

  6. Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives

    DEFF Research Database (Denmark)

    Cao, Xin; Cong, Gao; Cui, Bin

    2012-01-01

    of CQA services, question retrieval in a CQA archive aims to retrieve historical question-answer pairs that are relevant to a query question. This article presents several new approaches to exploiting the category information of questions for improving the performance of question retrieval...

  7. Entropy of the Information Retrieved from Black Holes

    CERN Document Server

    Mersini-Houghton, Laura

    2015-01-01

    The retrieval of black hole information was recently presented in two interesting proposals in the 'Hawking Radiation' conference: a revised version by G. 't Hooft of a proposal he initially suggested 20 years ago and, a new proposal by S. Hawking. Both proposals address the problem of black hole information loss at the classical level and derive an expression for the scattering matrix. The former uses gravitation back reaction of incoming particles that imprints its information on the outgoing modes. The latter uses supertranslation symmetry of horizons to relate a phase delay of the outgoing wave packet compared to their incoming wave partners. The difficulty in both proposals is that the entropy obtained from them appears to be infinite. By including quantum effects into the Hawking and 't Hooft's proposals, I show that a subtlety arising from the inescapable measurement process, the Quantum Zeno Effect, not only tames divergences but it actually recovers the correct $1/4$ of the area Bekenstein-Hawking en...

  8. Next-Generation Search Engines for Information Retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Devarakonda, Ranjeet [ORNL; Hook, Leslie A [ORNL; Palanisamy, Giri [ORNL; Green, James M [ORNL

    2011-01-01

    centralized index. The harvested files are indexed against SOLR search API consistently, so that it can render search capabilities such as simple, fielded, spatial and temporal searches across a span of projects ranging from land, atmosphere, and ocean ecology. Mercury also provides data sharing capabilities using Open Archive Initiatives Protocol for Metadata Handling (OAI-PMH). In this paper we will discuss about the best practices for archiving data and metadata, new searching techniques, efficient ways of data retrieval and information display.

  9. A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering.

    Science.gov (United States)

    Sarrouti, Mourad; Ouatik El Alaoui, Said

    2017-04-01

    Passage retrieval, the identification of top-ranked passages that may contain the answer for a given biomedical question, is a crucial component for any biomedical question answering (QA) system. Passage retrieval in open-domain QA is a longstanding challenge widely studied over the last decades. However, it still requires further efforts in biomedical QA. In this paper, we present a new biomedical passage retrieval method based on Stanford CoreNLP sentence/passage length, probabilistic information retrieval (IR) model and UMLS concepts. In the proposed method, we first use our document retrieval system based on PubMed search engine and UMLS similarity to retrieve relevant documents to a given biomedical question. We then take the abstracts from the retrieved documents and use Stanford CoreNLP for sentence splitter to make a set of sentences, i.e., candidate passages. Using stemmed words and UMLS concepts as features for the BM25 model, we finally compute the similarity scores between the biomedical question and each of the candidate passages and keep the N top-ranked ones. Experimental evaluations performed on large standard datasets, provided by the BioASQ challenge, show that the proposed method achieves good performances compared with the current state-of-the-art methods. The proposed method significantly outperforms the current state-of-the-art methods by an average of 6.84% in terms of mean average precision (MAP). We have proposed an efficient passage retrieval method which can be used to retrieve relevant passages in biomedical QA systems with high mean average precision. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Disposal of Information Seeking and Retrieval Research: Replacement with a Radical Proposition

    Science.gov (United States)

    Budd, John M.; Anstaett, Ashley

    2013-01-01

    Introduction: Research and theory on the topics of information seeking and retrieval have been plagued by some fundamental problems for several decades. Many of the difficulties spring from mechanistic and instrumental thinking and modelling. Method: Existing models of information retrieval and information seeking are examined for efficacy in a…

  11. Comparing the quality of accessing medical literature using content-based visual and textual information retrieval

    Science.gov (United States)

    Müller, Henning; Kalpathy-Cramer, Jayashree; Kahn, Charles E., Jr.; Hersh, William

    2009-02-01

    Content-based visual information (or image) retrieval (CBIR) has been an extremely active research domain within medical imaging over the past ten years, with the goal of improving the management of visual medical information. Many technical solutions have been proposed, and application scenarios for image retrieval as well as image classification have been set up. However, in contrast to medical information retrieval using textual methods, visual retrieval has only rarely been applied in clinical practice. This is despite the large amount and variety of visual information produced in hospitals every day. This information overload imposes a significant burden upon clinicians, and CBIR technologies have the potential to help the situation. However, in order for CBIR to become an accepted clinical tool, it must demonstrate a higher level of technical maturity than it has to date. Since 2004, the ImageCLEF benchmark has included a task for the comparison of visual information retrieval algorithms for medical applications. In 2005, a task for medical image classification was introduced and both tasks have been run successfully for the past four years. These benchmarks allow an annual comparison of visual retrieval techniques based on the same data sets and the same query tasks, enabling the meaningful comparison of various retrieval techniques. The datasets used from 2004-2007 contained images and annotations from medical teaching files. In 2008, however, the dataset used was made up of 67,000 images (along with their associated figure captions and the full text of their corresponding articles) from two Radiological Society of North America (RSNA) scientific journals. This article describes the results of the medical image retrieval task of the ImageCLEF 2008 evaluation campaign. We compare the retrieval results of both visual and textual information retrieval systems from 15 research groups on the aforementioned data set. The results show clearly that, currently

  12. A Problem in Information Retrieval with Fuzzy Sets.

    Science.gov (United States)

    Buell, Duncan A.

    1985-01-01

    Discussion of problems with fuzzy subsets in document retrieval highlights attempts to invent a system of weighted fuzzy queries in which weights correspond to relative importance of each term in query as whole, and use of Kantor's Logic for Retrieval as an alternative to Boolean queries. Six references are cited. (EJS)

  13. Contextual Information Retrieval based on Algorithmic Information Theory and Statistical Outlier Detection

    CERN Document Server

    Martinez, Rafael; Rodriguez, Francisco de Borja; Camacho, David

    2007-01-01

    The main contribution of this paper is to design an Information Retrieval (IR) technique based on Algorithmic Information Theory (using the Normalized Compression Distance- NCD), statistical techniques (outliers), and novel organization of data base structure. The paper shows how they can be integrated to retrieve information from generic databases using long (text-based) queries. Two important problems are analyzed in the paper. On the one hand, how to detect "false positives" when the distance among the documents is very low and there is actual similarity. On the other hand, we propose a way to structure a document database which similarities distance estimation depends on the length of the selected text. Finally, the experimental evaluations that have been carried out to study previous problems are shown.

  14. Propositions du Centre Mondial D'Information sur l'Education Bilingue (CMIEB) au Conseil de l'Europe [Proposals of the World Center of Information on Bilingual Education (CMIEB) to the Council of Europe].

    Science.gov (United States)

    Dalgalian, Gilbert

    The socio-economic, cultural and political situation in Europe necessitates a systematic and pedagogically sound promotion of bilingual education. Two prerequisites to this promotion are seen: (1) a political framework and support, and (2) a substructure for linguistic and pedagogical information and coordination. A proposal is presented to…

  15. Retrieving Information from the Invisible Web Using Mobile Agents

    Directory of Open Access Journals (Sweden)

    Fabien-Kenzo Sato

    2005-01-01

    Full Text Available This study proposes a model of information retrieval on the invisible Web by using the mobile agent paradigm. The developed architecture uses the power of a search engine to provide a list of sites of the invisible Web which are likely to be relevant and launches a dynamic search on these sites, thanks to mobile agents. To compare and experiment in real conditions, two versions were implemented: a version using the traditional client/server paradigm and a version using mobile agents. Client/server tests on actual Websites generated satisfactory qualitative results. A series of comparative experiments of the two versions implemented were carried out using a test site. Results show that the mobile agent version generates much less traffic and is thus faster than the client/server version, especially with low bandwidth. Moreover, as the mobile agents carry out calculations on the server rather than on the client’s site, this approach relieves the resources of the client terminal. Thus, the mobile agent approach seems particularly advantageous in the case of weak resource terminals such as PDAs.

  16. Stemmer Impact on Quranic Mobile Information Retrieval Performance

    Directory of Open Access Journals (Sweden)

    Huda Omar Aljaloud

    2016-12-01

    Full Text Available Stemming algorithms are employed in information retrieval (IR to reduce verity variants of the same word with several endings to a standard stem. Stemmers can also help IR systems by unifying vocabulary, reducing term variants, reducing storage space, and increasing the likelihood of matching documents, all of which make stemming very attractive for use in IR. This paper aims to study the impact of using stemming techniques in mobile effectiveness. Two-word extraction stemming techniques will be used: a light stemmer and a dictionary-lookup stemmer. Also, three sets of experiments were conducted in this research in order to raise the efficiency of mobile aapplications. Implementing the two stemming approaches and assessing their accuracy by calculating the precision, recall, MAP, and f-measure, produced results which show that the light10 stemmer outperforms the dictionary-lookup stemmer in precision and MAP. Furthermore, the mobile performance of the light10 stemmer exceeds that of the dictionary-based stemmer.

  17. How to retrieve additional information from the multiplicity distributions

    Science.gov (United States)

    Wilk, Grzegorz; Włodarczyk, Zbigniew

    2017-01-01

    Multiplicity distributions (MDs) P(N) measured in multiparticle production processes are most frequently described by the negative binomial distribution (NBD). However, with increasing collision energy some systematic discrepancies have become more and more apparent. They are usually attributed to the possible multi-source structure of the production process and described using a multi-NBD form of the MD. We investigate the possibility of keeping a single NBD but with its parameters depending on the multiplicity N. This is done by modifying the widely known clan model of particle production leading to the NBD form of P(N). This is then confronted with the approach based on the so-called cascade-stochastic formalism which is based on different types of recurrence relations defining P(N). We demonstrate that a combination of both approaches allows the retrieval of additional valuable information from the MDs, namely the oscillatory behavior of the counting statistics apparently visible in the high energy data.

  18. Information retrieval for education: making search engines language aware

    Directory of Open Access Journals (Sweden)

    Niels Ott

    2010-01-01

    Full Text Available Search engines have been a major factor in making the web the successful and widely usedinformation source it is today. Generally speaking, they make it possible to retrieve web pageson a topic specified by the keywords entered by the user. Yet web searching currently doesnot take into account which of the search results are comprehensible for a given user – anissue of particular relevance when considering students in an educational setting. And currentsearch engines do not support teachers in searching for language properties relevant forselecting texts appropriate for language students at different stages in the second languageacquisition process.At the same time, raising language awareness is a major focus in second language acquisitionresearch and foreign language teaching practice, and research since the 20s has tried toidentify indicators predicting which texts are comprehensible for readers at a particular levelof ability. For example, the military has been interested in ensuring that workers at a givenlevel of education can understand the manuals they need to read in order to perform their job.We present a new search engine approach which makes it possible for teachers to search fortexts both in terms of contents and in terms of their reading difficulty and other languageproperties. The implemented prototype builds on state-of-the art information retrievaltechnology and exemplifies how a range of readability measures can be integrated in amodular fashion.

  19. Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data

    OpenAIRE

    2005-01-01

    This paper aimed to design and build an Automatic Information Retrieval System to handle the Arabic data. Also, this paper presents some type of comparison between the retrieval results using the vector space model in two different indexing methods: the full-ward indexing and the root indexing. The proposed Automatic Information Retrieval system was implemented and built using a traditional model technique: Vector Space Model (VSM) where the cosine measure similarity was used. The output resu...

  20. Generic information can retrieve known biological associations: implications for biomedical knowledge discovery.

    Directory of Open Access Journals (Sweden)

    Herman H H B M van Haagen

    Full Text Available MOTIVATION: Weighted semantic networks built from text-mined literature can be used to retrieve known protein-protein or gene-disease associations, and have been shown to anticipate associations years before they are explicitly stated in the literature. Our text-mining system recognizes over 640,000 biomedical concepts: some are specific (i.e., names of genes or proteins others generic (e.g., 'Homo sapiens'. Generic concepts may play important roles in automated information retrieval, extraction, and inference but may also result in concept overload and confound retrieval and reasoning with low-relevance or even spurious links. Here, we attempted to optimize the retrieval performance for protein-protein interactions (PPI by filtering generic concepts (node filtering or links to generic concepts (edge filtering from a weighted semantic network. First, we defined metrics based on network properties that quantify the specificity of concepts. Then using these metrics, we systematically filtered generic information from the network while monitoring retrieval performance of known protein-protein interactions. We also systematically filtered specific information from the network (inverse filtering, and assessed the retrieval performance of networks composed of generic information alone. RESULTS: Filtering generic or specific information induced a two-phase response in retrieval performance: initially the effects of filtering were minimal but beyond a critical threshold network performance suddenly drops. Contrary to expectations, networks composed exclusively of generic information demonstrated retrieval performance comparable to unfiltered networks that also contain specific concepts. Furthermore, an analysis using individual generic concepts demonstrated that they can effectively support the retrieval of known protein-protein interactions. For instance the concept "binding" is indicative for PPI retrieval and the concept "mutation abnormality" is

  1. Latent morpho-semantic analysis : multilingual information retrieval with character n-grams and mutual information.

    Energy Technology Data Exchange (ETDEWEB)

    Bader, Brett William; Chew, Peter A.; Abdelali, Ahmed (New Mexico State University)

    2008-08-01

    We describe an entirely statistics-based, unsupervised, and language-independent approach to multilingual information retrieval, which we call Latent Morpho-Semantic Analysis (LMSA). LMSA overcomes some of the shortcomings of related previous approaches such as Latent Semantic Analysis (LSA). LMSA has an important theoretical advantage over LSA: it combines well-known techniques in a novel way to break the terms of LSA down into units which correspond more closely to morphemes. Thus, it has a particular appeal for use with morphologically complex languages such as Arabic. We show through empirical results that the theoretical advantages of LMSA can translate into significant gains in precision in multilingual information retrieval tests. These gains are not matched either when a standard stemmer is used with LSA, or when terms are indiscriminately broken down into n-grams.

  2. Efficient Methods to Assimilate Satellite Retrievals Based on Information Content. Part 2; Suboptimal Retrieval Assimilation

    Science.gov (United States)

    Joiner, J.; Dee, D. P.

    1998-01-01

    One of the outstanding problems in data assimilation has been and continues to be how best to utilize satellite data while balancing the tradeoff between accuracy and computational cost. A number of weather prediction centers have recently achieved remarkable success in improving their forecast skill by changing the method by which satellite data are assimilated into the forecast model from the traditional approach of assimilating retrievals to the direct assimilation of radiances in a variational framework. The operational implementation of such a substantial change in methodology involves a great number of technical details, e.g., pertaining to quality control procedures, systematic error correction techniques, and tuning of the statistical parameters in the analysis algorithm. Although there are clear theoretical advantages to the direct radiance assimilation approach, it is not obvious at all to what extent the improvements that have been obtained so far can be attributed to the change in methodology, or to various technical aspects of the implementation. The issue is of interest because retrieval assimilation retains many practical and logistical advantages which may become even more significant in the near future when increasingly high-volume data sources become available. The central question we address here is: how much improvement can we expect from assimilating radiances rather than retrievals, all other things being equal? We compare the two approaches in a simplified one-dimensional theoretical framework, in which problems related to quality control and systematic error correction are conveniently absent. By assuming a perfect radiative transfer model and perfect knowledge of radiance and background error covariances, we are able to formulate a nonlinear local error analysis for each assimilation method. Direct radiance assimilation is optimal in this idealized context, while the traditional method of assimilating retrievals is suboptimal because it

  3. 45 CFR 205.35 - Mechanized claims processing and information retrieval systems; definitions.

    Science.gov (United States)

    2010-10-01

    ... retrieval systems; definitions. 205.35 Section 205.35 Public Welfare Regulations Relating to Public Welfare... claims processing and information retrieval systems; definitions. Section 205.35 through 205.38 contain State plan requirements for an automated statewide management information system, conditions for FFP...

  4. Information Retrieval eXperience (IRX): Towards a Human-Centered Personalized Model of Relevance

    NARCIS (Netherlands)

    Sluis, van der Frans; Broek, van den Egon L.; Dijk, van Betsy; Hoeber, O.; Li, Y.; Huang, X.J.

    2010-01-01

    We approach Information Retrieval (IR) from a User eXperience (UX) perspective. Through introducing a model for Information Retrieval eXperience (IRX), this paper operationalizes a perspective on IR that reaches beyond topicality. Based on a document's topicality, complexity, and emotional value, a

  5. A Parallel Relational Database Management System Approach to Relevance Feedback in Information Retrieval.

    Science.gov (United States)

    Lundquist, Carol; Frieder, Ophir; Holmes, David O.; Grossman, David

    1999-01-01

    Describes a scalable, parallel, relational database-drive information retrieval engine. To support portability across a wide range of execution environments, all algorithms adhere to the SQL-92 standard. By incorporating relevance feedback algorithms, accuracy is enhanced over prior database-driven information retrieval efforts. Presents…

  6. Nonmaterialized Relations and the Support of Information Retrieval Applications by Relational Database Systems.

    Science.gov (United States)

    Lynch, Clifford A.

    1991-01-01

    Describes several aspects of the problem of supporting information retrieval system query requirements in the relational database management system (RDBMS) environment and proposes an extension to query processing called nonmaterialized relations. User interactions with information retrieval systems are discussed, and nonmaterialized relations are…

  7. Experiments in Discourse Analysis Impact on Information Classification and Retrieval Algorithms.

    Science.gov (United States)

    Morato, Jorge; Llorens, J.; Genova, G.; Moreiro, J. A.

    2003-01-01

    Discusses the inclusion of contextual information in indexing and retrieval systems to improve results and the ability to carry out text analysis by means of linguistic knowledge. Presents research that investigated whether discourse variables have an impact on information and retrieval and classification algorithms. (Author/LRW)

  8. On a Model of Distributed Information Retrieval Systems Based on Thesauri.

    Science.gov (United States)

    Mazur, Zygmunt

    1984-01-01

    Investigates the properties of a global model consisting of "n" local information retrieval systems based on thesaurus. Definitions of a distributed information retrieval system (thesaurus, documents set, set of queries) and proofs of theorems denoting further properties of the systems are highlighted. Five references are included. (EJS)

  9. A Domain Specific Lexicon Acquisition Tool for Cross-Language Information Retrieval

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Jong, de Franciska; Kraaij, Wessel

    1997-01-01

    With the recent enormous increase of information dissemination via the web as incentive there is a growing interest in supporting tools for cross-language retrieval. In this paper we describe a disclosure and retrieval approach that fulfils the needs of both information providers and users by offeri

  10. The Relative Effectiveness of Varied Visual Testing Formats in Retrieving Information Related to Different Educational Objectives

    Science.gov (United States)

    Williams, Jaison; Dwyer, Francis

    2004-01-01

    The purpose of this study is to: (1) examine the relative effectiveness with which different types of visual test formats facilitated information retrieval on tests measuring different educational objectives; (2) measure the effect that prior knowledge had on information retrieval; and (3) to determine whether an interaction existed between prior…

  11. Strong Similarity Measures for Ordered Sets of Documents in Information Retrieval.

    Science.gov (United States)

    Egghe, L.; Michel, Christine

    2002-01-01

    Presents a general method to construct ordered similarity measures in information retrieval based on classical similarity measures for ordinary sets. Describes a test of some of these measures in an information retrieval system that extracted ranked document sets and discuses the practical usability of the ordered similarity measures. (Author/LRW)

  12. Personalizing Information Retrieval Using Interaction Behaviors in Search Sessions in Different Types of Tasks

    Science.gov (United States)

    Liu, Chang

    2012-01-01

    When using information retrieval (IR) systems, users often pose short and ambiguous query terms. It is critical for IR systems to obtain more accurate representation of users' information need, their document preferences, and the context they are working in, and then incorporate them into the design of the systems to tailor retrieval to…

  13. A Parallel Relational Database Management System Approach to Relevance Feedback in Information Retrieval.

    Science.gov (United States)

    Lundquist, Carol; Frieder, Ophir; Holmes, David O.; Grossman, David

    1999-01-01

    Describes a scalable, parallel, relational database-drive information retrieval engine. To support portability across a wide range of execution environments, all algorithms adhere to the SQL-92 standard. By incorporating relevance feedback algorithms, accuracy is enhanced over prior database-driven information retrieval efforts. Presents…

  14. Children's use of linguistic information when learning in a bilingual context.

    Science.gov (United States)

    Atagi, Natsuki; Goldenberg, Elizabeth R; Sandhofer, Catherine M

    2016-04-01

    Children prefer to learn from people who are like themselves. However, who is considered "like themselves" is complex for bilingual children. Thus, the current study examined whether children's language experiences affect who they prefer to imitate. A sample of 3- to 5-year-old monolingual English-speaking children (n=16), Japanese-English bilingual children (n=16), and children bilingual in English and a non-Japanese language (n=16) watched videos of a monolingual English speaker and a Japanese-English bilingual speaker playing with novel toys and were asked to play with the same novel toys. Although all children--regardless of language background--imitated the monolingual speaker at similar rates, the two bilingual groups imitated the bilingual speaker more often than did the monolingual children. Such results suggest that experience in speaking two languages affects children's imitation behaviors.

  15. Dissociable parietal regions facilitate successful retrieval of recently learned and personally familiar information.

    Science.gov (United States)

    Elman, Jeremy A; Cohn-Sheehy, Brendan I; Shimamura, Arthur P

    2013-03-01

    In fMRI analyses, the posterior parietal cortex (PPC) is particularly active during the successful retrieval of episodic memory. To delineate the neural correlates of episodic retrieval more succinctly, we compared retrieval of recently learned spatial locations (photographs of buildings) with retrieval of previously familiar locations (photographs of familiar campus buildings). Episodic retrieval of recently learned locations activated a circumscribed region within the ventral PPC (anterior angular gyrus and adjacent regions in the supramarginal gyrus) as well as medial PPC regions (posterior cingulated gyrus and posterior precuneus). Retrieval of familiar locations activated more posterior regions in the ventral PPC (posterior angular gyrus, LOC) and more anterior regions in the medial PPC (anterior precuneus and retrosplenial cortex). These dissociable effects define more precisely PPC regions involved in the retrieval of recent, contextually bound information as opposed to regions involved in other processes, such as visual imagery, scene reconstruction, and self-referential processing.

  16. Information retrieval patterns and needs among practicing general surgeons: a statewide experience.

    Science.gov (United States)

    Shelstad, K R; Clevenger, F W

    1996-10-01

    Information retrieval has progressed from a reliance on traditional print sources to the modern era of computer databases and online networks. Surgeons, many from remote areas not served by professional medical libraries, must develop and maintain skills in information retrieval and management in both electronic and standard formats. One hundred thirty-three New Mexico general surgeons were surveyed to identify their information-seeking patterns in five areas: retrieval purposes, retrieval sources, barriers to access, techniques used, and continuing education needs. Ninety-nine (74.4%) surgeons responded to the survey. Ninety-five percent utilize professional meetings, the medical literature, and physician colleagues as information sources. Only 17% utilize the outreach services of the state's only medical school library. Common retrieval barriers were practice demands (71%), isolation from medical schools (30%), computer illiteracy (28%), and rural environment (25%). Continuing education topics related to information management would be valuable to 61% of the surgeons. Sixty-nine percent believe their current ability to access biomedical information is adequate, despite most frequently accessing their personal libraries for information related to decision-making or patient management. These data suggest that, despite significant information needs, surgeons have not embraced newer forms of information retrieval. It is imperative that surgeons acquire and maintain modern information retrieval skills as a means of remaining up-to-date in their profession. Professional surgical organizations and medical librarians should collaborate on these continuing education ventures.

  17. Aerometric Information Retrieval System/AIRS Facility Subsystem (AIRS/AFS)

    Data.gov (United States)

    U.S. Environmental Protection Agency — The Aerometric Information Retrieval System/AIRS Facility Subsystem (AIRS/AFS) is a database that provides information on air releases from various stationary...

  18. Subjective Probability and Information Retrieval: A Review of the Psychological Literature.

    Science.gov (United States)

    Thompson, Paul

    1988-01-01

    Reviews the subjective probability estimation literature of six schools of human judgement and decision making: decision theory, behavioral decision theory, psychological decision theory, social judgement theory, information integration theory, and attribution theory. Implications for probabilistic information retrieval are discussed, including…

  19. Intrasubtest scatter on the WAIS-III information subtest and psychometrically defined retrieval deficits.

    Science.gov (United States)

    Ryan, J J; Paul, C A; Arb, J D

    1999-12-01

    Milberg, et al. (1996) postulated that significant intrasubtest scatter on the Wechsler Information subtest reflects impaired retrieval. From a pool of 205 male referrals at a VA medical center with complete WAIS-III and WMS-III protocols, 28 participants with impaired retrieval (Group I) defined by a high Retrieval Composite score were identified. A sample (Group II) without similar evidence of impaired retrieval was matched to Group I on age, education, Full Scale IQ, race, and diagnosis. Intrasubtest scatter on the Information subtest was the same across groups (Group I M = 6.3, SD = 2.7; Group II M = 6.9, SD = 3.4). A second study identified impaired retrieval using the WMS-III Word Lists subtest. 21 participants (Group III) had impaired retrieval indicated by a Recognition scaled score being > or = 4 points higher than the Delayed Recall scaled score. A matched sample (Group IV) of VA patients without similar evidence of impaired retrieval was constituted. Intrasubtest scatter on the Information subtest did not differ across groups (Group III M = 6.6, SD = 2.4; Group IV M = 6.0, SD = 2.5). Evaluations of the retrieval deficit hypothesis should be based on responses of participants whose Information performance is characterized by abnormal amounts of intrasubtest scatter. It is possible that a specific amount of response variability must be present within the subtest before retrieval problems can be detected.

  20. [Design and implementation of medical instrument standard information retrieval system based on APS.NET].

    Science.gov (United States)

    Yu, Kaijun

    2010-07-01

    This paper Analys the design goals of Medical Instrumentation standard information retrieval system. Based on the B /S structure,we established a medical instrumentation standard retrieval system with ASP.NET C # programming language, IIS f Web server, SQL Server 2000 database, in the. NET environment. The paper also Introduces the system structure, retrieval system modules, system development environment and detailed design of the system.

  1. Editorial for the Bibliometric-enhanced Information Retrieval Workshop at ECIR 2014

    OpenAIRE

    2014-01-01

    This first "Bibliometric-enhanced Information Retrieval" (BIR 2014) workshop aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offer value-added effects for users. In this workshop we will explore how statistical modelling of scholarship, such as Bradfordizing or network analysis of co-authorship network, can improve retrieval...

  2. A Novel Approach for Information Content Retrieval and Analysis of Bio-Images using Datamining techniques

    Directory of Open Access Journals (Sweden)

    Ayyagari Sri Nagesh

    2012-11-01

    Full Text Available In Bio-Medical image processing domain, content-based analysis and Information retrieval of bio-images is very critical for disease diagnosis. Content-Based Image Analysis and Information Retrieval (CBIAIR has become a significant part of information retrieval technology. One challenge in this area is that the ever-increasing number of bio-images acquired through the digital world makes the brute force searching almost impossible. Medical Image structural objects content and object identification plays significant role for image content analysis and information retrieval. There are basically three fundamental concepts for content-based bio-image retrieval, i.e. visual-feature extraction, multi-dimensional indexing, and retrieval system process. Each image has three contents such as: colour, texture and shape features. Colour and Texture both plays important image visual features used in Content-Based Image Retrieval to improve results. In this paper, we have presented an effective image retrieval system using features like texture, shape and color, called CBIAIR (Content-Based Image Analysis and Information Retrieval. Here, we have taken three different features such as texture, color and shape. Firstly, we have developed a new texture pattern feature for pixel based feature in CBIAIR system. Subsequently, we have used semantic color feature for color based feature and the shape based feature selection is done using the existing technique. For retrieving, these features are extracted from the query image and matched with the feature library using the feature weighted distance. After that, all feature vectors will be stored in the database using indexing procedure. Finally, the relevant images that have less matched distance than the predefined threshold value are retrieved from the image database after adapting the K-NN classifier.

  3. Information Storage and Retrieval, Scientific Report No. ISR-15.

    Science.gov (United States)

    Salton, Gerard

    Several algorithms were investigated which would allow a user to interact with an automatic document retrieval system by requesting relevance judgments on selected sets of documents. Two viewpoints were taken in evaluation. One measured the movement of queries toward the optimum query as defined by Rocchio; the other measured the retrieval…

  4. Data retrieval system provides unlimited hardware design information

    Science.gov (United States)

    Rawson, R. D.; Swanson, R. L.

    1967-01-01

    Data is input to magnetic tape on a single format card that specifies the system, location, and component, the test point identification number, the operators initial, the date, a data code, and the data itself. This method is efficient for large volume data storage and retrieval, and permits output variations without continuous program modifications.

  5. An Information-Theoretic Privacy Criterion for Query Forgery in Information Retrieval

    CERN Document Server

    Rebollo-Monedero, David; Forné, Jordi

    2011-01-01

    In previous work, we presented a novel information-theoretic privacy criterion for query forgery in the domain of information retrieval. Our criterion measured privacy risk as a divergence between the user's and the population's query distribution, and contemplated the entropy of the user's distribution as a particular case. In this work, we make a twofold contribution. First, we thoroughly interpret and justify the privacy metric proposed in our previous work, elaborating on the intimate connection between the celebrated method of entropy maximization and the use of entropies and divergences as measures of privacy. Secondly, we attempt to bridge the gap between the privacy and the information-theoretic communities by substantially adapting some technicalities of our original work to reach a wider audience, not intimately familiar with information theory and the method of types.

  6. Design and Implementation of Automatic Indexing for Information Retrieval with Arabic Documents.

    Science.gov (United States)

    Hmeidi, Ismail; Kanaan, Ghassan; Evens, Martha

    1997-01-01

    Describes automatic information retrieval system designed and built to handle Arabic data. Discusses cost-effectiveness of automatic indexing. Compares retrieval results using words as index terms versus stems and roots. Includes 19 tables; 60 queries using full words and relevance judgments are appended. (JAK)

  7. Modeling the Time Course of Feature Perception and Feature Information Retrieval

    Science.gov (United States)

    Kent, Christopher; Lamberts, Koen

    2006-01-01

    Three experiments investigated whether retrieval of information about different dimensions of a visual object varies as a function of the perceptual properties of those dimensions. The experiments involved two perception-based matching tasks and two retrieval-based matching tasks. A signal-to-respond methodology was used in all tasks. A stochastic…

  8. Editorial for the Bibliometric-enhanced Information Retrieval Workshop at ECIR 2014

    NARCIS (Netherlands)

    Mayr, Philipp; Schaer, Philipp; Scharnhorst, Andrea; Mutschke, Peter

    2014-01-01

    This first "Bibliometric-enhanced Information Retrieval" (BIR 2014) workshop aims to engage with the IR community about possible links to bibliometrics and scholarly communication. Bibliometric techniques are not yet widely used to enhance retrieval processes in digital libraries, although they offe

  9. Query-by-Example Music Information Retrieval by Score-Informed Source Separation and Remixing Technologies

    Directory of Open Access Journals (Sweden)

    Goto Masataka

    2010-01-01

    Full Text Available We describe a novel query-by-example (QBE approach in music information retrieval that allows a user to customize query examples by directly modifying the volume of different instrument parts. The underlying hypothesis of this approach is that the musical mood of retrieved results changes in relation to the volume balance of different instruments. On the basis of this hypothesis, we aim to clarify the relationship between the change in the volume balance of a query and the genre of the retrieved pieces, called genre classification shift. Such an understanding would allow us to instruct users in how to generate alternative queries without finding other appropriate pieces. Our QBE system first separates all instrument parts from the audio signal of a piece with the help of its musical score, and then it allows users remix these parts to change the acoustic features that represent the musical mood of the piece. Experimental results showed that the genre classification shift was actually caused by the volume change in the vocal, guitar, and drum parts.

  10. Utilization of ontology look-up services in information retrieval for biomedical literature

    OpenAIRE

    2014-01-01

    With the vast amount of biomedical data we face the necessity to improve information retrieval processes in biomedical domain. The use of biomedical ontologies facilitated the combination of various data sources (e.g. scientific literature, clinical data repository) by increasing the quality of information retrieval and reducing the maintenance efforts. In this context, we developed Ontology Look-up services (OLS), based on NEWT and MeSH vocabularies. Our services were involved in some inform...

  11. Retrieval practice is an efficient method of enhancing the retention of anatomy and physiology information.

    Science.gov (United States)

    Dobson, John L

    2013-06-01

    Although a great deal of empirical evidence has indicated that retrieval practice is an effective means of promoting learning and memory, very few studies have investigated the strategy in the context of an actual class. The primary purpose of this study was to determine if a series of very brief retrieval quizzes could significantly improve the retention of previously tested information throughout an anatomy and physiology course. A second purpose was to determine if there were any significant differences between expanding and uniform patterns of retrieval that followed a standardized initial retrieval delay. Anatomy and physiology students were assigned to either a control group or groups that were repeatedly prompted to retrieve a subset of previously tested course information via a series of quizzes that were administered on either an expanding or a uniform schedule. Each retrieval group completed a total of 10 retrieval quizzes, and the series of quizzes required (only) a total of 2 h to complete. Final retention of the exam subset material was assessed during the last week of the semester. There were no significant differences between the expanding and uniform retrieval groups, but both retained an average of 41% more of the subset material than did the control group (ANOVA, F = 129.8, P = 0.00, ηp(2) = 0.36). In conclusion, retrieval practice is a highly efficient and effective strategy for enhancing the retention of anatomy and physiology material.

  12. Intelligent Information Retrieval: Diagnosing Information Need. Part II. Uncertainty Expansion in a Prototype of a Diagnostic IR Tool.

    Science.gov (United States)

    Cole, Charles; Cantero, Pablo; Sauve, Diane

    1998-01-01

    Outlines a prototype of an intelligent information-retrieval tool to facilitate information access for an undergraduate seeking information for a term paper. Topics include diagnosing the information need, Kuhlthau's information-search-process model, Shannon's mathematical theory of communication, and principles of uncertainty expansion and…

  13. Accurate performance estimators for information retrieval based on span bound of support vector machines

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    Support vector machines have met with significant success in the information retrieval field, especially in handling text classification tasks. Although various performance estimators for SVMs have been proposed,these only focus on accuracy which is based on the leave-one-out cross validation procedure. Information-retrieval-related performance measures are always neglected in a kernel learning methodology. In this paper, we have proposed a set of information-retrieval-oriented performance estimators for SVMs, which are based on the span bound of the leave-one-out procedure. Experiments have proven that our proposed estimators are both effective and stable.

  14. An overview of selected information storage and retrieval issues in computerized document processing

    Science.gov (United States)

    Dominick, Wayne D. (Editor); Ihebuzor, Valentine U.

    1984-01-01

    The rapid development of computerized information storage and retrieval techniques has introduced the possibility of extending the word processing concept to document processing. A major advantage of computerized document processing is the relief of the tedious task of manual editing and composition usually encountered by traditional publishers through the immense speed and storage capacity of computers. Furthermore, computerized document processing provides an author with centralized control, the lack of which is a handicap of the traditional publishing operation. A survey of some computerized document processing techniques is presented with emphasis on related information storage and retrieval issues. String matching algorithms are considered central to document information storage and retrieval and are also discussed.

  15. Retrieval monitoring is influenced by information value: the interplay between importance and confidence on false memory.

    Science.gov (United States)

    McDonough, Ian M; Bui, Dung C; Friedman, Michael C; Castel, Alan D

    2015-10-01

    The perceived value of information can influence one's motivation to successfully remember that information. This study investigated how information value can affect memory search and evaluation processes (i.e., retrieval monitoring). In Experiment 1, participants studied unrelated words associated with low, medium, or high values. Subsequent memory tests required participants to selectively monitor retrieval for different values. False memory effects were smaller when searching memory for high-value than low-value words, suggesting that people more effectively monitored more important information. In Experiment 2, participants studied semantically-related words, and the need for retrieval monitoring was reduced at test by using inclusion instructions (i.e., endorsement of any word related to the studied words) compared with standard instructions. Inclusion instructions led to increases in false recognition for low-value, but not for high-value words, suggesting that under standard-instruction conditions retrieval monitoring was less likely to occur for important information. Experiment 3 showed that words retrieved with lower confidence were associated with more effective retrieval monitoring, suggesting that the quality of the retrieved memory influenced the degree and effectiveness of monitoring processes. Ironically, unless encouraged to do so, people were less likely to carefully monitor important information, even though people want to remember important memories most accurately.

  16. Research on Semantic Retrieval in Network Information Retrieval Environment%网络信息检索环境下的语义检索研究

    Institute of Scientific and Technical Information of China (English)

    潘可

    2013-01-01

      互联网给用户的信息检索带来便利的同时也导致大量冗余信息出现,使得检索效率低下。语义检索通过对用户检索要求的语义分析使得检索效率以及检索准确度大大提升。本文在分析语义检索基本要素和检索原理的基础上,介绍三种常用的语义检索系统,体现语义检索在网络信息检索环境下的优势。%The Internet has brought convenience to the users' information retrieval, but also led to a large number of redundant information, resulting in inefficient retrieval. Semantic retrieval makes retrieval efficiency and retrieval accuracy greatly enhanced through semantic analysis of the requirements of the users to retrieve. Based on the analysis on basic elements of semantic retrieval and retrieval principle, this paper describes three common semantic retrieval systems, which embodies the advantages of semantic retrieval in network information retrieval environment.

  17. Entropy Associated with Information Storage and Its Retrieval

    Directory of Open Access Journals (Sweden)

    Abu Mohamed Alhasan

    2015-08-01

    Full Text Available We provide an entropy analysis for light storage and light retrieval. In this analysis, entropy extraction and reduction in a typical light storage experiment are identified. The spatiotemporal behavior of entropy is presented for D1 transition in cold sodium atoms. The governing equations are the reduced Maxwell field equations and the Liouville–von Neumann equation for the density matrix of the dressed atom.

  18. Confabulation in Alzheimer's disease: poor encoding and retrieval of over-learned information.

    Science.gov (United States)

    Attali, Eve; De Anna, Francesca; Dubois, Bruno; Dalla Barba, Gianfranco

    2009-01-01

    Patients who confabulate retrieve personal habits, repeated events or over-learned information and mistake them for actually experienced, specific unique events. Although some hypotheses favour a disruption of frontal/executive functions operating at retrieval, the respective involvement of encoding and retrieval processes in confabulation is still controversial. The present study sought to investigate experimentally the involvement of encoding and retrieval processes and the interference of over-learned information in the confabulation of Alzheimer's disease patients. Twenty Alzheimer's disease patients and 20 normal controls encoded and retrieved unknown stories, well-known fairy tales (e.g. Snow White) and modified well-known fairy tales (e.g. Little Red Riding Hood is not eaten by the wolf) under three experimental conditions: (i) full attention at encoding and at retrieval; (ii) divided attention at encoding (i.e. performing an attention demanding secondary task) and full attention at retrieval; (iii) full attention at encoding and divided attention at retrieval. We found that confabulations in Alzheimer's disease patients were more frequent for the modified well-known fairy tales and when encoding was weakened by a concurrent secondary task (61%), compared with the other types of stories and experimental conditions. Confabulations in the modified fairy tales always consisted of elements of the original version of the fairy tale (e.g. Little Red Riding Hood is eaten by the wolf). This is the first experimental evidence showing that poor encoding and over-learned information are involved in confabulation in Alzheimer's disease.

  19. Improving biomedical information retrieval by linear combinations of different query expansion techniques.

    Science.gov (United States)

    Abdulla, Ahmed AbdoAziz Ahmed; Lin, Hongfei; Xu, Bo; Banbhrani, Santosh Kumar

    2016-07-25

    Biomedical literature retrieval is becoming increasingly complex, and there is a fundamental need for advanced information retrieval systems. Information Retrieval (IR) programs scour unstructured materials such as text documents in large reserves of data that are usually stored on computers. IR is related to the representation, storage, and organization of information items, as well as to access. In IR one of the main problems is to determine which documents are relevant and which are not to the user's needs. Under the current regime, users cannot precisely construct queries in an accurate way to retrieve particular pieces of data from large reserves of data. Basic information retrieval systems are producing low-quality search results. In our proposed system for this paper we present a new technique to refine Information Retrieval searches to better represent the user's information need in order to enhance the performance of information retrieval by using different query expansion techniques and apply a linear combinations between them, where the combinations was linearly between two expansion results at one time. Query expansions expand the search query, for example, by finding synonyms and reweighting original terms. They provide significantly more focused, particularized search results than do basic search queries. The retrieval performance is measured by some variants of MAP (Mean Average Precision) and according to our experimental results, the combination of best results of query expansion is enhanced the retrieved documents and outperforms our baseline by 21.06 %, even it outperforms a previous study by 7.12 %. We propose several query expansion techniques and their combinations (linearly) to make user queries more cognizable to search engines and to produce higher-quality search results.

  20. Multimodal retrieval of autobiographical memories: sensory information contributes differently to the recollection of events.

    Science.gov (United States)

    Willander, Johan; Sikström, Sverker; Karlsson, Kristina

    2015-01-01

    Previous studies on autobiographical memory have focused on unimodal retrieval cues (i.e., cues pertaining to one modality). However, from an ecological perspective multimodal cues (i.e., cues pertaining to several modalities) are highly important to investigate. In the present study we investigated age distributions and experiential ratings of autobiographical memories retrieved with unimodal and multimodal cues. Sixty-two participants were randomized to one of four cue-conditions: visual, olfactory, auditory, or multimodal. The results showed that the peak of the distributions depends on the modality of the retrieval cue. The results indicated that multimodal retrieval seemed to be driven by visual and auditory information to a larger extent and to a lesser extent by olfactory information. Finally, no differences were observed in the number of retrieved memories or experiential ratings across the four cue-conditions.

  1. Episodic retrieval involves early and sustained effects of reactivating information from encoding.

    Science.gov (United States)

    Johnson, Jeffrey D; Price, Mason H; Leiker, Emily K

    2015-02-01

    Several fMRI studies have shown a correspondence between the brain regions activated during encoding and retrieval, consistent with the view that memory retrieval involves hippocampally-mediated reinstatement of cortical activity. With the limited temporal resolution of fMRI, the precise timing of such reactivation is unclear, calling into question the functional significance of these effects. Whereas reactivation influencing retrieval should emerge with neural correlates of retrieval success, that signifying post-retrieval monitoring would trail retrieval. The present study employed EEG to provide a temporal landmark of retrieval success from which we could investigate the sub-trial time course of reactivation. Pattern-classification analyses revealed that early-onsetting reactivation differentiated the outcome of recognition-memory judgments and was associated with individual differences in behavioral accuracy, while reactivation was also evident in a sustained form later in the trial. The EEG findings suggest that, whereas prior fMRI findings could be interpreted as reflecting the contribution of reinstatement to retrieval success, they could also indicate the maintenance of episodic information in service of post-retrieval evaluation.

  2. 07071 Report on Dagstuhl Seminar -- Web Information Retrieval and Linear Algebra Algorithms

    OpenAIRE

    Frommer, Andreas; Mahoney, Michael W.; Szyld, Daniel B.

    2007-01-01

    A seminar concentrating on the intersection of the fields of information retrieval and other web-related aspects with numerical and applied linear algebra techniques was held with the attendance of scientists from industry and academia.

  3. The IIR evaluation model: a framework for evaluation of interactive information retrieval systems

    Directory of Open Access Journals (Sweden)

    Borlund Pia

    2003-01-01

    Full Text Available An alternative approach to evaluation of interactive information retrieval (IIR systems is proposed. The model provides a framework for the collection and analysis of IR interaction data.

  4. Scalability of Findability: Decentralized Search and Retrieval in Large Information Networks

    Science.gov (United States)

    Ke, Weimao

    2010-01-01

    Amid the rapid growth of information today is the increasing challenge for people to survive and navigate its magnitude. Dynamics and heterogeneity of large information spaces such as the Web challenge information retrieval in these environments. Collection of information in advance and centralization of IR operations are hardly possible because…

  5. The Effect of Bilingual Term List Size on Dictionary-Based Cross-Language Information Retrieval

    Science.gov (United States)

    2006-01-01

    CLIR) is to support the task of searching multilingual col- lections by allowing users to enter queries in a language that might be different from that...ELRA Basic Multilingual Lexicon covered common terms quite well, with 97% of the 1,000 most common English words being found (af- ter splitting...English topic descriptions,1 and binary (yes-no) relevance judgments for topic-document pairs. We used this monolingual test collection with each spe

  6. A Generalized Framework for Ontology-Based Information Retrieval Application to a public-transportation system

    OpenAIRE

    2014-01-01

    In this paper we present a generic framework for ontology-based information retrieval. We focus on the recognition of semantic information extracted from data sources and the mapping of this knowledge into ontology. In order to achieve more scalability, we propose an approach for semantic indexing based on entity retrieval model. In addition, we have used ontology of public transportation domain in order to validate these proposals. Finally, we evaluated our system using ontology mapping and ...

  7. Experimental performance evaluation of using multiset metrics in information retrieval problems

    OpenAIRE

    Негурица, Дмитрий Сергеевич

    2014-01-01

    The problem of the comparative experimental performance evaluation using multiset metrics in information retrieval problems is considered in the paper. The main purpose of the studies is to prove on the basis of actual experimental data, the feasibility of using multisets, as a fundamentally new mathematical tool, in information retrieval problems. As a result of the preliminary statistical analysis of the available data set ‘’Anonymouswebdatafromwww.microsoft.com’’, specific features and pro...

  8. Learning Better Context Characterizations: An Intelligent Information Retrieval Approach

    CERN Document Server

    Lorenzetti, Carlos M

    2010-01-01

    This paper proposes an incremental method that can be used by an intelligent system to learn better descriptions of a thematic context. The method starts with a small number of terms selected from a simple description of the topic under analysis and uses this description as the initial search context. Using these terms, a set of queries are built and submitted to a search engine. New documents and terms are used to refine the learned vocabulary. Evaluations performed on a large number of topics indicate that the learned vocabulary is much more effective than the original one at the time of constructing queries to retrieve relevant material.

  9. Information Retrieval and Search Engines in Full-text Databases

    Directory of Open Access Journals (Sweden)

    Hartmut Zillmann

    2000-06-01

    Full Text Available OSIRIS and ELIB are two cooperative projects at the Osnabrück University supported by the German Research Society (DFG and the Ministry of Science and Culture (MWK Niedersachsen (Germany. They deal with natural language retrieval systems and with indexing techniques for full-text databases using natural language processing. Very complex processes in the context of syntactical and semantical analysis of textual phrases including sophisticated valuation criteria could be implemented in large relational databases with online realtime indexing requirements.

  10. Lexical processing and organization in bilingual first language acquisition: Guiding future research.

    Science.gov (United States)

    DeAnda, Stephanie; Poulin-Dubois, Diane; Zesiger, Pascal; Friend, Margaret

    2016-06-01

    A rich body of work in adult bilinguals documents an interconnected lexical network across languages, such that early word retrieval is language independent. This literature has yielded a number of influential models of bilingual semantic memory. However, extant models provide limited predictions about the emergence of lexical organization in bilingual first language acquisition (BFLA). Empirical evidence from monolingual infants suggests that lexical networks emerge early in development as children integrate phonological and semantic information. These findings tell us little about the interaction between 2 languages in early bilingual memory. To date, an understanding of when and how languages interact in early bilingual development is lacking. In this literature review, we present research documenting lexical-semantic development across monolingual and bilingual infants. This is followed by a discussion of current models of bilingual language representation and organization and their ability to account for the available empirical evidence. Together, these theoretical and empirical accounts inform and highlight unexplored areas of research and guide future work on early bilingual memory. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  11. Visual working memory buffers information retrieved from visual long-term memory.

    Science.gov (United States)

    Fukuda, Keisuke; Woodman, Geoffrey F

    2017-05-16

    Human memory is thought to consist of long-term storage and short-term storage mechanisms, the latter known as working memory. Although it has long been assumed that information retrieved from long-term memory is represented in working memory, we lack neural evidence for this and need neural measures that allow us to watch this retrieval into working memory unfold with high temporal resolution. Here, we show that human electrophysiology can be used to track information as it is brought back into working memory during retrieval from long-term memory. Specifically, we found that the retrieval of information from long-term memory was limited to just a few simple objects' worth of information at once, and elicited a pattern of neurophysiological activity similar to that observed when people encode new information into working memory. Our findings suggest that working memory is where information is buffered when being retrieved from long-term memory and reconcile current theories of memory retrieval with classic notions about the memory mechanisms involved.

  12. The Necessity for Information Space Mapping for Information Retrieval on the Semantic Web

    Directory of Open Access Journals (Sweden)

    Gregory B. Newby

    2002-01-01

    Full Text Available The Semantic Web offers exciting possibilities for information retrieval (IR. In IR, we would like systems that go beyond simply matching words in documents and queries, and instead match based on topic, data type, relations among data, and many other qualities. The Semantic Web, through fuzzy matching of information spaces from different sources, will provide for much more specific information seeking than current Web-based search engines or other IR systems. In order to succeed, however, it is necessary to map between the differing schema, metadata standards, namespaces and so forth used by documents on the Semantic Web. This information space mapping may be accomplished by a simple match or table lookup when document sets come from similar or otherwise well-defined domains. When the match is less precise, sets of rules or algorithms may be employed to map between information spaces. When schema or metadata are inconsistent, though, we are left with a similar data environment as the modern Web, and must rely on the context of the documents themselves to determine the mapping between information spaces.

  13. Transfer and distortion of atmospheric information in the satellite temperature retrieval problem

    Science.gov (United States)

    Thompson, O. E.

    1981-01-01

    A systematic approach to investigating the transfer of basic ambient temperature information and its distortion by satellite systems and subsequent analysis algorithms is discussed. The retrieval analysis cycle is derived, the variance spectrum of information is examined as it takes different forms in that process, and the quality and quantity of information existing at each stop is compared with the initial ambient temperature information. Temperature retrieval algorithms can smooth, add, or further distort information, depending on how stable the algorithm is, and how heavily influenced by a priori data.

  14. Implications of Inter-Rater Agreement on a Student Information Retrieval Evaluation

    CERN Document Server

    Schaer, Philipp; Mutschke, Peter

    2010-01-01

    This paper is about an information retrieval evaluation on three different retrieval-supporting services. All three services were designed to compensate typical problems that arise in metadata-driven Digital Libraries, which are not adequately handled by a simple tf-idf based retrieval. The services are: (1) a co-word analysis based query expansion mechanism and re-ranking via (2) Bradfordizing and (3) author centrality. The services are evaluated with relevance assessments conducted by 73 information science students. Since the students are neither information professionals nor domain experts the question of inter-rater agreement is taken into consideration. Two important implications emerge: (1) the inter-rater agreement rates were mainly fair to moderate and (2) after a data-cleaning step which erased the assessments with poor agreement rates the evaluation data shows that the three retrieval services returned disjoint but still relevant result sets.

  15. Event-related potential variations in the encoding and retrieval of different amounts of contextual information.

    Science.gov (United States)

    Estrada-Manilla, Cinthya; Cansino, Selene

    2012-06-15

    Episodic memory events occur within multidimensional contexts; however, the electrophysiological manifestations associated with processing of more than one context have been rarely investigated. The effect of the amount of context on the ERPs was studied using two single and one double source memory tasks and by comparing full and partial context retrieval within a double source task. The single source tasks elicited waveforms with a larger amplitude during successful encoding and retrieval than the double source task. Compared with the waveforms elicited with a full source response, a partial source response elicited waveforms with a smaller amplitude, probably because the retrieval success for one context was combined with the retrieval attempt processes for the missing source. Comparing the tasks revealed that the larger the amount of contextual information processed, the smaller the amplitude of the ERPs, indicating that greater effort or further control processes were required during double source retrieval.

  16. The validation of the Yonsei CArbon Retrieval algorithm with improved aerosol information using GOSAT measurements

    Science.gov (United States)

    Jung, Yeonjin; Kim, Jhoon; Kim, Woogyung; Boesch, Hartmut; Goo, Tae-Young; Cho, Chunho

    2017-04-01

    Although several CO2 retrieval algorithms have been developed to improve our understanding about carbon cycle, limitations in spatial coverage and uncertainties due to aerosols and thin cirrus clouds are still remained as a problem for monitoring CO2 concentration globally. Based on an optimal estimation method, the Yonsei CArbon Retrieval (YCAR) algorithm was developed to retrieve the column-averaged dry-air mole fraction of carbon dioxide (XCO2) using the Greenhouse Gases Observing SATellite (GOSAT) measurements with optimized a priori CO2 profiles and aerosol models over East Asia. In previous studies, the aerosol optical properties (AOP) are the most important factors in CO2 retrievals since AOPs are assumed as fixed parameters during retrieval process, resulting in significant XCO2 retrieval error up to 2.5 ppm. In this study, to reduce these errors caused by inaccurate aerosol optical information, the YCAR algorithm improved with taking into account aerosol optical properties as well as aerosol vertical distribution simultaneously. The CO2 retrievals with two difference aerosol approaches have been analyzed using the GOSAT spectra and have been evaluated throughout the comparison with collocated ground-based observations at several Total Carbon Column Observing Network (TCCON) sites. The improved YCAR algorithm has biases of 0.59±0.48 ppm and 2.16±0.87 ppm at Saga and Tsukuba sites, respectively, with smaller biases and higher correlation coefficients compared to the GOSAT operational algorithm. In addition, the XCO2 retrievals will be validated at other TCCON sites and error analysis will be evaluated. These results reveal that considering better aerosol information can improve the accuracy of CO2 retrieval algorithm and provide more useful XCO2 information with reduced uncertainties. This study would be expected to provide useful information in estimating carbon sources and sinks.

  17. Enhanced Trustworthy and High-Quality Information Retrieval System for Web Search Engines

    CERN Document Server

    Ramachandran, S; Joseph, S; Ramaraj, V

    2009-01-01

    The WWW is the most important source of information. But, there is no guarantee for information correctness and lots of conflicting information is retrieved by the search engines and the quality of provided information also varies from low quality to high quality. We provide enhanced trustworthiness in both specific (entity) and broad (content) queries in web searching. The filtering of trustworthiness is based on 5 factors: Provenance, Authority, Age, Popularity, and Related Links. The trustworthiness is calculated based on these 5 factors and it is stored thereby increasing the performance in retrieving trustworthy websites. The calculated trustworthiness is stored only for static websites. Quality is provided based on policies selected by the user. Quality based ranking of retrieved trusted information is provided using WIQA (Web Information Quality Assessment) Framework.

  18. On-Line Retrieval System Design; Part V of Scientific Report No. ISR-18, Information Storage and Retrieval...

    Science.gov (United States)

    Cornell Univ., Ithaca, NY. Dept. of Computer Science.

    On-line retrieval system design is discussed in the two papers which make up Part Five of this report on Salton's Magical Automatic Retriever of Texts (SMART) project report. The first paper: "A Prototype On-Line Document Retrieval System" by D. Williamson and R. Williamson outlines a design for a SMART on-line document retrieval system…

  19. Rank by Readability: Document Weighting for Information Retrieval

    Science.gov (United States)

    Newbold, Neil; McLaughlin, Harry; Gillam, Lee

    In this paper, we present a new approach to ranking that considers the reading ability (and motivation) of the user. Web pages can be, increasingly, badly written with unfamiliar words, poor use of syntax, ambiguous phrases and so on. Readability research suggests that experts and motivated readers may overcome confusingly written text, but nevertheless find it an irritation. We investigate using readability to re-rank web pages. We take an extended view of readability that considers the reading level of retrieved web pages using techniques that consider both textual and cognitive factors. Readability of a selection of query results is examined, and a re-ranking on readability is compared to the original ranking. Results to date suggest that considering a view of readability for each reader may increase the probability of relevance to a particular user.

  20. Children’s information retrieval: beyond examining search strategies and interfaces

    NARCIS (Netherlands)

    Jochmann-Mannak, H.E.; Huibers, T.W.C.; Sanders, T.J.M.

    2008-01-01

    The study of children’s information retrieval is still for the greater part untouched territory. Meanwhile, children can become lost in the digital information world, because they are confronted with search interfaces, both designed by and for adults. Most current research on children’s information

  1. Personalizing Information Retrieval Using Task Features, Topic Knowledge, and Task Products

    Science.gov (United States)

    Liu, Jingjing

    2010-01-01

    Personalization of information retrieval tailors search towards individual users to meet their particular information needs by taking into account information about users and their contexts, often through implicit sources of evidence such as user behaviors and contextual factors. The current study looks particularly at users' dwelling behavior,…

  2. Space shuttle program information control and retrieval system feasibility study report

    Science.gov (United States)

    Lingle, C. P.

    1973-01-01

    The feasibility of having a common information management network for space shuttle data, is studied. Identified are the information types required, sources and users of the information, and existing techniques for acquiring, storing and retrieving the data. The study concluded that a decentralized system is feasible, and described a recommended development plan for it.

  3. Semi-Automatic Retrieval of Definitional Information: A Northern ...

    African Journals Online (AJOL)

    rbr

    minological data, providing information on the meaning, usage and appropri- ateness of ..... used to signal definitional information in texts can be identified, it would ..... special field experts, which in turn, might make them more willing to partici-.

  4. 信息检索中文献漏检误检因素分析%Analysis on the Factor of Missing Retrieval and Erroneous Retrieval in the Information Retrieval

    Institute of Scientific and Technical Information of China (English)

    俞平

    2011-01-01

    This paper analyzes the factor of missing retrieval and erroneous retrieval in the information retrieval,proposes the countermeasures of missing retrieval and erroneous retrieval from the selection of database, the determination of index terms and the est%文章分析了信息检索中文献漏检和误检的因素,从数据库的选择、检索词的确定、检索策略的制定等方面提出了避免漏检误检的对策。

  5. Information Storage and Retrieval for Probe Storage using Optical Diffraction Patterns

    CERN Document Server

    van Honschoten, Joost; Koelmans, Wabe W; Parnell, Thomas P; Zaboronski, Oleg V

    2011-01-01

    A novel method for fast information retrieval from a probe storage device is considered. It is shown that information can be stored and retrieved using the optical diffraction patterns obtained by the illumination of a large array of cantilevers by a monochromatic light source. In thermo-mechanical probe storage, the information is stored as a sequence of indentations on the polymer medium. To retrieve the information, the array of probes is actuated by applying a bending force to the cantilevers. Probes positioned over indentations experience deflection by the depth of the indentation, probes over the flat media remain un-deflected. Thus the array of actuated probes can be viewed as an irregular optical grating, which creates a data-dependent diffraction pattern when illuminated by laser light. We develop a low complexity modulation scheme, which allows the extraction of information stored in the pattern of indentations on the media from Fourier coefficients of the intensity of the diffraction pattern. We th...

  6. Diachronic Analysis on Users' Information Retrieval Mechanism%用户情报检索机制的历时性分析研究

    Institute of Scientific and Technical Information of China (English)

    韩毅

    2001-01-01

    The paper analyzes the information retrieval mechanism with diachrony method, divides information retrieval process into four circumstances, that is, information need, infomnation want, information demand and retrieval, and information assimilation and use, and based on Brooks Equation, goes further into details of information retrieval mechanism.

  7. Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data

    Directory of Open Access Journals (Sweden)

    Ibrahiem M.M. El Emary

    2005-01-01

    Full Text Available This paper aimed to design and build an Automatic Information Retrieval System to handle the Arabic data. Also, this paper presents some type of comparison between the retrieval results using the vector space model in two different indexing methods: the full-ward indexing and the root indexing. The proposed Automatic Information Retrieval system was implemented and built using a traditional model technique: Vector Space Model (VSM where the cosine measure similarity was used. The output results indicate and show that the root indexing improved the retrieval performance more than the full-ward indexing on the Arabic documents; furthermore it reduces the size of stored data and minimizes the time of system processing.

  8. Evaluation of some Information Retrieval models for Gujarati Ad hoc Monolingual Tasks

    OpenAIRE

    J., Joshi Hardik; Jyoti, Pareek

    2012-01-01

    This paper describes the work towards Gujarati Ad hoc Monolingual Retrieval task for widely used Information Retrieval (IR) models. We present an indexing baseline for the Gujarati Language represented by Mean Average Precision (MAP) values. Our objective is to obtain a relative picture of a better IR model for Gujarati Language. Results show that Classical IR models like Term Frequency Inverse Document Frequency (TF_IDF) performs better when compared to few recent probabilistic IR models. Th...

  9. User centered and ontology based information retrieval system for life sciences

    OpenAIRE

    2012-01-01

    Abstract Background Because of the increasing number of electronic resources, designing efficient tools to retrieve and exploit them is a major challenge. Some improvements have been offered by semantic Web technologies and applications based on domain ontologies. In life science, for instance, the Gene Ontology is widely exploited in genomic applications and the Medical Subject Headings is the basis of biomedical publications indexation and information retrieval process proposed by PubMed. H...

  10. Translation Events in Cross-Language Information Retrieval: Lexical Ambiguity, Lexical Holes, Vocabulary Mismatch, and Correct Translations

    OpenAIRE

    Diekema, Anne R.

    2003-01-01

    Cross-Language Information Retrieval (CLIR) systems enable users to formulate queries in their native language to retrieve documents in foreign languages. Because queries and documents in CLIR do not necessarily share the same language, translation is needed before matching can take place. This translation step tends to cause a reduction in the retrieval performance of CLIR as compared to monolingual information retrieval. The prevailing CLIR approach and the focus of this study is quer...

  11. Infectious Cognition: Risk Perception Affects Socially Shared Retrieval-Induced Forgetting of Medical Information.

    Science.gov (United States)

    Coman, Alin; Berry, Jessica N

    2015-12-01

    When speakers selectively retrieve previously learned information, listeners often concurrently, and covertly, retrieve their memories of that information. This concurrent retrieval typically enhances memory for mentioned information (the rehearsal effect) and impairs memory for unmentioned but related information (socially shared retrieval-induced forgetting, SSRIF), relative to memory for unmentioned and unrelated information. Building on research showing that anxiety leads to increased attention to threat-relevant information, we explored whether concurrent retrieval is facilitated in high-anxiety real-world contexts. Participants first learned category-exemplar facts about meningococcal disease. Following a manipulation of perceived risk of infection (low vs. high risk), they listened to a mock radio show in which some of the facts were selectively practiced. Final recall tests showed that the rehearsal effect was equivalent between the two risk conditions, but SSRIF was significantly larger in the high-risk than in the low-risk condition. Thus, the tendency to exaggerate consequences of news events was found to have deleterious consequences.

  12. Distinct regions of prefrontal cortex are associated with the controlled retrieval and selection of social information.

    Science.gov (United States)

    Satpute, Ajay B; Badre, David; Ochsner, Kevin N

    2014-05-01

    Research in social neuroscience has uncovered a social knowledge network that is particularly attuned to making social judgments. However, the processes that are being performed by both regions within this network and those outside of this network that are nevertheless engaged in the service of making a social judgment remain unclear. To help address this, we drew upon research in semantic memory, which suggests that making a semantic judgment engages 2 distinct control processes: A controlled retrieval process, which aids in bringing goal-relevant information to mind from long-term stores, and a selection process, which aids in selecting the information that is goal-relevant from the information retrieved. In a neuroimaging study, we investigated whether controlled retrieval and selection for social information engage distinct portions of both the social knowledge network and regions outside this network. Controlled retrieval for social information engaged an anterior ventrolateral portion of the prefrontal cortex, whereas selection engaged both the dorsomedial prefrontal cortex and temporoparietal junction within the social knowledge network. These results suggest that the social knowledge network may be more involved with the selection of social information than the controlled retrieval of it and incorporates lateral prefrontal regions in accessing memory for making social judgments.

  13. Retrieval of air quality information using image processing technique.

    Science.gov (United States)

    Lim, H. S.; MatJafri, M. Z.; Abdullah, K.; Saleh, N. M.

    2007-04-01

    This paper presents and describes an approach to retrieve concentration of particulate matter of size less than 10- micron (PM10) from Landsat TM data over Penang Island. The objective of this study is test the feasibility of using Landsat TM for PM10 mapping using our proposed developed algorithm. The development of the algorithm was developed base on the aerosol characteristics in the atmosphere. PM10 measurements were collected using a DustTrak Aerosol Monitor 8520 simultaneously with the image acquisition. The station locations of the PM10 measurements were detemined using a hand held GPS. The digital numbers were extracted corresponding to the ground-truth locations for each band and then converted into radiance and reflectance values. The reflectance measured from the satellite [reflectance at the top of atmospheric, ρ(TOA)] was subtracted by the amount given by the surface reflectance to obtain the atmospheric reflectance. Then the atmospheric reflectance was related to the PM10 using regression analysis. The surface reflectance values were created using ACTOR2 image correction software in the PCI Geomatica 9.1.8 image processing software. The proposed developed algorithm produced high accuracy and also showed a good agreement (R =0.8406) between the measured and estimated PM10. This study indicates that it is feasible to use Landsat TM data for mapping PM10 using the proposed algorithm.

  14. Information retrieval from holographic interferograms: Fundamentals and problems

    Science.gov (United States)

    Vest, Charles M.

    1987-01-01

    Holographic interferograms can contain large amounts of information about flow and temperature fields. Their information content can be very high because they can be viewed from many different directions. This multidirectionality, and fringe localization add to the information contained in the fringe pattern if diffuse illumination is used. Additional information, and increased accuracy can be obtained through the use of dual reference wave holography to add reference fringes or to effect discrete phase shift or hetrodyne interferometry. Automated analysis of fringes is possible if interferograms are of simple structure and good quality. However, in practice a large number of practical problems can arise, so that a difficult image processing task results.

  15. A Compositional Relevance Model for Adaptive Information Retrieval

    Science.gov (United States)

    Mathe, Nathalie; Chen, James; Lu, Henry, Jr. (Technical Monitor)

    1994-01-01

    There is a growing need for rapid and effective access to information in large electronic documentation systems. Access can be facilitated if information relevant in the current problem solving context can be automatically supplied to the user. This includes information relevant to particular user profiles, tasks being performed, and problems being solved. However most of this knowledge on contextual relevance is not found within the contents of documents, and current hypermedia tools do not provide any easy mechanism to let users add this knowledge to their documents. We propose a compositional relevance network to automatically acquire the context in which previous information was found relevant. The model records information on the relevance of references based on user feedback for specific queries and contexts. It also generalizes such information to derive relevant references for similar queries and contexts. This model lets users filter information by context of relevance, build personalized views of documents over time, and share their views with other users. It also applies to any type of multimedia information. Compared to other approaches, it is less costly and doesn't require any a priori statistical computation, nor an extended training period. It is currently being implemented into the Computer Integrated Documentation system which enables integration of various technical documents in a hypertext framework.

  16. NASA access mechanism: Graphical user interface information retrieval system

    Science.gov (United States)

    Hunter, Judy; Generous, Curtis; Duncan, Denise

    1993-01-01

    Access to online information sources of aerospace, scientific, and engineering data, a mission focus for NASA's Scientific and Technical Information Program, has always been limited to factors such as telecommunications, query language syntax, lack of standardization in the information, and the lack of adequate tools to assist in searching. Today, the NASA STI Program's NASA Access Mechanism (NAM) prototype offers a solution to these problems by providing the user with a set of tools that provide a graphical interface to remote, heterogeneous, and distributed information in a manner adaptable to both casual and expert users. Additionally, the NAM provides access to many Internet-based services such as Electronic Mail, the Wide Area Information Servers system, Peer Locating tools, and electronic bulletin boards.

  17. NASA Access Mechanism - Graphical user interface information retrieval system

    Science.gov (United States)

    Hunter, Judy F.; Generous, Curtis; Duncan, Denise

    1993-01-01

    Access to online information sources of aerospace, scientific, and engineering data, a mission focus for NASA's Scientific and Technical Information Program, has always been limited by factors such as telecommunications, query language syntax, lack of standardization in the information, and the lack of adequate tools to assist in searching. Today, the NASA STI Program's NASA Access Mechanism (NAM) prototype offers a solution to these problems by providing the user with a set of tools that provide a graphical interface to remote, heterogeneous, and distributed information in a manner adaptable to both casual and expert users. Additionally, the NAM provides access to many Internet-based services such as Electronic Mail, the Wide Area Information Servers system, Peer Locating tools, and electronic bulletin boards.

  18. The challenge of automated tutoring in Web-based learning environments for information retrieval instruction

    Directory of Open Access Journals (Sweden)

    Sormunen Eero; Pennanen

    2004-01-01

    Full Text Available The need to enhance information literacy education increases demand for effective Web-based learning environments for information retrieval instruction. The paper introduces the Query Performance Analyser, a unique instructional tool for information retrieval learning environments. On top of an information retrieval system and within a given search assignment, the Query Performance Analyser supports learning by instantly visualizing achieved query performance. Although the Query Performance Analyser is a useful tool in training searching skills, performance feedback is not enough for learners practicing alone in Web-based learning environments. The paper reports the findings of a log analysis on user problems in exercising Boolean and best-match queries. A blueprint of an automated tutoring system for IR instruction is presented.

  19. NLP-SIR: A Natural Language Approach for Spreadsheet Information Retrieval

    CERN Document Server

    Flood, Derek; Caffery, Fergal Mc

    2009-01-01

    Spreadsheets are a ubiquitous software tool, used for a wide variety of tasks such as financial modelling, statistical analysis and inventory management. Extracting meaningful information from such data can be a difficult task, especially for novice users unfamiliar with the advanced data processing features of many spreadsheet applications. We believe that through the use of Natural Language Processing (NLP) techniques this task can be made considerably easier. This paper introduces NLP-SIR, a Natural language interface for spreadsheet information retrieval. The results of a recent evaluation which compared NLP-SIR with existing Information retrieval tools are also outlined. This evaluation has shown that NLP-SIR is a more effective method of spreadsheet information retrieval.

  20. Energy conservation choices for the City of Portland, Oregon: energy information retrieval system. Volume 2

    Energy Technology Data Exchange (ETDEWEB)

    1980-07-01

    Volume 2 sets forth and describes the developmental history as well as the technical accomplishment and design of the non-computerized information retrieval system realized under the aegis of the Portland Energy Conservation Demonstration Project (PECDP). An optical coincidence mechanism, a thesaurus, and an indexing procedure which combine to yield a method to selectively store and retrieve discriminate information in such a fashion whereby that information is readily available in a format acceptable to local government decision makers, the City's capital budgeting process, planning agency personnel, and citizens are described. PECDP's Energy Information Retrieval System was housed in existing library space of the Portland Bureau of Planning and since its establishment, there has been a 500% increase in library use among Planning staff. (MCW)

  1. Natural Language Query System Design for Interactive Information Storage and Retrieval Systems. M.S. Thesis

    Science.gov (United States)

    Dominick, Wayne D. (Editor); Liu, I-Hsiung

    1985-01-01

    The currently developed multi-level language interfaces of information systems are generally designed for experienced users. These interfaces commonly ignore the nature and needs of the largest user group, i.e., casual users. This research identifies the importance of natural language query system research within information storage and retrieval system development; addresses the topics of developing such a query system; and finally, proposes a framework for the development of natural language query systems in order to facilitate the communication between casual users and information storage and retrieval systems.

  2. Coordinating Council. Tenth Meeting: Information retrieval: The role of controlled vocabularies

    Science.gov (United States)

    1993-01-01

    The theme of this NASA Scientific and Technical Information Program Coordinating Council meeting was the role of controlled vocabularies (thesauri) in information retrieval. Included are summaries of the presentations and the accompanying visuals. Dr. Raya Fidel addressed 'Retrieval: Free Text, Full Text, and Controlled Vocabularies.' Dr. Bella Hass Weinberg spoke on 'Controlled Vocabularies and Thesaurus Standards.' The presentations were followed by a panel discussion with participation from NASA, the National Library of Medicine, the Defense Technical Information Center, and the Department of Energy; this discussion, however, is not summarized in any detail in this document.

  3. MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL

    Directory of Open Access Journals (Sweden)

    G. A. Kukharev

    2017-01-01

    Full Text Available Subject of Research. The paper deals with methods and algorithms for mutual transformation of related pairs of images in order to enhance the capabilities of cross-modal multimedia retrieval (CMMR technologies. We have thoroughly studied the problem of mutual transformation of face images of various kinds (e.g. photos and drawn pictures. This problem is widely represented in practice. Research is this area is based on existing datasets. The algorithms we have proposed in this paper can be applied to arbitrary pairs of related images due to the unified mathematical specification. Method. We have presented three image transformation algorithms. The first one is based on principal component analysis and Karhunen-Loève transform (1DPCA/1DKLT. Unlike the existing solution, it does not use the training set during the transformation process. The second algorithm assumes generation of an image population. The third algorithm performs the transformation based on two-dimensional principal component analysis and Karhunen-Loève transform (2DPCA/2DKLT. Main Results. The experiments on image transformation and population generation have revealed the main features of each algorithm. The first algorithm allows construction of an accurate and stable model of transition between two given sets of images. The second algorithm can be used to add new images to existing bases and the third algorithm is capable of performing the transformation outside the training dataset. Practical Relevance. Taking into account the qualities of the proposed algorithms, we have provided recommendations concerning their application. Possible scenarios include construction of a transition model for related pairs of images, mutual transformation of the images inside and outside the dataset as well as population generation in order to increase representativeness of existing datasets. Thus, the proposed algorithms can be used to improve reliability of face recognition performed on images

  4. Interactions among emotional attention, encoding, and retrieval of ambiguous information: An eye-tracking study.

    Science.gov (United States)

    Everaert, Jonas; Koster, Ernst H W

    2015-10-01

    Emotional biases in attention modulate encoding of emotional material into long-term memory, but little is known about the role of such attentional biases during emotional memory retrieval. The present study investigated how emotional biases in memory are related to attentional allocation during retrieval. Forty-nine individuals encoded emotionally positive and negative meanings derived from ambiguous information and then searched their memory for encoded meanings in response to a set of retrieval cues. The remember/know/new procedure was used to classify memories as recollection-based or familiarity-based, and gaze behavior was monitored throughout the task to measure attentional allocation. We found that a bias in sustained attention during recollection-based, but not familiarity-based, retrieval predicted subsequent memory bias toward positive versus negative material following encoding. Thus, during emotional memory retrieval, attention affects controlled forms of retrieval (i.e., recollection) but does not modulate relatively automatic, familiarity-based retrieval. These findings enhance understanding of how distinct components of attention regulate the emotional content of memories. Implications for theoretical models and emotion regulation are discussed.

  5. User centered and ontology based information retrieval system for life sciences

    Directory of Open Access Journals (Sweden)

    Sy Mohameth-François

    2012-01-01

    Full Text Available Abstract Background Because of the increasing number of electronic resources, designing efficient tools to retrieve and exploit them is a major challenge. Some improvements have been offered by semantic Web technologies and applications based on domain ontologies. In life science, for instance, the Gene Ontology is widely exploited in genomic applications and the Medical Subject Headings is the basis of biomedical publications indexation and information retrieval process proposed by PubMed. However current search engines suffer from two main drawbacks: there is limited user interaction with the list of retrieved resources and no explanation for their adequacy to the query is provided. Users may thus be confused by the selection and have no idea on how to adapt their queries so that the results match their expectations. Results This paper describes an information retrieval system that relies on domain ontology to widen the set of relevant documents that is retrieved and that uses a graphical rendering of query results to favor user interactions. Semantic proximities between ontology concepts and aggregating models are used to assess documents adequacy with respect to a query. The selection of documents is displayed in a semantic map to provide graphical indications that make explicit to what extent they match the user's query; this man/machine interface favors a more interactive and iterative exploration of data corpus, by facilitating query concepts weighting and visual explanation. We illustrate the benefit of using this information retrieval system on two case studies one of which aiming at collecting human genes related to transcription factors involved in hemopoiesis pathway. Conclusions The ontology based information retrieval system described in this paper (OBIRS is freely available at: http://www.ontotoolkit.mines-ales.fr/ObirsClient/. This environment is a first step towards a user centred application in which the system enlightens

  6. Perspectives on Adaptivity in Information Retrieval Interaction (PAIRI)

    DEFF Research Database (Denmark)

    Ingwersen, Peter; Larsen, Birger; Kelly, Diane

    2010-01-01

    Adaptivity in IR interactions requires the IR systems adapting to users’ situations and the users adapting to the systems. System adaption entails dynamic user modeling, effective information architecture and enhanced search features such as search integration and relevance feedback; user...... adaptation through interactions entails mental model building and modification towards a coherent state of knowledge and learning. The panel is structured as follows. Initially we provide an overview of the panel contents, consisting of four central dimensions of adaptivity in IR interaction....... These are adaption 1) through integration of information objects; 2) of information system to searcher; 3) of searcher to information system; and 4) to context and practice. The sequence follows the order of the panellists, i.e., each panellist is the prime mover of a particular dimension....

  7. Using Bayesian networks to support decision-focused information retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Lehner, P.; Elsaesser, C.; Seligman, L. [Mitre Corp., McLean, VA (United States)

    1996-12-31

    This paper has described an approach to controlling the process of pulling data/information from distributed data bases in a way that is specific to a persons specific decision making context. Our prototype implementation of this approach uses a knowledge-based planner to generate a plan, an automatically constructed Bayesian network to evaluate the plan, specialized processing of the network to derive key information items that would substantially impact the evaluation of the plan (e.g., determine that replanning is needed), automated construction of Standing Requests for Information (SRIs) which are automated functions that monitor changes and trends in distributed data base that are relevant to the key information items. This emphasis of this paper is on how Bayesian networks are used.

  8. Context Disambiguation Based Semantic Web Search for Effective Information Retrieval

    National Research Council Canada - National Science Library

    M. Barathi; S. Valli

    2011-01-01

    .... To overcome this problem, some search engines suggest terms that are semantically related to the submitted queries, so that users can choose from the suggestions based on their information needs. Approach...

  9. Distributed Knowledge Base Systems for Diagnosis and Information Retrieval.

    Science.gov (United States)

    1988-04-08

    J. and J. D. Seader . (1981). Equilibrium-Stage Separation Operation.s in Chemical Engineering. New York: John Wiley & Sons. Inc. 1W IMcDermott. J...representational commitments. and thus in principle they offer alternative infor- mation processing theories. Nevertheless, we argue that the hard work of...Information Processing Level Marr (1982) originated the method of information processing ([P) analysis as a way of separating the essential elements of a

  10. A Study of Organizational Information Search, Acquisition, Storage and Retrieval

    Science.gov (United States)

    1986-08-01

    literatures associated with the perspectives are described. The perspectives themselves are first contrasted and then, using a communicatins framework... corporation division. This 19 31 (selective distribution greatly reduces the information processing load of the many potential receiving units having...of Management, 11: 75-86. Shields, M.D. 1983. Effects of Information Supply and Demand on Judgment Accuracy: Evidence from Corporate Managers. The

  11. The Use of Metadata Visualisation Assist Information Retrieval

    Science.gov (United States)

    2007-10-01

    aspect of the popularity scale (Ahlberg & Shneirderman, 1994). The different genres (including drama, mystery, comedy, western, horror , action etc...organised with metadata for each item within the library, providing information describing the author, the genre , the title, the publisher, the year it...album title, the track length and the genre of music. Again, any of these pieces of information can be used to quickly search and locate specific

  12. A context driven information retrieval for online urban disaster management system

    OpenAIRE

    Zreik, Khaldoun; Parfouru, Samuel

    2005-01-01

    @inproceedings{CI-ZREIK-2005-3, author = {Zreik, K.}, title = {A context driven information retrieval for online urban disaster management system}, booktitle = {Third International Conference on Innovation in Architecture, Engineering and Construction (AEC'05)}, year = {2005}, address = {Rotterdam, The Netherlands}, month = {june} }; International audience; Most of Disaster Managing Approaches depends firmly on information and communication management system. Due to the advances in Informatio...

  13. Information Retrieval Strategies of Millennial Undergraduate Students in Web and Library Database Searches

    Science.gov (United States)

    Porter, Brandi

    2009-01-01

    Millennial students make up a large portion of undergraduate students attending colleges and universities, and they have a variety of online resources available to them to complete academically related information searches, primarily Web based and library-based online information retrieval systems. The content, ease of use, and required search…

  14. Networked Information Retrieval Tools in the Academic Environment: Towards a Cybernetic Library.

    Science.gov (United States)

    Brett, George H., II

    1993-01-01

    Reviews the literature of networked information retrieval tools to explore the concept of an interactive text-based virtual reality environment that would encompass resources currently available on the Internet. Highlights include academic libraries, electronic mail, hypertext navigation systems, wide area information servers, knowbots and…

  15. The Use of Fuzzy Set Theory in Information Retrieval and Databases: A Survey.

    Science.gov (United States)

    Kerre, Etienne E.; And Others

    1986-01-01

    Briefly surveys the numerous applications of fuzzy set theory on data representation and information retrieval. The importance of fuzzy set theory with respect to information systems is illustrated with a bibliography of 86 papers that describe data systems that are somehow "fuzzy." (Author/EM)

  16. Guidelines for the Establishment and Development of Monolingual Thesauri for Information Retrieval.

    Science.gov (United States)

    United Nations Educational, Scientific, and Cultural Organization, Paris (France).

    These guidelines are applicable in the sector of information and documentation for the preparation and development of monolingual thesauri for information storage and retrieval, irrespective of the technical field being dealt with. No provision has been made for mathematical and structural chemical formulae. The guidelines are drafted for general…

  17. Information Retrieval Strategies of Millennial Undergraduate Students in Web and Library Database Searches

    Science.gov (United States)

    Porter, Brandi

    2009-01-01

    Millennial students make up a large portion of undergraduate students attending colleges and universities, and they have a variety of online resources available to them to complete academically related information searches, primarily Web based and library-based online information retrieval systems. The content, ease of use, and required search…

  18. Not on the Same Page: Undergraduates' Information Retrieval in Electronic and Print Books

    Science.gov (United States)

    Berg, Selinda Adelle; Hoffmann, Kristin; Dawson, Diane

    2010-01-01

    Academic libraries are increasingly collecting e-books, but little research has investigated how students use e-books compared to print texts. This study used a prompted think-aloud method to gain an understanding of the information retrieval behavior of students in both formats. Qualitative analysis identified themes that will inform instruction…

  19. Automatic Identification and Back-Transliteration of Foreign Words for Information Retrieval.

    Science.gov (United States)

    Jeong, Kil Soon; Myaeng, Sung Hyon; Lee, Jae Sung; Choi, Key-Sun

    1999-01-01

    Discusses problems that occur in information retrieval when foreign words appear in Korean texts. Describes an algorithm that identifies a phrase containing a foreign word and then extracts that word based on statistical information, and explains a method for back-transliteration of a foreign word to its English origin. (Author/LRW)

  20. Dissociations within human hippocampal subregions during encoding and retrieval of spatial information.

    Science.gov (United States)

    Suthana, Nanthia; Ekstrom, Arne; Moshirvaziri, Saba; Knowlton, Barbara; Bookheimer, Susan

    2011-07-01

    Although the hippocampus is critical for the formation and retrieval of spatial memories, it is unclear how subregions are differentially involved in these processes. Previous high-resolution functional magnetic resonance imaging (fMRI) studies have shown that CA2, CA3, and dentate gyrus (CA23DG) regions support the encoding of novel associations, whereas the subicular cortices support the retrieval of these learned associations. Whether these subregions are used in humans during encoding and retrieval of spatial information has yet to be explored. Using high-resolution fMRI (1.6 mm × 1.6-mm in-plane), we found that activity within the right CA23DG increased during encoding compared to retrieval. Conversely, right subicular activity increased during retrieval compared to encoding of spatial associations. These results are consistent with the previous studies illustrating dissociations within human hippocampal subregions and further suggest that these regions are similarly involved during the encoding and retrieval of spatial information.

  1. A database approach to information retrieval: The remarkable relationship between language models and region models

    CERN Document Server

    Hiemstra, Djoerd

    2010-01-01

    In this report, we unify two quite distinct approaches to information retrieval: region models and language models. Region models were developed for structured document retrieval. They provide a well-defined behaviour as well as a simple query language that allows application developers to rapidly develop applications. Language models are particularly useful to reason about the ranking of search results, and for developing new ranking approaches. The unified model allows application developers to define complex language modeling approaches as logical queries on a textual database. We show a remarkable one-to-one relationship between region queries and the language models they represent for a wide variety of applications: simple ad-hoc search, cross-language retrieval, video retrieval, and web search.

  2. Optically secured information retrieval using two authenticated phase-only masks.

    Science.gov (United States)

    Wang, Xiaogang; Chen, Wen; Mei, Shengtao; Chen, Xudong

    2015-10-23

    We propose an algorithm for jointly designing two phase-only masks (POMs) that allow for the encryption and noise-free retrieval of triple images. The images required for optical retrieval are first stored in quick-response (QR) codes for noise-free retrieval and flexible readout. Two sparse POMs are respectively calculated from two different images used as references for authentication based on modified Gerchberg-Saxton algorithm (GSA) and pixel extraction, and are then used as support constraints in a modified double-phase retrieval algorithm (MPRA), together with the above-mentioned QR codes. No visible information about the target images or the reference images can be obtained from each of these authenticated POMs. This approach allows users to authenticate the two POMs used for image reconstruction without visual observation of the reference images. It also allows user to friendly access and readout with mobile devices.

  3. 'Meatball searching' - The adversarial approach to online information retrieval

    Science.gov (United States)

    Jack, R. F.

    1985-01-01

    It is proposed that the different styles of online searching can be described as either formal (highly precise) or informal with the needs of the client dictating which is most applicable at a particular moment. The background and personality of the searcher also come into play. Particular attention is focused on meatball searching which is a form of online searching characterized by deliberate vagueness. It requires generally comprehensive searches, often on unusual topics and with tight deadlines. It is most likely to occur in search centers serving many different disciplines and levels of client information sophistication. Various information needs are outlined as well as the laws of meatball searching and the adversarial approach. Traits and characteristics important to sucessful searching include: (1) concept analysis, (2) flexibility of thinking, (3) ability to think in synonyms and (4) anticipation of variant word forms and spellings.

  4. Utilization of ontology look-up services in information retrieval for biomedical literature.

    Science.gov (United States)

    Vishnyakova, Dina; Pasche, Emilie; Lovis, Christian; Ruch, Patrick

    2013-01-01

    With the vast amount of biomedical data we face the necessity to improve information retrieval processes in biomedical domain. The use of biomedical ontologies facilitated the combination of various data sources (e.g. scientific literature, clinical data repository) by increasing the quality of information retrieval and reducing the maintenance efforts. In this context, we developed Ontology Look-up services (OLS), based on NEWT and MeSH vocabularies. Our services were involved in some information retrieval tasks such as gene/disease normalization. The implementation of OLS services significantly accelerated the extraction of particular biomedical facts by structuring and enriching the data context. The results of precision in normalization tasks were boosted on about 20%.

  5. Semantic annotation for concept-based cross-language medical information retrieval.

    Science.gov (United States)

    Volk, Martin; Ripplinger, Bärbel; Vintar, Spela; Buitelaar, Paul; Raileanu, Diana; Sacaleanu, Bogdan

    2002-12-04

    We present a framework for concept-based cross-language information retrieval in the medical domain, which is under development in the MUCHMORE project. Our approach is based on using the Unified Medical Language System (UMLS) as the primary source of semantic data. Documents and queries are annotated with multiple layers of linguistic information. Linguistic processing includes part-of-speech tagging, morphological analysis, phrase recognition and the identification of medical terms and semantic relations between them. The paper describes experiments in monolingual and cross-language document retrieval, performed on a corpus of medical abstracts. Results show that linguistic processing, especially lemmatization and compound analysis for German, is a crucial step in achieving a good baseline performance. On the other hand, they show that semantic information, specifically the combined use of concepts and relations, increases the performance in monolingual and cross-language retrieval.

  6. Toward Studying Music Cognition with Information Retrieval Techniques: Lessons Learned from the OpenMIIR Initiative.

    Science.gov (United States)

    Stober, Sebastian

    2017-01-01

    As an emerging sub-field of music information retrieval (MIR), music imagery information retrieval (MIIR) aims to retrieve information from brain activity recorded during music cognition-such as listening to or imagining music pieces. This is a highly inter-disciplinary endeavor that requires expertise in MIR as well as cognitive neuroscience and psychology. The OpenMIIR initiative strives to foster collaborations between these fields to advance the state of the art in MIIR. As a first step, electroencephalography (EEG) recordings of music perception and imagination have been made publicly available, enabling MIR researchers to easily test and adapt their existing approaches for music analysis like fingerprinting, beat tracking or tempo estimation on this new kind of data. This paper reports on first results of MIIR experiments using these OpenMIIR datasets and points out how these findings could drive new research in cognitive neuroscience.

  7. Toward Studying Music Cognition with Information Retrieval Techniques: Lessons Learned from the OpenMIIR Initiative

    Directory of Open Access Journals (Sweden)

    Sebastian Stober

    2017-08-01

    Full Text Available As an emerging sub-field of music information retrieval (MIR, music imagery information retrieval (MIIR aims to retrieve information from brain activity recorded during music cognition–such as listening to or imagining music pieces. This is a highly inter-disciplinary endeavor that requires expertise in MIR as well as cognitive neuroscience and psychology. The OpenMIIR initiative strives to foster collaborations between these fields to advance the state of the art in MIIR. As a first step, electroencephalography (EEG recordings of music perception and imagination have been made publicly available, enabling MIR researchers to easily test and adapt their existing approaches for music analysis like fingerprinting, beat tracking or tempo estimation on this new kind of data. This paper reports on first results of MIIR experiments using these OpenMIIR datasets and points out how these findings could drive new research in cognitive neuroscience.

  8. Fast and reliable online learning to rank for information retrieval

    NARCIS (Netherlands)

    Hofmann, K.

    2013-01-01

    The amount of digital data we produce every day far surpasses our ability to process this data, and finding useful information in this constant flow of data has become one of the major challenges of the 21st century. Search engines are one way of accessing large data collections. Their algorithms ha

  9. Information Extraction and Linking in a Retrieval Context

    NARCIS (Netherlands)

    Moens, M.F.; Hiemstra, Djoerd

    We witness a growing interest and capabilities of automatic content recognition (often referred to as information extraction) in various media sources that identify entities (e.g. persons, locations and products) and their semantic attributes (e.g., opinions expressed towards persons or products,

  10. Online Cultural Heritage Exhibitions: A Survey of Information Retrieval Features

    Science.gov (United States)

    Liew, Chern Li

    2005-01-01

    Purpose: What kinds of online cultural heritage exhibitions are now available on the internet? How far have these cultural heritage institutions voyaged in terms of harnessing the power of information and communication technology and the interactivity of multimedia systems to exhibit cultural heritage resources? This study aims to highlight the…

  11. Content-Based Information Retrieval from Forensic Databases

    NARCIS (Netherlands)

    Geradts, Z.J.M.H.

    2002-01-01

    In forensic science, the number of image databases is growing rapidly. For this reason, it is necessary to have a proper procedure for searching in these images databases based on content. The use of image databases results in more solved crimes; furthermore, statistical information can be obtained

  12. Patent Database : A Methodology of Information Retrieval From PDF

    Directory of Open Access Journals (Sweden)

    Pawan Sharma

    2013-10-01

    Full Text Available Patent document holds wealth of information in itself. A brief detail of Indian patent applicationinformation is published as eighteen month publication by Indian patent Office, in electronic gazetteweekly. To date, a proper database of Indian patents specifically for research determination has not beenavailable, making it complicated for researcher to use this data for measuring any kind of researchactivities in terms of patents in India. To facilitate this, we constructed a comprehensive patent databasewhich incorporates the information presented in the electronic gazette. This database includes informationsuch as technology class, applicant, inventor, country of origin etc., of the patent submitted. We present themethodology for the creation of this database, its basic features along with its accuracy and reliability inthis research paper. Patent based database has been developed and can be used for various innovationresearches and activities.

  13. Energy for agriculture. A computerized information retrieval system

    Energy Technology Data Exchange (ETDEWEB)

    Stout, B.A.; Myers, C.A. (comps.)

    1979-12-01

    Energy may come from the sun or the earth or be the product of plant materials or agricultural wastes. Whatever its source, energy is indispensable to our way of life, beginning with the production, processing, and distribution of abundant, high quality food and fiber supplies. This specialized bibliography on the subject of energy for agriculture contains 2613 citations to the literature for 1973 through May 1979. Originally issued by Michigan State University (MSU), it is being reprinted and distributed by the U.S. Department of Agriculture. The literature citations will be incorporated into AGRICOLA (Agricultural On-Line Access), the comprehensive bibliographic data base maintained by Technical Information Systems (TIS), a component of USDA's Science and Education Administration (SEA). The citations and the listing of research projects will be combined with other relevant references to provide a continuously updated source of information on energy programs in the agricultural field. No abstracts are included.

  14. Analysis on Recommended System for Web Information Retrieval Using HMM

    Directory of Open Access Journals (Sweden)

    Himangni Rathore

    2014-11-01

    Full Text Available Web is a rich domain of data and knowledge, which is spread over the world in unstructured manner. The number of users is continuously access the information over the internet. Web mining is an application of data mining where web related data is extracted and manipulated for extracting knowledge. The data mining is used in the domain of web information mining is refers as web mining, that is further divided into three major domains web uses mining, web content mining and web structure mining. The proposed work is intended to work with web uses mining. The concept of web mining is to improve the user feedbacks and user navigation pattern discovery for a CRM system. Finally a new algorithm HMM is used for finding the pattern in data, which method promises to provide much accurate recommendation.

  15. Perspectives on Adaptivity in Information Retrieval Interaction (PAIRI)

    DEFF Research Database (Denmark)

    Ingwersen, Peter; Larsen, Birger; Kelly, Diane;

    2010-01-01

    Adaptivity in IR interactions requires the IR systems adapting to users’ situations and the users adapting to the systems. System adaption entails dynamic user modeling, effective information architecture and enhanced search features such as search integration and relevance feedback; user...... adaptation through interactions entails mental model building and modification towards a coherent state of knowledge and learning. The panel is structured as follows. Initially we provide an overview of the panel contents, consisting of four central dimensions of adaptivity in IR interaction...

  16. ONTOLOGY BASED WEB PAGE ANNOTATION FOR EFFECTIVE INFORMATION RETRIEVAL

    Directory of Open Access Journals (Sweden)

    S.Kalarani

    2010-11-01

    Full Text Available Today’s World Wide Web has large volume of data – billions of documents. So it is a time consuming process to discover effective knowledge from the input data. With today's keyword approach the amount of time and effort required to find the right information is directly proportional to the amount of information on the web.The web has grown exponentially and people are forced to spend more and more time in search for the information they are looking for. Lack of personalization as well as inability to easily separate commercial from non-commercial searches is among other limitations of today's web search technologies. This paper proposes a prototype relation-based search engine. “OntoLook” which has been designed in a virtual semantic web environment. The architecture has been proposed. The Semantic Web is well recognized as an effective infrastructure to enhance visibility of knowledge on the Web. The core of the Semantic Web is “ontology”, which is used to explicitly represent our conceptualizations. Ontology engineering in the Semantic Web isprimarily supported by languages such as RDF, RDFS and OWL. This paper discusses the requirements of ontology in the context of the Web, compares the above three languages with existing knowledge representation formalisms, and surveys tools for managing and applying ontology. Advantages of using ontology in both knowledge-base-style and database-style applications are demonstrated using one real world applications.

  17. Hybrid Quantum-Classical Protocol for Storage and Retrieval of Discrete-Valued Information

    Directory of Open Access Journals (Sweden)

    Abdullah M. Iliyasu

    2014-06-01

    Full Text Available In this paper we present a hybrid (i.e., quantum-classical adaptive protocol for the storage and retrieval of discrete-valued information. The purpose of this paper is to introduce a procedure that exhibits how to store and retrieve unanticipated information values by using a quantum property, that of using different vector space bases for preparation and measurement of quantum states. This simple idea leads to an interesting old wish in Artificial Intelligence: the development of computer systems that can incorporate new knowledge on a real-time basis just by hardware manipulation.

  18. Integrating the humanities in the education of health professionals: implications for search and retrieval of information.

    Science.gov (United States)

    Polson, Robert G; Farmer, Elizabeth S

    2002-03-01

    This article examines the increasing use of the humanities in the education of health professionals and posits that the approach may be of use in teaching health professionals information search and retrieval skills. However little evidence exists to support the educational effectiveness of using the humanities. This lack of evidence raises concerns about the costs of financing this approach to learning. These costs include the issue of copyright which cannot be ignored. While the humanities might provide a more attractive approach to teaching information search and retrieval skills, further research is needed to justify the costs of this approach to learning in more general terms and urgent attention to.

  19. Access to opportunities for bilingualism for individuals with developmental disabilities: Key informant interviews

    NARCIS (Netherlands)

    Scherba de Valenzuela, J.; Kay-Raining Bird, E.; Parkington, K.; Mirenda, P.; Cain, K.; MacLeod, A.A.N.; Segers, P.C.J.

    2016-01-01

    The purpose of this article is to describe the results of a thematic analysis of 79 semi-structured interviews collected at six research sites in four countries in relation to the inclusion and exclusion of students with developmental disabilities (DD) in and from special education and bilingual

  20. Access to opportunities for bilingualism for individuals with developmental disabilities: Key informant interviews

    NARCIS (Netherlands)

    Scherba de Valenzuela, J.; Kay-Raining Bird, E.; Parkington, K.; Mirenda, P.; Cain, K.; MacLeod, A.A.N.; Segers, P.C.J.

    2016-01-01

    The purpose of this article is to describe the results of a thematic analysis of 79 semi-structured interviews collected at six research sites in four countries in relation to the inclusion and exclusion of students with developmental disabilities (DD) in and from special education and bilingual opp

  1. Are Bilingual Children Better at Ignoring Perceptually Misleading Information? A Novel Test

    Science.gov (United States)

    Goldman, Meghan C.; Negen, James; Sarnecka, Barbara W.

    2014-01-01

    Does speaking more than one language help a child perform better on certain types of cognitive tasks? One possibility is that bilingualism confers either specific or general cognitive advantages on tasks that require selective attention to one dimension over another (e.g. Bialystok, [Bialystok, E., 2001]; Hilchey & Klein, [Hilchey, M.D.,…

  2. An Intelligent Information Retrieval Approach Based on Two Degrees of Uncertainty Fuzzy Ontology

    Directory of Open Access Journals (Sweden)

    Maryam Hourali

    2011-01-01

    Full Text Available In spite of the voluminous studies in the field of intelligent retrieval systems, effective retrieving of information has been remained an important unsolved problem. Implementations of different conceptual knowledge in the information retrieval process such as ontology have been considered as a solution to enhance the quality of results. Furthermore, the conceptual formalism supported by typical ontology may not be sufficient to represent uncertainty information due to the lack of clear-cut boundaries between concepts of the domains. To tackle this type of problems, one possible solution is to insert fuzzy logic into ontology construction process. In this article, a novel approach for fuzzy ontology generation with two uncertainty degrees is proposed. Hence, by implementing linguistic variables, uncertainty level in domain's concepts (Software Maintenance Engineering (SME domain has been modeled, and ontology relations have been modeled by fuzzy theory consequently. Then, we combined these uncertain models and proposed a new ontology with two degrees of uncertainty both in concept expression and relation expression. The generated fuzzy ontology was implemented for expansion of initial user's queries in SME domain. Experimental results showed that the proposed model has better overall retrieval performance comparing to keyword-based or crisp ontology-based retrieval systems.

  3. Information retrieval for OCR documents: a content-based probabilistic correction model

    Science.gov (United States)

    Jin, Rong; Zhai, ChangXiang; Hauptmann, Alexander

    2003-01-01

    The difficulty with information retrieval for OCR documents lies in the fact that OCR documents contain a significant amount of erroneous words and unfortunately most information retrieval techniques rely heavily on word matching between documents and queries. In this paper, we propose a general content-based correction model that can work on top of an existing OCR correction tool to "boost" retrieval performance. The basic idea of this correction model is to exploit the whole content of a document to supplement any other useful information provided by an existing OCR correction tool for word corrections. Instead of making an explicit correction decision for each erroneous word as typically done in a traditional approach, we consider the uncertainties in such correction decisions and compute an estimate of the original "uncorrupted" document language model accordingly. The document language model can then be used for retrieval with a language modeling retrieval approach. Evaluation using the TREC standard testing collections indicates that our method significantly improves the performance compared with simple word correction approaches such as using only the top ranked correction.

  4. Understanding vaccination resistance: vaccine search term selection bias and the valence of retrieved information.

    Science.gov (United States)

    Ruiz, Jeanette B; Bell, Robert A

    2014-10-07

    Dubious vaccination-related information on the Internet leads some parents to opt out of vaccinating their children. To determine if negative, neutral and positive search terms retrieve vaccination information that differs in valence and confirms searchers' assumptions about vaccination. A content analysis of first-page Google search results was conducted using three negative, three neutral, and three positive search terms for the concepts "vaccine," "vaccination," and "MMR"; 84 of the 90 websites retrieved met inclusion requirements. Two coders independently and reliably coded for the presence or absence of each of 15 myths about vaccination (e.g., "vaccines cause autism"), statements that countered these myths, and recommendations for or against vaccination. Data were analyzed using descriptive statistics. Across all websites, at least one myth was perpetuated on 16.7% of websites and at least one myth was countered on 64.3% of websites. The mean number of myths perpetuated on websites retrieved with negative, neutral, and positive search terms, respectively, was 1.93, 0.53, and 0.40. The mean number of myths countered on websites retrieved with negative, neutral, and positive search terms, respectively, was 3.0, 3.27, and 2.87. Explicit recommendations regarding vaccination were offered on 22.6% of websites. A recommendation against vaccination was more often made on websites retrieved with negative search terms (37.5% of recommendations) than on websites retrieved with neutral (12.5%) or positive (0%) search terms. The concerned parent who seeks information about the risks of childhood immunizations will find more websites that perpetuate vaccine myths and recommend against vaccination than the parent who seeks information about the benefits of vaccination. This suggests that search term valence can lead to online information that supports concerned parents' misconceptions about vaccines. Copyright © 2014 Elsevier Ltd. All rights reserved.

  5. Better late than never: information retrieval from black holes.

    Science.gov (United States)

    Braunstein, Samuel L; Pirandola, Stefano; Życzkowski, Karol

    2013-03-08

    We show that, in order to preserve the equivalence principle until late times in unitarily evaporating black holes, the thermodynamic entropy of a black hole must be primarily entropy of entanglement across the event horizon. For such black holes, we show that the information entering a black hole becomes encoded in correlations within a tripartite quantum state, the quantum analogue of a one-time pad, and is only decoded into the outgoing radiation very late in the evaporation. This behavior generically describes the unitary evaporation of highly entangled black holes and requires no specially designed evolution. Our work suggests the existence of a matter-field sum rule for any fundamental theory.

  6. The man/machine interface in information retrieval: Providing access to the casual user

    Science.gov (United States)

    Dominick, Wayne D. (Editor); Granier, Martin

    1984-01-01

    This study is concerned with the difficulties encountered by casual users wishing to employ Information Storage and Retrieval Systems. A casual user is defined as a professional who has neither time nor desire to pursue in depth the study of the numerous and varied retrieval systems. His needs for on-line search are only occasional, and not limited to any particular system. The paper takes a close look at the state of the art of research concerned with aiding casual users of Information Storage and Retrieval Systems. Current experiments such as LEXIS, CONIT, IIDA, CITE, and CCL are presented and discussed. Comments and proposals are offered, specifically in the areas of training, learning and cost as experienced by the casual user. An extensive bibliography of recent works on the subject follows the text.

  7. An Ontology Based Crawler for Retrieving Information Distributed on the Web

    Directory of Open Access Journals (Sweden)

    Wael A. Gab–Allah

    2016-06-01

    Full Text Available One of the principal motivations for the creation of the Web was to retrieve information in a fast and easy way. So, building systems for retrieving distributed information is crucially essential. This paper introduces an ontology based focused crawling system that exhibits high recall and high precision. The reason behind the power of the system is two–fold. First, it is focused, thanks to the underlying ontology–based retrieval subsystem. Second, operates in two phases, one to increase recall and the other to increase precision. We have implemented the proposed system using the Python language and the WordNet taxonomy. The results obtained by the system are given at the end of the paper and show clearly that it outperforms general purpose crawling systems built on approaches such as breadth first search

  8. Information storage and retrieval in a single levitating colloidal particle

    Science.gov (United States)

    Myers, Christopher J.; Celebrano, Michele; Krishnan, Madhavi

    2015-10-01

    The binary switch is a basic component of digital information. From phase-change alloys to nanomechanical beams, molecules and atoms, new strategies for controlled bistability hold great interest for emerging technologies. We present a generic methodology for precise and parallel spatiotemporal control of nanometre-scale matter in a fluid, and demonstrate the ability to attain digital functionalities such as switching, gating and data storage in a single colloid, with further implications for signal amplification and logic operations. This fluid-phase bit can be arrayed at high densities, manipulated by either electrical or optical fields, supports low-energy, high-speed operation and marks a first step toward ‘colloidal information’. The principle generalizes to any system where spatial perturbation of a particle elicits a differential response amenable to readout.

  9. Comparing the Scale of Web Subject Directories Precision in Technical-Engineering Information Retrieval

    Directory of Open Access Journals (Sweden)

    Mehrdokht Wazirpour Keshmiri

    2012-07-01

    Full Text Available The main purpose of this research was to compare the scale of web subject directories precision in information retrieval of technical-engineering science. Information gathering was documentary and webometric. Keywords of technical-engineering science were chosen at twenty different subjects from IEEE (Institute of Electrical and Electronics Engineers and engineering magazines that situated in sciencedirect site. These keywords are used at five subject directories Yahoo, Google, Infomine, Intute, Dmoz, that were web directories high-utilization. Usually first results in searching tools are connected to searching keywords. Because, first ten results was evaluated in every search. These assessments to consist of scale of precision, scale of error, scale retrieval items in technical-engineering categories to retrieval items entirely. The used criteria for determining the scale of precision that was according to high-utilization standards in different documents, to consist of presence of the keywords in title, appearance of keywords at the part of web retrieved pages, keywords adjacency, URL of page, page description and subject categories. Information analysis was according to Kruskal-Wallis Test and L.S.D fisher. Results revealed that there was meaningful difference about precision of web subject directories in information retrieval of technical-engineering science, Therefore this theory was confirmed.web subject directories ranked from point of precision as follows. Google, Yahoo, Intute, Dmoz, and Infomine. The scale of observed error at the first results was another criterion that was used for comparing web subject directories. In this research, Yahoo had minimum scale of error and Infomine had most of error. This research also compared the scale of retrieval items in all of categories web subject directories entirely to retrieval items in technical-engineering categories, results revealed that there was meaningful difference between them. And

  10. A new model of information behaviour based on the Search Situation Transition schema Information searching, Information behaviour, Behavior, Information retrieval, Information seeking

    Directory of Open Access Journals (Sweden)

    Nils Pharo

    2004-01-01

    Full Text Available This paper presents a conceptual model of information behaviour. The model is part of the Search Situation Transition method schema. The method schema is developed to discover and analyse interplay between phenomena traditionally analysed as factors influencing either information retrieval or information seeking. In this paper the focus is on the model's five main categories: the work task, the searcher, the social/organisational environment, the search task, and the search process. In particular, the search process and its sub-categories search situation and transition and the relationship between these are discussed. To justify the method schema an empirical study was designed according to the schema's specifications. In the paper a subset of the study is presented analysing the effects of work tasks on Web information searching. Findings from this small-scale study indicate a strong relationship between the work task goal and the level of relevance used for judging resources during search processes.

  11. 42 CFR 433.127 - Termination of FFP for failure to provide access to claims processing and information retrieval...

    Science.gov (United States)

    2010-10-01

    ... claims processing and information retrieval systems. 433.127 Section 433.127 Public Health CENTERS FOR... PROGRAMS STATE FISCAL ADMINISTRATION Mechanized Claims Processing and Information Retrieval Systems § 433.127 Termination of FFP for failure to provide access to claims processing and information...

  12. Learning and Relevance in Information Retrieval: A Study in the Application of Exploration and User Knowledge to Enhance Performance

    Science.gov (United States)

    Hyman, Harvey

    2012-01-01

    This dissertation examines the impact of exploration and learning upon eDiscovery information retrieval; it is written in three parts. Part I contains foundational concepts and background on the topics of information retrieval and eDiscovery. This part informs the reader about the research frameworks, methodologies, data collection, and…

  13. Confabulation in healthy aging is related to poor encoding and retrieval of over-learned information.

    Science.gov (United States)

    Attali, Eve; Dalla Barba, Gianfranco

    2013-01-01

    Normal aging is characterized by deficits that cross multiple cognitive domains including episodic memory and attention. Compared to young adults (YA), older adults (OA) not only show reduction in true memories, but also an increase in false memories. In this study we aim to elucidate how the production of confabulation is influenced by encoding and retrieval processes. We hypothesized that in OA, compared to YA, over-learned information interferes with the recall of specific, unique past episodes and this interference should be more prominent when a concurrent task perturbs the encoding of the episodes to be recalled. We tested this hypothesis using an experimental paradigm in which a group of OA and a group of YA had to recall three different types of story: a previously unknown story, a well-known fairy tale (Snow White), and a modified well-known fairy tale (Little Red Riding Hood is not eaten by the wolf), in three different experimental conditions: (1) free encoding and free retrieval; (2) Divided attention (DA) at encoding and free retrieval; and (3) free encoding and DA at retrieval. Results showed that OA produced significantly more confabulations than YA, particularly, in the recall of the modified fairy tale. Moreover, DA at encoding markedly increased the number of confabulations, whereas DA at retrieval had no effect on confabulation. Our findings reveal the implications of two phenomena in the production of confabulation in normal aging: the effect of poor encoding and the interference of strongly represented, over-learned information in episodic memory recall.

  14. An overview of the National Space Science data Center Standard Information Retrieval System (SIRS)

    Science.gov (United States)

    Shapiro, A.; Blecher, S.; Verson, E. E.; King, M. L. (Editor)

    1974-01-01

    A general overview is given of the National Space Science Data Center (NSSDC) Standard Information Retrieval System. A description, in general terms, the information system that contains the data files and the software system that processes and manipulates the files maintained at the Data Center. Emphasis is placed on providing users with an overview of the capabilities and uses of the NSSDC Standard Information Retrieval System (SIRS). Examples given are taken from the files at the Data Center. Detailed information about NSSDC data files is documented in a set of File Users Guides, with one user's guide prepared for each file processed by SIRS. Detailed information about SIRS is presented in the SIRS Users Guide.

  15. Hybrid ontology for semantic information retrieval model using keyword matching indexing system.

    Science.gov (United States)

    Uthayan, K R; Mala, G S Anandha

    2015-01-01

    Ontology is the process of growth and elucidation of concepts of an information domain being common for a group of users. Establishing ontology into information retrieval is a normal method to develop searching effects of relevant information users require. Keywords matching process with historical or information domain is significant in recent calculations for assisting the best match for specific input queries. This research presents a better querying mechanism for information retrieval which integrates the ontology queries with keyword search. The ontology-based query is changed into a primary order to predicate logic uncertainty which is used for routing the query to the appropriate servers. Matching algorithms characterize warm area of researches in computer science and artificial intelligence. In text matching, it is more dependable to study semantics model and query for conditions of semantic matching. This research develops the semantic matching results between input queries and information in ontology field. The contributed algorithm is a hybrid method that is based on matching extracted instances from the queries and information field. The queries and information domain is focused on semantic matching, to discover the best match and to progress the executive process. In conclusion, the hybrid ontology in semantic web is sufficient to retrieve the documents when compared to standard ontology.

  16. Hybrid Ontology for Semantic Information Retrieval Model Using Keyword Matching Indexing System

    Directory of Open Access Journals (Sweden)

    K. R. Uthayan

    2015-01-01

    Full Text Available Ontology is the process of growth and elucidation of concepts of an information domain being common for a group of users. Establishing ontology into information retrieval is a normal method to develop searching effects of relevant information users require. Keywords matching process with historical or information domain is significant in recent calculations for assisting the best match for specific input queries. This research presents a better querying mechanism for information retrieval which integrates the ontology queries with keyword search. The ontology-based query is changed into a primary order to predicate logic uncertainty which is used for routing the query to the appropriate servers. Matching algorithms characterize warm area of researches in computer science and artificial intelligence. In text matching, it is more dependable to study semantics model and query for conditions of semantic matching. This research develops the semantic matching results between input queries and information in ontology field. The contributed algorithm is a hybrid method that is based on matching extracted instances from the queries and information field. The queries and information domain is focused on semantic matching, to discover the best match and to progress the executive process. In conclusion, the hybrid ontology in semantic web is sufficient to retrieve the documents when compared to standard ontology.

  17. Correlates and Cross-Linguistic Comparisons of Informativeness and Efficiency on Nicholas and Brookshire Discourse Stimuli in Spanish/English Bilingual Adults

    Science.gov (United States)

    Edmonds, Lisa A.

    2013-01-01

    Purpose: The purpose of this study was to determine (a) correlates of informativeness and efficiency in discourse and (b) potential cross-linguistic and stimulus type (picture vs. nonpicture) differences in measures of informativeness and efficiency in Spanish/English bilingual adults in the United States. Method: Eighty-eight Spanish/English…

  18. Spectral unmixing techniques for retrieving plant foliar information

    Science.gov (United States)

    Themelis, Kostas; Sykioti, Olga; Rontogiannis, Athanasios; Koutroumbas, Konstantinos; Kyparissis, Aris

    2010-05-01

    In this study two novel approaches for supervised and semi-supervised hyperspectral unmixing are applied in the unmixing of CHRIS/PROBA data, in order to monitor seasonal land cover changes - in particular plant foliar coverage. Foliar coverage variations are directly linked to seasonal changes of the ecophysiological status of a plant (i.e. growth status, pigment concentrations, LAI etc). The high potential of using hyperspectral satellite data in monitoring plant biochemical and structural characteristics is important in ecophysiological studies. A reliable and efficient method to extract leaf and/or canopy information from a mixed pixel significantly contributes towards this direction. In this study, the development of two efficient algorithms in spectral unmixing enables the detection and mapping of leaf contribution to the overall pixel spectra and its seasonal variations. For this purpose, leaf spectra measured in the field, simultaneously to satellite acquisitions, are included in the endmember data set. The proposed unmixing techniques are performed on ground reflectances, assuming knowledge of the number and spectral signatures of the objects present in the images. An efficient estimation for their corresponding fractions in the pixels of the image is developed, based on a recently proposed maximum a posteriori probability (MAP) method. By exploiting the constraints naturally imposed to the problem, closed form expressions are derived for the statistical parameters required by the MAP estimator. In the semi-supervised scenario, we assume that a spectral library is given, containing spectral signatures of multiple endmembers. The objective in the latter case, is (a) to determine how many and which endmembers are present in the mixed pixel under study and (b) to use the selected endmembers to estimate the corresponding abundance fractions - especially the abundance of foliar coverage. The approach is based on a properly modified weighted l1-regularized least

  19. Millennial Undergraduate Research Strategies in Web and Library Information Retrieval Systems

    Science.gov (United States)

    Porter, Brandi

    2011-01-01

    This article summarizes the author's dissertation regarding search strategies of millennial undergraduate students in Web and library online information retrieval systems. Millennials bring a unique set of search characteristics and strategies to their research since they have never known a world without the Web. Through the use of search engines,…

  20. Calculation of the information content of retrieval procedures applied to mass spectral data bases

    NARCIS (Netherlands)

    Marlen, G. van; Dijkstra, Auke; Klooster, H.A. van 't

    1979-01-01

    A procedure has been developed for estimating the information content of retrieval systems with binary-coded mass spectra, as well as mass spectra coded by other methods, from the statistical properties of a reference file. For a reference file, binary-coded with a threshold of 1% of the intensity o