WorldWideScience

Sample records for current search engines

  1. Current Application of Search Engines and Their Developing Trend

    Institute of Scientific and Technical Information of China (English)

    ZHANG Li; SHAO Shi-huang; WU Xiao-qiong; ZENG Xian-hui; FAN Xiao-wen

    2002-01-01

    The basic types of current search engines which can help users to perfume laborious information-gathering tasks on Internet is proposed. Basically, the search engines can be classified into index engine, directory engine and agent engine on WWW information service. The key technologies of web mine, automatic classifying of documents and ordering regulation of feedback information are discussed. Finally, the developing trend of search engines is pointed out by analyzing their practical application on World Wide Web.

  2. Sound Search Engine Concept

    DEFF Research Database (Denmark)

    2006-01-01

    Sound search is provided by the major search engines, however, indexing is text based, not sound based. We will establish a dedicated sound search services with based on sound feature indexing. The current demo shows the concept of the sound search engine. The first engine will be realased June...

  3. Sound Search Engine Concept

    DEFF Research Database (Denmark)

    2006-01-01

    Sound search is provided by the major search engines, however, indexing is text based, not sound based. We will establish a dedicated sound search services with based on sound feature indexing. The current demo shows the concept of the sound search engine. The first engine will be realased June...

  4. Internet Search Engines

    OpenAIRE

    Fatmaa El Zahraa Mohamed Abdou

    2004-01-01

    A general study about the internet search engines, the study deals main 7 points; the differance between search engines and search directories, components of search engines, the percentage of sites covered by search engines, cataloging of sites, the needed time for sites appearance in search engines, search capabilities, and types of search engines.

  5. Internet Search Engines

    Directory of Open Access Journals (Sweden)

    Fatmaa El Zahraa Mohamed Abdou

    2004-09-01

    Full Text Available A general study about the internet search engines, the study deals main 7 points; the differance between search engines and search directories, components of search engines, the percentage of sites covered by search engines, cataloging of sites, the needed time for sites appearance in search engines, search capabilities, and types of search engines.

  6. Social Work Literature Searching: Current Issues with Databases and Online Search Engines

    Science.gov (United States)

    McGinn, Tony; Taylor, Brian; McColgan, Mary; McQuilkan, Janice

    2016-01-01

    Objectives: To compare the performance of a range of search facilities; and to illustrate the execution of a comprehensive literature search for qualitative evidence in social work. Context: Developments in literature search methods and comparisons of search facilities help facilitate access to the best available evidence for social workers.…

  7. The Jungle Database Search Engine

    DEFF Research Database (Denmark)

    Bøhlen, Michael Hanspeter; Bukauskas, Linas; Dyreson, Curtis

    1999-01-01

    Information spread in in databases cannot be found by current search engines. A database search engine is capable to access and advertise database on the WWW. Jungle is a database search engine prototype developed at Aalborg University. Operating through JDBC connections to remote databases, Jungle...

  8. Web Search Engines

    OpenAIRE

    Rajashekar, TB

    1998-01-01

    The World Wide Web is emerging as an all-in-one information source. Tools for searching Web-based information include search engines, subject directories and meta search tools. We take a look at key features of these tools and suggest practical hints for effective Web searching.

  9. Next-Gen Search Engines

    Science.gov (United States)

    Gupta, Amardeep

    2005-01-01

    Current search engines--even the constantly surprising Google--seem unable to leap the next big barrier in search: the trillions of bytes of dynamically generated data created by individual web sites around the world, or what some researchers call the "deep web." The challenge now is not information overload, but information overlook.…

  10. Improving Search Engine Reliability

    Science.gov (United States)

    Pruthi, Jyoti; Kumar, Ela

    2010-11-01

    Search engines on the Internet are used daily to access and find information. While these services are providing an easy way to find information globally, they are also suffering from artificially created false results. This paper describes two techniques that are being used to manipulate the search engines: spam pages (used to achieve higher rankings on the result page) and cloaking (used to feed falsified data into search engines). This paper also describes two proposed methods to fight this kind of misuse, algorithms for both of the formerly mentioned cases of spamdexing.

  11. Custom Search Engines: Tools & Tips

    Science.gov (United States)

    Notess, Greg R.

    2008-01-01

    Few have the resources to build a Google or Yahoo! from scratch. Yet anyone can build a search engine based on a subset of the large search engines' databases. Use Google Custom Search Engine or Yahoo! Search Builder or any of the other similar programs to create a vertical search engine targeting sites of interest to users. The basic steps to…

  12. Myanmar Language Search Engine

    Directory of Open Access Journals (Sweden)

    Pann Yu Mon

    2011-03-01

    Full Text Available With the enormous growth of the World Wide Web, search engines play a critical role in retrieving information from the borderless Web. Although many search engines are available for the major languages, but they are not much proficient for the less computerized languages including Myanmar. The main reason is that those search engines are not considering the specific features of those languages. A search engine which capable of searching the Web documents written in those languages is highly needed, especially when more and more Web sites are coming up with localized content in multiple languages. In this study, the design and the architecture of language specific search engine for Myanmar language is proposed. The main feature of the system are, (1 it can search the multiple encodings of the Myanmar Web page, (2 the system is designed to comply with the specific features of the Myanmar language. Finally the experiment has been done to prove whether it meets the design requirements.

  13. With News Search Engines

    Science.gov (United States)

    Gunn, Holly

    2005-01-01

    Although there are many news search engines on the Web, finding the news items one wants can be challenging. Choosing appropriate search terms is one of the biggest challenges. Unless one has seen the article that one is seeking, it is often difficult to select words that were used in the headline or text of the article. The limited archives of…

  14. Web Search Engines: Search Syntax and Features.

    Science.gov (United States)

    Ojala, Marydee

    2002-01-01

    Presents a chart that explains the search syntax, features, and commands used by the 12 most widely used general Web search engines. Discusses Web standardization, expanded types of content searched, size of databases, and search engines that include both simple and advanced versions. (LRW)

  15. Web Search Engines: Search Syntax and Features.

    Science.gov (United States)

    Ojala, Marydee

    2002-01-01

    Presents a chart that explains the search syntax, features, and commands used by the 12 most widely used general Web search engines. Discusses Web standardization, expanded types of content searched, size of databases, and search engines that include both simple and advanced versions. (LRW)

  16. Search Engine Bias and the Demise of Search Engine Utopianism

    Science.gov (United States)

    Goldman, E.

    Due to search engines' automated operations, people often assume that search engines display search results neutrally and without bias. However, this perception is mistaken. Like any other media company, search engines affirmatively control their users' experiences, which has the consequence of skewing search results (a phenomenon called "search engine bias"). Some commentators believe that search engine bias is a defect requiring legislative correction. Instead, this chapter argues that search engine bias is the beneficial consequence of search engines optimizing content for their users. The chapter further argues that the most problematic aspect of search engine bias, the "winner-take-all" effect caused by top placement in search results, will be mooted by emerging personalized search technology.

  17. A Feedback-Based Web Search Engine

    Institute of Scientific and Technical Information of China (English)

    ZHANG Wei-feng; XU Bao-wen; ZHOU Xiao-yu

    2004-01-01

    Web search engines are very useful information service tools in the Internet.The current web search engines produce search results relating to the search terms and the actual information collected by them.Since the selections of the search results cannot affect the future ones, they may not cover most people's interests.In this paper, feedback information produced by the users' accessing lists will be represented by the rough set and can reconstruct the query string and influence the search results.And thus the search engines can provide self-adaptability.

  18. Tag Based Audio Search Engine

    Directory of Open Access Journals (Sweden)

    Parameswaran Vellachu

    2012-03-01

    Full Text Available The volume of the music database is increasing day by day. Getting the required song as per the choice of the listener is a big challenge. Hence, it is really hard to manage this huge quantity, in terms of searching, filtering, through the music database. It is surprising to see that the audio and music industry still rely on very simplistic metadata to describe music files. However, while searching audio resource, an efficient "Tag Based Audio Search Engine" is necessary. The current research focuses on two aspects of the musical databases 1. Tag Based Semantic Annotation Generation using the tag based approach.2. An audio search engine, using which the user can retrieve the songs based on the users choice. The proposed method can be used to annotation and retrieve songs based on musical instruments used , mood of the song, theme of the song, singer, music director, artist, film director, instrument, genre or style and so on.

  19. NASA Indexing Benchmarks: Evaluating Text Search Engines

    Science.gov (United States)

    Esler, Sandra L.; Nelson, Michael L.

    1997-01-01

    The current proliferation of on-line information resources underscores the requirement for the ability to index collections of information and search and retrieve them in a convenient manner. This study develops criteria for analytically comparing the index and search engines and presents results for a number of freely available search engines. A product of this research is a toolkit capable of automatically indexing, searching, and extracting performance statistics from each of the focused search engines. This toolkit is highly configurable and has the ability to run these benchmark tests against other engines as well. Results demonstrate that the tested search engines can be grouped into two levels. Level one engines are efficient on small to medium sized data collections, but show weaknesses when used for collections 100MB or larger. Level two search engines are recommended for data collections up to and beyond 100MB.

  20. New generation of the multimedia search engines

    Science.gov (United States)

    Mijes Cruz, Mario Humberto; Soto Aldaco, Andrea; Maldonado Cano, Luis Alejandro; López Rodríguez, Mario; Rodríguez Vázqueza, Manuel Antonio; Amaya Reyes, Laura Mariel; Cano Martínez, Elizabeth; Pérez Rosas, Osvaldo Gerardo; Rodríguez Espejo, Luis; Flores Secundino, Jesús Abimelek; Rivera Martínez, José Luis; García Vázquez, Mireya Saraí; Zamudio Fuentes, Luis Miguel; Sánchez Valenzuela, Juan Carlos; Montoya Obeso, Abraham; Ramírez Acosta, Alejandro Álvaro

    2016-09-01

    Current search engines are based upon search methods that involve the combination of words (text-based search); which has been efficient until now. However, the Internet's growing demand indicates that there's more diversity on it with each passing day. Text-based searches are becoming limited, as most of the information on the Internet can be found in different types of content denominated multimedia content (images, audio files, video files). Indeed, what needs to be improved in current search engines is: search content, and precision; as well as an accurate display of expected search results by the user. Any search can be more precise if it uses more text parameters, but it doesn't help improve the content or speed of the search itself. One solution is to improve them through the characterization of the content for the search in multimedia files. In this article, an analysis of the new generation multimedia search engines is presented, focusing the needs according to new technologies. Multimedia content has become a central part of the flow of information in our daily life. This reflects the necessity of having multimedia search engines, as well as knowing the real tasks that it must comply. Through this analysis, it is shown that there are not many search engines that can perform content searches. The area of research of multimedia search engines of new generation is a multidisciplinary area that's in constant growth, generating tools that satisfy the different needs of new generation systems.

  1. Multimedia Search Engines : Concept, Performance, and Types

    OpenAIRE

    Sayed Rabeh Sayed

    2005-01-01

    A Research about multimedia search engines, it starts with definition of search engines at general and multimedia search engines, then explains how they work, and divided them into: Video search engines, Images search engines, and Audio search engines. Finally, it reviews a samples to multimedia search engines.

  2. Multimedia Search Engines : Concept, Performance, and Types

    Directory of Open Access Journals (Sweden)

    Sayed Rabeh Sayed

    2005-12-01

    Full Text Available A Research about multimedia search engines, it starts with definition of search engines at general and multimedia search engines, then explains how they work, and divided them into: Video search engines, Images search engines, and Audio search engines. Finally, it reviews a samples to multimedia search engines.

  3. A Survey on Semantic Web Search Engine

    Directory of Open Access Journals (Sweden)

    G.Sudeepthi

    2012-03-01

    Full Text Available The tremendous growth in the volume of data and with the terrific growth of number of web pages, traditional search engines now a days are not appropriate and not suitable anymore. Search engine is the most important tool to discover any information in World Wide Web. Semantic Search Engine is born of traditional search engine to overcome the above problem. The Semantic Web is an extension of the current web in which information is given well-defined meaning. Semantic web technologies are playing a crucial role in enhancing traditional web search, as it is working to create machine readable data. but it will not replace traditional search engine. In this paper we made a brief survey on various promising features of some of the best semantic search engines developed so far and we have discussed the various approaches to semantic search. We have summarized the techniques, advantages of some important semantic web search engines that are developed so far.The most prominent part is that how the semantic search engines differ from the traditional searches and their results are shown by giving a sample query as input

  4. Search Engine Optimization

    CERN Document Server

    Davis, Harold

    2006-01-01

    SEO--short for Search Engine Optimization--is the art, craft, and science of driving web traffic to web sites. Web traffic is food, drink, and oxygen--in short, life itself--to any web-based business. Whether your web site depends on broad, general traffic, or high-quality, targeted traffic, this PDF has the tools and information you need to draw more traffic to your site. You'll learn how to effectively use PageRank (and Google itself); how to get listed, get links, and get syndicated; and much more. The field of SEO is expanding into all the possible ways of promoting web traffic. This

  5. Self-learning search engines

    NARCIS (Netherlands)

    Schuth, A.

    2015-01-01

    How does a search engine such as Google know which search results to display? There are many competing algorithms that generate search results, but which one works best? We developed a new probabilistic method for quickly comparing large numbers of search algorithms by examining the results users cl

  6. Credibility in Web Search Engines

    OpenAIRE

    Lewandowski, Dirk

    2012-01-01

    Web search engines apply a variety of ranking signals to achieve user satisfaction, i.e., results pages that provide the best-possible results to the user. While these ranking signals implicitly consider credibility (e.g., by measuring popularity), explicit measures of credibility are not applied. In this chapter, credibility in Web search engines is discussed in a broad context: credibility as a measure for including documents in a search engine's index, credibility as a ranking signal, cred...

  7. Da "Search engines" a "Shop engines"

    OpenAIRE

    Lupi, Mauro

    2001-01-01

    The change occuring related to “search engines” is going towards e-commerce, transforming all the main search engines into information and commercial suggestion conveying means, basing their businnes on this activity. In a next future we will find two main series of search engines: from one side, the portals that will offer a general orientation guide being convoying means for services and to-buy products; from the other side, vertical portals able to offer information and products on specifi...

  8. Assessing Bias in Search Engines.

    Science.gov (United States)

    Mowshowitz, Abbe; Kawaguchi, Akira

    2002-01-01

    Addresses the measurement of bias in search engines on the Web, defining bias as the balance and representation of items in a collection retrieved from a database for a set of queries. Assesses bias by measuring the deviation from the ideal of the distribution produced by a particular search engine. (Author/LRW)

  9. Evaluative Measures of Search Engines

    Directory of Open Access Journals (Sweden)

    Jitendra Nath Singh

    2012-03-01

    Full Text Available The ability to search and retrieve information from the web efficiently and effectively is great challenge of search engine. Information retrieval on the Web is very different from retrieval in traditional indexed databases because it’s hyper-linked character, the heterogeneity of document types and authoring styles. Thus, since Web retrieval is substantially different from information retrieval, new or revised evaluative measures are required to assess retrieval performance using search engines. In this paper we suggested a number of evaluative measures to evaluate the effectiveness of search engines. The motivation behind each of these measures is presented, along with their descriptions and definitions.

  10. Evaluating search effectiveness of some selected search engines ...

    African Journals Online (AJOL)

    Evaluating search effectiveness of some selected search engines. ... AFRICAN JOURNALS ONLINE (AJOL) · Journals · Advanced Search · USING AJOL ... seek for information on the World Wide Web (WWW) using variety of search engines.

  11. Judging the Capability of Search Engines and Search Terms

    National Research Council Canada - National Science Library

    Anna Kaushik

    2012-01-01

    .... The present study aims to judge the capability of five selected search engines and search terms on the basis of first ten results and to identify most appropriate search term and search engine...

  12. [Advanced online search techniques and dedicated search engines for physicians].

    Science.gov (United States)

    Nahum, Yoav

    2008-02-01

    In recent years search engines have become an essential tool in the work of physicians. This article will review advanced search techniques from the world of information specialists, as well as some advanced search engine operators that may help physicians improve their online search capabilities, and maximize the yield of their searches. This article also reviews popular dedicated scientific and biomedical literature search engines.

  13. Self-learning search engines

    OpenAIRE

    Schuth, A.

    2015-01-01

    How does a search engine such as Google know which search results to display? There are many competing algorithms that generate search results, but which one works best? We developed a new probabilistic method for quickly comparing large numbers of search algorithms by examining the results users click on. Our study was presented at SIGIR 2015, the leading international conference on information retrieval, held in Santiago (Chili) last summer.

  14. A Study on Semantic Searching, Semantic Search Engines and Technologies Used for Semantic Search Engines

    OpenAIRE

    Junaid Rashid; Muhammad Wasif Nisar

    2016-01-01

    Semantic search engines(SSE) are more efficient than other web engines because in this era of busy life everyone wants an exact answer to his question which only semantic engines can provide. The immense increase in the volume of data, traditional search engines has increased the number of answers to satisfy the user. This creates the problem to search for the desired answer. To solve this problem, the trend of developing semantic search engines is increasing day by da...

  15. Conceptual Models for Search Engines

    Science.gov (United States)

    Hendry, D. G.; Efthimiadis, E. N.

    Search engines have entered popular culture. They touch people in diverse private and public settings and thus heighten the importance of such important social matters as information privacy and control, censorship, and equitable access. To fully benefit from search engines and to participate in debate about their merits, people necessarily appeal to their understandings for how they function. In this chapter we examine the conceptual understandings that people have of search engines by performing a content analysis on the sketches that 200 undergraduate and graduate students drew when asked to draw a sketch of how a search engine works. Analysis of the sketches reveals a diverse range of conceptual approaches, metaphors, representations, and misconceptions. On the whole, the conceptual models articulated by these students are simplistic. However, students with higher levels of academic achievement sketched more complete models. This research calls attention to the importance of improving students' technical knowledge of how search engines work so they can be better equipped to develop and advocate policies for how search engines should be embedded in, and restricted from, various private and public information settings.

  16. A Search Engine Features Comparison.

    Science.gov (United States)

    Vorndran, Gerald

    Until recently, the World Wide Web (WWW) public access search engines have not included many of the advanced commands, options, and features commonly available with the for-profit online database user interfaces, such as DIALOG. This study evaluates the features and characteristics common to both types of search interfaces, examines the Web search…

  17. Evaluative Measures of Search Engines

    OpenAIRE

    Jitendra Nath Singh; Dr. S.K. Dwivedi

    2012-01-01

    The ability to search and retrieve information from the web efficiently and effectively is great challenge of search engine. Information retrieval on the Web is very different from retrieval in traditional indexed databases because it’s hyper-linked character, the heterogeneity of document types and authoring styles. Thus, since Web retrieval is substantially different from information retrieval, new or revised evaluative measures are required to assess retrieval performance using search engi...

  18. BIOMedical Search Engine Framework: Lightweight and customized implementation of domain-specific biomedical search engines.

    Science.gov (United States)

    Jácome, Alberto G; Fdez-Riverola, Florentino; Lourenço, Anália

    2016-07-01

    meaningful to that particular scope of research. Conversely, indirect concept associations, i.e. concepts related by other intermediary concepts, can be useful to integrate information from different studies and look into non-trivial relations. The BIOMedical Search Engine Framework supports the development of domain-specific search engines. The key strengths of the framework are modularity and extensibilityin terms of software design, the use of open-source consolidated Web technologies, and the ability to integrate any number of biomedical text mining tools and information resources. Currently, the Smart Drug Search keeps over 1,186,000 documents, containing more than 11,854,000 annotations for 77,200 different concepts. The Smart Drug Search is publicly accessible at http://sing.ei.uvigo.es/sds/. The BIOMedical Search Engine Framework is freely available for non-commercial use at https://github.com/agjacome/biomsef. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  19. A Study on Semantic Searching, Semantic Search Engines and Technologies Used for Semantic Search Engines

    Directory of Open Access Journals (Sweden)

    Junaid Rashid

    2016-10-01

    Full Text Available Semantic search engines(SSE are more efficient than other web engines because in this era of busy life everyone wants an exact answer to his question which only semantic engines can provide. The immense increase in the volume of data, traditional search engines has increased the number of answers to satisfy the user. This creates the problem to search for the desired answer. To solve this problem, the trend of developing semantic search engines is increasing day by day. Semantic search engines work to extract the best answer of user queries which exactly fits with it. Traditional search engines are keyword based which means that they do not know the meaning of the words which we type in our queries. Due to this reason, the semantic search engines super pass the conventional search engines because they give us meaningful and well-defined information. In this paper, we will discuss the background of Semantic searching, about semantic search engines; the technology used for the semantic search engines and some of the existing semantic search engines on various factors are compared.

  20. Date restricted queries in web search engines

    OpenAIRE

    Lewandowski, Dirk

    2004-01-01

    Search engines usually offer a date restricted search on their advanced search pages. But determining the actual update of a web page is not without problems. We conduct a study testing date restricted queries on the search engines Google, Teoma and Yahoo!. We find that these searches fail to work properly in the examined engines. We discuss implications of this for further research and search engine development.

  1. Search engines that learn from their users

    NARCIS (Netherlands)

    Schuth, A.G.

    2016-01-01

    More than half the world’s population uses web search engines, resulting in over half a billion search queries every single day. For many people web search engines are among the first resources they go to when a question arises. Moreover, search engines have for many become the most trusted route to

  2. Engineering Optimisation by Cuckoo Search

    CERN Document Server

    Yang, Xin-She

    2010-01-01

    A new metaheuristic optimisation algorithm, called Cuckoo Search (CS), was developed recently by Yang and Deb (2009). This paper presents a more extensive comparison study using some standard test functions and newly designed stochastic test functions. We then apply the CS algorithm to solve engineering design optimisation problems, including the design of springs and welded beam structures. The optimal solutions obtained by CS are far better than the best solutions obtained by an efficient particle swarm optimiser. We will discuss the unique search features used in CS and the implications for further research.

  3. [Development of domain specific search engines].

    Science.gov (United States)

    Takai, T; Tokunaga, M; Maeda, K; Kaminuma, T

    2000-01-01

    As cyber space exploding in a pace that nobody has ever imagined, it becomes very important to search cyber space efficiently and effectively. One solution to this problem is search engines. Already a lot of commercial search engines have been put on the market. However these search engines respond with such cumbersome results that domain specific experts can not tolerate. Using a dedicate hardware and a commercial software called OpenText, we have tried to develop several domain specific search engines. These engines are for our institute's Web contents, drugs, chemical safety, endocrine disruptors, and emergent response for chemical hazard. These engines have been on our Web site for testing.

  4. A COMPARATIVE STUDY OF BYG SEARCH ENGINES

    Directory of Open Access Journals (Sweden)

    Kailash Kumar

    2013-01-01

    Full Text Available This paper compares the retrieval effectiveness of the Bing, Yahoo and Google (BYG Search Engines. The precision and relative recall of each search engine was considered for evaluating the effectiveness of the search engines. General Queries were tested. Results of the study showed that the precision of Google was high as compared to other two search engines and Yahoo has better precision than Bing

  5. Quantum searching application in search based software engineering

    Science.gov (United States)

    Wu, Nan; Song, FangMin; Li, Xiangdong

    2013-05-01

    The Search Based Software Engineering (SBSE) is widely used in software engineering for identifying optimal solutions. However, there is no polynomial-time complexity solution used in the traditional algorithms for SBSE, and that causes the cost very high. In this paper, we analyze and compare several quantum search algorithms that could be applied for SBSE: quantum adiabatic evolution searching algorithm, fixed-point quantum search (FPQS), quantum walks, and a rapid modified Grover quantum searching method. The Grover's algorithm is thought as the best choice for a large-scaled unstructured data searching and theoretically it can be applicable to any search-space structure and any type of searching problems.

  6. Search Engine For Ebook Portal

    Directory of Open Access Journals (Sweden)

    Prashant Kanade

    2015-08-01

    Full Text Available The purpose of this paper is to establish the textual analytics involved in developing a search engine for an ebook portal. We have extracted our dataset from Project Gutenberg using a robot harvester. Textual Analytics is used for efficient search retrieval. The entire dataset is represented using Vector Space Model where each document is a vector in the vector space. Further for computational purposes we represent our dataset in the form of a Term Frequency- Inverse Document Frequency tf-idf matrix. The first step involves obtaining the most coherent sequence of words of the search query entered. The entered query is processed using Front End algorithms this includes-Spell Checker Text Segmentation and Language Modeling. Back End processing includes Similarity Modeling Clustering Indexing and Retrieval. The relationship between documents and words is established using cosine similarity measured between the documents and words in Vector Space. Clustering performed is used to suggest books that are similar to the search query entered by the user. Lastly the Lucene Based Elasticsearch engine is used for indexing on the documents. This allows faster retrieval of data. Elasticsearch returns a dictionary and creates a tf-idf matrix. The processed query is compared with the dictionary obtained and tf-idf matrix is used to calculate the score for each match to give most relevant result.

  7. How Do Search Engines Handle Chinese Queries?

    Directory of Open Access Journals (Sweden)

    Hong Cui

    2005-10-01

    Full Text Available The use of languages other than English has been growing exponentially on the Web. However, the major search engines have been lagging behind in providing indexes and search features to handle these languages. This article explores the characteristics of the Chinese language and how queries in this language are handled by different search engines. Queries were entered in two major search engines (Google and AlltheWeb and two search engines developed for Chinese (Sohu and Baidu. Criteria such as handling word segmentation, number of retrieved documents, and correct display and identification of Chinese characters were used to examine how the search engines handled the queries. The results showed that the performance of the two major search engines was not on a par with that of the search engines developed for Chinese.

  8. Chemical-text hybrid search engines.

    Science.gov (United States)

    Zhou, Yingyao; Zhou, Bin; Jiang, Shumei; King, Frederick J

    2010-01-01

    As the amount of chemical literature increases, it is critical that researchers be enabled to accurately locate documents related to a particular aspect of a given compound. Existing solutions, based on text and chemical search engines alone, suffer from the inclusion of "false negative" and "false positive" results, and cannot accommodate diverse repertoire of formats currently available for chemical documents. To address these concerns, we developed an approach called Entity-Canonical Keyword Indexing (ECKI), which converts a chemical entity embedded in a data source into its canonical keyword representation prior to being indexed by text search engines. We implemented ECKI using Microsoft Office SharePoint Server Search, and the resultant hybrid search engine not only supported complex mixed chemical and keyword queries but also was applied to both intranet and Internet environments. We envision that the adoption of ECKI will empower researchers to pose more complex search questions that were not readily attainable previously and to obtain answers at much improved speed and accuracy.

  9. Children's Search Engines from an Information Search Process Perspective.

    Science.gov (United States)

    Broch, Elana

    2000-01-01

    Describes cognitive and affective characteristics of children and teenagers that may affect their Web searching behavior. Reviews literature on children's searching in online public access catalogs (OPACs) and using digital libraries. Profiles two Web search engines. Discusses some of the difficulties children have searching the Web, in the…

  10. Relevance of Search Engines for Modern Generations

    Directory of Open Access Journals (Sweden)

    Trilok Gupta

    2014-02-01

    Full Text Available Web search engines have major impact in people‟s everyday life . It is of great i m p o r t a n c e to test the retrieval effectiveness of search e n g i n e s. However, it is labour-intensive to judge the relevance of search results for a large number of queries, and these relevance judgments may not be reusable since the Web data change all the time. Experiments on major search engines show that our approach mines many high- confidence rules that help understand search engines and detect suspicious search results.

  11. New Architectures for Presenting Search Results Based on Web Search Engines Users Experience

    Science.gov (United States)

    Martinez, F. J.; Pastor, J. A.; Rodriguez, J. V.; Lopez, Rosana; Rodriguez, J. V., Jr.

    2011-01-01

    Introduction: The Internet is a dynamic environment which is continuously being updated. Search engines have been, currently are and in all probability will continue to be the most popular systems in this information cosmos. Method: In this work, special attention has been paid to the series of changes made to search engines up to this point,…

  12. Evaluation of Query Generators for Entity Search Engines

    CERN Document Server

    Endrullis, Stefan; Rahm, Erhard

    2010-01-01

    Dynamic web applications such as mashups need efficient access to web data that is only accessible via entity search engines (e.g. product or publication search engines). However, most current mashup systems and applications only support simple keyword searches for retrieving data from search engines. We propose the use of more powerful search strategies building on so-called query generators. For a given set of entities query generators are able to automatically determine a set of search queries to retrieve these entities from an entity search engine. We demonstrate the usefulness of query generators for on-demand web data integration and evaluate the effectiveness and efficiency of query generators for a challenging real-world integration scenario.

  13. Internet search engines - Fluctuations in document accessibility

    NARCIS (Netherlands)

    Mettrop, W.; Nieuwenhuysen, P.

    2001-01-01

    An empirical investigation of the consistency of retrieval through Internet search engines is reported. Thirteen engines are evaluated: AltaVista, EuroFerret, Excite, HotBot, InfoSeek, Lycos, MSN, NorthernLight, Snap, WebCrawler and three national Dutch engines: Ilse, Search.nl and Vindex. The focus

  14. Overview of the Web Search Engine%Web搜索引擎综述

    Institute of Scientific and Technical Information of China (English)

    张卫丰; 徐宝文; 周晓宇; 许蕾; 李东

    2001-01-01

    With the explosive increase of the network information,people can find information more and more difficultly. The occurrence of the Web search engine overcomes this problem in some degree. This paper tells about the history of the search engine ,the current state of the search engine. Some guidelines about the search engine are analysed and the related checking methods are also given. In this basis, we introduce the trend of the search engine.

  15. Capacity Planning for Vertical Search Engines

    CERN Document Server

    Badue, Claudine; Almeida, Virgilio; Baeza-Yates, Ricardo; Ribeiro-Neto, Berthier; Ziviani, Artur; Ziviani, Nivio

    2010-01-01

    Vertical search engines focus on specific slices of content, such as the Web of a single country or the document collection of a large corporation. Despite this, like general open web search engines, they are expensive to maintain, expensive to operate, and hard to design. Because of this, predicting the response time of a vertical search engine is usually done empirically through experimentation, requiring a costly setup. An alternative is to develop a model of the search engine for predicting performance. However, this alternative is of interest only if its predictions are accurate. In this paper we propose a methodology for analyzing the performance of vertical search engines. Applying the proposed methodology, we present a capacity planning model based on a queueing network for search engines with a scale typically suitable for the needs of large corporations. The model is simple and yet reasonably accurate and, in contrast to previous work, considers the imbalance in query service times among homogeneous...

  16. Optimization of web pages for search engines

    OpenAIRE

    Harej, Anže

    2011-01-01

    The thesis describes the most important elements of a Web Page and outside factors that affect Search Engine Optimization. The basic structure of a Web page, structure and functionality of a modern Search Engine is described at the beginning. The first section deals with the start of Search Engine Optimization, including planning, analysis of web space and the selection of the most important keywords for which the site will be optimized. The next section Web Page Optimization describes...

  17. Google Patents: The global patent search engine

    OpenAIRE

    Alireza Noruzi; Mohammadhiwa Abdekhoda

    2014-01-01

    Google Patents (www.google.com/patents) includes over 8 million full-text patents. Google Patents works in the same way as the Google search engine. Google Patents is the global patent search engine that lets users search through patents from the USPTO (United States Patent and Trademark Office), EPO (European Patent Office), etc. This study begins with an overview of how to use Google Patent and identifies advanced search techniques not well-documented by Google Patent. It makes several sug...

  18. A dynamic knowledge base based search engine

    Institute of Scientific and Technical Information of China (English)

    WANG Hui-jin; HU Hua; LI Qing

    2005-01-01

    Search engines have greatly helped us to find thedesired information from the Intemet. Most search engines use keywords matching technique. This paper discusses a Dynamic Knowledge Base based Search Engine (DKBSE), which can expand the user's query using the keywords' concept or meaning. To do this, the DKBSE needs to construct and maintain the knowledge base dynamically via the system's searching results and the user's feedback information. The DKBSE expands the user's initial query using the knowledge base, and returns the searched information after the expanded query.

  19. Database Search Engines: Paradigms, Challenges and Solutions.

    Science.gov (United States)

    Verheggen, Kenneth; Martens, Lennart; Berven, Frode S; Barsnes, Harald; Vaudel, Marc

    2016-01-01

    The first step in identifying proteins from mass spectrometry based shotgun proteomics data is to infer peptides from tandem mass spectra, a task generally achieved using database search engines. In this chapter, the basic principles of database search engines are introduced with a focus on open source software, and the use of database search engines is demonstrated using the freely available SearchGUI interface. This chapter also discusses how to tackle general issues related to sequence database searching and shows how to minimize their impact.

  20. Empirical Evidences in Citation-Based Search Engines: Is Microsoft Academic Search dead?

    OpenAIRE

    Orduna-Malea, Enrique; Ayllon, Juan Manuel; Martin-Martin, Alberto; Lopez-Cozar, Emilio Delgado

    2014-01-01

    The goal of this working paper is to summarize the main empirical evidences provided by the scientific community as regards the comparison between the two main citation based academic search engines: Google Scholar and Microsoft Academic Search, paying special attention to the following issues: coverage, correlations between journal rankings, and usage of these academic search engines. Additionally, selfelaborated data is offered, which are intended to provide current evidence about the popul...

  1. Distributed search engine architecture based on topic specific searches

    Science.gov (United States)

    Abudaqqa, Yousra; Patel, Ahmed

    2015-05-01

    Indisputably, search engines (SEs) abound. The monumental growth of users performing online searches on the Web is a contending issue in the contemporary world nowadays. For example, there are tens of billions of searches performed everyday, which typically offer the users many irrelevant results which are time consuming and costly to the user. Based on the afore-going problem it has become a herculean task for existing Web SEs to provide complete, relevant and up-to-date information response to users' search queries. To overcome this problem, we developed the Distributed Search Engine Architecture (DSEA), which is a new means of smart information query and retrieval of the World Wide Web (WWW). In DSEAs, multiple autonomous search engines, owned by different organizations or individuals, cooperate and act as a single search engine. This paper includes the work reported in this research focusing on development of DSEA, based on topic-specific specialised search engines. In DSEA, the results to specific queries could be provided by any of the participating search engines, for which the user is unaware of. The important design goal of using topic-specific search engines in the research is to build systems that can effectively be used by larger number of users simultaneously. Efficient and effective usage with good response is important, because it involves leveraging the vast amount of searched data from the World Wide Web, by categorising it into condensed focused topic -specific results that meet the user's queries. This design model and the development of the DSEA adopt a Service Directory (SD) to route queries towards topic-specific document hosting SEs. It displays the most acceptable performance which is consistent with the requirements of the users. The evaluation results of the model return a very high priority score which is associated with each frequency of a keyword.

  2. Subject Gateway Sites and Search Engine Ranking.

    Science.gov (United States)

    Thelwall, Mike

    2002-01-01

    Discusses subject gateway sites and commercial search engines for the Web and presents an explanation of Google's PageRank algorithm. The principle question addressed is the conditions under which a gateway site will increase the likelihood that a target page is found in search engines. (LRW)

  3. Human Flesh Search Engine and Online Privacy.

    Science.gov (United States)

    Zhang, Yang; Gao, Hong

    2016-04-01

    Human flesh search engine can be a double-edged sword, bringing convenience on the one hand and leading to infringement of personal privacy on the other hand. This paper discusses the ethical problems brought about by the human flesh search engine, as well as possible solutions.

  4. Arabic Stemmer for Search Engines Information Retrieval

    Directory of Open Access Journals (Sweden)

    Ahmed Khalid

    2016-01-01

    Full Text Available Arabic language is very different and difficult structure than other languages, that’s because it is a very rich language with complex morphology. Many stemmers have been developed for Arabic language but still there are many weakness and problems. There is still lack of usage of Arabic stemming in search engines. This paper introduces a rooted word Arabic stemmer technique. The results of the introduced technique for six Arabic sentences are used in famous search engines Google Chrome, Internet Explore and Mozilla Firefox to check the effect of using Arabic stemming in these search engines in terms of the total number of searched pages and the search time ratio for actual sentences and their stemming results. The results show that Arabic words stemming increase and accelerate the search engines output.

  5. OncoSearch: cancer gene search engine with literature evidence.

    Science.gov (United States)

    Lee, Hee-Jin; Dang, Tien Cuong; Lee, Hyunju; Park, Jong C

    2014-07-01

    In order to identify genes that are involved in oncogenesis and to understand how such genes affect cancers, abnormal gene expressions in cancers are actively studied. For an efficient access to the results of such studies that are reported in biomedical literature, the relevant information is accumulated via text-mining tools and made available through the Web. However, current Web tools are not yet tailored enough to allow queries that specify how a cancer changes along with the change in gene expression level, which is an important piece of information to understand an involved gene's role in cancer progression or regression. OncoSearch is a Web-based engine that searches Medline abstracts for sentences that mention gene expression changes in cancers, with queries that specify (i) whether a gene expression level is up-regulated or down-regulated, (ii) whether a certain type of cancer progresses or regresses along with such gene expression change and (iii) the expected role of the gene in the cancer. OncoSearch is available through http://oncosearch.biopathway.org.

  6. Using Advanced Search Operators on Web Search Engines.

    Science.gov (United States)

    Jansen, Bernard J.

    Studies show that the majority of Web searchers enter extremely simple queries, so a reasonable system design approach would be to build search engines to compensate for this user characteristic. One hundred representative queries were selected from the transaction log of a major Web search service. These 100 queries were then modified using the…

  7. Using Advanced Search Operators on Web Search Engines.

    Science.gov (United States)

    Jansen, Bernard J.

    Studies show that the majority of Web searchers enter extremely simple queries, so a reasonable system design approach would be to build search engines to compensate for this user characteristic. One hundred representative queries were selected from the transaction log of a major Web search service. These 100 queries were then modified using the…

  8. Search Engines for Tomorrow's Scholars

    Science.gov (United States)

    Fagan, Jody Condit

    2011-01-01

    Today's scholars face an outstanding array of choices when choosing search tools: Google Scholar, discipline-specific abstracts and index databases, library discovery tools, and more recently, Microsoft's re-launch of their academic search tool, now dubbed Microsoft Academic Search. What are these tools' strengths for the emerging needs of…

  9. Comparative analysis of some search engines

    Directory of Open Access Journals (Sweden)

    Taiwo O. Edosomwan

    2010-10-01

    Full Text Available We compared the information retrieval performances of some popular search engines (namely, Google, Yahoo, AlltheWeb, Gigablast, Zworks and AltaVista and Bing/MSN in response to a list of ten queries, varying in complexity. These queries were run on each search engine and the precision and response time of the retrieved results were recorded. The first ten documents on each retrieval output were evaluated as being ‘relevant’ or ‘non-relevant’ for evaluation of the search engine’s precision. To evaluate response time, normalised recall ratios were calculated at various cut-off points for each query and search engine. This study shows that Google appears to be the best search engine in terms of both average precision (70% and average response time (2 s. Gigablast and AlltheWeb performed the worst overall in this study.

  10. Google Patents: The global patent search engine

    Directory of Open Access Journals (Sweden)

    Alireza Noruzi

    2014-06-01

    Full Text Available Google Patents (www.google.com/patents includes over 8 million full-text patents. Google Patents works in the same way as the Google search engine. Google Patents is the global patent search engine that lets users search through patents from the USPTO (United States Patent and Trademark Office, EPO (European Patent Office, etc. This study begins with an overview of how to use Google Patent and identifies advanced search techniques not well-documented by Google Patent. It makes several suggestions for improving Google Patents. This study also compares the citation counts provided by Google Patents for journals in the field of library and information science (LIS. Finally, it concludes that Google Patents provides a free alternative or complement to other patent databases. It also addressed the advantages of Google Patents, for example, easy-use search interface and fast search engine; convenient access to patent images in PDF format; and fast downloads of PDF patent documents.

  11. Web Search Studies: Multidisciplinary Perspectives on Web Search Engines

    Science.gov (United States)

    Zimmer, Michael

    Perhaps the most significant tool of our internet age is the web search engine, providing a powerful interface for accessing the vast amount of information available on the world wide web and beyond. While still in its infancy compared to the knowledge tools that precede it - such as the dictionary or encyclopedia - the impact of web search engines on society and culture has already received considerable attention from a variety of academic disciplines and perspectives. This article aims to organize a meta-discipline of “web search studies,” centered around a nucleus of major research on web search engines from five key perspectives: technical foundations and evaluations; transaction log analyses; user studies; political, ethical, and cultural critiques; and legal and policy analyses.

  12. The Anatomy of Mitos Web Search Engine

    CERN Document Server

    Papadakos, Panagiotis; Theoharis, Yannis; Armenatzoglou, Nikos; Kopidaki, Stella; Marketakis, Yannis; Daskalakis, Manos; Karamaroudis, Kostas; Linardakis, Giorgos; Makrydakis, Giannis; Papathanasiou, Vangelis; Sardis, Lefteris; Tsialiamanis, Petros; Troullinou, Georgia; Vandikas, Kostas; Velegrakis, Dimitris; Tzitzikas, Yannis

    2008-01-01

    Engineering a Web search engine offering effective and efficient information retrieval is a challenging task. This document presents our experiences from designing and developing a Web search engine offering a wide spectrum of functionalities and we report some interesting experimental results. A rather peculiar design choice of the engine is that its index is based on a DBMS, while some of the distinctive functionalities that are offered include advanced Greek language stemming, real time result clustering, and advanced link analysis techniques (also for spam page detection).

  13. Real-time earthquake monitoring using a search engine method

    Science.gov (United States)

    Zhang, Jie; Zhang, Haijiang; Chen, Enhong; Zheng, Yi; Kuang, Wenhuan; Zhang, Xiong

    2014-12-01

    When an earthquake occurs, seismologists want to use recorded seismograms to infer its location, magnitude and source-focal mechanism as quickly as possible. If such information could be determined immediately, timely evacuations and emergency actions could be undertaken to mitigate earthquake damage. Current advanced methods can report the initial location and magnitude of an earthquake within a few seconds, but estimating the source-focal mechanism may require minutes to hours. Here we present an earthquake search engine, similar to a web search engine, that we developed by applying a computer fast search method to a large seismogram database to find waveforms that best fit the input data. Our method is several thousand times faster than an exact search. For an Mw 5.9 earthquake on 8 March 2012 in Xinjiang, China, the search engine can infer the earthquake’s parameters in <1 s after receiving the long-period surface wave data.

  14. Real-time earthquake monitoring using a search engine method.

    Science.gov (United States)

    Zhang, Jie; Zhang, Haijiang; Chen, Enhong; Zheng, Yi; Kuang, Wenhuan; Zhang, Xiong

    2014-12-04

    When an earthquake occurs, seismologists want to use recorded seismograms to infer its location, magnitude and source-focal mechanism as quickly as possible. If such information could be determined immediately, timely evacuations and emergency actions could be undertaken to mitigate earthquake damage. Current advanced methods can report the initial location and magnitude of an earthquake within a few seconds, but estimating the source-focal mechanism may require minutes to hours. Here we present an earthquake search engine, similar to a web search engine, that we developed by applying a computer fast search method to a large seismogram database to find waveforms that best fit the input data. Our method is several thousand times faster than an exact search. For an Mw 5.9 earthquake on 8 March 2012 in Xinjiang, China, the search engine can infer the earthquake's parameters in <1 s after receiving the long-period surface wave data.

  15. JPL Small Body Database Search Engine

    Data.gov (United States)

    National Aeronautics and Space Administration — Use this search engine to generate custom tables of orbital and/or physical parameters for all asteroids and comets (or a specified sub-set) in our small-body...

  16. Short-term Internet search using makes people rely on search engines when facing unknown issues.

    Science.gov (United States)

    Wang, Yifan; Wu, Lingdan; Luo, Liang; Zhang, Yifen; Dong, Guangheng

    2017-01-01

    The Internet search engines, which have powerful search/sort functions and ease of use features, have become an indispensable tool for many individuals. The current study is to test whether the short-term Internet search training can make people more dependent on it. Thirty-one subjects out of forty subjects completed the search training study which included a pre-test, a six-day's training of Internet search, and a post-test. During the pre- and post- tests, subjects were asked to search online the answers to 40 unusual questions, remember the answers and recall them in the scanner. Un-learned questions were randomly presented at the recalling stage in order to elicited search impulse. Comparing to the pre-test, subjects in the post-test reported higher impulse to use search engines to answer un-learned questions. Consistently, subjects showed higher brain activations in dorsolateral prefrontal cortex and anterior cingulate cortex in the post-test than in the pre-test. In addition, there were significant positive correlations self-reported search impulse and brain responses in the frontal areas. The results suggest that a simple six-day's Internet search training can make people dependent on the search tools when facing unknown issues. People are easily dependent on the Internet search engines.

  17. PRIVATE MOBILE SEARCH ENGINE USING VARIOUS LOCATIONS

    Directory of Open Access Journals (Sweden)

    Mayuri A. Auti

    2015-10-01

    Full Text Available Mobile search engine is a meta search engine that imprisonments the user’s favorite in the form of concepts by mining their clickthrough data. It excerpts the importance of location information in mobile search and categorizes these concepts into content concepts and location concepts. By positioning by GPS, user’s locations are used to addition the location concepts in search engine. The user favorites are organized in ontology based, multi facet user profiles, which are used to familiarize a Personalizing ranking function for rank revision of search results. Mobile Search Engine typifies the diversity of the concepts related with a query and their significance’s to the user’s need. It associated with four entropies are presented to balance the weights amid the content and location facets. Based on the client-server model, it contains a detailed architecture and design for operation. In this design, the client gathers and stores locally the clickthrough data to protect confidentiality. It discourses the privacy issue by restricting the information in the user profile exposed to the server with two privacy parameters are min Distance and expRatio. It prototypes search engine on the Google Android platform. It is an innovative approach for personalizing web search results. By mining content and location concepts for user profiling, it utilizes both the content and location preferences to personalize search results for a user. Mobile Search Engine incorporates a user’s physical locations in the personalization process. It is using a GPS location helps to improve retrieval effectiveness for location queries.

  18. Combining Search Engines for Comparative Proteomics

    Science.gov (United States)

    Tabb, David

    2012-01-01

    Many proteomics laboratories have found spectral counting to be an ideal way to recognize biomarkers that differentiate cohorts of samples. This approach assumes that proteins that differ in quantity between samples will generate different numbers of identifiable tandem mass spectra. Increasingly, researchers are employing multiple search engines to maximize the identifications generated from data collections. This talk evaluates four strategies to combine information from multiple search engines in comparative proteomics. The “Count Sum” model pools the spectra across search engines. The “Vote Counting” model combines the judgments from each search engine by protein. Two other models employ parametric and non-parametric analyses of protein-specific p-values from different search engines. We evaluated the four strategies in two different data sets. The ABRF iPRG 2009 study generated five LC-MS/MS analyses of “red” E. coli and five analyses of “yellow” E. coli. NCI CPTAC Study 6 generated five concentrations of Sigma UPS1 spiked into a yeast background. All data were identified with X!Tandem, Sequest, MyriMatch, and TagRecon. For both sample types, “Vote Counting” appeared to manage the diverse identification sets most effectively, yielding heightened discrimination as more search engines were added.

  19. The AXES-lite video search engine

    NARCIS (Netherlands)

    Chen, Shu; McGuinness, Kevin; Aly, Robin; Jong, de Franciska; O'Connor, Noel E.

    2012-01-01

    The aim of AXES is to develop tools that provide various types of users with new engaging ways to interact with audiovisual libraries, helping them discover, browse, navigate, search, and enrich archives. This paper describes the initial (lite) version of the AXES search engine, which is targeted at

  20. Music Search Engines: Specifications and Challenges

    DEFF Research Database (Denmark)

    Nanopoulos, Alexandros; Rafilidis, Dimitrios; Manolopoulos, Yannis

    2009-01-01

    Nowadays we have a proliferation of music data available over the Web. One of the imperative challenges is how to search these vast, global-scale musical resources to find preferred music. Recent research has envisaged the notion of music search engines (MSEs) that allow for searching preferred...... music over the Web. In this paper, we examine the growing research topic of MSEs, and provide potential specifications to follow and challenges to face....

  1. Automatic Planning of External Search Engine Optimization

    Directory of Open Access Journals (Sweden)

    Vita Jasevičiūtė

    2015-07-01

    Full Text Available This paper describes an investigation of the external search engine optimization (SEO action planning tool, dedicated to automatically extract a small set of most important keywords for each month during whole year period. The keywords in the set are extracted accordingly to external measured parameters, such as average number of searches during the year and for every month individually. Additionally the position of the optimized web site for each keyword is taken into account. The generated optimization plan is similar to the optimization plans prepared manually by the SEO professionals and can be successfully used as a support tool for web site search engine optimization.

  2. Adding a visualization feature to web search engines: it's time.

    Science.gov (United States)

    Wong, Pak Chung

    2008-01-01

    It's widely recognized that all Web search engines today are almost identical in presentation layout and behavior. In fact, the same presentation approach has been applied to depicting search engine results pages (SERPs) since the first Web search engine launched in 1993. In this Visualization Viewpoints article, I propose to add a visualization feature to Web search engines and suggest that the new addition can improve search engines' performance and capabilities, which in turn lead to better Web search technology.

  3. Editorial: Link Spam and Search Engines

    Directory of Open Access Journals (Sweden)

    Alireza Noruzi

    2006-03-01

    Full Text Available The growing number of blogs has caused problems for search engines, problems such as the highly frequent blog spam. Spammers use blogs to promote their websites. Spammers are trying to win the attention of search engines, not of bloggers or their readers. Link spam dishonestly and deliberately manipulates link-based ranking algorithms of search engines like Google's PageRank to increase the rank of a web site or page so that it is placed as close to the top of search results as possible. A link-based ranking algorithm gives a higher ranking to a site that has many backlinks, especially from highly-ranked sites/pages.

  4. Making the road by searching - A search engine based on Swarm Information Foraging

    CERN Document Server

    Gayo-Avello, Daniel

    2009-01-01

    Search engines are nowadays one of the most important entry points for Internet users and a central tool to solve most of their information needs. Still, there exist a substantial amount of users' searches which obtain unsatisfactory results. Needless to say, several lines of research aim to increase the relevancy of the results users retrieve. In this paper the authors frame this problem within the much broader (and older) one of information overload. They argue that users' dissatisfaction with search engines is a currently common manifestation of such a problem, and propose a different angle from which to tackle with it. As it will be discussed, their approach shares goals with a current hot research topic (namely, learning to rank for information retrieval) but, unlike the techniques commonly applied in that field, their technique cannot be exactly considered machine learning and, additionally, it can be used to change the search engine's response in real-time, driven by the users behavior. Their proposal ...

  5. Scheduling in a Meta Search Engine by Genetic Algorithm

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    The meta search engines provide service to the users bydispensing the users' requests to the existing search engines. The existing search engines sele cted by meta search engine determine the searching quality. Because the performa nce of the existing search engines and the users' requests are changed dynamical ly, it is not favorable for the fixed search engines to optimize the holistic pe rformance of the meta search engine. This paper applies the genetic algorithm (G A) to realize the scheduling strategy of agent manager in our meta search engine , GSE(general search engine), which can simulate the evolution process of living things more lively and more efficiently. By using GA, the combination of search engines can be optimized and hence the holistic performance of GSE can be impro ved dramatically.

  6. Comparison and Evaluation of Semantic Search Engines

    Directory of Open Access Journals (Sweden)

    Raheleh Dorri

    2015-02-01

    Full Text Available In this study, we evaluate the performance of five semantic search engines that are available on the web, using 45 criteria, in the form of a researcher-made checklist. Criteria provided in the checklist included both common and semantic features. Common criteria or features are those applicable to all search engines and semantic ones are those only applicable to semantic search engines. Findings show that the selected search engines do not have suitable performance and expected efficiency. DuckDuckGo, has the most points, considering regular features. Cluuz is in the second place with 20 points and Hakia with 18 points was in the third place. Lexxe and Factbites, with scores of 15 and 10 were placed in the next categories in order of their points. In semantic features, DuckDuckGo, with 10/65 points was in the first place. Hakia with 9/99 points was in the second place, and then the search engines Cluuz with 8/66 Points, Lexxe with 8/65 points and Factbites with 7/32 points were allocated to the next levels. The research results also indicated that on the whole, considering ordinary and semantic features, DuckDuckGo with 31/65 points, Cluuz with 28/66, Hakia with 27/99 points, Lexxe with 23/65 points and Factbites with 17/32 points, got the highest scores out of it.

  7. Weblog Search Engine Based on Quality Criteria

    Directory of Open Access Journals (Sweden)

    F. Azimzadeh,

    2011-01-01

    Full Text Available Nowadays, increasing amount of human knowledge is placed in computerized repositories such as the World Wide Web. This gives rise to the problem of how to locate specific pieces of information in these often quite unstructured repositories. Search engines is the best solved. Some studied show that, almost half of the traffic to the blog server comes from search engines. The more outgoing and informal social nature of the blogosphere opens the opportunity for exploiting more socially-oriented features. The nature of blogs, which are usually characterized by their personal and informal nature, dynamically and constructed on the new relational links required new quality measurement for blog search engine. Link analysis algorithms that exploit the Web graph may not work well in the blogosphere in general. (Gonçalves et al 2010 indicated that most of the popular blogs in the dataset (70% have a PageRank value equal -1, being thus almost invisible to the search engine. We expected that incorporated the special blogs quality criteria would be more desirably retrieved by search engines.

  8. Regulating Search Engines: Taking Stock And Looking Ahead

    OpenAIRE

    Gasser, Urs

    2006-01-01

    Since the creation of the first pre-Web Internet search engines in the early 1990s, search engines have become almost as important as email as a primary online activity. Arguably, search engines are among the most important gatekeepers in today's digitally networked environment. Thus, it does not come as a surprise that the evolution of search technology and the diffusion of search engines have been accompanied by a series of conflicts among stakeholders such as search operators, content crea...

  9. An Analysis of Chinese Search Engine Filtering

    CERN Document Server

    Zhu, Tao; Wallach, Dan S

    2011-01-01

    The imposition of government mandates upon Internet search engine operation is a growing area of interest for both computer science and public policy. Users of these search engines often observe evidence of censorship, but the government policies that impose this censorship are not generally public. To better understand these policies, we conducted a set of experiments on major search engines employed by Internet users in China, issuing queries against a variety of different words: some neutral, some with names of important people, some political, and some pornographic. We conducted these queries, in Chinese, against Baidu, Google (including google.cn, before it was terminated), Yahoo!, and Bing. We found remarkably aggressive filtering of pornographic terms, in some cases causing non-pornographic terms which use common characters to also be filtered. We also found that names of prominent activists and organizers as well as top political and military leaders, were also filtered in whole or in part. In some ca...

  10. Estimating Search Engine Index Size Variability

    DEFF Research Database (Denmark)

    Van den Bosch, Antal; Bogers, Toine; De Kunder, Maurice

    2016-01-01

    method of estimating the size of a Web search engine’s index by extrapolating from document frequencies of words observed in a large static corpus of Web pages. In addition, we provide a unique longitudinal perspective on the size of Google and Bing’s indices over a nine-year period, from March 2006...... until January 2015. We find that index size estimates of these two search engines tend to vary dramatically over time, with Google generally possessing a larger index than Bing. This result raises doubts about the reliability of previous one-off estimates of the size of the indexed Web. We find......One of the determining factors of the quality of Web search engines is the size of their index. In addition to its influence on search result quality, the size of the indexed Web can also tell us something about which parts of the WWW are directly accessible to the everyday user. We propose a novel...

  11. Correlation of Expert and Search Engine Rankings

    CERN Document Server

    Nelson, Michael L; Magudamudi, Manoranjan

    2008-01-01

    In previous research it has been shown that link-based web page metrics can be used to predict experts' assessment of quality. We are interested in a related question: do expert rankings of real-world entities correlate with search engine rankings of corresponding web resources? For example, each year US News & World Report publishes a list of (among others) top 50 graduate business schools. Does their expert ranking correlate with the search engine ranking of the URLs of those business schools? To answer this question we conducted 9 experiments using 8 expert rankings on a range of academic, athletic, financial and popular culture topics. We compared the expert rankings with the rankings in Google, Live Search (formerly MSN) and Yahoo (with list lengths of 10, 25, and 50). In 57 search engine vs. expert comparisons, only 1 strong and 4 moderate correlations were statistically significant. In 42 inter-search engine comparisons, only 2 strong and 4 moderate correlations were statistically significant. The ...

  12. Search engine optimization an hour a day

    CERN Document Server

    Grappone, Jennifer

    2011-01-01

    The third edition of the bestselling guide to do-it-yourself SEO. Getting seen on the first page of search engine result pages is crucial for businesses and online marketers. Search engine optimization helps improve Web site rankings, and it is often complex and confusing. This task-based, hands-on guide covers the concepts and trends and then lays out a day-by-day strategy for developing, managing, and measuring a successful SEO plan. With tools you can download and case histories to illustrate key points, it's the perfect solution for busy marketers, business owners, and others whose jobs in

  13. Weighting Relations Using Web Search Engine

    Science.gov (United States)

    Oka, Mizuki; Matsuo, Yutaka

    Measuring the weight of the relation between a pair of entities is necessary to use social networks for various purposes. Intuitively, a pair of entities has a stronger relation than another. It should therefore be weighted higher. We propose a method, using a Web search engine, to compute the weight of the relation existing between a pair of entities. Our method receives a pair of entities and various relations that exist between entities as input. It then outputs the weighted value for the pair of entities. The method explores how search engine results can be used as evidence for how strongly the two entities pertain to the relation.

  14. Evidence-based Medicine Search: a customizable federated search engine.

    Science.gov (United States)

    Bracke, Paul J; Howse, David K; Keim, Samuel M

    2008-04-01

    This paper reports on the development of a tool by the Arizona Health Sciences Library (AHSL) for searching clinical evidence that can be customized for different user groups. The AHSL provides services to the University of Arizona's (UA's) health sciences programs and to the University Medical Center. Librarians at AHSL collaborated with UA College of Medicine faculty to create an innovative search engine, Evidence-based Medicine (EBM) Search, that provides users with a simple search interface to EBM resources and presents results organized according to an evidence pyramid. EBM Search was developed with a web-based configuration component that allows the tool to be customized for different specialties. Informal and anecdotal feedback from physicians indicates that EBM Search is a useful tool with potential in teaching evidence-based decision making. While formal evaluation is still being planned, a tool such as EBM Search, which can be configured for specific user populations, may help lower barriers to information resources in an academic health sciences center.

  15. Relevance ranking for vertical search engines

    CERN Document Server

    Chang, Yi

    2014-01-01

    In plain, uncomplicated language, and using detailed examples to explain the key concepts, models, and algorithms in vertical search ranking, Relevance Ranking for Vertical Search Engines teaches readers how to manipulate ranking algorithms to achieve better results in real-world applications. This reference book for professionals covers concepts and theories from the fundamental to the advanced, such as relevance, query intention, location-based relevance ranking, and cross-property ranking. It covers the most recent developments in vertical search ranking applications, such as freshness-based relevance theory for new search applications, location-based relevance theory for local search applications, and cross-property ranking theory for applications involving multiple verticals. It introduces ranking algorithms and teaches readers how to manipulate ranking algorithms for the best results. It covers concepts and theories from the fundamental to the advanced. It discusses the state of the art: development of ...

  16. Reflections on New Search Engine 新型搜索引擎畅想

    OpenAIRE

    Huang, Jiannian

    2007-01-01

    English abstract]Quick increment of need on internet information resources leads to a rush of search engines. This article introduces some new type of search engines which is appearing and will appear. These search engines includes as follows: grey document search engine, invisible web search engine, knowledge discovery search engine, clustering meta search engine, academic clustering search engine, conception comparison and conception analogy search engine, consultation search engine, teachi...

  17. Quality Dimensions of Internet Search Engines.

    Science.gov (United States)

    Xie, M.; Wang, H.; Goh, T. N.

    1998-01-01

    Reviews commonly used search engines (AltaVista, Excite, infoseek, Lycos, HotBot, WebCrawler), focusing on existing comparative studies; considers quality dimensions from the customer's point of view based on a SERVQUAL framework; and groups these quality expectations in five dimensions: tangibles, reliability, responsiveness, assurance, and…

  18. Research Trends with Cross Tabulation Search Engine

    Science.gov (United States)

    Yin, Chengjiu; Hirokawa, Sachio; Yau, Jane Yin-Kim; Hashimoto, Kiyota; Tabata, Yoshiyuki; Nakatoh, Tetsuya

    2013-01-01

    To help researchers in building a knowledge foundation of their research fields which could be a time-consuming process, the authors have developed a Cross Tabulation Search Engine (CTSE). Its purpose is to assist researchers in 1) conducting research surveys, 2) efficiently and effectively retrieving information (such as important researchers,…

  19. Visual search engine for product images

    Science.gov (United States)

    Lin, Xiaofan; Gokturk, Burak; Sumengen, Baris; Vu, Diem

    2008-01-01

    Nowadays there are many product comparison web sites. But most of them only use text information. This paper introduces a novel visual search engine for product images, which provides a brand-new way of visually locating products through Content-based Image Retrieval (CBIR) technology. We discusses the unique technical challenges, solutions, and experimental results in the design and implementation of this system.

  20. An Exploratory Survey of Student Perspectives Regarding Search Engines

    Science.gov (United States)

    Alshare, Khaled; Miller, Don; Wenger, James

    2005-01-01

    This study explored college students' perceptions regarding their use of search engines. The main objective was to determine how frequently students used various search engines, whether advanced search features were used, and how many search engines were used. Various factors that might influence student responses were examined. Results showed…

  1. The Use of Web Search Engines in Information Science Research.

    Science.gov (United States)

    Bar-Ilan, Judit

    2004-01-01

    Reviews the literature on the use of Web search engines in information science research, including: ways users interact with Web search engines; social aspects of searching; structure and dynamic nature of the Web; link analysis; other bibliometric applications; characterizing information on the Web; search engine evaluation and improvement; and…

  2. Multiple Presents: How Search Engines Re-write the Past

    NARCIS (Netherlands)

    Hellsten, I; Leydesdorff, L.; Wouters, P.

    2006-01-01

    Internet search engines function in a present which changes continuously. The search engines update their indices regularly, overwriting webpages with newer ones, adding new pages to the index and losing older ones. Some search engines can be used to search for information on the internet for specif

  3. An Exploratory Survey of Student Perspectives Regarding Search Engines

    Science.gov (United States)

    Alshare, Khaled; Miller, Don; Wenger, James

    2005-01-01

    This study explored college students' perceptions regarding their use of search engines. The main objective was to determine how frequently students used various search engines, whether advanced search features were used, and how many search engines were used. Various factors that might influence student responses were examined. Results showed…

  4. Information retrieval implementing and evaluating search engines

    CERN Document Server

    Büttcher, Stefan; Cormack, Gordon V

    2016-01-01

    Information retrieval is the foundation for modern search engines. This textbook offers an introduction to the core topics underlying modern search technologies, including algorithms, data structures, indexing, retrieval, and evaluation. The emphasis is on implementation and experimentation; each chapter includes exercises and suggestions for student projects. Wumpus -- a multiuser open-source information retrieval system developed by one of the authors and available online -- provides model implementations and a basis for student work. The modular structure of the book allows instructors to use it in a variety of graduate-level courses, including courses taught from a database systems perspective, traditional information retrieval courses with a focus on IR theory, and courses covering the basics of Web retrieval. In addition to its classroom use, Information Retrieval will be a valuable reference for professionals in computer science, computer engineering, and software engineering.

  5. Chemical Information in Scirus and BASE (Bielefeld Academic Search Engine)

    Science.gov (United States)

    Bendig, Regina B.

    2009-01-01

    The author sought to determine to what extent the two search engines, Scirus and BASE (Bielefeld Academic Search Engines), would be useful to first-year university students as the first point of searching for chemical information. Five topics were searched and the first ten records of each search result were evaluated with regard to the type of…

  6. Intelligent Semantic Web Search Engines: A Brief Survey

    CERN Document Server

    Madhu, G; Rajinikanth, Dr T V

    2011-01-01

    The World Wide Web (WWW) allows the people to share the information (data) from the large database repositories globally. The amount of information grows billions of databases. We need to search the information will specialize tools known generically search engine. There are many of search engines available today, retrieving meaningful information is difficult. However to overcome this problem in search engines to retrieve meaningful information intelligently, semantic web technologies are playing a major role. In this paper we present survey on the search engine generations and the role of search engines in intelligent web and semantic search technologies.

  7. Sexual information seeking on web search engines.

    Science.gov (United States)

    Spink, Amanda; Koricich, Andrew; Jansen, B J; Cole, Charles

    2004-02-01

    Sexual information seeking is an important element within human information behavior. Seeking sexually related information on the Internet takes many forms and channels, including chat rooms discussions, accessing Websites or searching Web search engines for sexual materials. The study of sexual Web queries provides insight into sexually-related information-seeking behavior, of value to Web users and providers alike. We qualitatively analyzed queries from logs of 1,025,910 Alta Vista and AlltheWeb.com Web user queries from 2001. We compared the differences in sexually-related Web searching between Alta Vista and AlltheWeb.com users. Differences were found in session duration, query outcomes, and search term choices. Implications of the findings for sexual information seeking are discussed.

  8. Finding Business Information on the "Invisible Web": Search Utilities vs. Conventional Search Engines.

    Science.gov (United States)

    Darrah, Brenda

    Researchers for small businesses, which may have no access to expensive databases or market research reports, must often rely on information found on the Internet, which can be difficult to find. Although current conventional Internet search engines are now able to index over on billion documents, there are many more documents existing in…

  9. LAILAPS: the plant science search engine.

    Science.gov (United States)

    Esch, Maria; Chen, Jinbo; Colmsee, Christian; Klapperstück, Matthias; Grafahrend-Belau, Eva; Scholz, Uwe; Lange, Matthias

    2015-01-01

    With the number of sequenced plant genomes growing, the number of predicted genes and functional annotations is also increasing. The association between genes and phenotypic traits is currently of great interest. Unfortunately, the information available today is widely scattered over a number of different databases. Information retrieval (IR) has become an all-encompassing bioinformatics methodology for extracting knowledge from complex, heterogeneous and distributed databases, and therefore can be a useful tool for obtaining a comprehensive view of plant genomics, from genes to traits. Here we describe LAILAPS (http://lailaps.ipk-gatersleben.de), an IR system designed to link plant genomic data in the context of phenotypic attributes for a detailed forward genetic research. LAILAPS comprises around 65 million indexed documents, encompassing >13 major life science databases with around 80 million links to plant genomic resources. The LAILAPS search engine allows fuzzy querying for candidate genes linked to specific traits over a loosely integrated system of indexed and interlinked genome databases. Query assistance and an evidence-based annotation system enable time-efficient and comprehensive information retrieval. An artificial neural network incorporating user feedback and behavior tracking allows relevance sorting of results. We fully describe LAILAPS's functionality and capabilities by comparing this system's performance with other widely used systems and by reporting both a validation in maize and a knowledge discovery use-case focusing on candidate genes in barley.

  10. Performance Oriented Query Processing In GEO Based Location Search Engines

    CERN Document Server

    Umamaheswari, M

    2010-01-01

    Geographic location search engines allow users to constrain and order search results in an intuitive manner by focusing a query on a particular geographic region. Geographic search technology, also called location search, has recently received significant interest from major search engine companies. Academic research in this area has focused primarily on techniques for extracting geographic knowledge from the web. In this paper, we study the problem of efficient query processing in scalable geographic search engines. Query processing is a major bottleneck in standard web search engines, and the main reason for the thousands of machines used by the major engines. Geographic search engine query processing is different in that it requires a combination of text and spatial data processing techniques. We propose several algorithms for efficient query processing in geographic search engines, integrate them into an existing web search query processor, and evaluate them on large sets of real data and query traces.

  11. An Ontology Based Personalised Mobile Search Engine

    Directory of Open Access Journals (Sweden)

    Mrs. Rashmi A. Jolhe

    2014-02-01

    Full Text Available As the amount of Web information grows rapidly, Search engines must be able to retrieve information according to the user's preference. In this paper, we propose Ontology Based Personalised Mobile Search Engine (OBPMSE that captures user‟s interest and preferences in the form of concepts by mining search results and their clickthroughs. OBPMSE profile the user‟s interest and personalised the search results according to user‟s profile. OBPMSE classifies these concepts into content concepts and location concepts. In addition, users‟ locations (positioned by GPS are used to supplement the location concepts in OBPMSE. The user preferences are organized in an ontology-based, multifacet user profile, used to adapt a personalized ranking function which in turn used for rank adaptation of future search results. we propose to define personalization effectiveness based on the entropies and use it to balance the weights between the content and location facets. In our design, the client collects and stores locally the clickthrough data to protect privacy, whereas heavy tasks such as concept extraction ,training, and reranking are performed at the OBPMSE server. OBPMSE provide client-server architecture and distribute the task to each individual component to decrease the complexity.

  12. Thumbnail Images: Uncertainties, Infrastructures and Search Engines

    DEFF Research Database (Denmark)

    Thylstrup, Nanna; Teilmann, Stina

    2017-01-01

    and strategic terms; and a cultural question of how human-computer interaction design works with navigational uncertainty, both as an experience to be managed and a resource to be exploited. This paper considers two copyright infringement cases that involved search engines as defendants, Kelly v. Arriba Soft......This article argues that thumbnail images are infrastructural images that raise issues of uncertainty in two distinct, but interrelated, areas: a legal question of how to define, understand and govern visual information infrastructures, in particular image search systems in epistemological...... been negotiated in legal terms, its cultural infrastructures, and the information behaviours they are designed to produce....

  13. Information retrieval for education: making search engines language aware

    Directory of Open Access Journals (Sweden)

    Niels Ott

    2010-01-01

    Full Text Available Search engines have been a major factor in making the web the successful and widely usedinformation source it is today. Generally speaking, they make it possible to retrieve web pageson a topic specified by the keywords entered by the user. Yet web searching currently doesnot take into account which of the search results are comprehensible for a given user – anissue of particular relevance when considering students in an educational setting. And currentsearch engines do not support teachers in searching for language properties relevant forselecting texts appropriate for language students at different stages in the second languageacquisition process.At the same time, raising language awareness is a major focus in second language acquisitionresearch and foreign language teaching practice, and research since the 20s has tried toidentify indicators predicting which texts are comprehensible for readers at a particular levelof ability. For example, the military has been interested in ensuring that workers at a givenlevel of education can understand the manuals they need to read in order to perform their job.We present a new search engine approach which makes it possible for teachers to search fortexts both in terms of contents and in terms of their reading difficulty and other languageproperties. The implemented prototype builds on state-of-the art information retrievaltechnology and exemplifies how a range of readability measures can be integrated in amodular fashion.

  14. A Theoretical and Empirical Evaluation of Software Component Search Engines, Semantic Search Engines and Google Search Engine in the Context of COTS-Based Development

    CERN Document Server

    Yanes, Nacim; Ghezala, Henda Hajjami Ben

    2012-01-01

    COTS-based development is a component reuse approach promising to reduce costs and risks, and ensure higher quality. The growing availability of COTS components on the Web has concretized the possibility of achieving these objectives. In this multitude, a recurrent problem is the identification of the COTS components that best satisfy the user requirements. Finding an adequate COTS component implies searching among heterogeneous descriptions of the components within a broad search space. Thus, the use of search engines is required to make more efficient the COTS components identification. In this paper, we investigate, theoretically and empirically, the COTS component search performance of eight software component search engines, nine semantic search engines and a conventional search engine (Google). Our empirical evaluation is conducted with respect to precision and normalized recall. We defined ten queries for the assessed search engines. These queries were carefully selected to evaluate the capability of e...

  15. A Semantic Query Transformation Approach Based on Ontology for Search Engine

    Directory of Open Access Journals (Sweden)

    SAJENDRA KUMAR

    2012-05-01

    Full Text Available These days we are using some popular web search engines for information retrieval in all areas, such engine are as Google, Yahoo!, and Live Search, etc. to obtain initial helpful information.Which information we retrieved via search engine may not be relevant to the search target in the search engine user's mind. When user not found relevant information he has to shortlist the results. Thesesearch engines use traditional search service based on "static keywords", which require the users to type in the exact keywords. This approach clearly puts the users in a critical situation of guessing the exact keyword. The users may want to define their search by using attributes of the search target. But the relevancy of results in most cases may not be satisfactory and the users may not be patient enough to browse through complete list of pages to get a relevant result. The reason behind this is the search engines performs search based on the syntax not on semantics. But they seemed to be less efficient to understand the relationship between the keywords which had an adverse effect on the results it produced. Semantic search engines – only solution to this; which returns concepts not documents according to user query matching. In This paper we proposed a semantic query interface which creates a semantic query according the user input query and study of current semantic search engine techniques for semantic search.

  16. An open-source, mobile-friendly search engine for public medical knowledge.

    Science.gov (United States)

    Samwald, Matthias; Hanbury, Allan

    2014-01-01

    The World Wide Web has become an important source of information for medical practitioners. To complement the capabilities of currently available web search engines we developed FindMeEvidence, an open-source, mobile-friendly medical search engine. In a preliminary evaluation, the quality of results from FindMeEvidence proved to be competitive with those from TRIP Database, an established, closed-source search engine for evidence-based medicine.

  17. The end of meta search engines in Europe?

    NARCIS (Netherlands)

    Husovec, Martin

    2015-01-01

    The technology behind the meta search engines supports countless number of Internet services ranging from the price and quality comparison websites to more sophisticated traffic connection finders and general search engines like Google. Meta search engines generally increase market transparency, int

  18. Text Retrieval Online: Historical Perspective on Web Search Engines.

    Science.gov (United States)

    Hahn, Trudi Bellardo

    1998-01-01

    Provides an overview of online systems and search engines, highlighting search (relationships between terms and interpretation of words), browse, and Web search engine capabilities, iterative searches, canned or stored queries, vocabulary browsing, delivery of full source documents, simple and advanced user interfaces, and global access. Notes…

  19. Can an Ad-hoc ontology Beat a Medical Search Engine? The Chronious Search Engine case

    CERN Document Server

    Giacomelli, Piero; Rosso, Roberto

    2012-01-01

    Chronious is an Open, Ubiquitous and Adaptive Chronic Disease Management Platform for Chronic Obstructive Pulmonary Disease(COPD) Chronic Kidney Disease (CKD) and Renal Insufficiency. It consists of several modules: an ontology based literature search engine, a rule based decision support system, remote sensors interacting with lifestyle interfaces (PDA, monitor touch-screen) and a machine learning module. All these modules interact each other to allow the monitoring of two types of chronic diseases and to help clinician in taking decision for care purpose. This paper illustrates how the ontology search engine was created and fed and how some comparative test indicated that the ontology based approach give better results, on some estimation parameters, than the main reference web search engine.

  20. Semantic Clustering of Search Engine Results.

    Science.gov (United States)

    Soliman, Sara Saad; El-Sayed, Maged F; Hassan, Yasser F

    2015-01-01

    This paper presents a novel approach for search engine results clustering that relies on the semantics of the retrieved documents rather than the terms in those documents. The proposed approach takes into consideration both lexical and semantics similarities among documents and applies activation spreading technique in order to generate semantically meaningful clusters. This approach allows documents that are semantically similar to be clustered together rather than clustering documents based on similar terms. A prototype is implemented and several experiments are conducted to test the prospered solution. The result of the experiment confirmed that the proposed solution achieves remarkable results in terms of precision.

  1. Semantic Clustering of Search Engine Results

    Directory of Open Access Journals (Sweden)

    Sara Saad Soliman

    2015-01-01

    Full Text Available This paper presents a novel approach for search engine results clustering that relies on the semantics of the retrieved documents rather than the terms in those documents. The proposed approach takes into consideration both lexical and semantics similarities among documents and applies activation spreading technique in order to generate semantically meaningful clusters. This approach allows documents that are semantically similar to be clustered together rather than clustering documents based on similar terms. A prototype is implemented and several experiments are conducted to test the prospered solution. The result of the experiment confirmed that the proposed solution achieves remarkable results in terms of precision.

  2. Towards two-dimensional search engines

    OpenAIRE

    Ermann, Leonardo; Chepelianskii, Alexei D.; Shepelyansky, Dima L.

    2011-01-01

    We study the statistical properties of various directed networks using ranking of their nodes based on the dominant vectors of the Google matrix known as PageRank and CheiRank. On average PageRank orders nodes proportionally to a number of ingoing links, while CheiRank orders nodes proportionally to a number of outgoing links. In this way the ranking of nodes becomes two-dimensional that paves the way for development of two-dimensional search engines of new type. Statistical properties of inf...

  3. Shifts in search engine development: A review of past, present and future trends in research on search engines

    Directory of Open Access Journals (Sweden)

    Hamid R. Jamali

    2004-12-01

    Full Text Available The World Wide Web has developed fast and many people use search engines to capture information from the Web. This article reviews past, present and future of search engines. Papers published in four major Web and information management conferences were surveyed to track research interests in the last five years. Web search and information retrieval topics such as ranking, filtering and query formulation are still hot topics among researchers. The most important shifts and issues of the future of search engines are mentioned too. Search engine companies are trying to capture the Deep Web and extract structured data to offer high quality results. Using Web page structure, shared search engines, expert recommendations and different mobile search facilities seem to be features of the next generation of search engines.

  4. Full Elastic Waveform Search Engine for Near Surface Imaging

    Science.gov (United States)

    Zhang, J.; Zhang, X.

    2014-12-01

    For processing land seismic data, the near-surface problem is often very complex and may severely affect our capability to image the subsurface. The current state-of-the-art technology for near surface imaging is the early arrival waveform inversion that solves an acoustic wave-equation problem. However, fitting land seismic data with acoustic wavefield is sometimes invalid. On the other hand, performing elastic waveform inversion is very time-consuming. Similar to a web search engine, we develop a full elastic waveform search engine that includes a large database with synthetic elastic waveforms accounting for a wide range of interval velocity models in the CMP domain. With each CMP gather of real data as an entry, the search engine applies Multiple-Randomized K-Dimensional (MRKD) tree method to find approximate best matches to the entry in about a second. Interpolation of the velocity models at CMP positions creates 2D or 3D Vp, Vs, and density models for the near surface area. The method does not just return one solution; it gives a series of best matches in a solution space. Therefore, the results can help us to examine the resolution and nonuniqueness of the final solution. Further, this full waveform search method can avoid the issues of initial model and cycle skipping that the method of full waveform inversion is difficult to deal with.

  5. VALIDATING THE PERFORMANCE OF PERSONALIZATION TECHNIQUES IN SEARCH ENGINE

    Directory of Open Access Journals (Sweden)

    A. Suruliandi

    2015-04-01

    Full Text Available User profiling is an important and basic component in personalized search engine. Search engines respond to a user’s query by using the bag-of-words model, which matches keyword between the query and web documents but ignore contexts and users’ preferences. Personalized search greatly improves the search results as of the results provided by the search engine without personalization. In this paper, the performance of personalized search based on content analysis and personalized search based on user group have been evaluated. In personalized search based on content analysis the contents are traced by finding the user’s browsed documents and search history, which reduce the users search time. In user profile only user preference alone is taken into consideration. The experimental results show that the personalized search based on user group method having higher precision and recall rate than the content analysis method.

  6. Mashup Based Content Search Engine for Mobile Devices

    Directory of Open Access Journals (Sweden)

    Kohei Arai

    2013-05-01

    Full Text Available Mashup based content search engine for mobile devices is proposed. Example of the proposed search engine is implemented with Yahoo!JAPAN Web SearchAPI, Yahoo!JAPAN Image searchAPI, YouTube Data API, and Amazon Product Advertising API. The retrieved results are also merged and linked each other. Therefore, the different types of contents can be referred once an e-learning content is retrieved. The implemented search engine is evaluated with 20 students. The results show usefulness and effectiveness on e-learning content searches with a variety of content types, image, document, pdf files, moving picture.

  7. Search Engine Advertising Effectiveness in a Multimedia Campaign

    NARCIS (Netherlands)

    Zenetti, German; Bijmolt, Tammo H. A.; Leeflang, Peter S. H.; Klapper, Daniel

    2014-01-01

    Search engine advertising has become a multibillion-dollar business and one of the dominant forms of advertising on the Internet. This study examines the effectiveness of search engine advertising within a multimedia campaign, with explicit consideration of the interaction effects between search eng

  8. Search Engine Advertising Effectiveness in a Multimedia Campaign

    NARCIS (Netherlands)

    Zenetti, German; Bijmolt, Tammo H. A.; Leeflang, Peter S. H.; Klapper, Daniel

    2014-01-01

    Search engine advertising has become a multibillion-dollar business and one of the dominant forms of advertising on the Internet. This study examines the effectiveness of search engine advertising within a multimedia campaign, with explicit consideration of the interaction effects between search eng

  9. The effective use of search engines on the Internet.

    Science.gov (United States)

    Younger, P

    This article explains how nurses can get the most out of researching information on the internet using the search engine Google. It also explores some of the other types of search engines that are available. Internet users are shown how to find text, images and reports and search within sites. Copyright issues are also discussed.

  10. Design and Implementation of a Simple Web Search Engine

    CERN Document Server

    Mirzal, Andri

    2011-01-01

    We present a simple web search engine for indexing and searching html documents using python programming language. Because python is well known for its simple syntax and strong support for main operating systems, we hope it will be beneficial for learning information retrieval techniques, especially web search engine technology.

  11. Search Engine Advertising Effectiveness in a Multimedia Campaign

    NARCIS (Netherlands)

    Zenetti, German; Bijmolt, Tammo H. A.; Leeflang, Peter S. H.; Klapper, Daniel

    2014-01-01

    Search engine advertising has become a multibillion-dollar business and one of the dominant forms of advertising on the Internet. This study examines the effectiveness of search engine advertising within a multimedia campaign, with explicit consideration of the interaction effects between search

  12. People searching for people: analysis of a people search engine log

    NARCIS (Netherlands)

    Weerkamp, W.; Berendsen, R.; Kovachev, B.; Meij, E.; Balog, K.; de Rijke, M.

    2011-01-01

    Recent years show an increasing interest in vertical search: searching within a particular type of information. Understanding what people search for in these "verticals" gives direction to research and provides pointers for the search engines themselves. In this paper we analyze the search logs of o

  13. An Empirical Analysis of Search Engine Advertising: Sponsored Search in Electronic Markets

    OpenAIRE

    Anindya Ghose; Sha Yang

    2009-01-01

    The phenomenon of sponsored search advertising--where advertisers pay a fee to Internet search engines to be displayed alongside organic (nonsponsored) Web search results--is gaining ground as the largest source of revenues for search engines. Using a unique six-month panel data set of several hundred keywords collected from a large nationwide retailer that advertises on Google, we empirically model the relationship between different sponsored search metrics such as click-through rates, conve...

  14. Search Engine Optimization through Spanning Forest Generation Algorithm

    Directory of Open Access Journals (Sweden)

    SATYA PAVAN KUMAR SOMAYAJULA

    2011-09-01

    Full Text Available Search engine technology has had to scale dramatically to keep up with the growth of the web. With the tremendous growth of information available to end users through the Web, search engines come to play ever a more critical role. Determining the user intent of Web searches is a difficult problem due to the sparse data available concerning the searcher. We qualitatively analyze samples of queries from seven transaction logs from three different Web search engines containing more than five million queries. The following are our research objectives: Isolate characteristics of informational, navigational, and transactional for Web searching queries by identifying characteristics of each query type that will lead toreal world classification. Validate the taxonomy by automatically classifying a large set of queries from a Web search engine. This paper we deal with now is semantic web search engines is the layeredarchitecture and we use this with relation based page rank algorithm.

  15. Multiple Presents: How Search Engines Re-write the Past

    CERN Document Server

    Hellsten, Iina; Wouters, Paul

    2009-01-01

    Internet search engines function in a present which changes continuously. The search engines update their indices regularly, overwriting Web pages with newer ones, adding new pages to the index, and losing older ones. Some search engines can be used to search for information at the internet for specific periods of time. However, these 'date stamps' are not determined by the first occurrence of the pages in the Web, but by the last date at which a page was updated or a new page was added, and the search engine's crawler updated this change in the database. This has major implications for the use of search engines in scholarly research as well as theoretical implications for the conceptions of time and temporality. We examine the interplay between the different updating frequencies by using AltaVista and Google for searches at different moments of time. Both the retrieval of the results and the structure of the retrieved information erodes over time.

  16. Software engineering the current practice

    CERN Document Server

    Rajlich, Vaclav

    2011-01-01

    INTRODUCTION History of Software EngineeringSoftware PropertiesOrigins of SoftwareBirth of Software EngineeringThird Paradigm: Iterative ApproachSoftware Life Span ModelsStaged ModelVariants of Staged ModelSoftware Technologies Programming Languages and CompilersObject-Oriented TechnologyVersion Control SystemSoftware ModelsClass DiagramsUML Activity DiagramsClass Dependency Graphs and ContractsSOFTWARE CHANGEIntroduction to Software ChangeCharacteristics of Software ChangePhases of Software ChangeRequirements and Their ElicitationRequirements Analysis and Change InitiationConcepts and Concept

  17. A Case Study of Search Engine on World Wide Web for Chemical Fiber Engineering

    Institute of Scientific and Technical Information of China (English)

    张利; 邵世煌; 曾献辉; 尹美华

    2001-01-01

    Search engine is an effective approach to promote the service quality of the World Wide Web. On terms of the analysis of search engines at home and abroad, the developing principle of search engines is given according to the requirement of Web information for chemical fiber engineering. The implementation method for the communication and dynamic refreshment of information on home page of the search engines are elaborated by using programming technology of Active Server Page 3.0 (ASP3.0). The query of chemical fiber information and automatic linking of chemical fiber Web sites can be easily realized by the developed search engine under Internet environment according to users' requirement.

  18. Getting Off the Beaten Track: Specialized Web Search Engines.

    Science.gov (United States)

    Sullivan, Danny

    1998-01-01

    Describes specialty or vertical Web search engines that may provide more relevant results for information retrieval. Highlights include regional services, including filtering by domain and custom crawling; language searching; family-safe listings, including the pros and cons of filtering; news searches; and subject-oriented searching. (LRW)

  19. Utilization of a radiology-centric search engine.

    Science.gov (United States)

    Sharpe, Richard E; Sharpe, Megan; Siegel, Eliot; Siddiqui, Khan

    2010-04-01

    Internet-based search engines have become a significant component of medical practice. Physicians increasingly rely on information available from search engines as a means to improve patient care, provide better education, and enhance research. Specialized search engines have emerged to more efficiently meet the needs of physicians. Details about the ways in which radiologists utilize search engines have not been documented. The authors categorized every 25th search query in a radiology-centric vertical search engine by radiologic subspecialty, imaging modality, geographic location of access, time of day, use of abbreviations, misspellings, and search language. Musculoskeletal and neurologic imagings were the most frequently searched subspecialties. The least frequently searched were breast imaging, pediatric imaging, and nuclear medicine. Magnetic resonance imaging and computed tomography were the most frequently searched modalities. A majority of searches were initiated in North America, but all continents were represented. Searches occurred 24 h/day in converted local times, with a majority occurring during the normal business day. Misspellings and abbreviations were common. Almost all searches were performed in English. Search engine utilization trends are likely to mirror trends in diagnostic imaging in the region from which searches originate. Internet searching appears to function as a real-time clinical decision-making tool, a research tool, and an educational resource. A more thorough understanding of search utilization patterns can be obtained by analyzing phrases as actually entered as well as the geographic location and time of origination. This knowledge may contribute to the development of more efficient and personalized search engines.

  20. Location-Based Search Engines Tasks and Capabilities: A Comparative Study

    Directory of Open Access Journals (Sweden)

    Hossein Vakili Mofrad

    2007-12-01

    Full Text Available Location-based web searching is one of the popular tasks expected from the search engines. A location-based query consists of a topic and a reference location. Unlike general web search, in location-based search it is expected to find and rank documents which are not only related to the query topic but also geographically related to the location which the query is associated with. There are several issues for developing effective geographic search engines and so far, no global location-based search engine has been reported. Location ambiguity, lack of geographic information on web pages, language-based and country-dependent addressing styles, and multiple locations related to a single web resource are notable difficulties. Search engine companies have started to develop and offer location-based services. However, they are still geographically limited and have not become as successful and popular as general search engines. This paper reviews the architecture and tasks of location-based search engines and compares the capabilities, functionalities and coverage of the current geographic search engines with a user-oriented approach.

  1. Study of a Quantum Framework for Search Based Software Engineering

    Science.gov (United States)

    Wu, Nan; Song, Fangmin; Li, Xiangdong

    2013-06-01

    The Search Based Software Engineering (SBSE) is widely used in the software engineering to identify optimal solutions. The traditional methods and algorithms used in SBSE are criticized due to their high costs. In this paper, we propose a rapid modified-Grover quantum searching method for SBSE, and theoretically this method can be applied to any search-space structure and any type of searching problems.

  2. Getting to the top of Google: search engine optimization.

    Science.gov (United States)

    Maley, Catherine; Baum, Neil

    2010-01-01

    Search engine optimization is the process of making your Web site appear at or near the top of popular search engines such as Google, Yahoo, and MSN. This is not done by luck or knowing someone working for the search engines but by understanding the process of how search engines select Web sites for placement on top or on the first page. This article will review the process and provide methods and techniques to use to have your site rated at the top or very near the top.

  3. Runtime analysis of search heuristics on software engineering problems

    Institute of Scientific and Technical Information of China (English)

    Per Kristian LEHRE; Xin YAO

    2009-01-01

    Many software engineering tasks can potentially be automated using search heuristics. However, much work is needed in designing and evaluating search heuristics before this approach can be routinely applied to a software engineering problem. Experimental methodology should be complemented with theoretical analysis to achieve this goal.Recently, there have been significant theoretical advances in the runtime analysis of evolutionary algorithms (EAs) and other search heuristics in other problem domains. We suggest that these methods could be transferred and adapted to gain insight into the behaviour of search heuristics on software engineering problems while automating software engineering.

  4. Copyright over Works Reproduced and Published Online by Search Engines

    Directory of Open Access Journals (Sweden)

    Ernesto Rengifo García

    2016-12-01

    Full Text Available Search engines are an important technological tool that facilitates the dissemination and access to information on the Internet. However, when it comes to works protected by authors rights, in the case of continental law, or Copyright, for the Anglo-Saxon tradition, it is difficult to define if search engines infringe the rights of the owners of these works. In the face of this situation, the US and Europe have employed the exceptions to autorights and Fair Use to decide whether search engines infringes owners rights. This article carries out a comparative analysis of the different judicial decisions in the US and Europe on search engines and protected works.

  5. Towards two-dimensional search engines

    CERN Document Server

    Ermann, Leonardo; Shepelyansky, Dima L

    2011-01-01

    We study the statistical properties of various directed networks using ranking of their nodes based on the dominant vectors of the Google matrix known as PageRank and CheiRank. On average PageRank orders nodes proportionally to a number of ingoing links, while CheiRank orders nodes proportionally to a number of outgoing links. In this way the ranking of nodes becomes two-dimensional that paves the way for development of two-dimensional search engines of new type. Information flow properties on PageRank-CheiRank plane are analyzed for networks of British, French and Italian Universities, Wikipedia, Linux Kernel, gene regulation and other networks. Methods of spam links control are also analyzed.

  6. Toward two-dimensional search engines

    Science.gov (United States)

    Ermann, L.; Chepelianskii, A. D.; Shepelyansky, D. L.

    2012-07-01

    We study the statistical properties of various directed networks using ranking of their nodes based on the dominant vectors of the Google matrix known as PageRank and CheiRank. On average PageRank orders nodes proportionally to a number of ingoing links, while CheiRank orders nodes proportionally to a number of outgoing links. In this way, the ranking of nodes becomes two dimensional which paves the way for the development of two-dimensional search engines of a new type. Statistical properties of information flow on the PageRank-CheiRank plane are analyzed for networks of British, French and Italian universities, Wikipedia, Linux Kernel, gene regulation and other networks. A special emphasis is done for British universities networks using the large database publicly available in the UK. Methods of spam links control are also analyzed.

  7. Mashup Based Content Search Engine for Mobile Devices

    OpenAIRE

    Kohei Arai

    2013-01-01

    Mashup based content search engine for mobile devices is proposed. Example of the proposed search engine is implemented with Yahoo!JAPAN Web SearchAPI, Yahoo!JAPAN Image searchAPI, YouTube Data API, and Amazon Product Advertising API. The retrieved results are also merged and linked each other. Therefore, the different types of contents can be referred once an e-learning content is retrieved. The implemented search engine is evaluated with 20 students. The results show usefulness and effectiv...

  8. A search engine for the engineering and equipment data management system (EDMS) at CERN

    Science.gov (United States)

    Tsyganov, A.; Amérigo, S. M.; Petit, S.; Pettersson, T.; Suwalska, A.

    2008-07-01

    CERN, the European Laboratory for Particle Physics, located in Geneva -Switzerland, is currently building the LHC (Large Hadron Collider), a 27 km particle accelerator. The equipment life-cycle management of this project is provided by the Engineering and Equipment Data Management System (EDMS [1] [2]) Service. Using an Oracle database, it supports the management and follow-up of different kinds of documentation through the whole life cycle of the LHC project: design, manufacturing, installation, commissioning data etc... The equipment data collection phase is now slowing down and the project is getting closer to the 'As-Built' phase: the phase of the project consuming and exploring the large volumes of data stored since 1996. Searching through millions of items of information (documents, equipment parts, operations...) multiplied by dozens of points of view (operators, maintainers...) requires an efficient and flexible search engine. This paper describes the process followed by the team to implement the search engine for the LHC As-built project in the EDMS Service. The emphasis is put on the design decision to decouple the search engine from any user interface, potentially enabling other systems to also use it. Projections, algorithms, and the planned implementation are described in this paper. The implementation of the first version started in early 2007.

  9. Web Feet Guide to Search Engines: Finding It on the Net.

    Science.gov (United States)

    Web Feet, 2001

    2001-01-01

    This guide to search engines for the World Wide Web discusses selecting the right search engine; interpreting search results; major search engines; online tutorials and guides; search engines for kids; specialized search tools for various subjects; and other specialized engines and gateways. (LRW)

  10. Microbiome engineering: Current applications and its future.

    Science.gov (United States)

    Foo, Jee Loon; Ling, Hua; Lee, Yung Seng; Chang, Matthew Wook

    2017-03-01

    Microbiomes exist in all ecosystems and are composed of diverse microbial communities. Perturbation to microbiomes brings about undesirable phenotypes in the hosts, resulting in diseases and disorders, and disturbs the balance of the associated ecosystems. Engineering of microbiomes can be used to modify structures of the microbiota and restore ecological balance. Consequently, microbiome engineering has been employed for improving human health and agricultural productivity. The importance and current applications of microbiome engineering, particularly in humans, animals, plants and soil is reviewed. Furthermore, we explore the challenges in engineering microbiome and the future of this field, thus providing perspectives and outlook of microbiome engineering.

  11. Combining results of multiple search engines in proteomics.

    Science.gov (United States)

    Shteynberg, David; Nesvizhskii, Alexey I; Moritz, Robert L; Deutsch, Eric W

    2013-09-01

    A crucial component of the analysis of shotgun proteomics datasets is the search engine, an algorithm that attempts to identify the peptide sequence from the parent molecular ion that produced each fragment ion spectrum in the dataset. There are many different search engines, both commercial and open source, each employing a somewhat different technique for spectrum identification. The set of high-scoring peptide-spectrum matches for a defined set of input spectra differs markedly among the various search engine results; individual engines each provide unique correct identifications among a core set of correlative identifications. This has led to the approach of combining the results from multiple search engines to achieve improved analysis of each dataset. Here we review the techniques and available software for combining the results of multiple search engines and briefly compare the relative performance of these techniques.

  12. Combining Results of Multiple Search Engines in Proteomics*

    Science.gov (United States)

    Shteynberg, David; Nesvizhskii, Alexey I.; Moritz, Robert L.; Deutsch, Eric W.

    2013-01-01

    A crucial component of the analysis of shotgun proteomics datasets is the search engine, an algorithm that attempts to identify the peptide sequence from the parent molecular ion that produced each fragment ion spectrum in the dataset. There are many different search engines, both commercial and open source, each employing a somewhat different technique for spectrum identification. The set of high-scoring peptide-spectrum matches for a defined set of input spectra differs markedly among the various search engine results; individual engines each provide unique correct identifications among a core set of correlative identifications. This has led to the approach of combining the results from multiple search engines to achieve improved analysis of each dataset. Here we review the techniques and available software for combining the results of multiple search engines and briefly compare the relative performance of these techniques. PMID:23720762

  13. Brief Report: Consistency of Search Engine Rankings for Autism Websites

    Science.gov (United States)

    Reichow, Brian; Naples, Adam; Steinhoff, Timothy; Halpern, Jason; Volkmar, Fred R.

    2012-01-01

    The World Wide Web is one of the most common methods used by parents to find information on autism spectrum disorders and most consumers find information through search engines such as Google or Bing. However, little is known about how the search engines operate or the consistency of the results that are returned over time. This study presents the…

  14. Searching for safety: addressing search engine, website, and provider accountability for illicit online drug sales.

    Science.gov (United States)

    Liang, Bryan A; Mackey, Tim

    2009-01-01

    Online sales of pharmaceuticals are a rapidly growing phenomenon. Yet despite the dangers of purchasing drugs over the Internet, sales continue to escalate. These dangers include patient harm from fake or tainted drugs, lack of clinical oversight, and financial loss. Patients, and in particular vulnerable groups such as seniors and minorities, purchase drugs online either naïvely or because they lack the ability to access medications from other sources due to price considerations. Unfortunately, high risk online drug sources dominate the Internet, and virtually no accountability exists to ensure safety of purchased products. Importantly, search engines such as Google, Yahoo, and MSN, although purportedly requiring "verification" of Internet drug sellers using PharmacyChecker.com requirements, actually allow and profit from illicit drug sales from unverified websites. These search engines are not held accountable for facilitating clearly illegal activities. Both website drug seller anonymity and unethical physicians approving or writing prescriptions without seeing the patient contribute to rampant illegal online drug sales. Efforts in this country and around the world to stem the tide of these sales have had extremely limited effectiveness. Unfortunately, current congressional proposals are fractionated and do not address the key issues of demand by vulnerable patient populations, search engine accountability, and the ease with which financial transactions can be consummated to promote illegal online sales. To deal with the social scourge of illicit online drug sales, this article proposes a comprehensive statutory solution that creates a no-cost/low-cost national Drug Access Program to break the chain of demand from vulnerable patient populations and illicit online sellers, makes all Internet drug sales illegal unless the Internet pharmacy is licensed through a national Internet pharmacy licensing program, prohibits financial transactions for illegal online drug

  15. Considerations for the development of task-based search engines

    DEFF Research Database (Denmark)

    Petcu, Paula; Dragusin, Radu

    2013-01-01

    Based on previous experience from working on a task-based search engine, we present a list of suggestions and ideas for an Information Retrieval (IR) framework that could inform the development of next generation professional search systems. The specific task that we start from is the clinicians......' information need in finding rare disease diagnostic hypotheses at the time and place where medical decisions are made. Our experience from the development of a search engine focused on supporting clinicians in completing this task has provided us valuable insights in what aspects should be considered...... by the developers of vertical search engines....

  16. Journey of Web Search Engines: Milestones, Challenges & Innovations

    Directory of Open Access Journals (Sweden)

    Mamta Kathuria

    2016-12-01

    Full Text Available Past few decades have witnessed an information big bang in the form of World Wide Web leading to gigantic repository of heterogeneous data. A humble journey that started with the network connection between few computers at ARPANET project has reached to a level wherein almost all the computers and other communication devices of the world have joined together to form a huge global information network that makes available most of the information related to every possible heterogeneous domain. Not only the managing and indexing of this repository is a big concern but to provide a quick answer to the user‘s query is also of critical importance. Amazingly, rather miraculously, the task is being done quite efficiently by the current web search engines. This miracle has been possible due to a series of mathematical and technological innovations continuously being carried out in the area of search techniques. This paper takes an overview of search engine evolution from primitive to the present.

  17. What Major Search Engines Like Google, Yahoo and Bing Need to Know about Teachers in the UK?

    Science.gov (United States)

    Seyedarabi, Faezeh

    2014-01-01

    This article briefly outlines the current major search engines' approach to teachers' web searching. The aim of this article is to make Web searching easier for teachers when searching for relevant online teaching materials, in general, and UK teacher practitioners at primary, secondary and post-compulsory levels, in particular. Therefore, major…

  18. On development of search engine for geodata

    Directory of Open Access Journals (Sweden)

    David Procházka

    2010-01-01

    Full Text Available Effective management and sharing of geodata is one of the priorities of the European Union (INSPIRE activity and companies all around the world. Many different companies and organisations publish their geodata using web mapping services. This situation leads to a multiple publishing of similar or completely same geodata. On the other hand, there is frequently a problem how to determine an appropriate mapserver with the required data. This paper presents a geodata search engine which solves the problem how to access geodata more effectively. Presented solution aggregates data from the different mapservers and provides an interface according to the Open Geospatial Consortium Web Map Server specification. This allows to use our solution in the standard GIS tools as common mapserver. Completely new feature is a request which allows to select map layers which fulfills specified criteria. Selection could be given by keywords in a map layer description and by defining a bounding box on Earth surface. Response is a list of appropriate layers sorted according to their relevance. Presented solution could be among other applications significant source of information for many data mining techniques. It allows to interconnect processed data with their space-temporal context.

  19. Perron vector optimization applied to search engines

    CERN Document Server

    Fercoq, Olivier

    2011-01-01

    In the last years, Google's PageRank optimization problems have been extensively studied. In that case, the ranking is given by the invariant measure of a stochastic matrix. In this paper, we consider the more general situation in which the ranking is determined by the Perron eigenvector of a nonnegative, but not necessarily stochastic, matrix, in order to cover Kleinberg's HITS algorithm. We also give some results for Tomlin's HOTS algorithm. The problem consists then in finding an optimal outlink strategy subject to design constraints and for a given search engine. We study the relaxed versions of these problems, which means that we should accept weighted hyperlinks. We provide an efficient algorithm for the computation of the matrix of partial derivatives of the criterion, that uses the low rank property of this matrix. We give a scalable algorithm that couples gradient and power iterations and gives a local minimum of the Perron vector optimization problem. We prove convergence by considering it as an app...

  20. Folksonomies, the Web and Search Engines

    Directory of Open Access Journals (Sweden)

    Louise Spiteri

    2008-09-01

    Full Text Available The aim of this special issue of Webology is to explore developments in the design of folksonomies, knowledge organization systems, and search engines to reflect end user preferences for describing items of interest. Particular emphasis is placed on folksonomies, an area of study that has grown exponentially since the term was first coined by Thomas Vander Wal in 2004: "Folksonomy is the result of personal free tagging of information and objects (anything with a URL for one's own retrieval. The tagging is done in a social environment (usually shared and open to others. Folksonomy is created from the act of tagging by the person consuming the information" (Vander Wal, 2007. Since 2004, social software applications and their use of tagging have continued to increase in popularity; in its site dedicated to such applications, Wikipedia (2008 lists no less that 11 extant media sharing sites and 26 social bookmarking sites. This list does not take into account the approximate 20 media cataloguing sites, not to mention the innumerable blogging sites that employ tagging.

  1. A search engine for the engineering and equipment data management system (EDMS) at CERN

    CERN Document Server

    Tsyganov, A; Petit, S; Pettersson, Thomas Sven; Suwalska, A

    2008-01-01

    CERN, the European Laboratory for Particle Physics, located in Geneva -Switzerland, is currently building the LHC (Large Hadron Collider), a 27 km particle accelerator. The equipment life-cycle management of this project is provided by the Engineering and Equipment Data Management System (EDMS [1] [2]) Service. Using an Oracle database, it supports the management and follow-up of different kinds of documentation through the whole life cycle of the LHC project: design, manufacturing, installation, commissioning data etc... The equipment data collection phase is now slowing down and the project is getting closer to the 'As-Built' phase: the phase of the project consuming and exploring the large volumes of data stored since 1996. Searching through millions of items of information (documents, equipment parts, operations...) multiplied by dozens of points of view (operators, maintainers...) requires an efficient and flexible search engine. This paper describes the process followed by the team to implement the sear...

  2. Current Searching Methodology and Retrieval Issues: An Assessment

    Science.gov (United States)

    2008-03-01

    the most valuable part of the Web…” often referred to as the Deep Web or an Invisible Web. What is advocated is “fully customized vertical search...that will improve search results.” See discussion below. Scheffer’s display of deep web search sites by content type shows that some 54% are...ACROSS SEARCH ENGINES GOOGLE, 63.1 YAHOO, 21.4 MSN, 10 ASK, 3.5 AOL, 0.5 Other, 1.5 22 % DEEP WEB SEARCH

  3. Shifts in Search Engine Development: A Review of Past, Present and Future Trends in Research on Search Engines

    OpenAIRE

    Hamid R. Jamali; Saeid Asadi

    2004-01-01

    The World Wide Web has developed fast and many people use search engines to capture information from the Web. This article reviews past, present and future of search engines. Papers published in four major Web and information management conferences were surveyed to track research interests in the last five years. Web search and information retrieval topics such as ranking, filtering and query formulation are still hot topics among researchers. The most important shifts and issues of the futur...

  4. Persuading consumers to form precise search engine queries.

    Science.gov (United States)

    Leroy, Gondy

    2009-11-14

    Today's search engines provide a single textbox for searching. This input method has not changed in decades and, as a result, consumer search behaviour has not changed either: few and imprecise keywords are used. Especially with health information, where incorrect information may lead to unwise decisions, it would be beneficial if consumers could search more precisely. We evaluated a new user interface that supports more precise searching by using query diagrams. In a controlled user study, using paper-based prototypes, we compared searching with a Google interface with drawing new or modifying template diagrams. We evaluated consumer willingness and ability to use diagrams and the impact on query formulation. Users had no trouble understanding the new search method. Moreover, they used more keywords and relationships between keywords with search diagrams. In comparison to drawing their own diagrams, modifying existing templates led to more searches being conducted and higher creativity in searching.

  5. Search Engines for Tomorrow's Scholars, Part Two

    Science.gov (United States)

    Fagan, Jody Condit

    2012-01-01

    This two-part article considers how well some of today's search tools support scholars' work. The first part of the article reviewed Google Scholar and Microsoft Academic Search using a modified version of Carole L. Palmer, Lauren C. Teffeau, and Carrier M. Pirmann's framework (2009). Microsoft Academic Search is a strong contender when…

  6. Pavideoge: A New Video Processing Method in Video Search Engine

    CERN Document Server

    Yang, Pu; Chen, Guang

    2009-01-01

    In this paper, we study the problems of video processing in video search engine. Video has now become a very important kind of data in Internet; while searching for video is still a challenging task due to the inner properties of video: requiring enormous storage space, being independent, expressing information hiddenly. To handle the properties of video more effectively, in this paper, we propose a new video processing method in video search engine. In detail, the core of the new video processing method is creating pavideoge--a new data type, which contains the video advantages and webpage advantages. The pavideoge has four attributes: real link, videorank, text information and playnum. Each of them combines video's properties with webpage's. Video search engine based on the pavideoge can retrieve video more effectively. The experiment results show the encouraging performance of our approach. Based on the pavideoge, our video search engine can retrieve more precise videos in comparsion with previous related ...

  7. Topical interests and the mitigation of search engine bias.

    Science.gov (United States)

    Fortunato, S; Flammini, A; Menczer, F; Vespignani, A

    2006-08-22

    Search engines have become key media for our scientific, economic, and social activities by enabling people to access information on the web despite its size and complexity. On the down side, search engines bias the traffic of users according to their page ranking strategies, and it has been argued that they create a vicious cycle that amplifies the dominance of established and already popular sites. This bias could lead to a dangerous monopoly of information. We show that, contrary to intuition, empirical data do not support this conclusion; popular sites receive far less traffic than predicted. We discuss a model that accurately predicts traffic data patterns by taking into consideration the topical interests of users and their searching behavior in addition to the way search engines rank pages. The heterogeneity of user interests explains the observed mitigation of search engines' popularity bias.

  8. Web Service Architecture for a Meta Search Engine

    Directory of Open Access Journals (Sweden)

    K.Srinivas

    2011-10-01

    Full Text Available With the rapid advancements in Information Technology, Information Retrieval on Internet is gaining its importance day by day. Nowadays there are millions of Websites and billions of homepages available on the Internet. Search Engines are the essential tools for the purpose of retrieving the required information from the Web. But the existing search engines have many problems such as not having wide scope, imbalance in accessing the sites etc. So, the effectiveness of a search engine plays a vital role. Meta search engines are such systems that can provide effective information by accessing multiple existing search engines such as Dog Pile, Meta Crawler etc, but most of them cannot successfully operate on heterogeneous and fully dynamic web environment. In this paper we propose a Web Service Architecture for Meta Search Engine to cater the need of heterogeneous and dynamic web environment. The objective of our proposal is to exploit most of the features offered by Web Services through the implementation of a Web Service Meta Search Engine.

  9. Adding to the Students' Toolbox: Using Directories, Search Engines, and the Hidden Web in Search Processes.

    Science.gov (United States)

    Mardis, Marcia A.

    2002-01-01

    Discussion of searching for information on the Web focuses on resources that are not always found by traditional Web searches. Describes sources on the hidden Web, including full-text databases, clearinghouses, digital libraries, and learning objects; explains how search engines operate; and suggests that traditional print sources are still…

  10. Social media networking: YouTube and search engine optimization.

    Science.gov (United States)

    Jackson, Rem; Schneider, Andrew; Baum, Neil

    2011-01-01

    This is the third part of a three-part article on social media networking. This installment will focus on YouTube and search engine optimization. This article will explore the application of YouTube to the medical practice and how YouTube can help a practice retain its existing patients and attract new patients to the practice. The article will also describe the importance of search engine optimization and how to make your content appear on the first page of the search engines such as Google, Yahoo, and YouTube.

  11. Solar System Object Image Search: A precovery search engine

    Science.gov (United States)

    Gwyn, Stephen D. J.; Hill, Norman; Kavelaars, Jj

    2016-01-01

    While regular astronomical image archive searches can find images at a fixed location, they cannot find images of moving targets such as asteroids or comets. The Solar System Object Image Search (SSOIS) at the Canadian Astronomy Data Centre allows users to search for images of moving objects, allowing precoveries. SSOIS accepts as input either an object designation, a list of observations, a set of orbital elements, or a user-generated ephemeris for an object. It then searches for observations of that object over a range of dates. The user is then presented with a list of images containing that object from a variety of archives. Initially created to search the CFHT MegaCam archive, SSOIS has been extended to other telescopes including Gemini, Subaru/SuprimeCam, WISE, HST, the SDSS, AAT, the ING telescopes, the ESO telescopes, and the NOAO telescopes (KPNO/CTIO/WIYN), for a total of 24.5 million images. As the Pan-STARRS and Hyper Suprime-Cam archives become available, they will be incorporated as well. The SSOIS tool is located on the web at http://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/en/ssois/.

  12. PlateRunner: A Search Engine to Identify EMR Boilerplates.

    Science.gov (United States)

    Divita, Guy; Workman, T Elizabeth; Carter, Marjorie E; Redd, Andrew; Samore, Matthew H; Gundlapalli, Adi V

    2016-01-01

    Medical text contains boilerplated content, an artifact of pull-down forms from EMRs. Boilerplated content is the source of challenges for concept extraction on clinical text. This paper introduces PlateRunner, a search engine on boilerplates from the US Department of Veterans Affairs (VA) EMR. Boilerplates containing concepts should be identified and reviewed to recognize challenging formats, identify high yield document titles, and fine tune section zoning. This search engine has the capability to filter negated and asserted concepts, save and search query results. This tool can save queries, search results, and documents found for later analysis.

  13. ArraySearch: A Web-Based Genomic Search Engine.

    Science.gov (United States)

    Wilson, Tyler J; Ge, Steven X

    2012-01-01

    Recent advances in microarray technologies have resulted in a flood of genomics data. This large body of accumulated data could be used as a knowledge base to help researchers interpret new experimental data. ArraySearch finds statistical correlations between newly observed gene expression profiles and the huge source of well-characterized expression signatures deposited in the public domain. A search query of a list of genes will return experiments on which the genes are significantly up- or downregulated collectively. Searches can also be conducted using gene expression signatures from new experiments. This resource will empower biological researchers with a statistical method to explore expression data from their own research by comparing it with expression signatures from a large public archive.

  14. Assessment and Comparison of Search capabilities of Web-based Meta-Search Engines: A Checklist Approach

    Directory of Open Access Journals (Sweden)

    Alireza Isfandiyari Moghadam

    2010-03-01

    Full Text Available   The present investigation concerns evaluation, comparison and analysis of search options existing within web-based meta-search engines. 64 meta-search engines were identified. 19 meta-search engines that were free, accessible and compatible with the objectives of the present study were selected. An author’s constructed check list was used for data collection. Findings indicated that all meta-search engines studied used the AND operator, phrase search, number of results displayed setting, previous search query storage and help tutorials. Nevertheless, none of them demonstrated any search options for hypertext searching and displaying the size of the pages searched. 94.7% support features such as truncation, keywords in title and URL search and text summary display. The checklist used in the study could serve as a model for investigating search options in search engines, digital libraries and other internet search tools.

  15. Searching for preeclampsia genes : the current position

    NARCIS (Netherlands)

    Lachmeijer, AMA; Dekker, GA; Pals, G; Aarnoudse, JG; ten Kate, LP; Arngrimsson, R

    2002-01-01

    Although there is substantial evidence that preeclampsia has a genetic background, the complexity of the processes involved and the fact that preeclampsia is a maternal-fetal phenomenon does not make the search for the molecular basis of preeclampsia genes easy. It is possible that the single

  16. Searching for a New Way to Reach Patrons: A Search Engine Optimization Pilot Project at Binghamton University Libraries

    Science.gov (United States)

    Rushton, Erin E.; Kelehan, Martha Daisy; Strong, Marcy A.

    2008-01-01

    Search engine use is one of the most popular online activities. According to a recent OCLC report, nearly all students start their electronic research using a search engine instead of the library Web site. Instead of viewing search engines as competition, however, librarians at Binghamton University Libraries decided to employ search engine…

  17. Searching for a New Way to Reach Patrons: A Search Engine Optimization Pilot Project at Binghamton University Libraries

    Science.gov (United States)

    Rushton, Erin E.; Kelehan, Martha Daisy; Strong, Marcy A.

    2008-01-01

    Search engine use is one of the most popular online activities. According to a recent OCLC report, nearly all students start their electronic research using a search engine instead of the library Web site. Instead of viewing search engines as competition, however, librarians at Binghamton University Libraries decided to employ search engine…

  18. Review of Metadata Elements within the Web Pages Resulting from Searching in General Search Engines

    Directory of Open Access Journals (Sweden)

    Sima Shafi’ie Alavijeh

    2009-12-01

    Full Text Available The present investigation was aimed to study the scope of presence of Dublin Core metadata elements and HTML meta tags in web pages. Ninety web pages were chosen by searching general search engines (Google, Yahoo and MSN. The scope of metadata elements (Dublin Core and HTML Meta tags present in these pages as well as existence of a significant correlation between presence of meta elements and type of search engines were investigated. Findings indicated very low presence of both Dublin Core metadata elements and HTML meta tags in the pages retrieved which in turn illustrates the very low usage of meta data elements in web pages. Furthermore, findings indicated that there are no significant correlation between the type of search engine used and presence of metadata elements. From the standpoint of including metadata in retrieval of web sources, search engines do not significantly differ from one another.

  19. Grooker, KartOO, Addict-o-Matic and More: Really Different Search Engines

    Science.gov (United States)

    Descy, Don E.

    2009-01-01

    There are hundreds of unique search engines in the United States and thousands of unique search engines around the world. If people get into search engines designed just to search particular web sites, the number is in the hundreds of thousands. This article looks at: (1) clustering search engines, such as KartOO (www.kartoo.com) and Grokker…

  20. Using Internet Search Engines to Obtain Medical Information: A Comparative Study

    Science.gov (United States)

    Wang, Liupu; Wang, Juexin; Wang, Michael; Li, Yong; Liang, Yanchun

    2012-01-01

    results highly overlapped between the search engines, and the overlap between any two search engines was about half or more. On the other hand, each search engine emphasized various types of content differently. In terms of user satisfaction analysis, volunteer users scored Bing the highest for its usefulness, followed by Yahoo!, Google, and Ask.com. Conclusions Google, Yahoo!, Bing, and Ask.com are by and large effective search engines for helping lay users get health and medical information. Nevertheless, the current ranking methods have some pitfalls and there is room for improvement to help users get more accurate and useful information. We suggest that search engine users explore multiple search engines to search different types of health information and medical knowledge for their own needs and get a professional consultation if necessary. PMID:22672889

  1. HOW DO RADIOLOGISTS USE THE HUMAN SEARCH ENGINE?

    Science.gov (United States)

    Wolfe, Jeremy M; Evans, Karla K; Drew, Trafton; Aizenman, Avigael; Josephs, Emilie

    2016-06-01

    Radiologists perform many 'visual search tasks' in which they look for one or more instances of one or more types of target item in a medical image (e.g. cancer screening). To understand and improve how radiologists do such tasks, it must be understood how the human 'search engine' works. This article briefly reviews some of the relevant work into this aspect of medical image perception. Questions include how attention and the eyes are guided in radiologic search? How is global (image-wide) information used in search? How might properties of human vision and human cognition lead to errors in radiologic search?

  2. Exploring Search Engine Optimization (SEO) Techniques for Dynamic Websites

    OpenAIRE

    Kanwal, Wasfa

    2011-01-01

    ABSTRACT Context: With growing number of online businesses, Search Engine Optimization (SEO) has become vital to capitalize a business because SEO is key factor for marketing an online business. SEO is the process to optimize a website so that it ranks well on Search Engine Result Pages (SERPs). Dynamic websites are commonly used for e-commerce because they are easier to update and expand; however they are subjected to indexing related problems. Objectives: This research aims to examine and a...

  3. Using Internet search engines to estimate word frequency.

    Science.gov (United States)

    Blair, Irene V; Urland, Geoffrey R; Ma, Jennifer E

    2002-05-01

    The present research investigated Internet search engines as a rapid, cost-effective alternative for estimating word frequencies. Frequency estimates for 382 words were obtained and compared across four methods: (1) Internet search engines, (2) the Kucera and Francis (1967) analysis of a traditional linguistic corpus, (3) the CELEX English linguistic database (Baayen, Piepenbrock, & Gulikers, 1995), and (4) participant ratings of familiarity. The results showed that Internet search engines produced frequency estimates that were highly consistent with those reported by Kucera and Francis and those calculated from CELEX, highly consistent across search engines, and very reliable over a 6-month period of time. Additional results suggested that Internet search engines are an excellent option when traditional word frequency analyses do not contain the necessary data (e.g., estimates for forenames and slang). In contrast, participants' familiarity judgments did not correspond well with the more objective estimates of word frequency. Researchers are advised to use search engines with large databases (e.g., AltaVista) to ensure the greatest representativeness of the frequency estimates.

  4. A longitudinal analysis of search engine index size

    NARCIS (Netherlands)

    Bosch, A.P.J. van den; Bogers, T.; Kunder, M. de

    2015-01-01

    One of the determining factors of the quality of Web search engines is the size and quality of their index. In addition to its influence on search result quality, the size of the indexed Web can also tell us something about which parts of the WWW are directly accessible to the everyday user. We prop

  5. FindZebra: A search engine for rare diseases

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina Amalia;

    2013-01-01

    Background: The web has become a primary information resource about illnesses and treatments for both medical and non-medical users. Standard web search is by far the most common interface for such information. It is therefore of interest to find out how well web search engines work for diagnosti...

  6. Search engines, the new bottleneck for content access

    NARCIS (Netherlands)

    van Eijk, N.; Preissl, B.; Haucap, J.; Curwen, P.

    2009-01-01

    The core function of a search engine is to make content and sources of information easily accessible (although the search results themselves may actually include parts of the underlying information). In an environment with unlimited amounts of information available on open platforms such as the inte

  7. Search engines and the production of academic knowledge

    NARCIS (Netherlands)

    van Dijck, J.

    2010-01-01

    This article argues that search engines in general, and Google Scholar in particular, have become significant co-producers of academic knowledge. Knowledge is not simply conveyed to users, but is co-produced by search engines’ ranking systems and profiling systems, none of which are open to the rule

  8. Search engines and the production of academic knowledge

    NARCIS (Netherlands)

    van Dijck, J.

    2010-01-01

    This article argues that search engines in general, and Google Scholar in particular, have become significant co-producers of academic knowledge. Knowledge is not simply conveyed to users, but is co-produced by search engines’ ranking systems and profiling systems, none of which are open to the rule

  9. A longitudinal analysis of search engine index size

    NARCIS (Netherlands)

    Bosch, A.P.J. van den; Bogers, T.; Kunder, M. de

    2015-01-01

    One of the determining factors of the quality of Web search engines is the size and quality of their index. In addition to its influence on search result quality, the size of the indexed Web can also tell us something about which parts of the WWW are directly accessible to the everyday user. We

  10. FindZebra: A search engine for rare diseases

    DEFF Research Database (Denmark)

    Dragusin, Radu; Petcu, Paula; Lioma, Christina Amalia

    2013-01-01

    Background: The web has become a primary information resource about illnesses and treatments for both medical and non-medical users. Standard web search is by far the most common interface for such information. It is therefore of interest to find out how well web search engines work for diagnosti...

  11. SEARCH FOR QUALITY IN BIOSYSTEM ENGINEERING

    Directory of Open Access Journals (Sweden)

    Bülent Eker

    2012-09-01

    Full Text Available Today, engineering has become a disciplined field. The demand in food products caused the agricultural engineers to consider the matter in a different way. This consideration led the engineer to resolve the biological issues together with electronic and information disciplines and also advanced con trol, advanced technological materials and developed sensor systems. The subject has persuaded them to design solutions for problems related with living things and their environment. Bio - system engineering which has been developed for this purpose has beco me the application of technical knowledge aiming to fulfill the human requirements. The pursuit of bio - system engineering discipline are automation, new developed technologies, information technologies and human interaction, sensitive agriculture techniqu es, power and work machines, product technologies after harvest, structures and relation with environment, animal production technology, soil and water sources, rural development and planning. Bio - system engineering which covers such a wide area should re ach the solution by using its system engineering feature first and then determine the process parameters of the subjects that it resolves. Therefore it has to attribute the reason - result relation in every stage to quality parameters. Therefore, in this a nnouncement, the quality issues necessary for explaining the subjects dealt in bio - system engineering basis are examined one by one and solution models are created depending on these issues.

  12. On the Weakenesses of Correlation Measures used for Search Engines' Results (Unsupervised Comparison of Search Engine Rankings)

    CERN Document Server

    D'Alberto, Paolo

    2011-01-01

    The correlation of the result lists provided by search engines is fundamental and it has deep and multidisciplinary ramifications. Here, we present automatic and unsupervised methods to assess whether or not search engines provide results that are comparable or correlated. We have two main contributions: First, we provide evidence that for more than 80% of the input queries - independently of their frequency - the two major search engines share only three or fewer URLs in their search results, leading to an increasing divergence. In this scenario (divergence), we show that even the most robust measures based on comparing lists is useless to apply; that is, the small contribution by too few common items will infer no confidence. Second, to overcome this problem, we propose the fist content-based measures - i.e., direct comparison of the contents from search results; these measures are based on the Jaccard ratio and distribution similarity measures (CDF measures). We show that they are orthogonal to each other ...

  13. Can electronic search engines optimize screening of search results in systematic reviews: an empirical study

    Directory of Open Access Journals (Sweden)

    Clifford Tammy J

    2006-02-01

    Full Text Available Abstract Background Most electronic search efforts directed at identifying primary studies for inclusion in systematic reviews rely on the optimal Boolean search features of search interfaces such as DIALOG® and Ovid™. Our objective is to test the ability of an Ultraseek® search engine to rank MEDLINE® records of the included studies of Cochrane reviews within the top half of all the records retrieved by the Boolean MEDLINE search used by the reviewers. Methods Collections were created using the MEDLINE bibliographic records of included and excluded studies listed in the review and all records retrieved by the MEDLINE search. Records were converted to individual HTML files. Collections of records were indexed and searched through a statistical search engine, Ultraseek, using review-specific search terms. Our data sources, systematic reviews published in the Cochrane library, were included if they reported using at least one phase of the Cochrane Highly Sensitive Search Strategy (HSSS, provided citations for both included and excluded studies and conducted a meta-analysis using a binary outcome measure. Reviews were selected if they yielded between 1000–6000 records when the MEDLINE search strategy was replicated. Results Nine Cochrane reviews were included. Included studies within the Cochrane reviews were found within the first 500 retrieved studies more often than would be expected by chance. Across all reviews, recall of included studies into the top 500 was 0.70. There was no statistically significant difference in ranking when comparing included studies with just the subset of excluded studies listed as excluded in the published review. Conclusion The relevance ranking provided by the search engine was better than expected by chance and shows promise for the preliminary evaluation of large results from Boolean searches. A statistical search engine does not appear to be able to make fine discriminations concerning the relevance of

  14. A Semantic Query Transformation Approach Based on Ontology for Search Engine

    OpenAIRE

    SAJENDRA KUMAR; RAM KUMAR RANA; PAWAN SINGH

    2012-01-01

    These days we are using some popular web search engines for information retrieval in all areas, such engine are as Google, Yahoo!, and Live Search, etc. to obtain initial helpful information.Which information we retrieved via search engine may not be relevant to the search target in the search engine user's mind. When user not found relevant information he has to shortlist the results. Thesesearch engines use traditional search service based on "static keywords", which require the users to ty...

  15. An advanced search engine for patent analytics in medicinal chemistry.

    Science.gov (United States)

    Pasche, Emilie; Gobeill, Julien; Teodoro, Douglas; Gaudinat, Arnaud; Vishnykova, Dina; Lovis, Christian; Ruch, Patrick

    2012-01-01

    Patent collections contain an important amount of medical-related knowledge, but existing tools were reported to lack of useful functionalities. We present here the development of TWINC, an advanced search engine dedicated to patent retrieval in the domain of health and life sciences. Our tool embeds two search modes: an ad hoc search to retrieve relevant patents given a short query and a related patent search to retrieve similar patents given a patent. Both search modes rely on tuning experiments performed during several patent retrieval competitions. Moreover, TWINC is enhanced with interactive modules, such as chemical query expansion, which is of prior importance to cope with various ways of naming biomedical entities. While the related patent search showed promising performances, the ad-hoc search resulted in fairly contrasted results. Nonetheless, TWINC performed well during the Chemathlon task of the PatOlympics competition and experts appreciated its usability.

  16. Search Result Merging and Ranking Strategies in Meta-Search Engines: A Survey

    Directory of Open Access Journals (Sweden)

    Hossein Jadidoleslamy

    2012-07-01

    Full Text Available MetaSearch is utilizing multiple other search systems to perform simultaneous search. A MetaSearch Engine (MSE is a search system that enables MetaSearch. To perform a MetaSearch, user query is sent to multiple search engines; once the search results returned, they are received by the MSE, then merged into a single ranked list and the ranked list is presented to the user. When a query is submitted to a MSE, decisions are made with respect to the underlying search engines to be used, what modifications will be made to the query and how to score the results. These decisions are typically made by considering only the user€™s keyword query, neglecting the larger information need. The cornerstone of their technology is their rank aggregation method. In other words, Result merging is a key component in a MSE. The effectiveness of a MSE is closely related to the result merging algorithm it employs. In this paper, we want to investigate a variety of result merging methods based on a wide range of available information about the retrieved results, from their local ranks, their titles and snippets, to the full documents of these results.

  17. "openness of search engine": A critical flaw in search systems; a case study on google, yahoo and bing

    CERN Document Server

    Chakravarthy, Katuru SM Kalyana

    2012-01-01

    There is no doubt that Search Engines are playing a great role in Internet usage. But all the top search engines Google, Yahoo and Bing are having a critical flaw called "Openness of a Search Engine". An Internet user should be allowed to get the search results only when requested through Search engine's web page but the user must not be allowed to get the search results when requested through any web page that does not belong to the Search Engine. Only results of a search engine should be available to the Internet user but not the Search Engine. This paper explains the critical flaw called "Openness of Search Engine" with a case study on top 3 search engines 'Google', 'Yahoo' and 'Bing'. This paper conducts an attack based test using J2EE framework and proves that 'Google' passed the test and it strongly protects its Critical Search System, where 'Yahoo' and 'Bing' are failed to protect their Search Engines. But previously 'Google' also had other high severity issues with the Openness of search engine; this ...

  18. Performance Evaluation of search engines via user effort measures

    Directory of Open Access Journals (Sweden)

    Rajesh Kumar Goutam

    2012-07-01

    Full Text Available Many metrics exist to perform the task of search engine evaluation that are either looking for the experts judgments or believe in searchers decisions about the relevancy of the web documents. However, search logs can provide us information about how real users search. This paper explains, our attempts to incorporate the users searching behavior in formulation of user efforts centric evaluation metric. We also incorporate two dimensional users traversing approach in the ERR metric. After the formulation of the evaluation metric, authors judge its goodness and found that presented metric fulfills all the requirements that are needed for a metric to be mathematically accurate. The findings obtained from experiments, present a complete description for search engine evaluation procedure.

  19. The Theory of Planned Behaviour Applied to Search Engines as a Learning Tool

    Science.gov (United States)

    Liaw, Shu-Sheng

    2004-01-01

    Search engines have been developed for helping learners to seek online information. Based on theory of planned behaviour approach, this research intends to investigate the behaviour of using search engines as a learning tool. After factor analysis, the results suggest that perceived satisfaction of search engine, search engines as an information…

  20. The Theory of Planned Behaviour Applied to Search Engines as a Learning Tool

    Science.gov (United States)

    Liaw, Shu-Sheng

    2004-01-01

    Search engines have been developed for helping learners to seek online information. Based on theory of planned behaviour approach, this research intends to investigate the behaviour of using search engines as a learning tool. After factor analysis, the results suggest that perceived satisfaction of search engine, search engines as an information…

  1. Objectivity, Reliability, and Validity of Search Engine Count Estimates

    Directory of Open Access Journals (Sweden)

    Dietmar Janetzko

    2008-01-01

    Full Text Available Count estimates ("hits" provided by Web search engines have received much attention as a yardstick to measure a variety of phenomena of interest as diverse as, e.g., language statistics, popularity of authors, or similarity between words. Common to these activities is the intention to use Web search engines not only for search but for ad hoc measurement. Using search engine count estimates (SECEs in this way means that a phenomenon of interest, e.g., the popularity of an author, is conceived of as a measurand, and SECEs are taken to be its quantitative measures. However, the data quality of SECEs has not yet been studied systematically, and concerns have been raised against the use of this kind of data. This article examines the data quality of SECEs focusing on classical goodness criteria, i.e., objectivity, reliability, and validity. The results of a series of studies indicate that with the exception of Boolean queries that use disjunction or negation objectivity as well as test-retest reliability and parallel-test reliability of SECEs is good for most types of browsers and search engines examined. Estimation of validity required model development (all-subsets regression revealing satisfying results by using an explorative approach to feature selection. The findings are discussed in the light of previous objections and perspectives for using Web search count estimates are delineated.

  2. DRUMS: a human disease related unique gene mutation search engine.

    Science.gov (United States)

    Li, Zuofeng; Liu, Xingnan; Wen, Jingran; Xu, Ye; Zhao, Xin; Li, Xuan; Liu, Lei; Zhang, Xiaoyan

    2011-10-01

    With the completion of the human genome project and the development of new methods for gene variant detection, the integration of mutation data and its phenotypic consequences has become more important than ever. Among all available resources, locus-specific databases (LSDBs) curate one or more specific genes' mutation data along with high-quality phenotypes. Although some genotype-phenotype data from LSDB have been integrated into central databases little effort has been made to integrate all these data by a search engine approach. In this work, we have developed disease related unique gene mutation search engine (DRUMS), a search engine for human disease related unique gene mutation as a convenient tool for biologists or physicians to retrieve gene variant and related phenotype information. Gene variant and phenotype information were stored in a gene-centred relational database. Moreover, the relationships between mutations and diseases were indexed by the uniform resource identifier from LSDB, or another central database. By querying DRUMS, users can access the most popular mutation databases under one interface. DRUMS could be treated as a domain specific search engine. By using web crawling, indexing, and searching technologies, it provides a competitively efficient interface for searching and retrieving mutation data and their relationships to diseases. The present system is freely accessible at http://www.scbit.org/glif/new/drums/index.html.

  3. TAGS EXTARCTION FROM SPATIAL DOCUMENTS IN SEARCH ENGINES

    Directory of Open Access Journals (Sweden)

    S. Borhaninejad

    2015-12-01

    Full Text Available Nowadays the selective access to information on the Web is provided by search engines, but in the cases which the data includes spatial information the search task becomes more complex and search engines require special capabilities. The purpose of this study is to extract the information which lies in spatial documents. To that end, we implement and evaluate information extraction from GML documents and a retrieval method in an integrated approach. Our proposed system consists of three components: crawler, database and user interface. In crawler component, GML documents are discovered and their text is parsed for information extraction; storage. The database component is responsible for indexing of information which is collected by crawlers. Finally the user interface component provides the interaction between system and user. We have implemented this system as a pilot system on an Application Server as a simulation of Web. Our system as a spatial search engine provided searching capability throughout the GML documents and thus an important step to improve the efficiency of search engines has been taken.

  4. Tags Extarction from Spatial Documents in Search Engines

    Science.gov (United States)

    Borhaninejad, S.; Hakimpour, F.; Hamzei, E.

    2015-12-01

    Nowadays the selective access to information on the Web is provided by search engines, but in the cases which the data includes spatial information the search task becomes more complex and search engines require special capabilities. The purpose of this study is to extract the information which lies in spatial documents. To that end, we implement and evaluate information extraction from GML documents and a retrieval method in an integrated approach. Our proposed system consists of three components: crawler, database and user interface. In crawler component, GML documents are discovered and their text is parsed for information extraction; storage. The database component is responsible for indexing of information which is collected by crawlers. Finally the user interface component provides the interaction between system and user. We have implemented this system as a pilot system on an Application Server as a simulation of Web. Our system as a spatial search engine provided searching capability throughout the GML documents and thus an important step to improve the efficiency of search engines has been taken.

  5. D-score: a search engine independent MD-score.

    Science.gov (United States)

    Vaudel, Marc; Breiter, Daniela; Beck, Florian; Rahnenführer, Jörg; Martens, Lennart; Zahedi, René P

    2013-03-01

    While peptides carrying PTMs are routinely identified in gel-free MS, the localization of the PTMs onto the peptide sequences remains challenging. Search engine scores of secondary peptide matches have been used in different approaches in order to infer the quality of site inference, by penalizing the localization whenever the search engine similarly scored two candidate peptides with different site assignments. In the present work, we show how the estimation of posterior error probabilities for peptide candidates allows the estimation of a PTM score called the D-score, for multiple search engine studies. We demonstrate the applicability of this score to three popular search engines: Mascot, OMSSA, and X!Tandem, and evaluate its performance using an already published high resolution data set of synthetic phosphopeptides. For those peptides with phosphorylation site inference uncertainty, the number of spectrum matches with correctly localized phosphorylation increased by up to 25.7% when compared to using Mascot alone, although the actual increase depended on the fragmentation method used. Since this method relies only on search engine scores, it can be readily applied to the scoring of the localization of virtually any modification at no additional experimental or in silico cost.

  6. Domain knowledge, search behaviour, and search effectiveness of engineering and science students: an exploratory study

    Directory of Open Access Journals (Sweden)

    Zhang X.

    2005-01-01

    Full Text Available Introduction. This study sought to answer three questions: 1 Would the level of domain knowledge significantly affect the user's search behavior? 2 Would the level of domain knowledge significantly affect search effectiveness, and 3.What would be the relationship between search behaviour and search effectiveness? Method. Participants were asked to rate their familiarity with 200 thesaurus terms to measure their level of domain knowledge. They also searched on three assigned topics using the COMPENDEX database. Data were collected through pre- and post-search questionnaires, thesaurus term rating form, computer logs, and search session printouts. Analysis. Twenty-two engineering and science students' data were analysed both quantitatively and qualitatively. Quantitative analysis included both descriptive statistics and statistical testing, while the qualitative analysis was on the use of terms in queries. Results. As the level of domain knowledge increases, the user tends to do more searches and to use more terms in queries. However, the search effectiveness remained the same for all participants. Conclusion. The level of domain knowledge seems to have an effect on search behaviour, but not on search effectiveness, and search behaviour does not seem to be related to search effectiveness. The findings are limited by the small sample size and need to be confirmed in further studies.

  7. A Survey of Meta Search Engine%元搜索引擎研究

    Institute of Scientific and Technical Information of China (English)

    张卫丰; 徐宝文; 周晓宇; 李东; 许蕾

    2001-01-01

    With the explosive increase of the network information,it is more and more difficult for people to look up information. The occurrence of the Web search engines overcomes this problem in some degree. However, because different search engines use different mechanisms, scope and algorithms, the repetition of the search results for the same query is no more than 34 %. If wish to get relativly fullscale ,accurate search results,multi-search engines should be used and the meta search engines occur. In this paper ,the meta search engines are surveyed. At first ,the history ,the principles and the elements of the meta search engines are discussed. Then,the related creteria of the meta search engines are analyzed and several typical meta search engines are compared. Finally,on this base,the trend of the meta search engine is introduced.

  8. SEARCH ENGINE OPTIMIZATION: A CASE STUDY OF BENEFITS OF IT’S APPLICATION IN WEBSITES

    National Research Council Canada - National Science Library

    Christian Luís Ramos; Camilla Zanchin Caramigo1; Vinícius Camargo Andrade; Gustavo Kimura Montanha; Fernando Henrique Campos

    2016-01-01

    .... The Search Engine Optimization (SEO) is a set of strategies and techniques that are aimed at improving the position where a website is displayed in the results list of search engines in the search of a particular subject...

  9. Analysis of the Temporal Behaviour of Search Engine Crawlers at Web Sites

    Directory of Open Access Journals (Sweden)

    Jeeva Jose

    2013-06-01

    Full Text Available Web log mining is the extraction of web logs to analyze user behaviour at web sites. In addition to user information, web logs provide immense information about search engine traffic and behaviour. Search engine crawlers are highly automated programs that periodically visit the web site to collect information. The behaviour of search engines could be used in analyzing server load, quality of search engines, dynamics of search engine crawlers, ethics of search engines etc. The time spent by various crawlers is significant in identifying the server load as major proportion of the server load is constituted by search engine crawlers. A temporal analysis of the search engine crawlers were done to identify their behaviour. It was found that there is a significant difference in the total time spent by various crawlers. The presence of search engine crawlers at web sites on hourly basis was also done to identify the dynamics of search engine crawlers at web sites.

  10. Current state of biomedical engineering; Biomedical Engineering ha ima

    Energy Technology Data Exchange (ETDEWEB)

    Sakai, K. [Waseda Univ., Tokyo (Japan). School of Science and Engineering; Kanamori, T. [National Inst. of Materials and Chemical Research, Tsukuba (Japan)

    1996-11-05

    Medical science is divided into basic medical science and clinical medicine, and the technology of the medical treatment is established as their aggregate power. This concept can be compared with the presence of industrial engineering as a product of physical science and engineering. Basic medical science has come to be combined with science deeply as a result of the rise of recent molecular biology. As to clinical medicine, current highly advanced medical treatment can be said to be made up of scientific technology. Medical treatment can be considered to include prevention, diagnosis, remedy, and rehabilitation stages. It is closely connected with engineering in each stage. The methods of approaching medical science are elucidation of the functions of internal organs and tissue considering that a living body is a plant, and offering of new therapeutical means by applying chemical devices to a living body. The functions of artificial organs can e divided roughly into convection transport, mass transfer, structural members, and signal transfer from the viewpoint of chemical engineering. Medical treatment will be brought into close relation with scientific technology in the future. 10 refs., 3 figs., 2 tabs.

  11. LoyalTracker: Visualizing Loyalty Dynamics in Search Engines.

    Science.gov (United States)

    Shi, Conglei; Wu, Yingcai; Liu, Shixia; Zhou, Hong; Qu, Huamin

    2014-12-01

    The huge amount of user log data collected by search engine providers creates new opportunities to understand user loyalty and defection behavior at an unprecedented scale. However, this also poses a great challenge to analyze the behavior and glean insights into the complex, large data. In this paper, we introduce LoyalTracker, a visual analytics system to track user loyalty and switching behavior towards multiple search engines from the vast amount of user log data. We propose a new interactive visualization technique (flow view) based on a flow metaphor, which conveys a proper visual summary of the dynamics of user loyalty of thousands of users over time. Two other visualization techniques, a density map and a word cloud, are integrated to enable analysts to gain further insights into the patterns identified by the flow view. Case studies and the interview with domain experts are conducted to demonstrate the usefulness of our technique in understanding user loyalty and switching behavior in search engines.

  12. Health literacy and usability of clinical trial search engines.

    Science.gov (United States)

    Utami, Dina; Bickmore, Timothy W; Barry, Barbara; Paasche-Orlow, Michael K

    2014-01-01

    Several web-based search engines have been developed to assist individuals to find clinical trials for which they may be interested in volunteering. However, these search engines may be difficult for individuals with low health and computer literacy to navigate. The authors present findings from a usability evaluation of clinical trial search tools with 41 participants across the health and computer literacy spectrum. The study consisted of 3 parts: (a) a usability study of an existing web-based clinical trial search tool; (b) a usability study of a keyword-based clinical trial search tool; and (c) an exploratory study investigating users' information needs when deciding among 2 or more candidate clinical trials. From the first 2 studies, the authors found that users with low health literacy have difficulty forming queries using keywords and have significantly more difficulty using a standard web-based clinical trial search tool compared with users with adequate health literacy. From the third study, the authors identified the search factors most important to individuals searching for clinical trials and how these varied by health literacy level.

  13. Query Recommendation by Coupling Personalization with Clustering for Search Engine

    Directory of Open Access Journals (Sweden)

    Dhiliphanrajkumar.Thambidurai

    2016-11-01

    Full Text Available In the present world internet and web search engines have become an important part in one’s day-today life. For a user query, more than few thousand web pages are retrieved but most of them are irrelevant. A major problem in search engine is that the user queries are usually short and ambiguous, and they are not sufficient to satisfy the precise user needs. Also listing more number of results according to user make them worry about searching the desired results and it takes large amount of time to search from the huge list of results. To overcome all the problems, an effective approach is developed by capturing the users’ click through and bookmarking data to provide personalized query recommendation. For retrieving the results, Google API is used. Experimental results show that the proposed method is providing better query recommendation results than the existing query suggestion methods.

  14. A Longitudinal Analysis of Search Engine Index Size

    DEFF Research Database (Denmark)

    Van den Bosch, Antal; Bogers, Toine; De Kunder, Maurice

    2015-01-01

    One of the determining factors of the quality of Web search engines is the size of their index. In addition to its influence on search result quality, the size of the indexed Web can also tell us something about which parts of the WWW are directly accessible to the everyday user. We propose a novel...... method of estimating the size of a Web search engine’s index by extrapolating from document frequencies of words observed in a large static corpus of Web pages. In addition, we provide a unique longitudinal perspective on the size of Google and Bing’s indexes over a nine-year period, from March 2006...... until January 2015. We find that index size estimates of these two search engines tend to vary dramatically over time, with Google generally possessing a larger index than Bing. This result raises doubts about the reliability of previous one-off estimates of the size of the indexed Web. We find...

  15. A Longitudinal Analysis of Search Engine Index Size

    DEFF Research Database (Denmark)

    Van den Bosch, Antal; Bogers, Toine; De Kunder, Maurice

    2015-01-01

    method of estimating the size of a Web search engine’s index by extrapolating from document frequencies of words observed in a large static corpus of Web pages. In addition, we provide a unique longitudinal perspective on the size of Google and Bing’s indexes over a nine-year period, from March 2006...... until January 2015. We find that index size estimates of these two search engines tend to vary dramatically over time, with Google generally possessing a larger index than Bing. This result raises doubts about the reliability of previous one-off estimates of the size of the indexed Web. We find......One of the determining factors of the quality of Web search engines is the size of their index. In addition to its influence on search result quality, the size of the indexed Web can also tell us something about which parts of the WWW are directly accessible to the everyday user. We propose a novel...

  16. A Longitudinal Analysis of Search Engine Index Size

    DEFF Research Database (Denmark)

    Van den Bosch, Antal; Bogers, Toine; De Kunder, Maurice

    2015-01-01

    until January 2015. We find that index size estimates of these two search engines tend to vary dramatically over time, with Google generally possessing a larger index than Bing. This result raises doubts about the reliability of previous one-off estimates of the size of the indexed Web. We find...... that much if not all of this variability can be explained by changes in the indexing and ranking infrastructure of Google and Bing. This casts further doubt on whether Web search engines can be used reliably for cross-sectional webometric studies....

  17. Research on the User Interest Modeling of Personalized Search Engine

    Institute of Scientific and Technical Information of China (English)

    LI Zhengwei; XIA Shixiong; NIU Qiang; XIA Zhanguo

    2007-01-01

    At present, how to enable Search Engine to construct user personal interest model initially, master user's personalized information timely and provide personalized services accurately have become the hotspot in the research of Search Engine area.Aiming at the problems of user model's construction and combining techniques of manual customization modeling and automatic analytical modeling, a User Interest Model (UIM) is proposed in the paper. On the basis of it, the corresponding establishment and update algorithms of User Interest Profile (UIP) are presented subsequently. Simulation tests proved that the UIM proposed and corresponding algorithms could enhance the retrieval precision effectively and have superior adaptability.

  18. An Introduction to Search Engines and Web Navigation

    CERN Document Server

    Levene, Mark

    2010-01-01

    This book is a second edition, updated and expanded to explain the technologies that help us find information on the web.  Search engines and web navigation tools have become ubiquitous in our day to day use of the web as an information source, a tool for commercial transactions and a social computing tool. Moreover, through the mobile web we have access to the web's services when we are on the move.  This book demystifies the tools that we use when interacting with the web, and gives the reader a detailed overview of where we are and where we are going in terms of search engine

  19. A Domain Specific Ontology Based Semantic Web Search Engine

    CERN Document Server

    Mukhopadhyay, Debajyoti; Mukherjee, Sreemoyee; Bhattacharya, Jhilik; Kim, Young-Chon

    2011-01-01

    Since its emergence in the 1990s the World Wide Web (WWW) has rapidly evolved into a huge mine of global information and it is growing in size everyday. The presence of huge amount of resources on the Web thus poses a serious problem of accurate search. This is mainly because today's Web is a human-readable Web where information cannot be easily processed by machine. Highly sophisticated, efficient keyword based search engines that have evolved today have not been able to bridge this gap. So comes up the concept of the Semantic Web which is envisioned by Tim Berners-Lee as the Web of machine interpretable information to make a machine processable form for expressing information. Based on the semantic Web technologies we present in this paper the design methodology and development of a semantic Web search engine which provides exact search results for a domain specific search. This search engine is developed for an agricultural Website which hosts agricultural information about the state of West Bengal.

  20. A CLIR Interface to a Web search engine.

    Science.gov (United States)

    Daumke, Philipp; Schulz, Stefan; Markó, Kornél

    2005-01-01

    Medical document retrieval presents a unique combination of challenges for the design and implementation of retrieval engines. We introduce a method to meet these challenges by implementing a multilingual retrieval interface for biomedical content in the World Wide Web. To this end we developed an automated method for interlingual query construction by which a standard Web search engine is enabled to process non-English queries from the biomedical domain in order to retrieve English documents.

  1. BioCarian: search engine for exploratory searches in heterogeneous biological databases.

    Science.gov (United States)

    Zaki, Nazar; Tennakoon, Chandana

    2017-10-02

    There are a large number of biological databases publicly available for scientists in the web. Also, there are many private databases generated in the course of research projects. These databases are in a wide variety of formats. Web standards have evolved in the recent times and semantic web technologies are now available to interconnect diverse and heterogeneous sources of data. Therefore, integration and querying of biological databases can be facilitated by techniques used in semantic web. Heterogeneous databases can be converted into Resource Description Format (RDF) and queried using SPARQL language. Searching for exact queries in these databases is trivial. However, exploratory searches need customized solutions, especially when multiple databases are involved. This process is cumbersome and time consuming for those without a sufficient background in computer science. In this context, a search engine facilitating exploratory searches of databases would be of great help to the scientific community. We present BioCarian, an efficient and user-friendly search engine for performing exploratory searches on biological databases. The search engine is an interface for SPARQL queries over RDF databases. We note that many of the databases can be converted to tabular form. We first convert the tabular databases to RDF. The search engine provides a graphical interface based on facets to explore the converted databases. The facet interface is more advanced than conventional facets. It allows complex queries to be constructed, and have additional features like ranking of facet values based on several criteria, visually indicating the relevance of a facet value and presenting the most important facet values when a large number of choices are available. For the advanced users, SPARQL queries can be run directly on the databases. Using this feature, users will be able to incorporate federated searches of SPARQL endpoints. We used the search engine to do an exploratory search

  2. Advances in the Kepler Transit Search Engine

    Science.gov (United States)

    Jenkins, Jon M.

    2016-10-01

    Twenty years ago, no planets were known outside our own solar system. Since then, the discoveries of ~1500 exoplanets have radically altered our views of planets and planetary systems. This revolution is due in no small part to the Kepler Mission, which has discovered >1000 of these planets and >4000 planet candidates. While Kepler has shown that small rocky planets and planetary systems are quite common, the quest to find Earth's closest cousins and characterize their atmospheres presses forward with missions such as NASA Explorer Program's Transiting Exoplanet Survey Satellite (TESS) slated for launch in 2017 and ESA's PLATO mission scheduled for launch in 2024. These future missions pose daunting data processing challenges in terms of the number of stars, the amount of data, and the difficulties in detecting weak signatures of transiting small planets against a roaring background. These complications include instrument noise and systematic effects as well as the intrinsic stellar variability of the subjects under scrutiny. In this paper we review recent developments in the Kepler transit search pipeline improving both the yield and reliability of detected transit signatures. Many of the phenomena in light curves that represent noise can also trigger transit detection algorithms. The Kepler Mission has expended great effort in suppressing false positives from its planetary candidate catalogs. Over 18,000 transit-like signatures can be identified for a search across 4 years of data. Most of these signatures are artifacts, not planets. Vetting all such signatures historically takes several months' effort by many individuals. We describe the application of machine learning approaches for the automated vetting and production of planet candidate catalogs. These algorithms can improve the efficiency of the human vetting effort as well as quantifying the likelihood that each candidate is truly a planet. This information is crucial for obtaining valid planet occurrence

  3. Uncovering the Hidden Web, Part I: Finding What the Search Engines Don't. ERIC Digest.

    Science.gov (United States)

    Mardis, Marcia

    Currently, the World Wide Web contains an estimated 7.4 million sites (OCLC, 2001). Yet even the most experienced searcher, using the most robust search engines, can access only about 16% of these pages (Dahn, 2001). The other 84% of the publicly available information on the Web is referred to as the "hidden,""invisible," or…

  4. Ranking of Wikipedia articles in search engines revisited: Fair ranking for reasonable quality?

    CERN Document Server

    Lewandowski, Dirk; 10.1002/asi.21423

    2011-01-01

    This paper aims to review the fiercely discussed question of whether the ranking of Wikipedia articles in search engines is justified by the quality of the articles. After an overview of current research on information quality in Wikipedia, a summary of the extended discussion on the quality of encyclopedic entries in general is given. On this basis, a heuristic method for evaluating Wikipedia entries is developed and applied to Wikipedia articles that scored highly in a search engine retrieval effectiveness test and compared with the relevance judgment of jurors. In all search engines tested, Wikipedia results are unanimously judged better by the jurors than other results on the corresponding results position. Relevance judgments often roughly correspond with the results from the heuristic evaluation. Cases in which high relevance judgments are not in accordance with the comparatively low score from the heuristic evaluation are interpreted as an indicator of a high degree of trust in Wikipedia. One of the sy...

  5. Flavor Changing Neutral Current searches in the top quark sector

    CERN Document Server

    Bhowmik, Sandeep

    2017-01-01

    Flavor changing neutral current (FCNC) interactions in top quark are highly suppressed in the Standard Model. Therefore, any measurable branching ratio for top FCNC decays is an indication of new physics. In this paper, searches for FCNC interactions in top quark production and decay at the LHC by the ATLAS Collaboration and the CMS collaboration are presented. FCNC searches in t $\\rightarrow$ qH, t $\\rightarrow$ q$\\gamma$ and t $\\rightarrow$ qZ decays, and in top quark production in qg $\\rightarrow$ t or q $\\rightarrow$ tg are summarized. None of the searches yielded positive results and exclusion limits on branching ratios, coupling strengths and cross-sections are obtained.

  6. Classifying queries submitted to a vertical search engine

    NARCIS (Netherlands)

    Berendsen, R.; Kovachev, B.; Meij, E.; de Rijke, M.; Weerkamp, W.

    2011-01-01

    We propose and motivate a scheme for classifying queries submitted to a people search engine. We specify a number of features for automatically classifying people queries into the proposed classes and examine the eectiveness of these features. Our main nding is that classication is feasible and that

  7. A Competitive and Experiential Assignment in Search Engine Optimization Strategy

    Science.gov (United States)

    Clarke, Theresa B.; Clarke, Irvine, III

    2014-01-01

    Despite an increase in ad spending and demand for employees with expertise in search engine optimization (SEO), methods for teaching this important marketing strategy have received little coverage in the literature. Using Bloom's cognitive goals hierarchy as a framework, this experiential assignment provides a process for educators who may be…

  8. A Competitive and Experiential Assignment in Search Engine Optimization Strategy

    Science.gov (United States)

    Clarke, Theresa B.; Clarke, Irvine, III

    2014-01-01

    Despite an increase in ad spending and demand for employees with expertise in search engine optimization (SEO), methods for teaching this important marketing strategy have received little coverage in the literature. Using Bloom's cognitive goals hierarchy as a framework, this experiential assignment provides a process for educators who may be new…

  9. The Role of Exploratory Talk in Classroom Search Engine Tasks

    Science.gov (United States)

    Knight, Simon; Mercer, Neil

    2015-01-01

    While search engines are commonly used by children to find information, and in classroom-based activities, children are not adept in their information seeking or evaluation of information sources. Prior work has explored such activities in isolated, individual contexts, failing to account for the collaborative, discourse-mediated nature of search…

  10. PR Students' Perceptions and Readiness for Using Search Engine Optimization

    Science.gov (United States)

    Moody, Mia; Bates, Elizabeth

    2013-01-01

    Enough evidence is available to support the idea that public relations professionals must possess search engine optimization (SEO) skills to assist clients in a full-service capacity; however, little research exists on how much college students know about the tactic and best practices for incorporating SEO into course curriculum. Furthermore, much…

  11. Pyndri: a Python Interface to the Indri Search Engine

    NARCIS (Netherlands)

    Van Gysel, C.; Kanoulas, E.; de Rijke, M.; Jose, J.M.; Hauff, C.; Altıngovde, I.S.; Song, D.; Albakour, D.; Watt, S.; Tait, J.

    2017-01-01

    We introduce pyndri, a Python interface to the Indri search engine. Pyndri allows to access Indri indexes from Python at two levels: (1) dictionary and tokenized document collection, (2) evaluating queries on the index. We hope that with the release of pyndri, we will stimulate reproducible, open

  12. Search Engines and Power: A Politics of Online (Mis- Information

    Directory of Open Access Journals (Sweden)

    Elad Segev

    2008-06-01

    Full Text Available Media and communications have always been employed by dominant actors and played a crucial role in framing our knowledge and constructing certain orders. This paper examines the politics of search engines, suggesting that they increasingly become "authoritative" and popular information agents used by individuals, groups and governments to attain their position and shape the information order. Following the short evolution of search engines from small companies to global media corporations that commodify online information and control advertising spaces, this study brings attention to some of their important political, social, cultural and economic implications. This is indicated through their expanding operation and control over private and public informational spaces as well as through the structural bias of the information they attempt to organize. In particular, it is indicated that search engines are highly biased toward commercial and popular US-based content, supporting US-centric priorities and agendas. Consequently, it is suggested that together with their important role in "organizing the world's information" search engines reinforce certain inequalities and understandings of the world.

  13. A Competitive and Experiential Assignment in Search Engine Optimization Strategy

    Science.gov (United States)

    Clarke, Theresa B.; Clarke, Irvine, III

    2014-01-01

    Despite an increase in ad spending and demand for employees with expertise in search engine optimization (SEO), methods for teaching this important marketing strategy have received little coverage in the literature. Using Bloom's cognitive goals hierarchy as a framework, this experiential assignment provides a process for educators who may be new…

  14. PR Students' Perceptions and Readiness for Using Search Engine Optimization

    Science.gov (United States)

    Moody, Mia; Bates, Elizabeth

    2013-01-01

    Enough evidence is available to support the idea that public relations professionals must possess search engine optimization (SEO) skills to assist clients in a full-service capacity; however, little research exists on how much college students know about the tactic and best practices for incorporating SEO into course curriculum. Furthermore, much…

  15. Higgs Boson Searches @ LHC Dedicated to Engin

    CERN Document Server

    Kourkoumelis, C

    2008-01-01

    The Higgs boson is the only particle missing to complete the successful description of the elementary ingredients of our world. Its existence and its associated mechanism are predicted by the Standard Model (SM) in order to help give masses to the otherwise massless elementary particles. As of yet, it escapes experimental detection despite the enormous worldwide efforts. The new Large Hadron Collider (LHC) at CERN will provide enough collision energy for its formation, if it exists. The large experiments already installed are equipped with excellent detector capabilities in order to confirm (or reject) its existence. The late Professor Engin Arik and her group have strongly contributed to decade-long efforts of the ATLAS detector realization. The mass of the Higgs boson is not predicted by the SM, but in any case, it is above the existing experimental lower bound of 114.4 GeV/c2 at 95%CL. The Higgs particle decays in a number of different ways. Depending on its mass, different decay modes and decay particle i...

  16. LAILAPS: The Plant Science Search Engine

    OpenAIRE

    2014-01-01

    With the number of sequenced plant genomes growing, the number of predicted genes and functional annotations is also increasing. The association between genes and phenotypic traits is currently of great interest. Unfortunately, the information available today is widely scattered over a number of different databases. Information retrieval (IR) has become an all-encompassing bioinformatics methodology for extracting knowledge from complex, heterogeneous and distributed databases, and therefore ...

  17. Credible Mechanism for More Reliable SearchEngine Results

    Directory of Open Access Journals (Sweden)

    Mohammed Abdel Razek

    2015-02-01

    Full Text Available the number of websites on the Internet is growing randomly, thanks to HTML language. Consequently, a diversity of information is available on the Web, however, sometimes the content of it may be neither valuable nor trusted. This leads to a problem of a credibility of the existing information on these Websites. This paper investigates aspects affecting on the Websites credibility and then uses them along with dominant meaning of the query for improving information retrieval capabilities and to effectively manage contents. It presents a design and development of a credible mechanism that searches Web search engine and then ranks sites according to its reliability. Our experiments show that the credibility terms on the Websites can affect the ranking of the Web search engine and greatly improves retrieval effectiveness.

  18. ERRATUM: TOWARDS ACTIVE SEO (SEARCH ENGINE OPTIMIZATION 2.0

    Directory of Open Access Journals (Sweden)

    Charles-Victor Boutet

    2013-04-01

    Full Text Available In the age of writable web, new skills and new practices are appearing. In an environment that allows everyone to communicate information globally, internet referencing (or SEO is a strategic discipline that aims to generate visibility, internet traffic and a maximum exploitation of sites publications. Often misperceived as a fraud, SEO has evolved to be a facilitating tool for anyone who wishes to reference their website with search engines. In this article we show that it is possible to achieve the first rank in search results of keywords that are very competitive. We show methods that are quick, sustainable and legal; while applying the principles of active SEO 2.0. This article also clarifies some working functions of search engines, some advanced referencing techniques (that are completely ethical and legal and we lay the foundations for an in depth reflection on the qualities and advantages of these techniques.

  19. TOWARDS ACTIVE SEO (SEARCH ENGINE OPTIMIZATION 2.0

    Directory of Open Access Journals (Sweden)

    Charles-Victor Boutet

    2012-12-01

    Full Text Available In the age of writable web, new skills and new practices are appearing. In an environment that allows everyone to communicate information globally, internet referencing (or SEO is a strategic discipline that aims to generate visibility, internet traffic and a maximum exploitation of sites publications. Often misperceived as a fraud, SEO has evolved to be a facilitating tool for anyone who wishes to reference their website with search engines. In this article we show that it is possible to achieve the first rank in search results of keywords that are very competitive. We show methods that are quick, sustainable and legal; while applying the principles of active SEO 2.0. This article also clarifies some working functions of search engines, some advanced referencing techniques (that are completely ethical and legal and we lay the foundations for an in depth reflection on the qualities and advantages of these techniques.

  20. Android Based Effective Search Engine Retrieval System Using Ontology

    Directory of Open Access Journals (Sweden)

    A. Praveena

    2014-05-01

    Full Text Available In the proposed model, users search for the query on either Area specified or user’s location, server retrieves all the data to the user’s computer where ontology is applied. After applying the ontology, it will classify in to two concepts such as location based or content based. User PC displays all the relevant keywords to the user’s mobile, so that user selects the exact requirement. The client collects and stores locally then click through data to protect privacy, whereas tasks such as concept extraction, training, and reranking are performed at the search engine server. Ranking occurs and finally exactly mapped information is produced to the users mobile and addresses the privacy problem by restricting the information in the user profile exposed to the search engine server with two privacy parameters. Finally applied UDD algorithm to eliminate the duplication of records which helps to minimize the number of URL listed to the user.

  1. A Full-text Website Search Engine Powered by Lucene and The Depth First Search Algorithm

    Directory of Open Access Journals (Sweden)

    Modinat. A. Mabayoje

    2013-03-01

    Full Text Available With the amount of available text data on the web growing rapidly, the need for users to search such information is dramatically increasing. Full text search engines and relational databases each have unique strengths as development tools but also have overlapping capabilities. Both can provide for storage and update of data and both support search of the data. Full text systems are better for quickly searching high volumes of unstructured text for the presence of any word or combination of words. They provide rich text search capabilities and sophisticated relevancy ranking tools for ordering results based on how well they match a potentially fuzzy search request. Relational databases, on the other hand, excel at storing and manipulating structured data -- records of fields of specific types (text, integer, currency, etc.. They can do so with little or no redundancy. They support flexible search of multiple record types for specific values of fields, as well strong tools for quickly and securely updating individual records. The web being a collection of largely unstructured document which is ever growing in size, the appeal of using RDBMS for searching this collection of documents has become very costly.This paper describes the architecture, design and implementation of a prototype website search engine powered by Lucene to search through any website. This approach involves the development of a small scale web crawler to gather information from the desired website. The gathered information are then converted to a Lucene document and stored in the index. The time taken to search the index is very short when compared with how long it takes for a relational database to process a query.

  2. Health search engine with e-document analysis for reliable search results.

    Science.gov (United States)

    Gaudinat, Arnaud; Ruch, Patrick; Joubert, Michel; Uziel, Philippe; Strauss, Anne; Thonnet, Michèle; Baud, Robert; Spahni, Stéphane; Weber, Patrick; Bonal, Juan; Boyer, Celia; Fieschi, Marius; Geissbuhler, Antoine

    2006-01-01

    After a review of the existing practical solution available to the citizen to retrieve eHealth document, the paper describes an original specialized search engine WRAPIN. WRAPIN uses advanced cross lingual information retrieval technologies to check information quality by synthesizing medical concepts, conclusions and references contained in the health literature, to identify accurate, relevant sources. Thanks to MeSH terminology [1] (Medical Subject Headings from the U.S. National Library of Medicine) and advanced approaches such as conclusion extraction from structured document, reformulation of the query, WRAPIN offers to the user a privileged access to navigate through multilingual documents without language or medical prerequisites. The results of an evaluation conducted on the WRAPIN prototype show that results of the WRAPIN search engine are perceived as informative 65% (59% for a general-purpose search engine), reliable and trustworthy 72% (41% for the other engine) by users. But it leaves room for improvement such as the increase of database coverage, the explanation of the original functionalities and an audience adaptability. Thanks to evaluation outcomes, WRAPIN is now in exploitation on the HON web site (http://www.healthonnet.org), free of charge. Intended to the citizen it is a good alternative to general-purpose search engines when the user looks up trustworthy health and medical information or wants to check automatically a doubtful content of a Web page.

  3. Developing as new search engine and browser for libraries to search and organize the World Wide Web library resources

    OpenAIRE

    Sreenivasulu, V.

    2000-01-01

    Internet Granthalaya urges world wide advocates and targets at the task of creating a new search engine and dedicated browseer. Internet Granthalaya may be the ultimate search engine exclusively dedicated for every library use to search and organize the world wide web libary resources

  4. Developing as new search engine and browser for libraries to search and organize the World Wide Web library resources

    OpenAIRE

    SREENIVASULU, V.

    2000-01-01

    Internet Granthalaya urges world wide advocates and targets at the task of creating a new search engine and dedicated browseer. Internet Granthalaya may be the ultimate search engine exclusively dedicated for every library use to search and organize the world wide web libary resources

  5. The EBI Search engine: providing search and retrieval functionality for biological data from EMBL-EBI.

    Science.gov (United States)

    Squizzato, Silvano; Park, Young Mi; Buso, Nicola; Gur, Tamer; Cowley, Andrew; Li, Weizhong; Uludag, Mahmut; Pundir, Sangya; Cham, Jennifer A; McWilliam, Hamish; Lopez, Rodrigo

    2015-07-01

    The European Bioinformatics Institute (EMBL-EBI-https://www.ebi.ac.uk) provides free and unrestricted access to data across all major areas of biology and biomedicine. Searching and extracting knowledge across these domains requires a fast and scalable solution that addresses the requirements of domain experts as well as casual users. We present the EBI Search engine, referred to here as 'EBI Search', an easy-to-use fast text search and indexing system with powerful data navigation and retrieval capabilities. API integration provides access to analytical tools, allowing users to further investigate the results of their search. The interconnectivity that exists between data resources at EMBL-EBI provides easy, quick and precise navigation and a better understanding of the relationship between different data types including sequences, genes, gene products, proteins, protein domains, protein families, enzymes and macromolecular structures, together with relevant life science literature.

  6. A Search Engine That's Aware of Your Needs

    Science.gov (United States)

    2005-01-01

    Internet research can be compared to trying to drink from a firehose. Such a wealth of information is available that even the simplest inquiry can sometimes generate tens of thousands of leads, more information than most people can handle, and more burdensome than most can endure. Like everyone else, NASA scientists rely on the Internet as a primary search tool. Unlike the average user, though, NASA scientists perform some pretty sophisticated, involved research. To help manage the Internet and to allow researchers at NASA to gain better, more efficient access to the wealth of information, the Agency needed a search tool that was more refined and intelligent than the typical search engine. Partnership NASA funded Stottler Henke, Inc., of San Mateo, California, a cutting-edge software company, with a Small Business Innovation Research (SBIR) contract to develop the Aware software for searching through the vast stores of knowledge quickly and efficiently. The partnership was through NASA s Ames Research Center.

  7. An efficient quantum search engine on unsorted database

    Science.gov (United States)

    Lu, Songfeng; Zhang, Yingyu; Liu, Fang

    2013-10-01

    We consider the problem of finding one or more desired items out of an unsorted database. Patel has shown that if the database permits quantum queries, then mere digitization is sufficient for efficient search for one desired item. The algorithm, called factorized quantum search algorithm, presented by him can locate the desired item in an unsorted database using O() queries to factorized oracles. But the algorithm requires that all the attribute values must be distinct from each other. In this paper, we discuss how to make a database satisfy the requirements, and present a quantum search engine based on the algorithm. Our goal is achieved by introducing auxiliary files for the attribute values that are not distinct, and converting every complex query request into a sequence of calls to factorized quantum search algorithm. The query complexity of our algorithm is O() for most cases.

  8. GOseek: a gene ontology search engine using enhanced keywords.

    Science.gov (United States)

    Taha, Kamal

    2013-01-01

    We propose in this paper a biological search engine called GOseek, which overcomes the limitation of current gene similarity tools. Given a set of genes, GOseek returns the most significant genes that are semantically related to the given genes. These returned genes are usually annotated to one of the Lowest Common Ancestors (LCA) of the Gene Ontology (GO) terms annotating the given genes. Most genes have several annotation GO terms. Therefore, there may be more than one LCA for the GO terms annotating the given genes. The LCA annotating the genes that are most semantically related to the given gene is the one that receives the most aggregate semantic contribution from the GO terms annotating the given genes. To identify this LCA, GOseek quantifies the contribution of the GO terms annotating the given genes to the semantics of their LCAs. That is, it encodes the semantic contribution into a numeric format. GOseek uses microarray experiment data to rank result genes based on their significance. We evaluated GOseek experimentally and compared it with a comparable gene prediction tool. Results showed marked improvement over the tool.

  9. Survey of formal and informal citation in Google search engine

    Directory of Open Access Journals (Sweden)

    Afsaneh Teymourikhani

    2016-03-01

    Full Text Available Aim: Informal citations is bibliographic information (title or Internet address, citing sources of information resources for informal scholarly communication and always neglected in traditional citation databases. This study is done, in order to answer the question of whether informal citations in the web environment are traceable. The present research aims to determine what proportion of web citations of Google search engine is related to formal and informal citation. Research method: Webometrics is the method used. The study is done on 1344 research articles of 98 open access journal, and the method that is used to extract the web citation from Google search engine is “Web / URL citation extraction". Findings: The findings showed that ten percent of the web citations of Google search engine are formal and informal citations. The highest formal citation in the Google search engine with 19/27% is in the field of library and information science and the lowest official citation by 1/54% is devoted to the field of civil engineering. The highest percentage of informal citations with 3/57% is devoted to sociology and the lowest percentage of informal citations by 0/39% is devoted to the field of civil engineering. Journal Citation is highest with 94/12% in the surgical field and lowest with 5/26 percent in the philosophy filed. Result: Due to formal and informal citations in the Google search engine which is about 10 percent and the reduction of this amount compared to previous research, it seems that track citations by this engine should be treated with more caution. We see that the amount of formal citation is variable in different disciplines. Cited journals in the field of surgery, is highest and in the filed of philosophy is lowest, this indicates that in the filed of philosophy, that is a subset of the social sciences, journals in scientific communication do not play a significant role. On the other hand, book has a key role in this filed

  10. Current Status of the KSTAR Engineering

    Institute of Scientific and Technical Information of China (English)

    J. S. Bak; K. Kim; C. H. Choi; Y. K. Oh; B. C. Kim; N. I. Her; H. L. Yang; G. S. Lee; the KSTAR Team

    2004-01-01

    As there is substantial progress in the KSTAR tokamak engineering, all the major structures and sub-systems are under fabrication and in procurement phase. The vacuum vessel,port, cryostat cylinder, lid, and bellows are being rigorously fabricated in the factory. The lower part of the KSTAR such as cryostat base and gravity support has been almost finished in its fabrication. There are also great progresses and significant results in manufacturing of the superconducting magnet, including four Toroidal Field (TF) coils, lower and upper PF7 coils which are the largest Poloidal Field (PF) coils. The TF00 coil, which has been made for test and back-up of the TF magnet system, was successfully tested in the cool-down and current charging. As the fabrications and procurements of major structures have been actively proceeded, assembly works were also launched from Aug. 2003. More detailed description on these status, results, and plans will be described in this paper.

  11. Relevant Pages in semantic Web Search Engines using Ontology

    Directory of Open Access Journals (Sweden)

    Jemimah Simon

    2012-03-01

    Full Text Available In general, search engines are the most popular means of searching any kind of information from the Internet. Generally, keywords are given to the search engine and the Web database returns the documents containing specified keywords. In many situations, irrelevant results are given as results to the user query since different keywords are used in different forms in various documents. The development of the next generation Web, Semantic Web, will change this situation. This paper proposes a prototype of relation-based search engine which ranks the page according to the user query and on annotated results. Page sub graph is computed for each annotated page in the result set by generating all possible combinations for the relation in the sub graph. A relevance score is computed for each annotated page using a probability measure. A relation based ranking model is used which displays the pages in the final result set according to their relevance score. This ranking is provided by considering keyword-concept associations. Thus, the final result set contains pages in the order of their constrained relevant scores.

  12. First 20 Precision among World Wide Web Search Services (Search Engines).

    Science.gov (United States)

    Leighton, H. Vernon; Srivastava, Jaideep

    1999-01-01

    Compares five World Wide Web search engines for precision on the first 20 results returned for 15 queries, adding weight for ranking effectiveness. Discusses methods to lessen evaluator bias, evaluation criteria, definition of relevance, experimental design, the structure of queries, and future work. (Author/LRW)

  13. Next-Generation Search Engines for Information Retrieval

    Energy Technology Data Exchange (ETDEWEB)

    Devarakonda, Ranjeet [ORNL; Hook, Leslie A [ORNL; Palanisamy, Giri [ORNL; Green, James M [ORNL

    2011-01-01

    In the recent years, there have been significant advancements in the areas of scientific data management and retrieval techniques, particularly in terms of standards and protocols for archiving data and metadata. Scientific data is rich, and spread across different places. In order to integrate these pieces together, a data archive and associated metadata should be generated. Data should be stored in a format that can be retrievable and more importantly it should be in a format that will continue to be accessible as technology changes, such as XML. While general-purpose search engines (such as Google or Bing) are useful for finding many things on the Internet, they are often of limited usefulness for locating Earth Science data relevant (for example) to a specific spatiotemporal extent. By contrast, tools that search repositories of structured metadata can locate relevant datasets with fairly high precision, but the search is limited to that particular repository. Federated searches (such as Z39.50) have been used, but can be slow and the comprehensiveness can be limited by downtime in any search partner. An alternative approach to improve comprehensiveness is for a repository to harvest metadata from other repositories, possibly with limits based on subject matter or access permissions. Searches through harvested metadata can be extremely responsive, and the search tool can be customized with semantic augmentation appropriate to the community of practice being served. One such system, Mercury, a metadata harvesting, data discovery, and access system, built for researchers to search to, share and obtain spatiotemporal data used across a range of climate and ecological sciences. Mercury is open-source toolset, backend built on Java and search capability is supported by the some popular open source search libraries such as SOLR and LUCENE. Mercury harvests the structured metadata and key data from several data providing servers around the world and builds a

  14. Collection of Medical Original Data with Search Engine for Decision Support.

    Science.gov (United States)

    Orthuber, Wolfgang

    2016-01-01

    Medicine is becoming more and more complex and humans can capture total medical knowledge only partially. For specific access a high resolution search engine is demonstrated, which allows besides conventional text search also search of precise quantitative data of medical findings, therapies and results. Users can define metric spaces ("Domain Spaces", DSs) with all searchable quantitative data ("Domain Vectors", DSs). An implementation of the search engine is online in http://numericsearch.com. In future medicine the doctor could make first a rough diagnosis and check which fine diagnostics (quantitative data) colleagues had collected in such a case. Then the doctor decides about fine diagnostics and results are sent (half automatically) to the search engine which filters a group of patients which best fits to these data. In this specific group variable therapies can be checked with associated therapeutic results, like in an individual scientific study for the current patient. The statistical (anonymous) results could be used for specific decision support. Reversely the therapeutic decision (in the best case with later results) could be used to enhance the collection of precise pseudonymous medical original data which is used for better and better statistical (anonymous) search results.

  15. High dimensional search-based software engineering: finding tradeoffs among 15 objectives for automating software refactoring using NSGA-III

    OpenAIRE

    Mkaouer, Wiem; Kessentini, Marouane; Bechikh, Slim; Deb, Kalyanmoy; Ó Cinnéide, Mel

    2014-01-01

    peer-reviewed There is a growing need for scalable search-based software engineering approaches that address software engineering problems where a large number of objectives are to be optimized. Software refactoring is one of these problems where a refactoring sequence is sought that optimizes several software metrics. Most of the existing refactoring work uses a large set of quality metrics to evaluate the software design after applying refactoring operations, but current search-based sof...

  16. Antecedents and Outcomes of Search Engines Loyalty: Designing a model of Iranian Users Loyalty

    Directory of Open Access Journals (Sweden)

    Alireza Hadadian

    2013-09-01

    Full Text Available This paper aims to design proposed model for search engine designers to promote loyalty levels of search engine users. This is a descriptive survey study. The statistical population of the research is composed of Ferdowsi University students. The sample size estimated to be 347. Data gathering instrument was a self administered questionnaire and structural equation modeling (SEM is used for the data analysis. Findings indicate that user communication affects user satisfaction significantly and negatively. Also in research final model search engine image, search engine perceived quality and search engine perceived value affects user satisfaction. Moreover, search engine perceived value affects user loyalty and verbal advertising directly. Also user satisfaction affects user loyalty directly. Finally user loyalty leads into high expected switching costs, search engine revisit and verbal advertising for search engines.

  17. Measuring the Utilization of On-Page Search Engine Optimization in Selected Domain

    National Research Council Canada - National Science Library

    Goran Matošević

    2015-01-01

    Search engine optimization (SEO) techniques involve „on-page“ and „off-page“ actions taken by web developers and SEO specialists with aim to increase the ranking of web pages in search engine results pages (SERP...

  18. A geometry-based image search engine for advanced RADARSAT-1/2 GIS applications

    Science.gov (United States)

    Kotamraju, Vinay; Rabus, Bernhard; Busler, Jennifer

    2012-06-01

    Space-borne Synthetic Aperture Radar (SAR) sensors, such as RADARSAT-1 and -2, enable a multitude of defense and security applications owing to their unique capabilities of cloud penetration, day/night imaging and multi-polarization imaging. As a result, advanced SAR image time series exploitation techniques such as Interferometric SAR (InSAR) and Radargrammetry are now routinely used in applications such as underground tunnel monitoring, infrastructure monitoring and DEM generation. Imaging geometry, as determined by the satellite orbit and imaged terrain, plays a critical role in the success of such techniques. This paper describes the architecture and the current status of development of a geometry-based search engine that allows the search and visualization of archived and future RADARSAT-1 and -2 images appropriate for a variety of advanced SAR techniques and applications. Key features of the search engine's scalable architecture include (a) Interactive GIS-based visualization of the search results; (b) A client-server architecture for online access that produces up-to-date searches of the archive images and that can, in future, be extended to acquisition planning; (c) A techniquespecific search mode, wherein an expert user explicitly sets search parameters to find appropriate images for advanced SAR techniques such as InSAR and Radargrammetry; (d) A future application-specific search mode, wherein all search parameters implicitly default to preset values according to the application of choice such as tunnel monitoring, DEM generation and deformation mapping; (f) Accurate baseline calculations for InSAR searches, and, optimum beam configuration for Radargrammetric searches; (g) Simulated quick look images and technique-specific sensitivity maps in the future.

  19. Query log analysis of an electronic health record search engine.

    Science.gov (United States)

    Yang, Lei; Mei, Qiaozhu; Zheng, Kai; Hanauer, David A

    2011-01-01

    We analyzed a longitudinal collection of query logs of a full-text search engine designed to facilitate information retrieval in electronic health records (EHR). The collection, 202,905 queries and 35,928 user sessions recorded over a course of 4 years, represents the information-seeking behavior of 533 medical professionals, including frontline practitioners, coding personnel, patient safety officers, and biomedical researchers for patient data stored in EHR systems. In this paper, we present descriptive statistics of the queries, a categorization of information needs manifested through the queries, as well as temporal patterns of the users' information-seeking behavior. The results suggest that information needs in medical domain are substantially more sophisticated than those that general-purpose web search engines need to accommodate. Therefore, we envision there exists a significant challenge, along with significant opportunities, to provide intelligent query recommendations to facilitate information retrieval in EHR.

  20. The LAILAPS search engine: relevance ranking in life science databases.

    Science.gov (United States)

    Lange, Matthias; Spies, Karl; Bargsten, Joachim; Haberhauer, Gregor; Klapperstück, Matthias; Leps, Michael; Weinel, Christian; Wünschiers, Röbbe; Weissbach, Mandy; Stein, Jens; Scholz, Uwe

    2010-01-15

    Search engines and retrieval systems are popular tools at a life science desktop. The manual inspection of hundreds of database entries, that reflect a life science concept or fact, is a time intensive daily work. Hereby, not the number of query results matters, but the relevance does. In this paper, we present the LAILAPS search engine for life science databases. The concept is to combine a novel feature model for relevance ranking, a machine learning approach to model user relevance profiles, ranking improvement by user feedback tracking and an intuitive and slim web user interface, that estimates relevance rank by tracking user interactions. Queries are formulated as simple keyword lists and will be expanded by synonyms. Supporting a flexible text index and a simple data import format, LAILAPS can easily be used both as search engine for comprehensive integrated life science databases and for small in-house project databases. With a set of features, extracted from each database hit in combination with user relevance preferences, a neural network predicts user specific relevance scores. Using expert knowledge as training data for a predefined neural network or using users own relevance training sets, a reliable relevance ranking of database hits has been implemented. In this paper, we present the LAILAPS system, the concepts, benchmarks and use cases. LAILAPS is public available for SWISSPROT data at http://lailaps.ipk-gatersleben.de.

  1. Modification site localization scoring integrated into a search engine.

    Science.gov (United States)

    Baker, Peter R; Trinidad, Jonathan C; Chalkley, Robert J

    2011-07-01

    Large proteomic data sets identifying hundreds or thousands of modified peptides are becoming increasingly common in the literature. Several methods for assessing the reliability of peptide identifications both at the individual peptide or data set level have become established. However, tools for measuring the confidence of modification site assignments are sparse and are not often employed. A few tools for estimating phosphorylation site assignment reliabilities have been developed, but these are not integral to a search engine, so require a particular search engine output for a second step of processing. They may also require use of a particular fragmentation method and are mostly only applicable for phosphorylation analysis, rather than post-translational modifications analysis in general. In this study, we present the performance of site assignment scoring that is directly integrated into the search engine Protein Prospector, which allows site assignment reliability to be automatically reported for all modifications present in an identified peptide. It clearly indicates when a site assignment is ambiguous (and if so, between which residues), and reports an assignment score that can be translated into a reliability measure for individual site assignments.

  2. Improvement of natural image search engines results by emotional filtering

    Directory of Open Access Journals (Sweden)

    Patrice Denis

    2016-04-01

    Full Text Available With the Internet 2.0 era, managing user emotions is a problem that more and more actors are interested in. Historically, the first notions of emotion sharing were expressed and defined with emoticons. They allowed users to show their emotional status to others in an impersonal and emotionless digital world. Now, in the Internet of social media, every day users share lots of content with each other on Facebook, Twitter, Google+ and so on. Several new popular web sites like FlickR, Picassa, Pinterest, Instagram or DeviantArt are now specifically based on sharing image content as well as personal emotional status. This kind of information is economically very valuable as it can for instance help commercial companies sell more efficiently. In fact, with this king of emotional information, business can made where companies will better target their customers needs, and/or even sell them more products. Research has been and is still interested in the mining of emotional information from user data since then. In this paper, we focus on the impact of emotions from images that have been collected from search image engines. More specifically our proposition is the creation of a filtering layer applied on the results of such image search engines. Our peculiarity relies in the fact that it is the first attempt from our knowledge to filter image search engines results with an emotional filtering approach.

  3. Design and Implementation of the Personalized Search Engine Based on the Improved Behavior of User Browsing

    Directory of Open Access Journals (Sweden)

    Wei-Chao Li

    2013-02-01

    Full Text Available An improved user profile based on the user browsing behavior is proposed in this study. In the user profile, the user browsing web pages behaviors, the level of interest to keywords, the user's short-term interest and long-term interest are overall taken into account. The improved user profile based on the user browsing behavior is embedded in the personalized search engine system. The basic framework and the basic functional modules of the system are described detailed in this study. A demonstration system of IUBPSES is developed in the .NET platform. The results of the simulation experiments indicate that the retrieval effects which use the IUBPSES based on the improved user profile for information search surpass the current mainstream search engines. The direction of improvement and further research is proposed in the finally.

  4. Searching Choices: Quantifying Decision-Making Processes Using Search Engine Data.

    Science.gov (United States)

    Moat, Helen Susannah; Olivola, Christopher Y; Chater, Nick; Preis, Tobias

    2016-07-01

    When making a decision, humans consider two types of information: information they have acquired through their prior experience of the world, and further information they gather to support the decision in question. Here, we present evidence that data from search engines such as Google can help us model both sources of information. We show that statistics from search engines on the frequency of content on the Internet can help us estimate the statistical structure of prior experience; and, specifically, we outline how such statistics can inform psychological theories concerning the valuation of human lives, or choices involving delayed outcomes. Turning to information gathering, we show that search query data might help measure human information gathering, and it may predict subsequent decisions. Such data enable us to compare information gathered across nations, where analyses suggest, for example, a greater focus on the future in countries with a higher per capita GDP. We conclude that search engine data constitute a valuable new resource for cognitive scientists, offering a fascinating new tool for understanding the human decision-making process.

  5. The Effectiveness of Web Search Engines to Index New Sites from Different Countries

    Science.gov (United States)

    Pirkola, Ari

    2009-01-01

    Introduction: Investigates how effectively Web search engines index new sites from different countries. The primary interest is whether new sites are indexed equally or whether search engines are biased towards certain countries. If major search engines show biased coverage it can be considered a significant economic and political problem because…

  6. GeoSearcher: GeoSpatial Ranking of Search Engine Results.

    Science.gov (United States)

    Watters, Carolyn; Amoudi, Ghada

    2002-01-01

    Discusses search engines and describes a prototype system that provides dynamic ranking of search engine results for geospatial queries based on the URL of the host site. Evaluates this approach using user queries and random Web pages, making a contribution to Web retrieval by providing an alternative ranking order for search engine results.…

  7. Adding a Visualization Feature to Web Search Engines: It’s Time

    Energy Technology Data Exchange (ETDEWEB)

    Wong, Pak C.

    2008-11-11

    Since the first world wide web (WWW) search engine quietly entered our lives in 1994, the “information need” behind web searching has rapidly grown into a multi-billion dollar business that dominates the internet landscape, drives e-commerce traffic, propels global economy, and affects the lives of the whole human race. Today’s search engines are faster, smarter, and more powerful than those released just a few years ago. With the vast investment pouring into research and development by leading web technology providers and the intense emotion behind corporate slogans such as “win the web” or “take back the web,” I can’t help but ask why are we still using the very same “text-only” interface that was used 13 years ago to browse our search engine results pages (SERPs)? Why has the SERP interface technology lagged so far behind in the web evolution when the corresponding search technology has advanced so rapidly? In this article I explore some current SERP interface issues, suggest a simple but practical visual-based interface design approach, and argue why a visual approach can be a strong candidate for tomorrow’s SERP interface.

  8. Analysis of Search Engines and Meta Search Engines\\\\\\' Position by University of Isfahan Users Based on Rogers\\\\\\' Diffusion of Innovation Theory

    Directory of Open Access Journals (Sweden)

    Maryam Akbari

    2012-10-01

    Full Text Available The present study investigated the analysis of search engines and meta search engines adoption process by University of Isfahan users during 2009-2010 based on the Rogers' diffusion of innovation theory. The main aim of the research was to study the rate of adoption and recognizing the potentials and effective tools in search engines and meta search engines adoption among University of Isfahan users. The research method was descriptive survey study. The cases of the study were all of the post graduate students of the University of Isfahan. 351 students were selected as the sample and categorized by a stratified random sampling method. Questionnaire was used for collecting data. The collected data was analyzed using SPSS 16 in both descriptive and analytic statistic. For descriptive statistic frequency, percentage and mean were used, while for analytic statistic t-test and Kruskal-Wallis non parametric test (H-test were used. The finding of t-test and Kruscal-Wallis indicated that the mean of search engines and meta search engines adoption did not show statistical differences gender, level of education and the faculty. Special search engines adoption process was different in terms of gender but not in terms of the level of education and the faculty. Other results of the research indicated that among general search engines, Google had the most adoption rate. In addition, among the special search engines, Google Scholar and among the meta search engines Mamma had the most adopting rate. Findings also showed that friends played an important role on how students adopted general search engines while professors had important role on how students adopted special search engines and meta search engines. Moreover, results showed that the place where students got the most acquaintance with search engines and meta search engines was in the university. The finding showed that the curve of adoption rate was not normal and it was not also in S-shape. Morover

  9. GeneView: a comprehensive semantic search engine for PubMed.

    Science.gov (United States)

    Thomas, Philippe; Starlinger, Johannes; Vowinkel, Alexander; Arzt, Sebastian; Leser, Ulf

    2012-07-01

    Research results are primarily published in scientific literature and curation efforts cannot keep up with the rapid growth of published literature. The plethora of knowledge remains hidden in large text repositories like MEDLINE. Consequently, life scientists have to spend a great amount of time searching for specific information. The enormous ambiguity among most names of biomedical objects such as genes, chemicals and diseases often produces too large and unspecific search results. We present GeneView, a semantic search engine for biomedical knowledge. GeneView is built upon a comprehensively annotated version of PubMed abstracts and openly available PubMed Central full texts. This semi-structured representation of biomedical texts enables a number of features extending classical search engines. For instance, users may search for entities using unique database identifiers or they may rank documents by the number of specific mentions they contain. Annotation is performed by a multitude of state-of-the-art text-mining tools for recognizing mentions from 10 entity classes and for identifying protein-protein interactions. GeneView currently contains annotations for >194 million entities from 10 classes for ∼21 million citations with 271,000 full text bodies. GeneView can be searched at http://bc3.informatik.hu-berlin.de/.

  10. This image smells good: Effects of image information scent in search engine results pages

    OpenAIRE

    Loumakis, F.; Stumpf, S.; Grayson, D

    2011-01-01

    Users are confronted with an overwhelming amount of web pages when they look for information on the Internet. Current search engines already aid the user in their information seeking tasks by providing textual results but adding images to results pages could further help the user in judging the relevance of a result. We investigated this problem from an Information Foraging perspective and we report on two empirical studies that focused on the information scent of images. Our results show tha...

  11. REPTREE CLASSIFIER FOR IDENTIFYING LINK SPAM IN WEB SEARCH ENGINES

    Directory of Open Access Journals (Sweden)

    S.K. Jayanthi

    2013-01-01

    Full Text Available Search Engines are used for retrieving the information from the web. Most of the times, the importance is laid on top 10 results sometimes it may shrink as top 5, because of the time constraint and reliability on the search engines. Users believe that top 10 or 5 of total results are more relevant. Here comes the problem of spamdexing. It is a method to deceive the search result quality. Falsified metrics such as inserting enormous amount of keywords or links in website may take that website to the top 10 or 5 positions. This paper proposes a classifier based on the Reptree (Regression tree representative. As an initial step Link-based features such as neighbors, pagerank, truncated pagerank, trustrank and assortativity related attributes are inferred. Based on this features, tree is constructed. The tree uses the feature inference to differentiate spam sites from legitimate sites. WEBSPAM-UK-2007 dataset is taken as a base. It is preprocessed and converted into five datasets FEATA, FEATB, FEATC, FEATD and FEATE. Only link based features are taken for experiments. This paper focus on link spam alone. Finally a representative tree is created which will more precisely classify the web spam entries. Results are given. Regression tree classification seems to perform well as shown through experiments.

  12. Cross-System Evaluation of Clinical Trial Search Engines

    Science.gov (United States)

    Jiang, Silis Y.; Weng, Chunhua

    2014-01-01

    Clinical trials are fundamental to the advancement of medicine but constantly face recruitment difficulties. Various clinical trial search engines have been designed to help health consumers identify trials for which they may be eligible. Unfortunately, knowledge of the usefulness and usability of their designs remains scarce. In this study, we used mixed methods, including time-motion analysis, think-aloud protocol, and survey, to evaluate five popular clinical trial search engines with 11 users. Differences in user preferences and time spent on each system were observed and correlated with user characteristics. In general, searching for applicable trials using these systems is a cognitively demanding task. Our results show that user perceptions of these systems are multifactorial. The survey indicated eTACTS being the generally preferred system, but this finding did not persist among all mixed methods. This study confirms the value of mixed-methods for a comprehensive system evaluation. Future system designers must be aware that different users groups expect different functionalities. PMID:25954590

  13. Cross-system evaluation of clinical trial search engines.

    Science.gov (United States)

    Jiang, Silis Y; Weng, Chunhua

    2014-01-01

    Clinical trials are fundamental to the advancement of medicine but constantly face recruitment difficulties. Various clinical trial search engines have been designed to help health consumers identify trials for which they may be eligible. Unfortunately, knowledge of the usefulness and usability of their designs remains scarce. In this study, we used mixed methods, including time-motion analysis, think-aloud protocol, and survey, to evaluate five popular clinical trial search engines with 11 users. Differences in user preferences and time spent on each system were observed and correlated with user characteristics. In general, searching for applicable trials using these systems is a cognitively demanding task. Our results show that user perceptions of these systems are multifactorial. The survey indicated eTACTS being the generally preferred system, but this finding did not persist among all mixed methods. This study confirms the value of mixed-methods for a comprehensive system evaluation. Future system designers must be aware that different users groups expect different functionalities.

  14. Teen smoking cessation help via the Internet: a survey of search engines.

    Science.gov (United States)

    Edwards, Christine C; Elliott, Sean P; Conway, Terry L; Woodruff, Susan I

    2003-07-01

    The objective of this study was to assess Web sites related to teen smoking cessation on the Internet. Seven Internet search engines were searched using the keywords teen quit smoking. The top 20 hits from each search engine were reviewed and categorized. The keywords teen quit smoking produced between 35 and 400,000 hits depending on the search engine. Of 140 potential hits, 62% were active, unique sites; 85% were listed by only one search engine; and 40% focused on cessation. Findings suggest that legitimate on-line smoking cessation help for teens is constrained by search engine choice and the amount of time teens spend looking through potential sites. Resource listings should be updated regularly. Smoking cessation Web sites need to be picked up on multiple search engine searches. Further evaluation of smoking cessation Web sites need to be conducted to identify the most effective help for teens.

  15. Chemical Engineering Education - Current and Future Trends

    DEFF Research Database (Denmark)

    Gani, Rafiqul

    design, investigations, engineering practice and transferable skills) and a set guidelines (core curriculum, teaching and learning, industrial experience, review of the education process and student assessment) to achieve them, with special emphasis to the ability to solve problems. They also propose...... a leading role to define the chemical engineering curriculum. The result has been a set of recommendations for the first (BSc), second (MSc) and third (PhD) cycle chemical engineering education aligned to the Bologna Process. They recommend that students studying towards bachelor and masters qualifications...... a diversity of individual, academic and labour-market needs. Within Europe, two types of higher education in chemical engineering can be found: more research-oriented or more application-oriented first cycle programmes. Both types of studies cover a period of 3-4 academic years and 60 credits per year. After...

  16. Crossover phenomenon in the performance of an Internet search engine

    CERN Document Server

    Lacasa, Lucas; Berdahl, Andrew

    2012-01-01

    In this work we explore the ability of the Google search engine to find results for random N-letter strings. These random strings, dense over the set of possible N-letter words, address the existence of typos, acronyms, and other words without semantic meaning. Interestingly, we find that the probability of finding such strings sharply drops from one to zero at Nc = 6. The behavior of such order parameter suggests the presence of a transition-like phenomenon in the geometry of the search space. Furthermore, we define a susceptibility-like parameter which reaches a maximum in the neighborhood, suggesting the presence of criticality. We finally speculate on the possible connections to Ramsey theory.

  17. Stochastic Background Search Correlating ALLEGRO with LIGO Engineering Data

    CERN Document Server

    Whelan, J T; Heng, I S; McHugh, M P; Lazzarini, A; Whelan, John T; Daw, Edward; Heng, Ik Siong; Hugh, Martin P Mc; Lazzarini, Albert

    2003-01-01

    We describe the role of correlation measurements between the LIGO interferometer in Livingston, LA, and the ALLEGRO resonant bar detector in Baton Rouge, LA, in searches for a stochastic background of gravitational waves. Such measurements provide a valuable complement to correlations between interferometers at the two LIGO sites, since they are sensitive in a different, higher, frequency band. Additionally, the variable orientation of the ALLEGRO detector provides a means to distinguish gravitational wave correlations from correlated environmental noise. We describe the analysis underway to set a limit on the strength of a stochastic background at frequencies near 900 Hz using ALLEGRO data and data from LIGO's E7 Engineering Run.

  18. Searches for flavour changing neutral currents in the top sector

    CERN Document Server

    Araque, J P

    2016-01-01

    Flavour Changing Neutral Current (FCNC) processes are forbidden at tree level in the Standard Model and highly suppressed at higher orders. This makes FCNC one of the key processes to search for new physics since any small deviations from the Standard Model expectations could have a big impact. Both ATLAS and CMS Collaborations have designed a comprehensive strategy to search for FCNC in top quark physics both in the production and decay. The strategies followed by both collaborations are here described, using data from $pp$ collisions at the LHC collected at a centre of mass energies of 7 and 8~TeV with integrated luminosities ranging from $5~\\rm{fb}^{-1}$ to $20.3~\\rm{fb}^{-1}$.

  19. Searches for flavour changing neutral currents in the top sector

    CERN Document Server

    AUTHOR|(INSPIRE)INSPIRE-00359999; The ATLAS collaboration

    2016-01-01

    Flavour Changing Neutral Current (FCNC) processes are forbidden at tree level in the Standard Model and highly suppressed at higher orders. This makes FCNC one of the key processes to search for new physics since any small deviations from the Standard Model expectations could have a big impact. Both ATLAS and CMS Collaborations have designed a comprehensive strategy to search for FCNC in top quark physics both in the production and decay. The strategies followed by both collaborations are here described, using data from $pp$ collisions at the LHC collected at a centre of mass energies of 7 and 8~TeV with integrated luminosities ranging from $5~\\rm{ fb}^{-1}$ to $20.3~\\rm{ fb}^{-1}$.

  20. Current Results and Future Directions of the Pulsar Search Collaboratory

    Science.gov (United States)

    Heatherly, Sue Ann; Rosen, R.; McLaughlin, M.; Lorimer, D.

    2011-01-01

    The Pulsar Search Collaboratory (PSC) is a joint partnership between the National Radio Astronomy Observatory (NRAO) and West Virginia University (WVU). The ultimate goal of the PSC is to interest students in science, technology, engineering, mathematics (STEM) fields by engaging them in conducting authentic scientific research-specifically the search for new pulsars. Of the 33 schools in the original PSC program, 13 come from rural school districts; one third of these are from schools where over 50% participate in the Free/Reduced School Lunch program. We are reaching first generation college-goers. For students, the program succeeds in building confidence in students, rapport with the scientists involved in the project, and greater comfort with team-work. We see additional gains in girls, as they see themselves more as scientists after participating in the PSC program, which is an important predictor of success in STEM fields. The PSC has had several scientific successes as well. To date, PSC students have made two astronomical discoveries: a 4.8-s pulsar and bright radio burst of astrophysical origin, most likely from a sporadic neutron star. We will report on the status of the project including new evaluation data. We will also describe PSC-West, an experiment to involve schools in Illinois and Wisconsin using primarily online tools for professional development of teachers and coaching of students. Knowledge gained through our efforts with PSC-West will assist the PSC team in scaling up the project.

  1. Application of ARIMA(1,1,0 Model for Predicting Time Delay of Search Engine Crawlers

    Directory of Open Access Journals (Sweden)

    Jeeva JOSE

    2013-01-01

    Full Text Available World Wide Web is growing at a tremendous rate in terms of the number of visitors and number of web pages. Search engine crawlers are highly automated programs that periodically visit the web and index web pages. The behavior of search engines could be used in analyzing server load, quality of search engines, dynamics of search engine crawlers, ethics of search engines etc. The more the number of visits of a crawler to a web site, the more it contributes to the workload. The time delay between two consecutive visits of a crawler determines the dynamicity of the crawlers. The ARIMA(1,1,0 Model in time series analysis works well with the forecasting of the time delay between the visits of search crawlers at web sites. We considered 5 search engine crawlers, all of which could be modeled using ARIMA(1,1,0.The results of this study is useful in analyzing the server load.

  2. Mobile Storage and Search Engine of Information Oriented to Food Cloud

    Directory of Open Access Journals (Sweden)

    Lifeng Wei

    2013-10-01

    Full Text Available The aim of this study is to establish food cloud information search architecture. Food information search engine based on cloud computing architecture can not only achieve more personalized and intelligent search, but also can solve problems of data processing and storage centralization caused by mass of food and custom information. According to design idea of mobile search engine based on cloud computing architecture, the Map/Reduce algorithm and HDFS were thoroughly analyzed and researched. The cloud parallel storage technology under cloud computing architecture was introduced into mobile search engine for design and implementation mobile search engine under open-source Hadoop framework based on cloud computing architecture. The system achieves parallel storage of mass information and mass data to overcome unbalance problem of data centralization and overhead in storage server load caused by mass food data in traditional search engine, thus achieving high efficient mobile device search result and good user satisfaction.

  3. Indirect dark matter searches: current status and perspectives

    CERN Document Server

    CERN. Geneva

    2016-01-01

    Many theoretical ideas for the particle nature of dark matter exist. The  most popular models often predict that dark matter particles self-annihilate or decay, giving rise to potentially detectable signatures in astronomical observations.  I will summarize the current status of searches for such signatures and critically reassess recent claims for dark matter signals.  I will further provide an outlook on anticipated developments in the next 10 years, and discuss new methods to facilitate strategy development.

  4. Current Status of Tissue Engineering Heart Valve.

    Science.gov (United States)

    Shinoka, Toshiharu; Miyachi, Hideki

    2016-11-01

    The development of surgically implantable heart valve prostheses has contributed to improved outcomes in patients with cardiovascular disease. However, there are drawbacks, such as risk of infection and lack of growth potential. Tissue-engineered heart valve (TEHV) holds great promise to address these drawbacks as the ideal TEHV is easily implanted, biocompatible, non-thrombogenic, durable, degradable, and ultimately remodels into native-like tissue. In general, three main components used in creating a tissue-engineered construct are (1) a scaffold material, (2) a cell type for seeding the scaffold, and (3) a subsequent remodeling process driven by cell accumulation and proliferation, and/or biochemical and mechanical signaling. Despite rapid progress in the field over the past decade, TEHVs have not been translated into clinical applications successfully. To successfully utilize TEHVs clinically, further elucidation of the mechanisms for TEHV remodeling and further translational research outcome evaluations will be required. Tissue engineering is a major breakthrough in cardiovascular medicine that holds amazing promise for the future of reconstructive surgical procedures. In this article, we review the history of regenerative medicine, advances in the field, and state-of-the-art in valvular tissue engineering. © The Author(s) 2016.

  5. The Information-Seeking Practices of Engineers: Searching for Documents as Well as for People.

    Science.gov (United States)

    Hertzum, Morten; Pejtersen, Annelise Mark

    2000-01-01

    Investigates engineers' information-seeking practices based on case studies in two organizations. Results show engineers search for documents to find people, search for people to get documents, and interact socially to get information without engaging in explicit searches. Discusses the design task and how computer systems could support searches…

  6. A Taxonomic Search Engine: Federating taxonomic databases using web services

    Directory of Open Access Journals (Sweden)

    Page Roderic DM

    2005-03-01

    Full Text Available Abstract Background The taxonomic name of an organism is a key link between different databases that store information on that organism. However, in the absence of a single, comprehensive database of organism names, individual databases lack an easy means of checking the correctness of a name. Furthermore, the same organism may have more than one name, and the same name may apply to more than one organism. Results The Taxonomic Search Engine (TSE is a web application written in PHP that queries multiple taxonomic databases (ITIS, Index Fungorum, IPNI, NCBI, and uBIO and summarises the results in a consistent format. It supports "drill-down" queries to retrieve a specific record. The TSE can optionally suggest alternative spellings the user can try. It also acts as a Life Science Identifier (LSID authority for the source taxonomic databases, providing globally unique identifiers (and associated metadata for each name. Conclusion The Taxonomic Search Engine is available at http://darwin.zoology.gla.ac.uk/~rpage/portal/ and provides a simple demonstration of the potential of the federated approach to providing access to taxonomic names.

  7. Improving sensitivity in proteome studies by analysis of false discovery rates for multiple search engines.

    Science.gov (United States)

    Jones, Andrew R; Siepen, Jennifer A; Hubbard, Simon J; Paton, Norman W

    2009-03-01

    LC-MS experiments can generate large quantities of data, for which a variety of database search engines are available to make peptide and protein identifications. Decoy databases are becoming widely used to place statistical confidence in result sets, allowing the false discovery rate (FDR) to be estimated. Different search engines produce different identification sets so employing more than one search engine could result in an increased number of peptides (and proteins) being identified, if an appropriate mechanism for combining data can be defined. We have developed a search engine independent score, based on FDR, which allows peptide identifications from different search engines to be combined, called the FDR Score. The results demonstrate that the observed FDR is significantly different when analysing the set of identifications made by all three search engines, by each pair of search engines or by a single search engine. Our algorithm assigns identifications to groups according to the set of search engines that have made the identification, and re-assigns the score (combined FDR Score). The combined FDR Score can differentiate between correct and incorrect peptide identifications with high accuracy, allowing on average 35% more peptide identifications to be made at a fixed FDR than using a single search engine.

  8. Design and Implementation of a Threaded Search Engine for Tour Recommendation Systems

    Science.gov (United States)

    Lee, Junghoon; Park, Gyung-Leen; Ko, Jin-Hee; Shin, In-Hye; Kang, Mikyung

    This paper implements a threaded scan engine for the O(n!) search space and measures its performance, aiming at providing a responsive tour recommendation and scheduling service. As a preliminary step of integrating POI ontology, mobile object database, and personalization profile for the development of new vehicular telematics services, this implementation can give a useful guideline to design a challenging and computation-intensive vehicular telematics service. The implemented engine allocates the subtree to the respective threads and makes them run concurrently exploiting the primitives provided by the operating system and the underlying multiprocessor architecture. It also makes it easy to add a variety of constraints, for example, the search tree is pruned if the cost of partial allocation already exceeds the current best. The performance measurement result shows that the service can run even in the low-power telematics device when the number of destinations does not exceed 15, with an appropriate constraint processing.

  9. GoWeb: a semantic search engine for the life science web.

    Science.gov (United States)

    Dietze, Heiko; Schroeder, Michael

    2009-10-01

    Current search engines are keyword-based. Semantic technologies promise a next generation of semantic search engines, which will be able to answer questions. Current approaches either apply natural language processing to unstructured text or they assume the existence of structured statements over which they can reason. Here, we introduce a third approach, GoWeb, which combines classical keyword-based Web search with text-mining and ontologies to navigate large results sets and facilitate question answering. We evaluate GoWeb on three benchmarks of questions on genes and functions, on symptoms and diseases, and on proteins and diseases. The first benchmark is based on the BioCreAtivE 1 Task 2 and links 457 gene names with 1352 functions. GoWeb finds 58% of the functional GeneOntology annotations. The second benchmark is based on 26 case reports and links symptoms with diseases. GoWeb achieves 77% success rate improving an existing approach by nearly 20%. The third benchmark is based on 28 questions in the TREC genomics challenge and links proteins to diseases. GoWeb achieves a success rate of 79%. GoWeb's combination of classical Web search with text-mining and ontologies is a first step towards answering questions in the biomedical domain. GoWeb is online at: http://www.gopubmed.org/goweb.

  10. A search engine to find the best data?

    CERN Multimedia

    Katarina Anthony

    2014-01-01

    What if you could see your experiment’s results in a “page rank” style? How would your workflow change if you could collaborate with your colleagues on a single platform? What if you could search all your event data for certain specifications? All of these ideas (and more) are being explored at the LHCb experiment in collaboration with Internet giant Yandex.   An extremely rare B0s → μμ decay candidate event observed in the LHCb detector. As the leading search provider in Russia, with over 60% of the market share, Yandex is to East what Google is to West. Their collaboration with CERN began back in 2011, when Yandex co-founder Ilya Segalovich was approached by then-LHCb spokesperson Andrei Golutvin. “Just as Yandex's search engines sift through thousands of websites to find the right page, our experimentalists apply algorithms to find the best result in our data," says Andrei Golutvin. "Perhaps the techn...

  11. General vs health specialized search engine: a blind comparative evaluation of top search results.

    Science.gov (United States)

    Pletneva, Natalia; Ruiz de Castaneda, Rafael; Baroz, Frederic; Boyer, Celia

    2014-01-01

    This paper presents the results of a blind comparison of top ten search results retrieved by Google.ch (French) and Khresmoi for everyone, a health specialized search engine. Participants--students of the Faculty of Medicine of the University of Geneva had to complete three tasks and select their preferred results. The majority of the participants have largely preferred Google results while Khresmoi results showed potential to compete in specific topics. The coverage of the results seems to be one of the reasons. The second being that participants do not know how to select quality and transparent health web pages. More awareness, tools and education about the matter is required for the students of Medicine to be able to efficiently distinguish trustworthy online health information.

  12. In Search of Search Engine Marketing Strategy Amongst SME's in Ireland

    Science.gov (United States)

    Barry, Chris; Charleton, Debbie

    Researchers have identified the Web as a searchers first port of call for locating information. Search Engine Marketing (SEM) strategies have been noted as a key consideration when developing, maintaining and managing Websites. A study presented here of SEM practices of Irish small to medium enterprises (SMEs) reveals they plan to spend more resources on SEM in the future. Most firms utilize an informal SEM strategy, where Website optimization is perceived most effective in attracting traffic. Respondents cite the use of ‘keywords in title and description tags’ as the most used SEM technique, followed by the use of ‘keywords throughout the whole Website’; while ‘Pay for Placement’ was most widely used Paid Search technique. In concurrence with the literature, measuring SEM performance remains a significant challenge with many firms unsure if they measure it effectively. An encouraging finding is that Irish SMEs adopt a positive ethical posture when undertaking SEM.

  13. Advanced Metasearch Engine Technology

    CERN Document Server

    Meng, Weiyi

    2010-01-01

    Among the search tools currently on the Web, search engines are the most well known thanks to the popularity of major search engines such as Google and Yahoo!. While extremely successful, these major search engines do have serious limitations. This book introduces large-scale metasearch engine technology, which has the potential to overcome the limitations of the major search engines. Essentially, a metasearch engine is a search system that supports unified access to multiple existing search engines by passing the queries it receives to its component search engines and aggregating the returned

  14. Current Issues of Engineering Education under Globalized Society

    Science.gov (United States)

    Kim, Kwang Sun

    A global world has recently expedited the international collaboration and network among engineering education societies including their scholars. The current issues of engineering education societies have been raised and discussed and those are various topics such as accreditation issues, current trends in engineering and technology education, government policies, innovations, program and project based learning, social sciences in engineering and technology education, university-industry joint programs, human resource development and engineering education, university linkage with K-12, role of engineering education in sustainable development, and the others. Among the variety of issues and topics, the hottest topic is relating to “innovations” of engineering education system. The innovative direction of engineering education in Korea has been reported along with that of USA, whose role has been one of major parts in innovation for the global engineering education system. The recent survey by IFEES (International Federation of Engineering Education Societies) has also been analyzed to consider the current three biggest challenges of global engineering education societies.

  15. Utilizing mixed methods research in analyzing Iranian researchers’ informarion search behaviour in the Web and presenting current pattern

    Directory of Open Access Journals (Sweden)

    Maryam Asadi

    2015-12-01

    Full Text Available Using mixed methods research design, the current study has analyzed Iranian researchers’ information searching behaviour on the Web.Then based on extracted concepts, the model of their information searching behavior was revealed. . Forty-four participants, including academic staff from universities and research centers were recruited for this study selected by purposive sampling. Data were gathered from questionnairs including ten questions and semi-structured interview. Each participant’s memos were analyzed using grounded theory methods adapted from Strauss & Corbin (1998. Results showed that the main objectives of subjects were doing a research, writing a paper, studying, doing assignments, downloading files and acquiring public information in using Web. The most important of learning about how to search and retrieve information were trial and error and get help from friends among the subjects. Information resources are identified by searching in information resources (e.g. search engines, references in papers, and search in Online database… communications facilities & tools (e.g. contact with colleagues, seminars & workshops, social networking..., and information services (e.g. RSS, Alerting, and SDI. Also, Findings indicated that searching by search engines, reviewing references, searching in online databases, and contact with colleagues and studying last issue of the electronic journals were the most important for searching. The most important strategies were using search engines and scientific tools such as Google Scholar. In addition, utilizing from simple (Quick search method was the most common among subjects. Using of topic, keywords, title of paper were most important of elements for retrieval information. Analysis of interview showed that there were nine stages in researchers’ information searching behaviour: topic selection, initiating search, formulating search query, information retrieval, access to information

  16. EIIS: An Educational Information Intelligent Search Engine Supported by Semantic Services

    Science.gov (United States)

    Huang, Chang-Qin; Duan, Ru-Lin; Tang, Yong; Zhu, Zhi-Ting; Yan, Yong-Jian; Guo, Yu-Qing

    2011-01-01

    The semantic web brings a new opportunity for efficient information organization and search. To meet the special requirements of the educational field, this paper proposes an intelligent search engine enabled by educational semantic support service, where three kinds of searches are integrated into Educational Information Intelligent Search (EIIS)…

  17. MuZeeker - Adapting a music search engine for mobile phones

    DEFF Research Database (Denmark)

    Larsen, Jakob Eg; Halling, Søren Christian; Sigurdsson, Magnus Kristinn

    2010-01-01

    We describe MuZeeker, a search engine with domain knowledge based on Wikipedia. MuZeeker enables the user to refine a search in multiple steps by means of category selection. In the present version we focus on multimedia search related to music and we present two prototype search applications (web...

  18. A Survey on the Performance Evaluation of Various Meta Search Engines

    Directory of Open Access Journals (Sweden)

    K. Srinivas

    2011-05-01

    Full Text Available Though a Search Engine (SE helps in the process of retrieving the information required to the user, a Meta Search Engine (MSEs on the other hand uses new methodologies or fusion schemes for the information retrieval from the Web, and helps the user to collect more, relevant documents from the Web. This paper proposes a survey on various Meta Search Engines and the various parameters on which the efficiency of a MSE lies.

  19. Creative Engineering Based Education with Autonomous Robots Considering Job Search Support

    Science.gov (United States)

    Takezawa, Satoshi; Nagamatsu, Masao; Takashima, Akihiko; Nakamura, Kaeko; Ohtake, Hideo; Yoshida, Kanou

    The Robotics Course in our Mechanical Systems Engineering Department offers “Robotics Exercise Lessons” as one of its Problem-Solution Based Specialized Subjects. This is intended to motivate students learning and to help them acquire fundamental items and skills on mechanical engineering and improve understanding of Robotics Basic Theory. Our current curriculum was established to accomplish this objective based on two pieces of research in 2005: an evaluation questionnaire on the education of our Mechanical Systems Engineering Department for graduates and a survey on the kind of human resources which companies are seeking and their expectations for our department. This paper reports the academic results and reflections of job search support in recent years as inherited and developed from the previous curriculum.

  20. White Hat Search Engine Optimization (SEO: Structured Web Data for Libraries

    Directory of Open Access Journals (Sweden)

    Dan Scott

    2015-06-01

    Full Text Available “White hat” search engine optimization refers to the practice of publishing web pages that are useful to humans, while enabling search engines and web applications to better understand the structure and content of your website. This article teaches you to add structured data to your website so that search engines can more easily connect patrons to your library locations, hours, and contact information. A web page for a branch of the Greater Sudbury Public Library retrieved in January 2015 is used as the basis for examples that progressively enhance the page with structured data. Finally, some of the advantages structured data enables beyond search engine optimization are explored

  1. Computing Semantic Similarity Measure Between Words Using Web Search Engine

    Directory of Open Access Journals (Sweden)

    Pushpa C N

    2013-05-01

    Full Text Available Semantic Similarity measures between words plays an important role in information retrieval, natural language processing and in various tasks on the web. In this paper, we have proposed a Modified Pattern Extraction Algorithm to compute th e supervised semantic similarity measure between the words by combining both page count meth od and web snippets method. Four association measures are used to find semantic simi larity between words in page count method using web search engines. We use a Sequential Minim al Optimization (SMO support vector machines (SVM to find the optimal combination of p age counts-based similarity scores and top-ranking patterns from the web snippets method. The SVM is trained to classify synonymous word-pairs and non-synonymous word-pairs. The propo sed Modified Pattern Extraction Algorithm outperforms by 89.8 percent of correlatio n value.

  2. Search for Flavour Changing Neutral Currents in single top events

    CERN Document Server

    CMS Collaboration

    2013-01-01

    A study of top-quark anomalous couplings is performed through the search for a single top-quark produced in association with a $Z$ boson. The event selection requires the presence of three isolated leptons, electrons or muons, and of at least one jet. The signal extraction is done using kinematic variables and information related to b-tagging, combined using a Boosted Decision Tree. The search is performed in a data sample corresponding to about 5 fb$^{-1}$ of proton-proton collisions at $\\sqrt{s}=7$ TeV recorded with the CMS detector. No evidence of flavor-changing neutral currents is observed and upper limits at 95\\% confidence level are determined. The corresponding upper limits on the coupling strengths of an effective model are found to be $\\kappa_{gut}/\\Lambda < 0.10$ TeV$^{-1}$, $\\kappa_{gct}/\\Lambda < 0.35$ TeV$^{-1}$, $\\kappa_{Zut}/\\Lambda < 0.45$ TeV$^{-1}$ and $\\kappa_{Zct}/\\Lambda < 2.27$ TeV$^{-1}$, where $\\Lambda$ is the expected scale at which new physics could appear. The equivalen...

  3. Search for flavor-changing-neutral-current D meson decays

    CERN Document Server

    Abazov, V; Abolins, M; Acharya, B S; Adams, M; Adams, T; Aguiló, E; Ahn, S H; Ahsan, M; Alexeev, G D; Alkhazov, G; Alton, A; Alverson, G; Alves, G A; Anastasoaie, M; Ancu, L S; Andeen, T; Anderson, S; Andrieu, B; Anzelc, M S; Arnoud, Y; Arov, M; Arthaud, M; Askew, A; Åsman, B; Assis-Jesus, A C S; Atramentov, O; Autermann, C; Avila, C; Ay, C; Badaud, F; Baden, A; Bagby, L; Baldin, B; Bandurin, D V; Banerjee, S; Banerjee, P; Barberis, E; Barfuss, A F; Bargassa, P; Baringer, P; Barreto, J; Bartlett, J F; Bassler, U; Bauer, D; Beale, S; Bean, A; Begalli, M; Begel, M; Belanger-Champagne, C; Bellantoni, L; Bellavance, A; Benítez, J A; Beri, S B; Bernardi, G; Bernhard, R; Berntzon, L; Bertram, I; Besançon, M; Beuselinck, R; Bezzubov, V A; Bhat, P C; Bhatnagar, V; Biscarat, C; Blazey, G; Blekman, F; Blessing, S; Bloch, D; Bloom, K; Böhnlein, A; Boline, D; Bolton, T A; Borissov, G; Bos, K; Bose, T; Brandt, A; Brock, R; Brooijmans, G; Bross, A; Brown, D; Buchanan, N J; Buchholz, D; Bühler, M; Büscher, V; Burdin, S; Burke, S; Burnett, T H; Buszello, C P; Butler, J M; Calfayan, P; Calvet, S; Cammin, J; Caron, S; Carvalho, W; Casey, B C K; Cason, N M; Castilla-Valdez, H; Chakrabarti, S; Chakraborty, D; Chan, K M; Chan, K; Chandra, A; Charles, F; Cheu, E; Chevallier, F; Cho, D K; Choi, S; Choudhary, B; Christofek, L; Christoudias, T; Cihangir, S; Claes, D; Clement, B; Coadou, Y; Cooke, M; Cooper, W E; Corcoran, M; Couderc, F; Cousinou, M C; Crepe-Renaudin, S; Cutts, D; Cwiok, M; Da Motta, H; Das, A; Davies, G; De, K; De Jong, S J; de Jong, P; De La Cruz-Burelo, E; De Oliveira Martins, C; Degenhardt, J D; Déliot, F; Demarteau, M; Demina, R; Denisov, D; Denisov, S P; Desai, S; Diehl, H T; Diesburg, M; Dominguez, A; Dong, H; Dudko, L V; Duflot, L; Dugad, S R; Duggan, D; Duperrin, A; Dyer, J; Dyshkant, A; Eads, M; Edmunds, D; Ellison, J; Elvira, V D; Enari, Y; Eno, S; Ermolov, P; Evans, H; Evdokimov, A; Evdokimov, V N; Ferapontov, A V; Ferbel, T; Fiedler, F; Filthaut, F; Fisher, W; Fisk, H E; Ford, M; Fortner, M; Fox, H; Fu, S; Fuess, S; Gadfort, T; Galea, C F; Gallas, E; Galyaev, E; García, C; García-Bellido, A; Gavrilov, V; Gay, P; Geist, W; Gelé, D; Gerber, C E; Gershtein, Yu; Gillberg, D; Ginther, G; Gollub, N; Gómez, B; Goussiou, A; Grannis, P D; Greenlee, H; Greenwood, Z D; Gregores, E M; Grenier, G; Gris, P; Grivaz, J F; Grohsjean, A; Grünendahl, S; Grünewald, M W; Guo, J; Guo, F; Gutíerrez, P; Gutíerrez, G; Haas, A; Hadley, N J; Haefner, P; Hagopian, S; Haley, J; Hall, I; Hall, R E; Han, L; Hanagaki, K; Hansson, P; Harder, K; Harel, A; Harrington, R; Hauptman, J M; Hauser, R; Hays, J; Hebbeker, T; Hedin, D; Hegeman, J G; Heinmiller, J M; Heinson, A P; Heintz, U; Hensel, C; Herner, K; Hesketh, G; Hildreth, M D; Hirosky, R; Hobbs, J D; Hoeneisen, B; Hoeth, H; Hohlfeld, M; Hong, S J; Hooper, R; Hossain, S; Houben, P; Hu, Y; Hubacek, Z; Hynek, V; Iashvili, I; Illingworth, R; Ito, A S; Jabeen, S; Jaffré, M; Jain, S; Jakobs, K; Jarvis, C; Jesik, R; Johns, K; Johnson, C; Johnson, M; Jonckheere, A; Jonsson, P; Juste, A; Käfer, D; Kahn, S; Kajfasz, E; Kalinin, A M; Kalk, J R; Kalk, J M; Kappler, S; Karmanov, D; Kasper, J; Kasper, P; Katsanos, I; Kau, D; Kaur, R; Kaushik, V; Kehoe, R; Kermiche, S; Khalatyan, N; Khanov, A; Kharchilava, A; Kharzheev, Yu M; Khatidze, D; Kim, H; Kim, T J; Kirby, M H; Kirsch, M; Klima, B; Kohli, J M; Konrath, J P; Kopal, M; Korablev, V M; Kozelov, A V; Krop, D; Kryemadhi, A; Kühl, T; Kumar, A; Kunori, S; Kupco, A; Kurca, T; Kvita, J; Lacroix, F; Lam, D; Lammers, S; Landsberg, G; Lazoflores, J; Lebrun, P; Lee, W M; Leflat, A; Lehner, F; Lellouch, J; Lévêque, J; Lewis, P; Li, J; Li, Q Z; Li, L; Lietti, S M; Lima, J G R; Lincoln, D; Linnemann, J; Lipaev, V V; Lipton, R; Liu, Y; Liu, Z; Lobo, L; Lobodenko, A; Lokajícek, M; Lounis, A; Love, P; Lubatti, H J; Lyon, A L; Maciel, A K A; Mackin, D; Madaras, R J; Mättig, P; Magass, C; Magerkurth, A; Makovec, N; Mal, P K; Malbouisson, H B; Malik, S; Malyshev, V L; Mao, H S; Maravin, Y; Martin, B; McCarthy, R; Melnitchouk, A; Mendes, A; Mendoza, L; Mercadante, P G; Merkin, M; Merritt, K W; Meyer, J; Meyer, A; Michaut, M; Millet, T; Mitrevski, J; Molina, J; Mommsen, R K; Mondal, N K; Moore, R W; Moulik, T; Muanza, G S; Mulders, M; Mulhearn, M; Mundal, O; Mundim, L; Nagy, E; Naimuddin, M; Narain, M; Naumann, N A; Neal, H A; Negret, J P; Neustroev, P; Nilsen, H; Nomerotski, A; Novaes, S F; Nunnemann, T; O'Dell, V; O'Neil, D C; Obrant, G; Ochando, C; Onoprienko, D; Oshima, N; Osta, J; Otec, R; Oteroy-Garzon, G J; Owen, M; Padley, P; Pangilinan, M; Parashar, N; Park, S J; Park, S K; Parsons, J; Partridge, R; Parua, N; Patwa, A; Pawloski, G; Penning, B; Peters, K; Peters, Y; Petroff, P; Petteni, M; Piegaia, R; Piper, J; Pleier, M A; Podesta-Lerma, P L M; Podstavkov, V M; Pogorelov, Y; Pol, M E; Polozov, P; Pompo, A; Pope, B G; Popov, A V; Potter, C; Prado da Silva, W L; Prosper, H B; Protopopescu, S; Qian, J; Quadt, A; Quinn, B; Rakitine, A; Rangel, M S; Ranjan, K; Ratoff, P N; Renkel, P; Reucroft, S; Rich, P; Rijssenbeek, M; Ripp-Baudot, I; Rizatdinova, F; Robinson, S; Rodrigues, R F; Royon, C; Rubinov, P; Ruchti, R; Safronov, G; Sajot, G; Sánchez-Hernández, A; Sanders, M P; Santoro, A; Savage, G; Sawyer, L; Scanlon, T; Schaile, A D; Schamberger, R D; Scheglov, Y; Schellman, H; Schieferdecker, P; Schliephake, T; Schwanenberger, C; Schwartzman, A; Schwienhorst, R; Sekaric, J; Sen-Gupta, S; Severini, H; Shabalina, E; Shamim, M; Shary, V; Shchukin, A A; Shivpuri, R K; Shpakov, D; Siccardi, V; Simák, V; Sirotenko, V; Skubic, P; Slattery, P; Smirnov, D; Snow, J; Snow, G R; Snyder, S; Söldner-Rembold, S; Sonnenschein, L; Sopczak, A; Sosebee, M; Soustruznik, K; Souza, M; Spurlock, B; Stark, J; Steele, J; Stolin, V; Stone, A; Stoyanova, D A; Strandberg, J; Strandberg, S; Strang, M A; Strauss, M; Strauss, E; Ströhmer, R; Strom, D; Stutte, L; Sumowidagdo, S; Svoisky, P; Sznajder, A; Talby, M; Tamburello, P; Tanasijczuk, A; Taylor, W; Telford, P; Temple, J; Tiller, B; Tissandier, F; Titov, M; Tokmenin, V V; Toole, T; Torchiani, I; Trefzger, T; Tsybychev, D; Tuchming, B; Tully, C; Tuts, P M; Unalan, R; Uvarov, S; Uvarov, L; Uzunyan, S; Vachon, B; vanden Berg, P J; van Eijk, B; Van Kooten, R; Van Leeuwen, W M; Varelas, N; Varnes, E W; Vasilyev, I A; Vaupel, M; Verdier, P; Vertogradov, L S; Verzocchi, M; Villeneuve-Séguier, F; Vint, P; Vokac, P; Von Törne, E; Voutilainen, M; Vreeswijk, M; Wagner, R; Wahl, H D; Wang, L; SWang, M H L; Warchol, J; Watts, G; Wayne, M; Weber, M; Weber, G; Wenger, A; Wermes, N; Wetstein, M; White, A; Wicke, D; Wilson, G W; Wimpenny, S J; Wobisch, M; Wood, D R; Wyatt, T R; Xie, Y; Yacoob, S; Yamada, R; Yan, M; Yasuda, T; Yatsunenko, Y A; Yip, K; Yoo, H D; Youn, S W; Yu, J; Zatserklyaniy, A; Zeitnitz, C; Zhang, D; Zhao, T; Zhou, B; Zhu, J; Zielinski, M; Zieminska, D; Zieminski, A; Zivkovic, L; Zutshi, V; Zverev, E G

    2007-01-01

    We study the flavor-changing-neutral-current process c to u mu+ mu- using 1.3 fb^-1 of p p bar collisions at sqrt(s) = 1.96 TeV recorded by the D0 detector operating at the Fermilab Tevatron Collider. We see clear indications of the Ds+ and D+ to phi pi+ to mu+ mu- pi+ final states with significance greater than four standard deviations above background for the D+ state. We search for the continuum decay of D+ to pi+mu+mu- in the dimuon invariant mass spectrum away from the phi resonance. We see no evidence of signal above background and set a limit of B(D+ to pi+mu+mu-) < 3.9 x 10^-6 at the 90% C.L. This limit places the most stringent constraint on new phenomena in the c to u mu+ mu- transition.

  4. Finding current evidence: search strategies and common databases.

    Science.gov (United States)

    Gillespie, Lesley Diane; Gillespie, William John

    2003-08-01

    With more than 100 orthopaedic, sports medicine, or hand surgery journals indexed in MEDLINE, it is no longer possible to keep abreast of developments in orthopaedic surgery by reading a few journals each month. Electronic resources are easier to search and more current than most print sources. We provide a practical approach to finding useful information to guide orthopaedic practice. We focus first on where to find the information by providing details about many useful databases and web links. Sources for identifying guidelines, systematic reviews, and randomized controlled trials are identified. The second section discusses how to find the information, from the first stage of formulating a question and identifying the concepts of interest, through to writing a simple strategy. Sources for additional self-directed learning are provided.

  5. Predicting user click behaviour in search engine advertisements

    Science.gov (United States)

    Daryaie Zanjani, Mohammad; Khadivi, Shahram

    2015-10-01

    According to the specific requirements and interests of users, search engines select and display advertisements that match user needs and have higher probability of attracting users' attention based on their previous search history. New objects such as user, advertisement or query cause a deterioration of precision in targeted advertising due to their lack of history. This article surveys this challenge. In the case of new objects, we first extract similar observed objects to the new object and then we use their history as the history of new object. Similarity between objects is measured based on correlation, which is a relation between user and advertisement when the advertisement is displayed to the user. This method is used for all objects, so it has helped us to accurately select relevant advertisements for users' queries. In our proposed model, we assume that similar users behave in a similar manner. We find that users with few queries are similar to new users. We will show that correlation between users and advertisements' keywords is high. Thus, users who pay attention to advertisements' keywords, click similar advertisements. In addition, users who pay attention to specific brand names might have similar behaviours too.

  6. A Framework for Hierarchical Clustering Based Indexing in Search Engines

    Directory of Open Access Journals (Sweden)

    Parul Gupta

    2011-01-01

    Full Text Available Granting efficient and fast accesses to the index is a key issuefor performances of Web Search Engines. In order to enhancememory utilization and favor fast query resolution, WSEs useInverted File (IF indexes that consist of an array of theposting lists where each posting list is associated with a termand contains the term as well as the identifiers of the documentscontaining the term. Since the document identifiers are stored insorted order, they can be stored as the difference between thesuccessive documents so as to reduce the size of the index. Thispaper describes a clustering algorithm that aims atpartitioning the set of documents into ordered clusters so thatthe documents within the same cluster are similar and are beingassigned the closer document identifiers. Thus the averagevalue of the differences between the successive documents willbe minimized and hence storage space would be saved. Thepaper further presents the extension of this clustering algorithmto be applied for the hierarchical clustering in which similarclusters are clubbed to form a mega cluster and similar megaclusters are then combined to form super cluster. Thus thepaper describes the different levels of clustering whichoptimizes the search process by directing the searchto a specific path from higher levels of clustering to the lowerlevels i.e. from super clusters to mega clusters, then to clustersand finally to the individual documents so that the user gets thebest possible matching results in minimum possible time.

  7. Lactose crystallization: current issues and promising Engineering solutions

    OpenAIRE

    Rjabova, A.; Kirsanov, V.; Strizhko, M.; Bredikhin, A.; Semipyatnyi, V.; Chervetsov, V.; Galstyan, A.

    2013-01-01

    Current technological aspects of lactose crystallization are considered. A promising lactose crystallization method involving simulation seed crystals is reported. Advanced engineering solutions for continuous crystallization using spraying in vacuo and scraped-surface heat exchangers are presented.

  8. The Gaze of the Perfect Search Engine: Google as an Infrastructure of Dataveillance

    Science.gov (United States)

    Zimmer, M.

    Web search engines have emerged as a ubiquitous and vital tool for the successful navigation of the growing online informational sphere. The goal of the world's largest search engine, Google, is to "organize the world's information and make it universally accessible and useful" and to create the "perfect search engine" that provides only intuitive, personalized, and relevant results. While intended to enhance intellectual mobility in the online sphere, this chapter reveals that the quest for the perfect search engine requires the widespread monitoring and aggregation of a users' online personal and intellectual activities, threatening the values the perfect search engines were designed to sustain. It argues that these search-based infrastructures of dataveillance contribute to a rapidly emerging "soft cage" of everyday digital surveillance, where they, like other dataveillance technologies before them, contribute to the curtailing of individual freedom, affect users' sense of self, and present issues of deep discrimination and social justice.

  9. Engineering photorespiration: current state and future possibilities.

    Science.gov (United States)

    Peterhansel, C; Krause, K; Braun, H-P; Espie, G S; Fernie, A R; Hanson, D T; Keech, O; Maurino, V G; Mielewczik, M; Sage, R F

    2013-07-01

    Reduction of flux through photorespiration has been viewed as a major way to improve crop carbon fixation and yield since the energy-consuming reactions associated with this pathway were discovered. This view has been supported by the biomasses increases observed in model species that expressed artificial bypass reactions to photorespiration. Here, we present an overview about the major current attempts to reduce photorespiratory losses in crop species and provide suggestions for future research priorities.

  10. Applications of Tissue Engineering in Joint Arthroplasty: Current Concepts Update.

    Science.gov (United States)

    Zeineddine, Hussein A; Frush, Todd J; Saleh, Zeina M; El-Othmani, Mouhanad M; Saleh, Khaled J

    2017-07-01

    Research in tissue engineering has undoubtedly achieved significant milestones in recent years. Although it is being applied in several disciplines, tissue engineering's application is particularly advanced in orthopedic surgery and in degenerative joint diseases. The literature is full of remarkable findings and trials using tissue engineering in articular cartilage disease. With the vast and expanding knowledge, and with the variety of techniques available at hand, the authors aimed to review the current concepts and advances in the use of cell sources in articular cartilage tissue engineering. Copyright © 2017 Elsevier Inc. All rights reserved.

  11. [Biomedical information on the internet using search engines. A one-year trial].

    Science.gov (United States)

    Corrao, Salvatore; Leone, Francesco; Arnone, Sabrina

    2004-01-01

    The internet is a communication medium and content distributor that provide information in the general sense but it could be of great utility regarding as the search and retrieval of biomedical information. Search engines represent a great deal to rapidly find information on the net. However, we do not know whether general search engines and meta-search ones are reliable in order to find useful and validated biomedical information. The aim of our study was to verify the reproducibility of a search by key-words (pediatric or evidence) using 9 international search engines and 1 meta-search engine at the baseline and after a one year period. We analysed the first 20 citations as output of each searching. We evaluated the formal quality of Web-sites and their domain extensions. Moreover, we compared the output of each search at the start of this study and after a one year period and we considered as a criterion of reliability the number of Web-sites cited again. We found some interesting results that are reported throughout the text. Our findings point out an extreme dynamicity of the information on the Web and, for this reason, we advice a great caution when someone want to use search and meta-search engines as a tool for searching and retrieve reliable biomedical information. On the other hand, some search and meta-search engines could be very useful as a first step searching for defining better a search and, moreover, for finding institutional Web-sites too. This paper allows to know a more conscious approach to the internet biomedical information universe.

  12. Discrimination of Inrush Currents from Faults Current in Power Transformers using Gravitational Search Algorithm (GSA

    Directory of Open Access Journals (Sweden)

    Mohamad Kazem Daryabari

    2011-01-01

    Full Text Available The magnetizing inrush current phenomenon is a large transient condition, which occurs when a transformer is energized. The inrush current magnitude may be as high as ten times of transformer rated current that causes mal-operation of protection systems. Indeed, the similarity between signatures of Inrush current and internal fault condition make this failure. So, for safe running of a transformer, it is necessary to distinguish inrush current from fault currents. In this project, an Artificial Neural Network (ANN which is trained by two different swarm based algorithms; Gravitational Search Algorithm (GSA and Particle Swarm Optimization (PSO have been used to discriminate inrush current from fault currents in power transformers. GSA works based on gravity laws and in opposite of other swarm based algorithms, particles have identity and PSO is based on behaviors of bird flocking. Proposed approach has two general stages, in first step, obtained data from simulation have been processed and applied to ANN, and then in step two, using training data considered ANN has been trained by GSA & PSO. Proposed method has been compared with one of the common training approach which is called Back Propagation (BP and Results show that proposed method is so quick and can do discrimination very accurate.

  13. Information access in the art history domain. Evaluating a federated search engine for Rembrandt research

    NARCIS (Netherlands)

    Verberne, S.; Boves, L.W.J.; Bosch, A.P.J. van den

    2016-01-01

    The art history domain is an interesting case for search engines tailored to the digital humanities, because the domain involves different types of sources (primary and secondary; text and images). One example of an art history search engine is RemBench, which provides access to information in four

  14. Design of personalized search engine based on user-webpage dynamic model

    Science.gov (United States)

    Li, Jihan; Li, Shanglin; Zhu, Yingke; Xiao, Bo

    2013-12-01

    Personalized search engine focuses on establishing a user-webpage dynamic model. In this model, users' personalized factors are introduced so that the search engine is better able to provide the user with targeted feedback. This paper constructs user and webpage dynamic vector tables, introduces singular value decomposition analysis in the processes of topic categorization, and extends the traditional PageRank algorithm.

  15. A Strategic Analysis of Search Engine Advertising in Web based-commerce

    Directory of Open Access Journals (Sweden)

    Ela Kumar

    2007-08-01

    Full Text Available Endeavor of this paper is to explore the role play of Search Engine in Online Business Industry. This paper discusses the Search Engine advertising programs and provides an insight about the revenue generated online via Search Engine. It explores the growth of Online Business Industry in India and emphasis on the role of Search Engine as the major advertising vehicle. A case study on re volution of Indian Advertising Industry has been conducted and its impact on online revenu e evaluated. Search Engine advertising strategies have been discussed in detail and the impact of Search Engine on Indian Advertising Industry has been analyzed. It also provides an analytical and competitive study of online advertising strategies with traditional advertising tools to evaluate their efficiencies against important advertising parameters. The paper concludes with a brief discussion on the malpractices that have adversarial impact on the efficiency of the Search Engine advertising model and highlight key hurdle Search Engine Industry is facing in Indian Business Scenario

  16. Index Compression and Efficient Query Processing in Large Web Search Engines

    Science.gov (United States)

    Ding, Shuai

    2013-01-01

    The inverted index is the main data structure used by all the major search engines. Search engines build an inverted index on their collection to speed up query processing. As the size of the web grows, the length of the inverted list structures, which can easily grow to hundreds of MBs or even GBs for common terms (roughly linear in the size of…

  17. Information access in the art history domain. Evaluating a federated search engine for Rembrandt research

    NARCIS (Netherlands)

    Verberne, S.; Boves, L.W.J.; Bosch, A.P.J. van den

    2016-01-01

    The art history domain is an interesting case for search engines tailored to the digital humanities, because the domain involves different types of sources (primary and secondary; text and images). One example of an art history search engine is RemBench, which provides access to information in four

  18. MOMFER: A Search Engine of Thompson's Motif-Index of Folk Literature

    NARCIS (Netherlands)

    Karsdorp, F.B.; van der Meulen, Marten; Meder, Theo; van den Bosch, Antal

    2015-01-01

    More than fifty years after the first edition of Thompson's seminal Motif-Indexof Folk Literature, we present an online search engine tailored to fully disclose the index digitally. This search engine, called MOMFER, greatly enhances the searchability of the Motif-Index and provides exciting new way

  19. Taking It to the Top: A Lesson in Search Engine Optimization

    Science.gov (United States)

    Frydenberg, Mark; Miko, John S.

    2011-01-01

    Search engine optimization (SEO), the promoting of a Web site so it achieves optimal position with a search engine's rankings, is an important strategy for organizations and individuals in order to promote their brands online. Techniques for achieving SEO are relevant to students of marketing, computing, media arts, and other disciplines, and many…

  20. Evaluation of proteomic search engines for the analysis of histone modifications.

    Science.gov (United States)

    Yuan, Zuo-Fei; Lin, Shu; Molden, Rosalynn C; Garcia, Benjamin A

    2014-10-03

    Identification of histone post-translational modifications (PTMs) is challenging for proteomics search engines. Including many histone PTMs in one search increases the number of candidate peptides dramatically, leading to low search speed and fewer identified spectra. To evaluate database search engines on identifying histone PTMs, we present a method in which one kind of modification is searched each time, for example, unmodified, individually modified, and multimodified, each search result is filtered with false discovery rate less than 1%, and the identifications of multiple search engines are combined to obtain confident results. We apply this method for eight search engines on histone data sets. We find that two search engines, pFind and Mascot, identify most of the confident results at a reasonable speed, so we recommend using them to identify histone modifications. During the evaluation, we also find some important aspects for the analysis of histone modifications. Our evaluation of different search engines on identifying histone modifications will hopefully help those who are hoping to enter the histone proteomics field. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium with the data set identifier PXD001118.

  1. Evaluating Open-Source Full-Text Search Engines for Matching ICD-10 Codes.

    Science.gov (United States)

    Jurcău, Daniel-Alexandru; Stoicu-Tivadar, Vasile

    2016-01-01

    This research presents the results of evaluating multiple free, open-source engines on matching ICD-10 diagnostic codes via full-text searches. The study investigates what it takes to get an accurate match when searching for a specific diagnostic code. For each code the evaluation starts by extracting the words that make up its text and continues with building full-text search queries from the combinations of these words. The queries are then run against all the ICD-10 codes until a match indicates the code in question as a match with the highest relative score. This method identifies the minimum number of words that must be provided in order for the search engines choose the desired entry. The engines analyzed include a popular Java-based full-text search engine, a lightweight engine written in JavaScript which can even execute on the user's browser, and two popular open-source relational database management systems.

  2. An assessment of the visibility of MeSH-indexed medical web catalogs through search engines.

    Science.gov (United States)

    Zweigenbaum, P; Darmoni, S J; Grabar, N; Douyère, M; Benichou, J

    2002-01-01

    Manually indexed Internet health catalogs such as CliniWeb or CISMeF provide resources for retrieving high-quality health information. Users of these quality-controlled subject gateways are most often referred to them by general search engines such as Google, AltaVista, etc. This raises several questions, among which the following: what is the relative visibility of medical Internet catalogs through search engines? This study addresses this issue by measuring and comparing the visibility of six major, MeSH-indexed health catalogs through four different search engines (AltaVista, Google, Lycos, Northern Light) in two languages (English and French). Over half a million queries were sent to the search engines; for most of these search engines, according to our measures at the time the queries were sent, the most visible catalog for English MeSH terms was CliniWeb and the most visible one for French MeSH terms was CISMeF.

  3. Balancing Efficiency and Effectiveness for Fusion-Based Search Engines in the "Big Data" Environment

    Science.gov (United States)

    Li, Jieyu; Huang, Chunlan; Wang, Xiuhong; Wu, Shengli

    2016-01-01

    Introduction: In the big data age, we have to deal with a tremendous amount of information, which can be collected from various types of sources. For information search systems such as Web search engines or online digital libraries, the collection of documents becomes larger and larger. For some queries, an information search system needs to…

  4. GeoSearcher: Location-Based Ranking of Search Engine Results.

    Science.gov (United States)

    Watters, Carolyn; Amoudi, Ghada

    2003-01-01

    Discussion of Web queries with geospatial dimensions focuses on an algorithm that assigns location coordinates dynamically to Web sites based on the URL. Describes a prototype search system that uses the algorithm to re-rank search engine results for queries with a geospatial dimension, thus providing an alternative ranking order for search engine…

  5. Curating the Web: Building a Google Custom Search Engine for the Arts

    Science.gov (United States)

    Hennesy, Cody; Bowman, John

    2008-01-01

    Google's first foray onto the web made search simple and results relevant. With its Co-op platform, Google has taken another step toward dramatically increasing the relevancy of search results, further adapting the World Wide Web to local needs. Google Custom Search Engine, a tool on the Co-op platform, puts one in control of his or her own search…

  6. Search for flavor-changing-neutral-current D meson decays

    Energy Technology Data Exchange (ETDEWEB)

    Abazov, V.M.; Abbott, B.; Abolins, M.; Acharya, B.S.; Adams, M.; Adams, T.; Aguilo, E.; Ahn, S.H.; Ahsan, M.; Alexeev, G.D.; Alkhazov, G.; /Buenos Aires U. /Rio de Janeiro, CBPF /Rio de Janeiro State U. /Sao Paulo, IFT /Alberta U. /Simon Fraser U. /York U., Canada /McGill U. /Hefei, CUST /Andes U., Bogota /Charles U.

    2007-08-01

    We study the flavor-changing-neutral-current process c {yields} u{mu}{sup +}{mu}{sup -} using 1.3 fb{sup -1} of p{bar p} collisions at {radical}s = 1.96 TeV recorded by the D0 detector operating at the Fermilab Tevatron Collider. We see clear indications of the D{sup +}{sub s} and D{sup +} {yields} {phi}{pi}{sup +} {yields} {mu}{sup +}{mu}{sup -}{pi}{sup +} final states with significance greater than four standard deviations above background for the D{sup +} state. We search for the continuum decay of D{sup +} {yields} {pi}{sup +}{mu}{sup +}{mu}{sup -} in the dimuon invariant mass spectrum away from the {phi} resonance. We see no evidence of signal above background and set a limit of B(D{sup +} {yields} {pi}{sup +}{mu}{sup +}{mu}{sup -}) < 3.9 x 10{sup -6} at the 90% CL. This limit places the most stringent constraint on new phenomena in the c {yields} u{mu}{sup +}{mu}{sup -} transition.

  7. A Full-Text-Based Search Engine for Finding Highly Matched Documents Across Multiple Categories

    Science.gov (United States)

    Nguyen, Hung D.; Steele, Gynelle C.

    2016-01-01

    This report demonstrates the full-text-based search engine that works on any Web-based mobile application. The engine has the capability to search databases across multiple categories based on a user's queries and identify the most relevant or similar. The search results presented here were found using an Android (Google Co.) mobile device; however, it is also compatible with other mobile phones.

  8. CWM Global Search—The Internet Search Engine for Chemists and Biologists

    Directory of Open Access Journals (Sweden)

    Hans-Jürgen Himmler

    2010-12-01

    Full Text Available CWM Global Search is a meta-search engine allowing chemists and biologists to search the major chemical and biological databases on the Internet, by structure, synonyms, CAS Registry Numbers and free text. A meta-search engine is a search tool that sends user requests to several other search engines and/or databases and aggregates the results into a single list or displays them according to their source [1]. CWM Global Search is a web application that has many of the characteristics of desktop applications (also known as Rich Internet Application, RIA, and it runs on both Windows and Macintosh platforms. The application is one of the first RIA for scientists. The application can be started using the URL http://cwmglobalsearch.com/gsweb.

  9. A Novel Search Engine to trace Medical Information Needs using Medical Domain Ontology

    Directory of Open Access Journals (Sweden)

    M.Revati

    2011-08-01

    Full Text Available Information retrieval in medical domain is now sharing major part of the web search. Now a day’s most of the people especially adults are browsing health care and medical information at their homes using internet. Medical Information Retrieval System (MIRS through search engines providing positive information to the user based on the fixed questionnaires. In this paper we build a model for naïve users, who are having minimal knowledge to feedback the system by opting listed relevant questionnaire. Along with the framework, we also built an Intelligent Medical Search Engine (IMSE for searching medical information on World Wide Web (WC3. The implementation setup of IMSE uses medical Ontology and questionnaire to facilitate naive internet users to search for medical information. IMSE introduces and extends expert system technology into the search engine domain. IMSE uses several key techniques to improve its usability and search result quality.

  10. Semantic similarity measure in biomedical domain leverage web search engine.

    Science.gov (United States)

    Chen, Chi-Huang; Hsieh, Sheau-Ling; Weng, Yung-Ching; Chang, Wen-Yung; Lai, Feipei

    2010-01-01

    Semantic similarity measure plays an essential role in Information Retrieval and Natural Language Processing. In this paper we propose a page-count-based semantic similarity measure and apply it in biomedical domains. Previous researches in semantic web related applications have deployed various semantic similarity measures. Despite the usefulness of the measurements in those applications, measuring semantic similarity between two terms remains a challenge task. The proposed method exploits page counts returned by the Web Search Engine. We define various similarity scores for two given terms P and Q, using the page counts for querying P, Q and P AND Q. Moreover, we propose a novel approach to compute semantic similarity using lexico-syntactic patterns with page counts. These different similarity scores are integrated adapting support vector machines, to leverage the robustness of semantic similarity measures. Experimental results on two datasets achieve correlation coefficients of 0.798 on the dataset provided by A. Hliaoutakis, 0.705 on the dataset provide by T. Pedersen with physician scores and 0.496 on the dataset provided by T. Pedersen et al. with expert scores.

  11. Evaluating Search Engine Relevance with Click-Based Metrics

    Science.gov (United States)

    Radlinski, Filip; Kurup, Madhu; Joachims, Thorsten

    Automatically judging the quality of retrieval functions based on observable user behavior holds promise for making retrieval evaluation faster, cheaper, and more user centered. However, the relationship between observable user behavior and retrieval quality is not yet fully understood. In this chapter, we expand upon, Radlinski et al. (How does clickthrough data reflect retrieval quality, In Proceedings of the ACM Conference on Information and Knowledge Management (CIKM), 43-52, 2008), presenting a sequence of studies investigating this relationship for an operational search engine on the arXiv.org e-print archive. We find that none of the eight absolute usage metrics we explore (including the number of clicks observed, the frequency with which users reformulate their queries, and how often result sets are abandoned) reliably reflect retrieval quality for the sample sizes we consider. However, we find that paired experiment designs adapted from sensory analysis produce accurate and reliable statements about the relative quality of two retrieval functions. In particular, we investigate two paired comparison tests that analyze clickthrough data from an interleaved presentation of ranking pairs, and find that both give accurate and consistent results. We conclude that both paired comparison tests give substantially more accurate and sensitive evaluation results than the absolute usage metrics in our domain.

  12. A search algorithm for quantum state engineering and metrology

    Science.gov (United States)

    Knott, P. A.

    2016-07-01

    In this paper we present a search algorithm that finds useful optical quantum states which can be created with current technology. We apply the algorithm to the field of quantum metrology with the goal of finding states that can measure a phase shift to a high precision. Our algorithm efficiently produces a number of novel solutions: we find experimentally ready schemes to produce states that show significant improvements over the state-of-the-art, and can measure with a precision that beats the shot noise limit by over a factor of 4. Furthermore, these states demonstrate a robustness to moderate/high photon losses, and we present a conceptually simple measurement scheme that saturates the Cramér-Rao bound.

  13. Current Density Measurements of an Annular-Geometry Ion Engine

    Science.gov (United States)

    Shastry, Rohit; Patterson, Michael J.; Herman, Daniel A.; Foster, John E.

    2012-01-01

    The concept of the annular-geometry ion engine, or AGI-Engine, has been shown to have many potential benefits when scaling electric propulsion technologies to higher power. However, the necessary asymmetric location of the discharge cathode away from thruster centerline could potentially lead to non-uniformities in the discharge not present in conventional geometry ion thrusters. In an effort to characterize the degree of this potential nonuniformity, a number of current density measurements were taken on a breadboard AGI-Engine. Fourteen button probes were used to measure the ion current density of the discharge along a perforated electrode that replaced the ion optics during conditions of simulated beam extraction. Three Faraday probes spaced apart in the vertical direction were also used in a separate test to interrogate the plume of the AGI-Engine during true beam extraction. It was determined that both the discharge and the plume of the AGI-Engine are highly uniform, with variations under most conditions limited to 10% of the average current density in the discharge and 5% of the average current density in the plume. Beam flatness parameter measured 30 mm from the ion optics ranged from 0.85 0.95, and overall uniformity was shown to generally increase with increasing discharge and beam currents. These measurements indicate that the plasma is highly uniform despite the asymmetric location of the discharge cathode.

  14. A unified architecture for biomedical search engines based on semantic web technologies.

    Science.gov (United States)

    Jalali, Vahid; Matash Borujerdi, Mohammad Reza

    2011-04-01

    There is a huge growth in the volume of published biomedical research in recent years. Many medical search engines are designed and developed to address the over growing information needs of biomedical experts and curators. Significant progress has been made in utilizing the knowledge embedded in medical ontologies and controlled vocabularies to assist these engines. However, the lack of common architecture for utilized ontologies and overall retrieval process, hampers evaluating different search engines and interoperability between them under unified conditions. In this paper, a unified architecture for medical search engines is introduced. Proposed model contains standard schemas declared in semantic web languages for ontologies and documents used by search engines. Unified models for annotation and retrieval processes are other parts of introduced architecture. A sample search engine is also designed and implemented based on the proposed architecture in this paper. The search engine is evaluated using two test collections and results are reported in terms of precision vs. recall and mean average precision for different approaches used by this search engine.

  15. 分布式搜索引擎的模型综述%Review on Distributed Search Engine Model

    Institute of Scientific and Technical Information of China (English)

    钱立兵; 季振洲

    2015-01-01

    This paper reviews the model,structure and search method for distributed search engine, and then discusses the evaluation of search engines.From the offline processing and online processing, the basic modules of search engine are dis-cussed.The essential factor of search engine performance is determined by the online search processing.Divided from the distributed search engine model, the search engine consists of four main subsystems:Web crawler system, building index system, retrieval system and log analyzing system.The inverted index is divided into document ids and term frequency( or influence) sequence, which is composed of the dictionary structure and inverted file.Then the paper discusses the typically three types strategies of query processing for the current search eninge, and compares their adaptiation conditions.Finally, the two improtant indicators of evaluation of search engines are reviewed and enumerated the quantitative evaluation formu-la, which are query efficiently and quality of results, respectively.%本文综述了分布式搜索引擎的模型、结构和查询方法,并讨论了搜索引擎的评价指标。从搜索引擎的离线处理和在线处理讨论了搜索引擎的基本模块,在线查询过程速度决定了搜索引擎性能的关键因素;从分布式搜索引擎的模型上划分,搜索引擎包含四个主要子系统:网页爬虫系统、索引构建系统、检索系统和日志分析系统;倒排索引结构是以词典( dictionary )和倒排文件( inverted file)组成,分为文档编号递增排序和词频(或影响力)得分递减排序。然后讨论了当前搜索引擎典型的三类查询处理策略,并比较各自适应的条件。最后,综述评价搜索引擎的两个重要指标:查询效率和查询结果的质量,并列举定量评价公式。

  16. Stirling engines. (Latest citations from the Aerospace database). Published Search

    Energy Technology Data Exchange (ETDEWEB)

    1993-09-01

    The bibliography contains citations concerning fuel consumption, engine design and testing, computerized simulation, and lubrication systems relative to the Stirling cycle engine. Solar energy conversion research, thermodynamic efficiency, economics, and utilization for power generation and automobile engines are included. Materials used in Stirling engines are briefly evaluated. (Contains 250 citations and includes a subject term index and title list.)

  17. World Wide Web Search Engines: AltaVista and Yahoo.

    Science.gov (United States)

    Machovec, George S., Ed.

    1996-01-01

    Examines the history, structure, and search capabilities of Internet search tools AltaVista and Yahoo. AltaVista provides relevance-ranked feedback on full-text searches. Yahoo indexes Web "citations" only but does organize information hierarchically into predefined categories. Yahoo has recently become a publicly held company and…

  18. Federated search in the wild: the combined power of over a hundred search engines

    NARCIS (Netherlands)

    Nguyen, Dong-Phuong; Demeester, Thomas; Trieschnigg, Dolf; Hiemstra, Djoerd

    2012-01-01

    Federated search has the potential of improving web search: the user becomes less dependent on a single search provider and parts of the deep web become available through a unified interface, leading to a wider variety in the retrieved search results. However, a publicly available dataset for federa

  19. Federated Search in the Wild: the combined power of over a hundred search engines

    NARCIS (Netherlands)

    Nguyen, Dong-Phuong; Demeester, Thomas; Trieschnigg, Rudolf Berend; Hiemstra, Djoerd

    2012-01-01

    Federated search has the potential of improving web search: the user becomes less dependent on a single search provider and parts of the deep web become available through a unified interface, leading to a wider variety in the retrieved search results. However, a publicly available dataset for

  20. MSblender: A probabilistic approach for integrating peptide identifications from multiple database search engines.

    Science.gov (United States)

    Kwon, Taejoon; Choi, Hyungwon; Vogel, Christine; Nesvizhskii, Alexey I; Marcotte, Edward M

    2011-07-01

    Shotgun proteomics using mass spectrometry is a powerful method for protein identification but suffers limited sensitivity in complex samples. Integrating peptide identifications from multiple database search engines is a promising strategy to increase the number of peptide identifications and reduce the volume of unassigned tandem mass spectra. Existing methods pool statistical significance scores such as p-values or posterior probabilities of peptide-spectrum matches (PSMs) from multiple search engines after high scoring peptides have been assigned to spectra, but these methods lack reliable control of identification error rates as data are integrated from different search engines. We developed a statistically coherent method for integrative analysis, termed MSblender. MSblender converts raw search scores from search engines into a probability score for every possible PSM and properly accounts for the correlation between search scores. The method reliably estimates false discovery rates and identifies more PSMs than any single search engine at the same false discovery rate. Increased identifications increment spectral counts for most proteins and allow quantification of proteins that would not have been quantified by individual search engines. We also demonstrate that enhanced quantification contributes to improve sensitivity in differential expression analyses.

  1. Performance comparison of Word Sense Disambiguation Algorithm on Hindi Language Supporting Search Engines

    Directory of Open Access Journals (Sweden)

    Parul Rastogi

    2011-03-01

    Full Text Available Search Engines are the basic tool of fetching the information on the web. The IT revolution not only affected the technocrats, but the native users are also affected. The native users also tend to look for any information on web nowadays. This leads to the need of effective search engines to fulfill native user's needs and provide them information in their native languages. The major population of India use Hindi as a first language. The Hindi language web information retrieval is not in a satisfactory condition. Besides the other technical setbacks, the Hindi language search engines face the problem of sense ambiguity. Our WSD method is based on Highest Sense Count (HSC. It works well with Google. The objective of the paper is comparative analysis of the WSD algorithm results on the three Hindi language search engines- Google, Raftaar and Guruji. We have taken a test sample of 100 queries to check the performance level of the WSD algorithm on various search engines. The results show promising improvement in performance of Google search engine whereas the least performance improvement was there in Guruji search engine.

  2. Lessons learned from building the iMED intelligent medical search engine.

    Science.gov (United States)

    Luo, Gang

    2009-01-01

    Searching for medical information on the Web has become highly popular, but it remains a challenging task because searchers are often uncertain about their exact medical situations and unfamiliar with medical terminology. To address this challenge, we have built an intelligent medical Web search engine called iMed. iMed introduces and extends expert system technology into the search engine domain. It uses medical knowledge and an interactive questionnaire to help searchers form queries. This paper reports the lessons we learned from building the iMed system. We believe that many of these lessons can be applied to other medical search engines as well. We systematically discuss important issues in the new field consumer-centric intelligent medical search, including input interface, output interface, search system, medical knowledge base, help system, and testing.

  3. Quality of healthcare websites: A comparison of a general-purpose vs. domain-specific search engine.

    Science.gov (United States)

    Abraham, Joanna; Reddy, Madhu

    2007-10-11

    In a pilot study, we had five typical Internet users evaluate the quality of health websites returned by a general-purpose search engine (Google) and a healthcare-specific search engine (Healthfinder). The evaluators used a quality criteria developed by Mitretek/Health Information Technology Institute. Although both search engines provided high quality health websites, we found some important differences between the two types of search engines.

  4. How well they retrieve fresh news items: News search engine perspective

    Directory of Open Access Journals (Sweden)

    Mohammad Ubaidullah Bokhari

    2016-09-01

    Full Text Available People are nowadays opting news search engines for searching news instead of traditional web search engines as, number of specialized news search services have been developed. So it becomes necessary to evaluate these news search systems and help users to select the best one. Lots of work has been done to measure the traditional effectiveness of web search engines, major work has been done for relevance based evaluation using precision based measures, where topical relevance is often the main selection criteria, but less work has been done to measure the time-sensitive effectiveness of the news search systems where freshness matters. In this paper we used a scheme using mathematical statistics to measure the time-sensitive effectiveness of four news search systems, i.e., how well they retrieve the fresh documents. To our knowledge there is a lack of a good measure that combines both time-independent effectiveness and the relative freshness of news items so our scheme, using top ten results for 100 news queries on four news search engines with the basic idea to pull all the relevant results from the news search systems we want to compare together into a single ranked list based on their recency and analyse the relative positions of these results, will be useful in stuffing this gap.

  5. Segmentation Based Approach to Dynamic Page Construction from Search Engine Results

    OpenAIRE

    K.S. Kuppusamy,; Aghila, G.

    2012-01-01

    The results rendered by the search engines are mostly a linear snippet list. With the prolific increase in the dynamism of web pages there is a need for enhanced result lists from search engines in order to cope-up with the expectations of the users. This paper proposes a model for dynamic construction of a resultant page from various results fetched by the search engine, based on the web page segmentation approach. With the incorporation of personalization through user profile during the can...

  6. Web-based Image Search Engines%因特网上的图像搜索引擎

    Institute of Scientific and Technical Information of China (English)

    陈立娜

    2001-01-01

    The operating principle of Web-based image search engines is briefly described. A detailed evaluation of some of image search engines is made. Finally, the paper points out the deficiencies of the present image search engines and their development trend.

  7. Win the game of Googleopoly unlocking the secret strategy of search engines

    CERN Document Server

    Bradley, Sean V

    2015-01-01

    Rank higher in search results with this guide to SEO and content building supremacy Google is not only the number one search engine in the world, it is also the number one website in the world. Only 5 percent of site visitors search past the first page of Google, so if you're not in those top ten results, you are essentially invisible. Winning the Game of Googleopoly is the ultimate roadmap to Page One Domination. The POD strategy is what gets you on that super-critical first page of Google results by increasing your page views. You'll learn how to shape your online presence for Search Engine

  8. Evaluation of Current Assessment Methods in Engineering Entrepreneurship Education

    Science.gov (United States)

    Purzer, Senay; Fila, Nicholas; Nataraja, Kavin

    2016-01-01

    Quality assessment is an essential component of education that allows educators to support student learning and improve educational programs. The purpose of this study is to evaluate the current state of assessment in engineering entrepreneurship education. We identified 52 assessment instruments covered in 29 journal articles and conference…

  9. Current opportunities and challenges in skeletal muscle tissue engineering

    NARCIS (Netherlands)

    Koning, Merel; Harmsen, Martin C; van Luyn, Marja J A; Werker, Paul M N

    The purpose of this article is to give a concise review of the current state of the art in tissue engineering (TE) of skeletal muscle and the opportunities and challenges for future clinical applicability. The endogenous progenitor cells of skeletal muscle, i.e. satellite cells, show a high

  10. Comparing image search behaviour in the ARRS GoldMiner search engine and a clinical PACS/RIS.

    Science.gov (United States)

    De-Arteaga, Maria; Eggel, Ivan; Do, Bao; Rubin, Daniel; Kahn, Charles E; Müller, Henning

    2015-08-01

    Information search has changed the way we manage knowledge and the ubiquity of information access has made search a frequent activity, whether via Internet search engines or increasingly via mobile devices. Medical information search is in this respect no different and much research has been devoted to analyzing the way in which physicians aim to access information. Medical image search is a much smaller domain but has gained much attention as it has different characteristics than search for text documents. While web search log files have been analysed many times to better understand user behaviour, the log files of hospital internal systems for search in a PACS/RIS (Picture Archival and Communication System, Radiology Information System) have rarely been analysed. Such a comparison between a hospital PACS/RIS search and a web system for searching images of the biomedical literature is the goal of this paper. Objectives are to identify similarities and differences in search behaviour of the two systems, which could then be used to optimize existing systems and build new search engines. Log files of the ARRS GoldMiner medical image search engine (freely accessible on the Internet) containing 222,005 queries, and log files of Stanford's internal PACS/RIS search called radTF containing 18,068 queries were analysed. Each query was preprocessed and all query terms were mapped to the RadLex (Radiology Lexicon) terminology, a comprehensive lexicon of radiology terms created and maintained by the Radiological Society of North America, so the semantic content in the queries and the links between terms could be analysed, and synonyms for the same concept could be detected. RadLex was mainly created for the use in radiology reports, to aid structured reporting and the preparation of educational material (Lanlotz, 2006) [1]. In standard medical vocabularies such as MeSH (Medical Subject Headings) and UMLS (Unified Medical Language System) specific terms of radiology are often

  11. Perencanaan Search Engine E-commerce dengan Metode Latent Semantic Indexing Berbasis Multiplatform

    Directory of Open Access Journals (Sweden)

    Ni Made Ari Lestari

    2017-03-01

    Full Text Available E-commerce is a sale and purchase transactions that occur through electronic systems such as the Internet, WWW, or other computer networks. E-commerce involves electronic data interchange and automated data collection systems. In all e-commerce search engine provided a column for the search items desired by the user. In e-commerce such as Tokopedia, Lazada, MatahariMall, Amazon, and other search engines that provided just use a regular search engine technology. In the usual search engines getting longer sentences from the input or output of goods search results will be more extensive and more. However, by utilizing the semantic indexing technology, the longer and clear input desired goods, the number of searches will be few and accurately in accordance with the input that helps the user in decision making. In this study discussed how to build a search engine on the web e-commerce by using Latent Semantic Indexing. The first starts from the use of Text Mining methods for word processing, and the method Levenshtein Distance to repair automatic word and the last Latent Semantic Indexing for information processing and input expenditure.

  12. Towards Identifying and Reducing the Bias of Disease Information Extracted from Search Engine Data.

    Science.gov (United States)

    Huang, Da-Cang; Wang, Jin-Feng; Huang, Ji-Xia; Sui, Daniel Z; Zhang, Hong-Yan; Hu, Mao-Gui; Xu, Cheng-Dong

    2016-06-01

    The estimation of disease prevalence in online search engine data (e.g., Google Flu Trends (GFT)) has received a considerable amount of scholarly and public attention in recent years. While the utility of search engine data for disease surveillance has been demonstrated, the scientific community still seeks ways to identify and reduce biases that are embedded in search engine data. The primary goal of this study is to explore new ways of improving the accuracy of disease prevalence estimations by combining traditional disease data with search engine data. A novel method, Biased Sentinel Hospital-based Area Disease Estimation (B-SHADE), is introduced to reduce search engine data bias from a geographical perspective. To monitor search trends on Hand, Foot and Mouth Disease (HFMD) in Guangdong Province, China, we tested our approach by selecting 11 keywords from the Baidu index platform, a Chinese big data analyst similar to GFT. The correlation between the number of real cases and the composite index was 0.8. After decomposing the composite index at the city level, we found that only 10 cities presented a correlation of close to 0.8 or higher. These cities were found to be more stable with respect to search volume, and they were selected as sample cities in order to estimate the search volume of the entire province. After the estimation, the correlation improved from 0.8 to 0.864. After fitting the revised search volume with historical cases, the mean absolute error was 11.19% lower than it was when the original search volume and historical cases were combined. To our knowledge, this is the first study to reduce search engine data bias levels through the use of rigorous spatial sampling strategies.

  13. Towards Identifying and Reducing the Bias of Disease Information Extracted from Search Engine Data.

    Directory of Open Access Journals (Sweden)

    Da-Cang Huang

    2016-06-01

    Full Text Available The estimation of disease prevalence in online search engine data (e.g., Google Flu Trends (GFT has received a considerable amount of scholarly and public attention in recent years. While the utility of search engine data for disease surveillance has been demonstrated, the scientific community still seeks ways to identify and reduce biases that are embedded in search engine data. The primary goal of this study is to explore new ways of improving the accuracy of disease prevalence estimations by combining traditional disease data with search engine data. A novel method, Biased Sentinel Hospital-based Area Disease Estimation (B-SHADE, is introduced to reduce search engine data bias from a geographical perspective. To monitor search trends on Hand, Foot and Mouth Disease (HFMD in Guangdong Province, China, we tested our approach by selecting 11 keywords from the Baidu index platform, a Chinese big data analyst similar to GFT. The correlation between the number of real cases and the composite index was 0.8. After decomposing the composite index at the city level, we found that only 10 cities presented a correlation of close to 0.8 or higher. These cities were found to be more stable with respect to search volume, and they were selected as sample cities in order to estimate the search volume of the entire province. After the estimation, the correlation improved from 0.8 to 0.864. After fitting the revised search volume with historical cases, the mean absolute error was 11.19% lower than it was when the original search volume and historical cases were combined. To our knowledge, this is the first study to reduce search engine data bias levels through the use of rigorous spatial sampling strategies.

  14. Improving Scalability of Java Archive Search Engine through Recursion Conversion And Multithreading

    Directory of Open Access Journals (Sweden)

    Oscar Karnalim

    2016-05-01

    Full Text Available Based on the fact that bytecode always exists on Java archive, a bytecode based Java archive search engine had been developed [1, 2]. Although this system is quite effective, it still lack of scalability since many modules apply recursive calls and this system only utilizes one core (single thread. In this research, Java archive search engine architecture is redesigned in order to improve its scalability. All recursion are converted to iterative forms although most of these modules are logically recursive and quite difficult to convert (e.g. Tarjan’s strongly connected component algorithm. Recursion conversion can be conducted by following its respective recursive pattern. Each recursion is broke down to four parts (before and after actions of current and its children and converted to iteration with the help of caller reference. This conversion mechanism improves scalability by avoiding stack overflow error caused by method calls. System scalability is also improved by applying multithreading mechanism which successfully cut off its processing time. Shorter processing time may enable system to handle larger data. Multithreading is applied on major parts which are indexer, vector space model (VSM retriever, low-rank vector space model (LRVSM retriever, and semantic relatedness calculator (semantic relatedness calculator also involves multiprocess. The correctness of both recursion conversion and multithread design are proved by the fact that all implementation yield similar result.

  15. The History of the Internet Search Engine: Navigational Media and the Traffic Commodity

    Science.gov (United States)

    van Couvering, E.

    This chapter traces the economic development of the search engine industry over time, beginning with the earliest Web search engines and ending with the domination of the market by Google, Yahoo! and MSN. Specifically, it focuses on the ways in which search engines are similar to and different from traditional media institutions, and how the relations between traditional and Internet media have changed over time. In addition to its historical overview, a core contribution of this chapter is the analysis of the industry using a media value chain based on audiences rather than on content, and the development of traffic as the core unit of exchange. It shows that traditional media companies failed when they attempted to create vertically integrated portals in the late 1990s, based on the idea of controlling Internet content, while search engines succeeded in creating huge "virtually integrated" networks based on control of Internet traffic rather than Internet content.

  16. Search Engine Optimization Techniques Practiced in Organizations: A Study of Four Organizations

    CERN Document Server

    Akram, Muhammad; Hayat, Sikandar; Shafi, M Imran; Saeed, Umer

    2010-01-01

    Web spammers used Search Engine Optimization (SEO) techniques to increase search-ranking of web sites. In this paper we have study the essentials SEO techniques, such as; directory submission, keyword generation and link exchanges. The impact of SEO techniques can be applied as marketing technique and to get top listing in major search engines like Google, Yahoo, and MSN. Our study focuses on these techniques from four different companies' perspectives of United Kingdom and Pakistan. According to the these companies, these techniques are low cost and high impacts in profit, because mostly customers focus on major search engine to find different products on internet, so SEO technique provides best opportunity to grow their business. This paper also describes the pros and cons of using these searh engine optimization techniques in above four companies. We have concluded that these techniques are essential to increase their business profit and minimize their marketing cost.

  17. Improving Ranking Persian Subjects in Search Engine Using Fuzzy Inference System

    Directory of Open Access Journals (Sweden)

    Elaheh Golzardi

    2013-09-01

    Full Text Available According to the research, the efficiency of the search engines which done the rankings of Farsi content was much lower than the English search engines. After reviewing the literature, we found that, so far there been no ratings Persian system with fuzzy system and however, due to its proven performance in the field of fuzzy systems, also a search engine designed to accomplish this goal. Therefore, we prefer to advance this goal, so we establish a fuzzy inference system. It is created with the best evidence that can be considered to have been largely bringing the intended page to a user. Proposed method, display the relevant pages to the user in order to allow users to reach to their intended pages with less time and less cost. Also, in order to evaluate this method, Comparisons with other search engines was done.

  18. AN EFFICIENT APPROACH FOR KEYWORD SELECTION; IMPROVING ACCESSIBILITY OF WEB CONTENTS BY GENERAL SEARCH ENGINES

    Directory of Open Access Journals (Sweden)

    H. H. Kian

    2011-11-01

    Full Text Available General search engines often provide low precise results even for detailed queries. So there is a vital needto elicit useful information like keywords for search engines to provide acceptable results for user’s searchqueries. Although many methods have been proposed to show how to extract keywords automatically, allattempt to get a better recall, precision and other criteria which describe how the method has done its jobas an author. This paper presents a new automatic keyword extraction method which improves accessibilityof web content by search engines. The proposed method defines some coefficients determining featuresefficiency and tries to optimize them by using a genetic algorithm. Furthermore, it evaluates candidatekeywords by a function that utilizes the result of search engines. When comparing to the other methods,experiments demonstrate that by using the proposed method, a higher score is achieved from searchengines without losing noticeable recall or precision.

  19. Using Search Engines Properly%巧用搜索引擎

    Institute of Scientific and Technical Information of China (English)

    刘鑫

    2015-01-01

    Search engine can help users to find specific information on the Internet,but they also will offer a large amount of irrelevant information. This paper introduces how to choose the search engine,and skills in the use of search engines,so that people can spend as less time as possible through search engines to find exactly the information they need.%搜索引擎可以帮助使用者在Internet上找到特定的信息,但它们同时也会返回大量无关的信息。本文介绍了如何选择搜索引擎,以及在使用搜索引擎时技巧,使人们会花尽可能少的时间通过搜索引擎找到所需要的确切信息。

  20. Optimizing Online Suicide Prevention: A Search Engine-Based Tailored Approach.

    Science.gov (United States)

    Arendt, Florian; Scherr, Sebastian

    2016-10-14

    Search engines are increasingly used to seek suicide-related information online, which can serve both harmful and helpful purposes. Google acknowledges this fact and presents a suicide-prevention result for particular search terms. Unfortunately, the result is only presented to a limited number of visitors. Hence, Google is missing the opportunity to provide help to vulnerable people. We propose a two-step approach to a tailored optimization: First, research will identify the risk factors. Second, search engines will reweight algorithms according to the risk factors. In this study, we show that the query share of the search term "poisoning" on Google shows substantial peaks corresponding to peaks in actual suicidal behavior. Accordingly, thresholds for showing the suicide-prevention result should be set to the lowest levels during the spring, on Sundays and Mondays, on New Year's Day, and on Saturdays following Thanksgiving. Search engines can help to save lives globally by utilizing a more tailored approach to suicide prevention.

  1. The MediaMill TRECVID 2008 semantic video search engine

    NARCIS (Netherlands)

    Snoek, C.G.M.; van de Sande, K.E.A.; de Rooij, O.; Huurnink, B.; van Gemert, J.C.; Uijlings, J.R.R.; He, J.; Li, X.; Everts, I.; Nedovic, V.; van Liempt, M.; van Balen, R.; Yan, F.; Tahir, M.A.; Mikolajczyk, K.; Kittler, J.; de Rijke, M.; Geusebroek, J.M.; Gevers, T.; Worring, M.; Smeulders, A.W.M.; Koelma, D.C.

    2008-01-01

    In this paper we describe our TRECVID 2008 video retrieval experiments. The MediaMill team participated in three tasks: concept detection, automatic search, and interactive search. Rather than continuing to increase the number of concept detectors available for retrieval, our TRECVID 2008 experiment

  2. The MediaMill TRECVID 2010 semantic video search engine

    NARCIS (Netherlands)

    Snoek, C.G.M.; van de Sande, K.E.A.; de Rooij, O.; Huurnink, B.; Gavves, E.; Odijk, D.; de Rijke, M.; Gevers, T.; Worring, M.; Koelma, D.C.; Smeulders, A.W.M.

    2010-01-01

    In this paper we describe our TRECVID 2010 video retrieval experiments. The MediaMill team participated in three tasks: semantic indexing, known-item search, and instance search. The starting point for the MediaMill concept detection approach is our top-performing bag-of-words system of TRECVID 2009

  3. Sagace: A web-based search engine for biomedical databases in Japan

    Directory of Open Access Journals (Sweden)

    Morita Mizuki

    2012-10-01

    Full Text Available Abstract Background In the big data era, biomedical research continues to generate a large amount of data, and the generated information is often stored in a database and made publicly available. Although combining data from multiple databases should accelerate further studies, the current number of life sciences databases is too large to grasp features and contents of each database. Findings We have developed Sagace, a web-based search engine that enables users to retrieve information from a range of biological databases (such as gene expression profiles and proteomics data and biological resource banks (such as mouse models of disease and cell lines. With Sagace, users can search more than 300 databases in Japan. Sagace offers features tailored to biomedical research, including manually tuned ranking, a faceted navigation to refine search results, and rich snippets constructed with retrieved metadata for each database entry. Conclusions Sagace will be valuable for experts who are involved in biomedical research and drug development in both academia and industry. Sagace is freely available at http://sagace.nibio.go.jp/en/.

  4. A Review of Engine Seal Performance and Requirements for Current and Future Army Engine Platforms

    Science.gov (United States)

    Delgado, Irebert R.; Proctor, Margaret P.

    2008-01-01

    Sand ingestion continues to impact combat ground and air vehicles in military operations in the Middle East. The T-700 engine used in Apache and Blackhawk helicopters has been subjected to increased overhauls due to sand and dust ingestion during desert operations. Engine component wear includes compressor and turbine blades/vanes resulting in decreased engine power and efficiency. Engine labyrinth seals have also been subjected to sand and dust erosion resulting in tooth tip wear, increased clearances, and loss in efficiency. For the current investigation, a brief overview is given of the history of the T-700 engine development with respect to sand and dust ingestion requirements. The operational condition of labyrinth seals taken out of service from 4 different locations of the T-700 engine during engine overhauls are examined. Collaborative efforts between the Army and NASA to improve turbine engine seal leakage and life capability are currently focused on noncontacting, low leakage, compliant designs. These new concepts should be evaluated for their tolerance to sand laden air. Future R&D efforts to improve seal erosion resistance and operation in desert environments are recommended

  5. Website Traffics Acquisition Model for E-Business using Search Engine Optimization and Sitemap Submission (SEOSS)

    OpenAIRE

    ZULAZEZE SAHRI

    2016-01-01

    Abstract—It is inevitable to accept that today’s business trends focus on selling products online using the latest web application system and website marketing tools. This phenomenon creates a high competition on search engine ranking among website owners in gaining business leads, visitor traffics and acquisitions. This paper proposed a model on improving e-business traffics acquisitions in terms of the number of new visitors (traffics) and returning visitors using Search Engine Optimization...

  6. Model-based systems engineering in the execution of search and rescue operations

    OpenAIRE

    Hunt, Spencer S.

    2015-01-01

    Approved for public release; distribution is unlimited Complex systems engineering problems require robust modeling early in the design process in order to analyze crucial design requirements and interactions. This thesis emphasizes the need for such modeling through multiple model-based systems engineering techniques as they apply to the execution of search and rescue. Through the development of a design reference mission, this thesis illustrates how a search and rescue architecture can u...

  7. Search Engines Comparison on the Basis of Session Duration and Click Hits

    Directory of Open Access Journals (Sweden)

    Rajesh Kumar Goutam

    2011-03-01

    Full Text Available The evaluation of search engines has greatly diversified in recent years. Evaluation campaigns are required to continuously re-consider their tasks and updating evaluation function in order to satisfy the users. We presented two user action dependent approaches to rank the results, namely Session duration time and Click Hits. Furthermore, we have conducted an experiment with 25 TREC queries to do comparison of five popular search engines.

  8. DBLC_SPAMCLUST: SPAMDEXING DETECTION BY CLUSTERING CLIQUE-ATTACKS IN WEB SEARCH ENGIN

    OpenAIRE

    Dr.S.K.JAYANTHI,; Ms.S.Sasikala

    2011-01-01

    Search engines are playing a more and more important role in discovering information on the web now a day. Spam web pages, however, are employing various tricks to bamboozle search engines, therefore achieving undeserved ranks. In this paper an algorithm DBLCSPAMCLUST is proposed for spam detection based on content and link attributes details, which is an extension of DBSpamClust [1]. As showing through experiments such a method can filter out web spam effectively.

  9. DBLC_SPAMCLUST: SPAMDEXING DETECTION BY CLUSTERING CLIQUE-ATTACKS IN WEB SEARCH ENGIN

    Directory of Open Access Journals (Sweden)

    Dr.S.K.JAYANTHI,

    2011-06-01

    Full Text Available Search engines are playing a more and more important role in discovering information on the web now a day. Spam web pages, however, are employing various tricks to bamboozle search engines, therefore achieving undeserved ranks. In this paper an algorithm DBLCSPAMCLUST is proposed for spam detection based on content and link attributes details, which is an extension of DBSpamClust [1]. As showing through experiments such a method can filter out web spam effectively.

  10. Penerapan Teknik SEO (Search Engine Optimization pada Website dalam Strategi Pemasaran Melalui Internet

    Directory of Open Access Journals (Sweden)

    Rony Baskoro Lukito

    2014-12-01

    Full Text Available The purpose of this research is how to optimize a web design that can increase the number of visitors. The number of Internet users in the world continues to grow in line with advances in information technology. Products and services marketing media do not just use the printed and electronic media. Moreover, the cost of using the Internet as a medium of marketing is relatively inexpensive when compared to the use of television as a marketing medium. The penetration of the internet as a marketing medium lasted for 24 hours in different parts of the world. But to make an internet site into a site that is visited by many internet users, the site is not only good from the outside view only. Web sites that serve as a medium for marketing must be built with the correct rules, so that the Web site be optimal marketing media. One of the good rules in building the internet site as a marketing medium is how the content of such web sites indexed well in search engines like google. Search engine optimization in the index will be focused on the search engine Google for 83% of internet users across the world using Google as a search engine. Search engine optimization commonly known as SEO (Search Engine Optimization is an important rule that the internet site is easier to find a user with the desired keywords.

  11. Penerapan Teknik Seo (Search Engine Optimization pada Website dalam Strategi Pemasaran melalui Internet

    Directory of Open Access Journals (Sweden)

    Rony Baskoro Lukito

    2014-12-01

    Full Text Available The purpose of this research is how to optimize a web design that can increase the number of visitors. The number of Internet users in the world continues to grow in line with advances in information technology. Products and services marketing media do not just use the printed and electronic media. Moreover, the cost of using the Internet as a medium of marketing is relatively inexpensive when compared to the use of television as a marketing medium. The penetration of the internet as a marketing medium lasted for 24 hours in different parts of the world. But to make an internet site into a site that is visited by many internet users, the site is not only good from the outside view only. Web sites that serve as a medium for marketing must be built with the correct rules, so that the Web site be optimal marketing media. One of the good rules in building the internet site as a marketing medium is how the content of such web sites indexed well in search engines like google. Search engine optimization in the index will be focused on the search engine Google for 83% of internet users across the world using Google as a search engine. Search engine optimization commonly known as SEO (Search Engine Optimization is an important rule that the internet site is easier to find a user with the desired keywords.

  12. Bacteria engineered for fuel ethanol production: current status

    Energy Technology Data Exchange (ETDEWEB)

    Dien, B.S.; Cotta, M.A. [National Center for Agricultural Utilization Research, Agricultural Research Service, USDA, Peoria, IL (United States); Jeffries, T.W. [Inst. for Microbial and Biochemical Technology, Forest Service, Forest Products Lab., USDA, Madison, WI (United States)

    2004-07-01

    The lack of industrially suitable microorganisms for converting biomass into fuel ethanol has traditionally been cited as a major technical roadblock to developing a bioethanol industry. In the last two decades, numerous microorganisms have been engineered to selectively produce ethanol. Lignocellulosic biomass contains complex carbohydrates that necessitate utilizing microorganisms capable of fermenting sugars not fermentable by brewers' yeast. The most significant of these is xylose. The greatest successes have been in the engineering of gram-negative bacteria: Escherichia coli, Klebsiella oxytoca, and Zymomonas mobilis. E. coli and K. oxytoca are naturally able to use a wide spectrum of sugars, and work has concentrated on engineering these strains to selectively produce ethanol. Z. mobilis produces ethanol at high yields, but ferments only glucose and fructose. Work on this organism has concentrated on introducing pathways for the fermentation of arabinose and xylose. The history of constructing these strains and current progress in refining them are detailed in this review. (orig.)

  13. Current and future searches for neutrinoless double beta decay

    Science.gov (United States)

    Dolinski, Michelle J.

    2016-09-01

    With the discovery of neutrino oscillations and neutrino mass, it has become a pressing question whether neutrinos have distinct antiparticle states. The most practical experimental approach to answering this question is the search for neutrinoless double beta decay, a version of a rare nuclear process that would violate lepton number conservation. The observation of neutrinoless double beta decay would prove that neutrinos are their own antiparticles. Neutrinoless double beta decay experiments deploy large source masses consisting of a select few (usually enriched) isotopes of interest. Detectors must achieve extremely low levels of radioactive background to detect this rare decay. I will report on recent searches for neutrinoless double beta decay and discuss the technical challenges that the next generation of experiments will overcome.

  14. Impact of Internet Search Engines on OPAC Users: A Study of Punjabi University, Patiala (India)

    Science.gov (United States)

    Kumar, Shiv

    2012-01-01

    Purpose: The aim of this paper is to study the impact of internet search engine usage with special reference to OPAC searches in the Punjabi University Library, Patiala, Punjab (India). Design/methodology/approach: The primary data were collected from 352 users comprising faculty, research scholars and postgraduate students of the university. A…

  15. Inefficiency and Bias of Search Engines in Retrieving References Containing Scientific Names of Fossil Amphibians

    Science.gov (United States)

    Brown, Lauren E.; Dubois, Alain; Shepard, Donald B.

    2008-01-01

    Retrieval efficiencies of paper-based references in journals and other serials containing 10 scientific names of fossil amphibians were determined for seven major search engines. Retrievals were compared to the number of references obtained covering the period 1895-2006 by a Comprehensive Search. The latter was primarily a traditional…

  16. Search Engine Marketing (SEM: Financial & Competitive Advantages of an Effective Hotel SEM Strategy

    Directory of Open Access Journals (Sweden)

    Leora Halpern Lanz

    2015-05-01

    Full Text Available Search Engine Marketing and Optimization (SEO, SEM are keystones of a hotels marketing strategy, in fact research shows that 90% of travelers start their vacation planning with a Google search. Learn five strategies that can enhance a hotels SEO and SEM strategies to boost bookings.

  17. SpEnD: Linked Data SPARQL Endpoints Discovery Using Search Engines

    OpenAIRE

    Yumusak, Semih; Dogdu, Erdogan; KODAZ, Halife; Kamilaris, Andreas

    2016-01-01

    In this study, a novel metacrawling method is proposed for discovering and monitoring linked data sources on the Web. We implemented the method in a prototype system, named SPARQL Endpoints Discovery (SpEnD). SpEnD starts with a "search keyword" discovery process for finding relevant keywords for the linked data domain and specifically SPARQL endpoints. Then, these search keywords are utilized to find linked data sources via popular search engines (Google, Bing, Yahoo, Yandex). By using this ...

  18. Search Engine Optimization for Flash Best Practices for Using Flash on the Web

    CERN Document Server

    Perkins, Todd

    2009-01-01

    Search Engine Optimization for Flash dispels the myth that Flash-based websites won't show up in a web search by demonstrating exactly what you can do to make your site fully searchable -- no matter how much Flash it contains. You'll learn best practices for using HTML, CSS and JavaScript, as well as SWFObject, for building sites with Flash that will stand tall in search rankings.

  19. A Hybrid Quantum Search Engine: A Fast Quantum Algorithm for Multiple Matches

    CERN Document Server

    Younes, A; Miller, J; Younes, Ahmed; Rowe, Jon; Miller, Julian

    2003-01-01

    In this paper we will present a quantum algorithm which works very efficiently in case of multiple matches within the search space and in the case of few matches, the algorithm performs classically. This allows us to propose a hybrid quantum search engine that integrates Grover's algorithm and the proposed algorithm here to have general performance better that any pure classical or quantum search algorithm.

  20. Stirling engines. (Latest citations from the COMPENDEX database). Published Search

    Energy Technology Data Exchange (ETDEWEB)

    1992-12-01

    The bibliography contains citations concerning Stirling engine technology. Design, development, performance testing, and applications are discussed, including power generation, cryogenic cooling, solar power applications, and ground and marine vehicles. The citations also examine engine component design and material testing results. (Contains 250 citations and includes a subject term index and title list.)

  1. Reconsidering the Rhizome: A Textual Analysis of Web Search Engines as Gatekeepers of the Internet

    Science.gov (United States)

    Hess, A.

    Critical theorists have often drawn from Deleuze and Guattari's notion of the rhizome when discussing the potential of the Internet. While the Internet may structurally appear as a rhizome, its day-to-day usage by millions via search engines precludes experiencing the random interconnectedness and potential democratizing function. Through a textual analysis of four search engines, I argue that Web searching has grown hierarchies, or "trees," that organize data in tracts of knowledge and place users in marketing niches rather than assist in the development of new knowledge.

  2. Search Engine Optimization and Its Importance for Business Visibility and Branding

    OpenAIRE

    Vo, Tuan

    2016-01-01

    In the era of Information age, it is common for a business to have an online presence on the Internet. However, presence is not enough, the business has to be clearly visible on the Internet whenever people search for the product, service or resource provided by that business in order to survive and thrive in an increasingly competitive market. As a result, search engine marketing (SEM) in general and search engine optimization (SEO) in particular is an essential tool that can be applied to d...

  3. The invisible Web uncovering information sources search engines can't see

    CERN Document Server

    Sherman, Chris

    2001-01-01

    Enormous expanses of the Internet are unreachable with standard web search engines. This book provides the key to finding these hidden resources by identifying how to uncover and use invisible web resources. Mapping the invisible Web, when and how to use it, assessing the validity of the information, and the future of Web searching are topics covered in detail. Only 16 percent of Net-based information can be located using a general search engine. The other 84 percent is what is referred to as the invisible Web-made up of information stored in databases. Unlike pages on the visible Web, informa

  4. Meteorological Conditions Causing Jet-Engine Poweloss Events: Current Understanding

    Science.gov (United States)

    Strapp, J. W.; Ratvasky, T. P.

    2009-09-01

    The aviation industry is currently investigating a regular occurrence of jet engine-powerloss events which have now been attributed to the ingestion of atmospheric ice particles, usually in the vicinity of deep convection. There is a limited amount of information on the cloud microphysical properties near the cores of deep convection due to the potential hazards of flying in these areas, and due to the fact that it is a very challenging environment for current instrumentation. Most of the information that has been used to deduce the details of the conditions that cause engine powerloss has been extracted from the event-aircraft flight data recorders, pilot interviews, ground radar and satellite, a series of flight test programs in the 1950s and again in the 1990s, and the most recently available limited data from the cloud physics community. These have led to the conclusion that engine events occur due to flight through high mass concentrations of ice particles, probably with ice water contents (IWCs) in excess of 2 grams per cubic meter, and perhaps as high as 8. The limited microphysical data available has been used to suggest a median mass diameter of the ice particles of ~200 microns, with some evidence that it may be as low as 40 microns. These small particle sizes in the presence of high mass concentration is consistent with the lack of radar echoes > 20 dBZ observed on the pilot's radar, a consistent observation during engine events. The Engine Harmonization Working Group, an industry/regulator/government committee investigating engine powerloss, has concluded that the level of understanding of the properties of these clouds is inadequate to provide guidance to industry for engine design and testing. In order to address this issue, NASA and Environment Canada are planning to instrument an aircraft to make measurements in high IWC regions of tropical monsoon and continental convection. There is also a significant effort to upgrade and develop new

  5. Engineering Critical Current Density Improvement in Ag- Bi-2223 Tapes

    DEFF Research Database (Denmark)

    Wang, W. G.; Seifi, Behrouz; Eriksen, Morten;

    2000-01-01

    Ag alloy sheathed Bi-2223 multifilament tapes were produced by the powder-in-tube method. Engineering critical current density improvement has been achieved through both enhancement of critical current density by control of the thermal behavior of oxide powder and by an increase of the filling...... the superconductor composite sustaining large proportional oxide ceramics in the composite during drawing and rolling process. By optimization of the thermal and mechanical process, a Je of 12 kA/cm2 has been achieved in a 0.183.1 mm2 size tape which carried 67 A...

  6. On the Technological Improvement of Search Engines%试论搜索引擎的技术改进

    Institute of Scientific and Technical Information of China (English)

    赵丹群; 喀碧竹

    2003-01-01

    As a new and important tool for searching huge amount of information on the Web, search engine hasmade rapid progress in recent years. Meanwhile, it' s confronted with various problems. This article discusses the im-provement of search engines from 3 aspects, that is, changing the search mode, using the theory and knowledge of tradi-tional information retrieval and reinforcing post-processing of search results.

  7. FOAMSearch.net: A custom search engine for emergency medicine and critical care.

    Science.gov (United States)

    Raine, Todd; Thoma, Brent; Chan, Teresa M; Lin, Michelle

    2015-08-01

    The number of online resources read by and pertinent to clinicians has increased dramatically. However, most healthcare professionals still use mainstream search engines as their primary port of entry to the resources on the Internet. These search engines use algorithms that do not make it easy to find clinician-oriented resources. FOAMSearch, a custom search engine (CSE), was developed to find relevant, high-quality online resources for emergency medicine and critical care (EMCC) clinicians. Using Google™ algorithms, it searches a vetted list of >300 blogs, podcasts, wikis, knowledge translation tools, clinical decision support tools and medical journals. Utilisation has increased progressively to >3000 users/month since its launch in 2011. Further study of the role of CSEs to find medical resources is needed, and it might be possible to develop similar CSEs for other areas of medicine.

  8. A reliability measure of protein-protein interactions and a reliability measure-based search engine.

    Science.gov (United States)

    Park, Byungkyu; Han, Kyungsook

    2010-02-01

    Many methods developed for estimating the reliability of protein-protein interactions are based on the topology of protein-protein interaction networks. This paper describes a new reliability measure for protein-protein interactions, which does not rely on the topology of protein interaction networks, but expresses biological information on functional roles, sub-cellular localisations and protein classes as a scoring schema. The new measure is useful for filtering many spurious interactions, as well as for estimating the reliability of protein interaction data. In particular, the reliability measure can be used to search protein-protein interactions with the desired reliability in databases. The reliability-based search engine is available at http://yeast.hpid.org. We believe this is the first search engine for interacting proteins, which is made available to public. The search engine and the reliability measure of protein interactions should provide useful information for determining proteins to focus on.

  9. RSECM: Robust Search Engine using Context-based Mining for Educational Big Data

    Directory of Open Access Journals (Sweden)

    D. Pratiba

    2016-12-01

    Full Text Available With an accelerating growth in the educational sector along with the aid of ICT and cloud-based services, there is a consistent rise of educational big data, where storage and processing become the prime matter of challenge. Although many recent attempts have used open source framework e.g. Hadoop for storage, still there are reported issues in sufficient security management and data analyzing problems. Hence, there is less applicability of mining techniques for upcoming search engine due to unstructured educational data. The proposed system introduces a technique called as RSECM i.e. Robust Search Engine using Context-based Modeling that presents a novel archival and search engine. RSECM generates its own massive stream of educational big data and performs the efficient search of data. Outcome exhibits RSECM outperforms SQL based approaches concerning faster retrieval of the dynamic user-defined query.

  10. Using social annotation and web log to enhance search engine

    CERN Document Server

    Nguyen, Vu Thanh

    2009-01-01

    Search services have been developed rapidly in social Internet. It can help web users easily to find their documents. So that, finding a best method search is always an imagine. This paper would like introduce hybrid method of LPageRank algorithm and Social Sim Rank algorithm. LPageRank is the method using link structure to rank priority of page. It doesn't care content of page and content of query. Therefore, we want to use benefit of social annotations to create the latent semantic association between queries and annotations. This model, we use algorithm SocialPageRank and LPageRank to enhance accuracy of search system. To experiment and evaluate the proposed of the new model, we have used this model for Music Machine Website with their web logs.

  11. Current challenges for education of nuclear engineers. Beyond nuclear basics

    Energy Technology Data Exchange (ETDEWEB)

    Schoenfelder, Christian [AREVA GmbH, Offenbach (Germany). Training Center

    2014-07-15

    In past decades, curricula for the education of nuclear engineers (either as a major or minor subject) have been well established all over the world. However, from the point of view of a nuclear supplier, recent experiences in large and complex new build as well as modernization projects have shown that important competences required in these projects were not addressed during the education of young graduates. Consequently, in the past nuclear industry has been obliged to either accept long periods for job familiarization, or to develop and implement various dedicated internal training measures. Although the topics normally addressed in nuclear engineering education (like neutron and reactor physics, nuclear materials or thermohydraulics and the associated calculation methods) build up important competences, this paper shows that the current status of nuclear applications requires adaptations of educational curricula. As a conclusion, when academic nuclear engineering curricula start taking into account current competence needs in nuclear industry, it will be for the benefit of the current and future generation of nuclear engineers. They will be better prepared for their future job positions and career perspectives, especially on an international level. The recommendations presented should not only be of importance for the nuclear fission field, but also for the fusion community. Here, the Horizon 2020 Roadmap to Fusion as published in 2012 now is focusing on ITER and on a longer-term development of fusion technology for a future demonstration reactor DEMO. The very challenging work program is leading to a strong need for exactly those skills that are described in this article.

  12. Efficient Retrieval of Images for Search Engine by Visual Similarity and Re Ranking

    Directory of Open Access Journals (Sweden)

    Viswa S S

    2013-06-01

    Full Text Available Nowadays, web scale image search engines (e.g. Google Image Search, Microsoft Live Image Search rely almost purely on surrounding text features. Users type keywords in hope of finding a certain type of images. The search engine returns thousands of images ranked by the text keywords extracted from the surrounding text. However, many of returned images are noisy, disorganized, or irrelevant. Even Google and Microsoft have no Visual Information for searching of images. Using visual information to re rank and improve text based image search results is the idea. This improves the precision of the text based image search ranking by incorporating the information conveyed by the visual modality. The typical assumption that the top- images in the text-based search result are equally relevant is relaxed by linking the relevance of the images to their initial rank positions. Then, a number of images from the initial search result are employed as the prototypes that serve to visually represent the query and that are subsequently used to construct meta re rankers .i.e. The most relevant images are found by visual similarity and the average scores are calculated. By applying different meta re rankers to an image from the initial result, re ranking scores are generated, which are then used to find the new rank position for an image in the re ranked search result. Human supervision is introduced to learn the model weights offline, prior to the online re ranking process. While model learning requires manual labelling of the results for a few queries, the resulting model is query independent and therefore applicable to any other query. The experimental results on a representative web image search dataset comprising 353 queries demonstrate that the proposed method outperforms the existing supervised and unsupervised Re ranking approaches. Moreover, it improves the performance over the text-based image search engine by more than 25.48%.

  13. Complex dynamics of our economic life on different scales: insights from search engine query data.

    Science.gov (United States)

    Preis, Tobias; Reith, Daniel; Stanley, H Eugene

    2010-12-28

    Search engine query data deliver insight into the behaviour of individuals who are the smallest possible scale of our economic life. Individuals are submitting several hundred million search engine queries around the world each day. We study weekly search volume data for various search terms from 2004 to 2010 that are offered by the search engine Google for scientific use, providing information about our economic life on an aggregated collective level. We ask the question whether there is a link between search volume data and financial market fluctuations on a weekly time scale. Both collective 'swarm intelligence' of Internet users and the group of financial market participants can be regarded as a complex system of many interacting subunits that react quickly to external changes. We find clear evidence that weekly transaction volumes of S&P 500 companies are correlated with weekly search volume of corresponding company names. Furthermore, we apply a recently introduced method for quantifying complex correlations in time series with which we find a clear tendency that search volume time series and transaction volume time series show recurring patterns.

  14. Exploring the Relevance of Search Engines: An Overview of Google as a Case Study

    Directory of Open Access Journals (Sweden)

    Ricardo Beltrán-Alfonso

    2017-08-01

    Full Text Available The huge amount of data on the Internet and the diverse list of strategies used to try to link this information with relevant searches through Linked Data have generated a revolution in data treatment and its representation. Nevertheless, the conventional search engines like Google are kept as strategies with good reception to do search processes. The following article presents a study of the development and evolution of search engines, more specifically, to analyze the relevance of findings based on the number of results displayed in paging systems with Google as a case study. Finally, it is intended to contribute to indexing criteria in search results, based on an approach to Semantic Web as a stage in the evolution of the Web.

  15. Search for Flavor-Changing Neutral-Current Charm Decays

    CERN Document Server

    Aubert, B; Bóna, M; Boutigny, D; Couderc, F; Karyotakis, Yu; Lees, J P; Poireau, V; Tisserand, V; Zghiche, A; Graugès-Pous, E; Palano, A; Chen, J C; Qi, N D; Rong, G; Wang, P; Zhu, Y S; Eigen, G; Ofte, I; Stugu, B; Abrams, G S; Battaglia, M; Brown, D N; Button-Shafer, J; Cahn, R N; Charles, E; Gill, M S; Groysman, Y; Jacobsen, R G; Kadyk, J A; Kerth, L T; Kolomensky, Y G; Kukartsev, G; Lynch, G; Mir, L M; Orimoto, T J; Pripstein, M; Roe, N A; Ronan, M T; Wenzel, W A; Del Amo-Sánchez, P; Barrett, M; Ford, K E; Hart, A J; Harrison, T J; Hawkes, C M; Morgan, S E; Watson, A T; Held, T; Koch, H; Lewandowski, B; Pelizaeus, M; Peters, K; Schröder, T; Steinke, M; Boyd, J T; Burke, J P; Cottingham, W N; Walker, D; Asgeirsson, D J; Çuhadar-Dönszelmann, T; Fulsom, B G; Hearty, C; Knecht, N S; Mattison, T S; McKenna, J A; Khan, A; Kyberd, P; Saleem, M; Sherwood, D J; Teodorescu, L; Blinov, V E; Bukin, A D; Druzhinin, V P; Golubev, V B; Onuchin, A P; Serednyakov, S I; Skovpen, Y I; Solodov, E P; Todyshev, K Y; Best, D S; Bondioli, M; Bruinsma, M; Chao, M; Curry, S; Eschrich, I; Kirkby, D; Lankford, A J; Lund, P; Mandelkern, M A; Mommsen, R K; Röthel, W; Stoker, D P; Abachi, S; Buchanan, C; Foulkes, S D; Gary, J W; Long, O; Shen, B C; Wang, K; Zhang, L; Hadavand, H K; Hill, E J; Paar, H P; Rahatlou, S; Sharma, V; Berryhill, J W; Campagnari, C; Cunha, A; Dahmes, B; Hong, T M; Kovalskyi, D; Richman, J D; Beck, T W; Eisner, A M; Flacco, C J; Heusch, C A; Kroseberg, J; Lockman, W S; Nesom, G; Schalk, T; Schumm, B A; Seiden, A; Spradlin, P; Williams, D C; Wilson, M G; Albert, J; Chen, E; Dvoretskii, A; Fang, F; Hitlin, D G; Narsky, I; Piatenko, T; Porter, F C; Ryd, A; Samuel, A; Mancinelli, G; Meadows, B T; Mishra, K; Sokoloff, M D; Blanc, F; Bloom, P C; Chen, S; Ford, W T; Hirschauer, J F; Kreisel, A; Nagel, M; Nauenberg, U; Olivas, A; Ruddick, W O; Smith, J G; Ulmer, K A; Wagner, S R; Zhang, J; Chen, A; Eckhart, E A; Soffer, A; Toki, W H; Wilson, R J; Winklmeier, F; Zeng, Q; Altenburg, D D; Feltresi, E; Hauke, A; Jasper, H; Merkel, J; Petzold, A; Spaan, B; Brandt, T; Klose, V; Lacker, H M; Mader, W F; Nogowski, R; Schubert, J; Schubert, K R; Schwierz, R; Sundermann, J E; Volk, A; Bernard, D; Bonneaud, G R; Latour, E; Thiebaux, C; Verderi, M; Clark, P J; Gradl, W; Muheim, F; Playfer, S; Robertson, A I; Xie, Y; Andreotti, M; Bettoni, D; Bozzi, C; Calabrese, R; Cibinetto, G; Luppi, E; Negrini, M; Petrella, A; Piemontese, L; Prencipe, E; Anulli, F; Baldini-Ferroli, R; Calcaterra, A; De Sangro, R; Finocchiaro, G; Pacetti, S; Patteri, P; Peruzzi, I M; Piccolo, M; Rama, M; Zallo, A; Buzzo, A; Capra, R; Contri, R; Lo Vetere, M; Macri, M M; Monge, M R; Passaggio, S; Patrignani, C; Robutti, E; Santroni, A; Tosi, S; Brandenburg, G; Chaisanguanthum, K S; Morii, M; Wu, J; Dubitzky, R S; Marks, J; Schenk, S; Uwer, U; Bard, D J; Bhimji, W; Bowerman, D A; Dauncey, P D; Egede, U; Flack, R L; Nash, J A; Nikolich, M B; Panduro-Vazquez, W; Behera, P K; Chai, X; Charles, M J; Mallik, U; Meyer, N T; Ziegler, V; Cochran, J; Crawley, H B; Dong, L; Eyges, V; Meyer, W T; Prell, S; Rosenberg, E I; Rubin, A E; Gritsan, A V; Denig, A G; Fritsch, M; Schott, G; Arnaud, N; Davier, M; Grosdidier, G; Höcker, A; Le Diberder, F R; Lepeltier, V; Lutz, A M; Oyanguren, A; Pruvot, S; Rodier, S; Roudeau, P; Schune, M H; Stocchi, A; Wang, W F; Wormser, G; Cheng, C H; Lange, D J; Wright, D M; Chavez, C A; Forster, I J; Fry, J R; Gabathuler, E; Gamet, R; George, K A; Hutchcroft, D E; Payne, D J; Schofield, K C; Touramanis, C; Bevan, A J; Di Lodovico, F; Menges, W; Sacco, R; Cowan, G; Flächer, H U; Hopkins, D A; Jackson, P S; McMahon, T R; Ricciardi, S; Salvatore, F; Wren, A C; Davis, C L; Allison, J; Barlow, N R; Barlow, R J; Chia, Y M; Edgar, C L; Lafferty, G D; Naisbit, M T; Williams, J C; Yi, J I; Chen, C; Hulsbergen, W D; Jawahery, A; Lae, C K; Roberts, D A; Simi, G; Blaylock, G; Dallapiccola, C; Hertzbach, S S; Li, X; Moore, T B; Saremi, S; Stängle, H; Cowan, R; Sciolla, G; Sekula, S J; Spitznagel, M; Taylor, F; Yamamoto, R K; Kim, H; Mclachlin, S E; Patel, P M; Robertson, S H; Lazzaro, A; Lombardo, V; Palombo, F; Bauer, J M; Cremaldi, L; Eschenburg, V; Godang, R; Kroeger, R; Sanders, D A; Summers, D J; Zhao, H W; Brunet, S; Côté, D; Simard, M; Taras, P; Viaud, F B; Nicholson, H; Cavallo, N; De Nardo, Gallieno; Fabozzi, F; Gatto, C; Lista, L; Monorchio, D; Paolucci, P; Piccolo, D; Sciacca, C; Baak, M A; Raven, G; Snoek, H L; Jessop, C P; LoSecco, J M; Allmendinger, T; Benelli, G; Corwin, L A; Gan, K K; Honscheid, K; Hufnagel, D; Jackson, P D; Kagan, H; Kass, R; Rahimi, A M; Regensburger, J J; Ter-Antonian, R; Wong, Q K; Blount, N L; Brau, J E; Frey, R; Igonkina, O; Kolb, J A; Lu, M; Rahmat, R; Sinev, N B; Strom, D; Strube, J; Torrence, E; Gaz, A; Margoni, M; Morandin, M; Pompili, A; Posocco, M; Rotondo, M; Simonetto, F; Stroili, R; Voci, C; Benayoun, M; Briand, H; Chauveau, J; David, P; Del Buono, L; La Vaissière, C de; Hamon, O; Hartfiel, B L; John, M J J; Leruste, P; Malcles, J; Ocariz, J; Roos, L; Therin, G; Gladney, L; Panetta, J; Biasini, M; Covarelli, R; Angelini, C; Batignani, G; Bettarini, S; Bucci, F; Calderini, G; Carpinelli, M; Cenci, R; Forti, F; Giorgi, M A; Lusiani, A; Marchiori, G; Mazur, M A; Morganti, M; Neri, N; Paoloni, E; Rizzo, G; Walsh, J J; Haire, M; Judd, D; Wagoner, D E; Biesiada, J; Danielson, N; Elmer, P; Lau, Y P; Lü, C; Olsen, J; Smith, A J S; Telnov, A V; Bellini, F; Cavoto, G; D'Orazio, A; Del Re, D; Di Marco, E; Faccini, R; Ferrarotto, F; Ferroni, F; Gaspero, M; Li Gioi, L; Mazzoni, M A; Morganti, S; Piredda, G; Polci, F; Safai-Tehrani, F; Voena, C; Ebert, M; Schröder, H; Waldi, R; Adye, T; De Groot, N; Franek, B; Olaiya, E O; Wilson, F F; Aleksan, R; Emery, S; Gaidot, A; Ganzhur, S F; Hamel de Monchenault, G; Kozanecki, Witold; Legendre, M; Vasseur, G; Yéche, C; Zito, M; Chen, X R; Liu, H; Park, W; Purohit, M V; Wilson, J R; Allen, M T; Aston, D; Bartoldus, R; Bechtle, P; Berger, N; Claus, R; Coleman, J P; Convery, M R; Cristinziani, M; Dingfelder, J C; Dorfan, J; Dubois-Felsmann, G P; Dujmic, D; Dunwoodie, W M; Field, R C; Glanzman, T; Gowdy, S J; Graham, M T; Grenier, P; Halyo, V; Hast, C; Hrynóva, T; Innes, W R; Kelsey, M H; Kim, P; Leith, D W G S; Li, S; Luitz, S; Lüth, V; Lynch, H L; MacFarlane, D B; Marsiske, H; Messner, R; Müller, D R; O'Grady, C P; Ozcan, V E; Perazzo, A; Perl, M; Pulliam, T; Ratcliff, B N; Roodman, A; Salnikov, A A; Schindler, R H; Schwiening, J; Snyder, A; Stelzer, J; Su, D; Sullivan, M K; Suzuki, K; Swain, S K; Thompson, J M; Vavra, J; Van Bakel, N; Weaver, M; Weinstein, A J R; Wisniewski, W J; Wittgen, M; Wright, D H; Yarritu, A K; Yi, K; Young, C C; Burchat, P R; Edwards, A J; Majewski, S A; Petersen, B A; Roat, C; Wilden, L; Ahmed, S; Alam, M S; Bula, R; Ernst, J A; Jain, V; Pan, B; Saeed, M A; Wappler, F R; Zain, S B; Bugg, W; Krishnamurthy, M; Spanier, S M; Eckmann, R; Ritchie, J L; Satpathy, A; Schilling, C J; Schwitters, R F; Izen, J M; Lou, X C; Ye, S; Bianchi, F; Gallo, F; Gamba, D; Bomben, M; Bosisio, L; Cartaro, C; Cossutti, F; Della Ricca, G; Dittongo, S; Lanceri, L; Vitale, L; Azzolini, V; Lopez-March, N; Martínez-Vidal, F; Banerjee, S; Bhuyan, B; Brown, C M; Fortin, D; Hamano, K; Kowalewski, R V; Nugent, I M; Roney, J M; Sobie, R J; Back, J J; Harrison, P F; Latham, T E; Mohanty, G B; Pappagallo, M; Band, H R; Chen, X; Cheng, B; Dasu, S; Datta, M; Flood, K T; Hollar, J J; Kutter, P E; Mellado, B; Mihályi, A; Pan, Y; Pierini, M; Prepost, R; Wu, S L; Yu, Z; Neal, H

    2006-01-01

    We search for rare FCNC charm decays of the form $X_c^+\\to h^+\\llp$, where $X_c^+$ is a charm hadron, $h$ is a pion, kaon or proton, and $\\ell^{(}{'}^{)}$ is an electron or a muon. In the pion and kaon modes, we study both $D^+$ and $D_s^+$ decays, while in the proton modes we study $\\Lambda_c^+$ decays. Based on a data sample of 288${fb}^{-1}$ of $e^+e^-$ collisions collected by BABAR, we set preliminary 90% confidence level limits between 4 to 40$\\times10^{-6}$ for the branching fractions of the different decay modes. For most decay modes, our analysis provides a significant improvement over previous results.

  16. Semantic Based Efficient Retrieval of Relevant Resources and its Services using Search Engines

    Directory of Open Access Journals (Sweden)

    Pradeep Gurunathan

    2014-05-01

    Full Text Available The main objective of this paper is to propose an efficient mechanism for retrieval of resources using semantic approach and to exchange information using Service Oriented Architecture. A framework has been developed to empower the users in locating relevant resources and associated services through a meaningful semantics. The resources are retrieved efficiently by Modified Matchmaking Algorithm and dynamic ranking, which shows an improvement in search technique provided by the proposed search mechanism. The performance of retrieval of the proposed search mechanism is computed and compared with existing popular search engines like google and yahoo which shows a significant amount of improvement.

  17. A World Wide Web Region-Based Image Search Engine

    DEFF Research Database (Denmark)

    Kompatsiaris, Ioannis; Triantafyllou, Evangelia; Strintzis, Michael G.

    2001-01-01

    information. These features along with additional information such as the URL location and the date of index procedure are stored in a database. The user can access and search this indexed content through the Web with an advanced and user friendly interface. The output of the system is a set of links...

  18. ClinicalKey: a point-of-care search engine.

    Science.gov (United States)

    Vardell, Emily

    2013-01-01

    ClinicalKey is a new point-of-care resource for health care professionals. Through controlled vocabulary, ClinicalKey offers a cross section of resources on diseases and procedures, from journals to e-books and practice guidelines to patient education. A sample search was conducted to demonstrate the features of the database, and a comparison with similar tools is presented.

  19. The MediaMill TRECVID 2006 semantic video search engine

    NARCIS (Netherlands)

    Snoek, C.G.M.; Gemert, J.C. van; Gevers, T.; Huurnink, B.; Koelma, D.C.; Liempt, M. van; Rooij, O. de; Sande, K.E.A. van de; Seinstra, F.J.; Smeulders, A.W.M.; Thean, A.H.C.; Veenman, C.J.; Worring, M.

    2006-01-01

    In this paper we describe our TRECVID 2006 experiments. The MediaMill team participated in two tasks: concept detection and search. For concept detection we use the MediaMill Challenge as experimental platform. The MediaMill Challenge divides the generic video indexing problem into a visual-only, te

  20. Application of Template Mining in Search Engines%模板处理在搜索引擎中的应用

    Institute of Scientific and Technical Information of China (English)

    王宁

    2001-01-01

    The definition, content and role of template mining and search engine are described. The popular search engines are summarized. The article also gives a minute description of the reasons and methods of applying template mining in search engines. Finally, it probes into the measures of the use of template mining in the Chinese search engines.

  1. Optimal Threshold Control by the Robots of Web Search Engines with Obsolescence of Documents

    CERN Document Server

    Avrachenkov, Konstantin; Klimenok, Valentina; Nain, Philippe; Semenova, Olga; 10.1016/j.comnet.2011.01.013

    2012-01-01

    A typical web search engine consists of three principal parts: crawling engine, indexing engine, and searching engine. The present work aims to optimize the performance of the crawling engine. The crawling engine finds new web pages and updates web pages existing in the database of the web search engine. The crawling engine has several robots collecting information from the Internet. We first calculate various performance measures of the system (e.g., probability of arbitrary page loss due to the buffer overflow, probability of starvation of the system, the average time waiting in the buffer). Intuitively, we would like to avoid system starvation and at the same time to minimize the information loss. We formulate the problem as a multi-criteria optimization problem and attributing a weight to each criterion. We solve it in the class of threshold policies. We consider a very general web page arrival process modeled by Batch Marked Markov Arrival Process and a very general service time modeled by Phase-type dis...

  2. `Googling' Terrorists: Are Northern Irish Terrorists Visible on Internet Search Engines?

    Science.gov (United States)

    Reilly, P.

    In this chapter, the analysis suggests that Northern Irish terrorists are not visible on Web search engines when net users employ conventional Internet search techniques. Editors of mass media organisations traditionally have had the ability to decide whether a terrorist atrocity is `newsworthy,' controlling the `oxygen' supply that sustains all forms of terrorism. This process, also known as `gatekeeping,' is often influenced by the norms of social responsibility, or alternatively, with regard to the interests of the advertisers and corporate sponsors that sustain mass media organisations. The analysis presented in this chapter suggests that Internet search engines can also be characterised as `gatekeepers,' albeit without the ability to shape the content of Websites before it reaches net users. Instead, Internet search engines give priority retrieval to certain Websites within their directory, pointing net users towards these Websites rather than others on the Internet. Net users are more likely to click on links to the more `visible' Websites on Internet search engine directories, these sites invariably being the highest `ranked' in response to a particular search query. A number of factors including the design of the Website and the number of links to external sites determine the `visibility' of a Website on Internet search engines. The study suggests that Northern Irish terrorists and their sympathisers are unlikely to achieve a greater degree of `visibility' online than they enjoy in the conventional mass media through the perpetration of atrocities. Although these groups may have a greater degree of freedom on the Internet to publicise their ideologies, they are still likely to be speaking to the converted or members of the press. Although it is easier to locate Northern Irish terrorist organisations on Internet search engines by linking in via ideology, ideological description searches, such as `Irish Republican' and `Ulster Loyalist,' are more likely to

  3. NextSearch: A Search Engine for Mass Spectrometry Data against a Compact Nucleotide Exon Graph.

    Science.gov (United States)

    Kim, Hyunwoo; Park, Heejin; Paek, Eunok

    2015-07-02

    Proteogenomics research has been using six-frame translation of the whole genome or amino acid exon graphs to overcome the limitations of reference protein sequence database; however, six-frame translation is not suitable for annotating genes that span over multiple exons, and amino acid exon graphs are not convenient to represent novel splice variants and exon skipping events between exons of incompatible reading frames. We propose a proteogenomic pipeline NextSearch (Nucleotide EXon-graph Transcriptome Search) that is based on a nucleotide exon graph. This pipeline consists of constructing a compact nucleotide exon graph that systematically incorporates novel splice variations and a search tool that identifies peptides by directly searching the nucleotide exon graph against tandem mass spectra. Because our exon graph stores nucleotide sequences, it can easily represent novel splice variations and exon skipping events between incompatible reading frame exons. Searching for peptide identification is performed against this nucleotide exon graph, without converting it into a protein sequence in FASTA format, achieving an order of magnitude reduction in the size of the sequence database storage. NextSearch outputs the proteome-genome/transcriptome mapping results in a general feature format (GFF) file, which can be visualized by public tools such as the UCSC Genome Browser.

  4. Search engines: a first step to finding information: preliminary findings from a study of observed searches

    Directory of Open Access Journals (Sweden)

    A.D. Madden

    2007-01-01

    Full Text Available Introduction. This is a working paper which aims to present the preliminary results of a study into the search behaviour of the general public. The paper reports on the findings of the first six months of an eighteen-month data collection excercise. Method. . Detailed observations were made of nine volunteers, engaged on a variety of search tasks. Some of the tasks were self-selected, others were set by the researchers. Most tasks however, were designed to enable the volunteers to search within their own areas of interest and expertise. Analyses. A set of 'search dimensions' is proposed and qualitative findings based on these are presented. In addition, some initial quantitative findings are discussed. Result. Findings to date suggest that the best search strategy is a combination of simplicity and scrutiny. Volunteers who entered a few search terms but then carefully studied the results, appeared to be more successful than those who attempted to be prescriptive and entered a long series of terms.

  5. An empirical study on website usability elements and how they affect search engine optimisation

    Directory of Open Access Journals (Sweden)

    Eugene B. Visser

    2011-03-01

    Full Text Available The primary objective of this research project was to identify and investigate the website usability attributes which are in contradiction with search engine optimisation elements. The secondary objective was to determine if these usability attributes affect conversion. Although the literature review identifies the contradictions, experts disagree about their existence.An experiment was conducted, whereby the conversion and/or traffic ratio results of an existing control website were compared to a usability-designed version of the control website,namely the experimental website. All optimisation elements were ignored, thus implementing only usability. The results clearly show that inclusion of the usability attributes positively affect conversion,indicating that usability is a prerequisite for effective website design. Search engine optimisation is also a prerequisite for the very reason that if a website does not rank on the first page of the search engine result page for a given keyword, then that website might as well not exist. According to this empirical work, usability is in contradiction to search engine optimisation best practices. Therefore the two need to be weighed up in terms of importance towards search engines and visitors.

  6. Study of Search Engine Transaction Logs Shows Little Change in How Users use Search Engines. A review of: Jansen, Bernard J., and Amanda Spink. “How Are We Searching the World Wide Web? A Comparison of Nine Search Engine Transaction Logs.” Information Processing & Management 42.1 (2006: 248‐263.

    Directory of Open Access Journals (Sweden)

    David Hook

    2006-09-01

    Full Text Available Objective – To examine the interactions between users and search engines, and how they have changed over time. Design – Comparative analysis of search engine transaction logs. Setting – Nine major analyses of search engine transaction logs. Subjects – Nine web search engine studies (4 European, 5 American over a seven‐year period, covering the search engines Excite, Fireball, AltaVista, BWIE and AllTheWeb. Methods – The results from individual studies are compared by year of study for percentages of single query sessions, one term queries, operator (and, or, not, etc. usage and single result page viewing. As well, the authors group the search queries into eleven different topical categories and compare how the breakdown has changed over time. Main Results – Based on the percentage of single query sessions, it does not appear that the complexity of interactions has changed significantly for either the U.S.‐based or the European‐based search engines. As well, there was little change observed in the percentage of one‐term queries over the years of study for either the U.S.‐based or the European‐based search engines. Few users (generally less than 20% use Boolean or other operators in their queries, and these percentages have remained relatively stable. One area of noticeable change is in the percentage of users viewing only one results page, which has increased over the years of study. Based on the studies of the U.S.‐based search engines, the topical categories of ‘People, Place or Things’ and ‘Commerce, Travel, Employment or Economy’ are becoming more popular, while the categories of ‘Sex and Pornography’ and ‘Entertainment or Recreation’ are declining. Conclusions – The percentage of users viewing only one results page increased during the years of the study, while the percentages of single query sessions, oneterm sessions and operator usage remained stable. The increase in single result page viewing

  7. Enrich the E-publishing Community Website with Search Engine Optimization Technique

    Directory of Open Access Journals (Sweden)

    Vadivel Rangasamy

    2011-09-01

    Full Text Available Internet has played vital role in the online business. Every business peoples are needed to show their information clients or end user. In search engines have million indexed pages. A search engine optimization technique has to implement both web applications static and dynamic. There is no issue for create search engine optimization contents to static (web contents does not change until that web site is re host web application and keep up the search engine optimization regulations and state of affairs. A few significant challenges to dynamic content poses. To overcome these challenges to have a fully functional dynamic site that is optimized as much as a static site can be optimized. Whatever user search and they can get information their information quickly. In that circumstance we are using few search engine optimization dynamic web application methods such as User Friendly URL's, URL Redirector and HTML Generic. Both internal and external elements of the site affect the way it's ranked in any given search engine, so all of these elements should be taken into consideration. Implement these concepts to E-publishing Community Website that web site have large amount of dynamic fields with dynamic validations with the help of XML, XSL Java script. A database plays a major role to accomplish this functionality. We can use 3D (static, dynamic and Meta database structures. One of the advantages of the XML/XSLT combination is the ability to separate content from presentation. A data source can return an XML document, then by using an XSLT, the data can be transformed into whatever HTML is needed, based on the data in the XML document. The flexibility of XML/XLST can be combined with the power of ASP.NET server/client controls by using an XSLT to generate the server/client controls dynamically, thus leveraging the best of both worlds.

  8. NOVEL IMPLEMENTATION OF SEARCH ENGINE FOR TELUGU DOCUMENTS WITH SYLLABLE N-GRAM MODEL

    Directory of Open Access Journals (Sweden)

    DR.B.PADMAJA RANI,

    2010-08-01

    Full Text Available As the technology is growing day by day, there is an enormous increase in the number of documents posted on web. There is a need for an application that facilitates the user with an efficient retrieval of the information that is needed. Search engines are the key to finding specific information on the vast expanse of the World Wide Web. Without sophisticated search engines, it would be virtually impossible to locate anything on the Web without knowing a specific URL. A search engine is a program that searches documents for specified keywords and returns a list of the documents where the keywords were found. The keywords to be searched for are given as query and the search engine gives the list of the documents having a match with the keywords in the query based on certain algorithms. The search engine also ranks the documents such that the more relevant documents are placed first in the results retrieved. Recently, there is an enormous increase of non-English web page documents posted on web page documents posted on web. The greater amount of documents is related to Chinese where as the increase in Indian language text documents are gaining popularity. There is a need to organize Indian language text documents so that the retrieval with a query is very fast. Telugu is the third most spoken language in India and one of the fifteen most spoken languages in the world. It is the official language of the state of Andhra Pradesh. There is an also a vast increase in Telugu language text documents. Because of the complexity of Telugu language, its very difficult to search and retrieve the documents needed. Hence, there is a need for an application that facilitates the user with an efficient retrieval of the information that is needed.

  9. Search Engines and Search Technologies for Web-based Text Data%网络文本数据搜索引擎与搜索技术

    Institute of Scientific and Technical Information of China (English)

    李勇

    2001-01-01

    This paper describes the functions, characteristics and operating principles of search engines based on Web text, and the searching and data mining technologies for Web-based text information. Methods of computer-aided text clustering and abstacting are also given. Finally, it gives some guidelines for the assessment of searching quality.

  10. An autonomous organic reaction search engine for chemical reactivity

    Science.gov (United States)

    Dragone, Vincenza; Sans, Victor; Henson, Alon B.; Granda, Jaroslaw M.; Cronin, Leroy

    2017-06-01

    The exploration of chemical space for new reactivity, reactions and molecules is limited by the need for separate work-up-separation steps searching for molecules rather than reactivity. Herein we present a system that can autonomously evaluate chemical reactivity within a network of 64 possible reaction combinations and aims for new reactivity, rather than a predefined set of targets. The robotic system combines chemical handling, in-line spectroscopy and real-time feedback and analysis with an algorithm that is able to distinguish and select the most reactive pathways, generating a reaction selection index (RSI) without need for separate work-up or purification steps. This allows the automatic navigation of a chemical network, leading to previously unreported molecules while needing only to do a fraction of the total possible reactions without any prior knowledge of the chemistry. We show the RSI correlates with reactivity and is able to search chemical space using the most reactive pathways.

  11. An autonomous organic reaction search engine for chemical reactivity.

    Science.gov (United States)

    Dragone, Vincenza; Sans, Victor; Henson, Alon B; Granda, Jaroslaw M; Cronin, Leroy

    2017-06-09

    The exploration of chemical space for new reactivity, reactions and molecules is limited by the need for separate work-up-separation steps searching for molecules rather than reactivity. Herein we present a system that can autonomously evaluate chemical reactivity within a network of 64 possible reaction combinations and aims for new reactivity, rather than a predefined set of targets. The robotic system combines chemical handling, in-line spectroscopy and real-time feedback and analysis with an algorithm that is able to distinguish and select the most reactive pathways, generating a reaction selection index (RSI) without need for separate work-up or purification steps. This allows the automatic navigation of a chemical network, leading to previously unreported molecules while needing only to do a fraction of the total possible reactions without any prior knowledge of the chemistry. We show the RSI correlates with reactivity and is able to search chemical space using the most reactive pathways.

  12. A real-time all-atom structural search engine for proteins.

    Science.gov (United States)

    Gonzalez, Gabriel; Hannigan, Brett; DeGrado, William F

    2014-07-01

    Protein designers use a wide variety of software tools for de novo design, yet their repertoire still lacks a fast and interactive all-atom search engine. To solve this, we have built the Suns program: a real-time, atomic search engine integrated into the PyMOL molecular visualization system. Users build atomic-level structural search queries within PyMOL and receive a stream of search results aligned to their query within a few seconds. This instant feedback cycle enables a new "designability"-inspired approach to protein design where the designer searches for and interactively incorporates native-like fragments from proven protein structures. We demonstrate the use of Suns to interactively build protein motifs, tertiary interactions, and to identify scaffolds compatible with hot-spot residues. The official web site and installer are located at http://www.degradolab.org/suns/ and the source code is hosted at https://github.com/godotgildor/Suns (PyMOL plugin, BSD license), https://github.com/Gabriel439/suns-cmd (command line client, BSD license), and https://github.com/Gabriel439/suns-search (search engine server, GPLv2 license).

  13. A real-time all-atom structural search engine for proteins.

    Directory of Open Access Journals (Sweden)

    Gabriel Gonzalez

    2014-07-01

    Full Text Available Protein designers use a wide variety of software tools for de novo design, yet their repertoire still lacks a fast and interactive all-atom search engine. To solve this, we have built the Suns program: a real-time, atomic search engine integrated into the PyMOL molecular visualization system. Users build atomic-level structural search queries within PyMOL and receive a stream of search results aligned to their query within a few seconds. This instant feedback cycle enables a new "designability"-inspired approach to protein design where the designer searches for and interactively incorporates native-like fragments from proven protein structures. We demonstrate the use of Suns to interactively build protein motifs, tertiary interactions, and to identify scaffolds compatible with hot-spot residues. The official web site and installer are located at http://www.degradolab.org/suns/ and the source code is hosted at https://github.com/godotgildor/Suns (PyMOL plugin, BSD license, https://github.com/Gabriel439/suns-cmd (command line client, BSD license, and https://github.com/Gabriel439/suns-search (search engine server, GPLv2 license.

  14. The search engine manipulation effect (SEME) and its possible impact on the outcomes of elections.

    Science.gov (United States)

    Epstein, Robert; Robertson, Ronald E

    2015-08-18

    Internet search rankings have a significant impact on consumer choices, mainly because users trust and choose higher-ranked results more than lower-ranked results. Given the apparent power of search rankings, we asked whether they could be manipulated to alter the preferences of undecided voters in democratic elections. Here we report the results of five relevant double-blind, randomized controlled experiments, using a total of 4,556 undecided voters representing diverse demographic characteristics of the voting populations of the United States and India. The fifth experiment is especially notable in that it was conducted with eligible voters throughout India in the midst of India's 2014 Lok Sabha elections just before the final votes were cast. The results of these experiments demonstrate that (i) biased search rankings can shift the voting preferences of undecided voters by 20% or more, (ii) the shift can be much higher in some demographic groups, and (iii) search ranking bias can be masked so that people show no awareness of the manipulation. We call this type of influence, which might be applicable to a variety of attitudes and beliefs, the search engine manipulation effect. Given that many elections are won by small margins, our results suggest that a search engine company has the power to influence the results of a substantial number of elections with impunity. The impact of such manipulations would be especially large in countries dominated by a single search engine company.

  15. 多搜索引擎权重计算及搜索结果排序质量评估%Weight calculation for search engines and quality evaluation for ranking of search results

    Institute of Scientific and Technical Information of China (English)

    李超; 谢坤武

    2014-01-01

    搜索引擎在多成员搜索引擎搜索结果的整合过程中,搜索结果的排序在很大程度上决定着元搜索引擎的服务质量。为了实现搜索结果的有效整合,目前技术主要结合查询请求、文档内容、初始排序或(和)赋予搜索成员搜索引擎权重等因素。其中采用赋予搜索引擎权重时,往往根据用户和技术人员经验,主观地进行赋值,不能体现真实的用户搜索偏好。为此,提出了通过挖掘用户搜索及遍历情况,动态地赋予各成员搜索引擎权重的方法。通过用户遍历及点击下载情况,得到了用户搜索遍历与返回结果的匹配度,论证了该方法的可行性和有效性。%In the integration process of search results returned from multi member search engines, the quality of a service meta-search engine is determined by the ranking of search results to a great extent. The current technologies on achieving effective integration of search results mainly rely on combining search queries, document content, the initial sorting infor-mation or(and)endowing the member search engines with weights, and others. The method of endowing engines with weights is objective, often depends on the subjective experience of users, and can’t embody users’search preferences. So, this paper proposes a method to dynamically endow the member search engines with weights based on mining user’s search-ing and navigation habits. It analyzes users’log data on clicking and downloading search results, and gets the matching degrees between search results and users’navigation. The statistic results demonstrate that the methodology is feasible and effective to the ranking of multi engine search results.

  16. Measuring the Utilization of On-Page Search Engine Optimization in Selected Domain

    Directory of Open Access Journals (Sweden)

    Goran Matošević

    2015-12-01

    Full Text Available Search engine optimization (SEO techniques involve „on-page“ and „off-page“ actions taken by web developers and SEO specialists with aim to increase the ranking of web pages in search engine results pages (SERP by following recommendations from major search engine companies. In this paper we explore the possibility of creating a metric for evaluating on-page SEO of a website. A novel „k-rank“ metric is proposed which takes into account not only the presence of certain tags in HTML of a page, but how those tags are used with selected keywords in selected domain. The „k-rank“ is tested in domain of education by inspecting 20 university websites and comparing them with expert scores. The overview of results showed that „k-rank“ can be used as a metric for on-page SEO.

  17. Search Engine Visibility and Language Availability of Travel and Tourism Websites in Serbia

    Directory of Open Access Journals (Sweden)

    Uglješa Stankov

    2009-01-01

    Full Text Available The visibility of websites on search engines on the one hand is the basic demand of Internet users, and on the other hand reflects the way website owners create and maintain them. In a large number of tourist sites, the importance of visibility of the domestic tourist sites on the leading search engines is emphasized. The authors have found that travel and tourism websites in Serbia are most visible on the local search engines. For foreign users, in addition to the good visibility, the availability of website content in foreign languages is also important. Only half of the tourist websites in Serbia is available in a foreign language. The research aims to point out to the poor visibility and small availability of content in foreign languages of domestic tourist sites and the need for their improvement.

  18. Segmentation Based Approach to Dynamic Page Construction from Search Engine Results

    CERN Document Server

    Kuppusamy, K S

    2012-01-01

    The results rendered by the search engines are mostly a linear snippet list. With the prolific increase in the dynamism of web pages there is a need for enhanced result lists from search engines in order to cope-up with the expectations of the users. This paper proposes a model for dynamic construction of a resultant page from various results fetched by the search engine, based on the web page segmentation approach. With the incorporation of personalization through user profile during the candidate segment selection, the enriched resultant page is constructed. The benefits of this approach include instant, one-shot navigation to relevant portions from various result items, in contrast to a linear page-by-page visit approach. The experiments conducted on the prototype model with various levels of users, quantifies the improvements in terms of amount of relevant information fetched.

  19. Segmentation Based Approach to Dynamic Page Construction from Search Engine Results

    Directory of Open Access Journals (Sweden)

    K.S. Kuppusamy,

    2011-03-01

    Full Text Available The results rendered by the search engines are mostly a linear snippet list. With the prolific increase in the dynamism of web pages there is a need for enhanced result lists from search engines inorder to cope-up with the expectations of the users. This paper proposes a model for dynamic construction of a resultant page from various results fetched by the search engine, based on the web pagesegmentation approach. With the incorporation of personalization through user profile during the candidate segment selection, the enriched resultant page is constructed. The benefits of this approachinclude instant, one-shot navigation to relevant portions from various result items, in contrast to a linear page-by-page visit approach. The experiments conducted on the prototype model with various levels of users, quantifies the improvements in terms of amount of relevant information fetched.

  20. A Cooperative Schema between Web Sever and Search Engine for Improving Freshness of Web Repository

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    Because the web is huge and web pages are updated frequently, the index maintained by a search engine has to refresh web pages periodically. This is extremely resource consuming because the search engine needs to crawl the web and download web pages to refresh its index. Based on present technologies of web refreshing, we present a cooperative schema between web server and search engine for maintaining freshness of web repository. The web server provides meta-data defined through XML standard to describe web sites. Before updating the web page the crawler visits the meta-data files. If the meta-data indicates that the page is not modified, then the crawler will not update it. So this schema can save bandwidth resource. A primitive model based on the schema is implemented. The cost and efficiency of the schema are analyzed.

  1. 一个WWW智能搜索引擎%An Intelligent Search Engine for WWW

    Institute of Scientific and Technical Information of China (English)

    廖明宏; 程光明; 吴翔虎

    2001-01-01

    To avoid the overloading information or missing useful information of the traditional search engines, a new intelligent search engine is designed by using the new AI technology, such as ontologies, heuristic retrieval and user's goal, with this search engine, the information can be changed as useful knowledge for users.%为避免传统搜索引擎带来的信息过量或丢失有用信息的现象,采用本体论、启发式检索和用户目标等人工智能新技术来设计搜索引擎,从而实现将检索的信息转化成用户有用的知识。

  2. Research on Search Engine Optimization%搜索引擎优化技术研究

    Institute of Scientific and Technical Information of China (English)

    殷存举

    2014-01-01

    Search engine optimization technique is one of the effective methods to enterprise site network promotion,it has been paid attention more and more websites.The paper analysis of the impact of various factors of search engine optimization and puts forward search engine optimization method.%搜索引擎优化技术作为企业网站网络推广的有效方法之一,已被越来越多的网站重视和使用。对制约搜索引擎优化的原因进行了详细探讨,提出了对搜索引擎优化的策略。

  3. Global polar geospatial information service retrieval based on search engine and ontology reasoning

    Science.gov (United States)

    Chen, Nengcheng; E, Dongcheng; Di, Liping; Gong, Jianya; Chen, Zeqiang

    2007-01-01

    In order to improve the access precision of polar geospatial information service on web, a new methodology for retrieving global spatial information services based on geospatial service search and ontology reasoning is proposed, the geospatial service search is implemented to find the coarse service from web, the ontology reasoning is designed to find the refined service from the coarse service. The proposed framework includes standardized distributed geospatial web services, a geospatial service search engine, an extended UDDI registry, and a multi-protocol geospatial information service client. Some key technologies addressed include service discovery based on search engine and service ontology modeling and reasoning in the Antarctic geospatial context. Finally, an Antarctica multi protocol OWS portal prototype based on the proposed methodology is introduced.

  4. A New Approach to Design Graph Based Search Engine for Multiple Domains Using Different Ontologies

    CERN Document Server

    Mukhopadhyay, Debajyoti

    2011-01-01

    Search Engine has become a major tool for searching any information from the World Wide Web (WWW). While searching the huge digital library available in the WWW, every effort is made to retrieve the most relevant results. But in WWW majority of the Web pages are in HTML format and there are no such tags which tells the crawler to find any specific domain. To find more relevant result we use Ontology for that particular domain. If we are working with multiple domains then we use multiple ontologies. Now in order to design a domain specific search engine for multiple domains, crawler must crawl through the domain specific Web pages in the WWW according to the predefined ontologies.

  5. Andromeda - a peptide search engine integrated into the MaxQuant environment

    DEFF Research Database (Denmark)

    Cox, Jurgen; Neuhauser, Nadin; Michalski, Annette;

    2011-01-01

    A key step in mass spectrometry (MS)-based proteomics is the identification of peptides in sequence databases by their fragmentation spectra. Here we describe Andromeda, a novel peptide search engine using a probabilistic scoring model. On proteome data Andromeda performs as well as Mascot......, a widely used commercial search engine, as judged by sensitivity and specificity analysis based on target decoy searches. Furthermore, it can handle data with arbitrarily high fragment mass accuracy, is able to assign and score complex patterns of post-translational modifications, such as highly...... enables analysis of large data sets in a simple analysis workflow on a desktop computer. For searching individual spectra Andromeda is also accessible via a web server. We demonstrate the flexibility of the system by implementing the capability to identify co-fragmented peptides, significantly improving...

  6. Web Image Retrieval Search Engine based on Semantically Shared Annotation

    Directory of Open Access Journals (Sweden)

    Alaa Riad

    2012-03-01

    Full Text Available This paper presents a new majority voting technique that combines the two basic modalities of Web images textual and visual features of image in a re-annotation and search based framework. The proposed framework considers each web page as a voter to vote the relatedness of keyword to the web image, the proposed approach is not only pure combination between image low level feature and textual feature but it take into consideration the semantic meaning of each keyword that expected to enhance the retrieval accuracy. The proposed approach is not used only to enhance the retrieval accuracy of web images; but also able to annotated the unlabeled images.

  7. Google's pagerank and beyond the science of search engine rankings

    CERN Document Server

    Langville, Amy N

    2006-01-01

    Why doesn't your home page appear on the first page of search results, even when you query your own name? How do other Web pages always appear at the top? What creates these powerful rankings? And how? The first book ever about the science of Web page rankings, Google's PageRank and Beyond supplies the answers to these and other questions and more. The book serves two very different audiences: the curious science reader and the technical computational reader. The chapters build in mathematical sophistication, so that the first five are accessible to the general academic reader. While other cha

  8. Clinical evaluation of using semantic searching engine for radiological imaging services in RIS-integrated PACS

    Science.gov (United States)

    Ling, Tonghui; Zhang, Kai; Yang, Yuanyuan; Hua, Yanqing; Zhang, Jianguo

    2015-03-01

    We had designed a semantic searching engine (SSE) for radiological imaging to search both reports and images in RIS-integrated PACS environment. In this presentation, we present evaluation results of this SSE about how it impacting the radiologists' behaviors in reporting for different kinds of examinations, and how it improving the performance of retrieval and usage of historical images in RIS-integrated PACS.

  9. SUSTAINABLE ALLOY DESIGN: SEARCHING FOR RARE EARTH ELEMENT ALTERNATIVES THROUGH CRYSTAL ENGINEERING

    Science.gov (United States)

    2016-02-26

    Force Base, Dayton OH, March 20th 2013 23. Informatics Aided Discovery of Energy Materials 2013 Kentucky Workshop on Renewable Energy and Energy ...AFRL-AFOSR-VA-TR-2016-0122 Sustainable Alloy Design Searching for Rare Earth Element Alternatives through Crystal Engineering Krishna Rajan IOWA...reporting burden for this collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching

  10. Using internet search engines and library catalogs to locate toxicology information.

    Science.gov (United States)

    Wukovitz, L D

    2001-01-12

    The increasing importance of the Internet demands that toxicologists become aquainted with its resources. To find information, researchers must be able to effectively use Internet search engines, directories, subject-oriented websites, and library catalogs. The article will explain these resources, explore their benefits and weaknesses, and identify skills that help the researcher to improve search results and critically evaluate sources for their relevancy, validity, accuracy, and timeliness.

  11. ProThes: Thesaurus-based Meta-Search Engine for a Specific Application Domain

    OpenAIRE

    Braslavski, P.; Alshanski, G.; Shishkin, A.; П.И. Браславский

    2004-01-01

    In this poster we introduce ProThes, a pilot meta-search engine (MSE) for a specific application domain. ProThes combines three approaches: meta-search, graphical user interface (GUI) for query specification, and thesaurus-based query techniques. ProThes attempts to employ domain-specific knowledge, which is represented by both a conceptual thesaurus and results ranking heuristics. Since the knowledge representation is separated from the MSE core, adjusting the system to a specific domain is ...

  12. Conventional alternating-current generators and engine generator sets

    Energy Technology Data Exchange (ETDEWEB)

    Segaser, C.L.

    1978-04-01

    Available data and techniques relevant to the selection and analysis of appropriate electrical generating equipment for application in the ICES program are presented. Of the general classes of commercially available a-c generators, the synchronous, rotating field alternator is most suited to ICES applications, and the focus of this technology evaluation. Conventional 60-Hz, alternating-current generators, with standard ratings ranging from 1.25 kVA to 10,000 kVA at voltages from 125 single-phase to 14,400 volts three-phase and speeds up to 1800 rpm are covered. Technical data for representative diesel engine-generator sets for continuous prime power ratings up to 6445 kW are presented. Approximate 1976 costs of standard electrical generating equipment are given for: (1) standard conventional alternating current generators and (2) packaged engine-generator sets. The data indicate a decrease in unit costs as the power ratings increase, with the cost of the slow-speed units somewhat greater than that of the higher speed units. Maintenance data for a typical total energy plant presently in operation indicate that the average cost of maintenance amounts to 41 cents/kWh. A plot of available data also indicates a trend to decreasing operating costs with increasing unit size.

  13. In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.

    Science.gov (United States)

    Audain, Enrique; Uszkoreit, Julian; Sachsenberg, Timo; Pfeuffer, Julianus; Liang, Xiao; Hermjakob, Henning; Sanchez, Aniel; Eisenacher, Martin; Reinert, Knut; Tabb, David L; Kohlbacher, Oliver; Perez-Riverol, Yasset

    2017-01-06

    In mass spectrometry-based shotgun proteomics, protein identifications are usually the desired result. However, most of the analytical methods are based on the identification of reliable peptides and not the direct identification of intact proteins. Thus, assembling peptides identified from tandem mass spectra into a list of proteins, referred to as protein inference, is a critical step in proteomics research. Currently, different protein inference algorithms and tools are available for the proteomics community. Here, we evaluated five software tools for protein inference (PIA, ProteinProphet, Fido, ProteinLP, MSBayesPro) using three popular database search engines: Mascot, X!Tandem, and MS-GF+. All the algorithms were evaluated using a highly customizable KNIME workflow using four different public datasets with varying complexities (different sample preparation, species and analytical instruments). We defined a set of quality control metrics to evaluate the performance of each combination of search engines, protein inference algorithm, and parameters on each dataset. We show that the results for complex samples vary not only regarding the actual numbers of reported protein groups but also concerning the actual composition of groups. Furthermore, the robustness of reported proteins when using databases of differing complexities is strongly dependant on the applied inference algorithm. Finally, merging the identifications of multiple search engines does not necessarily increase the number of reported proteins, but does increase the number of peptides per protein and thus can generally be recommended.

  14. Using Exclusive Web Crawlers to Store Better Results in Search Engines' Database

    Directory of Open Access Journals (Sweden)

    Ali Tourani

    2013-05-01

    Full Text Available Crawler-based search engines are the mostly used search engines among web and Internet users , involveweb crawling, storing in database, ranking, indexing and displaying to the user. But it is noteworthy thatbecause of increasing changes in web sites search engines suffer high time and transfers costs which areconsumed to investigate the existence of each page in database while crawling, updating database andeven investigating its existence in any crawling operations."Exclusive Web Crawler" proposes guidelines for crawling features, links, media and other elements and tostore crawling results in a certain table in its database on the web. With doing this, search engines storeeach site's tables in their databases and implement their ranking results on them. Thus, accuracy of data inevery table (and its being up-to-date is ensured and no 404 result is shown in search results since, in fact,this data crawler crawls data entered by webmaster and the database stores whatever he wants to display.

  15. Estimating search engine index size variability: a 9-year longitudinal study.

    Science.gov (United States)

    van den Bosch, Antal; Bogers, Toine; de Kunder, Maurice

    One of the determining factors of the quality of Web search engines is the size of their index. In addition to its influence on search result quality, the size of the indexed Web can also tell us something about which parts of the WWW are directly accessible to the everyday user. We propose a novel method of estimating the size of a Web search engine's index by extrapolating from document frequencies of words observed in a large static corpus of Web pages. In addition, we provide a unique longitudinal perspective on the size of Google and Bing's indices over a nine-year period, from March 2006 until January 2015. We find that index size estimates of these two search engines tend to vary dramatically over time, with Google generally possessing a larger index than Bing. This result raises doubts about the reliability of previous one-off estimates of the size of the indexed Web. We find that much, if not all of this variability can be explained by changes in the indexing and ranking infrastructure of Google and Bing. This casts further doubt on whether Web search engines can be used reliably for cross-sectional webometric studies.

  16. How to Effectively Use Search Engines%如何有效利用搜索引擎

    Institute of Scientific and Technical Information of China (English)

    陈怡君

    2011-01-01

    本文就搜索引擎做了简要的概述,分析了现有搜索引擎的不足,讨论如何根据不同的搜索目标来选择合适的搜索引擎以提高检索效率.%In this paper, the search engine is briefly summarized, the shortcomings of existing search engines are analyzed, and how to select the appropriate search engines to improve search efficiency depending on the search target are discussed.

  17. A cross-disciplinary technology transfer for search-based evolutionary computing: from engineering design to software engineering design

    Science.gov (United States)

    Simons, C. L.; Parmee, I. C.

    2007-07-01

    Although object-oriented conceptual software design is difficult to learn and perform, computational tool support for the conceptual software designer is limited. In conceptual engineering design, however, computational tools exploiting interactive evolutionary computation (EC) have shown significant utility. This article investigates the cross-disciplinary technology transfer of search-based EC from engineering design to software engineering design in an attempt to provide support for the conceptual software designer. Firstly, genetic operators inspired by genetic algorithms (GAs) and evolutionary programming are evaluated for their effectiveness against a conceptual software design representation using structural cohesion as an objective fitness function. Building on this evaluation, a multi-objective GA inspired by a non-dominated Pareto sorting approach is investigated for an industrial-scale conceptual design problem. Results obtained reveal a mass of interesting and useful conceptual software design solution variants of equivalent optimality—a typical characteristic of successful multi-objective evolutionary search techniques employed in conceptual engineering design. The mass of software design solution variants produced suggests that transferring search-based technology across disciplines has significant potential to provide computationally intelligent tool support for the conceptual software designer.

  18. Web Spam, Social Propaganda and the Evolution of Search Engine Rankings

    Science.gov (United States)

    Metaxas, Panagiotis Takis

    Search Engines have greatly influenced the way we experience the web. Since the early days of the web, users have been relying on them to get informed and make decisions. When the web was relatively small, web directories were built and maintained using human experts to screen and categorize pages according to their characteristics. By the mid 1990's, however, it was apparent that the human expert model of categorizing web pages does not scale. The first search engines appeared and they have been evolving ever since, taking over the role that web directories used to play.

  19. A Method for Detecting the Real Location of Agency Website Based On Search Engine

    Directory of Open Access Journals (Sweden)

    Chou Xiao-Hui

    2016-01-01

    Full Text Available This paper provides a method to detect the real location of agency website based on search engine. We will analyze and process the target agency website to obtain the server routing information, extract critical feature information from web content, combine with search engine to acquire web data, and calculate word frequency. Through named entity recognition, web text matching calculation and syntactic analysis, we can infer real location of the target agency website in the real world. The experimental results show that our approach is reliable, correct and effective.

  20. 搜索引擎技术及研究%Research on Search Engine Technology

    Institute of Scientific and Technical Information of China (English)

    姬睿

    2015-01-01

    With the rapid development of Internet and information technology, the search engine has become the main means for people to obtain information on the Internet. This article elaborated from the search engine concept.%随着互联网和信息技术的飞速发展,搜索引擎已成为人们通过互联网获取信息的主要手段。本文从搜索引擎的概念、分类、工作原理及发展趋势进行了相关阐述。

  1. Chemical compound navigator: a web-based chem-BLAST, chemical taxonomy-based search engine for browsing compounds.

    Science.gov (United States)

    Prasanna, M D; Vondrasek, Jiri; Wlodawer, Alexander; Rodriguez, H; Bhat, T N

    2006-06-01

    A novel technique to annotate, query, and analyze chemical compounds has been developed and is illustrated by using the inhibitor data on HIV protease-inhibitor complexes. In this method, all chemical compounds are annotated in terms of standard chemical structural fragments. These standard fragments are defined by using criteria, such as chemical classification; structural, chemical, or functional groups; and commercial, scientific or common names or synonyms. These fragments are then organized into a data tree based on their chemical substructures. Search engines have been developed to use this data tree to enable query on inhibitors of HIV protease (http://xpdb.nist.gov/hivsdb/hivsdb.html). These search engines use a new novel technique, Chemical Block Layered Alignment of Substructure Technique (Chem-BLAST) to search on the fragments of an inhibitor to look for its chemical structural neighbors. This novel technique to annotate and query compounds lays the foundation for the use of the Semantic Web concept on chemical compounds to allow end users to group, sort, and search structural neighbors accurately and efficiently. During annotation, it enables the attachment of "meaning" (i.e., semantics) to data in a manner that far exceeds the current practice of associating "metadata" with data by creating a knowledge base (or ontology) associated with compounds. Intended users of the technique are the research community and pharmaceutical industry, for which it will provide a new tool to better identify novel chemical structural neighbors to aid drug discovery.

  2. Broadening the search for minority science and engineering doctoral starts

    Science.gov (United States)

    Brazziel, William E.; Brazziel, Marian E.

    1995-06-01

    This analysis looked at doctorate completion in science and engineering (S&E) by underrepresented minorities: blacks, Hispanics and Indian Americans. These are the groups we must increasingly depend upon to make up for shortfalls in science and engineering doctorate production among American citizens. These shortfalls derive from truncated birth rates among white people, for the most part. The analysis answered several questions officials will need to know the answers to if we are to plan effectively to develop the talents of these individuals. Specifically, the National Science Foundation asked us to look at the feasibility of involving nontraditional minority science and engineering graduates (baccalaureates at 25+) as doctoral starts, along with minority S&E graduates who had taken jobs with corporations to pay off student loans and military personnel involved in S&E study and S&E work (see NSF report of research under grant SED-9107756). We found that nontraditional minority S&E doctorate recipients matched their traditional counterparts in elapsed time to degree and similar indicators. They had less in the way of support for doctoral study, however. We found that minority S&E graduates who took jobs in corporations were keenly interested in returning to campus to complete degrees. We also found that many bright minority youngsters are studying S&E subjects in the Community College of the Air Force and in U.S. Army SOC colleges. Some have enrolled in baccalaureate programs on university campuses and plan to continue on to the PhD. We concluded that money is important in tapping these talent pools to make up for the demographically driven shortfalls discussed above.

  3. Resonance Search for a Heavy Photon in the 2015 Engineering Run Data of the Heavy Photon Search Experiment

    Science.gov (United States)

    Moreno, Omar; Heavy Photon Search Collaboration

    2017-01-01

    The Heavy Photon Search (HPS) experiment at Jefferson Lab is searching for a new U(1) vector boson (``heavy photon'',``dark photon'' or A') in the mass range of 20-500 MeV/c2. An A' in this mass range is theoretically favorable and may also mediate dark matter interactions. The A' couples to the ordinary photon through kinetic mixing, which induces their coupling to electric charge. Since heavy photons couple to electrons, they can be produced through a process analogous to bremsstrahlung, subsequently decaying to an e+e- , which can be observed as a narrow resonance above the dominant QED trident background. For suitably small couplings, heavy photons travel detectable distances before decaying, providing a second signature. Using the CEBAF electron beam at Jefferson Lab incident on a thin tungsten target, along with a compact, large acceptance forward spectrometer consisting of a silicon vertex tracker and lead tungstate electromagnetic calorimeter, HPS is accessing unexplored regions in the mass-coupling phase space. The HPS engineering run took place in spring of 2015 using a 1.056 GeV, 50 nA beam and collected 1165 nb-1 (7.29 mC) of data. This talk will present the results of a resonance search for a heavy photon using the engineering run data.

  4. Efficient Retrieval of Images for Search Engine by Visual Similarity and Re Ranking

    Directory of Open Access Journals (Sweden)

    Viswa S S

    2013-06-01

    Full Text Available Nowadays, web scale image search engines (e.g.Google Image Search, Microsoft Live ImageSearch rely almost purely on surrounding textfeatures. Users type keywords in hope of finding acertain type of images. The search engine returnsthousands of images ranked by the text keywordsextracted from the surrounding text. However,many of returned images are noisy, disorganized, orirrelevant. Even Google and Microsoft have noVisual Information for searching of images. Usingvisual information to re rank and improve textbased image search results is the idea. Thisimproves the precision of the text based imagesearch ranking by incorporating the informationconveyed by the visual modality.The typicalassumption that the top-images in the text-basedsearch result are equally relevant is relaxed bylinking the relevance of the images to their initialrank positions. Then, a number of images from theinitial search result are employed as the prototypesthat serve to visually represent the query and thatare subsequently used to construct meta re rankers.i.e. The most relevant images are found by visualsimilarity and the average scores are calculated. Byapplying different meta re rankers to an image fromthe initial result, re ranking scores are generated,which are then used to find the new rank positionfor an image in the re ranked search result.Humansupervision is introduced to learn the model weightsoffline, prior to the online re ranking process. Whilemodel learning requires manual labelling of theresults for a few queries, the resulting model isquery independent and therefore applicable to anyother query. The experimental results on arepresentative web image search dataset comprising353 queries demonstrate that the proposed methodoutperforms the existing supervised andunsupervised Re ranking approaches. Moreover, itimproves the performance over the text-based imagesearch engine by morethan 25.48%

  5. Posterior α EEG Dynamics Dissociate Current from Future Goals in Working Memory-Guided Visual Search

    Science.gov (United States)

    2017-01-01

    Current models of visual search assume that search is guided by an active visual working memory representation of what we are currently looking for. This attentional template for currently relevant stimuli can be dissociated from accessory memory representations that are only needed prospectively, for a future task, and that should be prevented from guiding current attention. However, it remains unclear what electrophysiological mechanisms dissociate currently relevant (serving upcoming selection) from prospectively relevant memories (serving future selection). We measured EEG of 20 human subjects while they performed two consecutive visual search tasks. Before the search tasks, a cue instructed observers which item to look for first (current template) and which second (prospective template). During the delay leading up to the first search display, we found clear suppression of α band (8–14 Hz) activity in regions contralateral to remembered items, comprising both local power and interregional phase synchronization within a posterior parietal network. Importantly, these lateralization effects were stronger when the memory item was currently relevant (i.e., for the first search) compared with when it was prospectively relevant (i.e., for the second search), consistent with current templates being prioritized over future templates. In contrast, event-related potential analysis revealed that the contralateral delay activity was similar for all conditions, suggesting no difference in storage. Together, these findings support the idea that posterior α oscillations represent a state of increased processing or excitability in task-relevant cortical regions, and reflect enhanced cortical prioritization of memory representations that serve as a current selection filter. SIGNIFICANCE STATEMENT Our days are filled with looking for relevant objects while ignoring irrelevant visual information. Such visual search activity is thought to be driven by current goals activated

  6. Posterior α EEG Dynamics Dissociate Current from Future Goals in Working Memory-Guided Visual Search.

    Science.gov (United States)

    de Vries, Ingmar E J; van Driel, Joram; Olivers, Christian N L

    2017-02-08

    Current models of visual search assume that search is guided by an active visual working memory representation of what we are currently looking for. This attentional template for currently relevant stimuli can be dissociated from accessory memory representations that are only needed prospectively, for a future task, and that should be prevented from guiding current attention. However, it remains unclear what electrophysiological mechanisms dissociate currently relevant (serving upcoming selection) from prospectively relevant memories (serving future selection). We measured EEG of 20 human subjects while they performed two consecutive visual search tasks. Before the search tasks, a cue instructed observers which item to look for first (current template) and which second (prospective template). During the delay leading up to the first search display, we found clear suppression of α band (8-14 Hz) activity in regions contralateral to remembered items, comprising both local power and interregional phase synchronization within a posterior parietal network. Importantly, these lateralization effects were stronger when the memory item was currently relevant (i.e., for the first search) compared with when it was prospectively relevant (i.e., for the second search), consistent with current templates being prioritized over future templates. In contrast, event-related potential analysis revealed that the contralateral delay activity was similar for all conditions, suggesting no difference in storage. Together, these findings support the idea that posterior α oscillations represent a state of increased processing or excitability in task-relevant cortical regions, and reflect enhanced cortical prioritization of memory representations that serve as a current selection filter.SIGNIFICANCE STATEMENT Our days are filled with looking for relevant objects while ignoring irrelevant visual information. Such visual search activity is thought to be driven by current goals activated in

  7. Diagnostics of the technical condition of gas turbine engines by the random search method

    Energy Technology Data Exchange (ETDEWEB)

    Shepel' , V.T.; Kabashov, M.A.

    1981-01-01

    One of the methods of diagnosis of gas turbine engines according to a limited number of controllable thermodynamic parameters is considered. It makes use of a priori information about the object of diagnosis, including both the information about the assumed defect or a group of defects, and information about the region of possible variations of characteristics of engine components or elements. The random search method with adaptation serves as the basis for the procedures.

  8. Development and evaluation of a prototype search engine to meet public health information needs.

    Science.gov (United States)

    Keeling, Jonathan W; Turner, Anne M; Allen, Eileen E; Rowe, Steven A; Merrill, Jacqueline A; Liddy, Elizabeth D; Turtle, Howard R

    2011-01-01

    Grey literature is information not available through commercial publishers. It is a sizable and valuable information source for public health (PH) practice but because documents are not formally indexed the information is difficult to locate. Public Health Information Search (PHIS) was developed to address this problem. NLP techniques were used to create informative document summaries for an extensive collection of grey literature on PH topics. The system was evaluated with PH workers using the critical incident technique in a two stage field evaluation to assess effectiveness in comparison with Google. Document summaries were found to be both helpful and accurate. Increased document collection size and enhanced result rankings improved search effectiveness from 28% to 55%. PHIS would work best in conjunction with Google or another broad coverage Web search engine when searching for documents and reports as opposed to local health data and primary disease information. PHIS could enhance both the quality and quantity of PH search results.

  9. Seasonal trends in tinnitus symptomatology: evidence from Internet search engine query data.

    Science.gov (United States)

    Plante, David T; Ingram, David G

    2015-10-01

    The primary aim of this study was to test the hypothesis that the symptom of tinnitus demonstrates a seasonal pattern with worsening in the winter relative to the summer using Internet search engine query data. Normalized search volume for the term 'tinnitus' from January 2004 through December 2013 was retrieved from Google Trends. Seasonal effects were evaluated using cosinor regression models. Primary countries of interest were the United States and Australia. Secondary exploratory analyses were also performed using data from Germany, the United Kingdom, Canada, Sweden, and Switzerland. Significant seasonal effects for 'tinnitus' search queries were found in the United States and Australia (p search volume in the winter relative to the summer. Our findings indicate that there are significant seasonal trends for Internet search queries for tinnitus, with a zenith in winter months. Further research is indicated to determine the biological mechanisms underlying these findings, as they may provide insights into the pathophysiology of this common and debilitating medical symptom.

  10. Start Your Search Engines. Part One: Taming Google--and Other Tips to Master Web Searches

    Science.gov (United States)

    Adam, Anna; Mowers, Helen

    2008-01-01

    There are a lot of useful tools on the Web, all those social applications, and the like. Still most people go online for one thing--to perform a basic search. For most fact-finding missions, the Web is there. But--as media specialists well know--the sheer wealth of online information can hamper efforts to focus on a few reliable references.…

  11. Defaming by Suggestion: Searching for Search Engine Liability in the Autocomplete Era

    OpenAIRE

    Cheung, ASY

    2015-01-01

    Whilst different jurisdictions have yet to reach consensus on search engines’ liability for defamation, Internet giant Google is confronting judges and academics with another challenge: the basis of liability for defamation arising from its Autocomplete function. In 2014, for example, the Hong Kong Court of First Instance held that a claimant whose name was often paired with ‘triad member’ in Autocomplete had a good arguable case of defamation to proceed with and dismissed a claim of summary ...

  12. 搜索引擎技术与服务的研究及其启示%Research on Search Engine Technology and Service and Its Enlightenment

    Institute of Scientific and Technical Information of China (English)

    符绍宏; 黄崑

    2000-01-01

    本文从宏观上对国外主要的英文搜索引擎的技术与服务特色进行了一些理论上的探讨,同时联系我国中文搜索引擎的现状,分析了目前中文搜索引擎存在的不足,展望了未来发展趋势,最后提出了几点建议。%This article discusses the main technologies applied popularly in foreign search engines and services provided by them. At the same time,it analyzes the current conditions of Chinese search engines. Based on it,it analyzes the deficiencies and puts forward some new scenes and trends of development of Chinese search engines for the future. Finally it provides several suggestions.

  13. Which Search Engine Is the Most Used One among University Students?

    Science.gov (United States)

    Cavus, Nadire; Alpan, Kezban

    2010-01-01

    The importance of information is increasing in the information age that we are living in with internet becoming the major information resource for people with rapidly increasing number of documents. This situation makes finding information on the internet without web search engines impossible. The aim of the study is revealing most widely used…

  14. A novel algorithm for validating peptide identification from a shotgun proteomics search engine.

    Science.gov (United States)

    Jian, Ling; Niu, Xinnan; Xia, Zhonghang; Samir, Parimal; Sumanasekera, Chiranthani; Mu, Zheng; Jennings, Jennifer L; Hoek, Kristen L; Allos, Tara; Howard, Leigh M; Edwards, Kathryn M; Weil, P Anthony; Link, Andrew J

    2013-03-01

    Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has revolutionized the proteomics analysis of complexes, cells, and tissues. In a typical proteomic analysis, the tandem mass spectra from a LC-MS/MS experiment are assigned to a peptide by a search engine that compares the experimental MS/MS peptide data to theoretical peptide sequences in a protein database. The peptide spectra matches are then used to infer a list of identified proteins in the original sample. However, the search engines often fail to distinguish between correct and incorrect peptides assignments. In this study, we designed and implemented a novel algorithm called De-Noise to reduce the number of incorrect peptide matches and maximize the number of correct peptides at a fixed false discovery rate using a minimal number of scoring outputs from the SEQUEST search engine. The novel algorithm uses a three-step process: data cleaning, data refining through a SVM-based decision function, and a final data refining step based on proteolytic peptide patterns. Using proteomics data generated on different types of mass spectrometers, we optimized the De-Noise algorithm on the basis of the resolution and mass accuracy of the mass spectrometer employed in the LC-MS/MS experiment. Our results demonstrate De-Noise improves peptide identification compared to other methods used to process the peptide sequence matches assigned by SEQUEST. Because De-Noise uses a limited number of scoring attributes, it can be easily implemented with other search engines.

  15. A search engine for retrieval and inspection of events with 48 human actions in realistic videos

    NARCIS (Netherlands)

    Burghouts, G.J.; Penning, H.L.H. de; Hove, R.J.M. ten; Landsmeer, S.; Broek, S.P. van den; Hollander, R.J.M.; Hanckmann, P.; Kruithof, M.C.; Leeuwen, C.J. van; Korzec, S.; Bouma, H.; Schutte, K.

    2013-01-01

    The contribution of this paper is a search engine that recognizes and describes 48 human actions in realistic videos. The core algorithms have been published recently, from the early visual processing (Bouma, 2012), discriminative recognition (Burghouts, 2012) and textual description (Hankmann, 2012

  16. A study of medical and health queries to web search engines.

    Science.gov (United States)

    Spink, Amanda; Yang, Yin; Jansen, Jim; Nykanen, Pirrko; Lorence, Daniel P; Ozmutlu, Seda; Ozmutlu, H Cenk

    2004-03-01

    This paper reports findings from an analysis of medical or health queries to different web search engines. We report results: (i). comparing samples of 10000 web queries taken randomly from 1.2 million query logs from the AlltheWeb.com and Excite.com commercial web search engines in 2001 for medical or health queries, (ii). comparing the 2001 findings from Excite and AlltheWeb.com users with results from a previous analysis of medical and health related queries from the Excite Web search engine for 1997 and 1999, and (iii). medical or health advice-seeking queries beginning with the word 'should'. Findings suggest: (i). a small percentage of web queries are medical or health related, (ii). the top five categories of medical or health queries were: general health, weight issues, reproductive health and puberty, pregnancy/obstetrics, and human relationships, and (iii). over time, the medical and health queries may have declined as a proportion of all web queries, as the use of specialized medical/health websites and e-commerce-related queries has increased. Findings provide insights into medical and health-related web querying and suggests some implications for the use of the general web search engines when seeking medical/health information.

  17. A cognitive evaluation of four online search engines for answering definitional questions posed by physicians.

    Science.gov (United States)

    Yu, Hong; Kaufman, David

    2007-01-01

    The Internet is having a profound impact on physicians' medical decision making. One recent survey of 277 physicians showed that 72% of physicians regularly used the Internet to research medical information and 51% admitted that information from web sites influenced their clinical decisions. This paper describes the first cognitive evaluation of four state-of-the-art Internet search engines: Google (i.e., Google and Scholar.Google), MedQA, Onelook, and PubMed for answering definitional questions (i.e., questions with the format of "What is X?") posed by physicians. Onelook is a portal for online definitions, and MedQA is a question answering system that automatically generates short texts to answer specific biomedical questions. Our evaluation criteria include quality of answer, ease of use, time spent, and number of actions taken. Our results show that MedQA outperforms Onelook and PubMed in most of the criteria, and that MedQA surpasses Google in time spent and number of actions, two important efficiency criteria. Our results show that Google is the best system for quality of answer and ease of use. We conclude that Google is an effective search engine for medical definitions, and that MedQA exceeds the other search engines in that it provides users direct answers to their questions; while the users of the other search engines have to visit several sites before finding all of the pertinent information.

  18. Key word placing in Web page body text to increase visibility to search engines

    Directory of Open Access Journals (Sweden)

    W. T. Kritzinger

    2007-11-01

    Full Text Available The growth of the World Wide Web has spawned a wide variety of new information sources, which has also left users with the daunting task of determining which sources are valid. Many users rely on the Web as an information source because of the low cost of information retrieval. It is also claimed that the Web has evolved into a powerful business tool. Examples include highly popular business services such as Amazon.com and Kalahari.net. It is estimated that around 80% of users utilize search engines to locate information on the Internet. This, by implication, places emphasis on the underlying importance of Web pages being listed on search engines indices. Empirical evidence that the placement of key words in certain areas of the body text will have an influence on the Web sites' visibility to search engines could not be found in the literature. The result of two experiments indicated that key words should be concentrated towards the top, and diluted towards the bottom of a Web page to increase visibility. However, care should be taken in terms of key word density, to prevent search engine algorithms from raising the spam alarm.

  19. Search Engine Designer for Tomorrow: Interview with TextWise's Elizabeth Liddy.

    Science.gov (United States)

    Quint, Barbara

    1998-01-01

    Presents an interview with Elizabeth Liddy, president of TextWise, an information access and analytics company. Background on TextWise is provided; advanced search engines are discussed; TextWise products and projects are described; and the changing role of information professionals is considered. (MES)

  20. Eugene Garfield, Francis Narin, and PageRank: The Theoretical Bases of the Google Search Engine

    CERN Document Server

    Bensman, Stephen J

    2013-01-01

    This paper presents a test of the validity of using Google Scholar to evaluate the publications of researchers by comparing the premises on which its search engine, PageRank, is based, to those of Garfield's theory of citation indexing. It finds that the premises are identical and that PageRank and Garfield's theory of citation indexing validate each other.

  1. Evaluation of three German search engines: Altavista.de, Google.de and Lycos.de

    Directory of Open Access Journals (Sweden)

    Joachim Griesbaum

    2004-01-01

    Full Text Available The goal of this study was to investigate the retrieval effectiveness of three popular German Web search services. For this purpose the engines Altavista.de, Google.de and Lycos.de were compared with each other in terms of the precision of their top 20 results. The test panelists were based on a collection of 50 randomly selected queries, and relevance assessments were made by independent jurors. Relevance assessments were acquired separately a for the search results themselves and b for the result descriptions on the search engine results pages. The basic findings were: 1. Google reached the best result values. Statistical validation showed that Google performed significantly better than Altavista, but there was no significant difference between Google and Lycos. Lycos also attained better values than Altavista, but again the differences reached no significant value. In terms of top 20 precision, the experiment showed similar outcomes to the preceding retrieval test in 2002. Google, followed by Lycos and then Altavista, still performs best, but the gaps between the engines are closer now. 2. There are big deviations between the relevance assignments based on the judgement of the results themselves and those based on the judgements of the result descriptions on the search engine results pages.

  2. A Novel Algorithm for Validating Peptide Identification from a Shotgun Proteomics Search Engine

    Science.gov (United States)

    Jian, Ling; Niu, Xinnan; Xia, Zhonghang; Samir, Parimal; Sumanasekera, Chiranthani; Zheng, Mu; Jennings, Jennifer L.; Hoek, Kristen L.; Allos, Tara; Howard., Leigh M.; Edwards, Kathryn M.; Weil, P. Anthony; Link, Andrew J.

    2013-01-01

    Liquid chromatography coupled with tandem mass spectrometry has revolutionized the proteomics analysis of complexes, cells, and tissues. In a typical proteomic analysis, the tandem mass spectra from a LC/MS/MS experiment are assigned to a peptide by a search engine that compares the experimental MS/MS peptide data to theoretical peptide sequences in a protein database. The peptide spectra matches are then used to infer a list of identified proteins in the original sample. However, the search engines often fail to distinguish between correct and incorrect peptides assignments. In this study, we designed and implemented a novel algorithm called De-Noise to reduce the number of incorrect peptide matches and maximize the number of correct peptides at a fixed false discovery rate using a minimal number of scoring outputs from the SEQUEST search engine. The novel algorithm uses a three step process: data cleaning, data refining through a SVM-based decision function, and a final data refining step based on proteolytic peptide patterns. Using proteomics data generated on different types of mass spectrometers, we optimized the De-Noise algorithm based on the resolution and mass accuracy of the mass spectrometer employed in the LC/MS/MS experiment. Our results demonstrate De-Noise improves peptide identification compared to other methods used to process the peptide sequence matches assigned by SEQUEST. Because De-Noise uses a limited number of scoring attributes, it can be easily implemented with other search engines. PMID:23402659

  3. Andromeda: a peptide search engine integrated into the MaxQuant environment.

    Science.gov (United States)

    Cox, Jürgen; Neuhauser, Nadin; Michalski, Annette; Scheltema, Richard A; Olsen, Jesper V; Mann, Matthias

    2011-04-01

    A key step in mass spectrometry (MS)-based proteomics is the identification of peptides in sequence databases by their fragmentation spectra. Here we describe Andromeda, a novel peptide search engine using a probabilistic scoring model. On proteome data, Andromeda performs as well as Mascot, a widely used commercial search engine, as judged by sensitivity and specificity analysis based on target decoy searches. Furthermore, it can handle data with arbitrarily high fragment mass accuracy, is able to assign and score complex patterns of post-translational modifications, such as highly phosphorylated peptides, and accommodates extremely large databases. The algorithms of Andromeda are provided. Andromeda can function independently or as an integrated search engine of the widely used MaxQuant computational proteomics platform and both are freely available at www.maxquant.org. The combination enables analysis of large data sets in a simple analysis workflow on a desktop computer. For searching individual spectra Andromeda is also accessible via a web server. We demonstrate the flexibility of the system by implementing the capability to identify cofragmented peptides, significantly improving the total number of identified peptides.

  4. A Multimodal Search Engine for Medical Imaging Studies.

    Science.gov (United States)

    Pinho, Eduardo; Godinho, Tiago; Valente, Frederico; Costa, Carlos

    2017-02-01

    The use of digital medical imaging systems in healthcare institutions has increased significantly, and the large amounts of data in these systems have led to the conception of powerful support tools: recent studies on content-based image retrieval (CBIR) and multimodal information retrieval in the field hold great potential in decision support, as well as for addressing multiple challenges in healthcare systems, such as computer-aided diagnosis (CAD). However, the subject is still under heavy research, and very few solutions have become part of Picture Archiving and Communication Systems (PACS) in hospitals and clinics. This paper proposes an extensible platform for multimodal medical image retrieval, integrated in an open-source PACS software with profile-based CBIR capabilities. In this article, we detail a technical approach to the problem by describing its main architecture and each sub-component, as well as the available web interfaces and the multimodal query techniques applied. Finally, we assess our implementation of the engine with computational performance benchmarks.

  5. Discussion on Search Engine Optimization%浅议搜索引擎优化

    Institute of Scientific and Technical Information of China (English)

    付雷

    2012-01-01

    SEO (Search Engine Optimization) can bring quality network flow to the site and im- prove the site's search ranking, thus realizing the commercial value of the site and achieving good net- work marketing effectiveness. The paper expounds the development status and the approach of achieving SEO.%搜索引擎优化SEO(Search Engine Optimization,SEO)可以给网站带来高质量的流量,提高网站的搜索排名、从而体现网站的商业价值,实现良好的网络营销效果。本文从SEO的发展现状出发,对SEO的优化实现方法进行了阐述。

  6. Artificial neural network-based merging score for Meta search engine

    Institute of Scientific and Technical Information of China (English)

    P Vijaya; G Raju; Santosh Kumar Ray

    2016-01-01

    Several users use metasearch engines directly or indirectly to access and gather data from more than one data sources. The effectiveness of a metasearch engine is majorly determined by the quality of the results and it returns and in response to user queries. The rank aggregation methods which have been proposed until now exploits very limited set of parameters such as total number of used resources and the rankings they achieved from each individual resource. In this work, we use the neural network to merge the score computation module effectively. Initially, we give a query to different search engines and the topn list from each search engine is chosen for further processing our technique. We then merge the topn list based on unique links and we do some parameter calculations such as title based calculation, snippet based calculation, content based calculation, domain calculation, position calculation and co-occurrence calculation. We give the solutions of the calculations with user given ranking of links to the neural network to train the system. The system then rank and merge the links we obtain from different search engines for the query we give. Experimentation results reports a retrieval effectiveness of about 80%, precision of about 79% for user queries and about 72% for benchmark queries. The proposed technique also includes a response time of about 76 ms for 50 links and 144 ms for 100 links.

  7. GeNemo: a search engine for web-based functional genomic data.

    Science.gov (United States)

    Zhang, Yongqing; Cao, Xiaoyi; Zhong, Sheng

    2016-07-08

    A set of new data types emerged from functional genomic assays, including ChIP-seq, DNase-seq, FAIRE-seq and others. The results are typically stored as genome-wide intensities (WIG/bigWig files) or functional genomic regions (peak/BED files). These data types present new challenges to big data science. Here, we present GeNemo, a web-based search engine for functional genomic data. GeNemo searches user-input data against online functional genomic datasets, including the entire collection of ENCODE and mouse ENCODE datasets. Unlike text-based search engines, GeNemo's searches are based on pattern matching of functional genomic regions. This distinguishes GeNemo from text or DNA sequence searches. The user can input any complete or partial functional genomic dataset, for example, a binding intensity file (bigWig) or a peak file. GeNemo reports any genomic regions, ranging from hundred bases to hundred thousand bases, from any of the online ENCODE datasets that share similar functional (binding, modification, accessibility) patterns. This is enabled by a Markov Chain Monte Carlo-based maximization process, executed on up to 24 parallel computing threads. By clicking on a search result, the user can visually compare her/his data with the found datasets and navigate the identified genomic regions. GeNemo is available at www.genemo.org.

  8. The LAILAPS search engine: a feature model for relevance ranking in life science databases.

    Science.gov (United States)

    Lange, Matthias; Spies, Karl; Colmsee, Christian; Flemming, Steffen; Klapperstück, Matthias; Scholz, Uwe

    2010-03-25

    Efficient and effective information retrieval in life sciences is one of the most pressing challenge in bioinformatics. The incredible growth of life science databases to a vast network of interconnected information systems is to the same extent a big challenge and a great chance for life science research. The knowledge found in the Web, in particular in life-science databases, are a valuable major resource. In order to bring it to the scientist desktop, it is essential to have well performing search engines. Thereby, not the response time nor the number of results is important. The most crucial factor for millions of query results is the relevance ranking. In this paper, we present a feature model for relevance ranking in life science databases and its implementation in the LAILAPS search engine. Motivated by the observation of user behavior during their inspection of search engine result, we condensed a set of 9 relevance discriminating features. These features are intuitively used by scientists, who briefly screen database entries for potential relevance. The features are both sufficient to estimate the potential relevance, and efficiently quantifiable. The derivation of a relevance prediction function that computes the relevance from this features constitutes a regression problem. To solve this problem, we used artificial neural networks that have been trained with a reference set of relevant database entries for 19 protein queries. Supporting a flexible text index and a simple data import format, this concepts are implemented in the LAILAPS search engine. It can easily be used both as search engine for comprehensive integrated life science databases and for small in-house project databases. LAILAPS is publicly available for SWISSPROT data at http://lailaps.ipk-gatersleben.de.

  9. PubMed vs. HighWire Press: a head-to-head comparison of two medical literature search engines.

    Science.gov (United States)

    Vanhecke, Thomas E; Barnes, Michael A; Zimmerman, Janet; Shoichet, Sandor

    2007-09-01

    PubMed and HighWire Press are both useful medical literature search engines available for free to anyone on the internet. We measured retrieval accuracy, number of results generated, retrieval speed, features and search tools on HighWire Press and PubMed using the quick search features of each. We found that using HighWire Press resulted in a higher likelihood of retrieving the desired article and higher number of search results than the same search on PubMed. PubMed was faster than HighWire Press in delivering search results regardless of search settings. There are considerable differences in search features between these two search engines.

  10. Precision and Recall of Five Search Engines for Retrieval of Scholarly Information in the Field of Biotechnology

    Directory of Open Access Journals (Sweden)

    Rafiq A. Rather

    2005-08-01

    Full Text Available This paper presents the results of a research conducted about five search engines- AltaVista, Google, HotBot, Scirus and Bioweb -for retrieving scholarly information using Biotechnology related search terms. The search engines are evaluated taking the first ten results pertaining to 'scholarly information' for estimation of precision and recall. It shows that Scirus is most comprehensive in retrieving 'scholarly information' followed by Google and HotBot. It also reveals that the search engines (except Bioweb perform well on structured queries while Bioweb performs better on unstructured queries.

  11. Ultrafast Band Engineering and Transient Spin Currents in Antiferromagnetic Oxides.

    Science.gov (United States)

    Gu, Mingqiang; Rondinelli, James M

    2016-04-29

    We report a dynamic structure and band engineering strategy with experimental protocols to induce indirect-to-direct band gap transitions and coherently oscillating pure spin-currents in three-dimensional antiferromagnets (AFM) using selective phononic excitations. In the Mott insulator LaTiO3, we show that a photo-induced nonequilibrium phonon mode amplitude destroys the spin and orbitally degenerate ground state, reduces the band gap by 160 meV and renormalizes the carrier masses. The time scale of this process is a few hundreds of femtoseconds. Then in the hole-doped correlated metallic titanate, we show how pure spin-currents can be achieved to yield spin-polarizations exceeding those observed in classic semiconductors. Last, we demonstrate the generality of the approach by applying it to the non-orbitally degenerate AFM CaMnO3. These results advance our understanding of electron-lattice interactions in structures out-of-equilibrium and establish a rational framework for designing dynamic phases that may be exploited in ultrafast optoelectronic and optospintronic devices.

  12. Current status of duplex surface engineered Ti-based materials

    Institute of Scientific and Technical Information of China (English)

    T.Bell

    2004-01-01

    Industrial exploitation of the high specific strength and corrosion resistance of titanium were dominated historically by the technological advances which have been made in gas-turbine engine and aircraft components. Realization of the possible benefits in general engineering has been limited by the absence of any proven and reliable means of overcoming the poor wear resistance and galling tendency suffered by titanium alloys when in contact with other materials. This problem can only be addressed by optimizing and demonstrating industrially viable surface engineering processes for titanium in general engineering. The status of single and duplex surface engineering systems are reviewed. In addition, in order to fully realize the potential of advanced surface engineering of titanium components contact mechanics models are developed to enable the automotive engineers to design dynamically the loaded automotive engine and transmission components.

  13. The internet and intelligent machines: search engines, agents and robots; Radiologische Informationssuche im Internet: Datenbanken, Suchmaschinen und intelligente Agenten

    Energy Technology Data Exchange (ETDEWEB)

    Achenbach, S.; Alfke, H. [Marburg Univ. (Germany). Abt. fuer Strahlendiagnostik

    2000-04-01

    The internet plays an important role in a growing number of medical applications. Finding relevant information is not always easy as the amount of available information on the Web is rising quickly. Even the best Search Engines can only collect links to a fraction of all existing Web pages. In addition, many of these indexed documents have been changed or deleted. The vast majority of information on the Web is not searchable with conventional methods. New search strategies, technologies and standards are combined in Intelligent Search Agents (ISA) an Robots, which can retrieve desired information in a specific approach. Conclusion: The article describes differences between ISAs and conventional Search Engines and how communication between Agents improves their ability to find information. Examples of existing ISAs are given and the possible influences on the current and future work in radiology is discussed. (orig.) [German] Das Internet findet zunehmend in medizinischen Anwendungen Verbreitung, jedoch ist das Auffinden relevanter Informationen nicht immer leicht. Die Anzahl der verfuegbaren Dokumente im World wide web nimmt so schnell zu, dass die Suche zunehmend Probleme bereitet: Auch gute Suchmaschinen erfassen nur einige Prozent der vorhandenen Seiten in Ihren Datenbanken. Zusaetzlich sorgen staendige Veraenderungen dafuer, dass nur ein Teil dieser durchsuchbaren Dokumente ueberhaupt noch existiert. Der Grossteil des Internets ist daher mit konventionellen Methoden nicht zu erschliessen. Neue Standards, Suchstrategien und Technologien vereinen sich in den Suchagenten und Robots, die gezielter und intelligenter Inhalte ermitteln koennen. Schlussfolgerung: Der Artikel stellt dar, wie sich ein Intelligent search agent (ISA) von einer Suchmaschine unterscheidet und durch Kooperation mit anderen Agenten die Anforderungen der Benutzer besser erfuellen kann. Neben den Grundlagen werden exemplarische Anwendungen gezeigt, die heute im Netz existieren, und ein Ausblick

  14. Multi-lingual search engine to access PubMed monolingual subsets: a feasibility study.

    Science.gov (United States)

    Darmoni, Stéfan J; Soualmia, Lina F; Griffon, Nicolas; Grosjean, Julien; Kerdelhué, Gaétan; Kergourlay, Ivan; Dahamna, Badisse

    2013-01-01

    PubMed contains many articles in languages other than English but it is difficult to find them using the English version of the Medical Subject Headings (MeSH) Thesaurus. The aim of this work is to propose a tool allowing access to a PubMed subset in one language, and to evaluate its performance. Translations of MeSH were enriched and gathered in the information system. PubMed subsets in main European languages were also added in our database, using a dedicated parser. The CISMeF generic semantic search engine was evaluated on the response time for simple queries. MeSH descriptors are currently available in 11 languages in the information system. All the 654,000 PubMed citations in French were integrated into CISMeF database. None of the response times exceed the threshold defined for usability (2 seconds). It is now possible to freely access biomedical literature in French using a tool in French; health professionals and lay people with a low English language may find it useful. It will be expended to several European languages: German, Spanish, Norwegian and Portuguese.

  15. Current Reports: Educating Scientists and Engineers: The View from OTA.

    Science.gov (United States)

    Morgan, Robert P.

    1989-01-01

    Compares two engineering education reports which urge the following needs and emphases: attract and retain minorities, retain students already in engineering school, and allow students to enter the engineering program at various levels. Criticizes the Office of Technology Assessment's report and supplies prescriptions for the future. (MVL)

  16. Start Your Search Engines. Part 2: When Image is Everything, Here are Some Great Ways to Find One

    Science.gov (United States)

    Adam, Anna; Mowers, Helen

    2008-01-01

    There is no doubt that Google is great for finding images. Simply head to its home page, click the "Images" link, enter criteria in the search box, and--voila! In this article, the authors share some of their other favorite search engines for finding images. To make sure the desired images are available for educational use, consider searching for…

  17. Efficient Proposed Framework for Semantic Search Engine using New Semantic Ranking Algorithm

    Directory of Open Access Journals (Sweden)

    M. M. El-gayar

    2015-08-01

    Full Text Available The amount of information raises billions of databases every year and there is an urgent need to search for that information by a specialize tool called search engine. There are many of search engines available today, but the main challenge in these search engines is that most of them cannot retrieve meaningful information intelligently. The semantic web technology is a solution that keeps data in a readable format that helps machines to match smartly this data with related information based on meanings. In this paper, we will introduce a proposed semantic framework that includes four phases crawling, indexing, ranking and retrieval phase. This semantic framework operates over a sorting RDF by using efficient proposed ranking algorithm and enhanced crawling algorithm. The enhanced crawling algorithm crawls relevant forum content from the web with minimal overhead. The proposed ranking algorithm is produced to order and evaluate similar meaningful data in order to make the retrieval process becomes faster, easier and more accurate. We applied our work on a standard database and achieved 99 percent effectiveness on semantic performance in minimum time and less than 1 percent error rate compared with the other semantic systems.

  18. Developing a distributed HTML5-based search engine for geospatial resource discovery

    Science.gov (United States)

    ZHOU, N.; XIA, J.; Nebert, D.; Yang, C.; Gui, Z.; Liu, K.

    2013-12-01

    With explosive growth of data, Geospatial Cyberinfrastructure(GCI) components are developed to manage geospatial resources, such as data discovery and data publishing. However, the efficiency of geospatial resources discovery is still challenging in that: (1) existing GCIs are usually developed for users of specific domains. Users may have to visit a number of GCIs to find appropriate resources; (2) The complexity of decentralized network environment usually results in slow response and pool user experience; (3) Users who use different browsers and devices may have very different user experiences because of the diversity of front-end platforms (e.g. Silverlight, Flash or HTML). To address these issues, we developed a distributed and HTML5-based search engine. Specifically, (1)the search engine adopts a brokering approach to retrieve geospatial metadata from various and distributed GCIs; (2) the asynchronous record retrieval mode enhances the search performance and user interactivity; (3) the search engine based on HTML5 is able to provide unified access capabilities for users with different devices (e.g. tablet and smartphone).

  19. A Federated Search Approach to Facilitate Systematic Literature Review in Software Engineering

    Directory of Open Access Journals (Sweden)

    Mohammad Ghafari

    2012-04-01

    Full Text Available To impact industry, researchers developing technologies in academia need to provide tangible evidence of the advantages of using them. Nowadays, Systematic Literature Review (SLR has become a prominent methodology in evidence-based researches. Although adopting SLR in software engineering does not go far in practice, it has been resulted in valuable researches and is going to be more common. However, digital libraries and scientific databases as the best research resources do not provide enough mechanism for SLRs especially in software engineering. On the other hand, any loss of data may change the SLR results and leads to research bias. Accordingly, the search process and evidence collection in SLR is a critical point. This paper provides some tips to enhance the SLR process. The main contribution of this work ispresenting a federated search tool which provides an automatic integrated search mechanism in wellknown Software Engineering databases. Results of case study show that this approach not only reduces required time to do SLR and facilitate its search process, but also improves its reliability and results in the increasing trend to use SLRs.

  20. GGRNA: an ultrafast, transcript-oriented search engine for genes and transcripts.

    Science.gov (United States)

    Naito, Yuki; Bono, Hidemasa

    2012-07-01

    GGRNA (http://GGRNA.dbcls.jp/) is a Google-like, ultrafast search engine for genes and transcripts. The web server accepts arbitrary words and phrases, such as gene names, IDs, gene descriptions, annotations of gene and even nucleotide/amino acid sequences through one simple search box, and quickly returns relevant RefSeq transcripts. A typical search takes just a few seconds, which dramatically enhances the usability of routine searching. In particular, GGRNA can search sequences as short as 10 nt or 4 amino acids, which cannot be handled easily by popular sequence analysis tools. Nucleotide sequences can be searched allowing up to three mismatches, or the query sequences may contain degenerate nucleotide codes (e.g. N, R, Y, S). Furthermore, Gene Ontology annotations, Enzyme Commission numbers and probe sequences of catalog microarrays are also incorporated into GGRNA, which may help users to conduct searches by various types of keywords. GGRNA web server will provide a simple and powerful interface for finding genes and transcripts for a wide range of users. All services at GGRNA are provided free of charge to all users.

  1. Refining comparative proteomics by spectral counting to account for shared peptides and multiple search engines.

    Science.gov (United States)

    Chen, Yao-Yi; Dasari, Surendra; Ma, Ze-Qiang; Vega-Montoto, Lorenzo J; Li, Ming; Tabb, David L

    2012-09-01

    Spectral counting has become a widely used approach for measuring and comparing protein abundance in label-free shotgun proteomics. However, when analyzing complex samples, the ambiguity of matching between peptides and proteins greatly affects the assessment of peptide and protein inventories, differentiation, and quantification. Meanwhile, the configuration of database searching algorithms that assign peptides to MS/MS spectra may produce different results in comparative proteomic analysis. Here, we present three strategies to improve comparative proteomics through spectral counting. We show that comparing spectral counts for peptide groups rather than for protein groups forestalls problems introduced by shared peptides. We demonstrate the advantage and flexibility of this new method in two datasets. We present four models to combine four popular search engines that lead to significant gains in spectral counting differentiation. Among these models, we demonstrate a powerful vote counting model that scales well for multiple search engines. We also show that semi-tryptic searching outperforms tryptic searching for comparative proteomics. Overall, these techniques considerably improve protein differentiation on the basis of spectral count tables.

  2. Durham Zoo: Powering a Search-&-Innovation Engine with Collective Intelligence

    Directory of Open Access Journals (Sweden)

    Richard Absalom

    2015-02-01

    Full Text Available Purpose – Durham Zoo (hereinafter – DZ is a project to design and operate a concept search engine for science and technology. In DZ, a concept includes a solution to a problem in a particular context.Design – Concept searching is rendered complex by the fuzzy nature of a concept, the many possible implementations of the same concept, and the many more ways that the many implementations can be expressed in natural language. An additional complexity is the diversity of languages and formats, in which the concepts can be disclosed.Humans understand language, inference, implication and abstraction and, hence, concepts much better than computers, that in turn are much better at storing and processing vast amounts of data.We are 7 billion on the planet and we have the Internet as the backbone for Collective Intelligence. So, our concept search engine uses humans to store concepts via a shorthand that can be stored, processed and searched by computers: so, humans IN and computers OUT.The shorthand is classification: metadata in a structure that can define the content of a disclosure. The classification is designed to be powerful in terms of defining and searching concepts, whilst suited to a crowdsourcing effort. It is simple and intuitive to use. Most importantly, it is adapted to restrict ambiguity, which is the poison of classification, without imposing a restrictive centralised management.In the classification scheme, each entity is shown together in a graphical representation with related entities. The entities are arranged on a sliding scale of similarity. This sliding scale is effectively fuzzy classification.Findings – The authors of the paper have been developing a first classification scheme for the technology of traffic cones, this in preparation for a trial of a working system. The process has enabled the authors to further explore the practicalities of concept classification. The CmapTools knowledge modelling kit to develop the

  3. MuZeeker - Adapting a music search engine for mobile phones

    DEFF Research Database (Denmark)

    Larsen, Jakob Eg; Halling, Søren Christian; Sigurdsson, Magnus Kristinn

    2010-01-01

    Zeeker application. We report from two usability experiments using the think aloud protocol, in which N=20 participants performed tasks using MuZeeker and a customized Google search engine. In both experiments web-based and mobile user interfaces were used. The experiment shows that participants are capable...... of solving tasks slightly better using MuZeeker, while the "inexperienced" MuZeeker users perform slightly slower than experienced Google users. This was found in both the web-based and the mobile applications. It was found that task performance in the mobile search applications (MuZeeker and Google) was 2...

  4. The Library Search Engine: A Smart Solution for Integrating Resources Beyond Library Holdings

    Directory of Open Access Journals (Sweden)

    Karin Herm

    2008-09-01

    Full Text Available The Cooperative Library Network Berlin-Brandenburg (KOBV, Germany addresses the problem of how to integrate resources found outside the library and library holdings into a single discovery tool. It presents a solution that uses open source technology to develop a next-generation catalog interface called the Library Search Engine. This pilot project was launched in 2007 with the library of Albert Einstein Science Park, Potsdam. The idea was to design and develop a fast and convenient search tool, integrating local holdings (books, journals, journal articles as well as relevant scientific subject information such as open access publications and bibliographies.

  5. GeNemo: a search engine for web-based functional genomic data

    OpenAIRE

    Zhang, Yongqing; Cao, Xiaoyi; Zhong, Sheng

    2016-01-01

    A set of new data types emerged from functional genomic assays, including ChIP-seq, DNase-seq, FAIRE-seq and others. The results are typically stored as genome-wide intensities (WIG/bigWig files) or functional genomic regions (peak/BED files). These data types present new challenges to big data science. Here, we present GeNemo, a web-based search engine for functional genomic data. GeNemo searches user-input data against online functional genomic datasets, including the entire collection of E...

  6. Comparison of Four Search Engines and their efficacy With Emphasis on Literature Research in Addiction (Prevention and Treatment)

    Science.gov (United States)

    Samadzadeh, Gholam Reza; Rigi, Tahereh; Ganjali, Ali Reza

    2013-01-01

    Background Surveying valuable and most recent information from internet, has become vital for researchers and scholars, because every day, thousands and perhaps millions of scientific works are brought out as digital resources which represented by internet and researchers can’t ignore this great resource to find related documents for their literature search, which may not be found in any library. With regard to variety of documents presented on the internet, search engines are one of the most effective search tools for finding information. Objectives The aim of this study is to evaluate the three criteria, recall, preciseness and importance of the four search engines which are PubMed, Science Direct, Google Scholar and federated search of Iranian National Medical Digital Library in addiction (prevention and treatment) to select the most effective search engine for offering the best literature research. Materials and Methods This research was a cross-sectional study by which four popular search engines in medical sciences were evaluated. To select keywords, medical subject heading (Mesh) was used. We entered given keywords in the search engines and after searching, 10 first entries were evaluated. Direct observation was used as a mean for data collection and they were analyzed by descriptive statistics (number, percent number and mean) and inferential statistics, One way analysis of variance (ANOVA) and post hoc Tukey in Spss. 15 statistical software. P Value Science Direct and Google Scholar were the best in recall, preciseness and importance respectively. Conclusions As literature research is one of the most important stages of research, it's better for researchers, especially Substance-Related Disorders scholars to use different search engines with the best recall, preciseness and importance in that subject field to reach desirable results while searching and they don’t depend on just one search engine. PMID:24971257

  7. Towards a portal and search engine to facilitate academic and research collaboration in engineering and education

    Science.gov (United States)

    Bonilla Villarreal, Isaura Nathaly

    While international academic and research collaborations are of great importance at this time, it is not easy to find researchers in the engineering field that publish in languages other than English. Because of this disconnect, there exists a need for a portal to find Who's Who in Engineering Education in the Americas. The objective of this thesis is to built an object-oriented architecture for this proposed portal. The Unified Modeling Language (UML) model developed in this thesis incorporates the basic structure of a social network for academic purposes. Reverse engineering of three social networks portals yielded important aspects of their structures that have been incorporated in the proposed UML model. Furthermore, the present work includes a pattern for academic social networks..

  8. Perivascular cells and tissue engineering: Current applications and untapped potential.

    Science.gov (United States)

    Avolio, Elisa; Alvino, Valeria V; Ghorbel, Mohamed T; Campagnolo, Paola

    2017-03-01

    The recent development of tissue engineering provides exciting new perspectives for the replacement of failing organs and the repair of damaged tissues. Perivascular cells, including vascular smooth muscle cells, pericytes and other tissue specific populations residing around blood vessels, have been isolated from many organs and are known to participate to the in situ repair process and angiogenesis. Their potential has been harnessed for cell therapy of numerous pathologies; however, in this Review we will discuss the potential of perivascular cells in the development of tissue engineering solutions for healthcare. We will examine their application in the engineering of vascular grafts, cardiac patches and bone substitutes as well as other tissue engineering applications and we will focus on their extensive use in the vascularization of engineered constructs. Additionally, we will discuss the emerging potential of human pericytes for the development of efficient, vascularized and non-immunogenic engineered constructs. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  9. Genetic Networks of Complex Disorders: from a Novel Search Engine for PubMed Article Database.

    Science.gov (United States)

    Jung, Jae-Yoon; Wall, Dennis Paul

    2013-01-01

    Finding genetic risk factors of complex disorders may involve reviewing hundreds of genes or thousands of research articles iteratively, but few tools have been available to facilitate this procedure. In this work, we built a novel publication search engine that can identify target-disorder specific, genetics-oriented research articles and extract the genes with significant results. Preliminary test results showed that the output of this engine has better coverage in terms of genes or publications, than other existing applications. We consider it as an essential tool for understanding genetic networks of complex disorders.

  10. Impact of Commercial Search Engines and International Databases on Engineering Teaching and Research

    Science.gov (United States)

    Chanson, Hubert

    2007-01-01

    For the last three decades, the engineering higher education and professional environments have been completely transformed by the "electronic/digital information revolution" that has included the introduction of personal computer, the development of email and world wide web, and broadband Internet connections at home. Herein the writer compares…

  11. Crescendo: A Protein Sequence Database Search Engine for Tandem Mass Spectra.

    Science.gov (United States)

    Wang, Jianqi; Zhang, Yajie; Yu, Yonghao

    2015-07-01

    A search engine that discovers more peptides reliably is essential to the progress of the computational proteomics. We propose two new scoring functions (L- and P-scores), which aim to capture similar characteristics of a peptide-spectrum match (PSM) as Sequest and Comet do. Crescendo, introduced here, is a software program that implements these two scores for peptide identification. We applied Crescendo to test datasets and compared its performance with widely used search engines, including Mascot, Sequest, and Comet. The results indicate that Crescendo identifies a similar or larger number of peptides at various predefined false discovery rates (FDR). Importantly, it also provides a better separation between the true and decoy PSMs, warranting the future development of a companion post-processing filtering algorithm.

  12. Web search engine:characteristics of user behaviors and their implication

    Institute of Scientific and Technical Information of China (English)

    王建勇; 单松巍; 雷鸣; 谢正茂; 李晓明

    2001-01-01

    In this paper, first studied are the distribution characteristics of user behaviors based on log data from a massive web search engine. Analysis shows that stochastic distribution of user queries accords with the characteristics of power-law function and exhibits strong similarity, and the user's queries and clicked URLs present dramatic locality, which implies that query cache and ‘hot click'cache can be employed to improve system performance. Then three typical cache replacement policies are compared, including LRU, FIFO, and LFU with attenuation. In addition, the distribution characteristics of web information are also analyzed, which demonstrates that the link popularity and replica popularity of a URL have positive influence on its importance. Finally, variance between the link popularity and user popularity, and variance between replica popularity and user popularity are analyzed, which give us some important insight that helps us improve the ranking algorithms in a search engine.

  13. Origin of Disagreements in Tandem Mass Spectra Interpretation by Search Engines.

    Science.gov (United States)

    Tessier, Dominique; Lollier, Virginie; Larré, Colette; Rogniaux, Hélène

    2016-10-07

    Several proteomic database search engines that interpret LC-MS/MS data do not identify the same set of peptides. These disagreements occur even when the scores of the peptide-to-spectrum matches suggest good confidence in the interpretation. Our study shows that these disagreements observed for the interpretations of a given spectrum are almost exclusively due to the variation of what we call the "peptide space", i.e., the set of peptides that are actually compared to the experimental spectra. We discuss the potential difficulties of precisely defining the "peptide space." Indeed, although several parameters that are generally reported in publications can easily be set to the same values, many additional parameters-with much less straightforward user access-might impact the "peptide space" used by each program. Moreover, in a configuration where each search engine identifies the same candidates for each spectrum, the inference of the proteins may remain quite different depending on the false discovery rate selected.

  14. FPS-RAM: Fast Prefix Search RAM-Based Hardware for Forwarding Engine

    Science.gov (United States)

    Zaitsu, Kazuya; Yamamoto, Koji; Kuroda, Yasuto; Inoue, Kazunari; Ata, Shingo; Oka, Ikuo

    Ternary content addressable memory (TCAM) is becoming very popular for designing high-throughput forwarding engines on routers. However, TCAM has potential problems in terms of hardware and power costs, which limits its ability to deploy large amounts of capacity in IP routers. In this paper, we propose new hardware architecture for fast forwarding engines, called fast prefix search RAM-based hardware (FPS-RAM). We designed FPS-RAM hardware with the intent of maintaining the same search performance and physical user interface as TCAM because our objective is to replace the TCAM in the market. Our RAM-based hardware architecture is completely different from that of TCAM and has dramatically reduced the costs and power consumption to 62% and 52%, respectively. We implemented FPS-RAM on an FPGA to examine its lookup operation.

  15. Crescendo: A Protein Sequence Database Search Engine for Tandem Mass Spectra

    Science.gov (United States)

    Wang, Jianqi; Zhang, Yajie; Yu, Yonghao

    2015-07-01

    A search engine that discovers more peptides reliably is essential to the progress of the computational proteomics. We propose two new scoring functions (L- and P-scores), which aim to capture similar characteristics of a peptide-spectrum match (PSM) as Sequest and Comet do. Crescendo, introduced here, is a software program that implements these two scores for peptide identification. We applied Crescendo to test datasets and compared its performance with widely used search engines, including Mascot, Sequest, and Comet. The results indicate that Crescendo identifies a similar or larger number of peptides at various predefined false discovery rates (FDR). Importantly, it also provides a better separation between the true and decoy PSMs, warranting the future development of a companion post-processing filtering algorithm.

  16. Quantitative evaluation of recall and precision of CAT Crawler, a search engine specialized on retrieval of Critically Appraised Topics

    Directory of Open Access Journals (Sweden)

    Loh Marie

    2004-12-01

    Full Text Available Abstract Background Critically Appraised Topics (CATs are a useful tool that helps physicians to make clinical decisions as the healthcare moves towards the practice of Evidence-Based Medicine (EBM. The fast growing World Wide Web has provided a place for physicians to share their appraised topics online, but an increasing amount of time is needed to find a particular topic within such a rich repository. Methods A web-based application, namely the CAT Crawler, was developed by Singapore's Bioinformatics Institute to allow physicians to adequately access available appraised topics on the Internet. A meta-search engine, as the core component of the application, finds relevant topics following keyword input. The primary objective of the work presented here is to evaluate the quantity and quality of search results obtained from the meta-search engine of the CAT Crawler by comparing them with those obtained from two individual CAT search engines. From the CAT libraries at these two sites, all possible keywords were extracted using a keyword extractor. Of those common to both libraries, ten were randomly chosen for evaluation. All ten were submitted to the two search engines individually, and through the meta-search engine of the CAT Crawler. Search results were evaluated for relevance both by medical amateurs and professionals, and the respective recall and precision were calculated. Results While achieving an identical recall, the meta-search engine showed a precision of 77.26% (±14.45 compared to the individual search engines' 52.65% (±12.0 (p Conclusion The results demonstrate the validity of the CAT Crawler meta-search engine approach. The improved precision due to inherent filters underlines the practical usefulness of this tool for clinicians.

  17. Automatic sorting of toxicological information into the IUCLID (International Uniform Chemical Information Database) endpoint-categories making use of the semantic search engine Go3R.

    Science.gov (United States)

    Sauer, Ursula G; Wächter, Thomas; Hareng, Lars; Wareing, Britta; Langsch, Angelika; Zschunke, Matthias; Alvers, Michael R; Landsiedel, Robert

    2014-06-01

    The knowledge-based search engine Go3R, www.Go3R.org, has been developed to assist scientists from industry and regulatory authorities in collecting comprehensive toxicological information with a special focus on identifying available alternatives to animal testing. The semantic search paradigm of Go3R makes use of expert knowledge on 3Rs methods and regulatory toxicology, laid down in the ontology, a network of concepts, terms, and synonyms, to recognize the contents of documents. Search results are automatically sorted into a dynamic table of contents presented alongside the list of documents retrieved. This table of contents allows the user to quickly filter the set of documents by topics of interest. Documents containing hazard information are automatically assigned to a user interface following the endpoint-specific IUCLID5 categorization scheme required, e.g. for REACH registration dossiers. For this purpose, complex endpoint-specific search queries were compiled and integrated into the search engine (based upon a gold standard of 310 references that had been assigned manually to the different endpoint categories). Go3R sorts 87% of the references concordantly into the respective IUCLID5 categories. Currently, Go3R searches in the 22 million documents available in the PubMed and TOXNET databases. However, it can be customized to search in other databases including in-house databanks.

  18. Conceptual framework and system house approach to designing search engine controversy

    Directory of Open Access Journals (Sweden)

    В.В. Шкурко

    2006-04-01

    Full Text Available  Conceptual Principles and System Engineer Principles Approach for Project System to Search for Contradictions There is outlined a range of tasks which are necessary to be solved for the organization of technology for the computer support of logic- linguistic professional findings in the legislative documents. There is shown a model of the informational process to carry out the professional findings, is proposed an information elements interworking, is worked out a model of the contradictions base.

  19. An effective hybrid cuckoo search and genetic algorithm for constrained engineering design optimization

    Science.gov (United States)

    Kanagaraj, G.; Ponnambalam, S. G.; Jawahar, N.; Mukund Nilakantan, J.

    2014-10-01

    This article presents an effective hybrid cuckoo search and genetic algorithm (HCSGA) for solving engineering design optimization problems involving problem-specific constraints and mixed variables such as integer, discrete and continuous variables. The proposed algorithm, HCSGA, is first applied to 13 standard benchmark constrained optimization functions and subsequently used to solve three well-known design problems reported in the literature. The numerical results obtained by HCSGA show competitive performance with respect to recent algorithms for constrained design optimization problems.

  20. Current State and Development Strategy of CNPC Engineering Technology Service

    Institute of Scientific and Technical Information of China (English)

    2011-01-01

    Challenges to CNPC engineering technology service Internal contraction and problems (1) Pricing mechanism for technology service. CNPC engineering technology companies have been under deficit for many years after restructuring, and eight subordinate companies lost 239 million RMB in 2008 and even 770 million RMB in 2009.